Science.gov

Sample records for genome-wide mammalian consistency-based

  1. Enredo and Pecan: Genome-wide mammalian consistency-based multiple alignment with paralogs

    PubMed Central

    Paten, Benedict; Herrero, Javier; Beal, Kathryn; Fitzgerald, Stephen; Birney, Ewan

    2008-01-01

    Pairwise whole-genome alignment involves the creation of a homology map, capable of performing a near complete transformation of one genome into another. For multiple genomes this problem is generalized to finding a set of consistent homology maps for converting each genome in the set of aligned genomes into any of the others. The problem can be divided into two principal stages. First, the partitioning of the input genomes into a set of colinear segments, a process which essentially deals with the complex processes of rearrangement. Second, the generation of a base pair level alignment map for each colinear segment. We have developed a new genome-wide segmentation program, Enredo, which produces colinear segments from extant genomes handling rearrangements, including duplications. We have then applied the new alignment program Pecan, which makes the consistency alignment methodology practical at a large scale, to create a new set of genome-wide mammalian alignments. We test both Enredo and Pecan using novel and existing assessment analyses that incorporate both real biological data and simulations, and show that both independently and in combination they outperform existing programs. Alignments from our pipeline are publicly available within the Ensembl genome browser. PMID:18849524

  2. Genome-wide detection of gene extinction in early mammalian evolution.

    PubMed

    Kuraku, Shigehiro; Kuratani, Shigeru

    2011-01-01

    Detecting gene losses is a novel aspect of evolutionary genomics that has been made feasible by whole-genome sequencing. However, research to date has concentrated on elucidating evolutionary patterns of genomic components shared between species, rather than identifying disparities between genomes. In this study, we searched for gene losses in the lineage leading to eutherian mammals. First, as a pilot analysis, we selected five gene families (Wnt, Fgf, Tbx, TGFβ, and Frizzled) for molecular phylogenetic analyses, and identified mammalian lineage-specific losses of Wnt11b, Tbx6L/VegT/tbx16, Nodal-related, ADMP1, ADMP2, Sizzled, and Crescent. Second, automated genome-wide phylogenetic screening was implemented based on this pilot analysis. As a result, we detected 147 chicken genes without eutherian orthologs, which resulted from 141 gene loss events. Our inventory contained a group of regulatory genes governing early embryonic axis formation, such as Noggins, and multiple members of the opsin and prolactin-releasing hormone receptor ("PRLHR") gene families. Our findings highlight the potential of genome-wide gene phylogeny ("phylome") analysis in detecting possible rearrangement of gene networks and the importance of identifying losses of ancestral genomic components in analyzing the molecular basis underlying phenotypic evolution. PMID:22094861

  3. Mammalian Heat Shock Response and Mechanisms Underlying Its Genome-wide Transcriptional Regulation.

    PubMed

    Mahat, Dig B; Salamanca, H Hans; Duarte, Fabiana M; Danko, Charles G; Lis, John T

    2016-04-01

    The heat shock response (HSR) is critical for survival of all organisms. However, its scope, extent, and the molecular mechanism of regulation are poorly understood. Here we show that the genome-wide transcriptional response to heat shock in mammals is rapid and dynamic and results in induction of several hundred and repression of several thousand genes. Heat shock factor 1 (HSF1), the "master regulator" of the HSR, controls only a fraction of heat shock-induced genes and does so by increasing RNA polymerase II release from promoter-proximal pause. Notably, HSF2 does not compensate for the lack of HSF1. However, serum response factor appears to transiently induce cytoskeletal genes independently of HSF1. The pervasive repression of transcription is predominantly HSF1-independent and is mediated through reduction of RNA polymerase II pause release. Overall, mammalian cells orchestrate rapid, dynamic, and extensive changes in transcription upon heat shock that are largely modulated at pause release, and HSF1 plays a limited and specialized role. PMID:27052732

  4. Mammalian NET-seq analysis defines nascent RNA profiles and associated RNA processing genome-wide.

    PubMed

    Nojima, Takayuki; Gomes, Tomás; Carmo-Fonseca, Maria; Proudfoot, Nicholas J

    2016-03-01

    The transcription cycle of RNA polymerase II (Pol II) correlates with changes to the phosphorylation state of its large subunit C-terminal domain (CTD). We recently developed Native Elongation Transcript sequencing using mammalian cells (mNET-seq), which generates single-nucleotide-resolution genome-wide profiles of nascent RNA and co-transcriptional RNA processing that are associated with different CTD phosphorylation states. Here we provide a detailed protocol for mNET-seq. First, Pol II elongation complexes are isolated with specific phospho-CTD antibodies from chromatin solubilized by micrococcal nuclease digestion. Next, RNA derived from within the Pol II complex is size fractionated and Illumina sequenced. Using mNET-seq, we have previously shown that Pol II pauses at both ends of protein-coding genes but with different CTD phosphorylation patterns, and we have also detected phosphorylation at serine 5 (Ser5-P) CTD-specific splicing intermediates and Pol II accumulation over co-transcriptionally spliced exons. With moderate biochemical and bioinformatic skills, mNET-seq can be completed in ∼6 d, not including sequencing and data analysis. PMID:26844429

  5. Genome-wide characterization of centromeric satellites from multiple mammalian genomes.

    PubMed

    Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario

    2011-01-01

    Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres. PMID:21081712

  6. Genome-Wide siRNA Screening Using Forward Transfection: Identification of Modulators of Membrane Trafficking in Mammalian Cells.

    PubMed

    Bexiga, Mariana G; Simpson, Jeremy C

    2016-01-01

    RNA interference (RNAi) has become an essential tool for molecular and cellular biologists to dissect cell function. In recent years its application has been extended to genome-wide studies, enabling the systematic identification of new cell regulation mechanisms and drug targets. In this chapter, a protocol for a genome-wide RNAi screen coupled to high-content microscopy is presented. Specifically we describe key features of assay design, plate layout, and a protocol for forward transfection of small interfering RNAs (siRNAs) in a 384-well plate format. As an example of its application in identifying modulators of membrane trafficking, we also provide a protocol to measure the efficacy of intracellular delivery of the B subunit of Shiga-like toxin to the Golgi complex. Finally we show an automated image analysis routine that can be used to extract single cell data from the screen, thereby providing a quantitative ranking of how a large panel of siRNAs affects this biological process. PMID:27581283

  7. Detecting DNA double-stranded breaks in mammalian genomes by linear amplification-mediated high-throughput genome-wide translocation sequencing.

    PubMed

    Hu, Jiazhi; Meyers, Robin M; Dong, Junchao; Panchakshari, Rohit A; Alt, Frederick W; Frock, Richard L

    2016-05-01

    Unbiased, high-throughput assays for detecting and quantifying DNA double-stranded breaks (DSBs) across the genome in mammalian cells will facilitate basic studies of the mechanisms that generate and repair endogenous DSBs. They will also enable more applied studies, such as those to evaluate the on- and off-target activities of engineered nucleases. Here we describe a linear amplification-mediated high-throughput genome-wide sequencing (LAM-HTGTS) method for the detection of genome-wide 'prey' DSBs via their translocation in cultured mammalian cells to a fixed 'bait' DSB. Bait-prey junctions are cloned directly from isolated genomic DNA using LAM-PCR and unidirectionally ligated to bridge adapters; subsequent PCR steps amplify the single-stranded DNA junction library in preparation for Illumina Miseq paired-end sequencing. A custom bioinformatics pipeline identifies prey sequences that contribute to junctions and maps them across the genome. LAM-HTGTS differs from related approaches because it detects a wide range of broken end structures with nucleotide-level resolution. Familiarity with nucleic acid methods and next-generation sequencing analysis is necessary for library generation and data interpretation. LAM-HTGTS assays are sensitive, reproducible, relatively inexpensive, scalable and straightforward to implement with a turnaround time of <1 week. PMID:27031497

  8. A Genome-Wide siRNA Screen in Mammalian Cells for Regulators of S6 Phosphorylation

    PubMed Central

    Papageorgiou, Angela; Rapley, Joseph; Mesirov, Jill P.; Tamayo, Pablo; Avruch, Joseph

    2015-01-01

    mTOR complex1, the major regulator of mRNA translation in all eukaryotic cells, is strongly activated in most cancers. We performed a genome-wide RNAi screen in a human cancer cell line, seeking genes that regulate S6 phosphorylation, readout of mTORC1 activity. Applying a stringent selection, we retrieved nearly 600 genes wherein at least two RNAis gave significant reduction in S6-P. This cohort contains known regulators of mTOR complex 1 and is significantly enriched in genes whose depletion affects the proliferation/viability of the large set of cancer cell lines in the Achilles database in a manner paralleling that caused by mTOR depletion. We next examined the effect of RNAi pools directed at 534 of these gene products on S6-P in TSC1 null mouse embryo fibroblasts. 76 RNAis reduced S6 phosphorylation significantly in 2 or 3 replicates. Surprisingly, among this cohort of genes the only elements previously associated with the maintenance of mTORC1 activity are two subunits of the vacuolar ATPase and the CUL4 subunit DDB1. RNAi against a second set of 84 targets reduced S6-P in only one of three replicates. However, an indication that this group also bears attention is the presence of rpS6KB1 itself, Rac1 and MAP4K3, a protein kinase that supports amino acid signaling to rpS6KB1. The finding that S6 phosphorylation requires a previously unidentified, functionally diverse cohort of genes that participate in fundamental cellular processes such as mRNA translation, RNA processing, DNA repair and metabolism suggests the operation of feedback pathways in the regulation of mTORC1 operating through novel mechanisms. PMID:25790369

  9. A genome-wide screen identifies a single Β-defensin gene cluster in the chicken: implications for the origin and evolution of mammalian defensins

    SciTech Connect

    Xiao, Yanjing; Hughes, Austin L.; Ando, Junko; Matsuda, Yoichi; Cheng, Jan-Fang; Skinner-Noble, Donald; Zhang, Guolong

    2004-08-13

    Defensins comprise a large family of cationic antimicrobial peptides that are characterized by the presence of a conserved cysteine-rich defensin motif. Based on the spacing pattern of cysteines, these defensins are broadly divided into five groups, namely plant, invertebrate, {alpha}-, {beta}-, and {theta}-defensins, with the last three groups being mostly found in mammalian species. However, the evolutionary relationships among these five groups of defensins remain controversial.

  10. Advances in genome-wide DNA methylation analysis

    PubMed Central

    Gupta, Romi; Nagarajan, Arvindhan; Wajapeyee, Narendra

    2013-01-01

    The covalent DNA modification of cytosine at position 5 (5-methylcytosine; 5mC) has emerged as an important epigenetic mark most commonly present in the context of CpG dinucleotides in mammalian cells. In pluripotent stem cells and plants, it is also found in non-CpG and CpNpG contexts, respectively. 5mC has important implications in a diverse set of biological processes, including transcriptional regulation. Aberrant DNA methylation has been shown to be associated with a wide variety of human ailments and thus is the focus of active investigation. Methods used for detecting DNA methylation have revolutionized our understanding of this epigenetic mark and provided new insights into its role in diverse biological functions. Here we describe recent technological advances in genome-wide DNA methylation analysis and discuss their relative utility and drawbacks, providing specific examples from studies that have used these technologies for genome-wide DNA methylation analysis to address important biological questions. Finally, we discuss a newly identified covalent DNA modification, 5-hydroxymethylcytosine (5hmC), and speculate on its possible biological function, as well as describe a new methodology that can distinguish 5hmC from 5mC. PMID:20964631

  11. Genome-wide association studies in pharmacogenomics.

    PubMed

    Daly, Ann K

    2010-04-01

    Genome-wide association (GWA) studies for pharmacogenomics-related traits are increasingly being performed to identify loci that affect either drug response or susceptibility to adverse drug reactions. Until now, only the largest effects have been detected, partly because of the challenges of obtaining large numbers of cases for pharmacogenomic studies. Since 2007, a range of pharmacogenomics GWA studies have been published that have identified several interesting and novel associations between drug responses or reactions and clinically relevant loci, showing the value of this approach. PMID:20300088

  12. Genome-wide association studies in neurology

    PubMed Central

    Tan, Meng-Shan; Jiang, Teng

    2014-01-01

    Genome-wide association studies (GWAS) are a powerful tool for understanding the genetic underpinnings of human disease. In this article, we briefly review the role and findings of GWAS in common neurological diseases, including Stroke, Alzheimer’s disease, Parkinson’s disease, epilepsy, multiple sclerosis, migraine, amyotrophic lateral sclerosis, frontotemporal lobar degeneration, restless legs syndrome, intracranial aneurysm, human prion diseases and moyamoya disease. We then discuss the present and future implications of these findings with regards to disease prediction, uncovering basic biology, and the development of potential therapeutic agents. PMID:25568877

  13. Genome-Wide Association Studies: A Primer

    PubMed Central

    Corvin, Aiden; Craddock, Nick; Sullivan, Patrick F.

    2014-01-01

    There have been nearly 400genome-wide association studies published since 2005. The GWAS approach has been exceptionally successful in identifying common genetic variants that predispose to a variety of complex human diseases and biochemical and anthropometric traits. Although this approach is relatively new, there are many excellent reviews of different aspects of the GWAS method. Here, we provide a primer, an annotated overview of the GWAS method with particular reference to psychiatric genetics. We dissect the GWAS methodology into its components and provide a brief description with citations and links to reviews that cover the topic in detail. PMID:19895722

  14. Genome-wide identification of enhancer elements.

    PubMed

    Tulin, Sarah; Barsi, Julius C; Bocconcelli, Carlo; Smith, Joel

    2016-01-01

    We present a prospective genome-wide regulatory element database for the sea urchin embryo and the modified chromosome capture-related methodology used to create it. The method we developed is termed GRIP-seq for genome-wide regulatory element immunoprecipitation and combines features of chromosome conformation capture, chromatin immunoprecipitation, and paired-end next-generation sequencing with molecular steps that enrich for active cis-regulatory elements associated with basal transcriptional machinery. The first GRIP-seq database, available to the community, comes from S. purpuratus 24 hpf embryos and takes advantage of the extremely well-characterized cis-regulatory elements in this system for validation. In addition, using the GRIP-seq database, we identify and experimentally validate a novel, intronic cis-regulatory element at the onecut locus. We find GRIP-seq signal sensitively identifies active cis-regulatory elements with a high signal-to-noise ratio for both distal and intronic elements. This promising GRIP-seq protocol has the potential to address a rate-limiting step in resolving comprehensive, predictive network models in all systems. PMID:27389984

  15. GWIDD: Genome-wide protein docking database

    PubMed Central

    Kundrotas, Petras J.; Zhu, Zhengwei; Vakser, Ilya A.

    2010-01-01

    Structural information on interacting proteins is important for understanding life processes at the molecular level. Genome-wide docking database is an integrated resource for structural studies of protein–protein interactions on the genome scale, which combines the available experimental data with models obtained by docking techniques. Current database version (August 2009) contains 25 559 experimental and modeled 3D structures for 771 organisms spanned over the entire universe of life from viruses to humans. Data are organized in a relational database with user-friendly search interface allowing exploration of the database content by a number of parameters. Search results can be interactively previewed and downloaded as PDB-formatted files, along with the information relevant to the specified interactions. The resource is freely available at http://gwidd.bioinformatics.ku.edu. PMID:19900970

  16. Genome-wide Membrane Protein Structure Prediction

    PubMed Central

    Piccoli, Stefano; Suku, Eda; Garonzi, Marianna; Giorgetti, Alejandro

    2013-01-01

    Transmembrane proteins allow cells to extensively communicate with the external world in a very accurate and specific way. They form principal nodes in several signaling pathways and attract large interest in therapeutic intervention, as the majority pharmaceutical compounds target membrane proteins. Thus, according to the current genome annotation methods, a detailed structural/functional characterization at the protein level of each of the elements codified in the genome is also required. The extreme difficulty in obtaining high-resolution three-dimensional structures, calls for computational approaches. Here we review to which extent the efforts made in the last few years, combining the structural characterization of membrane proteins with protein bioinformatics techniques, could help describing membrane proteins at a genome-wide scale. In particular we analyze the use of comparative modeling techniques as a way of overcoming the lack of high-resolution three-dimensional structures in the human membrane proteome. PMID:24403851

  17. Genome-Wide Association Studies of Cancer

    PubMed Central

    Stadler, Zsofia K.; Thom, Peter; Robson, Mark E.; Weitzel, Jeffrey N.; Kauff, Noah D.; Hurley, Karen E.; Devlin, Vincent; Gold, Bert; Klein, Robert J.; Offit, Kenneth

    2010-01-01

    Knowledge of the inherited risk for cancer is an important component of preventive oncology. In addition to well-established syndromes of cancer predisposition, much remains to be discovered about the genetic variation underlying susceptibility to common malignancies. Increased knowledge about the human genome and advances in genotyping technology have made possible genome-wide association studies (GWAS) of human diseases. These studies have identified many important regions of genetic variation associated with an increased risk for human traits and diseases including cancer. Understanding the principles, major findings, and limitations of GWAS is becoming increasingly important for oncologists as dissemination of genomic risk tests directly to consumers is already occurring through commercial companies. GWAS have contributed to our understanding of the genetic basis of cancer and will shed light on biologic pathways and possible new strategies for targeted prevention. To date, however, the clinical utility of GWAS-derived risk markers remains limited. PMID:20585100

  18. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  19. Genome-wide methylation profiling of schizophrenia.

    PubMed

    Rukova, B; Staneva, R; Hadjidekova, S; Stamenov, G; Milanova; Toncheva, D

    2014-12-01

    Schizophrenia is one of the major psychiatric disorders. It is a disorder of complex inheritance, involving both heritable and environmental factors. DNA methylation is an inheritable epigenetic modification that stably alters gene expression. We reasoned that genetic modifications that are a result of environmental stimuli could also make a contribution. We have performed 26 high-resolution genome-wide methylation array analyses to determine the methylation status of 27,627 CpG islands and compared the data between patients and healthy controls. Methylation profiles of DNAs were analyzed in six pools: 220 schizophrenia patients; 220 age-matched healthy controls; 110 female schizophrenia patients; 110 age-matched healthy females; 110 male schizophrenia patients; 110 age-matched healthy males. We also investigated the methylation status of 20 individual patient DNA samples (eight females and 12 males. We found significant differences in the methylation profile between schizophrenia and control DNA pools. We found new candidate genes that principally participate in apoptosis, synaptic transmission and nervous system development (GABRA2, LIN7B, CASP3). Methylation profiles differed between the genders. In females, the most important genes participate in apoptosis and synaptic transmission (XIAP, GABRD, OXT, KRT7), whereas in the males, the implicated genes in the molecular pathology of the disease were DHX37, MAP2K2, FNDC4 and GIPC1. Data from the individual methylation analyses confirmed, the gender-specific pools results. Our data revealed major differences in methylation profiles between schizophrenia patients and controls and between male and female patients. The dysregulated activity of the candidate genes could play a role in schizophrenia pathogenesis. PMID:25937794

  20. Genome-Wide Methylation Profiling of Schizophrenia

    PubMed Central

    Rukova, B; Staneva, R; Hadjidekova, S; Stamenov, G; Milanova; Toncheva, D

    2014-01-01

    Schizophrenia is one of the major psychiatric disorders. It is a disorder of complex inheritance, involving both heritable and environmental factors. DNA methylation is an inheritable epigenetic modification that stably alters gene expression. We reasoned that genetic modifications that are a result of environmental stimuli could also make a contribution. We have performed 26 high-resolution genome-wide methylation array analyses to determine the methylation status of 27,627 CpG islands and compared the data between patients and healthy controls. Methylation profiles of DNAs were analyzed in six pools: 220 schizophrenia patients; 220 age-matched healthy controls; 110 female schizophrenia patients; 110 age-matched healthy females; 110 male schizophrenia patients; 110 age-matched healthy males. We also investigated the methylation status of 20 individual patient DNA samples (eight females and 12 males. We found significant differences in the methylation profile between schizophrenia and control DNA pools. We found new candidate genes that principally participate in apoptosis, synaptic transmission and nervous system development (GABRA2, LIN7B, CASP3). Methylation profiles differed between the genders. In females, the most important genes participate in apoptosis and synaptic transmission (XIAP, GABRD, OXT, KRT7), whereas in the males, the implicated genes in the molecular pathology of the disease were DHX37, MAP2K2, FNDC4 and GIPC1. Data from the individual methylation analyses confirmed, the gender-specific pools results. Our data revealed major differences in methylation profiles between schizophrenia patients and controls and between male and female patients. The dysregulated activity of the candidate genes could play a role in schizophrenia pathogenesis. PMID:25937794

  1. Genome Wide Methylome Alterations in Lung Cancer.

    PubMed

    Mullapudi, Nandita; Ye, Bin; Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D; Spivack, Simon D

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)-non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  2. Genome Wide Methylome Alterations in Lung Cancer

    PubMed Central

    Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K.; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D.; Spivack, Simon D.

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)–non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  3. Optical mapping discerns genome wide DNA methylation profiles

    PubMed Central

    Ananiev, Gene E; Goldstein, Steve; Runnheim, Rod; Forrest, Dan K; Zhou, Shiguo; Potamousis, Konstantinos; Churas, Chris P; Bergendahl, Veit; Thomson, James A; Schwartz, David C

    2008-01-01

    Background Methylation of CpG dinucleotides is a fundamental mechanism of epigenetic regulation in eukaryotic genomes. Development of methods for rapid genome wide methylation profiling will greatly facilitate both hypothesis and discovery driven research in the field of epigenetics. In this regard, a single molecule approach to methylation profiling offers several unique advantages that include elimination of chemical DNA modification steps and PCR amplification. Results A single molecule approach is presented for the discernment of methylation profiles, based on optical mapping. We report results from a series of pilot studies demonstrating the capabilities of optical mapping as a platform for methylation profiling of whole genomes. Optical mapping was used to discern the methylation profile from both an engineered and wild type Escherichia coli. Furthermore, the methylation status of selected loci within the genome of human embryonic stem cells was profiled using optical mapping. Conclusion The optical mapping platform effectively detects DNA methylation patterns. Due to single molecule detection, optical mapping offers significant advantages over other technologies. This advantage stems from obviation of DNA modification steps, such as bisulfite treatment, and the ability of the platform to assay repeat dense regions within mammalian genomes inaccessible to techniques using array-hybridization technologies. PMID:18667073

  4. Chronic Periodontitis Genome-wide Association Studies

    PubMed Central

    Rhodin, K.; Divaris, K.; North, K.E.; Barros, S.P.; Moss, K.; Beck, J.D.; Offenbacher, S.

    2014-01-01

    Recent genome-wide association studies (GWAS) of chronic periodontitis (CP) offer rich data sources for the investigation of candidate genes, functional elements, and pathways. We used GWAS data of CP (n = 4,504) and periodontal pathogen colonization (n = 1,020) from a cohort of adult Americans of European descent participating in the Atherosclerosis Risk in Communities study and employed a MAGENTA approach (i.e., meta-analysis gene set enrichment of variant associations) to obtain gene-centric and gene set association results corrected for gene size, number of single-nucleotide polymorphisms, and local linkage disequilibrium characteristics based on the human genome build 18 (National Center for Biotechnology Information build 36). We used the Gene Ontology, Ingenuity, KEGG, Panther, Reactome, and Biocarta databases for gene set enrichment analyses. Six genes showed evidence of statistically significant association: 4 with severe CP (NIN, p = 1.6 × 10−7; ABHD12B, p = 3.6 × 10−7; WHAMM, p = 1.7 × 10−6; AP3B2, p = 2.2 × 10−6) and 2 with high periodontal pathogen colonization (red complex–KCNK1, p = 3.4 × 10−7; Porphyromonas gingivalis–DAB2IP, p = 1.0 × 10−6). Top-ranked genes for moderate CP were HGD (p = 1.4 × 10−5), ZNF675 (p = 1.5 × 10−5), TNFRSF10C (p = 2.0 × 10−5), and EMR1 (p = 2.0 × 10−5). Loci containing NIN, EMR1, KCNK1, and DAB2IP had showed suggestive evidence of association in the earlier single-nucleotide polymorphism–based analyses, whereas WHAMM and AP2B2 emerged as novel candidates. The top gene sets included severe CP (“endoplasmic reticulum membrane,” “cytochrome P450,” “microsome,” and “oxidation reduction”) and moderate CP (“regulation of gene expression,” “zinc ion binding,” “BMP signaling pathway,” and “ruffle”). Gene-centric analyses offer a promising avenue for efficient interrogation of large-scale GWAS data. These results highlight genes in previously identified loci and

  5. A Genome-Wide Map of AAV-Mediated Human Gene Targeting

    PubMed Central

    Deyle, David R.; Hansen, R. Scott; Cornea, Anda M.; Li, Li B.; Burt, Amber A.; Alexander, Ian E.; Sandstrom, Richard S.; Stamatoyannopoulos, John A.; Wei, Chia-Lin; Russell, David W.

    2014-01-01

    To determine which genomic features promote homologous recombination, we created a genome-wide map of gene targeting sites. An adeno-associated virus vector was used to target identical loci introduced as transcriptionally active retroviral vector proviruses. A comparison of ~2,000 targeted and untargeted sites showed that targeting occurred throughout the human genome and was not influenced by the presence of nearby CpG islands, sequence repeats, or DNase I hypersensitive sites. Targeted sites were preferentially found within transcription units, especially when the target loci were transcribed in the opposite orientation to their surrounding chromosomal genes. The impact of DNA replication was determined by mapping replication forks, which revealed a preference for recombination at target loci transcribed towards an incoming fork. Our results constitute the first genome-wide screen of gene targeting in mammalian cells, and they demonstrate a strong recombinogenic effect of colliding polymerases. PMID:25282150

  6. Genome-wide activities of Polycomb complexes control pervasive transcription.

    PubMed

    Lee, Hun-Goo; Kahn, Tatyana G; Simcox, Amanda; Schwartz, Yuri B; Pirrotta, Vincenzo

    2015-08-01

    Polycomb group (PcG) complexes PRC1 and PRC2 are well known for silencing specific developmental genes. PRC2 is a methyltransferase targeting histone H3K27 and producing H3K27me3, essential for stable silencing. Less well known but quantitatively much more important is the genome-wide role of PRC2 that dimethylates ∼70% of total H3K27. We show that H3K27me2 occurs in inverse proportion to transcriptional activity in most non-PcG target genes and intergenic regions and is governed by opposing roaming activities of PRC2 and complexes containing the H3K27 demethylase UTX. Surprisingly, loss of H3K27me2 results in global transcriptional derepression proportionally greatest in silent or weakly transcribed intergenic and genic regions and accompanied by an increase of H3K27ac and H3K4me1. H3K27me2 therefore sets a threshold that prevents random, unscheduled transcription all over the genome and even limits the activity of highly transcribed genes. PRC1-type complexes also have global roles. Unexpectedly, we find a pervasive distribution of histone H2A ubiquitylated at lysine 118 (H2AK118ub) outside of canonical PcG target regions, dependent on the RING/Sce subunit of PRC1-type complexes. We show, however, that H2AK118ub does not mediate the global PRC2 activity or the global repression and is predominantly produced by a new complex involving L(3)73Ah, a homolog of mammalian PCGF3. PMID:25986499

  7. Genome-wide survey for biologically functional pseudogenes.

    PubMed

    Svensson, Orjan; Arvestad, Lars; Lagergren, Jens

    2006-05-01

    According to current estimates there exist about 20,000 pseudogenes in a mammalian genome. The vast majority of these are disabled and nonfunctional copies of protein-coding genes which, therefore, evolve neutrally. Recent findings that a Makorin1 pseudogene, residing on mouse Chromosome 5, is, indeed, in vivo vital and also evolutionarily preserved, encouraged us to conduct a genome-wide survey for other functional pseudogenes in human, mouse, and chimpanzee. We identify to our knowledge the first examples of conserved pseudogenes common to human and mouse, originating from one duplication predating the human-mouse species split and having evolved as pseudogenes since the species split. Functionality is one possible way to explain the apparently contradictory properties of such pseudogene pairs, i.e., high conservation and ancient origin. The hypothesis of functionality is tested by comparing expression evidence and synteny of the candidates with proper test sets. The tests suggest potential biological function. Our candidate set includes a small set of long-lived pseudogenes whose unknown potential function is retained since before the human-mouse species split, and also a larger group of primate-specific ones found from human-chimpanzee searches. Two processed sequences are notable, their conservation since the human-mouse split being as high as most protein-coding genes; one is derived from the protein Ataxin 7-like 3 (ATX7NL3), and one from the Spinocerebellar ataxia type 1 protein (ATX1). Our approach is comparative and can be applied to any pair of species. It is implemented by a semi-automated pipeline based on cross-species BLAST comparisons and maximum-likelihood phylogeny estimations. To separate pseudogenes from protein-coding genes, we use standard methods, utilizing in-frame disablements, as well as a probabilistic filter based on Ka/Ks ratios. PMID:16680195

  8. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    MedlinePlus

    ... 1999 Spotlight on Research 2012 July 2012 (historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A ... out to see if a technology called whole genome sequencing would help them find other genetic risk ...

  9. Genome-wide Association Study of Chicken Plumage Pigmentation.

    PubMed

    Park, Mi Na; Choi, Jin Ae; Lee, Kyung-Tai; Lee, Hyun-Jeong; Choi, Bong-Hwan; Kim, Heebal; Kim, Tae-Hun; Cho, Seoae; Lee, Taeheon

    2013-11-01

    To increase plumage color uniformity and understand the genetic background of Korean chickens, we performed a genome-wide association study of different plumage color in Korean native chickens. We analyzed 60K SNP chips on 279 chickens with GEMMA methods for GWAS and estimated the genetic heritability for plumage color. The estimated heritability suggests that plumage coloration is a polygenic trait. We found new loci associated with feather pigmentation at the genome-wide level and from the results infer that there are additional genetic effect for plumage color. The results will be used for selecting and breeding chicken for plumage color uniformity. PMID:25049737

  10. A genome-wide CRISPR screen in primary immune cells to dissect regulatory networks

    PubMed Central

    Parnas, Oren; Jovanovic, Marko; Eisenhaure, Thomas M.; Herbst, Rebecca H.; Dixit, Atray; Ye, Chun Jimmie; Przybylski, Dariusz; Platt, Randall J.; Tirosh, Itay; Sanjana, Neville E.; Shalem, Ophir; Satija, Rahul; Raychowdhury, Raktima; Mertins, Philipp; Carr, Steven A.; Zhang, Feng; Hacohen, Nir; Regev, Aviv

    2015-01-01

    Finding the components of cellular circuits and determining their functions systematically remains a major challenge in mammalian cells. Here, we introduced genome-wide pooled CRISPR-Cas9 libraries into dendritic cells (DCs) to identify genes that control the induction of tumor necrosis factor (Tnf) by bacterial lipopolysaccharide (LPS), a key process in the host response to pathogens, mediated by the Tlr4 pathway. We found many of the known regulators of Tlr4 signaling, as well as dozens of previously unknown candidates that we validated. By measuring protein markers and mRNA profiles in DCs that are deficient in the known or candidate genes, we classified the genes into three functional modules with distinct effects on the canonical responses to LPS, and highlighted functions for the PAF complex and oligosaccharyltransferase (OST) complex. Our findings uncover new facets of innate immune circuits in primary cells, and provide a genetic approach for dissection of mammalian cell circuits. PMID:26189680

  11. Genome-wide association study identifies five new schizophrenia loci

    PubMed Central

    2012-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10−11) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10−9), ANK3 (rs10994359, P = 2.5 × 10−8) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10−9). PMID:21926974

  12. A super powerful method for genome wide association study

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population struct...

  13. Design and bioinformatics analysis of genome-wide CLIP experiments

    PubMed Central

    Wang, Tao; Xiao, Guanghua; Chu, Yongjun; Zhang, Michael Q.; Corey, David R.; Xie, Yang

    2015-01-01

    The past decades have witnessed a surge of discoveries revealing RNA regulation as a central player in cellular processes. RNAs are regulated by RNA-binding proteins (RBPs) at all post-transcriptional stages, including splicing, transportation, stabilization and translation. Defects in the functions of these RBPs underlie a broad spectrum of human pathologies. Systematic identification of RBP functional targets is among the key biomedical research questions and provides a new direction for drug discovery. The advent of cross-linking immunoprecipitation coupled with high-throughput sequencing (genome-wide CLIP) technology has recently enabled the investigation of genome-wide RBP–RNA binding at single base-pair resolution. This technology has evolved through the development of three distinct versions: HITS-CLIP, PAR-CLIP and iCLIP. Meanwhile, numerous bioinformatics pipelines for handling the genome-wide CLIP data have also been developed. In this review, we discuss the genome-wide CLIP technology and focus on bioinformatics analysis. Specifically, we compare the strengths and weaknesses, as well as the scopes, of various bioinformatics tools. To assist readers in choosing optimal procedures for their analysis, we also review experimental design and procedures that affect bioinformatics analyses. PMID:25958398

  14. Genome-wide association mapping of soybean aphid resistance traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Soybean aphid is the most damaging insect pest of soybean in the Upper Midwest and is primarily controlled by insecticides. Soybean aphid resistance (i.e., Rag genes) has been documented in some soybean lines at chromosomes 6, 7, 13, and 16, but more sources of resistance are needed. Genome-wide ass...

  15. Genome-wide characterization of maize miRNA genes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    MicroRNAs (miRNAs) are small non-coding RNAs that play essential roles in plant growth and development. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling ident...

  16. Massively expedited genome-wide heritability analysis (MEGHA)

    PubMed Central

    Ge, Tian; Nichols, Thomas E.; Lee, Phil H.; Holmes, Avram J.; Roffman, Joshua L.; Buckner, Randy L.; Sabuncu, Mert R.; Smoller, Jordan W.

    2015-01-01

    The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction. PMID:25675487

  17. Massively expedited genome-wide heritability analysis (MEGHA).

    PubMed

    Ge, Tian; Nichols, Thomas E; Lee, Phil H; Holmes, Avram J; Roffman, Joshua L; Buckner, Randy L; Sabuncu, Mert R; Smoller, Jordan W

    2015-02-24

    The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction. PMID:25675487

  18. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep.

    PubMed

    Mousel, Michelle R; Reynolds, James O; White, Stephen N

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10(-5)) were identified including markers in or near PIK3CB (P = 2.22x10(-6); additive model), KCNB1 (P = 2.93x10(-6); dominance model), ZC3H12C (P = 3.25x10(-6); genotypic model), JPH1 (P = 4.68x20(-6); genotypic model), and MYO3B (P = 5.74x10(-6); recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection. PMID:26098909

  19. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep

    PubMed Central

    Mousel, Michelle R.; Reynolds, James O.; White, Stephen N.

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10-5) were identified including markers in or near PIK3CB (P = 2.22x10-6; additive model), KCNB1 (P = 2.93x10-6; dominance model), ZC3H12C (P = 3.25x10-6; genotypic model), JPH1 (P = 4.68x20-6; genotypic model), and MYO3B (P = 5.74x10-6; recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection. PMID:26098909

  20. Genome-wide patterns of selection in 230 ancient Eurasians.

    PubMed

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R; Llamas, Bastien; Dryomov, Stanislav; Pickrell, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vyacheslav; Guerra, Manuel A Rojo; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2015-12-24

    Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  1. Genome-wide patterns of selection in 230 ancient Eurasians

    PubMed Central

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  2. Genome-Wide Significant Loci: How Important Are They?

    PubMed Central

    Björkegren, Johan L.M.; Kovacic, Jason C.; Dudley, Joel T.; Schadt, Eric E.

    2015-01-01

    Genome-wide association studies (GWAS) have been extensively used to study common complex diseases such as coronary artery disease (CAD), revealing 153 suggestive CAD loci, of which at least 46 have been validated as having genome-wide significance. However, these loci collectively explain <10% of the genetic variance in CAD. Thus, we must address the key question of what factors constitute the remaining 90% of CAD heritability. We review possible limitations of GWAS, and contextually consider some candidate CAD loci identified by this method. Looking ahead, we propose systems genetics as a complementary approach to unlocking the CAD heritability and etiology. Systems genetics builds network models of relevant molecular processes by combining genetic and genomic datasets to ultimately identify key “drivers” of disease. By leveraging systems-based genetic approaches, we can help reveal the full genetic basis of common complex disorders, enabling novel diagnostic and therapeutic opportunities. PMID:25720628

  3. Genome-wide association study of schizophrenia in Ashkenazi Jews.

    PubMed

    Goes, Fernando S; McGrath, John; Avramopoulos, Dimitrios; Wolyniec, Paula; Pirooznia, Mehdi; Ruczinski, Ingo; Nestadt, Gerald; Kenny, Eimear E; Vacic, Vladimir; Peters, Inga; Lencz, Todd; Darvasi, Ariel; Mulle, Jennifer G; Warren, Stephen T; Pulver, Ann E

    2015-12-01

    Schizophrenia is a common, clinically heterogeneous disorder associated with lifelong morbidity and early mortality. Several genetic variants associated with schizophrenia have been identified, but the majority of the heritability remains unknown. In this study, we report on a case-control sample of Ashkenazi Jews (AJ), a founder population that may provide additional insights into genetic etiology of schizophrenia. We performed a genome-wide association analysis (GWAS) of 592 cases and 505 controls of AJ ancestry ascertained in the US. Subsequently, we performed a meta-analysis with an Israeli AJ sample of 913 cases and 1640 controls, followed by a meta-analysis and polygenic risk scoring using summary results from Psychiatric GWAS Consortium 2 schizophrenia study. The U.S. AJ sample showed strong evidence of polygenic inheritance (pseudo-R(2) ∼9.7%) and a SNP-heritability estimate of 0.39 (P = 0.00046). We found no genome-wide significant associations in the U.S. sample or in the combined US/Israeli AJ meta-analysis of 1505 cases and 2145 controls. The strongest AJ specific associations (P-values in 10(-6) -10(-7) range) were in the 22q 11.2 deletion region and included the genes TBX1, GLN1, and COMT. Supportive evidence (meta P < 1 × 10(-4) ) was also found for several previously identified genome-wide significant findings, including the HLA region, CNTN4, IMMP2L, and GRIN2A. The meta-analysis of the U.S. sample with the PGC2 results provided initial genome-wide significant evidence for six new loci. Among the novel potential susceptibility genes is PEPD, a gene involved in proline metabolism, which is associated with a Mendelian disorder characterized by developmental delay and cognitive deficits. PMID:26198764

  4. A genome-wide scan for preeclampsia in the Netherlands.

    PubMed

    Lachmeijer, A M; Arngrímsson, R; Bastiaans, E J; Frigge, M L; Pals, G; Sigurdardóttir, S; Stéfansson, H; Pálsson, B; Nicolae, D; Kong, A; Aarnoudse, J G; Gulcher, J R; Dekker, G A; ten Kate, L P; Stéfansson, K

    2001-10-01

    Preeclampsia, hallmarked by de novo hypertension and proteinuria in pregnancy, has a familial tendency. Recently, a large Icelandic genome-wide scan provided evidence for a maternal susceptibility locus for preeclampsia on chromosome 2p13 which was confirmed by a genome scan from Australia and New Zealand (NZ). The current study reports on a genome-wide scan of Dutch affected sib-pair families. In total 67 Dutch affected sib-pair families, comprising at least two siblings with proteinuric preeclampsia, eclampsia or HELLP-syndrome, were typed for 293 polymorphic markers throughout the genome and linkage analysis was performed. The highest allele sharing lod score of 1.99 was seen on chromosome 12q at 109.5 cM. Two peaks overlapped in the same regions between the Dutch and Icelandic genome-wide scan at chromosome 3p and chromosome 15q. No overlap was seen on 2p. Re-analysis in 38 families without HELLP-syndrome (preeclampsia families) and 34 families with at least one sibling with HELLP syndrome (HELLP families), revealed two peaks with suggestive evidence for linkage in the non-HELLP families on chromosome 10q (lod score 2.38, D10S1432, 93.9 cM) and 22q (lod score 2.41, D22S685, 32.4 cM). The peak on 12q appeared to be associated with HELLP syndrome; it increased to a lod score of 2.1 in the HELLP families and almost disappeared in the preeclampsia families. A nominal peak on chromosome 11 in the preeclampsia families showed overlap with the second highest peak in the Australian/NZ study. Results from our Dutch genome-wide scan indicate that HELLP syndrome might have a different genetic background than preeclampsia. PMID:11781687

  5. Genome-Wide Detection and Analysis of Multifunctional Genes

    PubMed Central

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  6. Genome-Wide Detection and Analysis of Multifunctional Genes.

    PubMed

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-10-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms--H. sapiens, D. melanogaster, and S. cerevisiae--and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  7. Genome-wide association study of periodontal pathogen colonization.

    PubMed

    Divaris, K; Monda, K L; North, K E; Olshan, A F; Lange, E M; Moss, K; Barros, S P; Beck, J D; Offenbacher, S

    2012-07-01

    Pathological shifts of the human microbiome are characteristic of many diseases, including chronic periodontitis. To date, there is limited evidence on host genetic risk loci associated with periodontal pathogen colonization. We conducted a genome-wide association (GWA) study among 1,020 white participants of the Atherosclerosis Risk in Communities Study, whose periodontal diagnosis ranged from healthy to severe chronic periodontitis, and for whom "checkerboard" DNA-DNA hybridization quantification of 8 periodontal pathogens was performed. We examined 3 traits: "high red" and "high orange" bacterial complexes, and "high" Aggregatibacter actinomycetemcomitans (Aa) colonization. Genotyping was performed on the Affymetrix 6.0 platform. Imputation to 2.5 million markers was based on HapMap II-CEU, and a multiple-test correction was applied (genome-wide threshold of p < 5 × 10(-8)). We detected no genome-wide significant signals. However, 13 loci, including KCNK1, FBXO38, UHRF2, IL33, RUNX2, TRPS1, CAMTA1, and VAMP3, provided suggestive evidence (p < 5 × 10(-6)) of association. All associations reported for "red" and "orange" complex microbiota, but not for Aa, had the same effect direction in a second sample of 123 African-American participants. None of these polymorphisms was associated with periodontitis diagnosis. Investigations replicating these findings may lead to an improved understanding of the complex nature of host-microbiome interactions that characterizes states of health and disease. PMID:22699663

  8. Genome-wide polymorphisms show unexpected targets of natural selection.

    PubMed

    Pespeni, Melissa H; Garfield, David A; Manier, Mollie K; Palumbi, Stephen R

    2012-04-01

    Natural selection can act on all the expressed genes of an individual, leaving signatures of genetic differentiation or diversity at many loci across the genome. New power to assay these genome-wide effects of selection comes from associating multi-locus patterns of polymorphism with gene expression and function. Here, we performed one of the first genome-wide surveys in a marine species, comparing purple sea urchins, Strongylocentrotus purpuratus, from two distant locations along the species' wide latitudinal range. We examined 9112 polymorphic loci from upstream non-coding and coding regions of genes for signatures of selection with respect to gene function and tissue- and ontogenetic gene expression. We found that genetic differentiation (F(ST)) varied significantly across functional gene classes. The strongest enrichment occurred in the upstream regions of E3 ligase genes, enzymes known to regulate protein abundance during development and environmental stress. We found enrichment for high heterozygosity in genes directly involved in immune response, particularly NALP genes, which mediate pro-inflammatory signals during bacterial infection. We also found higher heterozygosity in immune genes in the southern population, where disease incidence and pathogen diversity are greater. Similar to the major histocompatibility complex in mammals, balancing selection may enhance genetic diversity in the innate immune system genes of this invertebrate. Overall, our results show that how genome-wide polymorphism data coupled with growing databases on gene function and expression can combine to detect otherwise hidden signals of selection in natural populations. PMID:21993504

  9. A Genome-Wide Association Study of Depressive Symptoms

    PubMed Central

    Cornelis, Marilyn C.; Amin, Najaf; Bakshis, Erin; Baumert, Jens; Ding, Jingzhong; Liu, Yongmei; Marciante, Kristin; Meirelles, Osorio; Nalls, Michael A.; Sun, Yan V.; Vogelzangs, Nicole; Yu, Lei; Bandinelli, Stefania; Benjamin, Emelia J.; Bennett, David A.; Boomsma, Dorret; Cannas, Alessandra; Coker, Laura H.; de Geus, Eco; De Jager, Philip L.; Diez-Roux, Ana V.; Purcell, Shaun; Hu, Frank B.; Rimma, Eric B.; Hunter, David J.; Jensen, Majken K.; Curhan, Gary; Rice, Kenneth; Penman, Alan D.; Rotter, Jerome I.; Sotoodehnia, Nona; Emeny, Rebecca; Eriksson, Johan G.; Evans, Denis A.; Ferrucci, Luigi; Fornage, Myriam; Gudnason, Vilmundur; Hofman, Albert; Illig, Thomas; Kardia, Sharon; Kelly-Hayes, Margaret; Koenen, Karestan; Kraft, Peter; Kuningas, Maris; Massaro, Joseph M.; Melzer, David; Mulas, Antonella; Mulder, Cornelis L.; Murray, Anna; Oostra, Ben A.; Palotie, Aarno; Penninx, Brenda; Petersmann, Astrid; Pilling, Luke C.; Psaty, Bruce; Rawal, Rajesh; Reiman, Eric M.; Schulz, Andrea; Shulman, Joshua M.; Singleton, Andrew B.; Smith, Albert V.; Sutin, Angelina R.; Uitterlinden, André G.; Völzke, Henry; Widen, Elisabeth; Yaffe, Kristine; Zonderman, Alan B.; Cucca, Francesco; Harris, Tamara; Ladwig, Karl-Heinz; Llewellyn, David J.; Räikkönen, Katri; Tanaka, Toshiko

    2013-01-01

    Background Depression is a heritable trait that exists on a continuum of varying severity and duration. Yet, the search for genetic variants associated with depression has had few successes. We exploit the entire continuum of depression to find common variants for depressive symptoms. Methods In this genome-wide association study, we combined the results of 17 population-based studies assessing depressive symptoms with the Center for Epidemiological Studies Depression Scale. Replication of the independent top hits (p < 1 × 10−5) was performed in five studies assessing depressive symptoms with other instruments. In addition, we performed a combined meta-analysis of all 22 discovery and replication studies. Results The discovery sample comprised 34,549 individuals (mean age of 66.5) and no loci reached genome-wide significance (lowest p = 1.05 × 10−7). Seven independent single nucleotide polymorphisms were considered for replication. In the replication set (n = 16,709), we found suggestive association of one single nucleotide polymorphism with depressive symptoms (rs161645, 5q21, p = 9.19 × 10−3). This 5q21 region reached genome-wide significance (p = 4.78 × 10−8) in the overall meta-analysis combining discovery and replication studies (n = 51,258). Conclusions The results suggest that only a large sample comprising more than 50,000 subjects may be sufficiently powered to detect genes for depressive symptoms. PMID:23290196

  10. Genome-wide association discoveries of alcohol dependence

    PubMed Central

    Zuo, Lingjun; Lu, Lingeng; Tan, Yunlong; Pan, Xinghua; Cai, Yiqiang; Wang, Xiaoping; Hong, Jiang; Zhong, Chunlong; Wang, Fei; Zhang, Xiang-yang; Vanderlinden, Lauren A.; Tabakoff, Boris; Luo, Xingguang

    2014-01-01

    Objective To report the genome-wide significant and/or replicable risk variants for alcohol dependence and explore their potential biological functions. Methods We searched in PubMed for all genome-wide association studies (GWASs) of alcohol dependence. The following three types of the results were extracted: (1) genome-wide significant associations in an individual sample, the combined samples, or the meta-analysis (p<5×10−8); (2) top-ranked associations in an individual sample (p<10−5) that were nominally replicated in other samples (p<0.05); and (3) nominally replicable associations across at least three independent GWAS samples (p<0.05). These results were meta-analyzed. cis-eQTLs in human, RNA expression in rat and mouse brain and bioinformatics properties of all of these risk variants were analyzed. Results The variants located within ADH cluster were significantly associated with alcohol dependence at genome-wide level (p<5×10−8) in at least one sample. Some associations with the ADH cluster were replicable across six independent GWAS samples. The variants located within or near SERINC2, KIAA0040, MREG-PECR or PKNOX2 were significantly associated with alcohol dependence at genome-wide level (p<5×10−8) in meta-analysis or combined samples, and these associations were replicable across at least one sample. The associations with the variants within NRD1, GPD1L-CMTM8 or MAP3K9-PCNX were suggestive (5×10−8

  11. Genome-wide non-CpG methylation of the host genome during M. tuberculosis infection.

    PubMed

    Sharma, Garima; Sowpati, Divya Tej; Singh, Prakruti; Khan, Mehak Zahoor; Ganji, Rakesh; Upadhyay, Sandeep; Banerjee, Sharmistha; Nandicoori, Vinay Kumar; Khosla, Sanjeev

    2016-01-01

    A mammalian cell utilizes DNA methylation to modulate gene expression in response to environmental changes during development and differentiation. Aberrant DNA methylation changes as a correlate to diseased states like cancer, neurodegenerative conditions and cardiovascular diseases have been documented. Here we show genome-wide DNA methylation changes in macrophages infected with the pathogen M. tuberculosis. Majority of the affected genomic loci were hypermethylated in M. tuberculosis infected THP1 macrophages. Hotspots of differential DNA methylation were enriched in genes involved in immune response and chromatin reorganization. Importantly, DNA methylation changes were observed predominantly for cytosines present in non-CpG dinucleotide context. This observation was consistent with our previous finding that the mycobacterial DNA methyltransferase, Rv2966c, targets non-CpG dinucleotides in the host DNA during M. tuberculosis infection and reiterates the hypothesis that pathogenic bacteria use non-canonical epigenetic strategies during infection. PMID:27112593

  12. Genome-wide mapping of DNA methylation in the human malaria parasite Plasmodium falciparum

    PubMed Central

    Ponts, Nadia; Fu, Lijuan; Harris, Elena Y.; Zhang, Jing; Chung, Duk-Won D.; Cervantes, Michael C.; Prudhomme, Jacques; Atanasova-Penichon, Vessela; Zehraoui, Enric; Bunnik, Evelien; Rodrigues, Elisandra M.; Lonardi, Stefano; Hicks, Glenn R.; Wang, Yinsheng; Le Roch, Karine G.

    2014-01-01

    SUMMARY Cytosine DNA methylation is an epigenetic mark in most eukaryotic cells that regulates numerous processes, including gene expression and stress responses. We performed a genome-wide analysis of DNA methylation in the human malaria parasite Plasmodium falciparum. We mapped the positions of methylated cytosines and identified a single functional DNA methyltransferase, PfDNMT, that may mediate these genomic modifications. These analyses revealed that the malaria genome is asymmetrically methylated, in which only one DNA strand is methylated, and shares common features with undifferentiated plant and mammalian cells. Notably, core promoters are hypomethylated and transcript levels correlate with intra-exonic methylation. Additionally, there are sharp methylation transitions at nucleosome and exon-intron boundaries. These data suggest that DNA methylation could regulate virulence gene expression and transcription elongation. Furthermore, the broad range of action of DNA methylation and uniqueness of PfDNMT suggest that the methylation pathway is a potential target for anti-malarial strategies. PMID:24331467

  13. Genome-wide non-CpG methylation of the host genome during M. tuberculosis infection

    PubMed Central

    Sharma, Garima; Sowpati, Divya Tej; Singh, Prakruti; Khan, Mehak Zahoor; Ganji, Rakesh; Upadhyay, Sandeep; Banerjee, Sharmistha; Nandicoori, Vinay Kumar; Khosla, Sanjeev

    2016-01-01

    A mammalian cell utilizes DNA methylation to modulate gene expression in response to environmental changes during development and differentiation. Aberrant DNA methylation changes as a correlate to diseased states like cancer, neurodegenerative conditions and cardiovascular diseases have been documented. Here we show genome-wide DNA methylation changes in macrophages infected with the pathogen M. tuberculosis. Majority of the affected genomic loci were hypermethylated in M. tuberculosis infected THP1 macrophages. Hotspots of differential DNA methylation were enriched in genes involved in immune response and chromatin reorganization. Importantly, DNA methylation changes were observed predominantly for cytosines present in non-CpG dinucleotide context. This observation was consistent with our previous finding that the mycobacterial DNA methyltransferase, Rv2966c, targets non-CpG dinucleotides in the host DNA during M. tuberculosis infection and reiterates the hypothesis that pathogenic bacteria use non-canonical epigenetic strategies during infection. PMID:27112593

  14. A genome-wide DNA methylation study in colorectal carcinoma

    PubMed Central

    2011-01-01

    Background We performed a genome-wide scan of 27,578 CpG loci covering 14,475 genes to identify differentially methylated loci (DML) in colorectal carcinoma (CRC). Methods We used Illumina's Infinium methylation assay in paired DNA samples extracted from 24 fresh frozen CRC tissues and their corresponding normal colon tissues from 24 consecutive diagnosed patients at a tertiary medical center. Results We found a total of 627 DML in CRC covering 513 genes, of which 535 are novel DML covering 465 genes. We also validated the Illumina Infinium methylation data for top-ranking genes by non-bisulfite conversion q-PCR-based methyl profiler assay in a subset of the same samples. We also carried out integration of genome-wide copy number and expression microarray along with methylation profiling to see the functional effect of methylation. Gene Set Enrichment Analysis (GSEA) showed that among the major "gene sets" that are hypermethylated in CRC are the sets: "inhibition of adenylate cyclase activity by G-protein signaling", "Rac guanyl-nucleotide exchange factor activity", "regulation of retinoic acid receptor signaling pathway" and "estrogen receptor activity". Two-level nested cross validation showed that DML-based predictive models may offer reasonable sensitivity (around 89%), specificity (around 95%), positive predictive value (around 95%) and negative predictive value (around 89%), suggesting that these markers may have potential clinical application. Conclusion Our genome-wide methylation study in CRC clearly supports most of the previous findings; additionally we found a large number of novel DML in CRC tissue. If confirmed in future studies, these findings may lead to identification of genomic markers for potential clinical application. PMID:21699707

  15. Voxelwise genome-wide association study (vGWAS)

    PubMed Central

    Stein, Jason L.; Hua, Xue; Lee, Suh; Ho, April J.; Leow, Alex D.; Toga, Arthur W.; Saykin, Andrew J.; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J.; Craig, David W.; Gerber, Jill D.; Allen, April N.; Corneveaux, Jason J.; DeChairo, Bryan M.; Potkin, Steven G.; Weiner, Michael W.; Thompson, Paul M.

    2010-01-01

    The structure of the human brain is highly heritable, and is thought to be influenced by many common genetic variants, many of which are currently unknown. Recent advances in neuroimaging and genetics have allowed collection of both highly detailed structural brain scans and genome-wide genotype information. This wealth of information presents a new opportunity to find the genes influencing brain structure. Here we explore the relation between 448,293 single nucleotide polymorphisms in each of 31,622 voxels of the entire brain across 740 elderly subjects (mean age±s.d.: 75.52±6.82 years; 438 male) including subjects with Alzheimer's disease, Mild Cognitive Impairment, and healthy elderly controls from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used tensor-based morphometry to measure individual differences in brain structure at the voxel level relative to a study-specific template based on healthy elderly subjects. We then conducted a genome-wide association at each voxel to identify genetic variants of interest. By studying only the most associated variant at each voxel, we developed a novel method to address the multiple comparisons problem and computational burden associated with the unprecedented amount of data. No variant survived the strict significance criterion, but several genes worthy of further exploration were identified, including CSMD2 and CADPS2. These genes have high relevance to brain structure. This is the first voxelwise genome wide association study to our knowledge, and offers a novel method to discover genetic influences on brain structure. PMID:20171287

  16. Genome-Wide Association Study of Metabolic Syndrome in Koreans

    PubMed Central

    Jeong, Seok Won; Chung, Myungguen; Park, Soo-Jung; Cho, Seong Beom

    2014-01-01

    Metabolic syndrome (METS) is a disorder of energy utilization and storage and increases the risk of developing cardiovascular disease and diabetes. To identify the genetic risk factors of METS, we carried out a genome-wide association study (GWAS) for 2,657 cases and 5,917 controls in Korean populations. As a result, we could identify 2 single nucleotide polymorphisms (SNPs) with genome-wide significance level p-values (<5 × 10-8), 8 SNPs with genome-wide suggestive p-values (5 × 10-8 ≤ p < 1 × 10-5), and 2 SNPs of more functional variants with borderline p-values (5 × 10-5 ≤ p < 1 × 10-4). On the other hand, the multiple correction criteria of conventional GWASs exclude false-positive loci, but simultaneously, they discard many true-positive loci. To reconsider the discarded true-positive loci, we attempted to include the functional variants (nonsynonymous SNPs [nsSNPs] and expression quantitative trait loci [eQTL]) among the top 5,000 SNPs based on the proportion of phenotypic variance explained by genotypic variance. In total, 159 eQTLs and 18 nsSNPs were presented in the top 5,000 SNPs. Although they should be replicated in other independent populations, 6 eQTLs and 2 nsSNP loci were located in the molecular pathways of LPL, APOA5, and CHRM2, which were the significant or suggestive loci in the METS GWAS. Conclusively, our approach using the conventional GWAS, reconsidering functional variants and pathway-based interpretation, suggests a useful method to understand the GWAS results of complex traits and can be expanded in other genomewide association studies. PMID:25705157

  17. Genome-Wide Association Study of Parity in Bangladeshi Women

    PubMed Central

    Aschebrook-Kilfoy, Briseis; Argos, Maria; Pierce, Brandon L.; Tong, Lin; Jasmine, Farzana; Roy, Shantanu; Parvez, Faruque; Ahmed, Alauddin; Islam, Tariqul; Kibriya, Muhammad G.; Ahsan, Habibul

    2015-01-01

    Human fertility is a complex trait determined by gene-environment interactions in which genetic factors represent a significant component. To better understand inter-individual variability in fertility, we performed one of the first genome-wide association studies (GWAS) of common fertility phenotypes, lifetime number of pregnancies and number of children in a developing country population. The fertility phenotype data and DNA samples were obtained at baseline recruitment from individuals participating in a large prospective cohort study in Bangladesh. GWAS analyses of fertility phenotypes were conducted among 1,686 married women. One SNP on chromosome 4 was non-significantly associated with number of children at P <10-7 and number of pregnancies at P <10-6. This SNP is located in a region without a gene within 1 Mb. One SNP on chromosome 6 was non-significantly associated with extreme number of children at P <10-6. The closest gene to this SNP is HDGFL1, a hepatoma-derived growth factor. When we excluded hormonal contraceptive users, a SNP on chromosome 5 was non-significantly associated at P <10-5 for number of children and number of pregnancies. This SNP is located near C5orf64, an open reading frame, and ZSWIM6, a zinc ion binding gene. We also estimated the heritability of these phenotypes from our genotype data using GCTA (Genome-wide Complex Trait Analysis) for number of children (hg2 = 0.149, SE = 0.24, p-value = 0.265) and number of pregnancies (hg2 = 0.007, SE = 0.22, p-value = 0.487). Our genome-wide association study and heritability estimates of number of pregnancies and number of children in Bangladesh did not confer strong evidence of common variants for parity variation. However, our results suggest that future studies may want to consider the role of 3 notable SNPs in their analysis. PMID:25742292

  18. Genome-wide association study of obsessive-compulsive disorder.

    PubMed

    Stewart, S E; Yu, D; Scharf, J M; Neale, B M; Fagerness, J A; Mathews, C A; Arnold, P D; Evans, P D; Gamazon, E R; Davis, L K; Osiecki, L; McGrath, L; Haddad, S; Crane, J; Hezel, D; Illman, C; Mayerfeld, C; Konkashbaev, A; Liu, C; Pluzhnikov, A; Tikhomirov, A; Edlund, C K; Rauch, S L; Moessner, R; Falkai, P; Maier, W; Ruhrmann, S; Grabe, H-J; Lennertz, L; Wagner, M; Bellodi, L; Cavallini, M C; Richter, M A; Cook, E H; Kennedy, J L; Rosenberg, D; Stein, D J; Hemmings, S M J; Lochner, C; Azzam, A; Chavira, D A; Fournier, E; Garrido, H; Sheppard, B; Umaña, P; Murphy, D L; Wendland, J R; Veenstra-VanderWeele, J; Denys, D; Blom, R; Deforce, D; Van Nieuwerburgh, F; Westenberg, H G M; Walitza, S; Egberts, K; Renner, T; Miguel, E C; Cappi, C; Hounie, A G; Conceição do Rosário, M; Sampaio, A S; Vallada, H; Nicolini, H; Lanzagorta, N; Camarena, B; Delorme, R; Leboyer, M; Pato, C N; Pato, M T; Voyiaziakis, E; Heutink, P; Cath, D C; Posthuma, D; Smit, J H; Samuels, J; Bienvenu, O J; Cullen, B; Fyer, A J; Grados, M A; Greenberg, B D; McCracken, J T; Riddle, M A; Wang, Y; Coric, V; Leckman, J F; Bloch, M; Pittenger, C; Eapen, V; Black, D W; Ophoff, R A; Strengman, E; Cusi, D; Turiel, M; Frau, F; Macciardi, F; Gibbs, J R; Cookson, M R; Singleton, A; Hardy, J; Crenshaw, A T; Parkin, M A; Mirel, D B; Conti, D V; Purcell, S; Nestadt, G; Hanna, G L; Jenike, M A; Knowles, J A; Cox, N; Pauls, D L

    2013-07-01

    Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1465 cases, 5557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9657 X-chromosome single nucleotide polymorphisms (SNPs). Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two P-values were located within DLGAP1 (P=2.49 × 10(-6) and P=3.44 × 10(-6)), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a P-value=3.84 × 10(-8). However, when trios were meta-analyzed with the case-control samples, the P-value for this variant was 3.62 × 10(-5), losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation QTLs (P<0.001) and frontal lobe expression quantitative trait loci (eQTLs) (P=0.001) was observed within the top-ranked SNPs (P<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921

  19. Genome-Wide Approaches to Drosophila Heart Development

    PubMed Central

    Frasch, Manfred

    2016-01-01

    The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi) reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level. PMID:27294102

  20. Genome-wide mapping of DNA hydroxymethylation in osteoarthritic chondrocytes

    PubMed Central

    Taylor, Sarah E. B.; Li, Ye Henry; Wong, Wing H.; Bhutani, Nidhi

    2015-01-01

    Objective To examine genome-wide 5hmC distribution in osteoarthritic (OA) and normal chondrocytes to investigate the effect on OA-specific gene expression. Methods Cartilage was obtained from OA patients undergoing total knee arthroplasty or control patients undergoing anterior cruciate ligament reconstruction. Genome-wide sequencing of 5hmC-enriched DNA (5hmC-seq) was performed for a small cohort of normal and OA chondrocytes to identify differentially hydroxymethylated regions (DhMRs) in OA chondrocytes. 5hmC-seq data was intersected with global OA gene expression data to define subsets of genes and pathways potentially affected by increased 5hmC levels in OA chondrocytes. Results 70591 DhMRs were identified in OA chondrocytes compared to normal chondrocytes, 44288 (63%) of which were increased in OA chondrocytes. The majority of DhMRs (66%) were gained in gene bodies. Increased DhMRs were observed in ~50% of genes previously implicated in OA pathology including MMP3, LRP5, GDF5 and COL11A1. Furthermore, analyses of gene expression data revealed gene body gain of 5hmC appears to be preferentially associated with activated but not repressed genes in OA chondrocytes. Conclusion This study provides the first genome-wide profiling of 5hmC distribution in OA chondrocytes. We had previously reported a global increase in 5hmC levels in OA chondrocytes. Gain of 5hmC in the gene body is found to be characteristic of activated genes in OA chondrocytes, highlighting the influence of 5hmC as an epigenetic mark in OA. In addition, this study identifies multiple OA-associated genes that are potentially regulated either singularly by gain of DNA hydroxymethylation or in combination with loss of DNA methylation. PMID:25940674

  1. Genome-wide association study of parity in Bangladeshi women.

    PubMed

    Aschebrook-Kilfoy, Briseis; Argos, Maria; Pierce, Brandon L; Tong, Lin; Jasmine, Farzana; Roy, Shantanu; Parvez, Faruque; Ahmed, Alauddin; Islam, Tariqul; Kibriya, Muhammad G; Ahsan, Habibul

    2015-01-01

    Human fertility is a complex trait determined by gene-environment interactions in which genetic factors represent a significant component. To better understand inter-individual variability in fertility, we performed one of the first genome-wide association studies (GWAS) of common fertility phenotypes, lifetime number of pregnancies and number of children in a developing country population. The fertility phenotype data and DNA samples were obtained at baseline recruitment from individuals participating in a large prospective cohort study in Bangladesh. GWAS analyses of fertility phenotypes were conducted among 1,686 married women. One SNP on chromosome 4 was non-significantly associated with number of children at P <10(-7) and number of pregnancies at P <10(-6). This SNP is located in a region without a gene within 1 Mb. One SNP on chromosome 6 was non-significantly associated with extreme number of children at P <10(-6). The closest gene to this SNP is HDGFL1, a hepatoma-derived growth factor. When we excluded hormonal contraceptive users, a SNP on chromosome 5 was non-significantly associated at P <10(-5) for number of children and number of pregnancies. This SNP is located near C5orf64, an open reading frame, and ZSWIM6, a zinc ion binding gene. We also estimated the heritability of these phenotypes from our genotype data using GCTA (Genome-wide Complex Trait Analysis) for number of children (hg2 = 0.149, SE = 0.24, p-value = 0.265) and number of pregnancies (hg2 = 0.007, SE = 0.22, p-value = 0.487). Our genome-wide association study and heritability estimates of number of pregnancies and number of children in Bangladesh did not confer strong evidence of common variants for parity variation. However, our results suggest that future studies may want to consider the role of 3 notable SNPs in their analysis. PMID:25742292

  2. [Genome-wide association study for adolescent idiopathic scoliosis].

    PubMed

    Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro

    2016-04-01

    Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS. PMID:27013625

  3. Genome-wide profiling of alternative splicing in Alzheimer's disease

    PubMed Central

    Lai, Mitchell K.P.; Esiri, Margaret M.; Tan, Michelle G.K.

    2014-01-01

    Alternative splicing is a highly regulated process which generates transcriptome and proteome diversity through the skipping or inclusion of exons within gene loci. Identification of aberrant alternative splicing associated with human diseases has become feasible with the development of new genomic technologies and powerful bioinformatics. We have previously reported genome-wide gene alterations in the neocortex of a well-characterized cohort of Alzheimer's disease (AD) patients and matched elderly controls using a commercial exon microarray platform [1]. Here, we provide detailed description of analyses aimed at identifying differential alternative splicing events associated with AD. PMID:26484111

  4. Genome-wide association studies in diverse populations

    PubMed Central

    Rosenberg, Noah A; Huang, Lucy; Jewett, Ethan M; Szpiech, Zachary A; Jankovic, Ivana; Boehnke, Michael

    2011-01-01

    Genome-wide association (GWA) studies have identified a large number of single-nucleotide polymorphisms (SNPs) associated with disease phenotypes. As most GWA studies have been performed primarily in populations of European descent, this review examines the issues involved in extending consideration of GWA studies to diverse worldwide populations. Although challenges exist with such issues as imputation, admixture, and replication, investigation of diverse populations in GWA studies has significant potential to advance the project of mapping the genetic determinants of complex diseases for the human population as a whole. PMID:20395969

  5. [New insight of genome-wide association study (GWAS)].

    PubMed

    Hotta, Kikuko

    2013-02-01

    The number of obese patients is increasing in Japan, due to the westernization of lifestyle. Obesity, especially visceral fat obesity, is important for the development of metabolic syndrome. Genetic factors are important for the development of obesity as well as environmental factors. Importance of genetic factors of fat distribution is also reported. Recent genome-wide association studies (GWASs) have revealed the obesity and fat distribution-related polymorphisms. GWAS will highlight a better understanding of the underlying molecular mechanisms in the regulation of obesity and distribution of body fat. PMID:23631198

  6. Genome-Wide Association Studies for Polycystic Ovary Syndrome.

    PubMed

    Liu, Hongbin; Zhao, Han; Chen, Zi-Jiang

    2016-07-01

    Over the past several years, the field of reproductive medicine has witnessed great advances in genome-wide association studies (GWASs) of polycystic ovary syndrome (PCOS), leading to identification of several promising genes involved in hormone action, type 2 diabetes, and cell proliferation. This review summarizes the key findings and discusses their potential implications with regard to genetic mechanisms of PCOS. Limitations of GWAS are evaluated, emphasizing the understanding of the reasons for variability in results between individual studies. Root causes of misinterpretations of GWASs are also addressed. Finally, the impact of GWAS on future directions of multi- and interdisciplinary studies is discussed. PMID:27513023

  7. Genome-wide analysis of DNA methylation in hepatoblastoma tissues

    PubMed Central

    Cui, Ximao; Liu, Baihui; Zheng, Shan; Dong, Kuiran; Dong, Rui

    2016-01-01

    DNA methylation has a crucial role in cancer biology. In the present study, a genome-wide analysis of DNA methylation in hepatoblastoma (HB) tissues was performed to verify differential methylation levels between HB and normal tissues. As alpha-fetoprotein (AFP) has a critical role in HB, AFP methylation levels were also detected using pyrosequencing. Normal and HB liver tissue samples (frozen tissue) were obtained from patients with HB. Genome-wide analysis of DNA methylation in these tissues was performed using an Infinium HumanMethylation450 BeadChip, and the results were confirmed with reverse transcription-quantitative polymerase chain reaction. The Infinium HumanMethylation450 BeadChip demonstrated distinctively less methylation in HB tissues than in non-tumor tissues. In addition, methylation enrichment was observed in positions near the transcription start site of AFP, which exhibited lower methylation levels in HB tissues than in non-tumor liver tissues. Lastly, a significant negative correlation was observed between AFP messenger RNA expression and DNA methylation percentage, using linear Pearson's R correlation coefficients. The present results demonstrate differential methylation levels between HB and normal tissues, and imply that aberrant methylation of AFP in HB could reflect HB development. Expansion of these findings could provide useful insight into HB biology. PMID:27446465

  8. Genome-wide association study of Tourette's syndrome.

    PubMed

    Scharf, J M; Yu, D; Mathews, C A; Neale, B M; Stewart, S E; Fagerness, J A; Evans, P; Gamazon, E; Edlund, C K; Service, S K; Tikhomirov, A; Osiecki, L; Illmann, C; Pluzhnikov, A; Konkashbaev, A; Davis, L K; Han, B; Crane, J; Moorjani, P; Crenshaw, A T; Parkin, M A; Reus, V I; Lowe, T L; Rangel-Lugo, M; Chouinard, S; Dion, Y; Girard, S; Cath, D C; Smit, J H; King, R A; Fernandez, T V; Leckman, J F; Kidd, K K; Kidd, J R; Pakstis, A J; State, M W; Herrera, L D; Romero, R; Fournier, E; Sandor, P; Barr, C L; Phan, N; Gross-Tsur, V; Benarroch, F; Pollak, Y; Budman, C L; Bruun, R D; Erenberg, G; Naarden, A L; Lee, P C; Weiss, N; Kremeyer, B; Berrío, G B; Campbell, D D; Cardona Silgado, J C; Ochoa, W C; Mesa Restrepo, S C; Muller, H; Valencia Duarte, A V; Lyon, G J; Leppert, M; Morgan, J; Weiss, R; Grados, M A; Anderson, K; Davarya, S; Singer, H; Walkup, J; Jankovic, J; Tischfield, J A; Heiman, G A; Gilbert, D L; Hoekstra, P J; Robertson, M M; Kurlan, R; Liu, C; Gibbs, J R; Singleton, A; Hardy, J; Strengman, E; Ophoff, R A; Wagner, M; Moessner, R; Mirel, D B; Posthuma, D; Sabatti, C; Eskin, E; Conti, D V; Knowles, J A; Ruiz-Linares, A; Rouleau, G A; Purcell, S; Heutink, P; Oostra, B A; McMahon, W M; Freimer, N B; Cox, N J; Pauls, D L

    2013-06-01

    Tourette's syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (P<5 × 10(-8)); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (P=1.85 × 10(-6)). A secondary analysis including an additional 211 cases and 285 controls from two closely related Latin American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (P=3.6 × 10(-7) for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  9. Genome-Wide Patterns of Nucleotide Polymorphism in Domesticated Rice

    PubMed Central

    Hernandez, Ryan D; Boyko, Adam; Fledel-Alon, Adi; York, Thomas L; Polato, Nicholas R; Olsen, Kenneth M; Nielsen, Rasmus; McCouch, Susan R; Bustamante, Carlos D; Purugganan, Michael D

    2007-01-01

    Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models to explain contemporary patterns of polymorphisms in rice, including a (i) selectively neutral population bottleneck model, (ii) bottleneck plus migration model, (iii) multiple selective sweeps model, and (iv) bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been the dominant demographic model for domesticated species, cannot explain the derived nucleotide polymorphism site frequency spectrum in rice. Instead, a bottleneck model that incorporates selective sweeps, or a more complex demographic model that includes subdivision and gene flow, are more plausible explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection results only in a local signature of variation. PMID:17907810

  10. Genome-wide signals of positive selection in human evolution

    PubMed Central

    Enard, David; Messer, Philipp W.; Petrov, Dmitri A.

    2014-01-01

    The role of positive selection in human evolution remains controversial. On the one hand, scans for positive selection have identified hundreds of candidate loci, and the genome-wide patterns of polymorphism show signatures consistent with frequent positive selection. On the other hand, recent studies have argued that many of the candidate loci are false positives and that most genome-wide signatures of adaptation are in fact due to reduction of neutral diversity by linked deleterious mutations, known as background selection. Here we analyze human polymorphism data from the 1000 Genomes Project and detect signatures of positive selection once we correct for the effects of background selection. We show that levels of neutral polymorphism are lower near amino acid substitutions, with the strongest reduction observed specifically near functionally consequential amino acid substitutions. Furthermore, amino acid substitutions are associated with signatures of recent adaptation that should not be generated by background selection, such as unusually long and frequent haplotypes and specific distortions in the site frequency spectrum. We use forward simulations to argue that the observed signatures require a high rate of strongly adaptive substitutions near amino acid changes. We further demonstrate that the observed signatures of positive selection correlate better with the presence of regulatory sequences, as predicted by the ENCODE Project Consortium, than with the positions of amino acid substitutions. Our results suggest that adaptation was frequent in human evolution and provide support for the hypothesis of King and Wilson that adaptive divergence is primarily driven by regulatory changes. PMID:24619126

  11. Genome-wide association study of Tourette Syndrome

    PubMed Central

    Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.

    2012-01-01

    Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  12. Genome-wide association analysis identifies three psoriasis susceptibility loci

    PubMed Central

    Stuart, Philip E.; Nair, Rajan P.; Ellinghaus, Eva; Ding, Jun; Tejasvi, Trilokraj; Gudjonsson, Johann E.; Li, Yun; Weidinger, Stephan; Eberlein, Bernadette; Gieger, Christian; Wichmann, H. Erich; Kunz, Manfred; Ike, Robert; Krueger, Gerald G.; Bowcock, Anne M.; Mroweitz, Ulrich; Lim, Henry W.; Voorhees, John J.; Abecasis, Goncalo R.; Weichenthal, Michael; Franke, Andre; Rahman, Proton; Gladman, Dafna D.; Elder, James T.

    2010-01-01

    To identify novel psoriasis susceptibility loci, we carried out a meta-analysis of two recent genome-wide association studies 1,2, yielding a discovery sample of 1,831 cases and 2,546 controls. 102 of the most promising loci in the discovery analysis were followed up in a three-stage replication study using 4,064 cases and 4,685 controls from Michigan, Toronto, Newfoundland, and Germany. Association at a genome-wide level of significance for the combined discovery and replication samples was found for three genomic regions. One contains NOS2 (rs4795067, p = 4 × 10−11), another contains FBXL19 (rs10782001, p = 9 × 10−10), and a third contains PSMA6 and NFKBIA (rs12586317, p = 2 × 10−8). All three loci were also strongly associated with the subphenotypes of psoriatic arthritis and purely cutaneous psoriasis. Finally, we confirmed a recently identified3 association signal near RNF114. PMID:20953189

  13. Genome-wide gene-based association study.

    PubMed

    Yang, Hsin-Chou; Liang, Yu-Jen; Chung, Chia-Min; Chen, Jia-Wei; Pan, Wen-Harn

    2009-01-01

    Genome-wide association studies, which analyzes hundreds of thousands of single-nucleotide polymorphisms to identify disease susceptibility genes, are challenging because the work involves intensive computation and complex modeling. We propose a two-stage genome-wide association scanning procedure, consisting of a single-locus association scan for the first stage and a gene-based association scan for the second stage. Marginal effects of single-nucleotide polymorphisms are examined by using the exact Armitage trend test or logistic regression, and gene effects are examined by using a p-value combination method. Compared with some existing single-locus and multilocus methods, the proposed method has the following merits: 1) convenient for definition of biologically meaningful regions, 2) powerful for detection of minor-effect genes, 3) helpful for alleviation of a multiple-testing problem, and 4) convenient for result interpretation. The method was applied to study Genetic Analysis Workshop 16 Problem 1 rheumatoid arthritis data, and strong association signals were found. The results show that the human major histocompatibility complex region is the most important genomic region associated with rheumatoid arthritis. Moreover, previously reported genes including PTPN22, C5, and IL2RB were confirmed; novel genes including HLA-DRA, BTNL2, C6orf10, NOTCH4, TAP2, and TNXB were identified by our analysis. PMID:20018002

  14. A Genome-Wide Association Study of Aging

    PubMed Central

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W.; Garcia, Melissa E.; Kaplan, Robert C.; Kumari, Meena; Lunetta, Kathryn L.; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J.; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J.; Biffar, Reiner; Buchman, Aron S.; Boerwinkle, Eric; Couper, David; De Jager, Philip L.; Evans, Denis A.; Harris, Tamara B.; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P.; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J.; Lohman, Kurt K.; Lutsey, Pamela L.; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M.; Reiman, Eric M.; Rotter, Jerome I.; Seshadri, Sudha; Shardell, Michelle D.; Smith, Albert V.; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M. Carola; Bandinelli, Stefania; Baumeister, Sebastian E.; Bennett, David A.; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M.; Newman, Anne B.; Tiemeier, Henning; Franceschini, Nora

    2011-01-01

    Human longevity and healthy aging show moderate heritability (20–50%). We conducted a meta-analysis of genome-wide association studies from nine studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for two outcomes: a) all-cause mortality and b) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10−8). We found fourteen independent SNPs that predicted risk of death, and eight SNPs that predicted event-free survival (p < 10−5). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer’s disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. PMID:21782286

  15. Consistency of genome-wide associations across major ancestral groups.

    PubMed

    Ntzani, Evangelia E; Liberopoulos, George; Manolio, Teri A; Ioannidis, John P A

    2012-07-01

    It is not well known whether genetic markers identified through genome-wide association studies (GWAS) confer similar or different risks across people of different ancestry. We screened a regularly updated catalog of all published GWAS curated at the NHGRI website for GWAS-identified associations that had reached genome-wide significance (p ≤ 5 × 10(-8)) in at least one major ancestry group (European, Asian, African) and for which replication data were available for comparison in at least two different major ancestry groups. These groups were compared for the correlation between and differences in risk allele frequencies and genetic effects' estimates. Data on 108 eligible GWAS-identified associations with a total of 900 datasets (European, n = 624; Asian, n = 217; African, n = 60) were analyzed. Risk-allele frequencies were modestly correlated between ancestry groups, with >10% absolute differences in 75-89% of the three pairwise comparisons of ancestry groups. Genetic effect (odds ratio) point estimates between ancestry groups correlated modestly (pairwise comparisons' correlation coefficients: 0.20-0.33) and point estimates of risks were opposite in direction or differed more than twofold in 57%, 79%, and 89% of the European versus Asian, European versus African, and Asian versus African comparisons, respectively. The modest correlations, differing risk estimates, and considerable between-association heterogeneity suggest that differential ancestral effects can be anticipated and genomic risk markers may need separate further evaluation in different ancestry groups. PMID:22183176

  16. Phenome-wide analysis of genome-wide polygenic scores.

    PubMed

    Krapohl, E; Euesden, J; Zabaneh, D; Pingault, J-B; Rimfeld, K; von Stumm, S; Dale, P S; Breen, G; O'Reilly, P F; Plomin, R

    2016-09-01

    Genome-wide polygenic scores (GPS), which aggregate the effects of thousands of DNA variants from genome-wide association studies (GWAS), have the potential to make genetic predictions for individuals. We conducted a systematic investigation of associations between GPS and many behavioral traits, the behavioral phenome. For 3152 unrelated 16-year-old individuals representative of the United Kingdom, we created 13 GPS from the largest GWAS for psychiatric disorders (for example, schizophrenia, depression and dementia) and cognitive traits (for example, intelligence, educational attainment and intracranial volume). The behavioral phenome included 50 traits from the domains of psychopathology, personality, cognitive abilities and educational achievement. We examined phenome-wide profiles of associations for the entire distribution of each GPS and for the extremes of the GPS distributions. The cognitive GPS yielded stronger predictive power than the psychiatric GPS in our UK-representative sample of adolescents. For example, education GPS explained variation in adolescents' behavior problems (~0.6%) and in educational achievement (~2%) but psychiatric GPS were associated with neither. Despite the modest effect sizes of current GPS, quantile analyses illustrate the ability to stratify individuals by GPS and opportunities for research. For example, the highest and lowest septiles for the education GPS yielded a 0.5 s.d. difference in mean math grade and a 0.25 s.d. difference in mean behavior problems. We discuss the usefulness and limitations of GPS based on adult GWAS to predict genetic propensities earlier in development. PMID:26303664

  17. Genome-wide identification of hypoxia-induced enhancer regions

    PubMed Central

    Preston, Jessica L.; Randel, Melissa A.; Johnson, Eric A.

    2015-01-01

    Here we present a genome-wide method for de novo identification of enhancer regions. This approach enables massively parallel empirical investigation of DNA sequences that mediate transcriptional activation and provides a platform for discovery of regulatory modules capable of driving context-specific gene expression. The method links fragmented genomic DNA to the transcription of randomer molecule identifiers and measures the functional enhancer activity of the library by massively parallel sequencing. We transfected a Drosophila melanogaster library into S2 cells in normoxia and hypoxia, and assayed 4,599,881 genomic DNA fragments in parallel. The locations of the enhancer regions strongly correlate with genes up-regulated after hypoxia and previously described enhancers. Novel enhancer regions were identified and integrated with RNAseq data and transcription factor motifs to describe the hypoxic response on a genome-wide basis as a complex regulatory network involving multiple stress-response pathways. This work provides a novel method for high-throughput assay of enhancer activity and the genome-scale identification of 31 hypoxia-activated enhancers in Drosophila. PMID:26713262

  18. Measuring genome-wide nucleosome turnover using CATCH-IT.

    PubMed

    Teves, Sheila S; Deal, Roger B; Henikoff, Steven

    2012-01-01

    The dynamic interplay between DNA-binding proteins and nucleosomes underlies essential nuclear processes such as transcription, replication, and DNA repair. Manifestations of this interplay include the assembly, eviction, and replacement of nucleosomes. Hence, measurements of nucleosome turnover kinetics can lead to insights into the regulation of dynamic chromatin processes. In this chapter, we describe a genome-wide method for measuring nucleosome turnover that uses metabolic labeling followed by capture of newly synthesized histones, which we have termed Covalent Attachment of Tagged Histones to Capture and Identify Turnover (CATCH-IT). Although CATCH-IT can be used with any genome-wide mapping procedure, high-resolution profiling is attainable using paired-end sequencing of native chromatin. Our protocol also includes an efficient Solexa DNA sequencing library preparation protocol that can be used for single base-pair resolution mapping of both nucleosome and subnucleosomal particles. We not only describe the use of these protocols in the context of a Drosophila cell line but also provide the necessary changes for adaptation to other model systems. PMID:22929769

  19. Genome-wide association interaction analysis for Alzheimer's disease

    PubMed Central

    Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel

    2015-01-01

    We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192

  20. Establishing an analytic pipeline for genome-wide DNA methylation.

    PubMed

    Wright, Michelle L; Dozmorov, Mikhail G; Wolen, Aaron R; Jackson-Cook, Colleen; Starkweather, Angela R; Lyon, Debra E; York, Timothy P

    2016-01-01

    The need for research investigating DNA methylation (DNAm) in clinical studies has increased, leading to the evolution of new analytic methods to improve accuracy and reproducibility of the interpretation of results from these studies. The purpose of this article is to provide clinical researchers with a summary of the major data processing steps routinely applied in clinical studies investigating genome-wide DNAm using the Illumina HumanMethylation 450K BeadChip. In most studies, the primary goal of employing DNAm analysis is to identify differential methylation at CpG sites among phenotypic groups. Experimental design considerations are crucial at the onset to minimize bias from factors related to sample processing and avoid confounding experimental variables with non-biological batch effects. Although there are currently no de facto standard methods for analyzing these data, we review the major steps in processing DNAm data recommended by several research studies. We describe several variations available for clinical researchers to process, analyze, and interpret DNAm data. These insights are applicable to most types of genome-wide DNAm array platforms and will be applicable for the next generation of DNAm array technologies (e.g., the 850K array). Selection of the DNAm analytic pipeline followed by investigators should be guided by the research question and supported by recently published methods. PMID:27127542

  1. A Pooled Genome-Wide Association Study of Asperger Syndrome

    PubMed Central

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  2. Genome-Wide Scan for Methylation Profiles in Keloids

    PubMed Central

    Jones, Lamont R.; Young, William; Divine, George; Datta, Indrani; Chen, Kang Mei; Ozog, David; Worsham, Maria J.

    2015-01-01

    Keloids are benign fibroproliferative tumors of the skin which commonly occur after injury mainly in darker skinned patients. Medical treatment is fraught with high recurrence rates mainly because of an incomplete understanding of the biological mechanisms that lead to keloids. The purpose of this project was to examine keloid pathogenesis from the epigenome perspective of DNA methylation. Genome-wide profiling used the Infinium HumanMethylation450 BeadChip to interrogate DNA from 6 fresh keloid and 6 normal skin samples from 12 anonymous donors. A 3-tiered approach was used to call out genes most differentially methylated between keloid and normal. When compared to normal, of the 685 differentially methylated CpGs at Tier 3, 510 were hypomethylated and 175 were hypermethylated with 190 CpGs in promoter and 495 in nonpromoter regions. The 190 promoter region CpGs corresponded to 152 genes: 96 (63%) were hypomethylated and 56 (37%) hypermethylated. This exploratory genome-wide scan of the keloid methylome highlights a predominance of hypomethylated genomic landscapes, favoring nonpromoter regions. DNA methylation, as an additional mechanism for gene regulation in keloid pathogenesis, holds potential for novel treatments that reverse deleterious epigenetic changes. As an alternative mechanism for regulating genes, epigenetics may explain why gene mutations alone do not provide definitive mechanisms for keloid formation. PMID:26074660

  3. Genome-Wide Mapping of DNA Strand Breaks

    PubMed Central

    Leduc, Frédéric; Faucher, David; Bikond Nkoma, Geneviève; Grégoire, Marie-Chantal; Arguin, Mélina; Wellinger, Raymund J.; Boissonneault, Guylain

    2011-01-01

    Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed “damaged DNA immunoprecipitation” (dDIP), uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL) to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage. PMID:21364894

  4. A Genome-Wide Association Study of Optic Disc Parameters

    PubMed Central

    Jansonius, Nomdo M.; de Jong, Paulus T. V. M.; Bergen, Arthur A. B.; Isaacs, Aaron; Amin, Najaf; Aulchenko, Yurii S.; Wolfs, Roger C. W.; Hofman, Albert; Rivadeneira, Fernando; Oostra, Ben A.; Uitterlinden, Andre G.; Hysi, Pirro; Hammond, Christopher J.; Lemij, Hans G.; Vingerling, Johannes R.

    2010-01-01

    The optic nerve head is involved in many ophthalmic disorders, including common diseases such as myopia and open-angle glaucoma. Two of the most important parameters are the size of the optic disc area and the vertical cup-disc ratio (VCDR). Both are highly heritable but genetically largely undetermined. We performed a meta-analysis of genome-wide association (GWA) data to identify genetic variants associated with optic disc area and VCDR. The gene discovery included 7,360 unrelated individuals from the population-based Rotterdam Study I and Rotterdam Study II cohorts. These cohorts revealed two genome-wide significant loci for optic disc area, rs1192415 on chromosome 1p22 (p = 6.72×10−19) within 117 kb of the CDC7 gene and rs1900004 on chromosome 10q21.3-q22.1 (p = 2.67×10−33) within 10 kb of the ATOH7 gene. They revealed two genome-wide significant loci for VCDR, rs1063192 on chromosome 9p21 (p = 6.15×10−11) in the CDKN2B gene and rs10483727 on chromosome 14q22.3-q23 (p = 2.93×10−10) within 40 kbp of the SIX1 gene. Findings were replicated in two independent Dutch cohorts (Rotterdam Study III and Erasmus Rucphen Family study; N = 3,612), and the TwinsUK cohort (N = 843). Meta-analysis with the replication cohorts confirmed the four loci and revealed a third locus at 16q12.1 associated with optic disc area, and four other loci at 11q13, 13q13, 17q23 (borderline significant), and 22q12.1 for VCDR. ATOH7 was also associated with VCDR independent of optic disc area. Three of the loci were marginally associated with open-angle glaucoma. The protein pathways in which the loci of optic disc area are involved overlap with those identified for VCDR, suggesting a common genetic origin. PMID:20548946

  5. Genome-wide association study of antisocial personality disorder.

    PubMed

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53-3.14), P=1.9 × 10(-5)). Two polymorphisms at 6p21.2 LINC00951-LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37-1.85), P=1.6 × 10(-9)) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  6. The CHR site: definition and genome-wide identification of a cell cycle transcriptional element

    PubMed Central

    Müller, Gerd A.; Wintsche, Axel; Stangner, Konstanze; Prohaska, Sonja J.; Stadler, Peter F.; Engeland, Kurt

    2014-01-01

    The cell cycle genes homology region (CHR) has been identified as a DNA element with an important role in transcriptional regulation of late cell cycle genes. It has been shown that such genes are controlled by DREAM, MMB and FOXM1-MuvB and that these protein complexes can contact DNA via CHR sites. However, it has not been elucidated which sequence variations of the canonical CHR are functional and how frequent CHR-based regulation is utilized in mammalian genomes. Here, we define the spectrum of functional CHR elements. As the basis for a computational meta-analysis, we identify new CHR sequences and compile phylogenetic motif conservation as well as genome-wide protein-DNA binding and gene expression data. We identify CHR elements in most late cell cycle genes binding DREAM, MMB, or FOXM1-MuvB. In contrast, Myb- and forkhead-binding sites are underrepresented in both early and late cell cycle genes. Our findings support a general mechanism: sequential binding of DREAM, MMB and FOXM1-MuvB complexes to late cell cycle genes requires CHR elements. Taken together, we define the group of CHR-regulated genes in mammalian genomes and provide evidence that the CHR is the central promoter element in transcriptional regulation of late cell cycle genes by DREAM, MMB and FOXM1-MuvB. PMID:25106871

  7. Genome-wide association studies in pediatric chronic kidney disease.

    PubMed

    Gupta, Jayanta; Kanetsky, Peter A; Wuttke, Matthias; Köttgen, Anna; Schaefer, Franz; Wong, Craig S

    2016-08-01

    The genome-wide association study (GWAS) has become an established scientific method that provides an unbiased screen for genetic loci potentially associated with phenotypes of clinical interest, such as chronic kidney disease (CKD). Thus, GWAS provides opportunities to gain new perspectives regarding the genetic architecture of CKD progression by identifying new candidate genes and targets for intervention. As such, it has become an important arm of translational science providing a complementary line of investigation to identify novel therapeutics to treat CKD. In this review, we describe the method and the challenges of performing GWAS in the pediatric CKD population. We also provide an overview of successful GWAS for kidney disease, and we discuss the established pediatric CKD cohorts in North America and Europe that are poised to identify genetic risk variants associated with CKD progression. PMID:26490952

  8. Ultrafast laser nanosurgery in microfluidics for genome-wide screenings

    PubMed Central

    Ben-Yakar, Adela; Bourgeois, Frederic

    2009-01-01

    Summary The use of ultrafast laser pulses in surgery has allowed for unprecedented precision with minimal collateral damage to surrounding tissues. For these reasons, ultrafast laser nanosurgery, as an injury model, has gained tremendous momentum in experimental biology ranging from in-vitro manipulations of subcellular structures to in-vivo studies in whole living organisms. For example, femtosecond laser nanosurgery on such model organism as the nematode Caenorhabditis elegans (C. elegans) has opened new opportunities for in-vivo nerve regeneration studies. Meanwhile, the development of novel microfluidic devices has brought the control in experimental environment to the level required for precise nanosurgery in various animal models. Merging microfluidics and laser nanosurgery has recently improved the specificities and increased the speed of laser surgeries enabling fast genome-wide screenings that can more readily decode the genetic map of various biological processes. PMID:19278850

  9. Genome-wide nucleosome positioning during embryonic stem cell development.

    PubMed

    Teif, Vladimir B; Vainshtein, Yevhen; Caudron-Herger, Maïwen; Mallm, Jan-Philipp; Marth, Caroline; Höfer, Thomas; Rippe, Karsten

    2012-11-01

    We determined genome-wide nucleosome occupancies in mouse embryonic stem cells and their neural progenitor and embryonic fibroblast counterparts to assess features associated with nucleosome positioning during lineage commitment. Cell-type- and protein-specific binding preferences of transcription factors to sites with either low (Myc, Klf4 and Zfx) or high (Nanog, Oct4 and Sox2) nucleosome occupancy as well as complex patterns for CTCF were identified. Nucleosome-depleted regions around transcription start and transcription termination sites were broad and more pronounced for active genes, with distinct patterns for promoters classified according to CpG content or histone methylation marks. Throughout the genome, nucleosome occupancy was correlated with certain histone methylation or acetylation modifications. In addition, the average nucleosome repeat length increased during differentiation by 5-7 base pairs, with local variations for specific regions. Our results reveal regulatory mechanisms of cell differentiation that involve nucleosome repositioning. PMID:23085715

  10. Quantitative prediction of genome-wide resource allocation in bacteria.

    PubMed

    Goelzer, Anne; Muntel, Jan; Chubukov, Victor; Jules, Matthieu; Prestel, Eric; Nölker, Rolf; Mariadassou, Mahendra; Aymerich, Stéphane; Hecker, Michael; Noirot, Philippe; Becher, Dörte; Fromion, Vincent

    2015-11-01

    Predicting resource allocation between cell processes is the primary step towards decoding the evolutionary constraints governing bacterial growth under various conditions. Quantitative prediction at genome-scale remains a computational challenge as current methods are limited by the tractability of the problem or by simplifying hypotheses. Here, we show that the constraint-based modeling method Resource Balance Analysis (RBA), calibrated using genome-wide absolute protein quantification data, accurately predicts resource allocation in the model bacterium Bacillus subtilis for a wide range of growth conditions. The regulation of most cellular processes is consistent with the objective of growth rate maximization except for a few suboptimal processes which likely integrate more complex objectives such as coping with stressful conditions and survival. As a proof of principle by using simulations, we illustrated how calibrated RBA could aid rational design of strains for maximizing protein production, offering new opportunities to investigate design principles in prokaryotes and to exploit them for biotechnological applications. PMID:26498510

  11. Progress of genome wide association study in domestic animals

    PubMed Central

    2012-01-01

    Domestic animals are invaluable resources for study of the molecular architecture of complex traits. Although the mapping of quantitative trait loci (QTL) responsible for economically important traits in domestic animals has achieved remarkable results in recent decades, not all of the genetic variation in the complex traits has been captured because of the low density of markers used in QTL mapping studies. The genome wide association study (GWAS), which utilizes high-density single-nucleotide polymorphism (SNP), provides a new way to tackle this issue. Encouraging achievements in dissection of the genetic mechanisms of complex diseases in humans have resulted from the use of GWAS. At present, GWAS has been applied to the field of domestic animal breeding and genetics, and some advances have been made. Many genes or markers that affect economic traits of interest in domestic animals have been identified. In this review, advances in the use of GWAS in domestic animals are described. PMID:22958308

  12. Metabolite-based genome-wide association studies in plants.

    PubMed

    Luo, Jie

    2015-04-01

    The plant metabolome is the readout of plant physiological status and is regarded as the bridge between the genome and the phenome of plants. Unraveling the natural variation and the underlying genetic basis of plant metabolism has received increasing interest from plant biologists. Enabled by the recent advances in high-throughput profiling and genotyping technologies, metabolite-based genome-wide association study (mGWAS) has emerged as a powerful alternative forward genetics strategy to dissect the genetic and biochemical bases of metabolism in model and crop plants. In this review, recent progress and applications of mGWAS in understanding the genetic control of plant metabolism and in interactive functional genomics and metabolomics are presented. Further directions and perspectives of mGWAS in plants are also discussed. PMID:25637954

  13. Genome-wide genetic changes during modern breeding of maize.

    PubMed

    Jiao, Yinping; Zhao, Hainan; Ren, Longhui; Song, Weibin; Zeng, Biao; Guo, Jinjie; Wang, Baobao; Liu, Zhipeng; Chen, Jing; Li, Wei; Zhang, Mei; Xie, Shaojun; Lai, Jinsheng

    2012-07-01

    The success of modern maize breeding has been demonstrated by remarkable increases in productivity over the last four decades. However, the underlying genetic changes correlated with these gains remain largely unknown. We report here the sequencing of 278 temperate maize inbred lines from different stages of breeding history, including deep resequencing of 4 lines with known pedigree information. The results show that modern breeding has introduced highly dynamic genetic changes into the maize genome. Artificial selection has affected thousands of targets, including genes and non-genic regions, leading to a reduction in nucleotide diversity and an increase in the proportion of rare alleles. Genetic changes during breeding happen rapidly, with extensive variation (SNPs, indels and copy-number variants (CNVs)) occurring, even within identity-by-descent regions. Our genome-wide assessment of genetic changes during modern maize breeding provides new strategies as well as practical targets for future crop breeding and biotechnology. PMID:22660547

  14. Genome-wide DNA methylation profiling in zebrafish.

    PubMed

    Murphy, P J; Cairns, B R

    2016-01-01

    Genomic DNA methylation functions to repress gene expression by interfering with transcription factor binding and/or recruiting repressive chromatin machinery. Recent data support contribution of regulated DNA methylation to embryonic pluripotency, development, and tissue differentiation; this important epigenetic mark is chemically stable yet enzymatically reversible-and heritable through the germline. Importantly, all the major components involved in dynamic DNA methylation are conserved in zebrafish, including the factors that "write, read, and erase" this mark. Therefore, the zebrafish has become an excellent model for studying most biological processes associated with DNA methylation in mammals. Here we briefly review the zebrafish model for studying DNA methylation and describe a series of methods for performing genome-wide DNA methylation analysis. We address and provide methods for methylated DNA immunoprecipitation followed by sequencing (MeDIP-Seq), bisulfite sequencing (BS-Seq), and reduced representation bisulfite sequencing (RRBS-Seq). PMID:27443935

  15. Quality control procedures for genome-wide association studies.

    PubMed

    Turner, Stephen; Armstrong, Loren L; Bradford, Yuki; Carlson, Christopher S; Crawford, Dana C; Crenshaw, Andrew T; de Andrade, Mariza; Doheny, Kimberly F; Haines, Jonathan L; Hayes, Geoffrey; Jarvik, Gail; Jiang, Lan; Kullo, Iftikhar J; Li, Rongling; Ling, Hua; Manolio, Teri A; Matsumoto, Martha; McCarty, Catherine A; McDavid, Andrew N; Mirel, Daniel B; Paschall, Justin E; Pugh, Elizabeth W; Rasmussen, Luke V; Wilke, Russell A; Zuvich, Rebecca L; Ritchie, Marylyn D

    2011-01-01

    Genome-wide association studies (GWAS) are being conducted at an unprecedented rate in population-based cohorts and have increased our understanding of the pathophysiology of complex disease. Regardless of context, the practical utility of this information will ultimately depend upon the quality of the original data. Quality control (QC) procedures for GWAS are computationally intensive, operationally challenging, and constantly evolving. Here we enumerate some of the challenges in QC of GWAS data and describe the approaches that the electronic MEdical Records and Genomics (eMERGE) network is using for quality assurance in GWAS data, thereby minimizing potential bias and error in GWAS results. We discuss common issues associated with QC of GWAS data, including data file formats, software packages for data manipulation and analysis, sex chromosome anomalies, sample identity, sample relatedness, population substructure, batch effects, and marker quality. We propose best practices and discuss areas of ongoing and future research. PMID:21234875

  16. Genome Wide Association Identifies Novel Loci Involved in Fungal Communication

    PubMed Central

    Kowbel, David; Welch, Juliet; Taylor, John W.; Brem, Rachel B.; Glass, N. Louise

    2013-01-01

    Understanding how genomes encode complex cellular and organismal behaviors has become the outstanding challenge of modern genetics. Unlike classical screening methods, analysis of genetic variation that occurs naturally in wild populations can enable rapid, genome-scale mapping of genotype to phenotype with a medium-throughput experimental design. Here we describe the results of the first genome-wide association study (GWAS) used to identify novel loci underlying trait variation in a microbial eukaryote, harnessing wild isolates of the filamentous fungus Neurospora crassa. We genotyped each of a population of wild Louisiana strains at 1 million genetic loci genome-wide, and we used these genotypes to map genetic determinants of microbial communication. In N. crassa, germinated asexual spores (germlings) sense the presence of other germlings, grow toward them in a coordinated fashion, and fuse. We evaluated germlings of each strain for their ability to chemically sense, chemotropically seek, and undergo cell fusion, and we subjected these trait measurements to GWAS. This analysis identified one gene, NCU04379 (cse-1, encoding a homolog of a neuronal calcium sensor), at which inheritance was strongly associated with the efficiency of germling communication. Deletion of cse-1 significantly impaired germling communication and fusion, and two genes encoding predicted interaction partners of CSE1 were also required for the communication trait. Additionally, mining our association results for signaling and secretion genes with a potential role in germling communication, we validated six more previously unknown molecular players, including a secreted protease and two other genes whose deletion conferred a novel phenotype of increased communication and multi-germling fusion. Our results establish protein secretion as a linchpin of germling communication in N. crassa and shed light on the regulation of communication molecules in this fungus. Our study demonstrates the power

  17. Genome-Wide Association Studies for Comb Traits in Chickens

    PubMed Central

    Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61–0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  18. Systems-Level Analysis of Genome-Wide Association Data

    PubMed Central

    Farber, Charles R.

    2013-01-01

    Genome-wide association studies (GWAS) have emerged as the method of choice for identifying common variants affecting complex disease. In a GWAS, particular attention is placed, for obvious reasons, on single-nucleotide polymorphisms (SNPs) that exceed stringent genome-wide significance thresholds. However, it is expected that many SNPs with only nominal evidence of association (e.g., P < 0.05) truly influence disease. Efforts to extract additional biological information from entire GWAS datasets have primarily focused on pathway-enrichment analyses. However, these methods suffer from a number of limitations and typically fail to lead to testable hypotheses. To evaluate alternative approaches, we performed a systems-level analysis of GWAS data using weighted gene coexpression network analysis. A weighted gene coexpression network was generated for 1918 genes harboring SNPs that displayed nominal evidence of association (P ≤ 0.05) from a GWAS of bone mineral density (BMD) using microarray data on circulating monocytes isolated from individuals with extremely low or high BMD. Thirteen distinct gene modules were identified, each comprising coexpressed and highly interconnected GWAS genes. Through the characterization of module content and topology, we illustrate how network analysis can be used to discover disease-associated subnetworks and characterize novel interactions for genes with a known role in the regulation of BMD. In addition, we provide evidence that network metrics can be used as a prioritizing tool when selecting genes and SNPs for replication studies. Our results highlight the advantages of using systems-level strategies to add value to and inform GWAS. PMID:23316444

  19. The genome-wide structure of the Jewish people.

    PubMed

    Behar, Doron M; Yunusbayev, Bayazit; Metspalu, Mait; Metspalu, Ene; Rosset, Saharon; Parik, Jüri; Rootsi, Siiri; Chaubey, Gyaneshwer; Kutuev, Ildus; Yudkovsky, Guennady; Khusnutdinova, Elza K; Balanovsky, Oleg; Semino, Ornella; Pereira, Luisa; Comas, David; Gurwitz, David; Bonne-Tamir, Batsheva; Parfitt, Tudor; Hammer, Michael F; Skorecki, Karl; Villems, Richard

    2010-07-01

    Contemporary Jews comprise an aggregate of ethno-religious communities whose worldwide members identify with each other through various shared religious, historical and cultural traditions. Historical evidence suggests common origins in the Middle East, followed by migrations leading to the establishment of communities of Jews in Europe, Africa and Asia, in what is termed the Jewish Diaspora. This complex demographic history imposes special challenges in attempting to address the genetic structure of the Jewish people. Although many genetic studies have shed light on Jewish origins and on diseases prevalent among Jewish communities, including studies focusing on uniparentally and biparentally inherited markers, genome-wide patterns of variation across the vast geographic span of Jewish Diaspora communities and their respective neighbours have yet to be addressed. Here we use high-density bead arrays to genotype individuals from 14 Jewish Diaspora communities and compare these patterns of genome-wide diversity with those from 69 Old World non-Jewish populations, of which 25 have not previously been reported. These samples were carefully chosen to provide comprehensive comparisons between Jewish and non-Jewish populations in the Diaspora, as well as with non-Jewish populations from the Middle East and north Africa. Principal component and structure-like analyses identify previously unrecognized genetic substructure within the Middle East. Most Jewish samples form a remarkably tight subcluster that overlies Druze and Cypriot samples but not samples from other Levantine populations or paired Diaspora host populations. In contrast, Ethiopian Jews (Beta Israel) and Indian Jews (Bene Israel and Cochini) cluster with neighbouring autochthonous populations in Ethiopia and western India, respectively, despite a clear paternal link between the Bene Israel and the Levant. These results cast light on the variegated genetic architecture of the Middle East, and trace the origins

  20. A genome-wide association study of anorexia nervosa.

    PubMed

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-10-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  1. Genome-Wide Association Studies for Comb Traits in Chickens.

    PubMed

    Shen, Manman; Qu, Liang; Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61-0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  2. Genome-wide linkage in Utah autism pedigrees.

    PubMed

    Allen-Brady, K; Robison, R; Cannon, D; Varvil, T; Villalobos, M; Pingree, C; Leppert, M F; Miller, J; McMahon, W M; Coon, H

    2010-10-01

    Genetic studies of autism over the past decade suggest a complex landscape of multiple genes. In the face of this heterogeneity, studies that include large extended pedigrees may offer valuable insights, as the relatively few susceptibility genes within single large families may be more easily discerned. This genome-wide screen of 70 families includes 20 large extended pedigrees of 6-9 generations, 6 moderate-sized families of 4-5 generations and 44 smaller families of 2-3 generations. The Center for Inherited Disease Research (CIDR) provided genotyping using the Illumina Linkage Panel 12, a 6K single-nucleotide polymorphism (SNP) platform. Results from 192 subjects with an autism spectrum disorder (ASD) and 461 of their relatives revealed genome-wide significance on chromosome 15q, with three possibly distinct peaks: 15q13.1-q14 (heterogeneity LOD (HLOD)=4.09 at 29 459 872 bp); 15q14-q21.1 (HLOD=3.59 at 36 837 208 bp); and 15q21.1-q22.2 (HLOD=5.31 at 55 629 733 bp). Two of these peaks replicate earlier findings. There were additional suggestive results on chromosomes 2p25.3-p24.1 (HLOD=1.87), 7q31.31-q32.3 (HLOD=1.97) and 13q12.11-q12.3 (HLOD=1.93). Affected subjects in families supporting the linkage peaks found in this study did not reveal strong evidence for distinct phenotypic subgroups. PMID:19455147

  3. Genome wide association identifies novel loci involved in fungal communication.

    PubMed

    Palma-Guerrero, Javier; Hall, Charles R; Kowbel, David; Welch, Juliet; Taylor, John W; Brem, Rachel B; Glass, N Louise

    2013-01-01

    Understanding how genomes encode complex cellular and organismal behaviors has become the outstanding challenge of modern genetics. Unlike classical screening methods, analysis of genetic variation that occurs naturally in wild populations can enable rapid, genome-scale mapping of genotype to phenotype with a medium-throughput experimental design. Here we describe the results of the first genome-wide association study (GWAS) used to identify novel loci underlying trait variation in a microbial eukaryote, harnessing wild isolates of the filamentous fungus Neurospora crassa. We genotyped each of a population of wild Louisiana strains at 1 million genetic loci genome-wide, and we used these genotypes to map genetic determinants of microbial communication. In N. crassa, germinated asexual spores (germlings) sense the presence of other germlings, grow toward them in a coordinated fashion, and fuse. We evaluated germlings of each strain for their ability to chemically sense, chemotropically seek, and undergo cell fusion, and we subjected these trait measurements to GWAS. This analysis identified one gene, NCU04379 (cse-1, encoding a homolog of a neuronal calcium sensor), at which inheritance was strongly associated with the efficiency of germling communication. Deletion of cse-1 significantly impaired germling communication and fusion, and two genes encoding predicted interaction partners of CSE1 were also required for the communication trait. Additionally, mining our association results for signaling and secretion genes with a potential role in germling communication, we validated six more previously unknown molecular players, including a secreted protease and two other genes whose deletion conferred a novel phenotype of increased communication and multi-germling fusion. Our results establish protein secretion as a linchpin of germling communication in N. crassa and shed light on the regulation of communication molecules in this fungus. Our study demonstrates the power

  4. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2015-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  5. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Slof-Op t Landt, Margarita CT; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O’Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2013-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10−7) in SOX2OT and rs17030795 (P=5.84×10−6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10−6) between CUL3 and FAM124B and rs1886797 (P=8.05×10−6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P= 4×10−6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:21079607

  6. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  7. Genome-wide 5-hydroxymethylcytosine modification pattern is a novel epigenetic feature of globozoospermia.

    PubMed

    Wang, Xiu-Xia; Sun, Bao-Fa; Jiao, Jiao; Chong, Ze-Chen; Chen, Yu-Shen; Wang, Xiao-Li; Zhao, Yue; Zhou, Yi-Ming; Li, Da

    2015-03-30

    Discovery of 5-hydroxymethylcytosine (5hmC) in mammalian genomes has excited the field of epigenetics, but information on the genome-wide distribution of 5hmC is limited. Globozoospermia is a rare but severe cause of male infertility. To date, the epigenetic mechanism, especially 5hmC profiles involved in globozoospermia progression, remains largely unknown. Here, utilizing the chemical labeling and biotin-enrichment approach followed by Illumina HiSeq sequencing, we showed that (i) 6664, 9029 and 6318 genes contain 5hmC in normal, abnormal, and globozoospermia sperm, respectively; (ii) some 5hmC-containing genes significantly involves in spermatogenesis, sperm motility and morphology, and gamete generation; (iii) 5hmC is exclusively localized in sperm intron; (iv) approximately 40% imprinted genes have 5hmC modification in sperm genomes, but globozoospermia sperm exhibiting a large portion of imprinted genes lose the 5hmC modification; (v) six imprinted genes showed different 5hmC patterns in abnormal sperm (GDAP1L1, GNAS, KCNK9, LIN28B, RB1, RTL1), and five imprinted genes showed different 5hmC patterns in globozoospermia sperm (KCNK9, LIN28B, RB1, SLC22A18, ZDBF2). These results suggested that differences in genome-wide 5hmC patterns may in part be responsible for the sperm phenotype. All of this may improve our understanding of the basic molecular mechanism underlying sperm biology and the etiology of male infertility. PMID:25762640

  8. Genome-wide detection of DNase I hypersensitive sites in single cells and FFPE tissue samples.

    PubMed

    Jin, Wenfei; Tang, Qingsong; Wan, Mimi; Cui, Kairong; Zhang, Yi; Ren, Gang; Ni, Bing; Sklar, Jeffrey; Przytycka, Teresa M; Childs, Richard; Levens, David; Zhao, Keji

    2015-12-01

    DNase I hypersensitive sites (DHSs) provide important information on the presence of transcriptional regulatory elements and the state of chromatin in mammalian cells. Conventional DNase sequencing (DNase-seq) for genome-wide DHSs profiling is limited by the requirement of millions of cells. Here we report an ultrasensitive strategy, called single-cell DNase sequencing (scDNase-seq) for detection of genome-wide DHSs in single cells. We show that DHS patterns at the single-cell level are highly reproducible among individual cells. Among different single cells, highly expressed gene promoters and enhancers associated with multiple active histone modifications display constitutive DHS whereas chromatin regions with fewer histone modifications exhibit high variation of DHS. Furthermore, the single-cell DHSs predict enhancers that regulate cell-specific gene expression programs and the cell-to-cell variations of DHS are predictive of gene expression. Finally, we apply scDNase-seq to pools of tumour cells and pools of normal cells, dissected from formalin-fixed paraffin-embedded tissue slides from patients with thyroid cancer, and detect thousands of tumour-specific DHSs. Many of these DHSs are associated with promoters and enhancers critically involved in cancer development. Analysis of the DHS sequences uncovers one mutation (chr18: 52417839G>C) in the tumour cells of a patient with follicular thyroid carcinoma, which affects the binding of the tumour suppressor protein p53 and correlates with decreased expression of its target gene TXNL1. In conclusion, scDNase-seq can reliably detect DHSs in single cells, greatly extending the range of applications of DHS analysis both for basic and for translational research, and may provide critical information for personalized medicine. PMID:26605532

  9. Genome-wide pathway analysis of genome-wide association studies on systemic lupus erythematosus and rheumatoid arthritis.

    PubMed

    Lee, Young Ho; Bae, Sang-Cheol; Choi, Sung Jae; Ji, Jong Dae; Song, Gwan Gyu

    2012-12-01

    The aim of this study was to explore candidate single nucleotide polymorphisms (SNPs) and candidate mechanisms of systemic lupus erythematosus (SLE) and rheumatoid arthritis (RA). Two SLE genome-wide association studies (GWASs) datasets were included in this study. Meta-analysis was conducted using 737,984 SNPs in 1,527 SLE cases and 3,421 controls of European ancestry, and 4,429 SNPs that met a threshold of p < 0.01 in a Korean RA GWAS dataset was used. ICSNPathway (identify candidate causal SNPs and pathways) analysis was applied to the meta-analysis results of the SLE GWAS datasets, and a RA GWAS dataset. The most significant result of SLE GWAS meta-analysis concerned rs2051549 in the human leukocyte antigen (HLA) region (p = 3.36E-22). In the non-HLA region, meta-analysis identified 6 SNPs associated with SLE with genome-wide significance (STAT4, TNPO3, BLK, FAM167A, and IRF5). ICSNPathway identified five candidate causal SNPs and 13 candidate causal pathways. This pathway-based analysis provides three hypotheses of the biological mechanism involved. First, rs8084 and rs7192 → HLA-DRA → bystander B cell activation. Second, rs1800629 → TNF → cytokine network. Third, rs1150752 and rs185819 → TNXB → collagen metabolic process. ICSNPathway analysis identified three candidate causal non-HLA SNPs and four candidate causal pathways involving the PADI4, MTR, PADI2, and TPH2 genes of RA. We identified five candidate SNPs and thirteen pathways, involving bystander B cell activation, cytokine network, and collagen metabolic processing, which may contribute to SLE susceptibility, and we revealed candidate causal non-HLA SNPs, genes, and pathways of RA. PMID:23053960

  10. Genome-Wide Association Study for Endothelial Growth Factors

    PubMed Central

    Lieb, Wolfgang; Chen, Ming-Huei; Larson, Martin G.; Safa, Radwan; Teumer, Alexander; Baumeister, Sebastian E.; Lin, Honghuang; Smith, Holly M.; Koch, Manja; Lorbeer, Roberto; Völker, Uwe; Nauck, Matthias; Völzke, Henry; Wallaschofski, Henri; Sawyer, Douglas B.; Vasan, Ramachandran S.

    2015-01-01

    Background Endothelial growth factors including angiopoietin-2 (Ang-2), its soluble receptor Tie-2 (sTie-2) and hepatocyte growth factor (HGF) play important roles in angiogenesis, vascular remodeling, local tumor growth and metastatic potential of various cancers. Circulating levels of these biomarkers have a heritable component (between 13% and 56%), but the underlying genetic variation influencing these biomarker levels is largely unknown. Methods and Results We performed a genome-wide association study for circulating Ang-2, sTie-2, and HGF in 3571 Framingham Heart Study (FHS) participants and assessed replication of the top hits for Ang-2 and sTie-2 in 3184 participants of the Study of Health in Pomerania (SHIP). In multivariable-adjusted models, sTie-2 and HGF concentrations were associated with single nucleotide polymorphisms (SNPs) in the genes encoding the respective biomarkers (top p=2.40×10−65 [rs2273720] and 3.64×10−19 [rs5745687], respectively). Likewise, rs2442517 in the MCPH1 gene (in which the Ang-2 gene is embedded) was associated with Ang-2 levels (p=5.05×10−8 in FHS and 8.39×10−5 in SHIP). Furthermore, SNPs in the AB0 gene were associated with sTie-2 (top SNP rs8176693 with p=1.84×10−33 in FHS; p=2.53×10−30 in SHIP) and Ang-2 (rs8176746 with p=2.07×10−8 in FHS; p=0.001 in SHIP) levels on a genome-wide significant level. The top genetic loci explained between 1.7% (Ang-2) and 11.2% (sTie-2) of the inter-individual variation in biomarker levels. Conclusions Genetic variation contributes to the inter-individual variation in growth factor levels and explains a modest proportion of circulating HGF, Ang-2, and Tie-2. This may potentially contribute to the familial susceptibility to cancer, a premise that warrants further studies. PMID:25552591

  11. Minimalist ensemble algorithms for genome-wide protein localization prediction

    PubMed Central

    2012-01-01

    Background Computational prediction of protein subcellular localization can greatly help to elucidate its functions. Despite the existence of dozens of protein localization prediction algorithms, the prediction accuracy and coverage are still low. Several ensemble algorithms have been proposed to improve the prediction performance, which usually include as many as 10 or more individual localization algorithms. However, their performance is still limited by the running complexity and redundancy among individual prediction algorithms. Results This paper proposed a novel method for rational design of minimalist ensemble algorithms for practical genome-wide protein subcellular localization prediction. The algorithm is based on combining a feature selection based filter and a logistic regression classifier. Using a novel concept of contribution scores, we analyzed issues of algorithm redundancy, consensus mistakes, and algorithm complementarity in designing ensemble algorithms. We applied the proposed minimalist logistic regression (LR) ensemble algorithm to two genome-wide datasets of Yeast and Human and compared its performance with current ensemble algorithms. Experimental results showed that the minimalist ensemble algorithm can achieve high prediction accuracy with only 1/3 to 1/2 of individual predictors of current ensemble algorithms, which greatly reduces computational complexity and running time. It was found that the high performance ensemble algorithms are usually composed of the predictors that together cover most of available features. Compared to the best individual predictor, our ensemble algorithm improved the prediction accuracy from AUC score of 0.558 to 0.707 for the Yeast dataset and from 0.628 to 0.646 for the Human dataset. Compared with popular weighted voting based ensemble algorithms, our classifier-based ensemble algorithms achieved much better performance without suffering from inclusion of too many individual predictors. Conclusions We

  12. Genome wide identification of regulatory motifs in Bacillus subtilis

    PubMed Central

    Mwangi, Michael M; Siggia, Eric D

    2003-01-01

    Background To explain the vastly different phenotypes exhibited by the same organism under different conditions, it is essential that we understand how the organism's genes are coordinately regulated. While there are many excellent tools for predicting sequences encoding proteins or RNA genes, few algorithms exist to predict regulatory sequences on a genome wide scale with no prior information. Results To identify motifs involved in the control of transcription, an algorithm was developed that searches upstream of operons for improbably frequent dimers. The algorithm was applied to the B. subtilis genome, which is predicted to encode for approximately 200 DNA binding proteins. The dimers found to be over-represented could be clustered into 317 distinct groups, each thought to represent a class of motifs uniquely recognized by some transcription factor. For each cluster of dimers, a representative weight matrix was derived and scored over the regions upstream of the operons to predict the sites recognized by the cluster's factor, and a putative regulon of the operons immediately downstream of the sites was inferred. The distribution in number of operons per predicted regulon is comparable to that for well characterized transcription factors. The most highly over-represented dimers matched σA, the T-box, and σW sites. We have evidence to suggest that at least 52 of our clusters of dimers represent actual regulatory motifs, based on the groups' weight matrix matches to experimentally characterized sites, the functional similarity of the component operons of the groups' regulons, and the positional biases of the weight matrix matches. All predictions are assigned a significance value, and thresholds are set to avoid false positives. Where possible, we examine our false negatives, drawing examples from known regulatory motifs and regulons inferred from RNA expression data. Conclusions We have demonstrated that in the case of B. subtilis our algorithm allows for the

  13. Defining the RNA polymerase III transcriptome: Genome-wide localization of the RNA polymerase III transcription machinery in human cells

    PubMed Central

    Canella, Donatella; Praz, Viviane; Reina, Jaime H.; Cousin, Pascal; Hernandez, Nouria

    2010-01-01

    Our view of the RNA polymerase III (Pol III) transcription machinery in mammalian cells arises mostly from studies of the RN5S (5S) gene, the Ad2 VAI gene, and the RNU6 (U6) gene, as paradigms for genes with type 1, 2, and 3 promoters. Recruitment of Pol III onto these genes requires prior binding of well-characterized transcription factors. Technical limitations in dealing with repeated genomic units, typically found at mammalian Pol III genes, have so far hampered genome-wide studies of the Pol III transcription machinery and transcriptome. We have localized, genome-wide, Pol III and some of its transcription factors. Our results reveal broad usage of the known Pol III transcription machinery and define a minimal Pol III transcriptome in dividing IMR90hTert fibroblasts. This transcriptome consists of some 500 actively transcribed genes including a few dozen candidate novel genes, of which we confirmed nine as Pol III transcription units by additional methods. It does not contain any of the microRNA genes previously described as transcribed by Pol III, but reveals two other microRNA genes, MIR886 (hsa-mir-886) and MIR1975 (RNY5, hY5, hsa-mir-1975), which are genuine Pol III transcription units. PMID:20413673

  14. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use. PMID:25296770

  15. Genome-Wide Association Studies of the Human Gut Microbiota.

    PubMed

    Davenport, Emily R; Cusanovich, Darren A; Michelini, Katelyn; Barreiro, Luis B; Ober, Carole; Gilad, Yoav

    2015-01-01

    The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both). These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%). For example, we identified an association between a taxon known to affect obesity (genus Akkermansia) and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL) mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10-7). Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut. PMID:26528553

  16. Assessing Predictive Properties of Genome-Wide Selection in Soybeans.

    PubMed

    Xavier, Alencar; Muir, William M; Rainey, Katy Martin

    2016-01-01

    Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set. PMID:27317786

  17. Genome wide association scan for chronic periodontitis implicates novel locus

    PubMed Central

    2014-01-01

    Background There is evidence for a genetic contribution to chronic periodontitis. In this study, we conducted a genome wide association study among 866 participants of the University of Pittsburgh Dental Registry and DNA Repository, whose periodontal diagnosis ranged from healthy (N = 767) to severe chronic periodontitis (N = 99). Methods Genotypingi of over half-million single nucleotide polymorphisms was determined. Analyses were done twice, first in the complete dataset of all ethnicities, and second including only samples defined as self-reported Whites. From the top 100 results, twenty single nucleotide polymorphisms had consistent results in both analyses (borderline p-values ranging from 1E-05 to 1E-6) and were selected to be tested in two independent datasets derived from 1,460 individuals from Porto Alegre, and 359 from Rio de Janeiro, Brazil. Meta-analyses of the Single nucleotide polymorphisms showing a trend for association in the independent dataset were performed. Results The rs1477403 marker located on 16q22.3 showed suggestive association in the discovery phase and in the Porto Alegre dataset (p = 0.05). The meta-analysis suggested the less common allele decreases the risk of chronic periodontitis. Conclusions Our data offer a clear hypothesis to be independently tested regarding the contribution of the 16q22.3 locus to chronic periodontitis. PMID:25008200

  18. Genome-wide association study of aggressive behaviour in chicken.

    PubMed

    Li, Zhenhui; Zheng, Ming; Abdalla, Bahareldin Ali; Zhang, Zhe; Xu, Zhenqiang; Ye, Qiao; Xu, Haiping; Luo, Wei; Nie, Qinghua; Zhang, Xiquan

    2016-01-01

    In the poultry industry, aggressive behaviour is a large animal welfare issue all over the world. To date, little is known about the underlying genetics of the aggressive behaviour. Here, we performed a genome-wide association study (GWAS) to explore the genetic mechanism associated with aggressive behaviour in chickens. The GWAS results showed that a total of 33 SNPs were associated with aggressive behaviour traits (P < 4.6E-6). rs312463697 on chromosome 4 was significantly associated with aggression (P = 2.10905E-07), and it was in the intron region of the sortilin-related VPS10 domain containing receptor 2 (SORCS2) gene. In addition, biological function analysis of the nearest 26 genes around the significant SNPs was performed with Ingenuity Pathway Analysis. An interaction network contained 17 genes was obtained and SORCS2 was involved in this network, interacted with nerve growth factor (NGF), nerve growth factor receptor (NGFR), dopa decarboxylase (L-dopa) and dopamine. After knockdown of SORCS2, the mRNA levels of NGF, L-dopa and dopamine receptor genes DRD1, DRD2, DRD3 and DRD4 were significantly decreased (P < 0.05). In summary, our data indicated that SORCS2 might play an important role in chicken aggressive behaviour through the regulation of dopaminergic pathways and NGF. PMID:27485826

  19. Genome-Wide Discriminatory Information Patterns of Cytosine DNA Methylation.

    PubMed

    Sanchez, Robersy; Mackenzie, Sally A

    2016-01-01

    Cytosine DNA methylation (CDM) is a highly abundant, heritable but reversible chemical modification to the genome. Herein, a machine learning approach was applied to analyze the accumulation of epigenetic marks in methylomes of 152 ecotypes and 85 silencing mutants of Arabidopsis thaliana. In an information-thermodynamics framework, two measurements were used: (1) the amount of information gained/lost with the CDM changes I R and (2) the uncertainty of not observing a SNP L C R . We hypothesize that epigenetic marks are chromosomal footprints accounting for different ontogenetic and phylogenetic histories of individual populations. A machine learning approach is proposed to verify this hypothesis. Results support the hypothesis by the existence of discriminatory information (DI) patterns of CDM able to discriminate between individuals and between individual subpopulations. The statistical analyses revealed a strong association between the topologies of the structured population of Arabidopsis ecotypes based on I R and on LCR, respectively. A statistical-physical relationship between I R and L C R was also found. Results to date imply that the genome-wide distribution of CDM changes is not only part of the biological signal created by the methylation regulatory machinery, but ensures the stability of the DNA molecule, preserving the integrity of the genetic message under continuous stress from thermal fluctuations in the cell environment. PMID:27322251

  20. Imputing Phenotypes for Genome-wide Association Studies.

    PubMed

    Hormozdiari, Farhad; Kang, Eun Yong; Bilow, Michael; Ben-David, Eyal; Vulpe, Chris; McLachlan, Stela; Lusis, Aldons J; Han, Buhm; Eskin, Eleazar

    2016-07-01

    Genome-wide association studies (GWASs) have been successful in detecting variants correlated with phenotypes of clinical interest. However, the power to detect these variants depends on the number of individuals whose phenotypes are collected, and for phenotypes that are difficult to collect, the sample size might be insufficient to achieve the desired statistical power. The phenotype of interest is often difficult to collect, whereas surrogate phenotypes or related phenotypes are easier to collect and have already been collected in very large samples. This paper demonstrates how we take advantage of these additional related phenotypes to impute the phenotype of interest or target phenotype and then perform association analysis. Our approach leverages the correlation structure between phenotypes to perform the imputation. The correlation structure can be estimated from a smaller complete dataset for which both the target and related phenotypes have been collected. Under some assumptions, the statistical power can be computed analytically given the correlation structure of the phenotypes used in imputation. In addition, our method can impute the summary statistic of the target phenotype as a weighted linear combination of the summary statistics of related phenotypes. Thus, our method is applicable to datasets for which we have access only to summary statistics and not to the raw genotypes. We illustrate our approach by analyzing associated loci to triglycerides (TGs), body mass index (BMI), and systolic blood pressure (SBP) in the Northern Finland Birth Cohort dataset. PMID:27292110

  1. Genome-Wide Association Studies of the Human Gut Microbiota

    PubMed Central

    Davenport, Emily R.; Cusanovich, Darren A.; Michelini, Katelyn; Barreiro, Luis B.; Ober, Carole; Gilad, Yoav

    2015-01-01

    The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both). These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%). For example, we identified an association between a taxon known to affect obesity (genus Akkermansia) and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL) mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10−7). Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut. PMID:26528553

  2. Quality Control Procedures for Genome Wide Association Studies

    PubMed Central

    Turner, Stephen; Armstrong, Loren L.; Bradford, Yuki; Carlson, Christopher S.; Crawford, Dana C.; Crenshaw, Andrew T.; de Andrade, Mariza; Doheny, Kimberly F.; Haines, Jonathan L.; Hayes, Geoffrey; Jarvik, Gail; Jiang, Lan; Kullo, Iftikhar J.; Li, Rongling; Ling, Hua; Manolio, Teri A.; Matsumoto, Martha; McCarty, Catherine A.; McDavid, Andrew N.; Mirel, Daniel B.; Paschall, Justin E.; Pugh, Elizabeth W.; Rasmussen, Luke V.; Wilke, Russell A.; Zuvich, Rebecca L.; Ritchie, Marylyn D.

    2011-01-01

    Genome-wide association studies (GWAS) are being conducted at an unprecedented rate in population-based cohorts and have increased our understanding of the pathophysiology of complex disease. The recent application of GWAS to clinic-based cohorts has also yielded genetic predictors of clinical outcomes. Regardless of context, the practical utility of this information will ultimately depend upon the quality of the original data. Quality control (QC) procedures for GWAS are computationally intensive, operationally challenging, and constantly evolving. With each new dataset, new realities are discovered about GWAS data and best practices continue to be developed. The Genomics Workgroup of the National Human Genome Research Institute (NHGRI) funded electronic Medical Records and Genomics (eMERGE) network has invested considerable effort in developing strategies for QC of these data. The lessons learned by this group will be valuable for other investigators dealing with large scale genomic datasets. Here we enumerate some of the challenges in QC of GWAS data and describe the approaches that the eMERGE network is using for quality assurance in GWAS data, thereby minimizing potential bias and error in GWAS results. In this protocol we discuss common issues associated with QC of GWAS data, including data file formats, software packages for data manipulation and analysis, sex chromosome anomalies, sample identity, sample relatedness, population substructure, batch effects, and marker quality. We propose best practices and discuss areas of ongoing and future research. PMID:21234875

  3. Insights into kidney diseases from genome-wide association studies.

    PubMed

    Wuttke, Matthias; Köttgen, Anna

    2016-09-01

    Over the past decade, genome-wide association studies (GWAS) have considerably improved our understanding of the genetic basis of kidney function and disease. Population-based studies, used to investigate traits that define chronic kidney disease (CKD), have identified >50 genomic regions in which common genetic variants associate with estimated glomerular filtration rate or urinary albumin-to-creatinine ratio. Case-control studies, used to study specific CKD aetiologies, have yielded risk loci for specific kidney diseases such as IgA nephropathy and membranous nephropathy. In this Review, we summarize important findings from GWAS and clinical and experimental follow-up studies. We also compare risk allele frequency, effect sizes, and specificity in GWAS of CKD-defining traits and GWAS of specific CKD aetiologies and the implications for study design. Genomic regions identified in GWAS of CKD-defining traits can contain causal genes for monogenic kidney diseases. Population-based research on kidney function traits can therefore generate insights into more severe forms of kidney diseases. Experimental follow-up studies have begun to identify causal genes and variants, which are potential therapeutic targets, and suggest mechanisms underlying the high allele frequency of causal variants. GWAS are thus a useful approach to advance knowledge in nephrology. PMID:27477491

  4. Reconstructing Roma History from Genome-Wide Data

    PubMed Central

    Moorjani, Priya; Patterson, Nick; Loh, Po-Ru; Lipson, Mark; Kisfali, Péter; Melegh, Bela I.; Bonin, Michael; Kádaši, Ľudevít; Rieß, Olaf; Berger, Bonnie; Reich, David; Melegh, Béla

    2013-01-01

    The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000–1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs). We estimate that the Roma harbor about 80% West Eurasian ancestry–derived from a combination of European and South Asian sources–and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe. PMID:23516520

  5. Assessing Predictive Properties of Genome-Wide Selection in Soybeans

    PubMed Central

    Xavier, Alencar; Muir, William M.; Rainey, Katy Martin

    2016-01-01

    Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set. PMID:27317786

  6. Genome-wide significant risk associations for mucinous ovarian carcinoma

    PubMed Central

    Kelemen, Linda E.; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; M. Lee, Janet; Seo, Ji-Heui; Phelan, Catherine M.; Beesley, Jonathan; Chen, Xiaoqin; Spindler, Tassja J.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chen, Y. Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Engelholm, Svend Aage; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moes-Sosnowska, Joanna; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wlodzimierz, Sawicki; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Freedman, Matthew L.; Chenevix-Trench, Georgia; Pharoah, Paul D.; Gayther, Simon A.; Berchuck, Andrew

    2015-01-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas (OC) but not for mucinous ovarian carcinomas (MOC). Genotypes from OC cases and controls were imputed into the 1000 Genomes Project reference panel. Analysis of 1,644 MOC cases and 21,693 controls identified three novel risk associations: rs752590 at 2q13 (P = 3.3 × 10−8), rs711830 at 2q31.1 (P = 7.5 × 10−12) and rs688187 at 19q13.2 (P = 6.8 × 10−13). Expression Quantitative Trait Locus (eQTL) analysis in ovarian and colorectal tumors (which are histologically similar to MOC) identified significant eQTL associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10−4, FDR = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors, and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790

  7. Genome-Wide Mapping of Yeast RNA Polymerase II Termination

    PubMed Central

    Schaughency, Paul; Merran, Jonathan; Corden, Jeffry L.

    2014-01-01

    Yeast RNA polymerase II (Pol II) terminates transcription of coding transcripts through the polyadenylation (pA) pathway and non-coding transcripts through the non-polyadenylation (non-pA) pathway. We have used PAR-CLIP to map the position of Pol II genome-wide in living yeast cells after depletion of components of either the pA or non-pA termination complexes. We show here that Ysh1, responsible for cleavage at the pA site, is required for efficient removal of Pol II from the template. Depletion of Ysh1 from the nucleus does not, however, lead to readthrough transcription. In contrast, depletion of the termination factor Nrd1 leads to widespread runaway elongation of non-pA transcripts. Depletion of Sen1 also leads to readthrough at non-pA terminators, but in contrast to Nrd1, this readthrough is less processive, or more susceptible to pausing. The data presented here provide delineation of in vivo Pol II termination regions and highlight differences in the sequences that signal termination of different classes of non-pA transcripts. PMID:25299594

  8. Reconstructing Roma history from genome-wide data.

    PubMed

    Moorjani, Priya; Patterson, Nick; Loh, Po-Ru; Lipson, Mark; Kisfali, Péter; Melegh, Bela I; Bonin, Michael; Kádaši, Ludevít; Rieß, Olaf; Berger, Bonnie; Reich, David; Melegh, Béla

    2013-01-01

    The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000-1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs). We estimate that the Roma harbor about 80% West Eurasian ancestry-derived from a combination of European and South Asian sources-and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe. PMID:23516520

  9. Genome-Wide Specific Selection in Three Domestic Sheep Breeds

    PubMed Central

    Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin

    2015-01-01

    Background Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. Results We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Conclusions Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding. PMID:26083354

  10. Genome-wide association study of aggressive behaviour in chicken

    PubMed Central

    Li, Zhenhui; Zheng, Ming; Abdalla, Bahareldin Ali; Zhang, Zhe; Xu, Zhenqiang; Ye, Qiao; Xu, Haiping; Luo, Wei; Nie, Qinghua; Zhang, Xiquan

    2016-01-01

    In the poultry industry, aggressive behaviour is a large animal welfare issue all over the world. To date, little is known about the underlying genetics of the aggressive behaviour. Here, we performed a genome-wide association study (GWAS) to explore the genetic mechanism associated with aggressive behaviour in chickens. The GWAS results showed that a total of 33 SNPs were associated with aggressive behaviour traits (P < 4.6E-6). rs312463697 on chromosome 4 was significantly associated with aggression (P = 2.10905E-07), and it was in the intron region of the sortilin-related VPS10 domain containing receptor 2 (SORCS2) gene. In addition, biological function analysis of the nearest 26 genes around the significant SNPs was performed with Ingenuity Pathway Analysis. An interaction network contained 17 genes was obtained and SORCS2 was involved in this network, interacted with nerve growth factor (NGF), nerve growth factor receptor (NGFR), dopa decarboxylase (L-dopa) and dopamine. After knockdown of SORCS2, the mRNA levels of NGF, L-dopa and dopamine receptor genes DRD1, DRD2, DRD3 and DRD4 were significantly decreased (P < 0.05). In summary, our data indicated that SORCS2 might play an important role in chicken aggressive behaviour through the regulation of dopaminergic pathways and NGF. PMID:27485826

  11. A SUPER Powerful Method for Genome Wide Association Study

    PubMed Central

    Pan, Yuchun; Buckler, Edward S.; Zhang, Zhiwu

    2014-01-01

    Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population structure and cryptic relationships to reduce false positives. However, its intensive computational burden is prohibitive in practice, especially for large samples. The newly developed algorithm, FaST-LMM, solved the computational problem, but requires that the number of SNPs be less than the number of individuals to derive a rank-reduced relationship. This restriction potentially leads to less statistical power when compared to using all SNPs. We developed a method to extract a small subset of SNPs and use them in FaST-LMM. This method not only retains the computational advantage of FaST-LMM, but also remarkably increases statistical power even when compared to using the entire set of SNPs. We named the method SUPER (Settlement of MLM Under Progressively Exclusive Relationship) and made it available within an implementation of the GAPIT software package. PMID:25247812

  12. cgmisc: enhanced genome-wide association analyses and visualization

    PubMed Central

    Kierczak, Marcin; Jabłońska, Jagoda; Forsberg, Simon K. G.; Bianchi, Matteo; Tengvall, Katarina; Pettersson, Mats; Scholz, Veronika; Meadows, Jennifer R. S.; Jern, Patric; Carlborg, Örjan; Lindblad-Toh, Kerstin

    2015-01-01

    Summary: High-throughput genotyping and sequencing technologies facilitate studies of complex genetic traits and provide new research opportunities. The increasing popularity of genome-wide association studies (GWAS) leads to the discovery of new associated loci and a better understanding of the genetic architecture underlying not only diseases, but also other monogenic and complex phenotypes. Several softwares are available for performing GWAS analyses, R environment being one of them. Results: We present cgmisc, an R package that enables enhanced data analysis and visualization of results from GWAS. The package contains several utilities and modules that complement and enhance the functionality of the existing software. It also provides several tools for advanced visualization of genomic data and utilizes the power of the R language to aid in preparation of publication-quality figures. Some of the package functions are specific for the domestic dog (Canis familiaris) data. Availability and implementation: The package is operating system-independent and is available from: https://github.com/cgmisc-team/cgmisc Contact: marcin.kierczak@imbim.uu.se Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26249815

  13. Genome-wide association study of selenium concentrations

    PubMed Central

    Cornelis, Marilyn C.; Fornage, Myriam; Foy, Millennia; Xun, Pengcheng; Gladyshev, Vadim N.; Morris, Steve; Chasman, Daniel I.; Hu, Frank B.; Rimm, Eric B.; Kraft, Peter; Jordan, Joanne M.; Mozaffarian, Dariush; He, Ka

    2015-01-01

    Selenium (Se) is an essential trace element in human nutrition, but its role in certain health conditions, particularly among Se sufficient populations, is controversial. A genome-wide association study (GWAS) of blood Se concentrations previously identified a locus at 5q14 near BHMT. We performed a GW meta-analysis of toenail Se concentrations, which reflect a longer duration of exposure than blood Se concentrations, including 4162 European descendants from four US cohorts. Toenail Se was measured using neutron activation analysis. We identified a GW-significant locus at 5q14 (P < 1 × 10−16), the same locus identified in the published GWAS of blood Se based on independent cohorts. The lead single-nucleotide polymorphism (SNP) explained ∼1% of the variance in toenail Se concentrations. Using GW-summary statistics from both toenail and blood Se, we observed statistical evidence of polygenic overlap (P < 0.001) and meta-analysis of results from studies of either trait (n = 9639) yielded a second GW-significant locus at 21q22.3, harboring CBS (P < 4 × 10−8). Proteins encoded by genes at 5q14 and 21q22.3 function in homocysteine (Hcy) metabolism, and index SNPs for each have previously been associated with betaine and Hcy levels in GWAS. Our findings show evidence of a genetic link between Se and Hcy pathways, both involved in cardiometabolic disease. PMID:25343990

  14. Realizing privacy preserving genome-wide association studies

    PubMed Central

    Simmons, Sean; Berger, Bonnie

    2016-01-01

    Motivation: As genomics moves into the clinic, there has been much interest in using this medical data for research. At the same time the use of such data raises many privacy concerns. These circumstances have led to the development of various methods to perform genome-wide association studies (GWAS) on patient records while ensuring privacy. In particular, there has been growing interest in applying differentially private techniques to this challenge. Unfortunately, up until now all methods for finding high scoring SNPs in a differentially private manner have had major drawbacks in terms of either accuracy or computational efficiency. Results: Here we overcome these limitations with a substantially modified version of the neighbor distance method for performing differentially private GWAS, and thus are able to produce a more viable mechanism. Specifically, we use input perturbation and an adaptive boundary method to overcome accuracy issues. We also design and implement a convex analysis based algorithm to calculate the neighbor distance for each SNP in constant time, overcoming the major computational bottleneck in the neighbor distance method. It is our hope that methods such as ours will pave the way for more widespread use of patient data in biomedical research. Availability and implementation: A python implementation is available at http://groups.csail.mit.edu/cb/DiffPriv/. Contact: bab@csail.mit.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26769317

  15. Genome-wide significant risk associations for mucinous ovarian carcinoma.

    PubMed

    Kelemen, Linda E; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; Lee, Janet M; Seo, Ji-Heui; Phelan, Catherine M; Beesley, Jonathan; Chen, Xiaoqing; Spindler, Tassja J; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia

    2015-08-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas but not for mucinous ovarian carcinomas (MOCs). Our analysis of 1,644 MOC cases and 21,693 controls with imputation identified 3 new risk associations: rs752590 at 2q13 (P = 3.3 × 10(-8)), rs711830 at 2q31.1 (P = 7.5 × 10(-12)) and rs688187 at 19q13.2 (P = 6.8 × 10(-13)). We identified significant expression quantitative trait locus (eQTL) associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10(-4), false discovery rate (FDR) = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk-associated SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790

  16. Genome wide association study on early puberty in Bos indicus.

    PubMed

    Nascimento, A V; Matos, M C; Seno, L O; Romero, A R S; Garcia, J F; Grisolia, A B

    2016-01-01

    The aim of this study was to evaluate a genome wide association study (GWAS) approach to identify single nucleotide polymorphisms (SNPs) associated with fertility traits (early puberty) in Nellore cattle (Bos indicus). Fifty-five Nellore cows were selected from a herd monitored for early puberty onset (positive pregnancy at 18 months of age). Extremes of this phenotype were selected; 30 and 25 individuals were pregnant and non-pregnant, respectively, at that age. DNA samples were genotyped using a high-density SNP chip (>777.000 SNP). GWAS using a case-control strategy highlighted a number of significant markers based on their proximity with the Bonferroni correction line. Results indicated that chromosomes 5, 6, 9, 10, and 22 were associated with the traits of interest. The most significant SNPs on these chromosomes were rs133039577, rs110013280, rs134702839, rs109551605, and rs41639155. Candidate genes, as well as quantitative trait loci (QTL) previously reported in the Ensembl and Cattle QTLdb databases, were further investigated. Analysis of the regions close to the SNP on chromosomes 9 and 10 revealed that four QTL had been previously classified under the reproduction category. In conclusion, we have identified SNPs in close proximity to genes associated with reproductive traits. Moreover, U6 spliceosomal RNA was present on three different chromosomes, which is possibly associated with age at first calving, suggesting that it might be a strong candidate for future studies. PMID:26909970

  17. Genome-wide DNA Methylation Profiles in Hepatocellular Carcinoma

    PubMed Central

    Shen, Jing; Wang, Shuang; Zhang, Yu-Jing; Kappil, Maya; Wu, Hui-Chen; Kibriya, Muhammad G.; Wang, Qiao; Jasmine, Farzana; Ahsan, Habib; Lee, Po-Huang; Yu, Ming-Whei; Chen, Chien-Jen; Santella, Regina M.

    2012-01-01

    Alterations in DNA methylation frequently occur in hepatocellular cancer (HCC). We have previously demonstrated that hypermethylation in candidate genes can be detected in plasma DNA prior to HCC diagnosis. To identify with a genome-wide approach additional genes hypermethylated in HCC that could be used for more accurate analysis of plasma DNA for early diagnosis, we analyzed tumor and adjacent non-tumor tissues from 62 Taiwanese HCC cases using Illumina methylation arrays that screen 26,486 autosomal CpG sites. After Bonferroni adjustment, a total of 2,324 CpG sites significantly differed in methylation level, with 684 CpG sites significantly hypermethylated and 1,640 hypomethylated in tumor compared to non-tumor tissues. Array data were validated with pyrosequencing in a subset of 5 of these genes; correlation coefficients ranged from 0.92 to 0.97. Analysis of plasma DNA from 38 cases demonstrated that 37% to 63% of cases had detectable hypermethylated DNA (≥5% methylation) for these 5 genes individually. At least one of these genes was hypermethylated in 87% of cases, suggesting that measurement of DNA methylation in plasma samples is feasible. The panel of methylated genes indentified in the current study will be further tested in large cohort of prospectively collected samples to determine their utility as early biomarkers of hepatocellular carcinoma. PMID:22234943

  18. Genome-wide footprinting: ready for prime time?

    PubMed

    Sung, Myong-Hee; Baek, Songjoon; Hager, Gordon L

    2016-03-01

    High-throughput sequencing technologies have allowed many gene locus-level molecular biology assays to become genome-wide profiling methods. DNA-cleaving enzymes such as DNase I have been used to probe accessible chromatin. The accessible regions contain functional regulatory sites, including promoters, insulators and enhancers. Deep sequencing of DNase-seq libraries and computational analysis of the cut profiles have been used to infer protein occupancy in the genome at the nucleotide level, a method introduced as 'digital genomic footprinting'. The approach has been proposed as an attractive alternative to the analysis of transcription factors (TFs) by chromatin immunoprecipitation followed by sequencing (ChIP-seq), and in theory it should overcome antibody issues, poor resolution and batch effects. Recent reports point to limitations of the DNase-based genomic footprinting approach and call into question the scope of detectable protein occupancy, especially for TFs with short-lived chromatin binding. The genomics community is grappling with issues concerning the utility of genomic footprinting and is reassessing the proposed approaches in terms of robust deliverables. Here we summarize the consensus as well as different views emerging from recent reports, and we describe the remaining issues and hurdles for genomic footprinting. PMID:26914206

  19. Genome-wide DNA methylation analysis in hepatocellular carcinoma.

    PubMed

    Yamada, Nobuhisa; Yasui, Kohichiroh; Dohi, Osamu; Gen, Yasuyuki; Tomie, Akira; Kitaichi, Tomoko; Iwai, Naoto; Mitsuyoshi, Hironori; Sumida, Yoshio; Moriguchi, Michihisa; Yamaguchi, Kanji; Nishikawa, Taichiro; Umemura, Atsushi; Naito, Yuji; Tanaka, Shinji; Arii, Shigeki; Itoh, Yoshito

    2016-04-01

    Epigenetic changes as well as genetic changes are mechanisms of tumorigenesis. We aimed to identify novel genes that are silenced by DNA hypermethylation in hepatocellular carcinoma (HCC). We screened for genes with promoter DNA hypermethylation using a genome-wide methylation microarray analysis in primary HCC (the discovery set). The microarray analysis revealed that there were 2,670 CpG sites that significantly differed in regards to the methylation level between the tumor and non-tumor liver tissues; 875 were significantly hypermethylated and 1,795 were significantly hypomethylated in the HCC tumors compared to the non‑tumor tissues. Further analyses using methylation-specific PCR, combined with expression analysis, in the validation set of primary HCC showed that, in addition to three known tumor-suppressor genes (APC, CDKN2A, and GSTP1), eight genes (AKR1B1, GRASP, MAP9, NXPE3, RSPH9, SPINT2, STEAP4, and ZNF154) were significantly hypermethylated and downregulated in the HCC tumors compared to the non-tumor liver tissues. Our results suggest that epigenetic silencing of these genes may be associated with HCC. PMID:26883180

  20. Genome-wide association study of selenium concentrations.

    PubMed

    Cornelis, Marilyn C; Fornage, Myriam; Foy, Millennia; Xun, Pengcheng; Gladyshev, Vadim N; Morris, Steve; Chasman, Daniel I; Hu, Frank B; Rimm, Eric B; Kraft, Peter; Jordan, Joanne M; Mozaffarian, Dariush; He, Ka

    2015-03-01

    Selenium (Se) is an essential trace element in human nutrition, but its role in certain health conditions, particularly among Se sufficient populations, is controversial. A genome-wide association study (GWAS) of blood Se concentrations previously identified a locus at 5q14 near BHMT. We performed a GW meta-analysis of toenail Se concentrations, which reflect a longer duration of exposure than blood Se concentrations, including 4162 European descendants from four US cohorts. Toenail Se was measured using neutron activation analysis. We identified a GW-significant locus at 5q14 (P < 1 × 10(-16)), the same locus identified in the published GWAS of blood Se based on independent cohorts. The lead single-nucleotide polymorphism (SNP) explained ∼1% of the variance in toenail Se concentrations. Using GW-summary statistics from both toenail and blood Se, we observed statistical evidence of polygenic overlap (P < 0.001) and meta-analysis of results from studies of either trait (n = 9639) yielded a second GW-significant locus at 21q22.3, harboring CBS (P < 4 × 10(-8)). Proteins encoded by genes at 5q14 and 21q22.3 function in homocysteine (Hcy) metabolism, and index SNPs for each have previously been associated with betaine and Hcy levels in GWAS. Our findings show evidence of a genetic link between Se and Hcy pathways, both involved in cardiometabolic disease. PMID:25343990

  1. Genome-wide association studies in pharmacogenetics research debate

    PubMed Central

    Bailey, Kent R; Cheng, Cheng

    2016-01-01

    Will genome-wide association studies (GWAS) ‘work’ for pharmacogenetics research? This question was the topic of a staged debate, with pro and con sides, aimed to bring out the strengths and weaknesses of GWAS for pharmacogenetics studies. After a full day of seminars at the Fifth Statistical Analysis Workshop of the Pharmacogenetics Research Network, the lively debate was held – appropriately – at Goonies Comedy Club in Rochester (MN, USA). The pro side emphasized that the many GWAS successes for identifying genetic variants associated with disease risk show that it works; that the current genotyping platforms are efficient, with good imputation methods to fill in missing data; that its global assessment is always a success even if no significant associations are detected; and that genetic effects are likely to be large because humans have not evolved in a drug-therapy environment. By contrast, the con side emphasized that we have limited knowledge of the complexity of the genome; limited clinical phenotypes compromise studies; the likely multifactorial nature of drug response clouding the small genetic effects; and limitations of sample size and replication studies in pharmacogenetic studies. Lively and insightful discussions emphasized further research efforts that might benefit GWAS in pharmacogenetics. PMID:20235786

  2. Genome-Wide Association Studies in Primary Biliary Cirrhosis.

    PubMed

    Gulamhusein, Aliya F; Juran, Brian D; Lazaridis, Konstantinos N

    2015-11-01

    Genome-wide association studies (GWASs) have been a significant technological advance in our ability to evaluate the genetic architecture of complex diseases such as primary biliary cirrhosis (PBC). To date, six large-scale studies have been performed that have identified 27 risk loci in addition to human leukocyte antigen (HLA) associated with PBC. The identified risk variants emphasize important disease concepts; namely, that disturbances in immunoregulatory pathways are important in the pathogenesis of PBC and that such perturbations are shared among a diverse number of autoimmune diseases-suggesting the risk architecture may confer a generalized propensity to autoimmunity not necessarily specific to PBC. Furthermore, the impact of non-HLA risk variants, particularly in genes involved with interleukin-12 signaling, and ethnic variation in conferring susceptibility to PBC have been highlighted. Although GWASs have been a critical stepping stone in understanding common genetic variation contributing to PBC, limitations pertaining to power, sample availability, and strong linkage disequilibrium across genes have left us with an incomplete understanding of the genetic underpinnings of disease pathogenesis. Future efforts to gain insight into this missing heritability, the genetic variation that contributes to important disease outcomes, and the functional consequences of associated variants will be critical if practical clinical translation is to be realized. PMID:26676814

  3. Genome-Wide Analysis of Human Metapneumovirus Evolution

    PubMed Central

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  4. Genome-wide transcriptome analysis of human epidermal melanocytes

    PubMed Central

    Haltaufderhyde, Kirk D.; Oancea, Elena

    2015-01-01

    Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175

  5. Genome-wide linkage-disequilibrium profiles from single individuals.

    PubMed

    Lynch, Michael; Xu, Sen; Maruki, Takahiro; Jiang, Xiaoqian; Pfaffelhuber, Peter; Haubold, Bernhard

    2014-09-01

    Although the analysis of linkage disequilibrium (LD) plays a central role in many areas of population genetics, the sampling variance of LD is known to be very large with high sensitivity to numbers of nucleotide sites and individuals sampled. Here we show that a genome-wide analysis of the distribution of heterozygous sites within a single diploid genome can yield highly informative patterns of LD as a function of physical distance. The proposed statistic, the correlation of zygosity, is closely related to the conventional population-level measure of LD, but is agnostic with respect to allele frequencies and hence likely less prone to outlier artifacts. Application of the method to several vertebrate species leads to the conclusion that >80% of recombination events are typically resolved by gene-conversion-like processes unaccompanied by crossovers, with the average lengths of conversion patches being on the order of one to several kilobases in length. Thus, contrary to common assumptions, the recombination rate between sites does not scale linearly with distance, often even up to distances of 100 kb. In addition, the amount of LD between sites separated by <200 bp is uniformly much greater than can be explained by the conventional neutral model, possibly because of the nonindependent origin of mutations within this spatial scale. These results raise questions about the application of conventional population-genetic interpretations to LD on short spatial scales and also about the use of spatial patterns of LD to infer demographic histories. PMID:24948778

  6. Genome-Wide Discriminatory Information Patterns of Cytosine DNA Methylation

    PubMed Central

    Sanchez, Robersy; Mackenzie, Sally A.

    2016-01-01

    Cytosine DNA methylation (CDM) is a highly abundant, heritable but reversible chemical modification to the genome. Herein, a machine learning approach was applied to analyze the accumulation of epigenetic marks in methylomes of 152 ecotypes and 85 silencing mutants of Arabidopsis thaliana. In an information-thermodynamics framework, two measurements were used: (1) the amount of information gained/lost with the CDM changes IR and (2) the uncertainty of not observing a SNP LCR. We hypothesize that epigenetic marks are chromosomal footprints accounting for different ontogenetic and phylogenetic histories of individual populations. A machine learning approach is proposed to verify this hypothesis. Results support the hypothesis by the existence of discriminatory information (DI) patterns of CDM able to discriminate between individuals and between individual subpopulations. The statistical analyses revealed a strong association between the topologies of the structured population of Arabidopsis ecotypes based on IR and on LCR, respectively. A statistical-physical relationship between IR and LCR was also found. Results to date imply that the genome-wide distribution of CDM changes is not only part of the biological signal created by the methylation regulatory machinery, but ensures the stability of the DNA molecule, preserving the integrity of the genetic message under continuous stress from thermal fluctuations in the cell environment. PMID:27322251

  7. Highly Parallel Genome-Wide Expression Analysis of Single Mammalian Cells

    PubMed Central

    Fan, Jian-Bing; Chen, Jing; April, Craig S.; Fisher, Jeffrey S.; Klotzle, Brandy; Bibikova, Marina; Kaper, Fiona; Ronaghi, Mostafa; Linnarsson, Sten; Ota, Takayo; Chien, Jeremy; Laurent, Louise C.; Nisperos, Sean V.; Chen, Gina Y.; Zhong, Jiang F.

    2012-01-01

    Background We have developed a high-throughput amplification method for generating robust gene expression profiles using single cell or low RNA inputs. Methodology/Principal Findings The method uses tagged priming and template-switching, resulting in the incorporation of universal PCR priming sites at both ends of the synthesized cDNA for global PCR amplification. Coupled with a whole-genome gene expression microarray platform, we routinely obtain expression correlation values of R2∼0.76–0.80 between individual cells and R2∼0.69 between 50 pg total RNA replicates. Expression profiles generated from single cells or 50 pg total RNA correlate well with that generated with higher input (1 ng total RNA) (R2∼0.80). Also, the assay is sufficiently sensitive to detect, in a single cell, approximately 63% of the number of genes detected with 1 ng input, with approximately 97% of the genes detected in the single-cell input also detected in the higher input. Conclusions/Significance In summary, our method facilitates whole-genome gene expression profiling in contexts where starting material is extremely limiting, particularly in areas such as the study of progenitor cells in early development and tumor stem cell biology. PMID:22347404

  8. Genome-wide examination of myoblast cell cycle withdrawal duringdifferentiation

    SciTech Connect

    Shen, Xun; Collier, John Michael; Hlaing, Myint; Zhang, Leanne; Delshad, Elizabeth H.; Bristow, James; Bernstein, Harold S.

    2002-12-02

    Skeletal and cardiac myocytes cease division within weeks of birth. Although skeletal muscle retains limited capacity for regeneration through recruitment of satellite cells, resident populations of adult myocardial stem cells have not been identified. Because cell cycle withdrawal accompanies myocyte differentiation, we hypothesized that C2C12 cells, a mouse myoblast cell line previously used to characterize myocyte differentiation, also would provide a model for studying cell cycle withdrawal during differentiation. C2C12 cells were differentiated in culture medium containing horse serum and harvested at various time points to characterize the expression profiles of known cell cycle and myogenic regulatory factors by immunoblot analysis. BrdU incorporation decreased dramatically in confluent cultures 48 hr after addition of horse serum, as cells started to form myotubes. This finding was preceded by up-regulation of MyoD, followed by myogenin, and activation of Bcl-2. Cyclin D1 was expressed in proliferating cultures and became undetectable in cultures containing 40 percent fused myotubes, as levels of p21(WAF1/Cip1) increased and alpha-actin became detectable. Because C2C12 myoblasts withdraw from the cell cycle during myocyte differentiation following a course that recapitulates this process in vivo, we performed a genome-wide screen to identify other gene products involved in this process. Using microarrays containing approximately 10,000 minimally redundant mouse sequences that map to the UniGene database of the National Center for Biotechnology Information, we compared gene expression profiles between proliferating, differentiating, and differentiated C2C12 cells and verified candidate genes demonstrating differential expression by RT-PCR. Cluster analysis of differentially expressed genes revealed groups of gene products involved in cell cycle withdrawal, muscle differentiation, and apoptosis. In addition, we identified several genes, including DDAH2 and Ly

  9. Improved Statistics for Genome-Wide Interaction Analysis

    PubMed Central

    Ueki, Masao; Cordell, Heather J.

    2012-01-01

    Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new “joint effects” statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al

  10. Genome-wide association study of sleep in Drosophila melanogaster

    PubMed Central

    2013-01-01

    Background Sleep is a highly conserved behavior, yet its duration and pattern vary extensively among species and between individuals within species. The genetic basis of natural variation in sleep remains unknown. Results We used the Drosophila Genetic Reference Panel (DGRP) to perform a genome-wide association (GWA) study of sleep in D. melanogaster. We identified candidate single nucleotide polymorphisms (SNPs) associated with differences in the mean as well as the environmental sensitivity of sleep traits; these SNPs typically had sex-specific or sex-biased effects, and were generally located in non-coding regions. The majority of SNPs (80.3%) affecting sleep were at low frequency and had moderately large effects. Additive models incorporating multiple SNPs explained as much as 55% of the genetic variance for sleep in males and females. Many of these loci are known to interact physically and/or genetically, enabling us to place them in candidate genetic networks. We confirmed the role of seven novel loci on sleep using insertional mutagenesis and RNA interference. Conclusions We identified many SNPs in novel loci that are potentially associated with natural variation in sleep, as well as SNPs within genes previously known to affect Drosophila sleep. Several of the candidate genes have human homologues that were identified in studies of human sleep, suggesting that genes affecting variation in sleep are conserved across species. Our discovery of genetic variants that influence environmental sensitivity to sleep may have a wider application to all GWA studies, because individuals with highly plastic genotypes will not have consistent phenotypes. PMID:23617951

  11. Probabilistic Protein Function Prediction from Heterogeneous Genome-Wide Data

    PubMed Central

    Nariai, Naoki; Kolaczyk, Eric D.; Kasif, Simon

    2007-01-01

    Dramatic improvements in high throughput sequencing technologies have led to a staggering growth in the number of predicted genes. However, a large fraction of these newly discovered genes do not have a functional assignment. Fortunately, a variety of novel high-throughput genome-wide functional screening technologies provide important clues that shed light on gene function. The integration of heterogeneous data to predict protein function has been shown to improve the accuracy of automated gene annotation systems. In this paper, we propose and evaluate a probabilistic approach for protein function prediction that integrates protein-protein interaction (PPI) data, gene expression data, protein motif information, mutant phenotype data, and protein localization data. First, functional linkage graphs are constructed from PPI data and gene expression data, in which an edge between nodes (proteins) represents evidence for functional similarity. The assumption here is that graph neighbors are more likely to share protein function, compared to proteins that are not neighbors. The functional linkage graph model is then used in concert with protein domain, mutant phenotype and protein localization data to produce a functional prediction. Our method is applied to the functional prediction of Saccharomyces cerevisiae genes, using Gene Ontology (GO) terms as the basis of our annotation. In a cross validation study we show that the integrated model increases recall by 18%, compared to using PPI data alone at the 50% precision. We also show that the integrated predictor is significantly better than each individual predictor. However, the observed improvement vs. PPI depends on both the new source of data and the functional category to be predicted. Surprisingly, in some contexts integration hurts overall prediction accuracy. Lastly, we provide a comprehensive assignment of putative GO terms to 463 proteins that currently have no assigned function. PMID:17396164

  12. Genome-wide signatures of convergent evolution in echolocating mammals

    PubMed Central

    Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A.; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J.

    2013-01-01

    Evolution is typically thought to proceed through divergence of genes, proteins, and ultimately phenotypes1-3. However, similar traits might also evolve convergently in unrelated taxa due to similar selection pressures4,5. Adaptive phenotypic convergence is widespread in nature, and recent results from a handful of genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level6-9. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution9,10 although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show for the first time that convergence is not a rare process restricted to a handful of loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four new bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Surprisingly we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognised. PMID:24005325

  13. Genome-wide signatures of convergent evolution in echolocating mammals.

    PubMed

    Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J

    2013-10-10

    Evolution is typically thought to proceed through divergence of genes, proteins and ultimately phenotypes. However, similar traits might also evolve convergently in unrelated taxa owing to similar selection pressures. Adaptive phenotypic convergence is widespread in nature, and recent results from several genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution, although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show that convergence is not a rare process restricted to several loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four newly sequenced bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the bottlenose dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Unexpectedly, we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognized. PMID:24005325

  14. Genome-wide characteristics of de novo mutations in autism

    PubMed Central

    Yuen, Ryan K C; Merico, Daniele; Cao, Hongzhi; Pellecchia, Giovanna; Alipanahi, Babak; Thiruvahindrapuram, Bhooma; Tong, Xin; Sun, Yuhui; Cao, Dandan; Zhang, Tao; Wu, Xueli; Jin, Xin; Zhou, Ze; Liu, Xiaomin; Nalpathamkalam, Thomas; Walker, Susan; Howe, Jennifer L.; Wang, Zhuozhi; MacDonald, Jeffrey R.; Chan, Ada; D’Abate, Lia; Deneault, Eric; Siu, Michelle T.; Tammimies, Kristiina; Uddin, Mohammed; Zarrei, Mehdi; Wang, Mingbang; Li, Yingrui; Wang, Jun; Wang, Jian; Yang, Huanming; Bookman, Matt; Bingham, Jonathan; Gross, Samuel S.; Loy, Dion; Pletcher, Mathew; Marshall, Christian R.; Anagnostou, Evdokia; Zwaigenbaum, Lonnie; Weksberg, Rosanna; Fernandez, Bridget A; Roberts, Wendy; Szatmari, Peter; Glazer, David; Frey, Brendan J.; Ring, Robert H.; Xu, Xun; Scherer, Stephen W.

    2016-01-01

    De novo mutations (DNMs) are important in Autism Spectrum Disorder (ASD), but so far analyses have mainly been on the ~1.5% of the genome encoding genes. Here, we performed whole genome sequencing (WGS) of 200 ASD parent-child trios and characterized germline and somatic DNMs. We confirmed that the majority of germline DNMs (75.6%) originated from the father, and these increased significantly with paternal age only (p=4.2×10−10). However, when clustered DNMs (those within 20kb) were found in ASD, not only did they mostly originate from the mother (p=7.7×10−13), but they could also be found adjacent to de novo copy number variations (CNVs) where the mutation rate was significantly elevated (p=2.4×10−24). By comparing DNMs detected in controls, we found a significant enrichment of predicted damaging DNMs in ASD cases (p=8.0×10−9; OR=1.84), of which 15.6% (p=4.3×10−3) and 22.5% (p=7.0×10−5) were in the non-coding or genic non-coding, respectively. The non-coding elements most enriched for DNM were untranslated regions of genes, boundaries involved in exon-skipping and DNase I hypersensitive regions. Using microarrays and a novel outlier detection test, we also found aberrant methylation profiles in 2/185 (1.1%) of ASD cases. These same individuals carried independently identified DNMs in the ASD risk- and epigenetic- genes DNMT3A and ADNP. Our data begins to characterize different genome-wide DNMs, and highlight the contribution of non-coding variants, to the etiology of ASD. PMID:27525107

  15. Multicentric Genome-Wide Association Study for Primary Spontaneous Pneumothorax.

    PubMed

    Sousa, Inês; Abrantes, Patrícia; Francisco, Vânia; Teixeira, Gilberto; Monteiro, Marta; Neves, João; Norte, Ana; Robalo Cordeiro, Carlos; Moura E Sá, João; Reis, Ernestina; Santos, Patrícia; Oliveira, Manuela; Sousa, Susana; Fradinho, Marta; Malheiro, Filipa; Negrão, Luís; Feijó, Salvato; Oliveira, Sofia A

    2016-01-01

    Despite elevated incidence and recurrence rates for Primary Spontaneous Pneumothorax (PSP), little is known about its etiology, and the genetics of idiopathic PSP remains unexplored. To identify genetic variants contributing to sporadic PSP risk, we conducted the first PSP genome-wide association study. Two replicate pools of 92 Portuguese PSP cases and of 129 age- and sex-matched controls were allelotyped in triplicate on the Affymetrix Human SNP Array 6.0 arrays. Markers passing quality control were ranked by relative allele score difference between cases and controls (|RASdiff|), by a novel cluster method and by a combined Z-test. 101 single nucleotide polymorphisms (SNPs) were selected using these three approaches for technical validation by individual genotyping in the discovery dataset. 87 out of 94 successfully tested SNPs were nominally associated in the discovery dataset. Replication of the 87 technically validated SNPs was then carried out in an independent replication dataset of 100 Portuguese cases and 425 controls. The intergenic rs4733649 SNP in chromosome 8 (between LINC00824 and LINC00977) was associated with PSP in the discovery (P = 4.07E-03, ORC[95% CI] = 1.88[1.22-2.89]), replication (P = 1.50E-02, ORC[95% CI] = 1.50[1.08-2.09]) and combined datasets (P = 8.61E-05, ORC[95% CI] = 1.65[1.29-2.13]). This study identified for the first time one genetic risk factor for sporadic PSP, but future studies are warranted to further confirm this finding in other populations and uncover its functional role in PSP pathogenesis. PMID:27203581

  16. Genome-Wide Architecture of Disease Resistance Genes in Lettuce.

    PubMed

    Christopoulou, Marilena; Wo, Sebastian Reyes-Chin; Kozik, Alex; McHale, Leah K; Truco, Maria-Jose; Wroblewski, Tadeusz; Michelmore, Richard W

    2015-12-01

    Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes. PMID:26449254

  17. Identification of Neural Outgrowth Genes using Genome-Wide RNAi

    PubMed Central

    Sepp, Katharine J.; Hong, Pengyu; Lizarraga, Sofia B.; Liu, Judy S.; Mejia, Luis A.; Walsh, Christopher A.; Perrimon, Norbert

    2008-01-01

    While genetic screens have identified many genes essential for neurite outgrowth, they have been limited in their ability to identify neural genes that also have earlier critical roles in the gastrula, or neural genes for which maternally contributed RNA compensates for gene mutations in the zygote. To address this, we developed methods to screen the Drosophila genome using RNA-interference (RNAi) on primary neural cells and present the results of the first full-genome RNAi screen in neurons. We used live-cell imaging and quantitative image analysis to characterize the morphological phenotypes of fluorescently labelled primary neurons and glia in response to RNAi-mediated gene knockdown. From the full genome screen, we focused our analysis on 104 evolutionarily conserved genes that when downregulated by RNAi, have morphological defects such as reduced axon extension, excessive branching, loss of fasciculation, and blebbing. To assist in the phenotypic analysis of the large data sets, we generated image analysis algorithms that could assess the statistical significance of the mutant phenotypes. The algorithms were essential for the analysis of the thousands of images generated by the screening process and will become a valuable tool for future genome-wide screens in primary neurons. Our analysis revealed unexpected, essential roles in neurite outgrowth for genes representing a wide range of functional categories including signalling molecules, enzymes, channels, receptors, and cytoskeletal proteins. We also found that genes known to be involved in protein and vesicle trafficking showed similar RNAi phenotypes. We confirmed phenotypes of the protein trafficking genes Sec61alpha and Ran GTPase using Drosophila embryo and mouse embryonic cerebral cortical neurons, respectively. Collectively, our results showed that RNAi phenotypes in primary neural culture can parallel in vivo phenotypes, and the screening technique can be used to identify many new genes that have

  18. Mosaic paternal genome-wide uniparental isodisomy with down syndrome.

    PubMed

    Darcy, Diana; Atwal, Paldeep Singh; Angell, Cathy; Gadi, Inder; Wallerstein, Robert

    2015-10-01

    We report on a 6-month-old girl with two apparent cell lines; one with trisomy 21, and the other with paternal genome-wide uniparental isodisomy (GWUPiD), identified using single nucleotide polymorphism (SNP) based microarray and microsatellite analysis of polymorphic loci. The patient has Beckwith-Wiedemann syndrome (BWS) due to paternal uniparental disomy (UPD) at chromosome location 11p15 (UPD 11p15), which was confirmed through methylation analysis. Hyperinsulinemic hypoglycemia is present, which is associated with paternal UPD 11p15.5; and she likely has medullary nephrocalcinosis, which is associated with paternal UPD 20, although this was not biochemically confirmed. Angelman syndrome (AS) analysis was negative but this testing is not completely informative; she has no specific features of AS. Clinical features of this patient include: dysmorphic features consistent with trisomy 21, tetralogy of Fallot, hemihypertrophy, swirled skin hyperpigmentation, hepatoblastoma, and Wilms tumor. Her karyotype is 47,XX,+21[19]/46,XX[4], and microarray results suggest that the cell line with trisomy 21 is biparentally inherited and represents 40-50% of the genomic material in the tested specimen. The difference in the level of cytogenetically detected mosaicism versus the level of mosaicism observed via microarray analysis is likely caused by differences in the test methodologies. While a handful of cases of mosaic paternal GWUPiD have been reported, this patient is the only reported case that also involves trisomy 21. Other GWUPiD patients have presented with features associated with multiple imprinted regions, as does our patient. PMID:26219535

  19. Genome-wide metabolic (re-) annotation of Kluyveromyces lactis

    PubMed Central

    2012-01-01

    Background Even before having its genome sequence published in 2004, Kluyveromyces lactis had long been considered a model organism for studies in genetics and physiology. Research on Kluyveromyces lactis is quite advanced and this yeast species is one of the few with which it is possible to perform formal genetic analysis. Nevertheless, until now, no complete metabolic functional annotation has been performed to the proteins encoded in the Kluyveromyces lactis genome. Results In this work, a new metabolic genome-wide functional re-annotation of the proteins encoded in the Kluyveromyces lactis genome was performed, resulting in the annotation of 1759 genes with metabolic functions, and the development of a methodology supported by merlin (software developed in-house). The new annotation includes novelties, such as the assignment of transporter superfamily numbers to genes identified as transporter proteins. Thus, the genes annotated with metabolic functions could be exclusively enzymatic (1410 genes), transporter proteins encoding genes (301 genes) or have both metabolic activities (48 genes). The new annotation produced by this work largely surpassed the Kluyveromyces lactis currently available annotations. A comparison with KEGG’s annotation revealed a match with 844 (~90%) of the genes annotated by KEGG, while adding 850 new gene annotations. Moreover, there are 32 genes with annotations different from KEGG. Conclusions The methodology developed throughout this work can be used to re-annotate any yeast or, with a little tweak of the reference organism, the proteins encoded in any sequenced genome. The new annotation provided by this study offers basic knowledge which might be useful for the scientific community working on this model yeast, because new functions have been identified for the so-called metabolic genes. Furthermore, it served as the basis for the reconstruction of a compartmentalized, genome-scale metabolic model of Kluyveromyces lactis, which is

  20. Genome-wide analyses of human perisylvian cerebral cortical patterning

    PubMed Central

    Abrahams, B. S.; Tentler, D.; Perederiy, J. V.; Oldham, M. C.; Coppola, G.; Geschwind, D. H.

    2007-01-01

    Despite the well established role of the frontal and posterior perisylvian cortices in many facets of human-cognitive specializations, including language, little is known about the developmental patterning of these regions in the human brain. We performed a genome-wide analysis of human cerebral patterning during midgestation, a critical epoch in cortical regionalization. A total of 345 genes were identified as differentially expressed between superior temporal gyrus (STG) and the remaining cerebral cortex. Gene ontology categories representing transcription factors were enriched in STG, whereas cell-adhesion and extracellular matrix molecules were enriched in the other cortical regions. Quantitative RT-PCR or in situ hybridization was performed to validate differential expression in a subset of 32 genes, most of which were confirmed. LIM domain-binding 1 (LDB1), which we show to be enriched in the STG, is a recently identified interactor of LIM domain only 4 (LMO4), a gene known to be involved in the asymmetric pattering of the perisylvian region in the developing human brain. Protocadherin 17 (PCDH17), a neuronal cell adhesion molecule, was highly enriched in focal regions of the human prefrontal cortex. Contactin associated protein-like 2 (CNTNAP2), in which mutations are known to cause autism, epilepsy, and language delay, showed a remarkable pattern of anterior-enriched cortical expression in human that was not observed in mouse or rat. These data highlight the importance of expression analysis of human brain and the utility of cross-species comparisons of gene expression. Genes identified here provide a foundation for understanding molecular aspects of human-cognitive specializations and the disorders that disrupt them. PMID:17978184

  1. Genome-Wide Approaches for RNA Structure Probing.

    PubMed

    Silverman, Ian M; Berkowitz, Nathan D; Gosai, Sager J; Gregory, Brian D

    2016-01-01

    RNA molecules of all types fold into complex secondary and tertiary structures that are important for their function and regulation. Structural and catalytic RNAs such as ribosomal RNA (rRNA) and transfer RNA (tRNA) are central players in protein synthesis, and only function through their proper folding into intricate three-dimensional structures. Studies of messenger RNA (mRNA) regulation have also revealed that structural elements embedded within these RNA species are important for the proper regulation of their total level in the transcriptome. More recently, the discovery of microRNAs (miRNAs) and long non-coding RNAs (lncRNAs) has shed light on the importance of RNA structure to genome, transcriptome, and proteome regulation. Due to the relatively small number, high conservation, and importance of structural and catalytic RNAs to all life, much early work in RNA structure analysis mapped out a detailed view of these molecules. Computational and physical methods were used in concert with enzymatic and chemical structure probing to create high-resolution models of these fundamental biological molecules. However, the recent expansion in our knowledge of the importance of RNA structure to coding and regulatory RNAs has left the field in need of faster and scalable methods for high-throughput structural analysis. To address this, nuclease and chemical RNA structure probing methodologies have been adapted for genome-wide analysis. These methods have been deployed to globally characterize thousands of RNA structures in a single experiment. Here, we review these experimental methodologies for high-throughput RNA structure determination and discuss the insights gained from each approach. PMID:27256381

  2. Genome-Wide Association Study of Proneness to Anger

    PubMed Central

    Mick, Eric; McGough, James; Deutsch, Curtis K.; Frazier, Jean A.; Kennedy, David; Goldberg, Robert J.

    2014-01-01

    Background Community samples suggest that approximately 1 in 20 children and adults exhibit clinically significant anger, hostility, and aggression. Individuals with dysregulated emotional control have a greater lifetime burden of psychiatric morbidity, severe impairment in role functioning, and premature mortality due to cardiovascular disease. Methods With publically available data secured from dbGaP, we conducted a genome-wide association study of proneness to anger using the Spielberger State-Trait Anger Scale in the Atherosclerosis Risk in Communities (ARIC) study (n = 8,747). Results Subjects were, on average, 54 (range 45–64) years old at baseline enrollment, 47% (n = 4,117) were male, and all were of European descent by self-report. The mean Angry Temperament and Angry Reaction scores were 5.8±1.8 and 7.6±2.2. We observed a nominally significant finding (p = 2.9E-08, λ = 1.027 - corrected pgc = 2.2E-07, λ = 1.0015) on chromosome 6q21 in the gene coding for the non-receptor protein-tyrosine kinase, Fyn. Conclusions Fyn interacts with NDMA receptors and inositol-1,4,5-trisphosphate (IP3)-gated channels to regulate calcium influx and intracellular release in the post-synaptic density. These results suggest that signaling pathways regulating intracellular calcium homeostasis, which are relevant to memory, learning, and neuronal survival, may in part underlie the expression of Angry Temperament. PMID:24489884

  3. Bioinformatics challenges for genome-wide association studies

    PubMed Central

    Moore, Jason H.; Asselbergs, Folkert W.; Williams, Scott M.

    2010-01-01

    Motivation: The sequencing of the human genome has made it possible to identify an informative set of >1 million single nucleotide polymorphisms (SNPs) across the genome that can be used to carry out genome-wide association studies (GWASs). The availability of massive amounts of GWAS data has necessitated the development of new biostatistical methods for quality control, imputation and analysis issues including multiple testing. This work has been successful and has enabled the discovery of new associations that have been replicated in multiple studies. However, it is now recognized that most SNPs discovered via GWAS have small effects on disease susceptibility and thus may not be suitable for improving health care through genetic testing. One likely explanation for the mixed results of GWAS is that the current biostatistical analysis paradigm is by design agnostic or unbiased in that it ignores all prior knowledge about disease pathobiology. Further, the linear modeling framework that is employed in GWAS often considers only one SNP at a time thus ignoring their genomic and environmental context. There is now a shift away from the biostatistical approach toward a more holistic approach that recognizes the complexity of the genotype–phenotype relationship that is characterized by significant heterogeneity and gene–gene and gene–environment interaction. We argue here that bioinformatics has an important role to play in addressing the complexity of the underlying genetic basis of common human diseases. The goal of this review is to identify and discuss those GWAS challenges that will require computational methods. Contact: jason.h.moore@dartmouth.edu PMID:20053841

  4. Genome-Wide Architecture of Disease Resistance Genes in Lettuce

    PubMed Central

    Christopoulou, Marilena; Wo, Sebastian Reyes-Chin; Kozik, Alex; McHale, Leah K.; Truco, Maria-Jose; Wroblewski, Tadeusz; Michelmore, Richard W.

    2015-01-01

    Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes. PMID:26449254

  5. Multicentric Genome-Wide Association Study for Primary Spontaneous Pneumothorax

    PubMed Central

    Abrantes, Patrícia; Francisco, Vânia; Teixeira, Gilberto; Monteiro, Marta; Neves, João; Norte, Ana; Robalo Cordeiro, Carlos; Moura e Sá, João; Reis, Ernestina; Santos, Patrícia; Oliveira, Manuela; Sousa, Susana; Fradinho, Marta; Malheiro, Filipa; Negrão, Luís

    2016-01-01

    Despite elevated incidence and recurrence rates for Primary Spontaneous Pneumothorax (PSP), little is known about its etiology, and the genetics of idiopathic PSP remains unexplored. To identify genetic variants contributing to sporadic PSP risk, we conducted the first PSP genome-wide association study. Two replicate pools of 92 Portuguese PSP cases and of 129 age- and sex-matched controls were allelotyped in triplicate on the Affymetrix Human SNP Array 6.0 arrays. Markers passing quality control were ranked by relative allele score difference between cases and controls (|RASdiff|), by a novel cluster method and by a combined Z-test. 101 single nucleotide polymorphisms (SNPs) were selected using these three approaches for technical validation by individual genotyping in the discovery dataset. 87 out of 94 successfully tested SNPs were nominally associated in the discovery dataset. Replication of the 87 technically validated SNPs was then carried out in an independent replication dataset of 100 Portuguese cases and 425 controls. The intergenic rs4733649 SNP in chromosome 8 (between LINC00824 and LINC00977) was associated with PSP in the discovery (P = 4.07E-03, ORC[95% CI] = 1.88[1.22–2.89]), replication (P = 1.50E-02, ORC[95% CI] = 1.50[1.08–2.09]) and combined datasets (P = 8.61E-05, ORC[95% CI] = 1.65[1.29–2.13]). This study identified for the first time one genetic risk factor for sporadic PSP, but future studies are warranted to further confirm this finding in other populations and uncover its functional role in PSP pathogenesis. PMID:27203581

  6. Assessing statistical significance in multivariable genome wide association analysis

    PubMed Central

    Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter

    2016-01-01

    Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: peter.buehlmann@stat.math.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677

  7. Genome-wide profiling of Populus small RNAs

    PubMed Central

    2009-01-01

    Background Short RNAs, and in particular microRNAs, are important regulators of gene expression both within defined regulatory pathways and at the epigenetic scale. We investigated the short RNA (sRNA) population (18-24 nt) of the transcriptome of green leaves from the sequenced Populus trichocarpa using a concatenation strategy in combination with 454 sequencing. Results The most abundant size class of sRNAs were 24 nt. Long Terminal Repeats were particularly associated with 24 nt sRNAs. Additionally, some repetitive elements were associated with 22 nt sRNAs. We identified an sRNA hot-spot on chromosome 19, overlapping a region containing both the proposed sex-determining locus and a major cluster of NBS-LRR genes. A number of phased siRNA loci were identified, a subset of which are predicted to target PPR and NBS-LRR disease resistance genes, classes of genes that have been significantly expanded in Populus. Additional loci enriched for sRNA production were identified and characterised. We identified 15 novel predicted microRNAs (miRNAs), including miRNA*sequences, and identified a novel locus that may encode a dual miRNA or a miRNA and short interfering RNAs (siRNAs). Conclusions The short RNA population of P. trichocarpa is at least as complex as that of Arabidopsis thaliana. We provide a first genome-wide view of short RNA production for P. trichocarpa and identify new, non-conserved miRNAs. PMID:20021695

  8. Genome-Wide RNAi Screening to Dissect the TGF-β Signal Transduction Pathway.

    PubMed

    Chen, Xiaochu; Xu, Lan

    2016-01-01

    The transforming growth factor-β (TGF-β) family of cytokines figures prominently in regulation of embryonic development and adult tissue homeostasis from Drosophila to mammals. Genetic defects affecting TGF-β signaling underlie developmental disorders and diseases such as cancer in human. Therefore, delineating the molecular mechanism by which TGF-β regulates cell biology is critical for understanding normal biology and disease mechanisms. Forward genetic screens in model organisms and biochemical approaches in mammalian tissue culture were instrumental in initial characterization of the TGF-β signal transduction pathway. With complete sequence information of the genomes and the advent of RNA interference (RNAi) technology, genome-wide RNAi screening emerged as a powerful functional genomics approach to systematically delineate molecular components of signal transduction pathways. Here, we describe a protocol for image-based whole-genome RNAi screening aimed at identifying molecules required for TGF-β signaling into the nucleus. Using this protocol we examined >90 % of annotated Drosophila open reading frames (ORF) individually and successfully uncovered several novel factors serving critical roles in the TGF-β pathway. Thus cell-based high-throughput functional genomics can uncover new mechanistic insights on signaling pathways beyond what the classical genetics had revealed. PMID:26520138

  9. Genome-wide association study of behavioral, physiological and gene expression traits in outbred CFW mice.

    PubMed

    Parker, Clarissa C; Gopalakrishnan, Shyam; Carbonetto, Peter; Gonzales, Natalia M; Leung, Emily; Park, Yeonhee J; Aryee, Emmanuel; Davis, Joe; Blizard, David A; Ackert-Bicknell, Cheryl L; Lionikas, Arimantas; Pritchard, Jonathan K; Palmer, Abraham A

    2016-08-01

    Although mice are the most widely used mammalian model organism, genetic studies have suffered from limited mapping resolution due to extensive linkage disequilibrium (LD) that is characteristic of crosses among inbred strains. Carworth Farms White (CFW) mice are a commercially available outbred mouse population that exhibit rapid LD decay in comparison to other available mouse populations. We performed a genome-wide association study (GWAS) of behavioral, physiological and gene expression phenotypes using 1,200 male CFW mice. We used genotyping by sequencing (GBS) to obtain genotypes at 92,734 SNPs. We also measured gene expression using RNA sequencing in three brain regions. Our study identified numerous behavioral, physiological and expression quantitative trait loci (QTLs). We integrated the behavioral QTL and eQTL results to implicate specific genes, including Azi2 in sensitivity to methamphetamine and Zmynd11 in anxiety-like behavior. The combination of CFW mice, GBS and RNA sequencing constitutes a powerful approach to GWAS in mice. PMID:27376237

  10. Next-generation libraries for robust RNA interference-based genome-wide screens

    PubMed Central

    Kampmann, Martin; Horlbeck, Max A.; Chen, Yuwen; Tsai, Jordan C.; Bassik, Michael C.; Gilbert, Luke A.; Villalta, Jacqueline E.; Kwon, S. Chul; Chang, Hyeshik; Kim, V. Narry; Weissman, Jonathan S.

    2015-01-01

    Genetic screening based on loss-of-function phenotypes is a powerful discovery tool in biology. Although the recent development of clustered regularly interspaced short palindromic repeats (CRISPR)-based screening approaches in mammalian cell culture has enormous potential, RNA interference (RNAi)-based screening remains the method of choice in several biological contexts. We previously demonstrated that ultracomplex pooled short-hairpin RNA (shRNA) libraries can largely overcome the problem of RNAi off-target effects in genome-wide screens. Here, we systematically optimize several aspects of our shRNA library, including the promoter and microRNA context for shRNA expression, selection of guide strands, and features relevant for postscreen sample preparation for deep sequencing. We present next-generation high-complexity libraries targeting human and mouse protein-coding genes, which we grouped into 12 sublibraries based on biological function. A pilot screen suggests that our next-generation RNAi library performs comparably to current CRISPR interference (CRISPRi)-based approaches and can yield complementary results with high sensitivity and high specificity. PMID:26080438

  11. Genome-wide nucleosome specificity and function of chromatin remodellers in ES cells

    PubMed Central

    de Dieuleveult, Maud; Yen, Kuangyu; Hmitou, Isabelle; Depaux, Arnaud; Boussouar, Fayçal; Dargham, Daria Bou; Jounier, Sylvie; Humbertclaude, Hélène; Ribierre, Florence; Baulard, Céline; Farrell, Nina P.; Park, Bongsoo; Keime, Céline; Carrière, Lucie; Berlivet, Soizick; Gut, Marta; Gut, Ivo; Werner, Michel; Deleuze, Jean-François; Olaso, Robert; Aude, Jean-Christophe; Chantalat, Sophie; Pugh, B. Franklin; Gérard, Matthieu

    2015-01-01

    Summary ATP-dependent chromatin remodellers allow access to DNA for transcription factors and the general transcription machinery, but whether mammalian chromatin remodellers1–3 target specific nucleosomes to regulate transcription is unclear. Here, we present genome-wide remodeller-nucleosome interaction profiles for Chd1, Chd2, Chd4, Chd6, Chd8, Chd9, Brg1 and Ep400 in mouse embryonic stem (ES) cells. These remodellers bind one or both full nucleosomes that flank MNase-defined nucleosome-free promoter regions (NFRs), where they separate divergent transcription. Surprisingly, large CpG-rich NFRs that extend downstream of annotated transcriptional start sites (TSSs) are nevertheless chromatinized with non-nucleosomal or subnucleosomal histone variants (H3.3 and H2A.Z) and modifications (H3K4me3 and H3K27ac). RNA polymerase (pol) II therefore navigates hundreds of bp of altered chromatin in the sense direction before encountering an MNase-resistant nucleosome at the 3′ end of the NFR. Transcriptome analysis upon remodeller depletion reveals reciprocal mechanisms of transcriptional regulation by remodellers. Whereas at active genes individual remodellers play either positive or negative roles via altering nucleosome stability, at polycomb-enriched bivalent genes the same remodellers act in an opposite manner. These findings indicate that remodellers target specific nucleosomes at the edge of NFRs, where they regulate ES cell transcriptional programs. PMID:26814966

  12. Genome-wide RNAi screen for nuclear actin reveals a network of cofilin regulators

    PubMed Central

    Dopie, Joseph; Rajakylä, Eeva K.; Joensuu, Merja S.; Huet, Guillaume; Ferrantelli, Evelina; Xie, Tiao; Jäälinoja, Harri; Jokitalo, Eija; Vartiainen, Maria K.

    2015-01-01

    ABSTRACT Nuclear actin plays an important role in many processes that regulate gene expression. Cytoplasmic actin dynamics are tightly controlled by numerous actin-binding proteins, but regulation of nuclear actin has remained unclear. Here, we performed a genome-wide RNA interference (RNAi) screen in Drosophila cells to identify proteins that influence either nuclear polymerization or import of actin. We validate 19 factors as specific hits, and show that Chinmo (known as Bach2 in mammals), SNF4Aγ (Prkag1 in mammals) and Rab18 play a role in nuclear localization of actin in both fly and mammalian cells. We identify several new regulators of cofilin activity, and characterize modulators of both cofilin kinases and phosphatase. For example, Chinmo/Bach2, which regulates nuclear actin levels also in vivo, maintains active cofilin by repressing the expression of the kinase Cdi (Tesk in mammals). Finally, we show that Nup98 and lamin are candidates for regulating nuclear actin polymerization. Our screen therefore reveals new aspects of actin regulation and links nuclear actin to many cellular processes. PMID:26021350

  13. Genome-wide RNAi screen for nuclear actin reveals a network of cofilin regulators.

    PubMed

    Dopie, Joseph; Rajakylä, Eeva K; Joensuu, Merja S; Huet, Guillaume; Ferrantelli, Evelina; Xie, Tiao; Jäälinoja, Harri; Jokitalo, Eija; Vartiainen, Maria K

    2015-07-01

    Nuclear actin plays an important role in many processes that regulate gene expression. Cytoplasmic actin dynamics are tightly controlled by numerous actin-binding proteins, but regulation of nuclear actin has remained unclear. Here, we performed a genome-wide RNA interference (RNAi) screen in Drosophila cells to identify proteins that influence either nuclear polymerization or import of actin. We validate 19 factors as specific hits, and show that Chinmo (known as Bach2 in mammals), SNF4Aγ (Prkag1 in mammals) and Rab18 play a role in nuclear localization of actin in both fly and mammalian cells. We identify several new regulators of cofilin activity, and characterize modulators of both cofilin kinases and phosphatase. For example, Chinmo/Bach2, which regulates nuclear actin levels also in vivo, maintains active cofilin by repressing the expression of the kinase Cdi (Tesk in mammals). Finally, we show that Nup98 and lamin are candidates for regulating nuclear actin polymerization. Our screen therefore reveals new aspects of actin regulation and links nuclear actin to many cellular processes. PMID:26021350

  14. A Genome-wide CRISPR Screen in Toxoplasma Identifies Essential Apicomplexan Genes.

    PubMed

    Sidik, Saima M; Huet, Diego; Ganesan, Suresh M; Huynh, My-Hang; Wang, Tim; Nasamu, Armiyaw S; Thiru, Prathapan; Saeij, Jeroen P J; Carruthers, Vern B; Niles, Jacquin C; Lourido, Sebastian

    2016-09-01

    Apicomplexan parasites are leading causes of human and livestock diseases such as malaria and toxoplasmosis, yet most of their genes remain uncharacterized. Here, we present the first genome-wide genetic screen of an apicomplexan. We adapted CRISPR/Cas9 to assess the contribution of each gene from the parasite Toxoplasma gondii during infection of human fibroblasts. Our analysis defines ∼200 previously uncharacterized, fitness-conferring genes unique to the phylum, from which 16 were investigated, revealing essential functions during infection of human cells. Secondary screens identify as an invasion factor the claudin-like apicomplexan microneme protein (CLAMP), which resembles mammalian tight-junction proteins and localizes to secretory organelles, making it critical to the initiation of infection. CLAMP is present throughout sequenced apicomplexan genomes and is essential during the asexual stages of the malaria parasite Plasmodium falciparum. These results provide broad-based functional information on T. gondii genes and will facilitate future approaches to expand the horizon of antiparasitic interventions. PMID:27594426

  15. Methodological Issues in Multistage Genome-wide Association Studies

    PubMed Central

    Thomas, Duncan C.; Casey, Graham; Conti, David V.; Haile, Robert W.; Lewinger, Juan Pablo; Stram, Daniel O.

    2010-01-01

    Because of the high cost of commercial genotyping chip technologies, many investigations have used a two-stage design for genome-wide association studies, using part of the sample for an initial discovery of “promising” SNPs at a less stringent significance level and the remainder in a joint analysis of just these SNPs using custom genotyping. Typical cost savings of about 50% are possible with this design to obtain comparable levels of overall type I error and power by using about half the sample for stage I and carrying about 0.1% of SNPs forward to the second stage, the optimal design depending primarily upon the ratio of costs per genotype for stages I and II. However, with the rapidly declining costs of the commercial panels, the generally low observed ORs of current studies, and many studies aiming to test multiple hypotheses and multiple endpoints, many investigators are abandoning the two-stage design in favor of simply genotyping all available subjects using a standard high-density panel. Concern is sometimes raised about the absence of a “replication” panel in this approach, as required by some high-profile journals, but it must be appreciated that the two-stage design is not a discovery/replication design but simply a more efficient design for discovery using a joint analysis of the data from both stages. Once a subset of highly-significant associations has been discovered, a truly independent “exact replication” study is needed in a similar population of the same promising SNPs using similar methods. This can then be followed by (1) “generalizability” studies to assess the full scope of replicated associations across different races, different endpoints, different interactions, etc.; (2) fine-mapping or re-sequencing to try to identify the causal variant; and (3) experimental studies of the biological function of these genes. Multistage sampling designs may be more useful at this stage, say for selecting subsets of subjects for deep re

  16. Genome-wide association study of colorectal cancer in Hispanics.

    PubMed

    Schmit, Stephanie L; Schumacher, Fredrick R; Edlund, Christopher K; Conti, David V; Ihenacho, Ugonna; Wan, Peggy; Van Den Berg, David; Casey, Graham; Fortini, Barbara K; Lenz, Heinz-Josef; Tusié-Luna, Teresa; Aguilar-Salinas, Carlos A; Moreno-Macías, Hortensia; Huerta-Chagoya, Alicia; Ordóñez-Sánchez, María Luisa; Rodríguez-Guillén, Rosario; Cruz-Bautista, Ivette; Rodríguez-Torres, Maribel; Muñóz-Hernández, Linda Liliana; Arellano-Campos, Olimpia; Gómez, Donají; Alvirde, Ulices; González-Villalpando, Clicerio; González-Villalpando, María Elena; Le Marchand, Loic; Haiman, Christopher A; Figueiredo, Jane C

    2016-06-01

    Genome-wide association studies (GWAS) have identified 58 susceptibility alleles across 37 regions associated with the risk of colorectal cancer (CRC) with P < 5×10(-8) Most studies have been conducted in non-Hispanic whites and East Asians; however, the generalizability of these findings and the potential for ethnic-specific risk variation in Hispanic and Latino (HL) individuals have been largely understudied. We describe the first GWAS of common genetic variation contributing to CRC risk in HL (1611 CRC cases and 4330 controls). We also examine known susceptibility alleles and implement imputation-based fine-mapping to identify potential ethnicity-specific association signals in known risk regions. We discovered 17 variants across 4 independent regions that merit further investigation due to suggestive CRC associations (P < 1×10(-6)) at 1p34.3 (rs7528276; Odds Ratio (OR) = 1.86 [95% confidence interval (CI): 1.47-2.36); P = 2.5×10(-7)], 2q23.3 (rs1367374; OR = 1.37 (95% CI: 1.21-1.55); P = 4.0×10(-7)), 14q24.2 (rs143046984; OR = 1.65 (95% CI: 1.36-2.01); P = 4.1×10(-7)) and 16q12.2 [rs142319636; OR = 1.69 (95% CI: 1.37-2.08); P=7.8×10(-7)]. Among the 57 previously published CRC susceptibility alleles with minor allele frequency ≥1%, 76.5% of SNPs had a consistent direction of effect and 19 (33.3%) were nominally statistically significant (P < 0.05). Further, rs185423955 and rs60892987 were identified as novel secondary susceptibility variants at 3q26.2 (P = 5.3×10(-5)) and 11q12.2 (P = 6.8×10(-5)), respectively. Our findings demonstrate the importance of fine mapping in HL. These results are informative for variant prioritization in functional studies and future risk prediction modeling in minority populations. PMID:27207650

  17. Genome-wide association study of colorectal cancer in Hispanics

    PubMed Central

    Schmit, Stephanie L.; Schumacher, Fredrick R.; Edlund, Christopher K.; Conti, David V.; Ihenacho, Ugonna; Wan, Peggy; Van Den Berg, David; Casey, Graham; Fortini, Barbara K.; Lenz, Heinz-Josef; Tusié-Luna, Teresa; Aguilar-Salinas, Carlos A.; Moreno-Macías, Hortensia; Huerta-Chagoya, Alicia; Ordóñez-Sánchez, María Luisa; Rodríguez-Guillén, Rosario; Cruz-Bautista, Ivette; Rodríguez-Torres, Maribel; Muñóz-Hernández, Linda Liliana; Arellano-Campos, Olimpia; Gómez, Donají; Alvirde, Ulices; González-Villalpando, Clicerio; González-Villalpando, María Elena; Le Marchand, Loic; Haiman, Christopher A.; Figueiredo, Jane C.

    2016-01-01

    Genome-wide association studies (GWAS) have identified 58 susceptibility alleles across 37 regions associated with the risk of colorectal cancer (CRC) with P < 5×10−8. Most studies have been conducted in non-Hispanic whites and East Asians; however, the generalizability of these findings and the potential for ethnic-specific risk variation in Hispanic and Latino (HL) individuals have been largely understudied. We describe the first GWAS of common genetic variation contributing to CRC risk in HL (1611 CRC cases and 4330 controls). We also examine known susceptibility alleles and implement imputation-based fine-mapping to identify potential ethnicity-specific association signals in known risk regions. We discovered 17 variants across 4 independent regions that merit further investigation due to suggestive CRC associations (P < 1×10−6) at 1p34.3 (rs7528276; Odds Ratio (OR) = 1.86 [95% confidence interval (CI): 1.47–2.36); P = 2.5×10−7], 2q23.3 (rs1367374; OR = 1.37 (95% CI: 1.21–1.55); P = 4.0×10−7), 14q24.2 (rs143046984; OR = 1.65 (95% CI: 1.36–2.01); P = 4.1×10−7) and 16q12.2 [rs142319636; OR = 1.69 (95% CI: 1.37–2.08); P=7.8×10−7]. Among the 57 previously published CRC susceptibility alleles with minor allele frequency ≥1%, 76.5% of SNPs had a consistent direction of effect and 19 (33.3%) were nominally statistically significant (P < 0.05). Further, rs185423955 and rs60892987 were identified as novel secondary susceptibility variants at 3q26.2 (P = 5.3×10–5) and 11q12.2 (P = 6.8×10−5), respectively. Our findings demonstrate the importance of fine mapping in HL. These results are informative for variant prioritization in functional studies and future risk prediction modeling in minority populations. PMID:27207650

  18. A Genome-wide Pleiotropy Scan for Prostate Cancer Risk

    PubMed Central

    Panagiotou, Orestis A; Travis, Ruth C; Campa, Daniele; Berndt, Sonja I.; Lindstrom, Sara; Kraft, Peter; Schumacher, Fredrick R.; Siddiq, Afshan; Papatheodorou, Stefania I.; Stanford, Janet L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie J.; Diver, W. Ryan; Gapstur, Susan M.; Stevens, Victoria L.; Boeing, Heiner; Bueno-de-Mesquita, H. Bas; Gurrea, Aurelio Barricarte; Kaaks, Rudolf; Khaw, Kay-Tee; Krogh, Vittorio; Overvad, Kim; Riboli, Elio; Trichopoulos, Dimitrios; Giovannucci, Edward; Stampfer, Meir; Haiman, Christopher; Henderson, Brian; Le Marchand, Loic; Gaziano, J. Michael; Hunter, DavidJ.; Koutros, Stella; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Wacholder, Sholom; Key, Timothy J.; Tsilidis, Konstantinos K

    2014-01-01

    Background No single-nucleotide polymorphisms (SNPs) specific for aggressive prostate cancer have been identified in genome-wide association studies (GWAS). Objective To test if SNPs associated with other traits may also affect the risk of aggressive prostate cancer. Design, setting, and participants SNPs implicated in any phenotype other than prostate cancer (p ≤ 10−7) were identified through the catalog of published GWAS and tested in 2891 aggressive prostate cancer cases and 4592 controls from the Breast and Prostate Cancer Cohort Consortium (BPC3). The 40 most significant SNPs were followed up in 4872 aggressive prostate cancer cases and 24 534 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. Outcome measurements and statistical analysis Odds ratios (ORs) and 95% confidence intervals (CIs) for aggressive prostate cancer were estimated. Results and limitations A total of 4666 SNPs were evaluated by the BPC3. Two signals were seen in regions already reported for prostate cancer risk. rs7014346 at 8q24.21 was marginally associated with aggressive prostate cancer in the BPC3 trial (p = 1.6 × 10-6), whereas after meta-analysis by PRACTICAL the summary OR was 1.21 (95%CI 1.16–1.27; p = 3.22 × 10−18). rs9900242 at 17q24.3 was also marginally associated with aggressive disease in the meta-analysis (OR 0.90, 95% CI 0.86–0.94; p = 2.5 × 10−6). Neither of these SNPs remained statistically significant when conditioning on correlated known prostate cancer SNPs. The meta-analysis by BPC3 and PRACTICAL identified a third promising signal, marked by rs16844874 at 2q34, independent of known prostate cancer loci (OR 1.12,95% CI 1.06–1.19; p = 4.67 × 10−5); it has been shown that SNPs correlated with this signal affect glycine concentrations. The main limitation is the heterogeneity in the definition of aggressive prostate cancer between BPC3 and PRACTICAL. Conclusions We did

  19. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2010-01-01

    Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…

  20. Case-Control Genome-Wide Association Study of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J. L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schafer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus additional genome-wide association studies (GWAS) are needed. Method: We used case-control analyses of 896 cases…

  1. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  2. Genome-wide screening and identification of antigens for rickettsial vaccine development

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The capacity to identify immunogens for vaccine development by genome-wide screening has been markedly enhanced by the availability of complete microbial genome sequences coupled to rapid proteomic and bioinformatic analysis. Critical to this genome-wide screening is in vivo testing in the context o...

  3. Genome-wide gene expression and DNA methylation differences in abnormally cloned and normally natural mating piglets.

    PubMed

    Zou, C; Fu, Y; Li, C; Liu, H; Li, G; Li, J; Zhang, H; Wu, Y; Li, C

    2016-08-01

    Many studies have proved that DNA methylation can regulate gene expression and further affect skeletal muscle growth and development of pig, whereas the mechanisms of how DNA methylation or gene expression alteration ultimately lead to phenotypical differences between the cloned and natural mating pigs remain elusive. This study aimed to investigate genome-wide gene expression and DNA methylation differences between abnormally cloned and normally natural mating piglets and identify molecular markers related to skeletal muscle growth and development in pig. The DNA methylation and genome-wide gene expression in the two groups of piglets were analysed through methylated DNA immunoprecipitation binding high-throughput sequencing and RNA sequencing respectively. We detected 1493 differentially expressed genes between the two groups, of which 382 genes were also differentially methylated. The results of the integrative analysis between DNA methylation and gene expression revealed that the DNA methylation levels showed a significantly negative and monotonic correlation with gene expression levels around the transcription start site of genes. By contrast, no notable monotonic correlation was observed in other regions. Furthermore, we identified some interesting genes and signalling pathways (e.g. myosin, heavy chain 7 and mammalian target of rapamycin) which possibly play essential roles in skeletal muscle growth and development. The results of this study provide insights into the relationship of DNA methylation with gene expression in newborn piglets and into the mechanisms in abnormally cloned animals through somatic cell nuclear transfer. PMID:27028246

  4. No Genome-Wide Protein Sequence Convergence for Echolocation

    PubMed Central

    Zou, Zhengting; Zhang, Jianzhi

    2015-01-01

    Toothed whales and two groups of bats independently acquired echolocation, the ability to locate and identify objects by reflected sound. Echolocation requires physiologically complex and coordinated vocal, auditory, and neural functions, but the molecular basis of the capacity for echolocation is not well understood. A recent study suggested that convergent amino acid substitutions widespread in the proteins of echolocators underlay the convergent origins of mammalian echolocation. Here, we show that genomic signatures of molecular convergence between echolocating lineages are generally no stronger than those between echolocating and comparable nonecholocating lineages. The same is true for the group of 29 hearing-related proteins claimed to be enriched with molecular convergence. Reexamining the previous selection test reveals several flaws and invalidates the asserted evidence for adaptive convergence. Together, these findings indicate that the reported genomic signatures of convergence largely reflect the background level of sequence convergence unrelated to the origins of echolocation. PMID:25631925

  5. No genome-wide protein sequence convergence for echolocation.

    PubMed

    Zou, Zhengting; Zhang, Jianzhi

    2015-05-01

    Toothed whales and two groups of bats independently acquired echolocation, the ability to locate and identify objects by reflected sound. Echolocation requires physiologically complex and coordinated vocal, auditory, and neural functions, but the molecular basis of the capacity for echolocation is not well understood. A recent study suggested that convergent amino acid substitutions widespread in the proteins of echolocators underlay the convergent origins of mammalian echolocation. Here, we show that genomic signatures of molecular convergence between echolocating lineages are generally no stronger than those between echolocating and comparable nonecholocating lineages. The same is true for the group of 29 hearing-related proteins claimed to be enriched with molecular convergence. Reexamining the previous selection test reveals several flaws and invalidates the asserted evidence for adaptive convergence. Together, these findings indicate that the reported genomic signatures of convergence largely reflect the background level of sequence convergence unrelated to the origins of echolocation. PMID:25631925

  6. Genome-wide RNAi Screening Identifies Protein Modules Required for 40S Subunit Synthesis in Human Cells.

    PubMed

    Badertscher, Lukas; Wild, Thomas; Montellese, Christian; Alexander, Leila T; Bammert, Lukas; Sarazova, Marie; Stebler, Michael; Csucs, Gabor; Mayer, Thomas U; Zamboni, Nicola; Zemp, Ivo; Horvath, Peter; Kutay, Ulrike

    2015-12-29

    Ribosome biogenesis is a highly complex process requiring many assisting factors. Studies in yeast have yielded comprehensive knowledge of the cellular machinery involved in this process. However, many aspects of ribosome synthesis are different in higher eukaryotes, and the global set of mammalian ribosome biogenesis factors remains unexplored. We used an imaging-based, genome-wide RNAi screen to find human proteins involved in 40S ribosomal subunit biogenesis. Our analysis identified ∼ 300 factors, many part of essential protein modules such as the small subunit (SSU) processome, the eIF3 and chaperonin complexes, and the ubiquitin-proteasome system. We demonstrate a role for the vertebrate-specific factor RBIS in ribosome synthesis, uncover a requirement for the CRL4 E3 ubiquitin ligase in nucleolar ribosome biogenesis, and reveal that intracellular glutamine synthesis supports 40S subunit production. PMID:26711351

  7. Use of genome-wide RNAi screens to identify regulators of embryonic stem cell self-renewal and pluripotency

    PubMed Central

    Zheng, Xiaofeng; Hu, Guang

    2014-01-01

    Summary Embryonic stem cells (ESCs) are characterized by two defining features: pluripotency and self-renewal. They hold tremendous promise for both basic research and regenerative medicine. To fully realize their potentials, it is important to understand the molecular mechanisms regulating ESC pluripotency and self-renewal. The development of RNA interference (RNAi) technology has revolutionized functional genetic studies in mammalian cells. In recent years, genome-wide RNAi screens have been adopted to systematically study ESC pluripotency and self-renewal, and have uncovered many previously unknown regulators, including transcription factors, chromatin remodelers, and post-transcriptional modulators. Here, we describe a method for the identification of regulators of ESC pluripotency and self-renewal using RNAi screens, as well as assays for further validation and characterization of the identified candidates. With modifications, this method can also be adapted to study the fate specification events during ESC differentiation. PMID:24743997

  8. Consistency-based rectification of nonrigid registrations

    PubMed Central

    Gass, Tobias; Székely, Gábor; Goksel, Orcun

    2015-01-01

    Abstract. We present a technique to rectify nonrigid registrations by improving their group-wise consistency, which is a widely used unsupervised measure to assess pair-wise registration quality. While pair-wise registration methods cannot guarantee any group-wise consistency, group-wise approaches typically enforce perfect consistency by registering all images to a common reference. However, errors in individual registrations to the reference then propagate, distorting the mean and accumulating in the pair-wise registrations inferred via the reference. Furthermore, the assumption that perfect correspondences exist is not always true, e.g., for interpatient registration. The proposed consistency-based registration rectification (CBRR) method addresses these issues by minimizing the group-wise inconsistency of all pair-wise registrations using a regularized least-squares algorithm. The regularization controls the adherence to the original registration, which is additionally weighted by the local postregistration similarity. This allows CBRR to adaptively improve consistency while locally preserving accurate pair-wise registrations. We show that the resulting registrations are not only more consistent, but also have lower average transformation error when compared to known transformations in simulated data. On clinical data, we show improvements of up to 50% target registration error in breathing motion estimation from four-dimensional MRI and improvements in atlas-based segmentation quality of up to 65% in terms of mean surface distance in three-dimensional (3-D) CT. Such improvement was observed consistently using different registration algorithms, dimensionality (two-dimensional/3-D), and modalities (MRI/CT). PMID:26158083

  9. Impact of high predation risk on genome-wide hippocampal gene expression in snowshoe hares.

    PubMed

    Lavergne, Sophia G; McGowan, Patrick O; Krebs, Charles J; Boonstra, Rudy

    2014-11-01

    The population dynamics of snowshoe hares (Lepus americanus) are fundamental to the ecosystem dynamics of Canada's boreal forest. During the 8- to 11-year population cycle, hare densities can fluctuate up to 40-fold. Predators in this system (lynx, coyotes, great-horned owls) affect population numbers not only through direct mortality but also through sublethal effects. The chronic stress hypothesis posits that high predation risk during the decline severely stresses hares, leading to greater stress responses, heightened ability to mobilize cortisol and energy, and a poorer body condition. These effects may result in, or be mediated by, differential gene expression. We used an oligonucleotide microarray designed for a closely-related species, the European rabbit (Oryctolagus cuniculus), to characterize differences in genome-wide hippocampal RNA transcript abundance in wild hares from the Yukon during peak and decline phases of a single cycle. A total of 106 genes were differentially regulated between phases. Array results were validated with quantitative real-time PCR, and mammalian protein sequence similarity was used to infer gene function. In comparison to hares from the peak, decline phase hares showed increased expression of genes involved in metabolic processes and hormone response, and decreased expression of immune response and blood cell formation genes. We found evidence for predation risk effects on the expression of genes whose putative functions correspond with physiological impacts known to be induced by predation risk in snowshoe hares. This study shows, for the first time, a link between changes in demography and alterations in neural RNA transcript abundance in a natural population. PMID:25234370

  10. Genome-wide siRNA screen for mediators of NF-κB activation

    PubMed Central

    Gewurz, Benjamin E.; Towfic, Fadi; Mar, Jessica C.; Shinners, Nicholas P.; Takasaki, Kaoru; Zhao, Bo; Cahir-McFarland, Ellen D.; Quackenbush, John; Xavier, Ramnik J.; Kieff, Elliott

    2012-01-01

    Although canonical NFκB is frequently critical for cell proliferation, survival, or differentiation, NFκB hyperactivation can cause malignant, inflammatory, or autoimmune disorders. Despite intensive study, mammalian NFκB pathway loss-of-function RNAi analyses have been limited to specific protein classes. We therefore undertook a human genome-wide siRNA screen for novel NFκB activation pathway components. Using an Epstein Barr virus latent membrane protein (LMP1) mutant, the transcriptional effects of which are canonical NFκB-dependent, we identified 155 proteins significantly and substantially important for NFκB activation in HEK293 cells. These proteins included many kinases, phosphatases, ubiquitin ligases, and deubiquinating enzymes not previously known to be important for NFκB activation. Relevance to other canonical NFκB pathways was extended by finding that 118 of the 155 LMP1 NF-κB activation pathway components were similarly important for IL-1β–, and 79 for TNFα–mediated NFκB activation in the same cells. MAP3K8, PIM3, and six other enzymes were uniquely relevant to LMP1-mediated NFκB activation. Most novel pathway components functioned upstream of IκB kinase complex (IKK) activation. Robust siRNA knockdown effects were confirmed for all mRNAs or proteins tested. Although multiple ZC3H-family proteins negatively regulate NFκB, ZC3H13 and ZC3H18 were activation pathway components. ZC3H13 was critical for LMP1, TNFα, and IL-1β NFκB-dependent transcription, but not for IKK activation, whereas ZC3H18 was critical for IKK activation. Down-modulators of LMP1 mediated NFκB activation were also identified. These experiments identify multiple targets to inhibit or stimulate LMP1-, IL-1β–, or TNFα–mediated canonical NFκB activation. PMID:22308454

  11. Genome-Wide Inference of Ancestral Recombination Graphs

    PubMed Central

    Rasmussen, Matthew D.; Hubisz, Melissa J.; Gronau, Ilan; Siepel, Adam

    2014-01-01

    The complex correlation structure of a collection of orthologous DNA sequences is uniquely captured by the “ancestral recombination graph” (ARG), a complete record of coalescence and recombination events in the history of the sample. However, existing methods for ARG inference are computationally intensive, highly approximate, or limited to small numbers of sequences, and, as a consequence, explicit ARG inference is rarely used in applied population genomics. Here, we introduce a new algorithm for ARG inference that is efficient enough to apply to dozens of complete mammalian genomes. The key idea of our approach is to sample an ARG of chromosomes conditional on an ARG of chromosomes, an operation we call “threading.” Using techniques based on hidden Markov models, we can perform this threading operation exactly, up to the assumptions of the sequentially Markov coalescent and a discretization of time. An extension allows for threading of subtrees instead of individual sequences. Repeated application of these threading operations results in highly efficient Markov chain Monte Carlo samplers for ARGs. We have implemented these methods in a computer program called ARGweaver. Experiments with simulated data indicate that ARGweaver converges rapidly to the posterior distribution over ARGs and is effective in recovering various features of the ARG for dozens of sequences generated under realistic parameters for human populations. In applications of ARGweaver to 54 human genome sequences from Complete Genomics, we find clear signatures of natural selection, including regions of unusually ancient ancestry associated with balancing selection and reductions in allele age in sites under directional selection. The patterns we observe near protein-coding genes are consistent with a primary influence from background selection rather than hitchhiking, although we cannot rule out a contribution from recurrent selective sweeps. PMID:24831947

  12. Genome-wide inference of ancestral recombination graphs.

    PubMed

    Rasmussen, Matthew D; Hubisz, Melissa J; Gronau, Ilan; Siepel, Adam

    2014-01-01

    The complex correlation structure of a collection of orthologous DNA sequences is uniquely captured by the "ancestral recombination graph" (ARG), a complete record of coalescence and recombination events in the history of the sample. However, existing methods for ARG inference are computationally intensive, highly approximate, or limited to small numbers of sequences, and, as a consequence, explicit ARG inference is rarely used in applied population genomics. Here, we introduce a new algorithm for ARG inference that is efficient enough to apply to dozens of complete mammalian genomes. The key idea of our approach is to sample an ARG of [Formula: see text] chromosomes conditional on an ARG of [Formula: see text] chromosomes, an operation we call "threading." Using techniques based on hidden Markov models, we can perform this threading operation exactly, up to the assumptions of the sequentially Markov coalescent and a discretization of time. An extension allows for threading of subtrees instead of individual sequences. Repeated application of these threading operations results in highly efficient Markov chain Monte Carlo samplers for ARGs. We have implemented these methods in a computer program called ARGweaver. Experiments with simulated data indicate that ARGweaver converges rapidly to the posterior distribution over ARGs and is effective in recovering various features of the ARG for dozens of sequences generated under realistic parameters for human populations. In applications of ARGweaver to 54 human genome sequences from Complete Genomics, we find clear signatures of natural selection, including regions of unusually ancient ancestry associated with balancing selection and reductions in allele age in sites under directional selection. The patterns we observe near protein-coding genes are consistent with a primary influence from background selection rather than hitchhiking, although we cannot rule out a contribution from recurrent selective sweeps. PMID:24831947

  13. General metabolism of Laribacter hongkongensis: a genome-wide analysis

    PubMed Central

    2011-01-01

    Background Laribacter hongkongensis is associated with community-acquired gastroenteritis and traveler's diarrhea. In this study, we performed an in-depth annotation of the genes and pathways of the general metabolism of L. hongkongensis and correlated them with its phenotypic characteristics. Results The L. hongkongensis genome possesses the pentose phosphate and gluconeogenesis pathways and tricarboxylic acid and glyoxylate cycles, but incomplete Embden-Meyerhof-Parnas and Entner-Doudoroff pathways, in agreement with its asaccharolytic phenotype. It contains enzymes for biosynthesis and β-oxidation of saturated fatty acids, biosynthesis of all 20 universal amino acids and selenocysteine, the latter not observed in Neisseria gonorrhoeae, Neisseria meningitidis and Chromobacterium violaceum. The genome contains a variety of dehydrogenases, enabling it to utilize different substrates as electron donors. It encodes three terminal cytochrome oxidases for respiration using oxygen as the electron acceptor under aerobic and microaerophilic conditions and four reductases for respiration with alternative electron acceptors under anaerobic conditions. The presence of complete tetrathionate reductase operon may confer survival advantage in mammalian host in association with diarrhea. The genome contains CDSs for incorporating sulfur and nitrogen by sulfate assimilation, ammonia assimilation and nitrate reduction. The existence of both glutamate dehydrogenase and glutamine synthetase/glutamate synthase pathways suggests an importance of ammonia metabolism in the living environments that it may encounter. Conclusions The L. hongkongensis genome possesses a variety of genes and pathways for carbohydrate, amino acid and lipid metabolism, respiratory chain and sulfur and nitrogen metabolism. These allow the bacterium to utilize various substrates for energy production and survive in different environmental niches. PMID:21711917

  14. [Genome-wide association study on complex diseases: study design and genetic markers].

    PubMed

    Yan, Wei-Li

    2008-04-01

    Genome-wide association study used to be a dream of geneticists years ago, but now it came true. Since the first paper reported the finding of genetic variation contributing to human age-related macular degeneration by genome-wide association study in 2005, a numbers of whole genome studies have been published. The present paper reviewed some common comments in whole genome association study on complex diseases, including achievements of genome-wide association studies on complex traits or diseases, principles of study design, selection of genetic marker in genome, and comparisons of different commercial products for whole genome association study. Finally a newly defined genetic variation, copy number variation, was briefly introduced. This paper also summarized the shortcomings of current genome-wide association studies and perspectives of its future. PMID:18424408

  15. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identi...

  16. Dating the age of admixture via wavelet transform analysis of genome-wide data.

    PubMed

    Pugach, Irina; Matveyev, Rostislav; Wollstein, Andreas; Kayser, Manfred; Stoneking, Mark

    2011-01-01

    We describe a PCA-based genome scan approach to analyze genome-wide admixture structure, and introduce wavelet transform analysis as a method for estimating the time of admixture. We test the wavelet transform method with simulations and apply it to genome-wide SNP data from eight admixed human populations. The wavelet transform method offers better resolution than existing methods for dating admixture, and can be applied to either SNP or sequence data from humans or other species. PMID:21352535

  17. Dating the age of admixture via wavelet transform analysis of genome-wide data

    PubMed Central

    2011-01-01

    We describe a PCA-based genome scan approach to analyze genome-wide admixture structure, and introduce wavelet transform analysis as a method for estimating the time of admixture. We test the wavelet transform method with simulations and apply it to genome-wide SNP data from eight admixed human populations. The wavelet transform method offers better resolution than existing methods for dating admixture, and can be applied to either SNP or sequence data from humans or other species. PMID:21352535

  18. Genome-wide Association Study and Meta-Analysis Identify ISL1 as Genome-wide Significant Susceptibility Gene for Bladder Exstrophy

    PubMed Central

    Draaken, Markus; Knapp, Michael; Pennimpede, Tracie; Schmidt, Johanna M.; Ebert, Anne-Karolin; Rösch, Wolfgang; Stein, Raimund; Utsch, Boris; Hirsch, Karin; Boemers, Thomas M.; Mangold, Elisabeth; Heilmann, Stefanie; Ludwig, Kerstin U.; Jenetzky, Ekkehart; Zwink, Nadine; Moebus, Susanne; Herrmann, Bernhard G.; Mattheisen, Manuel; Nöthen, Markus M.

    2015-01-01

    The bladder exstrophy-epispadias complex (BEEC) represents the severe end of the uro-rectal malformation spectrum, and is thought to result from aberrant embryonic morphogenesis of the cloacal membrane and the urorectal septum. The most common form of BEEC is isolated classic bladder exstrophy (CBE). To identify susceptibility loci for CBE, we performed a genome-wide association study (GWAS) of 110 CBE patients and 1,177 controls of European origin. Here, an association was found with a region of approximately 220kb on chromosome 5q11.1. This region harbors the ISL1 (ISL LIM homeobox 1) gene. Multiple markers in this region showed evidence for association with CBE, including 84 markers with genome-wide significance. We then performed a meta-analysis using data from a previous GWAS by our group of 98 CBE patients and 526 controls of European origin. This meta-analysis also implicated the 5q11.1 locus in CBE risk. A total of 138 markers at this locus reached genome-wide significance in the meta-analysis, and the most significant marker (rs9291768) achieved a P value of 2.13 × 10−12. No other locus in the meta-analysis achieved genome-wide significance. We then performed murine expression analyses to follow up this finding. Here, Isl1 expression was detected in the genital region within the critical time frame for human CBE development. Genital regions with Isl1 expression included the peri-cloacal mesenchyme and the urorectal septum. The present study identified the first genome-wide significant locus for CBE at chromosomal region 5q11.1, and provides strong evidence for the hypothesis that ISL1 is the responsible candidate gene in this region. PMID:25763902

  19. Genome-wide expression analysis reveals 100 adrenal gland-dependent circadian genes in the mouse liver.

    PubMed

    Oishi, Katsutaka; Amagai, Noriko; Shirai, Hidenori; Kadota, Koji; Ohkura, Naoki; Ishida, Norio

    2005-01-01

    Recent progress in genome-wide expression analysis has identified hundreds of circadian genes not only in the suprachiasmatic nucleus (the mammalian master clock) but also in peripheral tissues, such as heart, liver and kidney of mammals. Glucocorticoid is thought to be a circadian time cue for mammalian peripheral clocks. To identify the genes of which the circadian expression is regulated by endogenous glucocorticoids, we performed DNA microarray analysis using hepatic RNA from adrenalectomized (ADX) and sham-operated mice. We identified 169 genes that fluctuated between day and night in the livers of the sham-operated mice. Among these, 100 lost circadian rhythmicity in ADX mice. These included the genes for key enzymes of liver metabolic functions, such as glucokinase, HMG-CoA reductase and glucose-6-phosphatase. The circadian expression of Lpin1, FKBP51 and S-adenosyl methionine decarboxylase was also abolished in the ADX mice. On the other hand, although the circadian expression of clock or clock-related genes, such as mPer2, DBP, E4BP4, mDec1, Usp2 and Wee1 remained almost totally intact in the liver of ADX mice, it was extremely damped in homozygous Clock mutant mice. The present findings suggested that one type of hepatic circadian genes in mice is transcriptionally regulated by core components of the circadian clock, such as CLOCK and BMAL1, and that the other depends on the adrenal gland. PMID:16303750

  20. A genome-wide transcriptional analysis of producer and non-producer NS0 myeloma cell lines.

    PubMed

    Khoo, Soo Hean Gary; Falciani, Francesco; Al-Rubeai, Mohamed

    2007-06-01

    'Genome-wide' or 'global' gene expression profiling provides a powerful approach to the characterization of a cell's transcriptional state. Such technology has been used in animal cell culture to create genome-wide snapshots of transcriptional activity in response to environmental factors or cellular triggers under bioprocessing conditions. Furthermore, it allows us to have a fundamental understanding of genetic mechanisms involved in recombinant protein production. One such mechanism adversely affecting the growth of recombinant bacteria is the increased metabolic burden resulting from the maintenance of plasmid copy number and heterologous protein expression. There have also been some reports on the effect of metabolic burden in mammalian cell systems. In the present study, we have used a mouse array representing 6400 genes to assess the expression profile of a WT (wild-type) mouse plasmacytoma cell line, NS0 WT, and a GS (glutamine synthetase)-NS0 6A1-100 cell line expressing chimaeric monoclonal antibody. The producer cells did not exhibit a slower growth as the result of any metabolic burden, but showed differences in metabolic activity. Gene expression profiling revealed that the producer cell line was selected for a higher expression of chromosomal genes, genes for zinc-finger proteins as well as cell-cycle-related events. On the other hand, protein synthesis is greater and ribosomal genes were more expressed in the WT cells. A possible shift from expressing antigen presenting proteins to recombinant protein could also be seen. Hence, gene expression profiling suggests that the effect of the metabolic burden in slowing growth can be mostly negated in producer cell lines by careful clonal selection, where stable transfected cells are selected for both high productivity as well as high growth rates. PMID:17223793

  1. High Quality Genome-Wide Genotyping from Archived Dried Blood Spots without DNA Amplification

    PubMed Central

    St. Julien, Krystal R.; Jelliffe-Pawlowski, Laura L.; Shaw, Gary M.; Stevenson, David K.; O’Brodovich, Hugh M.; Krasnow, Mark A.

    2013-01-01

    Spots of blood are routinely collected from newborn babies onto filter paper called Guthrie cards and used to screen for metabolic and genetic disorders. The archived dried blood spots are an important and precious resource for genomic research. Whole genome amplification of dried blood spot DNA has been used to provide DNA for genome-wide SNP genotyping. Here we describe a 96 well format procedure to extract DNA from a portion of a dried blood spot that provides sufficient unamplified genomic DNA for genome-wide single nucleotide polymorphism (SNP) genotyping. We show that SNP genotyping of the unamplified DNA is more robust than genotyping amplified dried blood spot DNA, is comparable in cost, and can be done with thousands of samples. This procedure can be used for genome-wide association studies and other large-scale genomic analyses that require robust, high-accuracy genotyping of dried blood spot DNA. PMID:23737996

  2. What are genome-wide association studies telling us about B-cell tumor development?

    PubMed Central

    Sherborne, Amy L; Houlston, Richard S

    2010-01-01

    It has long been speculated that common genetic variation influences the development of B-cell malignancy, however until recently evidence for this assertion was lacking. The advent of genome-wide association studies (GWAS) has allowed the search for this class of susceptibility allele to be conducted on a genome-wide basis. Recent GWAS of chronic lymphocytic leukemia (CLL) and acute lymphoblastic leukemia (ALL) have identified novel disease genes for CLL and ALL and underscore the importance of polymorphic variation in B-cell development genes as determinants of leukemia risk. PMID:21307401

  3. Signatures of positive selection in East African Shorthorn Zebu: a genome-wide SNP analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...

  4. Performance of single nucleotide polymorphisms versus haplotypes for genome-wide association analysis in barley

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide association studies (GWAS) may benefit from using haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on the genetic architecture of traits, patter...

  5. Genome Wide Scan for Loci influencing Warner Bratzler Shear Force in Five Bos taurus Breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genetic tests for beef tenderness are currently limited to single nucleotide polymorphisms (SNPs) within µ-calpain (CAPN1) and calpastatin (CAST) and explain little of the phenotypic variation in Warner-Bratzler shear force (WBSF). We performed a genome-wide association study for WBSF by genotyping...

  6. Mixed linear model approach adapted for genome-wide association studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mixed linear model (MLM) methods have proven useful in controlling for population structure and relatedness within genome-wide association studies. However, MLM-based methods can be computationally challenging for large datasets. We report a compression approach, called ‘compressed MLM,’ that decrea...

  7. Genetic Variation Between Two Cucumber Genotypes Inferred from Genome-wide Microsatellite Polymorphism Analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Variability at microsatellite loci has been used widely to infer the extent of genetic diversity among related plant taxa. However, typically, only the most polymorphic loci in the genome were analyzed that may result in a biased, and generally overestimated picture of genome-wide microsatellite div...

  8. A genome-wide association study platform built on iPlant cyber-infrastructure

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We demonstrated a flexible Genome-Wide Association (GWA) Study (GWAS) platform built upon the iPlant Collaborative Cyber-infrastructure. The platform supports big data management, sharing, and large scale study of both genotype and phenotype data on clusters. End users can add their own analysis too...

  9. Methods for meta-analysis of genome-wide association studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. For increasing N, results from different GWA can be combined in a meta-analysis (MA-...

  10. Genome-wide linkage scan of quantitative traits representing symptom dimensions in multiplex schizophrenia families.

    PubMed

    Ryu, Seunghyong; Won, Hong-Hee; Oh, Sohee; Kim, Jong-Won; Park, Taesung; Cho, Eun-Young; Cho, Youngah; Park, Dong Yeon; Lee, Yu-Sang; Kwon, Jun Soo; Hong, Kyung Sue

    2013-12-30

    Symptom dimensions of schizophrenia are likely to be the intermediate phenotypes under the control of disease-susceptibility genes, or separate traits related to disease-modifier genes. This study aimed to identify chromosomal loci linked to symptom dimensions of schizophrenia through genome-wide quantitative trait locus (QTL) linkage analysis. The study subjects consisted of 56 families with 183 members including 123 affected individuals. Symptom evaluations were performed on lifetime basis. Through principal component factor analysis, eight quantitative phenotypes representing symptom dimensions were identified. Genotyping was done for 6008 SNP markers, and genome-wide QTL linkage analysis was performed. No symptom dimension showed a significant linkage attaining genome-wide empirical thresholds. We observed seven regions yielding linkage signals attaining genome-wide empirical thresholds for suggestive linkage (NPL Z score = 2.78-3.49); chromosome 15q26.1 for 'non-paranoid delusion factor', 2p24.3 and 7q31.1 for 'prodromal impairment factor', 1q32.1, 9p21.3, and 9q31.2 for 'negative symptom factor', and 10p13 for 'disorganization factor'. Among these loci, chromosome 2p24.3 and 1q32.1 overlap with susceptibility loci of schizophrenia identified in our previous linkage studies. This study suggests the existence of genetic loci related to various clinical features of schizophrenia. Further genetic analyses for these dimensional phenotypes are warranted. PMID:24035701

  11. Genome-wide computational prediction and analysis of core promoter elements across plant monocots and dicots

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Transcription initiation, essential to gene expression regulation, involves recruitment of basal transcription factors to the core promoter elements (CPEs). The distribution of currently known CPEs across plant genomes is largely unknown. This is the first large scale genome-wide report on the compu...

  12. Linkage Disequilibrium And Genome-Wide Association Studies In O. sativa

    Technology Transfer Automated Retrieval System (TEKTRAN)

    There is increasing evidence that genome-wide association studies provide a powerful approach to find the genetic basis of complex phenotypic variation in all kinds of species. For this purpose, we developed the first generation 44K Affymetrix SNP array in rice (see Tung et al. poster). We genotyped...

  13. Combined analysis of exon splicing and genome wide polymorphism data predict schizophrenia risk loci.

    PubMed

    Oldmeadow, Christopher; Mossman, David; Evans, Tiffany-Jane; Holliday, Elizabeth G; Tooney, Paul A; Cairns, Murray J; Wu, Jingqin; Carr, Vaughan; Attia, John R; Scott, Rodney J

    2014-05-01

    Schizophrenia has a strong genetic basis, and genome-wide association studies (GWAS) have shown that effect sizes for individual genetic variants which increase disease risk are small, making detection and validation of true disease-associated risk variants extremely challenging. Specifically, we first identify genes with exons showing differential expression between cases and controls, indicating a splicing mechanism that may contribute to variation in disease risk and focus on those showing consistent differential expression between blood and brain tissue. We then perform a genome-wide screen for SNPs associated with both normalised exon intensity of these genes (so called splicing QTLs) as well as association with schizophrenia. We identified a number of significantly associated loci with a biologically plausible role in schizophrenia, including MCPH1, DLG3, ZC3H13, and BICD2, and additional loci that influence splicing of these genes, including YWHAH. Our approach of integrating genome-wide exon intensity with genome-wide polymorphism data has identified a number of plausible SZ associated loci. PMID:24507884

  14. Genome-wide association as a means to understanding the mammary gland

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Next-generation sequencing and related technologies have facilitated the creation of enormous public databases that catalogue genomic variation. These databases have facilitated a variety of approaches to discover new genes that regulate normal biology as well as disease. Genome wide association (...

  15. A genome-wide SNP panel for genetic diversity, mapping and breeding studies in rice

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide SNP resource was developed for rice using the GoldenGate assay and used to genotype 400 landrace accessions of O. sativa. SNPs were originally discovered using Perlegen re-sequencing technology in 20 diverse landraces of O. sativa as part of OryzaSNP project (http://irfgc.irri.org). An...

  16. Genome-wide association study of agronomic traits in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) using a global Andean diversity panel (ADP) of 237 genotypes of common bean, Phaseolus vulgaris was conducted to gain insight into the genetic architecture of several agronomic traits controlling phenology, biomass, yield components and seed yield. The panel wa...

  17. Genome-Wide Association Study of Intelligence: Additive Effects of Novel Brain Expressed Genes

    ERIC Educational Resources Information Center

    Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.

    2012-01-01

    Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…

  18. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

    Technology Transfer Automated Retrieval System (TEKTRAN)

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...

  19. Genome-wide association study of swine farrowing traits. Part II: Bayesian analysis of marker data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Reproductive efficiency has a great impact on the economic success of pork production. Number born alive (NBA) and average piglet birth weight (ABW) contribute greatly to reproductive efficiency. To better understand the underlying genetics of birth traits, a genome wide association study (GWAS) w...

  20. Implementing meta-analysis from genome-wide association studies for pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality plays an important role in the meat processing industry, thus different methodologies have been implemented to elucidate the genetic architecture of traits affecting meat quality. One of the most common and widely used approaches is to perform genome-wide association (GWA) studies. Howe...

  1. Genome-wide association analysis of symbiotic nitrogen fixation in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for symbiotic nitrogen fixation (SNF) and related traits in the Andean diversity panel (ADP) comprised of 259 common bean (Phaseolus vulgaris) genotypes. The ADP was evaluated for SNF and related traits in...

  2. Sparse principal component analysis for identifying ancestry-informative markers in genome-wide association studies.

    PubMed

    Lee, Seokho; Epstein, Michael P; Duncan, Richard; Lin, Xihong

    2012-05-01

    Genome-wide association studies (GWAS) routinely apply principal component analysis (PCA) to infer population structure within a sample to correct for confounding due to ancestry. GWAS implementation of PCA uses tens of thousands of single-nucleotide polymorphisms (SNPs) to infer structure, despite the fact that only a small fraction of such SNPs provides useful information on ancestry. The identification of this reduced set of ancestry-informative markers (AIMs) from a GWAS has practical value; for example, researchers can genotype the AIM set to correct for potential confounding due to ancestry in follow-up studies that utilize custom SNP or sequencing technology. We propose a novel technique to identify AIMs from genome-wide SNP data using sparse PCA. The procedure uses penalized regression methods to identify those SNPs in a genome-wide panel that significantly contribute to the principal components while encouraging SNPs that provide negligible loadings to vanish from the analysis. We found that sparse PCA leads to negligible loss of ancestry information compared to traditional PCA analysis of genome-wide SNP data. We further demonstrate the value of sparse PCA for AIM selection using real data from the International HapMap Project and a genomewide study of inflammatory bowel disease. We have implemented our approach in open-source R software for public use. PMID:22508067

  3. CNV-based genome wide association study reveals additional variants contributing to meat quality in swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality is important both to the meat processing industry and consumers’ purchasing attitudes. Copy number variation (CNV) is a burgeoning kind of variant that may influence meat quality. Herein, a genome-wide association study (GWAS) was performed between CNVs and meat quality traits in swine....

  4. A genome-wide regulatory framework identifies maize Pericarp Color1 (P1) controlled genes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    P1 encodes an R2R3-MYB transcription factor responsible for the accumulation of insecticidal flavones in maize silks and red phlobaphene pigments in pericarps and other floral tissues. Using genome-wide expression analyses (RNA-Seq) in pericarps and silks of plants with contrasting P1 alleles combin...

  5. Genome-wide association of meat quality traits and tenderness in swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality has a large impact on consumer preference and perception of eating quality. A genome-wide association was performed for pork quality traits [intramuscular fat (IMF)], slice shear force (SSF), color attributes, purge, cooking loss, and pH] from 531 to 1,237 records on barrows and gilts o...

  6. A genome-wide association study for blood lipid phenotypes in the Framingham Heart Study

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Abstract Background: Blood lipid levels including low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), and triglycerides (TG) are highly heritable. Genome-wide association is a promising approach to map genetic loci related to these heritable phenotypes. Metho...

  7. Application of genome-wide expression analysis to human health and disease

    PubMed Central

    Cobb, J. Perren; Mindrinos, Michael N.; Miller-Graziano, Carol; Calvano, Steve E.; Baker, Henry V.; Xiao, Wenzhong; Laudanski, Krzysztof; Brownstein, Bernard H.; Elson, Constance M.; Hayden, Douglas L.; Herndon, David N.; Lowry, Stephen F.; Maier, Ronald V.; Schoenfeld, David A.; Moldawer, Lyle L.; Davis, Ronald W.; Tompkins, Ronald G.

    2005-01-01

    The application of genome-wide expression analysis to a large-scale, multicentered program in critically ill patients poses a number of theoretical and technical challenges. We describe here an analytical and organizational approach to a systematic evaluation of the variance associated with genome-wide expression analysis specifically tailored to study human disease. We analyzed sources of variance in genome-wide expression analyses performed with commercial oligonucleotide arrays. In addition, variance in gene expression in human blood leukocytes caused by repeated sampling in the same subject, among different healthy subjects, among different leukocyte subpopulations, and the effect of traumatic injury, were also explored. We report that analytical variance caused by sample processing was acceptably small. Blood leukocyte gene expression in the same individual over a 24-h period was remarkably constant. In contrast, genome-wide expression varied significantly among different subjects and leukocyte subpopulations. Expectedly, traumatic injury induced dramatic changes in apparent gene expression that were greater in magnitude than the analytical noise and interindividual variance. We demonstrate that the development of a nation-wide program for gene expression analysis with careful attention to analytical details can reduce the variance in the clinical setting to a level where patterns of gene expression are informative among different healthy human subjects, and can be studied with confidence in human disease. PMID:15781863

  8. Meta-analysis of genome-wide association from genomic prediction models

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...

  9. Genome-Wide Association Study of Receptive Language Ability of 12-Year-Olds

    ERIC Educational Resources Information Center

    Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert

    2014-01-01

    Purpose: Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a "genome-wide association" approach. Method: The…

  10. Genome-wide significant predictors of metabolites in the one-carbon metabolism pathway

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Low plasma B-vitamin levels and elevated homocysteine have been associated with cancer, cardiovascular disease, and neurodegenerative disorders. Common variants in FUT2 on chromosome 19q13 were associated with plasma vitamin B12 levels among women in a genome-wide association study (GWAS) in the Nur...

  11. Genome-wide scan revealed genetic loci for energy metabolism in Hispanic children and adolescents

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide scans were conducted in a search for genetic locations linked to energy expenditure and substrate oxidation in children. Pedigreed data of 1030 Hispanic children and adolescents were from the Viva La Familia Study, which was designed to investigate genetic and environmental risk factors ...

  12. Software engineering the mixed model for genome-wide association studies on large samples

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample siz...

  13. Genome-wide identification and characterization of simple sequence repeat loci in grape phylloxera, Daktulosphaira vitifoliae

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide sequence search was conducted to identify simple sequence repeat (SSR) loci in phylloxera, Daktulosphaira vitifoliae (Fitch), a key grape pest throughout the world. Collectively, 1,524 SSR loci containing mono, di-, tri-, tetra-, penta- and hexanucleotide motifs were identified. Among...

  14. Genome-wide association study of maize identifies genes affecting leaf architecture

    Technology Transfer Automated Retrieval System (TEKTRAN)

    U.S. maize yield has increased eightfold in the past 80 years with half of the improvement attributed to genetics. Changes in maize leaf angle and size provided a basis for more efficient light capture as plant densities increased. Through a genome wide association study (GWAS) of the maize nested a...

  15. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population

    PubMed Central

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene–environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10−8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  16. Genome-wide meta-analysis of cerebral white matter hyperintensities in patients with stroke

    PubMed Central

    Zhang, Cathy R.; Adib-Samii, Poneh; Devan, William J.; Parsons, Owen E.; Lanfranconi, Silvia; Gregory, Sarah; Cloonan, Lisa; Falcone, Guido J.; Radmanesh, Farid; Fitzpatrick, Kaitlin; Kanakis, Allison; Barrick, Thomas R.; Moynihan, Barry; Lewis, Cathryn M.; Boncoraglio, Giorgio B.; Lemmens, Robin; Thijs, Vincent; Sudlow, Cathie; Wardlaw, Joanna; Rothwell, Peter M.; Meschia, James F.; Worrall, Bradford B.; Levi, Christopher; Bevan, Steve; Furie, Karen L.; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.; Rost, Natalia

    2016-01-01

    Objective: For 3,670 stroke patients from the United Kingdom, United States, Australia, Belgium, and Italy, we performed a genome-wide meta-analysis of white matter hyperintensity volumes (WMHV) on data imputed to the 1000 Genomes reference dataset to provide insights into disease mechanisms. Methods: We first sought to identify genetic associations with white matter hyperintensities in a stroke population, and then examined whether genetic loci previously linked to WMHV in community populations are also associated in stroke patients. Having established that genetic associations are shared between the 2 populations, we performed a meta-analysis testing which associations with WMHV in stroke-free populations are associated overall when combined with stroke populations. Results: There were no associations at genome-wide significance with WMHV in stroke patients. All previously reported genome-wide significant associations with WMHV in community populations shared direction of effect in stroke patients. In a meta-analysis of the genome-wide significant and suggestive loci (p < 5 × 10−6) from community populations (15 single nucleotide polymorphisms in total) and from stroke patients, 6 independent loci were associated with WMHV in both populations. Four of these are novel associations at the genome-wide level (rs72934505 [NBEAL1], p = 2.2 × 10−8; rs941898 [EVL], p = 4.0 × 10−8; rs962888 [C1QL1], p = 1.1 × 10−8; rs9515201 [COL4A2], p = 6.9 × 10−9). Conclusions: Genetic associations with WMHV are shared in otherwise healthy individuals and patients with stroke, indicating common genetic susceptibility in cerebral small vessel disease. PMID:26674333

  17. Genome-wide distribution of genetic diversity and linkage disequilibrium in elite sugar beet germplasm

    PubMed Central

    2011-01-01

    Background Characterization of population structure and genetic diversity of germplasm is essential for the efficient organization and utilization of breeding material. The objectives of this study were to (i) explore the patterns of population structure in the pollen parent heterotic pool using different methods, (ii) investigate the genome-wide distribution of genetic diversity, and (iii) assess the extent and genome-wide distribution of linkage disequilibrium (LD) in elite sugar beet germplasm. Results A total of 264 and 238 inbred lines from the yield type and sugar type inbreds of the pollen parent heterotic gene pools, respectively, which had been genotyped with 328 SNP markers, were used in this study. Two distinct subgroups were detected based on different statistical methods within the elite sugar beet germplasm set, which was in accordance with its breeding history. MCLUST based on principal components, principal coordinates, or lapvectors had high correspondence with the germplasm type information as well as the assignment by STRUCTURE, which indicated that these methods might be alternatives to STRUCTURE for population structure analysis. Gene diversity and modified Roger's distance between the examined germplasm types varied considerably across the genome, which might be due to artificial selection. This observation indicates that population genetic approaches could be used to identify candidate genes for the traits under selection. Due to the fact that r2 >0.8 is required to detect marker-phenotype association explaining less than 1% of the phenotypic variance, our observation of a low proportion of SNP loci pairs showing such levels of LD suggests that the number of markers has to be dramatically increased for powerful genome-wide association mapping. Conclusions We provided a genome-wide distribution map of genetic diversity and linkage disequilibrium for the elite sugar beet germplasm, which is useful for the application of genome-wide association

  18. A genome-wide approach to children's aggressive behavior: The EAGLE consortium.

    PubMed

    Pappa, Irene; St Pourcain, Beate; Benke, Kelly; Cavadino, Alana; Hakulinen, Christian; Nivard, Michel G; Nolte, Ilja M; Tiesler, Carla M T; Bakermans-Kranenburg, Marian J; Davies, Gareth E; Evans, David M; Geoffroy, Marie-Claude; Grallert, Harald; Groen-Blokhuis, Maria M; Hudziak, James J; Kemp, John P; Keltikangas-Järvinen, Liisa; McMahon, George; Mileva-Seitz, Viara R; Motazedi, Ehsan; Power, Christine; Raitakari, Olli T; Ring, Susan M; Rivadeneira, Fernando; Rodriguez, Alina; Scheet, Paul A; Seppälä, Ilkka; Snieder, Harold; Standl, Marie; Thiering, Elisabeth; Timpson, Nicholas J; Veenstra, René; Velders, Fleur P; Whitehouse, Andrew J O; Smith, George Davey; Heinrich, Joachim; Hypponen, Elina; Lehtimäki, Terho; Middeldorp, Christel M; Oldehinkel, Albertine J; Pennell, Craig E; Boomsma, Dorret I; Tiemeier, Henning

    2016-07-01

    Individual differences in aggressive behavior emerge in early childhood and predict persisting behavioral problems and disorders. Studies of antisocial and severe aggression in adulthood indicate substantial underlying biology. However, little attention has been given to genome-wide approaches of aggressive behavior in children. We analyzed data from nine population-based studies and assessed aggressive behavior using well-validated parent-reported questionnaires. This is the largest sample exploring children's aggressive behavior to date (N = 18,988), with measures in two developmental stages (N = 15,668 early childhood and N = 16,311 middle childhood/early adolescence). First, we estimated the additive genetic variance of children's aggressive behavior based on genome-wide SNP information, using genome-wide complex trait analysis (GCTA). Second, genetic associations within each study were assessed using a quasi-Poisson regression approach, capturing the highly right-skewed distribution of aggressive behavior. Third, we performed meta-analyses of genome-wide associations for both the total age-mixed sample and the two developmental stages. Finally, we performed a gene-based test using the summary statistics of the total sample. GCTA quantified variance tagged by common SNPs (10-54%). The meta-analysis of the total sample identified one region in chromosome 2 (2p12) at near genome-wide significance (top SNP rs11126630, P = 5.30 × 10(-8) ). The separate meta-analyses of the two developmental stages revealed suggestive evidence of association at the same locus. The gene-based analysis indicated association of variation within AVPR1A with aggressive behavior. We conclude that common variants at 2p12 show suggestive evidence for association with childhood aggression. Replication of these initial findings is needed, and further studies should clarify its biological meaning. © 2015 Wiley Periodicals, Inc. PMID:26087016

  19. ReplicationDomain: a visualization tool and comparative database for genome-wide replication timing data

    PubMed Central

    Weddington, Nodin; Stuy, Alexander; Hiratani, Ichiro; Ryba, Tyrone; Yokochi, Tomoki; Gilbert, David M

    2008-01-01

    Background Eukaryotic DNA replication is regulated at the level of large chromosomal domains (0.5–5 megabases in mammals) within which replicons are activated relatively synchronously. These domains replicate in a specific temporal order during S-phase and our genome-wide analyses of replication timing have demonstrated that this temporal order of domain replication is a stable property of specific cell types. Results We have developed ReplicationDomain as a web-based database for analysis of genome-wide replication timing maps (replication profiles) from various cell lines and species. This database also provides comparative information of transcriptional expression and is configured to display any genome-wide property (for instance, ChIP-Chip or ChIP-Seq data) via an interactive web interface. Our published microarray data sets are publicly available. Users may graphically display these data sets for a selected genomic region and download the data displayed as text files, or alternatively, download complete genome-wide data sets. Furthermore, we have implemented a user registration system that allows registered users to upload their own data sets. Upon uploading, registered users may choose to: (1) view their data sets privately without sharing; (2) share with other registered users; or (3) make their published or "in press" data sets publicly available, which can fulfill journal and funding agencies' requirements for data sharing. Conclusion ReplicationDomain is a novel and powerful tool to facilitate the comparative visualization of replication timing in various cell types as well as other genome-wide chromatin features and is considerably faster and more convenient than existing browsers when viewing multi-megabase segments of chromosomes. Furthermore, the data upload function with the option of private viewing or sharing of data sets between registered users should be a valuable resource for the scientific community. PMID:19077204

  20. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population.

    PubMed

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene-environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10-8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  1. GENOME-WIDE IDENTIFICATION OF LONG NON-CODING RNAS IN RAT MODELS OF CARDIOVASCULAR AND RENAL DISEASE

    PubMed Central

    Gopalakrishnan, Kathirvel; Kumarasamy, Sivarajan; Mell, Blair; Joe, Bina

    2015-01-01

    Long noncoding RNAs (lncRNAs) are an emerging class of genomic regulatory molecules reported in various species. In the rat, which is one of the major mammalian model organisms, discovery of lncRNAs on a genome-wide scale is lagging. Renal LncRNA sequencing and lncRNA transcriptome analysis was conducted in three rat strains that are widely used in cardiovascular and renal research, the Dahl salt-sensitive (S) rat, the Spontaneously Hypertensive Rat (SHR) and the Dahl salt-resistant (R) rat. Through the RNA sequencing approach, 3,273 transcripts were identified as rat lncRNAs. A majority of lncRNAs were without predicted target genes. Differential expression of 273 and 749 lncRNAs was detected between S versus R and S versus SHR comparisons respectively. To couple the observed differential expression of lncRNAs with the status of mRNAs, an mRNA transcriptome analysis was conducted. Several cis mRNA genes were co-regulated with lncRNAs. Of these, the protein expression status of four target genes, Asb3, Chac2, Pex11b and Sp5, were differentially expressed between the relevant strain comparisons thereby suggesting that the differentially expressed lncRNAs associated with these genes are candidate genetic determinants of blood pressure. This study serves as a first-generation catalog of rat lncRNAs and illustrates the prioritization of lncRNAs as positional candidates for complex polygenic traits. PMID:25385761

  2. Genome-wide association of multiple complex traits in outbred mice by ultra-low-coverage sequencing.

    PubMed

    Nicod, Jérôme; Davies, Robert W; Cai, Na; Hassett, Carl; Goodstadt, Leo; Cosgrove, Cormac; Yee, Benjamin K; Lionikaite, Vikte; McIntyre, Rebecca E; Remme, Carol Ann; Lodder, Elisabeth M; Gregory, Jennifer S; Hough, Tertius; Joynson, Russell; Phelps, Hayley; Nell, Barbara; Rowe, Clare; Wood, Joe; Walling, Alison; Bopp, Nasrin; Bhomra, Amarjit; Hernandez-Pliego, Polinka; Callebert, Jacques; Aspden, Richard M; Talbot, Nick P; Robbins, Peter A; Harrison, Mark; Fray, Martin; Launay, Jean-Marie; Pinto, Yigal M; Blizard, David A; Bezzina, Connie R; Adams, David J; Franken, Paul; Weaver, Tom; Wells, Sara; Brown, Steve D M; Potter, Paul K; Klenerman, Paul; Lionikas, Arimantas; Mott, Richard; Flint, Jonathan

    2016-08-01

    Two bottlenecks impeding the genetic analysis of complex traits in rodents are access to mapping populations able to deliver gene-level mapping resolution and the need for population-specific genotyping arrays and haplotype reference panels. Here we combine low-coverage (0.15×) sequencing with a new method to impute the ancestral haplotype space in 1,887 commercially available outbred mice. We mapped 156 unique quantitative trait loci for 92 phenotypes at a 5% false discovery rate. Gene-level mapping resolution was achieved at about one-fifth of the loci, implicating Unc13c and Pgc1a at loci for the quality of sleep, Adarb2 for home cage activity, Rtkn2 for intensity of reaction to startle, Bmp2 for wound healing, Il15 and Id2 for several T cell measures and Prkca for bone mineral content. These findings have implications for diverse areas of mammalian biology and demonstrate how genome-wide association studies can be extended via low-coverage sequencing to species with highly recombinant outbred populations. PMID:27376238

  3. Bimolecular fluorescence complementation (BiFC) analysis: advances and recent applications for genome-wide interaction studies

    PubMed Central

    Huh, Won-Ki; Park, Hay-Oak

    2015-01-01

    Complex protein networks are involved in nearly all cellular processes. To uncover these vast networks of protein interactions, various high-throughput screening technologies have been developed. Over the last decade, bimolecular fluorescence complementation (BiFC) assay has been widely used to detect protein-protein interactions (PPIs) in living cells. This technique is based on the reconstitution of a fluorescent protein in vivo. Easy quantification of the BiFC signals allows effective cell-based high-throughput screenings for protein-binding partners and drugs that modulate PPIs. Recently, with the development of large screening libraries, BiFC has been effectively applied for genome-wide PPI studies and has uncovered novel protein interactions, providing new insight into protein functions. In this review, we describe the development of reagents and methods used for BiFC-based screens in yeast, plants, and mammalian cells. We also discuss the advantages and drawbacks of these methods and highlight the application of BiFC in large-scale studies. PMID:25772494

  4. Genome-wide transcriptional analysis of the human cell cycle identifies genes differentially regulated in normal and cancer cells

    PubMed Central

    Bar-Joseph, Ziv; Siegfried, Zahava; Brandeis, Michael; Brors, Benedikt; Lu, Yong; Eils, Roland; Dynlacht, Brian D.; Simon, Itamar

    2008-01-01

    Characterization of the transcriptional regulatory network of the normal cell cycle is essential for understanding the perturbations that lead to cancer. However, the complete set of cycling genes in primary cells has not yet been identified. Here, we report the results of genome-wide expression profiling experiments on synchronized primary human foreskin fibroblasts across the cell cycle. Using a combined experimental and computational approach to deconvolve measured expression values into “single-cell” expression profiles, we were able to overcome the limitations inherent in synchronizing nontransformed mammalian cells. This allowed us to identify 480 periodically expressed genes in primary human foreskin fibroblasts. Analysis of the reconstructed primary cell profiles and comparison with published expression datasets from synchronized transformed cells reveals a large number of genes that cycle exclusively in primary cells. This conclusion was supported by both bioinformatic analysis and experiments performed on other cell types. We suggest that this approach will help pinpoint genetic elements contributing to normal cell growth and cellular transformation. PMID:18195366

  5. Genome-wide association study identifies 14 novel risk alleles associated with basal cell carcinoma

    PubMed Central

    Chahal, Harvind S.; Wu, Wenting; Ransohoff, Katherine J.; Yang, Lingyao; Hedlin, Haley; Desai, Manisha; Lin, Yuan; Dai, Hong-Ji; Qureshi, Abrar A.; Li, Wen-Qing; Kraft, Peter; Hinds, David A.; Tang, Jean Y.; Han, Jiali; Sarin, Kavita Y.

    2016-01-01

    Basal cell carcinoma (BCC) is the most common cancer worldwide with an annual incidence of 2.8 million cases in the United States alone. Previous studies have demonstrated an association between 21 distinct genetic loci and BCC risk. Here, we report the results of a two-stage genome-wide association study of BCC, totalling 17,187 cases and 287,054 controls. We confirm 17 previously reported loci and identify 14 new susceptibility loci reaching genome-wide significance (P<5 × 10−8, logistic regression). These newly associated SNPs lie within predicted keratinocyte regulatory elements and in expression quantitative trait loci; furthermore, we identify candidate genes and non-coding RNAs involved in telomere maintenance, immune regulation and tumour progression, providing deeper insight into the pathogenesis of BCC. PMID:27539887

  6. Genome-wide association study for semen quality traits in German Warmblood stallions.

    PubMed

    Gottschalk, Maren; Metzger, Julia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar

    2016-08-01

    We performed a genome-wide association study for semen quality traits in 139 German Warmblood stallions. Stallions were genotyped using the Illumina equine SNP50 Beadchip. Traits analysed were de-regressed estimated breeding values (EBVs) for gel-free volume, sperm concentration, total number of sperm, progressive motility and the total number of progressively motile sperm. The GWAS revealed 29 SNPs on 12 different chromosomes as genome-wide significantly associated with semen quality traits. For ten genomic regions we could retrieve candidate genes influencing stallion fertility. Among the candidate genes, we could find the genes encoding cysteine-rich secretory proteins (CRISP1, CRISP2 and CRISP3). This was the first GWAS in horses performed for semen quality traits. PMID:27334685

  7. Partitioning heritability by functional annotation using genome-wide association summary statistics

    PubMed Central

    Finucane, Hilary K.; Bulik-Sullivan, Brendan; Gusev, Alexander; Trynka, Gosia; Reshef, Yakir; Loh, Po-Ru; Anttila, Verneri; Xu, Han; Zang, Chongzhi; Farh, Kyle; Ripke, Stephan; Day, Felix R.; Consortium, ReproGen; Purcell, Shaun; Stahl, Eli; Lindstrom, Sara; Perry, John R. B.; Okada, Yukinori; Raychaudhuri, Soumya; Daly, Mark; Patterson, Nick; Neale, Benjamin M.; Price, Alkes L.

    2015-01-01

    Recent work has demonstrated that some functional categories of the genome contribute disproportionately to the heritability of complex diseases. Here, we analyze a broad set of functional elements, including cell-type-specific elements, to estimate their polygenic contributions to heritability in genome-wide association studies (GWAS) of 17 complex diseases and traits with an average sample size of 73,599. To enable this analysis, we introduce a new method, stratified LD score regression, for partitioning heritability from GWAS summary statistics while accounting for linked markers. This new method is computationally tractable at very large sample sizes, and leverages genome-wide information. Our results include a large enrichment of heritability in conserved regions across many traits; a very large immunological disease-specific enrichment of heritability in FANTOM5 enhancers; and many cell-type-specific enrichments including significant enrichment of central nervous system cell types in body mass index, age at menarche, educational attainment, and smoking behavior. PMID:26414678

  8. Genome-wide association study identifies 14 novel risk alleles associated with basal cell carcinoma.

    PubMed

    Chahal, Harvind S; Wu, Wenting; Ransohoff, Katherine J; Yang, Lingyao; Hedlin, Haley; Desai, Manisha; Lin, Yuan; Dai, Hong-Ji; Qureshi, Abrar A; Li, Wen-Qing; Kraft, Peter; Hinds, David A; Tang, Jean Y; Han, Jiali; Sarin, Kavita Y

    2016-01-01

    Basal cell carcinoma (BCC) is the most common cancer worldwide with an annual incidence of 2.8 million cases in the United States alone. Previous studies have demonstrated an association between 21 distinct genetic loci and BCC risk. Here, we report the results of a two-stage genome-wide association study of BCC, totalling 17,187 cases and 287,054 controls. We confirm 17 previously reported loci and identify 14 new susceptibility loci reaching genome-wide significance (P<5 × 10(-8), logistic regression). These newly associated SNPs lie within predicted keratinocyte regulatory elements and in expression quantitative trait loci; furthermore, we identify candidate genes and non-coding RNAs involved in telomere maintenance, immune regulation and tumour progression, providing deeper insight into the pathogenesis of BCC. PMID:27539887

  9. AUTOGSCAN: powerful tools for automated genome-wide linkage and linkage disequilibrium analysis.

    PubMed

    Hiekkalinna, Tero; Terwilliger, Joseph D; Sammalisto, Sampo; Peltonen, Leena; Perola, Markus

    2005-02-01

    Genome-wide linkage analysis using multiple traits and statistical software packages is a tedious process which requires a significant amount of manual file manipulation. Different linkage analysis programs require different input file formats, making the task of analyzing data with multiple methods even more time-consuming. We have developed a software tool, AUTOGSCAN, that automates file formatting, the running of statistical analyses, and the summarizing of resulting statistics for whole genome scans with a push of a button, using several independent, and often idiosyncratic, statistical software packages such as MERLIN, SOLAR and GENEHUNTER. We also describe a program, ANALYZE, designed to run qualitative linkage analysis with several different statistical strategies and programs to efficiently screen for linkage and linkage disequilibrium for a given discrete trait. The ANALYZE program can also be used by AUTOGSCAN in a genome-wide sense. PMID:15836805

  10. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma.

    PubMed

    Law, Matthew H; Bishop, D Timothy; Lee, Jeffrey E; Brossard, Myriam; Martin, Nicholas G; Moses, Eric K; Song, Fengju; Barrett, Jennifer H; Kumar, Rajiv; Easton, Douglas F; Pharoah, Paul D P; Swerdlow, Anthony J; Kypreou, Katerina P; Taylor, John C; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A; Andresen, Per A; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M; Dȩbniak, Tadeusz; Duffy, David L; Elder, David E; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M; Goldstein, Alisa M; Gruis, Nelleke A; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A; Chen, Wei V; Landi, Maria Teresa; Lang, Julie; Lathrop, G Mark; Lubiński, Jan; Mackie, Rona M; Mann, Graham J; Molven, Anders; Montgomery, Grant W; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A; Radford-Smith, Graham L; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C; Craig, Jamie E; Schadendorf, Dirk; Simms, Lisa A; Burdon, Kathryn P; Nyholt, Dale R; Pooley, Karen A; Orr, Nick; Stratigos, Alexander J; Cust, Anne E; Ward, Sarah V; Hayward, Nicholas K; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M; Bishop, Julia A Newton; Demenais, Florence; Amos, Christopher I; MacGregor, Stuart; Iles, Mark M

    2015-09-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5 × 10(-8)), as did 2 previously reported but unreplicated loci and all 13 established loci. Newly associated SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes in the associated regions, including one involved in telomere biology. PMID:26237428

  11. Singular value decomposition for genome-wide expression data processing and modeling

    PubMed Central

    Alter, Orly; Brown, Patrick O.; Botstein, David

    2000-01-01

    We describe the use of singular value decomposition in transforming genome-wide expression data from genes × arrays space to reduced diagonalized “eigengenes” × “eigenarrays” space, where the eigengenes (or eigenarrays) are unique orthonormal superpositions of the genes (or arrays). Normalizing the data by filtering out the eigengenes (and eigenarrays) that are inferred to represent noise or experimental artifacts enables meaningful comparison of the expression of different genes across different arrays in different experiments. Sorting the data according to the eigengenes and eigenarrays gives a global picture of the dynamics of gene expression, in which individual genes and arrays appear to be classified into groups of similar regulation and function, or similar cellular state and biological phenotype, respectively. After normalization and sorting, the significant eigengenes and eigenarrays can be associated with observed genome-wide effects of regulators, or with measured samples, in which these regulators are overactive or underactive, respectively. PMID:10963673

  12. Quality control and conduct of genome-wide association meta-analyses

    PubMed Central

    Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Mägi, Reedik; Ferreira, Teresa; Fall, Tove; Graff, Mariaelisa; Justice, Anne E; Luan, Jian'an; Gustafsson, Stefan; Randall, Joshua C; Vedantam, Sailaja; Workalemahu, Tsegaselassie; Kilpeläinen, Tuomas O; Scherag, André; Esko, Tonu; Kutalik, Zoltán; Heid, Iris M; Loos, Ruth JF

    2014-01-01

    Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for [1] organizational aspects of GWAMAs, and for [2] QC at the study file level, the meta-level across studies, and the meta-analysis output level. Real–world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for use of a powerful and flexible software package called EasyQC. For consortia of comparable size to the GIANT consortium, the present protocol takes a minimum of about 10 months to complete. PMID:24762786

  13. Genome-Wide Studies of Type 2 Diabetes and Lipid Traits in Hispanics.

    PubMed

    Below, Jennifer E; Parra, Esteban J

    2016-05-01

    Although disproportionately affected by increasing rates of type 2 diabetes and dyslipidemias, Hispanic populations are underrepresented in efforts to understand genetic susceptibility to these disorders. Where research has been undertaken, these populations have provided substantial insight into identification of novel risk-associated genes and have aided in the ability to fine map previously described risk loci. Genome-wide analyses in Hispanic and trans-ethnic populations have resulted in identification of more than 40 replicated or novel genes with significant effects for type 2 diabetes or lipid traits. Initial investigations into rare variant effects have identified new risk-associated variants private to Hispanic populations, and preliminary results suggest metagenomic approaches in Hispanic populations, such as characterizing the gut microbiome, will enable the development of new predictive tools and therapeutic targets for type 2 diabetes. Future genome-wide studies in expanded cohorts of Hispanics are likely to result in new insights into the genetic etiology of metabolic health. PMID:27007718

  14. Exploring genome-wide datasets of MHC class II antigen presentation.

    PubMed

    Wijdeven, Ruud H; Bakker, Jeroen M; Paul, Petra; Neefjes, Jacques

    2013-09-01

    MHC class II molecules (MHCII) are critical for presenting antigens to CD4(+) T-cells. They control ignition of CD4(+) T cells and are as such involved in most auto-immune diseases. To define proteins and pathways controlling MHCII antigen presentation and expression, we performed a genome-wide flow cytometry based RNAi screen. Hits were subsequently classified by two screens that monitored the intracellular distribution and transcription of MHCII. This multi-dimensional approach allowed subclassification of hits into functional groups as a first step to defining new pathways controlling MHCII antigen presentation. The datasets from this screen are used as a template for several follow-up studies. This overview focuses on how data from genome-wide screens can be used for target-lead finding, data mining, systems biology and systematic cell biology. PMID:23137594

  15. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality

    PubMed Central

    Kamvar, Zhian N.; Brooks, Jonah C.; Grünwald, Niklaus J.

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  16. The HapMap and Genome-Wide Association Studies in Diagnosis and Therapy*

    PubMed Central

    Manolio, Teri A.; Collins, Francis S.

    2009-01-01

    The International HapMap Project produced a genome-wide database of human genetic variation for use in genetic association studies of common diseases. The initial output of these studies has been overwhelming, with over 150 risk loci identified in studies of more than 60 common diseases and traits. These associations have suggested previously unsuspected etiologic pathways for common diseases that will be of use in identifying new therapeutic targets and developing targeted interventions based on genetically defined risk. Here we examine the development and application of the HapMap to genome-wide association (GWA) studies; present and future technologies for GWA research; current major efforts in GWA studies; successes and limitations of the GWA approach in identifying polymorphisms related to complex diseases; data release and privacy polices; use of these findings by clinicians, the public, and academic physicians; and sources of ongoing authoritative information on this rapidly evolving field. PMID:19630580

  17. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast

    PubMed Central

    Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-01-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095

  18. Genome-wide association analysis identifies six new loci associated with forced vital capacity

    PubMed Central

    Loth, Daan W.; Artigas, María Soler; Gharib, Sina A.; Wain, Louise V.; Franceschini, Nora; Koch, Beate; Pottinger, Tess; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P.; James, Alan L.; Huffman, Jennifer E.; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J.; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M.; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K.; Fall, Tove; Viňuela, Ana; Launer, Lenore J.; Loehr, Laura R.; Fornage, Myriam; Li, Guo; Wilk, Jemma B.; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B.; North, Kari E.; Rudnicka, Alicja R.; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F.; Hastie, Nicholas D.; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A.; Pietiläinen, Kirsi H.; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G.; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M.; Wojczynski, Mary; Pouta, Anneli; Johansson, Åsa; Wild, Sarah H.; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G.; Eiriksdottir, Gudny; Morrison, Alanna C.; Rotter, Jerome I.; Gao, Wei; Postma, Dirkje S.; White, Wendy B.; Rich, Stephen S.; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J.; Psaty, Bruce M.; Lohman, Kurt; Burchard, Esteban G.; Uitterlinden, André G.; Garcia, Melissa; Joubert, Bonnie R.; McArdle, Wendy L.; Musk, A. Bill; Hansel, Nadia; Heckbert, Susan R.; Zgaga, Lina; van Meurs, Joyce B.J.; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah; Zhao, Jing Hua; Rantanen, Taina; O’Connor, George T.; Ripatti, Samuli; Scott, Rodney J.; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C.; Starr, John M.; Wijmenga, Cisca; Minster, Ryan L.; Lederer, David J.; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P.; Gläser, Sven; Hammond, Christopher J.; Burkart, Kristin M.; Beilby, John; Kritchevsky, Stephen B.; Gudnason, Vilmundur; Hancock, Dana B.; Williams, O. Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F.; Wjst, Matthias; Kim, Woo Jin; Porteous, David J.; Scotland, Generation; Smith, Blair H.; Viljanen, Anne; Heliövaara, Markku; Attia, John R.; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J.; Boezen, H. Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F.; Lind, Lars; Stricker, Bruno H.; Teumer, Alexander; Spector, Timothy D.; Melén, Erik; Peters, Marjolein J.; Lange, Leslie A.; Barr, R. Graham; Bracke, Ken R.; Verhamme, Fien M.; Sung, Joohon; Hiemstra, Pieter S.; Cassano, Patricia A.; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P.; Brusselle, Guy G.; Tobin, Martin D.; London, Stephanie J.

    2014-01-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10−8) with FVC in or near EFEMP1, BMP6, MIR-129-2/HSD17B12, PRDM11, WWOX, and KCNJ2. Two (GSTCD and PTCH1) loci previously associated with spirometric measures were related to FVC. Newly implicated regions were followed-up in samples of African American, Korean, Chinese, and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and pathogenesis of restrictive lung disease. PMID:24929828

  19. High-Throughput, Liquid-Based Genome-Wide RNAi Screening in C. elegans.

    PubMed

    O'Reilly, Linda P; Knoerdel, Ryan R; Silverman, Gary A; Pak, Stephen C

    2016-01-01

    RNA interference (RNAi) is a process in which double-stranded RNA (dsRNA) molecules mediate the inhibition of gene expression. RNAi in C. elegans can be achieved by simply feeding animals with bacteria expressing dsRNA against the gene of interest. This "feeding" method has made it possible to conduct genome-wide RNAi experiments for the systematic knockdown and subsequent investigation of almost every single gene in the genome. Historically, these genome-scale RNAi screens have been labor and time intensive. However, recent advances in automated, high-throughput methodologies have allowed the development of more rapid and efficient screening protocols. In this report, we describe a fast and efficient, liquid-based method for genome-wide RNAi screening. PMID:27581291

  20. Annotation of loci from genome-wide association studies using tissue-specific quantitative interaction proteomics.

    PubMed

    Lundby, Alicia; Rossin, Elizabeth J; Steffensen, Annette B; Acha, Moshe Rav; Newton-Cheh, Christopher; Pfeufer, Arne; Lynch, Stacey N; Olesen, Søren-Peter; Brunak, Søren; Ellinor, Patrick T; Jukema, J Wouter; Trompet, Stella; Ford, Ian; Macfarlane, Peter W; Krijthe, Bouwe P; Hofman, Albert; Uitterlinden, André G; Stricker, Bruno H; Nathoe, Hendrik M; Spiering, Wilko; Daly, Mark J; Asselbergs, Folkert W; van der Harst, Pim; Milan, David J; de Bakker, Paul I W; Lage, Kasper; Olsen, Jesper V

    2014-08-01

    Genome-wide association studies (GWAS) have identified thousands of loci associated with complex traits, but it is challenging to pinpoint causal genes in these loci and to exploit subtle association signals. We used tissue-specific quantitative interaction proteomics to map a network of five genes involved in the Mendelian disorder long QT syndrome (LQTS). We integrated the LQTS network with GWAS loci from the corresponding common complex trait, QT-interval variation, to identify candidate genes that were subsequently confirmed in Xenopus laevis oocytes and zebrafish. We used the LQTS protein network to filter weak GWAS signals by identifying single-nucleotide polymorphisms (SNPs) in proximity to genes in the network supported by strong proteomic evidence. Three SNPs passing this filter reached genome-wide significance after replication genotyping. Overall, we present a general strategy to propose candidates in GWAS loci for functional studies and to systematically filter subtle association signals using tissue-specific quantitative interaction proteomics. PMID:24952909

  1. Annotation of loci from genome-wide association studies using tissue-specific quantitative interaction proteomics

    PubMed Central

    Lundby, Alicia; Rossin, Elizabeth J.; Steffensen, Annette B.; Rav Acha, Moshe; Newton-Cheh, Christopher; Pfeufer, Arne; Lynch, Stacey N.; Olesen, Søren-Peter; Brunak, Søren; Ellinor, Patrick T.; Jukema, J.Wouter; Trompet, Stella; Ford, Ian; Macfarlane, Peter W.; Krijthe, Bouwe P.; Hofman, Albert; Uitterlinden, Andre G.; Stricker, Bruno H.; Nathoe, Hendrik M.; Spiering, Wilko; Daly, Mark J.; Asselbergs, Folkert W.; van der Harst, Pim; Milan, David J.; de Bakker, Paul I.W.; Lage, Kasper; Olsen, Jesper V.

    2014-01-01

    Genome-wide association studies (GWAS) have identified thousands of loci associated wtih complex traits, but it is challenging to pinpoint causal genes in these loci and to exploit subtle association signals. We used tissue-specific quantitative interaction proteomics to map a network of five genes involved in the Mendelian disorder long QT syndrome (LQTS). We integrated the LQTS network with GWAS loci from the corresponding common complex trait, QT interval variation, to identify candidate genes that were subsequently confirmed in Xenopus laevis oocytes and zebrafish. We used the LQTS protein network to filter weak GWAS signals by identifying single nucleotide polymorphisms (SNPs) in proximity to genes in the network supported by strong proteomic evidence. Three SNPs passing this filter reached genome-wide significance after replication genotyping. Overall, we present a general strategy to propose candidates in GWAS loci for functional studies and to systematically filter subtle association signals using tissue-specific quantitative interaction proteomics. PMID:24952909

  2. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma

    PubMed Central

    Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.

    2015-01-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428

  3. Genetic Studies on Diabetic Microvascular Complications: Focusing on Genome-Wide Association Studies

    PubMed Central

    Kwak, Soo Heon

    2015-01-01

    Diabetes is a common metabolic disorder with a worldwide prevalence of 8.3% and is the leading cause of visual loss, end-stage renal disease and amputation. Recently, genome-wide association studies (GWASs) have identified genetic risk factors for diabetic microvascular complications of retinopathy, nephropathy, and neuropathy. We summarized the recent findings of GWASs on diabetic microvascular complications and highlighted the challenges and our opinion on future directives. Five GWASs were conducted on diabetic retinopathy, nine on nephropathy, and one on neuropathic pain. The majority of recent GWASs were underpowered and heterogeneous in terms of study design, inclusion criteria and phenotype definition. Therefore, few reached the genome-wide significance threshold and the findings were inconsistent across the studies. Recent GWASs provided novel information on genetic risk factors and the possible pathophysiology of diabetic microvascular complications. However, further collaborative efforts to standardize phenotype definition and increase sample size are necessary for successful genetic studies on diabetic microvascular complications. PMID:26194074

  4. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality.

    PubMed

    Kamvar, Zhian N; Brooks, Jonah C; Grünwald, Niklaus J

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  5. Genome-wide Mapping of Nucleosome Positioning and DNA Methylation Within Individual DNA Molecules

    PubMed Central

    Liu, Yaping; Lay, Fides D.; Liang, Gangning; Berman, Benjamin P.; Jones, Peter A.; Kelly, Terry

    2012-01-01

    DNA methylation and nucleosome positioning work together to generate chromatin structures that regulate gene expression. Nucleosomes are typically mapped using nuclease digestion requiring significant amounts of material and varying enzyme concentrations. We have developed a method that uses a GpC methyltransferase (M.CviPI) and next generation sequencing to footprint nucleosome positioning genome-wide using less than 1 million cells, which does not suffer from sequence based biases associated with MNase digestion and retains endogenous DNA methylation information. Using a novel bioinformatics pipeline we identify chromatin configurations associated with a variety of functional genomic loci including distinct promoter types, enhancers, insulators, X-inactivated and imprinted genes. Importantly, DNA methylation and nucleosome positioning information are obtained from the same DNA molecule, giving the first genome-wide DNA methylation and nucleosome positioning correlation at the single molecule level that can be used to monitor disease progression and response to therapy.

  6. Genome-wide analysis of microRNA and mRNA expression signatures in cancer

    PubMed Central

    Li, Ming-hui; Fu, Sheng-bo; Xiao, Hua-sheng

    2015-01-01

    Cancer is an extremely diverse and complex disease that results from various genetic and epigenetic changes such as DNA copy-number variations, mutations, and aberrant mRNA and/or protein expression caused by abnormal transcriptional regulation. The expression profiles of certain microRNAs (miRNAs) and messenger RNAs (mRNAs) are closely related to cancer progression stages. In the past few decades, DNA microarray and next-generation sequencing techniques have been widely applied to identify miRNA and mRNA signatures for cancers on a genome-wide scale and have provided meaningful insights into cancer diagnosis, prognosis and personalized medicine. In this review, we summarize the progress in genome-wide analysis of miRNAs and mRNAs as cancer biomarkers, highlighting their diagnostic and prognostic roles. PMID:26299954

  7. Genome-wide association analysis identifies six new loci associated with forced vital capacity.

    PubMed

    Loth, Daan W; Soler Artigas, María; Gharib, Sina A; Wain, Louise V; Franceschini, Nora; Koch, Beate; Pottinger, Tess D; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P; James, Alan L; Huffman, Jennifer E; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K; Fall, Tove; Viñuela, Ana; Launer, Lenore J; Loehr, Laura R; Fornage, Myriam; Li, Guo; Wilk, Jemma B; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B; North, Kari E; Rudnicka, Alicja R; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F; Hastie, Nicholas D; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A; Pietiläinen, Kirsi H; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M; Wojczynski, Mary; Pouta, Anneli; Johansson, Asa; Wild, Sarah H; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G; Eiriksdottir, Gudny; Morrison, Alanna C; Rotter, Jerome I; Gao, Wei; Postma, Dirkje S; White, Wendy B; Rich, Stephen S; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J; Psaty, Bruce M; Lohman, Kurt; Burchard, Esteban G; Uitterlinden, André G; Garcia, Melissa; Joubert, Bonnie R; McArdle, Wendy L; Musk, A Bill; Hansel, Nadia; Heckbert, Susan R; Zgaga, Lina; van Meurs, Joyce B J; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah L; Zhao, Jing Hua; Rantanen, Taina; O'Connor, George T; Ripatti, Samuli; Scott, Rodney J; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C; Starr, John M; Wijmenga, Cisca; Minster, Ryan L; Lederer, David J; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P; Gläser, Sven; Hammond, Christopher J; Burkart, Kristin M; Beilby, John; Kritchevsky, Stephen B; Gudnason, Vilmundur; Hancock, Dana B; Williams, O Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F; Wjst, Matthias; Kim, Woo Jin; Porteous, David J; Scotland, Generation; Smith, Blair H; Viljanen, Anne; Heliövaara, Markku; Attia, John R; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J; Boezen, H Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F; Lind, Lars; Stricker, Bruno H; Teumer, Alexander; Spector, Timothy D; Melén, Erik; Peters, Marjolein J; Lange, Leslie A; Barr, R Graham; Bracke, Ken R; Verhamme, Fien M; Sung, Joohon; Hiemstra, Pieter S; Cassano, Patricia A; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P; Brusselle, Guy G; Tobin, Martin D; London, Stephanie J

    2014-07-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10(-8)) with FVC in or near EFEMP1, BMP6, MIR129-2-HSD17B12, PRDM11, WWOX and KCNJ2. Two loci previously associated with spirometric measures (GSTCD and PTCH1) were related to FVC. Newly implicated regions were followed up in samples from African-American, Korean, Chinese and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and the pathogenesis of restrictive lung disease. PMID:24929828

  8. A guide to genome-wide association analysis and post-analytic interrogation.

    PubMed

    Reed, Eric; Nunez, Sara; Kulp, David; Qian, Jing; Reilly, Muredach P; Foulkes, Andrea S

    2015-12-10

    This tutorial is a learning resource that outlines the basic process and provides specific software tools for implementing a complete genome-wide association analysis. Approaches to post-analytic visualization and interrogation of potentially novel findings are also presented. Applications are illustrated using the free and open-source R statistical computing and graphics software environment, Bioconductor software for bioinformatics and the UCSC Genome Browser. Complete genome-wide association data on 1401 individuals across 861,473 typed single nucleotide polymorphisms from the PennCATH study of coronary artery disease are used for illustration. All data and code, as well as additional instructional resources, are publicly available through the Open Resources in Statistical Genomics project: http://www.stat-gen.org. PMID:26343929

  9. Discovery and validation of sub-threshold genome-wide association study loci using epigenomic signatures.

    PubMed

    Wang, Xinchen; Tucker, Nathan R; Rizki, Gizem; Mills, Robert; Krijger, Peter Hl; de Wit, Elzo; Subramanian, Vidya; Bartell, Eric; Nguyen, Xinh-Xinh; Ye, Jiangchuan; Leyton-Mange, Jordan; Dolmatova, Elena V; van der Harst, Pim; de Laat, Wouter; Ellinor, Patrick T; Newton-Cheh, Christopher; Milan, David J; Kellis, Manolis; Boyer, Laurie A

    2016-01-01

    Genetic variants identified by genome-wide association studies explain only a modest proportion of heritability, suggesting that meaningful associations lie 'hidden' below current thresholds. Here, we integrate information from association studies with epigenomic maps to demonstrate that enhancers significantly overlap known loci associated with the cardiac QT interval and QRS duration. We apply functional criteria to identify loci associated with QT interval that do not meet genome-wide significance and are missed by existing studies. We demonstrate that these 'sub-threshold' signals represent novel loci, and that epigenomic maps are effective at discriminating true biological signals from noise. We experimentally validate the molecular, gene-regulatory, cellular and organismal phenotypes of these sub-threshold loci, demonstrating that most sub-threshold loci have regulatory consequences and that genetic perturbation of nearby genes causes cardiac phenotypes in mouse. Our work provides a general approach for improving the detection of novel loci associated with complex human traits. PMID:27162171

  10. A genome-wide association study on amyotrophic lateral sclerosis in the Taiwanese Han population.

    PubMed

    Chen, Chi-Jim; Chen, Chien-Ming; Pai, Tun-Wen; Chang, Hao-Teng; Hwang, Chi-Shin

    2016-06-01

    Identification of mutations in patients with amyotrophic lateral sclerosis (ALS) in a genome-wide association study can reveal possible biomarkers of such a rapidly progressive and fatal neurodegenerative disease. It was observed that significant single nucleotide polymorphisms vary when the tested population changes from one ethnic group to another. To identify new loci associated with ALS susceptibility in the Taiwanese Han population, we performed a genome-wide association study on 94 patients with sporadic ALS and 376 matched controls. We uncovered two new susceptibility loci at 13q14.3 (rs2785946) and 11q25 (rs11224052). In addition, we analyzed the functions of all the associated genes among 54 significant single nucleotide polymorphisms using Gene Ontology annotations, and the results showed several statistically significant neural- and muscle-related Gene Ontology terms and the associated diseases. PMID:26580837

  11. FINEMAP: efficient variable selection using summary data from genome-wide association studies

    PubMed Central

    Benner, Christian; Spencer, Chris C.A.; Havulinna, Aki S.; Salomaa, Veikko; Ripatti, Samuli; Pirinen, Matti

    2016-01-01

    Motivation: The goal of fine-mapping in genomic regions associated with complex diseases and traits is to identify causal variants that point to molecular mechanisms behind the associations. Recent fine-mapping methods using summary data from genome-wide association studies rely on exhaustive search through all possible causal configurations, which is computationally expensive. Results: We introduce FINEMAP, a software package to efficiently explore a set of the most important causal configurations of the region via a shotgun stochastic search algorithm. We show that FINEMAP produces accurate results in a fraction of processing time of existing approaches and is therefore a promising tool for analyzing growing amounts of data produced in genome-wide association studies and emerging sequencing projects. Availability and implementation: FINEMAP v1.0 is freely available for Mac OS X and Linux at http://www.christianbenner.com. Contact: christian.benner@helsinki.fi or matti.pirinen@helsinki.fi PMID:26773131

  12. Refining genome-wide linkage intervals using a meta-analysis of genome-wide association studies identifies loci influencing personality dimensions.

    PubMed

    Amin, Najaf; Hottenga, Jouke-Jan; Hansell, Narelle K; Janssens, A Cecile J W; de Moor, Marleen H M; Madden, Pamela A F; Zorkoltseva, Irina V; Penninx, Brenda W; Terracciano, Antonio; Uda, Manuela; Tanaka, Toshiko; Esko, Tonu; Realo, Anu; Ferrucci, Luigi; Luciano, Michelle; Davies, Gail; Metspalu, Andres; Abecasis, Goncalo R; Deary, Ian J; Raikkonen, Katri; Bierut, Laura J; Costa, Paul T; Saviouk, Viatcheslav; Zhu, Gu; Kirichenko, Anatoly V; Isaacs, Aaron; Aulchenko, Yurii S; Willemsen, Gonneke; Heath, Andrew C; Pergadia, Michele L; Medland, Sarah E; Axenovich, Tatiana I; de Geus, Eco; Montgomery, Grant W; Wright, Margaret J; Oostra, Ben A; Martin, Nicholas G; Boomsma, Dorret I; van Duijn, Cornelia M

    2013-08-01

    Personality traits are complex phenotypes related to psychosomatic health. Individually, various gene finding methods have not achieved much success in finding genetic variants associated with personality traits. We performed a meta-analysis of four genome-wide linkage scans (N=6149 subjects) of five basic personality traits assessed with the NEO Five-Factor Inventory. We compared the significant regions from the meta-analysis of linkage scans with the results of a meta-analysis of genome-wide association studies (GWAS) (N∼17 000). We found significant evidence of linkage of neuroticism to chromosome 3p14 (rs1490265, LOD=4.67) and to chromosome 19q13 (rs628604, LOD=3.55); of extraversion to 14q32 (ATGG002, LOD=3.3); and of agreeableness to 3p25 (rs709160, LOD=3.67) and to two adjacent regions on chromosome 15, including 15q13 (rs970408, LOD=4.07) and 15q14 (rs1055356, LOD=3.52) in the individual scans. In the meta-analysis, we found strong evidence of linkage of extraversion to 4q34, 9q34, 10q24 and 11q22, openness to 2p25, 3q26, 9p21, 11q24, 15q26 and 19q13 and agreeableness to 4q34 and 19p13. Significant evidence of association in the GWAS was detected between openness and rs677035 at 11q24 (P-value=2.6 × 10(-06), KCNJ1). The findings of our linkage meta-analysis and those of the GWAS suggest that 11q24 is a susceptible locus for openness, with KCNJ1 as the possible candidate gene. PMID:23211697

  13. Genome-wide Association Study Identifies New Susceptibility Loci for Posttraumatic Stress Disorder

    PubMed Central

    Xie, Pingxing; Kranzler, Henry R.; Yang, Can; Zhao, Hongyu; Farrer, Lindsay A.; Gelernter, Joel

    2013-01-01

    Background Genetic factors influence the risk for posttraumatic stress disorder (PTSD), a potentially chronic and disabling psychiatric disorder that can arise after exposure to trauma. Candidate gene association studies have identified few genetic variants that contribute to PTSD risk. Methods We conducted genome-wide association analyses in 1578 European Americans (EAs), including 300 PTSD cases, and 2766 African Americans, including 444 PTSD cases, to find novel common risk alleles for PTSD. We used the Illumina Omni1-Quad microarray, which yielded approximately 870,000 single nucleotide polymorphisms (SNPs) suitable for analysis. Results In EAs, we observed that one SNP on chromosome 7p12, rs406001, exceeded genome-wide significance (p = 3.97×10−8). A SNP that maps to the first intron of the Tolloid-Like 1 gene (TLL1) showed the second strongest evidence of association, although no SNPs at this locus reached genome-wide significance. We then tested six SNPs in an independent sample of nearly 2000 EAs and successfully replicated the association findings for two SNPs in the first intron of TLL1, rs6812849 and rs7691872, with p values of 6.3×10−6 and 2.3×10−4, respectively. In the combined sample, rs6812849 had a p value of 3.1 ×10−9. No significant signals were observed in the African American part of the sample. Genome-wide association study analyses restricted to trauma-exposed individuals yielded very similar results. Conclusions This study identified TLL1 as a new susceptibility gene for PTSD. PMID:23726511

  14. Meta-Analyses of Genome-Wide Association Data Hold New Promise for Addiction Genetics.

    PubMed

    Agrawal, Arpana; Edenberg, Howard J; Gelernter, Joel

    2016-09-01

    Meta-analyses of genome-wide association study data have begun to lead to promising new discoveries for behavioral and psychiatrically relevant phenotypes (e.g., schizophrenia, educational attainment). We outline how this methodology can similarly lead to novel discoveries in genomic studies of substance use disorders, and discuss challenges that will need to be overcome to accomplish this goal. We illustrate our approach with the work of the newly established Substance Use Disorders workgroup of the Psychiatric Genomics Consortium. PMID:27588522

  15. Citalopram and escitalopram plasma drug and metabolite concentrations: genome-wide associations

    PubMed Central

    Ji, Yuan; Schaid, Daniel J; Desta, Zeruesenay; Kubo, Michiaki; Batzler, Anthony J; Snyder, Karen; Mushiroda, Taisei; Kamatani, Naoyuki; Ogburn, Evan; Hall-Flavin, Daniel; Flockhart, David; Nakamura, Yusuke; Mrazek, David A; Weinshilboum, Richard M

    2014-01-01

    Aims Citalopram (CT) and escitalopram (S-CT) are among the most widely prescribed selective serotonin reuptake inhibitors used to treat major depressive disorder (MDD). We applied a genome-wide association study to identify genetic factors that contribute to variation in plasma concentrations of CT or S-CT and their metabolites in MDD patients treated with CT or S-CT. Methods Our genome-wide association study was performed using samples from 435 MDD patients. Linear mixed models were used to account for within-subject correlations of longitudinal measures of plasma drug/metabolite concentrations (4 and 8 weeks after the initiation of drug therapy), and single-nucleotide polymorphisms (SNPs) were modelled as additive allelic effects. Results Genome-wide significant associations were observed for S-CT concentration with SNPs in or near the CYP2C19 gene on chromosome 10 (rs1074145, P = 4.1 × 10−9) and with S-didesmethylcitalopram concentration for SNPs near the CYP2D6 locus on chromosome 22 (rs1065852, P = 2.0 × 10−16), supporting the important role of these cytochrome P450 (CYP) enzymes in biotransformation of citalopram. After adjustment for the effect of CYP2C19 functional alleles, the analyses also identified novel loci that will require future replication and functional validation. Conclusions In vitro and in vivo studies have suggested that the biotransformation of CT to monodesmethylcitalopram and didesmethylcitalopram is mediated by CYP isozymes. The results of our genome-wide association study performed in MDD patients treated with CT or S-CT have confirmed those observations but also identified novel genomic loci that might play a role in variation in plasma levels of CT or its metabolites during the treatment of MDD patients with these selective serotonin reuptake inhibitors. PMID:24528284

  16. Genome-wide methylation profiling in archival formalin-fixed paraffin-embedded tissue samples.

    PubMed

    Killian, J Keith; Walker, Robert L; Bilke, Sven; Chen, Yidong; Davis, Sean; Cornelison, Robert; Smith, William I; Meltzer, Paul S

    2012-01-01

    New technologies allow for genome-scale measurement of DNA methylation. In an effort to increase the clinical utility of DNA methylation as a biomarker, we have adapted a commercial bisulfite epigenotyping assay for genome-wide methylation profiling in archival formalin-fixed paraffin-embedded pathology specimens. This chapter takes the reader step by step through a biomarker discovery experiment to identify phenotype-correlated DNA methylation signatures in routine pathology specimens. PMID:22081342

  17. A genome-wide association study of body mass index across early life and childhood

    PubMed Central

    Warrington, Nicole M; Howe, Laura D; Paternoster, Lavinia; Kaakinen, Marika; Herrala, Sauli; Huikari, Ville; Wu, Yan Yan; Kemp, John P; Timpson, Nicholas J; Pourcain, Beate St; Davey Smith, George; Tilling, Kate; Jarvelin, Marjo-Riitta; Pennell, Craig E; Evans, David M; Lawlor, Debbie A; Briollais, Laurent; Palmer, Lyle J

    2015-01-01

    Background: Several studies have investigated the effect of known adult body mass index (BMI) associated single nucleotide polymorphisms (SNPs) on BMI in childhood. There has been no genome-wide association study (GWAS) of BMI trajectories over childhood. Methods: We conducted a GWAS meta-analysis of BMI trajectories from 1 to 17 years of age in 9377 children (77 967 measurements) from the Avon Longitudinal Study of Parents and Children (ALSPAC) and the Western Australian Pregnancy Cohort (Raine) Study. Genome-wide significant loci were examined in a further 3918 individuals (48 530 measurements) from Northern Finland. Linear mixed effects models with smoothing splines were used in each cohort for longitudinal modelling of BMI. Results: A novel SNP, downstream from the FAM120AOS gene on chromosome 9, was detected in the meta-analysis of ALSPAC and Raine. This association was driven by a difference in BMI at 8 years (T allele of rs944990 increased BMI; PSNP = 1.52 × 10−8), with a modest association with change in BMI over time (PWald(Change) = 0.006). Three known adult BMI-associated loci (FTO, MC4R and ADCY3) and one childhood obesity locus (OLFM4) reached genome-wide significance (PWald < 1.13 × 10−8) with BMI at 8 years and/or change over time. Conclusions: This GWAS of BMI trajectories over childhood identified a novel locus that warrants further investigation. We also observed genome-wide significance with previously established obesity loci, making the novel observation that these loci affected both the level and the rate of change in BMI. We have demonstrated that the use of repeated measures data can increase power to allow detection of genetic loci with smaller sample sizes. PMID:25953783

  18. Common Variants Confer Susceptibility to Barrett's Esophagus: Insights from the First Genome-Wide Association Studies.

    PubMed

    Palles, Claire; Findlay, John M; Tomlinson, Ian

    2016-01-01

    Eight loci have been identified by the two genome-wide association studies of Barrett's esophagus that have been conducted to date. Esophageal adenocarcinoma cases were included in the second study following evidence that predisposing genetic variants for this cancer overlap with those for Barrett's esophagus. Genes with roles in embryonic development of the foregut are adjacent to 6 of the loci identified (FOXF1, BARX1, FOXP1, GDF7, TBX5, and ALDH1A2). An additional locus maps to a gene with known oncogenic potential (CREB-regulated transcription coactivator 1), but expression quantitative trait data implicates yet another gene involved in esophageal development (PBX4). These results strongly support a model whereby dysregulation of genes involved in esophageal and thoracic development increases susceptibility to Barrett's esophagus and esophageal adenocarcinoma, probably by reducing anatomical antireflux mechanisms. An additional signal at 6p21 in the major histocompatibility complex also reinforces evidence that immune and inflammatory response to reflux is involved in the development of both diseases. All of the variants identified are intronic or intergenic rather than coding and are presumed to be or to mark regulatory variants. As with genome-wide association studies of other diseases, the functional variants at each locus are yet to be identified and the genes affected need confirming. In this chapter as well as discussing the biology behind each genome-wide association signal, we review the requirements for successfully conducting genome-wide association studies and discuss how progress in understanding the genetic variants that contribute to Barrett's esophagus/esophageal adenocarcinoma susceptibility compares to other cancers. PMID:27573776

  19. HIV Genome-Wide Protein Associations: a Review of 30 Years of Research.

    PubMed

    Li, Guangdi; De Clercq, Erik

    2016-09-01

    The HIV genome encodes a small number of viral proteins (i.e., 16), invariably establishing cooperative associations among HIV proteins and between HIV and host proteins, to invade host cells and hijack their internal machineries. As a known example, the HIV envelope glycoprotein GP120 is closely associated with GP41 for viral entry. From a genome-wide perspective, a hypothesis can be worked out to determine whether 16 HIV proteins could develop 120 possible pairwise associations either by physical interactions or by functional associations mediated via HIV or host molecules. Here, we present the first systematic review of experimental evidence on HIV genome-wide protein associations using a large body of publications accumulated over the past 3 decades. Of 120 possible pairwise associations between 16 HIV proteins, at least 34 physical interactions and 17 functional associations have been identified. To achieve efficient viral replication and infection, HIV protein associations play essential roles (e.g., cleavage, inhibition, and activation) during the HIV life cycle. In either a dispensable or an indispensable manner, each HIV protein collaborates with another viral protein to accomplish specific activities that precisely take place at the proper stages of the HIV life cycle. In addition, HIV genome-wide protein associations have an impact on anti-HIV inhibitors due to the extensive cross talk between drug-inhibited proteins and other HIV proteins. Overall, this study presents for the first time a comprehensive overview of HIV genome-wide protein associations, highlighting meticulous collaborations between all viral proteins during the HIV life cycle. PMID:27357278

  20. Common genetic variation and survival after colorectal cancer diagnosis: a genome-wide analysis.

    PubMed

    Phipps, Amanda I; Passarelli, Michael N; Chan, Andrew T; Harrison, Tabitha A; Jeon, Jihyoun; Hutter, Carolyn M; Berndt, Sonja I; Brenner, Hermann; Caan, Bette J; Campbell, Peter T; Chang-Claude, Jenny; Chanock, Stephen J; Cheadle, Jeremy P; Curtis, Keith R; Duggan, David; Fisher, David; Fuchs, Charles S; Gala, Manish; Giovannucci, Edward L; Hayes, Richard B; Hoffmeister, Michael; Hsu, Li; Jacobs, Eric J; Jansen, Lina; Kaplan, Richard; Kap, Elisabeth J; Maughan, Timothy S; Potter, John D; Schoen, Robert E; Seminara, Daniela; Slattery, Martha L; West, Hannah; White, Emily; Peters, Ulrike; Newcomb, Polly A

    2016-01-01

    Genome-wide association studies have identified several germline single nucleotide polymorphisms (SNPs) significantly associated with colorectal cancer (CRC) incidence. Common germline genetic variation may also be related to CRC survival. We used a discovery-based approach to identify SNPs related to survival outcomes after CRC diagnosis. Genome-wide genotyping arrays were conducted for 3494 individuals with invasive CRC enrolled in six prospective cohort studies (median study-specific follow-up = 4.2-8.1 years). In pooled analyses, we used Cox regression to assess SNP-specific associations with CRC-specific and overall survival, with additional analyses stratified by stage at diagnosis. Top findings were followed-up in independent studies. A P value threshold of P < 5×10(-8) in analyses combining discovery and follow-up studies was required for genome-wide significance. Among individuals with distant-metastatic CRC, several SNPs at 6p12.1, nearest the ELOVL5 gene, were statistically significantly associated with poorer survival, with the strongest associations noted for rs209489 [hazard ratio (HR) = 1.8, P = 7.6×10(-10) and HR = 1.8, P = 3.7×10(-9) for CRC-specific and overall survival, respectively). No SNPs were statistically significantly associated with survival among all cases combined or in cases without distant-metastases. SNPs in 6p12.1/ELOVL5 were associated with survival outcomes in individuals with distant-metastatic CRC, and merit further follow-up for functional significance. Findings from this genome-wide association study highlight the potential importance of genetic variation in CRC prognosis and provide clues to genomic regions of potential interest. PMID:26586795

  1. Genome-Wide Association Study Identifies Novel Loci Associated With Diisocyanate-Induced Occupational Asthma.

    PubMed

    Yucesoy, Berran; Kaufman, Kenneth M; Lummus, Zana L; Weirauch, Matthew T; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B; Bernstein, David I

    2015-07-01

    Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10(-14)). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10(-9) and rs2514805, p = 1.22 × 10(-8), respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10(-6)). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies. PMID:25918132

  2. Genome-wide analysis of epistasis in body mass index using multiple human populations

    PubMed Central

    Wei, Wen-Hua; Hemani, Gib; Gyenesei, Attila; Vitart, Veronique; Navarro, Pau; Hayward, Caroline; Cabrera, Claudia P; Huffman, Jennifer E; Knott, Sara A; Hicks, Andrew A; Rudan, Igor; Pramstaller, Peter P; Wild, Sarah H; Wilson, James F; Campbell, Harry; Hastie, Nicholas D; Wright, Alan F; Haley, Chris S

    2012-01-01

    We surveyed gene–gene interactions (epistasis) in human body mass index (BMI) in four European populations (n<1200) via exhaustive pair-wise genome scans where interactions were computed as F ratios by testing a linear regression model fitting two single-nucleotide polymorphisms (SNPs) with interactions against the one without. Before the association tests, BMI was corrected for sex and age, normalised and adjusted for relatedness. Neither single SNPs nor SNP interactions were genome-wide significant in either cohort based on the consensus threshold (P=5.0E−08) and a Bonferroni corrected threshold (P=1.1E−12), respectively. Next we compared sub genome-wide significant SNP interactions (P<5.0E−08) across cohorts to identify common epistatic signals, where SNPs were annotated to genes to test for gene ontology (GO) enrichment. Among the epistatic genes contributing to the commonly enriched GO terms, 19 were shared across study cohorts of which 15 are previously published genome-wide association loci, including CDH13 (cadherin 13) associated with height and SORCS2 (sortilin-related VPS10 domain containing receptor 2) associated with circulating insulin-like growth factor 1 and binding protein 3. Interactions between the 19 shared epistatic genes and those involving BMI candidate loci (P<5.0E−08) were tested across cohorts and found eight replicated at the SNP level (P<0.05) in at least one cohort, which were further tested and showed limited replication in a separate European population (n>5000). We conclude that genome-wide analysis of epistasis in multiple populations is an effective approach to provide new insights into the genetic regulation of BMI but requires additional efforts to confirm the findings. PMID:22333899

  3. Acetaminophen-NAPQI Hepatotoxicity: A Cell Line Model System Genome-Wide Association Study

    PubMed Central

    Moyer, Ann M.; Fridley, Brooke L.; Jenkins, Gregory D.; Batzler, Anthony J.; Pelleymounter, Linda L.; Kalari, Krishna R.; Ji, Yuan; Chai, Yubo; Nordgren, Kendra K. S.; Weinshilboum, Richard M.

    2011-01-01

    Acetaminophen is the leading cause of acute hepatic failure in many developed nations. Acetaminophen hepatotoxicity is mediated by the reactive metabolite N-acetyl-p-benzoquinonimine (NAPQI). We performed a “discovery” genome-wide association study using a cell line–based model system to study the possible contribution of genomics to NAPQI-induced cytotoxicity. A total of 176 lymphoblastoid cell lines from healthy subjects were treated with increasing concentrations of NAPQI. Inhibiting concentration 50 values were determined and were associated with “glutathione pathway” gene single nucleotide polymorphisms (SNPs) and genome-wide basal messenger RNA expression, as well as with 1.3 million genome-wide SNPs. A group of SNPs in linkage disequilibrium on chromosome 3 was highly associated with NAPQI toxicity. The p value for rs2880961, the SNP with the lowest p value, was 1.88 × 10−7. This group of SNPs mapped to a “gene desert,” but chromatin immunoprecipitation assays demonstrated binding of several transcription factor proteins including heat shock factor 1 (HSF1) and HSF2, at or near rs2880961. These chromosome 3 SNPs were not significantly associated with variation in basal expression for any of the genome-wide genes represented on the Affymetrix U133 Plus 2.0 GeneChip. We have used a cell line–based model system to identify a SNP signal associated with NAPQI cytotoxicity. If these observations are validated in future clinical studies, this SNP signal might represent a potential biomarker for risk of acetaminophen hepatotoxicity. The mechanisms responsible for this association remain unclear. PMID:21177773

  4. Genome-wide association study identifies multiple susceptibility loci for pancreatic cancer.

    PubMed

    Wolpin, Brian M; Rizzato, Cosmeri; Kraft, Peter; Kooperberg, Charles; Petersen, Gloria M; Wang, Zhaoming; Arslan, Alan A; Beane-Freeman, Laura; Bracci, Paige M; Buring, Julie; Canzian, Federico; Duell, Eric J; Gallinger, Steven; Giles, Graham G; Goodman, Gary E; Goodman, Phyllis J; Jacobs, Eric J; Kamineni, Aruna; Klein, Alison P; Kolonel, Laurence N; Kulke, Matthew H; Li, Donghui; Malats, Núria; Olson, Sara H; Risch, Harvey A; Sesso, Howard D; Visvanathan, Kala; White, Emily; Zheng, Wei; Abnet, Christian C; Albanes, Demetrius; Andreotti, Gabriella; Austin, Melissa A; Barfield, Richard; Basso, Daniela; Berndt, Sonja I; Boutron-Ruault, Marie-Christine; Brotzman, Michelle; Büchler, Markus W; Bueno-de-Mesquita, H Bas; Bugert, Peter; Burdette, Laurie; Campa, Daniele; Caporaso, Neil E; Capurso, Gabriele; Chung, Charles; Cotterchio, Michelle; Costello, Eithne; Elena, Joanne; Funel, Niccola; Gaziano, J Michael; Giese, Nathalia A; Giovannucci, Edward L; Goggins, Michael; Gorman, Megan J; Gross, Myron; Haiman, Christopher A; Hassan, Manal; Helzlsouer, Kathy J; Henderson, Brian E; Holly, Elizabeth A; Hu, Nan; Hunter, David J; Innocenti, Federico; Jenab, Mazda; Kaaks, Rudolf; Key, Timothy J; Khaw, Kay-Tee; Klein, Eric A; Kogevinas, Manolis; Krogh, Vittorio; Kupcinskas, Juozas; Kurtz, Robert C; LaCroix, Andrea; Landi, Maria T; Landi, Stefano; Le Marchand, Loic; Mambrini, Andrea; Mannisto, Satu; Milne, Roger L; Nakamura, Yusuke; Oberg, Ann L; Owzar, Kouros; Patel, Alpa V; Peeters, Petra H M; Peters, Ulrike; Pezzilli, Raffaele; Piepoli, Ada; Porta, Miquel; Real, Francisco X; Riboli, Elio; Rothman, Nathaniel; Scarpa, Aldo; Shu, Xiao-Ou; Silverman, Debra T; Soucek, Pavel; Sund, Malin; Talar-Wojnarowska, Renata; Taylor, Philip R; Theodoropoulos, George E; Thornquist, Mark; Tjønneland, Anne; Tobias, Geoffrey S; Trichopoulos, Dimitrios; Vodicka, Pavel; Wactawski-Wende, Jean; Wentzensen, Nicolas; Wu, Chen; Yu, Herbert; Yu, Kai; Zeleniuch-Jacquotte, Anne; Hoover, Robert; Hartge, Patricia; Fuchs, Charles; Chanock, Stephen J; Stolzenberg-Solomon, Rachael S; Amundadottir, Laufey T

    2014-09-01

    We performed a multistage genome-wide association study including 7,683 individuals with pancreatic cancer and 14,397 controls of European descent. Four new loci reached genome-wide significance: rs6971499 at 7q32.3 (LINC-PINT, per-allele odds ratio (OR) = 0.79, 95% confidence interval (CI) 0.74-0.84, P = 3.0 × 10(-12)), rs7190458 at 16q23.1 (BCAR1/CTRB1/CTRB2, OR = 1.46, 95% CI 1.30-1.65, P = 1.1 × 10(-10)), rs9581943 at 13q12.2 (PDX1, OR = 1.15, 95% CI 1.10-1.20, P = 2.4 × 10(-9)) and rs16986825 at 22q12.1 (ZNRF3, OR = 1.18, 95% CI 1.12-1.25, P = 1.2 × 10(-8)). We identified an independent signal in exon 2 of TERT at the established region 5p15.33 (rs2736098, OR = 0.80, 95% CI 0.76-0.85, P = 9.8 × 10(-14)). We also identified a locus at 8q24.21 (rs1561927, P = 1.3 × 10(-7)) that approached genome-wide significance located 455 kb telomeric of PVT1. Our study identified multiple new susceptibility alleles for pancreatic cancer that are worthy of follow-up studies. PMID:25086665

  5. Genome-wide association study for wool production traits in a Chinese Merino sheep population.

    PubMed

    Wang, Zhipeng; Zhang, Hui; Yang, Hua; Wang, Shouzhi; Rong, Enguang; Pei, Wenyu; Li, Hui; Wang, Ning

    2014-01-01

    Genome-wide association studies (GWAS) provide a powerful approach for identifying quantitative trait loci without prior knowledge of location or function. To identify loci associated with wool production traits, we performed a genome-wide association study on a total of 765 Chinese Merino sheep (JunKen type) genotyped with 50 K single nucleotide polymorphisms (SNPs). In the present study, five wool production traits were examined: fiber diameter, fiber diameter coefficient of variation, fineness dispersion, staple length and crimp. We detected 28 genome-wide significant SNPs for fiber diameter, fiber diameter coefficient of variation, fineness dispersion, and crimp trait in the Chinese Merino sheep. About 43% of the significant SNP markers were located within known or predicted genes, including YWHAZ, KRTCAP3, TSPEAR, PIK3R4, KIF16B, PTPN3, GPRC5A, DDX47, TCF9, TPTE2, EPHA5 and NBEA genes. Our results not only confirm the results of previous reports, but also provide a suite of novel SNP markers and candidate genes associated with wool traits. Our findings will be useful for exploring the genetic control of wool traits in sheep. PMID:25268383

  6. Revisiting the classification of curtoviruses based on genome-wide pairwise identity.

    PubMed

    Varsani, Arvind; Martin, Darren P; Navas-Castillo, Jesús; Moriones, Enrique; Hernández-Zepeda, Cecilia; Idris, Ali; Murilo Zerbini, F; Brown, Judith K

    2014-07-01

    Members of the genus Curtovirus (family Geminiviridae) are important pathogens of many wild and cultivated plant species. Until recently, relatively few full curtovirus genomes have been characterised. However, with the 19 full genome sequences now available in public databases, we revisit the proposed curtovirus species and strain classification criteria. Using pairwise identities coupled with phylogenetic evidence, revised species and strain demarcation guidelines have been instituted. Specifically, we have established 77 % genome-wide pairwise identity as a species demarcation threshold and 94 % genome-wide pairwise identity as a strain demarcation threshold. Hence, whereas curtovirus sequences with >77 % genome-wide pairwise identity would be classified as belonging to the same species, those sharing >94 % identity would be classified as belonging to the same strain. We provide step-by-step guidelines to facilitate the classification of newly discovered curtovirus full genome sequences and a set of defined criteria for naming new species and strains. The revision yields three curtovirus species: Beet curly top virus (BCTV), Spinach severe surly top virus (SpSCTV) and Horseradish curly top virus (HrCTV). PMID:24463952

  7. Genome-Wide Association Study for Wool Production Traits in a Chinese Merino Sheep Population

    PubMed Central

    Wang, Zhipeng; Zhang, Hui; Yang, Hua; Wang, Shouzhi; Rong, Enguang; Pei, Wenyu; Li, Hui; Wang, Ning

    2014-01-01

    Genome-wide association studies (GWAS) provide a powerful approach for identifying quantitative trait loci without prior knowledge of location or function. To identify loci associated with wool production traits, we performed a genome-wide association study on a total of 765 Chinese Merino sheep (JunKen type) genotyped with 50 K single nucleotide polymorphisms (SNPs). In the present study, five wool production traits were examined: fiber diameter, fiber diameter coefficient of variation, fineness dispersion, staple length and crimp. We detected 28 genome-wide significant SNPs for fiber diameter, fiber diameter coefficient of variation, fineness dispersion, and crimp trait in the Chinese Merino sheep. About 43% of the significant SNP markers were located within known or predicted genes, including YWHAZ, KRTCAP3, TSPEAR, PIK3R4, KIF16B, PTPN3, GPRC5A, DDX47, TCF9, TPTE2, EPHA5 and NBEA genes. Our results not only confirm the results of previous reports, but also provide a suite of novel SNP markers and candidate genes associated with wool traits. Our findings will be useful for exploring the genetic control of wool traits in sheep. PMID:25268383

  8. Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia

    PubMed Central

    Ripke, Stephan; O'Dushlaine, Colm; Chambert, Kimberly; Moran, Jennifer L; Kähler, Anna K; Akterin, Susanne; Bergen, Sarah; Collins, Ann L; Crowley, James J; Fromer, Menachem; Kim, Yunjung; Lee, Sang Hong; Magnusson, Patrik KE; Sanchez, Nick; Stahl, Eli A; Williams, Stephanie; Wray, Naomi R; Xia, Kai; Bettella, Francesco; Børglum, Anders D; Bulik-Sullivan, Brendan K; Cormican, Paul; Craddock, Nick; de Leeuw, Christiaan; Durmishi, Naser; Gill, Michael; Golimbet, Vera; Hamshere, Marian L; Holmans, Peter; Hougaard, David M; Kendler, Kenneth S; Lin, Kuang; Morris, Derek W; Mors, Ole; Mortensen, Preben B; Neale, Benjamin M; O'Neill, Francis A; Owen, Michael J; Milovancevic, MilicaPejovic; Posthuma, Danielle; Powell, John; Richards, Alexander L; Riley, Brien P; Ruderfer, Douglas; Rujescu, Dan; Sigurdsson, Engilbert; Silagadze, Teimuraz; Smit, August B; Stefansson, Hreinn; Steinberg, Stacy; Suvisaari, Jaana; Tosato, Sarah; Verhage, Matthijs; Walters, James T; Bramon, Elvira; Corvin, Aiden P; O'Donovan, Michael C; Stefansson, Kari; Scolnick, Edward; Purcell, Shaun; McCarroll, Steve; Sklar, Pamela; Hultman, Christina M; Sullivan, Patrick F

    2013-01-01

    Schizophrenia is a heritable disorder with substantial public health impact. We conducted a multi-stage genome-wide association study (GWAS) for schizophrenia beginning with a Swedish national sample (5,001 cases, 6,243 controls) followed by meta-analysis with prior schizophrenia GWAS (8,832 cases, 12,067 controls) and finally by replication of SNPs in 168 genomic regions in independent samples (7,413 cases, 19,762 controls, and 581 trios). In total, 22 regions met genome-wide significance (14 novel and one previously implicated in bipolar disorder). The results strongly implicate calcium signaling in the etiology of schizophrenia, and include genome-wide significant results for CACNA1C and CACNB2 whose protein products interact. We estimate that ∼8,300 independent and predominantly common SNPs contribute to risk for schizophrenia and that these collectively account for most of its heritability. Common genetic variation plays an important role in the etiology of schizophrenia, and larger studies will allow more detailed understanding of this devastating disorder. PMID:23974872

  9. Genome-wide signatures of male-mediated migration shaping the Indian gene pool.

    PubMed

    ArunKumar, GaneshPrasad; Tatarinova, Tatiana V; Duty, Jeff; Rollo, Debra; Syama, Adhikarla; Arun, Varatharajan Santhakumari; Kavitha, Valampuri John; Triska, Petr; Greenspan, Bennett; Wells, R Spencer; Pitchappan, Ramasamy

    2015-09-01

    Multiple questions relating to contributions of cultural and demographical factors in the process of human geographical dispersal remain largely unanswered. India, a land of early human settlement and the resulting diversity is a good place to look for some of the answers. In this study, we explored the genetic structure of India using a diverse panel of 78 males genotyped using the GenoChip. Their genome-wide single-nucleotide polymorphism (SNP) diversity was examined in the context of various covariates that influence Indian gene pool. Admixture analysis of genome-wide SNP data showed high proportion of the Southwest Asian component in all of the Indian samples. Hierarchical clustering based on admixture proportions revealed seven distinct clusters correlating to geographical and linguistic affiliations. Convex hull overlay of Y-chromosomal haplogroups on the genome-wide SNP principal component analysis brought out distinct non-overlapping polygons of F*-M89, H*-M69, L1-M27, O2a-M95 and O3a3c1-M117, suggesting a male-mediated migration and expansion of the Indian gene pool. Lack of similar correlation with mitochondrial DNA clades indicated a shared genetic ancestry of females. We suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India. PMID:25994871

  10. Genome-wide patterns of identity-by-descent sharing in the French Canadian founder population

    PubMed Central

    Gauvin, Héloïse; Moreau, Claudia; Lefebvre, Jean-François; Laprise, Catherine; Vézina, Hélène; Labuda, Damian; Roy-Gagnon, Marie-Hélène

    2014-01-01

    In genetics the ability to accurately describe the familial relationships among a group of individuals can be very useful. Recent statistical tools succeeded in assessing the degree of relatedness up to 6–7 generations with good power using dense genome-wide single-nucleotide polymorphism data to estimate the extent of identity-by-descent (IBD) sharing. It is therefore important to describe genome-wide patterns of IBD sharing for more remote and complex relatedness between individuals, such as that observed in a founder population like Quebec, Canada. Taking advantage of the extended genealogical records of the French Canadian founder population, we first compared different tools to identify regions of IBD in order to best describe genome-wide IBD sharing and its correlation with genealogical characteristics. Results showed that the extent of IBD sharing identified with FastIBD correlates best with relatedness measured using genealogical data. Total length of IBD sharing explained 85% of the genealogical kinship's variance. In addition, we observed significantly higher sharing in pairs of individuals with at least one inbred ancestor compared with those without any. Furthermore, patterns of IBD sharing and average sharing were different across regional populations, consistent with the settlement history of Quebec. Our results suggest that, as expected, the complex relatedness present in founder populations is reflected in patterns of IBD sharing. Using these patterns, it is thus possible to gain insight on the types of distant relationships in a sample from a founder population like Quebec. PMID:24129432

  11. On the analysis of a repeated measure design in genome-wide association analysis.

    PubMed

    Lee, Young; Park, Suyeon; Moon, Sanghoon; Lee, Juyoung; Elston, Robert C; Lee, Woojoo; Won, Sungho

    2014-12-01

    Longitudinal data enables detecting the effect of aging/time, and as a repeated measures design is statistically more efficient compared to cross-sectional data if the correlations between repeated measurements are not large. In particular, when genotyping cost is more expensive than phenotyping cost, the collection of longitudinal data can be an efficient strategy for genetic association analysis. However, in spite of these advantages, genome-wide association studies (GWAS) with longitudinal data have rarely been analyzed taking this into account. In this report, we calculate the required sample size to achieve 80% power at the genome-wide significance level for both longitudinal and cross-sectional data, and compare their statistical efficiency. Furthermore, we analyzed the GWAS of eight phenotypes with three observations on each individual in the Korean Association Resource (KARE). A linear mixed model allowing for the correlations between observations for each individual was applied to analyze the longitudinal data, and linear regression was used to analyze the first observation on each individual as cross-sectional data. We found 12 novel genome-wide significant disease susceptibility loci that were then confirmed in the Health Examination cohort, as well as some significant interactions between age/sex and SNPs. PMID:25464127

  12. Genome-Wide Meta-Analysis of Longitudinal Alcohol Consumption Across Youth and Early Adulthood.

    PubMed

    Adkins, Daniel E; Clark, Shaunna L; Copeland, William E; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A; Silberg, Judy; Brown, Tyson H; Fergusson, David M; Horwood, L John; Eaves, Lindon; van den Oord, Edwin J C G; Sullivan, Patrick F; Costello, E J

    2015-08-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse, and dependence increasing across adolescence and peaking in early adulthood. Here, we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three longitudinal community samples (N=2,126, obs=12,166). Consumption-repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and six others met our 'suggestive' criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms, including neurotransmission, xenobiotic pharmacodynamics, and nuclear hormone receptors (NHR). These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies. PMID:26081443

  13. Genome-wide meta-analysis of longitudinal alcohol consumption across youth and early adulthood

    PubMed Central

    Adkins, Daniel E.; Clark, Shaunna L.; Copeland, William E.; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A.; Silberg, Judy; Brown, Tyson H.; Fergusson, David M.; Horwood, L. John; Eaves, Lindon; van den Oord, Edwin J.C.G.; Sullivan, Patrick F.; Costello, E. J.

    2016-01-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse and dependence increasing across adolescence and peaking in early adulthood. Here we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three, longitudinal community samples (N=2,126, obs=12,166). Consumption repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and 6 others met our “suggestive” criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms including neurotransmission, xenobiotic pharmacodynamics and nuclear hormone receptors. These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies. PMID:26081443

  14. Simulation of genome-wide evolution under heterogeneous substitution models and complex multispecies coalescent histories.

    PubMed

    Arenas, Miguel; Posada, David

    2014-05-01

    Genomic evolution can be highly heterogeneous. Here, we introduce a new framework to simulate genome-wide sequence evolution under a variety of substitution models that may change along the genome and the phylogeny, following complex multispecies coalescent histories that can include recombination, demographics, longitudinal sampling, population subdivision/species history, and migration. A key aspect of our simulation strategy is that the heterogeneity of the whole evolutionary process can be parameterized according to statistical prior distributions specified by the user. We used this framework to carry out a study of the impact of variable codon frequencies across genomic regions on the estimation of the genome-wide nonsynonymous/synonymous ratio. We found that both variable codon frequencies across genes and rate variation among sites and regions can lead to severe underestimation of the global dN/dS values. The program SGWE-Simulation of Genome-Wide Evolution-is freely available from http://code.google.com/p/sgwe-project/, including extensive documentation and detailed examples. PMID:24557445

  15. Genome-wide association analysis to predict optimal antipsychotic dosage in schizophrenia: a pilot study.

    PubMed

    Koga, Arthur T; Strauss, John; Zai, Clement; Remington, Gary; De Luca, Vincenzo

    2016-03-01

    In recent years, several studies have investigated genetic polymorphisms of antipsychotic drug-metabolizing enzymes and receptors. However, most studies focused on drug response and very few have investigated the genetic influence on antipsychotic dosage. The aim of the present study is to test the association between antipsychotic dosages at genome-wide level. The current dosage of antipsychotic medications was collected from 79 schizophrenia patients. The dosage was standardized using three different methods: chlorpromazine equivalent (CPZe), defined daily dose (DDD), and percentage of maximum dose (PM %). The patients were then genotyped using the Illumina HumanOmni2.5-8 BeadChip Kit. All markers were screened for significance using linear regression, and the p values were visualized using a Manhattan plot. The genome-wide analysis showed that the top Single-Nucleotide Polymorphisms (SNPs) associated with dosage variation were rs981975 on chromosome 14 for CPZe, rs4470690 on chromosome 4 for PM %, and rs79323383 on chromosome 8 for DDD. However, no genome-wide significantly associated SNPs were identified. In this pilot sample, we found promising trends for pharmacodynamic targets associated with antipsychotic dosage. Therefore, studies combining large prescription databases may identify genetic predictors to adjust the dose of antipsychotic medication. PMID:26821981

  16. Five endometrial cancer risk loci identified through genome-wide association analysis.

    PubMed

    Cheng, Timothy H T; Thompson, Deborah J; O'Mara, Tracy A; Painter, Jodie N; Glubb, Dylan M; Flach, Susanne; Lewis, Annabelle; French, Juliet D; Freeman-Mills, Luke; Church, David; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Webb, Penelope M; Attia, John; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Henders, Anjali K; Martin, Nicholas G; Montgomery, Grant W; Nyholt, Dale R; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Dennis, Joe; Fasching, Peter A; Beckmann, Matthias W; Hein, Alexander; Ekici, Arif B; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Amant, Frederic; Schrauwen, Stefanie; Zhao, Hui; Lambrechts, Diether; Depreeuw, Jeroen; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica M J; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Liu, Tao; Mints, Miriam; Tham, Emma; Li, Mulin Jun; Yip, Shun H; Wang, Junwen; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Dunlop, Malcolm; Houlston, Richard; Palles, Claire; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Cunningham, Julie M; Pharoah, Paul D P; Dunning, Alison M; Edwards, Stacey L; Easton, Douglas F; Tomlinson, Ian; Spurdle, Amanda B

    2016-06-01

    We conducted a meta-analysis of three endometrial cancer genome-wide association studies (GWAS) and two follow-up phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five new risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1, near SIVA1). We also found a second independent 8q24.21 signal (rs17232730). Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r(2) = 0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103[T] allele that is protective in endometrial cancer suppressed gene expression in vitro, suggesting that regulation of the expression of KLF5, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer. PMID:27135401

  17. Reconstructing the population history of European Romani from genome-wide data.

    PubMed

    Mendizabal, Isabel; Lao, Oscar; Marigorta, Urko M; Wollstein, Andreas; Gusmão, Leonor; Ferak, Vladimir; Ioana, Mihai; Jordanova, Albena; Kaneva, Radka; Kouvatsi, Anastasia; Kučinskas, Vaidutis; Makukh, Halyna; Metspalu, Andres; Netea, Mihai G; de Pablo, Rosario; Pamjav, Horolma; Radojkovic, Dragica; Rolleston, Sarah J H; Sertic, Jadranka; Macek, Milan; Comas, David; Kayser, Manfred

    2012-12-18

    The Romani, the largest European minority group with approximately 11 million people, constitute a mosaic of languages, religions, and lifestyles while sharing a distinct social heritage. Linguistic and genetic studies have located the Romani origins in the Indian subcontinent. However, a genome-wide perspective on Romani origins and population substructure, as well as a detailed reconstruction of their demographic history, has yet to be provided. Our analyses based on genome-wide data from 13 Romani groups collected across Europe suggest that the Romani diaspora constitutes a single initial founder population that originated in north/northwestern India ~1.5 thousand years ago (kya). Our results further indicate that after a rapid migration with moderate gene flow from the Near or Middle East, the European spread of the Romani people was via the Balkans starting ~0.9 kya. The strong population substructure and high levels of homozygosity we found in the European Romani are in line with genetic isolation as well as differential gene flow in time and space with non-Romani Europeans. Overall, our genome-wide study sheds new light on the origins and demographic history of European Romani. PMID:23219723

  18. Genome-Wide Pathway Association Studies of Multiple Correlated Quantitative Phenotypes Using Principle Component Analyses

    PubMed Central

    Zhang, Feng; Guo, Xiong; Wu, Shixun; Han, Jing; Liu, Yongjun; Shen, Hui; Deng, Hong-Wen

    2012-01-01

    Genome-wide pathway association studies provide novel insight into the biological mechanism underlying complex diseases. Current pathway association studies primarily focus on single important disease phenotype, which is sometimes insufficient to characterize the clinical manifestations of complex diseases. We present a multi-phenotypes pathway association study(MPPAS) approach using principle component analysis(PCA). In our approach, PCA is first applied to multiple correlated quantitative phenotypes for extracting a set of orthogonal phenotypic components. The extracted phenotypic components are then used for pathway association analysis instead of original quantitative phenotypes. Four statistics were proposed for PCA-based MPPAS in this study. Simulations using the real data from the HapMap project were conducted to evaluate the power and type I error rates of PCA-based MPPAS under various scenarios considering sample sizes, additive and interactive genetic effects. A real genome-wide association study data set of bone mineral density (BMD) at hip and spine were also analyzed by PCA-based MPPAS. Simulation studies illustrated the performance of PCA-based MPPAS for identifying the causal pathways underlying complex diseases. Genome-wide MPPAS of BMD detected associations between BMD and KENNY_CTNNB1_TARGETS_UP as well as LONGEVITYPATHWAY pathways in this study. We aim to provide a applicable MPPAS approach, which may help to gain deep understanding the potential biological mechanism of association results for complex diseases. PMID:23285279

  19. A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM

    PubMed Central

    Weiss, Lauren A.; Arking, Dan E.

    2009-01-01

    Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673

  20. A twin study of breastfeeding with a preliminary genome wide association scan

    PubMed Central

    Colodro-Conde, L.; Zhu, G.; Power, R. A.; Henders, A.; Heath, A.C.; Madden, P.A.F.; Montgomery, G.W.; Medland, S. E.; Ordoñana, J.R.; Martin, N.G.

    2015-01-01

    Breastfeeding has been an important survival trait during human history, though it has long been recognised that individuals differ in their exact breastfeeding behaviour. Here our aims were, first, to explore to what extent genetic and environmental influences contributed to the individual differences in breastfeeding behaviour; second, to detect possible genetic variants related to breastfeeding; and lastly, to test if the genetic variants associated with breastfeeding have been previously found to be related with breast size. Data were collected from a large community-based cohort of Australian twins, with 3,364 women for the twin modelling analyses and 1,521 of them included in the genome wide association study. Monozygotic twin correlations (rMZ = .52, 95% CI .46 – .57) were larger than dizygotic twin correlations (rDZ = .35, 95% CI .25 – .43) and the best-fitting model was the one composed by additive genetics and unique environmental factors, explaining 53% and 47% of the variance in breastfeeding behaviour, respectively. No breastfeeding-related genetic variants reached genome-wide significance. The polygenic risk score analyses showed no significant results, suggesting breast size does not influence breastfeeding. This study confers a replication of a previous one exploring the sources of variance of breastfeeding and, to our knowledge, is the first one to conduct a Genome-Wide Association Study on breastfeeding and look at the overlap with variants for breast size. PMID:25475840

  1. Genome-wide association studies for hematological traits in Chinese Sutai pigs

    PubMed Central

    2014-01-01

    Background It has been shown that hematological traits are strongly associated with the metabolism and the immune system in domestic pig. However, little is known about the genetic architecture of hematological traits. To identify quantitative trait loci (QTL) controlling hematological traits, we performed single marker Genome-wide association studies (GWAS) and haplotype analysis for 15 hematological traits in 495 Chinese Sutai pigs. Results We identified 161 significant SNPs including 44 genome-wide significant SNPs associated with 11 hematological traits by single marker GWAS. Most of them were located on SSC2. Meanwhile, we detected 499 significant SNPs containing 154 genome-wide significant SNPs associated with 9 hematological traits by haplotype analysis. Most of the identified loci were located on SSC7 and SSC9. Conclusions We detected 4 SNPs with pleiotropic effects on SSC2 by single marker GWAS and (or) on SSC7 by haplotype analysis. Furthermore, through checking the gene functional annotations, positions and their expression variation, we finally selected 7 genes as potential candidates. Specially, we found that three genes (TRIM58, TRIM26 and TRIM21) of them originated from the same gene family and executed similar function of innate and adaptive immune. The findings will contribute to dissection the immune gene network, further identification of causative mutations underlying the identified QTLs and providing insights into the molecular basis of hematological trait in domestic pig. PMID:24674592

  2. NSD1 mutations generate a genome-wide DNA methylation signature

    PubMed Central

    Choufani, S.; Cytrynbaum, C.; Chung, B. H. Y.; Turinsky, A. L.; Grafodatskaya, D.; Chen, Y. A.; Cohen, A. S. A.; Dupuis, L.; Butcher, D. T.; Siu, M. T.; Luk, H. M.; Lo, I. F. M.; Lam, S. T. S.; Caluseriu, O.; Stavropoulos, D. J.; Reardon, W.; Mendoza-Londono, R.; Brudno, M.; Gibson, W. T.; Chitayat, D.; Weksberg, R.

    2015-01-01

    Sotos syndrome (SS) represents an important human model system for the study of epigenetic regulation; it is an overgrowth/intellectual disability syndrome caused by mutations in a histone methyltransferase, NSD1. As layered epigenetic modifications are often interdependent, we propose that pathogenic NSD1 mutations have a genome-wide impact on the most stable epigenetic mark, DNA methylation (DNAm). By interrogating DNAm in SS patients, we identify a genome-wide, highly significant NSD1+/−-specific signature that differentiates pathogenic NSD1 mutations from controls, benign NSD1 variants and the clinically overlapping Weaver syndrome. Validation studies of independent cohorts of SS and controls assigned 100% of these samples correctly. This highly specific and sensitive NSD1+/− signature encompasses genes that function in cellular morphogenesis and neuronal differentiation, reflecting cardinal features of the SS phenotype. The identification of SS-specific genome-wide DNAm alterations will facilitate both the elucidation of the molecular pathophysiology of SS and the development of improved diagnostic testing. PMID:26690673

  3. Novel Loci Associated with Usual Sleep Duration: The CHARGE Consortium Genome-Wide Association Study

    PubMed Central

    Gottlieb, Daniel J.; Hek, Karin; Chen, Ting-hsu; Watson, Nathaniel F.; Eiriksdottir, Gudny; Byrne, Enda M.; Cornelis, Marilyn; Warby, Simon C.; Bandinelli, Stefania; Cherkas, Lynn; Evans, Daniel S.; Grabe, Hans J.; Lahti, Jari; Li, Man; Lehtimäki, Terho; Lumley, Thomas; Marciante, Kristin D.; Pérusse, Louis; Psaty, Bruce M.; Robbins, John; Tranah, Gregory J.; Vink, Jacqueline M.; Wilk, Jemma B.; Stafford, Jeanette M.; Bellis, Claire; Biffar, Reiner; Bouchard, Claude; Cade, Brian; Curhan, Gary C.; Eriksson, Johan G.; Ewert, Ralf; Ferrucci, Luigi; Fülöp, Tibor; Gehrman, Philip R.; Goodloe, Robert; Harris, Tamara B.; Heath, Andrew C.; Hernandez, Dena; Hofman, Albert; Hottenga, Jouke-Jan; Hunter, David J.; Jensen, Majken K.; Johnson, Andrew D.; Kähönen, Mika; Kao, Linda; Kraft, Peter; Larkin, Emma K.; Lauderdale, Diane S.; Luik, Annemarie I.; Medici, Marco; Montgomery, Grant W.; Palotie, Aarno; Patel, Sanjay R.; Pistis, Giorgio; Porcu, Eleonora; Quaye, Lydia; Raitakari, Olli; Redline, Susan; Rimm, Eric B.; Rotter, Jerome I.; Smith, Albert V.; Spector, Tim D.; Teumer, Alexander; Uitterlinden, André G.; Vohl, Marie-Claude; Widen, Elisabeth; Willemsen, Gonneke; Young, Terry; Zhang, Xiaoling; Liu, Yongmei; Blangero, John; Boomsma, Dorret I.; Gudnason, Vilmundur; Hu, Frank; Mangino, Massimo; Martin, Nicholas G.; O’Connor, George T.; Stone, Katie L.; Tanaka, Toshiko; Viikari, Jorma; Gharib, Sina A.; Punjabi, Naresh M.; Räikkönen, Katri; Völzke, Henry; Mignot, Emmanuel; Tiemeier, Henning

    2015-01-01

    Usual sleep duration is a heritable trait correlated with psychiatric morbidity, cardiometabolic disease and mortality, although little is known about the genetic variants influencing this trait. A genome-wide association study of usual sleep duration was conducted using 18 population-based cohorts totaling 47,180 individuals of European ancestry. Genome-wide significant association was identified at two loci. The strongest is located on chromosome 2, in an intergenic region 35–80 kb upstream from the thyroid-specific transcription factor PAX8 (lowest p=1.1 ×10−9). This finding was replicated in an African-American sample of 4771 individuals (lowest p=9.3 × 10−4). The strongest combined association was at rs1823125 (p=1.5 × 10−10, minor allele frequency 0.26 in the discovery sample, 0.12 in the replication sample), with each copy of the minor allele associated with a sleep duration 3.1 minutes longer per night. The alleles associated with longer sleep duration were associated in previous genome-wide association studies with a more favorable metabolic profile and a lower risk of attention deficit hyperactivity disorder. Understanding the mechanisms underlying these associations may help elucidate biological mechanisms influencing sleep duration and its association with psychiatric, metabolic and cardiovascular disease. PMID:25469926

  4. Integrated analysis of genome-wide genetic and epigenetic association data for identification of disease mechanisms.

    PubMed

    Ke, Xiayi; Cortina-Borja, Mario; Silva, Bruno Cesar; Lowe, Robert; Rakyan, Vardhman; Balding, David

    2013-11-01

    Many human diseases are multifactorial, involving multiple genetic and environmental factors impacting on one or more biological pathways. Much of the environmental effect is believed to be mediated through epigenetic changes. Although many genome-wide genetic and epigenetic association studies have been conducted for different diseases and traits, it is still far from clear to what extent the genomic loci and biological pathways identified in the genetic and epigenetic studies are shared. There is also a lack of statistical tools to assess these important aspects of disease mechanisms. In the present study, we describe a protocol for the integrated analysis of genome-wide genetic and epigenetic data based on permutation of a sum statistic for the combined effects in a locus or pathway. The method was then applied to published type 1 diabetes (T1D) genome-wide- and epigenome-wide-association studies data to identify genomic loci and biological pathways that are associated with T1D genetically and epigenetically. Through combined analysis, novel loci and pathways were also identified, which could add to our understanding of disease mechanisms of T1D as well as complex diseases in general. PMID:24071862

  5. Cell-Type-Specific Genome-wide Expression Profiling after Laser Capture Microdissection of Living Tissue

    SciTech Connect

    Marchetti, F; Manohar, C F

    2005-02-09

    The purpose of this technical feasibility study was to develop and evaluate robust microgenomic tools for investigations of genome-wide expression of very small numbers of cells isolated from whole tissue sections. Tissues contain large numbers of cell-types that play varied roles in organ function and responses to endogenous and exogenous toxicants whether bacterial, viral, chemical or radiation. Expression studies of whole tissue biopsy are severely limited because heterogeneous cell-types result in an averaging of molecular signals masking subtle but important changes in gene expression in any one cell type(s) or group of cells. Accurate gene expression analysis requires the study of specific cell types in their tissue environment but without contamination from surrounding cells. Laser capture microdissection (LCM) is a new technology to isolate morphologically distinct cells from tissue sections. Alternative methods are available for isolating single cells but not yet for their reliable genome-wide expression analyses. The tasks of this feasibility project were to: (1) Develop efficient protocols for laser capture microdissection of cells from tissues identified by antibody label, or morphological stain. (2) Develop reproducible gene-transcript analyses techniques for single cell-types and determine the numbers of cells needed for reliable genome-wide analyses. (3) Validate the technology for epithelial and endothelial cells isolated from the gastrointestinal tract of mice.

  6. Multi-instance multi-label distance metric learning for genome-wide protein function prediction.

    PubMed

    Xu, Yonghui; Min, Huaqing; Song, Hengjie; Wu, Qingyao

    2016-08-01

    Multi-instance multi-label (MIML) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with not only multiple instances but also multiple class labels. To find an appropriate MIML learning method for genome-wide protein function prediction, many studies in the literature attempted to optimize objective functions in which dissimilarity between instances is measured using the Euclidean distance. But in many real applications, Euclidean distance may be unable to capture the intrinsic similarity/dissimilarity in feature space and label space. Unlike other previous approaches, in this paper, we propose to learn a multi-instance multi-label distance metric learning framework (MIMLDML) for genome-wide protein function prediction. Specifically, we learn a Mahalanobis distance to preserve and utilize the intrinsic geometric information of both feature space and label space for MIML learning. In addition, we try to deal with the sparsely labeled data by giving weight to the labeled data. Extensive experiments on seven real-world organisms covering the biological three-domain system (i.e., archaea, bacteria, and eukaryote; Woese et al., 1990) show that the MIMLDML algorithm is superior to most state-of-the-art MIML learning algorithms. PMID:26923212

  7. Using genome-wide complex trait analysis to quantify 'missing heritability' in Parkinson's disease.

    PubMed

    Keller, Margaux F; Saad, Mohamad; Bras, Jose; Bettella, Francesco; Nicolaou, Nayia; Simón-Sánchez, Javier; Mittag, Florian; Büchel, Finja; Sharma, Manu; Gibbs, J Raphael; Schulte, Claudia; Moskvina, Valentina; Durr, Alexandra; Holmans, Peter; Kilarski, Laura L; Guerreiro, Rita; Hernandez, Dena G; Brice, Alexis; Ylikotila, Pauli; Stefánsson, Hreinn; Majamaa, Kari; Morris, Huw R; Williams, Nigel; Gasser, Thomas; Heutink, Peter; Wood, Nicholas W; Hardy, John; Martinez, Maria; Singleton, Andrew B; Nalls, Michael A

    2012-11-15

    Genome-wide association studies (GWASs) have been successful at identifying single-nucleotide polymorphisms (SNPs) highly associated with common traits; however, a great deal of the heritable variation associated with common traits remains unaccounted for within the genome. Genome-wide complex trait analysis (GCTA) is a statistical method that applies a linear mixed model to estimate phenotypic variance of complex traits explained by genome-wide SNPs, including those not associated with the trait in a GWAS. We applied GCTA to 8 cohorts containing 7096 case and 19 455 control individuals of European ancestry in order to examine the missing heritability present in Parkinson's disease (PD). We meta-analyzed our initial results to produce robust heritability estimates for PD types across cohorts. Our results identify 27% (95% CI 17-38, P = 8.08E - 08) phenotypic variance associated with all types of PD, 15% (95% CI -0.2 to 33, P = 0.09) phenotypic variance associated with early-onset PD and 31% (95% CI 17-44, P = 1.34E - 05) phenotypic variance associated with late-onset PD. This is a substantial increase from the genetic variance identified by top GWAS hits alone (between 3 and 5%) and indicates there are substantially more risk loci to be identified. Our results suggest that although GWASs are a useful tool in identifying the most common variants associated with complex disease, a great deal of common variants of small effect remain to be discovered. PMID:22892372

  8. A novel genome-wide polyadenylation sites recognition system based on condition random field.

    PubMed

    Han, Jiuqiang; Zhang, Shanxin; Liu, Jun; Liu, Ruiling

    2014-01-01

    Polyadenylation including the cleavage of pre-mRNA and addition of a stretch of adenosines to the 3'-end is an essential step of pre-mRNA processing in eukayotes. The known regulatory role of polyadenylation in mRNA localization, stability, and translation and the emerging link between poly(A) and disease states underline the necessary to fully characterize polyadenylation sites. Several artificial intelligence methods have been proposed for poly(A) sites recognition. However, these methods are suitable to small subsets of genome sequences. It is necessary to propose a method for genome-wide recognition of poly(A) sites. Recent efforts have found a lot of poly(A) related factors on DNA level. Here, we proposed a novel genome-wide poly(A) recognition method based on the Condition Random Field (CRF) by integrating multiple features. Compared with the polya_svm (the most accurate program for prediction of poly(A) sites till date), our method had a higher performance with the area under ROC curve(0.8621 versus 0.6796). The result suggests that our method is an effective method in genome wide poly(A) sites recognition. PMID:25571055

  9. Genome-wide mapping of nucleosome positioning and DNA methylation within individual DNA molecules

    PubMed Central

    Kelly, Theresa K.; Liu, Yaping; Lay, Fides D.; Liang, Gangning; Berman, Benjamin P.; Jones, Peter A.

    2012-01-01

    DNA methylation and nucleosome positioning work together to generate chromatin structures that regulate gene expression. Nucleosomes are typically mapped using nuclease digestion requiring significant amounts of material and varying enzyme concentrations. We have developed a method (NOMe-seq) that uses a GpC methyltransferase (M.CviPI) and next generation sequencing to generate a high resolution footprint of nucleosome positioning genome-wide using less than 1 million cells while retaining endogenous DNA methylation information from the same DNA strand. Using a novel bioinformatics pipeline, we show a striking anti-correlation between nucleosome occupancy and DNA methylation at CTCF regions that is not present at promoters. We further show that the extent of nucleosome depletion at promoters is directly correlated to expression level and can accommodate multiple nucleosomes and provide genome-wide evidence that expressed non-CpG island promoters are nucleosome-depleted. Importantly, NOMe-seq obtains DNA methylation and nucleosome positioning information from the same DNA molecule, giving the first genome-wide DNA methylation and nucleosome positioning correlation at the single molecule, and thus, single cell level, that can be used to monitor disease progression and response to therapy. PMID:22960375

  10. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    PubMed

    Wang, Yu; Li, Wei; Xia, Yingying; Wang, Chongzhi; Tang, Y Tom; Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2014-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information. PMID:25919136

  11. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions

    PubMed Central

    Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2015-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information. PMID:25919136

  12. Genome-Wide Association Scan for Variants Associated with Early-Onset Prostate Cancer

    PubMed Central

    Lange, Ethan M.; Johnson, Anna M.; Wang, Yunfei; Zuhlke, Kimberly A.; Lu, Yurong; Ribado, Jessica V.; Keele, Gregory R.; Li, Jin; Duan, Qing; Li, Ge; Gao, Zhengrong; Li, Yun; Xu, Jianfeng; Isaacs, William B.; Zheng, Siqun; Cooney, Kathleen A.

    2014-01-01

    Prostate cancer is the most common non-skin cancer and the second leading cause of cancer related mortality for men in the United States. There is strong empirical and epidemiological evidence supporting a stronger role of genetics in early-onset prostate cancer. We performed a genome-wide association scan for early-onset prostate cancer. Novel aspects of this study include the focus on early-onset disease (defined as men with prostate cancer diagnosed before age 56 years) and use of publically available control genotype data from previous genome-wide association studies. We found genome-wide significant (p<5×10−8) evidence for variants at 8q24 and 11p15 and strong supportive evidence for a number of previously reported loci. We found little evidence for individual or systematic inflated association findings resulting from using public controls, demonstrating the utility of using public control data in large-scale genetic association studies of common variants. Taken together, these results demonstrate the importance of established common genetic variants for early-onset prostate cancer and the power of including early-onset prostate cancer cases in genetic association studies. PMID:24740154

  13. Genetic architecture dissection by genome-wide association analysis reveals avian eggshell ultrastructure traits.

    PubMed

    Duan, Zhongyi; Sun, Congjiao; Shen, ManMan; Wang, Kehua; Yang, Ning; Zheng, Jiangxia; Xu, Guiyun

    2016-01-01

    The ultrastructure of an eggshell is considered the major determinant of eggshell quality, which has biological and economic significance for the avian and poultry industries. However, the interrelationships and genome-wide architecture of eggshell ultrastructure remain to be elucidated. Herein, we measured eggshell thickness (EST), effective layer thickness (ET), mammillary layer thickness (MT), and mammillary density (MD) and conducted genome-wide association studies in 927 F2 hens. The SNP-based heritabilities of eggshell ultrastructure traits were estimated to be 0.39, 0.36, 0.17 and 0.19 for EST, ET, MT and MD, respectively, and a total of 719, 784, 1 and 10 genome-wide significant SNPs were associated with EST, ET, MT and MD, respectively. ABCC9, ITPR2, KCNJ8 and WNK1, which are involved in ion transport, were suggested to be the key genes regulating EST and ET. ITM2C and KNDC1 likely affect MT and MD, respectively. Additionally, there were linear relationships between the chromosome lengths and the variance explained per chromosome for EST (R(2) = 0.57) and ET (R(2) = 0.67). In conclusion, the interrelationships and genetic architecture of eggshell ultrastructure traits revealed in this study are valuable for our understanding of the avian eggshell and contribute to research on a variety of other calcified shells. PMID:27456605

  14. Genome-wide association analysis of age at onset and psychotic symptoms in bipolar disorder

    PubMed Central

    Mahon, Pamela Belmonte; Pirooznia, Mehdi; Goes, Fernando S.; Seifuddin, Fayaz; Steele, Jo; Lee, Phil Hyoun; Huang, Jie; Hamshere, Marian; DePaulo, J. Raymond; Kelsoe, John R.; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W.; Craddock, Nick; Schulze, ThomasG.; McMahon, Francis J.; Potash, James B.; Zandi, Peter P.

    2011-01-01

    Genome-wide association studies (GWAS) have identified several susceptibility loci for bipolar disorder (BP), most notably ANK3. However, most of the inherited risk for BP remains unexplained. One reason for the limited success may be the genetic heterogeneity of BP. Clinical sub-phenotypes of BP may identify more etiologically homogeneous subsets of patients, which can be studied with increased power to detect genetic variation. Here, we report on a mega-analysis of two widely studied sub-phenotypes of BP, age at onset and psychotic symptoms, which are familial and clinically significant. We combined data from three GWAS: NIMH Bipolar Disorder Genetic Association Information Network (GAIN-BP), NIMH Bipolar Disorder Genome Study(BiGS), and a German sample. The combined sample consisted of 2836 BP cases with information on sub-phenotypes and 2744 controls. Imputation was performed, resulting in 2.3 million SNPs available for analysis. No SNP reached genome-wide significance for either sub-phenotype. In addition, no SNP reached genome-wide significance in a meta-analysis with an independent replication sample. We had 80% power to detect associations with a common SNP at an OR of 1.6 for psychotic symptoms and a mean difference of 1.8 years in age at onset. Age at onset and psychotic symptoms in BP may be influenced by many genes of smaller effect sizes or other variants not measured well by SNP arrays, such as rare alleles. PMID:21305692

  15. Genome-wide detection of DNA double-stranded breaks induced by engineered nucleases.

    PubMed

    Frock, Richard L; Hu, Jiazhi; Meyers, Robin M; Ho, Yu-Jui; Kii, Erina; Alt, Frederick W

    2015-02-01

    Although great progress has been made in the characterization of the off-target effects of engineered nucleases, sensitive and unbiased genome-wide methods for the detection of off-target cleavage events and potential collateral damage are still lacking. Here we describe a linear amplification-mediated modification of a previously published high-throughput, genome-wide, translocation sequencing (HTGTS) method that robustly detects DNA double-stranded breaks (DSBs) generated by engineered nucleases across the human genome based on their translocation to other endogenous or ectopic DSBs. HTGTS with different Cas9:sgRNA or TALEN nucleases revealed off-target hotspot numbers for given nucleases that ranged from a few or none to dozens or more, and extended the number of known off-targets for certain previously characterized nucleases more than tenfold. We also identified translocations between bona fide nuclease targets on homologous chromosomes, an undesired collateral effect that has not been described previously. Finally, HTGTS confirmed that the Cas9D10A paired nickase approach suppresses off-target cleavage genome-wide. PMID:25503383

  16. Genome-wide association study identifies novel loci predisposing to cutaneous melanoma.

    PubMed

    Amos, Christopher I; Wang, Li-E; Lee, Jeffrey E; Gershenwald, Jeffrey E; Chen, Wei V; Fang, Shenying; Kosoy, Roman; Zhang, Mingfeng; Qureshi, Abrar A; Vattathil, Selina; Schacherer, Christopher W; Gardner, Julie M; Wang, Yuling; Bishop, D Tim; Barrett, Jennifer H; MacGregor, Stuart; Hayward, Nicholas K; Martin, Nicholas G; Duffy, David L; Mann, Graham J; Cust, Anne; Hopper, John; Brown, Kevin M; Grimm, Elizabeth A; Xu, Yaji; Han, Younghun; Jing, Kaiyan; McHugh, Caitlin; Laurie, Cathy C; Doheny, Kim F; Pugh, Elizabeth W; Seldin, Michael F; Han, Jiali; Wei, Qingyi

    2011-12-15

    We performed a multistage genome-wide association study of melanoma. In a discovery cohort of 1804 melanoma cases and 1026 controls, we identified loci at chromosomes 15q13.1 (HERC2/OCA2 region) and 16q24.3 (MC1R) regions that reached genome-wide significance within this study and also found strong evidence for genetic effects on susceptibility to melanoma from markers on chromosome 9p21.3 in the p16/ARF region and on chromosome 1q21.3 (ARNT/LASS2/ANXA9 region). The most significant single-nucleotide polymorphisms (SNPs) in the 15q13.1 locus (rs1129038 and rs12913832) lie within a genomic region that has profound effects on eye and skin color; notably, 50% of variability in eye color is associated with variation in the SNP rs12913832. Because eye and skin colors vary across European populations, we further evaluated the associations of the significant SNPs after carefully adjusting for European substructure. We also evaluated the top 10 most significant SNPs by using data from three other genome-wide scans. Additional in silico data provided replication of the findings from the most significant region on chromosome 1q21.3 rs7412746 (P = 6 × 10(-10)). Together, these data identified several candidate genes for additional studies to identify causal variants predisposing to increased risk for developing melanoma. PMID:21926416

  17. NSD1 mutations generate a genome-wide DNA methylation signature.

    PubMed

    Choufani, S; Cytrynbaum, C; Chung, B H Y; Turinsky, A L; Grafodatskaya, D; Chen, Y A; Cohen, A S A; Dupuis, L; Butcher, D T; Siu, M T; Luk, H M; Lo, I F M; Lam, S T S; Caluseriu, O; Stavropoulos, D J; Reardon, W; Mendoza-Londono, R; Brudno, M; Gibson, W T; Chitayat, D; Weksberg, R

    2015-01-01

    Sotos syndrome (SS) represents an important human model system for the study of epigenetic regulation; it is an overgrowth/intellectual disability syndrome caused by mutations in a histone methyltransferase, NSD1. As layered epigenetic modifications are often interdependent, we propose that pathogenic NSD1 mutations have a genome-wide impact on the most stable epigenetic mark, DNA methylation (DNAm). By interrogating DNAm in SS patients, we identify a genome-wide, highly significant NSD1(+/-)-specific signature that differentiates pathogenic NSD1 mutations from controls, benign NSD1 variants and the clinically overlapping Weaver syndrome. Validation studies of independent cohorts of SS and controls assigned 100% of these samples correctly. This highly specific and sensitive NSD1(+/-) signature encompasses genes that function in cellular morphogenesis and neuronal differentiation, reflecting cardinal features of the SS phenotype. The identification of SS-specific genome-wide DNAm alterations will facilitate both the elucidation of the molecular pathophysiology of SS and the development of improved diagnostic testing. PMID:26690673

  18. Genome-wide analysis of Notch signalling in Drosophila by transgenic RNAi

    PubMed Central

    Mummery-Widmer, Jennifer L.; Yamazaki, Masakazu; Stoeger, Thomas; Novatchkova, Maria; Bhalerao, Sheetal; Chen, Doris; Dietzl, Georg; Dickson, Barry J.; Knoblich, Juergen A.

    2010-01-01

    Genome-wide RNA interference (RNAi) screens have identified near-complete sets of genes involved in cellular processes. However, this methodology has not yet been used to study complex developmental processes in a tissue-specific manner. Here we report the use of a library of Drosophila strains expressing inducible hairpin RNAi constructs to study the Notch signalling pathway during external sensory organ development. We assigned putative loss-of-function phenotypes to 21.2% of the protein-coding Drosophila genes. Using secondary assays, we identified 6 new genes involved in asymmetric cell division and 23 novel genes regulating the Notch signalling pathway. By integrating our phenotypic results with protein interaction data, we constructed a genome-wide, functionally validated interaction network governing Notch signalling and asymmetric cell division. We used clustering algorithms to identify nuclear import pathways and the COP9 signallosome as Notch regulators. Our results show that complex developmental processes can be analysed on a genome-wide level and provide a unique resource for functional annotation of the Drosophila genome. PMID:19363474

  19. Genome-wide association study in Chinese identifies novel loci for blood pressure and hypertension.

    PubMed

    Lu, Xiangfeng; Wang, Laiyuan; Lin, Xu; Huang, Jianfeng; Charles Gu, C; He, Meian; Shen, Hongbing; He, Jiang; Zhu, Jingwen; Li, Huaixing; Hixson, James E; Wu, Tangchun; Dai, Juncheng; Lu, Ling; Shen, Chong; Chen, Shufeng; He, Lin; Mo, Zengnan; Hao, Yongchen; Mo, Xingbo; Yang, Xueli; Li, Jianxin; Cao, Jie; Chen, Jichun; Fan, Zhongjie; Li, Ying; Zhao, Liancheng; Li, Hongfan; Lu, Fanghong; Yao, Cailiang; Yu, Lin; Xu, Lihua; Mu, Jianjun; Wu, Xianping; Deng, Ying; Hu, Dongsheng; Zhang, Weidong; Ji, Xu; Guo, Dongshuang; Guo, Zhirong; Zhou, Zhengyuan; Yang, Zili; Wang, Renping; Yang, Jun; Zhou, Xiaoyang; Yan, Weili; Sun, Ningling; Gao, Pingjin; Gu, Dongfeng

    2015-02-01

    Hypertension is a common disorder and the leading risk factor for cardiovascular disease and premature deaths worldwide. Genome-wide association studies (GWASs) in the European population have identified multiple chromosomal regions associated with blood pressure, and the identified loci altogether explain only a small fraction of the variance for blood pressure. The differences in environmental exposures and genetic background between Chinese and European populations might suggest potential different pathways of blood pressure regulation. To identify novel genetic variants affecting blood pressure variation, we conducted a meta-analysis of GWASs of blood pressure and hypertension in 11 816 subjects followed by replication studies including 69 146 additional individuals. We identified genome-wide significant (P < 5.0 × 10(-8)) associations with blood pressure, which included variants at three new loci (CACNA1D, CYP21A2, and MED13L) and a newly discovered variant near SLC4A7. We also replicated 14 previously reported loci, 8 (CASZ1, MOV10, FGF5, CYP17A1, SOX6, ATP2B1, ALDH2, and JAG1) at genome-wide significance, and 6 (FIGN, ULK4, GUCY1A3, HFE, TBX3-TBX5, and TBX3) at a suggestive level of P = 1.81 × 10(-3) to 5.16 × 10(-8). These findings provide new mechanistic insights into the regulation of blood pressure and potential targets for treatments. PMID:25249183

  20. Meta-analysis of genome-wide association studies of attention deficit/hyperactivity disorder

    PubMed Central

    Neale, Benjamin M; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schäfer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J.L.; Langely, Kate; O’Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective Although twin and family studies have shown Attention Deficit/Hyperactivity Disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association scans (GWAS) have not yielded significant results, we conducted a meta-analysis of existing studies to boost statistical power. Method We used data from four projects: a) the Children’s Hospital of Philadelphia (CHOP), b) phase I of the International Multicenter ADHD Genetics project (IMAGE), c) phase II of IMAGE (IMAGE II), and d) the Pfizer funded study from the University of California, Los Angeles, Washington University and the Massachusetts General Hospital (PUWMa). The final sample size consisted of 2,064 trios, 896 cases and 2,455 controls. For each study, we imputed HapMap SNPs, computed association test statistics and transformed them to Z-scores, and then combined weighted Z-scores in a meta-analysis. Results No genome-wide significant associations were found, although an analysis of candidate genes suggests they may be involved in the disorder. Conclusions Given that ADHD is a highly heritable disorder, our negative results suggest that the effects of common ADHD risk variants must, individually, be very small or that other types of variants, e.g. rare ones, account for much of the disorder’s heritability. PMID:20732625

  1. Genetic architecture dissection by genome-wide association analysis reveals avian eggshell ultrastructure traits

    PubMed Central

    Duan, Zhongyi; Sun, Congjiao; Shen, ManMan; Wang, Kehua; Yang, Ning; Zheng, Jiangxia; Xu, Guiyun

    2016-01-01

    The ultrastructure of an eggshell is considered the major determinant of eggshell quality, which has biological and economic significance for the avian and poultry industries. However, the interrelationships and genome-wide architecture of eggshell ultrastructure remain to be elucidated. Herein, we measured eggshell thickness (EST), effective layer thickness (ET), mammillary layer thickness (MT), and mammillary density (MD) and conducted genome-wide association studies in 927 F2 hens. The SNP-based heritabilities of eggshell ultrastructure traits were estimated to be 0.39, 0.36, 0.17 and 0.19 for EST, ET, MT and MD, respectively, and a total of 719, 784, 1 and 10 genome-wide significant SNPs were associated with EST, ET, MT and MD, respectively. ABCC9, ITPR2, KCNJ8 and WNK1, which are involved in ion transport, were suggested to be the key genes regulating EST and ET. ITM2C and KNDC1 likely affect MT and MD, respectively. Additionally, there were linear relationships between the chromosome lengths and the variance explained per chromosome for EST (R2 = 0.57) and ET (R2 = 0.67). In conclusion, the interrelationships and genetic architecture of eggshell ultrastructure traits revealed in this study are valuable for our understanding of the avian eggshell and contribute to research on a variety of other calcified shells. PMID:27456605

  2. DamID-seq: Genome-wide Mapping of Protein-DNA Interactions by High Throughput Sequencing of Adenine-methylated DNA Fragments.

    PubMed

    Wu, Feinan; Olson, Brennan G; Yao, Jie

    2016-01-01

    The DNA adenine methyltransferase identification (DamID) assay is a powerful method to detect protein-DNA interactions both locally and genome-wide. It is an alternative approach to chromatin immunoprecipitation (ChIP). An expressed fusion protein consisting of the protein of interest and the E. coli DNA adenine methyltransferase can methylate the adenine base in GATC motifs near the sites of protein-DNA interactions. Adenine-methylated DNA fragments can then be specifically amplified and detected. The original DamID assay detects the genomic locations of methylated DNA fragments by hybridization to DNA microarrays, which is limited by the availability of microarrays and the density of predetermined probes. In this paper, we report the detailed protocol of integrating high throughput DNA sequencing into DamID (DamID-seq). The large number of short reads generated from DamID-seq enables detecting and localizing protein-DNA interactions genome-wide with high precision and sensitivity. We have used the DamID-seq assay to study genome-nuclear lamina (NL) interactions in mammalian cells, and have noticed that DamID-seq provides a high resolution and a wide dynamic range in detecting genome-NL interactions. The DamID-seq approach enables probing NL associations within gene structures and allows comparing genome-NL interaction maps with other functional genomic data, such as ChIP-seq and RNA-seq. PMID:26862720

  3. Population Stratification in the Context of Diverse Epidemiologic Surveys Sans Genome-Wide Data

    PubMed Central

    Oetjens, Matthew T.; Brown-Gentry, Kristin; Goodloe, Robert; Dilks, Holli H.; Crawford, Dana C.

    2016-01-01

    Population stratification or confounding by genetic ancestry is a potential cause of false associations in genetic association studies. Estimation of and adjustment for genetic ancestry has become common practice thanks in part to the availability of ancestry informative markers on genome-wide association study (GWAS) arrays. While array data is now widespread, these data are not ubiquitous as several large epidemiologic and clinic-based studies lack genome-wide data. One such large epidemiologic-based study lacking genome-wide data accessible to investigators is the National Health and Nutrition Examination Surveys (NHANES), population-based cross-sectional surveys of Americans linked to demographic, health, and lifestyle data conducted by the Centers for Disease Control and Prevention. DNA samples (n = 14,998) were extracted from biospecimens from consented NHANES participants between 1991–1994 (NHANES III, phase 2) and 1999–2002 and represent three major self-identified racial/ethnic groups: non-Hispanic whites (n = 6,634), non-Hispanic blacks (n = 3,458), and Mexican Americans (n = 3,950). We as the Epidemiologic Architecture for Genes Linked to Environment study genotyped candidate gene and GWAS-identified index variants in NHANES as part of the larger Population Architecture using Genomics and Epidemiology I study for collaborative genetic association studies. To enable basic quality control such as estimation of genetic ancestry to control for population stratification in NHANES san genome-wide data, we outline here strategies that use limited genetic data to identify the markers optimal for characterizing genetic ancestry. From among 411 and 295 autosomal SNPs available in NHANES III and NHANES 1999–2002, we demonstrate that markers with ancestry information can be identified to estimate global ancestry. Despite limited resolution, global genetic ancestry is highly correlated with self-identified race for the majority of participants, although less so

  4. A Genome-Wide Association Study of a Biomarker of Nicotine Metabolism

    PubMed Central

    Loukola, Anu; Buchwald, Jadwiga; Gupta, Richa; Palviainen, Teemu; Hällfors, Jenni; Tikkanen, Emmi; Korhonen, Tellervo; Ollikainen, Miina; Sarin, Antti-Pekka; Ripatti, Samuli; Lehtimäki, Terho; Raitakari, Olli; Salomaa, Veikko; Rose, Richard J.; Tyndale, Rachel F.; Kaprio, Jaakko

    2015-01-01

    Individuals with fast nicotine metabolism typically smoke more and thus have a greater risk for smoking-induced diseases. Further, the efficacy of smoking cessation pharmacotherapy is dependent on the rate of nicotine metabolism. Our objective was to use nicotine metabolite ratio (NMR), an established biomarker of nicotine metabolism rate, in a genome-wide association study (GWAS) to identify novel genetic variants influencing nicotine metabolism. A heritability estimate of 0.81 (95% CI 0.70–0.88) was obtained for NMR using monozygotic and dizygotic twins of the FinnTwin cohort. We performed a GWAS in cotinine-verified current smokers of three Finnish cohorts (FinnTwin, Young Finns Study, FINRISK2007), followed by a meta-analysis of 1518 subjects, and annotated the genome-wide significant SNPs with methylation quantitative loci (meQTL) analyses. We detected association on 19q13 with 719 SNPs exceeding genome-wide significance within a 4.2 Mb region. The strongest evidence for association emerged for CYP2A6 (min p = 5.77E-86, in intron 4), the main metabolic enzyme for nicotine. Other interesting genes with genome-wide significant signals included CYP2B6, CYP2A7, EGLN2, and NUMBL. Conditional analyses revealed three independent signals on 19q13, all located within or in the immediate vicinity of CYP2A6. A genetic risk score constructed using the independent signals showed association with smoking quantity (p = 0.0019) in two independent Finnish samples. Our meQTL results showed that methylation values of 16 CpG sites within the region are affected by genotypes of the genome-wide significant SNPs, and according to causal inference test, for some of the SNPs the effect on NMR is mediated through methylation. To our knowledge, this is the first GWAS on NMR. Our results enclose three independent novel signals on 19q13.2. The detected CYP2A6 variants explain a strikingly large fraction of variance (up to 31%) in NMR in these study samples. Further, we provide evidence

  5. Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies

    PubMed Central

    2014-01-01

    Summary Background The epilepsies are a clinically heterogeneous group of neurological disorders. Despite strong evidence for heritability, genome-wide association studies have had little success in identification of risk loci associated with epilepsy, probably because of relatively small sample sizes and insufficient power. We aimed to identify risk loci through meta-analyses of genome-wide association studies for all epilepsy and the two largest clinical subtypes (genetic generalised epilepsy and focal epilepsy). Methods We combined genome-wide association data from 12 cohorts of individuals with epilepsy and controls from population-based datasets. Controls were ethnically matched with cases. We phenotyped individuals with epilepsy into categories of genetic generalised epilepsy, focal epilepsy, or unclassified epilepsy. After standardised filtering for quality control and imputation to account for different genotyping platforms across sites, investigators at each site conducted a linear mixed-model association analysis for each dataset. Combining summary statistics, we conducted fixed-effects meta-analyses of all epilepsy, focal epilepsy, and genetic generalised epilepsy. We set the genome-wide significance threshold at p<1·66 × 10−8. Findings We included 8696 cases and 26 157 controls in our analysis. Meta-analysis of the all-epilepsy cohort identified loci at 2q24.3 (p=8·71 × 10−10), implicating SCN1A, and at 4p15.1 (p=5·44 × 10−9), harbouring PCDH7, which encodes a protocadherin molecule not previously implicated in epilepsy. For the cohort of genetic generalised epilepsy, we noted a single signal at 2p16.1 (p=9·99 × 10−9), implicating VRK2 or FANCL. No single nucleotide polymorphism achieved genome-wide significance for focal epilepsy. Interpretation This meta-analysis describes a new locus not previously implicated in epilepsy and provides further evidence about the genetic architecture of these disorders, with the

  6. Genome wide chromatin occupancy of mrhl RNA and its role in gene regulation in mouse spermatogonial cells

    PubMed Central

    Akhade, Vijay Suresh; Arun, Gayatri; Donakonda, Sainitin; Satyanarayana Rao, Manchanahalli R

    2014-01-01

    Mrhl RNA is a nuclear lncRNA encoded in the mouse genome and negatively regulates Wnt signaling in spermatogonial cells through p68/Ddx5 RNA helicase. Mrhl RNA is present in the chromatin fraction of mouse spermatogonial Gc1-Spg cells and genome wide chromatin occupancy of mrhl RNA by ChOP (Chromatin oligo affinity precipitation) technique identified 1370 statistically significant genomic loci. Among these, genes at 37 genomic loci also showed altered expression pattern upon mrhl RNA down regulation which are referred to as GRPAM (Genes Regulated by Physical Association of Mrhl RNA). p68 interacted with mrhl RNA in chromatin at these GRPAM loci. p68 silencing drastically reduced mrhl RNA occupancy at 27 GRPAM loci and also perturbed the expression of GRPAM suggesting a role for p68 mediated mrhl RNA occupancy in regulating GRPAM expression. Wnt3a ligand treatment of Gc1-Spg cells down regulated mrhl RNA expression and also perturbed expression of these 27 GRPAM genes that included genes regulating Wnt signaling pathway and spermatogenesis, one of them being Sox8, a developmentally important transcription factor. We also identified interacting proteins of mrhl RNA associated chromatin fraction which included Pc4, a chromatin organizer protein and hnRNP A/B and hnRNP A2/B1 which have been shown to be associated with lincRNA-Cox2 function in gene regulation. Our findings in the Gc1-Spg cell line also correlate with the results from analysis of mouse testicular tissue which further highlights the in vivo physiological significance of mrhl RNA in the context of gene regulation during mammalian spermatogenesis. PMID:25584904

  7. Genome-Wide Localization Study of Yeast Pex11 Identifies Peroxisome–Mitochondria Interactions through the ERMES Complex

    PubMed Central

    Mattiazzi Ušaj, M.; Brložnik, M.; Kaferle, P.; Žitnik, M.; Wolinski, H.; Leitner, F.; Kohlwein, S.D.; Zupan, B.; Petrovič, U.

    2015-01-01

    Pex11 is a peroxin that regulates the number of peroxisomes in eukaryotic cells. Recently, it was found that a mutation in one of the three mammalian paralogs, PEX11β, results in a neurological disorder. The molecular function of Pex11, however, is not known. Saccharomyces cerevisiae Pex11 has been shown to recruit to peroxisomes the mitochondrial fission machinery, thus enabling proliferation of peroxisomes. This process is essential for efficient fatty acid β-oxidation. In this study, we used high-content microscopy on a genome-wide scale to determine the subcellular localization pattern of yeast Pex11 in all non-essential gene deletion mutants, as well as in temperature-sensitive essential gene mutants. Pex11 localization and morphology of peroxisomes was profoundly affected by mutations in 104 different genes that were functionally classified. A group of genes encompassing MDM10, MDM12 and MDM34 that encode the mitochondrial and cytosolic components of the ERMES complex was analyzed in greater detail. Deletion of these genes caused a specifically altered Pex11 localization pattern, whereas deletion of MMM1, the gene encoding the fourth, endoplasmic-reticulum-associated component of the complex, did not result in an altered Pex11 localization or peroxisome morphology phenotype. Moreover, we found that Pex11 and Mdm34 physically interact and that Pex11 plays a role in establishing the contact sites between peroxisomes and mitochondria through the ERMES complex. Based on these results, we propose that the mitochondrial/cytosolic components of the ERMES complex establish a direct interaction between mitochondria and peroxisomes through Pex11. PMID:25769804

  8. Genome-wide sequencing to identify the cause of hereditary cancer syndromes: with examples from familial pancreatic cancer

    PubMed Central

    Roberts, Nicholas J.; Klein, Alison P.

    2013-01-01

    Advances in our understanding of the human genome and next-generation technologies have facilitated the use of genome-wide sequencing to decipher the genetic basis of Mendelian disease and hereditary cancer syndromes. The application of genome-wide sequencing in hereditary cancer syndromes has had mixed success, in part, due to complex nature of the underlying genetic architecture. In this review we discuss the use of genome-wide sequencing in both Mendelian diseases and hereditary cancer syndromes, highlighting the potential and challenges of this approach using familial pancreatic cancer as an example. PMID:23196058

  9. No evidence that runs of homozygosity are associated with schizophrenia in an Irish genome-wide association dataset

    PubMed Central

    Heron, Elizabeth A.; Cormican, Paul; Donohoe, Gary; O’Neill, Francis A.; Kendler, Kenneth S.; Riley, Brien P.; Gill, Michael; Corvin, Aiden P.; Morris, Derek W.

    2014-01-01

    Runs of homozygosity (ROH), regions of the genome containing many consecutive homozygous SNPs, may represent two copies of a haplotype inherited from a common ancestor. A rare variant on this haplotype could thus be present in a homozygous and potentially recessive state. To detect rare risk variants for schizophrenia, we performed an ROH analysis in a homogeneous Irish genome wide association study (GWAS) dataset consisting of 1606 cases and 1794 controls. There was no genome-wide excess of ROH in cases compared to controls (p = 0.7986). No consensus ROH at individual loci showed association with schizophrenia after genome-wide correction. PMID:24560374

  10. Genome-Wide Association of the Laboratory-Based Nicotine Metabolite Ratio in Three Ancestries

    PubMed Central

    Baurley, James W.; Edlund, Christopher K.; Pardamean, Carissa I.; Conti, David V.; Krasnow, Ruth; Javitz, Harold S.; Hops, Hyman; Swan, Gary E.; Benowitz, Neal L.

    2016-01-01

    Introduction: Metabolic enzyme variation and other patient and environmental characteristics influence smoking behaviors, treatment success, and risk of related disease. Population-specific variation in metabolic genes contributes to challenges in developing and optimizing pharmacogenetic interventions. We applied a custom genome-wide genotyping array for addiction research (Smokescreen), to three laboratory-based studies of nicotine metabolism with oral or venous administration of labeled nicotine and cotinine, to model nicotine metabolism in multiple populations. The trans-3′-hydroxycotinine/cotinine ratio, the nicotine metabolite ratio (NMR), was the nicotine metabolism measure analyzed. Methods: Three hundred twelve individuals of self-identified European, African, and Asian American ancestry were genotyped and included in ancestry-specific genome-wide association scans (GWAS) and a meta-GWAS analysis of the NMR. We modeled natural-log transformed NMR with covariates: principal components of genetic ancestry, age, sex, body mass index, and smoking status. Results: African and Asian American NMRs were statistically significantly (P values ≤ 5E-5) lower than European American NMRs. Meta-GWAS analysis identified 36 genome-wide significant variants over a 43 kilobase pair region at CYP2A6 with minimum P = 2.46E-18 at rs12459249, proximal to CYP2A6. Additional minima were located in intron 4 (rs56113850, P = 6.61E-18) and in the CYP2A6-CYP2A7 intergenic region (rs34226463, P = 1.45E-12). Most (34/36) genome-wide significant variants suggested reduced CYP2A6 activity; functional mechanisms were identified and tested in knowledge-bases. Conditional analysis resulted in intergenic variants of possible interest (P values < 5E-5). Conclusions: This meta-GWAS of the NMR identifies CYP2A6 variants, replicates the top-ranked single nucleotide polymorphism from a recent Finnish meta-GWAS of the NMR, identifies functional mechanisms, and provides pan

  11. Longitudinal Genome-Wide Association of Cardiovascular Disease Risk Factors in the Bogalusa Heart Study

    PubMed Central

    Smith, Erin N.; Chen, Wei; Kähönen, Mika; Kettunen, Johannes; Lehtimäki, Terho; Peltonen, Leena; Raitakari, Olli T.; Salem, Rany M.; Schork, Nicholas J.; Shaw, Marian; Srinivasan, Sathanur R.; Topol, Eric J.; Viikari, Jorma S.; Berenson, Gerald S.; Murray, Sarah S.

    2010-01-01

    Cardiovascular disease (CVD) is the leading cause of death worldwide. Recent genome-wide association (GWA) studies have pinpointed many loci associated with CVD risk factors in adults. It is unclear, however, if these loci predict trait levels at all ages, if they are associated with how a trait develops over time, or if they could be used to screen individuals who are pre-symptomatic to provide the opportunity for preventive measures before disease onset. We completed a genome-wide association study on participants in the longitudinal Bogalusa Heart Study (BHS) and have characterized the association between genetic factors and the development of CVD risk factors from childhood to adulthood. We report 7 genome-wide significant associations involving CVD risk factors, two of which have been previously reported. Top regions were tested for replication in the Young Finns Study (YF) and two associations strongly replicated: rs247616 in CETP with HDL levels (combined P = 9.7×10−24), and rs445925 at APOE with LDL levels (combined P = 8.7×10−19). We show that SNPs previously identified in adult cross-sectional studies tend to show age-independent effects in the BHS with effect sizes consistent with previous reports. Previously identified variants were associated with adult trait levels above and beyond those seen in childhood; however, variants with time-dependent effects were also promising predictors. This is the first GWA study to evaluate the role of common genetic variants in the development of CVD risk factors in children as they advance through adulthood and highlights the utility of using longitudinal studies to identify genetic predictors of adult traits in children. PMID:20838585

  12. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity

    PubMed Central

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P.; Nir, Talia M.; Toga, Arthur W.; Jack, Clifford R.; Saykin, Andrew J.; Green, Robert C.; Weiner, Michael W.; Medland, Sarah E.; Montgomery, Grant W.; Hansell, Narelle K.; McMahon, Katie L.; de Zubicaray, Greig I.; Martin, Nicholas G.; Wright, Margaret J.; Thompson, Paul M.; Weiner, Michael; Aisen, Paul; Weiner, Michael; Aisen, Paul; Petersen, Ronald; Jack, Clifford R.; Jagust, William; Trojanowski, John Q.; Toga, Arthur W.; Beckett, Laurel; Green, Robert C.; Saykin, Andrew J.; Morris, John; Liu, Enchi; Green, Robert C.; Montine, Tom; Petersen, Ronald; Aisen, Paul; Gamst, Anthony; Thomas, Ronald G.; Donohue, Michael; Walter, Sarah; Gessert, Devon; Sather, Tamie; Beckett, Laurel; Harvey, Danielle; Gamst, Anthony; Donohue, Michael; Kornak, John; Jack, Clifford R.; Dale, Anders; Bernstein, Matthew; Felmlee, Joel; Fox, Nick; Thompson, Paul; Schuff, Norbert; Alexander, Gene; DeCarli, Charles; Jagust, William; Bandy, Dan; Koeppe, Robert A.; Foster, Norm; Reiman, Eric M.; Chen, Kewei; Mathis, Chet; Morris, John; Cairns, Nigel J.; Taylor-Reinwald, Lisa; Trojanowki, J.Q.; Shaw, Les; Lee, Virginia M.Y.; Korecka, Magdalena; Toga, Arthur W.; Crawford, Karen; Neu, Scott; Saykin, Andrew J.; Foroud, Tatiana M.; Potkin, Steven; Shen, Li; Khachaturian, Zaven; Frank, Richard; Snyder, Peter J.; Molchan, Susan; Kaye, Jeffrey; Quinn, Joseph; Lind, Betty; Dolen, Sara; Schneider, Lon S.; Pawluczyk, Sonia; Spann, Bryan M.; Brewer, James; Vanderswag, Helen; Heidebrink, Judith L.; Lord, Joanne L.; Petersen, Ronald; Johnson, Kris; Doody, Rachelle S.; Villanueva-Meyer, Javier; Chowdhury, Munir; Stern, Yaakov; Honig, Lawrence S.; Bell, Karen L.; Morris, John C.; Ances, Beau; Carroll, Maria; Leon, Sue; Mintun, Mark A.; Schneider, Stacy; Marson, Daniel; Griffith, Randall; Clark, David; Grossman, Hillel; Mitsis, Effie; Romirowsky, Aliza; deToledo-Morrell, Leyla; Shah, Raj C.; Duara, Ranjan; Varon, Daniel; Roberts, Peggy; Albert, Marilyn; Onyike, Chiadi; Kielb, Stephanie; Rusinek, Henry; de Leon, Mony J.; Glodzik, Lidia; De Santi, Susan; Doraiswamy, P. Murali; Petrella, Jeffrey R.; Coleman, R. Edward; Arnold, Steven E.; Karlawish, Jason H.; Wolk, David; Smith, Charles D.; Jicha, Greg; Hardy, Peter; Lopez, Oscar L.; Oakley, MaryAnn; Simpson, Donna M.; Porsteinsson, Anton P.; Goldstein, Bonnie S.; Martin, Kim; Makino, Kelly M.; Ismail, M. Saleem; Brand, Connie; Mulnard, Ruth A.; Thai, Gaby; Mc-Adams-Ortiz, Catherine; Womack, Kyle; Mathews, Dana; Quiceno, Mary; Diaz-Arrastia, Ramon; King, Richard; Weiner, Myron; Martin-Cook, Kristen; DeVous, Michael; Levey, Allan I.; Lah, James J.; Cellar, Janet S.; Burns, Jeffrey M.; Anderson, Heather S.; Swerdlow, Russell H.; Apostolova, Liana; Lu, Po H.; Bartzokis, George; Silverman, Daniel H.S.; Graff-Radford, Neill R.; Parfitt, Francine; Johnson, Heather; Farlow, Martin R.; Hake, Ann Marie; Matthews, Brandy R.; Herring, Scott; van Dyck, Christopher H.; Carson, Richard E.; MacAvoy, Martha G.; Chertkow, Howard; Bergman, Howard; Hosein, Chris; Black, Sandra; Stefanovic, Bojana; Caldwell, Curtis; Hsiung, Ging-Yuek Robin; Feldman, Howard; Mudge, Benita; Assaly, Michele; Kertesz, Andrew; Rogers, John; Trost, Dick; Bernick, Charles; Munic, Donna; Kerwin, Diana; Mesulam, Marek-Marsel; Lipowski, Kristina; Wu, Chuang-Kuo; Johnson, Nancy; Sadowsky, Carl; Martinez, Walter; Villena, Teresa; Turner, Raymond Scott; Johnson, Kathleen; Reynolds, Brigid; Sperling, Reisa A.; Johnson, Keith A.; Marshall, Gad; Frey, Meghan; Yesavage, Jerome; Taylor, Joy L.; Lane, Barton; Rosen, Allyson; Tinklenberg, Jared; Sabbagh, Marwan; Belden, Christine; Jacobson, Sandra; Kowall, Neil; Killiany, Ronald; Budson, Andrew E.; Norbash, Alexander; Johnson, Patricia Lynn; Obisesan, Thomas O.; Wolday, Saba; Bwayo, Salome K.; Lerner, Alan; Hudson, Leon; Ogrocki, Paula; Fletcher, Evan; Carmichael, Owen; Olichney, John; DeCarli, Charles; Kittur, Smita; Borrie, Michael; Lee, T.-Y.; Bartha, Rob; Johnson, Sterling; Asthana, Sanjay; Carlsson, Cynthia M.; Potkin, Steven G.; Preda, Adrian; Nguyen, Dana; Tariot, Pierre; Fleisher, Adam; Reeder, Stephanie; Bates, Vernice; Capote, Horacio; Rainka, Michelle; Scharre, Douglas W.; Kataki, Maria; Zimmerman, Earl A.; Celmins, Dzintra; Brown, Alice D.; Pearlson, Godfrey D.; Blank, Karen; Anderson, Karen; Saykin, Andrew J.; Santulli, Robert B.; Schwartz, Eben S.; Sink, Kaycee M.; Williamson, Jeff D.; Garg, Pradeep; Watkins, Franklin; Ott, Brian R.; Querfurth, Henry; Tremont, Geoffrey; Salloway, Stephen; Malloy, Paul; Correia, Stephen; Rosen, Howard J.; Miller, Bruce L.; Mintzer, Jacobo; Longmire, Crystal Flynn; Spicer, Kenneth; Finger, Elizabeth; Rachinsky, Irina; Rogers, John; Kertesz, Andrew; Drost, Dick

    2013-01-01

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer’s disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain’s connectivity pattern, allowing us to discover genetic variants that affect the human brain’s wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer’s disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases. PMID:23471985

  13. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    PubMed Central

    Bendall, Matthew L; Stevens, Sarah LR; Chan, Leong-Keat; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Froula, Jeff; Kang, Dongwan; Tringe, Susannah G; Bertilsson, Stefan; Moran, Mary A; Shade, Ashley; Newton, Ryan J; McMahon, Katherine D; Malmstrom, Rex R

    2016-01-01

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Here, from a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. These patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model' of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment. PMID:26744812

  14. Genome-wide landscape of liver X receptor chromatin binding and gene regulation in human macrophages

    PubMed Central

    2012-01-01

    Background The liver X receptors (LXRs) are oxysterol sensing nuclear receptors with multiple effects on metabolism and immune cells. However, the complete genome-wide cistrome of LXR in cells of human origin has not yet been provided. Results We performed ChIP-seq in phorbol myristate acetate-differentiated THP-1 cells (macrophage-type) after stimulation with the potent synthetic LXR ligand T0901317 (T09). Microarray gene expression analysis was performed in the same cellular model. We identified 1357 genome-wide LXR locations (FDR < 1%), of which 526 were observed after T09 treatment. De novo analysis of LXR binding sequences identified a DR4-type element as the major motif. On mRNA level T09 up-regulated 1258 genes and repressed 455 genes. Our results show that LXR actions are focused on 112 genomic regions that contain up to 11 T09 target genes per region under the control of highly stringent LXR binding sites with individual constellations for each region. We could confirm that LXR controls lipid metabolism and transport and observed a strong association with apoptosis-related functions. Conclusions This first report on genome-wide binding of LXR in a human cell line provides new insights into the transcriptional network of LXR and its target genes with their link to physiological processes, such as apoptosis. The gene expression microarray and sequence data have been submitted collectively to the NCBI Gene Expression Omnibus http://www.ncbi.nlm.nih.gov/geo under accession number GSE28319. PMID:22292898

  15. Genome-wide association analysis of red blood cell traits in African Americans: the COGENT Network

    PubMed Central

    Chen, Zhao; Tang, Hua; Qayyum, Rehan; Schick, Ursula M.; Nalls, Michael A.; Handsaker, Robert; Li, Jin; Lu, Yingchang; Yanek, Lisa R.; Keating, Brendan; Meng, Yan; van Rooij, Frank J.A.; Okada, Yukinori; Kubo, Michiaki; Rasmussen-Torvik, Laura; Keller, Margaux F.; Lange, Leslie; Evans, Michele; Bottinger, Erwin P.; Linderman, Michael D.; Ruderfer, Douglas M.; Hakonarson, Hakon; Papanicolaou, George; Zonderman, Alan B.; Gottesman, Omri; Thomson, Cynthia; Ziv, Elad; Singleton, Andrew B.; Loos, Ruth J.F.; Sleiman, Patrick M.A.; Ganesh, Santhi; McCarroll, Steven; Becker, Diane M.; Wilson, James G.; Lettre, Guillaume; Reiner, Alexander P.

    2013-01-01

    Laboratory red blood cell (RBC) measurements are clinically important, heritable and differ among ethnic groups. To identify genetic variants that contribute to RBC phenotypes in African Americans (AAs), we conducted a genome-wide association study in up to ∼16 500 AAs. The alpha-globin locus on chromosome 16pter [lead SNP rs13335629 in ITFG3 gene; P < 1E−13 for hemoglobin (Hgb), RBC count, mean corpuscular volume (MCV), MCH and MCHC] and the G6PD locus on Xq28 [lead SNP rs1050828; P < 1E − 13 for Hgb, hematocrit (Hct), MCV, RBC count and red cell distribution width (RDW)] were each associated with multiple RBC traits. At the alpha-globin region, both the common African 3.7 kb deletion and common single nucleotide polymorphisms (SNPs) appear to contribute independently to RBC phenotypes among AAs. In the 2p21 region, we identified a novel variant of PRKCE distinctly associated with Hct in AAs. In a genome-wide admixture mapping scan, local European ancestry at the 6p22 region containing HFE and LRRC16A was associated with higher Hgb. LRRC16A has been previously associated with the platelet count and mean platelet volume in AAs, but not with Hgb. Finally, we extended to AAs the findings of association of erythrocyte traits with several loci previously reported in Europeans and/or Asians, including CD164 and HBS1L-MYB. In summary, this large-scale genome-wide analysis in AAs has extended the importance of several RBC-associated genetic loci to AAs and identified allelic heterogeneity and pleiotropy at several previously known genetic loci associated with blood cell traits in AAs. PMID:23446634

  16. Genome-wide assessment of Parkinson's disease in a Southern Spanish population.

    PubMed

    Bandrés-Ciga, Sara; Price, Timothy Ryan; Barrero, Francisco Javier; Escamilla-Sevilla, Francisco; Pelegrina, Javier; Arepalli, Sampath; Hernández, Dena; Gutiérrez, Blanca; Cervilla, Jorge; Rivera, Margarita; Rivera, Alberto; Ding, Jing-Hui; Vives, Francisco; Nalls, Michael; Singleton, Andrew; Durán, Raquel

    2016-09-01

    Here, we set out to study the genetic architecture of Parkinson's disease (PD) through a Genome-Wide Association Study in a Southern Spanish population. About 240 PD cases and 192 controls were genotyped on the NeuroX array. We estimated genetic variation associated with PD risk and age at onset (AAO). Risk profile analyses for PD and AAO were performed using a weighted genetic risk score. Total heritability was estimated by genome-wide complex trait analysis. Rare variants were screened with single-variant and burden tests. We also screened for variation in known PD genes. Finally, we explored runs of homozygosity and structural genomic variations. We replicate PD association (uncorrected p-value < 0.05) at the following loci: ACMSD/TMEM163, MAPT, STK39, MIR4697, and SREBF/RAI1. Subjects in the highest genetic risk score quintile showed significantly increased risk of PD versus the lowest quintile (odds ratio = 3.6, p-value < 4e(-7)), but no significant difference in AAO. We found evidence of runs of homozygosity in 2 PD-associated regions: one intersecting the HLA-DQB1 gene in 6 patients and 1 control; and another intersecting the GBA-SYT11 gene in PD case. The GBA N370S and the LRRK2 G2019S variants were found in 8 and 7 cases, respectively, replicating previous work. A structural variant was found in 1 case in the PARK2 gene locus. This current work represents a comprehensive assessment at a genome-wide level characterizing a novel population in PD genetics. PMID:27393345

  17. Genome-Wide Association Study Reveals Multiple Loci Influencing Normal Human Facial Morphology

    PubMed Central

    Raffensperger, Zachary D.; Heike, Carrie L.; Cunningham, Michael L.; Hecht, Jacqueline T.; Kau, Chung How; Moreno, Lina M.; Wehby, George L.; Murray, Jeffrey C.; Laurie, Cecelia A.; Laurie, Cathy C.; Santorico, Stephanie; Klein, Ophir; Feingold, Eleanor; Hallgrimsson, Benedikt; Spritz, Richard A.; Marazita, Mary L.; Weinberg, Seth M.

    2016-01-01

    Numerous lines of evidence point to a genetic basis for facial morphology in humans, yet little is known about how specific genetic variants relate to the phenotypic expression of many common facial features. We conducted genome-wide association meta-analyses of 20 quantitative facial measurements derived from the 3D surface images of 3118 healthy individuals of European ancestry belonging to two US cohorts. Analyses were performed on just under one million genotyped SNPs (Illumina OmniExpress+Exome v1.2 array) imputed to the 1000 Genomes reference panel (Phase 3). We observed genome-wide significant associations (p < 5 x 10−8) for cranial base width at 14q21.1 and 20q12, intercanthal width at 1p13.3 and Xq13.2, nasal width at 20p11.22, nasal ala length at 14q11.2, and upper facial depth at 11q22.1. Several genes in the associated regions are known to play roles in craniofacial development or in syndromes affecting the face: MAFB, PAX9, MIPOL1, ALX3, HDAC8, and PAX1. We also tested genotype-phenotype associations reported in two previous genome-wide studies and found evidence of replication for nasal ala length and SNPs in CACNA2D3 and PRDM16. These results provide further evidence that common variants in regions harboring genes of known craniofacial function contribute to normal variation in human facial features. Improved understanding of the genes associated with facial morphology in healthy individuals can provide insights into the pathways and mechanisms controlling normal and abnormal facial morphogenesis. PMID:27560520

  18. Genome-wide analyses of aggressiveness in attention-deficit hyperactivity disorder.

    PubMed

    Brevik, Erlend J; van Donkelaar, Marjolein M J; Weber, Heike; Sánchez-Mora, Cristina; Jacob, Christian; Rivero, Olga; Kittel-Schneider, Sarah; Garcia-Martínez, Iris; Aebi, Marcel; van Hulzen, Kimm; Cormand, Bru; Ramos-Quiroga, Josep A; Lesch, Klaus-Peter; Reif, Andreas; Ribasés, Marta; Franke, Barbara; Posserud, Maj-Britt; Johansson, Stefan; Lundervold, Astri J; Haavik, Jan; Zayats, Tetyana

    2016-07-01

    Aggressiveness is a behavioral trait that has the potential to be harmful to individuals and society. With an estimated heritability of about 40%, genetics is important in its development. We performed an exploratory genome-wide association (GWA) analysis of childhood aggressiveness in attention deficit hyperactivity disorder (ADHD) to gain insight into the underlying biological processes associated with this trait. Our primary sample consisted of 1,060 adult ADHD patients (aADHD). To further explore the genetic architecture of childhood aggressiveness, we performed enrichment analyses of suggestive genome-wide associations observed in aADHD among GWA signals of dimensions of oppositionality (defiant/vindictive and irritable dimensions) in childhood ADHD (cADHD). No single polymorphism reached genome-wide significance (P < 5.00E-08). The strongest signal in aADHD was observed at rs10826548, within a long noncoding RNA gene (beta = -1.66, standard error (SE) = 0.34, P = 1.07E-06), closely followed by rs35974940 in the neurotrimin gene (beta = 3.23, SE = 0.67, P = 1.26E-06). The top GWA SNPs observed in aADHD showed significant enrichment of signals from both the defiant/vindictive dimension (Fisher's P-value = 2.28E-06) and the irritable dimension in cADHD (Fisher's P-value = 0.0061). In sum, our results identify a number of biologically interesting markers possibly underlying childhood aggressiveness and provide targets for further genetic exploration of aggressiveness across psychiatric disorders. © 2016 The Authors. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics Published by Wiley Periodicals, Inc. PMID:27021288

  19. Using genome-wide complex trait analysis to quantify ‘missing heritability’ in Parkinson's disease

    PubMed Central

    Keller, Margaux F.; Saad, Mohamad; Bras, Jose; Bettella, Francesco; Nicolaou, Nayia; Simón-Sánchez, Javier; Mittag, Florian; Büchel, Finja; Sharma, Manu; Gibbs, J. Raphael; Schulte, Claudia; Moskvina, Valentina; Durr, Alexandra; Holmans, Peter; Kilarski, Laura L.; Guerreiro, Rita; Hernandez, Dena G.; Brice, Alexis; Ylikotila, Pauli; Stefánsson, Hreinn; Majamaa, Kari; Morris, Huw R.; Williams, Nigel; Gasser, Thomas; Heutink, Peter; Wood, Nicholas W.; Hardy, John; Martinez, Maria; Singleton, Andrew B.; Nalls, Michael A.

    2012-01-01

    Genome-wide association studies (GWASs) have been successful at identifying single-nucleotide polymorphisms (SNPs) highly associated with common traits; however, a great deal of the heritable variation associated with common traits remains unaccounted for within the genome. Genome-wide complex trait analysis (GCTA) is a statistical method that applies a linear mixed model to estimate phenotypic variance of complex traits explained by genome-wide SNPs, including those not associated with the trait in a GWAS. We applied GCTA to 8 cohorts containing 7096 case and 19 455 control individuals of European ancestry in order to examine the missing heritability present in Parkinson's disease (PD). We meta-analyzed our initial results to produce robust heritability estimates for PD types across cohorts. Our results identify 27% (95% CI 17–38, P = 8.08E − 08) phenotypic variance associated with all types of PD, 15% (95% CI −0.2 to 33, P = 0.09) phenotypic variance associated with early-onset PD and 31% (95% CI 17–44, P = 1.34E − 05) phenotypic variance associated with late-onset PD. This is a substantial increase from the genetic variance identified by top GWAS hits alone (between 3 and 5%) and indicates there are substantially more risk loci to be identified. Our results suggest that although GWASs are a useful tool in identifying the most common variants associated with complex disease, a great deal of common variants of small effect remain to be discovered. PMID:22892372

  20. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    PubMed Central

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2013-01-01

    Objective . Genes likely play a substantial role in the etiology of attention-deficit hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies have not identified a genome-wide significant association. We have conducted a third, independent multi-site GWAS of DSM-IV-TR ADHD. Method . Families were ascertained at Massachusetts General Hospital (MGH, N=309 trios), Washington University at St Louis (WASH-U, N=272 trios), and University of California at Los Angeles (UCLA, N=156 trios). Genotyping was conducted with the Illumina Human1M or Human1M-Duo BeadChip platforms. After applying quality control filters, association with ADHD was tested with 835,136 SNPs in 735 DSM-IV ADHD trios from 732 families. Results . Our smallest p-value (6.7E-07) did not reach the threshold for genome-wide statistical significance (5.0E-08) but one of the 20 most significant associations was located in a candidate gene of interest for ADHD, (SLC9A9, rs9810857, p=6.4E-6). We also conducted gene-based tests of candidate genes identified in the literature and found additional evidence of association with SLC9A9. Conclusion . We and our colleagues in the Psychiatric GWAS Consortium are working to pool together GWAS samples to establish the large data sets needed to follow-up on these results and to identify genes for ADHD and other disorders. PMID:20732626

  1. Genome-wide evidence for speciation with gene flow in Heliconius butterflies

    PubMed Central

    Martin, Simon H.; Dasmahapatra, Kanchon K.; Nadeau, Nicola J.; Salazar, Camilo; Walters, James R.; Simpson, Fraser; Blaxter, Mark; Manica, Andrea; Mallet, James; Jiggins, Chris D.

    2013-01-01

    Most speciation events probably occur gradually, without complete and immediate reproductive isolation, but the full extent of gene flow between diverging species has rarely been characterized on a genome-wide scale. Documenting the extent and timing of admixture between diverging species can clarify the role of geographic isolation in speciation. Here we use new methodology to quantify admixture at different stages of divergence in Heliconius butterflies, based on whole-genome sequences of 31 individuals. Comparisons between sympatric and allopatric populations of H. melpomene, H. cydno, and H. timareta revealed a genome-wide trend of increased shared variation in sympatry, indicative of pervasive interspecific gene flow. Up to 40% of 100-kb genomic windows clustered by geography rather than by species, demonstrating that a very substantial fraction of the genome has been shared between sympatric species. Analyses of genetic variation shared over different time intervals suggested that admixture between these species has continued since early in speciation. Alleles shared between species during recent time intervals displayed higher levels of linkage disequilibrium than those shared over longer time intervals, suggesting that this admixture took place at multiple points during divergence and is probably ongoing. The signal of admixture was significantly reduced around loci controlling divergent wing patterns, as well as throughout the Z chromosome, consistent with strong selection for Müllerian mimicry and with known Z-linked hybrid incompatibility. Overall these results show that species divergence can occur in the face of persistent and genome-wide admixture over long periods of time. PMID:24045163

  2. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations.

    PubMed

    Bendall, Matthew L; Stevens, Sarah Lr; Chan, Leong-Keat; Malfatti, Stephanie; Schwientek, Patrick; Tremblay, Julien; Schackwitz, Wendy; Martin, Joel; Pati, Amrita; Bushnell, Brian; Froula, Jeff; Kang, Dongwan; Tringe, Susannah G; Bertilsson, Stefan; Moran, Mary A; Shade, Ashley; Newton, Ryan J; McMahon, Katherine D; Malmstrom, Rex R

    2016-07-01

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Here, from a 9-year metagenomic study of a freshwater lake (2005-2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. These patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the 'ecotype model' of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment. PMID:26744812

  3. Genome-Wide Association Study in Immunocompetent Patients with Delayed Hypersensitivity to Sulfonamide Antimicrobials

    PubMed Central

    Motsinger-Reif, Alison; Dickey, Allison; Yale, Steven; Trepanier, Lauren A.

    2016-01-01

    Background Hypersensitivity (HS) reactions to sulfonamide antibiotics occur uncommonly, but with potentially severe clinical manifestations. A familial predisposition to sulfonamide HS is suspected, but robust predictive genetic risk factors have yet to be identified. Strongly linked genetic polymorphisms have been used clinically as screening tests for other HS reactions prior to administration of high-risk drugs. Objective The purpose of this study was to evaluate for genetic risk of sulfonamide HS in the immunocompetent population using genome-wide association. Methods Ninety-one patients with symptoms after trimethoprim-sulfamethoxazole (TMP-SMX) attributable to “probable” drug HS based on medical record review and the Naranjo Adverse Drug Reaction Probability Scale, and 184 age- and sex-matched patients who tolerated a therapeutic course of TMP-SMX, were included in a genome-wide association study using both common and rare variant techniques. Additionally, two subgroups of HS patients with a more refined clinical phenotype (fever and rash; or fever, rash and eosinophilia) were evaluated separately. Results For the full dataset, no single nucleotide polymorphisms were suggestive of or reached genome-wide significance in the common variant analysis, nor was any genetic locus significant in the rare variant analysis. A single, possible gene locus association (COL12A1) was identified in the rare variant analysis for patients with both fever and rash, but the sample size was very small in this subgroup (n = 16), and this may be a false positive finding. No other significant associations were found for the subgroups. Conclusions No convincing genetic risk factors for sulfonamide HS were identified in this population. These negative findings may be due to challenges in accurately confirming the phenotype in exanthematous drug eruptions, or to unidentified gene-environment interactions influencing sulfonamide HS. PMID:27272151

  4. Genome-wide association of anthropometric traits in African- and African-derived populations.

    PubMed

    Kang, Sun J; Chiang, Charleston W K; Palmer, Cameron D; Tayo, Bamidele O; Lettre, Guillaume; Butler, Johannah L; Hackett, Rachel; Adeyemo, Adebowale A; Guiducci, Candace; Berzins, Ilze; Nguyen, Thutrang T; Feng, Tao; Luke, Amy; Shriner, Daniel; Ardlie, Kristin; Rotimi, Charles; Wilks, Rainford; Forrester, Terrence; McKenzie, Colin A; Lyon, Helen N; Cooper, Richard S; Zhu, Xiaofeng; Hirschhorn, Joel N

    2010-07-01

    Genome-wide association (GWA) studies have identified common variants that are associated with a variety of traits and diseases, but most studies have been performed in European-derived populations. Here, we describe the first genome-wide analyses of imputed genotype and copy number variants (CNVs) for anthropometric measures in African-derived populations: 1188 Nigerians from Igbo-Ora and Ibadan, Nigeria, and 743 African-Americans from Maywood, IL. To improve the reach of our study, we used imputation to estimate genotypes at approximately 2.1 million single-nucleotide polymorphisms (SNPs) and also tested CNVs for association. No SNPs or common CNVs reached a genome-wide significance level for association with height or body mass index (BMI), and the best signals from a meta-analysis of the two cohorts did not replicate in approximately 3700 African-Americans and Jamaicans. However, several loci previously confirmed in European populations showed evidence of replication in our GWA panel of African-derived populations, including variants near IHH and DLEU7 for height and MC4R for BMI. Analysis of global burden of rare CNVs suggested that lean individuals possess greater total burden of CNVs, but this finding was not supported in an independent European population. Our results suggest that there are not multiple loci with strong effects on anthropometric traits in African-derived populations and that sample sizes comparable to those needed in European GWA studies will be required to identify replicable associations. Meta-analysis of this data set with additional studies in African-ancestry populations will be helpful to improve power to detect novel associations. PMID:20400458

  5. Disruptive selection without genome-wide evolution across a migratory divide.

    PubMed

    von Rönn, Jan A C; Shafer, Aaron B A; Wolf, Jochen B W

    2016-06-01

    Transcontinental migration is a fascinating example of how animals can respond to climatic oscillation. Yet, quantitative data on fitness components are scarce, and the resulting population genetic consequences are poorly understood. Migratory divides, hybrid zones with a transition in migratory behaviour, provide a natural setting to investigate the micro-evolutionary dynamics induced by migration under sympatric conditions. Here, we studied the effects of migratory programme on survival, trait evolution and genome-wide patterns of population differentiation in a migratory divide of European barn swallows. We sampled a total of 824 individuals from both allopatric European populations wintering in central and southern Africa, respectively, along with two mixed populations from within the migratory divide. While most morphological characters varied by latitude consistent with Bergmann's rule, wing length co-varied with distance to wintering grounds. Survival data collected during a 5-year period provided strong evidence that this covariance is repeatedly generated by disruptive selection against intermediate phenotypes. Yet, selection-induced divergence did not translate into genome-wide genetic differentiation as assessed by microsatellites, mtDNA and >20 000 genome-wide SNP markers; nor did we find evidence of local genomic selection between migratory types. Among breeding populations, a single outlier locus mapped to the BUB1 gene with a role in mitotic and meiotic organization. Overall, this study provides evidence for an adaptive response to variation in migration behaviour continuously eroded by gene flow under current conditions of nonassortative mating. It supports the theoretical prediction that population differentiation is difficult to achieve under conditions of gene flow despite measurable disruptive selection. PMID:26749140

  6. Ion Torrent sequencing for conducting genome-wide scans for mutation mapping analysis.

    PubMed

    Damerla, Rama Rao; Chatterjee, Bishwanath; Li, You; Francis, Richard J B; Fatakia, Sarosh N; Lo, Cecilia W

    2014-04-01

    Mutation mapping in mice can be readily accomplished by genome wide segregation analysis of polymorphic DNA markers. In this study, we showed the efficacy of Ion Torrent next generation sequencing for conducting genome-wide scans to map and identify a mutation causing congenital heart disease in a mouse mutant, Bishu, recovered from a mouse mutagenesis screen. The Bishu mutant line generated in a C57BL/6J (B6) background was intercrossed with another inbred strain, C57BL/10J (B10), and the resulting B6/B10 hybrid offspring were intercrossed to generate mutants used for the mapping analysis. For each mutant sample, a panel of 123 B6/B10 polymorphic SNPs distributed throughout the mouse genome was PCR amplified, bar coded, and then pooled to generate a single library used for Ion Torrent sequencing. Sequencing carried out using the 314 chip yielded >600,000 usable reads. These were aligned and mapped using a custom bioinformatics pipeline. Each SNP was sequenced to a depth >500×, allowing accurate automated calling of the B6/B10 genotypes. This analysis mapped the mutation in Bishu to an interval on the proximal region of mouse chromosome 4. This was confirmed by parallel capillary sequencing of the 123 polymorphic SNPs. Further analysis of genes in the map interval identified a splicing mutation in Dnaic1(c.204+1G>A), an intermediate chain dynein, as the disease causing mutation in Bishu. Overall, our experience shows Ion Torrent amplicon sequencing is high throughput and cost effective for conducting genome-wide mapping analysis and is easily scalable for other high volume genotyping analyses. PMID:24306492

  7. FVGWAS: Fast voxelwise genome wide association analysis of large-scale imaging genetic data.

    PubMed

    Huang, Meiyan; Nichols, Thomas; Huang, Chao; Yu, Yang; Lu, Zhaohua; Knickmeyer, Rebecca C; Feng, Qianjin; Zhu, Hongtu

    2015-09-01

    More and more large-scale imaging genetic studies are being widely conducted to collect a rich set of imaging, genetic, and clinical data to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. Several major big-data challenges arise from testing genome-wide (NC>12 million known variants) associations with signals at millions of locations (NV~10(6)) in the brain from thousands of subjects (n~10(3)). The aim of this paper is to develop a Fast Voxelwise Genome Wide Association analysiS (FVGWAS) framework to efficiently carry out whole-genome analyses of whole-brain data. FVGWAS consists of three components including a heteroscedastic linear model, a global sure independence screening (GSIS) procedure, and a detection procedure based on wild bootstrap methods. Specifically, for standard linear association, the computational complexity is O (nNVNC) for voxelwise genome wide association analysis (VGWAS) method compared with O ((NC+NV)n(2)) for FVGWAS. Simulation studies show that FVGWAS is an efficient method of searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. Finally, we have successfully applied FVGWAS to a large-scale imaging genetic data analysis of ADNI data with 708 subjects, 193,275voxels in RAVENS maps, and 501,584 SNPs, and the total processing time was 203,645s for a single CPU. Our FVGWAS may be a valuable statistical toolbox for large-scale imaging genetic analysis as the field is rapidly advancing with ultra-high-resolution imaging and whole-genome sequencing. PMID:26025292

  8. Genetic Structure of the Han Chinese Population Revealed by Genome-wide SNP Variation

    PubMed Central

    Chen, Jieming; Zheng, Houfeng; Bei, Jin-Xin; Sun, Liangdan; Jia, Wei-hua; Li, Tao; Zhang, Furen; Seielstad, Mark; Zeng, Yi-Xin; Zhang, Xuejun; Liu, Jianjun

    2009-01-01

    Population stratification is a potential problem for genome-wide association studies (GWAS), confounding results and causing spurious associations. Hence, understanding how allele frequencies vary across geographic regions or among subpopulations is an important prelude to analyzing GWAS data. Using over 350,000 genome-wide autosomal SNPs in over 6000 Han Chinese samples from ten provinces of China, our study revealed a one-dimensional “north-south” population structure and a close correlation between geography and the genetic structure of the Han Chinese. The north-south population structure is consistent with the historical migration pattern of the Han Chinese population. Metropolitan cities in China were, however, more diffused “outliers,” probably because of the impact of modern migration of peoples. At a very local scale within the Guangdong province, we observed evidence of population structure among dialect groups, probably on account of endogamy within these dialects. Via simulation, we show that empirical levels of population structure observed across modern China can cause spurious associations in GWAS if not properly handled. In the Han Chinese, geographic matching is a good proxy for genetic matching, particularly in validation and candidate-gene studies in which population stratification cannot be directly accessed and accounted for because of the lack of genome-wide data, with the exception of the metropolitan cities, where geographical location is no longer a good indicator of ancestral origin. Our findings are important for designing GWAS in the Chinese population, an activity that is expected to intensify greatly in the near future. PMID:19944401

  9. Siblings with Ischemic Stroke Study (SWISS): Results of a Genome-wide Scan for Stroke Loci

    PubMed Central

    Meschia, James F.; Nalls, Michael; Matarin, Mar; Brott, Thomas G.; Brown, Robert D.; Hardy, John; Kissela, Brett; Rich, Stephen S.; Singleton, Andrew; Hernandez, Dena; Ferrucci, Luigi; Pearce, Kerra; Keller, Margaret; Worrall, Bradford B.

    2011-01-01

    Background and Purpose Ischemic stroke has a strong familial component to risk. The Siblings with Ischemic Stroke Study (SWISS) is a genome-wide family-based analysis that included use of imputed genotypes. SWISS was conducted to examine associations between SNPs and risk of stroke and stroke subtypes within pairs. Methods SWISS enrolled 312 probands with ischemic stroke across 70 US and Canadian centers. Affected siblings were ascertained by centers and confirmed by central record review; unaffected siblings were ascertained by telephone contact. Ischemic stroke was subtyped using TOAST criteria. Genotyping was performed using an Illumina 610 quad array (probands) and an Illumina linkage V array (affected siblings). SNPs were imputed using 1000 Genomes Project data and MACH software. Family-based association analyses were conducted using the sibling-transmission disequilibrium test. Results For all pairs, the correlation of age at stroke within pairs of affected siblings was r = 0.83 (95%CI, 0.78 to 0.86; P < 2.2×10−16). The correlation did not differ substantially by subtype. The concordance of stroke subtypes among affected pairs was 33.8% (kappa = 0.13; P = 5.06×10−4) and did not differ by age at stroke in the proband. Although no SNP achieved genome-wide significance for risk of ischemic stroke, there was clustering of the most associated SNPs on chromosomes 3p (NOS1) and 6p. Conclusions Stroke subtype and age at stroke in affected sibling pairs exhibit significant clustering. No individual SNP reached genome-wide significance. However, two promising candidate loci were identified, including one that contains NOS1, though these risk loci warrant further examination in larger sample collections. PMID:21940970

  10. Genome-wide patterns of promoter sharing and co-expression in bovine skeletal muscle

    PubMed Central

    2011-01-01

    Background Gene regulation by transcription factors (TF) is species, tissue and time specific. To better understand how the genetic code controls gene expression in bovine muscle we associated gene expression data from developing Longissimus thoracis et lumborum skeletal muscle with bovine promoter sequence information. Results We created a highly conserved genome-wide promoter landscape comprising 87,408 interactions relating 333 TFs with their 9,242 predicted target genes (TGs). We discovered that the complete set of predicted TGs share an average of 2.75 predicted TF binding sites (TFBSs) and that the average co-expression between a TF and its predicted TGs is higher than the average co-expression between the same TF and all genes. Conversely, pairs of TFs sharing predicted TGs showed a co-expression correlation higher that pairs of TFs not sharing TGs. Finally, we exploited the co-occurrence of predicted TFBS in the context of muscle-derived functionally-coherent modules including cell cycle, mitochondria, immune system, fat metabolism, muscle/glycolysis, and ribosome. Our findings enabled us to reverse engineer a regulatory network of core processes, and correctly identified the involvement of E2F1, GATA2 and NFKB1 in the regulation of cell cycle, fat, and muscle/glycolysis, respectively. Conclusion The pivotal implication of our research is two-fold: (1) there exists a robust genome-wide expression signal between TFs and their predicted TGs in cattle muscle consistent with the extent of promoter sharing; and (2) this signal can be exploited to recover the cellular mechanisms underpinning transcription regulation of muscle structure and development in bovine. Our study represents the first genome-wide report linking tissue specific co-expression to co-regulation in a non-model vertebrate. PMID:21226902

  11. Genome Wide Association for Addiction: Replicated Results and Comparisons of Two Analytic Approaches

    PubMed Central

    Drgon, Tomas; Zhang, Ping-Wu; Johnson, Catherine; Walther, Donna; Hess, Judith; Nino, Michelle; Uhl, George R.

    2010-01-01

    Background Vulnerabilities to dependence on addictive substances are substantially heritable complex disorders whose underlying genetic architecture is likely to be polygenic, with modest contributions from variants in many individual genes. “Nontemplate” genome wide association (GWA) approaches can identity groups of chromosomal regions and genes that, taken together, are much more likely to contain allelic variants that alter vulnerability to substance dependence than expected by chance. Methodology/Principal Findings We report pooled “nontemplate” genome-wide association studies of two independent samples of substance dependent vs control research volunteers (n = 1620), one European-American and the other African-American using 1 million SNP (single nucleotide polymorphism) Affymetrix genotyping arrays. We assess convergence between results from these two samples using two related methods that seek clustering of nominally-positive results and assess significance levels with Monte Carlo and permutation approaches. Both “converge then cluster” and “cluster then converge” analyses document convergence between the results obtained from these two independent datasets in ways that are virtually never found by chance. The genes identified in this fashion are also identified by individually-genotyped dbGAP data that compare allele frequencies in cocaine dependent vs control individuals. Conclusions/Significance These overlapping results identify small chromosomal regions that are also identified by genome wide data from studies of other relevant samples to extents much greater than chance. These chromosomal regions contain more genes related to “cell adhesion” processes than expected by chance. They also contain a number of genes that encode potential targets for anti-addiction pharmacotherapeutics. “Nontemplate” GWA approaches that seek chromosomal regions in which nominally-positive associations are found in multiple independent samples are

  12. Genome-wide pharmacogenomic study of citalopram-induced side effects in STAR*D.

    PubMed

    Adkins, D E; Clark, S L; Åberg, K; Hettema, J M; Bukszár, J; McClay, J L; Souza, R P; van den Oord, E J C G

    2012-01-01

    Affecting about 1 in 12 Americans annually, depression is a leading cause of the global disease burden. While a range of effective antidepressants are now available, failure and relapse rates remain substantial, with intolerable side effect burden the most commonly cited reason for discontinuation. Thus, understanding individual differences in susceptibility to antidepressant therapy side effects will be essential to optimize depression treatment. Here we perform genome-wide association studies (GWAS) to identify genetic variation influencing susceptibility to citalopram-induced side effects. The analysis sample consisted of 1762 depression patients, successfully genotyped for 421K single-nucleotide polymorphisms (SNPs), from the Sequenced Treatment Alternatives to Relieve Depression (STAR(*)D) study. Outcomes included five indicators of citalopram side effects: general side effect burden, overall tolerability, sexual side effects, dizziness and vision/hearing side effects. Two SNPs met our genome-wide significance criterion (q<0.1), ensuring that, on average, only 10% of significant findings are false discoveries. In total, 12 additional SNPs demonstrated suggestive associations (q<0.5). The top finding was rs17135437, an intronic SNP within EMID2, mediating the effects of citalopram on vision/hearing side effects (P=3.27 × 10(-8), q=0.026). The second genome-wide significant finding, representing a haplotype spanning ∼30 kb and eight genotyped SNPs in a gene desert on chromosome 13, was associated with general side effect burden (P=3.22 × 10(-7), q=0.096). Suggestive findings were also found for SNPs at LAMA1, AOX2P, EGFLAM, FHIT and RTP2. Although our findings require replication and functional validation, this study demonstrates the potential of GWAS to discover genes and pathways that potentially mediate adverse effects of antidepressant medications. PMID:22760553

  13. Genome-Wide Analysis of miRNA-mRNA Interactions in Marrow Stromal Cells

    PubMed Central

    Balakrishnan, Ilango; Yang, Xiaodong; Brown, Joseph; Ramakrishnan, Aravind; Torok–Storb, Beverly; Kabos, Peter; Hesselberth, Jay R.; Pillai, Manoj M.

    2014-01-01

    Regulation of hematopoietic stem cell proliferation, lineage commitment, and differentiation in adult vertebrates requires extrinsic signals provided by cells in the marrow microenvironment (ME) located within the bone marrow. Both secreted and cell-surface bound factors critical to this regulation have been identified, yet control of their expression by cells within the ME has not been addressed. Herein we hypothesize that microRNAs (miRNAs) contribute to their controlled expression. MiRNAs are small noncoding RNAs that bind to target mRNAs and downregulate gene expression by either initiating mRNA degradation or preventing peptide translation. Testing the role of miRNAs in downregulating gene expression has been difficult since conventional techniques used to define miRNA-mRNA interactions are indirect and have high false-positive and negative rates. In this report, a genome-wide biochemical technique (high-throughput sequencing of RNA isolated by cross-linking immunoprecipitation or HITS-CLIP) was used to generate unbiased genome-wide maps of miRNA-mRNA interactions in two critical cellular components of the marrow ME: marrow stromal cells and bone marrow endothelial cells. Analysis of these datasets identified miRNAs as direct regulators of JAG1, WNT5A, MMP2, and VEGFA; four factors that are important to ME function. Our results show the feasibility and utility of unbiased genome-wide biochemical techniques in dissecting the role of miRNAs in regulation of complex tissues such as the marrow ME. PMID:24038734

  14. Common genetic variation and susceptibility to partial epilepsies: a genome-wide association study

    PubMed Central

    Kasperavičiūtė, Dalia; Catarino, Claudia B.; Heinzen, Erin L.; Depondt, Chantal; Cavalleri, Gianpiero L.; Caboclo, Luis O.; Tate, Sarah K.; Jamnadas-Khoda, Jenny; Chinthapalli, Krishna; Clayton, Lisa M.S.; Shianna, Kevin V.; Radtke, Rodney A.; Mikati, Mohamad A.; Gallentine, William B.; Husain, Aatif M.; Alhusaini, Saud; Leppert, David; Middleton, Lefkos T.; Gibson, Rachel A.; Johnson, Michael R.; Matthews, Paul M.; Hosford, David; Heuser, Kjell; Amos, Leslie; Ortega, Marcos; Zumsteg, Dominik; Wieser, Heinz-Gregor; Steinhoff, Bernhard J.; Krämer, Günter; Hansen, Jörg; Dorn, Thomas; Kantanen, Anne-Mari; Gjerstad, Leif; Peuralinna, Terhi; Hernandez, Dena G.; Eriksson, Kai J.; Kälviäinen, Reetta K.; Doherty, Colin P.; Wood, Nicholas W.; Pandolfo, Massimo; Duncan, John S.; Sander, Josemir W.; Delanty, Norman

    2010-01-01

    Partial epilepsies have a substantial heritability. However, the actual genetic causes are largely unknown. In contrast to many other common diseases for which genetic association-studies have successfully revealed common variants associated with disease risk, the role of common variation in partial epilepsies has not yet been explored in a well-powered study. We undertook a genome-wide association-study to identify common variants which influence risk for epilepsy shared amongst partial epilepsy syndromes, in 3445 patients and 6935 controls of European ancestry. We did not identify any genome-wide significant association. A few single nucleotide polymorphisms may warrant further investigation. We exclude common genetic variants with effect sizes above a modest 1.3 odds ratio for a single variant as contributors to genetic susceptibility shared across the partial epilepsies. We show that, at best, common genetic variation can only have a modest role in predisposition to the partial epilepsies when considered across syndromes in Europeans. The genetic architecture of the partial epilepsies is likely to be very complex, reflecting genotypic and phenotypic heterogeneity. Larger meta-analyses are required to identify variants of smaller effect sizes (odds ratio <1.3) or syndrome-specific variants. Further, our results suggest research efforts should also be directed towards identifying the multiple rare variants likely to account for at least part of the heritability of the partial epilepsies. Data emerging from genome-wide association-studies will be valuable during the next serious challenge of interpreting all the genetic variation emerging from whole-genome sequencing studies. PMID:20522523

  15. Genome-wide association study identifies multiple susceptibility loci for pancreatic cancer

    PubMed Central

    Wolpin, Brian M.; Rizzato, Cosmeri; Kraft, Peter; Kooperberg, Charles; Petersen, Gloria M.; Wang, Zhaoming; Arslan, Alan A.; Beane-Freeman, Laura; Bracci, Paige M.; Buring, Julie; Canzian, Federico; Duell, Eric J.; Gallinger, Steven; Giles, Graham G.; Goodman, Gary E.; Goodman, Phyllis J.; Jacobs, Eric J.; Kamineni, Aruna; Klein, Alison P.; Kolonel, Laurence N.; Kulke, Matthew H.; Li, Donghui; Malats, Núria; Olson, Sara H.; Risch, Harvey A.; Sesso, Howard D.; Visvanathan, Kala; White, Emily; Zheng, Wei; Abnet, Christian C.; Albanes, Demetrius; Andreotti, Gabriella; Austin, Melissa A.; Barfield, Richard; Basso, Daniela; Berndt, Sonja I.; Boutron-Ruault, Marie-Christine; Brotzman, Michelle; Büchler, Markus W.; Bueno-de-Mesquita, H. Bas; Bugert, Peter; Burdette, Laurie; Campa, Daniele; Caporaso, Neil E.; Capurso, Gabriele; Chung, Charles; Cotterchio, Michelle; Costello, Eithne; Elena, Joanne; Funel, Niccola; Gaziano, J. Michael; Giese, Nathalia A.; Giovannucci, Edward L.; Goggins, Michael; Gorman, Megan J.; Gross, Myron; Haiman, Christopher A.; Hassan, Manal; Helzlsouer, Kathy J.; Henderson, Brian E.; Holly, Elizabeth A.; Hu, Nan; Hunter, David J.; Innocenti, Federico; Jenab, Mazda; Kaaks, Rudolf; Key, Timothy J.; Khaw, Kay-Tee; Klein, Eric A.; Kogevinas, Manolis; Krogh, Vittorio; Kupcinskas, Juozas; Kurtz, Robert C.; LaCroix, Andrea; Landi, Maria T.; Landi, Stefano; Le Marchand, Loic; Mambrini, Andrea; Mannisto, Satu; Milne, Roger L.; Nakamura, Yusuke; Oberg, Ann L.; Owzar, Kouros; Patel, Alpa V.; Peeters, Petra H. M.; Peters, Ulrike; Pezzilli, Raffaele; Piepoli, Ada; Porta, Miquel; Real, Francisco X.; Riboli, Elio; Rothman, Nathaniel; Scarpa, Aldo; Shu, Xiao-Ou; Silverman, Debra T.; Soucek, Pavel; Sund, Malin; Talar-Wojnarowska, Renata; Taylor, Philip R.; Theodoropoulos, George E.; Thornquist, Mark; Tjønneland, Anne; Tobias, Geoffrey S.; Trichopoulos, Dimitrios; Vodicka, Pavel; Wactawski-Wende, Jean; Wentzensen, Nicolas; Wu, Chen; Yu, Herbert; Yu, Kai; Zeleniuch-Jacquotte, Anne; Hoover, Robert; Hartge, Patricia; Fuchs, Charles; Chanock, Stephen J.

    2014-01-01

    We performed a multistage genome-wide association study (GWAS) including 7,683 individuals with pancreatic cancer and 14,397 controls of European descent. Four new loci reached genome-wide significance: rs6971499 at 7q32.3 (LINC-PINT; per-allele odds ratio [OR] = 0.79; 95% confidence interval [CI] = 0.74–0.84; P = 3.0×10−12), rs7190458 at 16q23.1 (BCAR1/CTRB1/CTRB2; OR = 1.46; 95% CI = 1.30–1.65; P = 1.1×10−10), rs9581943 at 13q12.2 (PDX1; OR = 1.15; 95% CI = 1.10–1.20; P = 2.4×10−9), and rs16986825 at 22q12.1 (ZNRF3; OR = 1.18; 95% CI = 1.12–1.25; P = 1.2×10−8). An independent signal was identified in exon 2 of TERT at the established region 5p15.33 (rs2736098; OR = 0.80; 95% CI = 0.76–0.85; P = 9.8×10−14). We also identified a locus at 8q24.21 (rs1561927; P = 1.3×10−7) that approached genome-wide significance located 455 kb telomeric of PVT1. Our study has identified multiple new susceptibility alleles for pancreatic cancer worthy of follow-up studies. PMID:25086665

  16. Genome-wide Meta-analysis on the Sense of Smell Among US Older Adults.

    PubMed

    Dong, Jing; Yang, Jingyun; Tranah, Greg; Franceschini, Nora; Parimi, Neeta; Alkorta-Aranburu, Gorka; Xu, Zongli; Alonso, Alvaro; Cummings, Steven R; Fornage, Myriam; Huang, Xuemei; Kritchevsky, Stephen; Liu, Yongmei; London, Stephanie; Niu, Liang; Wilson, Robert S; De Jager, Philip L; Yu, Lei; Singleton, Andrew B; Harris, Tamara; Mosley, Thomas H; Pinto, Jayant M; Bennett, David A; Chen, Honglei

    2015-11-01

    Olfactory dysfunction is common among older adults and affects their safety, nutrition, quality of life, and mortality. More importantly, the decreased sense of smell is an early symptom of neurodegenerative diseases such as Parkinson disease (PD) and Alzheimer disease. However, the genetic determinants for the sense of smell have been poorly investigated. We here performed the first genome-wide meta-analysis on the sense of smell among 6252 US older adults of European descent from the Atherosclerosis Risk in Communities (ARIC) study, the Health, Aging, and Body Composition (Health ABC) study, and the Religious Orders Study and the Rush Memory and Aging Project (ROS/MAP). Genome-wide association study analysis was performed first by individual cohorts and then meta-analyzed using fixed-effect models with inverse variance weights. Although no SNPs reached genome-wide statistical significance, we identified 13 loci with suggestive evidence for an association with the sense of smell (Pmeta < 1 × 10). Of these, 2 SNPs at chromosome 17q21.31 (rs199443 in NSF, P = 3.02 × 10; and rs2732614 in KIAA1267-LRRC37A, P = 6.65 × 10) exhibited cis effects on the expression of microtubule-associated protein tau (MAPT, 17q21.31) in 447 frontal-cortex samples obtained postmortem and profiled by RNA-seq (P < 1 × 10). Gene-based and pathway-enrichment analyses further implicated MAPT in regulating the sense of smell in older adults. Similar results were obtained after excluding participants who reported a physician-diagnosed PD or use of PD medications. In conclusion, we provide preliminary evidence that the MAPT locus may play a role in regulating the sense of smell in older adults and therefore offer a potential genetic link between poor sense of smell and major neurodegenerative diseases. PMID:26632684

  17. Genome-Wide Association Study Reveals Multiple Loci Influencing Normal Human Facial Morphology.

    PubMed

    Shaffer, John R; Orlova, Ekaterina; Lee, Myoung Keun; Leslie, Elizabeth J; Raffensperger, Zachary D; Heike, Carrie L; Cunningham, Michael L; Hecht, Jacqueline T; Kau, Chung How; Nidey, Nichole L; Moreno, Lina M; Wehby, George L; Murray, Jeffrey C; Laurie, Cecelia A; Laurie, Cathy C; Cole, Joanne; Ferrara, Tracey; Santorico, Stephanie; Klein, Ophir; Mio, Washington; Feingold, Eleanor; Hallgrimsson, Benedikt; Spritz, Richard A; Marazita, Mary L; Weinberg, Seth M

    2016-08-01

    Numerous lines of evidence point to a genetic basis for facial morphology in humans, yet little is known about how specific genetic variants relate to the phenotypic expression of many common facial features. We conducted genome-wide association meta-analyses of 20 quantitative facial measurements derived from the 3D surface images of 3118 healthy individuals of European ancestry belonging to two US cohorts. Analyses were performed on just under one million genotyped SNPs (Illumina OmniExpress+Exome v1.2 array) imputed to the 1000 Genomes reference panel (Phase 3). We observed genome-wide significant associations (p < 5 x 10-8) for cranial base width at 14q21.1 and 20q12, intercanthal width at 1p13.3 and Xq13.2, nasal width at 20p11.22, nasal ala length at 14q11.2, and upper facial depth at 11q22.1. Several genes in the associated regions are known to play roles in craniofacial development or in syndromes affecting the face: MAFB, PAX9, MIPOL1, ALX3, HDAC8, and PAX1. We also tested genotype-phenotype associations reported in two previous genome-wide studies and found evidence of replication for nasal ala length and SNPs in CACNA2D3 and PRDM16. These results provide further evidence that common variants in regions harboring genes of known craniofacial function contribute to normal variation in human facial features. Improved understanding of the genes associated with facial morphology in healthy individuals can provide insights into the pathways and mechanisms controlling normal and abnormal facial morphogenesis. PMID:27560520

  18. A genome-wide meta-analysis of association studies of Cloninger's Temperament Scales

    PubMed Central

    Service, S K; Verweij, K J H; Lahti, J; Congdon, E; Ekelund, J; Hintsanen, M; Räikkönen, K; Lehtimäki, T; Kähönen, M; Widen, E; Taanila, A; Veijola, J; Heath, A C; Madden, P A F; Montgomery, G W; Sabatti, C; Järvelin, M-R; Palotie, A; Raitakari, O; Viikari, J; Martin, N G; Eriksson, J G; Keltikangas-Järvinen, L; Wray, N R; Freimer, N B

    2012-01-01

    Temperament has a strongly heritable component, yet multiple independent genome-wide studies have failed to identify significant genetic associations. We have assembled the largest sample to date of persons with genome-wide genotype data, who have been assessed with Cloninger's Temperament and Character Inventory. Sum scores for novelty seeking, harm avoidance, reward dependence and persistence have been measured in over 11 000 persons collected in four different cohorts. Our study had >80% power to identify genome-wide significant loci (P<1.25 × 10−8, with correction for testing four scales) accounting for ⩾0.4% of the phenotypic variance in temperament scales. Using meta-analysis techniques, gene-based tests and pathway analysis we have tested over 1.2 million single-nucleotide polymorphisms (SNPs) for association to each of the four temperament dimensions. We did not discover any SNPs, genes, or pathways to be significantly related to the four temperament dimensions, after correcting for multiple testing. Less than 1% of the variability in any temperament dimension appears to be accounted for by a risk score derived from the SNPs showing strongest association to the temperament dimensions. Elucidation of genetic loci significantly influencing temperament and personality will require potentially very large samples, and/or a more refined phenotype. Item response theory methodology may be a way to incorporate data from cohorts assessed with multiple personality instruments, and might be a method by which a large sample of a more refined phenotype could be acquired. PMID:22832960

  19. Genome-wide gene deletions in Streptococcus sanguinis by high throughput PCR.

    PubMed

    Ge, Xiuchun; Xu, Ping

    2012-01-01

    Transposon mutagenesis and single-gene deletion are two methods applied in genome-wide gene knockout in bacteria (1,2). Although transposon mutagenesis is less time consuming, less costly, and does not require completed genome information, there are two weaknesses in this method: (1) the possibility of a disparate mutants in the mixed mutant library that counter-selects mutants with decreased competition; and (2) the possibility of partial gene inactivation whereby genes do not entirely lose their function following the insertion of a transposon. Single-gene deletion analysis may compensate for the drawbacks associated with transposon mutagenesis. To improve the efficiency of genome-wide single gene deletion, we attempt to establish a high-throughput technique for genome-wide single gene deletion using Streptococcus sanguinis as a model organism. Each gene deletion construct in S. sanguinis genome is designed to comprise 1-kb upstream of the targeted gene, the aphA-3 gene, encoding kanamycin resistance protein, and 1-kb downstream of the targeted gene. Three sets of primers F1/R1, F2/R2, and F3/R3, respectively, are designed and synthesized in a 96-well plate format for PCR-amplifications of those three components of each deletion construct. Primers R1 and F3 contain 25-bp sequences that are complementary to regions of the aphA-3 gene at their 5' end. A large scale PCR amplification of the aphA-3 gene is performed once for creating all single-gene deletion constructs. The promoter of aphA-3 gene is initially excluded to minimize the potential polar effect of kanamycin cassette. To create the gene deletion constructs, high-throughput PCR amplification and purification are performed in a 96-well plate format. A linear recombinant PCR amplicon for each gene deletion will be made up through four PCR reactions using high-fidelity DNA polymerase. The initial exponential growth phase of S. sanguinis cultured in Todd Hewitt broth supplemented with 2.5% inactivated horse

  20. Genome-wide meta-analyses identify three loci associated with primary biliary cirrhosis

    PubMed Central

    Liu, Xiangdong; Invernizzi, Pietro; Lu, Yue; Kosoy, Roman; Lu, Yan; Bianchi, Ilaria; Podda, Mauro; Xu, Chun; Xie, Gang; Macciardi, Fabio; Selmi, Carlo; Lupoli, Sara; Shigeta, Russell; Ransom, Michael; Lleo, Ana; Lee, Annette T; Mason, Andrew L; Myers, Robert P; Peltekian, Kevork M; Ghent, Cameron N; Bernuzzi, Francesca; Zuin, Massimo; Rosina, Floriano; Borghesio, Elisabetta; Floreani, Annarosa; Lazzari, Roberta; Niro, Grazia; Andriulli, Angelo; Muratori, Luigi; Muratori, Paolo; Almasio, Piero L; Andreone, Pietro; Margotti, Marzia; Brunetto, Maurizia; Coco, Barbara; Alvaro, Domenico; Bragazzi, Maria C; Marra, Fabio; Pisano, Alessandro; Rigamonti, Cristina; Colombo, Massimo; Marzioni, Marco; Benedetti, Antonio; Fabris, Luca; Strazzabosco, Mario; Portincasa, Piero; Palmieri, Vincenzo O; Tiribelli, Claudio; Croce, Lory; Bruno, Savino; Rossi, Sonia; Vinci, Maria; Prisco, Cleofe; Mattalia, Alberto; Toniutto, Pierluigi; Picciotto, Antonio; Galli, Andrea; Ferrari, Carlo; Colombo, Silvia; Casella, Giovanni; Morini, Lorenzo; Caporaso, Nicola; Colli, Agostino; Spinzi, Giancarlo; Montanari, Renzo; Gregersen, Peter K; Heathcote, E Jenny; Hirschfield, Gideon M; Siminovitch, Katherine A; Amos, Christopher I; Gershwin, M Eric; Seldin, Michael F

    2011-01-01

    A genome-wide association screen for primary biliary cirrhosis risk alleles was performed in an Italian cohort. The results from the Italian cohort replicated IL12A and IL12RB associations, and a combined meta-analysis using a Canadian dataset identified newly associated loci at SPIB (P = 7.9 × 10–11, odds ratio (OR) = 1.46), IRF5-TNPO3 (P = 2.8 × 10–10, OR = 1.63) and 17q12-21 (P = 1.7 × 10–10, OR = 1.38). PMID:20639880

  1. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data

    PubMed Central

    Liu, Zhi-Ping

    2015-01-01

    Transcriptional regulation plays vital roles in many fundamental biological processes. Reverse engineering of genome-wide regulatory networks from high-throughput transcriptomic data provides a promising way to characterize the global scenario of regulatory relationships between regulators and their targets. In this review, we summarize and categorize the main frameworks and methods currently available for inferring transcriptional regulatory networks from microarray gene expression profiling data. We overview each of strategies and introduce representative methods respectively. Their assumptions, advantages, shortcomings, and possible improvements and extensions are also clarified and commented. PMID:25937810

  2. A mega-analysis of genome-wide association studies for major depressive disorder.

    PubMed

    Ripke, Stephan; Wray, Naomi R; Lewis, Cathryn M; Hamilton, Steven P; Weissman, Myrna M; Breen, Gerome; Byrne, Enda M; Blackwood, Douglas H R; Boomsma, Dorret I; Cichon, Sven; Heath, Andrew C; Holsboer, Florian; Lucae, Susanne; Madden, Pamela A F; Martin, Nicholas G; McGuffin, Peter; Muglia, Pierandrea; Noethen, Markus M; Penninx, Brenda P; Pergadia, Michele L; Potash, James B; Rietschel, Marcella; Lin, Danyu; Müller-Myhsok, Bertram; Shi, Jianxin; Steinberg, Stacy; Grabe, Hans J; Lichtenstein, Paul; Magnusson, Patrik; Perlis, Roy H; Preisig, Martin; Smoller, Jordan W; Stefansson, Kari; Uher, Rudolf; Kutalik, Zoltan; Tansey, Katherine E; Teumer, Alexander; Viktorin, Alexander; Barnes, Michael R; Bettecken, Thomas; Binder, Elisabeth B; Breuer, René; Castro, Victor M; Churchill, Susanne E; Coryell, William H; Craddock, Nick; Craig, Ian W; Czamara, Darina; De Geus, Eco J; Degenhardt, Franziska; Farmer, Anne E; Fava, Maurizio; Frank, Josef; Gainer, Vivian S; Gallagher, Patience J; Gordon, Scott D; Goryachev, Sergey; Gross, Magdalena; Guipponi, Michel; Henders, Anjali K; Herms, Stefan; Hickie, Ian B; Hoefels, Susanne; Hoogendijk, Witte; Hottenga, Jouke Jan; Iosifescu, Dan V; Ising, Marcus; Jones, Ian; Jones, Lisa; Jung-Ying, Tzeng; Knowles, James A; Kohane, Isaac S; Kohli, Martin A; Korszun, Ania; Landen, Mikael; Lawson, William B; Lewis, Glyn; Macintyre, Donald; Maier, Wolfgang; Mattheisen, Manuel; McGrath, Patrick J; McIntosh, Andrew; McLean, Alan; Middeldorp, Christel M; Middleton, Lefkos; Montgomery, Grant M; Murphy, Shawn N; Nauck, Matthias; Nolen, Willem A; Nyholt, Dale R; O'Donovan, Michael; Oskarsson, Högni; Pedersen, Nancy; Scheftner, William A; Schulz, Andrea; Schulze, Thomas G; Shyn, Stanley I; Sigurdsson, Engilbert; Slager, Susan L; Smit, Johannes H; Stefansson, Hreinn; Steffens, Michael; Thorgeirsson, Thorgeir; Tozzi, Federica; Treutlein, Jens; Uhr, Manfred; van den Oord, Edwin J C G; Van Grootheest, Gerard; Völzke, Henry; Weilburg, Jeffrey B; Willemsen, Gonneke; Zitman, Frans G; Neale, Benjamin; Daly, Mark; Levinson, Douglas F; Sullivan, Patrick F

    2013-04-01

    Prior genome-wide association studies (GWAS) of major depressive disorder (MDD) have met with limited success. We sought to increase statistical power to detect disease loci by conducting a GWAS mega-analysis for MDD. In the MDD discovery phase, we analyzed more than 1.2 million autosomal and X chromosome single-nucleotide polymorphisms (SNPs) in 18 759 independent and unrelated subjects of recent European ancestry (9240 MDD cases and 9519 controls). In the MDD replication phase, we evaluated 554 SNPs in independent samples (6783 MDD cases and 50 695 controls). We also conducted a cross-disorder meta-analysis using 819 autosomal SNPs with P<0.0001 for either MDD or the Psychiatric GWAS Consortium bipolar disorder (BIP) mega-analysis (9238 MDD cases/8039 controls and 6998 BIP cases/7775 controls). No SNPs achieved genome-wide significance in the MDD discovery phase, the MDD replication phase or in pre-planned secondary analyses (by sex, recurrent MDD, recurrent early-onset MDD, age of onset, pre-pubertal onset MDD or typical-like MDD from a latent class analyses of the MDD criteria). In the MDD-bipolar cross-disorder analysis, 15 SNPs exceeded genome-wide significance (P<5 × 10(-8)), and all were in a 248 kb interval of high LD on 3p21.1 (chr3:52 425 083-53 822 102, minimum P=5.9 × 10(-9) at rs2535629). Although this is the largest genome-wide analysis of MDD yet conducted, its high prevalence means that the sample is still underpowered to detect genetic effects typical for complex traits. Therefore, we were unable to identify robust and replicable findings. We discuss what this means for genetic research for MDD. The 3p21.1 MDD-BIP finding should be interpreted with caution as the most significant SNP did not replicate in MDD samples, and genotyping in independent samples will be needed to resolve its status. PMID:22472876

  3. Genome-Wide Chromatin Immunoprecipitation in Candida albicans and Other Yeasts.

    PubMed

    Lohse, Matthew B; Kongsomboonvech, Pisiwat; Madrigal, Maria; Hernday, Aaron D; Nobile, Clarissa J

    2016-01-01

    Chromatin immunoprecipitation experiments are critical to investigating the interactions between DNA and a wide range of nuclear proteins within a cell or biological sample. In this chapter we outline an optimized protocol for genome-wide chromatin immunoprecipitation that has been used successfully for several distinct morphological forms of numerous yeast species, and include an optimized method for amplification of chromatin immunoprecipitated DNA samples and hybridization to a high-density oligonucleotide tiling microarray. We also provide detailed suggestions on how to analyze the complex data obtained from these experiments. PMID:26483022

  4. Genome-wide association analysis in primary sclerosing cholangitis identifies two non-HLA susceptibility loci

    PubMed Central

    Melum, Espen; Franke, Andre; Schramm, Christoph; Weismüller, Tobias J; Gotthardt, Daniel Nils; Offner, Felix A; Juran, Brian D; Laerdahl, Jon K; Labi, Verena; Björnsson, Einar; Weersma, Rinse K; Henckaerts, Liesbet; Teufel, Andreas; Rust, Christian; Ellinghaus, Eva; Balschun, Tobias; Boberg, Kirsten Muri; Ellinghaus, David; Bergquist, Annika; Sauer, Peter; Ryu, Euijung; Hov, Johannes Roksund; Wedemeyer, Jochen; Lindkvist, Björn; Wittig, Michael; Porte, Robert J; Holm, Kristian; Gieger, Christian; Wichmann, H-Erich; Stokkers, Pieter; Ponsioen, Cyriel Y; Runz, Heiko; Stiehl, Adolf; Wijmenga, Cisca; Sterneck, Martina; Vermeire, Severine; Beuers, Ulrich; Villunger, Andreas; Schrumpf, Erik; Lazaridis, Konstantinos N; Manns, Michael P; Schreiber, Stefan; Karlsen, Tom H

    2015-01-01

    Primary sclerosing cholangitis (PSC) is a chronic bile duct disease affecting 2.4–7.5% of individuals with inflammatory bowel disease. We performed a genome-wide association analysis of 2,466,182 SNPs in 715 individuals with PSC and 2,962 controls, followed by replication in 1,025 PSC cases and 2,174 controls. We detected non-HLA associations at rs3197999 in MST1 and rs6720394 near BCL2L11 (combined P = 1.1 × 10−16 and P = 4.1 × 10−8, respectively). PMID:21151127

  5. Genome-wide association study identifies variants in TMPRSS6 associated with hemoglobin levels.

    PubMed

    Chambers, John C; Zhang, Weihua; Li, Yun; Sehmi, Joban; Wass, Mark N; Zabaneh, Delilah; Hoggart, Clive; Bayele, Henry; McCarthy, Mark I; Peltonen, Leena; Freimer, Nelson B; Srai, Surjit K; Maxwell, Patrick H; Sternberg, Michael J E; Ruokonen, Aimo; Abecasis, Gonçalo; Jarvelin, Marjo-Riitta; Scott, James; Elliott, Paul; Kooner, Jaspal S

    2009-11-01

    We carried out a genome-wide association study of hemoglobin levels in 16,001 individuals of European and Indian Asian ancestry. The most closely associated SNP (rs855791) results in nonsynonymous (V736A) change in the serine protease domain of TMPRSS6 and a blood hemoglobin concentration 0.13 (95% CI 0.09-0.17) g/dl lower per copy of allele A (P = 1.6 x 10(-13)). Our findings suggest that TMPRSS6, a regulator of hepcidin synthesis and iron handling, is crucial in hemoglobin level maintenance. PMID:19820698

  6. Genome-Wide Chromatin Immunoprecipitation in Candida albicans and Other Yeasts

    PubMed Central

    Lohse, Matthew B.; Kongsomboonvech, Pisiwat; Madrigal, Maria; Hernday, Aaron D.; Nobile, Clarissa J.

    2016-01-01

    Chromatin immunoprecipitation experiments are critical to investigating the interactions between DNA and a wide range of nuclear proteins within a cell or biological sample. In this chapter we outline an optimized protocol for genome-wide chromatin immunoprecipitation that has been used successfully for several distinct morphological forms of numerous yeast species, and include an optimized method for amplification of chromatin immunoprecipitated DNA samples and hybridization to a high-density oligonucleotide tiling microarray. We also provide detailed suggestions on how to analyze the complex data obtained from these experiments. PMID:26483022

  7. Genome-Wide Association Study Identifies Novel Loci Associated With Diisocyanate-Induced Occupational Asthma

    PubMed Central

    Yucesoy, Berran; Kaufman, Kenneth M.; Lummus, Zana L.; Weirauch, Matthew T.; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M.; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B.; Bernstein, David I.

    2015-01-01

    Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10−14). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10−9 and rs2514805, p = 1.22 × 10−8, respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10−6). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies. PMID:25918132

  8. Inferring Where and When Replication Initiates from Genome-Wide Replication Timing Data

    NASA Astrophysics Data System (ADS)

    Baker, A.; Audit, B.; Yang, S. C.-H.; Bechhoefer, J.; Arneodo, A.

    2012-06-01

    Based on an analogy between DNA replication and one dimensional nucleation-and-growth processes, various attempts to infer the local initiation rate I(x,t) of DNA replication origins from replication timing data have been developed in the framework of phase transition kinetics theories. These works have all used curve-fit strategies to estimate I(x,t) from genome-wide replication timing data. Here, we show how to invert analytically the Kolmogorov-Johnson-Mehl-Avrami model and extract I(x,t) directly. Tests on both simulated and experimental budding-yeast data confirm the location and firing-time distribution of replication origins.

  9. Genome-Wide Association Studies: Progress in Identifying Genetic Biomarkers in Common, Complex Diseases

    PubMed Central

    Kingsmore, Stephen F.; Lindquist, Ingrid E.; Mudge, Joann; Beavis, William D.

    2007-01-01

    Novel, comprehensive approaches for biomarker discovery and validation are urgently needed. One particular area of methodologic need is for discovery of novel genetic biomarkers in complex diseases and traits. Here, we review recent successes in the use of genome wide association (GWA) approaches to identify genetic biomarkers in common human diseases and traits. Such studies are yielding initial insights into the allelic architecture of complex traits. In general, it appears that complex diseases are associated with many common polymorphisms, implying profound genetic heterogeneity between affected individuals. PMID:19662211

  10. Genome-Wide Analysis of DNA Methylation and Cigarette Smoking in a Chinese Population

    PubMed Central

    Zhu, Xiaoyan; Li, Jun; Deng, Siyun; Yu, Kuai; Liu, Xuezhen; Deng, Qifei; Sun, Huizhen; Zhang, Xiaomin; He, Meian; Guo, Huan; Chen, Weihong; Yuan, Jing; Zhang, Bing; Kuang, Dan; He, Xiaosheng; Bai, Yansen; Han, Xu; Liu, Bing; Li, Xiaoliang; Yang, Liangle; Jiang, Haijing; Zhang, Yizhi; Hu, Jie; Cheng, Longxian; Luo, Xiaoting; Mei, Wenhua; Zhou, Zhiming; Sun, Shunchang; Zhang, Liyun; Liu, Chuanyao; Guo, Yanjun; Zhang, Zhihong; Hu, Frank B.; Liang, Liming; Wu, Tangchun

    2016-01-01

    Background: Smoking is a risk factor for many human diseases. DNA methylation has been related to smoking, but genome-wide methylation data for smoking in Chinese populations is limited. Objectives: We aimed to investigate epigenome-wide methylation in relation to smoking in a Chinese population. Methods: We measured the methylation levels at > 485,000 CpG sites (CpGs) in DNA from leukocytes using a methylation array and conducted a genome-wide meta-analysis of DNA methylation and smoking in a total of 596 Chinese participants. We further evaluated the associations of smoking-related CpGs with internal polycyclic aromatic hydrocarbon (PAH) biomarkers and their correlations with the expression of corresponding genes. Results: We identified 318 CpGs whose methylation levels were associated with smoking at a genome-wide significance level (false discovery rate < 0.05), among which 161 CpGs annotated to 123 genes were not associated with smoking in recent studies of Europeans and African Americans. Of these smoking-related CpGs, methylation levels at 80 CpGs showed significant correlations with the expression of corresponding genes (including RUNX3, IL6R, PTAFR, ANKRD11, CEP135 and CDH23), and methylation at 15 CpGs was significantly associated with urinary 2-hydroxynaphthalene, the most representative internal monohydroxy-PAH biomarker for smoking. Conclusion: We identified DNA methylation markers associated with smoking in a Chinese population, including some markers that were also correlated with gene expression. Exposure to naphthalene, a byproduct of tobacco smoke, may contribute to smoking-related methylation. Citation: Zhu X, Li J, Deng S, Yu K, Liu X, Deng Q, Sun H, Zhang X, He M, Guo H, Chen W, Yuan J, Zhang B, Kuang D, He X, Bai Y, Han X, Liu B, Li X, Yang L, Jiang H, Zhang Y, Hu J, Cheng L, Luo X, Mei W, Zhou Z, Sun S, Zhang L, Liu C, Guo Y, Zhang Z, Hu FB, Liang L, Wu T. 2016. Genome-wide analysis of DNA methylation and cigarette smoking in Chinese. Environ

  11. Brewing yeast genomes and genome-wide expression and proteome profiling during fermentation.

    PubMed

    Smart, Katherine A

    2007-11-01

    The genome structure, ancestry and instability of the brewing yeast strains have received considerable attention. The hybrid nature of brewing lager yeast strains provides adaptive potential but yields genome instability which can adversely affect fermentation performance. The requirement to differentiate between production strains and assess master cultures for genomic instability has led to significant adoption of specialized molecular tool kits by the industry. Furthermore, the development of genome-wide transcriptional and protein expression technologies has generated significant interest from brewers. The opportunity presented to explore, and the concurrent requirement to understand both, the constraints and potential of their strains to generate existing and new products during fermentation is discussed. PMID:17879324

  12. Genome-wide association analysis in primary sclerosing cholangitis identifies two non-HLA susceptibility loci.

    PubMed

    Melum, Espen; Franke, Andre; Schramm, Christoph; Weismüller, Tobias J; Gotthardt, Daniel Nils; Offner, Felix A; Juran, Brian D; Laerdahl, Jon K; Labi, Verena; Björnsson, Einar; Weersma, Rinse K; Henckaerts, Liesbet; Teufel, Andreas; Rust, Christian; Ellinghaus, Eva; Balschun, Tobias; Boberg, Kirsten Muri; Ellinghaus, David; Bergquist, Annika; Sauer, Peter; Ryu, Euijung; Hov, Johannes Roksund; Wedemeyer, Jochen; Lindkvist, Björn; Wittig, Michael; Porte, Robert J; Holm, Kristian; Gieger, Christian; Wichmann, H-Erich; Stokkers, Pieter; Ponsioen, Cyriel Y; Runz, Heiko; Stiehl, Adolf; Wijmenga, Cisca; Sterneck, Martina; Vermeire, Severine; Beuers, Ulrich; Villunger, Andreas; Schrumpf, Erik; Lazaridis, Konstantinos N; Manns, Michael P; Schreiber, Stefan; Karlsen, Tom H

    2011-01-01

    Primary sclerosing cholangitis (PSC) is a chronic bile duct disease affecting 2.4-7.5% of individuals with inflammatory bowel disease. We performed a genome-wide association analysis of 2,466,182 SNPs in 715 individuals with PSC and 2,962 controls, followed by replication in 1,025 PSC cases and 2,174 controls. We detected non-HLA associations at rs3197999 in MST1 and rs6720394 near BCL2L11 (combined P = 1.1 × 10⁻¹⁶ and P = 4.1 × 10⁻⁸, respectively). PMID:21151127

  13. The era of genome-wide association studies: opportunities and challenges for asthma genetics.

    PubMed

    Zhang, Guicheng; Goldblatt, Jack; LeSouëf, Peter

    2009-11-01

    In the era of genome-wide association (GWA) studies, delineating pathogenic asthma genetic pathways has provided both challenges and opportunities. Initial GWA studies on asthma and asthma-like phenotypes provided some successes in terms of ascertaining new potential asthma candidate genes. However, due to asthma having a heterogeneous etiology, replications of these genotype-phenotype association studies are generally lacking. Furthermore, genes by environment interactions are generally not considered when GWA studies are conducted. Therefore, there is a need for extensive collaborations in multi-disciplinary research fields, including different environments and populations, to investigate the functional importance of variations in the human genome in relation to asthma pathogenesis. PMID:19816512

  14. Genome-Wide Association Analysis of Blood Biomarkers in Chronic Obstructive Pulmonary Disease

    PubMed Central

    Kim, Deog Kyeom; Cho, Michael H.; Hersh, Craig P.; Lomas, David A.; Miller, Bruce E.; Kong, Xiangyang; Bakke, Per; Gulsvik, Amund; Agustí, Alvar; Wouters, Emiel; Celli, Bartolome; Coxson, Harvey; Vestbo, Jørgen; MacNee, William; Yates, Julie C.; Rennard, Stephen; Litonjua, Augusto; Qiu, Weiliang; Beaty, Terri H.; Crapo, James D.; Riley, John H.; Tal-Singer, Ruth

    2012-01-01

    Rationale: A genome-wide association study (GWAS) for circulating chronic obstructive pulmonary disease (COPD) biomarkers could identify genetic determinants of biomarker levels and COPD susceptibility. Objectives: To identify genetic variants of circulating protein biomarkers and novel genetic determinants of COPD. Methods: GWAS was performed for two pneumoproteins, Clara cell secretory protein (CC16) and surfactant protein D (SP-D), and five systemic inflammatory markers (C-reactive protein, fibrinogen, IL-6, IL-8, and tumor necrosis factor-α) in 1,951 subjects with COPD. For genome-wide significant single nucleotide polymorphisms (SNPs) (P < 1 × 10−8), association with COPD susceptibility was tested in 2,939 cases with COPD and 1,380 smoking control subjects. The association of candidate SNPs with mRNA expression in induced sputum was also elucidated. Measurements and Main Results: Genome-wide significant susceptibility loci affecting biomarker levels were found only for the two pneumoproteins. Two discrete loci affecting CC16, one region near the CC16 coding gene (SCGB1A1) on chromosome 11 and another locus approximately 25 Mb away from SCGB1A1, were identified, whereas multiple SNPs on chromosomes 6 and 16, in addition to SNPs near SFTPD, had genome-wide significant associations with SP-D levels. Several SNPs affecting circulating CC16 levels were significantly associated with sputum mRNA expression of SCGB1A1 (P = 0.009–0.03). Several SNPs highly associated with CC16 or SP-D levels were nominally associated with COPD in a collaborative GWAS (P = 0.001–0.049), although these COPD associations were not replicated in two additional cohorts. Conclusions: Distant genetic loci and biomarker-coding genes affect circulating levels of COPD-related pneumoproteins. A subset of these protein quantitative trait loci may influence their gene expression in the lung and/or COPD susceptibility. Clinical trial registered with www.clinicaltrials.gov (NCT 00292552). PMID

  15. Genome-wide Association Studies from the Cancer Genetic Markers of Susceptibility (CGEMS) Initiative | Office of Cancer Genomics

    Cancer.gov

    CGEMS identifies common inherited genetic variations associated with a number of cancers, including breast and prostate. Data from these genome-wide association studies (GWAS) are available through the Division of Cancer Epidemiology & Genetics website.

  16. A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence

    PubMed Central

    Kadeva, Neli; Miller, Mike B.; Iacono, William G.; McGue, Matt; Stergiakouli, Evie; Davey Smith, George; Putallaz, Martha; Lubinski, David; Meaburn, Emma L.; Plomin, Robert; Simpson, Michael A.

    2015-01-01

    Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. The present study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case-control association analysis with 1 409 individuals drawn from the top 0.0003 (IQ > 170) of the population distribution of intelligence and 3 253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina Human Exome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (Genome-wide Complex Trait Analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (SE 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence. PMID:26239293

  17. Genome-wide association scan suggests basis for microtia in Awassi sheep.

    PubMed

    Jawasreh, K; Boettcher, P J; Stella, A

    2016-08-01

    Hereditary underdevelopment of the ear, a condition also known as microtia, has been observed in several sheep breeds as well as in humans and other species. Its genetic basis in sheep is unknown. The Awassi sheep, a breed native to southwest Asia, carries this phenotype and was targeted for molecular characterization via a genome-wide association study. DNA samples were collected from sheep in Jordan. Eight affected and 12 normal individuals were genotyped with the Illumina OvineSNP50(®) chip. Multilocus analyses failed to identify any genotypic association. In contrast, a single-locus analysis revealed a statistically significant association (P = 0.012, genome-wide) with a SNP at basepair 34 647 499 on OAR23. This marker is adjacent to the gene encoding transcription factor GATA-6, which has been shown to play a role in many developmental processes, including chondrogenesis. The lack of extended homozygosity in this region suggests a fairly ancient mutation, and the time of occurrence was estimated to be approximately 3000 years ago. Many of the earless sheep breeds may thus share the causative mutation, especially within the subgroup of fat-tailed, wool sheep. PMID:26990958

  18. Meta-analysis of genome-wide association studies identifies 10 loci influencing allergic sensitization

    PubMed Central

    Granell, Raquel; Strachan, David P; Alves, Alexessander Couto; Linneberg, Allan; Curtin, John A; Warrington, Nicole M; Standl, Marie; Kerkhof, Marjan; Jonsdottir, Ingileif; Bukvic, Blazenka K; Kaakinen, Marika; Sleimann, Patrick; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Schramm, Katharina; Baltic, Svetlana; Kreiner-Møller, Eskil; Simpson, Angela; St Pourcain, Beate; Coin, Lachlan; Hui, Jennie; Walters, Eugene H; Tiesler, Carla M T; Duffy, David L; Jones, Graham; Ring, Susan M; McArdle, Wendy L; Price, Loren; Robertson, Colin F; Pekkanen, Juha; Tang, Clara S; Thiering, Elisabeth; Montgomery, Grant W; Hartikainen, Anna-Liisa; Dharmage, Shyamali C; Husemoen, Lise L; Herder, Christian; Kemp, John P; Elliot, Paul; James, Alan; Waldenberger, Melanie; Abramson, Michael J; Fairfax, Benjamin P; Knight, Julian C; Gupta, Ramneek; Thompson, Philip J; Holt, Patrick; Sly, Peter; Hirschhorn, Joel N; Blekic, Mario; Weidinger, Stephan; Hakonarsson, Hakon; Stefansson, Kari; Heinrich, Joachim; Postma, Dirkje S; Custovic, Adnan; Pennell, Craig E; Jarvelin, Marjo-Riitta; Koppelman, Gerard H; Timpson, Nicholas; Ferreira, Manuel A; Bisgaard, Hans; Henderson, A John

    2016-01-01

    Allergen-specific IgE (allergic sensitization) plays a central role in the pathogenesis of allergic disease. We performed the first large-scale genome wide association study (GWAS) of allergic sensitization in 5,789 affected individuals and 10,056 controls and followed up the top SNP from 26 loci in 6,114 affected individuals and 9,920 controls. We increased the number of susceptibility loci with genome-wide significant association to allergic sensitization from three to 10, including SNPs in or near TLR6, C11orf30, STAT6, SLC25A46, HLA-DQB1, IL1RL1, LPP, MYC, IL2 and HLA-B. All the top-SNPs were associated with allergic symptoms in an independent study. Risk variants at these 10 loci were estimated to account for at least 25% of allergic sensitization and allergic rhinitis. Understanding the molecular mechanisms underlying these associations may provide novel insight into the etiology of allergic disease. PMID:23817571

  19. Heavy metals induce oxidative stress and genome-wide modulation in transcriptome of rice root.

    PubMed

    Dubey, Sonali; Shri, Manju; Misra, Prashant; Lakhwani, Deepika; Bag, Sumit Kumar; Asif, Mehar H; Trivedi, Prabodh Kumar; Tripathi, Rudro Deo; Chakrabarty, Debasis

    2014-06-01

    Industrial growth, ecological disturbances and agricultural practices have contaminated the soil and water with many harmful compounds, including heavy metals. These heavy metals affect growth and development of plants as well as cause severe human health hazards through food chain contamination. In past, studies have been made to identify biochemical and molecular networks associated with heavy metal toxicity and uptake in plants. Studies suggested that most of the physiological and molecular processes affected by different heavy metals are similar to those affected by other abiotic stresses. To identify common and unique responses by different metals, we have studied biochemical and genome-wide modulation in transcriptome of rice (IR-64 cultivar) root after exposure to cadmium (Cd), arsenate [As(V)], lead (Pb) and chromium [Cr(VI)] in hydroponic condition. We observed that root tissue shows variable responses for antioxidant enzyme system for different heavy metals. Genome-wide expression analysis suggests variable number of genes differentially expressed in root in response to As(V), Cd, Pb and Cr(VI) stresses. In addition to unique genes, each heavy metal modulated expression of a large number of common genes. Study also identified cis-acting regions of the promoters which can be determinants for the modulated expression of the genes in response to different heavy metals. Our study advances understanding related to various processes and networks which might be responsible for heavy metal stresses, accumulation and detoxification. PMID:24553786

  20. Genome-wide association studies in preterm birth: implications for the practicing obstetrician-gynaecologist.

    PubMed

    Dolan, Siobhan M; Christiaens, Inge

    2013-01-01

    Preterm birth has the highest mortality and morbidity of all pregnancy complications. The burden of preterm birth on public health worldwide is enormous, yet there are few effective means to prevent a preterm delivery. To date, much of its etiology is unexplained, but genetic predisposition is thought to play a major role. In the upcoming year, the international Preterm Birth Genome Project (PGP) consortium plans to publish a large genome wide association study in early preterm birth. Genome-wide association studies (GWAS) are designed to identify common genetic variants that influence health and disease. Despite the many challenges that are involved, GWAS can be an important discovery tool, revealing genetic variations that are associated with preterm birth. It is highly unlikely that findings of a GWAS can be directly translated into clinical practice in the short run. Nonetheless, it will help us to better understand the etiology of preterm birth and the GWAS results will generate new hypotheses for further research, thus enhancing our understanding of preterm birth and informing prevention efforts in the long run. PMID:23445776

  1. Genome-wide association study of drought-related resistance traits in Aegilops tauschii.

    PubMed

    Qin, Peng; Lin, Yu; Hu, Yaodong; Liu, Kun; Mao, Shuangshuang; Li, Zhanyi; Wang, Jirui; Liu, Yaxi; Wei, Yuming; Zheng, Youliang

    2016-07-01

    The D-genome progenitor of wheat (Triticum aestivum), Aegilops tauschii, possesses numerous genes for resistance to abiotic stresses, including drought. Therefore, information on the genetic architecture of A. tauschii can aid the development of drought-resistant wheat varieties. Here, we evaluated 13 traits in 373 A. tauschii accessions grown under normal and polyethylene glycol-simulated drought stress conditions and performed a genome-wide association study using 7,185 single nucleotide polymorphism (SNP) markers. We identified 208 and 28 SNPs associated with all traits using the general linear model and mixed linear model, respectively, while both models detected 25 significant SNPs with genome-wide distribution. Public database searches revealed several candidate/flanking genes related to drought resistance that were grouped into three categories according to the type of encoded protein (enzyme, storage protein, and drought-induced protein). This study provided essential information for SNPs and genes related to drought resistance in A. tauschii and wheat, and represents a foundation for breeding drought-resistant wheat cultivars using marker-assisted selection. PMID:27392238

  2. Genome-wide autozygosity is associated with lower general cognitive ability.

    PubMed

    Howrigan, D P; Simonson, M A; Davies, G; Harris, S E; Tenesa, A; Starr, J M; Liewald, D C; Deary, I J; McRae, A; Wright, M J; Montgomery, G W; Hansell, N; Martin, N G; Payton, A; Horan, M; Ollier, W E; Abdellaoui, A; Boomsma, D I; DeRosse, P; Knowles, E E M; Glahn, D C; Djurovic, S; Melle, I; Andreassen, O A; Christoforou, A; Steen, V M; Hellard, S L; Sundet, K; Reinvang, I; Espeseth, T; Lundervold, A J; Giegling, I; Konte, B; Hartmann, A M; Rujescu, D; Roussos, P; Giakoumaki, S; Burdick, K E; Bitsios, P; Donohoe, G; Corley, R P; Visscher, P M; Pendleton, N; Malhotra, A K; Neale, B M; Lencz, T; Keller, M C

    2016-06-01

    Inbreeding depression refers to lower fitness among offspring of genetic relatives. This reduced fitness is caused by the inheritance of two identical chromosomal segments (autozygosity) across the genome, which may expose the effects of (partially) recessive deleterious mutations. Even among outbred populations, autozygosity can occur to varying degrees due to cryptic relatedness between parents. Using dense genome-wide single-nucleotide polymorphism (SNP) data, we examined the degree to which autozygosity associated with measured cognitive ability in an unselected sample of 4854 participants of European ancestry. We used runs of homozygosity-multiple homozygous SNPs in a row-to estimate autozygous tracts across the genome. We found that increased levels of autozygosity predicted lower general cognitive ability, and estimate a drop of 0.6 s.d. among the offspring of first cousins (P=0.003-0.02 depending on the model). This effect came predominantly from long and rare autozygous tracts, which theory predicts as more likely to be deleterious than short and common tracts. Association mapping of autozygous tracts did not reveal any specific regions that were predictive beyond chance after correcting for multiple testing genome wide. The observed effect size is consistent with studies of cognitive decline among offspring of known consanguineous relationships. These findings suggest a role for multiple recessive or partially recessive alleles in general cognitive ability, and that alleles decreasing general cognitive ability have been selected against over evolutionary time. PMID:26390830

  3. HITS-CLIP yields genome-wide insights into brain alternative RNA processing

    NASA Astrophysics Data System (ADS)

    Licatalosi, Donny D.; Mele, Aldo; Fak, John J.; Ule, Jernej; Kayikci, Melis; Chi, Sung Wook; Clark, Tyson A.; Schweitzer, Anthony C.; Blume, John E.; Wang, Xuning; Darnell, Jennifer C.; Darnell, Robert B.

    2008-11-01

    Protein-RNA interactions have critical roles in all aspects of gene expression. However, applying biochemical methods to understand such interactions in living tissues has been challenging. Here we develop a genome-wide means of mapping protein-RNA binding sites in vivo, by high-throughput sequencing of RNA isolated by crosslinking immunoprecipitation (HITS-CLIP). HITS-CLIP analysis of the neuron-specific splicing factor Nova revealed extremely reproducible RNA-binding maps in multiple mouse brains. These maps provide genome-wide in vivo biochemical footprints confirming the previous prediction that the position of Nova binding determines the outcome of alternative splicing; moreover, they are sufficiently powerful to predict Nova action de novo. HITS-CLIP revealed a large number of Nova-RNA interactions in 3' untranslated regions, leading to the discovery that Nova regulates alternative polyadenylation in the brain. HITS-CLIP, therefore, provides a robust, unbiased means to identify functional protein-RNA interactions in vivo.

  4. Partitioning heritability by functional annotation using genome-wide association summary statistics.

    PubMed

    Finucane, Hilary K; Bulik-Sullivan, Brendan; Gusev, Alexander; Trynka, Gosia; Reshef, Yakir; Loh, Po-Ru; Anttila, Verneri; Xu, Han; Zang, Chongzhi; Farh, Kyle; Ripke, Stephan; Day, Felix R; Purcell, Shaun; Stahl, Eli; Lindstrom, Sara; Perry, John R B; Okada, Yukinori; Raychaudhuri, Soumya; Daly, Mark J; Patterson, Nick; Neale, Benjamin M; Price, Alkes L

    2015-11-01

    Recent work has demonstrated that some functional categories of the genome contribute disproportionately to the heritability of complex diseases. Here we analyze a broad set of functional elements, including cell type-specific elements, to estimate their polygenic contributions to heritability in genome-wide association studies (GWAS) of 17 complex diseases and traits with an average sample size of 73,599. To enable this analysis, we introduce a new method, stratified LD score regression, for partitioning heritability from GWAS summary statistics while accounting for linked markers. This new method is computationally tractable at very large sample sizes and leverages genome-wide information. Our findings include a large enrichment of heritability in conserved regions across many traits, a very large immunological disease-specific enrichment of heritability in FANTOM5 enhancers and many cell type-specific enrichments, including significant enrichment of central nervous system cell types in the heritability of body mass index, age at menarche, educational attainment and smoking behavior. PMID:26414678

  5. Case-Control Genome-Wide Association of Attention-Deficit / Hyperactivity Disorder

    PubMed Central

    Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J.L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schäfer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Biederman, Joseph

    2010-01-01

    Objective Although twin and family studies have shown attention deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus, additional genomewide association studies (GWAS) are needed. Method We used case-control analyses of 896 cases with DSM-IV ADHD genotyped using the Affymetrix 5.0 array and 2,455 repository controls screened for psychotic and bipolar symptoms genotyped using Affymetrix 6.0 arrays. A consensus SNP set was imputed using BEAGLE 3.0, resulting in an analysis dataset of 1,033,244 SNPs. The data were analyzed using a generalized linear model. Results No genome-wide significant associations were found. The most significant results implicated the following genes: PRKG1, FLNC, TCERG1L, PPM1H, NXPH1, PPM1H, CDH13, HK1 and HKDC1. Conclusions The current analyses are a useful addition to the present literature and will make a valuable contribution to future meta-analyses. The candidate gene findings are consistent with a prior meta-analysis in suggesting that the effects of ADHD risk variants must, individually, be very small and/or include multiple rare alleles. PMID:20732627

  6. Heritability and Genome-Wide Association Analyses of Human Gait Suggest Contribution of Common Variants.

    PubMed

    Adams, Hieab H H; Verlinden, Vincentius J A; Callisaya, Michele L; van Duijn, Cornelia M; Hofman, Albert; Thomson, Russell; Uitterlinden, André G; Vernooij, Meike W; van der Geest, Jos N; Srikanth, Velandai; Ikram, M Arfan

    2016-06-01

    Human gait is a complex neurological and musculoskeletal function, of which the genetic basis remains largely unknown. To determine the influence of common genetic variants on gait parameters, we studied 2,946 participants of the Rotterdam Study, a population-based cohort of unrelated elderly individuals. We assessed 30 gait parameters using an electronic walkway, which yielded seven independent gait domains after principal component analysis. Genotypes of participants were imputed to the 1,000 Genomes reference panel for generating genetic relationship matrices to estimate heritability of gait parameters, and for subsequent genome-wide association scans (GWASs) to identify specific variants. Gait domains with the highest age- and sex-adjusted heritability were Variability (h (2) = 61%), Rhythm (37%), and Tandem (32%). For other gait domains, heritability estimates attenuated after adjustment for height and weight. Genome-wide association scans identified a variant on 1p22.3 that was significantly associated with single support time, a variable from the Rhythm domain (rs72953990; N = 2,946; β [SE] = 0.0069 (0.0012), p = 2.30×10(-8)). This variant did not replicate in an independent sample (N = 362; p = .78). In conclusion, human gait has highly heritable components that are explained by common genetic variation, which are partly attributed to height and weight. Collaborative efforts are needed to identify robust single variant associations for the heritable parameters. PMID:26219847

  7. A Genome-wide association study of self-rated health

    PubMed Central

    Mosing, Miriam A.; Verweij, Karin J.H.; Medland, Sarah E.; Painter, Jodie; Gordon, Scott D.; Heath, Andrew C.; Madden, Pamela A.; Montgomery, Grant W.; Martin, Nicholas G.

    2011-01-01

    Self-rated health questions have been proven to be a highly reliable and valid measure of overall health as measured by other indicators in many population groups. It also has been shown to be a very good predictor of mortality, chronic or severe diseases, and the need for services, and is positively correlated with clinical assessments. Genetic factors have been estimated to account for 25 – 64% of the variance in the liability of self-rated health. The aim of the present study was to identify Single Nucleotide Polymorphisms (SNPs) underlying the heritability of self-rated health by conducting a genome-wide association analysis in a large sample of 6,706 Australian individuals aged 18–92. No genome wide significant SNPs associated with self-rated health could be identified, indicating that self-rated health may be influenced by a large number of SNPs with very small effect size. A very large sample will be needed to identify these SNPs. PMID:20707712

  8. Genome Wide Analysis of Fertility and Production Traits in Italian Holstein Cattle

    PubMed Central

    Stella, Alessandra; Biffani, Stefano; Negrini, Riccardo; Lazzari, Barbara; Ajmone-Marsan, Paolo; Williams, John L .

    2013-01-01

    A genome wide scan was performed on a total of 2093 Italian Holstein proven bulls genotyped with 50K single nucleotide polymorphisms (SNPs), with the objective of identifying loci associated with fertility related traits and to test their effects on milk production traits. The analysis was carried out using estimated breeding values for the aggregate fertility index and for each trait contributing to the index: angularity, calving interval, non-return rate at 56 days, days to first service, and 305 day first parity lactation. In addition, two production traits not included in the aggregate fertility index were analysed: fat yield and protein yield. Analyses were carried out using all SNPs treated separately, further the most significant marker on BTA14 associated to milk quality located in the DGAT1 region was treated as fixed effect. Genome wide association analysis identified 61 significant SNPs and 75 significant marker-trait associations. Eight additional SNP associations were detected when SNP located near DGAT1 was included as a fixed effect. As there were no obvious common SNPs between the traits analyzed independently in this study, a network analysis was carried out to identify unforeseen relationships that may link production and fertility traits. PMID:24265800

  9. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    PubMed Central

    Kijas, James W.; Townley, David; Dalrymple, Brian P.; Heaton, Michael P.; Maddox, Jillian F.; McGrath, Annette; Wilson, Peter; Ingersoll, Roxann G.; McCulloch, Russell; McWilliam, Sean; Tang, Dave; McEwan, John; Cockett, Noelle; Oddy, V. Hutton; Nicholas, Frank W.; Raadsma, Herman

    2009-01-01

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identifying the first genome-wide set of SNP for sheep, we report on levels of genetic variability both within and between a diverse sample of ovine populations. Then, using cluster analysis and the partitioning of genetic variation, we demonstrate sheep are characterised by weak phylogeographic structure, overlapping genetic similarity and generally low differentiation which is consistent with their short evolutionary history. The degree of population substructure was, however, sufficient to cluster individuals based on geographic origin and known breed history. Specifically, African and Asian populations clustered separately from breeds of European origin sampled from Australia, New Zealand, Europe and North America. Furthermore, we demonstrate the presence of stratification within some, but not all, ovine breeds. The results emphasize that careful documentation of genetic structure will be an essential prerequisite when mapping the genetic basis of complex traits. Furthermore, the identification of a subset of SNP able to assign individuals into broad groupings demonstrates even a small panel of markers may be suitable for applications such as traceability. PMID:19270757

  10. Genetic Diversity in the Modern Horse Illustrated from Genome-Wide SNP Data

    PubMed Central

    Petersen, Jessica L.; Mickelson, James R.; Cothran, E. Gus; Andersson, Lisa S.; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M.; Borges, Alexandre S.; Brama, Pieter; da Câmara Machado, Artur; Distl, Ottmar; Felicetti, Michela; Fox-Clipsham, Laura; Graves, Kathryn T.; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A.; Mikko, Sofia; Orr, Nicholas; Penedo, M. Cecilia T; Piercy, Richard J.; Raekallio, Marja; Rieder, Stefan; Røed, Knut H.; Silvestrelli, Maurizio; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; M. Wade, Claire; McCue, Molly E.

    2013-01-01

    Horses were domesticated from the Eurasian steppes 5,000–6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds). This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. FST calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection. PMID:23383025

  11. Insect herbivory elicits genome-wide alternative splicing responses in Nicotiana attenuata.

    PubMed

    Ling, Zhihao; Zhou, Wenwu; Baldwin, Ian T; Xu, Shuqing

    2015-10-01

    Changes in gene expression and alternative splicing (AS) are involved in many responses to abiotic and biotic stresses in eukaryotic organisms. In response to attack and oviposition by insect herbivores, plants elicit rapid changes in gene expression which are essential for the activation of plant defenses; however, the herbivory-induced changes in AS remain unstudied. Using mRNA sequencing, we performed a genome-wide analysis on tobacco hornworm (Manduca sexta) feeding-induced AS in both leaves and roots of Nicotiana attenuata. Feeding by M. sexta for 5 h reduced total AS events by 7.3% in leaves but increased them in roots by 8.0% and significantly changed AS patterns in leaves and roots of existing AS genes. Feeding by M. sexta also resulted in increased (in roots) and decreased (in leaves) transcript levels of the serine/arginine-rich (SR) proteins that are involved in the AS machinery of plants and induced changes in SR gene expression that were jasmonic acid (JA)-independent in leaves but JA-dependent in roots. Changes in AS and gene expression elicited by M. sexta feeding were regulated independently in both tissues. This study provides genome-wide evidence that insect herbivory induces changes not only in the levels of gene expression but also in their splicing, which might contribute to defense against and/or tolerance of herbivory. PMID:26306554

  12. Genome-wide association analyses in East Asians identify new susceptibility loci for colorectal cancer

    PubMed Central

    Jia, Wei-Hua; Zhang, Ben; Matsuo, Keitaro; Shin, Aesun; Xiang, Yong-Bing; Jee, Sun Ha; Kim, Dong-Hyun; Ren, Zefang; Cai, Qiuyin; Long, Jirong; Shi, Jiajun; Wen, Wanqing; Yang, Gong; Delahanty, Ryan J.; Ji, Bu-Tian; Pan, Zhi-Zhong; Matsuda, Fumihiko; Gao, Yu-Tang; Oh, Jae Hwan; Ahn, Yoon-Ok; Park, Eun Jung; Li, Hong-Lan; Park, Ji Won; Jo, Jaeseong; Jeong, Jin-Young; Hosono, Satoyo; Casey, Graham; Peters, Ulrike; Shu, Xiao-Ou; Zeng, Yi-Xin; Zheng, Wei

    2013-01-01

    To identify novel genetic factors for colorectal cancer (CRC), we conducted a genome-wide association study in East Asians. By analyzing genome-wide data in 2,098 cases and 5,749 controls, we selected 64 promising SNPs for replication in an independent set of samples including up to 5,358 cases and 5,922 controls. We identified four SNPs with a P-value of 8.58 × 10−7 to 3.77 × 10−10 in the combined analysis of all East Asian samples. Three of the four SNPs were replicated in a study conducted among 26,060 European descendants with a combined P-value of 1.22 × 10−10 for rs647161 (5q31.1), 6.64 × 10−9 for rs2423279 (20p12.3), and 3.06 × 10−8 for rs10774214 (12p13.32 near the CCND2 gene), respectively, derived from the meta-analysis of data from both East Asian and European populations. This study identified three new CRC susceptibility loci and provides additional insight into the genetics and biology of CRC. PMID:23263487

  13. Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome

    PubMed Central

    Pedersen, Jakob Skou; Valen, Eivind; Velazquez, Amhed M. Vargas; Parker, Brian J.; Rasmussen, Morten; Lindgreen, Stinus; Lilje, Berit; Tobin, Desmond J.; Kelly, Theresa K.; Vang, Søren; Andersson, Robin; Jones, Peter A.; Hoover, Cindi A.; Tikhonov, Alexei; Prokhortchouk, Egor; Rubin, Edward M.; Sandelin, Albin; Gilbert, M. Thomas P.; Krogh, Anders; Willerslev, Eske; Orlando, Ludovic

    2014-01-01

    Epigenetic information is available from contemporary organisms, but is difficult to track back in evolutionary time. Here, we show that genome-wide epigenetic information can be gathered directly from next-generation sequence reads of DNA isolated from ancient remains. Using the genome sequence data generated from hair shafts of a 4000-yr-old Paleo-Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery of the expected signals at promoter regions, exon/intron boundaries, and CTCF sites. The top-scoring nucleosome calls revealed distinct DNA positioning biases, attesting to nucleotide-level accuracy. The ancient methylation levels exhibited high conservation over time, clustering closely with modern hair tissues. Using ancient methylation information, we estimated the age at death of the Saqqaq individual and illustrate how epigenetic information can be used to infer ancient gene expression. Similar epigenetic signatures were found in other fossil material, such as 110,000- to 130,000-yr-old bones, supporting the contention that ancient epigenomic information can be reconstructed from a deep past. Our findings lay the foundation for extracting epigenomic information from ancient samples, allowing shifts in epialleles to be tracked through evolutionary time, as well as providing an original window into modern epigenomics. PMID:24299735

  14. Opportunities for genome-wide selection for pig breeding in developing countries.

    PubMed

    Akanno, E C; Schenkel, F S; Sargolzaei, M; Friendship, R M; Robinson, J A B

    2013-07-26

    Genetic improvement of exotic and indigenous pigs in tropical developing countries is desired. Implementations of traditional selection methods on tropical pig populations are limited by lack of data recording and analysis infrastructure. Genome-wide selection (GS) provides an approach for achieving faster genetic progress without developing a pedigree recording system. The implications of GS on long term gain and inbreeding should be studied before actual implementation especially where low linkage disequilibrium (LD) is anticipated in the target population. A simulation case-study of this option was carried out based on the available 60 K SNP panel for porcine genome. Computer simulation was used to explore the effects of various selection methods, trait heritability and different breeding programs when applying GS. Genomic predictions were based on the ridge regression method. Genome-wide selection performed better than BLUP and phenotypic selection methods by increasing genetic gain and maintaining genetic variation while lowering inbreeding especially for traits with low heritability. Indigenous pig populations with low LD can be improved by using GS if high density marker panels are available. The combination of GS with repeated backcrossing of crossbreds to exotic pigs in developing countries promises to rapidly improve the genetic merit of the commercial population. Application of this novel method on a real population will need to be carried out to validate these results. PMID:23893977

  15. Opportunities for genome-wide selection for pig breeding in developing countries.

    PubMed

    Akanno, E C; Schenkel, F S; Sargolzaei, M; Friendship, R M; Robinson, J A B

    2013-10-01

    Genetic improvement of exotic and indigenous pigs in tropical developing countries is desired. Implementations of traditional selection methods on tropical pig populations are limited by lack of data recording and analysis infrastructure. Genome-wide selection (GS) provides an approach for achieving faster genetic progress without developing a pedigree recording system. The implications of GS on long-term gain and inbreeding should be studied before actual implementation, especially where low linkage disequilibrium (LD) is anticipated in the target population. A simulation case study of this option was performed on the basis of the available 60,000 SNP panel for porcine genome. Computer simulation was used to explore the effects of various selection methods, trait heritability, and different breeding programs when applying GS. Genomic predictions were based on the ridge regression method. Genome-wide selection performed better than BLUP and phenotypic selection methods by increasing genetic gain and maintaining genetic variation while lowering inbreeding, especially for traits with low heritability. Indigenous pig populations with low LD can be improved by using GS if high-density marker panels are available. The combination of GS with repeated backcrossing of crossbreds to exotic pigs in developing countries promises to rapidly improve the genetic merit of the commercial population. Application of this novel method on a real population will need to be performed to validate these results. PMID:24078617

  16. Natural CMT2 Variation Is Associated With Genome-Wide Methylation Changes and Temperature Seasonality

    PubMed Central

    Shen, Xia; De Jonge, Jennifer; Forsberg, Simon K. G.; Pettersson, Mats E.; Sheng, Zheya; Hennig, Lars; Carlborg, Örjan

    2014-01-01

    As Arabidopsis thaliana has colonized a wide range of habitats across the world it is an attractive model for studying the genetic mechanisms underlying environmental adaptation. Here, we used public data from two collections of A. thaliana accessions to associate genetic variability at individual loci with differences in climates at the sampling sites. We use a novel method to screen the genome for plastic alleles that tolerate a broader climate range than the major allele. This approach reduces confounding with population structure and increases power compared to standard genome-wide association methods. Sixteen novel loci were found, including an association between Chromomethylase 2 (CMT2) and temperature seasonality where the genome-wide CHH methylation was different for the group of accessions carrying the plastic allele. Cmt2 mutants were shown to be more tolerant to heat-stress, suggesting genetic regulation of epigenetic modifications as a likely mechanism underlying natural adaptation to variable temperatures, potentially through differential allelic plasticity to temperature-stress. PMID:25503602

  17. GStream: Improving SNP and CNV Coverage on Genome-Wide Association Studies

    PubMed Central

    Alonso, Arnald; Marsal, Sara; Tortosa, Raül; Canela-Xandri, Oriol; Julià, Antonio

    2013-01-01

    We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method. PMID:23844243

  18. Methods for Investigating Gene-Environment Interactions in Candidate Pathway and Genome-Wide Association Studies

    PubMed Central

    Thomas, Duncan

    2010-01-01

    Despite the considerable enthusiasm about the yield of novel and replicated discoveries of genetic associations from the new generation of genome-wide association studies (GWAS), the proportion of the heritability of most complex diseases that have been studied to date remains small. Some of this “dark matter” could be due to gene-environment (G×E) interactions or more complex pathways involving multiple genes and exposures. We review the basic epidemiologic study design and statistical analysis approaches to studying G×E interactions individually and then consider more comprehensive approaches to studying entire pathways or GWAS data. In addition to the usual issues in genetic association studies, particular care is needed in exposure assessment and very large sample sizes are required. Although hypothesis-driven pathway-based and “agnostic” GWAS approaches are generally viewed as opposite poles, we suggest that the two can be usefully married using hierarchical modeling strategies that exploit external pathway knowledge in mining genome-wide data. PMID:20070199

  19. Differential network analysis reveals the genome-wide landscape of estrogen receptor modulation in hormonal cancers

    PubMed Central

    Hsiao, Tzu-Hung; Chiu, Yu-Chiao; Hsu, Pei-Yin; Lu, Tzu-Pin; Lai, Liang-Chuan; Tsai, Mong-Hsun; Huang, Tim H.-M.; Chuang, Eric Y.; Chen, Yidong

    2016-01-01

    Several mutual information (MI)-based algorithms have been developed to identify dynamic gene-gene and function-function interactions governed by key modulators (genes, proteins, etc.). Due to intensive computation, however, these methods rely heavily on prior knowledge and are limited in genome-wide analysis. We present the modulated gene/gene set interaction (MAGIC) analysis to systematically identify genome-wide modulation of interaction networks. Based on a novel statistical test employing conjugate Fisher transformations of correlation coefficients, MAGIC features fast computation and adaption to variations of clinical cohorts. In simulated datasets MAGIC achieved greatly improved computation efficiency and overall superior performance than the MI-based method. We applied MAGIC to construct the estrogen receptor (ER) modulated gene and gene set (representing biological function) interaction networks in breast cancer. Several novel interaction hubs and functional interactions were discovered. ER+ dependent interaction between TGFβ and NFκB was further shown to be associated with patient survival. The findings were verified in independent datasets. Using MAGIC, we also assessed the essential roles of ER modulation in another hormonal cancer, ovarian cancer. Overall, MAGIC is a systematic framework for comprehensively identifying and constructing the modulated interaction networks in a whole-genome landscape. MATLAB implementation of MAGIC is available for academic uses at https://github.com/chiuyc/MAGIC. PMID:26972162

  20. A Genome-Wide Survey of Date Palm Cultivars Supports Two Major Subpopulations in Phoenix dactylifera.

    PubMed

    Mathew, Lisa S; Seidel, Michael A; George, Binu; Mathew, Sweety; Spannagl, Manuel; Haberer, Georg; Torres, Maria F; Al-Dous, Eman K; Al-Azwani, Eman K; Diboun, Ilhem; Krueger, Robert R; Mayer, Klaus F X; Mohamoud, Yasmin Ali; Suhre, Karsten; Malek, Joel A

    2015-07-01

    The date palm (Phoenix dactylifera L.) is one of the oldest cultivated trees and is intimately tied to the history of human civilization. There are hundreds of commercial cultivars with distinct fruit shapes, colors, and sizes growing mainly in arid lands from the west of North Africa to India. The origin of date palm domestication is still uncertain, and few studies have attempted to document genetic diversity across multiple regions. We conducted genotyping-by-sequencing on 70 female cultivar samples from across the date palm-growing regions, including four Phoenix species as the outgroup. Here, for the first time, we generate genome-wide genotyping data for 13,000-65,000 SNPs in a diverse set of date palm fruit and leaf samples. Our analysis provides the first genome-wide evidence confirming recent findings that the date palm cultivars segregate into two main regions of shared genetic background from North Africa and the Arabian Gulf. We identify genomic regions with high densities of geographically segregating SNPs and also observe higher levels of allele fixation on the recently described X-chromosome than on the autosomes. Our results fit a model with two centers of earliest cultivation including date palms autochthonous to North Africa. These results adjust our understanding of human agriculture history and will provide the foundation for more directed functional studies and a better understanding of genetic diversity in date palm. PMID:25957276

  1. Profiling the genome-wide DNA methylation pattern of porcine ovaries using reduced representation bisulfite sequencing

    PubMed Central

    Yuan, Xiao-Long; Gao, Ning; Xing, Yan; Zhang, Hai-Bin; Zhang, Ai-Ling; Liu, Jing; He, Jin-Long; Xu, Yuan; Lin, Wen-Mian; Chen, Zan-Mou; Zhang, Hao; Zhang, Zhe; Li, Jia-Qi

    2016-01-01

    Substantial evidence has shown that DNA methylation regulates the initiation of ovarian and sexual maturation. Here, we investigated the genome-wide profile of DNA methylation in porcine ovaries at single-base resolution using reduced representation bisulfite sequencing. The biological variation was minimal among the three ovarian replicates. We found hypermethylation frequently occurred in regions with low gene abundance, while hypomethylation in regions with high gene abundance. The DNA methylation around transcriptional start sites was negatively correlated with their own CpG content. Additionally, the methylation level in the bodies of genes was higher than that in their 5′ and 3′ flanking regions. The DNA methylation pattern of the low CpG content promoter genes differed obviously from that of the high CpG content promoter genes. The DNA methylation level of the porcine ovary was higher than that of the porcine intestine. Analyses of the genome-wide DNA methylation in porcine ovaries would advance the knowledge and understanding of the porcine ovarian methylome. PMID:26912189

  2. Genome-wide association study on reproductive traits in Jinghai Yellow Chicken.

    PubMed

    Zhang, G X; Fan, Q C; Wang, J Y; Zhang, T; Xue, Q; Shi, H Q

    2015-12-01

    To identify molecular markers and candidate genes associated with reproductive traits, a genome-wide analysis was performed in Jinghai Yellow Chickens to analyze body weight at first oviposition (BWF), age at first oviposition (AFE), weight of the egg at first oviposition (FEW), egg weight at the age of 300 days (EW300), number of eggs produced by 300 days of age (EN300), egg hatchability (HA) and multiple selection index for egg production (MSI). The results showed that seven single nucleotide polymorphisms (SNPs) were associated with reproductive traits (P<1.80E-6, Bonferroni correction). The P-values of the seven SNPs were 5.62E-10, 3.45E-08, 9.76E-07, 8.90E-07, 1.12E-06, 1.42E-07 and 1.48E-07, respectively. These SNPs were located in close proximity to or within the sequence of the five candidate genes, including FAM184B, TTL, RGS1, FBLN5 and PCNX. An additional 46 SNPs that could be associated with reproductive traits were identified (P<3.59E-5, Bonferroni correction). Identification of the candidate genes as well as genome-wide SNPs that may be associated with reproductive traits will greatly advance the understanding of the genetic basis and molecular mechanisms underlying reproductive traits and may have practical significance in breeding programs for the improvements of reproductive traits in the Jinghai Yellow Chicken. PMID:26498507

  3. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development.

    PubMed

    Jiang, Zhihua; Wang, Hongyang; Michal, Jennifer J; Zhou, Xiang; Liu, Bang; Woods, Leah C Solberg; Fuchs, Rita A

    2016-01-01

    Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals. PMID:26722221

  4. Genome-wide association study identifies novel susceptibility loci for cutaneous squamous cell carcinoma

    PubMed Central

    Chahal, Harvind S.; Lin, Yuan; Ransohoff, Katherine J.; Hinds, David A.; Wu, Wenting; Dai, Hong-Ji; Qureshi, Abrar A.; Li, Wen-Qing; Kraft, Peter; Tang, Jean Y.; Han, Jiali; Sarin, Kavita Y.

    2016-01-01

    Cutaneous squamous cell carcinoma represents the second most common cutaneous malignancy, affecting 7–11% of Caucasians in the United States. The genetic determinants of susceptibility to cutaneous squamous cell carcinoma remain largely unknown. Here we report the results of a two-stage genome-wide association study of cutaneous squamous cell carcinoma, totalling 7,404 cases and 292,076 controls. Eleven loci reached genome-wide significance (P<5 × 10−8) including seven previously confirmed pigmentation-related loci: MC1R, ASIP, TYR, SLC45A2, OCA2, IRF4 and BNC2. We identify an additional four susceptibility loci: 11q23.3 CADM1, a metastasis suppressor gene involved in modifying tumour interaction with cell-mediated immunity; 2p22.3; 7p21.1 AHR, the dioxin receptor involved in anti-apoptotic pathways and melanoma progression; and 9q34.3 SEC16A, a putative oncogene with roles in secretion and cellular proliferation. These susceptibility loci provide deeper insight into the pathogenesis of squamous cell carcinoma. PMID:27424798

  5. A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence.

    PubMed

    Spain, S L; Pedroso, I; Kadeva, N; Miller, M B; Iacono, W G; McGue, M; Stergiakouli, E; Smith, G D; Putallaz, M; Lubinski, D; Meaburn, E L; Plomin, R; Simpson, M A

    2016-08-01

    Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. This study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case-control association analysis with 1409 individuals drawn from the top 0.0003 (IQ >170) of the population distribution of intelligence and 3253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina HumanExome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (genome-wide complex trait analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (s.e. 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence. PMID:26239293

  6. Genome-Wide Expression of MicroRNAs Is Regulated by DNA Methylation in Hepatocarcinogenesis

    PubMed Central

    Shen, Jing; Wang, Shuang; Siegel, Abby B.; Remotti, Helen; Wang, Qiao; Sirosh, Iryna; Santella, Regina M.

    2015-01-01

    Background. Previous studies, including ours, have examined the regulation of microRNAs (miRNAs) by DNA methylation, but whether this regulation occurs at a genome-wide level in hepatocellular carcinoma (HCC) is unclear. Subjects/Methods. Using a two-phase study design, we conducted genome-wide screening for DNA methylation and miRNA expression to explore the potential role of methylation alterations in miRNAs regulation. Results. We found that expressions of 25 miRNAs were statistically significantly different between tumor and nontumor tissues and perfectly differentiated HCC tumor from nontumor. Six miRNAs were overexpressed, and 19 were repressed in tumors. Among 133 miRNAs with inverse correlations between methylation and expression, 8 miRNAs (6%) showed statistically significant differences in expression between tumor and nontumor tissues. Six miRNAs were validated in 56 additional paired HCC tissues, and significant inverse correlations were observed for miR-125b and miR-199a, which is consistent with the inactive chromatin pattern found in HepG2 cells. Conclusion. These data suggest that the expressions of miR-125b and miR-199a are dramatically regulated by DNA hypermethylation that plays a key role in hepatocarcinogenesis. PMID:25861255

  7. Genome-wide association study identifies six new loci influencing pulse pressure and mean arterial pressure

    PubMed Central

    Wain, Louise V; Verwoert, Germaine C; O’Reilly, Paul F; Shi, Gang; Johnson, Toby; Johnson, Andrew D; Bochud, Murielle; Rice, Kenneth M; Henneman, Peter; Smith, Albert V; Ehret, Georg B; Amin, Najaf; Larson, Martin G; Mooser, Vincent; Hadley, David; Dörr, Marcus; Bis, Joshua C; Aspelund, Thor; Esko, Tõnu; Janssens, A Cecile JW; Zhao, Jing Hua; Heath, Simon; Laan, Maris; Fu, Jingyuan; Pistis, Giorgio; Luan, Jian’an; Arora, Pankaj; Lucas, Gavin; Pirastu, Nicola; Pichler, Irene; Jackson, Anne U; Webster, Rebecca J; Zhang, Feng; Peden, John F; Schmidt, Helena; Tanaka, Toshiko; Campbell, Harry; Igl, Wilmar; Milaneschi, Yuri; Hotteng, Jouke-Jan; Vitart, Veronique; Chasman, Daniel I; Trompet, Stella; Bragg-Gresham, Jennifer L; Alizadeh, Behrooz Z; Chambers, John C; Guo, Xiuqing; Lehtimäki, Terho; Kühnel, Brigitte; Lopez, Lorna M; Polašek, Ozren; Boban, Mladen; Nelson, Christopher P; Morrison, Alanna C; Pihur, Vasyl; Ganesh, Santhi K; Hofman, Albert; Kundu, Suman; Mattace-Raso, Francesco US; Rivadeneira, Fernando; Sijbrands, Eric JG; Uitterlinden, Andre G; Hwang, Shih-Jen; Vasan, Ramachandran S; Wang, Thomas J; Bergmann, Sven; Vollenweider, Peter; Waeber, Gérard; Laitinen, Jaana; Pouta, Anneli; Zitting, Paavo; McArdle, Wendy L; Kroemer, Heyo K; Völker, Uwe; Völzke, Henry; Glazer, Nicole L; Taylor, Kent D; Harris, Tamara B; Alavere, Helene; Haller, Toomas; Keis, Aime; Tammesoo, Mari-Liis; Aulchenko, Yurii; Barroso, Inês; Khaw, Kay-Tee; Galan, Pilar; Hercberg, Serge; Lathrop, Mark; Eyheramendy, Susana; Org, Elin; Sõber, Siim; Lu, Xiaowen; Nolte, Ilja M; Penninx, Brenda W; Corre, Tanguy; Masciullo, Corrado; Sala, Cinzia; Groop, Leif; Voight, Benjamin F; Melander, Olle; O’Donnell, Christopher J; Salomaa, Veikko; d’Adamo, Adamo Pio; Fabretto, Antonella; Faletra, Flavio; Ulivi, Sheila; Del Greco, M Fabiola; Facheris, Maurizio; Collins, Francis S; Bergman, Richard N; Beilby, John P; Hung, Joseph; Musk, A William; Mangino, Massimo; Shin, So-Youn; Soranzo, Nicole; Watkins, Hugh; Goel, Anuj; Hamsten, Anders; Gider, Pierre; Loitfelder, Marisa; Zeginigg, Marion; Hernandez, Dena; Najjar, Samer S; Navarro, Pau; Wild, Sarah H; Corsi, Anna Maria; Singleton, Andrew; de Geus, Eco JC; Willemsen, Gonneke; Parker, Alex N; Rose, Lynda M; Buckley, Brendan; Stott, David; Orru, Marco; Uda, Manuela; van der Klauw, Melanie M; Zhang, Weihua; Li, Xinzhong; Scott, James; Chen, Yii-Der Ida; Burke, Gregory L; Kähönen, Mika; Viikari, Jorma; Döring, Angela; Meitinger, Thomas; Davies, Gail; Starr, John M; Emilsson, Valur; Plump, Andrew; Lindeman, Jan H; ’t Hoen, Peter AC; König, Inke R; Felix, Janine F; Clarke, Robert; Hopewell, Jemma C; Ongen, Halit; Breteler, Monique; Debette, Stéphanie; DeStefano, Anita L; Fornage, Myriam; Mitchell, Gary F; Smith, Nicholas L; Holm, Hilma; Stefansson, Kari; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Samani, Nilesh J; Preuss, Michael; Rudan, Igor; Hayward, Caroline; Deary, Ian J; Wichmann, H-Erich; Raitakari, Olli T; Palmas, Walter; Kooner, Jaspal S; Stolk, Ronald P; Jukema, J Wouter; Wright, Alan F; Boomsma, Dorret I; Bandinelli, Stefania; Gyllensten, Ulf B; Wilson, James F; Ferrucci, Luigi; Schmidt, Reinhold; Farrall, Martin; Spector, Tim D; Palmer, Lyle J; Tuomilehto, Jaakko; Pfeufer, Arne; Gasparini, Paolo; Siscovick, David; Altshuler, David; Loos, Ruth JF; Toniolo, Daniela; Snieder, Harold; Gieger, Christian; Meneton, Pierre; Wareham, Nicholas J; Oostra, Ben A; Metspalu, Andres; Launer, Lenore; Rettig, Rainer; Strachan, David P; Beckmann, Jacques S; Witteman, Jacqueline CM; Erdmann, Jeanette; van Dijk, Ko Willems; Boerwinkle, Eric; Boehnke, Michael; Ridker, Paul M; Jarvelin, Marjo-Riitta; Chakravarti, Aravinda; Abecasis, Goncalo R; Gudnason, Vilmundur; Newton-Cheh, Christopher; Levy, Daniel; Munroe, Patricia B; Psaty, Bruce M; Caulfield, Mark J; Rao, Dabeeru C

    2012-01-01

    Numerous genetic loci influence systolic blood pressure (SBP) and diastolic blood pressure (DBP) in Europeans 1-3. We now report genome-wide association studies of pulse pressure (PP) and mean arterial pressure (MAP). In discovery (N=74,064) and follow-up studies (N=48,607), we identified at genome-wide significance (P= 2.7×10-8 to P=2.3×10-13) four novel PP loci (at 4q12 near CHIC2/PDGFRAI, 7q22.3 near PIK3CG, 8q24.12 in NOV, 11q24.3 near ADAMTS-8), two novel MAP loci (3p21.31 in MAP4, 10q25.3 near ADRB1) and one locus associated with both traits (2q24.3 near FIGN) which has recently been associated with SBP in east Asians. For three of the novel PP signals, the estimated effect for SBP was opposite to that for DBP, in contrast to the majority of common SBP- and DBP-associated variants which show concordant effects on both traits. These findings indicate novel genetic mechanisms underlying blood pressure variation, including pathways that may differentially influence SBP and DBP. PMID:21909110

  8. Genome-wide association studies for fatty acid metabolic traits in five divergent pig populations

    PubMed Central

    Zhang, Wanchang; Bin Yang; Zhang, Junjie; Cui, Leilei; Ma, Junwu; Chen, Congying; Ai, Huashui; Xiao, Shijun; Ren, Jun; Huang, Lusheng

    2016-01-01

    Fatty acid composition profiles are important indicators of meat quality and tasting flavor. Metabolic indices of fatty acids are more authentic to reflect meat nutrition and public acceptance. To investigate the genetic mechanism of fatty acid metabolic indices in pork, we conducted genome-wide association studies (GWAS) for 33 fatty acid metabolic traits in five pig populations. We identified a total of 865 single nucleotide polymorphisms (SNPs), corresponding to 11 genome-wide significant loci on nine chromosomes and 12 suggestive loci on nine chromosomes. Our findings not only confirmed seven previously reported QTL with stronger association strength, but also revealed four novel population-specific loci, showing that investigations on intermediate phenotypes like the metabolic traits of fatty acids can increase the statistical power of GWAS for end-point phenotypes. We proposed a list of candidate genes at the identified loci, including three novel genes (FADS2, SREBF1 and PLA2G7). Further, we constructed the functional networks involving these candidate genes and deduced the potential fatty acid metabolic pathway. These findings advance our understanding of the genetic basis of fatty acid composition in pigs. The results from European hybrid commercial pigs can be immediately transited into breeding practice for beneficial fatty acid composition. PMID:27097669

  9. Novel loci associated with usual sleep duration: the CHARGE Consortium Genome-Wide Association Study.

    PubMed

    Gottlieb, D J; Hek, K; Chen, T-H; Watson, N F; Eiriksdottir, G; Byrne, E M; Cornelis, M; Warby, S C; Bandinelli, S; Cherkas, L; Evans, D S; Grabe, H J; Lahti, J; Li, M; Lehtimäki, T; Lumley, T; Marciante, K D; Pérusse, L; Psaty, B M; Robbins, J; Tranah, G J; Vink, J M; Wilk, J B; Stafford, J M; Bellis, C; Biffar, R; Bouchard, C; Cade, B; Curhan, G C; Eriksson, J G; Ewert, R; Ferrucci, L; Fülöp, T; Gehrman, P R; Goodloe, R; Harris, T B; Heath, A C; Hernandez, D; Hofman, A; Hottenga, J-J; Hunter, D J; Jensen, M K; Johnson, A D; Kähönen, M; Kao, L; Kraft, P; Larkin, E K; Lauderdale, D S; Luik, A I; Medici, M; Montgomery, G W; Palotie, A; Patel, S R; Pistis, G; Porcu, E; Quaye, L; Raitakari, O; Redline, S; Rimm, E B; Rotter, J I; Smith, A V; Spector, T D; Teumer, A; Uitterlinden, A G; Vohl, M-C; Widen, E; Willemsen, G; Young, T; Zhang, X; Liu, Y; Blangero, J; Boomsma, D I; Gudnason, V; Hu, F; Mangino, M; Martin, N G; O'Connor, G T; Stone, K L; Tanaka, T; Viikari, J; Gharib, S A; Punjabi, N M; Räikkönen, K; Völzke, H; Mignot, E; Tiemeier, H

    2015-10-01

    Usual sleep duration is a heritable trait correlated with psychiatric morbidity, cardiometabolic disease and mortality, although little is known about the genetic variants influencing this trait. A genome-wide association study (GWAS) of usual sleep duration was conducted using 18 population-based cohorts totaling 47 180 individuals of European ancestry. Genome-wide significant association was identified at two loci. The strongest is located on chromosome 2, in an intergenic region 35- to 80-kb upstream from the thyroid-specific transcription factor PAX8 (lowest P=1.1 × 10(-9)). This finding was replicated in an African-American sample of 4771 individuals (lowest P=9.3 × 10(-4)). The strongest combined association was at rs1823125 (P=1.5 × 10(-10), minor allele frequency 0.26 in the discovery sample, 0.12 in the replication sample), with each copy of the minor allele associated with a sleep duration 3.1 min longer per night. The alleles associated with longer sleep duration were associated in previous GWAS with a more favorable metabolic profile and a lower risk of attention deficit hyperactivity disorder. Understanding the mechanisms underlying these associations may help elucidate biological mechanisms influencing sleep duration and its association with psychiatric, metabolic and cardiovascular disease. PMID:25469926

  10. Genome-wide association study identifies 12 new susceptibility loci for primary biliary cirrhosis.

    PubMed

    Mells, George F; Floyd, James A B; Morley, Katherine I; Cordell, Heather J; Franklin, Christopher S; Shin, So-Youn; Heneghan, Michael A; Neuberger, James M; Donaldson, Peter T; Day, Darren B; Ducker, Samantha J; Muriithi, Agnes W; Wheater, Elizabeth F; Hammond, Christopher J; Dawwas, Muhammad F; Jones, David E; Peltonen, Leena; Alexander, Graeme J; Sandford, Richard N; Anderson, Carl A

    2011-04-01

    In addition to the HLA locus, six genetic risk factors for primary biliary cirrhosis (PBC) have been identified in recent genome-wide association studies (GWAS). To identify additional loci, we carried out a GWAS using 1,840 cases from the UK PBC Consortium and 5,163 UK population controls as part of the Wellcome Trust Case Control Consortium 3 (WTCCC3). We followed up 28 loci in an additional UK cohort of 620 PBC cases and 2,514 population controls. We identified 12 new susceptibility loci (at a genome-wide significance level of P < 5 × 10⁻⁸) and replicated all previously associated loci. We identified three further new loci in a meta-analysis of data from our study and previously published GWAS results. New candidate genes include STAT4, DENND1B, CD80, IL7R, CXCR5, TNFRSF1A, CLEC16A and NFKB1. This study has considerably expanded our knowledge of the genetic architecture of PBC. PMID:21399635

  11. The genetics of loneliness: linking evolutionary theory to genome-wide genetics, epigenetics, and social science.

    PubMed

    Goossens, Luc; van Roekel, Eeske; Verhagen, Maaike; Cacioppo, John T; Cacioppo, Stephanie; Maes, Marlies; Boomsma, Dorret I

    2015-03-01

    As a complex trait, loneliness is likely to be influenced by the interplay of numerous genetic and environmental factors. Studies in behavioral genetics indicate that loneliness has a sizable degree of heritability. Candidate-gene and gene-expression studies have pointed to several genes related to neurotransmitters and the immune system. The notion that these genes are related to loneliness is compatible with the basic tenets of the evolutionary theory of loneliness. Research on gene-environment interactions indicates that social-environmental factors (e.g., low social support) may have a more pronounced effect and lead to higher levels of loneliness if individuals carry the sensitive variant of these candidate genes. Currently, there is no extant research on loneliness based on genome-wide association studies, gene-environment-interaction studies, or studies in epigenetics. Such studies would allow researchers to identify networks of genes that contribute to loneliness. The contribution of genetics to loneliness research will become stronger when genome-wide genetics and epigenetics are integrated and used along with well-established methods in psychology to analyze the complex process of gene-environment interplay. PMID:25910391

  12. A genome-wide study on the perception of the odorants androstenone and galaxolide.

    PubMed

    Knaapila, Antti; Zhu, Gu; Medland, Sarah E; Wysocki, Charles J; Montgomery, Grant W; Martin, Nicholas G; Wright, Margaret J; Reed, Danielle R

    2012-07-01

    Twin pairs and their siblings rated the intensity of the odorants amyl acetate, androstenone, eugenol, Galaxolide, mercaptans, and rose (N = 1573). Heritability was established for ratings of androstenone (h (2) = 0.30) and Galaxolide (h(2) = 0.34) but not for the other odorants. Genome-wide association analysis using 2.3 million single nucleotide polymorphisms indicated that the most significant association was between androstenone and a region without known olfactory receptor genes (rs10966900, P = 1.2 × 10(-7)). A previously reported association between the olfactory receptor OR7D4 and the androstenone was not detected until we specifically typed this gene (P = 1.1 × 10(-4)). We also tested these 2 associations in a second independent sample of subjects and replicated the results either fully (OR7D4, P = 0.00002) or partially (rs10966900, P = 0.010; N = 266). These findings suggest that 1) the perceived intensity of some but not all odorants is a heritable trait, 2) use of a current genome-wide marker panel did not detect a known olfactory genotype-phenotype association, and 3) person-to-person differences in androstenone perception are influenced by OR7D4 genotype and perhaps by variants of other genes. PMID:22362865

  13. Genome-wide association study identifies three novel loci for type 2 diabetes.

    PubMed

    Hara, Kazuo; Fujita, Hayato; Johnson, Todd A; Yamauchi, Toshimasa; Yasuda, Kazuki; Horikoshi, Momoko; Peng, Chen; Hu, Cheng; Ma, Ronald C W; Imamura, Minako; Iwata, Minoru; Tsunoda, Tatsuhiko; Morizono, Takashi; Shojima, Nobuhiro; So, Wing Yee; Leung, Ting Fan; Kwan, Patrick; Zhang, Rong; Wang, Jie; Yu, Weihui; Maegawa, Hiroshi; Hirose, Hiroshi; Kaku, Kohei; Ito, Chikako; Watada, Hirotaka; Tanaka, Yasushi; Tobe, Kazuyuki; Kashiwagi, Atsunori; Kawamori, Ryuzo; Jia, Weiping; Chan, Juliana C N; Teo, Yik Ying; Shyong, Tai E; Kamatani, Naoyuki; Kubo, Michiaki; Maeda, Shiro; Kadowaki, Takashi

    2014-01-01

    Although over 60 loci for type 2 diabetes (T2D) have been identified, there still remains a large genetic component to be clarified. To explore unidentified loci for T2D, we performed a genome-wide association study (GWAS) of 6 209 637 single-nucleotide polymorphisms (SNPs), which were directly genotyped or imputed using East Asian references from the 1000 Genomes Project (June 2011 release) in 5976 Japanese patients with T2D and 20 829 nondiabetic individuals. Nineteen unreported loci were selected and taken forward to follow-up analyses. Combined discovery and follow-up analyses (30 392 cases and 34 814 controls) identified three new loci with genome-wide significance, which were MIR129-LEP [rs791595; risk allele = A; risk allele frequency (RAF) = 0.080; P = 2.55 × 10(-13); odds ratio (OR) = 1.17], GPSM1 [rs11787792; risk allele = A; RAF = 0.874; P = 1.74 × 10(-10); OR = 1.15] and SLC16A13 (rs312457; risk allele = G; RAF = 0.078; P = 7.69 × 10(-13); OR = 1.20). This study demonstrates that GWASs based on the imputation of genotypes using modern reference haplotypes such as that from the 1000 Genomes Project data can assist in identification of new loci for common diseases. PMID:23945395

  14. A genome-wide association study of periodontitis in a Japanese population.

    PubMed

    Shimizu, S; Momozawa, Y; Takahashi, A; Nagasawa, T; Ashikawa, K; Terada, Y; Izumi, Y; Kobayashi, H; Tsuji, M; Kubo, M; Furuichi, Y

    2015-04-01

    Periodontitis is a multifactorial disease in which bacterial, lifestyle, and genetic factors are involved. Although previous genetic association studies identified several susceptibility genes for periodontitis in European populations, there is little information for Asian populations. Here, we conducted a genome-wide association study and a replication study consisting of 2,760 Japanese periodontitis patients and 15,158 Japanese controls. Although single-nucleotide polymorphisms that surpassed a stringent genome-wide significance threshold (P < 5 × 10(-8)) were not identified, we found 2 suggestive loci for periodontitis: KCNQ5 on chromosome 6q13 (rs9446777, P = 4.83 × 10(-6), odds ratio = 0.82) and GPR141-NME8 at chromosome 7p14.1 (rs2392510, P = 4.17 × 10(-6), odds ratio = 0.87). A stratified analysis indicated that the GPR141-NME8 locus had a strong genetic effect on the susceptibility to generalized periodontitis in Japanese individuals with a history of smoking. In conclusion, this study identified 2 suggestive loci for periodontitis in a Japanese population. This study should contribute to a further understanding of genetic factors for enhanced susceptibility to periodontitis. PMID:25672891

  15. Genome-wide genetic investigation of serological measures of common infections.

    PubMed

    Rubicz, Rohina; Yolken, Robert; Drigalenko, Eugene; Carless, Melanie A; Dyer, Thomas D; Kent, Jack; Curran, Joanne E; Johnson, Matthew P; Cole, Shelley A; Fowler, Sharon P; Arya, Rector; Puppala, Sobha; Almasy, Laura; Moses, Eric K; Kraig, Ellen; Duggirala, Ravindranath; Blangero, John; Leach, Charles T; Göring, Harald H H

    2015-11-01

    Populations and individuals differ in susceptibility to infections because of a number of factors, including host genetic variation. We previously demonstrated that differences in antibody titer, which reflect infection history, are significantly heritable. Here we attempt to identify the genetic factors influencing variation in these serological phenotypes. Blood samples from >1300 Mexican Americans were quantified for IgG antibody level against 12 common infections, selected on the basis of their reported role in cardiovascular disease risk: Chlamydia pneumoniae; Helicobacter pylori; Toxoplasma gondii; cytomegalovirus; herpes simplex I virus; herpes simplex II virus; human herpesvirus 6 (HHV6); human herpesvirus 8 (HHV8); varicella zoster virus; hepatitis A virus (HAV); influenza A virus; and influenza B virus. Pathogen-specific quantitative antibody levels were analyzed, as were three measures of pathogen burden. Genome-wide linkage and joint linkage and association analyses were performed using ~1 million SNPs. Significant linkage (lod scores >3.0) was obtained for HHV6 (on chromosome 7), HHV8 (on chromosome 6), and HAV (on chromosome 13). SNP rs4812712 on chromosome 20 was significantly associated with C. pneumoniae (P=5.3 × 10(-8)). However, no genome-wide significant loci were obtained for the other investigated antibodies. We conclude that it is possible to localize host genetic factors influencing some of these antibody traits, but that further larger-scale investigations will be required to elucidate the genetic mechanisms contributing to variation in antibody levels. PMID:25758998

  16. Genome-wide association study identifies four loci associated with eruption of permanent teeth.

    PubMed

    Geller, Frank; Feenstra, Bjarke; Zhang, Hao; Shaffer, John R; Hansen, Thomas; Esserlind, Ann-Louise; Boyd, Heather A; Nohr, Ellen A; Timpson, Nicholas J; Fatemifar, Ghazaleh; Paternoster, Lavinia; Evans, David M; Weyant, Robert J; Levy, Steven M; Lathrop, Mark; Smith, George Davey; Murray, Jeffrey C; Olesen, Jes; Werge, Thomas; Marazita, Mary L; Sørensen, Thorkild I A; Melbye, Mads

    2011-09-01

    The sequence and timing of permanent tooth eruption is thought to be highly heritable and can have important implications for the risk of malocclusion, crowding, and periodontal disease. We conducted a genome-wide association study of number of permanent teeth erupted between age 6 and 14 years, analyzed as age-adjusted standard deviation score averaged over multiple time points, based on childhood records for 5,104 women from the Danish National Birth Cohort. Four loci showed association at P<5×10(-8) and were replicated in four independent study groups from the United States and Denmark with a total of 3,762 individuals; all combined P-values were below 10(-11). Two loci agreed with previous findings in primary tooth eruption and were also known to influence height and breast cancer, respectively. The two other loci pointed to genomic regions without any previous significant genome-wide association study results. The intronic SNP rs7924176 in ADK could be linked to gene expression in monocytes. The combined effect of the four genetic variants was most pronounced between age 10 and 12 years, where children with 6 to 8 delayed tooth eruption alleles had on average 3.5 (95% confidence interval: 2.9-4.1) fewer permanent teeth than children with 0 or 1 of these alleles. PMID:21931568

  17. A Genome-Wide Association Study for Regulators of Micronucleus Formation in Mice

    PubMed Central

    McIntyre, Rebecca E.; Nicod, Jérôme; Robles-Espinoza, Carla Daniela; Maciejowski, John; Cai, Na; Hill, Jennifer; Verstraten, Ruth; Iyer, Vivek; Rust, Alistair G.; Balmus, Gabriel; Mott, Richard; Flint, Jonathan; Adams, David J.

    2016-01-01

    In mammals the regulation of genomic instability plays a key role in tumor suppression and also controls genome plasticity, which is important for recombination during the processes of immunity and meiosis. Most studies to identify regulators of genomic instability have been performed in cells in culture or in systems that report on gross rearrangements of the genome, yet subtle differences in the level of genomic instability can contribute to whole organism phenotypes such as tumor predisposition. Here we performed a genome-wide association study in a population of 1379 outbred Crl:CFW(SW)-US_P08 mice to dissect the genetic landscape of micronucleus formation, a biomarker of chromosomal breaks, whole chromosome loss, and extranuclear DNA. Variation in micronucleus levels is a complex trait with a genome-wide heritability of 53.1%. We identify seven loci influencing micronucleus formation (false discovery rate <5%), and define candidate genes at each locus. Intriguingly at several loci we find evidence for sexual dimorphism in micronucleus formation, with a locus on chromosome 11 being specific to males. PMID:27233670

  18. phenosim - A software to simulate phenotypes for testing in genome-wide association studies

    PubMed Central

    2011-01-01

    Background There is a great interest in understanding the genetic architecture of complex traits in natural populations. Genome-wide association studies (GWAS) are becoming routine in human, animal and plant genetics to understand the connection between naturally occurring genotypic and phenotypic variation. Coalescent simulations are commonly used in population genetics to simulate genotypes under different parameters and demographic models. Results Here, we present phenosim, a software to add a phenotype to genotypes generated in time-efficient coalescent simulations. Both qualitative and quantitative phenotypes can be generated and it is possible to partition phenotypic variation between additive effects and epistatic interactions between causal variants. The output formats of phenosim are directly usable as input for different GWAS tools. The applicability of phenosim is shown by simulating a genome-wide association study in Arabidopsis thaliana. Conclusions By using the coalescent approach to generate genotypes and phenosim to add phenotypes, the data sets can be used to assess the influence of various factors such as demography, genetic architecture or selection on the statistical power of association methods to detect causal genetic variants under a wide variety of population genetic scenarios. phenosim is freely available from the authors' website http://evoplant.uni-hohenheim.de PMID:21714868

  19. Genome-Wide Tissue-Specific Farnesoid X Receptor Binding in Mouse Liver and Intestine

    PubMed Central

    Thomas, Ann M.; Hart, Steven N.; Kong, Bo; Fang, Jianwen; Zhong, Xiao-bo; Guo, Grace L.

    2016-01-01

    Farnesoid X receptor (FXR) is a bile acid-activated transcription factor belonging to the nuclear receptor superfamily. FXR is highly expressed in liver and intestine and crosstalk mediated by FXR in these two organs is critical in maintaining bile acid homeostasis. FXR deficiency has been implicated in many liver and intestine diseases. However, regulation of transcription by FXR at the genomic level is not known. This study analyzed genome-wide FXR binding in liver and intestine of mice treated with a synthetic FXR ligand (GW4064) by chromatin immunoprecipitation coupled to massively parallel sequencing (ChIP-seq). The results showed a large degree of tissue-specific FXR binding, with only 11% of total sites shared between liver and intestine. The sites were widely distributed between intergenic, upstream, intragenic, and downstream of genes, with novel sites identified within even known FXR target genes. Motif analysis revealed a half nuclear receptor binding site, normally bound by a few orphan nuclear receptors, adjacent to the FXR response elements, indicating possible involvement of some orphan nuclear receptors in modulating FXR function. Furthermore, pathway analysis indicated that FXR may be extensively involved in multiple cellular metabolic pathways. Conclusion This study reports genome-wide FXR binding in vivo and the results clearly demonstrate tissue-specific FXR/gene interaction. In addition, FXR may be involved in regulating broader biological pathways in maintaining hepatic and intestinal homeostasis. PMID:20091679

  20. Genome-wide association study of drought-related resistance traits in Aegilops tauschii

    PubMed Central

    Qin, Peng; Lin, Yu; Hu, Yaodong; Liu, Kun; Mao, Shuangshuang; Li, Zhanyi; Wang, Jirui; Liu, Yaxi; Wei, Yuming; Zheng, Youliang

    2016-01-01

    Abstract The D-genome progenitor of wheat (Triticum aestivum), Aegilops tauschii, possesses numerous genes for resistance to abiotic stresses, including drought. Therefore, information on the genetic architecture of A. tauschii can aid the development of drought-resistant wheat varieties. Here, we evaluated 13 traits in 373 A. tauschii accessions grown under normal and polyethylene glycol-simulated drought stress conditions and performed a genome-wide association study using 7,185 single nucleotide polymorphism (SNP) markers. We identified 208 and 28 SNPs associated with all traits using the general linear model and mixed linear model, respectively, while both models detected 25 significant SNPs with genome-wide distribution. Public database searches revealed several candidate/flanking genes related to drought resistance that were grouped into three categories according to the type of encoded protein (enzyme, storage protein, and drought-induced protein). This study provided essential information for SNPs and genes related to drought resistance in A. tauschii and wheat, and represents a foundation for breeding drought-resistant wheat cultivars using marker-assisted selection. PMID:27560650

  1. Genome-wide association for sensitivity to chronic oxidative stress in Drosophila melanogaster.

    PubMed

    Jordan, Katherine W; Craver, Kyle L; Magwire, Michael M; Cubilla, Carmen E; Mackay, Trudy F C; Anholt, Robert R H

    2012-01-01

    Reactive oxygen species (ROS) are a common byproduct of mitochondrial energy metabolism, and can also be induced by exogenous sources, including UV light, radiation, and environmental toxins. ROS generation is essential for maintaining homeostasis by triggering cellular signaling pathways and host defense mechanisms. However, an imbalance of ROS induces oxidative stress and cellular death and is associated with human disease, including age-related locomotor impairment. To identify genes affecting sensitivity and resistance to ROS-induced locomotor decline, we assessed locomotion of aged flies of the sequenced, wild-derived lines from the Drosophila melanogaster Genetics Reference Panel on standard medium and following chronic exposure to medium supplemented with 3 mM menadione sodium bisulfite (MSB). We found substantial genetic variation in sensitivity to oxidative stress with respect to locomotor phenotypes. We performed genome-wide association analyses to identify candidate genes associated with variation in sensitivity to ROS-induced decline in locomotor performance, and confirmed the effects for 13 of 16 mutations tested in these candidate genes. Candidate genes associated with variation in sensitivity to MSB-induced oxidative stress form networks of genes involved in neural development, immunity, and signal transduction. Many of these genes have human orthologs, highlighting the utility of genome-wide association in Drosophila for studying complex human disease. PMID:22715409

  2. Neuregulin-1 and schizophrenia in the genome-wide association study era.

    PubMed

    Mostaid, Md Shaki; Lloyd, David; Liberg, Benny; Sundram, Suresh; Pereira, Avril; Pantelis, Christos; Karl, Tim; Weickert, Cynthia Shannon; Everall, Ian P; Bousman, Chad A

    2016-09-01

    Clinical and pre-clinical evidence has implicated neuregulin 1 (NRG1) as a critical component in the pathophysiology of schizophrenia. However, the arrival of the genome-wide association study (GWAS) era has yielded results that challenge the relevance of NRG1 in schizophrenia due to the absence of a genome-wide significant NRG1 variant associated with schizophrenia. To assess NRG1's relevance to schizophrenia in the GWAS era, we provide a targeted review of recent preclinical evidence on NRG1's role in regulating several aspects of excitatory/inhibitory neurotransmission and in turn schizophrenia risk. We also present a systematic review of the last decade of clinical research examining NRG1 in the context of schizophrenia. We include concise summaries of genotypic variation, gene-expression, protein expression, structural and functional neuroimaging as well as cognitive studies conducted during this time period. We conclude with recommendations for future clinical and preclinical work that we hope will help prioritize a strategy forward to further advance our understanding of the relationship between NRG1 and schizophrenia. PMID:27283360

  3. Genome-wide characterization of microsatellites in Triticeae species: abundance, distribution and evolution.

    PubMed

    Deng, Pingchuan; Wang, Meng; Feng, Kewei; Cui, Licao; Tong, Wei; Song, Weining; Nie, Xiaojun

    2016-01-01

    Microsatellites are an important constituent of plant genome and distributed across entire genome. In this study, genome-wide analysis of microsatellites in 8 Triticeae species and 9 model plants revealed that microsatellite characteristics were similar among the Triticeae species. Furthermore, genome-wide microsatellite markers were designed in wheat and then used to analyze the evolutionary relationship of wheat and other Triticeae species. Results displayed that Aegilops tauschii was found to be the closest species to Triticum aestivum, followed by Triticum urartu, Triticum turgidum and Aegilops speltoides, while Triticum monococcum, Aegilops sharonensis and Hordeum vulgare showed a relatively lower PCR amplification effectivity. Additionally, a significantly higher PCR amplification effectivity was found in chromosomes at the same subgenome than its homoeologous when these markers were subjected to search against different chromosomes in wheat. After a rigorous screening process, a total of 20,666 markers showed high amplification and polymorphic potential in wheat and its relatives, which were integrated with the public available wheat markers and then anchored to the genome of wheat (CS). This study not only provided the useful resource for SSR markers development in Triticeae species, but also shed light on the evolution of polyploid wheat from the perspective of microsatellites. PMID:27561724

  4. Genome-wide mapping of IBD segments in an Ashkenazi PD cohort identifies associated haplotypes.

    PubMed

    Vacic, Vladimir; Ozelius, Laurie J; Clark, Lorraine N; Bar-Shira, Anat; Gana-Weisz, Mali; Gurevich, Tanya; Gusev, Alexander; Kedmi, Merav; Kenny, Eimear E; Liu, Xinmin; Mejia-Santana, Helen; Mirelman, Anat; Raymond, Deborah; Saunders-Pullman, Rachel; Desnick, Robert J; Atzmon, Gil; Burns, Edward R; Ostrer, Harry; Hakonarson, Hakon; Bergman, Aviv; Barzilai, Nir; Darvasi, Ariel; Peter, Inga; Guha, Saurav; Lencz, Todd; Giladi, Nir; Marder, Karen; Pe'er, Itsik; Bressman, Susan B; Orr-Urtreger, Avi

    2014-09-01

    The recent series of large genome-wide association studies in European and Japanese cohorts established that Parkinson disease (PD) has a substantial genetic component. To further investigate the genetic landscape of PD, we performed a genome-wide scan in the largest to date Ashkenazi Jewish cohort of 1130 Parkinson patients and 2611 pooled controls. Motivated by the reduced disease allele heterogeneity and a high degree of identical-by-descent (IBD) haplotype sharing in this founder population, we conducted a haplotype association study based on mapping of shared IBD segments. We observed significant haplotype association signals at three previously implicated Parkinson loci: LRRK2 (OR = 12.05, P = 1.23 × 10(-56)), MAPT (OR = 0.62, P = 1.78 × 10(-11)) and GBA (multiple distinct haplotypes, OR > 8.28, P = 1.13 × 10(-11) and OR = 2.50, P = 1.22 × 10(-9)). In addition, we identified a novel association signal on chr2q14.3 coming from a rare haplotype (OR = 22.58, P = 1.21 × 10(-10)) and replicated it in a secondary cohort of 306 Ashkenazi PD cases and 2583 controls. Our results highlight the power of our haplotype association method, particularly useful in studies of founder populations, and reaffirm the benefits of studying complex diseases in Ashkenazi Jewish cohorts. PMID:24842889

  5. A powerful test of independent assortment that determines genome-wide significance quickly and accurately

    PubMed Central

    Stewart, W C L; Hager, V R

    2016-01-01

    In the analysis of DNA sequences on related individuals, most methods strive to incorporate as much information as possible, with little or no attention paid to the issue of statistical significance. For example, a modern workstation can easily handle the computations needed to perform a large-scale genome-wide inheritance-by-descent (IBD) scan, but accurate assessment of the significance of that scan is often hindered by inaccurate approximations and computationally intensive simulation. To address these issues, we developed gLOD—a test of co-segregation that, for large samples, models chromosome-specific IBD statistics as a collection of stationary Gaussian processes. With this simple model, the parametric bootstrap yields an accurate and rapid assessment of significance—the genome-wide corrected P-value. Furthermore, we show that (i) under the null hypothesis, the limiting distribution of the gLOD is the standard Gumbel distribution; (ii) our parametric bootstrap simulator is approximately 40 000 times faster than gene-dropping methods, and it is more powerful than methods that approximate the adjusted P-value; and, (iii) the gLOD has the same statistical power as the widely used maximum Kong and Cox LOD. Thus, our approach gives researchers the ability to determine quickly and accurately the significance of most large-scale IBD scans, which may contain multiple traits, thousands of families and tens of thousands of DNA sequences. PMID:27245422

  6. A multi-stage genome-wide association study of bladder cancer identifies multiple susceptibility loci.

    PubMed

    Rothman, Nathaniel; Garcia-Closas, Montserrat; Chatterjee, Nilanjan; Malats, Nuria; Wu, Xifeng; Figueroa, Jonine D; Real, Francisco X; Van Den Berg, David; Matullo, Giuseppe; Baris, Dalsu; Thun, Michael; Kiemeney, Lambertus A; Vineis, Paolo; De Vivo, Immaculata; Albanes, Demetrius; Purdue, Mark P; Rafnar, Thorunn; Hildebrandt, Michelle A T; Kiltie, Anne E; Cussenot, Olivier; Golka, Klaus; Kumar, Rajiv; Taylor, Jack A; Mayordomo, Jose I; Jacobs, Kevin B; Kogevinas, Manolis; Hutchinson, Amy; Wang, Zhaoming; Fu, Yi-Ping; Prokunina-Olsson, Ludmila; Burdett, Laurie; Yeager, Meredith; Wheeler, William; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Lloreta, Josep; Johnson, Alison; Schwenn, Molly; Karagas, Margaret R; Schned, Alan; Andriole, Gerald; Grubb, Robert; Black, Amanda; Jacobs, Eric J; Diver, W Ryan; Gapstur, Susan M; Weinstein, Stephanie J; Virtamo, Jarmo; Cortessis, Victoria K; Gago-Dominguez, Manuela; Pike, Malcolm C; Stern, Mariana C; Yuan, Jian-Min; Hunter, David J; McGrath, Monica; Dinney, Colin P; Czerniak, Bogdan; Chen, Meng; Yang, Hushan; Vermeulen, Sita H; Aben, Katja K; Witjes, J Alfred; Makkinje, Remco R; Sulem, Patrick; Besenbacher, Soren; Stefansson, Kari; Riboli, Elio; Brennan, Paul; Panico, Salvatore; Navarro, Carmen; Allen, Naomi E; Bueno-de-Mesquita, H Bas; Trichopoulos, Dimitrios; Caporaso, Neil; Landi, Maria Teresa; Canzian, Federico; Ljungberg, Borje; Tjonneland, Anne; Clavel-Chapelon, Francoise; Bishop, David T; Teo, Mark T W; Knowles, Margaret A; Guarrera, Simonetta; Polidoro, Silvia; Ricceri, Fulvio; Sacerdote, Carlotta; Allione, Alessandra; Cancel-Tassin, Geraldine; Selinski, Silvia; Hengstler, Jan G; Dietrich, Holger; Fletcher, Tony; Rudnai, Peter; Gurzau, Eugen; Koppova, Kvetoslava; Bolick, Sophia C E; Godfrey, Ashley; Xu, Zongli; Sanz-Velez, José I; D García-Prats, María; Sanchez, Manuel; Valdivia, Gabriel; Porru, Stefano; Benhamou, Simone; Hoover, Robert N; Fraumeni, Joseph F; Silverman, Debra T; Chanock, Stephen J

    2010-11-01

    We conducted a multi-stage, genome-wide association study of bladder cancer with a primary scan of 591,637 SNPs in 3,532 affected individuals (cases) and 5,120 controls of European descent from five studies followed by a replication strategy, which included 8,382 cases and 48,275 controls from 16 studies. In a combined analysis, we identified three new regions associated with bladder cancer on chromosomes 22q13.1, 19q12 and 2q37.1: rs1014971, (P = 8 × 10⁻¹²) maps to a non-genic region of chromosome 22q13.1, rs8102137 (P = 2 × 10⁻¹¹) on 19q12 maps to CCNE1 and rs11892031 (P = 1 × 10⁻⁷) maps to the UGT1A cluster on 2q37.1. We confirmed four previously identified genome-wide associations on chromosomes 3q28, 4p16.3, 8q24.21 and 8q24.3, validated previous candidate associations for the GSTM1 deletion (P = 4 × 10⁻¹¹) and a tag SNP for NAT2 acetylation status (P = 4 × 10⁻¹¹), and found interactions with smoking in both regions. Our findings on common variants associated with bladder cancer risk should provide new insights into the mechanisms of carcinogenesis. PMID:20972438

  7. Genome-wide characterization of microsatellites in Triticeae species: abundance, distribution and evolution

    PubMed Central

    Deng, Pingchuan; Wang, Meng; Feng, Kewei; Cui, Licao; Tong, Wei; Song, Weining; Nie, Xiaojun

    2016-01-01

    Microsatellites are an important constituent of plant genome and distributed across entire genome. In this study, genome-wide analysis of microsatellites in 8 Triticeae species and 9 model plants revealed that microsatellite characteristics were similar among the Triticeae species. Furthermore, genome-wide microsatellite markers were designed in wheat and then used to analyze the evolutionary relationship of wheat and other Triticeae species. Results displayed that Aegilops tauschii was found to be the closest species to Triticum aestivum, followed by Triticum urartu, Triticum turgidum and Aegilops speltoides, while Triticum monococcum, Aegilops sharonensis and Hordeum vulgare showed a relatively lower PCR amplification effectivity. Additionally, a significantly higher PCR amplification effectivity was found in chromosomes at the same subgenome than its homoeologous when these markers were subjected to search against different chromosomes in wheat. After a rigorous screening process, a total of 20,666 markers showed high amplification and polymorphic potential in wheat and its relatives, which were integrated with the public available wheat markers and then anchored to the genome of wheat (CS). This study not only provided the useful resource for SSR markers development in Triticeae species, but also shed light on the evolution of polyploid wheat from the perspective of microsatellites. PMID:27561724

  8. Genome Wide Association Study Identifies Variants in NBEA Associated with Migraine in Bipolar Disorder

    PubMed Central

    Jacobsen, Kaya K.; Nievergelt, Caroline M.; Zayats, Tetyana; Greenwood, Tiffany A.; Anttila, Verneri; Akiskal, Hagop S.; Haavik, Jan; Fasmer, Ole Bernt; Kelsoe, John R.; Johansson, Stefan; Oedegaard, Ketil J.

    2014-01-01

    Background Migraine is a common comorbidity among individuals with bipolar disorder, but the underlying mechanisms for this co-occurrence are poorly understood. The aim of this study was to investigate the genetic background of bipolar patients with and without migraine. Methods We performed a genome-wide association analysis contrasting 460 bipolar migraneurs with 914 bipolar patients without migraine from the Bipolar Genome Study (BiGS). Results We identified one genome-wide significant association between migraine in bipolar disorder patients and rs1160720, an intronic single nucleotide polymorphism (SNP) in the NBEA gene (P= 2.97×10-8, OR: 1.82, 95% CI: 1.47-2.25), although this was not replicated in a smaller sample of 289 migraine cases. Limitations Our study is based on self-reported migraine. Conclusions NBEA encodes neurobeachin, a scaffolding protein primarily expressed in the brain and involved in trafficking of vesicles containing neurotransmitter receptors. This locus has not previously been implicated in migraine per se. We found no evidence of association in data from the GWAS migraine meta-analysis consortium (n=118 710 participants) suggesting that the association might be specific to migraine co-morbid with bipolar disorder. PMID:25451450

  9. Genome wide interactions of wild-type and activator bypass forms of σ54.

    PubMed

    Schaefer, Jorrit; Engl, Christoph; Zhang, Nan; Lawton, Edward; Buck, Martin

    2015-09-01

    Enhancer-dependent transcription involving the promoter specificity factor σ(54) is widely distributed amongst bacteria and commonly associated with cell envelope function. For transcription initiation, σ(54)-RNA polymerase yields open promoter complexes through its remodelling by cognate AAA+ ATPase activators. Since activators can be bypassed in vitro, bypass transcription in vivo could be a source of emergent gene expression along evolutionary pathways yielding new control networks and transcription patterns. At a single test promoter in vivo bypass transcription was not observed. We now use genome-wide transcription profiling, genome-wide mutagenesis and gene over-expression strategies in Escherichia coli, to (i) scope the range of bypass transcription in vivo and (ii) identify genes which might alter bypass transcription in vivo. We find little evidence for pervasive bypass transcription in vivo with only a small subset of σ(54) promoters functioning without activators. Results also suggest no one gene limits bypass transcription in vivo, arguing bypass transcription is strongly kept in check. Promoter sequences subject to repression by σ(54) were evident, indicating loss of rpoN (encoding σ(54)) rather than creating rpoN bypass alleles would be one evolutionary route for new gene expression patterns. Finally, cold-shock promoters showed unusual σ(54)-dependence in vivo not readily correlated with conventional σ(54) binding-sites. PMID:26082500

  10. Development and application of a novel genome-wide SNP array reveals domestication history in soybean

    PubMed Central

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  11. Discovery and validation of sub-threshold genome-wide association study loci using epigenomic signatures

    PubMed Central

    Wang, Xinchen; Tucker, Nathan R; Rizki, Gizem; Mills, Robert; Krijger, Peter HL; de Wit, Elzo; Subramanian, Vidya; Bartell, Eric; Nguyen, Xinh-Xinh; Ye, Jiangchuan; Leyton-Mange, Jordan; Dolmatova, Elena V; van der Harst, Pim; de Laat, Wouter; Ellinor, Patrick T; Newton-Cheh, Christopher; Milan, David J; Kellis, Manolis; Boyer, Laurie A

    2016-01-01

    Genetic variants identified by genome-wide association studies explain only a modest proportion of heritability, suggesting that meaningful associations lie 'hidden' below current thresholds. Here, we integrate information from association studies with epigenomic maps to demonstrate that enhancers significantly overlap known loci associated with the cardiac QT interval and QRS duration. We apply functional criteria to identify loci associated with QT interval that do not meet genome-wide significance and are missed by existing studies. We demonstrate that these 'sub-threshold' signals represent novel loci, and that epigenomic maps are effective at discriminating true biological signals from noise. We experimentally validate the molecular, gene-regulatory, cellular and organismal phenotypes of these sub-threshold loci, demonstrating that most sub-threshold loci have regulatory consequences and that genetic perturbation of nearby genes causes cardiac phenotypes in mouse. Our work provides a general approach for improving the detection of novel loci associated with complex human traits. DOI: http://dx.doi.org/10.7554/eLife.10557.001 PMID:27162171

  12. Genome-wide uniparental disomy screen in human discarded morphologically abnormal embryos

    PubMed Central

    Xu, Jiawei; Zhang, Meixiang; Niu, Wenbin; Yao, Guidong; Sun, Bo; Bao, Xiao; Wang, Linlin; Du, Linqing; Sun, Yingpu

    2015-01-01

    Uniparental disomy (UPD) has been shown to be rare in human normal blastocysts, but its frequency in discarded morphologically abnormal embryos and its relevance to embryonic self-correction of aneuploid remains unknown. The aim of this study was to detect UPD in discarded morphologically abnormal embryos. Both discarded morphologically abnormal embryos, including zero-pronuclear zygotes (0PN), one-pronuclear zygotes (1PN), three-pronuclear zygotes (3PN) and 2PN embryos scored as low development potential were cultured into blastocysts then underwent trophectoderm biopsy. Genome-wide UPD screening of the trophectoderm of 241 discarded morphologically abnormal embryo sourced blastocysts showed that UPD occurred in nine embryos. Five embryos exhibited UPDs with euploid chromosomes, and four displayed UPDs with chromosomal aneuploid. The percentage of UPDs among the morphologically abnormal sourced blastocysts was 3.73%, which is significant higher than the percentage observed in normal blastocysts. The frequency of UPD in 3PN-sourced blastocysts was 7.69%, which is significantly higher than that in normal blastocysts. This study provides the first systematic genome-wide profile of UPD in discarded morphologically abnormal embryos. Our results indicated that UPD may be a common phenomenon in discarded morphologically abnormal embryos and may be relevant to human embryonic self-correction. PMID:26194013

  13. Genome-Wide Association Study of Copy Number Variations (CNVs) with Opioid Dependence

    PubMed Central

    Li, Dawei; Zhao, Hongyu; Kranzler, Henry R; Li, Ming D; Jensen, Kevin P; Zayats, Tetyana; Farrer, Lindsay A; Gelernter, Joel

    2015-01-01

    Single-nucleotide polymorphisms that have been associated with opioid dependence (OD) altogether account for only a small proportion of the known heritability. Most of the genetic risk factors are unknown. Some of the ‘missing heritability' might be explained by copy number variations (CNVs) in the human genome. We used Illumina HumanOmni1 arrays to genotype 5152 African-American and European-American OD cases and screened controls and implemented combined CNV calling methods. After quality control measures were applied, a genome-wide association study (GWAS) of CNVs with OD was performed. For common CNVs, two deletions and one duplication were significantly associated with OD genome-wide (eg, P=2 × 10−8 and OR (95% CI)=0.64 (0.54–0.74) for a chromosome 18q12.3 deletion). Several rare or unique CNVs showed suggestive or marginal significance with large effect sizes. This study is the first GWAS of OD using CNVs. Some identified CNVs harbor genes newly identified here to be of biological importance in addiction, whereas others affect genes previously known to contribute to substance dependence risk. Our findings augment our specific knowledge of the importance of genomic variation in addictive disorders, and provide an addiction CNV pool for further research. These findings require replication. PMID:25345593

  14. Translation elicits a growth rate-dependent, genome-wide, differential protein production in Bacillus subtilis.

    PubMed

    Borkowski, Olivier; Goelzer, Anne; Schaffer, Marc; Calabre, Magali; Mäder, Ulrike; Aymerich, Stéphane; Jules, Matthieu; Fromion, Vincent

    2016-01-01

    Complex regulatory programs control cell adaptation to environmental changes by setting condition-specific proteomes. In balanced growth, bacterial protein abundances depend on the dilution rate, transcript abundances and transcript-specific translation efficiencies. We revisited the current theory claiming the invariance of bacterial translation efficiency. By integrating genome-wide transcriptome datasets and datasets from a library of synthetic gfp-reporter fusions, we demonstrated that translation efficiencies in Bacillus subtilis decreased up to fourfold from slow to fast growth. The translation initiation regions elicited a growth rate-dependent, differential production of proteins without regulators, hence revealing a unique, hard-coded, growth rate-dependent mode of regulation. We combined model-based data analyses of transcript and protein abundances genome-wide and revealed that this global regulation is extensively used in B. subtilis We eventually developed a knowledge-based, three-step translation initiation model, experimentally challenged the model predictions and proposed that a growth rate-dependent drop in free ribosome abundance accounted for the differential protein production. PMID:27193784

  15. A Genome-Wide Association Study of Total Bilirubin and Cholelithiasis Risk in Sickle Cell Anemia

    PubMed Central

    Milton, Jacqueline N.; Sebastiani, Paola; Solovieff, Nadia; Hartley, Stephen W.; Bhatnagar, Pallav; Arking, Dan E.; Dworkis, Daniel A.; Casella, James F.; Barron-Casella, Emily; Bean, Christopher J.; Hooper, W. Craig; DeBaun, Michael R.; Garrett, Melanie E.; Soldano, Karen; Telen, Marilyn J.; Ashley-Koch, Allison; Gladwin, Mark T.; Baldwin, Clinton T.; Steinberg, Martin H.; Klings, Elizabeth S.

    2012-01-01

    Serum bilirubin levels have been associated with polymorphisms in the UGT1A1 promoter in normal populations and in patients with hemolytic anemias, including sickle cell anemia. When hemolysis occurs circulating heme increases, leading to elevated bilirubin levels and an increased incidence of cholelithiasis. We performed the first genome-wide association study (GWAS) of bilirubin levels and cholelithiasis risk in a discovery cohort of 1,117 sickle cell anemia patients. We found 15 single nucleotide polymorphisms (SNPs) associated with total bilirubin levels at the genome-wide significance level (p value <5×10−8). SNPs in UGT1A1, UGT1A3, UGT1A6, UGT1A8 and UGT1A10, different isoforms within the UGT1A locus, were identified (most significant rs887829, p = 9.08×10−25). All of these associations were validated in 4 independent sets of sickle cell anemia patients. We tested the association of the 15 SNPs with cholelithiasis in the discovery cohort and found a significant association (most significant p value 1.15×10−4). These results confirm that the UGT1A region is the major regulator of bilirubin metabolism in African Americans with sickle cell anemia, similar to what is observed in other ethnicities. PMID:22558097

  16. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  17. Pharmacogenomics: accessing important alleles by imputation from commercial genome-wide SNP arrays.

    PubMed

    Liboredo, R; Pena, S D J

    2014-01-01

    Personalized medicine is becoming a medical reality, as important genotype-phenotype relationships are being unraveled. The availability of pharmacogenomic data is a key element of individualized care. In this study, we explored genotype imputation as a means to infer important pharmacogenomic alleles from a regular commercially available genome-wide SNP array. Using these arrays as a starting point can reduce testing costs, increasing access to these pharmacogenomic data and still retain a larger amount of genome-wide information. IMPUTE2 and MaCH-Admix were used to perform genotype imputation with a dense reference panel from 1000 Genomes data. We were able to correctly infer genotypes for the warfarin-related loci VKORC1 and CYP2C9 alleles 2, 3, 5, and 11 and also clopidogrel-related CYP2C19 alleles 2 and 17 for a small sample of Brazilian individuals, as well as for HapMap samples. The success of an imputation approach in admixed samples using publicly available reference panels can encourage further imputation initiatives in those populations. PMID:25117329

  18. Multi-ethnic genome-wide association study identifies novel locus for type 2 diabetes susceptibility.

    PubMed

    Cook, James P; Morris, Andrew P

    2016-08-01

    Genome-wide association studies (GWAS) have traditionally been undertaken in homogeneous populations from the same ancestry group. However, with the increasing availability of GWAS in large-scale multi-ethnic cohorts, we have evaluated a framework for detecting association of genetic variants with complex traits, allowing for population structure, and developed a powerful test of heterogeneity in allelic effects between ancestry groups. We have applied the methodology to identify and characterise loci associated with susceptibility to type 2 diabetes (T2D) using GWAS data from the Resource for Genetic Epidemiology on Adult Health and Aging, a large multi-ethnic population-based cohort, created for investigating the genetic and environmental basis of age-related diseases. We identified a novel locus for T2D susceptibility at genome-wide significance (P<5 × 10(-8)) that maps to TOMM40-APOE, a region previously implicated in lipid metabolism and Alzheimer's disease. We have also confirmed previous reports that single-nucleotide polymorphisms at the TCF7L2 locus demonstrate the greatest extent of heterogeneity in allelic effects between ethnic groups, with the lowest risk observed in populations of East Asian ancestry. PMID:27189021

  19. Ancestry informative markers for distinguishing between Thai populations based on genome-wide association datasets.

    PubMed

    Vongpaisarnsin, Kornkiat; Listman, Jennifer Beth; Malison, Robert T; Gelernter, Joel

    2015-07-01

    The main purpose of this work was to identify a set of AIMs that stratify the genetic structure and diversity of the Thai population from a high-throughput autosomal genome-wide association study. In this study, more than one million SNPs from the international HapMap database and the Thai depression genome-wide association study have been examined to identify ancestry informative markers (AIMs) that distinguish between Thai populations. An efficient strategy is proposed to identify and characterize such SNPs and to test high-resolution SNP data from international HapMap populations. The best AIMs are identified to stratify the population and to infer genetic ancestry structure. A total of 124 AIMs were clearly clustered geographically across the continent, whereas only 89 AIMs stratified the Thai population from East Asian populations. Finally, a set of 273 AIMs was able to distinguish northern from southern Thai subpopulations. These markers will be of particular value in identifying the ethnic origins in regions where matching by self-reports is unavailable or unreliable, which usually occurs in real forensic cases. PMID:25759192

  20. Genome-Wide Association Study of Behavioral Disinhibition in a Selected Adolescent Sample.

    PubMed

    Derringer, Jaime; Corley, Robin P; Haberstick, Brett C; Young, Susan E; Demmitt, Brittany A; Howrigan, Daniel P; Kirkpatrick, Robert M; Iacono, William G; McGue, Matt; Keller, Matthew C; Brown, Sandra; Tapert, Susan; Hopfer, Christian J; Stallings, Michael C; Crowley, Thomas J; Rhee, Soo Hyun; Krauter, Ken; Hewitt, John K; McQueen, Matthew B

    2015-07-01

    Behavioral disinhibition (BD) is a quantitative measure designed to capture the heritable variation encompassing risky and impulsive behaviors. As a result, BD represents an ideal target for discovering genetic loci that predispose individuals to a wide range of antisocial behaviors and substance misuse that together represent a large cost to society as a whole. Published genome-wide association studies (GWAS) have examined specific phenotypes that fall under the umbrella of BD (e.g. alcohol dependence, conduct disorder); however no GWAS has specifically examined the overall BD construct. We conducted a GWAS of BD using a sample of 1,901 adolescents over-selected for characteristics that define high BD, such as substance and antisocial behavior problems, finding no individual locus that surpassed genome-wide significance. Although no single SNP was significantly associated with BD, restricted maximum likelihood analysis estimated that 49.3 % of the variance in BD within the Caucasian sub-sample was accounted for by the genotyped SNPs (p = 0.06). Gene-based tests identified seven genes associated with BD (p ≤ 2.0 × 10(-6)). Although the current study was unable to identify specific SNPs or pathways with replicable effects on BD, the substantial sample variance that could be explained by all genotyped SNPs suggests that larger studies could successfully identify common variants associated with BD. PMID:25637581

  1. A powerful test of independent assortment that determines genome-wide significance quickly and accurately.

    PubMed

    Stewart, W C L; Hager, V R

    2016-08-01

    In the analysis of DNA sequences on related individuals, most methods strive to incorporate as much information as possible, with little or no attention paid to the issue of statistical significance. For example, a modern workstation can easily handle the computations needed to perform a large-scale genome-wide inheritance-by-descent (IBD) scan, but accurate assessment of the significance of that scan is often hindered by inaccurate approximations and computationally intensive simulation. To address these issues, we developed gLOD-a test of co-segregation that, for large samples, models chromosome-specific IBD statistics as a collection of stationary Gaussian processes. With this simple model, the parametric bootstrap yields an accurate and rapid assessment of significance-the genome-wide corrected P-value. Furthermore, we show that (i) under the null hypothesis, the limiting distribution of the gLOD is the standard Gumbel distribution; (ii) our parametric bootstrap simulator is approximately 40 000 times faster than gene-dropping methods, and it is more powerful than methods that approximate the adjusted P-value; and, (iii) the gLOD has the same statistical power as the widely used maximum Kong and Cox LOD. Thus, our approach gives researchers the ability to determine quickly and accurately the significance of most large-scale IBD scans, which may contain multiple traits, thousands of families and tens of thousands of DNA sequences. PMID:27245422

  2. Genome-wide scans of genetic variants for psychophysiological endophenotypes: a methodological overview.

    PubMed

    Iacono, William G; Malone, Stephen M; Vaidyanathan, Uma; Vrieze, Scott I

    2014-12-01

    This article provides an introductory overview of the investigative strategy employed to evaluate the genetic basis of 17 endophenotypes examined as part of a 20-year data collection effort from the Minnesota Center for Twin and Family Research. Included are characterization of the study samples, descriptive statistics for key properties of the psychophysiological measures, and rationale behind the steps taken in the molecular genetic study design. The statistical approach included (a) biometric analysis of twin and family data, (b) heritability analysis using 527,829 single nucleotide polymorphisms (SNPs), (c) genome-wide association analysis of these SNPs and 17,601 autosomal genes, (d) follow-up analyses of candidate SNPs and genes hypothesized to have an association with each endophenotype, (e) rare variant analysis of nonsynonymous SNPs in the exome, and (f) whole genome sequencing association analysis using 27 million genetic variants. These methods were used in the accompanying empirical articles comprising this special issue, Genome-Wide Scans of Genetic Variants for Psychophysiological Endophenotypes. PMID:25387703

  3. Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies

    PubMed Central

    Chang, Diana; Keinan, Alon

    2014-01-01

    Genome-wide association studies (GWASs) have recently revealed many genetic associations that are shared between different diseases. We propose a method, disPCA, for genome-wide characterization of shared and distinct risk factors between and within disease classes. It flips the conventional GWAS paradigm by analyzing the diseases themselves, across GWAS datasets, to explore their “shared pathogenetics”. The method applies principal component analysis (PCA) to gene-level significance scores across all genes and across GWASs, thereby revealing shared pathogenetics between diseases in an unsupervised fashion. Importantly, it adjusts for potential sources of heterogeneity present between GWAS which can confound investigation of shared disease etiology. We applied disPCA to 31 GWASs, including autoimmune diseases, cancers, psychiatric disorders, and neurological disorders. The leading principal components separate these disease classes, as well as inflammatory bowel diseases from other autoimmune diseases. Generally, distinct diseases from the same class tend to be less separated, which is in line with their increased shared etiology. Enrichment analysis of genes contributing to leading principal components revealed pathways that are implicated in the immune system, while also pointing to pathways that have yet to be explored before in this context. Our results point to the potential of disPCA in going beyond epidemiological findings of the co-occurrence of distinct diseases, to highlighting novel genes and pathways that unsupervised learning suggest to be key players in the variability across diseases. PMID:25211452

  4. Genome-Wide Analysis Identifies Germ-Line Risk Factors Associated with Canine Mammary Tumours

    PubMed Central

    Melin, Malin; Murén, Eva; Gustafson, Ulla; Starkey, Mike; Borge, Kaja Sverdrup; Lingaas, Frode; Saellström, Sara; Rönnberg, Henrik; Lindblad-Toh, Kerstin

    2016-01-01

    Canine mammary tumours (CMT) are the most common neoplasia in unspayed female dogs. CMTs are suitable naturally occurring models for human breast cancer and share many characteristics, indicating that the genetic causes could also be shared. We have performed a genome-wide association study (GWAS) in English Springer Spaniel dogs and identified a genome-wide significant locus on chromosome 11 (praw = 5.6x10-7, pperm = 0.019). The most associated haplotype spans a 446 kb region overlapping the CDK5RAP2 gene. The CDK5RAP2 protein has a function in cell cycle regulation and could potentially have an impact on response to chemotherapy treatment. Two additional loci, both on chromosome 27, were nominally associated (praw = 1.97x10-5 and praw = 8.30x10-6). The three loci explain 28.1±10.0% of the phenotypic variation seen in the cohort, whereas the top ten associated regions account for 38.2±10.8% of the risk. Furthermore, the ten GWAS loci and regions with reduced genetic variability are significantly enriched for snoRNAs and tumour-associated antigen genes, suggesting a role for these genes in CMT development. We have identified several candidate genes associated with canine mammary tumours, including CDK5RAP2. Our findings enable further comparative studies to investigate the genes and pathways in human breast cancer patients. PMID:27158822

  5. Semiconductor-based sequencing of genome-wide DNA methylation states.

    PubMed

    Corley, Michael J; Zhang, Wei; Zheng, Xin; Lum-Jones, Annette; Maunakea, Alika K

    2015-01-01

    Methylated DNA immunoprecipitation sequencing (MeDIP-Seq) is a widely used approach to study DNA methylation genome-wide. Here, we developed a MeDIP-Seq protocol compatible with the Ion Torrent semiconductor-based sequencing platform that is low cost, rapid, and scalable. We applied this protocol to demonstrate MeDIP-Seq on the Ion Torrent platform provides adequate coverage of CpG cytosines, the methylation states of which we validated at single-base resolution on the Infinium HumanMethylation450 BeadChip array, and accurately identifies sites of differential DNA methylation. Furthermore, we applied an integrative approach to further investigate and confirm the role of DNA methylation in alternative splicing and to profile 5mC and 5hmC variants of DNA methylation in normal human brain tissue that is localized over distinct genomic regions. These applications of MeDIP-Seq on the Ion Torrent platform have broad utility and add to the current methodologies for profiling genome-wide DNA methylation states in normal and disease conditions. PMID:25602802

  6. Gene-based and pathway-based genome-wide association study of alcohol dependence

    PubMed Central

    ZUO, Lingjun; ZHANG, Clarence K.; SAYWARD, Frederick G.; CHEUNG, Kei-Hoi; WANG, Kesheng; KRYSTAL, John H.; ZHAO, Hongyu; LUO, Xingguang

    2015-01-01

    Background The organization of risk genes within signaling pathways may provide clues about the converging neurobiological effects of risk genes for alcohol dependence. Aim Identify risk genes and risk gene pathways for alcohol dependence. Methods We conducted a pathway-based genome-wide association study (GWAS) of alcohol dependence using a gene-set-rich analytic approach. Approximately one million genetic markers were tested in the discovery sample which included 1409 European-American (EA) alcohol dependent individuals and 1518 EA healthy comparison subjects. An additional 681 African-American (AA) cases and 508 AA healthy subjects served as the replication sample. Results We identified several genome-wide replicable risk genes and risk pathways that were significantly associated with alcohol dependence. After applying the Bonferroni correction for multiple testing, the ‘cellextracellular matrix interactions’ pathway (p<2.0E-4 in EAs) and the PXN gene (which encodes paxillin) (p=3.9E-7 in EAs) within this pathway were the most promising risk factors for alcohol dependence. There were also two nominally replicable pathways enriched in alcohol dependence-related genes in both EAs (0.015≤p≤0.035) and AAs (0.025≤p≤0.050): the ‘Na+/Cl- dependent neurotransmitter transporters’ pathway and the ‘other glycan degradation’ pathway. Conclusion These findings provide new evidence highlighting several genes and biological signaling processes that may be related to the risk for alcohol dependence. PMID:26120261

  7. Genome-wide linkage analysis in a Dutch multigenerational family with attention deficit hyperactivity disorder

    PubMed Central

    Vegt, Rinus; Bertoli-Avella, Aida M; Tulen, Joke H M; de Graaf, Bianca; Verkerk, Annemieke J M H; Vervoort, Jeroen; Twigt, Carla M; Maat-Kievit, Anneke; van Tuijl, Ruud; van der Lijn, Marieke; Hengeveld, Michiel W; Oostra, Ben A

    2010-01-01

    Attention deficit hyperactivity disorder (ADHD) is a common neuropsychiatric disorder. Genetics has an important role in the aetiology of this disease. In this study, we describe the clinical findings in a Dutch family with eight patients suffering from ADHD, in whom five had at least one other psychiatric disorder. We performed a genome-wide (parametric and nonparametric) affected-only linkage analysis. Two genomic regions on chromosomes 7 and 14 showed an excess of allele sharing among the definitely affected members of the family with suggestive LOD scores (2.1 and 2.08). Nonparametric linkage analyses (NPL) yielded a maxNPL of 2.92 (P=0.001) for marker D7S502 and a maxNPL score of 2.56 (P=0.003) for marker D14S275. We confirmed that all patients share the same haplotype in each region of 7p15.1–q31.33 and 14q11.2–q22.3. Interestingly, both loci have been reported before in Dutch (affected sib pairs) and German (extended families) ADHD linkage studies. Hopefully, the genome-wide association studies in ADHD will help to highlight specific polymorphisms and genes within the broad areas detected by our, as well as other, linkage studies. PMID:19707245

  8. A Genome-Wide Linkage Scan for Age at Menarche in Three Populations of European Descent

    PubMed Central

    Anderson, Carl A.; Zhu, Gu; Falchi, Mario; van den Berg, Stéphanie M.; Treloar, Susan A.; Spector, Timothy D.; Martin, Nicholas G.; Boomsma, Dorret I.; Visscher, Peter M.; Montgomery, Grant W.

    2008-01-01

    Context: Age at menarche (AAM) is an important trait both biologically and socially, a clearly defined event in female pubertal development, and has been associated with many clinically significant phenotypes. Objective: The objective of the study was to identify genetic loci influencing variation in AAM in large population-based samples from three countries. Design/Participants: Recalled AAM data were collected from 13,697 individuals and 4,899 pseudoindependent sister-pairs from three different populations (Australia, The Netherlands, and the United Kingdom) by mailed questionnaire or interview. Genome-wide variance components linkage analysis was implemented on each sample individually and in combination. Results: The mean, sd, and heritability of AAM across the three samples was 13.1 yr, 1.5 yr, and 0.69, respectively. No loci were detected that reached genome-wide significance in the combined analysis, but a suggestive locus was detected on chromosome 12 (logarithm of the odds = 2.0). Three loci of suggestive significance were seen in the U.K. sample on chromosomes 1, 4, and 18 (logarithm of the odds = 2.4, 2.2 and 3.2, respectively). Conclusions: There was no evidence for common highly penetrant variants influencing AAM. Linkage and association suggest that one trait locus for AAM is located on chromosome 12, but further studies are required to replicate these results. PMID:18647812

  9. Epigenetic Variation in Monozygotic Twins: A Genome-Wide Analysis of DNA Methylation in Buccal Cells

    PubMed Central

    van Dongen, Jenny; Ehli, Erik A.; Slieker, Roderick C.; Bartels, Meike; Weber, Zachary M.; Davies, Gareth E.; Slagboom, P. Eline; Heijmans, Bastiaan T.; Boomsma, Dorret I.

    2014-01-01

    DNA methylation is one of the most extensively studied epigenetic marks in humans. Yet, it is largely unknown what causes variation in DNA methylation between individuals. The comparison of DNA methylation profiles of monozygotic (MZ) twins offers a unique experimental design to examine the extent to which such variation is related to individual-specific environmental influences and stochastic events or to familial factors (DNA sequence and shared environment). We measured genome-wide DNA methylation in buccal samples from ten MZ pairs (age 8–19) using the Illumina 450k array and examined twin correlations for methylation level at 420,921 CpGs after QC. After selecting CpGs showing the most variation in the methylation level between subjects, the mean genome-wide correlation (rho) was 0.54. The correlation was higher, on average, for CpGs within CpG islands (CGIs), compared to CGI shores, shelves and non-CGI regions, particularly at hypomethylated CpGs. This finding suggests that individual-specific environmental and stochastic influences account for more variation in DNA methylation in CpG-poor regions. Our findings also indicate that it is worthwhile to examine heritable and shared environmental influences on buccal DNA methylation in larger studies that also include dizygotic twins. PMID:24802513

  10. Epigenetic variation in monozygotic twins: a genome-wide analysis of DNA methylation in buccal cells.

    PubMed

    van Dongen, Jenny; Ehli, Erik A; Slieker, Roderick C; Bartels, Meike; Weber, Zachary M; Davies, Gareth E; Slagboom, P Eline; Heijmans, Bastiaan T; Boomsma, Dorret I

    2014-01-01

    DNA methylation is one of the most extensively studied epigenetic marks in humans. Yet, it is largely unknown what causes variation in DNA methylation between individuals. The comparison of DNA methylation profiles of monozygotic (MZ) twins offers a unique experimental design to examine the extent to which such variation is related to individual-specific environmental influences and stochastic events or to familial factors (DNA sequence and shared environment). We measured genome-wide DNA methylation in buccal samples from ten MZ pairs (age 8-19) using the Illumina 450k array and examined twin correlations for methylation level at 420,921 CpGs after QC. After selecting CpGs showing the most variation in the methylation level between subjects, the mean genome-wide correlation (rho) was 0.54. The correlation was higher, on average, for CpGs within CpG islands (CGIs), compared to CGI shores, shelves and non-CGI regions, particularly at hypomethylated CpGs. This finding suggests that individual-specific environmental and stochastic influences account for more variation in DNA methylation in CpG-poor regions. Our findings also indicate that it is worthwhile to examine heritable and shared environmental influences on buccal DNA methylation in larger studies that also include dizygotic twins. PMID:24802513

  11. Genome-wide association study reveals two new risk loci for bipolar disorder.

    PubMed

    Mühleisen, Thomas W; Leber, Markus; Schulze, Thomas G; Strohmaier, Jana; Degenhardt, Franziska; Treutlein, Jens; Mattheisen, Manuel; Forstner, Andreas J; Schumacher, Johannes; Breuer, René; Meier, Sandra; Herms, Stefan; Hoffmann, Per; Lacour, André; Witt, Stephanie H; Reif, Andreas; Müller-Myhsok, Bertram; Lucae, Susanne; Maier, Wolfgang; Schwarz, Markus; Vedder, Helmut; Kammerer-Ciernioch, Jutta; Pfennig, Andrea; Bauer, Michael; Hautzinger, Martin; Moebus, Susanne; Priebe, Lutz; Czerski, Piotr M; Hauser, Joanna; Lissowska, Jolanta; Szeszenia-Dabrowska, Neonila; Brennan, Paul; McKay, James D; Wright, Adam; Mitchell, Philip B; Fullerton, Janice M; Schofield, Peter R; Montgomery, Grant W; Medland, Sarah E; Gordon, Scott D; Martin, Nicholas G; Krasnow, Valery; Chuchalin, Alexander; Babadjanova, Gulja; Pantelejeva, Galina; Abramova, Lilia I; Tiganov, Alexander S; Polonikov, Alexey; Khusnutdinova, Elza; Alda, Martin; Grof, Paul; Rouleau, Guy A; Turecki, Gustavo; Laprise, Catherine; Rivas, Fabio; Mayoral, Fermin; Kogevinas, Manolis; Grigoroiu-Serbanescu, Maria; Propping, Peter; Becker, Tim; Rietschel, Marcella; Nöthen, Markus M; Cichon, Sven

    2014-01-01

    Bipolar disorder (BD) is a common and highly heritable mental illness and genome-wide association studies (GWAS) have robustly identified the first common genetic variants involved in disease aetiology. The data also provide strong evidence for the presence of multiple additional risk loci, each contributing a relatively small effect to BD susceptibility. Large samples are necessary to detect these risk loci. Here we present results from the largest BD GWAS to date by investigating 2.3 million single-nucleotide polymorphisms (SNPs) in a sample of 24,025 patients and controls. We detect 56 genome-wide significant SNPs in five chromosomal regions including previously reported risk loci ANK3, ODZ4 and TRANK1, as well as the risk locus ADCY2 (5p15.31) and a region between MIR2113 and POU3F2 (6q16.1). ADCY2 is a key enzyme in cAMP signalling and our finding provides new insights into the biological mechanisms involved in the development of BD. PMID:24618891

  12. Genome-wide uniparental disomy screen in human discarded morphologically abnormal embryos.

    PubMed

    Xu, Jiawei; Zhang, Meixiang; Niu, Wenbin; Yao, Guidong; Sun, Bo; Bao, Xiao; Wang, Linlin; Du, Linqing; Sun, Yingpu

    2015-01-01

    Uniparental disomy (UPD) has been shown to be rare in human normal blastocysts, but its frequency in discarded morphologically abnormal embryos and its relevance to embryonic self-correction of aneuploid remains unknown. The aim of this study was to detect UPD in discarded morphologically abnormal embryos. Both discarded morphologically abnormal embryos, including zero-pronuclear zygotes (0PN), one-pronuclear zygotes (1PN), three-pronuclear zygotes (3PN) and 2PN embryos scored as low development potential were cultured into blastocysts then underwent trophectoderm biopsy. Genome-wide UPD screening of the trophectoderm of 241 discarded morphologically abnormal embryo sourced blastocysts showed that UPD occurred in nine embryos. Five embryos exhibited UPDs with euploid chromosomes, and four displayed UPDs with chromosomal aneuploid. The percentage of UPDs among the morphologically abnormal sourced blastocysts was 3.73%, which is significant higher than the percentage observed in normal blastocysts. The frequency of UPD in 3PN-sourced blastocysts was 7.69%, which is significantly higher than that in normal blastocysts. This study provides the first systematic genome-wide profile of UPD in discarded morphologically abnormal embryos. Our results indicated that UPD may be a common phenomenon in discarded morphologically abnormal embryos and may be relevant to human embryonic self-correction. PMID:26194013

  13. Technical advances: genome-wide cDNA-AFLP analysis of the Arabidopsis transcriptome.

    PubMed

    Volkmuth, Wayne; Turk, Stefan; Shapiro, Amy; Fang, Yiwen; Kiegle, Ed; van Haaren, Mark; Donson, Jonathan

    2003-01-01

    cDNA-AFLP, a technology historically used to identify small numbers of differentially expressed genes, was adapted as a genome-wide transcript profiling method. mRNA levels were assayed in a diverse range of tissues from Arabidopsis thaliana plants grown under a variety of environmental conditions. The resulting cDNA-AFLP fragments were sequenced. By linking cDNA-AFLP fragments to their corresponding mRNAs via these sequences, a database was generated that contained quantitative expression information for up to two-thirds of gene loci in A. thaliana, ecotype Ws. Using this resource, the expression levels of genes, including those with high nucleotide sequence similarity, could be determined in a high-throughput manner merely by comparing cDNA-AFLP profiles with the database. The lengths of cDNA-AFLP fragments inferred from their electrophoretic mobilities correlated well with actual fragment lengths determined by sequencing. In addition, the concentrations of AFLP fragments from single cDNAs were highly correlated, illustrating the validity of cDNA-AFLP as a quantitative, genome-wide, transcript profiling method. cDNA-AFLP profiles were also qualitatively consistent with mRNA profiles obtained from parallel microarray analysis, and with data from previous studies. PMID:14506844

  14. Genome-Wide Association Study of Down Syndrome-Associated Atrioventricular Septal Defects

    PubMed Central

    Ramachandran, Dhanya; Zeng, Zhen; Locke, Adam E.; Mulle, Jennifer G.; Bean, Lora J.H.; Rosser, Tracie C.; Dooley, Kenneth J.; Cua, Clifford L.; Capone, George T.; Reeves, Roger H.; Maslen, Cheryl L.; Cutler, David J.; Feingold, Eleanor; Sherman, Stephanie L.; Zwick, Michael E.

    2015-01-01

    The goal of this study was to identify the contribution of common genetic variants to Down syndrome−associated atrioventricular septal defect, a severe heart abnormality. Compared with the euploid population, infants with Down syndrome, or trisomy 21, have a 2000-fold increased risk of presenting with atrioventricular septal defects. The cause of this increased risk remains elusive. Here we present data from the largest heart study conducted to date on a trisomic background by using a carefully characterized collection of individuals from extreme ends of the phenotypic spectrum. We performed a genome-wide association study using logistic regression analysis on 452 individuals with Down syndrome, consisting of 210 cases with complete atrioventricular septal defects and 242 controls with structurally normal hearts. No individual variant achieved genome-wide significance. We identified four disomic regions (1p36.3, 5p15.31, 8q22.3, and 17q22) and two trisomic regions on chromosome 21 (around PDXK and KCNJ6 genes) that merit further investigation in large replication studies. Our data show that a few common genetic variants of large effect size (odds ratio >2.0) do not account for the elevated risk of Down syndrome−associated atrioventricular septal defects. Instead, multiple variants of low-to-moderate effect sizes may contribute to this elevated risk, highlighting the complex genetic architecture of atrioventricular septal defects even in the highly susceptible Down syndrome population. PMID:26194203

  15. Conjunctival fibrosis and the innate barriers to Chlamydia trachomatis intracellular infection: a genome wide association study

    PubMed Central

    Roberts, Chrissy h.; Franklin, Christopher S.; Makalo, Pateh; Joof, Hassan; Sarr, Isatou; Mahdi, Olaimatu S.; Sillah, Ansumana; Bah, Momodou; Payne, Felicity; Jeffreys, Anna E.; Bottomley, William; Natividad, Angels; Molina-Gonzalez, Sandra; Burr, Sarah E.; Preston, Mark; Kwiatkowski, Dominic; Rockett, Kirk A.; Clark, Taane G.; Burton, Matthew J.; Mabey, David C. W.; Bailey, Robin; Barroso, Inês; Holland, Martin J.

    2015-01-01

    Chlamydia trachomatis causes both trachoma and sexually transmitted infections. These diseases have similar pathology and potentially similar genetic predisposing factors. We aimed to identify polymorphisms and pathways associated with pathological sequelae of ocular Chlamydia trachomatis infections in The Gambia. We report a discovery phase genome-wide association study (GWAS) of scarring trachoma (1090 cases, 1531 controls) that identified 27 SNPs with strong, but not genome-wide significant, association with disease (5 × 10−6 > P > 5 × 10−8). The most strongly associated SNP (rs111513399, P = 5.38 × 10−7) fell within a gene (PREX2) with homology to factors known to facilitate chlamydial entry to the host cell. Pathway analysis of GWAS data was significantly enriched for mitotic cell cycle processes (P = 0.001), the immune response (P = 0.00001) and for multiple cell surface receptor signalling pathways. New analyses of published transcriptome data sets from Gambia, Tanzania and Ethiopia also revealed that the same cell cycle and immune response pathways were enriched at the transcriptional level in various disease states. Although unconfirmed, the data suggest that genetic associations with chlamydial scarring disease may be focussed on processes relating to the immune response, the host cell cycle and cell surface receptor signalling. PMID:26616738

  16. Genome-wide association study of growth traits in the Jinghai Yellow chicken.

    PubMed

    Zhang, G X; Fan, Q C; Zhang, T; Wang, J Y; Wang, W H; Xue, Q; Wang, Y J

    2015-01-01

    Growth is one of the most economically important traits in the poultry industry. In this study, we identified single-nucleotide polymorphisms (SNPs) and candidate genes associated with growth traits of the Jinghai Yellow chicken. Genome-wide association studies were conducted using the Illumina 60 K SNP Chicken array to genotype 400 Jinghai Yellow chickens. For each bird, the body weights at hatching and at 2, 4, 6, 8, 12, 14, and 16 weeks were recorded. The SNPs that were significantly associated with the growth traits were identified using the general linear regression model. The results revealed a total of 18 SNPs that reached Bonferroni genome-wide significance (P < 1.80E-6). Three proximal genes (BTRC, NLK, and NF1) were found to participate in the Wnt-signaling pathway and mitogen-activated protein kinase signaling pathway. Haplotype analysis identified 19 significant haplotypes and identified a region 152.4-156.3M on GGA1 affecting 3 growth traits (BW4, BW14, and BW16). These results may help identify the exact locations of body weight quantitative trait loci on a genome level and indicate variants that can be used for subsequent investigations for Jinghai Yellow chicken. PMID:26634498

  17. Quantifying the heritability of glioma using genome-wide complex trait analysis

    PubMed Central

    Kinnersley, Ben; Mitchell, Jonathan S.; Gousias, Konstantinos; Schramm, Johannes; Idbaih, Ahmed; Labussière, Marianne; Marie, Yannick; Rahimian, Amithys; Wichmann, H.-Erich; Schreiber, Stefan; Hoang-Xuan, Khe; Delattre, Jean-Yves; Nöthen, Markus M.; Mokhtari, Karima; Lathrop, Mark; Bondy, Melissa; Simon, Matthias; Sanson, Marc; Houlston, Richard S.

    2015-01-01

    Genome-wide association studies (GWAS) have successfully identified a number of common single-nucleotide polymorphisms (SNPs) influencing glioma risk. While these SNPs only explain a small proportion of the genetic risk it is unclear how much is left to be detected by other, yet to be identified, common SNPs. Therefore, we applied Genome-Wide Complex Trait Analysis (GCTA) to three GWAS datasets totalling 3,373 cases and 4,571 controls and performed a meta-analysis to estimate the heritability of glioma. Our results identify heritability estimates of 25% (95% CI: 20–31%, P = 1.15 × 10−17) for all forms of glioma - 26% (95% CI: 17–35%, P = 1.05 × 10−8) for glioblastoma multiforme (GBM) and 25% (95% CI: 17–32%, P = 1.26 × 10−10) for non-GBM tumors. This is a substantial increase from the genetic variance identified by the currently identified GWAS risk loci (~6% of common heritability), indicating that most of the heritable risk attributable to common genetic variants remains to be identified. PMID:26625949

  18. Genome-Wide DNA Methylation in Mixed Ancestry Individuals with Diabetes and Prediabetes from South Africa

    PubMed Central

    Pheiffer, Carmen; Humphries, Stephen E.; Gamieldien, Junaid; Erasmus, Rajiv T.

    2016-01-01

    Aims. To conduct a genome-wide DNA methylation in individuals with type 2 diabetes, individuals with prediabetes, and control mixed ancestry individuals from South Africa. Methods. We used peripheral blood to perform genome-wide DNA methylation analysis in 3 individuals with screen detected diabetes, 3 individuals with prediabetes, and 3 individuals with normoglycaemia from the Bellville South Community, Cape Town, South Africa, who were age-, gender-, body mass index-, and duration of residency-matched. Methylated DNA immunoprecipitation (MeDIP) was performed by Arraystar Inc. (Rockville, MD, USA). Results. Hypermethylated DMRs were 1160 (81.97%) and 124 (43.20%), respectively, in individuals with diabetes and prediabetes when both were compared to subjects with normoglycaemia. Our data shows that genes related to the immune system, signal transduction, glucose transport, and pancreas development have altered DNA methylation in subjects with prediabetes and diabetes. Pathway analysis based on the functional analysis mapping of genes to KEGG pathways suggested that the linoleic acid metabolism and arachidonic acid metabolism pathways are hypomethylated in prediabetes and diabetes. Conclusions. Our study suggests that epigenetic changes are likely to be an early process that occurs before the onset of overt diabetes. Detailed analysis of DMRs that shows gradual methylation differences from control versus prediabetes to prediabetes versus diabetes in a larger sample size is required to confirm these findings. PMID:27555869

  19. Quality control and conduct of genome-wide association meta-analyses.

    PubMed

    Winkler, Thomas W; Day, Felix R; Croteau-Chonka, Damien C; Wood, Andrew R; Locke, Adam E; Mägi, Reedik; Ferreira, Teresa; Fall, Tove; Graff, Mariaelisa; Justice, Anne E; Luan, Jian'an; Gustafsson, Stefan; Randall, Joshua C; Vedantam, Sailaja; Workalemahu, Tsegaselassie; Kilpeläinen, Tuomas O; Scherag, André; Esko, Tonu; Kutalik, Zoltán; Heid, Iris M; Loos, Ruth J F

    2014-05-01

    Rigorous organization and quality control (QC) are necessary to facilitate successful genome-wide association meta-analyses (GWAMAs) of statistics aggregated across multiple genome-wide association studies. This protocol provides guidelines for (i) organizational aspects of GWAMAs, and for (ii) QC at the study file level, the meta-level across studies and the meta-analysis output level. Real-world examples highlight issues experienced and solutions developed by the GIANT Consortium that has conducted meta-analyses including data from 125 studies comprising more than 330,000 individuals. We provide a general protocol for conducting GWAMAs and carrying out QC to minimize errors and to guarantee maximum use of the data. We also include details for the use of a powerful and flexible software package called EasyQC. Precise timings will be greatly influenced by consortium size. For consortia of comparable size to the GIANT Consortium, this protocol takes a minimum of about 10 months to complete. PMID:24762786

  20. Genome-Wide Screen Reveals Valosin-Containing Protein Requirement for Coronavirus Exit from Endosomes

    PubMed Central

    Wong, Hui Hui; Kumar, Pankaj; Tay, Felicia Pei Ling; Moreau, Dimitri

    2015-01-01

    ABSTRACT Coronaviruses are RNA viruses with a large zoonotic reservoir and propensity for host switching, representing a real threat for public health, as evidenced by severe acute respiratory syndrome (SARS) and the emerging Middle East respiratory syndrome (MERS). Cellular factors required for their replication are poorly understood. Using genome-wide small interfering RNA (siRNA) screening, we identified 83 novel genes supporting infectious bronchitis virus (IBV) replication in human cells. Thirty of these hits can be placed in a network of interactions with viral proteins and are involved in RNA splicing, membrane trafficking, and ubiquitin conjugation. In addition, our screen reveals an unexpected role for valosin-containing protein (VCP/p97) in early steps of infection. Loss of VCP inhibits a previously uncharacterized degradation of the nucleocapsid N protein. This inhibition derives from virus accumulation in early endosomes, suggesting a role for VCP in the maturation of virus-loaded endosomes. The several host factors identified in this study may provide avenues for targeted therapeutics. IMPORTANCE Coronaviruses are RNA viruses representing a real threat for public health, as evidenced by SARS and the emerging MERS. However, cellular factors required for their replication are poorly understood. Using genome-wide siRNA screening, we identified novel genes supporting infectious bronchitis virus (IBV) replication in human cells. The several host factors identified in this study may provide directions for future research on targeted therapeutics. PMID:26311884

  1. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development

    PubMed Central

    Jiang, Zhihua; Wang, Hongyang; Michal, Jennifer J.; Zhou, Xiang; Liu, Bang; Woods, Leah C. Solberg; Fuchs, Rita A.

    2016-01-01

    Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals. PMID:26722221

  2. Efficiently identifying genome-wide changes with next-generation sequencing data.

    PubMed

    Huang, Weichun; Umbach, David M; Vincent Jordan, Nicole; Abell, Amy N; Johnson, Gary L; Li, Leping

    2011-10-01

    We propose a new and effective statistical framework for identifying genome-wide differential changes in epigenetic marks with ChIP-seq data or gene expression with mRNA-seq data, and we develop a new software tool EpiCenter that can efficiently perform data analysis. The key features of our framework are: (i) providing multiple normalization methods to achieve appropriate normalization under different scenarios, (ii) using a sequence of three statistical tests to eliminate background regions and to account for different sources of variation and (iii) allowing adjustment for multiple testing to control false discovery rate (FDR) or family-wise type I error. Our software EpiCenter can perform multiple analytic tasks including: (i) identifying genome-wide epigenetic changes or differentially expressed genes, (ii) finding transcription factor binding sites and (iii) converting multiple-sample sequencing data into a single read-count data matrix. By simulation, we show that our framework achieves a low FDR consistently over a broad range of read coverage and biological variation. Through two real examples, we demonstrate the