Science.gov

Sample records for pooling-based genome-wide analysis

  1. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  2. A novel statistic for genome-wide interaction analysis.

    PubMed

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  3. Analysis of Heritability Using Genome-Wide Data.

    PubMed

    Hall, Jacob B; Bush, William S

    2016-10-11

    Most analyses of genome-wide association data consider each variant independently without considering or adjusting for the genetic background present in the rest of the genome. New approaches to genome analysis use representations of genomic sharing to better account for confounding factors like population stratification or to directly approximate heritability through the estimated sharing of individuals in a dataset. These approaches use mixed linear models, which relate genotypic sharing to phenotypic sharing, and rely on the efficient computation of genetic sharing among individuals in a dataset. This unit describes the principles and practical application of mixed models for the analysis of genome-wide association study data. © 2016 by John Wiley & Sons, Inc.

  4. Genome-wide functional analysis in Candida albicans.

    PubMed

    Motaung, Thabiso E; Ells, Ruan; Pohl, Carolina H; Albertyn, Jacobus; Tsilo, Toi J

    2017-02-08

    Candida albicans is an important etiological agent of superficial and life-threatening infections in individuals with compromised immune systems. To date, we know of several overlapping genetic networks that govern virulence attributes in this fungal pathogen. Classical use of deletion mutants has led to the discovery of numerous virulence factors over the years, and genome-wide functional analysis has propelled gene discovery at an even faster pace. Indeed, a number of recent studies using large-scale genetic screens followed by genome-wide functional analysis has allowed for the unbiased discovery of many new genes involved in C. albicans biology. Here we share our perspectives on the role of these studies in analyzing fundamental aspects of C. albicans virulence properties.

  5. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    PubMed

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  6. Genome-wide association interaction analysis for Alzheimer's disease

    PubMed Central

    Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel

    2015-01-01

    We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192

  7. Weighted SNP set analysis in genome-wide association study.

    PubMed

    Dai, Hui; Zhao, Yang; Qian, Cheng; Cai, Min; Zhang, Ruyang; Chu, Minjie; Dai, Juncheng; Hu, Zhibin; Shen, Hongbing; Chen, Feng

    2013-01-01

    Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA.

  8. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use.

  9. Genome-Wide Analysis of DNA Methylation in Human Amnion

    PubMed Central

    Kim, Jinsil; Pitlick, Mitchell M.; Christine, Paul J.; Schaefer, Amanda R.; Saleme, Cesar; Comas, Belén; Cosentino, Viviana; Gadow, Enrique; Murray, Jeffrey C.

    2013-01-01

    The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor) and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR) gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3) gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies. PMID:23533356

  10. Genome-Wide Analysis of Human Metapneumovirus Evolution

    PubMed Central

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  11. Genome-Wide Analysis of Human Metapneumovirus Evolution.

    PubMed

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs.

  12. Genome-wide transcriptome analysis of human epidermal melanocytes

    PubMed Central

    Haltaufderhyde, Kirk D.; Oancea, Elena

    2015-01-01

    Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175

  13. Genome-wide analysis of condensin binding in Caenorhabditis elegans

    PubMed Central

    2013-01-01

    Background Condensins are multi-subunit protein complexes that are essential for chromosome condensation during mitosis and meiosis, and play key roles in transcription regulation during interphase. Metazoans contain two condensins, I and II, which perform different functions and localize to different chromosomal regions. Caenorhabditis elegans contains a third condensin, IDC, that is targeted to and represses transcription of the X chromosome for dosage compensation. Results To understand condensin binding and function, we performed ChIP-seq analysis of C. elegans condensins in mixed developmental stage embryos, which contain predominantly interphase nuclei. Condensins bind to a subset of active promoters, tRNA genes and putative enhancers. Expression analysis in kle-2-mutant larvae suggests that the primary effect of condensin II on transcription is repression. A DNA sequence motif, GCGC, is enriched at condensin II binding sites. A sequence extension of this core motif, AGGG, creates the condensin IDC motif. In addition to differences in recruitment that result in X-enrichment of condensin IDC and condensin II binding to all chromosomes, we provide evidence for a shared recruitment mechanism, as condensin IDC recruiter SDC-2 also recruits condensin II to the condensin IDC recruitment sites on the X. In addition, we found that condensin sites overlap extensively with the cohesin loader SCC-2, and that SDC-2 also recruits SCC-2 to the condensin IDC recruitment sites. Conclusions Our results provide the first genome-wide view of metazoan condensin II binding in interphase, define putative recruitment motifs, and illustrate shared loading mechanisms for condensin IDC and condensin II. PMID:24125077

  14. Genome-wide analysis of differential RNA editing in epilepsy

    PubMed Central

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R.; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M.; Petretto, Enrico; Johnson, Michael R.

    2017-01-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine–temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including “neuron projection” and “seizures.” Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures. PMID:28250018

  15. Genome-wide analysis of differential RNA editing in epilepsy.

    PubMed

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M; Petretto, Enrico; Johnson, Michael R

    2017-03-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine-temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including "neuron projection" and "seizures." Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures.

  16. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  17. Genome-wide analysis links NFATC2 with asparaginase hypersensitivity

    PubMed Central

    Fernandez, Christian A.; Smith, Colton; Yang, Wenjian; Mullighan, Charles G.; Qu, Chunxu; Larsen, Eric; Bowman, W. Paul; Liu, Chengcheng; Ramsey, Laura B.; Chang, Tamara; Karol, Seth E.; Loh, Mignon L.; Raetz, Elizabeth A.; Winick, Naomi J.; Hunger, Stephen P.; Carroll, William L.; Jeha, Sima; Pui, Ching-Hon; Evans, William E.; Devidas, Meenakshi

    2015-01-01

    Asparaginase is used to treat acute lymphoblastic leukemia (ALL); however, hypersensitivity reactions can lead to suboptimal asparaginase exposure. Our objective was to use a genome-wide approach to identify loci associated with asparaginase hypersensitivity in children with ALL enrolled on St. Jude Children’s Research Hospital (SJCRH) protocols Total XIIIA (n = 154), Total XV (n = 498), and Total XVI (n = 271), or Children’s Oncology Group protocols POG 9906 (n = 222) and AALL0232 (n = 2163). Germline DNA was genotyped using the Affymetrix 500K, Affymetrix 6.0, or the Illumina Exome BeadChip array. In multivariate logistic regression, the intronic rs6021191 variant in nuclear factor of activated T cells 2 (NFATC2) had the strongest association with hypersensitivity (P = 4.1 × 10−8; odds ratio [OR] = 3.11). RNA-seq data available from 65 SJCRH ALL tumor samples and 52 Yoruba HapMap samples showed that samples carrying the rs6021191 variant had higher NFATC2 expression compared with noncarriers (P = 1.1 × 10−3 and 0.03, respectively). The top ranked nonsynonymous polymorphism was rs17885382 in HLA-DRB1 (P = 3.2 × 10−6; OR = 1.63), which is in near complete linkage disequilibrium with the HLA-DRB1*07:01 allele we previously observed in a candidate gene study. The strongest risk factors for asparaginase allergy are variants within genes regulating the immune response. PMID:25987655

  18. Genome-Wide Analysis of Polyadenylation Events in Schmidtea mediterranea

    PubMed Central

    Lakshmanan, Vairavan; Bansal, Dhiru; Kulkarni, Jahnavi; Poduval, Deepak; Krishna, Srikar; Sasidharan, Vidyanand; Anand, Praveen; Seshasayee, Aswin; Palakodeti, Dasaradhi

    2016-01-01

    In eukaryotes, 3′ untranslated regions (UTRs) play important roles in regulating posttranscriptional gene expression. The 3′UTR is defined by regulated cleavage/polyadenylation of the pre-mRNA. The advent of next-generation sequencing technology has now enabled us to identify these events on a genome-wide scale. In this study, we used poly(A)-position profiling by sequencing (3P-Seq) to capture all poly(A) sites across the genome of the freshwater planarian, Schmidtea mediterranea, an ideal model system for exploring the process of regeneration and stem cell function. We identified the 3′UTRs for ∼14,000 transcripts and thus improved the existing gene annotations. We found 97 transcripts, which are polyadenylated within an internal exon, resulting in the shrinking of the ORF and loss of a predicted protein domain. Around 40% of the transcripts in planaria were alternatively polyadenylated (ApA), resulting either in an altered 3′UTR or a change in coding sequence. We identified specific ApA transcript isoforms that were subjected to miRNA mediated gene regulation using degradome sequencing. In this study, we also confirmed a tissue-specific expression pattern for alternate polyadenylated transcripts. The insights from this study highlight the potential role of ApA in regulating the gene expression essential for planarian regeneration. PMID:27489207

  19. Assessing statistical significance in multivariable genome wide association analysis

    PubMed Central

    Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter

    2016-01-01

    Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: peter.buehlmann@stat.math.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677

  20. Genome-wide analysis highlights genetic dilution in Algerian sheep.

    PubMed

    Gaouar, S B S; Lafri, M; Djaout, A; El-Bouyahiaoui, R; Bouri, A; Bouchatal, A; Maftah, A; Ciani, E; Da Silva, A B

    2017-03-01

    Algeria represents a reservoir of genetic diversity with local sheep breeds adapted to a large range of environments and showing specific features necessary to deal with harsh conditions. This remarkable diversity results from the traditional management of dryland by pastoralists over centuries. Most of these breeds are poorly productive, and the economic pressure leads farmers to realize anarchic cross-breeding (that is, not carried out in the framework of selection plans) with the hope to increase animal's conformation. In this study, eight of the nine local Algerian sheep breeds (D'men, Hamra, Ouled-Djellal, Rembi, Sidaoun, Tazegzawt, Berber and Barbarine) were investigated for the first time by genome-wide single-nucleotide polymorphism genotyping. At an international scale, Algerian sheep occupied an original position shaped by relations with African and European (particularly Italian) breeds. The strong genetic proximity with Caribbean and Brazilian breeds confirmed that the genetic make-up of these American breeds was largely influenced by the Atlantic slave trade. At a national scale, an alarming genetic dilution of the Berber (a primitive breed) and the Rembi was observed, as a consequence of uncontrolled mating practices with Ouled-Djellal. A similar, though less pronounced, phenomenon was also detected for the Barbarine, another ancestral breed. Genetic originality appeared to be better preserved in Tazegzawt, Hamra, D'men and Sidaoun. These breeds should be given high priority in the establishment of conservation plans to halt their progressive loss. For Berber and Barbarine that also occur in the bordering neighbor countries, urgent concerted transnational actions are needed.

  1. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis

    DTIC Science & Technology

    2011-09-01

    were down-selected and successfully genotyped for whole genome (WG) single nucleotide polymorphism (SNP) markers by means of the Affymetrix Canine...SUBJECT TERMS Military working dog genome-wide association study genetic marker intelligence... marker , intelligence, Canine Intelligence Testing Protocol, classification technique, clustering analysis Technical Report: September 2011 2

  2. Genome-wide analysis of promoter architecture in Drosophila melanogaster

    SciTech Connect

    Hoskins, Roger A.; Landolin, Jane M.; Brown, James B.; Sandler, Jeremy E.; Takahashi, Hazuki; Lassmann, Timo; Yu, Charles; Booth, Benjamin W.; Zhang, Dayu; Wan, Kenneth H.; Yang, Li; Boley, Nathan; Andrews, Justen; Kaufman, Thomas C.; Graveley, Brenton R.; Bickel, Peter J.; Carninci, Piero; Carlson, Joseph W.; Celniker, Susan E.

    2010-10-20

    Core promoters are critical regions for gene regulation in higher eukaryotes. However, the boundaries of promoter regions, the relative rates of initiation at the transcription start sites (TSSs) distributed within them, and the functional significance of promoter architecture remain poorly understood. We produced a high-resolution map of promoters active in the Drosophila melanogaster embryo by integrating data from three independent and complementary methods: 21 million cap analysis of gene expression (CAGE) tags, 1.2 million RNA ligase mediated rapid amplification of cDNA ends (RLMRACE) reads, and 50,000 cap-trapped expressed sequence tags (ESTs). We defined 12,454 promoters of 8037 genes. Our analysis indicates that, due to non-promoter-associated RNA background signal, previous studies have likely overestimated the number of promoter-associated CAGE clusters by fivefold. We show that TSS distributions form a complex continuum of shapes, and that promoters active in the embryo and adult have highly similar shapes in 95% of cases. This suggests that these distributions are generally determined by static elements such as local DNA sequence and are not modulated by dynamic signals such as histone modifications. Transcription factor binding motifs are differentially enriched as a function of promoter shape, and peaked promoter shape is correlated with both temporal and spatial regulation of gene expression. Our results contribute to the emerging view that core promoters are functionally diverse and control patterning of gene expression in Drosophila and mammals.

  3. Genome-wide analysis of TCP family in tobacco.

    PubMed

    Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H

    2016-05-23

    The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco.

  4. Genome-wide efficient mixed-model analysis for association studies.

    PubMed

    Zhou, Xiang; Stephens, Matthew

    2012-06-17

    Linear mixed models have attracted considerable attention recently as a powerful and effective tool for accounting for population stratification and relatedness in genetic association tests. However, existing methods for exact computation of standard test statistics are computationally impractical for even moderate-sized genome-wide association studies. To address this issue, several approximate methods have been proposed. Here, we present an efficient exact method, which we refer to as genome-wide efficient mixed-model association (GEMMA), that makes approximations unnecessary in many contexts. This method is approximately n times faster than the widely used exact method known as efficient mixed-model association (EMMA), where n is the sample size, making exact genome-wide association analysis computationally practical for large numbers of individuals.

  5. Signatures of positive selection in East African Shorthorn Zebu: a genome-wide SNP analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...

  6. Genome-wide Association Analysis of Kernel Weight in Hard Winter Wheat

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Wheat kernel weight is an important and heritable component of wheat grain yield and a key predictor of flour extraction. Genome-wide association analysis was conducted to identify genomic regions associated with kernel weight and kernel weight environmental response in 8 trials of 299 hard winter ...

  7. Methods for meta-analysis of genome-wide association studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. For increasing N, results from different GWA can be combined in a meta-analysis (MA-...

  8. Meta-analysis of genome wide association studies for pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Given the importance of pork quality in the meat processing industry, genome-wide association studies were performed for eight meat quality traits and also, a meta-analysis (MA) of GWA was implemented combining independent results from pig populations. Data from three pig datasets (USMARC, Commercia...

  9. On the analysis of a repeated measure design in genome-wide association analysis.

    PubMed

    Lee, Young; Park, Suyeon; Moon, Sanghoon; Lee, Juyoung; Elston, Robert C; Lee, Woojoo; Won, Sungho

    2014-11-28

    Longitudinal data enables detecting the effect of aging/time, and as a repeated measures design is statistically more efficient compared to cross-sectional data if the correlations between repeated measurements are not large. In particular, when genotyping cost is more expensive than phenotyping cost, the collection of longitudinal data can be an efficient strategy for genetic association analysis. However, in spite of these advantages, genome-wide association studies (GWAS) with longitudinal data have rarely been analyzed taking this into account. In this report, we calculate the required sample size to achieve 80% power at the genome-wide significance level for both longitudinal and cross-sectional data, and compare their statistical efficiency. Furthermore, we analyzed the GWAS of eight phenotypes with three observations on each individual in the Korean Association Resource (KARE). A linear mixed model allowing for the correlations between observations for each individual was applied to analyze the longitudinal data, and linear regression was used to analyze the first observation on each individual as cross-sectional data. We found 12 novel genome-wide significant disease susceptibility loci that were then confirmed in the Health Examination cohort, as well as some significant interactions between age/sex and SNPs.

  10. Genome-wide meta-analysis of cerebral white matter hyperintensities in patients with stroke

    PubMed Central

    Zhang, Cathy R.; Adib-Samii, Poneh; Devan, William J.; Parsons, Owen E.; Lanfranconi, Silvia; Gregory, Sarah; Cloonan, Lisa; Falcone, Guido J.; Radmanesh, Farid; Fitzpatrick, Kaitlin; Kanakis, Allison; Barrick, Thomas R.; Moynihan, Barry; Lewis, Cathryn M.; Boncoraglio, Giorgio B.; Lemmens, Robin; Thijs, Vincent; Sudlow, Cathie; Wardlaw, Joanna; Rothwell, Peter M.; Meschia, James F.; Worrall, Bradford B.; Levi, Christopher; Bevan, Steve; Furie, Karen L.; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.; Rost, Natalia

    2016-01-01

    Objective: For 3,670 stroke patients from the United Kingdom, United States, Australia, Belgium, and Italy, we performed a genome-wide meta-analysis of white matter hyperintensity volumes (WMHV) on data imputed to the 1000 Genomes reference dataset to provide insights into disease mechanisms. Methods: We first sought to identify genetic associations with white matter hyperintensities in a stroke population, and then examined whether genetic loci previously linked to WMHV in community populations are also associated in stroke patients. Having established that genetic associations are shared between the 2 populations, we performed a meta-analysis testing which associations with WMHV in stroke-free populations are associated overall when combined with stroke populations. Results: There were no associations at genome-wide significance with WMHV in stroke patients. All previously reported genome-wide significant associations with WMHV in community populations shared direction of effect in stroke patients. In a meta-analysis of the genome-wide significant and suggestive loci (p < 5 × 10−6) from community populations (15 single nucleotide polymorphisms in total) and from stroke patients, 6 independent loci were associated with WMHV in both populations. Four of these are novel associations at the genome-wide level (rs72934505 [NBEAL1], p = 2.2 × 10−8; rs941898 [EVL], p = 4.0 × 10−8; rs962888 [C1QL1], p = 1.1 × 10−8; rs9515201 [COL4A2], p = 6.9 × 10−9). Conclusions: Genetic associations with WMHV are shared in otherwise healthy individuals and patients with stroke, indicating common genetic susceptibility in cerebral small vessel disease. PMID:26674333

  11. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality

    PubMed Central

    Kamvar, Zhian N.; Brooks, Jonah C.; Grünwald, Niklaus J.

    2015-01-01

    To gain a detailed understanding of how plant microbes evolve and adapt to hosts, pesticides, and other factors, knowledge of the population dynamics and evolutionary history of populations is crucial. Plant pathogen populations are often clonal or partially clonal which requires different analytical tools. With the advent of high throughput sequencing technologies, obtaining genome-wide population genetic data has become easier than ever before. We previously contributed the R package poppr specifically addressing issues with analysis of clonal populations. In this paper we provide several significant extensions to poppr with a focus on large, genome-wide SNP data. Specifically, we provide several new functionalities including the new function mlg.filter to define clone boundaries allowing for inspection and definition of what is a clonal lineage, minimum spanning networks with reticulation, a sliding-window analysis of the index of association, modular bootstrapping of any genetic distance, and analyses across any level of hierarchies. PMID:26113860

  12. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma.

    PubMed

    Law, Matthew H; Bishop, D Timothy; Lee, Jeffrey E; Brossard, Myriam; Martin, Nicholas G; Moses, Eric K; Song, Fengju; Barrett, Jennifer H; Kumar, Rajiv; Easton, Douglas F; Pharoah, Paul D P; Swerdlow, Anthony J; Kypreou, Katerina P; Taylor, John C; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A; Andresen, Per A; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M; Dȩbniak, Tadeusz; Duffy, David L; Elder, David E; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M; Goldstein, Alisa M; Gruis, Nelleke A; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A; Chen, Wei V; Landi, Maria Teresa; Lang, Julie; Lathrop, G Mark; Lubiński, Jan; Mackie, Rona M; Mann, Graham J; Molven, Anders; Montgomery, Grant W; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A; Radford-Smith, Graham L; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C; Craig, Jamie E; Schadendorf, Dirk; Simms, Lisa A; Burdon, Kathryn P; Nyholt, Dale R; Pooley, Karen A; Orr, Nick; Stratigos, Alexander J; Cust, Anne E; Ward, Sarah V; Hayward, Nicholas K; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M; Bishop, Julia A Newton; Demenais, Florence; Amos, Christopher I; MacGregor, Stuart; Iles, Mark M

    2015-09-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5 × 10(-8)), as did 2 previously reported but unreplicated loci and all 13 established loci. Newly associated SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes in the associated regions, including one involved in telomere biology.

  13. Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma

    PubMed Central

    Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.

    2015-01-01

    Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428

  14. Meta-analysis of sex-specific genome-wide association studies.

    PubMed

    Magi, Reedik; Lindgren, Cecilia M; Morris, Andrew P

    2010-12-01

    Despite the success of genome-wide association studies, much of the genetic contribution to complex human traits is still unexplained. One potential source of genetic variation that may contribute to this "missing heritability" is that which differs in magnitude and/or direction between males and females, which could result from sexual dimorphism in gene expression. Such sex-differentiated effects are common in model organisms, and are becoming increasingly evident in human complex traits through large-scale male- and female-specific meta-analyses. In this article, we review the methodology for meta-analysis of sex-specific genome-wide association studies, and propose a sex-differentiated test of association with quantitative or dichotomous traits, which allows for heterogeneity of allelic effects between males and females. We perform detailed simulations to compare the power of the proposed sex-differentiated meta-analysis with the more traditional "sex-combined" approach, which is ambivalent to gender. The results of this study highlight only a small loss in power for the sex-differentiated meta-analysis when the allelic effects of the causal variant are the same in males and females. However, over a range of models of heterogeneity in allelic effects between genders, our sex-differentiated meta-analysis strategy offers substantial gains in power, and thus has the potential to discover novel loci contributing effects to complex human traits with existing genome-wide association data.

  15. Genome-wide association analysis of age at onset and psychotic symptoms in bipolar disorder.

    PubMed

    Belmonte Mahon, Pamela; Pirooznia, Mehdi; Goes, Fernando S; Seifuddin, Fayaz; Steele, Jo; Lee, Phil Hyoun; Huang, Jie; Hamshere, Marian L; Depaulo, J Raymond; Kelsoe, John R; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W; Craddock, Nick; Schulze, Thomas G; McMahon, Francis J; Potash, James B; Zandi, Peter P

    2011-04-01

    Genome-wide association studies (GWAS) have identified several susceptibility loci for bipolar disorder (BP), most notably ANK3. However, most of the inherited risk for BP remains unexplained. One reason for the limited success may be the genetic heterogeneity of BP. Clinical sub-phenotypes of BP may identify more etiologically homogeneous subsets of patients, which can be studied with increased power to detect genetic variation. Here, we report on a mega-analysis of two widely studied sub-phenotypes of BP, age at onset and psychotic symptoms, which are familial and clinically significant. We combined data from three GWAS: NIMH Bipolar Disorder Genetic Association Information Network (GAIN-BP), NIMH Bipolar Disorder Genome Study (BiGS), and a German sample. The combined sample consisted of 2,836 BP cases with information on sub-phenotypes and 2,744 controls. Imputation was performed, resulting in 2.3 million SNPs available for analysis. No SNP reached genome-wide significance for either sub-phenotype. In addition, no SNP reached genome-wide significance in a meta-analysis with an independent replication sample. We had 80% power to detect associations with a common SNP at an OR of 1.6 for psychotic symptoms and a mean difference of 1.8 years in age at onset. Age at onset and psychotic symptoms in BP may be influenced by many genes of smaller effect sizes or other variants not measured well by SNP arrays, such as rare alleles.

  16. A guide to genome-wide association analysis and post-analytic interrogation.

    PubMed

    Reed, Eric; Nunez, Sara; Kulp, David; Qian, Jing; Reilly, Muredach P; Foulkes, Andrea S

    2015-12-10

    This tutorial is a learning resource that outlines the basic process and provides specific software tools for implementing a complete genome-wide association analysis. Approaches to post-analytic visualization and interrogation of potentially novel findings are also presented. Applications are illustrated using the free and open-source R statistical computing and graphics software environment, Bioconductor software for bioinformatics and the UCSC Genome Browser. Complete genome-wide association data on 1401 individuals across 861,473 typed single nucleotide polymorphisms from the PennCATH study of coronary artery disease are used for illustration. All data and code, as well as additional instructional resources, are publicly available through the Open Resources in Statistical Genomics project: http://www.stat-gen.org.

  17. Genome-wide analysis of microRNA and mRNA expression signatures in cancer

    PubMed Central

    Li, Ming-hui; Fu, Sheng-bo; Xiao, Hua-sheng

    2015-01-01

    Cancer is an extremely diverse and complex disease that results from various genetic and epigenetic changes such as DNA copy-number variations, mutations, and aberrant mRNA and/or protein expression caused by abnormal transcriptional regulation. The expression profiles of certain microRNAs (miRNAs) and messenger RNAs (mRNAs) are closely related to cancer progression stages. In the past few decades, DNA microarray and next-generation sequencing techniques have been widely applied to identify miRNA and mRNA signatures for cancers on a genome-wide scale and have provided meaningful insights into cancer diagnosis, prognosis and personalized medicine. In this review, we summarize the progress in genome-wide analysis of miRNAs and mRNAs as cancer biomarkers, highlighting their diagnostic and prognostic roles. PMID:26299954

  18. Genome-wide association analysis identifies six new loci associated with forced vital capacity.

    PubMed

    Loth, Daan W; Soler Artigas, María; Gharib, Sina A; Wain, Louise V; Franceschini, Nora; Koch, Beate; Pottinger, Tess D; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P; James, Alan L; Huffman, Jennifer E; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K; Fall, Tove; Viñuela, Ana; Launer, Lenore J; Loehr, Laura R; Fornage, Myriam; Li, Guo; Wilk, Jemma B; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B; North, Kari E; Rudnicka, Alicja R; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F; Hastie, Nicholas D; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A; Pietiläinen, Kirsi H; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M; Wojczynski, Mary; Pouta, Anneli; Johansson, Asa; Wild, Sarah H; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G; Eiriksdottir, Gudny; Morrison, Alanna C; Rotter, Jerome I; Gao, Wei; Postma, Dirkje S; White, Wendy B; Rich, Stephen S; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J; Psaty, Bruce M; Lohman, Kurt; Burchard, Esteban G; Uitterlinden, André G; Garcia, Melissa; Joubert, Bonnie R; McArdle, Wendy L; Musk, A Bill; Hansel, Nadia; Heckbert, Susan R; Zgaga, Lina; van Meurs, Joyce B J; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah L; Zhao, Jing Hua; Rantanen, Taina; O'Connor, George T; Ripatti, Samuli; Scott, Rodney J; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C; Starr, John M; Wijmenga, Cisca; Minster, Ryan L; Lederer, David J; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P; Gläser, Sven; Hammond, Christopher J; Burkart, Kristin M; Beilby, John; Kritchevsky, Stephen B; Gudnason, Vilmundur; Hancock, Dana B; Williams, O Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F; Wjst, Matthias; Kim, Woo Jin; Porteous, David J; Scotland, Generation; Smith, Blair H; Viljanen, Anne; Heliövaara, Markku; Attia, John R; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J; Boezen, H Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F; Lind, Lars; Stricker, Bruno H; Teumer, Alexander; Spector, Timothy D; Melén, Erik; Peters, Marjolein J; Lange, Leslie A; Barr, R Graham; Bracke, Ken R; Verhamme, Fien M; Sung, Joohon; Hiemstra, Pieter S; Cassano, Patricia A; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P; Brusselle, Guy G; Tobin, Martin D; London, Stephanie J

    2014-07-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10(-8)) with FVC in or near EFEMP1, BMP6, MIR129-2-HSD17B12, PRDM11, WWOX and KCNJ2. Two loci previously associated with spirometric measures (GSTCD and PTCH1) were related to FVC. Newly implicated regions were followed up in samples from African-American, Korean, Chinese and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and the pathogenesis of restrictive lung disease.

  19. Genome-wide association analysis identifies six new loci associated with forced vital capacity

    PubMed Central

    Loth, Daan W.; Artigas, María Soler; Gharib, Sina A.; Wain, Louise V.; Franceschini, Nora; Koch, Beate; Pottinger, Tess; Smith, Albert Vernon; Duan, Qing; Oldmeadow, Chris; Lee, Mi Kyeong; Strachan, David P.; James, Alan L.; Huffman, Jennifer E.; Vitart, Veronique; Ramasamy, Adaikalavan; Wareham, Nicholas J.; Kaprio, Jaakko; Wang, Xin-Qun; Trochet, Holly; Kähönen, Mika; Flexeder, Claudia; Albrecht, Eva; Lopez, Lorna M.; de Jong, Kim; Thyagarajan, Bharat; Alves, Alexessander Couto; Enroth, Stefan; Omenaas, Ernst; Joshi, Peter K.; Fall, Tove; Viňuela, Ana; Launer, Lenore J.; Loehr, Laura R.; Fornage, Myriam; Li, Guo; Wilk, Jemma B.; Tang, Wenbo; Manichaikul, Ani; Lahousse, Lies; Harris, Tamara B.; North, Kari E.; Rudnicka, Alicja R.; Hui, Jennie; Gu, Xiangjun; Lumley, Thomas; Wright, Alan F.; Hastie, Nicholas D.; Campbell, Susan; Kumar, Rajesh; Pin, Isabelle; Scott, Robert A.; Pietiläinen, Kirsi H.; Surakka, Ida; Liu, Yongmei; Holliday, Elizabeth G.; Schulz, Holger; Heinrich, Joachim; Davies, Gail; Vonk, Judith M.; Wojczynski, Mary; Pouta, Anneli; Johansson, Åsa; Wild, Sarah H.; Ingelsson, Erik; Rivadeneira, Fernando; Völzke, Henry; Hysi, Pirro G.; Eiriksdottir, Gudny; Morrison, Alanna C.; Rotter, Jerome I.; Gao, Wei; Postma, Dirkje S.; White, Wendy B.; Rich, Stephen S.; Hofman, Albert; Aspelund, Thor; Couper, David; Smith, Lewis J.; Psaty, Bruce M.; Lohman, Kurt; Burchard, Esteban G.; Uitterlinden, André G.; Garcia, Melissa; Joubert, Bonnie R.; McArdle, Wendy L.; Musk, A. Bill; Hansel, Nadia; Heckbert, Susan R.; Zgaga, Lina; van Meurs, Joyce B.J.; Navarro, Pau; Rudan, Igor; Oh, Yeon-Mok; Redline, Susan; Jarvis, Deborah; Zhao, Jing Hua; Rantanen, Taina; O’Connor, George T.; Ripatti, Samuli; Scott, Rodney J.; Karrasch, Stefan; Grallert, Harald; Gaddis, Nathan C.; Starr, John M.; Wijmenga, Cisca; Minster, Ryan L.; Lederer, David J.; Pekkanen, Juha; Gyllensten, Ulf; Campbell, Harry; Morris, Andrew P.; Gläser, Sven; Hammond, Christopher J.; Burkart, Kristin M.; Beilby, John; Kritchevsky, Stephen B.; Gudnason, Vilmundur; Hancock, Dana B.; Williams, O. Dale; Polasek, Ozren; Zemunik, Tatijana; Kolcic, Ivana; Petrini, Marcy F.; Wjst, Matthias; Kim, Woo Jin; Porteous, David J.; Scotland, Generation; Smith, Blair H.; Viljanen, Anne; Heliövaara, Markku; Attia, John R.; Sayers, Ian; Hampel, Regina; Gieger, Christian; Deary, Ian J.; Boezen, H. Marike; Newman, Anne; Jarvelin, Marjo-Riitta; Wilson, James F.; Lind, Lars; Stricker, Bruno H.; Teumer, Alexander; Spector, Timothy D.; Melén, Erik; Peters, Marjolein J.; Lange, Leslie A.; Barr, R. Graham; Bracke, Ken R.; Verhamme, Fien M.; Sung, Joohon; Hiemstra, Pieter S.; Cassano, Patricia A.; Sood, Akshay; Hayward, Caroline; Dupuis, Josée; Hall, Ian P.; Brusselle, Guy G.; Tobin, Martin D.; London, Stephanie J.

    2014-01-01

    Forced vital capacity (FVC), a spirometric measure of pulmonary function, reflects lung volume and is used to diagnose and monitor lung diseases. We performed genome-wide association study meta-analysis of FVC in 52,253 individuals from 26 studies and followed up the top associations in 32,917 additional individuals of European ancestry. We found six new regions associated at genome-wide significance (P < 5 × 10−8) with FVC in or near EFEMP1, BMP6, MIR-129-2/HSD17B12, PRDM11, WWOX, and KCNJ2. Two (GSTCD and PTCH1) loci previously associated with spirometric measures were related to FVC. Newly implicated regions were followed-up in samples of African American, Korean, Chinese, and Hispanic individuals. We detected transcripts for all six newly implicated genes in human lung tissue. The new loci may inform mechanisms involved in lung development and pathogenesis of restrictive lung disease. PMID:24929828

  20. Five endometrial cancer risk loci identified through genome-wide association analysis.

    PubMed

    Cheng, Timothy H T; Thompson, Deborah J; O'Mara, Tracy A; Painter, Jodie N; Glubb, Dylan M; Flach, Susanne; Lewis, Annabelle; French, Juliet D; Freeman-Mills, Luke; Church, David; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Webb, Penelope M; Attia, John; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Henders, Anjali K; Martin, Nicholas G; Montgomery, Grant W; Nyholt, Dale R; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Dennis, Joe; Fasching, Peter A; Beckmann, Matthias W; Hein, Alexander; Ekici, Arif B; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Amant, Frederic; Schrauwen, Stefanie; Zhao, Hui; Lambrechts, Diether; Depreeuw, Jeroen; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica M J; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Liu, Tao; Mints, Miriam; Tham, Emma; Li, Mulin Jun; Yip, Shun H; Wang, Junwen; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Dunlop, Malcolm; Houlston, Richard; Palles, Claire; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Cunningham, Julie M; Pharoah, Paul D P; Dunning, Alison M; Edwards, Stacey L; Easton, Douglas F; Tomlinson, Ian; Spurdle, Amanda B

    2016-06-01

    We conducted a meta-analysis of three endometrial cancer genome-wide association studies (GWAS) and two follow-up phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five new risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1, near SIVA1). We also found a second independent 8q24.21 signal (rs17232730). Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r(2) = 0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103[T] allele that is protective in endometrial cancer suppressed gene expression in vitro, suggesting that regulation of the expression of KLF5, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer.

  1. Genome-Wide Meta-Analysis of Longitudinal Alcohol Consumption Across Youth and Early Adulthood.

    PubMed

    Adkins, Daniel E; Clark, Shaunna L; Copeland, William E; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A; Silberg, Judy; Brown, Tyson H; Fergusson, David M; Horwood, L John; Eaves, Lindon; van den Oord, Edwin J C G; Sullivan, Patrick F; Costello, E J

    2015-08-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse, and dependence increasing across adolescence and peaking in early adulthood. Here, we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three longitudinal community samples (N=2,126, obs=12,166). Consumption-repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and six others met our 'suggestive' criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms, including neurotransmission, xenobiotic pharmacodynamics, and nuclear hormone receptors (NHR). These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies.

  2. Genome-wide gene-environment interaction analysis for asbestos exposure in lung cancer susceptibility.

    PubMed

    Wei, Sheng; Wang, Li-E; McHugh, Michelle K; Han, Younghun; Xiong, Momiao; Amos, Christopher I; Spitz, Margaret R; Wei, Qingyi Wei

    2012-08-01

    Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene-environment interactions. To determine gene-asbestos interactions in lung cancer risk, we conducted genome-wide gene-environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10(-6), which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10(-5)). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene-asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.

  3. Cooperative Genome-Wide Analysis Shows Increased Homozygosity in Early Onset Parkinson's Disease

    PubMed Central

    Nalls, Michael A.; Martinez, Maria; Schulte, Claudia; Holmans, Peter; Gasser, Thomas; Hardy, John; Singleton, Andrew B.; Wood, Nicholas W.; Brice, Alexis; Heutink, Peter; Williams, Nigel; Morris, Huw R.

    2012-01-01

    Parkinson's disease (PD) occurs in both familial and sporadic forms, and both monogenic and complex genetic factors have been identified. Early onset PD (EOPD) is particularly associated with autosomal recessive (AR) mutations, and three genes, PARK2, PARK7 and PINK1, have been found to carry mutations leading to AR disease. Since mutations in these genes account for less than 10% of EOPD patients, we hypothesized that further recessive genetic factors are involved in this disorder, which may appear in extended runs of homozygosity. We carried out genome wide SNP genotyping to look for extended runs of homozygosity (ROHs) in 1,445 EOPD cases and 6,987 controls. Logistic regression analyses showed an increased level of genomic homozygosity in EOPD cases compared to controls. These differences are larger for ROH of 9 Mb and above, where there is a more than three-fold increase in the proportion of cases carrying a ROH. These differences are not explained by occult recessive mutations at existing loci. Controlling for genome wide homozygosity in logistic regression analyses increased the differences between cases and controls, indicating that in EOPD cases ROHs do not simply relate to genome wide measures of inbreeding. Homozygosity at a locus on chromosome19p13.3 was identified as being more common in EOPD cases as compared to controls. Sequencing analysis of genes and predicted transcripts within this locus failed to identify a novel mutation causing EOPD in our cohort. There is an increased rate of genome wide homozygosity in EOPD, as measured by an increase in ROHs. These ROHs are a signature of inbreeding and do not necessarily harbour disease-causing genetic variants. Although there might be other regions of interest apart from chromosome 19p13.3, we lack the power to detect them with this analysis. PMID:22427796

  4. Genetic determinants of common epilepsies: a meta-analysis of genome-wide association studies

    PubMed Central

    2014-01-01

    Summary Background The epilepsies are a clinically heterogeneous group of neurological disorders. Despite strong evidence for heritability, genome-wide association studies have had little success in identification of risk loci associated with epilepsy, probably because of relatively small sample sizes and insufficient power. We aimed to identify risk loci through meta-analyses of genome-wide association studies for all epilepsy and the two largest clinical subtypes (genetic generalised epilepsy and focal epilepsy). Methods We combined genome-wide association data from 12 cohorts of individuals with epilepsy and controls from population-based datasets. Controls were ethnically matched with cases. We phenotyped individuals with epilepsy into categories of genetic generalised epilepsy, focal epilepsy, or unclassified epilepsy. After standardised filtering for quality control and imputation to account for different genotyping platforms across sites, investigators at each site conducted a linear mixed-model association analysis for each dataset. Combining summary statistics, we conducted fixed-effects meta-analyses of all epilepsy, focal epilepsy, and genetic generalised epilepsy. We set the genome-wide significance threshold at p<1·66 × 10−8. Findings We included 8696 cases and 26 157 controls in our analysis. Meta-analysis of the all-epilepsy cohort identified loci at 2q24.3 (p=8·71 × 10−10), implicating SCN1A, and at 4p15.1 (p=5·44 × 10−9), harbouring PCDH7, which encodes a protocadherin molecule not previously implicated in epilepsy. For the cohort of genetic generalised epilepsy, we noted a single signal at 2p16.1 (p=9·99 × 10−9), implicating VRK2 or FANCL. No single nucleotide polymorphism achieved genome-wide significance for focal epilepsy. Interpretation This meta-analysis describes a new locus not previously implicated in epilepsy and provides further evidence about the genetic architecture of these disorders, with the

  5. Genome-wide analysis of long-term evolutionary domestication in Drosophila melanogaster

    PubMed Central

    Phillips, Mark A.; Long, Anthony D.; Greenspan, Zachary S.; Greer, Lee F.; Burke, Molly K.; Villeponteau, Bryant; Matsagas, Kennedy C.; Rizza, Cristina L.; Mueller, Laurence D.; Rose, Michael R.

    2016-01-01

    Experimental evolutionary genomics now allows biologists to test fundamental theories concerning the genetic basis of adaptation. We have conducted one of the longest laboratory evolution experiments with any sexually-reproducing metazoan, Drosophila melanogaster. We used next-generation resequencing data from this experiment to examine genome-wide patterns of genetic variation over an evolutionary time-scale that approaches 1,000 generations. We also compared measures of variation within and differentiation between our populations to simulations based on a variety of evolutionary scenarios. Our analysis yielded no clear evidence of hard selective sweeps, whereby natural selection acts to increase the frequency of a newly-arising mutation in a population until it becomes fixed. We do find evidence for selection acting on standing genetic variation, as independent replicate populations exhibit similar population-genetic dynamics, without obvious fixation of candidate alleles under selection. A hidden-Markov model test for selection also found widespread evidence for selection. We found more genetic variation genome-wide, and less differentiation between replicate populations genome-wide, than arose in any of our simulated evolutionary scenarios. PMID:28004838

  6. Host cell factors in HIV replication: meta-analysis of genome-wide studies.

    PubMed

    Bushman, Frederic D; Malani, Nirav; Fernandes, Jason; D'Orso, Iván; Cagney, Gerard; Diamond, Tracy L; Zhou, Honglin; Hazuda, Daria J; Espeseth, Amy S; König, Renate; Bandyopadhyay, Sourav; Ideker, Trey; Goff, Stephen P; Krogan, Nevan J; Frankel, Alan D; Young, John A T; Chanda, Sumit K

    2009-05-01

    We have analyzed host cell genes linked to HIV replication that were identified in nine genome-wide studies, including three independent siRNA screens. Overlaps among the siRNA screens were very modest (<7% for any pairwise combination), and similarly, only modest overlaps were seen in pairwise comparisons with other types of genome-wide studies. Combining all genes from the genome-wide studies together with genes reported in the literature to affect HIV yields 2,410 protein-coding genes, or fully 9.5% of all human genes (though of course some of these are false positive calls). Here we report an "encyclopedia" of all overlaps between studies (available at http://www.hostpathogen.org), which yielded a more extensively corroborated set of host factors assisting HIV replication. We used these genes to calculate refined networks that specify cellular subsystems recruited by HIV to assist in replication, and present additional analysis specifying host cell genes that are attractive as potential therapeutic targets.

  7. Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia

    PubMed Central

    Ripke, Stephan; O'Dushlaine, Colm; Chambert, Kimberly; Moran, Jennifer L; Kähler, Anna K; Akterin, Susanne; Bergen, Sarah; Collins, Ann L; Crowley, James J; Fromer, Menachem; Kim, Yunjung; Lee, Sang Hong; Magnusson, Patrik KE; Sanchez, Nick; Stahl, Eli A; Williams, Stephanie; Wray, Naomi R; Xia, Kai; Bettella, Francesco; Børglum, Anders D; Bulik-Sullivan, Brendan K; Cormican, Paul; Craddock, Nick; de Leeuw, Christiaan; Durmishi, Naser; Gill, Michael; Golimbet, Vera; Hamshere, Marian L; Holmans, Peter; Hougaard, David M; Kendler, Kenneth S; Lin, Kuang; Morris, Derek W; Mors, Ole; Mortensen, Preben B; Neale, Benjamin M; O'Neill, Francis A; Owen, Michael J; Milovancevic, MilicaPejovic; Posthuma, Danielle; Powell, John; Richards, Alexander L; Riley, Brien P; Ruderfer, Douglas; Rujescu, Dan; Sigurdsson, Engilbert; Silagadze, Teimuraz; Smit, August B; Stefansson, Hreinn; Steinberg, Stacy; Suvisaari, Jaana; Tosato, Sarah; Verhage, Matthijs; Walters, James T; Bramon, Elvira; Corvin, Aiden P; O'Donovan, Michael C; Stefansson, Kari; Scolnick, Edward; Purcell, Shaun; McCarroll, Steve; Sklar, Pamela; Hultman, Christina M; Sullivan, Patrick F

    2013-01-01

    Schizophrenia is a heritable disorder with substantial public health impact. We conducted a multi-stage genome-wide association study (GWAS) for schizophrenia beginning with a Swedish national sample (5,001 cases, 6,243 controls) followed by meta-analysis with prior schizophrenia GWAS (8,832 cases, 12,067 controls) and finally by replication of SNPs in 168 genomic regions in independent samples (7,413 cases, 19,762 controls, and 581 trios). In total, 22 regions met genome-wide significance (14 novel and one previously implicated in bipolar disorder). The results strongly implicate calcium signaling in the etiology of schizophrenia, and include genome-wide significant results for CACNA1C and CACNB2 whose protein products interact. We estimate that ∼8,300 independent and predominantly common SNPs contribute to risk for schizophrenia and that these collectively account for most of its heritability. Common genetic variation plays an important role in the etiology of schizophrenia, and larger studies will allow more detailed understanding of this devastating disorder. PMID:23974872

  8. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  9. Power comparison of admixture mapping and direct association analysis in genome-wide association studies.

    PubMed

    Qin, Huaizhen; Zhu, Xiaofeng

    2012-04-01

    When dense markers are available, one can interrogate almost every common variant across the genome via imputation and single nucleotide polymorphism (SNP) test, which has become a routine in current genome-wide association studies (GWASs). As a complement, admixture mapping exploits the long-range linkage disequilibrium (LD) generated by admixture between genetically distinct ancestral populations. It is then questionable whether admixture mapping analysis is still necessary in detecting the disease associated variants in admixed populations. We argue that admixture mapping is able to reduce the burden of massive comparisons in GWASs; it therefore can be a powerful tool to locate the disease variants with substantial allele frequency differences between ancestral populations. In this report we studied a two-stage approach, where candidate regions are defined by conducting admixture mapping at stage 1, and single SNP association tests are followed at stage 2 within the candidate regions defined at stage 1. We first established the genome-wide significance levels corresponding to the criteria to define the candidate regions at stage 1 by simulations. We next compared the power of the two-stage approach with direct association analysis. Our simulations suggest that the two-stage approach can be more powerful than the standard genome-wide association analysis when the allele frequency difference of a causal variant in ancestral populations, is larger than 0.4. Our conclusion is consistent with a theoretical prediction by Risch and Tang ([2006] Am J Hum Genet 79:S254). Surprisingly, our study also suggests that power can be improved when we use less strict criteria to define the candidate regions at stage 1.

  10. Meta-analysis of genome-wide association studies of anxiety disorders

    PubMed Central

    Otowa, Takeshi; Hek, Karin; Lee, Minyoung; Byrne, Enda M.; Mirza, Saira S.; Nivard, Michel G.; Bigdeli, Timothy; Aggen, Steven H.; Adkins, Daniel; Wolen, Aaron; Fanous, Ayman; Keller, Matthew C.; Castelao, Enrique; Kutalik, Zoltan; Van der Auwera, Sandra; Homuth, Georg; Nauck, Matthias; Teumer, Alexander; Milaneschi, Yuri; Hottenga, Jouke-Jan; Direk, Nese; Hofman, Albert; Uitterlinden, Andre; Mulder, Cornelis L.; Henders, Anjali K.; Medland, Sarah E.; Gordon, Scott; Heath, Andrew C.; Madden, Pamela A.F.; Pergadia, Michelle; van der Most, Peter J.; Nolte, Ilja M.; van Oort, Floor V.A.; Hartman, Catharina A.; Oldehinkel, Albertine J.; Preisig, Martin; Grabe, Hans Jörgen; Middeldorp, Christel M.; Penninx, Brenda WJH; Boomsma, Dorret; Martin, Nicholas G.; Montgomery, Grant; Maher, Brion S.; van den Oord, Edwin J.; Wray, Naomi R.; Tiemeier, Henning; Hettema, John M.

    2015-01-01

    Anxiety disorders, namely generalized anxiety disorder, panic disorder, and phobias, are common, etiologically complex conditions with a partially genetic basis. Despite differing on diagnostic definitions based upon clinical presentation, anxiety disorders likely represent various expressions of an underlying common diathesis of abnormal regulation of basic threat-response systems. We conducted genome-wide association analyses in nine samples of European ancestry from seven large, independent studies. To identify genetic variants contributing to genetic susceptibility shared across interview-generated DSM-based anxiety disorders, we applied two phenotypic approaches: (1) comparisons between categorical anxiety disorder cases and super-normal controls, and (2) quantitative phenotypic factor scores derived from a multivariate analysis combining information across the clinical phenotypes. We used logistic and linear regression, respectively, to analyze the association between these phenotypes and genome-wide single nucleotide polymorphisms. Meta-analysis for each phenotype combined results across the nine samples for over 18 000 unrelated individuals. Each meta-analysis identified a different genome-wide significant region, with the following markers showing the strongest association: for case-control contrasts, rs1709393 located in an uncharacterized non-coding RNA locus on chromosomal band 3q12.3 (P=1.65×10−8); for factor scores, rs1067327 within CAMKMT encoding the calmodulin-lysine N-methyltransferase on chromosomal band 2p21 (P=2.86×10−9). Independent replication and further exploration of these findings are needed to more fully understand the role of these variants in risk and expression of anxiety disorders. PMID:26754954

  11. The complex genetics of gait speed: genome-wide meta-analysis approach

    PubMed Central

    Lunetta, Kathryn L.; Smith, Jennifer A.; Eicher, John D.; Vered, Rotem; Deelen, Joris; Arnold, Alice M.; Buchman, Aron S.; Tanaka, Toshiko; Faul, Jessica D.; Nethander, Maria; Fornage, Myriam; Adams, Hieab H.; Matteini, Amy M.; Callisaya, Michele L.; Smith, Albert V.; Yu, Lei; De Jager, Philip L.; Evans, Denis A.; Gudnason, Vilmundur; Hofman, Albert; Pattie, Alison; Corley, Janie; Launer, Lenore J.; Knopman, Davis S.; Parimi, Neeta; Turner, Stephen T.; Bandinelli, Stefania; Beekman, Marian; Gutman, Danielle; Sharvit, Lital; Mooijaart, Simon P.; Liewald, David C.; Houwing-Duistermaat, Jeanine J.; Ohlsson, Claes; Moed, Matthijs; Verlinden, Vincent J.; Mellström, Dan; van der Geest, Jos N.; Karlsson, Magnus; Hernandez, Dena; McWhirter, Rebekah; Liu, Yongmei; Thomson, Russell; Tranah, Gregory J.; Uitterlinden, Andre G.; Weir, David R.; Zhao, Wei; Starr, John M.; Johnson, Andrew D.; Ikram, M. Arfan; Bennett, David A.; Cummings, Steven R.; Deary, Ian J.; Harris, Tamara B.; Kardia, Sharon L. R.; Mosley, Thomas H.; Srikanth, Velandai K.; Windham, Beverly G.; Newman, Ann B.; Walston, Jeremy D.; Davies, Gail; Evans, Daniel S.; Slagboom, Eline P.; Ferrucci, Luigi; Kiel, Douglas P.; Murabito, Joanne M.; Atzmon, Gil

    2017-01-01

    Emerging evidence suggests that the basis for variation in late-life mobility is attributable, in part, to genetic factors, which may become increasingly important with age. Our objective was to systematically assess the contribution of genetic variation to gait speed in older individuals. We conducted a meta-analysis of gait speed GWASs in 31,478 older adults from 17 cohorts of the CHARGE consortium, and validated our results in 2,588 older adults from 4 independent studies. We followed our initial discoveries with network and eQTL analysis of candidate signals in tissues. The meta-analysis resulted in a list of 536 suggestive genome wide significant SNPs in or near 69 genes. Further interrogation with Pathway Analysis placed gait speed as a polygenic complex trait in five major networks. Subsequent eQTL analysis revealed several SNPs significantly associated with the expression of PRSS16, WDSUB1 and PTPRT, which in addition to the meta-analysis and pathway suggested that genetic effects on gait speed may occur through synaptic function and neuronal development pathways. No genome-wide significant signals for gait speed were identified from this moderately large sample of older adults, suggesting that more refined physical function phenotypes will be needed to identify the genetic basis of gait speed in aging. PMID:28077804

  12. Principal components analysis corrects for stratification in genome-wide association studies.

    PubMed

    Price, Alkes L; Patterson, Nick J; Plenge, Robert M; Weinblatt, Michael E; Shadick, Nancy A; Reich, David

    2006-08-01

    Population stratification--allele frequency differences between cases and controls due to systematic ancestry differences-can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers.

  13. MAVEN: a tool for visualization and functional analysis of genome-wide association results

    PubMed Central

    Narayanan, Kanchana; Li, Jing

    2010-01-01

    Summary: We describe the features and implementation of a web application tool named MAVEN—for Management, Analysis, Visualization and rEsults shariNg of genome-wide association data using cutting edge technologies. Main capabilities include user data uploading and management, queries using a variety of criteria, visualization of results, interactive selections and seamless integration of users' data with databases at the National Center for Biotechnology Information (NCBI) for functional annotations of single nucleotide polymorphisms (SNPs) and genes. Availability: http://cbc.case.edu/maven Contact: jingli@case.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19933166

  14. Pharmacogenetic meta-analysis of genome-wide association studies of LDL cholesterol response to statins.

    PubMed

    Postmus, Iris; Trompet, Stella; Deshmukh, Harshal A; Barnes, Michael R; Li, Xiaohui; Warren, Helen R; Chasman, Daniel I; Zhou, Kaixin; Arsenault, Benoit J; Donnelly, Louise A; Wiggins, Kerri L; Avery, Christy L; Griffin, Paula; Feng, QiPing; Taylor, Kent D; Li, Guo; Evans, Daniel S; Smith, Albert V; de Keyser, Catherine E; Johnson, Andrew D; de Craen, Anton J M; Stott, David J; Buckley, Brendan M; Ford, Ian; Westendorp, Rudi G J; Slagboom, P Eline; Sattar, Naveed; Munroe, Patricia B; Sever, Peter; Poulter, Neil; Stanton, Alice; Shields, Denis C; O'Brien, Eoin; Shaw-Hawkins, Sue; Chen, Y-D Ida; Nickerson, Deborah A; Smith, Joshua D; Dubé, Marie Pierre; Boekholdt, S Matthijs; Hovingh, G Kees; Kastelein, John J P; McKeigue, Paul M; Betteridge, John; Neil, Andrew; Durrington, Paul N; Doney, Alex; Carr, Fiona; Morris, Andrew; McCarthy, Mark I; Groop, Leif; Ahlqvist, Emma; Bis, Joshua C; Rice, Kenneth; Smith, Nicholas L; Lumley, Thomas; Whitsel, Eric A; Stürmer, Til; Boerwinkle, Eric; Ngwa, Julius S; O'Donnell, Christopher J; Vasan, Ramachandran S; Wei, Wei-Qi; Wilke, Russell A; Liu, Ching-Ti; Sun, Fangui; Guo, Xiuqing; Heckbert, Susan R; Post, Wendy; Sotoodehnia, Nona; Arnold, Alice M; Stafford, Jeanette M; Ding, Jingzhong; Herrington, David M; Kritchevsky, Stephen B; Eiriksdottir, Gudny; Launer, Leonore J; Harris, Tamara B; Chu, Audrey Y; Giulianini, Franco; MacFadyen, Jean G; Barratt, Bryan J; Nyberg, Fredrik; Stricker, Bruno H; Uitterlinden, André G; Hofman, Albert; Rivadeneira, Fernando; Emilsson, Valur; Franco, Oscar H; Ridker, Paul M; Gudnason, Vilmundur; Liu, Yongmei; Denny, Joshua C; Ballantyne, Christie M; Rotter, Jerome I; Adrienne Cupples, L; Psaty, Bruce M; Palmer, Colin N A; Tardif, Jean-Claude; Colhoun, Helen M; Hitman, Graham; Krauss, Ronald M; Wouter Jukema, J; Caulfield, Mark J

    2014-10-28

    Statins effectively lower LDL cholesterol levels in large studies and the observed interindividual response variability may be partially explained by genetic variation. Here we perform a pharmacogenetic meta-analysis of genome-wide association studies (GWAS) in studies addressing the LDL cholesterol response to statins, including up to 18,596 statin-treated subjects. We validate the most promising signals in a further 22,318 statin recipients and identify two loci, SORT1/CELSR2/PSRC1 and SLCO1B1, not previously identified in GWAS. Moreover, we confirm the previously described associations with APOE and LPA. Our findings advance the understanding of the pharmacogenetic architecture of statin response.

  15. Genome-wide linkage analysis of blood pressure under locus heterogeneity

    PubMed Central

    Yang, Xinqun; Wang, Kai; Huang, Jian; Vieland, Veronica J

    2003-01-01

    We describe a method for mapping quantitative trait loci that allows for locus heterogeneity. A genome-wide linkage analysis of blood pressure was performed using sib-pair data from the Framingham Heart Study. Evidence of linkage was found on four markers (GATA89G08, GATA23D06, GATA14E09, and 049xd2) at a significance level of 0.01. Two of them (GATA14E09 and 049xd2) seem to overlap with linkage signals reported previously, while the other two are not linked to any known signals. PMID:14975146

  16. Genome-wide Meta-analysis on the Sense of Smell Among US Older Adults.

    PubMed

    Dong, Jing; Yang, Jingyun; Tranah, Greg; Franceschini, Nora; Parimi, Neeta; Alkorta-Aranburu, Gorka; Xu, Zongli; Alonso, Alvaro; Cummings, Steven R; Fornage, Myriam; Huang, Xuemei; Kritchevsky, Stephen; Liu, Yongmei; London, Stephanie; Niu, Liang; Wilson, Robert S; De Jager, Philip L; Yu, Lei; Singleton, Andrew B; Harris, Tamara; Mosley, Thomas H; Pinto, Jayant M; Bennett, David A; Chen, Honglei

    2015-11-01

    Olfactory dysfunction is common among older adults and affects their safety, nutrition, quality of life, and mortality. More importantly, the decreased sense of smell is an early symptom of neurodegenerative diseases such as Parkinson disease (PD) and Alzheimer disease. However, the genetic determinants for the sense of smell have been poorly investigated. We here performed the first genome-wide meta-analysis on the sense of smell among 6252 US older adults of European descent from the Atherosclerosis Risk in Communities (ARIC) study, the Health, Aging, and Body Composition (Health ABC) study, and the Religious Orders Study and the Rush Memory and Aging Project (ROS/MAP). Genome-wide association study analysis was performed first by individual cohorts and then meta-analyzed using fixed-effect models with inverse variance weights. Although no SNPs reached genome-wide statistical significance, we identified 13 loci with suggestive evidence for an association with the sense of smell (Pmeta < 1 × 10). Of these, 2 SNPs at chromosome 17q21.31 (rs199443 in NSF, P = 3.02 × 10; and rs2732614 in KIAA1267-LRRC37A, P = 6.65 × 10) exhibited cis effects on the expression of microtubule-associated protein tau (MAPT, 17q21.31) in 447 frontal-cortex samples obtained postmortem and profiled by RNA-seq (P < 1 × 10). Gene-based and pathway-enrichment analyses further implicated MAPT in regulating the sense of smell in older adults. Similar results were obtained after excluding participants who reported a physician-diagnosed PD or use of PD medications. In conclusion, we provide preliminary evidence that the MAPT locus may play a role in regulating the sense of smell in older adults and therefore offer a potential genetic link between poor sense of smell and major neurodegenerative diseases.

  17. Genome-wide analysis of a Wnt1-regulated transcriptional network implicates neurodegenerative pathways.

    PubMed

    Wexler, Eric M; Rosen, Ezra; Lu, Daning; Osborn, Gregory E; Martin, Elizabeth; Raybould, Helen; Geschwind, Daniel H

    2011-10-04

    Wnt proteins are critical to mammalian brain development and function. The canonical Wnt signaling pathway involves the stabilization and nuclear translocation of β-catenin; however, Wnt also signals through alternative, noncanonical pathways. To gain a systems-level, genome-wide view of Wnt signaling, we analyzed Wnt1-stimulated changes in gene expression by transcriptional microarray analysis in cultured human neural progenitor (hNP) cells at multiple time points over a 72-hour time course. We observed a widespread oscillatory-like pattern of changes in gene expression, involving components of both the canonical and the noncanonical Wnt signaling pathways. A higher-order, systems-level analysis that combined independent component analysis, waveform analysis, and mutual information-based network construction revealed effects on pathways related to cell death and neurodegenerative disease. Wnt effectors were tightly clustered with presenilin1 (PSEN1) and granulin (GRN), which cause dominantly inherited forms of Alzheimer's disease and frontotemporal dementia (FTD), respectively. We further explored a potential link between Wnt1 and GRN and found that Wnt1 decreased GRN expression by hNPs. Conversely, GRN knockdown increased WNT1 expression, demonstrating that Wnt and GRN reciprocally regulate each other. Finally, we provided in vivo validation of the in vitro findings by analyzing gene expression data from individuals with FTD. These unbiased and genome-wide analyses provide evidence for a connection between Wnt signaling and the transcriptional regulation of neurodegenerative disease genes.

  18. Genome-wide analysis of alternative splicing during human heart development

    NASA Astrophysics Data System (ADS)

    Wang, He; Chen, Yanmei; Li, Xinzhong; Chen, Guojun; Zhong, Lintao; Chen, Gangbing; Liao, Yulin; Liao, Wangjun; Bin, Jianping

    2016-10-01

    Alternative splicing (AS) drives determinative changes during mouse heart development. Recent high-throughput technological advancements have facilitated genome-wide AS, while its analysis in human foetal heart transition to the adult stage has not been reported. Here, we present a high-resolution global analysis of AS transitions between human foetal and adult hearts. RNA-sequencing data showed extensive AS transitions occurred between human foetal and adult hearts, and AS events occurred more frequently in protein-coding genes than in long non-coding RNA (lncRNA). A significant difference of AS patterns was found between foetal and adult hearts. The predicted difference in AS events was further confirmed using quantitative reverse transcription-polymerase chain reaction analysis of human heart samples. Functional foetal-specific AS event analysis showed enrichment associated with cell proliferation-related pathways including cell cycle, whereas adult-specific AS events were associated with protein synthesis. Furthermore, 42.6% of foetal-specific AS events showed significant changes in gene expression levels between foetal and adult hearts. Genes exhibiting both foetal-specific AS and differential expression were highly enriched in cell cycle-associated functions. In conclusion, we provided a genome-wide profiling of AS transitions between foetal and adult hearts and proposed that AS transitions and deferential gene expression may play determinative roles in human heart development.

  19. Genome-wide analysis of zygotic linkage disequilibrium and its components in crossbred cattle

    PubMed Central

    2012-01-01

    Background Linkage disequilibrium (LD) between genes at linked or independent loci can occur at gametic and zygotic levels known asgametic LD and zygotic LD, respectively. Gametic LD is well known for its roles in fine-scale mapping of quantitative trait loci, genomic selection and evolutionary inference. The less-well studied is the zygotic LD and its components that can be also estimated directly from the unphased SNPs. Results This study was set up to investigate the genome-wide extent and patterns of zygotic LD and its components in a crossbred cattle population using the genomic data from the Illumina BovineSNP50 beadchip. The animal population arose from repeated crossbreeding of multiple breeds and selection for growth and cow reproduction. The study showed that similar genomic structures in gametic and zygotic LD were observed, with zygotic LD decaying faster than gametic LD over marker distance. The trigenic and quadrigenic disequilibria were generally two- to three-fold smaller than the usual digenic disequilibria (gametic or composite LD). There was less power of testing for these high-order genic disequilibria than for the digenic disequilibria. The power estimates decreased with the marker distance between markers though the decay trend is more obvious for the digenic disequilibria than for high-order disequilibria. Conclusions This study is the first major genome-wide survey of all non-allelic associations between pairs of SNPs in a cattle population. Such analysis allows us to assess the relative importance of gametic LD vs. all other non-allelic genic LDs regardless of whether or not the population is in HWE. The observed predominance of digenic LD (gametic or composite LD) coupled with insignificant high-order trigenic and quadrigenic disequilibria supports the current intensive focus on the use of high-density SNP markers for genome-wide association studies and genomic selection activities in the cattle population. PMID:22827586

  20. Using genome-wide complex trait analysis to quantify ‘missing heritability’ in Parkinson's disease

    PubMed Central

    Keller, Margaux F.; Saad, Mohamad; Bras, Jose; Bettella, Francesco; Nicolaou, Nayia; Simón-Sánchez, Javier; Mittag, Florian; Büchel, Finja; Sharma, Manu; Gibbs, J. Raphael; Schulte, Claudia; Moskvina, Valentina; Durr, Alexandra; Holmans, Peter; Kilarski, Laura L.; Guerreiro, Rita; Hernandez, Dena G.; Brice, Alexis; Ylikotila, Pauli; Stefánsson, Hreinn; Majamaa, Kari; Morris, Huw R.; Williams, Nigel; Gasser, Thomas; Heutink, Peter; Wood, Nicholas W.; Hardy, John; Martinez, Maria; Singleton, Andrew B.; Nalls, Michael A.

    2012-01-01

    Genome-wide association studies (GWASs) have been successful at identifying single-nucleotide polymorphisms (SNPs) highly associated with common traits; however, a great deal of the heritable variation associated with common traits remains unaccounted for within the genome. Genome-wide complex trait analysis (GCTA) is a statistical method that applies a linear mixed model to estimate phenotypic variance of complex traits explained by genome-wide SNPs, including those not associated with the trait in a GWAS. We applied GCTA to 8 cohorts containing 7096 case and 19 455 control individuals of European ancestry in order to examine the missing heritability present in Parkinson's disease (PD). We meta-analyzed our initial results to produce robust heritability estimates for PD types across cohorts. Our results identify 27% (95% CI 17–38, P = 8.08E − 08) phenotypic variance associated with all types of PD, 15% (95% CI −0.2 to 33, P = 0.09) phenotypic variance associated with early-onset PD and 31% (95% CI 17–44, P = 1.34E − 05) phenotypic variance associated with late-onset PD. This is a substantial increase from the genetic variance identified by top GWAS hits alone (between 3 and 5%) and indicates there are substantially more risk loci to be identified. Our results suggest that although GWASs are a useful tool in identifying the most common variants associated with complex disease, a great deal of common variants of small effect remain to be discovered. PMID:22892372

  1. Genome-wide genetic diversity, population structure and admixture analysis in African and Asian cattle breeds.

    PubMed

    Edea, Z; Bhuiyan, M S A; Dessie, T; Rothschild, M F; Dadi, H; Kim, K S

    2015-02-01

    Knowledge about genetic diversity and population structure is useful for designing effective strategies to improve the production, management and conservation of farm animal genetic resources. Here, we present a comprehensive genome-wide analysis of genetic diversity, population structure and admixture based on 244 animals sampled from 10 cattle populations in Asia and Africa and genotyped for 69,903 autosomal single-nucleotide polymorphisms (SNPs) mainly derived from the indicine breed. Principal component analysis, STRUCTURE and distance analysis from high-density SNP data clearly revealed that the largest genetic difference occurred between the two domestic lineages (taurine and indicine), whereas Ethiopian cattle populations represent a mosaic of the humped zebu and taurine. Estimation of the genetic influence of zebu and taurine revealed that Ethiopian cattle were characterized by considerable levels of introgression from South Asian zebu, whereas Bangladeshi populations shared very low taurine ancestry. The relationships among Ethiopian cattle populations reflect their history of origin and admixture rather than phenotype-based distinctions. The high within-individual genetic variability observed in Ethiopian cattle represents an untapped opportunity for adaptation to changing environments and for implementation of within-breed genetic improvement schemes. Our results provide a basis for future applications of genome-wide SNP data to exploit the unique genetic makeup of indigenous cattle breeds and to facilitate their improvement and conservation.

  2. Genome-Wide Analysis of Homeobox Gene Family in Legumes: Identification, Gene Duplication and Expression Profiling

    PubMed Central

    Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development. PMID:25745864

  3. Genome-wide association analysis demonstrates the highly polygenic character of age-related hearing impairment

    PubMed Central

    Fransen, Erik; Bonneux, Sarah; Corneveaux, Jason J; Schrauwen, Isabelle; Di Berardino, Federica; White, Cory H; Ohmen, Jeffrey D; Van de Heyning, Paul; Ambrosetti, Umberto; Huentelman, Matthew J; Van Camp, Guy; Friedman, Rick A

    2015-01-01

    We performed a genome-wide association study (GWAS) to identify the genes responsible for age-related hearing impairment (ARHI), the most common form of hearing impairment in the elderly. Analysis of common variants, with and without adjustment for stratification and environmental covariates, rare variants and interactions, as well as gene-set enrichment analysis, showed no variants with genome-wide significance. No evidence for replication of any previously reported genes was found. A study of the genetic architecture indicates for the first time that ARHI is highly polygenic in nature, with probably no major genes involved. The phenotype depends on the aggregated effect of a large number of SNPs, of which the individual effects are undetectable in a modestly powered GWAS. We estimated that 22% of the variance in our data set can be explained by the collective effect of all genotyped SNPs. A score analysis showed a modest enrichment in causative SNPs among the SNPs with a P-value below 0.01. PMID:24939585

  4. Genome-wide analysis of homeobox gene family in legumes: identification, gene duplication and expression profiling.

    PubMed

    Bhattacharjee, Annapurna; Ghangal, Rajesh; Garg, Rohini; Jain, Mukesh

    2015-01-01

    Homeobox genes encode transcription factors that are known to play a major role in different aspects of plant growth and development. In the present study, we identified homeobox genes belonging to 14 different classes in five legume species, including chickpea, soybean, Medicago, Lotus and pigeonpea. The characteristic differences within homeodomain sequences among various classes of homeobox gene family were quite evident. Genome-wide expression analysis using publicly available datasets (RNA-seq and microarray) indicated that homeobox genes are differentially expressed in various tissues/developmental stages and under stress conditions in different legumes. We validated the differential expression of selected chickpea homeobox genes via quantitative reverse transcription polymerase chain reaction. Genome duplication analysis in soybean indicated that segmental duplication has significantly contributed in the expansion of homeobox gene family. The Ka/Ks ratio of duplicated homeobox genes in soybean showed that several members of this family have undergone purifying selection. Moreover, expression profiling indicated that duplicated genes might have been retained due to sub-functionalization. The genome-wide identification and comprehensive gene expression profiling of homeobox gene family members in legumes will provide opportunities for functional analysis to unravel their exact role in plant growth and development.

  5. [Analysis of population stratification using random SNPs in genome-wide association studies].

    PubMed

    Cao, Zong-Fu; Ma, Chuan-Xiang; Wang, Lei; Cai, Bin

    2010-09-01

    Since population genetic STRUCTURE can increase false-positive rate in genome-wide association studies (GWAS) for complex diseases, the effect of population stratification should be taken into account in GWAS. However, the effect of randomly selected SNPs in population stratification analysis is underdetermined. In this study, based on the genotype data generated on Genome-Wide Human SNP Array 6.0 from unrelated individuals of HapMap Phase2, we randomly selected SNPs that were evenly distributed across the whole-genome, and acquired Ancestry Informative Markers (AIMs) by the method of f value and allelic Fisher exact test. F-statistics and STRUCTURE analysis based on the select different sets of SNPs were used to evaluate the effect of distinguishing the populations from HapMap Phase3. We found that randomly selected SNPs that were evenly distributed across the whole-genome were able to be used to identify the population structure. This study further indicated that more than 3 000 randomly selected SNPs that were evenly distributed across the whole-genome were substituted for AIMs in population stratification analysis, when there were no available AIMs for spe-cific populations.

  6. A genome-wide analysis of putative functional and exonic variation associated with extremely high intelligence

    PubMed Central

    Spain, S L; Pedroso, I; Kadeva, N; Miller, M B; Iacono, W G; McGue, M; Stergiakouli, E; Smith, G D; Putallaz, M; Lubinski, D; Meaburn, E L; Plomin, R; Simpson, M A

    2016-01-01

    Although individual differences in intelligence (general cognitive ability) are highly heritable, molecular genetic analyses to date have had limited success in identifying specific loci responsible for its heritability. This study is the first to investigate exome variation in individuals of extremely high intelligence. Under the quantitative genetic model, sampling from the high extreme of the distribution should provide increased power to detect associations. We therefore performed a case–control association analysis with 1409 individuals drawn from the top 0.0003 (IQ >170) of the population distribution of intelligence and 3253 unselected population-based controls. Our analysis focused on putative functional exonic variants assayed on the Illumina HumanExome BeadChip. We did not observe any individual protein-altering variants that are reproducibly associated with extremely high intelligence and within the entire distribution of intelligence. Moreover, no significant associations were found for multiple rare alleles within individual genes. However, analyses using genome-wide similarity between unrelated individuals (genome-wide complex trait analysis) indicate that the genotyped functional protein-altering variation yields a heritability estimate of 17.4% (s.e. 1.7%) based on a liability model. In addition, investigation of nominally significant associations revealed fewer rare alleles associated with extremely high intelligence than would be expected under the null hypothesis. This observation is consistent with the hypothesis that rare functional alleles are more frequently detrimental than beneficial to intelligence. PMID:26239293

  7. Quantifying the heritability of glioma using genome-wide complex trait analysis

    PubMed Central

    Kinnersley, Ben; Mitchell, Jonathan S.; Gousias, Konstantinos; Schramm, Johannes; Idbaih, Ahmed; Labussière, Marianne; Marie, Yannick; Rahimian, Amithys; Wichmann, H.-Erich; Schreiber, Stefan; Hoang-Xuan, Khe; Delattre, Jean-Yves; Nöthen, Markus M.; Mokhtari, Karima; Lathrop, Mark; Bondy, Melissa; Simon, Matthias; Sanson, Marc; Houlston, Richard S.

    2015-01-01

    Genome-wide association studies (GWAS) have successfully identified a number of common single-nucleotide polymorphisms (SNPs) influencing glioma risk. While these SNPs only explain a small proportion of the genetic risk it is unclear how much is left to be detected by other, yet to be identified, common SNPs. Therefore, we applied Genome-Wide Complex Trait Analysis (GCTA) to three GWAS datasets totalling 3,373 cases and 4,571 controls and performed a meta-analysis to estimate the heritability of glioma. Our results identify heritability estimates of 25% (95% CI: 20–31%, P = 1.15 × 10−17) for all forms of glioma - 26% (95% CI: 17–35%, P = 1.05 × 10−8) for glioblastoma multiforme (GBM) and 25% (95% CI: 17–32%, P = 1.26 × 10−10) for non-GBM tumors. This is a substantial increase from the genetic variance identified by the currently identified GWAS risk loci (~6% of common heritability), indicating that most of the heritable risk attributable to common genetic variants remains to be identified. PMID:26625949

  8. Genome-wide identification and analysis of the MADS-box gene family in apple.

    PubMed

    Tian, Yi; Dong, Qinglong; Ji, Zhirui; Chi, Fumei; Cong, Peihua; Zhou, Zongshan

    2015-01-25

    The MADS-box gene family is one of the most widely studied families in plants and has diverse developmental roles in flower pattern formation, gametophyte cell division and fruit differentiation. Although the genome-wide analysis of this family has been performed in some species, little is known regarding MADS-box genes in apple (Malus domestica). In this study, 146 MADS-box genes were identified in the apple genome and were phylogenetically clustered into six subgroups (MIKC(c), MIKC*, Mα, Mβ, Mγ and Mδ) with the MADS-box genes from Arabidopsis and rice. The predicted apple MADS-box genes were distributed across all 17 chromosomes at different densities. Additionally, the MADS-box domain, exon length, gene structure and motif compositions of the apple MADS-box genes were analysed. Moreover, the expression of all of the apple MADS-box genes was analysed in the root, stem, leaf, flower tissues and five stages of fruit development. All of the apple MADS-box genes, with the exception of some genes in each group, were expressed in at least one of the tissues tested, which indicates that the MADS-box genes are involved in various aspects of the physiological and developmental processes of the apple. To the best of our knowledge, this report describes the first genome-wide analysis of the apple MADS-box gene family, and the results should provide valuable information for understanding the classification, cloning and putative functions of this family.

  9. Genome Wide Analysis of Fatty Acid Desaturation and Its Response to Temperature1[OPEN

    PubMed Central

    Menard, Guillaume N.; Moreno, Jose Martin; Bryant, Fiona M.; Munoz-Azcarate, Olaya; Hassani-Pak, Keywan; Kurup, Smita

    2017-01-01

    Plants modify the polyunsaturated fatty acid content of their membrane and storage lipids in order to adapt to changes in temperature. In developing seeds, this response is largely controlled by the activities of the microsomal ω-6 and ω-3 fatty acid desaturases, FAD2 and FAD3. Although temperature regulation of desaturation has been studied at the molecular and biochemical levels, the genetic control of this trait is poorly understood. Here, we have characterized the response of Arabidopsis (Arabidopsis thaliana) seed lipids to variation in ambient temperature and found that heat inhibits both ω-6 and ω-3 desaturation in phosphatidylcholine, leading to a proportional change in triacylglycerol composition. Analysis of the 19 parental accessions of the multiparent advanced generation intercross (MAGIC) population showed that significant natural variation exists in the temperature responsiveness of ω-6 desaturation. A combination of quantitative trait locus (QTL) analysis and genome-wide association studies (GWAS) using the MAGIC population suggests that ω-6 desaturation is largely controlled by cis-acting sequence variants in the FAD2 5′ untranslated region intron that determine the expression level of the gene. However, the temperature responsiveness of ω-6 desaturation is controlled by a separate QTL on chromosome 2. The identity of this locus is unknown, but genome-wide association studies identified potentially causal sequence variants within ∼40 genes in an ∼450-kb region of the QTL. PMID:28108698

  10. Genome-wide Comparative Analysis of Atopic Dermatitis and Psoriasis Gives Insight into Opposing Genetic Mechanisms

    PubMed Central

    Baurecht, Hansjörg; Hotze, Melanie; Brand, Stephan; Büning, Carsten; Cormican, Paul; Corvin, Aiden; Ellinghaus, David; Ellinghaus, Eva; Esparza-Gordillo, Jorge; Fölster-Holst, Regina; Franke, Andre; Gieger, Christian; Hubner, Norbert; Illig, Thomas; Irvine, Alan D.; Kabesch, Michael; Lee, Young A.E.; Lieb, Wolfgang; Marenholz, Ingo; McLean, W.H. Irwin; Morris, Derek W.; Mrowietz, Ulrich; Nair, Rajan; Nöthen, Markus M.; Novak, Natalija; O’Regan, Grainne M.; Schreiber, Stefan; Smith, Catherine; Strauch, Konstantin; Stuart, Philip E.; Trembath, Richard; Tsoi, Lam C.; Weichenthal, Michael; Barker, Jonathan; Elder, James T.; Weidinger, Stephan; Cordell, Heather J.; Brown, Sara J.

    2015-01-01

    Atopic dermatitis and psoriasis are the two most common immune-mediated inflammatory disorders affecting the skin. Genome-wide studies demonstrate a high degree of genetic overlap, but these diseases have mutually exclusive clinical phenotypes and opposing immune mechanisms. Despite their prevalence, atopic dermatitis and psoriasis very rarely co-occur within one individual. By utilizing genome-wide association study and ImmunoChip data from >19,000 individuals and methodologies developed from meta-analysis, we have identified opposing risk alleles at shared loci as well as independent disease-specific loci within the epidermal differentiation complex (chromosome 1q21.3), the Th2 locus control region (chromosome 5q31.1), and the major histocompatibility complex (chromosome 6p21–22). We further identified previously unreported pleiotropic alleles with opposing effects on atopic dermatitis and psoriasis risk in PRKRA and ANXA6/TNIP1. In contrast, there was no evidence for shared loci with effects operating in the same direction on both diseases. Our results show that atopic dermatitis and psoriasis have distinct genetic mechanisms with opposing effects in shared pathways influencing epidermal differentiation and immune response. The statistical analysis methods developed in the conduct of this study have produced additional insight from previously published data sets. The approach is likely to be applicable to the investigation of the genetic basis of other complex traits with overlapping and distinct clinical features. PMID:25574825

  11. Five endometrial cancer risk loci identified through genome-wide association analysis

    PubMed Central

    O’Mara, Tracy A; Painter, Jodie N; Glubb, Dylan M; Flach, Susanne; Lewis, Annabelle; French, Juliet D; Freeman-Mills, Luke; Church, David; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley; Webb, Penelope M; Attia, John; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Henders, Anjali K; Martin, Nicholas G; Montgomery, Grant W; Nyholt, Dale R; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Dennis, Joe; Fasching, Peter A; Beckmann, Matthias W; Hein, Alexander; Ekici, Arif B; Hall, Per; Czene, Kamila; Darabi, Hatef; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo; Amant, Frederic; Schrauwen, Stefanie; Zhao, Hui; Lambrechts, Diether; Depreeuw, Jeroen; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica MJ; Ashton, Katie; Otton, Geoffrey; Proietto, Tony; Liu, Tao; Mints, Miriam; Tham, Emma; Consortium, CHIBCHA; Jun Li, Mulin; Yip, Shun H; Wang, Junwen; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Dunlop, Malcolm; Houlston, Richard; Palles, Claire; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Cunningham, Julie M; Pharoah, Paul D P; Dunning, Alison M; Edwards, Stacey L; Easton, Douglas F; Tomlinson, Ian; Spurdle, Amanda B

    2016-01-01

    We conducted a meta-analysis of three endometrial cancer GWAS and two replication phases totaling 7,737 endometrial cancer cases and 37,144 controls of European ancestry. Genome-wide imputation and meta-analysis identified five novel risk loci of genome-wide significance at likely regulatory regions on chromosomes 13q22.1 (rs11841589, near KLF5), 6q22.31 (rs13328298, in LOC643623 and near HEY2 and NCOA7), 8q24.21 (rs4733613, telomeric to MYC), 15q15.1 (rs937213, in EIF2AK4, near BMF) and 14q32.33 (rs2498796, in AKT1 near SIVA1). A second independent 8q24.21 signal (rs17232730) was found. Functional studies of the 13q22.1 locus showed that rs9600103 (pairwise r2=0.98 with rs11841589) is located in a region of active chromatin that interacts with the KLF5 promoter region. The rs9600103-T endometrial cancer protective allele suppressed gene expression in vitro suggesting that regulation of KLF5 expression, a gene linked to uterine development, is implicated in tumorigenesis. These findings provide enhanced insight into the genetic and biological basis of endometrial cancer. PMID:27135401

  12. Genome-wide comparative analysis of atopic dermatitis and psoriasis gives insight into opposing genetic mechanisms.

    PubMed

    Baurecht, Hansjörg; Hotze, Melanie; Brand, Stephan; Büning, Carsten; Cormican, Paul; Corvin, Aiden; Ellinghaus, David; Ellinghaus, Eva; Esparza-Gordillo, Jorge; Fölster-Holst, Regina; Franke, Andre; Gieger, Christian; Hubner, Norbert; Illig, Thomas; Irvine, Alan D; Kabesch, Michael; Lee, Young A E; Lieb, Wolfgang; Marenholz, Ingo; McLean, W H Irwin; Morris, Derek W; Mrowietz, Ulrich; Nair, Rajan; Nöthen, Markus M; Novak, Natalija; O'Regan, Grainne M; Schreiber, Stefan; Smith, Catherine; Strauch, Konstantin; Stuart, Philip E; Trembath, Richard; Tsoi, Lam C; Weichenthal, Michael; Barker, Jonathan; Elder, James T; Weidinger, Stephan; Cordell, Heather J; Brown, Sara J

    2015-01-08

    Atopic dermatitis and psoriasis are the two most common immune-mediated inflammatory disorders affecting the skin. Genome-wide studies demonstrate a high degree of genetic overlap, but these diseases have mutually exclusive clinical phenotypes and opposing immune mechanisms. Despite their prevalence, atopic dermatitis and psoriasis very rarely co-occur within one individual. By utilizing genome-wide association study and ImmunoChip data from >19,000 individuals and methodologies developed from meta-analysis, we have identified opposing risk alleles at shared loci as well as independent disease-specific loci within the epidermal differentiation complex (chromosome 1q21.3), the Th2 locus control region (chromosome 5q31.1), and the major histocompatibility complex (chromosome 6p21-22). We further identified previously unreported pleiotropic alleles with opposing effects on atopic dermatitis and psoriasis risk in PRKRA and ANXA6/TNIP1. In contrast, there was no evidence for shared loci with effects operating in the same direction on both diseases. Our results show that atopic dermatitis and psoriasis have distinct genetic mechanisms with opposing effects in shared pathways influencing epidermal differentiation and immune response. The statistical analysis methods developed in the conduct of this study have produced additional insight from previously published data sets. The approach is likely to be applicable to the investigation of the genetic basis of other complex traits with overlapping and distinct clinical features.

  13. Graphic analysis of population structure on genome-wide rheumatoid arthritis data.

    PubMed

    Zhang, Jun; Weng, Chunhua; Niyogi, Partha

    2009-12-15

    Principal-component analysis (PCA) has been used for decades to summarize the human genetic variation across geographic regions and to infer population migration history. Reduction of spurious associations due to population structure is crucial for the success of disease association studies. Recently, PCA has also become a popular method for detecting population structure and correction of population stratification in disease association studies. Inspired by manifold learning, we propose a novel method based on spectral graph theory. Regarding each study subject as a node with suitably defined weights for its edges to close neighbors, one can form a weighted graph. We suggest using the spectrum of the associated graph Laplacian operator, namely, Laplacian eigenfunctions, to infer population structures instead of principal components (PCs). For the whole genome-wide association data for the North American Rheumatoid Arthritis Consortium (NARAC) provided by Genetic Workshop Analysis 16, Laplacian eigenfunctions revealed more meaningful structures of the underlying population than PCA. The proposed method has connection to PCA, and it naturally includes PCA as a special case. Our simple method is computationally fast and is suitable for disease studies at the genome-wide scale.

  14. Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis

    PubMed Central

    2012-01-01

    Background The MYB gene family comprises one of the richest groups of transcription factors in plants. Plant MYB proteins are characterized by a highly conserved MYB DNA-binding domain. MYB proteins are classified into four major groups namely, 1R-MYB, 2R-MYB, 3R-MYB and 4R-MYB based on the number and position of MYB repeats. MYB transcription factors are involved in plant development, secondary metabolism, hormone signal transduction, disease resistance and abiotic stress tolerance. A comparative analysis of MYB family genes in rice and Arabidopsis will help reveal the evolution and function of MYB genes in plants. Results A genome-wide analysis identified at least 155 and 197 MYB genes in rice and Arabidopsis, respectively. Gene structure analysis revealed that MYB family genes possess relatively more number of introns in the middle as compared with C- and N-terminal regions of the predicted genes. Intronless MYB-genes are highly conserved both in rice and Arabidopsis. MYB genes encoding R2R3 repeat MYB proteins retained conserved gene structure with three exons and two introns, whereas genes encoding R1R2R3 repeat containing proteins consist of six exons and five introns. The splicing pattern is similar among R1R2R3 MYB genes in Arabidopsis. In contrast, variation in splicing pattern was observed among R1R2R3 MYB members of rice. Consensus motif analysis of 1kb upstream region (5′ to translation initiation codon) of MYB gene ORFs led to the identification of conserved and over-represented cis-motifs in both rice and Arabidopsis. Real-time quantitative RT-PCR analysis showed that several members of MYBs are up-regulated by various abiotic stresses both in rice and Arabidopsis. Conclusion A comprehensive genome-wide analysis of chromosomal distribution, tandem repeats and phylogenetic relationship of MYB family genes in rice and Arabidopsis suggested their evolution via duplication. Genome-wide comparative analysis of MYB genes and their expression analysis

  15. Genome-wide analysis of the MYB transcription factor superfamily in soybean

    PubMed Central

    2012-01-01

    Background The MYB superfamily constitutes one of the most abundant groups of transcription factors described in plants. Nevertheless, their functions appear to be highly diverse and remain rather unclear. To date, no genome-wide characterization of this gene family has been conducted in a legume species. Here we report the first genome-wide analysis of the whole MYB superfamily in a legume species, soybean (Glycine max), including the gene structures, phylogeny, chromosome locations, conserved motifs, and expression patterns, as well as a comparative genomic analysis with Arabidopsis. Results A total of 244 R2R3-MYB genes were identified and further classified into 48 subfamilies based on a phylogenetic comparative analysis with their putative orthologs, showed both gene loss and duplication events. The phylogenetic analysis showed that most characterized MYB genes with similar functions are clustered in the same subfamily, together with the identification of orthologs by synteny analysis, functional conservation among subgroups of MYB genes was strongly indicated. The phylogenetic relationships of each subgroup of MYB genes were well supported by the highly conserved intron/exon structures and motifs outside the MYB domain. Synonymous nucleotide substitution (dN/dS) analysis showed that the soybean MYB DNA-binding domain is under strong negative selection. The chromosome distribution pattern strongly indicated that genome-wide segmental and tandem duplication contribute to the expansion of soybean MYB genes. In addition, we found that ~ 4% of soybean R2R3-MYB genes had undergone alternative splicing events, producing a variety of transcripts from a single gene, which illustrated the extremely high complexity of transcriptome regulation. Comparative expression profile analysis of R2R3-MYB genes in soybean and Arabidopsis revealed that MYB genes play conserved and various roles in plants, which is indicative of a divergence in function. Conclusions In this

  16. CONAN: copy number variation analysis software for genome-wide association studies

    PubMed Central

    2010-01-01

    Background Genome-wide association studies (GWAS) based on single nucleotide polymorphisms (SNPs) revolutionized our perception of the genetic regulation of complex traits and diseases. Copy number variations (CNVs) promise to shed additional light on the genetic basis of monogenic as well as complex diseases and phenotypes. Indeed, the number of detected associations between CNVs and certain phenotypes are constantly increasing. However, while several software packages support the determination of CNVs from SNP chip data, the downstream statistical inference of CNV-phenotype associations is still subject to complicated and inefficient in-house solutions, thus strongly limiting the performance of GWAS based on CNVs. Results CONAN is a freely available client-server software solution which provides an intuitive graphical user interface for categorizing, analyzing and associating CNVs with phenotypes. Moreover, CONAN assists the evaluation process by visualizing detected associations via Manhattan plots in order to enable a rapid identification of genome-wide significant CNV regions. Various file formats including the information on CNVs in population samples are supported as input data. Conclusions CONAN facilitates the performance of GWAS based on CNVs and the visual analysis of calculated results. CONAN provides a rapid, valid and straightforward software solution to identify genetic variation underlying the 'missing' heritability for complex traits that remains unexplained by recent GWAS. The freely available software can be downloaded at http://genepi-conan.i-med.ac.at. PMID:20546565

  17. Genome-Wide Analysis of Promoters: Clustering by Alignment and Analysis of Regular Patterns

    PubMed Central

    Pettinato, Lucia; Calistri, Elisa; Di Patti, Francesca; Livi, Roberto; Luccioli, Stefano

    2014-01-01

    In this paper we perform a genome-wide analysis of H. sapiens promoters. To this aim, we developed and combined two mathematical methods that allow us to (i) classify promoters into groups characterized by specific global structural features, and (ii) recover, in full generality, any regular sequence in the different classes of promoters. One of the main findings of this analysis is that H. sapiens promoters can be classified into three main groups. Two of them are distinguished by the prevalence of weak or strong nucleotides and are characterized by short compositionally biased sequences, while the most frequent regular sequences in the third group are strongly correlated with transposons. Taking advantage of the generality of these mathematical procedures, we have compared the promoter database of H. sapiens with those of other species. We have found that the above-mentioned features characterize also the evolutionary content appearing in mammalian promoters, at variance with ancestral species in the phylogenetic tree, that exhibit a definitely lower level of differentiation among promoters. PMID:24465517

  18. Analysis of Genome-Wide Association Studies with Multiple Outcomes Using Penalization

    PubMed Central

    Liu, Jin; Huang, Jian; Ma, Shuangge

    2012-01-01

    Genome-wide association studies have been extensively conducted, searching for markers for biologically meaningful outcomes and phenotypes. Penalization methods have been adopted in the analysis of the joint effects of a large number of SNPs (single nucleotide polymorphisms) and marker identification. This study is partly motivated by the analysis of heterogeneous stock mice dataset, in which multiple correlated phenotypes and a large number of SNPs are available. Existing penalization methods designed to analyze a single response variable cannot accommodate the correlation among multiple response variables. With multiple response variables sharing the same set of markers, joint modeling is first employed to accommodate the correlation. The group Lasso approach is adopted to select markers associated with all the outcome variables. An efficient computational algorithm is developed. Simulation study and analysis of the heterogeneous stock mice dataset show that the proposed method can outperform existing penalization methods. PMID:23272092

  19. Meta-analysis of New Genome-wide Association Studies of Colorectal Cancer Risk

    PubMed Central

    Peters, Ulrike; Hutter, Carolyn M.; Hsu, Li; Schumacher, Fredrick R.; Conti, David V.; Carlson, Christopher S.; Edlund, Christopher K.; Haile, Robert W.; Gallinger, Steven; Zanke, Brent W.; Lemire, Mathieu; Rangrej, Jagadish; Vijayaraghavan, Raakhee; Chan, Andrew T.; Hazra, Aditi; Hunter, David J.; Ma, Jing; Fuchs, Charles S.; Giovannucci, Edward L.; Kraft, Peter; Liu, Yan; Chen, Lin; Jiao, Shuo; Makar, Karen W.; Taverna, Darin; Gruber, Stephen B.; Rennert, Gad; Moreno, Victor; Ulrich, Cornelia M.; Woods, Michael O.; Green, Roger C.; Parfrey, Patrick S.; Prentice, Ross L.; Kooperberg, Charles; Jackson, Rebecca D.; LaCroix, Andrea Z.; Caan, Bette J.; Hayes, Richard B.; Berndt, Sonja I.; Chanock, Stephen J.; Schoen, Robert E.; Chang-Claude, Jenny; Hoffmeister, Michael; Brenner, Hermann; Frank, Bernd; Bézieau, Stéphane; Küry, Sébastien; Slattery, Martha L.; Hopper, John L.; Jenkins, Mark A.; Le Marchand, Loic; Lindor, Noralane M.; Newcomb, Polly A.; Seminara, Daniela; Hudson, Thomas J.; Duggan, David J.; Potter, John D.; Casey, Graham

    2011-01-01

    Colorectal cancer is the second leading cause of cancer death in developed countries. Genome-wide association studies (GWAS) have successfully identified novel susceptibility loci for colorectal cancer. To follow-up on these findings, and try to identify novel colorectal cancer susceptibility loci, we present results for genome-wide association studies (GWAS) of colorectal cancer (2,906 cases, 3,416 controls) that have not previously published main associations. Specifically, we calculated odds ratios (ORs) and 95% confidence intervals (CIs) using log-additive models for each study. In order to improve our power to detect novel colorectal cancer susceptibility loci, we performed a meta-analysis combining the results across studies. We selected the most statistically significant single nucleotide polymorphisms (SNPs) for replication using 10 independent studies (8,161 cases and 9,101 controls). We again used a meta-analysis to summarize results for the replication studies alone, and for a combined analysis of GWAS and replication studies. We measured 10 SNPs previously identified in colorectal cancer susceptibility loci and found eight to be associated with colorectal cancer (p-value range: 0.02 to 1.8 × 10−8). When we excluded studies that have previously published on these SNPs, five SNPs remained significant at p<0.05 in the combined analysis. No novel susceptibility loci were significant in the replication study after adjustment for multiple testing, and none reached genome-wide significance from a combined analysis of GWAS and replication. We observed marginally significant evidence for a second independent SNP in the BMP2 region at chromosomal location 20p12 (rs4813802; replication p-value 0.03; combined p-value 7.3 × 10−5). In a region on 5p33.15, which includes the coding regions of the TERT-CLPTM1L genes and has been identified in GWAS to be associated with susceptibility to at least seven other cancers, we observed a marginally significant

  20. Genome-wide identification and expression analysis of MAPK and MAPKK gene family in Malus domestica.

    PubMed

    Zhang, Shizhong; Xu, Ruirui; Luo, Xiaocui; Jiang, Zesheng; Shu, Huairui

    2013-12-01

    MAPK signal transduction modules play crucial roles in regulating many biological processes in plants, which are composed of three classes of hierarchically organized protein kinases, namely MAPKKKs, MAPKKs, and MAPKs. Although genome-wide analysis of this family has been carried out in some species, little is known about MAPK and MAPKK genes in apple (Malus domestica). In this study, a total of 26 putative apple MAPK genes (MdMPKs) and 9 putative apple MAPKK genes (MdMKKs) have been identified and located within the apple genome. Phylogenetic analysis revealed that MdMAPKs and MdMAPKKs could be divided into 4 subfamilies (groups A, B, C and D), respectively. The predicted MdMAPKs and MdMAPKKs were distributed across 13 out of 17 chromosomes with different densities. In addition, analysis of exon-intron junctions and of intron phase inside the predicted coding region of each candidate gene has revealed high levels of conservation within and between phylogenetic groups. According to the microarray and expressed sequence tag (EST) analysis, the different expression patterns indicate that they may play different roles during fruit development and rootstock-scion interaction process. Moreover, MAPK and MAPKK genes were performed expression profile analyses in different tissues (root, stem, leaf, flower and fruit), and all of the selected genes were expressed in at least one of the tissues tested, indicating that the MAPKs and MAPKKs are involved in various aspects of physiological and developmental processes of apple. To our knowledge, this is the first report of a genome-wide analysis of the apple MAPK and MAPKK gene family. This study provides valuable information for understanding the classification and putative functions of the MAPK signal in apple.

  1. Genome-wide association analysis of anti-TNF drug response in rheumatoid arthritis patients

    PubMed Central

    Mirkov, Maša Umiċeviċ; Cui, Jing; Vermeulen, Sita H; Stahl, Eli A.; Toonen, Erik JM; Makkinje, Remco R; Lee, Annette T; Huizinga, Tom WJ; Allaart, Renee; Barton, Anne; Mariette, Xavier; Miceli-Richard, Corinne; Criswell, Lindsey A; Tak, Paul P; de Vries, Niek; Saevarsdottir, Saedis; Padyukov, Leonid; Bridges, S. Louis; van Schaardenburg, Dirk-Jan; Jansen, Tim; Dutmer, Ellen AJ; van de Laar, Mart; Barrera, Pilar; Radstake, Timothy RDJ; van Riel, Piet LCM; Scheffer, Hans; Franke, Barbara; Brunner, Han G; Plenge, Robert M; Gregersen, Peter K; Guchelaar, Henk-Jan; Coenen, Marieke JH

    2014-01-01

    Background Treatment strategies blocking tumor necrosis factor (anti-TNF) have proven very successful in patients with rheumatoid arthritis (RA). However, a significant subset of patients does not respond for unknown reasons. Currently there are no means of identifying these patients prior to treatment. This study was aimed at identifying genetic factors predicting anti-TNF treatment outcome in patient with RA using a genome-wide association approach. Methods We conducted a multi-stage, genome-wide association study with a primary analysis of 2,557,253 single nucleotide polymorphisms (SNPs) in 882 RA patients receiving anti-TNF therapy included through the Dutch Rheumatoid Arthritis Monitoring (DREAM) registry and the database of Apotheekzorg. Linear regression analysis of changes in the Disease Activity Score in 28 joints after 14 weeks of treatment was performed using an additive model. Markers with a p<10−3 were selected for replication in 1,821 RA patients from three independent cohorts. Pathway analysis including all SNPs with a p-value < 10−3 was performed using Ingenuity. Results Seven hundred seventy two markers demonstrated evidence of association with treatment outcome in the initial stage. Eight genetic loci showed improved p-value in the overall meta-analysis compared to the first stage, three of which (rs1568885, rs1813443 and rs4411591) showed directional consistency over all four studied cohorts. We were unable to replicate markers previously reported to be associated with anti-TNF outcome. Network analysis indicated strong involvement of biological processes underlying inflammatory response and cell morphology. Conclusion Using a multi-stage strategy, we have identified 8 genetic loci associated with response to anti-TNF treatment. Further studies are required to validate these findings in additional patient collections. PMID:23233654

  2. Genome-wide meta-analysis identifies new susceptibility loci for migraine.

    PubMed

    Anttila, Verneri; Winsvold, Bendik S; Gormley, Padhraig; Kurth, Tobias; Bettella, Francesco; McMahon, George; Kallela, Mikko; Malik, Rainer; de Vries, Boukje; Terwindt, Gisela; Medland, Sarah E; Todt, Unda; McArdle, Wendy L; Quaye, Lydia; Koiranen, Markku; Ikram, M Arfan; Lehtimäki, Terho; Stam, Anine H; Ligthart, Lannie; Wedenoja, Juho; Dunham, Ian; Neale, Benjamin M; Palta, Priit; Hamalainen, Eija; Schürks, Markus; Rose, Lynda M; Buring, Julie E; Ridker, Paul M; Steinberg, Stacy; Stefansson, Hreinn; Jakobsson, Finnbogi; Lawlor, Debbie A; Evans, David M; Ring, Susan M; Färkkilä, Markus; Artto, Ville; Kaunisto, Mari A; Freilinger, Tobias; Schoenen, Jean; Frants, Rune R; Pelzer, Nadine; Weller, Claudia M; Zielman, Ronald; Heath, Andrew C; Madden, Pamela A F; Montgomery, Grant W; Martin, Nicholas G; Borck, Guntram; Göbel, Hartmut; Heinze, Axel; Heinze-Kuhn, Katja; Williams, Frances M K; Hartikainen, Anna-Liisa; Pouta, Anneli; van den Ende, Joyce; Uitterlinden, Andre G; Hofman, Albert; Amin, Najaf; Hottenga, Jouke-Jan; Vink, Jacqueline M; Heikkilä, Kauko; Alexander, Michael; Muller-Myhsok, Bertram; Schreiber, Stefan; Meitinger, Thomas; Wichmann, Heinz Erich; Aromaa, Arpo; Eriksson, Johan G; Traynor, Bryan J; Trabzuni, Daniah; Rossin, Elizabeth; Lage, Kasper; Jacobs, Suzanne B R; Gibbs, J Raphael; Birney, Ewan; Kaprio, Jaakko; Penninx, Brenda W; Boomsma, Dorret I; van Duijn, Cornelia; Raitakari, Olli; Jarvelin, Marjo-Riitta; Zwart, John-Anker; Cherkas, Lynn; Strachan, David P; Kubisch, Christian; Ferrari, Michel D; van den Maagdenberg, Arn M J M; Dichgans, Martin; Wessman, Maija; Smith, George Davey; Stefansson, Kari; Daly, Mark J; Nyholt, Dale R; Chasman, Daniel I; Palotie, Aarno

    2013-08-01

    Migraine is the most common brain disorder, affecting approximately 14% of the adult population, but its molecular mechanisms are poorly understood. We report the results of a meta-analysis across 29 genome-wide association studies, including a total of 23,285 individuals with migraine (cases) and 95,425 population-matched controls. We identified 12 loci associated with migraine susceptibility (P<5×10(-8)). Five loci are new: near AJAP1 at 1p36, near TSPAN2 at 1p13, within FHL5 at 6q16, within C7orf10 at 7p14 and near MMP16 at 8q21. Three of these loci were identified in disease subgroup analyses. Brain tissue expression quantitative trait locus analysis suggests potential functional candidate genes at four loci: APOA1BP, TBC1D7, FUT9, STAT6 and ATP5B.

  3. Genome-Wide Linkage Analysis Identifies Loci for Physical Appearance Traits in Chickens.

    PubMed

    Sun, Yanfa; Liu, Ranran; Zhao, Guiping; Zheng, Maiqing; Sun, Yan; Yu, Xiaoqiong; Li, Peng; Wen, Jie

    2015-08-06

    Physical appearance traits, such as feather-crested head, comb size and type, beard, wattles size, and feathered feet, are used to distinguish between breeds of chicken and also may be associated with economic traits. In this study, a genome-wide linkage analysis was used to identify candidate regions and genes for physical appearance traits and to potentially provide further knowledge of the molecular mechanisms that underlie these traits. The linkage analysis was conducted with an F2 population derived from Beijing-You chickens and a commercial broiler line. Single-nucleotide polymorphisms were analyzed using the Illumina 60K Chicken SNP Beadchip. The data were used to map quantitative trait loci and genes for six physical appearance traits. A 10-cM/0.51-Mb region (0.0-10.0 cM/0.00-0.51 Mb) with 1% genome-wide significant level on LGE22C19W28_E50C23 linkage group (LGE22) for crest trait was identified, which is likely very closely linked to the HOXC8. A QTL with 5% chromosome-wide significant level for comb weight, which partly overlaps with a region identified in a previous study, was identified at 74 cM/25.55 Mb on chicken (Gallus gallus; GG) chromosome 3 (i.e., GGA3). For beard and wattles traits, an identical region 11 cM/2.23 Mb (0.0-11.0 cM/0.00-2.23 Mb) including WNT3 and GH genes on GGA27 was identified. Two QTL with 1% genome-wide significant level for feathered feet trait, one 9-cM/2.80-Mb (48.0-57.0/13.40-16.20 Mb) region on GGA13, and another 12-cM/1.45-Mb (41.0-53.0 cM/11.37-12.82 Mb) region on GGA15 were identified. These candidate regions and genes provide important genetic information for the physical appearance traits in chicken.

  4. Genome-Wide Linkage Analysis Identifies Loci for Physical Appearance Traits in Chickens

    PubMed Central

    Sun, Yanfa; Liu, Ranran; Zhao, Guiping; Zheng, Maiqing; Sun, Yan; Yu, Xiaoqiong; Li, Peng; Wen, Jie

    2015-01-01

    Physical appearance traits, such as feather-crested head, comb size and type, beard, wattles size, and feathered feet, are used to distinguish between breeds of chicken and also may be associated with economic traits. In this study, a genome-wide linkage analysis was used to identify candidate regions and genes for physical appearance traits and to potentially provide further knowledge of the molecular mechanisms that underlie these traits. The linkage analysis was conducted with an F2 population derived from Beijing-You chickens and a commercial broiler line. Single-nucleotide polymorphisms were analyzed using the Illumina 60K Chicken SNP Beadchip. The data were used to map quantitative trait loci and genes for six physical appearance traits. A 10-cM/0.51-Mb region (0.0−10.0 cM/0.00−0.51 Mb) with 1% genome-wide significant level on LGE22C19W28_E50C23 linkage group (LGE22) for crest trait was identified, which is likely very closely linked to the HOXC8. A QTL with 5% chromosome-wide significant level for comb weight, which partly overlaps with a region identified in a previous study, was identified at 74 cM/25.55 Mb on chicken (Gallus gallus; GG) chromosome 3 (i.e., GGA3). For beard and wattles traits, an identical region 11 cM/2.23 Mb (0.0−11.0 cM/0.00−2.23 Mb) including WNT3 and GH genes on GGA27 was identified. Two QTL with 1% genome-wide significant level for feathered feet trait, one 9-cM/2.80-Mb (48.0-57.0/13.40-16.20 Mb) region on GGA13, and another 12-cM/1.45-Mb (41.0−53.0 cM/11.37−12.82 Mb) region on GGA15 were identified. These candidate regions and genes provide important genetic information for the physical appearance traits in chicken. PMID:26248982

  5. Genome-wide interaction-based association analysis identified multiple new susceptibility Loci for common diseases.

    PubMed

    Liu, Yang; Xu, Haiming; Chen, Suchao; Chen, Xianfeng; Zhang, Zhenguo; Zhu, Zhihong; Qin, Xueying; Hu, Landian; Zhu, Jun; Zhao, Guo-Ping; Kong, Xiangyin

    2011-03-01

    Genome-wide interaction-based association (GWIBA) analysis has the potential to identify novel susceptibility loci. These interaction effects could be missed with the prevailing approaches in genome-wide association studies (GWAS). However, no convincing loci have been discovered exclusively from GWIBA methods, and the intensive computation involved is a major barrier for application. Here, we developed a fast, multi-thread/parallel program named "pair-wise interaction-based association mapping" (PIAM) for exhaustive two-locus searches. With this program, we performed a complete GWIBA analysis on seven diseases with stringent control for false positives, and we validated the results for three of these diseases. We identified one pair-wise interaction between a previously identified locus, C1orf106, and one new locus, TEC, that was specific for Crohn's disease, with a Bonferroni corrected P < 0.05 (P = 0.039). This interaction was replicated with a pair of proxy linked loci (P = 0.013) on an independent dataset. Five other interactions had corrected P < 0.5. We identified the allelic effect of a locus close to SLC7A13 for coronary artery disease. This was replicated with a linked locus on an independent dataset (P = 1.09 × 10⁻⁷). Through a local validation analysis that evaluated association signals, rather than locus-based associations, we found that several other regions showed association/interaction signals with nominal P < 0.05. In conclusion, this study demonstrated that the GWIBA approach was successful for identifying novel loci, and the results provide new insights into the genetic architecture of common diseases. In addition, our PIAM program was capable of handling very large GWAS datasets that are likely to be produced in the future.

  6. Genome-wide linkage analysis and association study identifies loci for polydactyly in chickens.

    PubMed

    Sun, Yanfa; Liu, Ranran; Zhao, Guiping; Zheng, Maiqing; Sun, Yan; Yu, Xiaoqiong; Li, Peng; Wen, Jie

    2014-04-21

    Polydactyly occurs in some chicken breeds, but the molecular mechanism remains incompletely understood. Combined genome-wide linkage analysis and association study (GWAS) for chicken polydactyly helps identify loci or candidate genes for the trait and potentially provides further mechanistic understanding of this phenotype in chickens and perhaps other species. The linkage analysis and GWAS for polydactyly was conducted using an F2 population derived from Beijing-You chickens and commercial broilers. The results identified two QTLs through linkage analysis and seven single-nucleotide polymorphisms (SNPs) through GWAS, associated with the polydactyly trait. One QTL located at 35 cM on the GGA2 was significant at the 1% genome-wise level and another QTL at the 1% chromosome-wide significance level was detected at 39 cM on GGA19. A total of seven SNPs, four of 5% genome-wide significance (P < 2.98 × 10(-6)) and three of suggestive significance (5.96 × 10(-5)) were identified, including two SNPs (GGaluGA132178 and Gga_rs14135036) in the QTL on GGA2. Of the identified SNPs, the eight nearest genes were sonic hedgehog (SHH), limb region 1 homolog (mouse) (LMBR1), dipeptidyl-peptidase 6, transcript variant 3 (DPP6), thyroid-stimulating hormone, beta (TSHB), sal-like 4 (Drosophila) (SALL4), par-6 partitioning defective 6 homolog beta (Caenorhabditis elegans) (PARD6B), coenzyme Q5 (COQ5), and tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, etapolypeptide (YWHAH). The GWAS supports earlier reports of the importance of SHH and LMBR1 as regulating genes for polydactyly in chickens and other species, and identified others, most of which have not previously been associated with limb development. The genes and associated SNPs revealed here provide detailed information for further exploring the molecular and developmental mechanisms underlying polydactyly.

  7. MPE-seq, a new method for the genome-wide analysis of chromatin structure

    PubMed Central

    Ishii, Haruhiko; Kadonaga, James T.; Ren, Bing

    2015-01-01

    The analysis of chromatin structure is essential for the understanding of transcriptional regulation in eukaryotes. Here we describe methidiumpropyl-EDTA sequencing (MPE-seq), a method for the genome-wide characterization of chromatin that involves the digestion of nuclei withMPE-Fe(II) followed by massively parallel sequencing. Like micrococcal nuclease (MNase), MPE-Fe(II) preferentially cleaves the linker DNA between nucleosomes. However, there are differences in the cleavage of nuclear chromatin by MPE-Fe(II) relative to MNase. Most notably, immediately upstream of the transcription start site of active promoters, we frequently observed nucleosome-sized (141–190 bp) and subnucleosome-sized (such as 101–140 bp) peaks of digested chromatin fragments with MPE-seq but not with MNase-seq. These peaks also correlate with the presence of core histones and could thus be due, at least in part, to noncanonical chromatin structures such as labile nucleosome-like particles that have been observed in other contexts. The subnucleosome-sized MPE-seq peaks exhibit a particularly distinct association with active promoters. In addition, unlike MNase, MPE-Fe(II) cleaves nuclear DNA with little sequence bias. In this regard, we found that DNA sequences at RNA splice sites are hypersensitive to digestion by MNase but not by MPE-Fe(II). This phenomenon may have affected the analysis of nucleosome occupancy over exons. These findings collectively indicate that MPE-seq provides a unique and straightforward means for the genome-wide analysis of chromatin structure with minimal DNA sequence bias. In particular, the combined use of MPE-seq and MNase-seq enables the identification of noncanonical chromatin structures that are likely to be important for the regulation of gene expression. PMID:26080409

  8. Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis.

    PubMed

    Skwark, Marcin J; Croucher, Nicholas J; Puranen, Santeri; Chewapreecha, Claire; Pesonen, Maiju; Xu, Ying Ying; Turner, Paul; Harris, Simon R; Beres, Stephen B; Musser, James M; Parkhill, Julian; Bentley, Stephen D; Aurell, Erik; Corander, Jukka

    2017-02-01

    Recent advances in the scale and diversity of population genomic datasets for bacteria now provide the potential for genome-wide patterns of co-evolution to be studied at the resolution of individual bases. Here we describe a new statistical method, genomeDCA, which uses recent advances in computational structural biology to identify the polymorphic loci under the strongest co-evolutionary pressures. We apply genomeDCA to two large population data sets representing the major human pathogens Streptococcus pneumoniae (pneumococcus) and Streptococcus pyogenes (group A Streptococcus). For pneumococcus we identified 5,199 putative epistatic interactions between 1,936 sites. Over three-quarters of the links were between sites within the pbp2x, pbp1a and pbp2b genes, the sequences of which are critical in determining non-susceptibility to beta-lactam antibiotics. A network-based analysis found these genes were also coupled to that encoding dihydrofolate reductase, changes to which underlie trimethoprim resistance. Distinct from these antibiotic resistance genes, a large network component of 384 protein coding sequences encompassed many genes critical in basic cellular functions, while another distinct component included genes associated with virulence. The group A Streptococcus (GAS) data set population represents a clonal population with relatively little genetic variation and a high level of linkage disequilibrium across the genome. Despite this, we were able to pinpoint two RNA pseudouridine synthases, which were each strongly linked to a separate set of loci across the chromosome, representing biologically plausible targets of co-selection. The population genomic analysis method applied here identifies statistically significantly co-evolving locus pairs, potentially arising from fitness selection interdependence reflecting underlying protein-protein interactions, or genes whose product activities contribute to the same phenotype. This discovery approach greatly

  9. A genome-wide 20 K citrus microarray for gene expression analysis

    PubMed Central

    Martinez-Godoy, M Angeles; Mauri, Nuria; Juarez, Jose; Marques, M Carmen; Santiago, Julia; Forment, Javier; Gadea, Jose

    2008-01-01

    Background Understanding of genetic elements that contribute to key aspects of citrus biology will impact future improvements in this economically important crop. Global gene expression analysis demands microarray platforms with a high genome coverage. In the last years, genome-wide EST collections have been generated in citrus, opening the possibility to create new tools for functional genomics in this crop plant. Results We have designed and constructed a publicly available genome-wide cDNA microarray that include 21,081 putative unigenes of citrus. As a functional companion to the microarray, a web-browsable database [1] was created and populated with information about the unigenes represented in the microarray, including cDNA libraries, isolated clones, raw and processed nucleotide and protein sequences, and results of all the structural and functional annotation of the unigenes, like general description, BLAST hits, putative Arabidopsis orthologs, microsatellites, putative SNPs, GO classification and PFAM domains. We have performed a Gene Ontology comparison with the full set of Arabidopsis proteins to estimate the genome coverage of the microarray. We have also performed microarray hybridizations to check its usability. Conclusion This new cDNA microarray replaces the first 7K microarray generated two years ago and allows gene expression analysis at a more global scale. We have followed a rational design to minimize cross-hybridization while maintaining its utility for different citrus species. Furthermore, we also provide access to a website with full structural and functional annotation of the unigenes represented in the microarray, along with the ability to use this site to directly perform gene expression analysis using standard tools at different publicly available servers. Furthermore, we show how this microarray offers a good representation of the citrus genome and present the usefulness of this genomic tool for global studies in citrus by using it to

  10. Interacting networks of resistance, virulence and core machinery genes identified by genome-wide epistasis analysis

    PubMed Central

    Pesonen, Maiju; Musser, James M.; Bentley, Stephen D.; Aurell, Erik; Corander, Jukka

    2017-01-01

    Recent advances in the scale and diversity of population genomic datasets for bacteria now provide the potential for genome-wide patterns of co-evolution to be studied at the resolution of individual bases. Here we describe a new statistical method, genomeDCA, which uses recent advances in computational structural biology to identify the polymorphic loci under the strongest co-evolutionary pressures. We apply genomeDCA to two large population data sets representing the major human pathogens Streptococcus pneumoniae (pneumococcus) and Streptococcus pyogenes (group A Streptococcus). For pneumococcus we identified 5,199 putative epistatic interactions between 1,936 sites. Over three-quarters of the links were between sites within the pbp2x, pbp1a and pbp2b genes, the sequences of which are critical in determining non-susceptibility to beta-lactam antibiotics. A network-based analysis found these genes were also coupled to that encoding dihydrofolate reductase, changes to which underlie trimethoprim resistance. Distinct from these antibiotic resistance genes, a large network component of 384 protein coding sequences encompassed many genes critical in basic cellular functions, while another distinct component included genes associated with virulence. The group A Streptococcus (GAS) data set population represents a clonal population with relatively little genetic variation and a high level of linkage disequilibrium across the genome. Despite this, we were able to pinpoint two RNA pseudouridine synthases, which were each strongly linked to a separate set of loci across the chromosome, representing biologically plausible targets of co-selection. The population genomic analysis method applied here identifies statistically significantly co-evolving locus pairs, potentially arising from fitness selection interdependence reflecting underlying protein-protein interactions, or genes whose product activities contribute to the same phenotype. This discovery approach greatly

  11. Pharmacogenetic meta-analysis of genome-wide association studies of LDL cholesterol response to statins

    PubMed Central

    Postmus, Iris; Trompet, Stella; Deshmukh, Harshal A.; Barnes, Michael R.; Li, Xiaohui; Warren, Helen R.; Chasman, Daniel I.; Zhou, Kaixin; Arsenault, Benoit J.; Donnelly, Louise A.; Wiggins, Kerri L.; Avery, Christy L.; Griffin, Paula; Feng, QiPing; Taylor, Kent D.; Li, Guo; Evans, Daniel S.; Smith, Albert V.; de Keyser, Catherine E.; Johnson, Andrew D.; de Craen, Anton J. M.; Stott, David J.; Buckley, Brendan M.; Ford, Ian; Westendorp, Rudi G. J.; Eline Slagboom, P.; Sattar, Naveed; Munroe, Patricia B.; Sever, Peter; Poulter, Neil; Stanton, Alice; Shields, Denis C.; O’Brien, Eoin; Shaw-Hawkins, Sue; Ida Chen, Y.-D.; Nickerson, Deborah A.; Smith, Joshua D.; Pierre Dubé, Marie; Matthijs Boekholdt, S.; Kees Hovingh, G.; Kastelein, John J. P.; McKeigue, Paul M.; Betteridge, John; Neil, Andrew; Durrington, Paul N.; Doney, Alex; Carr, Fiona; Morris, Andrew; McCarthy, Mark I.; Groop, Leif; Ahlqvist, Emma; Bis, Joshua C.; Rice, Kenneth; Smith, Nicholas L.; Lumley, Thomas; Whitsel, Eric A.; Stürmer, Til; Boerwinkle, Eric; Ngwa, Julius S.; O’Donnell, Christopher J.; Vasan, Ramachandran S.; Wei, Wei-Qi; Wilke, Russell A.; Liu, Ching-Ti; Sun, Fangui; Guo, Xiuqing; Heckbert, Susan R; Post, Wendy; Sotoodehnia, Nona; Arnold, Alice M.; Stafford, Jeanette M.; Ding, Jingzhong; Herrington, David M.; Kritchevsky, Stephen B.; Eiriksdottir, Gudny; Launer, Leonore J.; Harris, Tamara B.; Chu, Audrey Y.; Giulianini, Franco; MacFadyen, Jean G.; Barratt, Bryan J.; Nyberg, Fredrik; Stricker, Bruno H.; Uitterlinden, André G.; Hofman, Albert; Rivadeneira, Fernando; Emilsson, Valur; Franco, Oscar H.; Ridker, Paul M.; Gudnason, Vilmundur; Liu, Yongmei; Denny, Joshua C.; Ballantyne, Christie M.; Rotter, Jerome I.; Adrienne Cupples, L.; Psaty, Bruce M.; Palmer, Colin N. A.; Tardif, Jean-Claude; Colhoun, Helen M.; Hitman, Graham; Krauss, Ronald M.; Wouter Jukema, J; Caulfield, Mark J.; Donnelly, Peter; Barroso, Ines; Blackwell, Jenefer M.; Bramon, Elvira; Brown, Matthew A.; Casas, Juan P.; Corvin, Aiden; Deloukas, Panos; Duncanson, Audrey; Jankowski, Janusz; Markus, Hugh S.; Mathew, Christopher G.; Palmer, Colin N. A.; Plomin, Robert; Rautanen, Anna; Sawcer, Stephen J.; Trembath, Richard C.; Viswanathan, Ananth C.; Wood, Nicholas W.; Spencer, Chris C. A.; Band, Gavin; Bellenguez, Céline; Freeman, Colin; Hellenthal, Garrett; Giannoulatou, Eleni; Pirinen, Matti; Pearson, Richard; Strange, Amy; Su, Zhan; Vukcevic, Damjan; Donnelly, Peter; Langford, Cordelia; Hunt, Sarah E.; Edkins, Sarah; Gwilliam, Rhian; Blackburn, Hannah; Bumpstead, Suzannah J.; Dronov, Serge; Gillman, Matthew; Gray, Emma; Hammond, Naomi; Jayakumar, Alagurevathi; McCann, Owen T.; Liddle, Jennifer; Potter, Simon C.; Ravindrarajah, Radhi; Ricketts, Michelle; Waller, Matthew; Weston, Paul; Widaa, Sara; Whittaker, Pamela; Barroso, Ines; Deloukas, Panos; Mathew, Christopher G.; Blackwell, Jenefer M.; Brown, Matthew A.; Corvin, Aiden; McCarthy, Mark I.; Spencer, Chris C. A.

    2014-01-01

    Statins effectively lower LDL cholesterol levels in large studies and the observed interindividual response variability may be partially explained by genetic variation. Here we perform a pharmacogenetic meta-analysis of genome-wide association studies (GWAS) in studies addressing the LDL cholesterol response to statins, including up to 18,596 statin-treated subjects. We validate the most promising signals in a further 22,318 statin recipients and identify two loci, SORT1/CELSR2/PSRC1 and SLCO1B1, not previously identified in GWAS. Moreover, we confirm the previously described associations with APOE and LPA. Our findings advance the understanding of the pharmacogenetic architecture of statin response. PMID:25350695

  12. A genome-wide resource for the analysis of protein localisation in Drosophila.

    PubMed

    Sarov, Mihail; Barz, Christiane; Jambor, Helena; Hein, Marco Y; Schmied, Christopher; Suchold, Dana; Stender, Bettina; Janosch, Stephan; K J, Vinay Vikas; Krishnan, R T; Krishnamoorthy, Aishwarya; Ferreira, Irene R S; Ejsmont, Radoslaw K; Finkl, Katja; Hasse, Susanne; Kämpfer, Philipp; Plewka, Nicole; Vinis, Elisabeth; Schloissnig, Siegfried; Knust, Elisabeth; Hartenstein, Volker; Mann, Matthias; Ramaswami, Mani; VijayRaghavan, K; Tomancak, Pavel; Schnorrer, Frank

    2016-02-20

    The Drosophila genome contains >13000 protein-coding genes, the majority of which remain poorly investigated. Important reasons include the lack of antibodies or reporter constructs to visualise these proteins. Here, we present a genome-wide fosmid library of 10000 GFP-tagged clones, comprising tagged genes and most of their regulatory information. For 880 tagged proteins, we created transgenic lines, and for a total of 207 lines, we assessed protein expression and localisation in ovaries, embryos, pupae or adults by stainings and live imaging approaches. Importantly, we visualised many proteins at endogenous expression levels and found a large fraction of them localising to subcellular compartments. By applying genetic complementation tests, we estimate that about two-thirds of the tagged proteins are functional. Moreover, these tagged proteins enable interaction proteomics from developing pupae and adult flies. Taken together, this resource will boost systematic analysis of protein expression and localisation in various cellular and developmental contexts.

  13. Genome-Wide Analysis Reveals Novel Regulators of Growth in Drosophila melanogaster

    PubMed Central

    Vonesch, Sibylle Chantal; Lamparter, David; Mackay, Trudy F. C.; Bergmann, Sven; Hafen, Ernst

    2016-01-01

    Organismal size depends on the interplay between genetic and environmental factors. Genome-wide association (GWA) analyses in humans have implied many genes in the control of height but suffer from the inability to control the environment. Genetic analyses in Drosophila have identified conserved signaling pathways controlling size; however, how these pathways control phenotypic diversity is unclear. We performed GWA of size traits using the Drosophila Genetic Reference Panel of inbred, sequenced lines. We find that the top associated variants differ between traits and sexes; do not map to canonical growth pathway genes, but can be linked to these by epistasis analysis; and are enriched for genes and putative enhancers. Performing GWA on well-studied developmental traits under controlled conditions expands our understanding of developmental processes underlying phenotypic diversity. PMID:26751788

  14. Moving toward System Genetics through Multiple Trait Analysis in Genome-Wide Association Studies

    PubMed Central

    Shriner, Daniel

    2011-01-01

    Association studies are a staple of genotype–phenotype mapping studies, whether they are based on single markers, haplotypes, candidate genes, genome-wide genotypes, or whole genome sequences. Although genetic epidemiological studies typically contain data collected on multiple traits which themselves are often correlated, most analyses have been performed on single traits. Here, I review several methods that have been developed to perform multiple trait analysis. These methods range from traditional multivariate models for systems of equations to recently developed graphical approaches based on network theory. The application of network theory to genetics is termed systems genetics and has the potential to address long-standing questions in genetics about complex processes such as coordinate regulation, homeostasis, and pleiotropy. PMID:22303408

  15. Genome-wide Analysis of RNA Polymerase II Termination at Protein-Coding Genes.

    PubMed

    Baejen, Carlo; Andreani, Jessica; Torkler, Phillipp; Battaglia, Sofia; Schwalb, Bjoern; Lidschreiber, Michael; Maier, Kerstin C; Boltendahl, Andrea; Rus, Petra; Esslinger, Stephanie; Söding, Johannes; Cramer, Patrick

    2017-03-06

    At the end of protein-coding genes, RNA polymerase (Pol) II undergoes a concerted transition that involves 3'-processing of the pre-mRNA and transcription termination. Here, we present a genome-wide analysis of the 3'-transition in budding yeast. We find that the 3'-transition globally requires the Pol II elongation factor Spt5 and factors involved in the recognition of the polyadenylation (pA) site and in endonucleolytic RNA cleavage. Pol II release from DNA occurs in a narrow termination window downstream of the pA site and requires the "torpedo" exonuclease Rat1 (XRN2 in human). The Rat1-interacting factor Rai1 contributes to RNA degradation downstream of the pA site. Defects in the 3'-transition can result in increased transcription at downstream genes.

  16. Genome Wide Analysis of Chromatin Regulation by Cocaine Reveals a Novel Role for Sirtuins

    PubMed Central

    Renthal, William; Kumar, Arvind; Xiao, Guanghua; Wilkinson, Matthew; Covington, Herbert E.; Maze, Ian; Sikder, Devanjan; Robison, Alfred J.; LaPlant, Quincey; Dietz, David M.; Russo, Scott J.; Vialou, Vincent; Chakravarty, Sumana; Kodadek, Thomas J.; Stack, Ashley; Kabbaj, Mohammed; Nestler, Eric J.

    2009-01-01

    Summary Changes in gene expression contribute to the long-lasting regulation of the brain’s reward circuitry seen in drug addiction, however, the specific genes regulated and the transcriptional mechanisms underlying such regulation remain poorly understood. Here, we used chromatin immunoprecipitation coupled with promoter microarray analysis to characterize genome-wide chromatin changes in the mouse nucleus accumbens, a crucial brain reward region, after repeated cocaine administration. Our findings reveal several interesting principles of gene regulation by cocaine and of the role of ΔFosB and CREB, two prominent cocaine-induced transcription factors, in this brain region. The findings also provide novel and comprehensive insight into the molecular pathways regulated by cocaine – including a new role for sirtuins (Sirt1 and Sirt2) –which are induced in the nucleus accumbens by cocaine and, in turn, dramatically enhance the behavioral effects of the drug. PMID:19447090

  17. Identification of Genetic Susceptibility Loci for Colorectal Tumors in a Genome-wide Meta-analysis

    PubMed Central

    Peters, Ulrike; Jiao, Shuo; Schumacher, Fredrick R.; Hutter, Carolyn M.; Aragaki, Aaron K.; Baron, John A.; Berndt, Sonja I.; Bézieau, Stéphane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Campbell, Peter T.; Carlson, Christopher S.; Casey, Graham; Chan, Andrew T.; Chang-Claude, Jenny; Chanock, Stephen J.; Chen, Lin S.; Coetzee, Gerhard A.; Coetzee, Simon G.; Conti, David V.; Curtis, Keith R.; Duggan, David; Edwards, Todd; Fuchs, Charles S.; Gallinger, Steven; Giovannucci, Edward L.; Gogarten, Stephanie M.; Gruber, Stephen B.; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Henderson, Brian E.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Hunter, David J.; Jackson, Rebecca D.; Jee, Sun Ha; Jenkins, Mark A.; Jia, Wei-Hua; Kolonel, Laurence N.; Kooperberg, Charles; Küry, Sébastien; Lacroix, Andrea Z.; Laurie, Cathy C.; Laurie, Cecelia A.; Le Marchand, Loic; Lemire, Mathieu; Levine, David; Lindor, Noralane M.; Liu, Yan; Ma, Jing; Makar, Karen W.; Matsuo, Keitaro; Newcomb, Polly A.; Potter, John D.; Prentice, Ross L.; Qu, Conghui; Rohan, Thomas; Rosse, Stephanie A.; Schoen, Robert E.; Seminara, Daniela; Shrubsole, Martha; Shu, Xiao-Ou; Slattery, Martha L.; Taverna, Darin; Thibodeau, Stephen N.; Ulrich, Cornelia M.; White, Emily; Xiang, Yongbing; Zanke, Brent W.; Zeng, Yi-Xin; Zhang, Ben; Zheng, Wei; Hsu, Li

    2013-01-01

    BACKGROUND & AIMS Heritable factors contribute to the development of colorectal cancer. Identifying the genetic loci associated with colorectal tumor formation could elucidate the mechanisms of pathogenesis. METHODS We conducted a genome-wide association study that included 14 studies, 12,696 cases of colorectal tumors (11,870 cancer, 826 adenoma), and 15,113 controls of European descent. The 10 most statistically significant, previously unreported findings were followed up in 6 studies; these included 3056 colorectal tumor cases (2098 cancer, 958 adenoma) and 6658 controls of European and Asian descent. RESULTS Based on the combined analysis, we identified a locus that reached the conventional genome-wide significance level at less than 5.0 × 10−8: an intergenic region on chromosome 2q32.3, close to nucleic acid binding protein 1 (most significant single nucleotide polymorphism: rs11903757; odds ratio [OR], 1.15 per risk allele; P = 3.7 × 10−8). We also found evidence for 3 additional loci with P values less than 5.0 × 10−7: a locus within the laminin gamma 1 gene on chromosome 1q25.3 (rs10911251; OR, 1.10 per risk allele; P = 9.5 × 10−8), a locus within the cyclin D2 gene on chromosome 12p13.32 (rs3217810 per risk allele; OR, 0.84; P = 5.9 × 10−8), and a locus in the T-box 3 gene on chromosome 12q24.21 (rs59336; OR, 0.91 per risk allele; P = 3.7 × 10−7). CONCLUSIONS In a large genome-wide association study, we associated polymorphisms close to nucleic acid binding protein 1 (which encodes a DNA-binding protein involved in DNA repair) with colorectal tumor risk. We also provided evidence for an association between colorectal tumor risk and polymorphisms in laminin gamma 1 (this is the second gene in the laminin family to be associated with colorectal cancers), cyclin D2 (which encodes for cyclin D2), and T-box 3 (which encodes a T-box transcription factor and is a target of Wnt signaling to β-catenin). The roles of these genes and their products

  18. Genome-wide linkage and association analysis of cardiometabolic phenotypes in Hispanic Americans.

    PubMed

    Hellwege, Jacklyn N; Palmer, Nicholette D; Dimitrov, Latchezar; Keaton, Jacob M; Tabb, Keri L; Sajuthi, Satria; Taylor, Kent D; Ng, Maggie C Y; Speliotes, Elizabeth K; Hawkins, Gregory A; Long, Jirong; Ida Chen, Yii-Der; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Langefeld, Carl D; Wagenknecht, Lynne E; Bowden, Donald W

    2017-02-01

    Linkage studies of complex genetic diseases have been largely replaced by genome-wide association studies, due in part to limited success in complex trait discovery. However, recent interest in rare and low-frequency variants motivates re-examination of family-based methods. In this study, we investigated the performance of two-point linkage analysis for over 1.6 million single-nucleotide polymorphisms (SNPs) combined with single variant association analysis to identify high impact variants, which are both strongly linked and associated with cardiometabolic traits in up to 1414 Hispanics from the Insulin Resistance Atherosclerosis Family Study (IRASFS). Evaluation of all 50 phenotypes yielded 83 557 000 LOD (logarithm of the odds) scores, with 9214 LOD scores ⩾3.0, 845 ⩾4.0 and 89 ⩾5.0, with a maximal LOD score of 6.49 (rs12956744 in the LAMA1 gene for tumor necrosis factor-α (TNFα) receptor 2). Twenty-seven variants were associated with P<0.005 as well as having an LOD score >4, including variants in the NFIB gene under a linkage peak with TNFα receptor 2 levels on chromosome 9. Linkage regions of interest included a broad peak (31 Mb) on chromosome 1q with acute insulin response (max LOD=5.37). This region was previously documented with type 2 diabetes in family-based studies, providing support for the validity of these results. Overall, we have demonstrated the utility of two-point linkage and association in comprehensive genome-wide array-based SNP genotypes.

  19. Genome-wide association analysis identifies three new susceptibility loci for childhood body mass index.

    PubMed

    Felix, Janine F; Bradfield, Jonathan P; Monnereau, Claire; van der Valk, Ralf J P; Stergiakouli, Evie; Chesi, Alessandra; Gaillard, Romy; Feenstra, Bjarke; Thiering, Elisabeth; Kreiner-Møller, Eskil; Mahajan, Anubha; Pitkänen, Niina; Joro, Raimo; Cavadino, Alana; Huikari, Ville; Franks, Steve; Groen-Blokhuis, Maria M; Cousminer, Diana L; Marsh, Julie A; Lehtimäki, Terho; Curtin, John A; Vioque, Jesus; Ahluwalia, Tarunveer S; Myhre, Ronny; Price, Thomas S; Vilor-Tejedor, Natalia; Yengo, Loïc; Grarup, Niels; Ntalla, Ioanna; Ang, Wei; Atalay, Mustafa; Bisgaard, Hans; Blakemore, Alexandra I; Bonnefond, Amelie; Carstensen, Lisbeth; Eriksson, Johan; Flexeder, Claudia; Franke, Lude; Geller, Frank; Geserick, Mandy; Hartikainen, Anna-Liisa; Haworth, Claire M A; Hirschhorn, Joel N; Hofman, Albert; Holm, Jens-Christian; Horikoshi, Momoko; Hottenga, Jouke Jan; Huang, Jinyan; Kadarmideen, Haja N; Kähönen, Mika; Kiess, Wieland; Lakka, Hanna-Maaria; Lakka, Timo A; Lewin, Alexandra M; Liang, Liming; Lyytikäinen, Leo-Pekka; Ma, Baoshan; Magnus, Per; McCormack, Shana E; McMahon, George; Mentch, Frank D; Middeldorp, Christel M; Murray, Clare S; Pahkala, Katja; Pers, Tune H; Pfäffle, Roland; Postma, Dirkje S; Power, Christine; Simpson, Angela; Sengpiel, Verena; Tiesler, Carla M T; Torrent, Maties; Uitterlinden, André G; van Meurs, Joyce B; Vinding, Rebecca; Waage, Johannes; Wardle, Jane; Zeggini, Eleftheria; Zemel, Babette S; Dedoussis, George V; Pedersen, Oluf; Froguel, Philippe; Sunyer, Jordi; Plomin, Robert; Jacobsson, Bo; Hansen, Torben; Gonzalez, Juan R; Custovic, Adnan; Raitakari, Olli T; Pennell, Craig E; Widén, Elisabeth; Boomsma, Dorret I; Koppelman, Gerard H; Sebert, Sylvain; Järvelin, Marjo-Riitta; Hyppönen, Elina; McCarthy, Mark I; Lindi, Virpi; Harri, Niinikoski; Körner, Antje; Bønnelykke, Klaus; Heinrich, Joachim; Melbye, Mads; Rivadeneira, Fernando; Hakonarson, Hakon; Ring, Susan M; Smith, George Davey; Sørensen, Thorkild I A; Timpson, Nicholas J; Grant, Struan F A; Jaddoe, Vincent W V

    2016-01-15

    A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide association studies of childhood body mass index, using sex- and age-adjusted standard deviation scores. We included 35 668 children from 20 studies in the discovery phase and 11 873 children from 13 studies in the replication phase. In total, 15 loci reached genome-wide significance (P-value < 5 × 10(-8)) in the joint discovery and replication analysis, of which 12 are previously identified loci in or close to ADCY3, GNPDA2, TMEM18, SEC16B, FAIM2, FTO, TFAP2B, TNNI3K, MC4R, GPR61, LMX1B and OLFM4 associated with adult body mass index or childhood obesity. We identified three novel loci: rs13253111 near ELP3, rs8092503 near RAB27B and rs13387838 near ADAM23. Per additional risk allele, body mass index increased 0.04 Standard Deviation Score (SDS) [Standard Error (SE) 0.007], 0.05 SDS (SE 0.008) and 0.14 SDS (SE 0.025), for rs13253111, rs8092503 and rs13387838, respectively. A genetic risk score combining all 15 SNPs showed that each additional average risk allele was associated with a 0.073 SDS (SE 0.011, P-value = 3.12 × 10(-10)) increase in childhood body mass index in a population of 1955 children. This risk score explained 2% of the variance in childhood body mass index. This study highlights the shared genetic background between childhood and adult body mass index and adds three novel loci. These loci likely represent age-related differences in strength of the associations with body mass index.

  20. Genome-wide association analysis identifies three new susceptibility loci for childhood body mass index

    PubMed Central

    Felix, Janine F.; Bradfield, Jonathan P.; Monnereau, Claire; van der Valk, Ralf J.P.; Stergiakouli, Evie; Chesi, Alessandra; Gaillard, Romy; Feenstra, Bjarke; Thiering, Elisabeth; Kreiner-Møller, Eskil; Mahajan, Anubha; Pitkänen, Niina; Joro, Raimo; Cavadino, Alana; Huikari, Ville; Franks, Steve; Groen-Blokhuis, Maria M.; Cousminer, Diana L.; Marsh, Julie A.; Lehtimäki, Terho; Curtin, John A.; Vioque, Jesus; Ahluwalia, Tarunveer S.; Myhre, Ronny; Price, Thomas S.; Vilor-Tejedor, Natalia; Yengo, Loïc; Grarup, Niels; Ntalla, Ioanna; Ang, Wei; Atalay, Mustafa; Bisgaard, Hans; Blakemore, Alexandra I.; Bonnefond, Amelie; Carstensen, Lisbeth; Eriksson, Johan; Flexeder, Claudia; Franke, Lude; Geller, Frank; Geserick, Mandy; Hartikainen, Anna-Liisa; Haworth, Claire M.A.; Hirschhorn, Joel N.; Hofman, Albert; Holm, Jens-Christian; Horikoshi, Momoko; Hottenga, Jouke Jan; Huang, Jinyan; Kadarmideen, Haja N.; Kähönen, Mika; Kiess, Wieland; Lakka, Hanna-Maaria; Lakka, Timo A.; Lewin, Alexandra M.; Liang, Liming; Lyytikäinen, Leo-Pekka; Ma, Baoshan; Magnus, Per; McCormack, Shana E.; McMahon, George; Mentch, Frank D.; Middeldorp, Christel M.; Murray, Clare S.; Pahkala, Katja; Pers, Tune H.; Pfäffle, Roland; Postma, Dirkje S.; Power, Christine; Simpson, Angela; Sengpiel, Verena; Tiesler, Carla M. T.; Torrent, Maties; Uitterlinden, André G.; van Meurs, Joyce B.; Vinding, Rebecca; Waage, Johannes; Wardle, Jane; Zeggini, Eleftheria; Zemel, Babette S.; Dedoussis, George V.; Pedersen, Oluf; Froguel, Philippe; Sunyer, Jordi; Plomin, Robert; Jacobsson, Bo; Hansen, Torben; Gonzalez, Juan R.; Custovic, Adnan; Raitakari, Olli T.; Pennell, Craig E.; Widén, Elisabeth; Boomsma, Dorret I.; Koppelman, Gerard H.; Sebert, Sylvain; Järvelin, Marjo-Riitta; Hyppönen, Elina; McCarthy, Mark I.; Lindi, Virpi; Harri, Niinikoski; Körner, Antje; Bønnelykke, Klaus; Heinrich, Joachim; Melbye, Mads; Rivadeneira, Fernando; Hakonarson, Hakon; Ring, Susan M.; Smith, George Davey; Sørensen, Thorkild I.A.; Timpson, Nicholas J.; Grant, Struan F.A.; Jaddoe, Vincent W.V.

    2016-01-01

    A large number of genetic loci are associated with adult body mass index. However, the genetics of childhood body mass index are largely unknown. We performed a meta-analysis of genome-wide association studies of childhood body mass index, using sex- and age-adjusted standard deviation scores. We included 35 668 children from 20 studies in the discovery phase and 11 873 children from 13 studies in the replication phase. In total, 15 loci reached genome-wide significance (P-value < 5 × 10−8) in the joint discovery and replication analysis, of which 12 are previously identified loci in or close to ADCY3, GNPDA2, TMEM18, SEC16B, FAIM2, FTO, TFAP2B, TNNI3K, MC4R, GPR61, LMX1B and OLFM4 associated with adult body mass index or childhood obesity. We identified three novel loci: rs13253111 near ELP3, rs8092503 near RAB27B and rs13387838 near ADAM23. Per additional risk allele, body mass index increased 0.04 Standard Deviation Score (SDS) [Standard Error (SE) 0.007], 0.05 SDS (SE 0.008) and 0.14 SDS (SE 0.025), for rs13253111, rs8092503 and rs13387838, respectively. A genetic risk score combining all 15 SNPs showed that each additional average risk allele was associated with a 0.073 SDS (SE 0.011, P-value = 3.12 × 10−10) increase in childhood body mass index in a population of 1955 children. This risk score explained 2% of the variance in childhood body mass index. This study highlights the shared genetic background between childhood and adult body mass index and adds three novel loci. These loci likely represent age-related differences in strength of the associations with body mass index. PMID:26604143

  1. Genome-Wide Analysis of A-to-I RNA Editing.

    PubMed

    Savva, Yiannis A; Laurent, Georges St; Reenan, Robert A

    2016-01-01

    Adenosine (A)-to-inosine (I) RNA editing is a fundamental posttranscriptional modification that ensures the deamination of A-to-I in double-stranded (ds) RNA molecules. Intriguingly, the A-to-I RNA editing system is particularly active in the nervous system of higher eukaryotes, altering a plethora of noncoding and coding sequences. Abnormal RNA editing is highly associated with many neurological phenotypes and neurodevelopmental disorders. However, the molecular mechanisms underlying RNA editing-mediated pathogenesis still remain enigmatic and have attracted increasing attention from researchers. Over the last decade, methods available to perform genome-wide transcriptome analysis, have evolved rapidly. Within the RNA editing field researchers have adopted next-generation sequencing technologies to identify RNA-editing sites within genomes and to elucidate the underlying process. However, technical challenges associated with editing site discovery have hindered efforts to uncover comprehensive editing site datasets, resulting in the general perception that the collections of annotated editing sites represent only a small minority of the total number of sites in a given organism, tissue, or cell type of interest. Additionally to doubts about sensitivity, existing RNA-editing site lists often contain high percentages of false positives, leading to uncertainty about their validity and usefulness in downstream studies. An accurate investigation of A-to-I editing requires properly validated datasets of editing sites with demonstrated and transparent levels of sensitivity and specificity. Here, we describe a high signal-to-noise method for RNA-editing site detection using single-molecule sequencing (SMS). With this method, authentic RNA-editing sites may be differentiated from artifacts. Machine learning approaches provide a procedure to improve upon and experimentally validate sequencing outcomes through use of computationally predicted, iterative feedback loops

  2. Genome-wide meta-analysis of common variant differences between men and women

    PubMed Central

    Boraska, Vesna; Jerončić, Ana; Colonna, Vincenza; Southam, Lorraine; Nyholt, Dale R.; William Rayner, Nigel; Perry, John R.B.; Toniolo, Daniela; Albrecht, Eva; Ang, Wei; Bandinelli, Stefania; Barbalic, Maja; Barroso, Inês; Beckmann, Jacques S.; Biffar, Reiner; Boomsma, Dorret; Campbell, Harry; Corre, Tanguy; Erdmann, Jeanette; Esko, Tõnu; Fischer, Krista; Franceschini, Nora; Frayling, Timothy M.; Girotto, Giorgia; Gonzalez, Juan R.; Harris, Tamara B.; Heath, Andrew C.; Heid, Iris M.; Hoffmann, Wolfgang; Hofman, Albert; Horikoshi, Momoko; Hua Zhao, Jing; Jackson, Anne U.; Hottenga, Jouke-Jan; Jula, Antti; Kähönen, Mika; Khaw, Kay-Tee; Kiemeney, Lambertus A.; Klopp, Norman; Kutalik, Zoltán; Lagou, Vasiliki; Launer, Lenore J.; Lehtimäki, Terho; Lemire, Mathieu; Lokki, Marja-Liisa; Loley, Christina; Luan, Jian'an; Mangino, Massimo; Mateo Leach, Irene; Medland, Sarah E.; Mihailov, Evelin; Montgomery, Grant W.; Navis, Gerjan; Newnham, John; Nieminen, Markku S.; Palotie, Aarno; Panoutsopoulou, Kalliope; Peters, Annette; Pirastu, Nicola; Polašek, Ozren; Rehnström, Karola; Ripatti, Samuli; Ritchie, Graham R.S.; Rivadeneira, Fernando; Robino, Antonietta; Samani, Nilesh J.; Shin, So-Youn; Sinisalo, Juha; Smit, Johannes H.; Soranzo, Nicole; Stolk, Lisette; Swinkels, Dorine W.; Tanaka, Toshiko; Teumer, Alexander; Tönjes, Anke; Traglia, Michela; Tuomilehto, Jaakko; Valsesia, Armand; van Gilst, Wiek H.; van Meurs, Joyce B.J.; Smith, Albert Vernon; Viikari, Jorma; Vink, Jacqueline M.; Waeber, Gerard; Warrington, Nicole M.; Widen, Elisabeth; Willemsen, Gonneke; Wright, Alan F.; Zanke, Brent W.; Zgaga, Lina; Boehnke, Michael; d'Adamo, Adamo Pio; de Geus, Eco; Demerath, Ellen W.; den Heijer, Martin; Eriksson, Johan G.; Ferrucci, Luigi; Gieger, Christian; Gudnason, Vilmundur; Hayward, Caroline; Hengstenberg, Christian; Hudson, Thomas J.; Järvelin, Marjo-Riitta; Kogevinas, Manolis; Loos, Ruth J.F.; Martin, Nicholas G.; Metspalu, Andres; Pennell, Craig E.; Penninx, Brenda W.; Perola, Markus; Raitakari, Olli; Salomaa, Veikko; Schreiber, Stefan; Schunkert, Heribert; Spector, Tim D.; Stumvoll, Michael; Uitterlinden, André G.; Ulivi, Sheila; van der Harst, Pim; Vollenweider, Peter; Völzke, Henry; Wareham, Nicholas J.; Wichmann, H.-Erich; Wilson, James F.; Rudan, Igor; Xue, Yali; Zeggini, Eleftheria

    2012-01-01

    The male-to-female sex ratio at birth is constant across world populations with an average of 1.06 (106 male to 100 female live births) for populations of European descent. The sex ratio is considered to be affected by numerous biological and environmental factors and to have a heritable component. The aim of this study was to investigate the presence of common allele modest effects at autosomal and chromosome X variants that could explain the observed sex ratio at birth. We conducted a large-scale genome-wide association scan (GWAS) meta-analysis across 51 studies, comprising overall 114 863 individuals (61 094 women and 53 769 men) of European ancestry and 2 623 828 common (minor allele frequency >0.05) single-nucleotide polymorphisms (SNPs). Allele frequencies were compared between men and women for directly-typed and imputed variants within each study. Forward-time simulations for unlinked, neutral, autosomal, common loci were performed under the demographic model for European populations with a fixed sex ratio and a random mating scheme to assess the probability of detecting significant allele frequency differences. We do not detect any genome-wide significant (P < 5 × 10−8) common SNP differences between men and women in this well-powered meta-analysis. The simulated data provided results entirely consistent with these findings. This large-scale investigation across ∼115 000 individuals shows no detectable contribution from common genetic variants to the observed skew in the sex ratio. The absence of sex-specific differences is useful in guiding genetic association study design, for example when using mixed controls for sex-biased traits. PMID:22843499

  3. Genome-wide Linkage and Association Analysis of Cardiometabolic Phenotypes in Hispanic Americans

    PubMed Central

    Hellwege, Jacklyn N.; Palmer, Nicholette D.; Dimitrov, Latchezar; Keaton, Jacob M.; Tabb, Keri L.; Sajuthi, Satria; Taylor, Kent D.; Ng, Maggie C.Y.; Speliotes, Elizabeth K.; Hawkins, Gregory A.; Long, Jirong; Chen, Yii-Der Ida; Lorenzo, Carlos; Norris, Jill M.; Rotter, Jerome I.; Langefeld, Carl D.; Wagenknecht, Lynne E.; Bowden, Donald W.

    2016-01-01

    Linkage studies of complex genetic diseases have been largely replaced by Genome-Wide Association studies (GWAS), due in part to limited success in complex trait discovery. However, recent interest in rare and low-frequency variants motivates reexamination of family-based methods. In this study we investigated the performance of two-point linkage analysis for over 1.6 million SNPs combined with single variant association analysis to identify high impact variants which are both strongly linked and associated with cardiometabolic traits in up to 1 414 Hispanics from the Insulin Resistance Atherosclerosis Family Study (IRASFS). Evaluation of all 50 phenotypes yielded 83 557 000 LOD scores with 9 214 LOD scores ≥ 3.0, 845 ≥ 4.0, and 89 ≥ 5.0, with a maximal LOD score of 6.49 (rs12956744 in the LAMA1 gene for TNFα receptor 2). Twenty-seven variants were associated with p < 0.005 as well as having a LOD score > 4, including variants in the NFIB gene under a linkage peak with TNFα receptor 2 levels on chromosome 9. Linkage regions of interest included a broad peak (31Mb) on chromosome 1q with acute insulin response (max LOD = 5.37). This region was previously documented with type 2 diabetes in family-based studies, providing support for the validity of these results. Overall, we have demonstrated the utility of two-point linkage and association in comprehensive genome-wide array-based SNP genotypes. PMID:27535031

  4. Identification of susceptibility loci for colorectal cancer in a genome-wide meta-analysis

    PubMed Central

    Whiffin, Nicola; Hosking, Fay J.; Farrington, Susan M.; Palles, Claire; Dobbins, Sara E.; Zgaga, Lina; Lloyd, Amy; Kinnersley, Ben; Gorman, Maggie; Tenesa, Albert; Broderick, Peter; Wang, Yufei; Barclay, Ella; Hayward, Caroline; Martin, Lynn; Buchanan, Daniel D.; Win, Aung Ko; Hopper, John; Jenkins, Mark; Lindor, Noralane M.; Newcomb, Polly A.; Gallinger, Steve; Conti, David; Schumacher, Fred; Casey, Graham; Liu, Tao; Campbell, Harry; Lindblom, Annika; Houlston, Richard S.; Tomlinson, Ian P.; Dunlop, Malcolm G.

    2014-01-01

    To identify common variants influencing colorectal cancer (CRC) risk, we performed a meta-analysis of five genome-wide association studies, comprising 5626 cases and 7817 controls of European descent. We conducted replication of top ranked single nucleotide polymorphisms (SNPs) in additional series totalling 14 037 cases and 15 937 controls, identifying a new CRC risk locus at 10q24.2 [rs1035209; odds ratio (OR) = 1.13, P = 4.54 × 10−11]. We also performed meta-analysis of our studies, with previously published data, of several recently purported CRC risk loci. We failed to find convincing evidence for a previously reported genome-wide association at rs11903757 (2q32.3). Of the three additional loci for which evidence of an association in Europeans has been previously described we failed to show an association between rs59336 (12q24.21) and CRC risk. However, for the other two SNPs, our analyses demonstrated new, formally significant associations with CRC. These are rs3217810 intronic in CCND2 (12p13.32; OR = 1.19, P = 2.16 × 10−10) and rs10911251 near LAMC1 (1q25.3; OR = 1.09, P = 1.75 × 10−8). Additionally, we found some evidence to support a relationship between, rs647161, rs2423297 and rs10774214 and CRC risk originally identified in East Asians in our European datasets. Our findings provide further insights into the genetic and biological basis of inherited genetic susceptibility to CRC. PMID:24737748

  5. Genome-wide association analysis of autoantibody positivity in type 1 diabetes cases.

    PubMed

    Plagnol, Vincent; Howson, Joanna M M; Smyth, Deborah J; Walker, Neil; Hafler, Jason P; Wallace, Chris; Stevens, Helen; Jackson, Laura; Simmonds, Matthew J; Bingley, Polly J; Gough, Stephen C; Todd, John A

    2011-08-01

    The genetic basis of autoantibody production is largely unknown outside of associations located in the major histocompatibility complex (MHC) human leukocyte antigen (HLA) region. The aim of this study is the discovery of new genetic associations with autoantibody positivity using genome-wide association scan single nucleotide polymorphism (SNP) data in type 1 diabetes (T1D) patients with autoantibody measurements. We measured two anti-islet autoantibodies, glutamate decarboxylase (GADA, n = 2,506), insulinoma-associated antigen 2 (IA-2A, n = 2,498), antibodies to the autoimmune thyroid (Graves') disease (AITD) autoantigen thyroid peroxidase (TPOA, n = 8,300), and antibodies against gastric parietal cells (PCA, n = 4,328) that are associated with autoimmune gastritis. Two loci passed a stringent genome-wide significance level (p<10(-10)): 1q23/FCRL3 with IA-2A and 9q34/ABO with PCA. Eleven of 52 non-MHC T1D loci showed evidence of association with at least one autoantibody at a false discovery rate of 16%: 16p11/IL27-IA-2A, 2q24/IFIH1-IA-2A and PCA, 2q32/STAT4-TPOA, 10p15/IL2RA-GADA, 6q15/BACH2-TPOA, 21q22/UBASH3A-TPOA, 1p13/PTPN22-TPOA, 2q33/CTLA4-TPOA, 4q27/IL2/TPOA, 15q14/RASGRP1/TPOA, and 12q24/SH2B3-GADA and TPOA. Analysis of the TPOA-associated loci in 2,477 cases with Graves' disease identified two new AITD loci (BACH2 and UBASH3A).

  6. Genome-wide enrichment analysis between endometriosis and obesity-related traits reveals novel susceptibility loci

    PubMed Central

    Rahmioglu, Nilufer; Macgregor, Stuart; Drong, Alexander W.; Hedman, Åsa K.; Harris, Holly R.; Randall, Joshua C.; Prokopenko, Inga; Nyholt, Dale R.; Morris, Andrew P.; Montgomery, Grant W.; Missmer, Stacey A.; Lindgren, Cecilia M.; Zondervan, Krina T.

    2015-01-01

    Endometriosis is a chronic inflammatory condition in women that results in pelvic pain and subfertility, and has been associated with decreased body mass index (BMI). Genetic variants contributing to the heritable component have started to emerge from genome-wide association studies (GWAS), although the majority remain unknown. Unexpectedly, we observed an intergenic locus on 7p15.2 that was genome-wide significantly associated with both endometriosis and fat distribution (waist-to-hip ratio adjusted for BMI; WHRadjBMI) in an independent meta-GWAS of European ancestry individuals. This led us to investigate the potential overlap in genetic variants underlying the aetiology of endometriosis, WHRadjBMI and BMI using GWAS data. Our analyses demonstrated significant enrichment of common variants between fat distribution and endometriosis (P = 3.7 × 10−3), which was stronger when we restricted the investigation to more severe (Stage B) cases (P = 4.5 × 10−4). However, no genetic enrichment was observed between endometriosis and BMI (P = 0.79). In addition to 7p15.2, we identify four more variants with statistically significant evidence of involvement in both endometriosis and WHRadjBMI (in/near KIFAP3, CAB39L, WNT4, GRB14); two of these, KIFAP3 and CAB39L, are novel associations for both traits. KIFAP3, WNT4 and 7p15.2 are associated with the WNT signalling pathway; formal pathway analysis confirmed a statistically significant (P = 6.41 × 10−4) overrepresentation of shared associations in developmental processes/WNT signalling between the two traits. Our results demonstrate an example of potential biological pleiotropy that was hitherto unknown, and represent an opportunity for functional follow-up of loci and further cross-phenotype comparisons to assess how fat distribution and endometriosis pathogenesis research fields can inform each other. PMID:25296917

  7. Integrated analysis of copy number variation and genome-wide expression profiling in colorectal cancer tissues.

    PubMed

    Ali Hassan, Nur Zarina; Mokhtar, Norfilza Mohd; Kok Sin, Teow; Mohamed Rose, Isa; Sagap, Ismail; Harun, Roslan; Jamal, Rahman

    2014-01-01

    Integrative analyses of multiple genomic datasets for selected samples can provide better insight into the overall data and can enhance our knowledge of cancer. The objective of this study was to elucidate the association between copy number variation (CNV) and gene expression in colorectal cancer (CRC) samples and their corresponding non-cancerous tissues. Sixty-four paired CRC samples from the same patients were subjected to CNV profiling using the Illumina HumanOmni1-Quad assay, and validation was performed using multiplex ligation probe amplification method. Genome-wide expression profiling was performed on 15 paired samples from the same group of patients using the Affymetrix Human Gene 1.0 ST array. Significant genes obtained from both array results were then overlapped. To identify molecular pathways, the data were mapped to the KEGG database. Whole genome CNV analysis that compared primary tumor and non-cancerous epithelium revealed gains in 1638 genes and losses in 36 genes. Significant gains were mostly found in chromosome 20 at position 20q12 with a frequency of 45.31% in tumor samples. Examples of genes that were associated at this cytoband were PTPRT, EMILIN3 and CHD6. The highest number of losses was detected at chromosome 8, position 8p23.2 with 17.19% occurrence in all tumor samples. Among the genes found at this cytoband were CSMD1 and DLC1. Genome-wide expression profiling showed 709 genes to be up-regulated and 699 genes to be down-regulated in CRC compared to non-cancerous samples. Integration of these two datasets identified 56 overlapping genes, which were located in chromosomes 8, 20 and 22. MLPA confirmed that the CRC samples had the highest gains in chromosome 20 compared to the reference samples. Interpretation of the CNV data in the context of the transcriptome via integrative analyses may provide more in-depth knowledge of the genomic landscape of CRC.

  8. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Development of Advanced Classification Algorithm for Genome-Wide Single Nucleotide Polymorphism (SNP) Data Analysis

    DTIC Science & Technology

    2011-04-01

    distribution unlimited. QC – quality control QTL – quantitative trait loci SNP – single nucleotide polymorphism TE – Tris + EDTA TBE – Tris + Boric Acid + EDTA WGSA – whole genome sampling assay ...canine intelligence testing protocol EDTA – ethylenediaminetetraacetic acid GWAS – genome-wide association study LD – linkage disequilibrium MWD

  9. Pathway Analysis Based on a Genome-Wide Association Study of Polycystic Ovary Syndrome

    PubMed Central

    Shim, Unjin; Kim, Han-Na; Lee, Hyejin; Oh, Jee-Young

    2015-01-01

    Background Polycystic ovary syndrome (PCOS) is one of the most common endocrine disorders in women of reproductive age, and it is affected by both environmental and genetic factors. Although the genetic component of PCOS is evident, studies aiming to identify susceptibility genes have shown controversial results. This study conducted a pathway-based analysis using a dataset obtained through a genome-wide association study (GWAS) to elucidate the biological pathways that contribute to PCOS susceptibility and the associated genes. Methods We used GWAS data on 636,797 autosomal single nucleotide polymorphisms (SNPs) from 1,221 individuals (432 PCOS patients and 789 controls) for analysis. A pathway analysis was conducted using meta-analysis gene-set enrichment of variant associations (MAGENTA). Top-ranking pathways or gene sets associated with PCOS were identified, and significant genes within the pathways were analyzed. Results The pathway analysis of the GWAS dataset identified significant pathways related to oocyte meiosis and the regulation of insulin secretion by acetylcholine and free fatty acids (all nominal gene-set enrichment analysis (GSEA) P-values < 0.05). In addition, INS, GNAQ, STXBP1, PLCB3, PLCB2, SMC3 and PLCZ1 were significant genes observed within the biological pathways (all gene P-values < 0.05). Conclusions By applying MAGENTA pathway analysis to PCOS GWAS data, we identified significant pathways and candidate genes involved in PCOS. Our findings may provide new leads for understanding the mechanisms underlying the development of PCOS. PMID:26308735

  10. Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea (Cicer arietinum L.).

    PubMed

    Deokar, Amit A; Tar'an, Bunyamin

    2016-01-01

    Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea (Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis-acting regulatory elements revealed enrichment of cis-elements involved in circadian control, light response, defense and stress responsiveness

  11. Genome-Wide Analysis of the Aquaporin Gene Family in Chickpea (Cicer arietinum L.)

    PubMed Central

    Deokar, Amit A.; Tar'an, Bunyamin

    2016-01-01

    Aquaporins (AQPs) are essential membrane proteins that play critical role in the transport of water and many other solutes across cell membranes. In this study, a comprehensive genome-wide analysis identified 40 AQP genes in chickpea (Cicer arietinum L.). A complete overview of the chickpea AQP (CaAQP) gene family is presented, including their chromosomal locations, gene structure, phylogeny, gene duplication, conserved functional motifs, gene expression, and conserved promoter motifs. To understand AQP's evolution, a comparative analysis of chickpea AQPs with AQP orthologs from soybean, Medicago, common bean, and Arabidopsis was performed. The chickpea AQP genes were found on all of the chickpea chromosomes, except chromosome 7, with a maximum of six genes on chromosome 6, and a minimum of one gene on chromosome 5. Gene duplication analysis indicated that the expansion of chickpea AQP gene family might have been due to segmental and tandem duplications. CaAQPs were grouped into four subfamilies including 15 NOD26-like intrinsic proteins (NIPs), 13 tonoplast intrinsic proteins (TIPs), eight plasma membrane intrinsic proteins (PIPs), and four small basic intrinsic proteins (SIPs) based on sequence similarities and phylogenetic position. Gene structure analysis revealed a highly conserved exon-intron pattern within CaAQP subfamilies supporting the CaAQP family classification. Functional prediction based on conserved Ar/R selectivity filters, Froger's residues, and specificity-determining positions suggested wide differences in substrate specificity among the subfamilies of CaAQPs. Expression analysis of the AQP genes indicated that some of the genes are tissue-specific, whereas few other AQP genes showed differential expression in response to biotic and abiotic stresses. Promoter profiling of CaAQP genes for conserved cis-acting regulatory elements revealed enrichment of cis-elements involved in circadian control, light response, defense and stress responsiveness

  12. A genome-wide meta-analysis identifies novel loci associated with schizophrenia and bipolar disorder.

    PubMed

    Wang, Ke-Sheng; Liu, Xue-Feng; Aragam, Nagesh

    2010-12-01

    Schizophrenia and bipolar disorder both have strong inherited components. Recent studies have indicated that schizophrenia and bipolar disorder may share more than half of their genetic determinants. In this study, we performed a meta-analysis (combined analysis) for genome-wide association data of the Affymetrix Genome-Wide Human SNP array 6.0 to detect genetic variants influencing both schizophrenia and bipolar disorder using European-American samples (653 bipolar cases and 1034 controls, 1172 schizophrenia cases and 1379 controls). The best associated SNP rs11789399 was located at 9q33.1 (p=2.38 × 10(-6), 5.74 × 10(-4), and 5.56 × 10(-9), for schizophrenia, bipolar disorder and meta-analysis of schizophrenia and bipolar disorder, respectively), where one flanking gene, ASTN2 (220kb away) has been associated with attention deficit/hyperactivity disorder and schizophrenia. The next best SNP was rs12201676 located at 6q15 (p=2.67 × 10(-4), 2.12 × 10(-5), 3.88 × 10(-8) for schizophrenia, bipolar disorder and meta-analysis, respectively), near two flanking genes, GABRR1 and GABRR2 (15 and 17kb away, respectively). The third interesting SNP rs802568 was at 7q35 within CNTNAP2 (p=8.92 × 10(-4), 1.38 × 10(-5), and 1.62 × 10(-7) for schizophrenia, bipolar disorder and meta-analysis, respectively). Through meta-analysis, we found two additional associated genes NALCN (the top SNP is rs2044117, p=4.57 × 10(-7)) and NAP5 (the top SNP is rs10496702, p=7.15 × 10(-7)). Haplotype analyses of above five loci further supported the associations with schizophrenia and bipolar disorder. These results provide evidence of common genetic variants influencing schizophrenia and bipolar disorder. These findings will serve as a resource for replication in other populations to elucidate the potential role of these genetic variants in schizophrenia and bipolar disorder.

  13. A genome-wide analysis of the expansin genes in Malus × Domestica.

    PubMed

    Zhang, Shizhong; Xu, Ruirui; Gao, Zheng; Chen, Changtian; Jiang, Zesheng; Shu, Huairui

    2014-04-01

    Expansins were first identified as cell wall-loosening proteins; they are involved in regulating cell expansion, fruits softening and many other physiological processes. However, our knowledge about the expansin family members and their evolutionary relationships in fruit trees, such as apple, is limited. In this study, we identified 41 members of the expansin gene family in the genome of apple (Malus × Domestica L. Borkh). Phylogenetic analysis revealed that expansin genes in apple could be divided into four subfamilies according to their gene structures and protein motifs. By phylogenetic analysis of the expansins in five plants (Arabidopsis, rice, poplar, grape and apple), the expansins were divided into 17 subgroups. Our gene duplication analysis revealed that whole-genome and chromosomal-segment duplications contributed to the expansion of Mdexpansins. The microarray and expressed sequence tag (EST) data showed that 34 Mdexpansin genes could be divided into five groups by the EST analysis; they may also play different roles during fruit development. An expression model for MdEXPA16 and MdEXPA20 showed their potential role in developing fruit. Overall, our study provides useful data and novel insights into the functions and regulatory mechanisms of the expansin genes in apple, as well as their evolution and divergence. As the first step towards genome-wide analysis of the expansin genes in apple, our results have established a solid foundation for future studies on the function of the expansin genes in fruit development.

  14. Identification of proprotein convertase substrates using genome-wide expression correlation analysis

    PubMed Central

    2011-01-01

    Background Subtilisin/kexin-like proprotein convertase (PCSK) enzymes have important regulatory function in a wide variety of biological processes. PCSKs proteolytically process at a target sequence that contains basic amino acids arginine and lysine, which results in functional maturation of the target protein. In vitro assays have showed significant biochemical redundancy between the seven family members, but the phenotypes of PCSK deficient mice and patients carrying an inactive PCSK allele argue for a specific biological function. Modeling the structures of individual PCSK enzymes has offered little insights into the specificity determinants. However, previous studies have shown that there can be a coordinated expression between a PCSK and its target molecule. Here, we have surveyed the putative PCSK target proteins using genome-wide expression correlation analysis and cleavage site prediction algorithms. Results We first performed a gene expression correlation analysis over the whole genome for all PCSK enzymes. PCSKs were found to cluster differently based on the strength of correlations. The screen for putative PCSK target proteins showed a significant enrichment (p-values from 1.2e-4 to < 1.0e-10) of putative targets among the most positively correlating genes for most PCSKs. Interestingly, there was no enrichment in putative targets among the genes that correlated positively with the biologically redundant PCSK7, whereas PCSK5 showed an inverse correlation. PCSKs also showed a highly variable degree of shared target genes that were identified by expression correlation and cleavage site prediction. Multiple alignments were used to evaluate the putative targets to pinpoint the important residues for the substrate recognition. Finally, we validated our approach and identified biochemically PAPPA1 and ADAMTS6 as novel targets for FURIN proteolytic activity. Conclusions Most PCSK enzymes display strong positive expression correlation with predicted target

  15. Genome-wide characterization of transcriptional start sites in humans by integrative transcriptome analysis

    PubMed Central

    Yamashita, Riu; Sathira, Nuankanya P.; Kanai, Akinori; Tanimoto, Kousuke; Arauchi, Takako; Tanaka, Yoshiaki; Hashimoto, Shin-ichi; Sugano, Sumio; Nakai, Kenta; Suzuki, Yutaka

    2011-01-01

    We performed a genome-wide analysis of transcriptional start sites (TSSs) in human genes by multifaceted use of a massively parallel sequencer. By analyzing 800 million sequences that were obtained from various types of transcriptome analyses, we characterized 140 million TSS tags in 12 human cell types. Despite the large number of TSS clusters (TSCs), the number of TSCs was observed to decrease sharply with increasing expression levels. Highly expressed TSCs exhibited several characteristic features: Nucleosome-seq analysis revealed highly ordered nucleosome structures, ChIP-seq analysis detected clear RNA polymerase II binding signals in their surrounding regions, evaluations of previously sequenced and newly shotgun-sequenced complete cDNA sequences showed that they encode preferable transcripts for protein translation, and RNA-seq analysis of polysome-incorporated RNAs yielded direct evidence that those transcripts are actually translated into proteins. We also demonstrate that integrative interpretation of transcriptome data is essential for the selection of putative alternative promoter TSCs, two of which also have protein consequences. Furthermore, discriminative chromatin features that separate TSCs at different expression levels were found for both genic TSCs and intergenic TSCs. The collected integrative information should provide a useful basis for future biological characterization of TSCs. PMID:21372179

  16. Genome-wide Identification and Structural, Functional and Evolutionary Analysis of WRKY Components of Mulberry

    PubMed Central

    Baranwal, Vinay Kumar; Negi, Nisha; Khurana, Paramjit

    2016-01-01

    Mulberry is known to be sensitive to several biotic and abiotic stresses, which in turn have a direct impact on the yield of silk, because it is the sole food source for the silk worm. WRKYs are a family of transcription factors, which play an important role in combating various biotic and abiotic stresses. In this study, we identified 54 genes with conserved WRKY motifs in the Morus notabilis genome. Motif searches coupled with a phylogenetic analysis revealed seven sub-groups as well as the absence of members of Group Ib in mulberry. Analyses of the 2K upstream region in addition to a gene ontology terms enrichment analysis revealed putative functions of mulberry WRKYs under biotic and abiotic stresses. An RNA-seq-based analysis showed that several of the identified WRKYs have shown preferential expression in the leaf, bark, root, male flower, and winter bud of M. notabilis. Finally, expression analysis by qPCR under different stress and hormone treatments revealed genotype-specific responses. Taken together, our results briefs about the genome-wide identification of WRKYs as well as their differential response to stresses and hormones. Importantly, these data can also be utilized to identify potential molecular targets for conferring tolerance to various stresses in mulberry. PMID:27477686

  17. Genome-wide Identification and Structural, Functional and Evolutionary Analysis of WRKY Components of Mulberry.

    PubMed

    Baranwal, Vinay Kumar; Negi, Nisha; Khurana, Paramjit

    2016-08-01

    Mulberry is known to be sensitive to several biotic and abiotic stresses, which in turn have a direct impact on the yield of silk, because it is the sole food source for the silk worm. WRKYs are a family of transcription factors, which play an important role in combating various biotic and abiotic stresses. In this study, we identified 54 genes with conserved WRKY motifs in the Morus notabilis genome. Motif searches coupled with a phylogenetic analysis revealed seven sub-groups as well as the absence of members of Group Ib in mulberry. Analyses of the 2K upstream region in addition to a gene ontology terms enrichment analysis revealed putative functions of mulberry WRKYs under biotic and abiotic stresses. An RNA-seq-based analysis showed that several of the identified WRKYs have shown preferential expression in the leaf, bark, root, male flower, and winter bud of M. notabilis. Finally, expression analysis by qPCR under different stress and hormone treatments revealed genotype-specific responses. Taken together, our results briefs about the genome-wide identification of WRKYs as well as their differential response to stresses and hormones. Importantly, these data can also be utilized to identify potential molecular targets for conferring tolerance to various stresses in mulberry.

  18. Genome-Wide Analysis of Branched-Chain Amino Acid Levels in Arabidopsis Seeds[W

    PubMed Central

    Angelovici, Ruthie; Lipka, Alexander E.; Deason, Nicholas; Gonzalez-Jorge, Sabrina; Lin, Haining; Cepela, Jason; Buell, Robin; Gore, Michael A.; DellaPenna, Dean

    2013-01-01

    Branched-chain amino acids (BCAAs) are three of the nine essential amino acids in human and animal diets and are important for numerous processes in development and growth. However, seed BCAA levels in major crops are insufficient to meet dietary requirements, making genetic improvement for increased and balanced seed BCAAs an important nutritional target. Addressing this issue requires a better understanding of the genetics underlying seed BCAA content and composition. Here, a genome-wide association study and haplotype analysis for seed BCAA traits in Arabidopsis thaliana revealed a strong association with a chromosomal interval containing two BRANCHED-CHAIN AMINO ACID TRANSFERASES, BCAT1 and BCAT2. Linkage analysis, reverse genetic approaches, and molecular complementation analysis demonstrated that allelic variation at BCAT2 is responsible for the natural variation of seed BCAAs in this interval. Complementation analysis of a bcat2 null mutant with two significantly different alleles from accessions Bayreuth-0 and Shahdara is consistent with BCAT2 contributing to natural variation in BCAA levels, glutamate recycling, and free amino acid homeostasis in seeds in an allele-dependent manner. The seed-specific phenotype of bcat2 null alleles, its strong transcription induction during late seed development, and its subcellular localization to the mitochondria are consistent with a unique, catabolic role for BCAT2 in BCAA metabolism in seeds. PMID:24368787

  19. Pathway analysis of genome-wide association datasets of personality traits.

    PubMed

    Kim, H-N; Kim, B-H; Cho, J; Ryu, S; Shin, H; Sung, J; Shin, C; Cho, N H; Sung, Y A; Choi, B-O; Kim, H-L

    2015-04-01

    Although several genome-wide association (GWA) studies of human personality have been recently published, genetic variants that are highly associated with certain personality traits remain unknown, due to difficulty reproducing results. To further investigate these genetic variants, we assessed biological pathways using GWA datasets. Pathway analysis using GWA data was performed on 1089 Korean women whose personality traits were measured with the Revised NEO Personality Inventory for the 5-factor model of personality. A total of 1042 pathways containing 8297 genes were included in our study. Of these, 14 pathways were highly enriched with association signals that were validated in 1490 independent samples. These pathways include association of: Neuroticism with axon guidance [L1 cell adhesion molecule (L1CAM) interactions]; Extraversion with neuronal system and voltage-gated potassium channels; Agreeableness with L1CAM interaction, neurotransmitter receptor binding and downstream transmission in postsynaptic cells; and Conscientiousness with the interferon-gamma and platelet-derived growth factor receptor beta polypeptide pathways. Several genes that contribute to top-ranked pathways in this study were previously identified in GWA studies or by pathway analysis in schizophrenia or other neuropsychiatric disorders. Here we report the first pathway analysis of all five personality traits. Importantly, our analysis identified novel pathways that contribute to understanding the etiology of personality traits.

  20. Genome-Wide Meta-Analysis of Sciatica in Finnish Population

    PubMed Central

    Lemmelä, Susanna; Solovieva, Svetlana; Shiri, Rahman; Benner, Christian; Heliövaara, Markku; Kettunen, Johannes; Anttila, Verneri; Ripatti, Samuli; Perola, Markus; Seppälä, Ilkka; Juonala, Markus; Kähönen, Mika; Salomaa, Veikko; Viikari, Jorma; Raitakari, Olli T.; Lehtimäki, Terho; Palotie, Aarno; Viikari-Juntura, Eira; Husgafvel-Pursiainen, Kirsti

    2016-01-01

    Sciatica or the sciatic syndrome is a common and often disabling low back disorder in the working-age population. It has a relatively high heritability but poorly understood molecular mechanisms. The Finnish population is a genetic isolate where small founder population and bottleneck events have led to enrichment of certain rare and low frequency variants. We performed here the first genome-wide association (GWAS) and meta-analysis of sciatica. The meta-analysis was conducted across two GWAS covering 291 Finnish sciatica cases and 3671 controls genotyped and imputed at 7.7 million autosomal variants. The most promising loci (p<1x10-6) were replicated in 776 Finnish sciatica patients and 18,489 controls. We identified five intragenic variants, with relatively low frequencies, at two novel loci associated with sciatica at genome-wide significance. These included chr9:14344410:I (rs71321981) at 9p22.3 (NFIB gene; p = 1.30x10-8, MAF = 0.08) and four variants at 15q21.2: rs145901849, rs80035109, rs190200374 and rs117458827 (MYO5A; p = 1.34x10-8, MAF = 0.06; p = 2.32x10-8, MAF = 0.07; p = 3.85x10-8, MAF = 0.06; p = 4.78x10-8, MAF = 0.07, respectively). The most significant association in the meta-analysis, a single base insertion rs71321981 within the regulatory region of the transcription factor NFIB, replicated in an independent Finnish population sample (p = 0.04). Despite identifying 15q21.2 as a promising locus, we were not able to replicate it. It was differentiated; the lead variants within 15q21.2 were more frequent in Finland (6–7%) than in other European populations (1–2%). Imputation accuracies of the three significantly associated variants (chr9:14344410:I, rs190200374, and rs80035109) were validated by genotyping. In summary, our results suggest a novel locus, 9p22.3 (NFIB), which may be involved in susceptibility to sciatica. In addition, another locus, 15q21.2, emerged as a promising one, but failed to replicate. PMID:27764105

  1. Genome-Wide Identification and Expression Analysis of Calcium-dependent Protein Kinase in Tomato

    PubMed Central

    Hu, Zhangjian; Lv, Xiangzhang; Xia, Xiaojian; Zhou, Jie; Shi, Kai; Yu, Jingquan; Zhou, Yanhong

    2016-01-01

    Calcium-dependent protein kinases (CDPKs) play critical roles in regulating growth, development and stress response in plants. Information about CDPKs in tomato, however, remains obscure although it is one of the most important model crops in the world. In this study, we performed a bioinformatics analysis of the entire tomato genome and identified 29 CDPK genes. These CDPK genes are found to be located in 12 chromosomes, and could be divided into four groups. Analysis of the gene structure and splicing site reflected high structure conservation within different CDPK gene groups both in the exon-intron pattern and mRNA splicing. Transcripts of most CDPK genes varied with plant organs and developmental stages and their transcripts could be differentially induced by abscisic acid (ABA), brassinosteroids (BRs), methyl jasmonate (MeJA), and salicylic acid (SA), as well as after exposure to heat, cold, and drought, respectively. To our knowledge, this is the first report about the genome-wide analysis of the CDPK gene family in tomato, and the findings obtained offer a clue to the elaborated regulatory role of CDPKs in plant growth, development and stress response in tomato. PMID:27092168

  2. Pathway-based analysis of primary biliary cirrhosis genome-wide association studies.

    PubMed

    Kar, S P; Seldin, M F; Chen, W; Lu, E; Hirschfield, G M; Invernizzi, P; Heathcote, J; Cusi, D; Gershwin, M E; Siminovitch, K A; Amos, C I

    2013-04-01

    Genome-wide association studies (GWAS) have successfully identified several loci associated with primary biliary cirrhosis (PBC) risk. Pathway analysis complements conventional GWAS analysis. We applied the recently developed linear combination test for pathways to datasets drawn from independent PBC GWAS in Italian and Canadian subjects. Of the Kyoto Encyclopedia of Genes and Genomes and BioCarta pathways tested, 25 pathways in the Italian dataset (449 cases, 940 controls) and 26 pathways in the Canadian dataset (530 cases, 398 controls) were associated with PBC susceptibility (P<0.05). After correcting for multiple comparisons, only the eight most significant pathways in the Italian dataset had FDR <0.25 with tumor necrosis factor/stress-related signaling emerging as the top pathway (P=7.38 × 10⁻⁴, FDR=0.18). Two pathways, phosphatidylinositol signaling and hedgehog signaling, were replicated in both datasets (P<0.05), and subjected to two additional complementary pathway tests. Both pathway signals remained significant in the Italian dataset on modified gene set enrichment analysis (P<0.05). In both GWAS, variants nominally associated with PBC were significantly overrepresented in the phosphatidylinositol pathway (Fisher exact P<0.05). These results point to established and novel pathway-level associations with inherited predisposition to PBC that, on further independent replication and functional validation, may provide fresh insights into PBC etiology.

  3. Genome-wide analysis of plant-specific Dof transcription factor family in tomato.

    PubMed

    Cai, Xiaofeng; Zhang, Yuyang; Zhang, Chanjuan; Zhang, Tingyan; Hu, Tixu; Ye, Jie; Zhang, Junhong; Wang, Taotao; Li, Hanxia; Ye, Zhibiao

    2013-06-01

    The Dof (DNA binding with One Finger) family encoding single zinc finger proteins has been known as a family of plant-specific transcription factors. These transcription factors are involved in a variety of functions of importance for different biological processes in plants. In the current study, we identified 34 Dof family genes in tomato, distributed on 11 chromosomes. A complete overview of SlDof genes in tomato is presented, including the gene structures, chromosome locations, phylogeny, protein motifs and evolution pattern. Phylogenetic analysis of 34 SlDof proteins resulted in four classes constituting six clusters. In addition, a comparative analysis between these genes in tomato, Arabidopsis and rice was also performed. The tomato Dof family expansion has been dated to recent duplication events, and segmental duplication is predominant for the SlDof genes. Furthermore, the SlDof genes displayed differential expression either in their transcript abundance or in their expression patterns under normal growth conditions. This is the first step towards genome-wide analyses of the Dof genes in tomato. Our study provides a very useful reference for cloning and functional analysis of the members of this gene family in tomato and other species.

  4. Genome-wide identification and phylogenetic analysis of the SBP-box gene family in melons.

    PubMed

    Ma, Y; Guo, J W; Bade, R; Men, Z H; Hasi, A

    2014-10-27

    The SBP-box gene family is specific to plants and encodes a class of zinc finger-containing transcription factors with a broad range of functions. Although SBP-box genes have been identified in numerous plants, including green algae, moss, silver birch, snapdragon, Arabidopsis, rice, and maize, there is little information concerning SBP-box genes, or the corresponding miR156/157, function in melon. Using the highly conserved sequence of the Arabidopsis thaliana SBP-box domain protein as a probe of information sequence, the genome-wide protein database of melon was explored to obtain 13 SBP-box protein sequences, which were further divided into 4 groups, based on phylogenetic analysis. A further analysis centered on the melon SBP-box genetic family's phylogenetic evolution, sequence similarities, gene structure, and miR156 target sequence was also conducted. Analysis of all the expression patterns of melon SBP-box family genes showed that the SBP-box genes were detected in 7 kinds of tissue, and fruit had the highest expression level. CmSBP11 tends to present its specific expression in melon fruit and root. CmSBP09 expression was the highest in flower. Overall, the molecular evolution and expression pattern of the melon SBP-box gene family, revealed by these results, suggest its function differentiation that followed gene duplication.

  5. Genome-Wide DNA Methylation Analysis and Epigenetic Variations Associated with Congenital Aortic Valve Stenosis (AVS)

    PubMed Central

    Radhakrishna, Uppala; Albayrak, Samet; Alpay-Savasan, Zeynep; Zeb, Amna; Turkoglu, Onur; Sobolewski, Paul; Bahado-Singh, Ray O.

    2016-01-01

    Congenital heart defect (CHD) is the most common cause of death from congenital anomaly. Among several candidate epigenetic mechanisms, DNA methylation may play an important role in the etiology of CHDs. We conducted a genome-wide DNA methylation analysis using an Illumina Infinium 450k human methylation assay in a cohort of 24 newborns who had aortic valve stenosis (AVS), with gestational-age matched controls. The study identified significantly-altered CpG methylation at 59 sites in 52 genes in AVS subjects as compared to controls (either hypermethylated or demethylated). Gene Ontology analysis identified biological processes and functions for these genes including positive regulation of receptor-mediated endocytosis. Consistent with prior clinical data, the molecular function categories as determined using DAVID identified low-density lipoprotein receptor binding, lipoprotein receptor binding and identical protein binding to be over-represented in the AVS group. A significant epigenetic change in the APOA5 and PCSK9 genes known to be involved in AVS was also observed. A large number CpG methylation sites individually demonstrated good to excellent diagnostic accuracy for the prediction of AVS status, thus raising possibility of molecular screening markers for this disorder. Using epigenetic analysis we were able to identify genes significantly involved in the pathogenesis of AVS. PMID:27152866

  6. Pathway Analysis for Genome-Wide Association Study of Lung Cancer in Han Chinese Population

    PubMed Central

    Wu, Chen; Jin, Guangfu; Dai, Juncheng; Wang, Cheng; Hu, Lingmin; Gou, Jianwei; Qian, Chen; Bai, Jianling; Wu, Tangchun; Hu, Zhibin; Lin, Dongxin; Shen, Hongbing; Chen, Feng

    2013-01-01

    Genome-wide association studies (GWAS) have identified a number of genetic variants associated with lung cancer risk. However, these loci explain only a small fraction of lung cancer hereditability and other variants with weak effect may be lost in the GWAS approach due to the stringent significance level after multiple comparison correction. In this study, in order to identify important pathways involving the lung carcinogenesis, we performed a two-stage pathway analysis in GWAS of lung cancer in Han Chinese using gene set enrichment analysis (GSEA) method. Predefined pathways by BioCarta and KEGG databases were systematically evaluated on Nanjing study (Discovery stage: 1,473 cases and 1,962 controls) and the suggestive pathways were further to be validated in Beijing study (Replication stage: 858 cases and 1,115 controls). We found that four pathways (achPathway, metPathway, At1rPathway and rac1Pathway) were consistently significant in both studies and the P values for combined dataset were 0.012, 0.010, 0.022 and 0.005 respectively. These results were stable after sensitivity analysis based on gene definition and gene overlaps between pathways. These findings may provide new insights into the etiology of lung cancer. PMID:23469231

  7. Genome-wide association analysis for feed efficiency in Angus cattle

    PubMed Central

    Rolf, M M; Taylor, J F; Schnabel, R D; McKay, S D; McClure, M C; Northcutt, S L; Kerley, M S; Weaber, R L

    2012-01-01

    Estimated breeding values for average daily feed intake (AFI; kg/day), residual feed intake (RFI; kg/day) and average daily gain (ADG; kg/day) were generated using a mixed linear model incorporating genomic relationships for 698 Angus steers genotyped with the Illumina BovineSNP50 assay. Association analyses of estimated breeding values (EBVs) were performed for 41 028 single nucleotide polymorphisms (SNPs), and permutation analysis was used to empirically establish the genome-wide significance threshold (P < 0.05) for each trait. SNPs significantly associated with each trait were used in a forward selection algorithm to identify genomic regions putatively harbouring genes with effects on each trait. A total of 53, 66 and 68 SNPs explained 54.12% (24.10%), 62.69% (29.85%) and 55.13% (26.54%) of the additive genetic variation (when accounting for the genomic relationships) in steer breeding values for AFI, RFI and ADG, respectively, within this population. Evaluation by pathway analysis revealed that many of these SNPs are in genomic regions that harbour genes with metabolic functions. The presence of genetic correlations between traits resulted in 13.2% of SNPs selected for AFI and 4.5% of SNPs selected for RFI also being selected for ADG in the analysis of breeding values. While our study identifies panels of SNPs significant for efficiency traits in our population, validation of all SNPs in independent populations will be necessary before commercialization. PMID:22497295

  8. Pathway-based analysis of primary biliary cirrhosis genome-wide association studies

    PubMed Central

    Kar, SP; Seldin, MF; Chen, W; Lu, E; Hirschfield, GM; Invernizzi, P; Heathcote, J; Cusi, D; Gershwin, ME; Siminovitch, KA; Amos, CI

    2013-01-01

    Genome-wide association studies (GWAS) have successfully identified several loci associated with primary biliary cirrhosis (PBC) risk. Pathway analysis complements conventional GWAS analysis. We applied the recently developed linear combination test for pathways to datasets drawn from independent PBC GWAS in Italian and Canadian subjects. Of the Kyoto Encyclopedia of Genes and Genomes and BioCarta pathways tested, 25 pathways in the Italian dataset (449 cases, 940 controls) and 26 pathways in the Canadian dataset (530 cases, 398 controls) were associated with PBC susceptibility (P < 0.05). After correcting for multiple comparisons, only the eight most significant pathways in the Italian dataset had FDR < 0.25 with tumor necrosis factor/stress-related signaling emerging as the top pathway (P = 7.38 × 10−4, FDR = 0.18). Two pathways, phosphatidylinositol signaling and hedgehog signaling, were replicated in both datasets (P < 0.05), and subjected to two additional complementary pathway tests. Both pathway signals remained significant in the Italian dataset on modified gene set enrichment analysis (P < 0.05). In both GWAS, variants nominally associated with PBC were significantly overrepresented in the phosphatidylinositol pathway (Fisher exact P < 0.05). These results point to established and novel pathway-level associations with inherited predisposition to PBC that on further independent replication and functional validation, may provide fresh insights into PBC etiology. PMID:23392275

  9. A genome-wide transcription analysis of a fungal riboflavin overproducer.

    PubMed

    Karos, Marvin; Vilariño, Cristina; Bollschweiler, Claus; Revuelta, Jose Luis

    2004-09-30

    The production of many fine chemicals such as vitamins and amino acids is carried out in bioreactors using microorganisms. Usually, these strains are developed from wild-type organisms by classical mutation and selection. After several generations of strain improvement, no further enhancement can be achieved. Therefore, metabolic engineering (ME) is a rational approach to optimise such producer organisms beyond this point, or for starting all over from the beginning. Metabolic Engineering involves detailed analysis of the organism's metabolic and genetic properties, leading to the identification of new target genes. The fungal riboflavin overproducer Ashbya gossypii converts vegetable oil to vitamin B2 in a "one-step reaction". The productivity and selectivity of this microorganism have been optimised significantly over the years, first following a classical approach and now a rational one. The improvement is based on our understanding of vitamin B2 metabolism. We have been able to selectively enhance the pathways that are necessary for the formation of riboflavin and to inhibit those leading to unwanted side products. New targets for further improvements of this process have been found using a genome-wide transcript expression analysis; namely massive parallel signature sequencing (MPSS). With this analysis even completely unknown genes can be used for strain improvement.

  10. Genome-wide analysis of the MADS-box gene family in Brachypodium distachyon.

    PubMed

    Wei, Bo; Zhang, Rong-Zhi; Guo, Juan-Juan; Liu, Dan-Mei; Li, Ai-Li; Fan, Ren-Chun; Mao, Long; Zhang, Xiang-Qi

    2014-01-01

    MADS-box genes are important transcription factors for plant development, especially floral organogenesis. Brachypodium distachyon is a model for biofuel plants and temperate grasses such as wheat and barley, but a comprehensive analysis of MADS-box family proteins in Brachypodium is still missing. We report here a genome-wide analysis of the MADS-box gene family in Brachypodium distachyon. We identified 57 MADS-box genes and classified them into 32 MIKC(c)-type, 7 MIKC*-type, 9 Mα, 7 Mβ and 2 Mγ MADS-box genes according to their phylogenetic relationships to the Arabidopsis and rice MADS-box genes. Detailed gene structure and motif distribution were then studied. Investigation of their chromosomal localizations revealed that Brachypodium MADS-box genes distributed evenly across five chromosomes. In addition, five pairs of type II MADS-box genes were found on synteny blocks derived from whole genome duplication blocks. We then performed a systematic expression analysis of Brachypodium MADS-box genes in various tissues, particular floral organs. Further detection under salt, drought, and low-temperature conditions showed that some MADS-box genes may also be involved in abiotic stress responses, including type I genes. Comparative studies of MADS-box genes among Brachypodium, rice and Arabidopsis showed that Brachypodium had fewer gene duplication events. Taken together, this work provides useful data for further functional studies of MADS-box genes in Brachypodium distachyon.

  11. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci.

    PubMed

    Smith, D J; Escott-Price, V; Davies, G; Bailey, M E S; Colodro-Conde, L; Ward, J; Vedernikov, A; Marioni, R; Cullen, B; Lyall, D; Hagenaars, S P; Liewald, D C M; Luciano, M; Gale, C R; Ritchie, S J; Hayward, C; Nicholl, B; Bulik-Sullivan, B; Adams, M; Couvy-Duchesne, B; Graham, N; Mackay, D; Evans, J; Smith, B H; Porteous, D J; Medland, S E; Martin, N G; Holmans, P; McIntosh, A M; Pell, J P; Deary, I J; O'Donovan, M C

    2016-06-01

    Neuroticism is a personality trait of fundamental importance for psychological well-being and public health. It is strongly associated with major depressive disorder (MDD) and several other psychiatric conditions. Although neuroticism is heritable, attempts to identify the alleles involved in previous studies have been limited by relatively small sample sizes. Here we report a combined meta-analysis of genome-wide association study (GWAS) of neuroticism that includes 91 370 participants from the UK Biobank cohort, 6659 participants from the Generation Scotland: Scottish Family Health Study (GS:SFHS) and 8687 participants from a QIMR (Queensland Institute of Medical Research) Berghofer Medical Research Institute (QIMR) cohort. All participants were assessed using the same neuroticism instrument, the Eysenck Personality Questionnaire-Revised (EPQ-R-S) Short Form's Neuroticism scale. We found a single-nucleotide polymorphism-based heritability estimate for neuroticism of ∼15% (s.e.=0.7%). Meta-analysis identified nine novel loci associated with neuroticism. The strongest evidence for association was at a locus on chromosome 8 (P=1.5 × 10(-15)) spanning 4 Mb and containing at least 36 genes. Other associated loci included interesting candidate genes on chromosome 1 (GRIK3 (glutamate receptor ionotropic kainate 3)), chromosome 4 (KLHL2 (Kelch-like protein 2)), chromosome 17 (CRHR1 (corticotropin-releasing hormone receptor 1) and MAPT (microtubule-associated protein Tau)) and on chromosome 18 (CELF4 (CUGBP elav-like family member 4)). We found no evidence for genetic differences in the common allelic architecture of neuroticism by sex. By comparing our findings with those of the Psychiatric Genetics Consortia, we identified a strong genetic correlation between neuroticism and MDD and a less strong but significant genetic correlation with schizophrenia, although not with bipolar disorder. Polygenic risk scores derived from the primary UK Biobank sample captured

  12. Genome-wide analysis of over 106 000 individuals identifies 9 neuroticism-associated loci

    PubMed Central

    Smith, D J; Escott-Price, V; Davies, G; Bailey, M E S; Colodro-Conde, L; Ward, J; Vedernikov, A; Marioni, R; Cullen, B; Lyall, D; Hagenaars, S P; Liewald, D C M; Luciano, M; Gale, C R; Ritchie, S J; Hayward, C; Nicholl, B; Bulik-Sullivan, B; Adams, M; Couvy-Duchesne, B; Graham, N; Mackay, D; Evans, J; Smith, B H; Porteous, D J; Medland, S E; Martin, N G; Holmans, P; McIntosh, A M; Pell, J P; Deary, I J; O'Donovan, M C

    2016-01-01

    Neuroticism is a personality trait of fundamental importance for psychological well-being and public health. It is strongly associated with major depressive disorder (MDD) and several other psychiatric conditions. Although neuroticism is heritable, attempts to identify the alleles involved in previous studies have been limited by relatively small sample sizes. Here we report a combined meta-analysis of genome-wide association study (GWAS) of neuroticism that includes 91 370 participants from the UK Biobank cohort, 6659 participants from the Generation Scotland: Scottish Family Health Study (GS:SFHS) and 8687 participants from a QIMR (Queensland Institute of Medical Research) Berghofer Medical Research Institute (QIMR) cohort. All participants were assessed using the same neuroticism instrument, the Eysenck Personality Questionnaire-Revised (EPQ-R-S) Short Form's Neuroticism scale. We found a single-nucleotide polymorphism-based heritability estimate for neuroticism of ∼15% (s.e.=0.7%). Meta-analysis identified nine novel loci associated with neuroticism. The strongest evidence for association was at a locus on chromosome 8 (P=1.5 × 10−15) spanning 4 Mb and containing at least 36 genes. Other associated loci included interesting candidate genes on chromosome 1 (GRIK3 (glutamate receptor ionotropic kainate 3)), chromosome 4 (KLHL2 (Kelch-like protein 2)), chromosome 17 (CRHR1 (corticotropin-releasing hormone receptor 1) and MAPT (microtubule-associated protein Tau)) and on chromosome 18 (CELF4 (CUGBP elav-like family member 4)). We found no evidence for genetic differences in the common allelic architecture of neuroticism by sex. By comparing our findings with those of the Psychiatric Genetics Consortia, we identified a strong genetic correlation between neuroticism and MDD and a less strong but significant genetic correlation with schizophrenia, although not with bipolar disorder. Polygenic risk scores derived from the primary UK Biobank sample captured

  13. Genome Wide Association Studies

    NASA Astrophysics Data System (ADS)

    Sebastiani, Paola; Solovieff, Nadia

    The availability of high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS). These studies generate massive amount of genetic data that challenge investigators with issues related to data management, statistical analysis of large data sets, visualization, and annotation of results. We will review the common approach to analysis of GWAS data and then discuss options to learn more from these data.

  14. An across-breed genome wide association analysis of susceptibility to paratuberculosis in dairy cattle.

    PubMed

    Sallam, Ahmed M; Zare, Yalda; Alpay, Fazli; Shook, George E; Collins, Michael T; Alsheikh, Samir; Sharaby, Mahmoud; Kirkpatrick, Brian W

    2017-02-01

    Paratuberculosis is a chronic disease of ruminants caused by Mycobacterium avium subspecies paratuberculosis (MAP). It occurs worldwide and causes a significant loss in the animal production industry. There is no cure for MAP infection and vaccination is problematic. Identification of genetics of susceptibility could be a useful adjunct for programs that focus on management, testing and culling of diseased animals. A case-control, genome-wide association study (GWAS) was conducted using Holstein and Jersey cattle in a combined analysis in order to identify markers and chromosomal regions associated with susceptibility to MAP infection across-breed. A mixed-model method (GRAMMAR-GC) implemented in the GenABEL R package and a Bayes C analysis implemented in GenSel software were used as alternative approaches to conduct GWAS analysis focused on single SNPs and chromosomal segments, respectively. After conducting quality control, 22 406 SNPs from 2157 individuals were available for the GRAMMAR-GC (Bayes C) analysis and 45 640 SNPs from 2199 individuals were available for the Bayes C analysis. One SNP located on BTA27 (8·6 Mb) was identified as moderately associated (P < 5 × 10-5, FDR = 0·44) in the GRAMMAR-GC analysis of the combined breed data. Nine 1 Mb windows located on BTA 2, 3 (3 windows), 6, 8, 25, 27 and 29 each explained ≥1% of the total proportion of genetic variance in the Bayes C analysis. In an analysis ignoring differences in linkage phase, two moderately significantly associated SNPs were identified; ARS-BFGL-NGS-19381 on BTA23 (32 Mb) and Hapmap40994-BTA-46361 on BTA19 (61 Mb). New common genomic regions and candidate genes have been identified from the across-breed analysis that might be involved in the immune response and susceptibility to MAP infection.

  15. Genome-wide analysis and expression profiling of the Solanum tuberosum aquaporins.

    PubMed

    Venkatesh, Jelli; Yu, Jae-Woong; Park, Se Won

    2013-12-01

    Aquaporins belongs to the major intrinsic proteins involved in the transcellular membrane transport of water and other small solutes. A comprehensive genome-wide search for the homologues of Solanum tuberosum major intrinsic protein (MIP) revealed 41 full-length potato aquaporin genes. All potato aquaporins are grouped into five subfamilies; plasma membrane intrinsic proteins (PIPs), tonoplast intrinsic proteins (TIPs), NOD26-like intrinsic proteins (NIPs), small basic intrinsic proteins (SIPs) and x-intrinsic proteins (XIPs). Functional predictions based on the aromatic/arginine (ar/R) selectivity filters and Froger's positions showed a remarkable difference in substrate transport specificity among subfamilies. The expression pattern of potato aquaporins, examined by qPCR analysis, showed distinct expression profiles in various organs and tuber developmental stages. Furthermore, qPCR analysis of potato plantlets, subjected to various abiotic stresses revealed the marked effect of stresses on expression levels of aquaporins. Taken together, the expression profiles of aquaporins imply that aquaporins play important roles in plant growth and development, in addition to maintaining water homeostasis in response to environmental stresses.

  16. Genome-Wide Analysis of the Lysine Biosynthesis Pathway Network during Maize Seed Development

    PubMed Central

    Liu, Yuwei; Xie, Shaojun; Yu, Jingjuan

    2016-01-01

    Lysine is one of the most limiting essential amino acids for humans and livestock. The nutritional value of maize (Zea mays L.) is reduced by its poor lysine content. To better understand the lysine biosynthesis pathway in maize seed, we conducted a genome-wide analysis of the genes involved in lysine biosynthesis. We identified lysine biosynthesis pathway genes (LBPGs) and investigated whether a diaminopimelate pathway variant exists in maize. We analyzed two genes encoding the key enzyme dihydrodipicolinate synthase, and determined that they contribute differently to lysine synthesis during maize seed development. A coexpression network of LBPGs was constructed using RNA-sequencing data from 21 developmental stages of B73 maize seed. We found a large set of genes encoding ribosomal proteins, elongation factors and zein proteins that were coexpressed with LBPGs. The coexpressed genes were enriched in cellular metabolism terms and protein related terms. A phylogenetic analysis of the LBPGs from different plant species revealed different relationships. Additionally, six transcription factor (TF) families containing 13 TFs were identified as the Hub TFs of the LBPGs modules. Several expression quantitative trait loci of LBPGs were also identified. Our results should help to elucidate the lysine biosynthesis pathway network in maize seed. PMID:26829553

  17. A genome-wide association meta-analysis identifies new childhood obesity loci.

    PubMed

    Bradfield, Jonathan P; Taal, H Rob; Timpson, Nicholas J; Scherag, André; Lecoeur, Cecile; Warrington, Nicole M; Hypponen, Elina; Holst, Claus; Valcarcel, Beatriz; Thiering, Elisabeth; Salem, Rany M; Schumacher, Fredrick R; Cousminer, Diana L; Sleiman, Patrick M A; Zhao, Jianhua; Berkowitz, Robert I; Vimaleswaran, Karani S; Jarick, Ivonne; Pennell, Craig E; Evans, David M; St Pourcain, Beate; Berry, Diane J; Mook-Kanamori, Dennis O; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, André G; van Duijn, Cornelia M; van der Valk, Ralf J P; de Jongste, Johan C; Postma, Dirkje S; Boomsma, Dorret I; Gauderman, W James; Hassanein, Mohamed T; Lindgren, Cecilia M; Mägi, Reedik; Boreham, Colin A G; Neville, Charlotte E; Moreno, Luis A; Elliott, Paul; Pouta, Anneli; Hartikainen, Anna-Liisa; Li, Mingyao; Raitakari, Olli; Lehtimäki, Terho; Eriksson, Johan G; Palotie, Aarno; Dallongeville, Jean; Das, Shikta; Deloukas, Panos; McMahon, George; Ring, Susan M; Kemp, John P; Buxton, Jessica L; Blakemore, Alexandra I F; Bustamante, Mariona; Guxens, Mònica; Hirschhorn, Joel N; Gillman, Matthew W; Kreiner-Møller, Eskil; Bisgaard, Hans; Gilliland, Frank D; Heinrich, Joachim; Wheeler, Eleanor; Barroso, Inês; O'Rahilly, Stephen; Meirhaeghe, Aline; Sørensen, Thorkild I A; Power, Chris; Palmer, Lyle J; Hinney, Anke; Widen, Elisabeth; Farooqi, I Sadaf; McCarthy, Mark I; Froguel, Philippe; Meyre, David; Hebebrand, Johannes; Jarvelin, Marjo-Riitta; Jaddoe, Vincent W V; Smith, George Davey; Hakonarson, Hakon; Grant, Struan F A

    2012-05-01

    Multiple genetic variants have been associated with adult obesity and a few with severe obesity in childhood; however, less progress has been made in establishing genetic influences on common early-onset obesity. We performed a North American, Australian and European collaborative meta-analysis of 14 studies consisting of 5,530 cases (≥95th percentile of body mass index (BMI)) and 8,318 controls (<50th percentile of BMI) of European ancestry. Taking forward the eight newly discovered signals yielding association with P < 5 × 10(-6) in nine independent data sets (2,818 cases and 4,083 controls), we observed two loci that yielded genome-wide significant combined P values near OLFM4 at 13q14 (rs9568856; P = 1.82 × 10(-9); odds ratio (OR) = 1.22) and within HOXB5 at 17q21 (rs9299; P = 3.54 × 10(-9); OR = 1.14). Both loci continued to show association when two extreme childhood obesity cohorts were included (2,214 cases and 2,674 controls). These two loci also yielded directionally consistent associations in a previous meta-analysis of adult BMI(1).

  18. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii

    PubMed Central

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C.; Zhang, Baohong

    2014-01-01

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence. PMID:25322260

  19. Genome-wide Analysis of the Homeobox C6 Transcriptional Network in Prostate Cancer

    PubMed Central

    McCabe, Colleen D.; Spyropoulos, Demetri D.; Martin, David; Moreno, Carlos S.

    2008-01-01

    Homeobox (HOX) transcription factors are developmentally regulated genes that play crucial roles in tissue patterning. Homeobox C6 (HOXC6) is overexpressed in prostate cancers and correlated with cancer progression, but the downstream targets of HOXC6 are largely unknown. We have performed genome-wide localization analysis to identify promoters bound by HOXC6 in prostate cancer cells. This analysis identified 468 reproducibly bound promoters whose associated genes are involved in functions such as cell proliferation and apoptosis. We have complemented these data with expression profiling of prostates from mice with homozygous disruption of the Hoxc6 gene to identify 31 direct regulatory target genes of HOXC6. We show that HOXC6 directly regulates expression of BMP7, FGFR2, IGFBP3, and PDGFRA in prostate cells and indirectly influences the Notch and Wnt signaling pathways in vivo. We further show that inhibition of PDGFRA reduces proliferation of prostate cancer cells, and that overexpression of HOXC6 can overcome the effects of PDGFRA inhibition. HOXC6 regulates genes with both oncogenic and tumor suppressor activities as well as several genes such as CD44 that are important for prostate branching morphogenesis and metastasis to the bone microenvironment. PMID:18339881

  20. Genome-wide identification and expression analysis of TCP transcription factors in Gossypium raimondii.

    PubMed

    Ma, Jun; Wang, Qinglian; Sun, Runrun; Xie, Fuliang; Jones, Don C; Zhang, Baohong

    2014-10-16

    Plant-specific TEOSINTE-BRANCHED1/CYCLOIDEA/PCF (TCP) transcription factors play versatile functions in multiple aspects of plant growth and development. However, no systematical study has been performed in cotton. In this study, we performed for the first time the genome-wide identification and expression analysis of the TCP transcription factor family in Gossypium raimondii. A total of 38 non-redundant cotton TCP encoding genes were identified. The TCP transcription factors were divided into eleven subgroups based on phylogenetic analysis. Most TCP genes within the same subfamily demonstrated similar exon and intron organization and the motif structures were highly conserved among the subfamilies. Additionally, the chromosomal distribution pattern revealed that TCP genes were unevenly distributed across 11 out of the 13 chromosomes; segmental duplication is a predominant duplication event for TCP genes and the major contributor to the expansion of TCP gene family in G. raimondii. Moreover, the expression profiles of TCP genes shed light on their functional divergence.

  1. Genome-wide analysis uncovers novel recurrent alterations in primary central nervous system lymphomas

    PubMed Central

    Braggio, Esteban; Van Wier, Scott; Ojha, Juhi; McPhail, Ellen; Asmann, Yan W.; Egan, Jan; da Silva, Jackline Ayres; Schiff, David; Lopes, M Beatriz; Decker, Paul A; Valdez, Riccardo; Tibes, Raoul; Eckloff, Bruce; Witzig, Thomas E.; Stewart, A Keith; Fonseca, Rafael; O’Neill, Brian Patrick

    2015-01-01

    Purpose Primary central nervous system lymphoma (PCNSL) is an aggressive non-Hodgkin lymphoma confined to the CNS. Whether there is a PCNSL-specific genomic signature and, if so, how it differs from systemic diffuse large B-cell lymphoma (DLBCL) is uncertain. Experimental design We performed a comprehensive genomic study of tumor samples from 19 immunocompetent PCNSL patients. Testing comprised array-comparative genomic hybridization and whole exome sequencing. Results Biallelic inactivation of TOX and PRKCD were recurrently found in PCNSL but not in systemic DLBCL, suggesting a specific role in PCNSL pathogenesis. Additionally, we found a high prevalence of MYD88 mutations (79%) and CDKN2A biallelic loss (60%). Several genes recurrently affected in PCNSL were common with systemic DLBCL, including loss of TNFAIP3, PRDM1, GNA13, TMEM30A, TBL1XR1, B2M, CD58, activating mutations of CD79B, CARD11 and translocations IgH-BCL6. Overall, BCR/TLR/NF-κB pathways were altered in >90% of PNCSL, highlighting its value for targeted therapeutic approaches. Furthermore, integrated analysis showed enrichment of pathways associated with immune response, proliferation, apoptosis, and lymphocyte differentiation. Conclusions In summary, genome-wide analysis uncovered novel recurrent alterations, including TOX and PRKCD, helping to differentiate PCNSL from systemic DLBCL and related lymphomas. PMID:25991819

  2. A genome-wide association meta-analysis identifies new childhood obesity loci

    PubMed Central

    Bradfield, Jonathan P.; Taal, H. Rob; Timpson, Nicholas J.; Scherag, André; Lecoeur, Cecile; Warrington, Nicole M.; Hypponen, Elina; Holst, Claus; Valcarcel, Beatriz; Thiering, Elisabeth; Salem, Rany M.; Schumacher, Fredrick R.; Cousminer, Diana L.; Sleiman, Patrick M.A.; Zhao, Jianhua; Berkowitz, Robert I.; Vimaleswaran, Karani S.; Jarick, Ivonne; Pennell, Craig E.; Evans, David M.; St. Pourcain, Beate; Berry, Diane J.; Mook-Kanamori, Dennis O; Hofman, Albert; Rivadeinera, Fernando; Uitterlinden, André G.; van Duijn, Cornelia M.; van der Valk, Ralf J.P.; de Jongste, Johan C.; Postma, Dirkje S.; Boomsma, Dorret I.; Gauderman, William J.; Hassanein, Mohamed T.; Lindgren, Cecilia M.; Mägi, Reedik; Boreham, Colin A.G.; Neville, Charlotte E.; Moreno, Luis A.; Elliott, Paul; Pouta, Anneli; Hartikainen, Anna-Liisa; Li, Mingyao; Raitakari, Olli; Lehtimäki, Terho; Eriksson, Johan G.; Palotie, Aarno; Dallongeville, Jean; Das, Shikta; Deloukas, Panos; McMahon, George; Ring, Susan M.; Kemp, John P.; Buxton, Jessica L.; Blakemore, Alexandra I.F.; Bustamante, Mariona; Guxens, Mònica; Hirschhorn, Joel N.; Gillman, Matthew W.; Kreiner-Møller, Eskil; Bisgaard, Hans; Gilliland, Frank D.; Heinrich, Joachim; Wheeler, Eleanor; Barroso, Inês; O'Rahilly, Stephen; Meirhaeghe, Aline; Sørensen, Thorkild I.A.; Power, Chris; Palmer, Lyle J.; Hinney, Anke; Widen, Elisabeth; Farooqi, I. Sadaf; McCarthy, Mark I.; Froguel, Philippe; Meyre, David; Hebebrand, Johannes; Jarvelin, Marjo-Riitta; Jaddoe, Vincent W.V.; Smith, George Davey; Hakonarson, Hakon; Grant, Struan F.A.

    2012-01-01

    Multiple genetic variants have been associated with adult obesity and a few with severe obesity in childhood; however, less progress has been made to establish genetic influences on common early-onset obesity. We performed a North American-Australian-European collaborative meta-analysis of fourteen studies consisting of 5,530 cases (≥95th percentile of body mass index (BMI)) and 8,318 controls (<50th percentile of BMI) of European ancestry. Taking forward the eight novel signals yielding association with P < 5×10−6 in to nine independent datasets (n = 2,818 cases and 4,083 controls) we observed two loci that yielded a genome wide significant combined P-value, namely near OLFM4 on 13q14 (rs9568856; P=1.82×10−9; OR=1.22) and within HOXB5 on 17q21 (rs9299; P=3.54×10−9; OR=1.14). Both loci continued to show association when including two extreme childhood obesity cohorts (n = 2,214 cases and 2,674 controls). Finally, these two loci yielded directionally consistent associations in the GIANT meta-analysis of adult BMI1. PMID:22484627

  3. Meta-analysis of genome-wide association studies discovers multiple loci for chronic lymphocytic leukemia

    PubMed Central

    Berndt, Sonja I.; Camp, Nicola J.; Skibola, Christine F.; Vijai, Joseph; Wang, Zhaoming; Gu, Jian; Nieters, Alexandra; Kelly, Rachel S.; Smedby, Karin E.; Monnereau, Alain; Cozen, Wendy; Cox, Angela; Wang, Sophia S.; Lan, Qing; Teras, Lauren R.; Machado, Moara; Yeager, Meredith; Brooks-Wilson, Angela R.; Hartge, Patricia; Purdue, Mark P.; Birmann, Brenda M.; Vajdic, Claire M.; Cocco, Pierluigi; Zhang, Yawei; Giles, Graham G.; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Montalvan, Rebecca; Burdett, Laurie; Hutchinson, Amy; Ye, Yuanqing; Call, Timothy G.; Shanafelt, Tait D.; Novak, Anne J.; Kay, Neil E.; Liebow, Mark; Cunningham, Julie M.; Allmer, Cristine; Hjalgrim, Henrik; Adami, Hans-Olov; Melbye, Mads; Glimelius, Bengt; Chang, Ellen T.; Glenn, Martha; Curtin, Karen; Cannon-Albright, Lisa A.; Diver, W Ryan; Link, Brian K.; Weiner, George J.; Conde, Lucia; Bracci, Paige M.; Riby, Jacques; Arnett, Donna K.; Zhi, Degui; Leach, Justin M.; Holly, Elizabeth A.; Jackson, Rebecca D.; Tinker, Lesley F.; Benavente, Yolanda; Sala, Núria; Casabonne, Delphine; Becker, Nikolaus; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; McKay, James; Staines, Anthony; Chaffee, Kari G.; Achenbach, Sara J.; Vachon, Celine M.; Goldin, Lynn R.; Strom, Sara S.; Leis, Jose F.; Weinberg, J. Brice; Caporaso, Neil E.; Norman, Aaron D.; De Roos, Anneclaire J.; Morton, Lindsay M.; Severson, Richard K.; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Masala, Giovanna; Weiderpass, Elisabete; Chirlaque, María- Dolores; Vermeulen, Roel C. H.; Travis, Ruth C.; Southey, Melissa C.; Milne, Roger L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Clavel, Jacqueline; Zheng, Tongzhang; Holford, Theodore R.; Villano, Danylo J.; Maria, Ann; Spinelli, John J.; Gascoyne, Randy D.; Connors, Joseph M.; Bertrand, Kimberly A.; Giovannucci, Edward; Kraft, Peter; Kricker, Anne; Turner, Jenny; Ennas, Maria Grazia; Ferri, Giovanni M.; Miligi, Lucia; Liang, Liming; Ma, Baoshan; Huang, Jinyan; Crouch, Simon; Park, Ju-Hyun; Chatterjee, Nilanjan; North, Kari E.; Snowden, John A.; Wright, Josh; Fraumeni, Joseph F.; Offit, Kenneth; Wu, Xifeng; de Sanjose, Silvia; Cerhan, James R.; Chanock, Stephen J.; Rothman, Nathaniel; Slager, Susan L.

    2016-01-01

    Chronic lymphocytic leukemia (CLL) is a common lymphoid malignancy with strong heritability. To further understand the genetic susceptibility for CLL and identify common loci associated with risk, we conducted a meta-analysis of four genome-wide association studies (GWAS) composed of 3,100 cases and 7,667 controls with follow-up replication in 1,958 cases and 5,530 controls. Here we report three new loci at 3p24.1 (rs9880772, EOMES, P=2.55 × 10−11), 6p25.2 (rs73718779, SERPINB6, P=1.97 × 10−8) and 3q28 (rs9815073, LPP, P=3.62 × 10−8), as well as a new independent SNP at the known 2q13 locus (rs9308731, BCL2L11, P=1.00 × 10−11) in the combined analysis. We find suggestive evidence (P<5 × 10−7) for two additional new loci at 4q24 (rs10028805, BANK1, P=7.19 × 10−8) and 3p22.2 (rs1274963, CSRNP1, P=2.12 × 10−7). Pathway analyses of new and known CLL loci consistently show a strong role for apoptosis, providing further evidence for the importance of this biological pathway in CLL susceptibility. PMID:26956414

  4. Meta-analysis of genome-wide association studies discovers multiple loci for chronic lymphocytic leukemia.

    PubMed

    Berndt, Sonja I; Camp, Nicola J; Skibola, Christine F; Vijai, Joseph; Wang, Zhaoming; Gu, Jian; Nieters, Alexandra; Kelly, Rachel S; Smedby, Karin E; Monnereau, Alain; Cozen, Wendy; Cox, Angela; Wang, Sophia S; Lan, Qing; Teras, Lauren R; Machado, Moara; Yeager, Meredith; Brooks-Wilson, Angela R; Hartge, Patricia; Purdue, Mark P; Birmann, Brenda M; Vajdic, Claire M; Cocco, Pierluigi; Zhang, Yawei; Giles, Graham G; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Montalvan, Rebecca; Burdett, Laurie; Hutchinson, Amy; Ye, Yuanqing; Call, Timothy G; Shanafelt, Tait D; Novak, Anne J; Kay, Neil E; Liebow, Mark; Cunningham, Julie M; Allmer, Cristine; Hjalgrim, Henrik; Adami, Hans-Olov; Melbye, Mads; Glimelius, Bengt; Chang, Ellen T; Glenn, Martha; Curtin, Karen; Cannon-Albright, Lisa A; Diver, W Ryan; Link, Brian K; Weiner, George J; Conde, Lucia; Bracci, Paige M; Riby, Jacques; Arnett, Donna K; Zhi, Degui; Leach, Justin M; Holly, Elizabeth A; Jackson, Rebecca D; Tinker, Lesley F; Benavente, Yolanda; Sala, Núria; Casabonne, Delphine; Becker, Nikolaus; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; McKay, James; Staines, Anthony; Chaffee, Kari G; Achenbach, Sara J; Vachon, Celine M; Goldin, Lynn R; Strom, Sara S; Leis, Jose F; Weinberg, J Brice; Caporaso, Neil E; Norman, Aaron D; De Roos, Anneclaire J; Morton, Lindsay M; Severson, Richard K; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Masala, Giovanna; Weiderpass, Elisabete; Chirlaque, María-Dolores; Vermeulen, Roel C H; Travis, Ruth C; Southey, Melissa C; Milne, Roger L; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie; Clavel, Jacqueline; Zheng, Tongzhang; Holford, Theodore R; Villano, Danylo J; Maria, Ann; Spinelli, John J; Gascoyne, Randy D; Connors, Joseph M; Bertrand, Kimberly A; Giovannucci, Edward; Kraft, Peter; Kricker, Anne; Turner, Jenny; Ennas, Maria Grazia; Ferri, Giovanni M; Miligi, Lucia; Liang, Liming; Ma, Baoshan; Huang, Jinyan; Crouch, Simon; Park, Ju-Hyun; Chatterjee, Nilanjan; North, Kari E; Snowden, John A; Wright, Josh; Fraumeni, Joseph F; Offit, Kenneth; Wu, Xifeng; de Sanjose, Silvia; Cerhan, James R; Chanock, Stephen J; Rothman, Nathaniel; Slager, Susan L

    2016-03-09

    Chronic lymphocytic leukemia (CLL) is a common lymphoid malignancy with strong heritability. To further understand the genetic susceptibility for CLL and identify common loci associated with risk, we conducted a meta-analysis of four genome-wide association studies (GWAS) composed of 3,100 cases and 7,667 controls with follow-up replication in 1,958 cases and 5,530 controls. Here we report three new loci at 3p24.1 (rs9880772, EOMES, P=2.55 × 10(-11)), 6p25.2 (rs73718779, SERPINB6, P=1.97 × 10(-8)) and 3q28 (rs9815073, LPP, P=3.62 × 10(-8)), as well as a new independent SNP at the known 2q13 locus (rs9308731, BCL2L11, P=1.00 × 10(-11)) in the combined analysis. We find suggestive evidence (P<5 × 10(-7)) for two additional new loci at 4q24 (rs10028805, BANK1, P=7.19 × 10(-8)) and 3p22.2 (rs1274963, CSRNP1, P=2.12 × 10(-7)). Pathway analyses of new and known CLL loci consistently show a strong role for apoptosis, providing further evidence for the importance of this biological pathway in CLL susceptibility.

  5. Pathway Analysis of Seven Common Diseases Assessed by Genome-Wide Association

    PubMed Central

    Torkamani, Ali; Topol, Eric J.; Schork, Nicholas J.

    2008-01-01

    Recent genome wide association studies (GWAS) have identified DNA sequence variations that exhibit unequivocal statistical associations with many common chronic diseases. However, the vast majority of these studies identified variations that explain only a very small fraction of disease burden in the population at large, suggesting that other factors, such as multiple rare or low-penetrance variations and interacting environmental factors, are major contributors to disease susceptibility. Identifying multiple low penetrance variations (or ‘polygenes’) contributing to disease susceptibility will be difficult. We present a pathway analysis approach to characterizing the likely polygenic basis of seven common diseases using the Wellcome Trust Case Control Consortium (WTCCC) GWAS results. We identify numerous pathways implicated in disease predisposition that would have not been revealed using standard single-locus GWAS statistical analysis criteria. Many of these pathways have long been assumed to contain polymorphic genes that lead to disease predisposition. Additionally, we analyze the genetic relationships between the seven diseases, and based upon similarities with respect to the associated genes and pathways affected in each, propose a new way of categorizing the diseases. PMID:18722519

  6. Genome-wide analysis of thyroid hormone receptors shared and specific functions in neural cells

    PubMed Central

    Chatonnet, Fabrice; Guyot, Romain; Benoît, Gérard; Flamant, Frederic

    2013-01-01

    TRα1 and TRβ1, the two main thyroid hormone receptors in mammals, are transcription factors that share similar properties. However, their respective functions are very different. This functional divergence might be explained in two ways: it can reflect different expression patterns or result from different intrinsic properties of the receptors. We tested this second hypothesis by comparing the repertoires of 3,3′,5-triiodo-L-thyronine (T3)-responsive genes of two neural cell lines, expressing either TRα1 or TRβ1. Using transcriptome analysis, we found that a substantial fraction of the T3 target genes display a marked preference for one of the two receptors. So when placed alone in identical situations, the two receptors have different repertoires of target genes. Chromatin occupancy analysis, performed at a genome-wide scale, revealed that TRα1 and TRβ1 cistromes were also different. However, receptor-selective regulation of T3 target genes did not result from receptor-selective chromatin occupancy of their promoter regions. We conclude that modification of TRα1 and TRβ1 intrinsic properties contributes in a large part to the divergent evolution of the receptors’ function, at least during neurodevelopment. PMID:23382204

  7. Genome-wide common and rare variant analysis provides novel insights into clozapine-associated neutropenia

    PubMed Central

    Legge, Sophie E; Hamshere, Marian L; Ripke, Stephan; Pardinas, Antonio F; Goldstein, Jacqueline I; Rees, Elliott; Richards, Alexander L; Leonenko, Ganna; Jorskog, L Fredrik; Chambert, Kimberly D; Collier, David A; Genovese, Giulio; Giegling, Ina; Holmans, Peter; Jonasdottir, Adalbjorg; Kirov, George; McCarroll, Steven A; MacCabe, James H; Mantripragada, Kiran; Moran, Jennifer L; Neale, Benjamin M; Stefansson, Hreinn; Rujescu, Dan; Daly, Mark J; Sullivan, Patrick F; Owen, Michael J; O’Donovan, Michael C; Walters, James T R

    2016-01-01

    The antipsychotic clozapine is uniquely effective in the management of schizophrenia, but its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown although genetic factors play an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed HLA alleles, exome array, and copy number variation analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (OR=4.32, P=1.79×10-8), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P = 0.015, positive predictive value = 35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect. PMID:27400856

  8. Integrated genome-wide analysis of genomic changes and gene regulation in human adrenocortical tissue samples.

    PubMed

    Gara, Sudheer Kumar; Wang, Yonghong; Patel, Dhaval; Liu-Chittenden, Yi; Jain, Meenu; Boufraqech, Myriem; Zhang, Lisa; Meltzer, Paul S; Kebebew, Electron

    2015-10-30

    To gain insight into the pathogenesis of adrenocortical carcinoma (ACC) and whether there is progression from normal-to-adenoma-to-carcinoma, we performed genome-wide gene expression, gene methylation, microRNA expression and comparative genomic hybridization (CGH) analysis in human adrenocortical tissue (normal, adrenocortical adenomas and ACC) samples. A pairwise comparison of normal, adrenocortical adenomas and ACC gene expression profiles with more than four-fold expression differences and an adjusted P-value < 0.05 revealed no major differences in normal versus adrenocortical adenoma whereas there are 808 and 1085, respectively, dysregulated genes between ACC versus adrenocortical adenoma and ACC versus normal. The majority of the dysregulated genes in ACC were downregulated. By integrating the CGH, gene methylation and expression profiles of potential miRNAs with the gene expression of dysregulated genes, we found that there are higher alterations in ACC versus normal compared to ACC versus adrenocortical adenoma. Importantly, we identified several novel molecular pathways that are associated with dysregulated genes and further experimentally validated that oncostatin m signaling induces caspase 3 dependent apoptosis and suppresses cell proliferation. Finally, we propose that there is higher number of genomic changes from normal-to-adenoma-to-carcinoma and identified oncostatin m signaling as a plausible druggable pathway for therapeutics.

  9. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia.

    PubMed

    Law, Philip J; Berndt, Sonja I; Speedy, Helen E; Camp, Nicola J; Sava, Georgina P; Skibola, Christine F; Holroyd, Amy; Joseph, Vijai; Sunter, Nicola J; Nieters, Alexandra; Bea, Silvia; Monnereau, Alain; Martin-Garcia, David; Goldin, Lynn R; Clot, Guillem; Teras, Lauren R; Quintela, Inés; Birmann, Brenda M; Jayne, Sandrine; Cozen, Wendy; Majid, Aneela; Smedby, Karin E; Lan, Qing; Dearden, Claire; Brooks-Wilson, Angela R; Hall, Andrew G; Purdue, Mark P; Mainou-Fowler, Tryfonia; Vajdic, Claire M; Jackson, Graham H; Cocco, Pierluigi; Marr, Helen; Zhang, Yawei; Zheng, Tongzhang; Giles, Graham G; Lawrence, Charles; Call, Timothy G; Liebow, Mark; Melbye, Mads; Glimelius, Bengt; Mansouri, Larry; Glenn, Martha; Curtin, Karen; Diver, W Ryan; Link, Brian K; Conde, Lucia; Bracci, Paige M; Holly, Elizabeth A; Jackson, Rebecca D; Tinker, Lesley F; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Maynadie, Marc; McKay, James; Albanes, Demetrius; Weinstein, Stephanie; Wang, Zhaoming; Caporaso, Neil E; Morton, Lindsay M; Severson, Richard K; Riboli, Elio; Vineis, Paolo; Vermeulen, Roel C H; Southey, Melissa C; Milne, Roger L; Clavel, Jacqueline; Topka, Sabine; Spinelli, John J; Kraft, Peter; Ennas, Maria Grazia; Summerfield, Geoffrey; Ferri, Giovanni M; Harris, Robert J; Miligi, Lucia; Pettitt, Andrew R; North, Kari E; Allsup, David J; Fraumeni, Joseph F; Bailey, James R; Offit, Kenneth; Pratt, Guy; Hjalgrim, Henrik; Pepper, Chris; Chanock, Stephen J; Fegan, Chris; Rosenquist, Richard; de Sanjose, Silvia; Carracedo, Angel; Dyer, Martin J S; Catovsky, Daniel; Campo, Elias; Cerhan, James R; Allan, James M; Rothman, Nathanial; Houlston, Richard; Slager, Susan

    2017-02-06

    Several chronic lymphocytic leukaemia (CLL) susceptibility loci have been reported; however, much of the heritable risk remains unidentified. Here we perform a meta-analysis of six genome-wide association studies, imputed using a merged reference panel of 1,000 Genomes and UK10K data, totalling 6,200 cases and 17,598 controls after replication. We identify nine risk loci at 1p36.11 (rs34676223, P=5.04 × 10(-13)), 1q42.13 (rs41271473, P=1.06 × 10(-10)), 4q24 (rs71597109, P=1.37 × 10(-10)), 4q35.1 (rs57214277, P=3.69 × 10(-8)), 6p21.31 (rs3800461, P=1.97 × 10(-8)), 11q23.2 (rs61904987, P=2.64 × 10(-11)), 18q21.1 (rs1036935, P=3.27 × 10(-8)), 19p13.3 (rs7254272, P=4.67 × 10(-8)) and 22q13.33 (rs140522, P=2.70 × 10(-9)). These new and established risk loci map to areas of active chromatin and show an over-representation of transcription factor binding for the key determinants of B-cell development and immune response.

  10. Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes.

    PubMed

    Wang, Da-Wei; Li, Da; Wang, Junjun; Zhao, Yue; Wang, Zhaojun; Yue, Guidong; Liu, Xin; Qin, Huanju; Zhang, Kunpu; Dong, Lingli; Wang, Daowen

    2017-03-16

    Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 α-, 11 γ-, one δ- and five ω-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of α-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The δ-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic α-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat.

  11. Analysis of binary responses with outcome-specific misclassification probability in genome-wide association studies

    PubMed Central

    Rekaya, Romdhane; Smith, Shannon; Hay, El Hamidi; Farhat, Nourhene; Aggrey, Samuel E

    2016-01-01

    Errors in the binary status of some response traits are frequent in human, animal, and plant applications. These error rates tend to differ between cases and controls because diagnostic and screening tests have different sensitivity and specificity. This increases the inaccuracies of classifying individuals into correct groups, giving rise to both false-positive and false-negative cases. The analysis of these noisy binary responses due to misclassification will undoubtedly reduce the statistical power of genome-wide association studies (GWAS). A threshold model that accommodates varying diagnostic errors between cases and controls was investigated. A simulation study was carried out where several binary data sets (case–control) were generated with varying effects for the most influential single nucleotide polymorphisms (SNPs) and different diagnostic error rate for cases and controls. Each simulated data set consisted of 2000 individuals. Ignoring misclassification resulted in biased estimates of true influential SNP effects and inflated estimates for true noninfluential markers. A substantial reduction in bias and increase in accuracy ranging from 12% to 32% was observed when the misclassification procedure was invoked. In fact, the majority of influential SNPs that were not identified using the noisy data were captured using the proposed method. Additionally, truly misclassified binary records were identified with high probability using the proposed method. The superiority of the proposed method was maintained across different simulation parameters (misclassification rates and odds ratios) attesting to its robustness. PMID:27942229

  12. Genome-wide Promoter Analysis of the SOX4 Transcriptional Network in Prostate Cancer Cells

    PubMed Central

    Scharer, Christopher D.; McCabe, Colleen D.; Ali-Seyed, Mohamed; Berger, Michael F.; Bulyk, Martha L.; Moreno, Carlos S.

    2008-01-01

    SOX4 is a critical developmental transcription factor in vertebrates and is required for precise differentiation and proliferation in multiple tissues. In addition, SOX4 is overexpressed in many human malignancies, but the exact role of SOX4 in cancer progression is not well understood. Here we have identified the direct transcriptional targets of SOX4 using a combination of genome-wide localization ChIP-chip analysis and transient overexpression followed by expression profiling in a prostate cancer model cell line. We have also used protein-binding microarrays to derive a novel SOX4-specific position-weight matrix and determined that SOX4 binding sites are enriched in SOX4-bound promoter regions. Direct transcriptional targets of SOX4 include several key cellular regulators such as EGFR, HSP70, Tenascin C, Frizzled-5, Patched-1, and Delta-like 1 We also show that SOX4 targets 23 transcription factors such as MLL, FOXA1, ZNF281, and NKX3-1 In addition, SOX4 directly regulates expression of three components of the RNA-induced silencing complex (RISC), namely Dicer, Argonaute 1, and RNA Helicase A. These data provide new insights into how SOX4 impacts developmental signaling pathways and how these changes may influence cancer progression via regulation of gene networks involved in microRNA processing, transcriptional regulation, the TGFβ, Wnt, Hedgehog, and Notch pathways, growth factor signaling, and tumor metastasis. PMID:19147588

  13. Genome-wide linkage analysis in families with infantile hypertrophic pyloric stenosis indicates novel susceptibility loci.

    PubMed

    Svenningsson, Anna; Söderhäll, Cilla; Persson, Sofia; Lundberg, Fredrik; Luthman, Holger; Chung, Eddie; Gardiner, Mark; Kockum, Ingrid; Nordenskjöld, Agneta

    2012-02-01

    Infantile hypertrophic pyloric stenosis (IHPS) is a common cause of upper gastrointestinal obstruction during infancy. A multifactorial background of the disease is well established. Multiple susceptibility loci including the neuronal nitric oxide synthase (NOS1) gene have previously been linked to IHPS, but contradictory results of linkage studies in different materials indicate genetic heterogeneity. To identify IHPS susceptibility loci, we conducted a genome-wide linkage analysis in 37 Swedish families. In regions where the Swedish material showed most evidence in favor of linkage, 31 additional British IHPS families were analyzed. Evidence in favor of significant linkage was observed in the Swedish material to two loci on chromosome 2q24 (non-parametric linkage (NPL) =3.77) and 7p21 (NPL=4.55). In addition, evidence of suggestive linkage was found to two loci on chromosome 6p21 (NPL=2.97) and 12q24 (NPL=2.63). Extending the material with British samples did not enhance the level of significance. Regions with linkage harbor interesting candidate genes, such as glucagon-like peptide-2 (GLP-2 encoded by the glucagon gene GCG), NOS1, motilin (MLN) and neuropeptide Y (NPY). The coding exons for GLP-2, and NPY were screened for mutations with negative results. In conclusion, we could confirm suggestive linkage to the region harboring the NOS1 gene and detected additional novel susceptibility loci for IHPS.

  14. The Genome-Wide Analysis of Carcinoembryonic Antigen Signaling by Colorectal Cancer Cells Using RNA Sequencing

    PubMed Central

    Gorbunova, Anna; Evsyukov, Igor; Rayko, Michael; Gapon, Svetlana; Bozhokina, Ekaterina; Shishkin, Alexander; O’Brien, Stephen J.

    2016-01-01

    Сarcinoembryonic antigen (CEA, CEACAM5, CD66) is a promoter of metastasis in epithelial cancers that is widely used as a prognostic clinical marker of metastasis. The aim of this study is to identify the network of genes that are associated with CEA-induced colorectal cancer liver metastasis. We compared the genome-wide transcriptomic profiles of CEA positive (MIP101 clone 8) and CEA negative (MIP 101) colorectal cancer cell lines with different metastatic potential in vivo. The CEA-producing cells displayed quantitative changes in the level of expression for 100 genes (over-expressed or down-regulated). They were confirmed by quantitative RT-PCR. The KEGG pathway analysis identified 4 significantly enriched pathways: cytokine-cytokine receptor interaction, MAPK signaling pathway, TGF-beta signaling pathway and pyrimidine metabolism. Our results suggest that CEA production by colorectal cancer cells triggers colorectal cancer progression by inducing the epithelial- mesenchymal transition, increasing tumor cell invasiveness into the surrounding tissues and suppressing stress and apoptotic signaling. The novel gene expression distinctions establish the relationships between the existing cancer markers and implicate new potential biomarkers for colorectal cancer hepatic metastasis. PMID:27583792

  15. Genome-Wide Association Scan Meta-Analysis Identifies Three Loci Influencing Adiposity and Fat Distribution

    PubMed Central

    Qi, Lu; Speliotes, Elizabeth K.; Thorleifsson, Gudmar; Willer, Cristen J.; Herrera, Blanca M.; Jackson, Anne U.; Lim, Noha; Scheet, Paul; Soranzo, Nicole; Amin, Najaf; Aulchenko, Yurii S.; Chambers, John C.; Drong, Alexander; Luan, Jian'an; Lyon, Helen N.; Rivadeneira, Fernando; Sanna, Serena; Timpson, Nicholas J.; Zillikens, M. Carola; Zhao, Jing Hua; Almgren, Peter; Bandinelli, Stefania; Bennett, Amanda J.; Bergman, Richard N.; Bonnycastle, Lori L.; Bumpstead, Suzannah J.; Chanock, Stephen J.; Cherkas, Lynn; Chines, Peter; Coin, Lachlan; Cooper, Cyrus; Crawford, Gabriel; Doering, Angela; Dominiczak, Anna; Doney, Alex S. F.; Ebrahim, Shah; Elliott, Paul; Erdos, Michael R.; Estrada, Karol; Ferrucci, Luigi; Fischer, Guido; Forouhi, Nita G.; Gieger, Christian; Grallert, Harald; Groves, Christopher J.; Grundy, Scott; Guiducci, Candace; Hadley, David; Hamsten, Anders; Havulinna, Aki S.; Hofman, Albert; Holle, Rolf; Holloway, John W.; Illig, Thomas; Isomaa, Bo; Jacobs, Leonie C.; Jameson, Karen; Jousilahti, Pekka; Karpe, Fredrik; Kuusisto, Johanna; Laitinen, Jaana; Lathrop, G. Mark; Lawlor, Debbie A.; Mangino, Massimo; McArdle, Wendy L.; Meitinger, Thomas; Morken, Mario A.; Morris, Andrew P.; Munroe, Patricia; Narisu, Narisu; Nordström, Anna; Nordström, Peter; Oostra, Ben A.; Palmer, Colin N. A.; Payne, Felicity; Peden, John F.; Prokopenko, Inga; Renström, Frida; Ruokonen, Aimo; Salomaa, Veikko; Sandhu, Manjinder S.; Scott, Laura J.; Scuteri, Angelo; Silander, Kaisa; Song, Kijoung; Yuan, Xin; Stringham, Heather M.; Swift, Amy J.; Tuomi, Tiinamaija; Uda, Manuela; Vollenweider, Peter; Waeber, Gerard; Wallace, Chris; Walters, G. Bragi; Weedon, Michael N.; Witteman, Jacqueline C. M.; Zhang, Cuilin; Zhang, Weihua; Caulfield, Mark J.; Collins, Francis S.; Davey Smith, George; Day, Ian N. M.; Franks, Paul W.; Hattersley, Andrew T.; Hu, Frank B.; Jarvelin, Marjo-Riitta; Kong, Augustine; Kooner, Jaspal S.; Laakso, Markku; Lakatta, Edward; Mooser, Vincent; Morris, Andrew D.; Peltonen, Leena; Samani, Nilesh J.; Spector, Timothy D.; Strachan, David P.; Tanaka, Toshiko; Tuomilehto, Jaakko; Uitterlinden, André G.; van Duijn, Cornelia M.; Wareham, Nicholas J.; Watkins for the PROCARDIS consortia, Hugh; Waterworth, Dawn M.; Boehnke, Michael; Deloukas, Panos; Groop, Leif; Hunter, David J.; Thorsteinsdottir, Unnur; Schlessinger, David; Wichmann, H.-Erich; Frayling, Timothy M.; Abecasis, Gonçalo R.; Hirschhorn, Joel N.; Loos, Ruth J. F.; Stefansson, Kari; Mohlke, Karen L.; Barroso, Inês; McCarthy for the GIANT consortium, Mark I.

    2009-01-01

    To identify genetic loci influencing central obesity and fat distribution, we performed a meta-analysis of 16 genome-wide association studies (GWAS, N = 38,580) informative for adult waist circumference (WC) and waist–hip ratio (WHR). We selected 26 SNPs for follow-up, for which the evidence of association with measures of central adiposity (WC and/or WHR) was strong and disproportionate to that for overall adiposity or height. Follow-up studies in a maximum of 70,689 individuals identified two loci strongly associated with measures of central adiposity; these map near TFAP2B (WC, P = 1.9×10−11) and MSRA (WC, P = 8.9×10−9). A third locus, near LYPLAL1, was associated with WHR in women only (P = 2.6×10−8). The variants near TFAP2B appear to influence central adiposity through an effect on overall obesity/fat-mass, whereas LYPLAL1 displays a strong female-only association with fat distribution. By focusing on anthropometric measures of central obesity and fat distribution, we have identified three loci implicated in the regulation of human adiposity. PMID:19557161

  16. Genome-Wide Identification and Expression Analysis of WRKY Gene Family in Capsicum annuum L.

    PubMed Central

    Diao, Wei-Ping; Snyder, John C.; Wang, Shu-Bin; Liu, Jin-Bing; Pan, Bao-Gui; Guo, Guang-Jun; Wei, Ge

    2016-01-01

    The WRKY family of transcription factors is one of the most important families of plant transcriptional regulators with members regulating multiple biological processes, especially in regulating defense against biotic and abiotic stresses. However, little information is available about WRKYs in pepper (Capsicum annuum L.). The recent release of completely assembled genome sequences of pepper allowed us to perform a genome-wide investigation for pepper WRKY proteins. In the present study, a total of 71 WRKY genes were identified in the pepper genome. According to structural features of their encoded proteins, the pepper WRKY genes (CaWRKY) were classified into three main groups, with the second group further divided into five subgroups. Genome mapping analysis revealed that CaWRKY were enriched on four chromosomes, especially on chromosome 1, and 15.5% of the family members were tandemly duplicated genes. A phylogenetic tree was constructed depending on WRKY domain' sequences derived from pepper and Arabidopsis. The expression of 21 selected CaWRKY genes in response to seven different biotic and abiotic stresses (salt, heat shock, drought, Phytophtora capsici, SA, MeJA, and ABA) was evaluated by quantitative RT-PCR; Some CaWRKYs were highly expressed and up-regulated by stress treatment. Our results will provide a platform for functional identification and molecular breeding studies of WRKY genes in pepper. PMID:26941768

  17. Pooled Genome-Wide Analysis to Identify Novel Risk Loci for Pediatric Allergic Asthma

    PubMed Central

    Ricci, Giampaolo; Astolfi, Annalisa; Remondini, Daniel; Cipriani, Francesca; Formica, Serena; Dondi, Arianna; Pession, Andrea

    2011-01-01

    Background Genome-wide association studies of pooled DNA samples were shown to be a valuable tool to identify candidate SNPs associated to a phenotype. No such study was up to now applied to childhood allergic asthma, even if the very high complexity of asthma genetics is an appropriate field to explore the potential of pooled GWAS approach. Methodology/Principal Findings We performed a pooled GWAS and individual genotyping in 269 children with allergic respiratory diseases comparing allergic children with and without asthma. We used a modular approach to identify the most significant loci associated with asthma by combining silhouette statistics and physical distance method with cluster-adapted thresholding. We found 97% concordance between pooled GWAS and individual genotyping, with 36 out of 37 top-scoring SNPs significant at individual genotyping level. The most significant SNP is located inside the coding sequence of C5, an already identified asthma susceptibility gene, while the other loci regulate functions that are relevant to bronchial physiopathology, as immune- or inflammation-mediated mechanisms and airway smooth muscle contraction. Integration with gene expression data showed that almost half of the putative susceptibility genes are differentially expressed in experimental asthma mouse models. Conclusion/Significance Combined silhouette statistics and cluster-adapted physical distance threshold analysis of pooled GWAS data is an efficient method to identify candidate SNP associated to asthma development in an allergic pediatric population. PMID:21359210

  18. Genome-wide analysis reveals adaptation to high altitudes in Tibetan sheep

    PubMed Central

    Wei, Caihong; Wang, Huihua; Liu, Gang; Zhao, Fuping; Kijas, James W.; Ma, Youji; Lu, Jian; Zhang, Li; Cao, Jiaxue; Wu, Mingming; Wang, Guangkai; Liu, Ruizao; Liu, Zhen; Zhang, Shuzhen; Liu, Chousheng; Du, Lixin

    2016-01-01

    Tibetan sheep have lived on the Tibetan Plateau for thousands of years; however, the process and consequences of adaptation to this extreme environment have not been elucidated for important livestock such as sheep. Here, seven sheep breeds, representing both highland and lowland breeds from different areas of China, were genotyped for a genome-wide collection of single-nucleotide polymorphisms (SNPs). The FST and XP-EHH approaches were used to identify regions harbouring local positive selection between these highland and lowland breeds, and 236 genes were identified. We detected selection events spanning genes involved in angiogenesis, energy production and erythropoiesis. In particular, several candidate genes were associated with high-altitude hypoxia, including EPAS1, CRYAA, LONP1, NF1, DPP4, SOD1, PPARG and SOCS2. EPAS1 plays a crucial role in hypoxia adaption; therefore, we investigated the exon sequences of EPAS1 and identified 12 mutations. Analysis of the relationship between blood-related phenotypes and EPAS1 genotypes in additional highland sheep revealed that a homozygous mutation at a relatively conserved site in the EPAS1 3′ untranslated region was associated with increased mean corpuscular haemoglobin concentration and mean corpuscular volume. Taken together, our results provide evidence of the genetic diversity of highland sheep and indicate potential high-altitude hypoxia adaptation mechanisms, including the role of EPAS1 in adaptation. PMID:27230812

  19. Genome-wide analysis of homeobox genes from Mesobuthus martensii reveals Hox gene duplication in scorpions.

    PubMed

    Di, Zhiyong; Yu, Yao; Wu, Yingliang; Hao, Pei; He, Yawen; Zhao, Huabin; Li, Yixue; Zhao, Guoping; Li, Xuan; Li, Wenxin; Cao, Zhijian

    2015-06-01

    Homeobox genes belong to a large gene group, which encodes the famous DNA-binding homeodomain that plays a key role in development and cellular differentiation during embryogenesis in animals. Here, one hundred forty-nine homeobox genes were identified from the Asian scorpion, Mesobuthus martensii (Chelicerata: Arachnida: Scorpiones: Buthidae) based on our newly assembled genome sequence with approximately 248 × coverage. The identified homeobox genes were categorized into eight classes including 82 families: 67 ANTP class genes, 33 PRD genes, 11 LIM genes, five POU genes, six SINE genes, 14 TALE genes, five CUT genes, two ZF genes and six unclassified genes. Transcriptome data confirmed that more than half of the genes were expressed in adults. The homeobox gene diversity of the eight classes is similar to the previously analyzed Mandibulata arthropods. Interestingly, it is hypothesized that the scorpion M. martensii may have two Hox clusters. The first complete genome-wide analysis of homeobox genes in Chelicerata not only reveals the repertoire of scorpion, arachnid and chelicerate homeobox genes, but also shows some insights into the evolution of arthropod homeobox genes.

  20. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia

    PubMed Central

    Law, Philip J.; Berndt, Sonja I.; Speedy, Helen E.; Camp, Nicola J.; Sava, Georgina P.; Skibola, Christine F.; Holroyd, Amy; Joseph, Vijai; Sunter, Nicola J.; Nieters, Alexandra; Bea, Silvia; Monnereau, Alain; Martin-Garcia, David; Goldin, Lynn R.; Clot, Guillem; Teras, Lauren R.; Quintela, Inés; Birmann, Brenda M.; Jayne, Sandrine; Cozen, Wendy; Majid, Aneela; Smedby, Karin E.; Lan, Qing; Dearden, Claire; Brooks-Wilson, Angela R.; Hall, Andrew G.; Purdue, Mark P.; Mainou-Fowler, Tryfonia; Vajdic, Claire M.; Jackson, Graham H.; Cocco, Pierluigi; Marr, Helen; Zhang, Yawei; Zheng, Tongzhang; Giles, Graham G.; Lawrence, Charles; Call, Timothy G.; Liebow, Mark; Melbye, Mads; Glimelius, Bengt; Mansouri, Larry; Glenn, Martha; Curtin, Karen; Diver, W Ryan; Link, Brian K.; Conde, Lucia; Bracci, Paige M.; Holly, Elizabeth A.; Jackson, Rebecca D.; Tinker, Lesley F.; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Maynadie, Marc; McKay, James; Albanes, Demetrius; Weinstein, Stephanie; Wang, Zhaoming; Caporaso, Neil E.; Morton, Lindsay M.; Severson, Richard K.; Riboli, Elio; Vineis, Paolo; Vermeulen, Roel C. H.; Southey, Melissa C.; Milne, Roger L.; Clavel, Jacqueline; Topka, Sabine; Spinelli, John J.; Kraft, Peter; Ennas, Maria Grazia; Summerfield, Geoffrey; Ferri, Giovanni M.; Harris, Robert J.; Miligi, Lucia; Pettitt, Andrew R.; North, Kari E.; Allsup, David J.; Fraumeni, Joseph F.; Bailey, James R.; Offit, Kenneth; Pratt, Guy; Hjalgrim, Henrik; Pepper, Chris; Chanock, Stephen J.; Fegan, Chris; Rosenquist, Richard; de Sanjose, Silvia; Carracedo, Angel; Dyer, Martin J. S.; Catovsky, Daniel; Campo, Elias; Cerhan, James R.; Allan, James M.; Rothman, Nathanial; Houlston, Richard; Slager, Susan

    2017-01-01

    Several chronic lymphocytic leukaemia (CLL) susceptibility loci have been reported; however, much of the heritable risk remains unidentified. Here we perform a meta-analysis of six genome-wide association studies, imputed using a merged reference panel of 1,000 Genomes and UK10K data, totalling 6,200 cases and 17,598 controls after replication. We identify nine risk loci at 1p36.11 (rs34676223, P=5.04 × 10−13), 1q42.13 (rs41271473, P=1.06 × 10−10), 4q24 (rs71597109, P=1.37 × 10−10), 4q35.1 (rs57214277, P=3.69 × 10−8), 6p21.31 (rs3800461, P=1.97 × 10−8), 11q23.2 (rs61904987, P=2.64 × 10−11), 18q21.1 (rs1036935, P=3.27 × 10−8), 19p13.3 (rs7254272, P=4.67 × 10−8) and 22q13.33 (rs140522, P=2.70 × 10−9). These new and established risk loci map to areas of active chromatin and show an over-representation of transcription factor binding for the key determinants of B-cell development and immune response. PMID:28165464

  1. Mammalian NET-seq analysis defines nascent RNA profiles and associated RNA processing genome-wide

    PubMed Central

    Nojima, Takayuki; Gomes, Tomás; Carmo-Fonseca, Maria; Proudfoot, Nicholas J

    2016-01-01

    The transcription cycle of RNA polymerase II (Pol II) correlates with changes to the phosphorylation state of its large subunit C-terminal domain (CTD). We recently developed Native Elongation Transcript sequencing using mammalian cells (mNET-seq), which generates single-nucleotide–resolution genome-wide profiles of nascent RNA and co-transcriptional RNA processing that are associated with different CTD phosphorylation states. Here we provide a detailed protocol for mNET-seq. First, Pol II elongation complexes are isolated with specific phospho-CTD antibodies from chromatin solubilized by micrococcal nuclease digestion. Next, RNA derived from within the Pol II complex is size fractionated and Illumina sequenced. using mNET-seq, we have previously shown that Pol II pauses at both ends of protein-coding genes but with different CTD phosphorylation patterns, and we have also detected phosphorylation at serine 5 (Ser5-P) CTD-specific splicing intermediates and Pol II accumulation over co-transcriptionally spliced exons. With moderate biochemical and bioinformatic skills, mNET-seq can be completed in ~6 d, not including sequencing and data analysis. PMID:26844429

  2. Meta-analysis and genome-wide interpretation of genetic susceptibility to drug addiction

    PubMed Central

    2011-01-01

    Background Classical genetic studies provide strong evidence for heritable contributions to susceptibility to developing dependence on addictive substances. Candidate gene and genome-wide association studies (GWAS) have sought genes, chromosomal regions and allelic variants likely to contribute to susceptibility to drug addiction. Results Here, we performed a meta-analysis of addiction candidate gene association studies and GWAS to investigate possible functional mechanisms associated with addiction susceptibility. From meta-data retrieved from 212 publications on candidate gene association studies and 5 GWAS reports, we linked a total of 843 haplotypes to addiction susceptibility. We mapped the SNPs in these haplotypes to functional and regulatory elements in the genome and estimated the magnitude of the contributions of different molecular mechanisms to their effects on addiction susceptibility. In addition to SNPs in coding regions, these data suggest that haplotypes in gene regulatory regions may also contribute to addiction susceptibility. When we compared the lists of genes identified by association studies and those identified by molecular biological studies of drug-regulated genes, we observed significantly higher participation in the same gene interaction networks than expected by chance, despite little overlap between the two gene lists. Conclusions These results appear to offer new insights into the genetic factors underlying drug addiction. PMID:21999673

  3. Genome-wide identification, classification and transcriptional analysis of nitrate and ammonium transporters in Coffea.

    PubMed

    Santos, Tiago Benedito Dos; Lima, Joni Esrom; Felicio, Mariane Silva; Soares, João Danillo Moura; Domingues, Douglas Silva

    2017-04-10

    Nitrogen (N) is quantitatively the main nutrient required by coffee plants, with acquisition mainly by the roots and mostly exported to coffee beans. Nitrate (NO3-) and ammonium (NH4+) are the most important inorganic sources for N uptake. Several N transporters encoded by different gene families mediate the uptake of these compounds. They have an important role in source preference for N uptake in the root system. In this study, we performed a genome-wide analysis, including in silico expression and phylogenetic analyses of AMT1, AMT2, NRT1/PTR, and NRT2 transporters in the recently sequenced Coffea canephora genome. We analyzed the expression of six selected transporters in Coffea arabica roots submitted to N deficiency. N source preference was also analyzed in C. arabica using isotopes. C. canephora N transporters follow the patterns observed for most eudicots, where each member of the AMT and NRT families has a particular role in N mobilization, and where some of these are modulated by N deficiency. Despite the prevalence of putative nitrate transporters in the Coffea genome, ammonium was the preferential inorganic N source for N-starved C. arabica roots. This data provides an important basis for fundamental and applied studies to depict molecular mechanisms involved in N uptake in coffee trees.

  4. A genome-wide resource for the analysis of protein localisation in Drosophila

    PubMed Central

    Sarov, Mihail; Barz, Christiane; Jambor, Helena; Hein, Marco Y; Schmied, Christopher; Suchold, Dana; Stender, Bettina; Janosch, Stephan; KJ, Vinay Vikas; Krishnan, RT; Krishnamoorthy, Aishwarya; Ferreira, Irene RS; Ejsmont, Radoslaw K; Finkl, Katja; Hasse, Susanne; Kämpfer, Philipp; Plewka, Nicole; Vinis, Elisabeth; Schloissnig, Siegfried; Knust, Elisabeth; Hartenstein, Volker; Mann, Matthias; Ramaswami, Mani; VijayRaghavan, K; Tomancak, Pavel; Schnorrer, Frank

    2016-01-01

    The Drosophila genome contains >13000 protein-coding genes, the majority of which remain poorly investigated. Important reasons include the lack of antibodies or reporter constructs to visualise these proteins. Here, we present a genome-wide fosmid library of 10000 GFP-tagged clones, comprising tagged genes and most of their regulatory information. For 880 tagged proteins, we created transgenic lines, and for a total of 207 lines, we assessed protein expression and localisation in ovaries, embryos, pupae or adults by stainings and live imaging approaches. Importantly, we visualised many proteins at endogenous expression levels and found a large fraction of them localising to subcellular compartments. By applying genetic complementation tests, we estimate that about two-thirds of the tagged proteins are functional. Moreover, these tagged proteins enable interaction proteomics from developing pupae and adult flies. Taken together, this resource will boost systematic analysis of protein expression and localisation in various cellular and developmental contexts. DOI: http://dx.doi.org/10.7554/eLife.12068.001 PMID:26896675

  5. Genome-wide analysis of glutathione reductase (GR) genes from rice and Arabidopsis.

    PubMed

    Trivedi, Dipesh Kumar; Gill, Sarvajeet Singh; Yadav, Sandep; Tuteja, Narendra

    2013-02-01

    Plant cells and tissues remain always on risk under abiotic and biotic stresses due to increased production of reactive oxygen species (ROS). Plants protect themselves against ROS induced oxidative damage by the upregulation of antioxidant machinery. Out of many components of antioxidant machinery, glutathione reductase (GR, EC 1.6.4.2) and glutathione (GSH, γ-Glu-Cys-Gly) play important role in the protection of cell against oxidative damage. In stress condition, the GR helps in maintaining the reduced glutathione pool for strengthening the antioxidative processes in plants. Present study investigates genome wide analysis of GR from rice and Arabidopsis. We were able to identify 3 rice GR genes (LOC_Os02 g56850, LOC_Os03 g06740, LOC_Os10 g28000) and 2 Arabidopsis GR genes (AT3G54660, AT3G24170) from their respective genomes on the basis of their annotation as well as the presence of pyridine nucleotide-disulphide oxidoreductases class-I active site. The evolutionary relationship of the GR genes from rice and Arabidopsis genomes was analyzed using the multiple sequence alignment and phylogenetic tree. This revealed evolutionary conserved pyridine nucleotide-disulphide oxidoreductases class-I active site among the GR protein in rice and Arabidopsis. This study should make an important contribution to our better understanding of the GR under normal and stress condition in plants.

  6. A molecular scheme for Yersinia enterocolitica patho-serotyping derived from genome-wide analysis.

    PubMed

    Garzetti, Debora; Susen, Rosa; Fruth, Angelika; Tietze, Erhard; Heesemann, Jürgen; Rakin, Alexander

    2014-05-01

    Yersinia enterocolitica is a food-borne, gastro-intestinal pathogen with world-wide distribution. Only 11 serotypes have been isolated from patients, with O:3, O:9, O:8 and O:5,27 being the serotypes most commonly associated with human yersiniosis. Serotype is an important characteristic of Y. enterocolitica strains, allowing differentiation for epidemiology, diagnosis and phylogeny studies. Conventional serotyping, performed by slide agglutination, is a tedious and laborious procedure whose interpretation tends to be subjective, leading to poor reproducibility. Here we present a PCR-based typing scheme for molecular identification and patho-serotyping of Y. enterocolitica. Genome-wide comparison of Y. enterocolitica sequences allowed analysis of the O-antigen gene clusters of different serotypes, uncovering their formerly unknown genomic locations, and selection of targets for serotype-specific amplification. Two multiplex PCRs and one additional PCR were designed and tested on various reference strains and isolates from different origins. Our genotypic assay proved to be highly specific for identification of Y. enterocolitica species, discrimination between virulent and non-virulent strains, distinguishing the main human-related serotypes, and typing of conventionally untypeable strains. This genotyping scheme could be applied in microbiology laboratories as an alternative or complementary method to the traditional phenotypic assays, providing data for epidemiological studies.

  7. Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes

    PubMed Central

    Wang, Da-Wei; Li, Da; Wang, Junjun; Zhao, Yue; Wang, Zhaojun; Yue, Guidong; Liu, Xin; Qin, Huanju; Zhang, Kunpu; Dong, Lingli; Wang, Daowen

    2017-01-01

    Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 α-, 11 γ-, one δ- and five ω-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of α-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The δ-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic α-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat. PMID:28300172

  8. Genome-wide analysis suggests divergent evolution of lipid phosphotases/phosphotransferase genes in plants.

    PubMed

    Wang, Peng; Chen, Zhenxi; Kasimu, Rena; Chen, Yinhua; Zhang, Xiaoxiao; Gai, Jiangtao

    2016-08-01

    Genes of the LPPT (lipid phosphatase/phosphotransferase) family play important roles in lipid phosphorous transfer and triacylglycerol accumulation in plants. To provide overviews of the plant LPPT family and their overall relationships, here we carried out genome-wide identifications and analyses of plant LPPT family members. A total of 643 putative LPPT genes were identified from 48 sequenced plant genomes, among which 205 genes from 14 plants were chosen for further analyses. Plant LPPT genes belonged to three distinctive groups, namely the LPT (lipid phosphotransfease), LPP (lipid phosphatase), and pLPP (plastidic lipid phosphotransfease) groups. Genes of the LPT group could be further partitioned into three groups, two of which were only identified in terrestrial plants. Genes in the LPP and pLPP groups experienced duplications in early stages of plant evolution. Among 17 Zea mays LPPT genes, divergence of temporal-spatial expression patterns was revealed based on microarray data analysis. Peptide sequences of plant LPPT genes harbored different conserved motifs. A test of Branch Model versus One-ratio Model did not support significant selective pressures acting on different groups of LPPT genes, although quite different nonsynonymous evolutionary rates and selective pressures were observed. The complete picture of the plant LPPT family provided here should facilitate further investigations of plant LPPT genes and offer a better understanding of lipid biosynthesis in plants.

  9. The development of an Arabidopsis model system for genome-wide analysis of polyploidy effects

    PubMed Central

    JEFFREY CHEN, Z.; WANG, JIANLIN; TIAN, LU; LEE, HYEON-SE; WANG, JIYUAN J.; CHEN, MENG; LEE, JINSUK J.; JOSEFSSON, CAROLINE; MADLUNG, ANDREAS; WATSON, BRIAN; LIPPMAN, ZACH; VAUGHN, MATT; CHRIS PIRES, J.; COLOT, VINCENT; DOERGE, R. W.; MARTIENSSEN, ROBERT A.; COMAI, LUCA; OSBORN, THOMAS C.

    2007-01-01

    Arabidopsis is a model system not only for studying numerous aspects of plant biology, but also for understanding mechanisms of the rapid evolutionary process associated with genome duplication and polyploidization. Although in animals interspecific hybrids are often sterile and aneuploids are related to disease syndromes, both Arabidopsis autopolyploids and allopolyploids occur in nature and can be readily formed in the laboratory, providing an attractive system for comparing changes in gene expression and genome structure among relatively ‘young’ and ‘established’ or ‘ancient’ polyploids. Powerful reverse and forward genetics in Arabidopsis offer an exceptional means by which regulatory mechanisms of gene and genome duplication may be revealed. Moreover, the Arabidopsis genome is completely sequenced; both coding and non-coding sequences are available. We have developed spotted oligo-gene and chromosome microarrays using the complete Arabidopsis genome sequence. The oligo-gene microarray consists of ~26 000 70-mer oligonucleotides that are designed from all annotated genes in Arabidopsis, and the chromosome microarray contains 1 kb genomic tiling fragments amplified from a chromosomal region or the complete sequence of chromosome 4. We have demonstrated the utility of microarrays for genome-wide analysis of changes in gene expression, genome organization and chromatin structure in Arabidopsis polyploids and related species. PMID:18079994

  10. Genome-wide characterization and comparative analysis of the MLO gene family in cotton.

    PubMed

    Wang, Xiaoyan; Ma, Qifeng; Dou, Lingling; Liu, Zhen; Peng, Renhai; Yu, Shuxun

    2016-06-01

    In plants, MLO (Mildew Locus O) gene encodes a plant-specific seven transmembrane (TM) domain protein involved in several cellular processes, including susceptibility to powdery mildew (PM). In this study, a genome-wide characterization of the MLO gene family in G. raimondii L., G. arboreum L. and G. hirsutum L. was performed. In total, 22, 17 and 38 homologous sequences were identified for each species, respectively. Gene organization, including chromosomal location, gene clustering and gene duplication, was investigated. Homologues related to PM susceptibility in upland cotton were inferred by phylogenetic relationships with functionally characterized MLO proteins. To conduct a comparative analysis between MLO candidate genes from G. raimondii L., G. arboreum L. and G. hirsutum L., orthologous relationships and conserved synteny blocks were constructed. The transcriptional variation of 38 GhMLO genes in response to exogenous application of salt, mannitol (Man), abscisic acid (ABA), ethylene (ETH), jasmonic acid (JA) and salicylic acid (SA) was monitored. Further studies should be conducted to elucidate the functions of MLO genes in PM susceptibility and phytohormone signalling pathways.

  11. Analysis of copy number variations in Mexican Holstein cattle using axiom genome-wide Bos 1 array

    PubMed Central

    Salomon-Torres, Ricardo; Villa-Angulo, Rafael; Villa-Angulo, Carlos

    2015-01-01

    Recently, for copy number variation (CNV) analysis, bovine researchers have focused mainly on the use of genome-wide SNP genotyping arrays. One of the highest densities commercially available SNPchips for cattle is the Affymetrix axiom genome-wide Bos 1, which assays 648,315 informative SNPs across the whole bovine genome. Here, we describe the microarray data, quality controls and validation implemented in a study published in Genetics and Molecular Research Journal in 2015 [1]. The microarray raw data has been deposited into Gene Expression Omnibus under accession #GSE54813. PMID:26981375

  12. Genome-wide DNA methylation analysis using massively parallel sequencing technologies.

    PubMed

    Suzuki, Masako; Greally, John M

    2013-01-01

    "Epigenetics" refers to a heritable change in transcriptional status without alteration in the primary nucleotide sequence. Epigenetics provides an extra layer of transcriptional control and plays a crucial role in normal development, as well as in pathological conditions. DNA methylation is one of the best known and well-studied epigenetic modifications. Genome-wide DNA methylation profiling has become recognized as a biologically and clinically important epigenomic assay. In this review, we discuss the strengths and weaknesses of the protocols for genome-wide DNA methylation profiling using massively parallel sequencing (MPS) techniques. We will also describe recently discovered DNA modifications, and the protocols to detect these modifications.

  13. Genome-wide identification and analysis of the MADS-box gene family in sesame.

    PubMed

    Wei, Xin; Wang, Linhai; Yu, Jingyin; Zhang, Yanxin; Li, Donghua; Zhang, Xiurong

    2015-09-10

    MADS-box genes encode transcription factors that play crucial roles in plant growth and development. Sesame (Sesamum indicum L.) is an oil crop that contributes to the daily oil and protein requirements of almost half of the world's population; therefore, a genome-wide analysis of the MADS-box gene family is needed. Fifty-seven MADS-box genes were identified from 14 linkage groups of the sesame genome. Analysis of phylogenetic relationships with Arabidopsis thaliana, Utricularia gibba and Solanum lycopersicum MADS-box genes was performed. Sesame MADS-box genes were clustered into four groups: 28 MIKC(c)-type, 5 MIKC(⁎)-type, 14 Mα-type and 10 Mγ-type. Gene structure analysis revealed from 1 to 22 exons of sesame MADS-box genes. The number of exons in type II MADS-box genes greatly exceeded the number in type I genes. Motif distribution analysis of sesame MADS-box genes also indicated that type II MADS-box genes contained more motifs than type I genes. These results suggested that type II sesame MADS-box genes had more complex structures. By analyzing expression profiles of MADS-box genes in seven sesame transcriptomes, we determined that MIKC(C)-type MADS-box genes played significant roles in sesame flower and seed development. Although most MADS-box genes in the same clade showed similar expression features, some gene functions were diversified from the orthologous Arabidopsis genes. This research will contribute to uncovering the role of MADS-box genes in sesame development.

  14. Genome-wide analysis of SAUR gene family in Solanaceae species.

    PubMed

    Wu, Jian; Liu, Songyu; He, Yanjun; Guan, Xiaoyan; Zhu, Xiangfei; Cheng, Lin; Wang, Jie; Lu, Gang

    2012-11-01

    The plant hormone auxin plays a vital role in regulating many aspects of plant growth and development. Small auxin up-regulated RNAs (SAURs) are primary auxin response genes hypothesized to be involved in auxin signaling pathway, but their functions remain unclear. Here, a genome-wide search for SAUR gene homologues in Solanaceae species identified 99 and 134 members of SAUR gene family from tomato and potato, respectively. Phylogenetic analysis indicated that the SAUR proteins from Arabidopsis, rice, sorghum, tomato and potato were divided into four major groups with 16 subgroups. Among them, 25 histidine-rich SAURs genes with metal-binding characteristics were found in Arabidopsis, sorghum and Solanaceae species, but not in rice. Using tomato as a model, a comprehensive overview of SAUR gene family is presented, including the gene structures, phylogeny and chromosome locations. Quantitative real-time PCR analysis indicated that 11 randomly selected SlSAUR genes in tomato could be expressed at least in one of the tomato organs/tissues tested. However, different SlSAUR genes displayed distinctive expression levels. SlSAUR16 and SlSAUR71 exhibited highly tissue-specific expression patterns. Almost all of the detected SlSAURs showed an accumulating pattern of mRNA along tomato flower and fruit development. Some of them displayed differential response to exogenous IAA treatment. The abiotic (cold, salt and drought) stresses significantly modified transcript levels of SlSAURs genes. Most of them were down-regulated in response to abiotic stresses (drought, heat and salinity), but SlSAUR58, as a histidine-rich SAUR gene, was up-regulated after salt treatment, indicating that it may play a specific role in the salt signaling transduction pathway. Our comparative analysis provides some basic genomic information for the SAUR genes in the Solanaceae species and will pave the way for deciphering their function during plant development.

  15. Identification of susceptibility modules for coronary artery disease using a genome wide integrated network analysis.

    PubMed

    Duan, Shiwei; Luo, Xuhong; Dong, Changzheng

    2013-12-01

    Although recent genome-wide association studies (GWAS) have identified a handful of variants with best significance for coronary artery disease (CAD), it remains a challenge to summarize the underlying biological information from the abundant genotyping data. Here, we propose an integrated network analysis that effectively combines GWAS genotyping dataset, protein-protein interaction (PPI) database, literature and pathway annotation information. This three-step approach was illustrated for a comprehensive network analysis of CAD as the following. First, a network was constructed from PPI database and CAD seed genes mined from the available literatures. Then, susceptibility network modules were captured from the results of gene-based association tests. Finally, susceptibility modules were annotated with potential mechanisms for CAD via the KEGG pathway database. Our network analysis identified four susceptibility modules for CAD including a complex module that consisted of 15 functional inter-connected sub-modules, AGPAT3-AGPAT4-PPAP2B module, ITGA11-ITGB1 module and EMCN-SELL module. MAPK10 and COL4A2 among the top-scored focal adhesion pathway related module were the most significant genes (MAPK10: OR=32.5, P=3.5 × 10(-11); COL4A2: OR=2.7, P=2.8 × 10(-10)). The significance of the two genes were further validated by other two gene-based association tests (MAPK10: P=0.009 and 0.007; COL4A2: P=0.001 and 0.023) and another independent GWAS dataset (MAPK10: P=0.001; COL4A2: P=0.0004). Furthermore, 34 out of 44 previously reported CAD susceptibility genes were captured by our CAD PPI network and 17 of them were also significant genes. The susceptibility modules identified in our study might provide novel clues for the clarification of CAD pathogenesis in the future.

  16. Genome Wide Association Analysis of Copy Number Variation in Recurrent Depressive Disorder

    PubMed Central

    Rucker, James J.H.; Breen, Gerome; Pinto, Dalila; Pedroso, Inti; Lewis, Cathryn M.; Cohen-Woods, Sarah; Uher, Rudolf; Schosser, Alexandra; Rivera, Margarita; Aitchison, Katherine J.; Craddock, Nick; Owen, Michael J.; Jones, Lisa; Jones, Ian; Korszun, Ania; Muglia, Pierandrea; Barnes, Michael R.; Preisig, Martin; Mors, Ole; Gill, Mike; Maier, Wolfgang; Rice, John; Rietschel, Marcella; Holsboer, Florian; Farmer, Anne E.; Craig, Ian W.; Scherer, Stephen W.; McGuffin, Peter

    2014-01-01

    Large, rare copy number variants (CNV) have been implicated in a variety of psychiatric disorders, but the role of CNVs in recurrent depression is unclear. We performed a genome-wide analysis of large, rare CNVs in 3,106 cases of recurrent depression, 459 controls screened for lifetime-absence of psychiatric disorder and 5,619 unscreened controls from phase 2 of the Wellcome Trust Case Control Consortium (WTCCC2). We compared the frequency of cases with CNVs against the frequency observed in each control group, analysing CNVs over the whole genome, genic, intergenic, intronic and exonic regions. We found that deletion CNVs were associated with recurrent depression while duplications were not. The effect was significant when comparing cases to WTCCC2 controls (p=7.7×10−6, OR =1.25 (95% CI 1.13 - 1.37)) and to screened controls (p=5.6×10−4, OR=1.52 (95% CI 1.20 - 1.93). Further analysis showed that CNVs deleting protein coding regions were largely responsible for the association. Within an analysis of regions previously implicated in schizophrenia, we found an overall enrichment of CNVs in our cases when compared to screened controls (p=0.019). We observe an ordered increase of samples with deletion CNVs, with the lowest proportion seen in screened controls, the next highest in unscreened controls and the highest in cases. This may suggest that the absence of deletion CNVs, especially in genes, is associated with resilience to recurrent depression. PMID:22042228

  17. Genome-wide analysis in endangered populations: a case study in Barbaresca sheep.

    PubMed

    Mastrangelo, S; Portolano, B; Di Gerlando, R; Ciampolini, R; Tolone, M; Sardina, M T

    2017-01-12

    Analysis of genomic data is becoming increasingly common in the livestock industry and the findings have been an invaluable resource for effective management of breeding programs in small and endangered populations. In this paper, with the goal of highlighting the potential of genomic analysis for small and endangered populations, genome-wide levels of linkage disequilibrium, measured as the squared correlation coefficient of allele frequencies at a pair of loci, effective population size, runs of homozygosity (ROH) and genetic diversity parameters, were estimated in Barbaresca sheep using Illumina OvineSNP50K array data. Moreover, the breed's genetic structure and its relationship with other breeds were investigated. Levels of pairwise linkage disequilibrium decreased with increasing distance between single nucleotide polymorphisms. An average correlation coefficient <0.25 was found for markers located up to 50 kb apart. Therefore, these results support the need to use denser single nucleotide polymorphism panels for high power association mapping and genomic selection efficiency in future breeding programs. The estimate of past effective population size ranged from 747 animals 250 generations ago to 28 animals five generations ago, whereas the contemporary effective population size was 25 animals. A total of 637 ROH were identified, most of which were short (67%) and ranged from 1 to 10 Mb. The genetic analyses revealed that the Barbaresca breed tended to display lower variability than other Sicilian breeds. Recent inbreeding was evident, according to the ROH analysis. All the investigated parameters showed a comparatively narrow genetic base and indicated an endangered status for Barbaresca. Multidimensional scaling, model-based clustering, measurement of population differentiation, neighbor networks and haplotype sharing distinguished Barbaresca from other breeds, showed a low level of admixture with the other breeds considered in this study, and indicated

  18. Oxidative Stress and Heat-Shock Responses in Desulfovibrio vulgaris by Genome-Wide Transcriptomic Analysis

    SciTech Connect

    Zhang, Weiwen; Culley, David E.; Hogan, Mike; Vitiritti, Luigi; Brockman, Fred J.

    2006-05-30

    Abstract Sulfate-reducing bacteria, like Desulfovibrio vulgaris have developed a set of reactions allowing them to survive in environments. To obtain further knowledge of the protecting mechanisms employed in D. vulgaris against the oxidative stress and heat shock, we performed a genome-wide transcriptomic analysis to determine the cellular responses to both stimuli. The results showed that 130 genes were responsive to oxidative stress, while 427 genes responsive to heat-shock, respectively. Functional analyses suggested that the genes regulated were involved in a variety of cellular functions. Metabolic analysis showed that amino acid biosynthetic pathways were induced by both oxidative stress and heat shock treatments, while fatty acid metabolism, purine and cofactor biosynthesis were induced by heat shock only. Rubrerythrin gene (rbR) were upregulated by the oxidative stress, suggesting its important role in the oxidative resistance, whereas the expression of rubredoxin oxidoreductase (rbO), superoxide ismutase (sodB) and catalase (katA) genes were not subjected to regulation by oxidative stress in D. vulgaris. In addition, the results showed that thioredoxin reductase (trxB) was responsive to oxidative stress, suggesting the thiol-specific redox system might be involved in oxidative protection in D. vulgaris. Comparison of cellular responses to oxidative stress and heat-shock allowed the identification of 66 genes that showed a similar drastic response to both environmental stimuli, implying that they might be part of the general stress response (GSR) network in D. vulgaris, which was further supported by the finding of a conserved motif upstream these common-responsive genes.

  19. Genome-Wide Analysis of Polycistronic MicroRNAs in Cultivated and Wild Rice

    PubMed Central

    Baldrich, Patricia; Hsing, Yue-Ie Caroline; San Segundo, Blanca

    2016-01-01

    MicroRNAs (miRNAs) are small noncoding RNAs that direct posttranscriptional gene silencing in eukaryotes. They are frequently clustered in the genomes of animals and can be independently transcribed or simultaneously transcribed into single polycistronic transcripts. Only a few miRNA clusters have been described in plants, and most of them are generated from independent transcriptional units. Here, we used a combination of bioinformatic tools and experimental analyses to discover new polycistronic miRNAs in rice. A genome-wide analysis of clustering patterns of MIRNA loci in the rice genome was carried out using a criterion of 3 kb as the maximal distance between two miRNAs. This analysis revealed 28 loci with the ability to form the typical hairpin structure of miRNA precursors in which 2 or more mature miRNAs mapped along the same structure. RT-PCR provided evidence for the polycistronic nature of seven miRNA precursors containing homologous or nonhomologous miRNA species. Polycistronic miRNAs and candidate polycistronic miRNAs are located across different rice chromosomes, except chromosome 12, and resided in both duplicated and nonduplicated chromosomal regions. Finally, most polycistronic and candidate polycistronic miRNAs showed a pattern of conservation in the genome of rice species with an AA genome. The diversity in the organization of MIR genes that are transcribed as polycistrons suggests a versatile mechanism for the control of gene expression in different biological processes and supports additional levels of complexity in miRNA functioning in plants. PMID:27190137

  20. Metabolic network driven analysis of genome-wide transcription data from Aspergillus nidulans

    PubMed Central

    David, Helga; Hofmann, Gerald; Oliveira, Ana Paula; Jarmer, Hanne; Nielsen, Jens

    2006-01-01

    Background Aspergillus nidulans (the asexual form of Emericella nidulans) is a model organism for aspergilli, which are an important group of filamentous fungi that encompasses human and plant pathogens as well as industrial cell factories. Aspergilli have a highly diversified metabolism and, because of their medical, agricultural and biotechnological importance, it would be valuable to have an understanding of how their metabolism is regulated. We therefore conducted a genome-wide transcription analysis of A. nidulans grown on three different carbon sources (glucose, glycerol, and ethanol) with the objective of identifying global regulatory structures. Furthermore, we reconstructed the complete metabolic network of this organism, which resulted in linking 666 genes to metabolic functions, as well as assigning metabolic roles to 472 genes that were previously uncharacterized. Results Through combination of the reconstructed metabolic network and the transcription data, we identified subnetwork structures that pointed to coordinated regulation of genes that are involved in many different parts of the metabolism. Thus, for a shift from glucose to ethanol, we identified coordinated regulation of the complete pathway for oxidation of ethanol, as well as upregulation of gluconeogenesis and downregulation of glycolysis and the pentose phosphate pathway. Furthermore, on change in carbon source from glucose to ethanol, the cells shift from using the pentose phosphate pathway as the major source of NADPH (nicotinamide adenine dinucleotide phosphatase, reduced form) for biosynthesis to use of the malic enzyme. Conclusion Our analysis indicates that some of the genes are regulated by common transcription factors, making it possible to establish new putative links between known transcription factors and genes through clustering. PMID:17107606

  1. GDSL esterase/lipase genes in Brassica rapa L.: genome-wide identification and expression analysis.

    PubMed

    Dong, Xiangshu; Yi, Hankuil; Han, Ching-Tack; Nou, Ill-Sup; Hur, Yoonkang

    2016-04-01

    GDSL esterase/lipase proteins (GELPs), a very large subfamily of lipolytic enzymes, have been identified in microbes and many plants, but only a few have been characterized with respect to their roles in growth, development, and stress responses. In Brassica crops, as in many other species, genome-wide systematic analysis and functional studies of these genes are still lacking. As a first step to study their function in B. rapa ssp. pekinensis (Chinese cabbage), we comprehensively identified all GELP genes in the genome. We found a total of 121 Brassica rapa GDSL esterase/lipase protein genes (BrGELPs), forming three clades in the phylogenetic analysis (two major and one minor), with an asymmetrical chromosomal distribution. Most BrGELPs possess four strictly conserved residues (Ser-Gly-Asn-His) in four separate conserved regions, along with short conserved and clade-specific blocks, suggesting functional diversification of these proteins. Detailed expression profiling revealed that BrGELPs were expressed in various tissues, including floral organs, implying that BrGELPs play diverse roles in various tissues and during development. Ten percent of BrGELPs were specifically expressed in fertile buds, rather than male-sterile buds, implying their involvement in pollen development. Analyses of EXL6 (extracellular lipase 6) expression and its co-expressed genes in both B. rapa and Arabidopsis, as well as knockdown of this gene in Arabidopsis, revealed that this gene plays an important role in pollen development in both species. The data described in this study will facilitate future investigations of other BrGELP functions.

  2. Performance of single nucleotide polymorphisms versus haplotypes for genome-wide association analysis in barley.

    PubMed

    Lorenz, Aaron J; Hamblin, Martha T; Jannink, Jean-Luc

    2010-11-22

    Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data.

  3. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume.

    PubMed

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development.

  4. Genome-wide microarray analysis of tomato roots showed defined responses to iron deficiency

    PubMed Central

    2012-01-01

    Background Plants react to iron deficiency stress adopting different kind of adaptive responses. Tomato, a Strategy I plant, improves iron uptake through acidification of rhizosphere, reduction of Fe3+ to Fe2+ and transport of Fe2+ into the cells. Large-scale transcriptional analyses of roots under iron deficiency are only available for a very limited number of plant species with particular emphasis for Arabidopsis thaliana. Regarding tomato, an interesting model species for Strategy I plants and an economically important crop, physiological responses to Fe-deficiency have been thoroughly described and molecular analyses have provided evidence for genes involved in iron uptake mechanisms and their regulation. However, no detailed transcriptome analysis has been described so far. Results A genome-wide transcriptional analysis, performed with a chip that allows to monitor the expression of more than 25,000 tomato transcripts, identified 97 differentially expressed transcripts by comparing roots of Fe-deficient and Fe-sufficient tomato plants. These transcripts are related to the physiological responses of tomato roots to the nutrient stress resulting in an improved iron uptake, including regulatory aspects, translocation, root morphological modification and adaptation in primary metabolic pathways, such as glycolysis and TCA cycle. Other genes play a role in flavonoid biosynthesis and hormonal metabolism. Conclusions The transcriptional characterization confirmed the presence of the previously described mechanisms to adapt to iron starvation in tomato, but also allowed to identify other genes potentially playing a role in this process, thus opening new research perspectives to improve the knowledge on the tomato root response to the nutrient deficiency. PMID:22433273

  5. Genome-wide linkage analysis for human longevity: Genetics of Healthy Ageing Study

    PubMed Central

    Beekman, Marian; Blanché, Hélène; Perola, Markus; Hervonen, Anti; Bezrukov, Vladyslav; Sikora, Ewa; Flachsbart, Frederieke; Christiansen, Lene; De Craen, Anton J.M.; Kirkwood, Tom B.L.; Rea, I. Meave; Poulain, Michel; Robine, Jean-Marie; Stazi, Maria Antonietta; Passarino, Giuseppe; Deiana, Luca; Gonos, Efstathios S.; Valensin, Silvana; Paternoster, Lavinia; Sørensen, Thorkild I.A.; Tan, Qihua; Helmer, Quinta; Van den Akker, Erik B.; Deelen, Joris; Martella, Francesca; Cordell, Heather J.; Ayers, Kristin L.; Vaupel, James W.; Törnwall, Outi; Johnson, Thomas E.; Schreiber, Stefan; Lathrop, Mark; Skytthe, Axel; Westendorp, Rudi G.J.; Christensen, Kaare; Gampe, Jutta; Nebel, Almut; Houwing-Duistermaat, Jeanine J.; Slagboom, P. Eline; Franceschi, Claudio

    2013-01-01

    Summary Clear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian sibling pairs that have been enrolled in fifteen study centers of eleven European countries as part of the Genetics of Healthy Ageing (GEHA) project. In the joint linkage analyses we observed four regions that show linkage with longevity; chromosome 14q11.2 (LOD=3.47), chromosome 17q12-q22 (LOD=2.95), chromosome 19p13.3-p13.11 (LOD=3.76) and chromosome 19q13.11-q13.32 (LOD=3.57). To fine map these regions linked to longevity, we performed association analysis using GWAS data in a subgroup of 1,228 unrelated nonagenarian and 1,907 geographically matched controls. Using a fixed effect meta-analysis approach, rs4420638 at the TOMM40/APOE/APOC1 gene locus showed significant association with longevity (p-value=9.6 × 10−8). By combined modeling of linkage and association we showed that association of longevity with APOEε4 and APOEε2 alleles explain the linkage at 19q13.11-q13.32 with p-value=0.02 and p-value=1.0 × 10−5, respectively. In the largest linkage scan thus far performed for human familial longevity, we confirm that the APOE locus is a longevity gene and that additional longevity loci may be identified at 14q11.2, 17q12-q22 and 19p13.3-p13.11. Since the latter linkage results are not explained by common variants, we suggest that rare variants play an important role in human familial longevity. PMID:23286790

  6. Genome-wide SNP analysis of the Systemic Capillary Leak Syndrome (Clarkson disease)

    PubMed Central

    Xie, Zhihui; Nagarajan, Vijayaraj; Sturdevant, Daniel E; Iwaki, Shoko; Chan, Eunice; Wisch, Laura; Young, Michael; Nelson, Celeste M; Porcella, Stephen F; Druey, Kirk M

    2013-01-01

    The Systemic Capillary Leak Syndrome (SCLS) is an extremely rare, orphan disease that resembles, and is frequently erroneously diagnosed as, systemic anaphylaxis. The disorder is characterized by repeated, transient, and seemingly unprovoked episodes of hypotensive shock and peripheral edema due to transient endothelial hyperpermeability. SCLS is often accompanied by a monoclonal gammopathy of unknown significance (MGUS). Using Affymetrix Single Nucleotide Polymorphism (SNP) microarrays, we performed the first genome-wide SNP analysis of SCLS in a cohort of 12 disease subjects and 18 controls. Exome capture sequencing was performed on genomic DNA from nine of these patients as validation for the SNP-chip discoveries and de novo data generation. We identified candidate susceptibility loci for SCLS, which included a region flanking CAV3 (3p25.3) as well as SNP clusters in PON1 (7q21.3), PSORS1C1 (6p21.3), and CHCHD3 (7q33). Among the most highly ranked discoveries were gene-associated SNPs in the uncharacterized LOC100130480 gene (rs6417039, rs2004296). Top case-associated SNPs were observed in BTRC (rs12355803, 3rs4436485), ARHGEF18 (rs11668246), CDH13 (rs4782779), and EDG2 (rs12552348), which encode proteins with known or suspected roles in B cell function and/or vascular integrity. 61 SNPs that were significantly associated with SCLS by microarray analysis were also detected and validated by exome deep sequencing. Functional annotation of highly ranked SNPs revealed enrichment of cell projections, cell junctions and adhesion, and molecules containing pleckstrin homology, Ras/Rho regulatory, and immunoglobulin Ig-like C2/fibronectin type III domains, all of which involve mechanistic functions that correlate with the SCLS phenotype. These results highlight SNPs with potential relevance to SCLS. PMID:24808988

  7. Genome-Wide Identification and Analysis of the TIFY Gene Family in Grape

    PubMed Central

    Zhang, Yucheng; Gao, Min; Singer, Stacy D.; Fei, Zhangjun; Wang, Hua; Wang, Xiping

    2012-01-01

    Background The TIFY gene family constitutes a plant-specific group of genes with a broad range of functions. This family encodes four subfamilies of proteins, including ZML, TIFY, PPD and JASMONATE ZIM-Domain (JAZ) proteins. JAZ proteins are targets of the SCFCOI1 complex, and function as negative regulators in the JA signaling pathway. Recently, it has been reported in both Arabidopsis and rice that TIFY genes, and especially JAZ genes, may be involved in plant defense against insect feeding, wounding, pathogens and abiotic stresses. Nonetheless, knowledge concerning the specific expression patterns and evolutionary history of plant TIFY family members is limited, especially in a woody species such as grape. Methodology/Principal Findings A total of two TIFY, four ZML, two PPD and 11 JAZ genes were identified in the Vitis vinifera genome. Phylogenetic analysis of TIFY protein sequences from grape, Arabidopsis and rice indicated that the grape TIFY proteins are more closely related to those of Arabidopsis than those of rice. Both segmental and tandem duplication events have been major contributors to the expansion of the grape TIFY family. In addition, synteny analysis between grape and Arabidopsis demonstrated that homologues of several grape TIFY genes were found in the corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of lineages that led to grape and Arabidopsis. Analyses of microarray and quantitative real-time RT-PCR expression data revealed that grape TIFY genes are not a major player in the defense against biotrophic pathogens or viruses. However, many of these genes were responsive to JA and ABA, but not SA or ET. Conclusion The genome-wide identification, evolutionary and expression analyses of grape TIFY genes should facilitate further research of this gene family and provide new insights regarding their evolutionary history and regulatory control. PMID:22984514

  8. Genome-wide meta-analysis identifies six novel loci associated with habitual coffee consumption.

    PubMed

    Cornelis, M C; Byrne, E M; Esko, T; Nalls, M A; Ganna, A; Paynter, N; Monda, K L; Amin, N; Fischer, K; Renstrom, F; Ngwa, J S; Huikari, V; Cavadino, A; Nolte, I M; Teumer, A; Yu, K; Marques-Vidal, P; Rawal, R; Manichaikul, A; Wojczynski, M K; Vink, J M; Zhao, J H; Burlutsky, G; Lahti, J; Mikkilä, V; Lemaitre, R N; Eriksson, J; Musani, S K; Tanaka, T; Geller, F; Luan, J; Hui, J; Mägi, R; Dimitriou, M; Garcia, M E; Ho, W-K; Wright, M J; Rose, L M; Magnusson, P K E; Pedersen, N L; Couper, D; Oostra, B A; Hofman, A; Ikram, M A; Tiemeier, H W; Uitterlinden, A G; van Rooij, F J A; Barroso, I; Johansson, I; Xue, L; Kaakinen, M; Milani, L; Power, C; Snieder, H; Stolk, R P; Baumeister, S E; Biffar, R; Gu, F; Bastardot, F; Kutalik, Z; Jacobs, D R; Forouhi, N G; Mihailov, E; Lind, L; Lindgren, C; Michaëlsson, K; Morris, A; Jensen, M; Khaw, K-T; Luben, R N; Wang, J J; Männistö, S; Perälä, M-M; Kähönen, M; Lehtimäki, T; Viikari, J; Mozaffarian, D; Mukamal, K; Psaty, B M; Döring, A; Heath, A C; Montgomery, G W; Dahmen, N; Carithers, T; Tucker, K L; Ferrucci, L; Boyd, H A; Melbye, M; Treur, J L; Mellström, D; Hottenga, J J; Prokopenko, I; Tönjes, A; Deloukas, P; Kanoni, S; Lorentzon, M; Houston, D K; Liu, Y; Danesh, J; Rasheed, A; Mason, M A; Zonderman, A B; Franke, L; Kristal, B S; Karjalainen, J; Reed, D R; Westra, H-J; Evans, M K; Saleheen, D; Harris, T B; Dedoussis, G; Curhan, G; Stumvoll, M; Beilby, J; Pasquale, L R; Feenstra, B; Bandinelli, S; Ordovas, J M; Chan, A T; Peters, U; Ohlsson, C; Gieger, C; Martin, N G; Waldenberger, M; Siscovick, D S; Raitakari, O; Eriksson, J G; Mitchell, P; Hunter, D J; Kraft, P; Rimm, E B; Boomsma, D I; Borecki, I B; Loos, R J F; Wareham, N J; Vollenweider, P; Caporaso, N; Grabe, H J; Neuhouser, M L; Wolffenbuttel, B H R; Hu, F B; Hyppönen, E; Järvelin, M-R; Cupples, L A; Franks, P W; Ridker, P M; van Duijn, C M; Heiss, G; Metspalu, A; North, K E; Ingelsson, E; Nettleton, J A; van Dam, R M; Chasman, D I

    2015-05-01

    Coffee, a major dietary source of caffeine, is among the most widely consumed beverages in the world and has received considerable attention regarding health risks and benefits. We conducted a genome-wide (GW) meta-analysis of predominately regular-type coffee consumption (cups per day) among up to 91,462 coffee consumers of European ancestry with top single-nucleotide polymorphisms (SNPs) followed-up in ~30 062 and 7964 coffee consumers of European and African-American ancestry, respectively. Studies from both stages were combined in a trans-ethnic meta-analysis. Confirmed loci were examined for putative functional and biological relevance. Eight loci, including six novel loci, met GW significance (log10Bayes factor (BF)>5.64) with per-allele effect sizes of 0.03-0.14 cups per day. Six are located in or near genes potentially involved in pharmacokinetics (ABCG2, AHR, POR and CYP1A2) and pharmacodynamics (BDNF and SLC6A4) of caffeine. Two map to GCKR and MLXIPL genes related to metabolic traits but lacking known roles in coffee consumption. Enhancer and promoter histone marks populate the regions of many confirmed loci and several potential regulatory SNPs are highly correlated with the lead SNP of each. SNP alleles near GCKR, MLXIPL, BDNF and CYP1A2 that were associated with higher coffee consumption have previously been associated with smoking initiation, higher adiposity and fasting insulin and glucose but lower blood pressure and favorable lipid, inflammatory and liver enzyme profiles (P<5 × 10(-8)).Our genetic findings among European and African-American adults reinforce the role of caffeine in mediating habitual coffee consumption and may point to molecular mechanisms underlying inter-individual variability in pharmacological and health effects of coffee.

  9. Genome-wide analysis reveals positional-nucleosome-oriented binding pattern of pioneer factor FOXA1

    PubMed Central

    Ye, Zhenqing; Chen, Zhong; Sunkel, Benjamin; Frietze, Seth; Huang, Tim H.-M.; Wang, Qianben; Jin, Victor X.

    2016-01-01

    The compaction of nucleosomal structures creates a barrier for DNA-binding transcription factors (TFs) to access their cognate cis-regulatory elements. Pioneer factors (PFs) such as FOXA1 are able to directly access these cis-targets within compact chromatin. However, how these PFs interplay with nucleosomes remains to be elucidated, and is critical for us to understand the underlying mechanism of gene regulation. Here, we have conducted a computational analysis on a strand-specific paired-end ChIP-exo (termed as ChIP-ePENS) data of FOXA1 in LNCaP cells by our novel algorithm ePEST. We find that FOXA1 chromatin binding occurs via four distinct border modes (or footprint boundary patterns), with a preferential footprint boundary patterns relative to FOXA1 motif orientation. In addition, from this analysis three fundamental nucleotide positions (oG, oS and oH) emerged as major determinants for blocking exo-digestion and forming these four distinct border modes. By integrating histone MNase-seq data, we found an astonishingly consistent, ‘well-positioned’ configuration occurs between FOXA1 motifs and dyads of nucleosomes genome-wide. We further performed ChIP-seq of eight chromatin remodelers and found an increased occupancy of these remodelers on FOXA1 motifs for all four border modes (or footprint boundary patterns), indicating the full occupancy of FOXA1 complex on the three blocking sites (oG, oS and oH) likely produces an active regulatory status with well-positioned phasing for protein binding events. Together, our results suggest a positional-nucleosome-oriented accessing model for PFs seeking target motifs, in which FOXA1 can examine each underlying DNA nucleotide and is able to sense all potential motifs regardless of whether they face inward or outward from histone octamers along the DNA helix axis. PMID:27458208

  10. A combined analysis of genome-wide expression profiling of bipolar disorder in human prefrontal cortex.

    PubMed

    Wang, Jinglu; Qu, Susu; Wang, Weixiao; Guo, Liyuan; Zhang, Kunlin; Chang, Suhua; Wang, Jing

    2016-11-01

    Numbers of gene expression profiling studies of bipolar disorder have been published. Besides different array chips and tissues, variety of the data processes in different cohorts aggravated the inconsistency of results of these genome-wide gene expression profiling studies. By searching the gene expression databases, we obtained six data sets for prefrontal cortex (PFC) of bipolar disorder with raw data and combinable platforms. We used standardized pre-processing and quality control procedures to analyze each data set separately and then combined them into a large gene expression matrix with 101 bipolar disorder subjects and 106 controls. A standard linear mixed-effects model was used to calculate the differentially expressed genes (DEGs). Multiple levels of sensitivity analyses and cross validation with genetic data were conducted. Functional and network analyses were carried out on basis of the DEGs. In the result, we identified 198 unique differentially expressed genes in the PFC of bipolar disorder and control. Among them, 115 DEGs were robust to at least three leave-one-out tests or different pre-processing methods; 51 DEGs were validated with genetic association signals. Pathway enrichment analysis showed these DEGs were related with regulation of neurological system, cell death and apoptosis, and several basic binding processes. Protein-protein interaction network further identified one key hub gene. We have contributed the most comprehensive integrated analysis of bipolar disorder expression profiling studies in PFC to date. The DEGs, especially those with multiple validations, may denote a common signature of bipolar disorder and contribute to the pathogenesis of disease.

  11. Genome-Wide Association Analysis of the Anthocyanin and Carotenoid Contents of Rose Petals

    PubMed Central

    Schulz, Dietmar F.; Schott, Rena T.; Voorrips, Roeland E.; Smulders, Marinus J. M.; Linde, Marcus; Debener, Thomas

    2016-01-01

    Petal color is one of the key characteristics determining the attractiveness and therefore the commercial value of an ornamental crop. Here, we present the first genome-wide association study for the important ornamental crop rose, focusing on the anthocyanin and carotenoid contents in petals of 96 diverse tetraploid garden rose genotypes. Cultivated roses display a vast phenotypic and genetic diversity and are therefore ideal targets for association genetics. For marker analysis, we used a recently designed Axiom SNP chip comprising 68,000 SNPs with additionally 281 SSRs, 400 AFLPs and 246 markers from candidate genes. An analysis of the structure of the rose population revealed three subpopulations with most of the genetic variation between individual genotypes rather than between clusters and with a high average proportion of heterozygous loci. The mapping of markers significantly associated with anthocyanin and carotenoid content to the related Fragaria and Prunus genomes revealed clusters of associated markers indicating five genomic regions associated with the total anthocyanin content and two large clusters associated with the carotenoid content. Among the marker clusters associated with the phenotypes, we found several candidate genes with known functions in either the anthocyanin or the carotenoid biosynthesis pathways. Among others, we identified a glutathione-S-transferase, 4CL, an auxin response factor and F3'H as candidate genes affecting anthocyanin concentration, and CCD4 and Zeaxanthine epoxidase as candidates affecting the concentration of carotenoids. These markers are starting points for future validation experiments in independent populations as well as for functional genomic studies to identify the causal factors for the observed color phenotypes. Furthermore, validated markers may be interesting tools for marker-assisted selection in commercial breeding programmes in that they provide the tools to identify superior parental combinations that

  12. Genome-wide variant analysis of simplex autism families with an integrative clinical-bioinformatics pipeline

    PubMed Central

    Jiménez-Barrón, Laura T.; O'Rawe, Jason A.; Wu, Yiyang; Yoon, Margaret; Fang, Han; Iossifov, Ivan; Lyon, Gholson J.

    2015-01-01

    Autism spectrum disorders (ASDs) are a group of developmental disabilities that affect social interaction and communication and are characterized by repetitive behaviors. There is now a large body of evidence that suggests a complex role of genetics in ASDs, in which many different loci are involved. Although many current population-scale genomic studies have been demonstrably fruitful, these studies generally focus on analyzing a limited part of the genome or use a limited set of bioinformatics tools. These limitations preclude the analysis of genome-wide perturbations that may contribute to the development and severity of ASD-related phenotypes. To overcome these limitations, we have developed and utilized an integrative clinical and bioinformatics pipeline for generating a more complete and reliable set of genomic variants for downstream analyses. Our study focuses on the analysis of three simplex autism families consisting of one affected child, unaffected parents, and one unaffected sibling. All members were clinically evaluated and widely phenotyped. Genotyping arrays and whole-genome sequencing were performed on each member, and the resulting sequencing data were analyzed using a variety of available bioinformatics tools. We searched for rare variants of putative functional impact that were found to be segregating according to de novo, autosomal recessive, X-linked, mitochondrial, and compound heterozygote transmission models. The resulting candidate variants included three small heterozygous copy-number variations (CNVs), a rare heterozygous de novo nonsense mutation in MYBBP1A located within exon 1, and a novel de novo missense variant in LAMB3. Our work demonstrates how more comprehensive analyses that include rich clinical data and whole-genome sequencing data can generate reliable results for use in downstream investigations. PMID:27148569

  13. Genome-Wide Analysis of miRNA targets in Brachypodium and Biomass Energy Crops

    SciTech Connect

    Green, Pamela J.

    2015-08-11

    MicroRNAs (miRNAs) contribute to the control of numerous biological processes through the regulation of specific target mRNAs. Although the identities of these targets are essential to elucidate miRNA function, the targets are much more difficult to identify than the small RNAs themselves. Before this work, we pioneered the genome-wide identification of the targets of Arabidopsis miRNAs using an approach called PARE (German et al., Nature Biotech. 2008; Nature Protocols, 2009). Under this project, we applied PARE to Brachypodium distachyon (Brachypodium), a model plant in the Poaceae family, which includes the major food grain and bioenergy crops. Through in-depth global analysis and examination of specific examples, this research greatly expanded our knowledge of miRNAs and target RNAs of Brachypodium. New regulation in response to environmental stress or tissue type was found, and many new miRNAs were discovered. More than 260 targets of new and known miRNAs with PARE sequences at the precise sites of miRNA-guided cleavage were identified and characterized. Combining PARE data with the small RNA data also identified the miRNAs responsible for initiating approximately 500 phased loci, including one of the novel miRNAs. PARE analysis also revealed that differentially expressed miRNAs in the same family guide specific target RNA cleavage in a correspondingly tissue-preferential manner. The project included generation of small RNA and PARE resources for bioenergy crops, to facilitate ongoing discovery of conserved miRNA-target RNA regulation. By associating specific miRNA-target RNA pairs with known physiological functions, the research provides insights about gene regulation in different tissues and in response to environmental stress. This, and release of new PARE and small RNA data sets should contribute basic knowledge to enhance breeding and may suggest new strategies for improvement of biomass energy crops.

  14. Genome-wide common and rare variant analysis provides novel insights into clozapine-associated neutropenia.

    PubMed

    Legge, S E; Hamshere, M L; Ripke, S; Pardinas, A F; Goldstein, J I; Rees, E; Richards, A L; Leonenko, G; Jorskog, L F; Chambert, K D; Collier, D A; Genovese, G; Giegling, I; Holmans, P; Jonasdottir, A; Kirov, G; McCarroll, S A; MacCabe, J H; Mantripragada, K; Moran, J L; Neale, B M; Stefansson, H; Rujescu, D; Daly, M J; Sullivan, P F; Owen, M J; O'Donovan, M C; Walters, J T R

    2016-07-12

    The antipsychotic clozapine is uniquely effective in the management of schizophrenia; however, its use is limited by its potential to induce agranulocytosis. The causes of this, and of its precursor neutropenia, are largely unknown, although genetic factors have an important role. We sought risk alleles for clozapine-associated neutropenia in a sample of 66 cases and 5583 clozapine-treated controls, through a genome-wide association study (GWAS), imputed human leukocyte antigen (HLA) alleles, exome array and copy-number variation (CNV) analyses. We then combined associated variants in a meta-analysis with data from the Clozapine-Induced Agranulocytosis Consortium (up to 163 cases and 7970 controls). In the largest combined sample to date, we identified a novel association with rs149104283 (odds ratio (OR)=4.32, P=1.79 × 10(-8)), intronic to transcripts of SLCO1B3 and SLCO1B7, members of a family of hepatic transporter genes previously implicated in adverse drug reactions including simvastatin-induced myopathy and docetaxel-induced neutropenia. Exome array analysis identified gene-wide associations of uncommon non-synonymous variants within UBAP2 and STARD9. We additionally provide independent replication of a previously identified variant in HLA-DQB1 (OR=15.6, P=0.015, positive predictive value=35.1%). These results implicate biological pathways through which clozapine may act to cause this serious adverse effect.Molecular Psychiatry advance online publication, 12 July 2016; doi:10.1038/mp.2016.97.

  15. Genome-Wide Identification, Characterization and Expression Analysis of the TCP Gene Family in Prunus mume

    PubMed Central

    Zhou, Yuzhen; Xu, Zongda; Zhao, Kai; Yang, Weiru; Cheng, Tangren; Wang, Jia; Zhang, Qixiang

    2016-01-01

    TCP proteins, belonging to a plant-specific transcription factors family, are known to have great functions in plant development, especially flower and leaf development. However, there is little information about this gene family in Prunus mume, which is widely cultivated in China as an ornamental and fruit tree. Here a genome-wide analysis of TCP genes was performed to explore their evolution in P. mume. Nineteen PmTCPs were identified and three of them contained putative miR319 target sites. Phylogenetic and comprehensive bioinformatics analyses of these genes revealed that different types of TCP genes had undergone different evolutionary processes and the genes in the same clade had similar chromosomal location, gene structure, and conserved domains. Expression analysis of these PmTCPs indicated that there were diverse expression patterns among different clades. Most TCP genes were predominantly expressed in flower, leaf, and stem, and showed high expression levels in the different stages of flower bud differentiation, especially in petal formation stage and gametophyte development. Genes in TCP-P subfamily had main roles in both flower development and gametophyte development. The CIN genes in double petal cultivars might have key roles in the formation of petal, while they were correlated with gametophyte development in the single petal cultivar. The CYC/TB1 type genes were highly detected in the formation of petal and pistil. The less-complex flower types of P. mume might result from the fact that there were only two CYC type genes present in P. mume and a lack of CYC2 genes to control the identity of flower types. These results lay the foundation for further study on the functions of TCP genes during flower development. PMID:27630648

  16. Transport genes and chemotaxis in Laribacter hongkongensis: a genome-wide analysis

    PubMed Central

    2011-01-01

    Background Laribacter hongkongensis is a Gram-negative, sea gull-shaped rod associated with community-acquired gastroenteritis. The bacterium has been found in diverse freshwater environments including fish, frogs and drinking water reservoirs. Using the complete genome sequence data of L. hongkongensis, we performed a comprehensive analysis of putative transport-related genes and genes related to chemotaxis, motility and quorum sensing, which may help the bacterium adapt to the changing environments and combat harmful substances. Results A genome-wide analysis using Transport Classification Database TCDB, similarity and keyword searches revealed the presence of a large diversity of transporters (n = 457) and genes related to chemotaxis (n = 52) and flagellar biosynthesis (n = 40) in the L. hongkongensis genome. The transporters included those from all seven major transporter categories, which may allow the uptake of essential nutrients or ions, and extrusion of metabolic end products and hazardous substances. L. hongkongensis is unique among closely related members of Neisseriaceae family in possessing higher number of proteins related to transport of ammonium, urea and dicarboxylate, which may reflect the importance of nitrogen and dicarboxylate metabolism in this assacharolytic bacterium. Structural modeling of two C4-dicarboxylate transporters showed that they possessed similar structures to the determined structures of other DctP-TRAP transporters, with one having an unusual disulfide bond. Diverse mechanisms for iron transport, including hemin transporters for iron acquisition from host proteins, were also identified. In addition to the chemotaxis and flagella-related genes, the L. hongkongensis genome also contained two copies of qseB/qseC homologues of the AI-3 quorum sensing system. Conclusions The large number of diverse transporters and genes involved in chemotaxis, motility and quorum sensing suggested that the bacterium may utilize a complex system to

  17. Genome wide association analysis for seedling response traits to thermal stress in sorghum germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The sorghum association panel exhibited extensive variation for seedling traits under cold and heat stress. Genome-wide analyses identified thirty single nucleotide polymorphisms (SNPs) that were strongly associated with traits measured at seedling stage under cold stress and tagged genes that act a...

  18. A population structure and genome-wide association analysis on the USDA soybean germplasm collection

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genotype-phenotype associations within the soybean (Glycine max) germplasm collection could provide valuable information on the frequency and distribution of alleles affecting economically important traits. Here we performed a genome-wide association study (GWAS) for seed protein and oil content in ...

  19. Genome-wide association study of swine farrowing traits. Part II: Bayesian analysis of marker data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Reproductive efficiency has a great impact on the economic success of pork production. Number born alive (NBA) and average piglet birth weight (ABW) contribute greatly to reproductive efficiency. To better understand the underlying genetics of birth traits, a genome wide association study (GWAS) w...

  20. Implementing meta-analysis from genome-wide association studies for pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality plays an important role in the meat processing industry, thus different methodologies have been implemented to elucidate the genetic architecture of traits affecting meat quality. One of the most common and widely used approaches is to perform genome-wide association (GWA) studies. Howe...

  1. Implementing Meta-analysis for genome-wide association studies of pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality is a critical concern in the meat industry. Implementation of genome-wide association studies (GWA) allows identification of genomic regions that explain a substantial portion of the variation of relevant traits. It is also important to determine the consistency of results of GWA across...

  2. Meta-analysis of genome-wide association from genomic prediction models

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...

  3. Genome-wide association analysis for drought tolerance and associated traits in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for drought tolerance and related traits in a Middle American diversity panel comprised of 96 common bean (Phaseolus vulgaris) genotypes. The panel grown under irrigated and rainfed conditions and single n...

  4. Genome-wide CNV analysis reveals variants associated with growth traits in Bos indicus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Apart from single nucleotide polymorphism (SNP), copy number variation (CNV) is another important type of genetic variation, which may affect growth traits and play key roles for the production of beef cattle. To date, no genome-wide association study (GWAS) for CNV and body traits in be...

  5. Genome-wide association analysis of symbiotic nitrogen fixation in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for symbiotic nitrogen fixation (SNF) and related traits in the Andean diversity panel (ADP) comprised of 259 common bean (Phaseolus vulgaris) genotypes. The ADP was evaluated for SNF and related traits in...

  6. Genome wide transcriptional profile analysis of Vitis amurensis and Vitis vinifera in response to cold stress.

    PubMed

    Xin, Haiping; Zhu, Wei; Wang, Lina; Xiang, Yue; Fang, Linchuan; Li, Jitao; Sun, Xiaoming; Wang, Nian; Londo, Jason P; Li, Shaohua

    2013-01-01

    Grape is one of the most important fruit crops worldwide. The suitable geographical locations and productivity of grapes are largely limited by temperature. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding that of Vitis vinifera, the dominant cultivated species of grapevine. However, the molecular mechanisms that contribute to the enhanced freezing tolerance of V. amurensis remain unknown. Here we used deep sequencing data from restriction endonuclease-generated cDNA fragments to evaluate the whole genome wide modification of transcriptome of V. amurensis under cold treatment. Vitis vinifera cv. Muscat of Hamburg was used as control to help investigate the distinctive features of V. amruensis in responding to cold stress. Approximately 9 million tags were sequenced from non-cold treatment (NCT) and cold treatment (CT) cDNA libraries in each species of grapevine sampled from shoot apices. Alignment of tags into V. vinifera cv. Pinot noir (PN40024) annotated genome identified over 15,000 transcripts in each library in V. amruensis and more than 16,000 in Muscat of Hamburg. Comparative analysis between NCT and CT libraries indicate that V. amurensis has fewer differential expressed genes (DEGs, 1314 transcripts) than Muscat of Hamburg (2307 transcripts) when exposed to cold stress. Common DEGs (408 transcripts) suggest that some genes provide fundamental roles during cold stress in grapes. The most robust DEGs (more than 20-fold change) also demonstrated significant differences between two kinds of grapevine, indicating that cold stress may trigger species specific pathways in V. amurensis. Functional categories of DEGs indicated that the proportion of up-regulated transcripts related to metabolism, transport, signal transduction and transcription were more abundant in V. amurensis. Several highly expressed transcripts that were found uniquely accumulated in V. amurensis are discussed in detail. This subset of unique candidate

  7. Genome-wide copy number variation analysis in a Chinese autism spectrum disorder cohort

    PubMed Central

    Guo, Hui; Peng, Yu; Hu, Zhengmao; Li, Ying; Xun, Guanglei; Ou, Jianjun; Sun, Liangdan; Xiong, Zhimin; Liu, Yanling; Wang, Tianyun; Chen, Jingjing; Xia, Lu; Bai, Ting; Shen, Yidong; Tian, Qi; Hu, Yiqiao; Shen, Lu; Zhao, Rongjuan; Zhang, Xuejun; Zhang, Fengyu; Zhao, Jingping; Zou, Xiaobing; Xia, Kun

    2017-01-01

    Autism spectrum disorder (ASD) describes a group of neurodevelopmental disorders with high heritability, although the underlying genetic determinants of ASDs remain largely unknown. Large-scale whole-genome studies of copy number variation in Han Chinese samples are still lacking. We performed a genome-wide copy number variation analysis of 343 ASD trios, 203 patients with sporadic cases and 988 controls in a Chinese population using Illumina genotyping platforms to identify CNVs and related genes that may contribute to ASD risk. We identified 32 rare CNVs larger than 1 Mb in 31 patients. ASD patients were found to carry a higher global burden of rare, large CNVs than controls. Recurrent de novo or case-private CNVs were found at 15q11-13, Xp22.3, 15q13.1–13.2, 3p26.3 and 2p12. The de novo 15q11–13 duplication was more prevalent in this Chinese population than in those with European ancestry. Several genes, including GRAMD2 and STAM, were implicated as novel ASD risk genes when integrating whole-genome CNVs and whole-exome sequencing data. We also identified several CNVs that include known ASD genes (SHANK3, CDH10, CSMD1) or genes involved in nervous system development (NYAP2, ST6GAL2, GRM6). Besides, our study also implicated Contactins-NYAPs-WAVE1 pathway in ASD pathogenesis. Our findings identify ASD-related CNVs in a Chinese population and implicate novel ASD risk genes and related pathway for further study. PMID:28281572

  8. Genome-Wide Pathway Analysis Identifies Genetic Pathways Associated with Psoriasis.

    PubMed

    Aterido, Adrià; Julià, Antonio; Ferrándiz, Carlos; Puig, Lluís; Fonseca, Eduardo; Fernández-López, Emilia; Dauden, Esteban; Sánchez-Carazo, José Luís; López-Estebaranz, José Luís; Moreno-Ramírez, David; Vanaclocha, Francisco; Herrera, Enrique; de la Cueva, Pablo; Dand, Nick; Palau, Núria; Alonso, Arnald; López-Lasanta, María; Tortosa, Raül; García-Montero, Andrés; Codó, Laia; Gelpí, Josep Lluís; Bertranpetit, Jaume; Absher, Devin; Capon, Francesca; Myers, Richard M; Barker, Jonathan N; Marsal, Sara

    2016-03-01

    Psoriasis is a chronic inflammatory disease with a complex genetic architecture. To date, the psoriasis heritability is only partially explained. However, there is increasing evidence that the missing heritability in psoriasis could be explained by multiple genetic variants of low effect size from common genetic pathways. The objective of this study was to identify new genetic variation associated with psoriasis risk at the pathway level. We genotyped 598,258 single nucleotide polymorphisms in a discovery cohort of 2,281 case-control individuals from Spain. We performed a genome-wide pathway analysis using 1,053 reference biological pathways. A total of 14 genetic pathways (PFDR ≤ 2.55 × 10(-2)) were found to be significantly associated with psoriasis risk. Using an independent validation cohort of 7,353 individuals from the UK, a total of 6 genetic pathways were significantly replicated (PFDR ≤ 3.46 × 10(-2)). We found genetic pathways that had not been previously associated with psoriasis risk such as retinol metabolism (Pcombined = 1.84 × 10(-4)), the transport of inorganic ions and amino acids (Pcombined = 1.57 × 10(-7)), and post-translational protein modification (Pcombined = 1.57 × 10(-7)). In the latter pathway, MGAT5 showed a strong network centrality, and its association with psoriasis risk was further validated in an additional case-control cohort of 3,429 individuals (P < 0.05). These findings provide insights into the biological mechanisms associated with psoriasis susceptibility.

  9. Identification of Six Loci Associated With Pelvic Organ Prolapse Using Genome-Wide Association Analysis

    PubMed Central

    Allen-Brady, Kristina; Cannon-Albright, Lisa; Farnham, James M.; Teerlink, Craig; Vierhout, Mark E.; van Kempen, Léon C.L.; Kluivers, Kirsten B.; Norton, Peggy A.

    2011-01-01

    Objective There is evidence that both environmental and genetic factors contribute to pelvic organ prolapse (POP). We conducted a genome-wide association study to investigate whether common genetic variants modify the risk of POP. Methods We recruited women who had been evaluated and treated for POP at the University of Utah from 1996–2008 and their affected female relatives. Cases were genotyped on the Illumina 550K platform. We genetically matched 2,976 Caucasian iControls available from Illumina as controls. Association tests were adjusted for related subjects using two different software programs: Efficient Mixed-Model Association eXpedited (EMMAX) and Genie. Confirmation of findings was performed in a cohort of Dutch women (n=76) with recurrent POP and family history of POP. Results The Utah study sample included 115 cases treated for POP, in most cases with surgery (n=78) or repeat surgery (n=35). Results from association analyses using EMMAX software identified five single nucleotide polymorphisms (SNPs) significantly associated with POP (p<1×10−7). Independent association analysis with Genie software identified three of the same SNPs and one additional SNP. The six SNPs were located at 4q21 (rs1455311), 8q24 (rs1036819), 9q22 (rs430794), 15q11 (rs8027714), 20p13 (rs1810636), and 21q22 (rs2236479). Nominally significant findings (p<0.05) or findings trending towards significance (p<0.1) were observed for five of the six SNPs in the Dutch cohort. Conclusion Six SNPs have been identified that are significantly associated with POP in high-risk familial cases and provide evidence for a genetic contribution to POP. PMID:22105264

  10. Genome Wide Transcriptional Profile Analysis of Vitis amurensis and Vitis vinifera in Response to Cold Stress

    PubMed Central

    Xin, Haiping; Zhu, Wei; Wang, Lina; Xiang, Yue; Fang, Linchuan; Li, Jitao; Sun, Xiaoming; Wang, Nian; Londo, Jason P.; Li, Shaohua

    2013-01-01

    Grape is one of the most important fruit crops worldwide. The suitable geographical locations and productivity of grapes are largely limited by temperature. Vitis amurensis is a wild grapevine species with remarkable cold-tolerance, exceeding that of Vitis vinifera, the dominant cultivated species of grapevine. However, the molecular mechanisms that contribute to the enhanced freezing tolerance of V. amurensis remain unknown. Here we used deep sequencing data from restriction endonuclease-generated cDNA fragments to evaluate the whole genome wide modification of transcriptome of V. amurensis under cold treatment. Vitis vinifera cv. Muscat of Hamburg was used as control to help investigate the distinctive features of V. amruensis in responding to cold stress. Approximately 9 million tags were sequenced from non-cold treatment (NCT) and cold treatment (CT) cDNA libraries in each species of grapevine sampled from shoot apices. Alignment of tags into V. vinifera cv. Pinot noir (PN40024) annotated genome identified over 15,000 transcripts in each library in V. amruensis and more than 16,000 in Muscat of Hamburg. Comparative analysis between NCT and CT libraries indicate that V. amurensis has fewer differential expressed genes (DEGs, 1314 transcripts) than Muscat of Hamburg (2307 transcripts) when exposed to cold stress. Common DEGs (408 transcripts) suggest that some genes provide fundamental roles during cold stress in grapes. The most robust DEGs (more than 20-fold change) also demonstrated significant differences between two kinds of grapevine, indicating that cold stress may trigger species specific pathways in V. amurensis. Functional categories of DEGs indicated that the proportion of up-regulated transcripts related to metabolism, transport, signal transduction and transcription were more abundant in V. amurensis. Several highly expressed transcripts that were found uniquely accumulated in V. amurensis are discussed in detail. This subset of unique candidate

  11. Genome-wide linkage analysis and physical mapping of the rippling muscle disease gene

    SciTech Connect

    Stephan, D.A.; Buist, N.R.M.; Bhaskar, A.C.

    1994-09-01

    Rippling muscle disease (RMD) is an inherited disorder of skeletal muscle in which mechanical stimuli provoke electrically silent contractions. The patient`s symptoms are muscle cramps, pain, and stiffness, particularly during or following exercise. Clinical signs are balling of muscle following percussion, and a characteristic lateral rolling movement of muscle occurring after contraction followed by stretching. We report a new 44-member pedigree segregating RMD as an autosomal dominant trait. A genome-wide genetic linkage study in this family, using a novel approach of testing closely spaced highly polymorphic markers in affected individuals, localized the responsible gene to the distal end of the long arm of chromosome 1 with a maximum multi-point lod score of 3.56 ({theta}=0). In this family, RMD is localized to a 6 cM region near D1S235. Physical mapping of the linked region yielded several positive YAC clones, one of which spans the entire 6 cM distance. Several candidate genes not present in the YAC contig, but in the region of 1q4, have been excluded as causative by either linkage analysis of intragenic microsatellite repeats (alpha-actinin, angiotensinogen) or by SSCP of exons (skeletal muscle alpha-actinin). We studied two previously reported German families for linkage to the same locus and this same area did not co-segregate with the disease, a finding that shows that different genetic defects can cause a similar clinical phenotype (genetic heterogeneity). An understanding of the defect in contraction control within the muscle fibers in this disease may lead to a better understanding of muscle force transduction, intracellular calcium homeostasis, or both.

  12. Genome-wide methylation analysis in Silver-Russell syndrome patients

    PubMed Central

    Böhm, S; Frost, JM; Puszyk, W; Abu-Amero, S; Stanier, P; Schulz, R; Moore, GE; Oakey, RJ

    2015-01-01

    Silver-Russell Syndrome (SRS) is a clinically heterogeneous disorder characterised by severe in utero growth restriction and poor postnatal growth, body asymmetry, irregular craniofacial features and several additional minor malformations. The aetiology of SRS is complex and current evidence strongly implicates imprinted genes. Approximately half of all patients exhibit DNA hypomethylation at the H19/IGF2 imprinted domain, and around 10% have maternal uniparental disomy of chromosome 7. We measured DNA methylation in 18 SRS patients at >485,000 CpG sites using DNA methylation microarrays. Using a novel bioinformatics methodology specifically designed to identify subsets of patients with a shared epimutation, we analysed methylation changes genome-wide as well as at known imprinted regions to identify SRS-associated epimutations. Our analysis identifies epimutations at the previously characterised domains of H19/IGF2 and at imprinted regions on chromosome 7, providing proof of principle that our methodology can detect DNA methylation changes at imprinted loci. In addition we discovered two novel epimutations associated with SRS and located at imprinted loci previously linked to relevant mouse and human phenotypes. We identify RB1 as an additional imprinted locus associated with SRS, with a region near the RB1 DMR hypermethylated in 13/18 (~70 %) patients. We also report 6/18 (~33 %) patients were hypermethylated at a CpG island near the ANKRD11 gene. We do not observe consistent cooccurrence of epimutations at multiple imprinted loci in single SRS individuals. SRS is clinically heterogeneous and the absence of multiple imprinted loci epimutations reflects the heterogeneity at the molecular level. Further stratification of SRS patients by molecular phenotypes might aid the identification of disease causes. PMID:25563730

  13. Bivariate Genome-Wide Association Analysis of the Growth and Intake Components of Feed Efficiency

    PubMed Central

    Beever, Jonathan E.; Bollero, Germán A.; Southey, Bruce R.; Faulkner, Daniel B.; Rodriguez-Zas, Sandra L.

    2013-01-01

    Single nucleotide polymorphisms (SNPs) associated with average daily gain (ADG) and dry matter intake (DMI), two major components of feed efficiency in cattle, were identified in a genome-wide association study (GWAS). Uni- and multi-SNP models were used to describe feed efficiency in a training data set and the results were confirmed in a validation data set. Results from the univariate and bivariate analyses of ADG and DMI, adjusted by the feedlot beef steer maintenance requirements, were compared. The bivariate uni-SNP analysis identified (P-value <0.0001) 11 SNPs, meanwhile the univariate analyses of ADG and DMI identified 8 and 9 SNPs, respectively. Among the six SNPs confirmed in the validation data set, five SNPs were mapped to KDELC2, PHOX2A, and TMEM40. Findings from the uni-SNP models were used to develop highly accurate predictive multi-SNP models in the training data set. Despite the substantially smaller size of the validation data set, the training multi-SNP models had slightly lower predictive ability when applied to the validation data set. Six Gene Ontology molecular functions related to ion transport activity were enriched (P-value <0.001) among the genes associated with the detected SNPs. The findings from this study demonstrate the complementary value of the uni- and multi-SNP models, and univariate and bivariate GWAS analyses. The identified SNPs can be used for genome-enabled improvement of feed efficiency in feedlot beef cattle, and can aid in the design of empirical studies to further confirm the associations. PMID:24205251

  14. Genome wide analysis of transcript levels after perturbation of the EGFR pathway in the Drosophila ovary.

    PubMed

    Jordan, Katherine C; Hatfield, Steven D; Tworoger, Michael; Ward, Ellen J; Fischer, Karin A; Bowers, Stuart; Ruohola-Baker, Hannele

    2005-03-01

    Defects in the epidermal growth factor receptor (EGFR) pathway can lead to aggressive tumor formation. Activation of this pathway during normal development produces multiple outcomes at the cellular level, leading to cellular differentiation and cell cycle activation. To elucidate the downstream events induced by this pathway, we used genome-wide cDNA microarray technology to identify potential EGFR targets in Drosophila oogenesis. We focused on genes for which the transcriptional responses due to EGFR pathway activation and inactivation were in opposite directions, as this is expected for genes that are directly regulated by the pathway in this tissue type. We perturbed the EGFR pathway in epithelial follicle cells using seven different genetic backgrounds. To activate the pathway, we overexpressed an activated form of the EGFR (UAS-caEGFR), and an activated form of the signal transducer Raf (UAS-caRaf); we also over- or ectopically expressed the downstream homeobox transcription factor Mirror (UAS-mirr) and the ligand-activating serine protease Rhomboid (UAS-rho). To reduce pathway activity we used loss-of-function mutations in the ligand (gurken) and receptor (torpedo). From microarrays containing 6,255 genes, we found 454 genes that responded in an opposite manner in gain-of-function and loss-of-function conditions among which are many Wingless signaling pathway components. Further analysis of two such components, sugarless and pangolin, revealed a function for these genes in late follicle cell patterning. Of interest, components of other signaling pathways were also enriched in the EGFR target group, suggesting that one reason for the pleiotropic effects seen with EGFR activity in cancer progression and development may be its ability to regulate many other signaling pathways.

  15. Genome-wide analysis of WRKY transcription factors in Solanum lycopersicum.

    PubMed

    Huang, Shengxiong; Gao, Yongfeng; Liu, Jikai; Peng, Xiaoli; Niu, Xiangli; Fei, Zhangjun; Cao, Shuqing; Liu, Yongsheng

    2012-06-01

    The WRKY transcription factors have been implicated in multiple biological processes in plants, especially in regulating defense against biotic and abiotic stresses. However, little information is available about the WRKYs in tomato (Solanum lycopersicum). The recent release of the whole-genome sequence of tomato allowed us to perform a genome-wide investigation for tomato WRKY proteins, and to compare these positively identified proteins with their orthologs in model plants, such as Arabidopsis and rice. In the present study, based on the recently released tomato whole-genome sequences, we identified 81 SlWRKY genes that were classified into three main groups, with the second group further divided into five subgroups. Depending on WRKY domains' sequences derived from tomato, Arabidopsis and rice, construction of a phylogenetic tree demonstrated distinct clustering and unique gene expansion of WRKY genes among the three species. Genome mapping analysis revealed that tomato WRKY genes were enriched on several chromosomes, especially on chromosome 5, and 16 % of the family members were tandemly duplicated genes. The tomato WRKYs from each group were shown to share similar motif compositions. Furthermore, tomato WRKY genes showed distinct temporal and spatial expression patterns in different developmental processes and in response to various biotic and abiotic stresses. The expression of 18 selected tomato WRKY genes in response to drought and salt stresses and Pseudomonas syringae invasion, respectively, was validated by quantitative RT-PCR. Our results will provide a platform for functional identification and molecular breeding study of WRKY genes in tomato and probably other Solanaceae plants.

  16. Genome-wide association analysis of eating disorder-related symptoms, behaviors, and personality traits.

    PubMed

    Boraska, Vesna; Davis, Oliver S P; Cherkas, Lynn F; Helder, Sietske G; Harris, Juliette; Krug, Isabel; Liao, Thomas Pei-Chi; Treasure, Janet; Ntalla, Ioanna; Karhunen, Leila; Keski-Rahkonen, Anna; Christakopoulou, Danai; Raevuori, Anu; Shin, So-Youn; Dedoussis, George V; Kaprio, Jaakko; Soranzo, Nicole; Spector, Tim D; Collier, David A; Zeggini, Eleftheria

    2012-10-01

    Eating disorders (EDs) are common, complex psychiatric disorders thought to be caused by both genetic and environmental factors. They share many symptoms, behaviors, and personality traits, which may have overlapping heritability. The aim of the present study is to perform a genome-wide association scan (GWAS) of six ED phenotypes comprising three symptom traits from the Eating Disorders Inventory 2 [Drive for Thinness (DT), Body Dissatisfaction (BD), and Bulimia], Weight Fluctuation symptom, Breakfast Skipping behavior and Childhood Obsessive-Compulsive Personality Disorder trait (CHIRP). Investigated traits were derived from standardized self-report questionnaires completed by the TwinsUK population-based cohort. We tested 283,744 directly typed SNPs across six phenotypes of interest in the TwinsUK discovery dataset and followed-up signals from various strata using a two-stage replication strategy in two independent cohorts of European ancestry. We meta-analyzed a total of 2,698 individuals for DT, 2,680 for BD, 2,789 (821 cases/1,968 controls) for Bulimia, 1,360 (633 cases/727 controls) for Childhood Obsessive-Compulsive Personality Disorder trait, 2,773 (761 cases/2,012 controls) for Breakfast Skipping, and 2,967 (798 cases/2,169 controls) for Weight Fluctuation symptom. In this GWAS analysis of six ED-related phenotypes, we detected association of eight genetic variants with P < 10(-5) . Genetic variants that showed suggestive evidence of association were previously associated with several psychiatric disorders and ED-related phenotypes. Our study indicates that larger-scale collaborative studies will be needed to achieve the necessary power to detect loci underlying ED-related traits.

  17. Genome-Wide Analysis in Brazilians Reveals Highly Differentiated Native American Genome Regions.

    PubMed

    Mychaleckyj, Josyf C; Havt, Alexandre; Nayak, Uma; Pinkerton, Relana; Farber, Emily; Concannon, Patrick; Lima, Aldo A; Guerrant, Richard L

    2017-03-01

    Despite its population, geographic size, and emerging economic importance, disproportionately little genome-scale research exists into genetic factors that predispose Brazilians to disease, or the population genetics of risk. After identification of suitable proxy populations and careful analysis of tri-continental admixture in 1,538 North-Eastern Brazilians to estimate individual ancestry and ancestral allele frequencies, we computed 400,000 genome-wide locus-specific branch length (LSBL) Fst statistics of Brazilian Amerindian ancestry compared to European and African; and a similar set of differentiation statistics for their Amerindian component compared with the closest Asian 1000 Genomes population (surprisingly, Bengalis in Bangladesh). After ranking SNPs by these statistics, we identified the top 10 highly differentiated SNPs in five genome regions in the LSBL tests of Brazilian Amerindian ancestry compared to European and African; and the top 10 SNPs in eight regions comparing their Amerindian component to the closest Asian 1000 Genomes population. We found SNPs within or proximal to the genes CIITA (rs6498115), SMC6 (rs1834619), and KLHL29 (rs2288697) were most differentiated in the Amerindian-specific branch, while SNPs in the genes ADAMTS9 (rs7631391), DOCK2 (rs77594147), SLC28A1 (rs28649017), ARHGAP5 (rs7151991), and CIITA (rs45601437) were most highly differentiated in the Asian comparison. These genes are known to influence immune function, metabolic and anthropometry traits, and embryonic development. These analyses have identified candidate genes for selection within Amerindian ancestry, and by comparison of the two analyses, those for which the differentiation may have arisen during the migration from Asia to the Americas.

  18. Genome-wide transcript analysis of maize hybrids: allelic additive gene expression and yield heterosis.

    PubMed

    Guo, Mei; Rupe, Mary A; Yang, Xiaofeng; Crasta, Oswald; Zinselmeier, Christopher; Smith, Oscar S; Bowen, Ben

    2006-09-01

    Heterosis, or hybrid vigor, has been widely exploited in plant breeding for many decades, but the molecular mechanisms underlying the phenomenon remain unknown. In this study, we applied genome-wide transcript profiling to gain a global picture of the ways in which a large proportion of genes are expressed in the immature ear tissues of a series of 16 maize hybrids that vary in their degree of heterosis. Key observations include: (1) the proportion of allelic additively expressed genes is positively associated with hybrid yield and heterosis; (2) the proportion of genes that exhibit a bias towards the expression level of the paternal parent is negatively correlated with hybrid yield and heterosis; and (3) there is no correlation between the over- or under-expression of specific genes in maize hybrids with either yield or heterosis. The relationship of the expression patterns with hybrid performance is substantiated by analysis of a genetically improved modern hybrid (Pioneer hybrid 3394) versus a less improved older hybrid (Pioneer hybrid 3306) grown at different levels of plant density stress. The proportion of allelic additively expressed genes is positively associated with the modern high yielding hybrid, heterosis and high yielding environments, whereas the converse is true for the paternally biased gene expression. The dynamic changes of gene expression in hybrids responding to genotype and environment may result from differential regulation of the two parental alleles. Our findings suggest that differential allele regulation may play an important role in hybrid yield or heterosis, and provide a new insight to the molecular understanding of the underlying mechanisms of heterosis.

  19. DNA Methylation in Newborns and Maternal Smoking in Pregnancy: Genome-wide Consortium Meta-analysis

    PubMed Central

    Joubert, Bonnie R.; Felix, Janine F.; Yousefi, Paul; Bakulski, Kelly M.; Just, Allan C.; Breton, Carrie; Reese, Sarah E.; Markunas, Christina A.; Richmond, Rebecca C.; Xu, Cheng-Jian; Küpers, Leanne K.; Oh, Sam S.; Hoyo, Cathrine; Gruzieva, Olena; Söderhäll, Cilla; Salas, Lucas A.; Baïz, Nour; Zhang, Hongmei; Lepeule, Johanna; Ruiz, Carlos; Ligthart, Symen; Wang, Tianyuan; Taylor, Jack A.; Duijts, Liesbeth; Sharp, Gemma C.; Jankipersadsing, Soesma A.; Nilsen, Roy M.; Vaez, Ahmad; Fallin, M. Daniele; Hu, Donglei; Litonjua, Augusto A.; Fuemmeler, Bernard F.; Huen, Karen; Kere, Juha; Kull, Inger; Munthe-Kaas, Monica Cheng; Gehring, Ulrike; Bustamante, Mariona; Saurel-Coubizolles, Marie José; Quraishi, Bilal M.; Ren, Jie; Tost, Jörg; Gonzalez, Juan R.; Peters, Marjolein J.; Håberg, Siri E.; Xu, Zongli; van Meurs, Joyce B.; Gaunt, Tom R.; Kerkhof, Marjan; Corpeleijn, Eva; Feinberg, Andrew P.; Eng, Celeste; Baccarelli, Andrea A.; Benjamin Neelon, Sara E.; Bradman, Asa; Merid, Simon Kebede; Bergström, Anna; Herceg, Zdenko; Hernandez-Vargas, Hector; Brunekreef, Bert; Pinart, Mariona; Heude, Barbara; Ewart, Susan; Yao, Jin; Lemonnier, Nathanaël; Franco, Oscar H.; Wu, Michael C.; Hofman, Albert; McArdle, Wendy; Van der Vlies, Pieter; Falahi, Fahimeh; Gillman, Matthew W.; Barcellos, Lisa F.; Kumar, Ashish; Wickman, Magnus; Guerra, Stefano; Charles, Marie-Aline; Holloway, John; Auffray, Charles; Tiemeier, Henning W.; Smith, George Davey; Postma, Dirkje; Hivert, Marie-France; Eskenazi, Brenda; Vrijheid, Martine; Arshad, Hasan; Antó, Josep M.; Dehghan, Abbas; Karmaus, Wilfried; Annesi-Maesano, Isabella; Sunyer, Jordi; Ghantous, Akram; Pershagen, Göran; Holland, Nina; Murphy, Susan K.; DeMeo, Dawn L.; Burchard, Esteban G.; Ladd-Acosta, Christine; Snieder, Harold; Nystad, Wenche; Koppelman, Gerard H.; Relton, Caroline L.; Jaddoe, Vincent W.V.; Wilcox, Allen; Melén, Erik; London, Stephanie J.

    2016-01-01

    Epigenetic modifications, including DNA methylation, represent a potential mechanism for environmental impacts on human disease. Maternal smoking in pregnancy remains an important public health problem that impacts child health in a myriad of ways and has potential lifelong consequences. The mechanisms are largely unknown, but epigenetics most likely plays a role. We formed the Pregnancy And Childhood Epigenetics (PACE) consortium and meta-analyzed, across 13 cohorts (n = 6,685), the association between maternal smoking in pregnancy and newborn blood DNA methylation at over 450,000 CpG sites (CpGs) by using the Illumina 450K BeadChip. Over 6,000 CpGs were differentially methylated in relation to maternal smoking at genome-wide statistical significance (false discovery rate, 5%), including 2,965 CpGs corresponding to 2,017 genes not previously related to smoking and methylation in either newborns or adults. Several genes are relevant to diseases that can be caused by maternal smoking (e.g., orofacial clefts and asthma) or adult smoking (e.g., certain cancers). A number of differentially methylated CpGs were associated with gene expression. We observed enrichment in pathways and processes critical to development. In older children (5 cohorts, n = 3,187), 100% of CpGs gave at least nominal levels of significance, far more than expected by chance (p value < 2.2 × 10−16). Results were robust to different normalization methods used across studies and cell type adjustment. In this large scale meta-analysis of methylation data, we identified numerous loci involved in response to maternal smoking in pregnancy with persistence into later childhood and provide insights into mechanisms underlying effects of this important exposure. PMID:27040690

  20. Genome-wide DNA methylation analysis in obsessive-compulsive disorder patients

    PubMed Central

    Yue, Weihua; Cheng, Weiqiu; Liu, Zhaorui; Tang, Yi; Lu, Tianlan; Zhang, Dai; Tang, Muni; Huang, Yueqin

    2016-01-01

    Literatures have suggested that not only genetic but also environmental factors, interactively accounted for susceptibility of obsessive-compulsive disorder (OCD). DNA methylation may regulate expression of genes as the heritable epigenetic modification. The examination for genome-wide DNA methylation was performed on blood samples from 65 patients with OCD, as well as 96 healthy control subjects. The DNA methylation was examined at over 485,000 CpG sites using the Illumina Infinium Human Methylation450 BeadChip. As a result, 8,417 probes corresponding to 2,190 unique genes were found to be differentially methylated between OCD and healthy control subjects. Of those genes, 4,013 loci were located in CpG islands and 2,478 were in promoter regions. These included BCYRN1, BCOR, FGF13, HLA-DRB1, ARX, etc., which have previously been reported to be associated with OCD. Pathway analyses indicated that regulation of actin cytoskeleton, cell adhesion molecules (CAMs), actin binding, transcription regulator activity, and other pathways might be further associated with risk of OCD. Unsupervised clustering analysis of the top 3,000 most variable probes revealed two distinct groups with significantly more people with OCD in cluster one compared with controls (67.74% of cases v.s. 27.13% of controls, Chi-square = 26.011, df = 1, P = 3.41E-07). These results strongly suggested that differential DNA methylation might play an important role in etiology of OCD. PMID:27527274

  1. Genome-wide DNA methylation analysis in obsessive-compulsive disorder patients.

    PubMed

    Yue, Weihua; Cheng, Weiqiu; Liu, Zhaorui; Tang, Yi; Lu, Tianlan; Zhang, Dai; Tang, Muni; Huang, Yueqin

    2016-08-16

    Literatures have suggested that not only genetic but also environmental factors, interactively accounted for susceptibility of obsessive-compulsive disorder (OCD). DNA methylation may regulate expression of genes as the heritable epigenetic modification. The examination for genome-wide DNA methylation was performed on blood samples from 65 patients with OCD, as well as 96 healthy control subjects. The DNA methylation was examined at over 485,000 CpG sites using the Illumina Infinium Human Methylation450 BeadChip. As a result, 8,417 probes corresponding to 2,190 unique genes were found to be differentially methylated between OCD and healthy control subjects. Of those genes, 4,013 loci were located in CpG islands and 2,478 were in promoter regions. These included BCYRN1, BCOR, FGF13, HLA-DRB1, ARX, etc., which have previously been reported to be associated with OCD. Pathway analyses indicated that regulation of actin cytoskeleton, cell adhesion molecules (CAMs), actin binding, transcription regulator activity, and other pathways might be further associated with risk of OCD. Unsupervised clustering analysis of the top 3,000 most variable probes revealed two distinct groups with significantly more people with OCD in cluster one compared with controls (67.74% of cases v.s. 27.13% of controls, Chi-square = 26.011, df = 1, P = 3.41E-07). These results strongly suggested that differential DNA methylation might play an important role in etiology of OCD.

  2. Genome-wide association analysis of actigraphic sleep phenotypes in the LIFE Adult Study.

    PubMed

    Spada, Janek; Scholz, Markus; Kirsten, Holger; Hensch, Tilman; Horn, Katrin; Jawinski, Philippe; Ulke, Christine; Burkhardt, Ralph; Wirkner, Kerstin; Loeffler, Markus; Hegerl, Ulrich; Sander, Christian

    2016-12-01

    The genetic basis of sleep is still poorly understood. Despite the moderate to high heritability of sleep-related phenotypes, known genetic variants explain only a small proportion of the phenotypical variance. However, most previous studies were based solely upon self-report measures. The present study aimed to conduct the first genome-wide association (GWA) of actigraphic sleep phenotypes. The analyses included 956 middle- to older-aged subjects (40-79 years) from the LIFE Adult Study. The SenseWear Pro 3 Armband was used to collect 11 actigraphic parameters of night- and daytime sleep and three parameters of rest (lying down). The parameters comprised measures of sleep timing, quantity and quality. A total of 7 141 204 single nucleotide polymorphisms (SNPs) were analysed after imputation and quality control. We identified several variants below the significance threshold of P ≤ 5× 10(-8) (not corrected for analysis of multiple traits). The most significant was a hit near UFL1 associated with sleep efficiency on weekdays (P = 1.39 × 10(-8) ). Further SNPs were close to significance, including an association between sleep latency and a variant in CSNK2A1 (P = 8.20 × 10(-8) ), a gene known to be involved in the regulation of circadian rhythm. In summary, our GWAS identified novel candidate genes with biological plausibility being promising candidates for replication and further follow-up studies.

  3. A genome-wide association analysis of chromosomal aberrations and Hirschsprung disease.

    PubMed

    Bae, Joon Seol; Koh, InSong; Cheong, Hyun Sub; Seo, Jeong-Meen; Kim, Dae-Yeon; Oh, Jung-Tak; Kim, Hyun-Young; Jung, Kyuwhan; Sul, Jae Hoon; Park, Woong-Yang; Kim, Jeong-Hyun; Shin, Hyoung Doo

    2016-11-01

    Hirschsprung disease (HSCR) is a neurocristopathy characterized by the absence of intramural ganglion cells along variable lengths of the gastrointestinal tract. Although the RET proto-oncogene is considered to be the main risk factor for HSCR, only about 30% of the HSCR cases can be explained by variations in previously known genes including RET. Recently, copy number variation (CNV) and loss of heterozygosity (LOH) have emerged as new ways to understand human genomic variation. The goal of this present study is to identify new HSCR genetic factors related to CNV in Korean patients. In the genome-wide genotyping, using Illumina's HumanOmni1-Quad BeadChip (1,140,419 markers), of 123 HSCR patients and 432 unaffected subjects (total n = 555), a total of 8,188 CNVs (1 kb ∼ 1 mb) were identified by CNVpartition. As a result, 16 CNV regions and 13 LOH regions were identified as associated with HSCR (minimum P = 0.0005). Two top CNV regions (deletions at chr6:32675155-32680480 and chr22:20733495-21607293) were successfully validated by additional real-time quantitative polymerase chain reaction analysis. In addition, 2 CNV regions (6p21.32 and 22q11.21) and 2 LOH regions (3p22.2 and 14q23.3) were discovered to be unique to the HSCR patients group. Regarding the large-scale chromosomal aberrations (>1 mb), 11 large aberrations in the HSCR patients group were identified, which suggests that they may be a risk factor for HSCR. Although further replication in a larger cohort is needed, our findings may contribute to the understanding of the etiology of HSCR.

  4. Genome-Wide Transcriptional Analysis of Genes Associated with Acute Desiccation Stress in Anopheles gambiae

    PubMed Central

    Wang, Mei-Hui; Marinotti, Osvaldo; Vardo-Zalik, Anne; Boparai, Rajni; Yan, Guiyun

    2011-01-01

    Malaria transmission in sub-Saharan Africa varies seasonally in intensity. Outbreaks of malaria occur after the beginning of the rainy season, whereas, during the dry season, reports of the disease are less frequent. Anopheles gambiae mosquitoes, the main malaria vector, are observed all year long but their densities are low during the dry season that generally lasts several months. Aestivation, seasonal migration, and local adaptation have been suggested as mechanisms that enable mosquito populations to persist through the dry season. Studies of chromosomal inversions have shown that inversions 2La, 2Rb, 2Rc, 2Rd, and 2Ru are associated with various physiological changes that confer aridity resistance. However, little is known about how phenotypic plasticity responds to seasonally dry conditions. This study examined the effects of desiccation stress on transcriptional regulation in An. gambiae. We exposed female An. gambiae G3 mosquitoes to acute desiccation and conducted a genome-wide analysis of their transcriptomes using the Affymetrix Plasmodium/Anopheles Genome Array. The transcription of 248 genes (1.7% of all transcripts) was significantly affected in all experimental conditions, including 96 with increased expression and 152 with decreased expression. In general, the data indicate a reduction in the metabolic rate of mosquitoes exposed to desiccation. Transcripts accumulated at higher levels during desiccation are associated with oxygen radical detoxification, DNA repair and stress responses. The proportion of transcripts within 2La and 2Rs (2Rb, 2Rc, 2Rd, and 2Ru) (67/248, or 27%) is similar to the percentage of transcripts located within these inversions (31%). These data may be useful in efforts to elucidate the role of chromosomal inversions in aridity tolerance. The scope of application of the anopheline genome demonstrates that examining transcriptional activity in relation to genotypic adaptations greatly expands the number of candidate regions

  5. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa

    PubMed Central

    Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA—BrIAA) and 36 cross species (BrIAA—AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa. PMID

  6. A genome-wide association meta-analysis on apolipoprotein A-IV concentrations.

    PubMed

    Lamina, Claudia; Friedel, Salome; Coassin, Stefan; Rueedi, Rico; Yousri, Noha A; Seppälä, Ilkka; Gieger, Christian; Schönherr, Sebastian; Forer, Lukas; Erhart, Gertraud; Kollerits, Barbara; Marques-Vidal, Pedro; Ried, Janina; Waeber, Gerard; Bergmann, Sven; Dähnhardt, Doreen; Stöckl, Andrea; Kiechl, Stefan; Raitakari, Olli T; Kähönen, Mika; Willeit, Johann; Kedenko, Ludmilla; Paulweber, Bernhard; Peters, Annette; Meitinger, Thomas; Strauch, Konstantin; Study Group, Kora; Lehtimäki, Terho; Hunt, Steven C; Vollenweider, Peter; Kronenberg, Florian

    2016-08-15

    Apolipoprotein A-IV (apoA-IV) is a major component of HDL and chylomicron particles and is involved in reverse cholesterol transport. It is an early marker of impaired renal function. We aimed to identify genetic loci associated with apoA-IV concentrations and to investigate relationships with known susceptibility loci for kidney function and lipids. A genome-wide association meta-analysis on apoA-IV concentrations was conducted in five population-based cohorts (n = 13,813) followed by two additional replication studies (n = 2,267) including approximately 10 M SNPs. Three independent SNPs from two genomic regions were significantly associated with apoA-IV concentrations: rs1729407 near APOA4 (P = 6.77 × 10 (-)  (44)), rs5104 in APOA4 (P = 1.79 × 10(-)(24)) and rs4241819 in KLKB1 (P = 5.6 × 10(-)(14)). Additionally, a look-up of the replicated SNPs in downloadable GWAS meta-analysis results was performed on kidney function (defined by eGFR), HDL-cholesterol and triglycerides. From these three SNPs mentioned above, only rs1729407 showed an association with HDL-cholesterol (P = 7.1 × 10 (-)  (07)). Moreover, weighted SNP-scores were built involving known susceptibility loci for the aforementioned traits (53, 70 and 38 SNPs, respectively) and were associated with apoA-IV concentrations. This analysis revealed a significant and an inverse association for kidney function with apoA-IV concentrations (P = 5.5 × 10(-)(05)). Furthermore, an increase of triglyceride-increasing alleles was found to decrease apoA-IV concentrations (P = 0.0078). In summary, we identified two independent SNPs located in or next the APOA4 gene and one SNP in KLKB1 The association of KLKB1 with apoA-IV suggests an involvement of apoA-IV in renal metabolism and/or an interaction within HDL particles. Analyses of SNP-scores indicate potential causal effects of kidney function and by lesser extent triglycerides on apoA-IV concentrations.

  7. Genome-Wide Analysis and Characterization of Aux/IAA Family Genes in Brassica rapa.

    PubMed

    Paul, Parameswari; Dhandapani, Vignesh; Rameneni, Jana Jeevan; Li, Xiaonan; Sivanandhan, Ganesan; Choi, Su Ryun; Pang, Wenxing; Im, Subin; Lim, Yong Pyo

    2016-01-01

    Auxins are the key players in plant growth development involving leaf formation, phototropism, root, fruit and embryo development. Auxin/Indole-3-Acetic Acid (Aux/IAA) are early auxin response genes noted as transcriptional repressors in plant auxin signaling. However, many studies focus on Aux/ARF gene families and much less is known about the Aux/IAA gene family in Brassica rapa (B. rapa). Here we performed a comprehensive genome-wide analysis and identified 55 Aux/IAA genes in B. rapa using four conserved motifs of Aux/IAA family (PF02309). Chromosomal mapping of the B. rapa Aux/IAA (BrIAA) genes facilitated understanding cluster rearrangement of the crucifer building blocks in the genome. Phylogenetic analysis of BrIAA with Arabidopsis thaliana, Oryza sativa and Zea mays identified 51 sister pairs including 15 same species (BrIAA-BrIAA) and 36 cross species (BrIAA-AtIAA) IAA genes. Among the 55 BrIAA genes, expression of 43 and 45 genes were verified using Genebank B. rapa ESTs and in home developed microarray data from mature leaves of Chiifu and RcBr lines. Despite their huge morphological difference, tissue specific expression analysis of BrIAA genes between the parental lines Chiifu and RcBr showed that the genes followed a similar pattern of expression during leaf development and a different pattern during bud, flower and siliqua development stages. The response of the BrIAA genes to abiotic and auxin stress at different time intervals revealed their involvement in stress response. Single Nucleotide Polymorphisms between IAA genes of reference genome Chiifu and RcBr were focused and identified. Our study examines the scope of conservation and divergence of Aux/IAA genes and their structures in B. rapa. Analyzing the expression and structural variation between two parental lines will significantly contribute to functional genomics of Brassica crops and we belive our study would provide a foundation in understanding the Aux/IAA genes in B. rapa.

  8. A genome-wide association meta-analysis on apolipoprotein A-IV concentrations

    PubMed Central

    Lamina, Claudia; Friedel, Salome; Coassin, Stefan; Rueedi, Rico; Yousri, Noha A.; Seppälä, Ilkka; Gieger, Christian; Schönherr, Sebastian; Forer, Lukas; Erhart, Gertraud; Kollerits, Barbara; Marques-Vidal, Pedro; Ried, Janina; Waeber, Gerard; Bergmann, Sven; Dähnhardt, Doreen; Stöckl, Andrea; Kiechl, Stefan; Raitakari, Olli T.; Kähönen, Mika; Willeit, Johann; Kedenko, Ludmilla; Paulweber, Bernhard; Peters, Annette; Meitinger, Thomas; Strauch, Konstantin; Study Group, KORA; Lehtimäki, Terho; Hunt, Steven C.; Vollenweider, Peter; Kronenberg, Florian

    2016-01-01

    Apolipoprotein A-IV (apoA-IV) is a major component of HDL and chylomicron particles and is involved in reverse cholesterol transport. It is an early marker of impaired renal function. We aimed to identify genetic loci associated with apoA-IV concentrations and to investigate relationships with known susceptibility loci for kidney function and lipids. A genome-wide association meta-analysis on apoA-IV concentrations was conducted in five population-based cohorts (n = 13,813) followed by two additional replication studies (n = 2,267) including approximately 10 M SNPs. Three independent SNPs from two genomic regions were significantly associated with apoA-IV concentrations: rs1729407 near APOA4 (P = 6.77 × 10 − 44), rs5104 in APOA4 (P = 1.79 × 10−24) and rs4241819 in KLKB1 (P = 5.6 × 10−14). Additionally, a look-up of the replicated SNPs in downloadable GWAS meta-analysis results was performed on kidney function (defined by eGFR), HDL-cholesterol and triglycerides. From these three SNPs mentioned above, only rs1729407 showed an association with HDL-cholesterol (P = 7.1 × 10 − 07). Moreover, weighted SNP-scores were built involving known susceptibility loci for the aforementioned traits (53, 70 and 38 SNPs, respectively) and were associated with apoA-IV concentrations. This analysis revealed a significant and an inverse association for kidney function with apoA-IV concentrations (P = 5.5 × 10−05). Furthermore, an increase of triglyceride-increasing alleles was found to decrease apoA-IV concentrations (P = 0.0078). In summary, we identified two independent SNPs located in or next the APOA4 gene and one SNP in KLKB1. The association of KLKB1 with apoA-IV suggests an involvement of apoA-IV in renal metabolism and/or an interaction within HDL particles. Analyses of SNP-scores indicate potential causal effects of kidney function and by lesser extent triglycerides on apoA-IV concentrations. PMID

  9. Genome-wide analysis identifies 12 loci influencing human reproductive behavior.

    PubMed

    Barban, Nicola; Jansen, Rick; de Vlaming, Ronald; Vaez, Ahmad; Mandemakers, Jornt J; Tropf, Felix C; Shen, Xia; Wilson, James F; Chasman, Daniel I; Nolte, Ilja M; Tragante, Vinicius; van der Laan, Sander W; Perry, John R B; Kong, Augustine; Ahluwalia, Tarunveer S; Albrecht, Eva; Yerges-Armstrong, Laura; Atzmon, Gil; Auro, Kirsi; Ayers, Kristin; Bakshi, Andrew; Ben-Avraham, Danny; Berger, Klaus; Bergman, Aviv; Bertram, Lars; Bielak, Lawrence F; Bjornsdottir, Gyda; Bonder, Marc Jan; Broer, Linda; Bui, Minh; Barbieri, Caterina; Cavadino, Alana; Chavarro, Jorge E; Turman, Constance; Concas, Maria Pina; Cordell, Heather J; Davies, Gail; Eibich, Peter; Eriksson, Nicholas; Esko, Tõnu; Eriksson, Joel; Falahi, Fahimeh; Felix, Janine F; Fontana, Mark Alan; Franke, Lude; Gandin, Ilaria; Gaskins, Audrey J; Gieger, Christian; Gunderson, Erica P; Guo, Xiuqing; Hayward, Caroline; He, Chunyan; Hofer, Edith; Huang, Hongyan; Joshi, Peter K; Kanoni, Stavroula; Karlsson, Robert; Kiechl, Stefan; Kifley, Annette; Kluttig, Alexander; Kraft, Peter; Lagou, Vasiliki; Lecoeur, Cecile; Lahti, Jari; Li-Gao, Ruifang; Lind, Penelope A; Liu, Tian; Makalic, Enes; Mamasoula, Crysovalanto; Matteson, Lindsay; Mbarek, Hamdi; McArdle, Patrick F; McMahon, George; Meddens, S Fleur W; Mihailov, Evelin; Miller, Mike; Missmer, Stacey A; Monnereau, Claire; van der Most, Peter J; Myhre, Ronny; Nalls, Mike A; Nutile, Teresa; Kalafati, Ioanna Panagiota; Porcu, Eleonora; Prokopenko, Inga; Rajan, Kumar B; Rich-Edwards, Janet; Rietveld, Cornelius A; Robino, Antonietta; Rose, Lynda M; Rueedi, Rico; Ryan, Kathleen A; Saba, Yasaman; Schmidt, Daniel; Smith, Jennifer A; Stolk, Lisette; Streeten, Elizabeth; Tönjes, Anke; Thorleifsson, Gudmar; Ulivi, Sheila; Wedenoja, Juho; Wellmann, Juergen; Willeit, Peter; Yao, Jie; Yengo, Loic; Zhao, Jing Hua; Zhao, Wei; Zhernakova, Daria V; Amin, Najaf; Andrews, Howard; Balkau, Beverley; Barzilai, Nir; Bergmann, Sven; Biino, Ginevra; Bisgaard, Hans; Bønnelykke, Klaus; Boomsma, Dorret I; Buring, Julie E; Campbell, Harry; Cappellani, Stefania; Ciullo, Marina; Cox, Simon R; Cucca, Francesco; Toniolo, Daniela; Davey-Smith, George; Deary, Ian J; Dedoussis, George; Deloukas, Panos; van Duijn, Cornelia M; de Geus, Eco J C; Eriksson, Johan G; Evans, Denis A; Faul, Jessica D; Sala, Cinzia Felicita; Froguel, Philippe; Gasparini, Paolo; Girotto, Giorgia; Grabe, Hans-Jörgen; Greiser, Karin Halina; Groenen, Patrick J F; de Haan, Hugoline G; Haerting, Johannes; Harris, Tamara B; Heath, Andrew C; Heikkilä, Kauko; Hofman, Albert; Homuth, Georg; Holliday, Elizabeth G; Hopper, John; Hyppönen, Elina; Jacobsson, Bo; Jaddoe, Vincent W V; Johannesson, Magnus; Jugessur, Astanand; Kähönen, Mika; Kajantie, Eero; Kardia, Sharon L R; Keavney, Bernard; Kolcic, Ivana; Koponen, Päivikki; Kovacs, Peter; Kronenberg, Florian; Kutalik, Zoltan; La Bianca, Martina; Lachance, Genevieve; Iacono, William G; Lai, Sandra; Lehtimäki, Terho; Liewald, David C; Lindgren, Cecilia M; Liu, Yongmei; Luben, Robert; Lucht, Michael; Luoto, Riitta; Magnus, Per; Magnusson, Patrik K E; Martin, Nicholas G; McGue, Matt; McQuillan, Ruth; Medland, Sarah E; Meisinger, Christa; Mellström, Dan; Metspalu, Andres; Traglia, Michela; Milani, Lili; Mitchell, Paul; Montgomery, Grant W; Mook-Kanamori, Dennis; de Mutsert, Renée; Nohr, Ellen A; Ohlsson, Claes; Olsen, Jørn; Ong, Ken K; Paternoster, Lavinia; Pattie, Alison; Penninx, Brenda W J H; Perola, Markus; Peyser, Patricia A; Pirastu, Mario; Polasek, Ozren; Power, Chris; Kaprio, Jaakko; Raffel, Leslie J; Räikkönen, Katri; Raitakari, Olli; Ridker, Paul M; Ring, Susan M; Roll, Kathryn; Rudan, Igor; Ruggiero, Daniela; Rujescu, Dan; Salomaa, Veikko; Schlessinger, David; Schmidt, Helena; Schmidt, Reinhold; Schupf, Nicole; Smit, Johannes; Sorice, Rossella; Spector, Tim D; Starr, John M; Stöckl, Doris; Strauch, Konstantin; Stumvoll, Michael; Swertz, Morris A; Thorsteinsdottir, Unnur; Thurik, A Roy; Timpson, Nicholas J; Tung, Joyce Y; Uitterlinden, André G; Vaccargiu, Simona; Viikari, Jorma; Vitart, Veronique; Völzke, Henry; Vollenweider, Peter; Vuckovic, Dragana; Waage, Johannes; Wagner, Gert G; Wang, Jie Jin; Wareham, Nicholas J; Weir, David R; Willemsen, Gonneke; Willeit, Johann; Wright, Alan F; Zondervan, Krina T; Stefansson, Kari; Krueger, Robert F; Lee, James J; Benjamin, Daniel J; Cesarini, David; Koellinger, Philipp D; den Hoed, Marcel; Snieder, Harold; Mills, Melinda C

    2016-12-01

    The genetic architecture of human reproductive behavior-age at first birth (AFB) and number of children ever born (NEB)-has a strong relationship with fitness, human development, infertility and risk of neuropsychiatric disorders. However, very few genetic loci have been identified, and the underlying mechanisms of AFB and NEB are poorly understood. We report a large genome-wide association study of both sexes including 251,151 individuals for AFB and 343,072 individuals for NEB. We identified 12 independent loci that are significantly associated with AFB and/or NEB in a SNP-based genome-wide association study and 4 additional loci associated in a gene-based effort. These loci harbor genes that are likely to have a role, either directly or by affecting non-local gene expression, in human reproduction and infertility, thereby increasing understanding of these complex traits.

  10. Common genetic variation and survival after colorectal cancer diagnosis: a genome-wide analysis

    PubMed Central

    Phipps, Amanda I.; Passarelli, Michael N.; Chan, Andrew T.; Harrison, Tabitha A.; Jeon, Jihyoun; Hutter, Carolyn M.; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cheadle, Jeremy P.; Curtis, Keith R.; Duggan, David; Fisher, David; Fuchs, Charles S.; Gala, Manish; Giovannucci, Edward L.; Hayes, Richard B.; Hoffmeister, Michael; Hsu, Li; Jacobs, Eric J.; Jansen, Lina; Kaplan, Richard; Kap, Elisabeth J.; Maughan, Timothy S.; Potter, John D.; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; West, Hannah; White, Emily; Peters, Ulrike; Newcomb, Polly A.

    2016-01-01

    Genome-wide association studies have identified several germline single nucleotide polymorphisms (SNPs) significantly associated with colorectal cancer (CRC) incidence. Common germline genetic variation may also be related to CRC survival. We used a discovery-based approach to identify SNPs related to survival outcomes after CRC diagnosis. Genome-wide genotyping arrays were conducted for 3494 individuals with invasive CRC enrolled in six prospective cohort studies (median study-specific follow-up = 4.2–8.1 years). In pooled analyses, we used Cox regression to assess SNP-specific associations with CRC-specific and overall survival, with additional analyses stratified by stage at diagnosis. Top findings were followed-up in independent studies. A P value threshold of P < 5×10−8 in analyses combining discovery and follow-up studies was required for genome-wide significance. Among individuals with distant-metastatic CRC, several SNPs at 6p12.1, nearest the ELOVL5 gene, were statistically significantly associated with poorer survival, with the strongest associations noted for rs209489 [hazard ratio (HR) = 1.8, P = 7.6×10−10 and HR = 1.8, P = 3.7×10−9 for CRC-specific and overall survival, respectively). No SNPs were statistically significantly associated with survival among all cases combined or in cases without distant-metastases. SNPs in 6p12.1/ELOVL5 were associated with survival outcomes in individuals with distant-metastatic CRC, and merit further follow-up for functional significance. Findings from this genome-wide association study highlight the potential importance of genetic variation in CRC prognosis and provide clues to genomic regions of potential interest. PMID:26586795

  11. Genome-wide computational analysis of the secretome of brown algae (Phaeophyceae).

    PubMed

    Terauchi, Makoto; Yamagishi, Takahiro; Hanyuda, Takeaki; Kawai, Hiroshi

    2017-04-01

    Brown algae have evolved complex multicellularity in the heterokont lineage. They are phylogenetically distant to land plants, fungi and animals. Especially, the members of Laminariales (so-called kelps) have developed highly differentiated tissues. Extracellular matrix (ECM) plays pivotal roles in a number of essential processes in multicellular organisms, such as cell adhesion, cell and tissue differentiations, cell-to-cell communication, and responses to environmental stimuli. In these processes, a set of extracellular secreted proteins called the secretome operates remodeling of the physicochemical nature of ECM and signal transduction by interacting with cell surface proteins and signaling molecules. Characterization of the secretome is a critical step to clarify the contributions of ECM to the multicellularity of brown algae. However, the identity of the brown algal secretome has been poorly understood. In order to reveal the repertory of the brown algal secretome and its involvement in the evolution of Laminariales, we conducted a genome-wide analysis of the brown algal secretome utilizing the published complete genome data of Ectocarpus siliculosus and Saccharina japonica as well as newly obtained RNA-seq data of seven laminarialean species (Agarum clathratum, Alaria crassifolia, Aureophycus aleuticus, Costaria costata, Pseudochorda nagaii, Saccharina angustata and Undaria pinnatifida) largely covering the laminarialean families. We established the in silico pipeline to systematically and accurately detect the secretome by combining multiple prediction algorithms for the N-terminal signal peptide and transmembrane domain within the protein sequence. From 16,189 proteins of E. siliculosus and 18,733 proteins of S. japonica, 552 and 964 proteins respectively were predicted to be classified as the secretome. Conserved domain analysis showed that the domain repertory were very similar to each other, and that of the brown algal secretome was partially common

  12. Genome-wide analysis of the omega-3 fatty acid desaturase gene family in Gossypium

    DOE PAGES

    Yurchenko, Olga P.; Park, Sunjung; Ilut, Daniel C.; ...

    2014-11-18

    The majority of commercial cotton varieties planted worldwide are derived from Gossypium hirsutum, which is a naturally occurring allotetraploid produced by interspecific hybridization of A- and D-genome diploid progenitor species. While most cotton species are adapted to warm, semi-arid tropical and subtropical regions, and thus perform well in these geographical areas, cotton seedlings are sensitive to cold temperature, which can significantly reduce crop yields. One of the common biochemical responses of plants to cold temperatures is an increase in omega-3 fatty acids, which protects cellular function by maintaining membrane integrity. The purpose of our study was to identify and characterizemore » the omega-3 fatty acid desaturase (FAD) gene family in G. hirsutum, with an emphasis on identifying omega-3 FADs involved in cold temperature adaptation. Results: Eleven omega-3 FAD genes were identified in G. hirsutum, and characterization of the gene family in extant A and D diploid species (G. herbaceum and G. raimondii, respectively) allowed for unambiguous genome assignment of all homoeologs in tetraploid G. hirsutum. The omega-3 FAD family of cotton includes five distinct genes, two of which encode endoplasmic reticulum-type enzymes (FAD3-1 and FAD3-2) and three that encode chloroplast-type enzymes (FAD7/8-1, FAD7/8-2, and FAD7/8-3). The FAD3-2 gene was duplicated in the A genome progenitor species after the evolutionary split from the D progenitor, but before the interspecific hybridization event that gave rise to modern tetraploid cotton. RNA-seq analysis revealed conserved, gene-specific expression patterns in various organs and cell types and semi-quantitative RT-PCR further revealed that FAD7/8-1 was specifically induced during cold temperature treatment of G. hirsutum seedlings. Conclusions: The omega-3 FAD gene family in cotton was characterized at the genome-wide level in three species, showing relatively ancient establishment of the gene family prior

  13. Creative Activities in Music – A Genome-Wide Linkage Analysis

    PubMed Central

    Oikkonen, Jaana; Kuusi, Tuire; Peltonen, Petri; Raijas, Pirre; Ukkola-Vuoti, Liisa; Karma, Kai; Onkamo, Päivi; Järvelä, Irma

    2016-01-01

    Creative activities in music represent a complex cognitive function of the human brain, whose biological basis is largely unknown. In order to elucidate the biological background of creative activities in music we performed genome-wide linkage and linkage disequilibrium (LD) scans in musically experienced individuals characterised for self-reported composing, arranging and non-music related creativity. The participants consisted of 474 individuals from 79 families, and 103 sporadic individuals. We found promising evidence for linkage at 16p12.1-q12.1 for arranging (LOD 2.75, 120 cases), 4q22.1 for composing (LOD 2.15, 103 cases) and Xp11.23 for non-music related creativity (LOD 2.50, 259 cases). Surprisingly, statistically significant evidence for linkage was found for the opposite phenotype of creative activity in music (neither composing nor arranging; NCNA) at 18q21 (LOD 3.09, 149 cases), which contains cadherin genes like CDH7 and CDH19. The locus at 4q22.1 overlaps the previously identified region of musical aptitude, music perception and performance giving further support for this region as a candidate region for broad range of music-related traits. The other regions at 18q21 and 16p12.1-q12.1 are also adjacent to the previously identified loci with musical aptitude. Pathway analysis of the genes suggestively associated with composing suggested an overrepresentation of the cerebellar long-term depression pathway (LTD), which is a cellular model for synaptic plasticity. The LTD also includes cadherins and AMPA receptors, whose component GSG1L was linked to arranging. These results suggest that molecular pathways linked to memory and learning via LTD affect music-related creative behaviour. Musical creativity is a complex phenotype where a common background with musicality and intelligence has been proposed. Here, we implicate genetic regions affecting music-related creative behaviour, which also include genes with neuropsychiatric associations. We also propose

  14. Creative Activities in Music--A Genome-Wide Linkage Analysis.

    PubMed

    Oikkonen, Jaana; Kuusi, Tuire; Peltonen, Petri; Raijas, Pirre; Ukkola-Vuoti, Liisa; Karma, Kai; Onkamo, Päivi; Järvelä, Irma

    2016-01-01

    Creative activities in music represent a complex cognitive function of the human brain, whose biological basis is largely unknown. In order to elucidate the biological background of creative activities in music we performed genome-wide linkage and linkage disequilibrium (LD) scans in musically experienced individuals characterised for self-reported composing, arranging and non-music related creativity. The participants consisted of 474 individuals from 79 families, and 103 sporadic individuals. We found promising evidence for linkage at 16p12.1-q12.1 for arranging (LOD 2.75, 120 cases), 4q22.1 for composing (LOD 2.15, 103 cases) and Xp11.23 for non-music related creativity (LOD 2.50, 259 cases). Surprisingly, statistically significant evidence for linkage was found for the opposite phenotype of creative activity in music (neither composing nor arranging; NCNA) at 18q21 (LOD 3.09, 149 cases), which contains cadherin genes like CDH7 and CDH19. The locus at 4q22.1 overlaps the previously identified region of musical aptitude, music perception and performance giving further support for this region as a candidate region for broad range of music-related traits. The other regions at 18q21 and 16p12.1-q12.1 are also adjacent to the previously identified loci with musical aptitude. Pathway analysis of the genes suggestively associated with composing suggested an overrepresentation of the cerebellar long-term depression pathway (LTD), which is a cellular model for synaptic plasticity. The LTD also includes cadherins and AMPA receptors, whose component GSG1L was linked to arranging. These results suggest that molecular pathways linked to memory and learning via LTD affect music-related creative behaviour. Musical creativity is a complex phenotype where a common background with musicality and intelligence has been proposed. Here, we implicate genetic regions affecting music-related creative behaviour, which also include genes with neuropsychiatric associations. We also propose

  15. Genome Wide Linkage Analysis of 972 Bipolar Pedigrees Using Single Nucleotide Polymorphisms

    PubMed Central

    Badner, Judith A; Koller, Daniel; Foroud, Tatiana; Edenberg, Howard; Nurnberger, John I; Zandi, Peter P; Willour, Virginia L.; McMahon, Francis J; Potash, James B; Hamshere, Marian; Grozeva, Detelina; Green, Elaine; Kirov, George; Jones, Ian; Jones, Lisa; Craddock, Nicholas; Morris, Derek; Segurado, Ricardo; Gill, Mike; Sadovnick, Dessa; Remick, Ronald; Keck, Paul; Kelsoe, John; Ayub, Muhammad; MacLean, Alan; Blackwood, Douglas; Liu, Chun-Yu; Gershon, Elliot S; McMahon, William; Lyon, Gholson; Robinson, Reid; Ross, Jessica; Byerley, William

    2011-01-01

    Because of the high costs associated with ascertainment of families most linkage studies of Bipolar I disorder (BPI) have used relatively small samples. Moreover, the genetic information content reported in most studies has been less than 0.6. While microsatellite markers spaced every 10 centimorgans typically extract most of the genetic information content for larger multiplex families, they can be less informative for smaller pedigrees especially for affected sib pair kindreds. For these reasons we collaborated to pool family resources and carry out higher density genotyping. Approximately 1100 pedigrees of European ancestry were initially selected for study and were genotyped by the Center for Inherited Disease Research using the Illumina Linkage Panel 12 set of 6090 SNPs. Of the ~1100 families, 972 were informative for further analyses and mean information content was 0.86 after pruning for LD. The 972 kindreds include 2284 cases of BPI disorder, 498 individuals with Bipolar II disorder (BPII) and 702 subjects with Recurrent Major Depression. Three affection status models were considered: ASM1 (BPI and schizoaffective disorder, BP cases (SABP) only), ASM2 (ASM1 cases plus BPII) and ASM3 (ASM2 cases plus Recurrent Major Depression). Both parametric and non-parametric linkage methods were carried out. The strongest findings occurred at 6q21 (Nonparametric Pairs Lod 3.4 for rs1046943 at 119 cM) and 9q21 (Nonparametric Pairs Lod 3.4 for rs722642 at 78 cM) using only BPI and SA, BP cases. Both results met genome-wide significant criteria, although neither was significant after correction for multiple analyses. We also inspected parametric scores for the larger multiplex families to identify possible rare susceptibility loci. In this analysis we observed 59 parametric lods of 2 or greater, many of which are likely to be close to maximum possible scores. While some linkage findings may be false positives the results could help prioritize the search for rare variants

  16. Pharmacogenomic Genome-Wide Meta-Analysis of Blood Pressure Response to β-Blockers in Hypertensive African Americans.

    PubMed

    Gong, Yan; Wang, Zhiying; Beitelshees, Amber L; McDonough, Caitrin W; Langaee, Taimour Y; Hall, Karen; Schmidt, Siegfried O F; Curry, Robert W; Gums, John G; Bailey, Kent R; Boerwinkle, Eric; Chapman, Arlene B; Turner, Stephen T; Cooper-DeHoff, Rhonda M; Johnson, Julie A

    2016-03-01

    African Americans suffer a higher prevalence of hypertension compared with other racial/ethnic groups. In this study, we performed a pharmacogenomic genome-wide association study of blood pressure (BP) response to β-blockers in African Americans with uncomplicated hypertension. Genome-wide meta-analysis was performed in 318 African American hypertensive participants in the 2 Pharmacogenomic Evaluation of Antihypertensive Responses studies: 150 treated with atenolol monotherapy and 168 treated with metoprolol monotherapy. The analysis adjusted for age, sex, baseline BP and principal components for ancestry. Genome-wide significant variants with P<5×10(-8) and suggestive variants with P<5×10(-7) were evaluated in an additional cohort of 141 African Americans treated with the addition of atenolol to hydrochlorothiazide treatment. The validated variants were then meta-analyzed in these 3 groups of African Americans. Two variants discovered in the monotherapy meta-analysis were validated in the add-on therapy. African American participants heterozygous for SLC25A31 rs201279313 deletion versus wild-type genotype had better diastolic BP response to atenolol monotherapy, metoprolol monotherapy, and atenolol add-on therapy: -9.3 versus -4.6, -9.6 versus -4.8, and -9.7 versus -6.4 mm Hg, respectively (3-group meta-analysis P=2.5×10(-8), β=-4.42 mm Hg per variant allele). Similarly, LRRC15 rs11313667 was validated for systolic BP response to β-blocker therapy with 3-group meta-analysis P=7.2×10(-8) and β=-3.65 mm Hg per variant allele. In this first pharmacogenomic genome-wide meta-analysis of BP response to β-blockers in African Americans, we identified novel variants that may provide valuable information for personalized antihypertensive treatment in this group.

  17. Meta-analysis for genome-wide association studies using case-control design: application and practice

    PubMed Central

    2016-01-01

    This review aimed to arrange the process of a systematic review of genome-wide association studies in order to practice and apply a genome-wide meta-analysis (GWMA). The process has a series of five steps: searching and selection, extraction of related information, evaluation of validity, meta-analysis by type of genetic model, and evaluation of heterogeneity. In contrast to intervention meta-analyses, GWMA has to evaluate the Hardy–Weinberg equilibrium (HWE) in the third step and conduct meta-analyses by five potential genetic models, including dominant, recessive, homozygote contrast, heterozygote contrast, and allelic contrast in the fourth step. The ‘genhwcci’ and ‘metan’ commands of STATA software evaluate the HWE and calculate a summary effect size, respectively. A meta-regression using the ‘metareg’ command of STATA should be conducted to evaluate related factors of heterogeneities. PMID:28092928

  18. Meta-analysis for genome-wide association studies using case-control design: application and practice.

    PubMed

    Shim, Sungryul; Kim, Jiyoung; Jung, Wonguen; Shin, In-Soo; Bae, Jong-Myon

    2016-01-01

    This review aimed to arrange the process of a systematic review of genome-wide association studies in order to practice and apply a genome-wide meta-analysis (GWMA). The process has a series of five steps: searching and selection, extraction of related information, evaluation of validity, meta-analysis by type of genetic model, and evaluation of heterogeneity. In contrast to intervention meta-analyses, GWMA has to evaluate the Hardy-Weinberg equilibrium (HWE) in the third step and conduct meta-analyses by five potential genetic models, including dominant, recessive, homozygote contrast, heterozygote contrast, and allelic contrast in the fourth step. The 'genhwcci' and 'metan' commands of STATA software evaluate the HWE and calculate a summary effect size, respectively. A meta-regression using the 'metareg' command of STATA should be conducted to evaluate related factors of heterogeneities.

  19. Genome-wide association analysis of thirty one production, health, reproduction and body conformation traits in contemporary U.S. Holstein cows

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background Genome-wide association analysis is a powerful tool for annotating phenotypic effects on the genome and knowledge of genes and chromosomal regions associated with dairy phenotypes is useful for genome and gene-based selection. Here, we report results of a genome-wide analysis of predicted...

  20. Genome-Wide Association Study to Identify Common Variants Associated with Brachial Circumference: A Meta-Analysis of 14 Cohorts

    PubMed Central

    Boraska, Vesna; Day-Williams, Aaron; Franklin, Christopher S.; Elliott, Katherine S.; Panoutsopoulou, Kalliope; Tachmazidou, Ioanna; Albrecht, Eva; Bandinelli, Stefania; Beilin, Lawrence J.; Bochud, Murielle; Cadby, Gemma; Ernst, Florian; Evans, David M.; Hayward, Caroline; Hicks, Andrew A.; Huffman, Jennifer; Huth, Cornelia; James, Alan L.; Klopp, Norman; Kolcic, Ivana; Kutalik, Zoltán; Lawlor, Debbie A.; Musk, Arthur W.; Pehlic, Marina; Pennell, Craig E.; Perry, John R. B.; Peters, Annette; Polasek, Ozren; Pourcain, Beate St; Ring, Susan M.; Salvi, Erika; Schipf, Sabine; Staessen, Jan A.; Teumer, Alexander; Timpson, Nicholas; Vitart, Veronique; Warrington, Nicole M.; Yaghootkar, Hanieh; Zemunik, Tatijana; Zgaga, Lina; An, Ping; Anttila, Verneri; Borecki, Ingrid B.; Holmen, Jostein; Ntalla, Ioanna; Palotie, Aarno; Pietiläinen, Kirsi H.; Wedenoja, Juho; Winsvold, Bendik S.; Dedoussis, George V.; Kaprio, Jaakko; Province, Michael A.; Zwart, John-Anker; Burnier, Michel; Campbell, Harry; Cusi, Daniele; Davey Smith, George; Frayling, Timothy M.; Gieger, Christian; Palmer, Lyle J.; Pramstaller, Peter P.; Rudan, Igor; Völzke, Henry; Wichmann, H. -Erich; Wright, Alan F.; Zeggini, Eleftheria

    2012-01-01

    Brachial circumference (BC), also known as upper arm or mid arm circumference, can be used as an indicator of muscle mass and fat tissue, which are distributed differently in men and women. Analysis of anthropometric measures of peripheral fat distribution such as BC could help in understanding the complex pathophysiology behind overweight and obesity. The purpose of this study is to identify genetic variants associated with BC through a large-scale genome-wide association scan (GWAS) meta-analysis. We used fixed-effects meta-analysis to synthesise summary results across 14 GWAS discovery and 4 replication cohorts comprising overall 22,376 individuals (12,031 women and 10,345 men) of European ancestry. Individual analyses were carried out for men, women, and combined across sexes using linear regression and an additive genetic model: adjusted for age and adjusted for age and BMI. We prioritised signals for follow-up in two-stages. We did not detect any signals reaching genome-wide significance. The FTO rs9939609 SNP showed nominal evidence for association (p<0.05) in the age-adjusted strata for men and across both sexes. In this first GWAS meta-analysis for BC to date, we have not identified any genome-wide significant signals and do not observe robust association of previously established obesity loci with BC. Large-scale collaborations will be necessary to achieve higher power to detect loci underlying BC. PMID:22479309

  1. Genetic Control of Canine Leishmaniasis: Genome-Wide Association Study and Genomic Selection Analysis

    PubMed Central

    Quilez, Javier; Martínez, Verónica; Woolliams, John A.; Sanchez, Armand; Pong-Wong, Ricardo; Kennedy, Lorna J.; Quinnell, Rupert J.; Ollier, William E. R.; Roura, Xavier; Ferrer, Lluís; Altet, Laura; Francino, Olga

    2012-01-01

    Background The current disease model for leishmaniasis suggests that only a proportion of infected individuals develop clinical disease, while others are asymptomatically infected due to immune control of infection. The factors that determine whether individuals progress to clinical disease following Leishmania infection are unclear, although previous studies suggest a role for host genetics. Our hypothesis was that canine leishmaniasis is a complex disease with multiple loci responsible for the progression of the disease from Leishmania infection. Methodology/Principal Findings Genome-wide association and genomic selection approaches were applied to a population-based case-control dataset of 219 dogs from a single breed (Boxer) genotyped for ∼170,000 SNPs. Firstly, we aimed to identify individual disease loci; secondly, we quantified the genetic component of the observed phenotypic variance; and thirdly, we tested whether genome-wide SNP data could accurately predict the disease. Conclusions/Significance We estimated that a substantial proportion of the genome is affecting the trait and that its heritability could be as high as 60%. Using the genome-wide association approach, the strongest associations were on chromosomes 1, 4 and 20, although none of these were statistically significant at a genome-wide level and after correcting for genetic stratification and lifestyle. Amongst these associations, chromosome 4: 61.2–76.9 Mb maps to a locus that has previously been associated with host susceptibility to human and murine leishmaniasis, and genomic selection estimated markers in this region to have the greatest effect on the phenotype. We therefore propose these regions as candidates for replication studies. An important finding of this study was the significant predictive value from using the genomic information. We found that the phenotype could be predicted with an accuracy of ∼0.29 in new samples and that the affection status was correctly predicted in 60

  2. Genome-Wide Association Analysis of Blood Biomarkers in Chronic Obstructive Pulmonary Disease

    PubMed Central

    Kim, Deog Kyeom; Cho, Michael H.; Hersh, Craig P.; Lomas, David A.; Miller, Bruce E.; Kong, Xiangyang; Bakke, Per; Gulsvik, Amund; Agustí, Alvar; Wouters, Emiel; Celli, Bartolome; Coxson, Harvey; Vestbo, Jørgen; MacNee, William; Yates, Julie C.; Rennard, Stephen; Litonjua, Augusto; Qiu, Weiliang; Beaty, Terri H.; Crapo, James D.; Riley, John H.; Tal-Singer, Ruth

    2012-01-01

    Rationale: A genome-wide association study (GWAS) for circulating chronic obstructive pulmonary disease (COPD) biomarkers could identify genetic determinants of biomarker levels and COPD susceptibility. Objectives: To identify genetic variants of circulating protein biomarkers and novel genetic determinants of COPD. Methods: GWAS was performed for two pneumoproteins, Clara cell secretory protein (CC16) and surfactant protein D (SP-D), and five systemic inflammatory markers (C-reactive protein, fibrinogen, IL-6, IL-8, and tumor necrosis factor-α) in 1,951 subjects with COPD. For genome-wide significant single nucleotide polymorphisms (SNPs) (P < 1 × 10−8), association with COPD susceptibility was tested in 2,939 cases with COPD and 1,380 smoking control subjects. The association of candidate SNPs with mRNA expression in induced sputum was also elucidated. Measurements and Main Results: Genome-wide significant susceptibility loci affecting biomarker levels were found only for the two pneumoproteins. Two discrete loci affecting CC16, one region near the CC16 coding gene (SCGB1A1) on chromosome 11 and another locus approximately 25 Mb away from SCGB1A1, were identified, whereas multiple SNPs on chromosomes 6 and 16, in addition to SNPs near SFTPD, had genome-wide significant associations with SP-D levels. Several SNPs affecting circulating CC16 levels were significantly associated with sputum mRNA expression of SCGB1A1 (P = 0.009–0.03). Several SNPs highly associated with CC16 or SP-D levels were nominally associated with COPD in a collaborative GWAS (P = 0.001–0.049), although these COPD associations were not replicated in two additional cohorts. Conclusions: Distant genetic loci and biomarker-coding genes affect circulating levels of COPD-related pneumoproteins. A subset of these protein quantitative trait loci may influence their gene expression in the lung and/or COPD susceptibility. Clinical trial registered with www.clinicaltrials.gov (NCT 00292552). PMID

  3. Meta-analysis of genome-wide association studies identifies 10 loci influencing allergic sensitization

    PubMed Central

    Granell, Raquel; Strachan, David P; Alves, Alexessander Couto; Linneberg, Allan; Curtin, John A; Warrington, Nicole M; Standl, Marie; Kerkhof, Marjan; Jonsdottir, Ingileif; Bukvic, Blazenka K; Kaakinen, Marika; Sleimann, Patrick; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Schramm, Katharina; Baltic, Svetlana; Kreiner-Møller, Eskil; Simpson, Angela; St Pourcain, Beate; Coin, Lachlan; Hui, Jennie; Walters, Eugene H; Tiesler, Carla M T; Duffy, David L; Jones, Graham; Ring, Susan M; McArdle, Wendy L; Price, Loren; Robertson, Colin F; Pekkanen, Juha; Tang, Clara S; Thiering, Elisabeth; Montgomery, Grant W; Hartikainen, Anna-Liisa; Dharmage, Shyamali C; Husemoen, Lise L; Herder, Christian; Kemp, John P; Elliot, Paul; James, Alan; Waldenberger, Melanie; Abramson, Michael J; Fairfax, Benjamin P; Knight, Julian C; Gupta, Ramneek; Thompson, Philip J; Holt, Patrick; Sly, Peter; Hirschhorn, Joel N; Blekic, Mario; Weidinger, Stephan; Hakonarsson, Hakon; Stefansson, Kari; Heinrich, Joachim; Postma, Dirkje S; Custovic, Adnan; Pennell, Craig E; Jarvelin, Marjo-Riitta; Koppelman, Gerard H; Timpson, Nicholas; Ferreira, Manuel A; Bisgaard, Hans; Henderson, A John

    2016-01-01

    Allergen-specific IgE (allergic sensitization) plays a central role in the pathogenesis of allergic disease. We performed the first large-scale genome wide association study (GWAS) of allergic sensitization in 5,789 affected individuals and 10,056 controls and followed up the top SNP from 26 loci in 6,114 affected individuals and 9,920 controls. We increased the number of susceptibility loci with genome-wide significant association to allergic sensitization from three to 10, including SNPs in or near TLR6, C11orf30, STAT6, SLC25A46, HLA-DQB1, IL1RL1, LPP, MYC, IL2 and HLA-B. All the top-SNPs were associated with allergic symptoms in an independent study. Risk variants at these 10 loci were estimated to account for at least 25% of allergic sensitization and allergic rhinitis. Understanding the molecular mechanisms underlying these associations may provide novel insight into the etiology of allergic disease. PMID:23817571

  4. Genome-Wide Analysis Identifies Germ-Line Risk Factors Associated with Canine Mammary Tumours

    PubMed Central

    Melin, Malin; Murén, Eva; Gustafson, Ulla; Starkey, Mike; Borge, Kaja Sverdrup; Lingaas, Frode; Saellström, Sara; Rönnberg, Henrik; Lindblad-Toh, Kerstin

    2016-01-01

    Canine mammary tumours (CMT) are the most common neoplasia in unspayed female dogs. CMTs are suitable naturally occurring models for human breast cancer and share many characteristics, indicating that the genetic causes could also be shared. We have performed a genome-wide association study (GWAS) in English Springer Spaniel dogs and identified a genome-wide significant locus on chromosome 11 (praw = 5.6x10-7, pperm = 0.019). The most associated haplotype spans a 446 kb region overlapping the CDK5RAP2 gene. The CDK5RAP2 protein has a function in cell cycle regulation and could potentially have an impact on response to chemotherapy treatment. Two additional loci, both on chromosome 27, were nominally associated (praw = 1.97x10-5 and praw = 8.30x10-6). The three loci explain 28.1±10.0% of the phenotypic variation seen in the cohort, whereas the top ten associated regions account for 38.2±10.8% of the risk. Furthermore, the ten GWAS loci and regions with reduced genetic variability are significantly enriched for snoRNAs and tumour-associated antigen genes, suggesting a role for these genes in CMT development. We have identified several candidate genes associated with canine mammary tumours, including CDK5RAP2. Our findings enable further comparative studies to investigate the genes and pathways in human breast cancer patients. PMID:27158822

  5. Meta-analysis of Genome-Wide Association Studies for Extraversion: Findings from the Genetics of Personality Consortium.

    PubMed

    van den Berg, Stéphanie M; de Moor, Marleen H M; Verweij, Karin J H; Krueger, Robert F; Luciano, Michelle; Arias Vasquez, Alejandro; Matteson, Lindsay K; Derringer, Jaime; Esko, Tõnu; Amin, Najaf; Gordon, Scott D; Hansell, Narelle K; Hart, Amy B; Seppälä, Ilkka; Huffman, Jennifer E; Konte, Bettina; Lahti, Jari; Lee, Minyoung; Miller, Mike; Nutile, Teresa; Tanaka, Toshiko; Teumer, Alexander; Viktorin, Alexander; Wedenoja, Juho; Abdellaoui, Abdel; Abecasis, Goncalo R; Adkins, Daniel E; Agrawal, Arpana; Allik, Jüri; Appel, Katja; Bigdeli, Timothy B; Busonero, Fabio; Campbell, Harry; Costa, Paul T; Smith, George Davey; Davies, Gail; de Wit, Harriet; Ding, Jun; Engelhardt, Barbara E; Eriksson, Johan G; Fedko, Iryna O; Ferrucci, Luigi; Franke, Barbara; Giegling, Ina; Grucza, Richard; Hartmann, Annette M; Heath, Andrew C; Heinonen, Kati; Henders, Anjali K; Homuth, Georg; Hottenga, Jouke-Jan; Iacono, William G; Janzing, Joost; Jokela, Markus; Karlsson, Robert; Kemp, John P; Kirkpatrick, Matthew G; Latvala, Antti; Lehtimäki, Terho; Liewald, David C; Madden, Pamela A F; Magri, Chiara; Magnusson, Patrik K E; Marten, Jonathan; Maschio, Andrea; Mbarek, Hamdi; Medland, Sarah E; Mihailov, Evelin; Milaneschi, Yuri; Montgomery, Grant W; Nauck, Matthias; Nivard, Michel G; Ouwens, Klaasjan G; Palotie, Aarno; Pettersson, Erik; Polasek, Ozren; Qian, Yong; Pulkki-Råback, Laura; Raitakari, Olli T; Realo, Anu; Rose, Richard J; Ruggiero, Daniela; Schmidt, Carsten O; Slutske, Wendy S; Sorice, Rossella; Starr, John M; St Pourcain, Beate; Sutin, Angelina R; Timpson, Nicholas J; Trochet, Holly; Vermeulen, Sita; Vuoksimaa, Eero; Widen, Elisabeth; Wouda, Jasper; Wright, Margaret J; Zgaga, Lina; Porteous, David; Minelli, Alessandra; Palmer, Abraham A; Rujescu, Dan; Ciullo, Marina; Hayward, Caroline; Rudan, Igor; Metspalu, Andres; Kaprio, Jaakko; Deary, Ian J; Räikkönen, Katri; Wilson, James F; Keltikangas-Järvinen, Liisa; Bierut, Laura J; Hettema, John M; Grabe, Hans J; Penninx, Brenda W J H; van Duijn, Cornelia M; Evans, David M; Schlessinger, David; Pedersen, Nancy L; Terracciano, Antonio; McGue, Matt; Martin, Nicholas G; Boomsma, Dorret I

    2016-03-01

    Extraversion is a relatively stable and heritable personality trait associated with numerous psychosocial, lifestyle and health outcomes. Despite its substantial heritability, no genetic variants have been detected in previous genome-wide association (GWA) studies, which may be due to relatively small sample sizes of those studies. Here, we report on a large meta-analysis of GWA studies for extraversion in 63,030 subjects in 29 cohorts. Extraversion item data from multiple personality inventories were harmonized across inventories and cohorts. No genome-wide significant associations were found at the single nucleotide polymorphism (SNP) level but there was one significant hit at the gene level for a long non-coding RNA site (LOC101928162). Genome-wide complex trait analysis in two large cohorts showed that the additive variance explained by common SNPs was not significantly different from zero, but polygenic risk scores, weighted using linkage information, significantly predicted extraversion scores in an independent cohort. These results show that extraversion is a highly polygenic personality trait, with an architecture possibly different from other complex human traits, including other personality traits. Future studies are required to further determine which genetic variants, by what modes of gene action, constitute the heritable nature of extraversion.

  6. Genome-wide association study of swine farrowing traits. Part II: Bayesian analysis of marker data.

    PubMed

    Schneider, J F; Rempel, L A; Snelling, W M; Wiedmann, R T; Nonneman, D J; Rohrer, G A

    2012-10-01

    Reproductive efficiency has a great impact on the economic success of pork (sus scrofa) production. Number born alive (NBA) and average piglet birth weight (ABW) contribute greatly to reproductive efficiency. To better understand the underlying genetics of birth traits, a genome-wide association study (GWAS) was undertaken. Samples of DNA were collected and tested using the Illumina PorcineSNP60 BeadChip from 1,152 first parity gilts. Traits included total number born (TNB), NBA, number born dead (NBD), number stillborn (NSB), number of mummies (MUM), total litter birth weight (LBW), and ABW. A total of 41,151 SNP were tested using a Bayesian approach. Beginning with the first 5 SNP on SSC1 and ending with the last 5 SNP on the SSCX, SNP were assigned to groups of 5 consecutive SNP by chromosome-position order and analyzed again using a Bayesian approach. From that analysis, 5-SNP groups were selected having no overlap with another 5-SNP groups and no overlap across chromosomes. These selected 5-SNP non-overlapping groups were defined as QTL. Of the available 8,814 QTL, 124 were found to be statistically significant (P < 0.01). Multiple testing was considered using the probability of false positives. Eleven QTL were found for TNB, 3 on SSC1, 3 on SSC4, 1 on SSC13, 1 on SSC14, 2 on SSC15, and 1 on SSC17. Statistical testing for NBA identified 14 QTL, 4 on SSC1, 1 on SSC4, 1 on SSC6, 1 on SSC10, 1on SSC13, 3 on SSC15, and 3 on SSC17. A single NBD QTL was found on SSC11. No QTL were identified for NSB or MUM. Thirty-three QTL were found for LBW, 3 on SSC1, 1 on SSC2, 1 on SSC3, 5 on SSC4, 2 on SSC5, 5 on SSC6, 3 on SSC7, 2 on SSC9, 1 on SSC10, 2 on SSC14, 6 on SSC15, and 2 on SSC17. A total of 65 QTL were found for ABW, 9 on SSC1, 3 on SSC2, 9 on SSC5, 5 on SSC6, 1 on SSC7, 2 on SSC8, 2 on SSC9, 3 on SSC10, 1 on SSC11, 3 on SSC12, 2 on SSC13, 8 on SSC14, 8 on SSC15, 1 on SSC17, and 8 on SSC18. Several candidate genes have been identified that overlap QTL locations

  7. Genome-wide random regression analysis for parent-of-origin effects of body composition allometries in mouse

    PubMed Central

    Zhao, Jingli; Li, Shuling; Wang, Lijuan; Jiang, Li; Yang, Runqing; Cui, Yuehua

    2017-01-01

    Genomic imprinting underlying growth and development traits has been recognized, with a focus on the form of absolute or pure growth. However, little is known about the effect of genomic imprinting on relative growth. In this study, we proposed a random regression model to estimate genome-wide imprinting effects on the relative growth of multiple tissues and organs to body weight in mice. Joint static allometry scaling equation as sub-model is nested within the genetic effects of markers and polygenic effects caused by a pedigree. Both chromosome-wide and genome-wide statistical tests were conducted to identify imprinted quantitative trait nucleotides (QTNs) associated with relative growth of individual tissues and organs to body weight. Real data analysis showed that three of six analysed tissues and organs are significantly associated with body weight in terms of phenotypic relative growth. At the chromosome-wide level, a total 122 QTNs were associated with allometries of kidney, spleen and liver weights to body weight, 36 of which were imprinted with different imprinting fashions. Further, only two imprinted QTNs responsible for relative growth of spleen and liver were verified by genome-wide test. Our approach provides a general framework for statistical inference of genomic imprinting underlying allometry scaling in animals. PMID:28338098

  8. Genome-wide analysis of runs of homozygosity identifies new susceptibility regions of lung cancer in Han Chinese

    PubMed Central

    Wang, Cheng; Xu, Zhengfeng; Jin, Guangfu; Hu, Zhibin; Dai, Juncheng; Ma, Hongxia; Jiang, Yue; Hu, Lingmin; Chu, Minjie; Cao, Songyu; Shen, Hongbing

    2013-01-01

    Runs of homozygosity (ROHs) are a class of important but poorly studied genomic variations and may be involved in individual susceptibility to diseases. To better understand ROH and its relationship with lung cancer, we performed a genome-wide ROH analysis of a subset of a previous genome-wide case-control study (1,473 cases and 1,962 controls) in a Han Chinese population. ROHs were classified into two classes, based on lengths, intermediate and long ROHs, to evaluate their association with lung cancer risk using existing genome-wide single nucleotide polymorphism (SNP) data. We found that the overall level of intermediate ROHs was significantly associated with a decreased risk of lung cancer (odds ratio = 0.63; 95% confidence interval: 0.51-0.77; P = 4.78×10−6 ), while the long ROHs seemed to be a risk factor of lung cancer. We also identified one ROH region at 14q23.1 that was consistently associated with lung cancer risk in the study. These results indicated that ROHs may be a new class of variation which may be associated with lung cancer risk, and genetic variants at 14q23.1 may be involved in the development of lung cancer. PMID:23720676

  9. Genome-wide SNP analysis explains coral diversity and recovery in the Ryukyu Archipelago

    PubMed Central

    Shinzato, Chuya; Mungpakdee, Sutada; Arakaki, Nana; Satoh, Noriyuki

    2015-01-01

    Following a global coral bleaching event in 1998, Acropora corals surrounding most of Okinawa island (OI) were devastated, although they are now gradually recovering. In contrast, the Kerama Islands (KIs) only 30 km west of OI, have continuously hosted a great variety of healthy corals. Taking advantage of the decoded Acropora digitifera genome and using genome-wide SNP analyses, we clarified Acropora population structure in the southern Ryukyu Archipelago (sRA). Despite small genetic distances, we identified distinct clusters corresponding to specific island groups, suggesting infrequent long-distance dispersal within the sRA. Although the KIs were believed to supply coral larvae to OI, admixture analyses showed that such dispersal is much more limited than previously realized, indicating independent recovery of OI coral populations and the necessity of local conservation efforts for each region. We detected strong historical migration from the Yaeyama Islands (YIs) to OI, and suggest that the YIs are the original source of OI corals. In addition, migration edges to the KIs suggest that they are a historical sink population in the sRA, resulting in high diversity. This population genomics study provides the highest resolution data to date regarding coral population structure and history. PMID:26656261

  10. Genome-wide methylation analysis identified sexually dimorphic methylated regions in hybrid tilapia

    PubMed Central

    Wan, Zi Yi; Xia, Jun Hong; Lin, Grace; Wang, Le; Lin, Valerie C. L.; Yue, Gen Hua

    2016-01-01

    Sexual dimorphism is an interesting biological phenomenon. Previous studies showed that DNA methylation might play a role in sexual dimorphism. However, the overall picture of the genome-wide methylation landscape in sexually dimorphic species remains unclear. We analyzed the DNA methylation landscape and transcriptome in hybrid tilapia (Oreochromis spp.) using whole genome bisulfite sequencing (WGBS) and RNA-sequencing (RNA-seq). We found 4,757 sexually dimorphic differentially methylated regions (DMRs), with significant clusters of DMRs located on chromosomal regions associated with sex determination. CpG methylation in promoter regions was negatively correlated with the gene expression level. MAPK/ERK pathway was upregulated in male tilapia. We also inferred active cis-regulatory regions (ACRs) in skeletal muscle tissues from WGBS datasets, revealing sexually dimorphic cis-regulatory regions. These results suggest that DNA methylation contribute to sex-specific phenotypes and serve as resources for further investigation to analyze the functions of these regions and their contributions towards sexual dimorphisms. PMID:27782217

  11. Genome-wide analysis of genetic susceptibility to language impairment in an isolated Chilean population

    PubMed Central

    Villanueva, Pia; Newbury, Dianne F; Jara, Lilian; De Barbieri, Zulema; Mirza, Ghazala; Palomino, Hernán M; Fernández, María Angélica; Cazier, Jean-Baptiste; Monaco, Anthony P; Palomino, Hernán

    2011-01-01

    Specific language impairment (SLI) is an unexpected deficit in the acquisition of language skills and affects between 5 and 8% of pre-school children. Despite its prevalence and high heritability, our understanding of the aetiology of this disorder is only emerging. In this paper, we apply genome-wide techniques to investigate an isolated Chilean population who exhibit an increased frequency of SLI. Loss of heterozygosity (LOH) mapping and parametric and non-parametric linkage analyses indicate that complex genetic factors are likely to underlie susceptibility to SLI in this population. Across all analyses performed, the most consistently implicated locus was on chromosome 7q. This locus achieved highly significant linkage under all three non-parametric models (max NPL=6.73, P=4.0 × 10−11). In addition, it yielded a HLOD of 1.24 in the recessive parametric linkage analyses and contained a segment that was homozygous in two affected individuals. Further, investigation of this region identified a two-SNP haplotype that occurs at an increased frequency in language-impaired individuals (P=0.008). We hypothesise that the linkage regions identified here, in particular that on chromosome 7, may contain variants that underlie the high prevalence of SLI observed in this isolated population and may be of relevance to other populations affected by language impairments. PMID:21248734

  12. Genome-wide association analysis of feed intake and residual feed intake in Nellore cattle

    PubMed Central

    2014-01-01

    Background Feed intake plays an important economic role in beef cattle, and is related with feed efficiency, weight gain and carcass traits. However, the phenotypes collected for dry matter intake and feed efficiency are scarce when compared with other measures such as weight gain and carcass traits. The use of genomic information can improve the power of inference of studies on these measures, identifying genomic regions that affect these phenotypes. This work performed the genome-wide association study (GWAS) for dry matter intake (DMI) and residual feed intake (RFI) of 720 Nellore cattle (Bos taurus indicus). Results In general, no genomic region extremely associated with both phenotypic traits was observed, as expected for the variables that have their regulation controlled by many genes. Three SNPs surpassed the threshold for the Bonferroni multiple test for DMI and two SNPs for RFI. These markers are located on chromosomes 4, 8, 14 and 21 in regions near genes regulating appetite and ion transport and close to important QTL as previously reported to RFI and DMI, thus corroborating the literature that points these two processes as important in the physiological regulation of intake and feed efficiency. Conclusions This study showed the first GWAS of DMI to identify genomic regions associated with feed intake and efficiency in Nellore cattle. Some genes and QTLs previously described for DMI and RFI, in other subspecies (Bos taurus taurus), that influences these phenotypes are confirmed in this study. PMID:24517472

  13. Genome-wide and fine-resolution association analysis of malaria in West Africa.

    PubMed

    Jallow, Muminatou; Teo, Yik Ying; Small, Kerrin S; Rockett, Kirk A; Deloukas, Panos; Clark, Taane G; Kivinen, Katja; Bojang, Kalifa A; Conway, David J; Pinder, Margaret; Sirugo, Giorgio; Sisay-Joof, Fatou; Usen, Stanley; Auburn, Sarah; Bumpstead, Suzannah J; Campino, Susana; Coffey, Alison; Dunham, Andrew; Fry, Andrew E; Green, Angela; Gwilliam, Rhian; Hunt, Sarah E; Inouye, Michael; Jeffreys, Anna E; Mendy, Alieu; Palotie, Aarno; Potter, Simon; Ragoussis, Jiannis; Rogers, Jane; Rowlands, Kate; Somaskantharajah, Elilan; Whittaker, Pamela; Widden, Claire; Donnelly, Peter; Howie, Bryan; Marchini, Jonathan; Morris, Andrew; SanJoaquin, Miguel; Achidi, Eric Akum; Agbenyega, Tsiri; Allen, Angela; Amodu, Olukemi; Corran, Patrick; Djimde, Abdoulaye; Dolo, Amagana; Doumbo, Ogobara K; Drakeley, Chris; Dunstan, Sarah; Evans, Jennifer; Farrar, Jeremy; Fernando, Deepika; Hien, Tran Tinh; Horstmann, Rolf D; Ibrahim, Muntaser; Karunaweera, Nadira; Kokwaro, Gilbert; Koram, Kwadwo A; Lemnge, Martha; Makani, Julie; Marsh, Kevin; Michon, Pascal; Modiano, David; Molyneux, Malcolm E; Mueller, Ivo; Parker, Michael; Peshu, Norbert; Plowe, Christopher V; Puijalon, Odile; Reeder, John; Reyburn, Hugh; Riley, Eleanor M; Sakuntabhai, Anavaj; Singhasivanon, Pratap; Sirima, Sodiomon; Tall, Adama; Taylor, Terrie E; Thera, Mahamadou; Troye-Blomberg, Marita; Williams, Thomas N; Wilson, Michael; Kwiatkowski, Dominic P

    2009-06-01

    We report a genome-wide association (GWA) study of severe malaria in The Gambia. The initial GWA scan included 2,500 children genotyped on the Affymetrix 500K GeneChip, and a replication study included 3,400 children. We used this to examine the performance of GWA methods in Africa. We found considerable population stratification, and also that signals of association at known malaria resistance loci were greatly attenuated owing to weak linkage disequilibrium (LD). To investigate possible solutions to the problem of low LD, we focused on the HbS locus, sequencing this region of the genome in 62 Gambian individuals and then using these data to conduct multipoint imputation in the GWA samples. This increased the signal of association, from P = 4 × 10(-7) to P = 4 × 10(-14), with the peak of the signal located precisely at the HbS causal variant. Our findings provide proof of principle that fine-resolution multipoint imputation, based on population-specific sequencing data, can substantially boost authentic GWA signals and enable fine mapping of causal variants in African populations.

  14. Genome-wide analysis of LXRα activation reveals new transcriptional networks in human atherosclerotic foam cells.

    PubMed

    Feldmann, Radmila; Fischer, Cornelius; Kodelja, Vitam; Behrens, Sarah; Haas, Stefan; Vingron, Martin; Timmermann, Bernd; Geikowski, Anne; Sauer, Sascha

    2013-04-01

    Increased physiological levels of oxysterols are major risk factors for developing atherosclerosis and cardiovascular disease. Lipid-loaded macrophages, termed foam cells, are important during the early development of atherosclerotic plaques. To pursue the hypothesis that ligand-based modulation of the nuclear receptor LXRα is crucial for cell homeostasis during atherosclerotic processes, we analysed genome-wide the action of LXRα in foam cells and macrophages. By integrating chromatin immunoprecipitation-sequencing (ChIP-seq) and gene expression profile analyses, we generated a highly stringent set of 186 LXRα target genes. Treatment with the nanomolar-binding ligand T0901317 and subsequent auto-regulatory LXRα activation resulted in sequence-dependent sharpening of the genome-binding patterns of LXRα. LXRα-binding loci that correlated with differential gene expression revealed 32 novel target genes with potential beneficial effects, which in part explained the implications of disease-associated genetic variation data. These observations identified highly integrated LXRα ligand-dependent transcriptional networks, including the APOE/C1/C4/C2-gene cluster, which contribute to the reversal of cholesterol efflux and the dampening of inflammation processes in foam cells to prevent atherogenesis.

  15. Genome-wide SNP analysis explains coral diversity and recovery in the Ryukyu Archipelago.

    PubMed

    Shinzato, Chuya; Mungpakdee, Sutada; Arakaki, Nana; Satoh, Noriyuki

    2015-12-10

    Following a global coral bleaching event in 1998, Acropora corals surrounding most of Okinawa island (OI) were devastated, although they are now gradually recovering. In contrast, the Kerama Islands (KIs) only 30 km west of OI, have continuously hosted a great variety of healthy corals. Taking advantage of the decoded Acropora digitifera genome and using genome-wide SNP analyses, we clarified Acropora population structure in the southern Ryukyu Archipelago (sRA). Despite small genetic distances, we identified distinct clusters corresponding to specific island groups, suggesting infrequent long-distance dispersal within the sRA. Although the KIs were believed to supply coral larvae to OI, admixture analyses showed that such dispersal is much more limited than previously realized, indicating independent recovery of OI coral populations and the necessity of local conservation efforts for each region. We detected strong historical migration from the Yaeyama Islands (YIs) to OI, and suggest that the YIs are the original source of OI corals. In addition, migration edges to the KIs suggest that they are a historical sink population in the sRA, resulting in high diversity. This population genomics study provides the highest resolution data to date regarding coral population structure and history.

  16. Efficient Genome-Wide Sequencing and Low-Coverage Pedigree Analysis from Noninvasively Collected Samples.

    PubMed

    Snyder-Mackler, Noah; Majoros, William H; Yuan, Michael L; Shaver, Amanda O; Gordon, Jacob B; Kopp, Gisela H; Schlebusch, Stephen A; Wall, Jeffrey D; Alberts, Susan C; Mukherjee, Sayan; Zhou, Xiang; Tung, Jenny

    2016-06-01

    Research on the genetics of natural populations was revolutionized in the 1990s by methods for genotyping noninvasively collected samples. However, these methods have remained largely unchanged for the past 20 years and lag far behind the genomics era. To close this gap, here we report an optimized laboratory protocol for genome-wide capture of endogenous DNA from noninvasively collected samples, coupled with a novel computational approach to reconstruct pedigree links from the resulting low-coverage data. We validated both methods using fecal samples from 62 wild baboons, including 48 from an independently constructed extended pedigree. We enriched fecal-derived DNA samples up to 40-fold for endogenous baboon DNA and reconstructed near-perfect pedigree relationships even with extremely low-coverage sequencing. We anticipate that these methods will be broadly applicable to the many research systems for which only noninvasive samples are available. The lab protocol and software ("WHODAD") are freely available at www.tung-lab.org/protocols-and-software.html and www.xzlab.org/software.html, respectively.

  17. Genome-Wide Analysis of Self-Renewal in Drosophila Neural Stem Cells by Transgenic RNAi

    PubMed Central

    Neumüller, Ralph A.; Richter, Constance; Fischer, Anja; Novatchkova, Maria; Neumüller, Klaus G.; Knoblich, Juergen A.

    2011-01-01

    Summary The balance between stem cell self-renewal and differentiation is precisely controlled to ensure tissue homeostasis and prevent tumorigenesis. Here we use genome-wide transgenic RNAi to identify 620 genes potentially involved in controlling this balance in Drosophila neuroblasts. We quantify all phenotypes and derive measurements for proliferation, lineage, cell size, and cell shape. We identify a set of transcriptional regulators essential for self-renewal and use hierarchical clustering and integration with interaction data to create functional networks for the control of neuroblast self-renewal and differentiation. Our data identify key roles for the chromatin remodeling Brm complex, the spliceosome, and the TRiC/CCT-complex and show that the alternatively spliced transcription factor Lola and the transcriptional elongation factors Ssrp and Barc control self-renewal in neuroblast lineages. As our data are strongly enriched for genes highly expressed in murine neural stem cells, they are likely to provide valuable insights into mammalian stem cell biology as well. PMID:21549331

  18. Genome-Wide Transcriptional Analysis of the Phosphate Starvation Stimulon of Bacillus subtilis†

    PubMed Central

    Allenby, Nicholas E. E.; O'Connor, Nicola; Prágai, Zoltán; Ward, Alan C.; Wipat, Anil; Harwood, Colin R.

    2005-01-01

    Bacillus subtilis responds to phosphate starvation stress by inducing the PhoP and SigB regulons. While the PhoP regulon provides a specific response to phosphate starvation stress, maximizing the acquisition of phosphate (Pi) from the environment and reducing the cellular requirement for this essential nutrient, the SigB regulon provides nonspecific resistance to stress by protecting essential cellular components, such as DNA and membranes. We have characterized the phosphate starvation stress response of B. subtilis at a genome-wide level using DNA macroarrays. A combination of outlier and cluster analyses identified putative new members of the PhoP regulon, namely, yfkN (2′,3′ cyclic nucleotide 2′-phosphodiesterase), yurI (RNase), yjdB (unknown), and vpr (extracellular serine protease). YurI is thought to be responsible for the nonspecific degradation of RNA, while the activity of YfkN on various nucleotide phosphates suggests that it could act on substrates liberated by YurI, which produces 3′ or 5′ phosphoribonucleotides. The putative new PhoP regulon members are either known or predicted to be secreted and are likely to be important for the recovery of inorganic phosphate from a variety of organic sources of phosphate in the environment. PMID:16291680

  19. P-value based analysis for shared controls design in genome-wide association studies.

    PubMed

    Zaykin, Dmitri V; Kozbur, Damian O

    2010-11-01

    An appealing genome-wide association study design compares one large control group against several disease samples. A pioneering study by the Wellcome Trust Case Control Consortium that employed such a design has identified multiple susceptibility regions, many of which have been independently replicated. While reusing a control sample provides effective utilization of data, it also creates correlation between association statistics across diseases. An observation of a large association statistic for one of the diseases may greatly increase chances of observing a spuriously large association for a different disease. Accounting for the correlation is also particularly important when screening for SNPs that might be involved in a set of diseases with overlapping etiology. We describe methods that correct association statistics for dependency due to shared controls, and we describe ways to obtain a measure of overall evidence and to combine association signals across multiple diseases. The methods we describe require no access to individual subject data, instead, they efficiently utilize information contained in P-values for association reported for individual diseases. P-value based combined tests for association are flexible and essentially as powerful as the approach based on aggregating the individual subject data.

  20. Genome-wide analysis reveals mechanisms modulating autophagy in normal brain aging and in Alzheimer's disease

    PubMed Central

    Lipinski, Marta M.; Zheng, Bin; Lu, Tao; Yan, Zhenyu; Py, Bénédicte F.; Ng, Aylwin; Xavier, Ramnik J.; Li, Cheng; Yankner, Bruce A.; Scherzer, Clemens R.; Yuan, Junying

    2010-01-01

    Dysregulation of autophagy, a cellular catabolic mechanism essential for degradation of misfolded proteins, has been implicated in multiple neurodegenerative diseases. However, the mechanisms that lead to the autophagy dysfunction are still not clear. Based on the results of a genome-wide screen, we show that reactive oxygen species (ROS) serve as common mediators upstream of the activation of the type III PI3 kinase, which is critical for the initiation of autophagy. Furthermore, ROS play an essential function in the induction of the type III PI3 kinase and autophagy in response to amyloid β peptide, the main pathogenic mediator of Alzheimer's disease (AD). However, lysosomal blockage also caused by Aβ is independent of ROS. In addition, we demonstrate that autophagy is transcriptionally down-regulated during normal aging in the human brain. Strikingly, in contrast to normal aging, we observe transcriptional up-regulation of autophagy in the brains of AD patients, suggesting that there might be a compensatory regulation of autophagy. Interestingly, we show that an AD drug and an AD drug candidate have inhibitory effects on autophagy, raising the possibility that decreasing input into the lysosomal system may help to reduce cellular stress in AD. Finally, we provide a list of candidate drug targets that can be used to safely modulate levels of autophagy without causing cell death. PMID:20660724

  1. Efficient Genome-Wide Sequencing and Low-Coverage Pedigree Analysis from Noninvasively Collected Samples

    PubMed Central

    Snyder-Mackler, Noah; Majoros, William H.; Yuan, Michael L.; Shaver, Amanda O.; Gordon, Jacob B.; Kopp, Gisela H.; Schlebusch, Stephen A.; Wall, Jeffrey D.; Alberts, Susan C.; Mukherjee, Sayan; Zhou, Xiang; Tung, Jenny

    2016-01-01

    Research on the genetics of natural populations was revolutionized in the 1990s by methods for genotyping noninvasively collected samples. However, these methods have remained largely unchanged for the past 20 years and lag far behind the genomics era. To close this gap, here we report an optimized laboratory protocol for genome-wide capture of endogenous DNA from noninvasively collected samples, coupled with a novel computational approach to reconstruct pedigree links from the resulting low-coverage data. We validated both methods using fecal samples from 62 wild baboons, including 48 from an independently constructed extended pedigree. We enriched fecal-derived DNA samples up to 40-fold for endogenous baboon DNA and reconstructed near-perfect pedigree relationships even with extremely low-coverage sequencing. We anticipate that these methods will be broadly applicable to the many research systems for which only noninvasive samples are available. The lab protocol and software (“WHODAD”) are freely available at www.tung-lab.org/protocols-and-software.html and www.xzlab.org/software.html, respectively. PMID:27098910

  2. Genome-wide bimolecular fluorescence complementation analysis of SUMO interactome in yeast.

    PubMed

    Sung, Min-Kyung; Lim, Gyubum; Yi, Dae-Gwan; Chang, Yeon Ji; Yang, Eun Bin; Lee, Kiyoung; Huh, Won-Ki

    2013-04-01

    The definition of protein-protein interactions (PPIs) in the natural cellular context is essential for properly understanding various biological processes. So far, however, most large-scale PPI analyses have not been performed in the natural cellular context. Here, we describe the construction of a Saccharomyces cerevisiae fusion library in which each endogenous gene is C-terminally tagged with the N-terminal fragment of Venus (VN) for a genome-wide bimolecular fluorescence complementation assay, a powerful technique for identifying PPIs in living cells. We illustrate the utility of the VN fusion library by systematically analyzing the interactome of the small ubiquitin-related modifier (SUMO) and provide previously unavailable information on the subcellular localization, types, and protease dependence of SUMO interactions. Our data set is highly complementary to the existing data sets and represents a useful resource for expanding the understanding of the physiological roles of SUMO. In addition, the VN fusion library provides a useful research tool that makes it feasible to systematically analyze PPIs in the natural cellular context.

  3. Genome-Wide Methylation Analysis Identifies Specific Epigenetic Marks In Severely Obese Children

    PubMed Central

    Fradin, Delphine; Boëlle, Pierre-Yves; Belot, Marie-Pierre; Lachaux, Fanny; Tost, Jorg; Besse, Céline; Deleuze, Jean-François; De Filippo, Gianpaolo; Bougnères, Pierre

    2017-01-01

    Obesity is a heterogeneous disease with many different subtypes. Epigenetics could contribute to these differences. The aim of this study was to investigate genome-wide DNA methylation searching for methylation marks associated with obesity in children and adolescents. We studied DNA methylation profiles in whole blood cells from 40 obese children and controls using Illumina Infinium HumanMethylation450 BeadChips. After correction for cell heterogeneity and multiple tests, we found that compared to lean controls, 31 CpGs are differentially methylated in obese patients. A greatest proportion of these CpGs is hypermethylated in obesity and located in CpG shores regions. We next focused on severely obese children and identified 151 differentially methylated CpGs among which 10 with a difference in methylation greater than 10%. The top pathways enriched among the identified CpGs included the “IRS1 target genes” and several pathways in cancer diseases. This study represents the first effort to search for differences in methylation in obesity and severe obesity, which may help understanding these different forms of obesity and their complications. PMID:28387357

  4. Genome-wide analysis of DNA methylation in obese, lean, and miniature pig breeds

    PubMed Central

    Yang, Yalan; Zhou, Rong; Mu, Yulian; Hou, Xinhua; Tang, Zhonglin; Li, Kui

    2016-01-01

    DNA methylation is a crucial epigenetic modification involved in diverse biological processes. There is significant phenotypic variance between Chinese indigenous and western pig breeds. Here, we surveyed the genome-wide DNA methylation profiles of blood leukocytes from three pig breeds (Tongcheng, Landrace, and Wuzhishan) by methylated DNA immunoprecipitation sequencing. The results showed that DNA methylation was enriched in gene body regions and repetitive sequences. LINE/L1 and SINE/tRNA-Glu were the predominant methylated repeats in pigs. The methylation level in the gene body regions was higher than in the 5′ and 3′ flanking regions of genes. About 15% of CpG islands were methylated in the pig genomes. Additionally, 2,807, 2,969, and 5,547 differentially methylated genes (DMGs) were identified in the Tongcheng vs. Landrace, Tongcheng vs. Wuzhishan, and Landrace vs. Wuzhishan comparisons, respectively. A total of 868 DMGs were shared by the three contrasts. The DMGs were significantly enriched in development- and metabolism-related biological processes and pathways. Finally, we identified 32 candidate DMGs associated with phenotype variance in pigs. Our research provides a DNA methylome resource for pigs and furthers understanding of epigenetically regulated phenotype variance in mammals. PMID:27444743

  5. Genome-wide analysis of chromatin packing in Arabidopsis thaliana at single-gene resolution

    PubMed Central

    Liu, Chang; Wang, Congmao; Wang, George; Becker, Claude; Zaidem, Maricris; Weigel, Detlef

    2016-01-01

    The three-dimensional packing of the genome plays an important role in regulating gene expression. We have used Hi-C, a genome-wide chromatin conformation capture (3C) method, to analyze Arabidopsis thaliana chromosomes dissected into subkilobase segments, which is required for gene-level resolution in this species with a gene-dense genome. We found that the repressive H3K27me3 histone mark is overrepresented in the promoter regions of genes that are in conformational linkage over long distances. In line with the globally dispersed distribution of RNA polymerase II in A. thaliana nuclear space, actively transcribed genes do not show a strong tendency to associate with each other. In general, there are often contacts between 5′ and 3′ ends of genes, forming local chromatin loops. Such self-loop structures of genes are more likely to occur in more highly expressed genes, although they can also be found in silent genes. Silent genes with local chromatin loops are highly enriched for the histone variant H3.3 at their 5′ and 3′ ends but depleted of repressive marks such as heterochromatic histone modifications and DNA methylation in flanking regions. Our results suggest that, different from animals, a major theme of genome folding in A. thaliana is the formation of structural units that correspond to gene bodies. PMID:27225844

  6. Genome-wide association analysis of metabolic traits in a birth cohort from a founder population.

    PubMed

    Sabatti, Chiara; Service, Susan K; Hartikainen, Anna-Liisa; Pouta, Anneli; Ripatti, Samuli; Brodsky, Jae; Jones, Chris G; Zaitlen, Noah A; Varilo, Teppo; Kaakinen, Marika; Sovio, Ulla; Ruokonen, Aimo; Laitinen, Jaana; Jakkula, Eveliina; Coin, Lachlan; Hoggart, Clive; Collins, Andrew; Turunen, Hannu; Gabriel, Stacey; Elliot, Paul; McCarthy, Mark I; Daly, Mark J; Järvelin, Marjo-Riitta; Freimer, Nelson B; Peltonen, Leena

    2009-01-01

    Genome-wide association studies (GWAS) of longitudinal birth cohorts enable joint investigation of environmental and genetic influences on complex traits. We report GWAS results for nine quantitative metabolic traits (triglycerides, high-density lipoprotein, low-density lipoprotein, glucose, insulin, C-reactive protein, body mass index, and systolic and diastolic blood pressure) in the Northern Finland Birth Cohort 1966 (NFBC1966), drawn from the most genetically isolated Finnish regions. We replicate most previously reported associations for these traits and identify nine new associations, several of which highlight genes with metabolic functions: high-density lipoprotein with NR1H3 (LXRA), low-density lipoprotein with AR and FADS1-FADS2, glucose with MTNR1B, and insulin with PANK1. Two of these new associations emerged after adjustment of results for body mass index. Gene-environment interaction analyses suggested additional associations, which will require validation in larger samples. The currently identified loci, together with quantified environmental exposures, explain little of the trait variation in NFBC1966. The association observed between low-density lipoprotein and an infrequent variant in AR suggests the potential of such a cohort for identifying associations with both common, low-impact and rarer, high-impact quantitative trait loci.

  7. Genome-wide meta-analysis uncovers novel loci influencing circulating leptin levels.

    PubMed

    Kilpeläinen, Tuomas O; Carli, Jayne F Martin; Skowronski, Alicja A; Sun, Qi; Kriebel, Jennifer; Feitosa, Mary F; Hedman, Åsa K; Drong, Alexander W; Hayes, James E; Zhao, Jinghua; Pers, Tune H; Schick, Ursula; Grarup, Niels; Kutalik, Zoltán; Trompet, Stella; Mangino, Massimo; Kristiansson, Kati; Beekman, Marian; Lyytikäinen, Leo-Pekka; Eriksson, Joel; Henneman, Peter; Lahti, Jari; Tanaka, Toshiko; Luan, Jian'an; Del Greco M, Fabiola; Pasko, Dorota; Renström, Frida; Willems, Sara M; Mahajan, Anubha; Rose, Lynda M; Guo, Xiuqing; Liu, Yongmei; Kleber, Marcus E; Pérusse, Louis; Gaunt, Tom; Ahluwalia, Tarunveer S; Ju Sung, Yun; Ramos, Yolande F; Amin, Najaf; Amuzu, Antoinette; Barroso, Inês; Bellis, Claire; Blangero, John; Buckley, Brendan M; Böhringer, Stefan; I Chen, Yii-Der; de Craen, Anton J N; Crosslin, David R; Dale, Caroline E; Dastani, Zari; Day, Felix R; Deelen, Joris; Delgado, Graciela E; Demirkan, Ayse; Finucane, Francis M; Ford, Ian; Garcia, Melissa E; Gieger, Christian; Gustafsson, Stefan; Hallmans, Göran; Hankinson, Susan E; Havulinna, Aki S; Herder, Christian; Hernandez, Dena; Hicks, Andrew A; Hunter, David J; Illig, Thomas; Ingelsson, Erik; Ioan-Facsinay, Andreea; Jansson, John-Olov; Jenny, Nancy S; Jørgensen, Marit E; Jørgensen, Torben; Karlsson, Magnus; Koenig, Wolfgang; Kraft, Peter; Kwekkeboom, Joanneke; Laatikainen, Tiina; Ladwig, Karl-Heinz; LeDuc, Charles A; Lowe, Gordon; Lu, Yingchang; Marques-Vidal, Pedro; Meisinger, Christa; Menni, Cristina; Morris, Andrew P; Myers, Richard H; Männistö, Satu; Nalls, Mike A; Paternoster, Lavinia; Peters, Annette; Pradhan, Aruna D; Rankinen, Tuomo; Rasmussen-Torvik, Laura J; Rathmann, Wolfgang; Rice, Treva K; Brent Richards, J; Ridker, Paul M; Sattar, Naveed; Savage, David B; Söderberg, Stefan; Timpson, Nicholas J; Vandenput, Liesbeth; van Heemst, Diana; Uh, Hae-Won; Vohl, Marie-Claude; Walker, Mark; Wichmann, Heinz-Erich; Widén, Elisabeth; Wood, Andrew R; Yao, Jie; Zeller, Tanja; Zhang, Yiying; Meulenbelt, Ingrid; Kloppenburg, Margreet; Astrup, Arne; Sørensen, Thorkild I A; Sarzynski, Mark A; Rao, D C; Jousilahti, Pekka; Vartiainen, Erkki; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, André G; Kajantie, Eero; Osmond, Clive; Palotie, Aarno; Eriksson, Johan G; Heliövaara, Markku; Knekt, Paul B; Koskinen, Seppo; Jula, Antti; Perola, Markus; Huupponen, Risto K; Viikari, Jorma S; Kähönen, Mika; Lehtimäki, Terho; Raitakari, Olli T; Mellström, Dan; Lorentzon, Mattias; Casas, Juan P; Bandinelli, Stefanie; März, Winfried; Isaacs, Aaron; van Dijk, Ko W; van Duijn, Cornelia M; Harris, Tamara B; Bouchard, Claude; Allison, Matthew A; Chasman, Daniel I; Ohlsson, Claes; Lind, Lars; Scott, Robert A; Langenberg, Claudia; Wareham, Nicholas J; Ferrucci, Luigi; Frayling, Timothy M; Pramstaller, Peter P; Borecki, Ingrid B; Waterworth, Dawn M; Bergmann, Sven; Waeber, Gérard; Vollenweider, Peter; Vestergaard, Henrik; Hansen, Torben; Pedersen, Oluf; Hu, Frank B; Eline Slagboom, P; Grallert, Harald; Spector, Tim D; Jukema, J W; Klein, Robert J; Schadt, Erik E; Franks, Paul W; Lindgren, Cecilia M; Leibel, Rudolph L; Loos, Ruth J F

    2016-02-01

    Leptin is an adipocyte-secreted hormone, the circulating levels of which correlate closely with overall adiposity. Although rare mutations in the leptin (LEP) gene are well known to cause leptin deficiency and severe obesity, no common loci regulating circulating leptin levels have been uncovered. Therefore, we performed a genome-wide association study (GWAS) of circulating leptin levels from 32,161 individuals and followed up loci reaching P<10(-6) in 19,979 additional individuals. We identify five loci robustly associated (P<5 × 10(-8)) with leptin levels in/near LEP, SLC32A1, GCKR, CCNL1 and FTO. Although the association of the FTO obesity locus with leptin levels is abolished by adjustment for BMI, associations of the four other loci are independent of adiposity. The GCKR locus was found associated with multiple metabolic traits in previous GWAS and the CCNL1 locus with birth weight. Knockdown experiments in mouse adipose tissue explants show convincing evidence for adipogenin, a regulator of adipocyte differentiation, as the novel causal gene in the SLC32A1 locus influencing leptin levels. Our findings provide novel insights into the regulation of leptin production by adipose tissue and open new avenues for examining the influence of variation in leptin levels on adiposity and metabolic health.

  8. Genome-wide meta-analysis uncovers novel loci influencing circulating leptin levels

    PubMed Central

    Kilpeläinen, Tuomas O.; Carli, Jayne F. Martin; Skowronski, Alicja A.; Sun, Qi; Kriebel, Jennifer; Feitosa, Mary F; Hedman, Åsa K.; Drong, Alexander W.; Hayes, James E.; Zhao, Jinghua; Pers, Tune H.; Schick, Ursula; Grarup, Niels; Kutalik, Zoltán; Trompet, Stella; Mangino, Massimo; Kristiansson, Kati; Beekman, Marian; Lyytikäinen, Leo-Pekka; Eriksson, Joel; Henneman, Peter; Lahti, Jari; Tanaka, Toshiko; Luan, Jian'an; Greco M, Fabiola Del; Pasko, Dorota; Renström, Frida; Willems, Sara M.; Mahajan, Anubha; Rose, Lynda M.; Guo, Xiuqing; Liu, Yongmei; Kleber, Marcus E.; Pérusse, Louis; Gaunt, Tom; Ahluwalia, Tarunveer S.; Ju Sung, Yun; Ramos, Yolande F.; Amin, Najaf; Amuzu, Antoinette; Barroso, Inês; Bellis, Claire; Blangero, John; Buckley, Brendan M.; Böhringer, Stefan; I Chen, Yii-Der; de Craen, Anton J. N.; Crosslin, David R.; Dale, Caroline E.; Dastani, Zari; Day, Felix R.; Deelen, Joris; Delgado, Graciela E.; Demirkan, Ayse; Finucane, Francis M.; Ford, Ian; Garcia, Melissa E.; Gieger, Christian; Gustafsson, Stefan; Hallmans, Göran; Hankinson, Susan E.; Havulinna, Aki S; Herder, Christian; Hernandez, Dena; Hicks, Andrew A.; Hunter, David J.; Illig, Thomas; Ingelsson, Erik; Ioan-Facsinay, Andreea; Jansson, John-Olov; Jenny, Nancy S.; Jørgensen, Marit E.; Jørgensen, Torben; Karlsson, Magnus; Koenig, Wolfgang; Kraft, Peter; Kwekkeboom, Joanneke; Laatikainen, Tiina; Ladwig, Karl-Heinz; LeDuc, Charles A.; Lowe, Gordon; Lu, Yingchang; Marques-Vidal, Pedro; Meisinger, Christa; Menni, Cristina; Morris, Andrew P.; Myers, Richard H.; Männistö, Satu; Nalls, Mike A.; Paternoster, Lavinia; Peters, Annette; Pradhan, Aruna D.; Rankinen, Tuomo; Rasmussen-Torvik, Laura J.; Rathmann, Wolfgang; Rice, Treva K.; Brent Richards, J; Ridker, Paul M.; Sattar, Naveed; Savage, David B.; Söderberg, Stefan; Timpson, Nicholas J.; Vandenput, Liesbeth; van Heemst, Diana; Uh, Hae-Won; Vohl, Marie-Claude; Walker, Mark; Wichmann, Heinz-Erich; Widén, Elisabeth; Wood, Andrew R.; Yao, Jie; Zeller, Tanja; Zhang, Yiying; Meulenbelt, Ingrid; Kloppenburg, Margreet; Astrup, Arne; Sørensen, Thorkild I. A.; Sarzynski, Mark A.; Rao, D. C.; Jousilahti, Pekka; Vartiainen, Erkki; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, André G.; Kajantie, Eero; Osmond, Clive; Palotie, Aarno; Eriksson, Johan G.; Heliövaara, Markku; Knekt, Paul B.; Koskinen, Seppo; Jula, Antti; Perola, Markus; Huupponen, Risto K.; Viikari, Jorma S.; Kähönen, Mika; Lehtimäki, Terho; Raitakari, Olli T.; Mellström, Dan; Lorentzon, Mattias; Casas, Juan P.; Bandinelli, Stefanie; März, Winfried; Isaacs, Aaron; van Dijk, Ko W.; van Duijn, Cornelia M.; Harris, Tamara B.; Bouchard, Claude; Allison, Matthew A.; Chasman, Daniel I.; Ohlsson, Claes; Lind, Lars; Scott, Robert A.; Langenberg, Claudia; Wareham, Nicholas J.; Ferrucci, Luigi; Frayling, Timothy M.; Pramstaller, Peter P.; Borecki, Ingrid B.; Waterworth, Dawn M.; Bergmann, Sven; Waeber, Gérard; Vollenweider, Peter; Vestergaard, Henrik; Hansen, Torben; Pedersen, Oluf; Hu, Frank B.; Eline Slagboom, P; Grallert, Harald; Spector, Tim D.; Jukema, J.W.; Klein, Robert J.; Schadt, Erik E; Franks, Paul W.; Lindgren, Cecilia M.; Leibel, Rudolph L.; Loos, Ruth J. F.

    2016-01-01

    Leptin is an adipocyte-secreted hormone, the circulating levels of which correlate closely with overall adiposity. Although rare mutations in the leptin (LEP) gene are well known to cause leptin deficiency and severe obesity, no common loci regulating circulating leptin levels have been uncovered. Therefore, we performed a genome-wide association study (GWAS) of circulating leptin levels from 32,161 individuals and followed up loci reaching P<10−6 in 19,979 additional individuals. We identify five loci robustly associated (P<5 × 10−8) with leptin levels in/near LEP, SLC32A1, GCKR, CCNL1 and FTO. Although the association of the FTO obesity locus with leptin levels is abolished by adjustment for BMI, associations of the four other loci are independent of adiposity. The GCKR locus was found associated with multiple metabolic traits in previous GWAS and the CCNL1 locus with birth weight. Knockdown experiments in mouse adipose tissue explants show convincing evidence for adipogenin, a regulator of adipocyte differentiation, as the novel causal gene in the SLC32A1 locus influencing leptin levels. Our findings provide novel insights into the regulation of leptin production by adipose tissue and open new avenues for examining the influence of variation in leptin levels on adiposity and metabolic health. PMID:26833098

  9. GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies.

    PubMed

    Gogarten, Stephanie M; Bhangale, Tushar; Conomos, Matthew P; Laurie, Cecelia A; McHugh, Caitlin P; Painter, Ian; Zheng, Xiuwen; Crosslin, David R; Levine, David; Lumley, Thomas; Nelson, Sarah C; Rice, Kenneth; Shen, Jess; Swarnkar, Rohit; Weir, Bruce S; Laurie, Cathy C

    2012-12-15

    GWASTools is an R/Bioconductor package for quality control and analysis of genome-wide association studies (GWAS). GWASTools brings the interactive capability and extensive statistical libraries of R to GWAS. Data are stored in NetCDF format to accommodate extremely large datasets that cannot fit within R's memory limits. The documentation includes instructions for converting data from multiple formats, including variants called from sequencing. GWASTools provides a convenient interface for linking genotypes and intensity data with sample and single nucleotide polymorphism annotation.

  10. International genome-wide meta-analysis identifies new primary biliary cirrhosis risk loci and targetable pathogenic pathways.

    PubMed

    Cordell, Heather J; Han, Younghun; Mells, George F; Li, Yafang; Hirschfield, Gideon M; Greene, Casey S; Xie, Gang; Juran, Brian D; Zhu, Dakai; Qian, David C; Floyd, James A B; Morley, Katherine I; Prati, Daniele; Lleo, Ana; Cusi, Daniele; Gershwin, M Eric; Anderson, Carl A; Lazaridis, Konstantinos N; Invernizzi, Pietro; Seldin, Michael F; Sandford, Richard N; Amos, Christopher I; Siminovitch, Katherine A

    2015-09-22

    Primary biliary cirrhosis (PBC) is a classical autoimmune liver disease for which effective immunomodulatory therapy is lacking. Here we perform meta-analyses of discovery data sets from genome-wide association studies of European subjects (n=2,764 cases and 10,475 controls) followed by validation genotyping in an independent cohort (n=3,716 cases and 4,261 controls). We discover and validate six previously unknown risk loci for PBC (Pcombined<5 × 10(-8)) and used pathway analysis to identify JAK-STAT/IL12/IL27 signalling and cytokine-cytokine pathways, for which relevant therapies exist.

  11. International genome-wide meta-analysis identifies new primary biliary cirrhosis risk loci and targetable pathogenic pathways

    PubMed Central

    Cordell, Heather J.; Han, Younghun; Mells, George F.; Li, Yafang; Hirschfield, Gideon M.; Greene, Casey S.; Xie, Gang; Juran, Brian D.; Zhu, Dakai; Qian, David C.; Floyd, James A. B.; Morley, Katherine I.; Prati, Daniele; Lleo, Ana; Cusi, Daniele; Schlicht, Erik M; Lammert, Craig; Atkinson, Elizabeth J; Chan, Landon L; de Andrade, Mariza; Balschun, Tobias; Mason, Andrew L; Myers, Robert P; Zhang, Jinyi; Milkiewicz, Piotr; Qu, Jia; Odin, Joseph A; Luketic, Velimir A; Bacon, Bruce R; Bodenheimer Jr, Henry C; Liakina, Valentina; Vincent, Catherine; Levy, Cynthia; Gregersen, Peter K; Almasio, Piero L; Alvaro, Domenico; Andreone, Pietro; Andriulli, Angelo; Barlassina, Cristina; Battezzati, Pier Maria; Benedetti, Antonio; Bernuzzi, Francesca; Bianchi, Ilaria; Bragazzi, Maria Consiglia; Brunetto, Maurizia; Bruno, Savino; Casella, Giovanni; Coco, Barbara; Colli, Agostino; Colombo, Massimo; Colombo, Silvia; Cursaro, Carmela; Crocè, Lory Saveria; Crosignani, Andrea; Donato, Maria Francesca; Elia, Gianfranco; Fabris, Luca; Ferrari, Carlo; Floreani, Annarosa; Foglieni, Barbara; Fontana, Rosanna; Galli, Andrea; Lazzari, Roberta; Macaluso, Fabio; Malinverno, Federica; Marra, Fabio; Marzioni, Marco; Mattalia, Alberto; Montanari, Renzo; Morini, Lorenzo; Morisco, Filomena; Hani S, Mousa; Muratori, Luigi; Muratori, Paolo; Niro, Grazia A; Palmieri, Vincenzo O; Picciotto, Antonio; Podda, Mauro; Portincasa, Piero; Ronca, Vincenzo; Rosina, Floriano; Rossi, Sonia; Sogno, Ilaria; Spinzi, Giancarlo; Spreafico, Marta; Strazzabosco, Mario; Tarallo, Sonia; Tarocchi, Mirko; Tiribelli, Claudio; Toniutto, Pierluigi; Vinci, Maria; Zuin, Massimo; Ch'ng, Chin Lye; Rahman, Mesbah; Yapp, Tom; Sturgess, Richard; Healey, Christopher; Czajkowski, Marek; Gunasekera, Anton; Gyawali, Pranab; Premchand, Purushothaman; Kapur, Kapil; Marley, Richard; Foster, Graham; Watson, Alan; Dias, Aruna; Subhani, Javaid; Harvey, Rory; McCorry, Roger; Ramanaden, David; Gasem, Jaber; Evans, Richard; Mathialahan, Thiriloganathan; Shorrock, Christopher; Lipscomb, George; Southern, Paul; Tibble, Jeremy; Gorard, David; Palegwala, Altaf; Jones, Susan; Carbone, Marco; Dawwas, Mohamed; Alexander, Graeme; Dolwani, Sunil; Prince, Martin; Foxton, Matthew; Elphick, David; Mitchison, Harriet; Gooding, Ian; Karmo, Mazn; Saksena, Sushma; Mendall, Mike; Patel, Minesh; Ede, Roland; Austin, Andrew; Sayer, Joanna; Hankey, Lorraine; Hovell, Christopher; Fisher, Neil; Carter, Martyn; Koss, Konrad; Piotrowicz, Andrzej; Grimley, Charles; Neal, David; Lim, Guan; Levi, Sass; Ala, Aftab; Broad, Andrea; Saeed, Athar; Wood, Gordon; Brown, Jonathan; Wilkinson, Mark; Gordon, Harriet; Ramage, John; Ridpath, Jo; Ngatchu, Theodore; Grover, Bob; Shaukat, Syed; Shidrawi, Ray; Abouda, George; Ali, Faiz; Rees, Ian; Salam, Imroz; Narain, Mark; Brown, Ashley; Taylor-Robinson, Simon; Williams, Simon; Grellier, Leonie; Banim, Paul; Das, Debashis; Chilton, Andrew; Heneghan, Michael; Curtis, Howard; Gess, Markus; Drake, Ian; Aldersley, Mark; Davies, Mervyn; Jones, Rebecca; McNair, Alastair; Srirajaskanthan, Raj; Pitcher, Maxton; Sen, Sambit; Bird, George; Barnardo, Adrian; Kitchen, Paul; Yoong, Kevin; Chirag, Oza; Sivaramakrishnan, Nurani; MacFaul, George; Jones, David; Shah, Amir; Evans, Chris; Saha, Subrata; Pollock, Katharine; Bramley, Peter; Mukhopadhya, Ashis; Fraser, Andrew; Mills, Peter; Shallcross, Christopher; Campbell, Stewart; Bathgate, Andrew; Shepherd, Alan; Dillon, John; Rushbrook, Simon; Przemioslo, Robert; Macdonald, Christopher; Metcalf, Jane; Shmueli, Udi; Davis, Andrew; Naqvi, Asifabbas; Lee, Tom; Ryder, Stephen D; Collier, Jane; Klass, Howard; Ninkovic, Mary; Cramp, Matthew; Sharer, Nicholas; Aspinall, Richard; Goggin, Patrick; Ghosh, Deb; Douds, Andrew; Hoeroldt, Barbara; Booth, Jonathan; Williams, Earl; Hussaini, Hyder; Stableforth, William; Ayres, Reuben; Thorburn, Douglas; Marshall, Eileen; Burroughs, Andrew; Mann, Steven; Lombard, Martin; Richardson, Paul; Patanwala, Imran; Maltby, Julia; Brookes, Matthew; Mathew, Ray; Vyas, Samir; Singhal, Saket; Gleeson, Dermot; Misra, Sharat; Butterworth, Jeff; George, Keith; Harding, Tim; Douglass, Andrew; Panter, Simon; Shearman, Jeremy; Bray, Gary; Butcher, Graham; Forton, Daniel; Mclindon, John; Cowan, Matthew; Whatley, Gregory; Mandal, Aditya; Gupta, Hemant; Sanghi, Pradeep; Jain, Sanjiv; Pereira, Steve; Prasad, Geeta; Watts, Gill; Wright, Mark; Neuberger, James; Gordon, Fiona; Unitt, Esther; Grant, Allister; Delahooke, Toby; Higham, Andrew; Brind, Alison; Cox, Mark; Ramakrishnan, Subramaniam; King, Alistair; Collins, Carole; Whalley, Simon; Li, Andy; Fraser, Jocelyn; Bell, Andrew; Wong, Voi Shim; Singhal, Amit; Gee, Ian; Ang, Yeng; Ransford, Rupert; Gotto, James; Millson, Charles; Bowles, Jane; Thomas, Caradog; Harrison, Melanie; Galaska, Roman; Kendall, Jennie; Whiteman, Jessica; Lawlor, Caroline; Gray, Catherine; Elliott, Keith; Mulvaney-Jones, Caroline; Hobson, Lucie; Van Duyvenvoorde, Greta; Loftus, Alison; Seward, Katie; Penn, Ruth; Maiden, Jane; Damant, Rose; Hails, Janeane; Cloudsdale, Rebecca; Silvestre, Valeria; Glenn, Sue; Dungca, Eleanor; Wheatley, Natalie; Doyle, Helen; Kent, Melanie; Hamilton, Caroline; Braim, Delyth; Wooldridge, Helen; Abrahams, Rachel; Paton, Alison; Lancaster, Nicola; Gibbins, Andrew; Hogben, Karen; Desousa, Phillipa; Muscariu, Florin; Musselwhite, Janine; McKay, Alexandra; Tan, LaiTing; Foale, Carole; Brighton, Jacqueline; Flahive, Kerry; Nambela, Estelle; Townshend, Paula; Ford, Chris; Holder, Sophie; Palmer, Caroline; Featherstone, James; Nasseri, Mariam; Sadeghian, Joy; Williams, Bronwen; Thomas, Carol; Rolls, Sally-Ann; Hynes, Abigail; Duggan, Claire; Jones, Sarah; Crossey, Mary; Stansfield, Glynis; MacNicol, Carolyn; Wilkins, Joy; Wilhelmsen, Elva; Raymode, Parizade; Lee, Hye-Jeong; Durant, Emma; Bishop, Rebecca; Ncube, Noma; Tripoli, Sherill; Casey, Rebecca; Cowley, Caroline; Miller, Richard; Houghton, Kathryn; Ducker, Samantha; Wright, Fiona; Bird, Bridget; Baxter, Gwen; Keggans, Janie; Hughes, Maggie; Grieve, Emma; Young, Karin; Williams, D; Ocker, Kate; Hines, Frances; Martin, Kirsty; Innes, Caron; Valliani, Talal; Fairlamb, Helen; Thornthwaite, Sarah; Eastick, Anne; Tanqueray, Elizabeth; Morrison, Jennifer; Holbrook, Becky; Browning, Julie; Walker, Kirsten; Congreave, Susan; Verheyden, Juliette; Slininger, Susan; Stafford, Lizzie; O'Donnell, Denise; Ainsworth, Mark; Lord, Susan; Kent, Linda; March, Linda; Dickson, Christine; Simpson, Diane; Longhurst, Beverley; Hayes, Maria; Shpuza, Ervin; White, Nikki; Besley, Sarah; Pearson, Sallyanne; Wright, Alice; Jones, Linda; Gunter, Emma; Dewhurst, Hannah; Fouracres, Anna; Farrington, Liz; Graves, Lyn; Marriott, Suzie; Leoni, Marina; Tyrer, David; Martin, Kate; Dali-kemmery, Lola; Lambourne, Victoria; Green, Marie; Sirdefield, Dawn; Amor, Kelly; Colley, Julie; Shinder, Bal; Jones, Jayne; Mills, Marisa; Carnahan, Mandy; Taylor, Natalie; Boulton, Kerenza; Tregonning, Julie; Brown, Carly; Clifford, Gayle; Archer, Emily; Hamilton, Maria; Curtis, Janette; Shewan, Tracey; Walsh, Sue; Warner, Karen; Netherton, Kimberley; Mupudzi, Mcdonald; Gunson, Bridget; Gitahi, Jane; Gocher, Denise; Batham, Sally; Pateman, Hilary; Desmennu, Senayon; Conder, Jill; Clement, Darren; Gallagher, Susan; Orpe, Jacky; Chan, PuiChing; Currie, Lynn; O'Donohoe, Lynn; Oblak, Metod; Morgan, Lisa; Quinn, Marie; Amey, Isobel; Baird, Yolanda; Cotterill, Donna; Cumlat, Lourdes; Winter, Louise; Greer, Sandra; Spurdle, Katie; Allison, Joanna; Dyer, Simon; Sweeting, Helen; Kordula, Jean; Gershwin, M. Eric; Anderson, Carl A.; Lazaridis, Konstantinos N.; Invernizzi, Pietro; Seldin, Michael F.; Sandford, Richard N.; Amos, Christopher I.; Siminovitch, Katherine A.

    2015-01-01

    Primary biliary cirrhosis (PBC) is a classical autoimmune liver disease for which effective immunomodulatory therapy is lacking. Here we perform meta-analyses of discovery data sets from genome-wide association studies of European subjects (n=2,764 cases and 10,475 controls) followed by validation genotyping in an independent cohort (n=3,716 cases and 4,261 controls). We discover and validate six previously unknown risk loci for PBC (Pcombined<5 × 10−8) and used pathway analysis to identify JAK-STAT/IL12/IL27 signalling and cytokine–cytokine pathways, for which relevant therapies exist. PMID:26394269

  12. Genome-Wide Analysis of the Musa WRKY Gene Family: Evolution and Differential Expression during Development and Stress

    PubMed Central

    Goel, Ridhi; Pandey, Ashutosh; Trivedi, Prabodh K.; Asif, Mehar H.

    2016-01-01

    The WRKY gene family plays an important role in the development and stress responses in plants. As information is not available on the WRKY gene family in Musa species, genome-wide analysis has been carried out in this study using available genomic information from two species, Musa acuminata and Musa balbisiana. Analysis identified 147 and 132 members of the WRKY gene family in M. acuminata and M. balbisiana, respectively. Evolutionary analysis suggests that the WRKY gene family expanded much before the speciation in both the species. Most of the orthologs retained in two species were from the γ duplication event which occurred prior to α and β genome-wide duplication (GWD) events. Analysis also suggests that subtle changes in nucleotide sequences during the course of evolution have led to the development of new motifs which might be involved in neo-functionalization of different WRKY members in two species. Expression and cis-regulatory motif analysis suggest possible involvement of Group II and Group III WRKY members during various stresses and growth/development including fruit ripening process respectively. PMID:27014321

  13. Genome-Wide Analysis of MicroRNA Responses to the Phytohormone Abscisic Acid in Populus euphratica

    PubMed Central

    Duan, Hui; Lu, Xin; Lian, Conglong; An, Yi; Xia, Xinli; Yin, Weilun

    2016-01-01

    MicroRNA (miRNA) is a type of non-coding small RNA with a regulatory function at the posttranscriptional level in plant growth development and in response to abiotic stress. Previous studies have not reported on miRNAs responses to the phytohormone abscisic acid (ABA) at a genome-wide level in Populus euphratica, a model tree for studying abiotic stress responses in woody plants. Here we analyzed the miRNA response to ABA at a genome-wide level in P. euphratica utilizing high-throughput sequencing. To systematically perform a genome-wide analysis of ABA-responsive miRNAs in P. euphratica, nine sRNA libraries derived from three groups (control, treated with ABA for 1 day and treated with ABA for 4 days) were constructed. Each group included three libraries from three individual plantlets as biological replicate. In total, 151 unique mature sequences belonging to 75 conserved miRNA families were identified, and 94 unique sequences were determined to be novel miRNAs, including 56 miRNAs with miRNA* sequences. In all, 31 conserved miRNAs and 31 novel miRNAs response to ABA significantly differed among the groups. In addition, 4132 target genes were predicted for the conserved and novel miRNAs. Confirmed by real-time qPCR, expression changes of miRNAs were inversely correlated with the expression profiles of their putative targets. The Populus special or novel miRNA-target interactions were predicted might be involved in some biological process related stress tolerance. Our analysis provides a comprehensive view of how P. euphratica miRNA respond to ABA, and moreover, different temporal dynamics were observed in different ABA-treated libraries. PMID:27582743

  14. Genome-wide association analysis identifies variants associated with nonalcoholic fatty liver disease that have distinct effects on metabolic traits.

    PubMed

    Speliotes, Elizabeth K; Yerges-Armstrong, Laura M; Wu, Jun; Hernaez, Ruben; Kim, Lauren J; Palmer, Cameron D; Gudnason, Vilmundur; Eiriksdottir, Gudny; Garcia, Melissa E; Launer, Lenore J; Nalls, Michael A; Clark, Jeanne M; Mitchell, Braxton D; Shuldiner, Alan R; Butler, Johannah L; Tomas, Marta; Hoffmann, Udo; Hwang, Shih-Jen; Massaro, Joseph M; O'Donnell, Christopher J; Sahani, Dushyant V; Salomaa, Veikko; Schadt, Eric E; Schwartz, Stephen M; Siscovick, David S; Voight, Benjamin F; Carr, J Jeffrey; Feitosa, Mary F; Harris, Tamara B; Fox, Caroline S; Smith, Albert V; Kao, W H Linda; Hirschhorn, Joel N; Borecki, Ingrid B

    2011-03-01

    Nonalcoholic fatty liver disease (NAFLD) clusters in families, but the only known common genetic variants influencing risk are near PNPLA3. We sought to identify additional genetic variants influencing NAFLD using genome-wide association (GWA) analysis of computed tomography (CT) measured hepatic steatosis, a non-invasive measure of NAFLD, in large population based samples. Using variance components methods, we show that CT hepatic steatosis is heritable (∼26%-27%) in family-based Amish, Family Heart, and Framingham Heart Studies (n = 880 to 3,070). By carrying out a fixed-effects meta-analysis of genome-wide association (GWA) results between CT hepatic steatosis and ∼2.4 million imputed or genotyped SNPs in 7,176 individuals from the Old Order Amish, Age, Gene/Environment Susceptibility-Reykjavik study (AGES), Family Heart, and Framingham Heart Studies, we identify variants associated at genome-wide significant levels (p<5×10(-8)) in or near PNPLA3, NCAN, and PPP1R3B. We genotype these and 42 other top CT hepatic steatosis-associated SNPs in 592 subjects with biopsy-proven NAFLD from the NASH Clinical Research Network (NASH CRN). In comparisons with 1,405 healthy controls from the Myocardial Genetics Consortium (MIGen), we observe significant associations with histologic NAFLD at variants in or near NCAN, GCKR, LYPLAL1, and PNPLA3, but not PPP1R3B. Variants at these five loci exhibit distinct patterns of association with serum lipids, as well as glycemic and anthropometric traits. We identify common genetic variants influencing CT-assessed steatosis and risk of NAFLD. Hepatic steatosis associated variants are not uniformly associated with NASH/fibrosis or result in abnormalities in serum lipids or glycemic and anthropometric traits, suggesting genetic heterogeneity in the pathways influencing these traits.

  15. Genome-Wide Association Analysis Identifies Variants Associated with Nonalcoholic Fatty Liver Disease That Have Distinct Effects on Metabolic Traits

    PubMed Central

    Palmer, Cameron D.; Gudnason, Vilmundur; Eiriksdottir, Gudny; Garcia, Melissa E.; Launer, Lenore J.; Nalls, Michael A.; Clark, Jeanne M.; Mitchell, Braxton D.; Shuldiner, Alan R.; Butler, Johannah L.; Tomas, Marta; Hoffmann, Udo; Hwang, Shih-Jen; Massaro, Joseph M.; O'Donnell, Christopher J.; Sahani, Dushyant V.; Salomaa, Veikko; Schadt, Eric E.; Schwartz, Stephen M.; Siscovick, David S.; Voight, Benjamin F.; Carr, J. Jeffrey; Feitosa, Mary F.; Harris, Tamara B.; Fox, Caroline S.

    2011-01-01

    Nonalcoholic fatty liver disease (NAFLD) clusters in families, but the only known common genetic variants influencing risk are near PNPLA3. We sought to identify additional genetic variants influencing NAFLD using genome-wide association (GWA) analysis of computed tomography (CT) measured hepatic steatosis, a non-invasive measure of NAFLD, in large population based samples. Using variance components methods, we show that CT hepatic steatosis is heritable (∼26%–27%) in family-based Amish, Family Heart, and Framingham Heart Studies (n = 880 to 3,070). By carrying out a fixed-effects meta-analysis of genome-wide association (GWA) results between CT hepatic steatosis and ∼2.4 million imputed or genotyped SNPs in 7,176 individuals from the Old Order Amish, Age, Gene/Environment Susceptibility-Reykjavik study (AGES), Family Heart, and Framingham Heart Studies, we identify variants associated at genome-wide significant levels (p<5×10−8) in or near PNPLA3, NCAN, and PPP1R3B. We genotype these and 42 other top CT hepatic steatosis-associated SNPs in 592 subjects with biopsy-proven NAFLD from the NASH Clinical Research Network (NASH CRN). In comparisons with 1,405 healthy controls from the Myocardial Genetics Consortium (MIGen), we observe significant associations with histologic NAFLD at variants in or near NCAN, GCKR, LYPLAL1, and PNPLA3, but not PPP1R3B. Variants at these five loci exhibit distinct patterns of association with serum lipids, as well as glycemic and anthropometric traits. We identify common genetic variants influencing CT–assessed steatosis and risk of NAFLD. Hepatic steatosis associated variants are not uniformly associated with NASH/fibrosis or result in abnormalities in serum lipids or glycemic and anthropometric traits, suggesting genetic heterogeneity in the pathways influencing these traits. PMID:21423719

  16. The challenges of genome-wide interaction studies: lessons to learn from the analysis of HDL blood levels.

    PubMed

    van Leeuwen, Elisabeth M; Smouter, Françoise A S; Kam-Thong, Tony; Karbalai, Nazanin; Smith, Albert V; Harris, Tamara B; Launer, Lenore J; Sitlani, Colleen M; Li, Guo; Brody, Jennifer A; Bis, Joshua C; White, Charles C; Jaiswal, Alok; Oostra, Ben A; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, Andre G; Boerwinkle, Eric; Ballantyne, Christie M; Gudnason, Vilmundur; Psaty, Bruce M; Cupples, L Adrienne; Järvelin, Marjo-Riitta; Ripatti, Samuli; Isaacs, Aaron; Müller-Myhsok, Bertram; Karssen, Lennart C; van Duijn, Cornelia M

    2014-01-01

    Genome-wide association studies (GWAS) have revealed 74 single nucleotide polymorphisms (SNPs) associated with high-density lipoprotein cholesterol (HDL) blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS) to identify SNP×SNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS) cohort I (RS-I) using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs) to perform linear regression on all genome-wide pairs of SNPs. By performing a meta-analysis together with Rotterdam Study cohorts II and III (RS-II and RS-III), we were able to filter 181 interaction terms with a p-value<1 · 10-8 that replicated in the two independent cohorts. We were not able to replicate any of these interaction term in the AGES, ARIC, CHS, ERF, FHS and NFBC-66 cohorts (Ntotal = 30,011) when adjusting for multiple testing. Our GWIS resulted in the consistent finding of a possible interaction between rs774801 in ARMC8 (ENSG00000114098) and rs12442098 in SPATA8 (ENSG00000185594) being associated with HDL levels. However, p-values do not reach the preset Bonferroni correction of the p-values. Our study suggest that even for highly genetically determined traits such as HDL the sample sizes needed to detect SNP×SNP interactions are large and the 2-step filtering approaches do not yield a solution. Here we present our analysis plan and our reservations concerning GWIS.

  17. The Challenges of Genome-Wide Interaction Studies: Lessons to Learn from the Analysis of HDL Blood Levels

    PubMed Central

    van Leeuwen, Elisabeth M.; Smouter, Françoise A. S.; Kam-Thong, Tony; Karbalai, Nazanin; Smith, Albert V.; Harris, Tamara B.; Launer, Lenore J.; Sitlani, Colleen M.; Li, Guo; Brody, Jennifer A.; Bis, Joshua C.; White, Charles C.; Jaiswal, Alok; Oostra, Ben A.; Hofman, Albert; Rivadeneira, Fernando; Uitterlinden, Andre G.; Boerwinkle, Eric; Ballantyne, Christie M.; Gudnason, Vilmundur; Psaty, Bruce M.; Cupples, L. Adrienne; Järvelin, Marjo-Riitta; Ripatti, Samuli; Isaacs, Aaron; Müller-Myhsok, Bertram; Karssen, Lennart C.; van Duijn, Cornelia M.

    2014-01-01

    Genome-wide association studies (GWAS) have revealed 74 single nucleotide polymorphisms (SNPs) associated with high-density lipoprotein cholesterol (HDL) blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS) to identify SNP×SNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS) cohort I (RS-I) using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs) to perform linear regression on all genome-wide pairs of SNPs. By performing a meta-analysis together with Rotterdam Study cohorts II and III (RS-II and RS-III), we were able to filter 181 interaction terms with a p-value<1 · 10−8 that replicated in the two independent cohorts. We were not able to replicate any of these interaction term in the AGES, ARIC, CHS, ERF, FHS and NFBC-66 cohorts (Ntotal = 30,011) when adjusting for multiple testing. Our GWIS resulted in the consistent finding of a possible interaction between rs774801 in ARMC8 (ENSG00000114098) and rs12442098 in SPATA8 (ENSG00000185594) being associated with HDL levels. However, p-values do not reach the preset Bonferroni correction of the p-values. Our study suggest that even for highly genetically determined traits such as HDL the sample sizes needed to detect SNP×SNP interactions are large and the 2-step filtering approaches do not yield a solution. Here we present our analysis plan and our reservations concerning GWIS. PMID:25329471

  18. Genome-wide identification, isolation and expression analysis of auxin response factor (ARF) gene family in sweet orange (Citrus sinensis).

    PubMed

    Li, Si-Bei; OuYang, Wei-Zhi; Hou, Xiao-Jin; Xie, Liang-Liang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2015-01-01

    Auxin response factors (ARFs) are an important family of proteins in auxin-mediated response, with key roles in various physiological and biochemical processes. To date, a genome-wide overview of the ARF gene family in citrus was not available. A systematic analysis of this gene family in citrus was begun by carrying out a genome-wide search for the homologs of ARFs. A total of 19 nonredundant ARF genes (CiARF) were found and validated from the sweet orange. A comprehensive overview of the CiARFs was undertaken, including the gene structures, phylogenetic analysis, chromosome locations, conserved motifs of proteins, and cis-elements in promoters of CiARF. Furthermore, expression profiling using real-time PCR revealed many CiARF genes, albeit with different patterns depending on types of tissues and/or developmental stages. Comprehensive expression analysis of these genes was also performed under two hormone treatments using real-time PCR. Indole-3-acetic acid (IAA) and N-1-napthylphthalamic acid (NPA) treatment experiments revealed differential up-regulation and down-regulation, respectively, of the 19 citrus ARF genes in the callus of sweet orange. Our comprehensive analysis of ARF genes further elucidates the roles of CiARF family members during citrus growth and development process.

  19. Large-scale genome-wide association analysis of bipolar disorder identifies a new susceptibility locus near ODZ4

    PubMed Central

    Sklar, Pamela; Ripke, Stephan; Scott, Laura J.; Andreassen, Ole A.; Cichon, Sven; Craddock, Nick; Edenberg, Howard J.; Nurnberger, John I.; Rietschel, Marcella; Blackwood, Douglas; Corvin, Aiden; Flickinger, Matthew; Guan, Weihua; Mattingsdal, Morten; Mcquillin, Andrew; Kwan, Phoenix; Wienker, Thomas F.; Daly, Mark; Dudbridge, Frank; Holmans, Peter A.; Lin, Danyu; Burmeister, Margit; Greenwood, Tiffany A.; Hamshere, Marian L.; Muglia, Pierandrea; Smith, Erin N.; Zandi, Peter P.; Nievergelt, Caroline M.; Mckinney, Rebecca; Shilling, Paul D.; Schork, Nicholas J.; Bloss, Cinnamon S.; Foroud, Tatiana; Koller, Daniel L.; Gershon, Elliot S.; Liu, Chunyu; Badner, Judith A.; Scheftner, William A.; Lawson, William B.; Nwulia, Evaristus A.; Hipolito, Maria; Coryell, William; Rice, John P.; Byerley, William; McMahon, Francis J.; Schulze, Thomas G.; Berrettini, Wade; Lohoff, Falk W.; Potash, James B.; Mahon, Pamela B.; Mcinnis, Melvin G.; Zöllner, Sebastian; Zhang, Peng; Craig, David W.; Szelinger, Szabocls; Barrett, Thomas B.; Breuer, René; Meier, Sandra; Strohmaier, Jana; Witt, Stephanie H.; Tozzi, Federica; Farmer, Anne; McGuffin, Peter; Strauss, John; Xu, Wei; Kennedy, James L.; Vincent, John B.; Matthews, Keith; Day, Richard; Ferreira, Manuel D.C.; O'Dushlaine, Colm; Perlis, Roy; Raychaudhuri, Soumya; Ruderfer, Douglas; Hyoun, Phil L.; Smoller, Jordan W.; Li, Jun; Absher, Devin; Thompson, Robert C.; Meng, Fan Guo; Schatzberg, Alan F.; Bunney, William E.; Barchas, Jack D.; Jones, Edward G.; Watson, Stanley J.; Myers, Richard M.; Akil, Huda; Boehnke, Michael; Chambert, Kim; Moran, Jennifer; Scolnick, Ed; Djurovic, Srdjan; Melle, Ingrid; Morken, Gunnar; Gill, Michael; Morris, Derek; Quinn, Emma; Mühleisen, Thomas W.; Degenhardt, Franziska A.; Mattheisen, Manuel; Schumacher, Johannes; Maier, Wolfgang; Steffens, Michael; Propping, Peter; Nöthen, Markus M.; Anjorin, Adebayo; Bass, Nick; Gurling, Hugh; Kandaswamy, Radhika; Lawrence, Jacob; Mcghee, Kevin; Mcintosh, Andrew; Mclean, Alan W.; Muir, Walter J.; Pickard, Benjamin S.; Breen, Gerome; St Clair, David; Caesar, Sian; Gordon-Smith, Katherine; Jones, Lisa; Fraser, Christine; Green, Elaine K.; Grozeva, Detelina; Jones, Ian R.; Kirov, George; Moskvina, Valentina; Nikolov, Ivan; O'Donovan, Michael C.; Owen, Michael J.; Collier, David A.; Elkin, Amanda; Williamson, Richard; Young, Allan H.; Ferrier, I Nicol; Stefansson, Kari; Stefansson, Hreinn; Porgeirsson, Porgeir; Steinberg, Stacy; Gustafsson, Omar; Bergen, Sarah E.; Nimgaonkar, Vishwajit; hultman, Christina; Landén, Mikael; Lichtenstein, Paul; Sullivan, Patrick; Schalling, Martin; Osby, Urban; Backlund, Lena; Frisén, Louise; Langstrom, Niklas; Jamain, Stéphane; Leboyer, Marion; Etain, Bruno; Bellivier, Frank; Petursson, Hannes; Sigur Sson, Engilbert; Müller-Mysok, Bertram; Lucae, Susanne; Schwarz, Markus; Schofield, Peter R.; Martin, Nick; Montgomery, Grant W.; Lathrop, Mark; Oskarsson, Högni; Bauer, Michael; Wright, Adam; Mitchell, Philip B.; Hautzinger, Martin; Reif, Andreas; Kelsoe, John R.; Purcell, Shaun M.

    2011-01-01

    We conducted a combined genome-wide association (GWAS) analysis of 7,481 individuals affected with bipolar disorder and 9,250 control individuals within the Psychiatric Genomewide Association Study Consortium Bipolar Disorder group (PGC-BD). We performed a replication study in which we tested 34 independent SNPs in 4,493 independent bipolar disorder cases and 42,542 independent controls and found strong evidence for replication. In the replication sample, 18 of 34 SNPs had P value < 0.05, and 31 of 34 SNPs had signals with the same direction of effect (P = 3.8 × 10−7). In the combined analysis of all 63,766 subjects (11,974 cases and 51,792 controls), genome-wide significant evidence for association was confirmed for CACNA1C and found for a novel gene ODZ4. In a combined analysis of non-overlapping schizophrenia and bipolar GWAS samples we observed strong evidence for association with SNPs in CACNA1C and in the region of NEK4/ITIH1,3,4. Pathway analysis identified a pathway comprised of subunits of calcium channels enriched in the bipolar disorder association intervals. The strength of the replication data implies that increasing samples sizes in bipolar disorder will confirm many additional loci. PMID:21926972

  20. Genome-wide loss-of-function analysis of deubiquitylating enzymes for zebrafish development

    PubMed Central

    2009-01-01

    Background Deconjugation of ubiquitin and/or ubiquitin-like modified protein substrates is essential to modulate protein-protein interactions and, thus, signaling processes in cells. Although deubiquitylating (deubiquitinating) enzymes (DUBs) play a key role in this process, however, their function and regulation remain insufficiently understood. The "loss-of-function" phenotype studies can provide important information to elucidate the gene function, and zebrafish is an excellent model for this goal. Results From an in silico genome-wide search, we found more than 90 putative DUBs encoded in the zebrafish genome belonging to six different subclasses. Out of them, 85 from five classical subclasses have been tested with morpholino (MO) knockdown experiments and 57 of them were found to be important in early development of zebrafish. These DUB morphants resulted in a complex and pleiotropic phenotype that, regardless of gene target, always affected the notochord. Based on the huC neuronal marker expression, we grouped them into five sets (groups I to V). Group I DUBs (otud7b, uchl3 and bap1) appear to be involved in the Notch signaling pathway based on the neuronal hyperplasia, while group IV DUBs (otud4, usp5, usp15 and usp25) play a critical role in dorsoventral patterning through the BMP pathway. Conclusion We have identified an exhaustive list of genes in the zebrafish genome belonging to the five established classes of DUBs. Additionally, we performed the corresponding MO knockdown experiments in zebrafish as well as functional studies for a subset of the predicted DUB genes. The screen results in this work will stimulate functional follow-up studies of potential DUB genes using the zebrafish model system. PMID:20040115

  1. Genome-wide analysis links emerin to neuromuscular junction activity in Caenorhabditis elegans

    PubMed Central

    2014-01-01

    Background Laminopathies are diseases characterized by defects in nuclear envelope structure. A well-known example is Emery-Dreifuss muscular dystrophy, which is caused by mutations in the human lamin A/C and emerin genes. While most nuclear envelope proteins are ubiquitously expressed, laminopathies often affect only a subset of tissues. The molecular mechanisms underlying these tissue-specific manifestations remain elusive. We hypothesize that different functional subclasses of genes might be differentially affected by defects in specific nuclear envelope components. Results Here we determine genome-wide DNA association profiles of two nuclear envelope components, lamin/LMN-1 and emerin/EMR-1 in adult Caenorhabditis elegans. Although both proteins bind to transcriptionally inactive regions of the genome, EMR-1 is enriched at genes involved in muscle and neuronal function. Deletion of either EMR-1 or LEM-2, another integral envelope protein, causes local changes in nuclear architecture as evidenced by altered association between DNA and LMN-1. Transcriptome analyses reveal that EMR-1 and LEM-2 are associated with gene repression, particularly of genes implicated in muscle and nervous system function. We demonstrate that emr-1, but not lem-2, mutants are sensitive to the cholinesterase inhibitor aldicarb, indicating altered activity at neuromuscular junctions. Conclusions We identify a class of elements that bind EMR-1 but do not associate with LMN-1, and these are enriched for muscle and neuronal genes. Our data support a redundant function of EMR-1 and LEM-2 in chromatin anchoring to the nuclear envelope and gene repression. We demonstrate a specific role of EMR-1 in neuromuscular junction activity that may contribute to Emery-Dreifuss muscular dystrophy in humans. PMID:24490688

  2. Genome-wide identification and analysis of the SGR gene family in Cucumis melo L.

    PubMed

    Bade, R G; Bao, M L; Jin, W Y; Ma, Y; Niu, Y D; Hasi, A

    2016-10-17

    Chlorophyll (CHL) is present in many plant organs, and its metabolism is strongly regulated throughout plant development. Understanding the fate of CHL in senescent leaves or during fruit ripening is a complex process. The stay-green (SGR) protein has been shown to affect CHL degradation. In this study, we used the conserved sequences of STAY-GREEN domain protein (NP_567673) in Arabidopsis thaliana as a probe to search SGR family genes in the genome-wide melon protein database. Four candidate SGR family genes were identified in melon (Cucumis melo L. Hetao). The phylogenetic evolution, gene structure, and conserved motifs were subsequently analyzed. In order to verify the function of CmSGR genes in CHL degradation, CmSGR1 and CmSGR2 were transiently overexpressed and silenced using different plasmids in melon. Overexpression of CmSGR1 or CmSGR2 induced leaf yellowing or fruit ripening, while silencing of CmSGR1 or CmSGR2 via RNA interference delayed CHL breakdown during fruit ripening or leaf senescence compared with the wild type. Next, the expression profile was analyzed, and we found that CmSGR genes were expressed ubiquitously. Moreover, CmSGR1 and CmSGR2 were upregulated, and promoted fruit ripening. CmSGR3 and CmSGR4 were more highly expressed in leaves, cotyledon, and stem compared with CmSGR1 or CmSGR2. Thus, we conclude that CmSGR genes are crucial for fruit ripening and leaf senescence. CmSGR protein structure and function were further clarified to provide a theoretical foundation and valuable information for improved performance of melon.

  3. Genome-wide analysis of the response to nitric oxide in uropathogenic Escherichia coli CFT073

    PubMed Central

    Mehta, Heer H.; Liu, Yuxuan

    2015-01-01

    Uropathogenic Escherchia coli (UPEC) is the causative agent of urinary tract infections. Nitric oxide (NO) is a toxic water-soluble gas that is encountered by UPEC in the urinary tract. Therefore, UPEC probably requires mechanisms to detoxify NO in the host environment. Thus far, flavohaemoglobin (Hmp), an NO denitrosylase, is the only demonstrated NO detoxification system in UPEC. Here we show that, in E. coli strain CFT073, the NADH-dependent NO reductase flavorubredoxin (FlRd) also plays a major role in NO scavenging. We generated a mutant that lacks all known and candidate NO detoxification pathways (Hmp, FlRd and the respiratory nitrite reductase, NrfA). When grown and assayed anaerobically, this mutant expresses an NO-inducible NO scavenging activity, pointing to the existence of a novel detoxification mechanism. Expression of this activity is inducible by both NO and nitrate, and the enzyme is membrane-associated. Genome-wide transcriptional profiling of UPEC grown under anaerobic conditions in the presence of nitrate (as a source of NO) highlighted various aspects of the response of the pathogen to nitrate and NO. Several virulence-associated genes are upregulated, suggesting that host-derived NO is a potential regulator of UPEC virulence. Chromatin immunoprecipitation and sequencing was used to evaluate the NsrR regulon in CFT073. We identified 49 NsrR binding sites in promoter regions in the CFT073 genome, 29 of which were not previously identified in E. coli K-12. NsrR may regulate some CFT073 genes that do not have homologues in E. coli K-12. PMID:28348816

  4. Genome-wide Association Analysis Identifies PDE4D as an Asthma-Susceptibility Gene

    PubMed Central

    Himes, Blanca E.; Hunninghake, Gary M.; Baurley, James W.; Rafaels, Nicholas M.; Sleiman, Patrick; Strachan, David P.; Wilk, Jemma B.; Willis-Owen, Saffron A.G.; Klanderman, Barbara; Lasky-Su, Jessica; Lazarus, Ross; Murphy, Amy J.; Soto-Quiros, Manuel E.; Avila, Lydiana; Beaty, Terri; Mathias, Rasika A.; Ruczinski, Ingo; Barnes, Kathleen C.; Celedón, Juan C.; Cookson, William O.C.; Gauderman, W. James; Gilliland, Frank D.; Hakonarson, Hakon; Lange, Christoph; Moffatt, Miriam F.; O'Connor, George T.; Raby, Benjamin A.; Silverman, Edwin K.; Weiss, Scott T.

    2009-01-01

    Asthma, a chronic airway disease with known heritability, affects more than 300 million people around the world. A genome-wide association (GWA) study of asthma with 359 cases from the Childhood Asthma Management Program (CAMP) and 846 genetically matched controls from the Illumina ICONdb public resource was performed. The strongest region of association seen was on chromosome 5q12 in PDE4D. The phosphodiesterase 4D, cAMP-specific (phosphodiesterase E3 dunce homolog, Drosophila) gene (PDE4D) is a regulator of airway smooth-muscle contractility, and PDE4 inhibitors have been developed as medications for asthma. Allelic p values for top SNPs in this region were 4.3 × 10−07 for rs1588265 and 9.7 × 10−07 for rs1544791. Replications were investigated in ten independent populations with different ethnicities, study designs, and definitions of asthma. In seven white and Hispanic replication populations, two PDE4D SNPs had significant results with p values less than 0.05, and five had results in the same direction as the original population but had p values greater than 0.05. Combined p values for 18,891 white and Hispanic individuals (4,342 cases) in our replication populations were 4.1 × 10−04 for rs1588265 and 9.2 × 10−04 for rs1544791. In three black replication populations, which had different linkage disequilibrium patterns than the other populations, original findings were not replicated. Further study of PDE4D variants might lead to improved understanding of the role of PDE4D in asthma pathophysiology and the efficacy of PDE4 inhibitor medications. PMID:19426955

  5. Genome-wide analysis of chimpanzee genes with premature termination codons

    PubMed Central

    Wetterbom, Anna; Gyllensten, Ulf; Cavelier, Lucia; Bergström, Tomas F

    2009-01-01

    Background Premature termination codons (PTCs) cause mRNA degradation or a truncated protein and thereby contribute to the transcriptome and proteome divergence between species. Here we present the first genome-wide study of PTCs in the chimpanzee. By comparing the human and chimpanzee genome sequences we identify and characterize genes with PTCs, in order to understand the contribution of these mutations to the transcriptome diversity between the species. Results We have studied a total of 13,487 human-chimpanzee gene pairs and found that ~8% were affected by PTCs in the chimpanzee. A majority (764/1,109) of PTCs were caused by insertions or deletions and the remaining part was caused by substitutions. The distribution of PTC genes varied between chromosomes, with Y having the highest proportion. Furthermore, the density of PTC genes varied on a megabasepair scale within chromosomes and we found the density to be correlated both with indel divergence and proximity to the telomere. Within genes, PTCs were more common close to the 5' and 3' ends of the amino acid sequence. Gene Ontology classification revealed that olfactory receptor genes were over represented among the PTC genes. Conclusion Our results showed that the density of PTC genes fluctuated across the genome depending on the local genomic context. PTCs were preferentially located in the terminal parts of the transcript, which generally have a lower frequency of functional domains, indicating that selection was operating against PTCs at sites central to protein function. The enrichment of GO terms associated with olfaction suggests that PTCs may have influenced the difference in the repertoire of olfactory genes between humans and chimpanzees. In summary, 8% of the chimpanzee genes were affected by PTCs and this type of variation is likely to have an important effect on the transcript and proteomic divergence between humans and chimpanzees. PMID:19178713

  6. Genome-wide analysis of tandem repeats in plants and green algae.

    PubMed

    Zhao, Zhixin; Guo, Cheng; Sutharzan, Sreeskandarajan; Li, Pei; Echt, Craig S; Zhang, Jie; Liang, Chun

    2014-01-10

    Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among the 31 species, no significant correlation was detected between the TR density and genome size. Interestingly, green alga Chlamydomonas reinhardtii (42,059 bp/Mbp) and castor bean Ricinus communis (55,454 bp/Mbp) showed much higher TR densities than all other species (13,209 bp/Mbp on average). In the 29 land plants, including 22 dicots, 5 monocots, and 2 bryophytes, 5'-UTR and upstream intergenic 200-nt (UI200) regions had the first and second highest TR densities, whereas in the two green algae (C. reinhardtii and Volvox carteri) the first and second highest densities were found in intron and coding sequence (CDS) regions, respectively. In CDS regions, trinucleotide and hexanucleotide motifs were those most frequently represented in all species. In intron regions, especially in the two green algae, significantly more TRs were detected near the intron-exon junctions. Within intergenic regions in dicots and monocots, more TRs were found near both the 5' and 3' ends of genes. GO annotation in two green algae revealed that the genes with TRs in introns are significantly involved in transcriptional and translational processing. As the first systematic examination of TRs in plant and green algal genomes, our study showed that TRs displayed nonrandom distribution for both intragenic and intergenic regions, suggesting that they have potential roles in transcriptional or translational regulation in plants and green algae.

  7. Genome wide analysis of Silurana (Xenopus) tropicalis development reveals dynamic expression using network enrichment analysis.

    PubMed

    Langlois, Valérie S; Martyniuk, Christopher J

    2013-01-01

    Development involves precise timing of gene expression and coordinated pathways for organogenesis and morphogenesis. Functional and sub-network enrichment analysis provides an integrated approach for identifying networks underlying development. The objectives of this study were to characterize early gene regulatory networks over Silurana tropicalis development from NF stage 2 to 46 using a custom Agilent 4×44K microarray. There were >8000 unique gene probes that were differentially expressed between Nieuwkoop-Faber (NF) stage 2 and stage 16, and >2000 gene probes differentially expressed between NF 34 and 46. Gene ontology revealed that genes involved in nucleosome assembly, cell division, pattern specification, neurotransmission, and general metabolism were increasingly regulated throughout development, consistent with active development. Sub-network enrichment analysis revealed that processes such as membrane hyperpolarisation, retinoic acid, cholesterol, and dopamine metabolic gene networks were activated/inhibited over time. This study identifies RNA transcripts that are potentially maternally inherited in an anuran species, provides evidence that the expression of genes involved in retinoic acid receptor signaling may increase prior to those involved in thyroid receptor signaling, and characterizes novel gene expression networks preceding organogenesis which increases understanding of the spatiotemporal embryonic development in frogs.

  8. Meta-analysis of heterogeneous Down Syndrome data reveals consistent genome-wide dosage effects related to neurological processes

    PubMed Central

    2011-01-01

    Background Down syndrome (DS; trisomy 21) is the most common genetic cause of mental retardation in the human population and key molecular networks dysregulated in DS are still unknown. Many different experimental techniques have been applied to analyse the effects of dosage imbalance at the molecular and phenotypical level, however, currently no integrative approach exists that attempts to extract the common information. Results We have performed a statistical meta-analysis from 45 heterogeneous publicly available DS data sets in order to identify consistent dosage effects from these studies. We identified 324 genes with significant genome-wide dosage effects, including well investigated genes like SOD1, APP, RUNX1 and DYRK1A as well as a large proportion of novel genes (N = 62). Furthermore, we characterized these genes using gene ontology, molecular interactions and promoter sequence analysis. In order to judge relevance of the 324 genes for more general cerebral pathologies we used independent publicly available microarry data from brain studies not related with DS and identified a subset of 79 genes with potential impact for neurocognitive processes. All results have been made available through a web server under http://ds-geneminer.molgen.mpg.de/. Conclusions Our study represents a comprehensive integrative analysis of heterogeneous data including genome-wide transcript levels in the domain of trisomy 21. The detected dosage effects build a resource for further studies of DS pathology and the development of new therapies. PMID:21569303

  9. Twenty bone-mineral-density loci identified by large-scale meta-analysis of genome-wide association studies.

    PubMed

    Rivadeneira, Fernando; Styrkársdottir, Unnur; Estrada, Karol; Halldórsson, Bjarni V; Hsu, Yi-Hsiang; Richards, J Brent; Zillikens, M Carola; Kavvoura, Fotini K; Amin, Najaf; Aulchenko, Yurii S; Cupples, L Adrienne; Deloukas, Panagiotis; Demissie, Serkalem; Grundberg, Elin; Hofman, Albert; Kong, Augustine; Karasik, David; van Meurs, Joyce B; Oostra, Ben; Pastinen, Tomi; Pols, Huibert A P; Sigurdsson, Gunnar; Soranzo, Nicole; Thorleifsson, Gudmar; Thorsteinsdottir, Unnur; Williams, Frances M K; Wilson, Scott G; Zhou, Yanhua; Ralston, Stuart H; van Duijn, Cornelia M; Spector, Timothy; Kiel, Douglas P; Stefansson, Kari; Ioannidis, John P A; Uitterlinden, André G

    2009-11-01

    Bone mineral density (BMD) is a heritable complex trait used in the clinical diagnosis of osteoporosis and the assessment of fracture risk. We performed meta-analysis of five genome-wide association studies of femoral neck and lumbar spine BMD in 19,195 subjects of Northern European descent. We identified 20 BMD loci that reached genome-wide significance (GWS; P < 5 x 10(-8)), of which 13 map to regions not previously associated with this trait: 1p31.3 (GPR177), 2p21 (SPTBN1), 3p22 (CTNNB1), 4q21.1 (MEPE), 5q14 (MEF2C), 7p14 (STARD3NL), 7q21.3 (FLJ42280), 11p11.2 (LRP4, ARHGAP1, F2), 11p14.1 (DCDC5), 11p15 (SOX6), 16q24 (FOXL1), 17q21 (HDAC5) and 17q12 (CRHR1). The meta-analysis also confirmed at GWS level seven known BMD loci on 1p36 (ZBTB40), 6q25 (ESR1), 8q24 (TNFRSF11B), 11q13.4 (LRP5), 12q13 (SP7), 13q14 (TNFSF11) and 18q21 (TNFRSF11A). The many SNPs associated with BMD map to genes in signaling pathways with relevance to bone metabolism and highlight the complex genetic architecture that underlies osteoporosis and variation in BMD.

  10. Genome-wide analysis of the DNA-binding with one zinc finger (Dof) transcription factor family in bananas.

    PubMed

    Dong, Chen; Hu, Huigang; Xie, Jianghui

    2016-12-01

    DNA-binding with one finger (Dof) domain proteins are a multigene family of plant-specific transcription factors involved in numerous aspects of plant growth and development. In this study, we report a genome-wide search for Musa acuminata Dof (MaDof) genes and their expression profiles at different developmental stages and in response to various abiotic stresses. In addition, a complete overview of the Dof gene family in bananas is presented, including the gene structures, chromosomal locations, cis-regulatory elements, conserved protein domains, and phylogenetic inferences. Based on the genome-wide analysis, we identified 74 full-length protein-coding MaDof genes unevenly distributed on 11 chromosomes. Phylogenetic analysis with Dof members from diverse plant species showed that MaDof genes can be classified into four subgroups (StDof I, II, III, and IV). The detailed genomic information of the MaDof gene homologs in the present study provides opportunities for functional analyses to unravel the exact role of the genes in plant growth and development.

  11. Large-scale meta-analysis of genome-wide association data identifies six new risk loci for Parkinson's disease.

    PubMed

    Nalls, Mike A; Pankratz, Nathan; Lill, Christina M; Do, Chuong B; Hernandez, Dena G; Saad, Mohamad; DeStefano, Anita L; Kara, Eleanna; Bras, Jose; Sharma, Manu; Schulte, Claudia; Keller, Margaux F; Arepalli, Sampath; Letson, Christopher; Edsall, Connor; Stefansson, Hreinn; Liu, Xinmin; Pliner, Hannah; Lee, Joseph H; Cheng, Rong; Ikram, M Arfan; Ioannidis, John P A; Hadjigeorgiou, Georgios M; Bis, Joshua C; Martinez, Maria; Perlmutter, Joel S; Goate, Alison; Marder, Karen; Fiske, Brian; Sutherland, Margaret; Xiromerisiou, Georgia; Myers, Richard H; Clark, Lorraine N; Stefansson, Kari; Hardy, John A; Heutink, Peter; Chen, Honglei; Wood, Nicholas W; Houlden, Henry; Payami, Haydeh; Brice, Alexis; Scott, William K; Gasser, Thomas; Bertram, Lars; Eriksson, Nicholas; Foroud, Tatiana; Singleton, Andrew B

    2014-09-01

    We conducted a meta-analysis of Parkinson's disease genome-wide association studies using a common set of 7,893,274 variants across 13,708 cases and 95,282 controls. Twenty-six loci were identified as having genome-wide significant association; these and 6 additional previously reported loci were then tested in an independent set of 5,353 cases and 5,551 controls. Of the 32 tested SNPs, 24 replicated, including 6 newly identified loci. Conditional analyses within loci showed that four loci, including GBA, GAK-DGKQ, SNCA and the HLA region, contain a secondary independent risk variant. In total, we identified and replicated 28 independent risk variants for Parkinson's disease across 24 loci. Although the effect of each individual locus was small, risk profile analysis showed substantial cumulative risk in a comparison of the highest and lowest quintiles of genetic risk (odds ratio (OR) = 3.31, 95% confidence interval (CI) = 2.55-4.30; P = 2 × 10(-16)). We also show six risk loci associated with proximal gene expression or DNA methylation.

  12. Genome-wide association analysis and pathways enrichment for lactation persistency in Canadian Holstein cattle.

    PubMed

    Do, D N; Bissonnette, N; Lacasse, P; Miglior, F; Sargolzaei, M; Zhao, X; Ibeagha-Awemu, E M

    2017-03-01

    Lactation persistency (LP), defined as the rate of declining milk yield after milk peak, is an economically important trait for dairy cattle. Improving LP is considered a good alternative method for increasing overall milk production because it does not cause the negative energy balance and other health issues that cows experience during peak milk production. However, little is known about the biology of LP. A genome-wide association study (GWAS) and pathway enrichment were used to explore the genetic mechanisms underlying LP. The GWAS was performed using a univariate regression mixed linear model on LP data of 3,796 cows and 44,100 single nucleotide polymorphisms (SNP). Eight and 47 SNP were significantly and suggestively associated with LP, respectively. The 2 most important quantitative trait loci regions for LP were (1) a region from 106 to 108 Mb on Bos taurus autosome (BTA) 5, where the most significant SNP (ARS-BFGL-NGS-2399) was located and also formed a linkage disequilibrium block with 3 other SNP; and (2) a region from 29.3 to 31.3 Mb on BTA 20, which contained 3 significant SNP. Based on physical positions, MAN1C1, MAP3K5, HCN1, TSPAN9, MRPS30, TEX14, and CCL28 are potential candidate genes for LP because the significant SNP were located in their intronic regions. Enrichment analyses of a list of 536 genes in 0.5-Mb flanking regions of significant and suggestive SNP indicates that synthesis of milk components, regulation of cell apoptosis processes and insulin, and prolactin signaling pathways are important for LP. Upstream regulators relevant for LP positional candidate genes were prolactin (PRL), peroxisome proliferator-activated receptor gamma (PPARG), and Erb-B2 receptor tyrosine kinase 2 (ERBB2). Several networks related to cellular development, proliferation and death were significantly enriched for LP positional candidate genes. In conclusion, this study detected several SNP, genes, and interesting regions for fine mapping and validation of

  13. Genome-wide DNA methylation analysis identifies MEGF10 as a novel epigenetically repressed candidate tumor suppressor gene in neuroblastoma.

    PubMed

    Charlet, Jessica; Tomari, Ayumi; Dallosso, Anthony R; Szemes, Marianna; Kaselova, Martina; Curry, Thomas J; Almutairi, Bader; Etchevers, Heather C; McConville, Carmel; Malik, Karim T A; Brown, Keith W

    2017-04-01

    Neuroblastoma is a childhood cancer in which many children still have poor outcomes, emphasising the need to better understand its pathogenesis. Despite recent genome-wide mutation analyses, many primary neuroblastomas do not contain recognizable driver mutations, implicating alternate molecular pathologies such as epigenetic alterations. To discover genes that become epigenetically deregulated during neuroblastoma tumorigenesis, we took the novel approach of comparing neuroblastomas to neural crest precursor cells, using genome-wide DNA methylation analysis. We identified 93 genes that were significantly differentially methylated of which 26 (28%) were hypermethylated and 67 (72%) were hypomethylated. Concentrating on hypermethylated genes to identify candidate tumor suppressor loci, we found the cell engulfment and adhesion factor gene MEGF10 to be epigenetically repressed by DNA hypermethylation or by H3K27/K9 methylation in neuroblastoma cell lines. MEGF10 showed significantly down-regulated expression in neuroblastoma tumor samples; furthermore patients with the lowest-expressing tumors had reduced relapse-free survival. Our functional studies showed that knock-down of MEGF10 expression in neuroblastoma cell lines promoted cell growth, consistent with MEGF10 acting as a clinically relevant, epigenetically deregulated neuroblastoma tumor suppressor gene. © 2016 The Authors. Molecular Carcinogenesis Published by Wiley Periodicals, Inc.

  14. Mining Gold Dust under the Genome Wide Significance Level: A Two-Stage Approach to Analysis of GWAS

    PubMed Central

    Shi, Gang; Boerwinkle, Eric; Morrison, Alanna C.; Gu, C. Charles; Chakravarti, Aravinda; Rao, DC

    2013-01-01

    We propose a two-stage approach to analyze genome-wide association (GWA) data in order to identify a set of promising single-nucleotide polymorphisms (SNPs). In stage one, we select a list of top signals from single SNP analyses by controlling false discovery rate (FDR). In stage two, we use the least absolute shrinkage and selection operator (LASSO) regression to reduce false positives. The proposed approach was evaluated using simulated quantitative traits based on genome-wide SNP data on 8,861 Caucasian individuals from the Atherosclerosis Risk in Communities (ARIC) Study. Our first stage, targeted at controlling false negatives, yields better power than using Bonferroni corrected significance level. The LASSO regression reduces the number of significant SNPs in stage two: it reduces false positive SNPs and it reduces true positive SNPs also at simulated causal loci due to linkage disequilibrium. Interestingly, the LASSO regression preserves the power from stage one, i.e., the number of causal loci detected from the LASSO regression in stage two is almost the same as in stage one, while reducing false positives further. Real data on systolic blood pressure in the ARIC study was analyzed using our two-stage approach which identified two significant SNPs, one of which was reported to be genome-significant in a meta-analysis containing a much larger sample size. On the other hand, a single SNP association scan did not yield any significant results. PMID:21254218

  15. Genome-wide analysis of transcriptional regulators in human HSPCs reveals a densely interconnected network of coding and noncoding genes.

    PubMed

    Beck, Dominik; Thoms, Julie A I; Perera, Dilmi; Schütte, Judith; Unnikrishnan, Ashwin; Knezevic, Kathy; Kinston, Sarah J; Wilson, Nicola K; O'Brien, Tracey A; Göttgens, Berthold; Wong, Jason W H; Pimanda, John E

    2013-10-03

    Genome-wide combinatorial binding patterns for key transcription factors (TFs) have not been reported for primary human hematopoietic stem and progenitor cells (HSPCs), and have constrained analysis of the global architecture of molecular circuits controlling these cells. Here we provide high-resolution genome-wide binding maps for a heptad of key TFs (FLI1, ERG, GATA2, RUNX1, SCL, LYL1, and LMO2) in human CD34(+) HSPCs, together with quantitative RNA and microRNA expression profiles. We catalog binding of TFs at coding genes and microRNA promoters, and report that combinatorial binding of all 7 TFs is favored and associated with differential expression of genes and microRNA in HSPCs. We also uncover a previously unrecognized association between FLI1 and RUNX1 pairing in HSPCs, we establish a correlation between the density of histone modifications that mark active enhancers and the number of overlapping TFs at a peak, we demonstrate bivalent histone marks at promoters of heptad target genes in CD34(+) cells that are poised for later expression, and we identify complex relationships between specific microRNAs and coding genes regulated by the heptad. Taken together, these data reveal the power of integrating multifactor sequencing of chromatin immunoprecipitates with coding and noncoding gene expression to identify regulatory circuits controlling cell identity.

  16. Meta-analysis of genome-wide association studies identifies common variants in CTNNA2 associated with excitement-seeking

    PubMed Central

    Terracciano, A; Esko, T; Sutin, A R; de Moor, M H M; Meirelles, O; Zhu, G; Tanaka, T; Giegling, I; Nutile, T; Realo, A; Allik, J; Hansell, N K; Wright, M J; Montgomery, G W; Willemsen, G; Hottenga, J-J; Friedl, M; Ruggiero, D; Sorice, R; Sanna, S; Cannas, A; Räikkönen, K; Widen, E; Palotie, A; Eriksson, J G; Cucca, F; Krueger, R F; Lahti, J; Luciano, M; Smoller, J W; van Duijn, C M; Abecasis, G R; Boomsma, D I; Ciullo, M; Costa, P T; Ferrucci, L; Martin, N G; Metspalu, A; Rujescu, D; Schlessinger, D; Uda, M

    2011-01-01

    The tendency to seek stimulating activities and intense sensations define excitement-seeking, a personality trait akin to some aspects of sensation-seeking. This trait is a central feature of extraversion and is a component of the multifaceted impulsivity construct. Those who score high on measures of excitement-seeking are more likely to smoke, use other drugs, gamble, drive recklessly, have unsafe/unprotected sex and engage in other risky behaviors of clinical and social relevance. To identify common genetic variants associated with the Excitement-Seeking scale of the Revised NEO Personality Inventory, we performed genome-wide association studies in six samples of European ancestry (N=7860), and combined the results in a meta-analysis. We identified a genome-wide significant association between the Excitement-Seeking scale and rs7600563 (P=2 × 10−8). This single-nucleotide polymorphism maps within the catenin cadherin-associated protein, alpha 2 (CTNNA2) gene, which encodes for a brain-expressed α-catenin critical for synaptic contact. The effect of rs7600563 was in the same direction in all six samples, but did not replicate in additional samples (N=5105). The results provide insight into the genetics of excitement-seeking and risk-taking, and are relevant to hyperactivity, substance use, antisocial and bipolar disorders. PMID:22833195

  17. A Genome-Wide Association Study for Agronomic Traits in Soybean Using SNP Markers and SNP-Based Haplotype Analysis

    PubMed Central

    de Oliveira, Marco Antônio Rott; Higashi, Wilson; Scapim, Carlos Alberto; Schuster, Ivan

    2017-01-01

    Mapping quantitative trait loci through the use of linkage disequilibrium (LD) in populations of unrelated individuals provides a valuable approach for dissecting the genetic basis of complex traits in soybean (Glycine max). The haplotype-based genome-wide association study (GWAS) has now been proposed as a complementary approach to intensify benefits from LD, which enable to assess the genetic determinants of agronomic traits. In this study a GWAS was undertaken to identify genomic regions that control 100-seed weight (SW), plant height (PH) and seed yield (SY) in a soybean association mapping panel using single nucleotide polymorphism (SNP) markers and haplotype information. The soybean cultivars (N = 169) were field-evaluated across four locations of southern Brazil. The genome-wide haplotype association analysis (941 haplotypes) identified eleven, seventeen and fifty-nine SNP-based haplotypes significantly associated with SY, SW and PH, respectively. Although most marker-trait associations were environment and trait specific, stable haplotype associations were identified for SY and SW across environments (i.e., haplotypes Gm12_Hap12). The haplotype block 42 on Chr19 (Gm19_Hap42) was confirmed to be associated with PH in two environments. These findings enable us to refine the breeding strategy for tropical soybean, which confirm that haplotype-based GWAS can provide new insights on the genetic determinants that are not captured by the single-marker approach. PMID:28152092

  18. A genome-wide gene-expression analysis and database in transgenic mice during development of amyloid or tau pathology.

    PubMed

    Matarin, Mar; Salih, Dervis A; Yasvoina, Marina; Cummings, Damian M; Guelfi, Sebastian; Liu, Wenfei; Nahaboo Solim, Muzammil A; Moens, Thomas G; Paublete, Rocio Moreno; Ali, Shabinah S; Perona, Marina; Desai, Roshni; Smith, Kenneth J; Latcham, Judy; Fulleylove, Michael; Richardson, Jill C; Hardy, John; Edwards, Frances A

    2015-02-03

    We provide microarray data comparing genome-wide differential expression and pathology throughout life in four lines of "amyloid" transgenic mice (mutant human APP, PSEN1, or APP/PSEN1) and "TAU" transgenic mice (mutant human MAPT gene). Microarray data were validated by qPCR and by comparison to human studies, including genome-wide association study (GWAS) hits. Immune gene expression correlated tightly with plaques whereas synaptic genes correlated negatively with neurofibrillary tangles. Network analysis of immune gene modules revealed six hub genes in hippocampus of amyloid mice, four in common with cortex. The hippocampal network in TAU mice was similar except that Trem2 had hub status only in amyloid mice. The cortical network of TAU mice was entirely different with more hub genes and few in common with the other networks, suggesting reasons for specificity of cortical dysfunction in FTDP17. This Resource opens up many areas for investigation. All data are available and searchable at http://www.mouseac.org.

  19. Mining gold dust under the genome wide significance level: a two-stage approach to analysis of GWAS.

    PubMed

    Shi, Gang; Boerwinkle, Eric; Morrison, Alanna C; Gu, C Charles; Chakravarti, Aravinda; Rao, D C

    2011-02-01

    We propose a two-stage approach to analyze genome-wide association data in order to identify a set of promising single-nucleotide polymorphisms (SNPs). In stage one, we select a list of top signals from single SNP analyses by controlling false discovery rate. In stage two, we use the least absolute shrinkage and selection operator (LASSO) regression to reduce false positives. The proposed approach was evaluated using simulated quantitative traits based on genome-wide SNP data on 8,861 Caucasian individuals from the Atherosclerosis Risk in Communities (ARIC) Study. Our first stage, targeted at controlling false negatives, yields better power than using Bonferroni-corrected significance level. The LASSO regression reduces the number of significant SNPs in stage two: it reduces false-positive SNPs and it reduces true-positive SNPs also at simulated causal loci due to linkage disequilibrium. Interestingly, the LASSO regression preserves the power from stage one, i.e., the number of causal loci detected from the LASSO regression in stage two is almost the same as in stage one, while reducing false positives further. Real data on systolic blood pressure in the ARIC study was analyzed using our two-stage approach which identified two significant SNPs, one of which was reported to be genome-significant in a meta-analysis containing a much larger sample size. On the other hand, a single SNP association scan did not yield any significant results.

  20. Genome-wide association analysis for chronic venous disease identifies EFEMP1 and KCNH8 as susceptibility loci

    PubMed Central

    Ellinghaus, Eva; Ellinghaus, David; Krusche, Petra; Greiner, Aljoscha; Schreiber, Claudia; Nikolaus, Susanna; Gieger, Christian; Strauch, Konstantin; Lieb, Wolfgang; Rosenstiel, Philip; Frings, Norbert; Fiebig, Andreas; Schreiber, Stefan; Franke, Andre

    2017-01-01

    Chronic venous disease (CVD) is a multifactorial condition representing one of the most common disorders among populations of Western countries. The heritability of about 17% suggests genetic risk factors in CVD etiology. However, so far the genetic causes are unknown. We undertook the hitherto first genome-wide association study (GWAS) for CVD, analyzing more than 1.93 M SNPs in 4,942 German individuals, followed by replication in two independent German data sets. The combined analysis of discovery and replication stages (2,269 cases and 7,765 controls) yielded robust associations within the two genes EFEMP1 and KCNH8 (rs17278665, rs727139 with P < 5 × 10−8), and suggestive association within gene SKAP2 (rs2030136 with P < 5 × 10−7). Association signals of rs17278665 and rs727139 reside in regions of low linkage disequilibrium containing no other genes. Data from the ENCODE and Roadmap Epigenomics projects show that tissue specific marks overlap with the variants. SNPs rs17278665 and rs2030136 are known eQTLs. Our study demonstrates that GWAS are a valuable tool to study the genetic component of CVD. With our approach, we identified two novel genome-wide significant susceptibility loci for this common disease. Particularly, the extracellular matrix glycoprotein EFEMP1 is promising for future functional studies due to its antagonistic role in vessel development and angiogenesis. PMID:28374850

  1. Genome-Wide Analysis of Seed Acid Detergent Lignin (ADL) and Hull Content in Rapeseed (Brassica napus L.).

    PubMed

    Wang, Jia; Jian, Hongju; Wei, Lijuan; Qu, Cunmin; Xu, Xinfu; Lu, Kun; Qian, Wei; Li, Jiana; Li, Maoteng; Liu, Liezhao

    2015-01-01

    A stable yellow-seeded variety is the breeding goal for obtaining the ideal rapeseed (Brassica napus L.) plant, and the amount of acid detergent lignin (ADL) in the seeds and the hull content (HC) are often used as yellow-seeded rapeseed screening indices. In this study, a genome-wide association analysis of 520 accessions was performed using the Q + K model with a total of 31,839 single-nucleotide polymorphism (SNP) sites. As a result, three significant associations on the B. napus chromosomes A05, A09, and C05 were detected for seed ADL content. The peak SNPs were within 9.27, 14.22, and 20.86 kb of the key genes BnaA.PAL4, BnaA.CAD2/BnaA.CAD3, and BnaC.CCR1, respectively. Further analyses were performed on the major locus of A05, which was also detected in the seed HC examination. A comparison of our genome-wide association study (GWAS) results and previous linkage mappings revealed a common chromosomal region on A09, which indicates that GWAS can be used as a powerful complementary strategy for dissecting complex traits in B. napus. Genomic selection (GS) utilizing the significant SNP markers based on the GWAS results exhibited increased predictive ability, indicating that the predictive ability of a given model can be substantially improved by using GWAS and GS.

  2. Compare and Contrast Meta Analysis (CCMA): A Method for Identification of Pleiotropic Loci in Genome-Wide Association Studies.

    PubMed

    Baurecht, Hansjörg; Hotze, Melanie; Rodríguez, Elke; Manz, Judith; Weidinger, Stephan; Cordell, Heather J; Augustin, Thomas; Strauch, Konstantin

    2016-01-01

    In recent years, genome-wide association studies (GWAS) have identified many loci that are shared among common disorders and this has raised interest in pleiotropy. For performing appropriate analysis, several methods have been proposed, e.g. conducting a look-up in external sources or exploiting GWAS results by meta-analysis based methods. We recently proposed the Compare & Contrast Meta-Analysis (CCMA) approach where significance thresholds were obtained by simulation. Here we present analytical formulae for the density and cumulative distribution function of the CCMA test statistic under the null hypothesis of no pleiotropy and no association, which, conveniently for practical reasons, turns out to be exponentially distributed. This allows researchers to apply the CCMA method without having to rely on simulations. Finally, we show that CCMA demonstrates power to detect disease-specific, agonistic and antagonistic loci comparable to the frequently used Subset-Based Meta-Analysis approach, while better controlling the type I error rate.

  3. A comprehensive 1000 Genomes-based genome-wide association meta-analysis of coronary artery disease

    PubMed Central

    Kyriakou, Theodosios; Nelson, Christopher P; Hopewell, Jemma C; Webb, Thomas R; Zeng, Lingyao; Dehghan, Abbas; Alver, Maris; Armasu, Sebastian M; Auro, Kirsi; Bjonnes, Andrew; Chasman, Daniel I; Chen, Shufeng; Ford, Ian; Franceschini, Nora; Gieger, Christian; Grace, Christopher; Gustafsson, Stefan; Huang, Jie; Hwang, Shih-Jen; Kim, Yun Kyoung; Kleber, Marcus E; Lau, King Wai; Lu, Xiangfeng; Lu, Yingchang; Lyytikäinen, Leo-Pekka; Mihailov, Evelin; Morrison, Alanna C; Pervjakova, Natalia; Qu, Liming; Rose, Lynda M; Salfati, Elias; Saxena, Richa; Scholz, Markus; Smith, Albert V; Tikkanen, Emmi; Uitterlinden, Andre; Yang, Xueli; Zhang, Weihua; Zhao, Wei; de Andrade, Mariza; de Vries, Paul S; van Zuydam, Natalie R; Anand, Sonia S; Bertram, Lars; Beutner, Frank; Dedoussis, George; Frossard, Philippe; Gauguier, Dominique; Goodall, Alison H; Gottesman, Omri; Haber, Marc; Han, Bok-Ghee; Huang, Jianfeng; Jalilzadeh, Shapour; Kessler, Thorsten; König, Inke R; Lannfelt, Lars; Lieb, Wolfgang; Lind, Lars; Lindgren, Cecilia M; Lokki, Marja-Liisa; Magnusson, Patrik K; Mallick, Nadeem H; Mehra, Narinder; Meitinger, Thomas; Memon, Fazal-ur-Rehman; Morris, Andrew P; Nieminen, Markku S; Pedersen, Nancy L; Peters, Annette; Rallidis, Loukianos S; Rasheed, Asif; Samuel, Maria; Shah, Svati H; Sinisalo, Juha; Stirrups, Kathleen E; Trompet, Stella; Wang, Laiyuan; Zaman, Khan S; Ardissino, Diego; Boerwinkle, Eric; Borecki, Ingrid B; Bottinger, Erwin P; Buring, Julie E; Chambers, John C; Collins, Rory; Cupples, L Adrienne; Danesh, John; Demuth, Ilja; Elosua, Roberto; Epstein, Stephen E; Esko, Tõnu; Feitosa, Mary F; Franco, Oscar H; Franzosi, Maria Grazia; Granger, Christopher B; Gu, Dongfeng; Gudnason, Vilmundur; Hall, Alistair S; Hamsten, Anders; Harris, Tamara B; Hazen, Stanley L; Hengstenberg, Christian; Hofman, Albert; Ingelsson, Erik; Iribarren, Carlos; Jukema, J Wouter; Karhunen, Pekka J; Kim, Bong-Jo; Kooner, Jaspal S; Kullo, Iftikhar J; Lehtimäki, Terho; Loos, Ruth J F; Melander, Olle; Metspalu, Andres; März, Winfried; Palmer, Colin N; Perola, Markus; Quertermous, Thomas; Rader, Daniel J; Ridker, Paul M; Ripatti, Samuli; Roberts, Robert; Salomaa, Veikko; Sanghera, Dharambir K; Schwartz, Stephen M; Seedorf, Udo; Stewart, Alexandre F; Stott, David J; Thiery, Joachim; Zalloua, Pierre A; O’Donnell, Christopher J; Reilly, Muredach P; Assimes, Themistocles L; Thompson, John R; Erdmann, Jeanette; Clarke, Robert; Watkins, Hugh; Kathiresan, Sekar; McPherson, Ruth; Deloukas, Panos; Schunkert, Heribert; Samani, Nilesh J; Farrall, Martin

    2015-01-01

    Existing knowledge of genetic variants affecting risk of coronary artery disease (CAD) is largely based on genome-wide association studies (GWAS) analysis of common SNPs. Leveraging phased haplotypes from the 1000 Genomes Project, we report a GWAS meta-analysis of 185 thousand CAD cases and controls, interrogating 6.7 million common (MAF>0.05) as well as 2.7 million low frequency (0.005analysis provides a comprehensive survey of the fine genetic architecture of CAD showing that genetic susceptibility to this common disease is largely determined by common SNPs of small effect size. PMID:26343387

  4. An integrated pipeline for the genome-wide analysis of transcription factor binding sites from ChIP-Seq.

    PubMed

    Mercier, Eloi; Droit, Arnaud; Li, Leping; Robertson, Gordon; Zhang, Xuekui; Gottardo, Raphael

    2011-02-16

    ChIP-Seq has become the standard method for genome-wide profiling DNA association of transcription factors. To simplify analyzing and interpreting ChIP-Seq data, which typically involves using multiple applications, we describe an integrated, open source, R-based analysis pipeline. The pipeline addresses data input, peak detection, sequence and motif analysis, visualization, and data export, and can readily be extended via other R and Bioconductor packages. Using a standard multicore computer, it can be used with datasets consisting of tens of thousands of enriched regions. We demonstrate its effectiveness on published human ChIP-Seq datasets for FOXA1, ER, CTCF and STAT1, where it detected co-occurring motifs that were consistent with the literature but not detected by other methods. Our pipeline provides the first complete set of Bioconductor tools for sequence and motif analysis of ChIP-Seq and ChIP-chip data.

  5. Genome-Wide Analysis, Classification, Evolution, and Expression Analysis of the Cytochrome P450 93 Family in Land Plants.

    PubMed

    Du, Hai; Ran, Feng; Dong, Hong-Li; Wen, Jing; Li, Jia-Na; Liang, Zhe

    2016-01-01

    Cytochrome P450 93 family (CYP93) belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies-CYP93A-K, with the last two being identified first. CYP93A is the ancestor that was derived in flowering plants, and the remaining showed lineage-specific distribution-CYP93B and CYP93C are present in dicots; CYP93F is distributed only in Poaceae; CYP93G and CYP93J are monocot-specific; CYP93E is unique to legumes; CYP93H and CYP93K are only found in Aquilegia coerulea, and CYP93D is Brassicaceae-specific. Each subfamily generally has conserved gene numbers, structures, and characteristics, indicating functional conservation during evolution. Synonymous nucleotide substitution (dN/dS) analysis showed that CYP93 genes are under strong negative selection. Comparative expression analyses of CYP93 genes in dicots and monocots revealed that they are preferentially expressed in the roots and tend to be induced by biotic and/or abiotic stresses, in accordance with their well-known functions in plant secondary biosynthesis.

  6. Genome-wide association analysis and differential expression analysis of resistance to Sclerotinia stem rot in Brassica napus.

    PubMed

    Wei, Lijuan; Jian, Hongju; Lu, Kun; Filardo, Fiona; Yin, Nengwen; Liu, Liezhao; Qu, Cunmin; Li, Wei; Du, Hai; Li, Jiana

    2016-06-01

    Brassica napus is one of the most important oil crops in the world, and stem rot caused by the fungus Sclerotinia sclerotiorum results in major losses in yield and quality. To elucidate resistance genes and pathogenesis-related genes, genome-wide association analysis of 347 accessions was performed using the Illumina 60K Brassica SNP (single nucleotide polymorphism) array. In addition, the detached stem inoculation assay was used to select five highly resistant (R) and susceptible (S) B. napus lines, 48 h postinoculation with S. sclerotiorum for transcriptome sequencing. We identified 17 significant associations for stem resistance on chromosomes A8 and C6, five of which were on A8 and 12 on C6. The SNPs identified on A8 were located in a 409-kb haplotype block, and those on C6 were consistent with previous QTL mapping efforts. Transcriptome analysis suggested that S. sclerotiorum infection activates the immune system, sulphur metabolism, especially glutathione (GSH) and glucosinolates in both R and S genotypes. Genes found to be specific to the R genotype related to the jasmonic acid pathway, lignin biosynthesis, defence response, signal transduction and encoding transcription factors. Twenty-four genes were identified in both the SNP-trait association and transcriptome sequencing analyses, including a tau class glutathione S-transferase (GSTU) gene cluster. This study provides useful insight into the molecular mechanisms underlying the plant's response to S. sclerotiorum.

  7. Genome-Wide Analysis, Classification, Evolution, and Expression Analysis of the Cytochrome P450 93 Family in Land Plants

    PubMed Central

    Du, Hai; Ran, Feng; Dong, Hong-Li; Wen, Jing; Li, Jia-Na; Liang, Zhe

    2016-01-01

    Cytochrome P450 93 family (CYP93) belonging to the cytochrome P450 superfamily plays important roles in diverse plant processes. However, no previous studies have investigated the evolution and expression of the members of this family. In this study, we performed comprehensive genome-wide analysis to identify CYP93 genes in 60 green plants. In all, 214 CYP93 proteins were identified; they were specifically found in flowering plants and could be classified into ten subfamilies—CYP93A–K, with the last two being identified first. CYP93A is the ancestor that was derived in flowering plants, and the remaining showed lineage-specific distribution—CYP93B and CYP93C are present in dicots; CYP93F is distributed only in Poaceae; CYP93G and CYP93J are monocot-specific; CYP93E is unique to legumes; CYP93H and CYP93K are only found in Aquilegia coerulea, and CYP93D is Brassicaceae-specific. Each subfamily generally has conserved gene numbers, structures, and characteristics, indicating functional conservation during evolution. Synonymous nucleotide substitution (dN/dS) analysis showed that CYP93 genes are under strong negative selection. Comparative expression analyses of CYP93 genes in dicots and monocots revealed that they are preferentially expressed in the roots and tend to be induced by biotic and/or abiotic stresses, in accordance with their well-known functions in plant secondary biosynthesis. PMID:27760179

  8. Meta-analysis of genome-wide association studies identifies novel loci that influence cupping and the glaucomatous process

    PubMed Central

    Springelkamp, Henriët.; Höhn, René; Mishra, Aniket; Hysi, Pirro G.; Khor, Chiea-Chuen; Loomis, Stephanie J.; Bailey, Jessica N. Cooke; Gibson, Jane; Thorleifsson, Gudmar; Janssen, Sarah F.; Luo, Xiaoyan; Ramdas, Wishal D.; Vithana, Eranga; Nongpiur, Monisha E.; Montgomery, Grant W.; Xu, Liang; Mountain, Jenny E.; Gharahkhani, Puya; Lu, Yi; Amin, Najaf; Karssen, Lennart C.; Sim, Kar-Seng; van Leeuwen, Elisabeth M.; Iglesias, Adriana I.; Verhoeven, Virginie J. M.; Hauser, Michael A.; Loon, Seng-Chee; Despriet, Dominiek D. G.; Nag, Abhishek; Venturini, Cristina; Sanfilippo, Paul G.; Schillert, Arne; Kang, Jae H.; Landers, John; Jonasson, Fridbert; Cree, Angela J.; van Koolwijk, Leonieke M. E.; Rivadeneira, Fernando; Souzeau, Emmanuelle; Jonsson, Vesteinn; Menon, Geeta; Mitchell, Paul; Wang, Jie Jin; Rochtchina, Elena; Attia, John; Scott, Rodney; Holliday, Elizabeth G.; Wong, Tien-Yin; Baird, Paul N.; Xie, Jing; Inouye, Michael; Viswanathan, Ananth; Sim, Xueling; Weinreb, Robert N.; de Jong, Paulus T. V. M.; Oostra, Ben A.; Uitterlinden, André G.; Hofman, Albert; Ennis, Sarah; Thorsteinsdottir, Unnur; Burdon, Kathryn P.; Allingham, R. Rand; Brilliant, Murray H.; Budenz, Donald L.; Cooke Bailey, Jessica N.; Christen, William G.; Fingert, John; Friedman, David S.; Gaasterland, Douglas; Gaasterland, Terry; Haines, Jonathan L.; Hauser, Michael A.; Kang, Jae Hee; Kraft, Peter; Lee, Richard K.; Lichter, Paul R.; Liu, Yutao; Loomis, Stephanie J.; Moroi, Sayoko E.; Pasquale, Louis R.; Pericak-Vance, Margaret A.; Realini, Anthony; Richards, Julia E.; Schuman, Joel S.; Scott, William K.; Singh, Kuldev; Sit, Arthur J.; Vollrath, Douglas; Weinreb, Robert N.; Wiggs, Janey L.; Wollstein, Gadi; Zack, Donald J.; Zhang, Kang; Donnelly (Chair), Peter; Barroso (Deputy Chair), Ines; Blackwell, Jenefer M.; Bramon, Elvira; Brown, Matthew A.; Casas, Juan P.; Corvin, Aiden; Deloukas, Panos; Duncanson, Audrey; Jankowski, Janusz; Markus, Hugh S.; Mathew, Christopher G.; Palmer, Colin N. A.; Plomin, Robert; Rautanen, Anna; Sawcer, Stephen J.; Trembath, Richard C.; Viswanathan, Ananth C.; Wood, Nicholas W.; Spencer, Chris C. A.; Band, Gavin; Bellenguez, Céline; Freeman, Colin; Hellenthal, Garrett; Giannoulatou, Eleni; Pirinen, Matti; Pearson, Richard; Strange, Amy; Su, Zhan; Vukcevic, Damjan; Donnelly, Peter; Langford, Cordelia; Hunt, Sarah E.; Edkins, Sarah; Gwilliam, Rhian; Blackburn, Hannah; Bumpstead, Suzannah J.; Dronov, Serge; Gillman, Matthew; Gray, Emma; Hammond, Naomi; Jayakumar, Alagurevathi; McCann, Owen T.; Liddle, Jennifer; Potter, Simon C.; Ravindrarajah, Radhi; Ricketts, Michelle; Waller, Matthew; Weston, Paul; Widaa, Sara; Whittaker, Pamela; Barroso, Ines; Deloukas, Panos; Mathew (Chair), Christopher G.; Blackwell, Jenefer M.; Brown, Matthew A.; Corvin, Aiden; Spencer, Chris C. A.; Spector, Timothy D.; Mirshahi, Alireza; Saw, Seang-Mei; Vingerling, Johannes R.; Teo, Yik-Ying; Haines, Jonathan L.; Wolfs, Roger C. W.; Lemij, Hans G.; Tai, E-Shyong; Jansonius, Nomdo M.; Jonas, Jost B.; Cheng, Ching-Yu; Aung, Tin; Viswanathan, Ananth C.; Klaver, Caroline C. W.; Craig, Jamie E.; Macgregor, Stuart; Mackey, David A.; Lotery, Andrew J.; Stefansson, Kari; Bergen, Arthur A. B.; Young, Terri L.; Wiggs, Janey L.; Pfeiffer, Norbert; Wong, Tien-Yin; Pasquale, Louis R.; Hewitt, Alex W.; van Duijn, Cornelia M.; Hammond, Christopher J.

    2014-01-01

    Glaucoma is characterized by irreversible optic nerve degeneration and is the most frequent cause of irreversible blindness worldwide. Here, the International Glaucoma Genetics Consortium conducts a meta-analysis of genome-wide association studies of vertical cup-disc ratio (VCDR), an important disease-related optic nerve parameter. In 21,094 individuals of European ancestry and 6,784 individuals of Asian ancestry, we identify 10 new loci associated with variation in VCDR. In a separate risk-score analysis of five case-control studies, Caucasians in the highest quintile have a 2.5-fold increased risk of primary open-angle glaucoma as compared with those in the lowest quintile. This study has more than doubled the known loci associated with optic disc cupping and will allow greater understanding of mechanisms involved in this common blinding condition. PMID:25241763

  9. Analysis of genome-wide structure, diversity and fine mapping of Mendelian traits in traditional and village chickens.

    PubMed

    Wragg, D; Mwacharo, J M; Alcalde, J A; Hocking, P M; Hanotte, O

    2012-07-01

    Extensive phenotypic variation is a common feature among village chickens found throughout much of the developing world, and in traditional chicken breeds that have been artificially selected for traits such as plumage variety. We present here an assessment of traditional and village chicken populations, for fine mapping of Mendelian traits using genome-wide single-nucleotide polymorphism (SNP) genotyping while providing information on their genetic structure and diversity. Bayesian clustering analysis reveals two main genetic backgrounds in traditional breeds, Kenyan, Ethiopian and Chilean village chickens. Analysis of linkage disequilibrium (LD) reveals useful LD (r(2) ≥ 0.3) in both traditional and village chickens at pairwise marker distances of ~10 Kb; while haplotype block analysis indicates a median block size of 11-12 Kb. Association mapping yielded refined mapping intervals for duplex comb (Gga 2:38.55-38.89 Mb) and rose comb (Gga 7:18.41-22.09 Mb) phenotypes in traditional breeds. Combined mapping information from traditional breeds and Chilean village chicken allows the oocyan phenotype to be fine mapped to two small regions (Gga 1:67.25-67.28 Mb, Gga 1:67.28-67.32 Mb) totalling ~75 Kb. Mapping the unmapped earlobe pigmentation phenotype supports previous findings that the trait is sex-linked and polygenic. A critical assessment of the number of SNPs required to map simple traits indicate that between 90 and 110K SNPs are required for full genome-wide analysis of haplotype block structure/ancestry, and for association mapping in both traditional and village chickens. Our results demonstrate the importance and uniqueness of phenotypic diversity and genetic structure of traditional chicken breeds for fine-scale mapping of Mendelian traits in the species, with village chicken populations providing further opportunities to enhance mapping resolutions.

  10. Analysis of genome-wide structure, diversity and fine mapping of Mendelian traits in traditional and village chickens

    PubMed Central

    Wragg, D; Mwacharo, J M; Alcalde, J A; Hocking, P M; Hanotte, O

    2012-01-01

    Extensive phenotypic variation is a common feature among village chickens found throughout much of the developing world, and in traditional chicken breeds that have been artificially selected for traits such as plumage variety. We present here an assessment of traditional and village chicken populations, for fine mapping of Mendelian traits using genome-wide single-nucleotide polymorphism (SNP) genotyping while providing information on their genetic structure and diversity. Bayesian clustering analysis reveals two main genetic backgrounds in traditional breeds, Kenyan, Ethiopian and Chilean village chickens. Analysis of linkage disequilibrium (LD) reveals useful LD (r2⩾0.3) in both traditional and village chickens at pairwise marker distances of ∼10 Kb; while haplotype block analysis indicates a median block size of 11–12 Kb. Association mapping yielded refined mapping intervals for duplex comb (Gga 2:38.55–38.89 Mb) and rose comb (Gga 7:18.41–22.09 Mb) phenotypes in traditional breeds. Combined mapping information from traditional breeds and Chilean village chicken allows the oocyan phenotype to be fine mapped to two small regions (Gga 1:67.25–67.28 Mb, Gga 1:67.28–67.32 Mb) totalling ∼75 Kb. Mapping the unmapped earlobe pigmentation phenotype supports previous findings that the trait is sex-linked and polygenic. A critical assessment of the number of SNPs required to map simple traits indicate that between 90 and 110K SNPs are required for full genome-wide analysis of haplotype block structure/ancestry, and for association mapping in both traditional and village chickens. Our results demonstrate the importance and uniqueness of phenotypic diversity and genetic structure of traditional chicken breeds for fine-scale mapping of Mendelian traits in the species, with village chicken populations providing further opportunities to enhance mapping resolutions. PMID:22395157

  11. Genome-Wide Meta-Analysis of Myopia and Hyperopia Provides Evidence for Replication of 11 Loci

    PubMed Central

    Simpson, Claire L.; Wojciechowski, Robert; Oexle, Konrad; Murgia, Federico; Portas, Laura; Li, Xiaohui; Verhoeven, Virginie J. M.; Vitart, Veronique; Schache, Maria; Hosseini, S. Mohsen; Hysi, Pirro G.; Raffel, Leslie J.; Cotch, Mary Frances; Chew, Emily; Klein, Barbara E. K.; Klein, Ronald; Wong, Tien Yin; van Duijn, Cornelia M.; Mitchell, Paul; Saw, Seang Mei; Fossarello, Maurizio; Wang, Jie Jin; Polašek, Ozren; Campbell, Harry; Rudan, Igor; Oostra, Ben A.; Uitterlinden, André G.; Hofman, Albert; Rivadeneira, Fernando; Amin, Najaf; Karssen, Lennart C.; Vingerling, Johannes R.; Döring, Angela; Bettecken, Thomas; Bencic, Goran; Gieger, Christian; Wichmann, H.-Erich; Wilson, James F.; Venturini, Cristina; Fleck, Brian; Cumberland, Phillippa M.; Rahi, Jugnoo S.; Hammond, Chris J.; Hayward, Caroline; Wright, Alan F.; Paterson, Andrew D.; Baird, Paul N.; Klaver, Caroline C. W.; Rotter, Jerome I.; Pirastu, Mario; Meitinger, Thomas; Bailey-Wilson, Joan E.; Stambolian, Dwight

    2014-01-01

    Refractive error (RE) is a complex, multifactorial disorder characterized by a mismatch between the optical power of the eye and its axial length that causes object images to be focused off the retina. The two major subtypes of RE are myopia (nearsightedness) and hyperopia (farsightedness), which represent opposite ends of the distribution of the quantitative measure of spherical refraction. We performed a fixed effects meta-analysis of genome-wide association results of myopia and hyperopia from 9 studies of European-derived populations: AREDS, KORA, FES, OGP-Talana, MESA, RSI, RSII, RSIII and ERF. One genome-wide significant region was observed for myopia, corresponding to a previously identified myopia locus on 8q12 (p = 1.25×10−8), which has been reported by Kiefer et al. as significantly associated with myopia age at onset and Verhoeven et al. as significantly associated to mean spherical-equivalent (MSE) refractive error. We observed two genome-wide significant associations with hyperopia. These regions overlapped with loci on 15q14 (minimum p value = 9.11×10−11) and 8q12 (minimum p value 1.82×10−11) previously reported for MSE and myopia age at onset. We also used an intermarker linkage- disequilibrium-based method for calculating the effective number of tests in targeted regional replication analyses. We analyzed myopia (which represents the closest phenotype in our data to the one used by Kiefer et al.) and showed replication of 10 additional loci associated with myopia previously reported by Kiefer et al. This is the first replication of these loci using myopia as the trait under analysis. “Replication-level” association was also seen between hyperopia and 12 of Kiefer et al.'s published loci. For the loci that show evidence of association to both myopia and hyperopia, the estimated effect of the risk alleles were in opposite directions for the two traits. This suggests that these loci are important contributors to variation of

  12. Genome-wide meta-analysis of myopia and hyperopia provides evidence for replication of 11 loci.

    PubMed

    Simpson, Claire L; Wojciechowski, Robert; Oexle, Konrad; Murgia, Federico; Portas, Laura; Li, Xiaohui; Verhoeven, Virginie J M; Vitart, Veronique; Schache, Maria; Hosseini, S Mohsen; Hysi, Pirro G; Raffel, Leslie J; Cotch, Mary Frances; Chew, Emily; Klein, Barbara E K; Klein, Ronald; Wong, Tien Yin; van Duijn, Cornelia M; Mitchell, Paul; Saw, Seang Mei; Fossarello, Maurizio; Wang, Jie Jin; Polašek, Ozren; Campbell, Harry; Rudan, Igor; Oostra, Ben A; Uitterlinden, André G; Hofman, Albert; Rivadeneira, Fernando; Amin, Najaf; Karssen, Lennart C; Vingerling, Johannes R; Döring, Angela; Bettecken, Thomas; Bencic, Goran; Gieger, Christian; Wichmann, H-Erich; Wilson, James F; Venturini, Cristina; Fleck, Brian; Cumberland, Phillippa M; Rahi, Jugnoo S; Hammond, Chris J; Hayward, Caroline; Wright, Alan F; Paterson, Andrew D; Baird, Paul N; Klaver, Caroline C W; Rotter, Jerome I; Pirastu, Mario; Meitinger, Thomas; Bailey-Wilson, Joan E; Stambolian, Dwight

    2014-01-01

    Refractive error (RE) is a complex, multifactorial disorder characterized by a mismatch between the optical power of the eye and its axial length that causes object images to be focused off the retina. The two major subtypes of RE are myopia (nearsightedness) and hyperopia (farsightedness), which represent opposite ends of the distribution of the quantitative measure of spherical refraction. We performed a fixed effects meta-analysis of genome-wide association results of myopia and hyperopia from 9 studies of European-derived populations: AREDS, KORA, FES, OGP-Talana, MESA, RSI, RSII, RSIII and ERF. One genome-wide significant region was observed for myopia, corresponding to a previously identified myopia locus on 8q12 (p = 1.25×10(-8)), which has been reported by Kiefer et al. as significantly associated with myopia age at onset and Verhoeven et al. as significantly associated to mean spherical-equivalent (MSE) refractive error. We observed two genome-wide significant associations with hyperopia. These regions overlapped with loci on 15q14 (minimum p value = 9.11×10(-11)) and 8q12 (minimum p value 1.82×10(-11)) previously reported for MSE and myopia age at onset. We also used an intermarker linkage- disequilibrium-based method for calculating the effective number of tests in targeted regional replication analyses. We analyzed myopia (which represents the closest phenotype in our data to the one used by Kiefer et al.) and showed replication of 10 additional loci associated with myopia previously reported by Kiefer et al. This is the first replication of these loci using myopia as the trait under analysis. "Replication-level" association was also seen between hyperopia and 12 of Kiefer et al.'s published loci. For the loci that show evidence of association to both myopia and hyperopia, the estimated effect of the risk alleles were in opposite directions for the two traits. This suggests that these loci are important contributors to variation of refractive

  13. Genome-wide association analysis of young onset stroke identifies a locus on chromosome 10q25 near HABP2

    PubMed Central

    Cheng, Yu-Ching; Stanne, Tara M.; Giese, Anne-Katrin; Ho, Weang Kee; Traylor, Matthew; Amouyel, Philippe; Holliday, Elizabeth G.; Malik, Rainer; Xu, Huichun; Kittner, Steven J.; Cole, John W.; O’Connell, Jeffrey R.; Danesh, John; Rasheed, Asif; Zhao, Wei; Engelter, Stefan; Grond-Ginsbach, Caspar; Kamatani, Yoichiro; Lathrop, Mark; Leys, Didier; Thijs, Vincent; Metso, Tiina M.; Tatlisumak, Turgut; Pezzini, Alessandro; Parati, Eugenio A.; Norrving, Bo; Bevan, Steve; Rothwell, Peter M; Sudlow, Cathie; Slowik, Agnieszka; Lindgren, Arne; Walters, Matthew R; Jannes, Jim; Shen, Jess; Crosslin, David; Doheny, Kimberly; Laurie, Cathy C.; Kanse, Sandip M.; Bis, Joshua C.; Fornage, Myriam; Mosley, Thomas H.; Hopewell, Jemma C.; Strauch, Konstantin; Müller-Nurasyid, Martina; Gieger, Christian; Waldenberger, Melanie; Peters, Annette; Meisinger, Christine; Ikram, M. Arfan; Longstreth, WT; Meschia, James F.; Seshadri, Sudha; Sharma, Pankaj; Worrall, Bradford; Jern, Christina; Levi, Christopher; Dichgans, Martin; Boncoraglio, Giorgio B.; Markus, Hugh S.; Debette, Stephanie; Rolfs, Arndt; Saleheen, Danish; Mitchell, Braxton D.

    2015-01-01

    Background and Purpose Although a genetic contribution to ischemic stroke is well recognized, only a handful of stroke loci have been identified by large-scale genetic association studies to date. Hypothesizing that genetic effects might be stronger for early- versus late-onset stroke, we conducted a two-stage meta-analysis of genome-wide association studies (GWAS), focusing on stroke cases with an age of onset < 60 years old. Methods The Discovery stage of our GWAS included 4,505 cases and 21,968 controls of European, South-Asian and African ancestry, drawn from 6 studies. In Stage 2, we selected the lead genetic variants at loci with association P<5×10−6 and performed in silico association analyses in an independent sample of up to 1,003 cases and 7,745 controls. Results One stroke susceptibility locus at 10q25 reached genome-wide significance in the combined analysis of all samples from the Discovery and Follow-up Stages (rs11196288, OR=1.41, P=9.5×10−9). The associated locus is in an intergenic region between TCF7L2 and HABP2. In a further analysis in an independent sample, we found that two SNPs in high linkage disequilibrium with rs11196288 were significantly associated with total plasma factor VII-activating protease levels, a product of HABP2. Conclusions HABP2, which encodes an extracellular serine protease involved in coagulation, fibrinolysis, and inflammatory pathways, may be a genetic susceptibility locus for early-onset stroke. PMID:26732560

  14. Genome-wide identification, classification, and expression analysis of sHSP genes in Chinese cabbage (Brassica rapa ssp pekinensis).

    PubMed

    Tao, P; Guo, W L; Li, B Y; Wang, W H; Yue, Z C; Lei, J L; Zhong, X M

    2015-10-05

    Small heat shock proteins (sHSPs) are essential for the plant's normal development and stress responses, especially the heat stress response. The information regarding sHSP genes in Chinese cabbage (Brassica rapa ssp pekinensis) is sparse, hence we performed a genome-wide analysis to identify sHSP genes in this species. We identified 26 non-redundant sHSP genes distributed on all chromosomes, except chromosome A7, with one additional sHSP gene identified from an expressed sequence tag library. Chinese cabbage was found to contain more sHSP genes than Arabidopsis. The 27 sHSP genes were classified into 11 subfamilies. We identified 22 groups of sHSP syntenic orthologous genes between Chinese cabbage and Arabidopsis. In addition, eight groups of paralogous genes were uncovered in Chinese cabbage. Protein structures of the 27 Chinese cabbage sHSPs were modeled using Phyre2, which revealed that all of them contain several conserved β strands across different subfamilies. In general, gene structure was conserved within each subfamily between Chinese cabbage and Arabidopsis, except for peroxisome sHSP. Analysis of promoter motifs showed that most sHSP genes contain heat shock elements or variants. We also found that biased gene loss has occurred during the evolution of the sHSP subfamily in Chinese cabbage. Expression analysis indicated that the greatest transcript abundance of most Chinese cabbage sHSP genes was found in siliques and early cotyledon embryos. Thus, genome-wide identification and characterization of sHSP genes is a first and important step in the investigation of sHSPs in Chinese cabbage.

  15. GWASTools: an R/Bioconductor package for quality control and analysis of genome-wide association studies

    PubMed Central

    Gogarten, Stephanie M.; Bhangale, Tushar; Conomos, Matthew P.; Laurie, Cecelia A.; McHugh, Caitlin P.; Painter, Ian; Zheng, Xiuwen; Crosslin, David R.; Levine, David; Lumley, Thomas; Nelson, Sarah C.; Rice, Kenneth; Shen, Jess; Swarnkar, Rohit; Weir, Bruce S.; Laurie, Cathy C.

    2012-01-01

    Summary: GWASTools is an R/Bioconductor package for quality control and analysis of genome-wide association studies (GWAS). GWASTools brings the interactive capability and extensive statistical libraries of R to GWAS. Data are stored in NetCDF format to accommodate extremely large datasets that cannot fit within R’s memory limits. The documentation includes instructions for converting data from multiple formats, including variants called from sequencing. GWASTools provides a convenient interface for linking genotypes and intensity data with sample and single nucleotide polymorphism annotation. Availability and implementation: GWASTools is implemented in R and is available from Bioconductor (http://www.bioconductor.org). An extensive vignette detailing a recommended work flow is included. Contact: sdmorris@uw.edu PMID:23052040

  16. Meta-analysis of genome-wide association studies identifies two loci associated with circulating osteoprotegerin levels

    PubMed Central

    Kwan, Johnny S.H.; Hsu, Yi-Hsiang; Cheung, Ching-Lung; Dupuis, Josée; Saint-Pierre, Aude; Eriksson, Joel; Handelman, Samuel K.; Aragaki, Aaron; Karasik, David; Pramstaller, Peter P.; Kooperberg, Charles; Lacroix, Andrea Z.; Larson, Martin G.; Lau, Kam-Shing; Lorentzon, Mattias; Pichler, Irene; Sham, Pak C.; Taliun, Daniel; Vandenput, Liesbeth; Kiel, Douglas P.; Hicks, Andrew A.; Jackson, Rebecca D.; Ohlsson, Claes; Benjamin, Emelia J.; Kung, Annie W.C.

    2014-01-01

    Osteoprotegerin (OPG) is involved in bone homeostasis and tumor cell survival. Circulating OPG levels are also important biomarkers of various clinical traits, such as cancers and atherosclerosis. OPG levels were measured in serum or in plasma. In a meta-analysis of genome-wide association studies in up to 10 336 individuals from European and Asian origin, we discovered that variants >100 kb upstream of the TNFRSF11B gene encoding OPG and another new locus on chromosome 17q11.2 were significantly associated with OPG variation. We also identified a suggestive locus on chromosome 14q21.2 associated with the trait. Moreover, we estimated that over half of the heritability of OPG levels could be explained by all variants examined in our study. Our findings provide further insight into the genetic regulation of circulating OPG levels. PMID:25080503

  17. The histone modification pattern of active genes revealed through genome-wide chromatin analysis of a higher eukaryote

    PubMed Central

    Schübeler, Dirk; MacAlpine, David M.; Scalzo, David; Wirbelauer, Christiane; Kooperberg, Charles; van Leeuwen, Fred; Gottschling, Daniel E.; O'Neill, Laura P.; Turner, Bryan M.; Delrow, Jeffrey; Bell, Stephen P.; Groudine, Mark

    2004-01-01

    The covalent modification of nucleosomal histones has emerged as a major determinant of chromatin structure and gene activity. To understand the interplay between various histone modifications, including acetylation and methylation, we performed a genome-wide chromatin structure analysis in a higher eukaryote. We found a binary pattern of histone modifications among euchromatic genes, with active genes being hyperacetylated for H3 and H4 and hypermethylated at Lys 4 and Lys 79 of H3, and inactive genes being hypomethylated and deacetylated at the same residues. Furthermore, the degree of modification correlates with the level of transcription, and modifications are largely restricted to transcribed regions, suggesting that their regulation is tightly linked to polymerase activity. PMID:15175259

  18. Genome-wide digital transcript analysis of putative fruitlet abscission related genes regulated by ethephon in litchi

    PubMed Central

    Li, Caiqin; Wang, Yan; Ying, Peiyuan; Ma, Wuqiang; Li, Jianguo

    2015-01-01

    The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn.) causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ) and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA) analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors (TFs), protein ubiquitination, ROS response, calcium signal transduction, and cell wall modification. These genes could be clustered into four groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to a better

  19. Esophageal Cancer Epigenomics and Integrome Analysis of Genome-Wide Methylation and Expression in High Risk Northeast Indian Population.

    PubMed

    Singh, Virendra; Singh, Laishram Chandreshwor; Vasudevan, Madavan; Chattopadhyay, Indranil; Borthakar, Bibhuti Bhusan; Rai, Avdhesh Kumar; Phukan, Rup Kumar; Sharma, Jagannath; Mahanta, Jagadish; Kataki, Amal Chandra; Kapur, Sujala; Saxena, Sunita

    2015-11-01

    Esophageal cancer is a major global health burden with a strong host-environment interaction component and epigenomics underpinnings that remain to be elucidated further. Certain populations such as the Northeast Indians suffer at a disproportionately higher rate from this devastating disease. Promoter methylation is correlated with transcriptional silencing of various genes in esophageal cancer. Very few studies on genome-wide methylation for esophageal cancer exist and yet, no one has carried out an integromics analysis of methylation and gene expression. In the present study, genome-wide methylation was measured in samples collected from the Northeast Indian population by Infinium 450k array, and integration of the methylation data was performed. To prepare a network of genes displaying enriched pathways, together with the list of genes exhibiting promoter hypermethylation or hypomethylation with inversely correlated expression, we performed an integrome analysis. We identified 23 Integrome network enriched genes with relevance to tumor progression and associated with the processes involved in metastasis such as cell adhesion, integrin signaling, cytoskeleton, and extracellular matrix organizations. These included four genes (PTK2, RND1, RND3, and UBL3) with promoter hypermethylation and downregulation, and 19 genes (SEMG2, CD97, CTNND2, CADM3, OMD, NEFM, FBN2, CTNNB1, DLX6, UGT2B4, CCDC80, PZP, SERPINA4, TNFSF13B, NPC1, COL1A1, TAC3, BMP8A, and IL22RA2) with promoter hypomethylation and upregulation. A Methylation Efficiency Index was further calculated for these genes; the top five gene with the highest index were COL1A1, TAC3, SERPINA4, TNFSF13B, and IL22RA2. In conclusion, we recommend that the circulatory proteins IL22RA2, TNFSF13B, SERPINA4, and TAC3 in serum of patients and disease-free healthy controls can be examined in the future as putative noninvasive biomarkers.

  20. Genome-wide digital transcript analysis of putative fruitlet abscission related genes regulated by ethephon in litchi.

    PubMed

    Li, Caiqin; Wang, Yan; Ying, Peiyuan; Ma, Wuqiang; Li, Jianguo

    2015-01-01

    The high level of physiological fruitlet abscission in litchi (Litchi chinensis Sonn.) causes severe yield loss. Cell separation occurs at the fruit abscission zone (FAZ) and can be triggered by ethylene. However, a deep knowledge of the molecular events occurring in the FAZ is still unknown. Here, genome-wide digital transcript abundance (DTA) analysis of putative fruit abscission related genes regulated by ethephon in litchi were studied. More than 81 million high quality reads from seven ethephon treated and untreated control libraries were obtained by high-throughput sequencing. Through DTA profile analysis in combination with Gene Ontology and KEGG pathway enrichment analyses, a total of 2730 statistically significant candidate genes were involved in the ethephon-promoted litchi fruitlet abscission. Of these, there were 1867 early-responsive genes whose expressions were up- or down-regulated from 0 to 1 d after treatment. The most affected genes included those related to ethylene biosynthesis and signaling, auxin transport and signaling, transcription factors (TFs), protein ubiquitination, ROS response, calcium signal transduction, and cell wall modification. These genes could be clustered into four groups and 13 subgroups according to their similar expression patterns. qRT-PCR displayed the expression pattern of 41 selected candidate genes, which proved the accuracy of our DTA data. Ethephon treatment significantly increased fruit abscission and ethylene production of fruitlet. The possible molecular events to control the ethephon-promoted litchi fruitlet abscission were prompted out. The increased ethylene evolution in fruitlet would suppress the synthesis and polar transport of auxin and trigger abscission signaling. To the best of our knowledge, it is the first time to monitor the gene expression profile occurring in the FAZ-enriched pedicel during litchi fruit abscission induced by ethephon on the genome-wide level. This study will contribute to a better

  1. Correlation of microRNA levels during hypoxia with predicted target mRNAs through genome-wide microarray analysis

    PubMed Central

    Guimbellot, Jennifer S; Erickson, Stephen W; Mehta, Tapan; Wen, Hui; Page, Grier P; Sorscher, Eric J; Hong, Jeong S

    2009-01-01

    Background Low levels of oxygen in tissues, seen in situations such as chronic lung disease, necrotic tumors, and high altitude exposures, initiate a signaling pathway that results in active transcription of genes possessing a hypoxia response element (HRE). The aim of this study was to investigate whether a change in miRNA expression following hypoxia could account for changes in the cellular transcriptome based on currently available miRNA target prediction tools. Methods To identify changes induced by hypoxia, we conducted mRNA- and miRNA-array-based experiments in HT29 cells, and performed comparative analysis of the resulting data sets based on multiple target prediction algorithms. To date, few studies have investigated an environmental perturbation for effects on genome-wide miRNA levels, or their consequent influence on mRNA output. Results Comparison of miRNAs with predicted mRNA targets indicated a lower level of concordance than expected. We did, however, find preliminary evidence of combinatorial regulation of mRNA expression by miRNA. Conclusion Target prediction programs and expression profiling techniques do not yet adequately represent the complexity of miRNA-mediated gene repression, and new methods may be required to better elucidate these pathways. Our data suggest the physiologic impact of miRNAs on cellular transcription results from a multifaceted network of miRNA and mRNA relationships, working together in an interconnected system and in context of hundreds of RNA species. The methods described here for comparative analysis of cellular miRNA and mRNA will be useful for understanding genome wide regulatory responsiveness and refining miRNA predictive algorithms. PMID:19320992

  2. Pathway-Based Analysis of a Melanoma Genome-Wide Association Study: Analysis of Genes Related to Tumour-Immunosuppression

    PubMed Central

    Schoof, Nils; Iles, Mark M.; Bishop, D. Timothy; Newton-Bishop, Julia A.; Barrett, Jennifer H.; consortium, GenoMEL

    2011-01-01

    Systemic immunosuppression is a risk factor for melanoma, and sunburn-induced immunosuppression is thought to be causal. Genes in immunosuppression pathways are therefore candidate melanoma-susceptibility genes. If variants within these genes individually have a small effect on disease risk, the association may be undetected in genome-wide association (GWA) studies due to low power to reach a high significance level. Pathway-based approaches have been suggested as a method of incorporating a priori knowledge into the analysis of GWA studies. In this study, the association of 1113 single nucleotide polymorphisms (SNPs) in 43 genes (39 genomic regions) related to immunosuppression have been analysed using a gene-set approach in 1539 melanoma cases and 3917 controls from the GenoMEL consortium GWA study. The association between melanoma susceptibility and the whole set of tumour-immunosuppression genes, and also predefined functional subgroups of genes, was considered. The analysis was based on a measure formed by summing the evidence from the most significant SNP in each gene, and significance was evaluated empirically by case-control label permutation. An association was found between melanoma and the complete set of genes (pemp = 0.002), as well as the subgroups related to the generation of tolerogenic dendritic cells (pemp = 0.006) and secretion of suppressive factors (pemp = 0.0004), thus providing preliminary evidence of involvement of tumour-immunosuppression gene polymorphisms in melanoma susceptibility. The analysis was repeated on a second phase of the GenoMEL study, which showed no evidence of an association. As one of the first attempts to replicate a pathway-level association, our results suggest that low power and heterogeneity may present challenges. PMID:22216283

  3. Genome-wide association analysis identifies TYW3/CRYZ and NDST4 loci associated with circulating resistin levels

    PubMed Central

    Qi, Qibin; Menzaghi, Claudia; Smith, Shelly; Liang, Liming; de Rekeneire, Nathalie; Garcia, Melissa E.; Lohman, Kurt K.; Miljkovic, Iva; Strotmeyer, Elsa S.; Cummings, Steve R.; Kanaya, Alka M.; Tylavsky, Frances A.; Satterfield, Suzanne; Ding, Jingzhong; Rimm, Eric B.; Trischitta, Vincenzo; Hu, Frank B.; Liu, Yongmei; Qi, Lu

    2012-01-01

    Resistin is a polypeptide hormone that was reported to be associated with insulin resistance, inflammation and risk of type 2 diabetes and cardiovascular disease. We conducted a genome-wide association (GWA) study on circulating resistin levels in individuals of European ancestry drawn from the two independent studies: the Nurses' Health Study (n = 1590) and the Health, Aging and Body Composition Study (n = 1658). Single-nucleotide polymorphisms (SNPs) identified in the GWA analysis were replicated in an independent cohort of Europeans: the Gargano Family Study (n = 659). We confirmed the association with a previously known locus, the RETN gene (19p13.2), and identified two novel loci near the TYW3/CRYZ gene (1p31) and the NDST4 gene (4q25), associated with resistin levels at a genome-wide significant level, best represented by SNP rs3931020 (P = 6.37 × 10–12) and SNP rs13144478 (P = 6.19 × 10−18), respectively. Gene expression quantitative trait loci analyses showed a significant cis association between the SNP rs3931020 and CRYZ gene expression levels (P = 3.68 × 10−7). We also found that both of these two SNPs were significantly associated with resistin gene (RETN) mRNA levels in white blood cells from 68 subjects with type 2 diabetes (both P = 0.02). In addition, the resistin-rising allele of the TYW3/CRYZ SNP rs3931020, but not the NDST4 SNP rs13144478, showed a consistent association with increased coronary heart disease risk [odds ratio = 1.18 (95% CI, 1.03–1.34); P = 0.01]. Our results suggest that genetic variants in TYW3/CRYZ and NDST4 loci may be involved in the regulation of circulating resistin levels. More studies are needed to verify the associations of the SNP rs13144478 with NDST4 gene expression and resistin-related disease. PMID:22843503

  4. Genetic factors in nonsmokers with age-related macular degeneration revealed through genome-wide gene-environment interaction analysis.

    PubMed

    Naj, Adam C; Scott, William K; Courtenay, Monique D; Cade, William H; Schwartz, Stephen G; Kovach, Jaclyn L; Agarwal, Anita; Wang, Gaofeng; Haines, Jonathan L; Pericak-Vance, Margaret A

    2013-05-01

    Relatively little is known about the interaction between genes and environment in the complex etiology of age-related macular degeneration (AMD). This study aimed to identify novel factors associated with AMD by analyzing gene-smoking interactions in a genome-wide association study of 1207 AMD cases and 686 controls of Caucasian background with genotype data on 668,238 single nucleotide polymorphisms (SNPs) after quality control. Participants' history of smoking at least 100 cigarettes lifetime was determined by a self-administered questionnaire. SNP associations modeled the effect of the minor allele additively on AMD using logistic regression, with adjustment for age, sex, and ever/never smoking. Joint effects of SNPs and smoking were examined comparing a null model containing only age, sex, and smoking against an extended model including genotypic and interaction terms. Genome-wide significant main effects were detected at three known AMD loci: CFH (P = 7.51×10(-30) ), ARMS2 (P = 1.94×10(-23) ), and RDBP/CFB/C2 (P = 4.37×10(-10) ), while joint effects analysis revealed three genomic regions with P < 10(-5) . Analyses stratified by smoking found genetic associations largely restricted to nonsmokers, with one notable exception: the chromosome 18q22.1 intergenic SNP rs17073641 (between SERPINB8 and CDH7), more strongly associated in nonsmokers (OR = 0.57, P = 2.73 × 10(-5) ), with an inverse association among smokers (OR = 1.42, P = 0.00228), suggesting that smoking modifies the effect of some genetic polymorphisms on AMD risk.

  5. Genome-wide association analysis of agronomic traits in wheat under drought-stressed and non-stressed conditions

    PubMed Central

    Mwadzingeni, Learnmore; Shimelis, Hussein; Rees, D. Jasper G.; Tsilo, Toi J.

    2017-01-01

    This study determined the population structure and genome-wide marker-trait association of agronomic traits of wheat for drought-tolerance breeding. Ninety-three diverse bread wheat genotypes were genotyped using the Diversity Arrays Technology sequencing (DArTseq) protocol. The number of days-to-heading (DTH), number of days-to-maturity (DTM), plant height (PHT), spike length (SPL), number of kernels per spike (KPS), thousand kernel weight (TKW) and grain yield (GYLD), assessed under drought-stressed and non-stressed conditions, were considered for the study. Population structure analysis and genome-wide association mapping were undertaken based on 16,383 silico DArTs loci with < 10% missing data. The population evaluated was grouped into nine distinct genetic structures. Inter-chromosomal linkage disequilibrium showed the existence of linkage decay as physical distance increased. A total of 62 significant (P < 0.001) marker-trait associations (MTAs) were detected explaining more than 20% of the phenotypic variation observed under both drought-stressed and non-stressed conditions. Significant (P < 0.001) MTA event(s) were observed for DTH, PHT, SPL, SPS, and KPS; under both stressed and non-stressed conditions, while additional significant (P < 0.05) associations were observed for TKW, DTM and GYLD under non-stressed condition. The MTAs reported in this population could be useful to initiate marker-assisted selection (MAS) and targeted trait introgression of wheat under drought-stressed and non-stressed conditions, and for fine mapping and cloning of the underlying genes and QTL. PMID:28234945

  6. Genome-Wide Analysis of Host Responses to Four Different Types of Microorganisms in Bombyx Mori (Lepidoptera: Bombycidae)

    PubMed Central

    Lin, Ping; Huang, Lulin; Wu, Yuqian; Jin, Shengkai; Liu, Chun; Xia, Qingyou

    2016-01-01

    Several pathogenic microorganisms have been used to investigate the genome-wide transcriptional responses of Bombyx mori to infection. However, studies have so far each focused on one microorganism, and systematic genome-wide comparison of transcriptional responses to different pathogenic microorganisms has not been undertaken. Here, we surveyed transcriptional responses of B. mori to its natural bacterial, viral, and fungal pathogens, Bacillus bombyseptieus, B. mori nucleopolyhedrovirus (BmNPV), and Beauveria bassiana, respectively, and to nonpathogenic Escherichia coli, by microarray analysis. In total, the expression of 2,436, 1,804, 1,743, and 912 B. mori genes was modulated by infection with B. bombyseptieus, BmNPV, B. bassiana, and E. coli, respectively. Notably, the expression of 620, 400, 177, or 165 of these genes was only modulated by infection with B. bombyseptieus, BmNPV, B. bassiana, or E. coli, respectively. In contrast to the expression of genes related to juvenile hormone synthesis and metabolism, that of genes encoding juvenile hormone binding proteins was microorganism-specific. Three basal metabolic pathways were modulated by infection with any of the four microorganisms, and 3, 14, 5, and 2 metabolic pathways were specifically modulated by infection with B. bombyseptieus, BmNPV, B. bassiana, and E. coli, respectively. Interestingly, BmNPV infection modulated the JAK/STAT signaling pathway, whereas both the Imd and Toll signaling pathways were modulated by infection with B. bombyseptieus, B. bassiana, or E. coli. These results elucidate potential molecular mechanisms of the host response to different microorganisms, and provide a foundation for further work on host–pathogen interaction. PMID:27382132

  7. Genome-Wide Analysis of the Synonymous Codon Usage Patterns in Riemerella anatipestifer

    PubMed Central

    Liu, Jibin; Zhu, Dekang; Ma, Guangpeng; Liu, Mafeng; Wang, Mingshu; Jia, Renyong; Chen, Shun; Sun, Kunfeng; Yang, Qiao; Wu, Ying; Chen, Xiaoyue; Cheng, Anchun

    2016-01-01

    Riemerella anatipestifer (RA) belongs to the Flavobacteriaceae family and can cause a septicemia disease in poultry. The synonymous codon usage patterns of bacteria reflect a series of evolutionary changes that enable bacteria to improve tolerance of the various environments. We detailed the codon usage patterns of RA isolates from the available 12 sequenced genomes by multiple codon and statistical analysis. Nucleotide compositions and relative synonymous codon usage (RSCU) analysis revealed that A or U ending codons are predominant in RA. Neutrality analysis found no significant correlation between GC12 and GC3 (p > 0.05). Correspondence analysis and ENc-plot results showed that natural selection dominated over mutation in the codon usage bias. The tree of cluster analysis based on RSCU was concordant with dendrogram based on genomic BLAST by neighbor-joining method. By comparative analysis, about 50 highly expressed genes that were orthologs across all 12 strains were found in the top 5% of high CAI value. Based on these CAI values, we infer that RA contains a number of predicted highly expressed coding sequences, involved in transcriptional regulation and metabolism, reflecting their requirement for dealing with diverse environmental conditions. These results provide some useful information on the mechanisms that contribute to codon usage bias and evolution of RA. PMID:27517915

  8. Multivariate Analysis of Anthropometric Traits Using Summary Statistics of Genome-Wide Association Studies from GIANT Consortium

    PubMed Central

    Zhu, Xiaofeng

    2016-01-01

    Meta-analysis of single trait for multiple cohorts has been used for increasing statistical power in genome-wide association studies (GWASs). Although hundreds of variants have been identified by GWAS, these variants only explain a small fraction of phenotypic variation. Cross-phenotype association analysis (CPASSOC) can further improve statistical power by searching for variants that contribute to multiple traits, which is often relevant to pleiotropy. In this study, we performed CPASSOC analysis on the summary statistics from the Genetic Investigation of ANthropometric Traits (GIANT) consortium using a novel method recently developed by our group. Sex-specific meta-analysis data for height, body mass index (BMI), and waist-to-hip ratio adjusted for BMI (WHRadjBMI) from discovery phase of the GIANT consortium study were combined using CPASSOC for each trait as well as 3 traits together. The conventional meta-analysis results from the discovery phase data of GIANT consortium studies were used to compare with that from CPASSOC analysis. The CPASSOC analysis was able to identify 17 loci associated with anthropometric traits that were missed by conventional meta-analysis. Among these loci, 16 have been reported in literature by including additional samples and 1 is novel. We also demonstrated that CPASSOC is able to detect pleiotropic effects when analyzing multiple traits. PMID:27701450

  9. High performance computing enabling exhaustive analysis of higher order single nucleotide polymorphism interaction in Genome Wide Association Studies.

    PubMed

    Goudey, Benjamin; Abedini, Mani; Hopper, John L; Inouye, Michael; Makalic, Enes; Schmidt, Daniel F; Wagner, John; Zhou, Zeyu; Zobel, Justin; Reumann, Matthias

    2015-01-01

    Genome-wide association studies (GWAS) are a common approach for systematic discovery of single nucleotide polymorphisms (SNPs) which are associated with a given disease. Univariate analysis approaches commonly employed may miss important SNP associations that only appear through multivariate analysis in complex diseases. However, multivariate SNP analysis is currently limited by its inherent computational complexity. In this work, we present a computational framework that harnesses supercomputers. Based on our results, we estimate a three-way interaction analysis on 1.1 million SNP GWAS data requiring over 5.8 years on the full "Avoca" IBM Blue Gene/Q installation at the Victorian Life Sciences Computation Initiative. This is hundreds of times faster than estimates for other CPU based methods and four times faster than runtimes estimated for GPU methods, indicating how the improvement in the level of hardware applied to interaction analysis may alter the types of analysis that can be performed. Furthermore, the same analysis would take under 3 months on the currently largest IBM Blue Gene/Q supercomputer "Sequoia" at the Lawrence Livermore National Laboratory assuming linear scaling is maintained as our results suggest. Given that the implementation used in this study can be further optimised, this runtime means it is becoming feasible to carry out exhaustive analysis of higher order interaction studies on large modern GWAS.

  10. High performance computing enabling exhaustive analysis of higher order single nucleotide polymorphism interaction in Genome Wide Association Studies

    PubMed Central

    2015-01-01

    Genome-wide association studies (GWAS) are a common approach for systematic discovery of single nucleotide polymorphisms (SNPs) which are associated with a given disease. Univariate analysis approaches commonly employed may miss important SNP associations that only appear through multivariate analysis in complex diseases. However, multivariate SNP analysis is currently limited by its inherent computational complexity. In this work, we present a computational framework that harnesses supercomputers. Based on our results, we estimate a three-way interaction analysis on 1.1 million SNP GWAS data requiring over 5.8 years on the full "Avoca" IBM Blue Gene/Q installation at the Victorian Life Sciences Computation Initiative. This is hundreds of times faster than estimates for other CPU based methods and four times faster than runtimes estimated for GPU methods, indicating how the improvement in the level of hardware applied to interaction analysis may alter the types of analysis that can be performed. Furthermore, the same analysis would take under 3 months on the currently largest IBM Blue Gene/Q supercomputer "Sequoia" at the Lawrence Livermore National Laboratory assuming linear scaling is maintained as our results suggest. Given that the implementation used in this study can be further optimised, this runtime means it is becoming feasible to carry out exhaustive analysis of higher order interaction studies on large modern GWAS. PMID:25870758

  11. Comparative Genome-Wide Analysis of the Malate Dehydrogenase Gene Families in Cotton

    PubMed Central

    Imran, Muhammad; Tang, Kai; Liu, Jin-Yuan

    2016-01-01

    Malate dehydrogenases (MDHs) play crucial roles in the physiological processes of plant growth and development. In this study, 13 and 25 MDH genes were identified from Gossypium raimondii and Gossypium hirsutum, respectively. Using these and 13 previously reported Gossypium arboretum MDH genes, a comparative molecular analysis between identified MDH genes from G. raimondii, G. hirsutum, and G. arboretum was performed. Based on multiple sequence alignments, cotton MDHs were divided into five subgroups: mitochondrial MDH, peroxisomal MDH, plastidial MDH, chloroplastic MDH and cytoplasmic MDH. Almost all of the MDHs within the same subgroup shared similar gene structure, amino acid sequence, and conserved motifs in their functional domains. An analysis of chromosomal localization suggested that segmental duplication played a major role in the expansion of cotton MDH gene families. Additionally, a selective pressure analysis indicated that purifying selection acted as a vital force in the evolution of MDH gene families in cotton. Meanwhile, an expression analysis showed the distinct expression profiles of GhMDHs in different vegetative tissues and at different fiber developmental stages, suggesting the functional diversification of these genes in cotton growth and fiber development. Finally, a promoter analysis indicated redundant but typical cis-regulatory elements for the potential functions and stress activity of many MDH genes. This study provides fundamental information for a better understanding of cotton MDH gene families and aids in functional analyses of the MDH genes in cotton fiber development. PMID:27829020

  12. [Genome-wide identification and expression analysis of the WRKY gene family in peach].

    PubMed

    Yanbing, Gu; Zhirui, Ji; Fumei, Chi; Zhuang, Qiao; Chengnan, Xu; Junxiang, Zhang; Zongshan, Zhou; Qinglong, Dong

    2016-03-01

    The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.

  13. Genome-wide analysis of spatiotemporal gene expression patterns during early embryogenesis in rice.

    PubMed

    Itoh, Jun-Ichi; Sato, Yutaka; Sato, Yutaka; Hibara, Ken-Ichiro; Shimizu-Sato, Sae; Kobayashi, Hiromi; Takehisa, Hinako; Sanguinet, Karen A; Namiki, Nobukazu; Nagamura, Yoshiaki

    2016-04-01

    Embryogenesis in rice is different from that of most dicotolydonous plants in that it shows a non-stereotypic cell division pattern, formation of dorsal-ventral polarity, and endogenous initiation of the radicle. To reveal the transcriptional features associated with developmental events during rice early embryogenesis, we used microarray analysis coupled with laser microdissection to obtain both spatial and temporal transcription profiles. Our results allowed us to determine spatial expression foci for each expressed gene in the globular embryo, which revealed the importance of phytohormone-related genes and a suite of transcription factors to early embryogenesis. Our analysis showed the polarized expression of a small number of genes along the apical-basal and dorsal-ventral axes in the globular embryo, which tended to fluctuate in later developmental stages. We also analyzed gene expression patterns in the early globular embryo and how this relates to expression in embryonic organs at later stages. We confirmed the accuracy of the expression patterns found by microarray analysis of embryo subdomains using in situ hybridization. Our study identified homologous genes from Arabidopsis thaliana with known functions in embryogenesis in addition to unique and uncharacterized genes that show polarized expression patterns during embryogenesis. The results of this study are presented in a database to provide a framework for spatiotemporal gene expression during rice embryogenesis, to serve as a resource for future functional analysis of genes, and as a basis for comparative studies of plant embryogenesis.

  14. Genome-wide analysis of Aux/IAA and ARF gene families in Populus trichocarpa

    SciTech Connect

    Kalluri, Udaya C; DiFazio, Stephen P; Brunner, A.; Tuskan, Gerald A

    2007-01-01

    Auxin/Indole-3-Acetic Acid (Aux/IAA) and Auxin Response Factor (ARF) transcription factors are key regulators of auxin responses in plants. A total of 35 Aux/IAA and 39 ARF genes were identified in the Populus genome. Comparative phylogenetic analysis revealed that the subgroups PoptrARF2, 6, 9 and 16 and PoptrIAA3, 16, 27 and 29 have differentially expanded in Populus relative to Arabidopsis. Activator ARFs were found to be two fold-overrepresented in the Populus genome. PoptrIAA and PoptrARF gene families appear to have expanded due to high segmental and low tandem duplication events. Furthermore, expression studies showed that genes in the expanded PoptrIAA3 subgroup display differential expression. The gene-family analysis reported here will be useful in conducting future functional genomics studies to understand how the molecular roles of these large gene families translate into a diversity of biologically meaningful auxin effects.

  15. Genome-Wide Analysis of the NADK Gene Family in Plants

    PubMed Central

    Li, Wen-Yan; Wang, Xiang; Li, Ri; Li, Wen-Qiang; Chen, Kun-Ming

    2014-01-01

    Background NAD(H) kinase (NADK) is the key enzyme that catalyzes de novo synthesis of NADP(H) from NAD(H) for NADP(H)-based metabolic pathways. In plants, NADKs form functional subfamilies. Studies of these families in Arabidopsis thaliana indicate that they have undergone considerable evolutionary selection; however, the detailed evolutionary history and functions of the various NADKs in plants are not clearly understood. Principal Findings We performed a comparative genomic analysis that identified 74 NADK gene homologs from 24 species representing the eight major plant lineages within the supergroup Plantae: glaucophytes, rhodophytes, chlorophytes, bryophytes, lycophytes, gymnosperms, monocots and eudicots. Phylogenetic and structural analysis classified these NADK genes into four well-conserved subfamilies with considerable variety in the domain organization and gene structure among subfamily members. In addition to the typical NAD_kinase domain, additional domains, such as adenylate kinase, dual-specificity phosphatase, and protein tyrosine phosphatase catalytic domains, were found in subfamily II. Interestingly, NADKs in subfamily III exhibited low sequence similarity (∼30%) in the kinase domain within the subfamily and with the other subfamilies. These observations suggest that gene fusion and exon shuffling may have occurred after gene duplication, leading to specific domain organization seen in subfamilies II and III, respectively. Further analysis of the exon/intron structures showed that single intron loss and gain had occurred, yielding the diversified gene structures, during the process of structural evolution of NADK family genes. Finally, both available global microarray data analysis and qRT-RCR experiments revealed that the NADK genes in Arabidopsis and Oryza sativa show different expression patterns in different developmental stages and under several different abiotic/biotic stresses and hormone treatments, underscoring the functional diversity

  16. Genome-Wide Transcriptome and Proteome Analysis on Different Developmental Stages of Cordyceps militaris

    PubMed Central

    Yin, Yalin; Yu, Guojun; Chen, Yijie; Jiang, Shuai; Wang, Man; Jin, Yanxia; Lan, Xianqing; Liang, Yi; Sun, Hui

    2012-01-01

    Background Cordyceps militaris, an ascomycete caterpillar fungus, has been used as a traditional Chinese medicine for many years owing to its anticancer and immunomodulatory activities. Currently, artificial culturing of this beneficial fungus has been widely used and can meet the market, but systematic molecular studies on the developmental stages of cultured C. militaris at transcriptional and translational levels have not been determined. Methodology/Principal Findings We utilized high-throughput Illumina sequencing to obtain the transcriptomes of C. militaris mycelium and fruiting body. All clean reads were mapped to C. militaris genome and most of the reads showed perfect coverage. Alternative splicing and novel transcripts were predicted to enrich the database. Gene expression analysis revealed that 2,113 genes were up-regulated in mycelium and 599 in fruiting body. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis were performed to analyze the genes with expression differences. Moreover, the putative cordycepin metabolism difference between different developmental stages was studied. In addition, the proteome data of mycelium and fruiting body were obtained by one-dimensional gel electrophoresis (1-DGE) coupled with nano-electrospray ionization liquid chromatography tandem mass spectrometry (nESI-LC-MS/MS). 359 and 214 proteins were detected from mycelium and fruiting body respectively. GO, KEGG and Cluster of Orthologous Groups (COG) analysis were further conducted to better understand their difference. We analyzed the amounts of some noteworthy proteins in these two samples including lectin, superoxide dismutase, glycoside hydrolase and proteins involved in cordycepin metabolism, providing important information for further protein studies. Conclusions/Significance The results reveal the difference in gene expression between the mycelium and fruiting body of artificially cultivated C. militaris by transcriptome and proteome

  17. Genome-Wide Analysis and Functional Characterization of the Polyadenylation Site in Pigs Using RNAseq Data.

    PubMed

    Wang, Hongyang; Li, Rui; Zhou, Xiang; Xue, Liyao; Xu, Xuewen; Liu, Bang

    2016-11-04

    Polyadenylation, a critical step in the production of mature mRNA for translation in most eukaryotes, involves cleavage and poly(A) tail addition at the 3' end of mRNAs at the polyadenylation site (PAS). Sometimes, one gene can have more than one PAS, which can produce the alternative polyadenylation (APA) phenomenon and affect the stability, localization and translation of the mRNA. In this study, we discovered 28,363 PASs using pig RNAseq data, with 13,033 located in 7,403 genes. Among the genes, 41% were identified to have more than one PAS. PAS distribution analysis indicated that the PAS position was highly variable in genes. Additionally, the analysis of RNAseq data from the liver and testis showed a difference in their PAS number and usage. RT-PCR and qRT-PCR were performed to confirm our findings by detecting the expression of 3'UTR isoforms for five candidate genes. The analysis of RNAseq data under a different androstenone level and salmonella inoculation indicated that the functional usage of PAS might participate in the immune response and may be related to the androstenone level in pigs. This study provides new insights into pig PAS and facilitates further functional research of PAS.

  18. Genome-wide analysis of the WRKY gene family in physic nut (Jatropha curcas L.).

    PubMed

    Xiong, Wangdan; Xu, Xueqin; Zhang, Lin; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2013-07-25

    The WRKY proteins, which contain highly conserved WRKYGQK amino acid sequences and zinc-finger-like motifs, constitute a large family of transcription factors in plants. They participate in diverse physiological and developmental processes. WRKY genes have been identified and characterized in a number of plant species. We identified a total of 58 WRKY genes (JcWRKY) in the genome of the physic nut (Jatropha curcas L.). On the basis of their conserved WRKY domain sequences, all of the JcWRKY proteins could be assigned to one of the previously defined groups, I-III. Phylogenetic analysis of JcWRKY genes with Arabidopsis and rice WRKY genes, and separately with castor bean WRKY genes, revealed no evidence of recent gene duplication in JcWRKY gene family. Analysis of transcript abundance of JcWRKY gene products were tested in different tissues under normal growth condition. In addition, 47 WRKY genes responded to at least one abiotic stress (drought, salinity, phosphate starvation and nitrogen starvation) in individual tissues (leaf, root and/or shoot cortex). Our study provides a useful reference data set as the basis for cloning and functional analysis of physic nut WRKY genes.

  19. Genome-Wide Analysis of the NAC Gene Family in Physic Nut (Jatropha curcas L.).

    PubMed

    Wu, Zhenying; Xu, Xueqin; Xiong, Wangdan; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Wu, Guojiang; Jiang, Huawu

    2015-01-01

    The NAC proteins (NAM, ATAF1/2 and CUC2) are plant-specific transcriptional regulators that have a conserved NAM domain in the N-terminus. They are involved in various biological processes, including both biotic and abiotic stress responses. In the present study, a total of 100 NAC genes (JcNAC) were identified in physic nut (Jatropha curcas L.). Based on phylogenetic analysis and gene structures, 83 JcNAC genes were classified as members of, or proposed to be diverged from, 39 previously predicted orthologous groups (OGs) of NAC sequences. Physic nut has a single intron-containing NAC gene subfamily that has been lost in many plants. The JcNAC genes are non-randomly distributed across the 11 linkage groups of the physic nut genome, and appear to be preferentially retained duplicates that arose from both ancient and recent duplication events. Digital gene expression analysis indicates that some of the JcNAC genes have tissue-specific expression profiles (e.g. in leaves, roots, stem cortex or seeds), and 29 genes differentially respond to abiotic stresses (drought, salinity, phosphorus deficiency and nitrogen deficiency). Our results will be helpful for further functional analysis of the NAC genes in physic nut.

  20. Genome-Wide Analysis and Functional Characterization of the Polyadenylation Site in Pigs Using RNAseq Data

    PubMed Central

    Wang, Hongyang; Li, Rui; Zhou, Xiang; Xue, Liyao; Xu, Xuewen; Liu, Bang

    2016-01-01

    Polyadenylation, a critical step in the production of mature mRNA for translation in most eukaryotes, involves cleavage and poly(A) tail addition at the 3′ end of mRNAs at the polyadenylation site (PAS). Sometimes, one gene can have more than one PAS, which can produce the alternative polyadenylation (APA) phenomenon and affect the stability, localization and translation of the mRNA. In this study, we discovered 28,363 PASs using pig RNAseq data, with 13,033 located in 7,403 genes. Among the genes, 41% were identified to have more than one PAS. PAS distribution analysis indicated that the PAS position was highly variable in genes. Additionally, the analysis of RNAseq data from the liver and testis showed a difference in their PAS number and usage. RT-PCR and qRT-PCR were performed to confirm our findings by detecting the expression of 3′UTR isoforms for five candidate genes. The analysis of RNAseq data under a different androstenone level and salmonella inoculation indicated that the functional usage of PAS might participate in the immune response and may be related to the androstenone level in pigs. This study provides new insights into pig PAS and facilitates further functional research of PAS. PMID:27812017

  1. Nonlinear Analysis of Time Series in Genome-Wide Linkage Disequilibrium Data

    NASA Astrophysics Data System (ADS)

    Hernández-Lemus, Enrique; Estrada-Gil, Jesús K.; Silva-Zolezzi, Irma; Fernández-López, J. Carlos; Hidalgo-Miranda, Alfredo; Jiménez-Sánchez, Gerardo

    2008-02-01

    The statistical study of large scale genomic data has turned out to be a very important tool in population genetics. Quantitative methods are essential to understand and implement association studies in the biomedical and health sciences. Nevertheless, the characterization of recently admixed populations has been an elusive problem due to the presence of a number of complex phenomena. For example, linkage disequilibrium structures are thought to be more complex than their non-recently admixed population counterparts, presenting the so-called ancestry blocks, admixed regions that are not yet smoothed by the effect of genetic recombination. In order to distinguish characteristic features for various populations we have implemented several methods, some of them borrowed or adapted from the analysis of nonlinear time series in statistical physics and quantitative physiology. We calculate the main fractal dimensions (Kolmogorov's capacity, information dimension and correlation dimension, usually named, D0, D1 and D2). We also have made detrended fluctuation analysis and information based similarity index calculations for the probability distribution of correlations of linkage disequilibrium coefficient of six recently admixed (mestizo) populations within the Mexican Genome Diversity Project [1] and for the non-recently admixed populations in the International HapMap Project [2]. Nonlinear correlations showed up as a consequence of internal structure within the haplotype distributions. The analysis of these correlations as well as the scope and limitations of these procedures within the biomedical sciences are discussed.

  2. Genome-Wide Identification and Expression Analysis of the WRKY Gene Family in Cassava

    PubMed Central

    Wei, Yunxie; Shi, Haitao; Xia, Zhiqiang; Tie, Weiwei; Ding, Zehong; Yan, Yan; Wang, Wenquan; Hu, Wei; Li, Kaimian

    2016-01-01

    The WRKY family, a large family of transcription factors (TFs) found in higher plants, plays central roles in many aspects of physiological processes and adaption to environment. However, little information is available regarding the WRKY family in cassava (Manihot esculenta). In the present study, 85 WRKY genes were identified from the cassava genome and classified into three groups according to conserved WRKY domains and zinc-finger structure. Conserved motif analysis showed that all of the identified MeWRKYs had the conserved WRKY domain. Gene structure analysis suggested that the number of introns in MeWRKY genes varied from 1 to 5, with the majority of MeWRKY genes containing three exons. Expression profiles of MeWRKY genes in different tissues and in response to drought stress were analyzed using the RNA-seq technique. The results showed that 72 MeWRKY genes had differential expression in their transcript abundance and 78 MeWRKY genes were differentially expressed in response to drought stresses in different accessions, indicating their contribution to plant developmental processes and drought stress resistance in cassava. Finally, the expression of 9 WRKY genes was analyzed by qRT-PCR under osmotic, salt, ABA, H2O2, and cold treatments, indicating that MeWRKYs may be involved in different signaling pathways. Taken together, this systematic analysis identifies some tissue-specific and abiotic stress-responsive candidate MeWRKY genes for further functional assays in planta, and provides a solid foundation for understanding of abiotic stress responses and signal transduction mediated by WRKYs in cassava. PMID:26904033

  3. Genome-Wide Identification, Localization, and Expression Analysis of Proanthocyanidin-Associated Genes in Brassica

    PubMed Central

    Liu, Xianjun; Lu, Ying; Yan, Mingli; Sun, Donghong; Hu, Xuefang; Liu, Shuyan; Chen, Sheyuan; Guan, Chunyun; Liu, Zhongsong

    2016-01-01

    Proanthocyanidins (PA) is a type of prominent flavonoid compound deposited in seed coats which controls the pigmentation in all Brassica species. Annotation of Brassica juncea genome survey sequences showed 72 PA genes; however, a functional description of these genes, especially how their interactions regulate seed pigmentation, remains elusive. In the present study, we designed 19 primer pairs to screen a bacterial artificial chromosome (BAC) library of B. juncea. A total of 284 BAC clones were identified and sequenced. Alignment of the sequences confirmed that 55 genes were cloned, with every Arabidopsis PA gene having 2–7 homologs in B. juncea. BLAST analysis using the recently released B. rapa or B. napus genome database identified 31 and 58 homologous genes, respectively. Mapping and phylogenetic analysis indicated that 30 B. juncea PA genes are located in the A-genome chromosomes except A04, whereas the remaining 25 genes are mapped to the B-genome chromosomes except B05 and B07. RNA-seq data and Fragments Per Kilobase of a transcript per Million mapped reads (FPKM) analysis showed that most of the PA genes were expressed in the seed coat of B. juncea and B. napus, and that BjuTT3, BjuTT18, BjuANR, BjuTT4-2, BjuTT4-3, BjuTT19-1, and BjuTT19-3 are transcriptionally regulated, and not expressed or downregulated in yellow-seeded testa. Importantly, our study facilitates in better understanding of the molecular mechanism underlying Brassica PA profiles and accumulation, as well as in further characterization of PA genes. PMID:28018375

  4. Genome-wide analysis of drought induced gene expression changes in flax (Linum usitatissimum)

    PubMed Central

    Dash, Prasanta K; Cao, Yongguo; Jailani, Abdul K; Gupta, Payal; Venglat, Prakash; Xiang, Daoquan; Rai, Rhitu; Sharma, Rinku; Thirunavukkarasu, Nepolean; Abdin, Malik Z; Yadava, Devendra K; Singh, Nagendra K; Singh, Jas; Selvaraj, Gopalan; Deyholos, Mike; Kumar, Polumetla Ananda; Datla, Raju

    2014-01-01

    A robust phenotypic plasticity to ward off adverse environmental conditions determines performance and productivity in crop plants. Flax (linseed), is an important cash crop produced for natural textile fiber (linen) or oilseed with many health promoting products. This crop is prone to drought stress and yield losses in many parts of the world. Despite recent advances in drought research in a number of important crops, related progress in flax is very limited. Since, response of this plant to drought stress has not been addressed at the molecular level; we conducted microarray analysis to capture transcriptome associated with induced drought in flax. This study identified 183 differentially expressed genes (DEGs) associated with diverse cellular, biophysical and metabolic programs in flax. The analysis also revealed especially the altered regulation of cellular and metabolic pathways governing photosynthesis. Additionally, comparative transcriptome analysis identified a plethora of genes that displayed differential regulation both spatially and temporally. These results revealed co-regulated expression of 26 genes in both shoot and root tissues with implications for drought stress response. Furthermore, the data also showed that more genes are upregulated in roots compared to shoots, suggesting that roots may play important and additional roles in response to drought in flax. With prolonged drought treatment, the number of DEGs increased in both tissue types. Differential expression of selected genes was confirmed by qRT-PCR, thus supporting the suggested functional association of these intrinsic genes in maintaining growth and homeostasis in response to imminent drought stress in flax. Together the present study has developed foundational and new transcriptome data sets for drought stress in flax. PMID:25072186

  5. Comparative analysis of genome-wide Mlo gene family in Cajanus cajan and Phaseolus vulgaris.

    PubMed

    Deshmukh, Reena; Singh, V K; Singh, B D

    2016-04-01

    The Mlo gene was discovered in barley because the mutant 'mlo' allele conferred broad-spectrum, non-race-specific resistance to powdery mildew caused by Blumeria graminis f. sp. hordei. The Mlo genes also play important roles in growth and development of plants, and in responses to biotic and abiotic stresses. The Mlo gene family has been characterized in several crop species, but only a single legume species, soybean (Glycine max L.), has been investigated so far. The present report describes in silico identification of 18 CcMlo and 20 PvMlo genes in the important legume crops Cajanus cajan (L.) Millsp. and Phaseolus vulgaris L., respectively. In silico analysis of gene organization, protein properties and conserved domains revealed that the C. cajan and P. vulgaris Mlo gene paralogs are more divergent from each other than from their orthologous pairs. The comparative phylogenetic analysis classified CcMlo and PvMlo genes into three major clades. A comparative analysis of CcMlo and PvMlo proteins with the G. max Mlo proteins indicated close association of one CcMlo, one PvMlo with two GmMlo genes, indicating that there was no further expansion of the Mlo gene family after the separation of these species. Thus, most of the diploid species of eudicots might be expected to contain 15-20 Mlo genes. The genes CcMlo12 and 14, and PvMlo11 and 12 are predicted to participate in powdery mildew resistance. If this prediction were verified, these genes could be targeted by TILLING or CRISPR to isolate powdery mildew resistant mutants.

  6. Genome-wide analysis of the MADS-box gene family in Brassica rapa (Chinese cabbage).

    PubMed

    Duan, Weike; Song, Xiaoming; Liu, Tongkun; Huang, Zhinan; Ren, Jun; Hou, Xilin; Li, Ying

    2015-02-01

    The MADS-box gene family is an ancient and well-studied transcription factor family that functions in almost every developmental process in plants. There are a number of reports about the MADS-box family in different plant species, but systematic analysis of the MADS-box transcription factor family in Brassica rapa (Chinese cabbage) is still lacking. In this study, 160 MADS-box transcription factors were identified from the entire Chinese cabbage genome and compared with the MADS-box factors from 21 other representative plant species. A detailed list of MADS proteins from these 22 species was sorted. Phylogenetic analysis of the BrMADS genes, together with their Arabidopsis and rice counterparts, showed that the BrMADS genes were categorised into type I (Mα, Mβ, Mγ) and type II (MIKC(C), MIKC*) groups, and the MIKC(C) proteins were further divided into 13 subfamilies. The Chinese cabbage type II group has 95 members, which is twice as much as the Arabidopsis type II group, indicating that the Chinese cabbage type II genes have been retained more frequently than the type I genes. Finally, RNA-seq transcriptome data and quantitative real-time PCR analysis revealed that BrMADS genes are expressed in a tissue-specific manner similar to Arabidopsis. Interestingly, a number of BrMIKC genes showed responses to different abiotic stress treatments, suggesting a function for some of the genes in these processes as well. Taken together, the characterization of the B. rapa MADS-box family presented here, will certainly help in the selection of appropriate candidate genes and further facilitate functional studies in Chinese cabbage.

  7. Genome-wide linkage analysis of the tracking of systolic blood pressure using a mixed model

    PubMed Central

    Wang, Tao; Zhu, Guohua; Keen, Kevin J

    2003-01-01

    Background Elevated blood pressure in middle age is a major risk factor for subsequent cardiovascular complications. An important longitudinal characteristic of blood pressure is the "tracking phenomenon". Tracking is defined as the persistence of the rank of a person's blood pressure level in a group over a long period of time. In this analysis, we used the Framingham data to investigate whether there are some genes responsible for this phenomenon. Results Both two-point and multipoint linkage analyses were applied to family members with complete data only and to all family data with missing values imputed by a Gaussian model. The results of two-point linkage analysis indicated that two loci for linkage with the intercept were on chromosomes 10 and 13, and two loci for linkage with both slope and intercept were on chromosomes 1 and 3. Multipoint linkage analysis indicated only one region, 200–240 cM on chromosome 1, to be linked with both intercept and slope. For the intercept of SBP, the highest LOD (4.43) was found at 214 cM when missing data were imputed, and the highest LOD (2.81) was at 231 cM for the complete case data. For the slope of SBP, the highest multipoint LODs were 3.63 at 227 cM and 2.02 at 234 cM for the complete case data and imputation data, respectively. Conclusion One or more genes in the range of 200–240 cM on chromosome 1 may be related to the tracking phenomenon of SBP. PMID:14975156

  8. Genome-wide identification and analysis of MAPK and MAPKK gene families in Brachypodium distachyon.

    PubMed

    Chen, Lihong; Hu, Wei; Tan, Shenglong; Wang, Min; Ma, Zhanbing; Zhou, Shiyi; Deng, Xiaomin; Zhang, Yang; Huang, Chao; Yang, Guangxiao; He, Guangyuan

    2012-01-01

    MAPK cascades are universal signal transduction modules and play important roles in plant growth, development and in response to a variety of biotic and abiotic stresses. Although MAPKs and MAPKKs have been systematically investigated in several plant species including Arabidopsis, rice and poplar, no systematic analysis has been conducted in the emerging monocot model plant Brachypodium distachyon. In the present study, a total of 16 MAPK genes and 12 MAPKK genes were identified from B. distachyon. An analysis of the genomic evolution showed that both tandem and segment duplications contributed significantly to the expansion of MAPK and MAPKK families. Evolutionary relationships within subfamilies were supported by exon-intron organizations and the architectures of conserved protein motifs. Synteny analysis between B. distachyon and the other two plant species of rice and Arabidopsis showed that only one homolog of B. distachyon MAPKs was found in the corresponding syntenic blocks of Arabidopsis, while 13 homologs of B. distachyon MAPKs and MAPKKs were found in that of rice, which was consistent with the speciation process of the three species. In addition, several interactive protein pairs between the two families in B. distachyon were found through yeast two hybrid assay, whereas their orthologs of a pair in Arabidopsis and other plant species were not found to interact with each other. Finally, expression studies of closely related family members among B. distachyon, Arabidopsis and rice showed that even recently duplicated representatives may fulfill different functions and be involved in different signal pathways. Taken together, our data would provide a foundation for evolutionary and functional characterization of MAPK and MAPKK gene families in B. distachyon and other plant species to unravel their biological roles.

  9. Genome-Wide Identification, Localization, and Expression Analysis of Proanthocyanidin-Associated Genes in Brassica.

    PubMed

    Liu, Xianjun; Lu, Ying; Yan, Mingli; Sun, Donghong; Hu, Xuefang; Liu, Shuyan; Chen, Sheyuan; Guan, Chunyun; Liu, Zhongsong

    2016-01-01

    Proanthocyanidins (PA) is a type of prominent flavonoid compound deposited in seed coats which controls the pigmentation in all Brassica species. Annotation of Brassica juncea genome survey sequences showed 72 PA genes; however, a functional description of these genes, especially how their interactions regulate seed pigmentation, remains elusive. In the present study, we designed 19 primer pairs to screen a bacterial artificial chromosome (BAC) library of B. juncea. A total of 284 BAC clones were identified and sequenced. Alignment of the sequences confirmed that 55 genes were cloned, with every Arabidopsis PA gene having 2-7 homologs in B. juncea. BLAST analysis using the recently released B. rapa or B. napus genome database identified 31 and 58 homologous genes, respectively. Mapping and phylogenetic analysis indicated that 30 B. juncea PA genes are located in the A-genome chromosomes except A04, whereas the remaining 25 genes are mapped to the B-genome chromosomes except B05 and B07. RNA-seq data and Fragments Per Kilobase of a transcript per Million mapped reads (FPKM) analysis showed that most of the PA genes were expressed in the seed coat of B. juncea and B. napus, and that BjuTT3, BjuTT18, BjuANR, BjuTT4-2, BjuTT4-3, BjuTT19-1, and BjuTT19-3 are transcriptionally regulated, and not expressed or downregulated in yellow-seeded testa. Importantly, our study facilitates in better understanding of the molecular mechanism underlying Brassica PA profiles and accumulation, as well as in further characterization of PA genes.

  10. Genome-wide expression for diagnosis of pulmonary tuberculosis: a multicohort analysis

    PubMed Central

    Sweeney, Timothy E; Braviak, Lindsay; Tato, Cristina M; Khatri, Purvesh

    2016-01-01

    Summary Background Active pulmonary tuberculosis is difficult to diagnose and treatment response is difficult to effectively monitor. A WHO consensus statement has called for new non-sputum diagnostics. The aim of this study was to use an integrated multicohort analysis of samples from publically available datasets to derive a diagnostic gene set in the peripheral blood of patients with active tuberculosis. Methods We searched two public gene expression microarray repositories and retained datasets that examined clinical cohorts of active pulmonary tuberculosis infection in whole blood. We compared gene expression in patients with either latent tuberculosis or other diseases versus patients with active tuberculosis using our validated multicohort analysis framework. Three datasets were used as discovery datasets and meta-analytical methods were used to assess gene effects in these cohorts. We then validated the diagnostic capacity of the three gene set in the remaining 11 datasets. Findings A total of 14 datasets containing 2572 samples from 10 countries from both adult and paediatric patients were included in the analysis. Of these, three datasets (N=1023) were used to discover a set of three genes (GBP5, DUSP3, and KLF2) that are highly diagnostic for active tuberculosis. We validated the diagnostic power of the three gene set to separate active tuberculosis from healthy controls (global area under the ROC curve (AUC) 0·90 [95% CI 0·85–0·95]), latent tuberculosis (0·88 [0·84–0·92]), and other diseases (0·84 [0·80–0·95]) in eight independent datasets composed of both children and adults from ten countries. Expression of the three-gene set was not confounded by HIV infection status, bacterial drug resistance, or BCG vaccination. Furthermore, in four additional cohorts, we showed that the tuberculosis score declined during treatment of patients with active tuberculosis. Interpretation Overall, our integrated multicohort analysis yielded a three

  11. Genome wide expression analysis in HPV16 Cervical Cancer: identification of altered metabolic pathways

    PubMed Central

    Pérez-Plasencia, Carlos; Vázquez-Ortiz, Guelaguetza; López-Romero, Ricardo; Piña-Sanchez, Patricia; Moreno, José; Salcedo, Mauricio

    2007-01-01

    Background Cervical carcinoma (CC) is a leading cause of death among women worldwide. Human papilloma virus (HPV) is a major etiological factor in CC and HPV 16 is the more frequent viral type present. Our aim was to characterize metabolic pathways altered in HPV 16 tumor samples by means of transcriptome wide analysis and bioinformatics tools for visualizing expression data in the context of KEGG biological pathways. Results We found 2,067 genes significantly up or down-modulated (at least 2-fold) in tumor clinical samples compared to normal tissues, representing ~3.7% of analyzed genes. Cervical carcinoma was associated with an important up-regulation of Wnt signaling pathway, which was validated by in situ hybridization in clinical samples. Other up-regulated pathways were those of calcium signaling and MAPK signaling, as well as cell cycle-related genes. There was down-regulation of focal adhesion, TGF-β signaling, among other metabolic pathways. Conclusion This analysis of HPV 16 tumors transcriptome could be useful for the identification of genes and molecular pathways involved in the pathogenesis of cervical carcinoma. Understanding the possible role of these proteins in the pathogenesis of CC deserves further studies. PMID:17822553

  12. Genome-wide transcriptomic analysis of the sporophyte of the moss Physcomitrella patens.

    PubMed

    O'Donoghue, Martin-Timothy; Chater, Caspar; Wallace, Simon; Gray, Julie E; Beerling, David J; Fleming, Andrew J

    2013-09-01

    Bryophytes, the most basal of the extant land plants, diverged at least 450 million years ago. A major feature of these plants is the biphasic alternation of generations between a dominant haploid gametophyte and a minor diploid sporophyte phase. These dramatic differences in form and function occur in a constant genetic background, raising the question of whether the switch from gametophyte-to-sporophyte development reflects major changes in the spectrum of genes being expressed or alternatively whether only limited changes in gene expression occur and the differences in plant form are due to differences in how the gene products are put together. This study performed replicated microarray analyses of RNA from several thousand dissected and developmentally staged sporophytes of the moss Physcomitrella patens, allowing analysis of the transcriptomes of the sporophyte and early gametophyte, as well as the early stages of moss sporophyte development. The data indicate that more significant changes in transcript profile occur during the switch from gametophyte to sporophyte than recently reported, with over 12% of the entire transcriptome of P. patens being altered during this major developmental transition. Analysis of the types of genes contributing to these differences supports the view of the early sporophyte being energetically and nutritionally dependent on the gametophyte, provides a profile of homologues to genes involved in angiosperm stomatal development and physiology which suggests a deeply conserved mechanism of stomatal control, and identifies a novel series of transcription factors associated with moss sporophyte development.

  13. Genome-wide identification of Wig-1 mRNA targets by RIP-Seq analysis

    PubMed Central

    Bersani, Cinzia; Huss, Mikael; Giacomello, Stefania; Xu, Li-Di; Bianchi, Julie; Eriksson, Sofi; Jerhammar, Fredrik; Alexeyenko, Andrey; Vilborg, Anna; Lundeberg, Joakim; Lui, Weng-Onn; Wiman, Klas G.

    2016-01-01

    RNA-binding proteins (RBPs) play important roles in the regulation of gene expression through a variety of post-transcriptional mechanisms. The p53-induced RBP Wig-1 (Zmat3) binds RNA through its zinc finger domains and enhances stability of p53 and N-Myc mRNAs and decreases stability of FAS mRNA. To identify novel Wig-1-bound RNAs, we performed RNA-immunoprecipitation followed by high-throughput sequencing (RIP-Seq) in HCT116 and Saos-2 cells. We identified 286 Wig-1-bound mRNAs common between the two cell lines. Sequence analysis revealed that AU-rich elements (AREs) are highly enriched in the 3′UTR of these Wig-1-bound mRNAs. Network enrichment analysis showed that Wig-1 preferentially binds mRNAs involved in cell cycle regulation. Moreover, we identified a 2D Wig-1 binding motif in HIF1A mRNA. Our findings confirm that Wig-1 is an ARE-BP that regulates cell cycle-related processes and provide a novel view of how Wig-1 may bind mRNA through a putative structural motif. We also significantly extend the repertoire of Wig-1 target mRNAs. Since Wig-1 is a transcriptional target of the tumor suppressor p53, these results have implications for our understanding of p53-dependent stress responses and tumor suppression. PMID:26672765

  14. Supercomputing enabling exhaustive statistical analysis of genome wide association study data: Preliminary results.

    PubMed

    Reumann, Matthias; Makalic, Enes; Goudey, Benjamin W; Inouye, Michael; Bickerstaffe, Adrian; Bui, Minh; Park, Daniel J; Kapuscinski, Miroslaw K; Schmidt, Daniel F; Zhou, Zeyu; Qian, Guoqi; Zobel, Justin; Wagner, John; Hopper, John L

    2012-01-01

    Most published GWAS do not examine SNP interactions due to the high computational complexity of computing p-values for the interaction terms. Our aim is to utilize supercomputing resources to apply complex statistical techniques to the world's accumulating GWAS, epidemiology, survival and pathology data to uncover more information about genetic and environmental risk, biology and aetiology. We performed the Bayesian Posterior Probability test on a pseudo data set with 500,000 single nucleotide polymorphism and 100 samples as proof of principle. We carried out strong scaling simulations on 2 to 4,096 processing cores with factor 2 increments in partition size. On two processing cores, the run time is 317h, i.e. almost two weeks, compared to less than 10 minutes on 4,096 processing cores. The speedup factor is 2,020 that is very close to the theoretical value of 2,048. This work demonstrates the feasibility of performing exhaustive higher order analysis of GWAS studies using independence testing for contingency tables. We are now in a position to employ supercomputers with hundreds of thousands of threads for higher order analysis of GWAS data using complex statistics.

  15. Genome-Wide Identification, Evolution and Expression Analysis of mTERF Gene Family in Maize

    PubMed Central

    Zhao, Yanxin; Cai, Manjun; Zhang, Xiaobo; Li, Yurong; Zhang, Jianhua; Zhao, Hailiang; Kong, Fei; Zheng, Yonglian; Qiu, Fazhan

    2014-01-01

    Plant mitochondrial transcription termination factor (mTERF) genes comprise a large family with important roles in regulating organelle gene expression. In this study, a comprehensive database search yielded 31 potential mTERF genes in maize (Zea mays L.) and most of them were targeted to mitochondria or chloroplasts. Maize mTERF were divided into nine main groups based on phylogenetic analysis, and group IX represented the mitochondria and species-specific clade that diverged from other groups. Tandem and segmental duplication both contributed to the expansion of the mTERF gene family in the maize genome. Comprehensive expression analysis of these genes, using microarray data and RNA-seq data, revealed that these genes exhibit a variety of expression patterns. Environmental stimulus experiments revealed differential up or down-regulation expression of maize mTERF genes in seedlings exposed to light/dark, salts and plant hormones, respectively, suggesting various important roles of maize mTERF genes in light acclimation and stress-related responses. These results will be useful for elucidating the roles of mTERF genes in the growth, development and stress response of maize. PMID:24718683

  16. [Genome-wide identification and bioinformatic analysis of PPR gene family in tomato].

    PubMed

    Ding, Anming; Li, Ling; Qu, Xu; Sun, Tingting; Chen, Yaqiong; Zong, Peng; Li, Zunqiang; Gong, Daping; Sun, Yuhe

    2014-01-01

    Pentatricopeptide repeats (PPRs) genes constitute one of the largest gene families in plants, which play a broad and essential role in plant growth and development. In this study, the protein sequences annotated by the tomato (S. lycopersicum L.) genome project were screened with the Pfam PPR sequences. A total of 471 putative PPR-encoding genes were identified. Based on the motifs defined in A. thaliana L., protein structure and conserved sequences for each tomato motif were analyzed. We also analyzed phylogenetic relationship, subcellular localization, expression and GO analysis of the identified gene sequences. Our results demonstrate that tomato PPR gene family contains two subfamilies, P and PLS, each accounting for half of the family. PLS subfamily can be divided into four subclasses i.e., PLS, E, E+ and DYW. Each subclass of sequences forms a clade in the phylogenetic tree. The PPR motifs were found highly conserved among plants. The tomato PPR genes were distributed over 12 chromosomes and most of them lack introns. The majority of PPR proteins harbor mitochondrial or chloroplast localization sequences, whereas GO analysis showed that most PPR proteins participate in RNA-related biological processes.

  17. Genome-Wide Identification, Characterization and Expression Analysis of the Chalcone Synthase Family in Maize

    PubMed Central

    Han, Yahui; Ding, Ting; Su, Bo; Jiang, Haiyang

    2016-01-01

    Members of the chalcone synthase (CHS) family participate in the synthesis of a series of secondary metabolites in plants, fungi and bacteria. The metabolites play important roles in protecting land plants against various environmental stresses during the evolutionary process. Our research was conducted on comprehensive investigation of CHS genes in maize (Zea mays L.), including their phylogenetic relationships, gene structures, chromosomal locations and expression analysis. Fourteen CHS genes (ZmCHS01–14) were identified in the genome of maize, representing one of the largest numbers of CHS family members identified in one organism to date. The gene family was classified into four major classes (classes I–IV) based on their phylogenetic relationships. Most of them contained two exons and one intron. The 14 genes were unevenly located on six chromosomes. Two segmental duplication events were identified, which might contribute to the expansion of the maize CHS gene family to some extent. In addition, quantitative real-time PCR and microarray data analyses suggested that ZmCHS genes exhibited various expression patterns, indicating functional diversification of the ZmCHS genes. Our results will contribute to future studies of the complexity of the CHS gene family in maize and provide valuable information for the systematic analysis of the functions of the CHS gene family. PMID:26828478

  18. Genome-wide transcriptomic analysis of the sporophyte of the moss Physcomitrella patens

    PubMed Central

    O’Donoghue, Martin-Timothy; Chater, Caspar; Wallace, Simon; Gray, Julie E.; Beerling, David J.; Fleming, Andrew J.

    2013-01-01

    Bryophytes, the most basal of the extant land plants, diverged at least 450 million years ago. A major feature of these plants is the biphasic alternation of generations between a dominant haploid gametophyte and a minor diploid sporophyte phase. These dramatic differences in form and function occur in a constant genetic background, raising the question of whether the switch from gametophyte-to-sporophyte development reflects major changes in the spectrum of genes being expressed or alternatively whether only limited changes in gene expression occur and the differences in plant form are due to differences in how the gene products are put together. This study performed replicated microarray analyses of RNA from several thousand dissected and developmentally staged sporophytes of the moss Physcomitrella patens, allowing analysis of the transcriptomes of the sporophyte and early gametophyte, as well as the early stages of moss sporophyte development. The data indicate that more significant changes in transcript profile occur during the switch from gametophyte to sporophyte than recently reported, with over 12% of the entire transcriptome of P. patens being altered during this major developmental transition. Analysis of the types of genes contributing to these differences supports the view of the early sporophyte being energetically and nutritionally dependent on the gametophyte, provides a profile of homologues to genes involved in angiosperm stomatal development and physiology which suggests a deeply conserved mechanism of stomatal control, and identifies a novel series of transcription factors associated with moss sporophyte development. PMID:23888066

  19. Genome-Wide Analysis of Respiratory Burst Oxidase Homologs in Grape (Vitis vinifera L.)

    PubMed Central

    Cheng, Chenxia; Xu, Xiaozhao; Gao, Min; Li, Jun; Guo, Chunlei; Song, Junyang; Wang, Xiping

    2013-01-01

    Plant respiratory burst oxidase homolog (rboh) genes appear to play crucial roles in plant development, defense reactions and hormone signaling. In this study, a total of seven rboh genes from grape were identified and characterized. Genomic structure and predicted protein sequence analysis indicated that the sequences of plant rboh genes are highly conserved. Synteny analysis demonstrated that several Vvrboh genes were found in corresponding syntenic blocks of Arabidopsis, suggesting that these genes arose before the divergence of the respective lineages. The expression pattern of Vvrboh genes in different tissues was assessed by qRT-PCR and two were constitutively expressed in all tissues tested. The expression profiles were similarly analyzed following exposure to various stresses and hormone treatments. It was shown that the expression levels of VvrbohA, VvrbohB and VvrbohC1 were significantly increased by salt and drought treatments. VvrbohB, VvrbohC2, and VvrbohD exhibited a dramatic up-regulation after powdery mildew (Uncinula necator (Schw.) Burr.) inoculation, while VvrbohH was down-regulated. Finally, salicylic acid treatment strongly stimulated the expression of VvrbohD and VvrbohH, while abscisic acid treatment induced the expression of VvrbohB and VvrbohH. These results demonstrate that the expression patterns of grape rboh genes exhibit diverse and complex stress-response expression signatures. PMID:24351809

  20. Genome-wide analysis of caesium and strontium accumulation in Saccharomyces cerevisiae.

    PubMed

    Heuck, Sabine; Gerstmann, Udo C; Michalke, Bernhard; Kanter, Ulrike

    2010-10-01

    (137)Cs and (90)Sr contribute to significant and long-lasting contamination of the environment with radionuclides. Due to their relatively high biological availability, they are transferred rapidly into biotic systems and may enter the food chain. In this study, we analysed 4862 haploid yeast knockout strains of Saccharomyces cerevisiae to identify genes involved in caesium (Cs(+)) and/or strontium (Sr(2+)) accumulation. According to this analysis, 212 mutant strains were associated with reproducible altered Cs(+) and/or Sr(2+) accumulation. These mutants were deficient for a wide range of cellular processes. Among those, the vacuolar function and biogenesis turned out to be crucial for both Cs(+) and Sr(2+) accumulation. Disruption of the vacuole diminished Cs(+) accumulation, whereas Sr(2+) enrichment was enhanced. Further analysis with a subset of the identified candidates were undertaken comparing the accumulation of Cs(+) and Sr(2+) with their essential counterparts potassium (K(+)) and calcium (Ca(2+)). Sr(2+) and Ca(2+) accumulation was highly correlated in yeast excluding the possibility of a differential regulation or uptake mechanisms. In direct contrast, the respective results suggest that Cs(+) uptake is at least partially dependent on mechanisms distinct from K(+) uptake. Single candidates (e.g. KHA1) are presented which might be specifically responsible for Cs(+) homeostasis.

  1. Genome-wide analysis of WOX gene family in rice, sorghum, maize, Arabidopsis and poplar.

    PubMed

    Zhang, Xin; Zong, Jie; Liu, Jianhua; Yin, Jinyuan; Zhang, Dabing

    2010-11-01

    WUSCHEL-related homeobox (WOX) genes form a large gene family specifically expressed in plants. They are known to play important roles in regulating the development of plant tissues and organs by determining cell fate. Recent available whole genome sequences allow us to do more comprehensive phylogenetic analysis of the WOX genes in plants. In the present study, we identified 11 and 21 WOXs from sorghum (Sorghum bicolor) and maize (Zea mays), respectively. The 72 WOX genes from rice (Oryza sativa), sorghum, maize, Arabidopsis (Arabidopsis thaliana) and poplar (Populus trichocarpa) were grouped into three well supported clades with nine subgroups according to the amino acid sequences of their homodomains. Their phylogenetic relationship was also supported by the observation of the motifs outside the homodomain. We observed the variation of duplication events among the nine sub-groups between monocots and eudicots, for instance, more gene duplication events of WOXs within subgroup A for monocots, while, less for dicots in this subgroup. Furthermore, we observed the conserved intron/exon structural patterns of WOX genes in rice, sorghum and Arabidopsis. In addition, WUS (Wuschel)-box and EAR (the ERF-associated amphiphilic repression)-like motif were observed to be conserved among several WOX subgroups in these five plants. Comparative analysis of expression patterns of WOX genes in rice and Arabidopsis suggest that the WOX genes play conserved and various roles in plants. This work provides insights into the evolution of the WOX gene family and is useful for future research.

  2. Genome-wide network analysis of Wnt signaling in three pediatric cancers

    NASA Astrophysics Data System (ADS)

    Bao, Ju; Lee, Ho-Jin; Zheng, Jie J.

    2013-10-01

    Genomic structural alteration is common in pediatric cancers, and analysis of data generated by the Pediatric Cancer Genome Project reveals such tumor-related alterations in many Wnt signaling-associated genes. Most pediatric cancers are thought to arise within developing tissues that undergo substantial expansion during early organ formation, growth and maturation, and Wnt signaling plays an important role in this development. We examined three pediatric tumors--medullobastoma, early T-cell precursor acute lymphoblastic leukemia, and retinoblastoma--that show multiple genomic structural variations within Wnt signaling pathways. We mathematically modeled this pathway to investigate the effects of cancer-related structural variations on Wnt signaling. Surprisingly, we found that an outcome measure of canonical Wnt signaling was consistently similar in matched cancer cells and normal cells, even in the context of different cancers, different mutations, and different Wnt-related genes. Our results suggest that the cancer cells maintain a normal level of Wnt signaling by developing multiple mutations.

  3. Genome-wide network analysis of Wnt signaling in three pediatric cancers.

    PubMed

    Bao, Ju; Lee, Ho-Jin; Zheng, Jie J

    2013-10-17

    Genomic structural alteration is common in pediatric cancers, and analysis of data generated by the Pediatric Cancer Genome Project reveals such tumor-related alterations in many Wnt signaling-associated genes. Most pediatric cancers are thought to arise within developing tissues that undergo substantial expansion during early organ formation, growth and maturation, and Wnt signaling plays an important role in this development. We examined three pediatric tumors-medullobastoma, early T-cell precursor acute lymphoblastic leukemia, and retinoblastoma-that show multiple genomic structural variations within Wnt signaling pathways. We mathematically modeled this pathway to investigate the effects of cancer-related structural variations on Wnt signaling. Surprisingly, we found that an outcome measure of canonical Wnt signaling was consistently similar in matched cancer cells and normal cells, even in the context of different cancers, different mutations, and different Wnt-related genes. Our results suggest that the cancer cells maintain a normal level of Wnt signaling by developing multiple mutations.

  4. Genome-wide analysis of the GRAS gene family in Chinese cabbage (Brassica rapa ssp. pekinensis).

    PubMed

    Song, Xiao-Ming; Liu, Tong-Kun; Duan, Wei-Ke; Ma, Qing-Hua; Ren, Jun; Wang, Zhen; Li, Ying; Hou, Xi-Lin

    2014-01-01

    The GRAS gene family is one of the most important families of transcriptional regulators. In this study, 48 GRAS genes are identified from Chinese cabbage, and they are classified into eight groups according to the classification of Arabidopsis. The characterization, classification, gene structure and phylogenetic construction of GRAS proteins are performed. Distribution mapping shows that GRAS proteins are nonrandomly localized in 10 chromosomes. Fifty-five orthologous gene pairs are shared by Chinese cabbage and Arabidopsis, and interaction networks of these orthologous genes are constructed. The expansion of GRAS genes in Chinese cabbage results from genome triplication. Among the 17 species examined, 14 higher plants carry the GRAS genes, whereas two lower plants and one fungi species do not. Furthermore, the expression patterns of GRAS genes exhibit differences in three tissues based on RNA-seq data. Taken together, this comprehensive analysis will provide rich resources for studying GRAS protein functions in Chinese cabbage.

  5. Genome-wide identification and expression analysis of the metacaspase gene family in Hevea brasiliensis.

    PubMed

    Liu, Hui; Deng, Zhi; Chen, Jiangshu; Wang, Sen; Hao, Lili; Li, Dejun

    2016-08-01

    Metacaspases, a family of cysteine proteases, have been suggested to play important roles in programmed cell death (PCD) during plant development and stress responses. To date, no systematic characterization of this gene family has been reported in rubber tree (Hevea brasiliensis). In the present study, nine metacaspase genes, designated as HbMC1 to HbMC9, were identified from whole-genome sequence of rubber tree. Multiple sequence alignment and phylogenetic analyses suggested that these genes were divided into two types: type I (HbMC1-HBMC7) and type II (HbMC8 and HbMC9). Gene structure analysis demonstrated that type I and type II HbMCs separately contained four and two introns, indicating the conserved exon-intron organization of HbMCs. Quantitative real-time PCR analysis revealed that HbMCs showed distinct expression patterns in different tissues, suggesting the functional diversity of HbMCs in various tissues during development. Most of the HbMCs were regulated by drought, cold, and salt stress, implying their possible functions in regulating abiotic stress-induced cell death. Of the nine HbMCs, HbMC1, HbMC2, HbMC5, and HbMC8 displayed a significantly higher relative transcript accumulation in barks of tapping panel dryness (TPD) trees compared with healthy trees. In addition, the four genes were up-regulated by ethephon (ET) and methyl jasmonate (MeJA), indicating their potential involvement in TPD resulting from ET- or JA-induced PCD. In summary, this work provides valuable information for further functional characterization of HbMC genes in rubber tree.

  6. Genome-wide analysis of Pax8 binding provides new insights into thyroid functions

    PubMed Central

    2012-01-01

    Background The transcription factor Pax8 is essential for the differentiation of thyroid cells. However, there are few data on genes transcriptionally regulated by Pax8 other than thyroid-related genes. To better understand the role of Pax8 in the biology of thyroid cells, we obtained transcriptional profiles of Pax8-silenced PCCl3 thyroid cells using whole genome expression arrays and integrated these signals with global cis-regulatory sequencing studies performed by ChIP-Seq analysis Results Exhaustive analysis of Pax8 immunoprecipitated peaks demonstrated preferential binding to intragenic regions and CpG-enriched islands, which suggests a role of Pax8 in transcriptional regulation of orphan CpG regions. In addition, ChIP-Seq allowed us to identify Pax8 partners, including proteins involved in tertiary DNA structure (CTCF) and chromatin remodeling (Sp1), and these direct transcriptional interactions were confirmed in vivo. Moreover, both factors modulate Pax8-dependent transcriptional activation of the sodium iodide symporter (Nis) gene promoter. We ultimately combined putative and novel Pax8 binding sites with actual target gene expression regulation to define Pax8-dependent genes. Functional classification suggests that Pax8-regulated genes may be directly involved in important processes of thyroid cell function such as cell proliferation and differentiation, apoptosis, cell polarity, motion and adhesion, and a plethora of DNA/protein-related processes. Conclusion Our study provides novel insights into the role of Pax8 in thyroid biology, exerted through transcriptional regulation of important genes involved in critical thyrocyte processes. In addition, we found new transcriptional partners of Pax8, which functionally cooperate with Pax8 in the regulation of thyroid gene transcription. Besides, our data demonstrate preferential location of Pax8 in non-promoter CpG regions. These data point to an orphan CpG island-mediated mechanism that represents a novel role

  7. Genome-wide analysis of shoot growth-associated alternative splicing in moso bamboo.

    PubMed

    Li, Long; Hu, Tao; Li, Xueping; Mu, Shaohua; Cheng, Zhanchao; Ge, Wei; Gao, Jian

    2016-08-01

    Alternative splicing (AS) significantly enhances transcriptome complexity and is differentially regulated in a wide variety of physiological processes in plants, including shoot growth. Presently, the functional implications and conservation of AS occurrences are not well understood in the moso bamboo genome. To analyze the global changes in AS during moso bamboo shoot growth, fast-growing shoots collected at seven different heights and culms after leaf expansion were sequenced using the Illumina HiSeq™ 2000 sequencing platform. It was found that approximately 60.74 % of all genes were alternatively spliced, with intron retention (IR) being the most frequent AS event (27.43 %). Statistical analysis demonstrated that variations of AS frequency and AS types were significantly correlated with changes in gene features and gene transcriptional level. According to the phylogenetic analysis of isoform expression data and AS frequency, the bamboo shoot growth could be divided into four different growth periods, including winter bamboo shoot (S1), early growth period (S2-S5), late growth period (S6 and S7), and mature period (CK). In addition, our data also showed that the winter bamboo shoot had the highest number of AS events. Twenty-six putative Serine/arginine-rich (SR) proteins were identified, producing a total of 109 transcripts. AS events were frequently and specifically regulated by SR splicing factors throughout shoot growth, resulting in changes to the original open reading frame (ORF) and subsequently changes to conserved domains. The AS product-isoforms showed regular expression change during the whole shoot growth period, thus influencing shoot growth. All together, these data indicate that AS events are adjusted to different growth stages, providing briefness and efficient means of gene regulation. This study will provide a very useful clue for future functional analyses.

  8. Genome-wide analysis of histone modifiers in tomato: gaining an insight into their developmental roles

    PubMed Central

    2013-01-01

    Background Histone post-translational modifications (HPTMs) including acetylation and methylation have been recognized as playing a crucial role in epigenetic regulation of plant growth and development. Although Solanum lycopersicum is a dicot model plant as well as an important crop, systematic analysis and expression profiling of histone modifier genes (HMs) in tomato are sketchy. Results Based on recently released tomato whole-genome sequences, we identified in silico 32 histone acetyltransferases (HATs), 15 histone deacetylases (HDACs), 52 histone methytransferases (HMTs) and 26 histone demethylases (HDMs), and compared them with those detected in Arabidopsis (Arabidopsis thaliana), maize (Zea mays) and rice (Oryza sativa) orthologs. Comprehensive analysis of the protein domain architecture and phylogeny revealed the presence of non-canonical motifs and new domain combinations, thereby suggesting for HATs the existence of a new family in plants. Due to species-specific diversification during evolutionary history tomato has fewer HMs than Arabidopsis. The transcription profiles of HMs within tomato organs revealed a broad functional role for some HMs and a more specific activity for others, suggesting key HM regulators in tomato development. Finally, we explored S. pennellii introgression lines (ILs) and integrated the map position of HMs, their expression profiles and the phenotype of ILs. We thereby proved that the strategy was useful to identify HM candidates involved in carotenoid biosynthesis in tomato fruits. Conclusions In this study, we reveal the structure, phylogeny and spatial expression of members belonging to the classical families of HMs in tomato. We provide a framework for gene discovery and functional investigation of HMs in other Solanaceae species. PMID:23356725

  9. Genome-wide analysis of eukaryote thaumatin-like proteins (TLPs) with an emphasis on poplar

    PubMed Central

    2011-01-01

    Background Plant inducible immunity includes the accumulation of a set of defense proteins during infection called pathogenesis-related (PR) proteins, which are grouped into families termed PR-1 to PR-17. The PR-5 family is composed of thaumatin-like proteins (TLPs), which are responsive to biotic and abiotic stress and are widely studied in plants. TLPs were also recently discovered in fungi and animals. In the poplar genome, TLPs are over-represented compared with annual species and their transcripts strongly accumulate during stress conditions. Results Our analysis of the poplar TLP family suggests that the expansion of this gene family was followed by diversification, as differences in expression patterns and predicted properties correlate with phylogeny. In particular, we identified a clade of poplar TLPs that cluster to a single 350 kb locus of chromosome I and that are up-regulated by poplar leaf rust infection. A wider phylogenetic analysis of eukaryote TLPs - including plant, animal and fungi sequences - shows that TLP gene content and diversity increased markedly during land plant evolution. Mapping the reported functions of characterized TLPs to the eukaryote phylogenetic tree showed that antifungal or glycan-lytic properties are widespread across eukaryote phylogeny, suggesting that these properties are shared by most TLPs and are likely associated with the presence of a conserved acidic cleft in their 3D structure. Also, we established an exhaustive catalog of TLPs with atypical architectures such as small-TLPs, TLP-kinases and small-TLP-kinases, which have potentially developed alternative functions (such as putative receptor kinases for pathogen sensing and signaling). Conclusion Our study, based on the most recent plant genome sequences, provides evidence for TLP gene family diversification during land plant evolution. We have shown that the diverse functions described for TLPs are not restricted to specific clades but seem to be universal among

  10. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution.

    PubMed

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-06

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening.

  11. Multiple-trait genome-wide association study based on principal component analysis for residual covariance matrix.

    PubMed

    Gao, H; Wu, Y; Zhang, T; Wu, Y; Jiang, L; Zhan, J; Li, J; Yang, R

    2014-12-01

    Given the drawbacks of implementing multivariate analysis for mapping multiple traits in genome-wide association study (GWAS), principal component analysis (PCA) has been widely used to generate independent 'super traits' from the original multivariate phenotypic traits for the univariate analysis. However, parameter estimates in this framework may not be the same as those from the joint analysis of all traits, leading to spurious linkage results. In this paper, we propose to perform the PCA for residual covariance matrix instead of the phenotypical covariance matrix, based on which multiple traits are transformed to a group of pseudo principal components. The PCA for residual covariance matrix allows analyzing each pseudo principal component separately. In addition, all parameter estimates are equivalent to those obtained from the joint multivariate analysis under a linear transformation. However, a fast least absolute shrinkage and selection operator (LASSO) for estimating the sparse oversaturated genetic model greatly reduces the computational costs of this procedure. Extensive simulations show statistical and computational efficiencies of the proposed method. We illustrate this method in a GWAS for 20 slaughtering traits and meat quality traits in beef cattle.

  12. Exploring the Major Sources and Extent of Heterogeneity in a Genome-Wide Association Meta-Analysis.

    PubMed

    Pei, Yu-Fang; Tian, Qing; Zhang, Lei; Deng, Hong-Wen

    2016-03-01

    Genome-wide association (GWA) meta-analysis has become a popular approach for discovering genetic variants responsible for complex diseases. The between-study heterogeneity effect is a severe issue that may complicate the interpretation of results. Aiming to improve the interpretation of meta-analysis results, we empirically explored the extent and source of heterogeneity effect. We analyzed a previously reported GWA meta-analysis of obesity, in which over 21,000 subjects from seven individual samples were meta-analyzed. We first evaluated the extent and distribution of heterogeneity across the entire genome. We then studied the effects of several potentially confounding factors, including age, ethnicity, gender composition, study type, and genotype imputation on heterogeneity with a random-effects meta-regression model. Of the total 4,325,550 SNPs being tested, heterogeneity was moderate to very large for 25.4% of the total SNPs. Heterogeneity was more severe in SNPs with stronger association signals. Ethnicity, average age, and genotype imputation accuracy had significant effects on the heterogeneity. Exploring the effects of ethnicity can provide clues to the potential ethnic-specific effects for two loci known to affect obesity, MC4R, and MTCH2. Our analysis can help to clarify understanding of the obesity mechanism and may provide guidance for an effective design of future GWA meta-analysis.

  13. Genome-wide analysis of the AP2/ERF family in Musa species reveals divergence and neofunctionalisation during evolution

    PubMed Central

    Lakhwani, Deepika; Pandey, Ashutosh; Dhar, Yogeshwar Vikram; Bag, Sumit Kumar; Trivedi, Prabodh Kumar; Asif, Mehar Hasan

    2016-01-01

    AP2/ERF domain containing transcription factor super family is one of the important regulators in the plant kingdom. The involvement of AP2/ERF family members has been elucidated in various processes associated with plant growth, development as well as in response to hormones, biotic and abiotic stresses. In this study, we carried out genome-wide analysis to identify members of AP2/ERF family in Musa acuminata (A genome) and Musa balbisiana (B genome) and changes leading to neofunctionalisation of genes. Analysis identified 265 and 318 AP2/ERF encoding genes in M. acuminata and M. balbisiana respectively which were further classified into ERF, DREB, AP2, RAV and Soloist groups. Comparative analysis indicated that AP2/ERF family has undergone duplication, loss and divergence during evolution and speciation of the Musa A and B genomes. We identified nine genes which are up-regulated during fruit ripening and might be components of the regulatory machinery operating during ethylene-dependent ripening in banana. Tissue-specific expression analysis of the genes suggests that different regulatory mechanisms might be involved in peel and pulp ripening process through recruiting specific ERFs in these tissues. Analysis also suggests that MaRAV-6 and MaERF026 have structurally diverged from their M. balbisiana counterparts and have attained new functions during ripening. PMID:26733055

  14. Multiple-trait genome-wide association study based on principal component analysis for residual covariance matrix

    PubMed Central

    Gao, H; Zhang, T; Wu, Y; Wu, Y; Jiang, L; Zhan, J; Li, J; Yang, R

    2014-01-01

    Given the drawbacks of implementing multivariate analysis for mapping multiple traits in genome-wide association study (GWAS), principal component analysis (PCA) has been widely used to generate independent ‘super traits' from the original multivariate phenotypic traits for the univariate analysis. However, parameter estimates in this framework may not be the same as those from the joint analysis of all traits, leading to spurious linkage results. In this paper, we propose to perform the PCA for residual covariance matrix instead of the phenotypical covariance matrix, based on which multiple traits are transformed to a group of pseudo principal components. The PCA for residual covariance matrix allows analyzing each pseudo principal component separately. In addition, all parameter estimates are equivalent to those obtained from the joint multivariate analysis under a linear transformation. However, a fast least absolute shrinkage and selection operator (LASSO) for estimating the sparse oversaturated genetic model greatly reduces the computational costs of this procedure. Extensive simulations show statistical and computational efficiencies of the proposed method. We illustrate this method in a GWAS for 20 slaughtering traits and meat quality traits in beef cattle. PMID:24984606

  15. Genome-wide investigation and transcriptome analysis of the WRKY gene family in Gossypium.

    PubMed

    Ding, Mingquan; Chen, Jiadong; Jiang, Yurong; Lin, Lifeng; Cao, YueFen; Wang, Minhua; Zhang, Yuting; Rong, Junkang; Ye, Wuwei

    2015-02-01

    WRKY transcription factors play important roles in various stress responses in diverse plant species. In cotton, this family has not been well studied, especially in relation to fiber development. Here, the genomes and transcriptomes of Gossypium raimondii and Gossypium arboreum were investigated to identify fiber development related WRKY genes. This represents the first comprehensive comparative study of WRKY transcription factors in both diploid A and D cotton species. In total, 112 G. raimondii and 109 G. arboreum WRKY genes were identified. No significant gene structure or domain alterations were detected between the two species, but many SNPs distributed unequally in exon and intron regions. Physical mapping revealed that the WRKY genes in G. arboreum were not located in the corresponding chromosomes of G. raimondii, suggesting great chromosome rearrangement in the diploid cotton genomes. The cotton WRKY genes, especially subgroups I and II, have expanded through multiple whole genome duplications and tandem duplications compared with other plant species. Sequence comparison showed many functionally divergent sites between WRKY subgroups, while the genes within each group are under strong purifying selection. Transcriptome analysis suggested that many WRKY genes participate in specific fiber development processes such as fiber initiation, elongation and maturation with different expression patterns between species. Complex WRKY gene expression such as differential Dt and At allelic gene expression in G. hirsutum and alternative splicing events were also observed in both diploid and tetraploid cottons during fiber development process. In conclusion, this study provides important information on the evolution and function of WRKY gene family in cotton species.

  16. Genome-wide transcriptome analysis revealed organelle specific responses to temperature variations in algae

    PubMed Central

    Shin, HyeonSeok; Hong, Seong-Joo; Yoo, Chan; Han, Mi-Ae; Lee, Hookeun; Choi, Hyung-Kyoon; Cho, Suhyung; Lee, Choul-Gyun; Cho, Byung-Kwan

    2016-01-01

    Temperature is a critical environmental factor that affects microalgal growth. However, microalgal coping mechanisms for temperature variations are unclear. Here, we determined changes in transcriptome, total carbohydrate, total fatty acid methyl ester, and fatty acid composition of Tetraselmis sp. KCTC12432BP, a strain with a broad temperature tolerance range, to elucidate the tolerance mechanisms in response to large temperature variations. Owing to unavailability of genome sequence information, de novo transcriptome assembly coupled with BLAST analysis was performed using strand specific RNA-seq data. This resulted in 26,245 protein-coding transcripts, of which 83.7% could be annotated to putative functions. We identified more than 681 genes differentially expressed, suggesting an organelle-specific response to temperature variation. Among these, the genes related to the photosynthetic electron transfer chain, which are localized in the plastid thylakoid membrane, were upregulated at low temperature. However, the transcripts related to the electron transport chain and biosynthesis of phosphatidylethanolamine localized in mitochondria were upregulated at high temperature. These results show that the low energy uptake by repressed photosynthesis under low and high temperature conditions is compensated by different mechanisms, including photosystem I and mitochondrial oxidative phosphorylation, respectively. This study illustrates that microalgae tolerate different temperature conditions through organelle specific mechanisms. PMID:27883062

  17. Genome-wide Analysis of WD40 Protein Family in Human

    PubMed Central

    Zou, Xu-Dong; Hu, Xue-Jia; Ma, Jing; Li, Tuan; Ye, Zhi-Qiang; Wu, Yun-Dong

    2016-01-01

    The WD40 proteins, often acting as scaffolds to form functional complexes in fundamental cellular processes, are one of the largest families encoded by the eukaryotic genomes. Systematic studies of this family on genome scale are highly required for understanding their detailed functions, but are currently lacking in the animal lineage. Here we present a comprehensive in silico study of the human WD40 family. We have identified 262 non-redundant WD40 proteins, and grouped them into 21 classes according to their domain architectures. Among them, 11 animal-specific domain architectures have been recognized. Sequence alignment indicates the complicated duplication and recombination events in the evolution of this family. Through further phylogenetic analysis, we have revealed that the WD40 family underwent more expansion than the overall average in the evolutionary early stage, and the early emerged WD40 proteins are prone to domain architectures with fundamental cellular roles and more interactions. While most widely and highly expressed human WD40 genes originated early, the tissue-specific ones often have late origin. These results provide a landscape of the human WD40 family concerning their classification, evolution, and expression, serving as a valuable complement to the previous studies in the plant lineage. PMID:27991561

  18. Genome-wide linkage disequilibrium analysis in bread wheat and durum wheat.

    PubMed

    Somers, Daryl J; Banks, Travis; Depauw, Ron; Fox, Stephen; Clarke, John; Pozniak, Curtis; McCartney, Curt

    2007-06-01

    Bread wheat and durum wheat were examined for linkage disequilibrium (LD) using microsatellite markers distributed across the genome. The allele database consisted of 189 bread wheat accessions genotyped at 370 loci and 93 durum wheat accessions genotyped at 245 loci. A significance level of p < 0.001 was set for all comparisons. The bread and durum wheat collections showed that 47.9% and 14.0% of all locus pairs were in LD, respectively. LD was more prevalent between loci on the same chromosome compared with loci on independent chromosomes and was highest between adjacent loci. Only a small fraction (bread wheat, 0.9%; durum wheat, 3.2%) of the locus pairs in LD showed R2 values > 0.2. The LD between adjacent locus pairs extended (R2 > 0.2) approximately 2-3 cM, on average, but some regions of the bread and durum wheat genomes showed high levels of LD (R2 = 0.7 and 1.0, respectively) extending 41.2 and 25.5 cM, respectively. The wheat collections were clustered by similarity into subpopulations using unlinked microsatellite data and the software Structure. Analysis within subpopulations showed 14- to 16-fold fewer locus pairs in LD, higher R2 values for those pairs in LD, and LD extending further along the chromosome. The data suggest that LD mapping of wheat can be performed with simple sequence repeats to a resolution of <5 cM.

  19. Genome-Wide Analysis of Codon Usage Bias in Epichloë festucae

    PubMed Central

    Li, Xiuzhang; Song, Hui; Kuang, Yu; Chen, Shuihong; Tian, Pei; Li, Chunjie; Nan, Zhibiao

    2016-01-01

    Analysis of codon usage data has both practical and theoretical applications in understanding the basics of molecular biology. Differences in codon usage patterns among genes reflect variations in local base compositional biases and the intensity of natural selection. Recently, there have been several reports related to codon usage in fungi, but little is known about codon usage bias in Epichloë endophytes. The present study aimed to assess codon usage patterns and biases in 4870 sequences from Epichloë festucae, which may be helpful in revealing the constraint factors such as mutation or selection pressure and improving the bioreactor on the cloning, expression, and characterization of some special genes. The GC content with 56.41% is higher than the AT content (43.59%) in E. festucae. The results of neutrality and effective number of codons plot analyses showed that both mutational bias and natural selection play roles in shaping codon usage in this species. We found that gene length is strongly correlated with codon usage and may contribute to the codon usage patterns observed in genes. Nucleotide composition and gene expression levels also shape codon usage bias in E. festucae. E. festucae exhibits codon usage bias based on the relative synonymous codon usage (RSCU) values of 61 sense codons, with 25 codons showing an RSCU larger than 1. In addition, we identified 27 optimal codons that end in a G or C. PMID:27428961

  20. Genome-wide analysis for identification of salt-responsive genes in common wheat.

    PubMed

    Kawaura, Kanako; Mochida, Keiichi; Ogihara, Yasunari

    2008-08-01

    To identify salt-responsive genes in wheat, global expression analysis of transcripts was carried out using oligo-DNA microarrays. Microarrays have been designed from approximately 32,000 unique wheat genes classified from a large number of expressed sequence tags (ESTs). Two-week-old seedlings of wheat were treated with 150 mM NaCl for 1, 6, and 24 h, and their roots and shoots were separately subjected to analyses. Consequently, 5,996 genes showed changes in expression of more than twofold and were classified into 12 groups according to correlations in expression patterns. These salt-responsive genes were assigned functions using the Gene Ontology (GO). Genes assigned to transcription factor, transcription-regulator activity, and DNA-binding functions were preferentially classified into early response groups. On the other hand, those assigned transferase and transporter activity were classified into late response groups. These data suggest that multiple signal transduction pathways in response to salinity exist in wheat. Transcription factors (TFs) which have been reported as participants in salt-tolerant pathway changed their expression levels in response to salt treatment. Among them, only a few TFs show high sequence homologies to genes in rice. These investigations suggest that salt-responsive genes identified by this study are candidates for salt-stress tolerance uniquely in wheat.

  1. Genome-wide functional analysis of cotton (Gossypium hirsutum) in response to drought.

    PubMed

    Chen, Yun; Liu, Zhi-Hao; Feng, Li; Zheng, Yong; Li, Deng-Di; Li, Xue-Bao

    2013-01-01

    Cotton is one of the most important crops for its natural textile fibers in the world. However, it often suffered from drought stress during its growth and development, resulting in a drastic reduction in cotton productivity. Therefore, study on molecular mechanism of cotton drought-tolerance is very important for increasing cotton production. To investigate molecular mechanism of cotton drought-resistance, we employed RNA-Seq technology to identify differentially expressed genes in the leaves of two different cultivars (drought-resistant cultivar J-13 and drought-sensitive cultivar Lu-6) of cotton. The results indicated that there are about 13.38% to 18.75% of all the unigenes differentially expressed in drought-resistant sample and drought-sensitive control, and the number of differentially expressed genes was increased along with prolonged drought treatment. DEG (differentially expression gene) analysis showed that the normal biophysical profiles of cotton (cultivar J-13) were affected by drought stress, and some cellular metabolic processes (including photosynthesis) were inhibited in cotton under drought conditions. Furthermore, the experimental data revealed that there were significant differences in expression levels of the genes related to abscisic acid signaling, ethylene signaling and jasmonic acid signaling pathways between drought-resistant cultivar J-13 and drought-sensitive cultivar Lu-6, implying that these signaling pathways may participate in cotton response and tolerance to drought stress.

  2. Genome-wide analysis of the GRAS gene family in physic nut (Jatropha curcas L.).

    PubMed

    Wu, Z Y; Wu, P Z; Chen, Y P; Li, M R; Wu, G J; Jiang, H W

    2015-12-29

    GRAS proteins play vital roles in plant growth and development. Physic nut (Jatropha curcas L.) was found to have a total of 48 GRAS family members (JcGRAS), 15 more than those found in Arabidopsis. The JcGRAS genes were divided into 12 subfamilies or 15 ancient monophyletic lineages based on the phylogenetic analysis of GRAS proteins from both flowering and lower plants. The functions of GRAS genes in 9 subfamilies have been reported previously for several plants, while the genes in the remaining 3 subfamilies were of unknown function; we named the latter families U1 to U3. No member of U3 subfamily is present in Arabidopsis and Poaceae species according to public genome sequence data. In comparison with the number of GRAS genes in Arabidopsis, more were detected in physic nut, resulting from the retention of many ancient GRAS subfamilies and the formation of tandem repeats during evolution. No evidence of recent duplication among JcGRAS genes was observed in physic nut. Based on digital gene expression data, 21 of the 48 genes exhibited differential expression in four tissues analyzed. Two members of subfamily U3 were expressed only in buds and flowers, implying that they may play specific roles. Our results provide valuable resources for future studies on the functions of GRAS proteins in physic nut.

  3. Genome-wide identification, evolution and expression analysis of RING finger protein genes in Brassica rapa

    PubMed Central

    Alam, Intikhab; Yang, Yan-Qing; Wang, Yong; Zhu, Mei-Lan; Wang, Heng-Bo; Chalhoub, Boulos; Lu, Yun-Hai

    2017-01-01

    More and more RING finger genes were found to be implicated in various important biological processes. In the present study, a total of 731 RING domains in 715 predicted proteins were identified in Brassica rapa genome (AA, 2n = 20), which were further divided into eight types: RING-H2 (371), RING-HCa (215), RING-HCb (47), RING-v (44), RING-C2 (38), RING-D (10), RING-S/T (5) and RING-G (1). The 715 RING finger proteins were further classified into 51 groups according to the presence of additional domains. 700 RING finger protein genes were mapped to the 10 chromosomes of B. rapa with a range of 47 to 111 genes for each chromosome. 667 RING finger protein genes were expressed in at least one of the six tissues examined, indicating their involvement in various physiological and developmental processes in B. rapa. Hierarchical clustering analysis of RNA-seq data divided them into seven major groups, one of which includes 231 members preferentially expressed in leaf, and constitutes then a panel of gene candidates for studying the genetic and molecular mechanisms of leafy head traits in Brassica crops. Our results lay the foundation for further studies on the classification, evolution and putative functions of RING finger protein genes in Brassica species. PMID:28094809

  4. Identification of candidate genes associated with porcine meat color traits by genome-wide transcriptome analysis

    PubMed Central

    Li, Bojiang; Dong, Chao; Li, Pinghua; Ren, Zhuqing; Wang, Han; Yu, Fengxiang; Ning, Caibo; Liu, Kaiqing; Wei, Wei; Huang, Ruihua; Chen, Jie; Wu, Wangjun; Liu, Honglin

    2016-01-01

    Meat color is considered to be the most important indicator of meat quality, however, the molecular mechanisms underlying traits related to meat color remain mostly unknown. In this study, to elucidate the molecular basis of meat color, we constructed six cDNA libraries from biceps femoris (Bf) and soleus (Sol), which exhibit obvious differences in meat color, and analyzed the whole-transcriptome differences between Bf (white muscle) and Sol (red muscle) using high-throughput sequencing technology. Using DEseq2 method, we identified 138 differentially expressed genes (DEGs) between Bf and Sol. Using DEGseq method, we identified 770, 810, and 476 DEGs in comparisons between Bf and Sol in three separate animals. Of these DEGs, 52 were overlapping DEGs. Using these data, we determined the enriched GO terms, metabolic pathways and candidate genes associated with meat color traits. Additionally, we mapped 114 non-redundant DEGs to the meat color QTLs via a comparative analysis with the porcine quantitative trait loci (QTL) database. Overall, our data serve as a valuable resource for identifying genes whose functions are critical for meat color traits and can accelerate studies of the molecular mechanisms of meat color formation. PMID:27748458

  5. Coelomata and Not Ecdysozoa: Evidence From Genome-Wide Phylogenetic Analysis

    PubMed Central

    Wolf, Yuri I.; Rogozin, Igor B.; Koonin, Eugene V.

    2004-01-01

    Relative positions of nematodes, arthropods, and chordates in animal phylogeny remain uncertain. The traditional tree topology joins arthropods with chordates in a coelomate clade, whereas nematodes, which lack a coelome, occupy a basal position. However, the current leading hypothesis, based on phylogenetic trees for 18S ribosomal RNA and several proteins, joins nematodes with arthropods in a clade of molting animals, Ecdysozoa. We performed a phylogenetic analysis of over 500 sets of orthologous proteins, which are represented in plants, animals, and fungi, using maximum likelihood, maximum parsimony, and distance methods. Additionally, to increase the statistical power of topology tests, the same methods were applied to concatenated alignments of subunits of eight conserved macromolecular complexes. The majority of the methods, when applied to most of the orthologous clusters, both concatenated and individual, grouped the fly with humans to the exclusion of the nematode, in support of the coelomate phylogeny. Trees were also constructed using information on insertions and deletions in orthologous proteins, combinations of domains in multidomain proteins, and presence-absence of species in clusters of orthologs. All of these approaches supported the coelomate clade and showed concordance between evolution of protein sequences and higher-level evolutionary events, such as domain fusion or gene loss. PMID:14707168

  6. Genome-wide transcriptome analysis of Clavibacter michiganensis subsp. michiganensis grown in xylem mimicking medium.

    PubMed

    Hiery, Eva; Adam, Susanne; Reid, Stephen; Hofmann, Jörg; Sonnewald, Sophia; Burkovski, Andreas

    2013-12-01

    The interaction between Clavibacter michiganensis subsp. michiganensis with its host, the tomato plant (Solanum lycopersicum), is poorly understood and only few virulence factors are known. While studying of the bacteria in planta is time-consuming and difficult, the analysis in vitro would facilitate research. Therefore, a xylem mimicking medium (XMM) for C. michiganensis subsp. michiganensis was established in this study based on an apoplast medium for Xanthomonas campestris pv. vesicatoria. In contrast to the apoplast medium, XMM contains no sugars, but amino acids which serve as nitrogen and carbon source. As a result, growth in XMM induced transcriptional changes of genes encoding putative sugar, amino acid and iron uptake systems. In summary, mRNA levels of about 8% of all C. michiganensis subsp. michiganensis genes were changed when XMM-grown bacteria were compared to M9 minimal medium-grown cells. Almost no transcriptional changes of genes encoding hydrolytic enzymes were detected, leading to the idea that XMM reflects the situation in the beginning of infection and therefore allows the characterization of virulence factors in this early stage of infection. The addition of the plant wound substance acetosyringone to the XMM medium led to a change in transcript amount, including genes coding for proteins involved in protein transport, iron uptake and regulation processes.

  7. Genome-wide analysis of TIAR RNA ligands in mouse macrophages before and after LPS stimulation

    PubMed Central

    Kharraz, Yacine; Lefort, Anne; Libert, Frédérick; Mann, Christopher J.; Gueydan, Cyril; Kruys, Véronique

    2016-01-01

    TIA-1 related protein (TIAR) is a RNA-binding protein involved in several steps of gene expression such as RNA splicing Aznarez et al. (2008) [1] and translation Piecyk et al. (2000) [2]. TIAR contains three RNA recognition motifs (RRMs) allowing its interaction with specific sequences localized in the untranslated regions (UTRs) of several mRNAs. In myeloid cells, TIAR has been shown to bind and regulate the translation and stability of various mRNA-encoding proteins important for the inflammatory response, such as TNFα Piecyk et al. (2000), Gueydan et al. (1999) [2], [3], Cox-2 Cok et al. (2003) [4] or IL-8 Suswam et al. (2005) [5]. Here, we generated two macrophage-like RAW 264.7 cell lines expressing either a tagged full-length TIAR protein or a RRM2-truncated mutant unable to bind RNA with high affinity Dember et al. (1996), Kim et al. (2013) . By a combination of RNA-IP and microarray analysis (RIP-chip), we identified mRNAs specifically bound by the full-length protein both in basal conditions and in response to LPS (GSE77577). PMID:26981431

  8. Genome-wide analysis of TIAR RNA ligands in mouse macrophages before and after LPS stimulation.

    PubMed

    Kharraz, Yacine; Lefort, Anne; Libert, Frédérick; Mann, Christopher J; Gueydan, Cyril; Kruys, Véronique

    2016-03-01

    TIA-1 related protein (TIAR) is a RNA-binding protein involved in several steps of gene expression such as RNA splicing Aznarez et al. (2008) [1] and translation Piecyk et al. (2000) [2]. TIAR contains three RNA recognition motifs (RRMs) allowing its interaction with specific sequences localized in the untranslated regions (UTRs) of several mRNAs. In myeloid cells, TIAR has been shown to bind and regulate the translation and stability of various mRNA-encoding proteins important for the inflammatory response, such as TNFα Piecyk et al. (2000), Gueydan et al. (1999) [2], [3], Cox-2 Cok et al. (2003) [4] or IL-8 Suswam et al. (2005) [5]. Here, we generated two macrophage-like RAW 264.7 cell lines expressing either a tagged full-length TIAR protein or a RRM2-truncated mutant unable to bind RNA with high affinity Dember et al. (1996), Kim et al. (2013) . By a combination of RNA-IP and microarray analysis (RIP-chip), we identified mRNAs specifically bound by the full-length protein both in basal conditions and in response to LPS (GSE77577).

  9. Genome-wide analysis of transcription factors involved in maize embryonic callus formation.

    PubMed

    Ge, Fei; Luo, Xu; Huang, Xing; Zhang, Yanling; He, Xiujing; Liu, Min; Lin, Haijian; Peng, Huanwei; Li, Lujiang; Zhang, Zhiming; Pan, Guangtang; Shen, Yaou

    2016-12-01

    In this study, a maize inbred line with a strong capacity to induce embryonic callus, 18-599R, was used to analyze the transcription factors expressed during embryonic callus formation. A total of 1180 transcription factors were found to be expressed during three key stages of callus induction. Of these, compared with control, 361, 346 and 328 transcription factors were significantly downregulated during stages I, II and III, respectively. In contrast, 355, 372 and 401 transcription factors (TFs) were upregulated during the respective stages. We constructed a transcription factor-mediated regulatory network and found that plant hormone signal transduction was the pathway most significantly enriched among TFs. This pathway includes 48 TFs regulating cell enlargement, cell differentiation, cell division and cell dedifferentiation via the response to plant hormones. Through real-time polymerase chain reaction (PCR) and degradome sequencing, we identified 23 transcription factors that are regulated by miRNA. Through further analysis, ZmMYB138, a member of the MYB transcription factor family localized in the nucleus, was verified to promote embryonic callus formation in the maize embryo through GA signal transduction.

  10. Genome-wide identification and expression analysis of WNK kinase gene family in rice.

    PubMed

    Manuka, Rakesh; Saddhe, Ankush Ashok; Kumar, Kundan

    2015-12-01

    Eukaryotic protein kinases represent one of the largest gene families involved in diverse regulatory functions. WNK (With No Lysine) kinases are members of ser/thr protein kinase family, which lack conserved catalytic lysine (K) residue at protein kinase subdomain II and is replaced by either asparagine, serine or glycine residues. They are involved in regulation of flowering time, circadian rhythms and abiotic stresses in Arabidopsis thaliana. In the present study, we have identified 9 members of WNK in rice, showed resemblance to Arabidopsis and human WNK and clustered into five main clades phylogenetically. The predicted genes structure, bonafide conserved signature motif and domains strongly support their identity, as members of WNK kinase family. We have analyzed their chromosomal distribution, physio-chemical properties, subcellular localizations and cis-elements in the promoter regions in silico. Further, transcript analysis of OsWNK by qRT-PCR revealed their differential regulation in tissue specific and abiotic stresses libraries. In conclusion, the identification of nine OsWNK and transcript level expression pattern under abiotic stress using qRT-PCR in rice will significantly contribute towards the understanding of WNK genes in monocots and thus provide a set up for functional genomics studies of WNK protein kinases.

  11. Genome-wide primary transcriptome analysis of H2-producing archaeon Thermococcus onnurineus NA1.

    PubMed

    Cho, Suhyung; Kim, Min-Sik; Jeong, Yujin; Lee, Bo-Rahm; Lee, Jung-Hyun; Kang, Sung Gyun; Cho, Byung-Kwan

    2017-02-20

    In spite of their pivotal roles in transcriptional and post-transcriptional processes, the regulatory elements of archaeal genomes are not yet fully understood. Here, we determine the primary transcriptome of the H2-producing archaeon Thermococcus onnurineus NA1. We identified 1,082 purine-rich transcription initiation sites along with well-conserved TATA box, A-rich B recognition element (BRE), and promoter proximal element (PPE) motif in promoter regions, a high pyrimidine nucleotide content (T/C) at the -1 position, and Shine-Dalgarno (SD) motifs (GGDGRD) in 5' untranslated regions (5' UTRs). Along with differential transcript levels, 117 leaderless genes and 86 non-coding RNAs (ncRNAs) were identified, representing diverse cellular functions and potential regulatory functions under the different growth conditions. Interestingly, we observed low GC content in ncRNAs for RNA-based regulation via unstructured forms or interaction with other cellular components. Further comparative analysis of T. onnurineus upstream regulatory sequences with those of closely related archaeal genomes demonstrated that transcription of orthologous genes are initiated by highly conserved promoter sequences, however their upstream sequences for transcriptional and translational regulation are largely diverse. These results provide the genetic information of T. onnurineus for its future application in metabolic engineering.

  12. Genome-Wide Comparative Analysis of Flowering-Related Genes in Arabidopsis, Wheat, and Barley

    PubMed Central

    Peng, Fred Y.; Hu, Zhiqiu; Yang, Rong-Cai

    2015-01-01

    Early flowering is an important trait influencing grain yield and quality in wheat (Triticum aestivum L.) and barley (Hordeum vulgare L.) in short-season cropping regions. However, due to large and complex genomes of these species, direct identification of flowering genes and their molecular characterization remain challenging. Here, we used a bioinformatic approach to predict flowering-related genes in wheat and barley from 190 known Arabidopsis (Arabidopsis thaliana (L.) Heynh.) flowering genes. We identified 900 and 275 putative orthologs in wheat and barley, respectively. The annotated flowering-related genes were clustered into 144 orthologous groups with one-to-one, one-to-many, many-to-one, and many-to-many orthology relationships. Our approach was further validated by domain and phylogenetic analyses of flowering-related proteins and comparative analysis of publicly available microarray data sets for in silico expression profiling of flowering-related genes in 13 different developmental stages of wheat and barley. These further analyses showed that orthologous gene pairs in three critical flowering gene families (PEBP, MADS, and BBX) exhibited similar expression patterns among 13 developmental stages in wheat and barley, suggesting similar functions among the orthologous genes with sequence and expression similarities. The predicted candidate flowering genes can be confirmed and incorporated into molecular breeding for early flowering wheat and barley in short-season cropping regions. PMID:26435710

  13. Genome-wide identification and expression analysis of aquaporins in tomato.

    PubMed

    Reuscher, Stefan; Akiyama, Masahito; Mori, Chiharu; Aoki, Koh; Shibata, Daisuke; Shiratake, Katsuhiro

    2013-01-01

    The family of aquaporins, also called water channels or major intrinsic proteins, is characterized by six transmembrane domains that together facilitate the transport of water and a variety of low molecular weight solutes. They are found in all domains of life, but show their highest diversity in plants. Numerous studies identified aquaporins as important targets for improving plant performance under drought stress. The phylogeny of aquaporins is well established based on model species like Arabidopsis thaliana, which can be used as a template to investigate aquaporins in other species. In this study we comprehensively identified aquaporin encoding genes in tomato (Solanum lycopersicum), which is an important vegetable crop and also serves as a model for fleshy fruit development. We found 47 aquaporin genes in the tomato genome and analyzed their structural features. Based on a phylogenetic analysis of the deduced amino acid sequences the aquaporin genes were assigned to five subfamilies (PIPs, TIPs, NIPs, SIPs and XIPs) and their substrate specificity was assessed on the basis of key amino acid residues. As ESTs were available for 32 genes, expression of these genes was analyzed in 13 different tissues and developmental stages of tomato. We detected tissue-specific and development-specific expression of tomato aquaporin genes, which is a first step towards revealing the contribution of aquaporins to water and solute transport in leaves and during fruit development.

  14. Genome-wide primary transcriptome analysis of H2-producing archaeon Thermococcus onnurineus NA1

    PubMed Central

    Cho, Suhyung; Kim, Min-Sik; Jeong, Yujin; Lee, Bo-Rahm; Lee, Jung-Hyun; Kang, Sung Gyun; Cho, Byung-Kwan

    2017-01-01

    In spite of their pivotal roles in transcriptional and post-transcriptional processes, the regulatory elements of archaeal genomes are not yet fully understood. Here, we determine the primary transcriptome of the H2-producing archaeon Thermococcus onnurineus NA1. We identified 1,082 purine-rich transcription initiation sites along with well-conserved TATA box, A-rich B recognition element (BRE), and promoter proximal element (PPE) motif in promoter regions, a high pyrimidine nucleotide content (T/C) at the −1 position, and Shine-Dalgarno (SD) motifs (GGDGRD) in 5′ untranslated regions (5′ UTRs). Along with differential transcript levels, 117 leaderless genes and 86 non-coding RNAs (ncRNAs) were identified, representing diverse cellular functions and potential regulatory functions under the different growth conditions. Interestingly, we observed low GC content in ncRNAs for RNA-based regulation via unstructured forms or interaction with other cellular components. Further comparative analysis of T. onnurineus upstream regulatory sequences with those of closely related archaeal genomes demonstrated that transcription of orthologous genes are initiated by highly conserved promoter sequences, however their upstream sequences for transcriptional and translational regulation are largely diverse. These results provide the genetic information of T. onnurineus for its future application in metabolic engineering. PMID:28216628

  15. Genome-wide analysis of admixture and adaptation in the Africanized honeybee.

    PubMed

    Nelson, Ronald M; Wallberg, Andreas; Simões, Zilá Luz Paulino; Lawson, Daniel J; Webster, Matthew T

    2017-04-05

    Genetic exchange by hybridization or admixture can make an important contribution to evolution, and introgression of favourable alleles can facilitate adaptation to new environments. A small number of honeybees (Apis mellifera) with African ancestry were introduced to Brazil ~60 years ago, which dispersed and hybridized with existing managed populations of European origin, quickly spreading across much of the Americas in an example of a massive biological invasion. Here we analyse whole genome sequences of 32 Africanized honeybees sampled from throughout Brazil in order to study the effect of this process on genome diversity. By comparison with ancestral populations from Europe and Africa, we infer that these samples have 84% African ancestry, with the remainder from western European populations. However, this proportion varies across the genome and we identify signals of positive selection in regions with high European ancestry proportions. These observations are largely driven by one large gene-rich 1.4-Mbp segment on chromosome 11 where European haplotypes are present at a significantly elevated frequency and likely confer an adaptive advantage in the Africanized honeybee population. This region has previously been implicated in reproductive traits and foraging behaviour in worker bees. Finally, by analysing the distribution of ancestry tract lengths in the context of the known time of the admixture event, we are able to infer an average generation time of 2.0 years. Our analysis highlights the processes by which populations of mixed genetic ancestry form and adapt to new environments. This article is protected by copyright. All rights reserved.

  16. Genome-wide identification, characterization, and expression analysis of the MLO gene family in Cucumis sativus.

    PubMed

    Zhou, S J; Jing, Z; Shi, J L

    2013-12-11

    Mildew resistance locus o (MLO) is a plant-specific seven-transmembrane (TM) gene family. Several studies have revealed that certain members of the MLO gene family mediate powdery mildew susceptibility in three plant species, namely, Arabidopsis, barley, and tomato. The sequenced cucumber genome provides an opportunity to conduct a comprehensive overview of the MLO gene family. Fourteen genes (designated CsMLO01 through CsMLO14) have been identified within the Cucumis sativus genome by using an in silico cloning method with the MLO amino acid sequences of Arabidopsis thaliana and rice as probes. Sequence alignment revealed that numerous features of the gene family, such as TMs, a calmodulin-binding domain, peptide domains I and II, and 30 important amino acid residues for MLO function, are well conserved. Phylogenetic analysis of the MLO genes from cucumber and other plant species reveals seven different clades (I through VII). Three of these clades comprised MLO genes from A. thaliana, rice, maize, and cucumber, suggesting that these genes may have evolved after the divergence of monocots and dicots. In silico mapping showed that these CsMLOs were located on chromosomes 1, 2, 3, 4, 5, and 6 without any obvious clustering, except CsMLO01. To our knowledge, this paper is the first comprehensive report on MLO genes in C. sativus. These findings will facilitate the functional characterization of the MLOs related to powdery mildew susceptibility and assist in the development of disease resistance in cucumber.

  17. Genome-Wide Comparative Analysis of Chemosensory Gene Families in Five Tsetse Fly Species

    PubMed Central

    Macharia, Rosaline; Mireji, Paul; Murungi, Edwin; Murilla, Grace; Christoffels, Alan; Aksoy, Serap; Masiga, Daniel

    2016-01-01

    For decades, odour-baited traps have been used for control of tsetse flies (Diptera; Glossinidae), vectors of African trypanosomes. However, differential responses to known attractants have been reported in different Glossina species, hindering establishment of a universal vector control tool. Availability of full genome sequences of five Glossina species offers an opportunity to compare their chemosensory repertoire and enhance our understanding of their biology in relation to chemosensation. Here, we identified and annotated the major chemosensory gene families in Glossina. We identified a total of 118, 115, 124, and 123 chemosensory genes in Glossina austeni, G. brevipalpis, G. f. fuscipes, G. pallidipes, respectively, relative to 127 reported in G. m. morsitans. Our results show that tsetse fly genomes have fewer chemosensory genes when compared to other dipterans such as Musca domestica (n>393), Drosophila melanogaster (n = 246) and Anopheles gambiae (n>247). We also found that Glossina chemosensory genes are dispersed across distantly located scaffolds in their respective genomes, in contrast to other insects like D. melanogaster whose genes occur in clusters. Further, Glossina appears to be devoid of sugar receptors and to have expanded CO2 associated receptors, potentially reflecting Glossina's obligate hematophagy and the need to detect hosts that may be out of sight. We also identified, in all species, homologs of Ir84a; a Drosophila-specific ionotropic receptor that promotes male courtship suggesting that this is a conserved trait in tsetse flies. Notably, our selection analysis revealed that a total of four gene loci (Gr21a, GluRIIA, Gr28b, and Obp83a) were under positive selection, which confers fitness advantage to species. These findings provide a platform for studies to further define the language of communication of tsetse with their environment, and influence development of novel approaches for control. PMID:26886411

  18. Genome-wide analysis of the AP2/ERF superfamily in peach (Prunus persica).

    PubMed

    Zhang, C H; Shangguan, L F; Ma, R J; Sun, X; Tao, R; Guo, L; Korir, N K; Yu, M L

    2012-10-17

    We identified 131 AP2/ERF (APETALA2/ethylene-responsive factor) genes in material from peach using the gene sequences of AP2/ERF amino acids of Arabidopsis thaliana (Brassicaceae) as probes. Based on the number of AP2/ERF domains and individual gene characteristics, the AP2/ERF superfamily gene in peach can be classified broadly into three families, ERF (ethylene-responsive factor), RAV (related to ABI3/VP1), and AP2 (APETALA2), containing 104, 5, and 21 members, respectively, along with a solo gene (ppa005376m). The 104 genes in the ERF family were further divided into 11 groups based on the group classification made for Arabidopsis. The scaffold localizations of the AP2/ERF genes indicated that 129 AP2/ERF genes were all located on scaffolds 1 to 8, except for two genes, which were on scaffolds 17 and 10. Although the primary structure varied among AP2/ERF superfamily proteins, their tertiary structures were similar. Most ERF family genes have no introns, while members of the AP2 family have more introns than genes in the ERF and RAV families. All sequences of AP2 family genes were disrupted by introns into several segments of varying sizes. The expression of the AP2/ERF superfamily genes was highest in the mesocarp; it was far higher than in the other seven tissues that we examined, implying that AP2/ERF superfamily genes play an important role in fruit growth and development in the peach. These results will be useful for selecting candidate genes from specific subgroups for functional analysis.

  19. Genome-wide analysis of acetivibrio cellulolyticus provides a blueprint of an elaborate cellulosome system

    PubMed Central

    2012-01-01

    Background Microbial degradation of plant cell walls and its conversion to sugars and other byproducts is a key step in the carbon cycle on Earth. In order to process heterogeneous plant-derived biomass, specialized anaerobic bacteria use an elaborate multi-enzyme cellulosome complex to synergistically deconstruct cellulosic substrates. The cellulosome was first discovered in the cellulolytic thermophile, Clostridium thermocellum, and much of our knowledge of this intriguing type of protein composite is based on the cellulosome of this environmentally and biotechnologically important bacterium. The recently sequenced genome of the cellulolytic mesophile, Acetivibrio cellulolyticus, allows detailed comparison of the cellulosomes of these two select cellulosome-producing bacteria. Results Comprehensive analysis of the A. cellulolyticus draft genome sequence revealed a very sophisticated cellulosome system. Compared to C. thermocellum, the cellulosomal architecture of A. cellulolyticus is much more extensive, whereby the genome encodes for twice the number of cohesin- and dockerin-containing proteins. The A. cellulolyticus genome has thus evolved an inflated number of 143 dockerin-containing genes, coding for multimodular proteins with distinctive catalytic and carbohydrate-binding modules that play critical roles in biomass degradation. Additionally, 41 putative cohesin modules distributed in 16 different scaffoldin proteins were identified in the genome, representing a broader diversity and modularity than those of Clostridium thermocellum. Although many of the A. cellulolyticus scaffoldins appear in unconventional modular combinations, elements of the basic structural scaffoldins are maintained in both species. In addition, both species exhibit similarly elaborate cell-anchoring and cellulosome-related gene- regulatory elements. Conclusions This work portrays a particularly intricate, cell-surface cellulosome system in A. cellulolyticus and provides a blueprint for

  20. Genome-wide analysis of the MYB gene family in physic nut (Jatropha curcas L.).

    PubMed

    Zhou, Changpin; Chen, Yanbo; Wu, Zhenying; Lu, Wenjia; Han, Jinli; Wu, Pingzhi; Chen, Yaping; Li, Meiru; Jiang, Huawu; Wu, Guojiang

    2015-11-01

    The MYB proteins comprise one of the largest transcription factor families in plants, and play key roles in regulatory networks controlling development, metabolism, and stress responses. A total of 125 MYB genes (JcMYB) have been identified in the physic nut (Jatropha curcas L.) genome, including 120 2R-type MYB, 4 3R-MYB, and 1 4R-MYB genes. Based on exon-intron arrangement of MYBs from both lower (Physcomitrella patens) and higher (physic nut, Arabidopsis, and rice) plants, we can classify plant MYB genes into ten groups (MI-X), except for MIX genes which are nonexistent in higher plants. We also observed that MVIII genes may be one of the most ancient MYB types which consist of both R2R3- and 3R-MYB genes. Most MYB genes (76.8% in physic nut) belong to the MI group which can be divided into 34 subgroups. The JcMYB genes were nonrandomly distributed on its 11 linkage groups (LGs). The expansion of MYB genes across several subgroups was observed and resulted from genome triplication of ancient dicotyledons and from both ancient and recent tandem duplication events in the physic nut genome. The expression patterns of several MYB duplicates in the physic nut showed differences in four tissues (root, stem, leaf, and seed), and 34 MYB genes responded to at least one abiotic stressor (drought, salinity, phosphate starvation, and nitrogen starvation) in leaves and/or roots based on the data analysis of digital gene expression tags. Overexpression of the JcMYB001 gene in Arabidopsis increased its sensitivity to drought and salinity stresses.