POEM: Identifying Joint Additive Effects on Regulatory Circuits.
Botzman, Maya; Nachshon, Aharon; Brodt, Avital; Gat-Viks, Irit
2016-01-01
Expression Quantitative Trait Locus (eQTL) mapping tackles the problem of identifying variation in DNA sequence that have an effect on the transcriptional regulatory network. Major computational efforts are aimed at characterizing the joint effects of several eQTLs acting in concert to govern the expression of the same genes. Yet, progress toward a comprehensive prediction of such joint effects is limited. For example, existing eQTL methods commonly discover interacting loci affecting the expression levels of a module of co-regulated genes. Such "modularization" approaches, however, are focused on epistatic relations and thus have limited utility for the case of additive (non-epistatic) effects. Here we present POEM (Pairwise effect On Expression Modules), a methodology for identifying pairwise eQTL effects on gene modules. POEM is specifically designed to achieve high performance in the case of additive joint effects. We applied POEM to transcription profiles measured in bone marrow-derived dendritic cells across a population of genotyped mice. Our study reveals widespread additive, trans-acting pairwise effects on gene modules, characterizes their organizational principles, and highlights high-order interconnections between modules within the immune signaling network. These analyses elucidate the central role of additive pairwise effect in regulatory circuits, and provide computational tools for future investigations into the interplay between eQTLs. The software described in this article is available at csgi.tau.ac.il/POEM/.
POEM: Identifying Joint Additive Effects on Regulatory Circuits
Botzman, Maya; Nachshon, Aharon; Brodt, Avital; Gat-Viks, Irit
2016-01-01
Motivation: Expression Quantitative Trait Locus (eQTL) mapping tackles the problem of identifying variation in DNA sequence that have an effect on the transcriptional regulatory network. Major computational efforts are aimed at characterizing the joint effects of several eQTLs acting in concert to govern the expression of the same genes. Yet, progress toward a comprehensive prediction of such joint effects is limited. For example, existing eQTL methods commonly discover interacting loci affecting the expression levels of a module of co-regulated genes. Such “modularization” approaches, however, are focused on epistatic relations and thus have limited utility for the case of additive (non-epistatic) effects. Results: Here we present POEM (Pairwise effect On Expression Modules), a methodology for identifying pairwise eQTL effects on gene modules. POEM is specifically designed to achieve high performance in the case of additive joint effects. We applied POEM to transcription profiles measured in bone marrow-derived dendritic cells across a population of genotyped mice. Our study reveals widespread additive, trans-acting pairwise effects on gene modules, characterizes their organizational principles, and highlights high-order interconnections between modules within the immune signaling network. These analyses elucidate the central role of additive pairwise effect in regulatory circuits, and provide computational tools for future investigations into the interplay between eQTLs. Availability: The software described in this article is available at csgi.tau.ac.il/POEM/. PMID:27148351
Jiang, Wenzhu; Jin, Yong-Mei; Lee, Joohyun; Lee, Kang-Ie; Piao, Rihua; Han, Longzhi; Shin, Jin-Chul; Jin, Rong-De; Cao, Tiehua; Pan, Hong-Yu; Du, Xinglin; Koh, Hee-Jong
2011-01-01
Low temperature is one of the major environmental stresses in rice cultivation in high-altitude and high-latitude regions. In this study, we cultivated a set of recombinant inbred lines (RIL) derived from Dasanbyeo (indica) / TR22183 (japonica) crosses in Yanji (high-latitude area), Kunming (high-altitude area), Chuncheon (cold water irrigation) and Suwon (normal) to evaluate the main effects of quantitative trait loci (QTL) and epistatic QTL (E-QTL) with regard to their interactions with environments for coldrelated traits. Six QTLs for spikelet fertility (SF) were identified in three cold treatment locations. Among them, four QTLs on chromosomes 2, 7, 8, and 10 were validated by several near isogenic lines (NILs) under cold treatment in Chuncheon. A total of 57 QTLs and 76 E-QTLs for nine cold-related traits were identified as distributing on all 12 chromosomes; among them, 19 QTLs and E-QTLs showed significant interactions of QTLs and environments (QEIs). The total phenotypic variation explained by each trait ranged from 13.2 to 29.1% in QTLs, 10.6 to 29.0% in EQTLs, 2.2 to 8.8% in QEIs and 1.0% to 7.7% in E-QTL × environment interactions (E-QEIs). These results demonstrate that epistatic effects and QEIs are important properties of QTL parameters for cold tolerance at the reproductive stage. In order to develop cold tolerant varieties adaptable to wide-ranges of cold stress, a strategy facilitating marker-assisted selection (MAS) is being adopted to accumulate QTLs identified from different environments. PMID:22080374
Detection of eQTL modules mediated by activity levels of transcription factors.
Sun, Wei; Yu, Tianwei; Li, Ker-Chau
2007-09-01
Studies of gene expression quantitative trait loci (eQTL) in different organisms have shown the existence of eQTL hot spots: each being a small segment of DNA sequence that harbors the eQTL of a large number of genes. Two questions of great interest about eQTL hot spots arise: (1) which gene within the hot spot is responsible for the linkages, i.e. which gene is the quantitative trait gene (QTG)? (2) How does a QTG affect the expression levels of many genes linked to it? Answers to the first question can be offered by available biological evidence or by statistical methods. The second question is harder to address. One simple situation is that the QTG encodes a transcription factor (TF), which regulates the expression of genes linked to it. However, previous results have shown that TFs are not overrepresented in the eQTL hot spots. In this article, we consider the scenario that the propagation of genetic perturbation from a QTG to other linked genes is mediated by the TF activity. We develop a procedure to detect the eQTL modules (eQTL hot spots together with linked genes) that are compatible with this scenario. We first detect 27 eQTL modules from a yeast eQTL data, and estimate TF activity profiles using the method of Yu and Li (2005). Then likelihood ratio tests (LRTs) are conducted to find 760 relationships supporting the scenario of TF activity mediation: (DNA polymorphism --> cis-linked gene --> TF activity --> downstream linked gene). They are organized into 4 eQTL modules: an amino acid synthesis module featuring a cis-linked gene LEU2 and the mediating TF Leu3; a pheromone response module featuring a cis-linked gene GPA1 and the mediating TF Ste12; an energy-source control module featuring two cis-linked genes, GSY2 and HAP1, and the mediating TF Hap1; a mitotic exit module featuring four cis-linked genes, AMN1, CSH1, DEM1 and TOS1, and the mediating TF complex Ace2/Swi5. Gene Ontology is utilized to reveal interesting functional groups of the downstream genes in each module. Our methods are implemented in an R package: eqtl.TF, which includes source codes and relevant data. It can be freely downloaded at http://www.stat.ucla.edu/~sunwei/software.htm. http://www.stat.ucla.edu/~sunwei/yeast_eQTL_TF/supplementary.pdf.
Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P
2013-03-21
Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group, we provide experimental evidence suggesting that the identified candidates do regulate the target genes predicted by GFlasso. Thus, this structured association analysis of a yeast eQTL dataset via GFlasso, coupled with extensive bioinformatics analysis, discovers a novel regulation pattern between multiple eQTL hotspots and functional gene modules. Furthermore, this analysis demonstrates the potential of GFlasso as a powerful computational tool for eQTL studies that exploit the rich structural information among expression traits due to correlation, regulation, or other forms of biological dependencies.
Cheng, Lirui; Wang, Yun; Meng, Lijun; Hu, Xia; Cui, Yanru; Sun, Yong; Zhu, Linghua; Ali, Jauhar; Xu, Jianlong; Li, Zhikang
2012-01-01
Effect of genetic background on detection of quantitative trait locus (QTL) governing salinity tolerance (ST) was studied using two sets of reciprocal introgression lines (ILs) derived from a cross between a moderately salinity tolerant japonica variety, Xiushui09 from China, and a drought tolerant but salinity susceptible indica breeding line, IR2061-520-6-9 from the Philippines. Salt toxicity symptoms (SST) on leaves, days to seedling survival (DSS), and sodium and potassium uptake by shoots were measured under salinity stress of 140 mmol/L of NaCl. A total of 47 QTLs, including 26 main-effect QTLs (M-QTLs) and 21 epistatic QTLs (E-QTLs), were identified from the two sets of reciprocal ILs. Among the 26 M-QTLs, only four (15.4%) were shared in the reciprocal backgrounds while no shared E-QTLs were detected, indicating that ST QTLs, especially E-QTLs, were very specific to the genetic background. Further, 78.6% of the M-QTLs for SST and DSS identified in the reciprocal ILs were also detected in the recombinant inbred lines (RILs) from the same cross, which clearly brings out the background effect on ST QTL detection and its utilization in ST breeding. The detection of ILs with various levels of pyramiding of nonallelic M-QTL alleles for ST from Xiushui09 into IR2061-520-6-9 allowed us to further improve the ST in rice.
Main and epistatic QTL analyses for Sclerotinia Head Rot resistance in sunflower.
Zubrzycki, Jeremías Enrique; Maringolo, Carla Andrea; Filippi, Carla Valeria; Quiróz, Facundo José; Nishinakamasu, Verónica; Puebla, Andrea Fabiana; Di Rienzo, Julio A; Escande, Alberto; Lia, Verónica Viviana; Heinz, Ruth Amalia; Hopp, Horacio Esteban; Cervigni, Gerardo D L; Paniego, Norma Beatriz
2017-01-01
Sclerotinia Head Rot (SHR), a disease caused by Sclerotinia sclerotiorum, is one of the most limiting factors in sunflower production. In this study, we identified genomic loci associated with resistance to SHR to support the development of assisted breeding strategies. We genotyped 114 Recombinant Inbred Lines (RILs) along with their parental lines (PAC2 -partially resistant-and RHA266 -susceptible-) by using a 384 single nucleotide polymorphism (SNP) Illumina Oligo Pool Assay to saturate a sunflower genetic map. Subsequently, we tested these lines for SHR resistance using assisted inoculations with S. sclerotiorum ascospores. We also conducted a randomized complete-block assays with three replicates to visually score disease incidence (DI), disease severity (DS), disease intensity (DInt) and incubation period (IP) through four field trials (2010-2014). We finally assessed main effect quantitative trait loci (M-QTLs) and epistatic QTLs (E-QTLs) by composite interval mapping (CIM) and mixed-model-based composite interval mapping (MCIM), respectively. As a result of this study, the improved map incorporates 61 new SNPs over candidate genes. We detected a broad range of narrow sense heritability (h2) values (1.86-59.9%) as well as 36 M-QTLs and 13 E-QTLs along 14 linkage groups (LGs). On LG1, LG10, and LG15, we repeatedly detected QTLs across field trials; which emphasizes their putative effectiveness against SHR. In all selected variables, most of the identified QTLs showed high determination coefficients, associated with moderate to high heritability values. Using markers shared with previous Sclerotinia resistance studies, we compared the QTL locations in LG1, LG2, LG8, LG10, LG11, LG15 and LG16. This study constitutes the largest report of QTLs for SHR resistance in sunflower. Further studies focusing on the regions in LG1, LG10, and LG15 harboring the detected QTLs are necessary to identify causal alleles and contribute to unraveling the complex genetic basis governing the resistance.
González, Ana M.; Yuste-Lisbona, Fernando J.; Rodiño, A. Paula; De Ron, Antonio M.; Capel, Carmen; García-Alcázar, Manuel; Lozano, Rafael; Santalla, Marta
2015-01-01
Colletotrichum lindemuthianum is a hemibiotrophic fungal pathogen that causes anthracnose disease in common bean. Despite the genetics of anthracnose resistance has been studied for a long time, few quantitative trait loci (QTLs) studies have been conducted on this species. The present work examines the genetic basis of quantitative resistance to races 23 and 1545 of C. lindemuthianum in different organs (stem, leaf and petiole). A population of 185 recombinant inbred lines (RIL) derived from the cross PMB0225 × PHA1037 was evaluated for anthracnose resistance under natural and artificial photoperiod growth conditions. Using multi-environment QTL mapping approach, 10 and 16 main effect QTLs were identified for resistance to anthracnose races 23 and 1545, respectively. The homologous genomic regions corresponding to 17 of the 26 main effect QTLs detected were positive for the presence of resistance-associated gene cluster encoding nucleotide-binding and leucine-rich repeat (NL) proteins. Among them, it is worth noting that the main effect QTLs detected on linkage group 05 for resistance to race 1545 in stem, petiole and leaf were located within a 1.2 Mb region. The NL gene Phvul.005G117900 is located in this region, which can be considered an important candidate gene for the non-organ-specific QTL identified here. Furthermore, a total of 39 epistatic QTL (E-QTLs) (21 for resistance to race 23 and 18 for resistance to race 1545) involved in 20 epistatic interactions (eleven and nine interactions for resistance to races 23 and 1545, respectively) were identified. None of the main and epistatic QTLs detected displayed significant environment interaction effects. The present research provides essential information not only for the better understanding of the plant-pathogen interaction but also for the application of genomic assisted breeding for anthracnose resistance improvement in common bean through application of marker-assisted selection (MAS). PMID:25852706
Kogelman, Lisette J A; Zhernakova, Daria V; Westra, Harm-Jan; Cirera, Susanna; Fredholm, Merete; Franke, Lude; Kadarmideen, Haja N
2015-10-20
Obesity is a multi-factorial health problem in which genetic factors play an important role. Limited results have been obtained in single-gene studies using either genomic or transcriptomic data. RNA sequencing technology has shown its potential in gaining accurate knowledge about the transcriptome, and may reveal novel genes affecting complex diseases. Integration of genomic and transcriptomic variation (expression quantitative trait loci [eQTL] mapping) has identified causal variants that affect complex diseases. We integrated transcriptomic data from adipose tissue and genomic data from a porcine model to investigate the mechanisms involved in obesity using a systems genetics approach. Using a selective gene expression profiling approach, we selected 36 animals based on a previously created genomic Obesity Index for RNA sequencing of subcutaneous adipose tissue. Differential expression analysis was performed using the Obesity Index as a continuous variable in a linear model. eQTL mapping was then performed to integrate 60 K porcine SNP chip data with the RNA sequencing data. Results were restricted based on genome-wide significant single nucleotide polymorphisms, detected differentially expressed genes, and previously detected co-expressed gene modules. Further data integration was performed by detecting co-expression patterns among eQTLs and integration with protein data. Differential expression analysis of RNA sequencing data revealed 458 differentially expressed genes. The eQTL mapping resulted in 987 cis-eQTLs and 73 trans-eQTLs (false discovery rate < 0.05), of which the cis-eQTLs were associated with metabolic pathways. We reduced the eQTL search space by focusing on differentially expressed and co-expressed genes and disease-associated single nucleotide polymorphisms to detect obesity-related genes and pathways. Building a co-expression network using eQTLs resulted in the detection of a module strongly associated with lipid pathways. Furthermore, we detected several obesity candidate genes, for example, ENPP1, CTSL, and ABHD12B. To our knowledge, this is the first study to perform an integrated genomics and transcriptomics (eQTL) study using, and modeling, genomic and subcutaneous adipose tissue RNA sequencing data on obesity in a porcine model. We detected several pathways and potential causal genes for obesity. Further validation and investigation may reveal their exact function and association with obesity.
Rotival, Maxime; Zeller, Tanja; Wild, Philipp S; Maouche, Seraya; Szymczak, Silke; Schillert, Arne; Castagné, Raphaele; Deiseroth, Arne; Proust, Carole; Brocheton, Jessy; Godefroy, Tiphaine; Perret, Claire; Germain, Marine; Eleftheriadis, Medea; Sinning, Christoph R; Schnabel, Renate B; Lubos, Edith; Lackner, Karl J; Rossmann, Heidi; Münzel, Thomas; Rendon, Augusto; Erdmann, Jeanette; Deloukas, Panos; Hengstenberg, Christian; Diemert, Patrick; Montalescot, Gilles; Ouwehand, Willem H; Samani, Nilesh J; Schunkert, Heribert; Tregouet, David-Alexandre; Ziegler, Andreas; Goodall, Alison H; Cambien, François; Tiret, Laurence; Blankenberg, Stefan
2011-12-01
One major expectation from the transcriptome in humans is to characterize the biological basis of associations identified by genome-wide association studies. So far, few cis expression quantitative trait loci (eQTLs) have been reliably related to disease susceptibility. Trans-regulating mechanisms may play a more prominent role in disease susceptibility. We analyzed 12,808 genes detected in at least 5% of circulating monocyte samples from a population-based sample of 1,490 European unrelated subjects. We applied a method of extraction of expression patterns-independent component analysis-to identify sets of co-regulated genes. These patterns were then related to 675,350 SNPs to identify major trans-acting regulators. We detected three genomic regions significantly associated with co-regulated gene modules. Association of these loci with multiple expression traits was replicated in Cardiogenics, an independent study in which expression profiles of monocytes were available in 758 subjects. The locus 12q13 (lead SNP rs11171739), previously identified as a type 1 diabetes locus, was associated with a pattern including two cis eQTLs, RPS26 and SUOX, and 5 trans eQTLs, one of which (MADCAM1) is a potential candidate for mediating T1D susceptibility. The locus 12q24 (lead SNP rs653178), which has demonstrated extensive disease pleiotropy, including type 1 diabetes, hypertension, and celiac disease, was associated to a pattern strongly correlating to blood pressure level. The strongest trans eQTL in this pattern was CRIP1, a known marker of cellular proliferation in cancer. The locus 12q15 (lead SNP rs11177644) was associated with a pattern driven by two cis eQTLs, LYZ and YEATS4, and including 34 trans eQTLs, several of them tumor-related genes. This study shows that a method exploiting the structure of co-expressions among genes can help identify genomic regions involved in trans regulation of sets of genes and can provide clues for understanding the mechanisms linking genome-wide association loci to disease.
Mapping of Gene Expression Reveals CYP27A1 as a Susceptibility Gene for Sporadic ALS
van Rheenen, Wouter; Franke, Lude; Jansen, Ritsert C.; van Es, Michael A.; van Vught, Paul W. J.; Blauw, Hylke M.; Groen, Ewout J. N.; Horvath, Steve; Estrada, Karol; Rivadeneira, Fernando; Hofman, Albert; Uitterlinden, Andre G.; Robberecht, Wim; Andersen, Peter M.; Melki, Judith; Meininger, Vincent; Hardiman, Orla; Landers, John E.; Brown, Robert H.; Shatunov, Aleksey; Shaw, Christopher E.; Leigh, P. Nigel; Al-Chalabi, Ammar; Ophoff, Roel A.
2012-01-01
Amyotrophic lateral sclerosis (ALS) is a progressive, neurodegenerative disease characterized by loss of upper and lower motor neurons. ALS is considered to be a complex trait and genome-wide association studies (GWAS) have implicated a few susceptibility loci. However, many more causal loci remain to be discovered. Since it has been shown that genetic variants associated with complex traits are more likely to be eQTLs than frequency-matched variants from GWAS platforms, we conducted a two-stage genome-wide screening for eQTLs associated with ALS. In addition, we applied an eQTL analysis to finemap association loci. Expression profiles using peripheral blood of 323 sporadic ALS patients and 413 controls were mapped to genome-wide genotyping data. Subsequently, data from a two-stage GWAS (3,568 patients and 10,163 controls) were used to prioritize eQTLs identified in the first stage (162 ALS, 207 controls). These prioritized eQTLs were carried forward to the second sample with both gene-expression and genotyping data (161 ALS, 206 controls). Replicated eQTL SNPs were then tested for association in the second-stage GWAS data to find SNPs associated with disease, that survived correction for multiple testing. We thus identified twelve cis eQTLs with nominally significant associations in the second-stage GWAS data. Eight SNP-transcript pairs of highest significance (lowest p = 1.27×10−51) withstood multiple-testing correction in the second stage and modulated CYP27A1 gene expression. Additionally, we show that C9orf72 appears to be the only gene in the 9p21.2 locus that is regulated in cis, showing the potential of this approach in identifying causative genes in association loci in ALS. This study has identified candidate genes for sporadic ALS, most notably CYP27A1. Mutations in CYP27A1 are causal to cerebrotendinous xanthomatosis which can present as a clinical mimic of ALS with progressive upper motor neuron loss, making it a plausible susceptibility gene for ALS. PMID:22509407
Chauvet, Cristina; Crespo, Kimberley; Ménard, Annie; Roy, Julie; Deng, Alan Y
2013-11-15
Hypertension, the most frequently diagnosed clinical condition world-wide, predisposes individuals to morbidity and mortality, yet its underlying pathological etiologies are poorly understood. So far, a large number of quantitative trait loci (QTLs) have been identified in both humans and animal models, but how they function together in determining overall blood pressure (BP) in physiological settings is unknown. Here, we systematically and comprehensively performed pair-wise comparisons of individual QTLs to create a global picture of their functionality in an inbred rat model. Rather than each of numerous QTLs contributing to infinitesimal BP increments, a modularized pattern arises: two epistatic 'blocks' constitute basic functional 'units' for nearly all QTLs, designated as epistatic module 1 (EM1) and EM2. This modularization dictates the magnitude and scope of BP effects. Any EM1 member can contribute to BP additively to that of EM2, but not to those of the same module. Members of each EM display epistatic hierarchy, which seems to reflect a related functional pathway. Rat homologues of 11 human BP QTLs belong to either EM1 or EM2. Unique insights emerge into the novel genetic mechanism and hierarchy determining BP in the Dahl salt-sensitive SS/Jr (DSS) rat model that implicate a portion of human QTLs. Elucidating the pathways underlying EM1 and EM2 may reveal the genetic regulation of BP.
Chauvet, Cristina; Ménard, Annie; Deng, Alan Y
2015-09-01
Multiple quantitative trait loci (QTLs) for blood pressure (BP) have been detected in rat models of human polygenic hypertension. They influence BP physiologically via epistatic modules. Little is known about the causal genes and virtually nothing is known on modularized mechanisms governing their regulatory connections. Two genes responsible for two individual BP QTLs on rat Chromosome 18 have been identified that belong to the same epistatic module. Treacher Collins-Franceschetti syndrome 1 (Tcof1) gene is the only function candidate for C18QTL3. Haloacid dehalogenase like hydrolase domain containing 2 (Hdhd2), although a gene of previously unknown function, is C18QTL4, and encodes a newly identified phosphatase. The current work has provided the premier evidence that Hdhd2/C18QTL4 and Tcof1/C18QTL3 may be involved in polygenic hypertension. Hdhd2/C18QTL4 can regulate the function of Tcof1/C18QTL3 via de-phosphorylation, and, for the first time, furbishes a molecular mechanism in support of a genetically epistatic hierarchy between two BP QTLs, and thus authenticates the epistasis-common pathway paradigm. The pathway initiated by Hdhd2/C18QTL4 upstream of Tcof1/C18QTL3 reveals novel mechanistic insights into BP modulations. Their discovery might yield innovative therapeutic targets and diagnostic tools predicated on a novel BP cause and mechanism that is determined by a regulatory hierarchy. Optimizing the de-phosphorylation capability and its downstream target could be antihypertensive. The conceptual paradigm of an order and regulatory hierarchy may help unravel genetic and molecular relationships among certain human BP QTLs.
Zhang, Wenchao; Dai, Xinbin; Wang, Qishan; Xu, Shizhong; Zhao, Patrick X
2016-05-01
The term epistasis refers to interactions between multiple genetic loci. Genetic epistasis is important in regulating biological function and is considered to explain part of the 'missing heritability,' which involves marginal genetic effects that cannot be accounted for in genome-wide association studies. Thus, the study of epistasis is of great interest to geneticists. However, estimating epistatic effects for quantitative traits is challenging due to the large number of interaction effects that must be estimated, thus significantly increasing computing demands. Here, we present a new web server-based tool, the Pipeline for estimating EPIStatic genetic effects (PEPIS), for analyzing polygenic epistatic effects. The PEPIS software package is based on a new linear mixed model that has been used to predict the performance of hybrid rice. The PEPIS includes two main sub-pipelines: the first for kinship matrix calculation, and the second for polygenic component analyses and genome scanning for main and epistatic effects. To accommodate the demand for high-performance computation, the PEPIS utilizes C/C++ for mathematical matrix computing. In addition, the modules for kinship matrix calculations and main and epistatic-effect genome scanning employ parallel computing technology that effectively utilizes multiple computer nodes across our networked cluster, thus significantly improving the computational speed. For example, when analyzing the same immortalized F2 rice population genotypic data examined in a previous study, the PEPIS returned identical results at each analysis step with the original prototype R code, but the computational time was reduced from more than one month to about five minutes. These advances will help overcome the bottleneck frequently encountered in genome wide epistatic genetic effect analysis and enable accommodation of the high computational demand. The PEPIS is publically available at http://bioinfo.noble.org/PolyGenic_QTL/.
Expression quantitative trait loci: replication, tissue- and sex-specificity in mice.
van Nas, Atila; Ingram-Drake, Leslie; Sinsheimer, Janet S; Wang, Susanna S; Schadt, Eric E; Drake, Thomas; Lusis, Aldons J
2010-07-01
By treating the transcript abundance as a quantitative trait, gene expression can be mapped to local or distant genomic regions relative to the gene encoding the transcript. Local expression quantitative trait loci (eQTL) generally act in cis (that is, control the expression of only the contiguous structural gene), whereas distal eQTL act in trans. Distal eQTL are more difficult to identify with certainty due to the fact that significant thresholds are very high since all regions of the genome must be tested, and confounding factors such as batch effects can produce false positives. Here, we compare findings from two large genetic crosses between mouse strains C3H/HeJ and C57BL/6J to evaluate the reliability of distal eQTL detection, including "hotspots" influencing the expression of multiple genes in trans. We found that >63% of local eQTL and >18% of distal eQTL were replicable at a threshold of LOD > 4.3 between crosses and 76% of local and >24% of distal eQTL at a threshold of LOD > 6. Additionally, at LOD > 4.3 four tissues studied (adipose, brain, liver, and muscle) exhibited >50% preservation of local eQTL and >17% preservation of distal eQTL. We observed replicated distal eQTL hotspots between the crosses on chromosomes 9 and 17. Finally, >69% of local eQTL and >10% of distal eQTL were preserved in most tissues between sexes. We conclude that most local eQTL are highly replicable between mouse crosses, tissues, and sex as compared to distal eQTL, which exhibited modest replicability.
Ijichi, Shinji; Ijichi, Naomi; Ijichi, Yukina; Imamura, Chikako; Sameshima, Hisami; Kawaike, Yoichi; Morioka, Hirofumi
2018-01-01
The continuing prevalence of a highly heritable and hypo-reproductive extreme tail of a human neurobehavioral quantitative diversity suggests the possibility that the reproductive majority retains the genetic mechanism for the extremes. From the perspective of stochastic epistasis, the effect of an epistatic modifier variant can randomly vary in both phenotypic value and effect direction among the careers depending on the genetic individuality, and the modifier careers are ubiquitous in the population distribution. The neutrality of the mean genetic effect in the careers warrants the survival of the variant under selection pressures. Functionally or metabolically related modifier variants make an epistatic network module and dozens of modules may be involved in the phenotype. To assess the significance of stochastic epistasis, a simplified module-based model was employed. The individual repertoire of the modifier variants in a module also participates in the genetic individuality which determines the genetic contribution of each modifier in the career. Because the entire contribution of a module to the phenotypic outcome is consequently unpredictable in the model, the module effect represents the total contribution of the related modifiers as a stochastic unit in the simulations. As a result, the intrinsic compatibility between distributional robustness and quantitative changeability could mathematically be simulated using the model. The artificial normal distribution shape in large-sized simulations was preserved in each generation even if the lowest fitness tail was un-reproductive. The robustness of normality beyond generations is analogous to the real situations of human complex diversity including neurodevelopmental conditions. The repeated regeneration of the un-reproductive extreme tail may be inevitable for the reproductive majority's competence to survive and change, suggesting implications of the extremes for others. Further model-simulations to illustrate how the fitness of extreme individuals can be low through generations may be warranted to increase the credibility of this stochastic epistasis model.
Dobbyn, Amanda; Huckins, Laura M; Boocock, James; Sloofman, Laura G; Glicksberg, Benjamin S; Giambartolomei, Claudia; Hoffman, Gabriel E; Perumal, Thanneer M; Girdhar, Kiran; Jiang, Yan; Raj, Towfique; Ruderfer, Douglas M; Kramer, Robin S; Pinto, Dalila; Akbarian, Schahram; Roussos, Panos; Domenici, Enrico; Devlin, Bernie; Sklar, Pamela; Stahl, Eli A; Sieberts, Solveig K
2018-06-07
Causal genes and variants within genome-wide association study (GWAS) loci can be identified by integrating GWAS statistics with expression quantitative trait loci (eQTL) and determining which variants underlie both GWAS and eQTL signals. Most analyses, however, consider only the marginal eQTL signal, rather than dissect this signal into multiple conditionally independent signals for each gene. Here we show that analyzing conditional eQTL signatures, which could be important under specific cellular or temporal contexts, leads to improved fine mapping of GWAS associations. Using genotypes and gene expression levels from post-mortem human brain samples (n = 467) reported by the CommonMind Consortium (CMC), we find that conditional eQTL are widespread; 63% of genes with primary eQTL also have conditional eQTL. In addition, genomic features associated with conditional eQTL are consistent with context-specific (e.g., tissue-, cell type-, or developmental time point-specific) regulation of gene expression. Integrating the 2014 Psychiatric Genomics Consortium schizophrenia (SCZ) GWAS and CMC primary and conditional eQTL data reveals 40 loci with strong evidence for co-localization (posterior probability > 0.8), including six loci with co-localization of conditional eQTL. Our co-localization analyses support previously reported genes, identify novel genes associated with schizophrenia risk, and provide specific hypotheses for their functional follow-up. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Cross-Population Joint Analysis of eQTLs: Fine Mapping and Functional Annotation
Wen, Xiaoquan; Luca, Francesca; Pique-Regi, Roger
2015-01-01
Mapping expression quantitative trait loci (eQTLs) has been shown as a powerful tool to uncover the genetic underpinnings of many complex traits at molecular level. In this paper, we present an integrative analysis approach that leverages eQTL data collected from multiple population groups. In particular, our approach effectively identifies multiple independent cis-eQTL signals that are consistent across populations, accounting for population heterogeneity in allele frequencies and linkage disequilibrium patterns. Furthermore, by integrating genomic annotations, our analysis framework enables high-resolution functional analysis of eQTLs. We applied our statistical approach to analyze the GEUVADIS data consisting of samples from five population groups. From this analysis, we concluded that i) jointly analysis across population groups greatly improves the power of eQTL discovery and the resolution of fine mapping of causal eQTL ii) many genes harbor multiple independent eQTLs in their cis regions iii) genetic variants that disrupt transcription factor binding are significantly enriched in eQTLs (p-value = 4.93 × 10-22). PMID:25906321
Ju, Jin Hyun; Crystal, Ronald G.
2017-01-01
Genome-wide expression Quantitative Trait Loci (eQTL) studies in humans have provided numerous insights into the genetics of both gene expression and complex diseases. While the majority of eQTL identified in genome-wide analyses impact a single gene, eQTL that impact many genes are particularly valuable for network modeling and disease analysis. To enable the identification of such broad impact eQTL, we introduce CONFETI: Confounding Factor Estimation Through Independent component analysis. CONFETI is designed to address two conflicting issues when searching for broad impact eQTL: the need to account for non-genetic confounding factors that can lower the power of the analysis or produce broad impact eQTL false positives, and the tendency of methods that account for confounding factors to model broad impact eQTL as non-genetic variation. The key advance of the CONFETI framework is the use of Independent Component Analysis (ICA) to identify variation likely caused by broad impact eQTL when constructing the sample covariance matrix used for the random effect in a mixed model. We show that CONFETI has better performance than other mixed model confounding factor methods when considering broad impact eQTL recovery from synthetic data. We also used the CONFETI framework and these same confounding factor methods to identify eQTL that replicate between matched twin pair datasets in the Multiple Tissue Human Expression Resource (MuTHER), the Depression Genes Networks study (DGN), the Netherlands Study of Depression and Anxiety (NESDA), and multiple tissue types in the Genotype-Tissue Expression (GTEx) consortium. These analyses identified both cis-eQTL and trans-eQTL impacting individual genes, and CONFETI had better or comparable performance to other mixed model confounding factor analysis methods when identifying such eQTL. In these analyses, we were able to identify and replicate a few broad impact eQTL although the overall number was small even when applying CONFETI. In light of these results, we discuss the broad impact eQTL that have been previously reported from the analysis of human data and suggest that considerable caution should be exercised when making biological inferences based on these reported eQTL. PMID:28505156
Ju, Jin Hyun; Shenoy, Sushila A; Crystal, Ronald G; Mezey, Jason G
2017-05-01
Genome-wide expression Quantitative Trait Loci (eQTL) studies in humans have provided numerous insights into the genetics of both gene expression and complex diseases. While the majority of eQTL identified in genome-wide analyses impact a single gene, eQTL that impact many genes are particularly valuable for network modeling and disease analysis. To enable the identification of such broad impact eQTL, we introduce CONFETI: Confounding Factor Estimation Through Independent component analysis. CONFETI is designed to address two conflicting issues when searching for broad impact eQTL: the need to account for non-genetic confounding factors that can lower the power of the analysis or produce broad impact eQTL false positives, and the tendency of methods that account for confounding factors to model broad impact eQTL as non-genetic variation. The key advance of the CONFETI framework is the use of Independent Component Analysis (ICA) to identify variation likely caused by broad impact eQTL when constructing the sample covariance matrix used for the random effect in a mixed model. We show that CONFETI has better performance than other mixed model confounding factor methods when considering broad impact eQTL recovery from synthetic data. We also used the CONFETI framework and these same confounding factor methods to identify eQTL that replicate between matched twin pair datasets in the Multiple Tissue Human Expression Resource (MuTHER), the Depression Genes Networks study (DGN), the Netherlands Study of Depression and Anxiety (NESDA), and multiple tissue types in the Genotype-Tissue Expression (GTEx) consortium. These analyses identified both cis-eQTL and trans-eQTL impacting individual genes, and CONFETI had better or comparable performance to other mixed model confounding factor analysis methods when identifying such eQTL. In these analyses, we were able to identify and replicate a few broad impact eQTL although the overall number was small even when applying CONFETI. In light of these results, we discuss the broad impact eQTL that have been previously reported from the analysis of human data and suggest that considerable caution should be exercised when making biological inferences based on these reported eQTL.
Extensive epistasis for olfactory behaviour, sleep and waking activity in Drosophila melanogaster.
Swarup, Shilpa; Harbison, Susan T; Hahn, Lauren E; Morozova, Tatiana V; Yamamoto, Akihiko; Mackay, Trudy F C; Anholt, Robert R H
2012-02-01
Epistasis is an important feature of the genetic architecture of quantitative traits, but the dynamics of epistatic interactions in natural populations and the relationship between epistasis and pleiotropy remain poorly understood. Here, we studied the effects of epistatic modifiers that segregate in a wild-derived Drosophila melanogaster population on the mutational effects of P-element insertions in Semaphorin-5C (Sema-5c) and Calreticulin (Crc), pleiotropic genes that affect olfactory behaviour and startle behaviour and, in the case of Crc, sleep phenotypes. We introduced Canton-S B (CSB) third chromosomes with or without a P-element insertion at the Crc or Sema-5c locus in multiple wild-derived inbred lines of the Drosophila melanogaster Genetic Reference Panel (DGRP) and assessed the effects of epistasis on the olfactory response to benzaldehyde and, for Crc, also on sleep. In each case, we found substantial epistasis and significant variation in the magnitude of epistasis. The predominant direction of epistatic effects was to suppress the mutant phenotype. These observations support a previous study on startle behaviour using the same D. melanogaster chromosome substitution lines, which concluded that suppressing epistasis may buffer the effects of new mutations. However, epistatic effects are not correlated among the different phenotypes. Thus, suppressing epistasis appears to be a pervasive general feature of natural populations to protect against the effects of new mutations, but different epistatic interactions modulate different phenotypes affected by mutations at the same pleiotropic gene.
Expression quantitative trait loci (eQTL) mapping in Puerto Rican children.
Chen, Wei; Brehm, John M; Lin, Jerome; Wang, Ting; Forno, Erick; Acosta-Pérez, Edna; Boutaoui, Nadia; Canino, Glorisa; Celedón, Juan C
2015-01-01
Expression quantitative trait loci (eQTL) have been identified using tissue or cell samples from diverse human populations, thus enhancing our understanding of regulation of gene expression. However, few studies have attempted to identify eQTL in racially admixed populations such as Hispanics. We performed a systematic eQTL study to identify regulatory variants of gene expression in whole blood from 121 Puerto Rican children with (n = 63) and without (n = 58) asthma. Genome-wide genotyping was conducted using the Illumina Omni2.5M Bead Chip, and gene expression was assessed using the Illumina HT-12 microarray. After completing quality control, we performed a pair-wise genome analysis of ~15 K transcripts and ~1.3 M SNPs for both local and distal effects. This analysis was conducted under a regression framework adjusting for age, gender and principal components derived from both genotypic and mRNA data. We used a false discovery rate (FDR) approach to identify significant eQTL signals, which were next compared to top eQTL signals from existing eQTL databases. We then performed a pathway analysis for our top genes. We identified 36,720 local pairs in 3,391 unique genes and 1,851 distal pairs in 446 unique genes at FDR <0.05, corresponding to unadjusted P values lower than 1.5x10-4 and 4.5x10-9, respectively. A significant proportion of genes identified in our study overlapped with those identified in previous studies. We also found an enrichment of disease-related genes in our eQTL list. We present results from the first eQTL study in Puerto Rican children, who are members of a unique Hispanic cohort disproportionately affected with asthma, prematurity, obesity and other common diseases. Our study confirmed eQTL signals identified in other ethnic groups, while also detecting additional eQTLs unique to our study population. The identified eQTLs will help prioritize findings from future genome-wide association studies in Puerto Ricans.
A Gene Module-Based eQTL Analysis Prioritizing Disease Genes and Pathways in Kidney Cancer.
Yang, Mary Qu; Li, Dan; Yang, William; Zhang, Yifan; Liu, Jun; Tong, Weida
2017-01-01
Clear cell renal cell carcinoma (ccRCC) is the most common and most aggressive form of renal cell cancer (RCC). The incidence of RCC has increased steadily in recent years. The pathogenesis of renal cell cancer remains poorly understood. Many of the tumor suppressor genes, oncogenes, and dysregulated pathways in ccRCC need to be revealed for improvement of the overall clinical outlook of the disease. Here, we developed a systems biology approach to prioritize the somatic mutated genes that lead to dysregulation of pathways in ccRCC. The method integrated multi-layer information to infer causative mutations and disease genes. First, we identified differential gene modules in ccRCC by coupling transcriptome and protein-protein interactions. Each of these modules consisted of interacting genes that were involved in similar biological processes and their combined expression alterations were significantly associated with disease type. Then, subsequent gene module-based eQTL analysis revealed somatic mutated genes that had driven the expression alterations of differential gene modules. Our study yielded a list of candidate disease genes, including several known ccRCC causative genes such as BAP1 and PBRM1 , as well as novel genes such as NOD2, RRM1, CSRNP1, SLC4A2, TTLL1 and CNTN1. The differential gene modules and their driver genes revealed by our study provided a new perspective for understanding the molecular mechanisms underlying the disease. Moreover, we validated the results in independent ccRCC patient datasets. Our study provided a new method for prioritizing disease genes and pathways.
Peters, James E.; Lyons, Paul A.; Lee, James C.; Richard, Arianne C.; Fortune, Mary D.; Newcombe, Paul J.; Richardson, Sylvia; Smith, Kenneth G. C.
2016-01-01
Genome-wide association studies (GWAS) have transformed our understanding of the genetics of complex traits such as autoimmune diseases, but how risk variants contribute to pathogenesis remains largely unknown. Identifying genetic variants that affect gene expression (expression quantitative trait loci, or eQTLs) is crucial to addressing this. eQTLs vary between tissues and following in vitro cellular activation, but have not been examined in the context of human inflammatory diseases. We performed eQTL mapping in five primary immune cell types from patients with active inflammatory bowel disease (n = 91), anti-neutrophil cytoplasmic antibody-associated vasculitis (n = 46) and healthy controls (n = 43), revealing eQTLs present only in the context of active inflammatory disease. Moreover, we show that following treatment a proportion of these eQTLs disappear. Through joint analysis of expression data from multiple cell types, we reveal that previous estimates of eQTL immune cell-type specificity are likely to have been exaggerated. Finally, by analysing gene expression data from multiple cell types, we find eQTLs not previously identified by database mining at 34 inflammatory bowel disease-associated loci. In summary, this parallel eQTL analysis in multiple leucocyte subsets from patients with active disease provides new insights into the genetic basis of immune-mediated diseases. PMID:27015630
Choi, Ted; Eskin, Eleazar
2013-01-01
Gene expression data, in conjunction with information on genetic variants, have enabled studies to identify expression quantitative trait loci (eQTLs) or polymorphic locations in the genome that are associated with expression levels. Moreover, recent technological developments and cost decreases have further enabled studies to collect expression data in multiple tissues. One advantage of multiple tissue datasets is that studies can combine results from different tissues to identify eQTLs more accurately than examining each tissue separately. The idea of aggregating results of multiple tissues is closely related to the idea of meta-analysis which aggregates results of multiple genome-wide association studies to improve the power to detect associations. In principle, meta-analysis methods can be used to combine results from multiple tissues. However, eQTLs may have effects in only a single tissue, in all tissues, or in a subset of tissues with possibly different effect sizes. This heterogeneity in terms of effects across multiple tissues presents a key challenge to detect eQTLs. In this paper, we develop a framework that leverages two popular meta-analysis methods that address effect size heterogeneity to detect eQTLs across multiple tissues. We show by using simulations and multiple tissue data from mouse that our approach detects many eQTLs undetected by traditional eQTL methods. Additionally, our method provides an interpretation framework that accurately predicts whether an eQTL has an effect in a particular tissue. PMID:23785294
Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression.
Fairfax, Benjamin P; Humburg, Peter; Makino, Seiko; Naranbhai, Vivek; Wong, Daniel; Lau, Evelyn; Jostins, Luke; Plant, Katharine; Andrews, Robert; McGee, Chris; Knight, Julian C
2014-03-07
To systematically investigate the impact of immune stimulation upon regulatory variant activity, we exposed primary monocytes from 432 healthy Europeans to interferon-γ (IFN-γ) or differing durations of lipopolysaccharide and mapped expression quantitative trait loci (eQTLs). More than half of cis-eQTLs identified, involving hundreds of genes and associated pathways, are detected specifically in stimulated monocytes. Induced innate immune activity reveals multiple master regulatory trans-eQTLs including the major histocompatibility complex (MHC), coding variants altering enzyme and receptor function, an IFN-β cytokine network showing temporal specificity, and an interferon regulatory factor 2 (IRF2) transcription factor-modulated network. Induced eQTL are significantly enriched for genome-wide association study loci, identifying context-specific associations to putative causal genes including CARD9, ATM, and IRF8. Thus, applying pathophysiologically relevant immune stimuli assists resolution of functional genetic variants.
eQTL Mapping Using RNA-seq Data
Hu, Yijuan
2012-01-01
As RNA-seq is replacing gene expression microarrays to assess genome-wide transcription abundance, gene expression Quantitative Trait Locus (eQTL) studies using RNA-seq have emerged. RNA-seq delivers two novel features that are important for eQTL studies. First, it provides information on allele-specific expression (ASE), which is not available from gene expression microarrays. Second, it generates unprecedentedly rich data to study RNA-isoform expression. In this paper, we review current methods for eQTL mapping using ASE and discuss some future directions. We also review existing works that use RNA-seq data to study RNA-isoform expression and we discuss the gaps between these works and isoform-specific eQTL mapping. PMID:23667399
Mapping eQTLs in the Norfolk Island Genetic Isolate Identifies Candidate Genes for CVD Risk Traits
Benton, Miles C.; Lea, Rod A.; Macartney-Coxson, Donia; Carless, Melanie A.; Göring, Harald H.; Bellis, Claire; Hanna, Michelle; Eccles, David; Chambers, Geoffrey K.; Curran, Joanne E.; Harper, Jacquie L.; Blangero, John; Griffiths, Lyn R.
2013-01-01
Cardiovascular disease (CVD) affects millions of people worldwide and is influenced by numerous factors, including lifestyle and genetics. Expression quantitative trait loci (eQTLs) influence gene expression and are good candidates for CVD risk. Founder-effect pedigrees can provide additional power to map genes associated with disease risk. Therefore, we identified eQTLs in the genetic isolate of Norfolk Island (NI) and tested for associations between these and CVD risk factors. We measured genome-wide transcript levels of blood lymphocytes in 330 individuals and used pedigree-based heritability analysis to identify heritable transcripts. eQTLs were identified by genome-wide association testing of these transcripts. Testing for association between CVD risk factors (i.e., blood lipids, blood pressure, and body fat indices) and eQTLs revealed 1,712 heritable transcripts (p < 0.05) with heritability values ranging from 0.18 to 0.84. From these, we identified 200 cis-acting and 70 trans-acting eQTLs (p < 1.84 × 10−7) An eQTL-centric analysis of CVD risk traits revealed multiple associations, including 12 previously associated with CVD-related traits. Trait versus eQTL regression modeling identified four CVD risk candidates (NAAA, PAPSS1, NME1, and PRDX1), all of which have known biological roles in disease. In addition, we implicated several genes previously associated with CVD risk traits, including MTHFR and FN3KRP. We have successfully identified a panel of eQTLs in the NI pedigree and used this to implicate several genes in CVD risk. Future studies are required for further assessing the functional importance of these eQTLs and whether the findings here also relate to outbred populations. PMID:24314549
Wang, Xiaoliang; Gawrieh, Samer; Gamazon, Eric R.; Athinarayanan, Shaminie; Liu, Yang-Lin; Darlay, Rebecca; Cordell, Heather J; Daly, Ann K
2017-01-01
The increased expression of PNPLA3148M leads to hepatosteatosis in mice. This study aims to investigate the genetic control of hepatic PNPLA3 transcription and to explore its impact on NAFLD risk in humans. Through a locus-wide expression quantitative trait loci (eQTL) mapping in two human liver sample sets, a PNPLA3 intronic SNP, rs139051 A>G was identified as a significant eQTL (p = 6.6×10−8) influencing PNPLA3 transcription, with the A allele significantly associated with increased PNPLA3 mRNA. An electrophoresis mobility shift assay further demonstrated that the A allele has enhanced affinity to nuclear proteins than the G allele. The impact of this eQTL on NAFLD risk was further tested in three independent populations. We found that rs139051 did not independently affect the NAFLD risk, whilst rs738409 did not significantly modulate PNPLA3 transcription but was associated with NAFLD risk. The A-G haplotype associated with higher transcription of the disease-risk rs738409 G allele conferred similar risk for NAFLD compared to the G-G haplotype that possesses a lower transcription level. Our study suggests that the pathogenic role of PNPLA3148M in NAFLD is independent of the gene transcription in humans, which may be attributed to the high endogenous transcription level of PNPLA3 gene in human livers. PMID:27744419
Liu, Wanqing; Anstee, Quentin M; Wang, Xiaoliang; Gawrieh, Samer; Gamazon, Eric R; Athinarayanan, Shaminie; Liu, Yang-Lin; Darlay, Rebecca; Cordell, Heather J; Daly, Ann K; Day, Chris P; Chalasani, Naga
2016-10-13
The increased expression of PNPLA3 148M leads to hepatosteatosis in mice. This study aims to investigate the genetic control of hepatic PNPLA3 transcription and to explore its impact on NAFLD risk in humans. Through a locus-wide expression quantitative trait loci (eQTL) mapping in two human liver sample sets, a PNPLA3 intronic SNP, rs139051 A>G was identified as a significant eQTL ( p = 6.6×10 -8 ) influencing PNPLA3 transcription, with the A allele significantly associated with increased PNPLA3 mRNA. An electrophoresis mobility shift assay further demonstrated that the A allele has enhanced affinity to nuclear proteins than the G allele. The impact of this eQTL on NAFLD risk was further tested in three independent populations. We found that rs139051 did not independently affect the NAFLD risk, whilst rs738409 did not significantly modulate PNPLA3 transcription but was associated with NAFLD risk. The A-G haplotype associated with higher transcription of the disease-risk rs738409 G allele conferred similar risk for NAFLD compared to the G-G haplotype that possesses a lower transcription level. Our study suggests that the pathogenic role of PNPLA3 148M in NAFLD is independent of the gene transcription in humans, which may be attributed to the high endogenous transcription level of PNPLA3 gene in human livers.
LSCC SNP variant regulates SOX2 modulation of VDAC3.
Chyr, Jacqueline; Guo, Dongmin; Zhou, Xiaobo
2018-04-27
Lung squamous cell carcinoma (LSCC) is a genomically complex malignancy with no effective treatments. Recent studies have found a large number of DNA alterations such as SOX2 amplification in LSCC patients. As a stem cell transcription factor, SOX2 is important for the maintenance of pluripotent cells and may play a role in cancer. To study the downstream mechanisms of SOX2, we employed expression quantitative trait loci (eQTLs) technology to investigate how the presence of SOX2 affects the expression of target genes. We discovered unique eQTLs, such as rs798827-VDAC3 (FDR p -value = 0.0034), that are only found in SOX2-active patients but not in SOX2-inactive patients. SNP rs798827 is within strong linkage disequilibrium ( r 2 = 1) to rs58163073, where rs58163073 [T] allele increases the binding affinity of SOX2 and allele [TA] decreases it. In our analysis, SOX2 silencing downregulates VDAC3 in two LSCC cell lines. Chromatin conformation capturing data indicates that this SNP is located within the same Topologically Associating Domain (TAD) of VDAC3, further suggesting SOX2's role in the regulation of VDAC3 through the binding of rs58163073. By first subgrouping patients based on SOX2 activity, we made more relevant eQTL discoveries and our analysis can be applied to other diseases.
Veyrieras, Jean-Baptiste; Gaffney, Daniel J.; Pickrell, Joseph K.; Gilad, Yoav; Stephens, Matthew; Pritchard, Jonathan K.
2012-01-01
Mapping of expression quantitative trait loci (eQTLs) is an important technique for studying how genetic variation affects gene regulation in natural populations. In a previous study using Illumina expression data from human lymphoblastoid cell lines, we reported that cis-eQTLs are especially enriched around transcription start sites (TSSs) and immediately upstream of transcription end sites (TESs). In this paper, we revisit the distribution of eQTLs using additional data from Affymetrix exon arrays and from RNA sequencing. We confirm that most eQTLs lie close to the target genes; that transcribed regions are generally enriched for eQTLs; that eQTLs are more abundant in exons than introns; and that the peak density of eQTLs occurs at the TSS. However, we find that the intriguing TES peak is greatly reduced or absent in the Affymetrix and RNA-seq data. Instead our data suggest that the TES peak observed in the Illumina data is mainly due to exon-specific QTLs that affect 3′ untranslated regions, where most of the Illumina probes are positioned. Nonetheless, we do observe an overall enrichment of eQTLs in exons versus introns in all three data sets, consistent with an important role for exonic sequences in gene regulation. PMID:22359548
Verta, Jukka-Pekka; Landry, Christian R; MacKay, John
2016-07-01
Regulation of gene expression plays a central role in translating genotypic variation into phenotypic variation. Dissection of the genetic basis of expression variation is key to understanding how expression regulation evolves. Such analyses remain challenging in contexts where organisms are outbreeding, highly heterozygous and long-lived such as in the case of conifer trees. We developed an RNA sequencing (RNA-seq)-based approach for both expression-quantitative trait locus (eQTL) mapping and the detection of cis-acting (allele-specific) vs trans-acting (non-allele-specific) eQTLs. This method can be potentially applied to many conifers. We used haploid and diploid meiotic seed tissues of a single self-fertilized white spruce (Picea glauca) individual to dissect eQTLs according to linkage and allele specificity. The genetic architecture of local eQTLs linked to the expressed genes was particularly complex, consisting of cis-acting, trans-acting and, surprisingly, compensatory cis-trans effects. These compensatory effects influence expression in opposite directions and are neutral when combined in homozygotes. Nearly half of local eQTLs were under compensation, indicating that close linkage between compensatory cis-trans factors is common in spruce. Compensated genes were overrepresented in developmental and cell organization functions. Our haploid-diploid eQTL analysis in spruce revealed that compensatory cis-trans eQTLs segregate within populations and evolve in close genetic linkage. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
The genetic architecture of gene expression levels in wild baboons.
Tung, Jenny; Zhou, Xiang; Alberts, Susan C; Stephens, Matthew; Gilad, Yoav
2015-02-25
Primate evolution has been argued to result, in part, from changes in how genes are regulated. However, we still know little about gene regulation in natural primate populations. We conducted an RNA sequencing (RNA-seq)-based study of baboons from an intensively studied wild population. We performed complementary expression quantitative trait locus (eQTL) mapping and allele-specific expression analyses, discovering substantial evidence for, and surprising power to detect, genetic effects on gene expression levels in the baboons. eQTL were most likely to be identified for lineage-specific, rapidly evolving genes; interestingly, genes with eQTL significantly overlapped between baboons and a comparable human eQTL data set. Our results suggest that genes vary in their tolerance of genetic perturbation, and that this property may be conserved across species. Further, they establish the feasibility of eQTL mapping using RNA-seq data alone, and represent an important step towards understanding the genetic architecture of gene expression in primates.
The genetic architecture of gene expression levels in wild baboons
Tung, Jenny; Zhou, Xiang; Alberts, Susan C; Stephens, Matthew; Gilad, Yoav
2015-01-01
Primate evolution has been argued to result, in part, from changes in how genes are regulated. However, we still know little about gene regulation in natural primate populations. We conducted an RNA sequencing (RNA-seq)-based study of baboons from an intensively studied wild population. We performed complementary expression quantitative trait locus (eQTL) mapping and allele-specific expression analyses, discovering substantial evidence for, and surprising power to detect, genetic effects on gene expression levels in the baboons. eQTL were most likely to be identified for lineage-specific, rapidly evolving genes; interestingly, genes with eQTL significantly overlapped between baboons and a comparable human eQTL data set. Our results suggest that genes vary in their tolerance of genetic perturbation, and that this property may be conserved across species. Further, they establish the feasibility of eQTL mapping using RNA-seq data alone, and represent an important step towards understanding the genetic architecture of gene expression in primates. DOI: http://dx.doi.org/10.7554/eLife.04729.001 PMID:25714927
Cross-Tissue and Tissue-Specific eQTLs: Partitioning the Heritability of a Complex Trait
Torres, Jason M.; Gamazon, Eric R.; Parra, Esteban J.; Below, Jennifer E.; Valladares-Salgado, Adan; Wacher, Niels; Cruz, Miguel; Hanis, Craig L.; Cox, Nancy J.
2014-01-01
Top signals from genome-wide association studies (GWASs) of type 2 diabetes (T2D) are enriched with expression quantitative trait loci (eQTLs) identified in skeletal muscle and adipose tissue. We therefore hypothesized that such eQTLs might account for a disproportionate share of the heritability estimated from all SNPs interrogated through GWASs. To test this hypothesis, we applied linear mixed models to the Wellcome Trust Case Control Consortium (WTCCC) T2D data set and to data sets representing Mexican Americans from Starr County, TX, and Mexicans from Mexico City. We estimated the proportion of phenotypic variance attributable to the additive effect of all variants interrogated in these GWASs, as well as a much smaller set of variants identified as eQTLs in human adipose tissue, skeletal muscle, and lymphoblastoid cell lines. The narrow-sense heritability explained by all interrogated SNPs in each of these data sets was substantially greater than the heritability accounted for by genome-wide-significant SNPs (∼10%); GWAS SNPs explained over 50% of phenotypic variance in the WTCCC, Starr County, and Mexico City data sets. The estimate of heritability attributable to cross-tissue eQTLs was greater in the WTCCC data set and among lean Hispanics, whereas adipose eQTLs significantly explained heritability among Hispanics with a body mass index ≥ 30. These results support an important role for regulatory variants in the genetic component of T2D susceptibility, particularly for eQTLs that elicit effects across insulin-responsive peripheral tissues. PMID:25439722
Functional regression method for whole genome eQTL epistasis analysis with sequencing data.
Xu, Kelin; Jin, Li; Xiong, Momiao
2017-05-18
Epistasis plays an essential rule in understanding the regulation mechanisms and is an essential component of the genetic architecture of the gene expressions. However, interaction analysis of gene expressions remains fundamentally unexplored due to great computational challenges and data availability. Due to variation in splicing, transcription start sites, polyadenylation sites, post-transcriptional RNA editing across the entire gene, and transcription rates of the cells, RNA-seq measurements generate large expression variability and collectively create the observed position level read count curves. A single number for measuring gene expression which is widely used for microarray measured gene expression analysis is highly unlikely to sufficiently account for large expression variation across the gene. Simultaneously analyzing epistatic architecture using the RNA-seq and whole genome sequencing (WGS) data poses enormous challenges. We develop a nonlinear functional regression model (FRGM) with functional responses where the position-level read counts within a gene are taken as a function of genomic position, and functional predictors where genotype profiles are viewed as a function of genomic position, for epistasis analysis with RNA-seq data. Instead of testing the interaction of all possible pair-wises SNPs, the FRGM takes a gene as a basic unit for epistasis analysis, which tests for the interaction of all possible pairs of genes and use all the information that can be accessed to collectively test interaction between all possible pairs of SNPs within two genome regions. By large-scale simulations, we demonstrate that the proposed FRGM for epistasis analysis can achieve the correct type 1 error and has higher power to detect the interactions between genes than the existing methods. The proposed methods are applied to the RNA-seq and WGS data from the 1000 Genome Project. The numbers of pairs of significantly interacting genes after Bonferroni correction identified using FRGM, RPKM and DESeq were 16,2361, 260 and 51, respectively, from the 350 European samples. The proposed FRGM for epistasis analysis of RNA-seq can capture isoform and position-level information and will have a broad application. Both simulations and real data analysis highlight the potential for the FRGM to be a good choice of the epistatic analysis with sequencing data.
Genetics and Genomics of Social Behavior in a Chicken Model.
Johnsson, Martin; Henriksen, Rie; Fogelholm, Jesper; Höglund, Andrey; Jensen, Per; Wright, Dominic
2018-05-01
The identification of genes affecting sociality can give insights into the maintenance and development of sociality and personality. In this study, we used the combination of an advanced intercross between wild and domestic chickens with a combined QTL and eQTL genetical genomics approach to identify genes for social reinstatement, a social and anxiety-related behavior. A total of 24 social reinstatement QTL were identified and overlaid with over 600 eQTL obtained from the same birds using hypothalamic tissue. Correlations between overlapping QTL and eQTL indicated five strong candidate genes, with the gene TTRAP being strongly significantly correlated with multiple aspects of social reinstatement behavior, as well as possessing a highly significant eQTL. Copyright © 2018 by the Genetics Society of America.
Peroxiredoxin 5 modulates immune response in Drosophila
Radyuk, Svetlana N.; Michalak, Katarzyna; Klichko, Vladimir I.; Benes, Judith; Orr, William C.
2010-01-01
Background Peroxiredoxins are redox-sensing enzymes with multiple cellular functions. Previously, we reported on the potent antioxidant function of Drosophila peroxiredoxin 5 (dPrx5). Studies with mammalian and human cells suggest that peroxiredoxins can modulate immune-related signaling. Methods Survivorship studies and bacteriological analysis were used to determine resistance of flies to fungal and bacterial infections. RT-PCR and immunoblot analyses determined expression of dPrx5 and immunity factors in response to bacterial challenge. Double mutants for dprx5 gene and genes comprising the Imd/Relish and dTak1/Basket branches of the immune signaling pathways were used in epistatic analysis. Results The dprx5 mutant flies were more resistant to bacterial infection than controls, while flies overexpressing dPrx5 were more susceptible. The enhanced resistance to bacteria was accompanied by rapid induction of the Imd-dependent antimicrobial peptides, phosphorylation of the JNK kinase Basket and altered transcriptional profiling of the transient response genes, puckered, ets21C and relish, while the opposite effects were observed in flies over-expressing dPrx5. Epistatic analysis of double mutants, using attacin D and Puckered as read outs of activation of the Imd and JNK pathways, implicated dPrx5 function in the control of the dTak1-JNK arm of immune signaling. Conclusions Differential effects on fly survivorship suggested a trade-off between the antioxidant and immune functions of dPrx5. Molecular and epistatic analyses identified dPrx5 as a negative regulator in the dTak1-JNK arm of immune signaling. General significance Our findings suggest that peroxiredoxins play an important modulatory role in the Drosophila immune response. PMID:20600624
Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus
2015-01-01
Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. PMID:26019233
Yang, Fan; Wang, Jiebiao; Pierce, Brandon L; Chen, Lin S
2017-11-01
The impact of inherited genetic variation on gene expression in humans is well-established. The majority of known expression quantitative trait loci (eQTLs) impact expression of local genes ( cis -eQTLs). More research is needed to identify effects of genetic variation on distant genes ( trans -eQTLs) and understand their biological mechanisms. One common trans -eQTLs mechanism is "mediation" by a local ( cis ) transcript. Thus, mediation analysis can be applied to genome-wide SNP and expression data in order to identify transcripts that are " cis -mediators" of trans -eQTLs, including those " cis -hubs" involved in regulation of many trans -genes. Identifying such mediators helps us understand regulatory networks and suggests biological mechanisms underlying trans -eQTLs, both of which are relevant for understanding susceptibility to complex diseases. The multitissue expression data from the Genotype-Tissue Expression (GTEx) program provides a unique opportunity to study cis -mediation across human tissue types. However, the presence of complex hidden confounding effects in biological systems can make mediation analyses challenging and prone to confounding bias, particularly when conducted among diverse samples. To address this problem, we propose a new method: Genomic Mediation analysis with Adaptive Confounding adjustment (GMAC). It enables the search of a very large pool of variables, and adaptively selects potential confounding variables for each mediation test. Analyses of simulated data and GTEx data demonstrate that the adaptive selection of confounders by GMAC improves the power and precision of mediation analysis. Application of GMAC to GTEx data provides new insights into the observed patterns of cis -hubs and trans -eQTL regulation across tissue types. © 2017 Yang et al.; Published by Cold Spring Harbor Laboratory Press.
Durbin, Richard; Winn, John
2010-01-01
Gene expression measurements are influenced by a wide range of factors, such as the state of the cell, experimental conditions and variants in the sequence of regulatory regions. To understand the effect of a variable of interest, such as the genotype of a locus, it is important to account for variation that is due to confounding causes. Here, we present VBQTL, a probabilistic approach for mapping expression quantitative trait loci (eQTLs) that jointly models contributions from genotype as well as known and hidden confounding factors. VBQTL is implemented within an efficient and flexible inference framework, making it fast and tractable on large-scale problems. We compare the performance of VBQTL with alternative methods for dealing with confounding variability on eQTL mapping datasets from simulations, yeast, mouse, and human. Employing Bayesian complexity control and joint modelling is shown to result in more precise estimates of the contribution of different confounding factors resulting in additional associations to measured transcript levels compared to alternative approaches. We present a threefold larger collection of cis eQTLs than previously found in a whole-genome eQTL scan of an outbred human population. Altogether, 27% of the tested probes show a significant genetic association in cis, and we validate that the additional eQTLs are likely to be real by replicating them in different sets of individuals. Our method is the next step in the analysis of high-dimensional phenotype data, and its application has revealed insights into genetic regulation of gene expression by demonstrating more abundant cis-acting eQTLs in human than previously shown. Our software is freely available online at http://www.sanger.ac.uk/resources/software/peer/. PMID:20463871
Robust Linear Models for Cis-eQTL Analysis.
Rantalainen, Mattias; Lindgren, Cecilia M; Holmes, Christopher C
2015-01-01
Expression Quantitative Trait Loci (eQTL) analysis enables characterisation of functional genetic variation influencing expression levels of individual genes. In outbread populations, including humans, eQTLs are commonly analysed using the conventional linear model, adjusting for relevant covariates, assuming an allelic dosage model and a Gaussian error term. However, gene expression data generally have noise that induces heavy-tailed errors relative to the Gaussian distribution and often include atypical observations, or outliers. Such departures from modelling assumptions can lead to an increased rate of type II errors (false negatives), and to some extent also type I errors (false positives). Careful model checking can reduce the risk of type-I errors but often not type II errors, since it is generally too time-consuming to carefully check all models with a non-significant effect in large-scale and genome-wide studies. Here we propose the application of a robust linear model for eQTL analysis to reduce adverse effects of deviations from the assumption of Gaussian residuals. We present results from a simulation study as well as results from the analysis of real eQTL data sets. Our findings suggest that in many situations robust models have the potential to provide more reliable eQTL results compared to conventional linear models, particularly in respect to reducing type II errors due to non-Gaussian noise. Post-genomic data, such as that generated in genome-wide eQTL studies, are often noisy and frequently contain atypical observations. Robust statistical models have the potential to provide more reliable results and increased statistical power under non-Gaussian conditions. The results presented here suggest that robust models should be considered routinely alongside other commonly used methodologies for eQTL analysis.
Lowry, David B.; Logan, Tierney L.; Santuari, Luca; Hardtke, Christian S.; Richards, James H.; DeRose-Wilson, Leah J.; McKay, John K.; Sen, Saunak; Juenger, Thomas E.
2013-01-01
The regulation of gene expression is crucial for an organism’s development and response to stress, and an understanding of the evolution of gene expression is of fundamental importance to basic and applied biology. To improve this understanding, we conducted expression quantitative trait locus (eQTL) mapping in the Tsu-1 (Tsushima, Japan) × Kas-1 (Kashmir, India) recombinant inbred line population of Arabidopsis thaliana across soil drying treatments. We then used genome resequencing data to evaluate whether genomic features (promoter polymorphism, recombination rate, gene length, and gene density) are associated with genes responding to the environment (E) or with genes with genetic variation (G) in gene expression in the form of eQTLs. We identified thousands of genes that responded to soil drying and hundreds of main-effect eQTLs. However, we identified very few statistically significant eQTLs that interacted with the soil drying treatment (GxE eQTL). Analysis of genome resequencing data revealed associations of several genomic features with G and E genes. In general, E genes had lower promoter diversity and local recombination rates. By contrast, genes with eQTLs (G) had significantly greater promoter diversity and were located in genomic regions with higher recombination. These results suggest that genomic architecture may play an important a role in the evolution of gene expression. PMID:24045022
Identification and Correction of Sample Mix-Ups in Expression Genetic Data: A Case Study
Broman, Karl W.; Keller, Mark P.; Broman, Aimee Teo; Kendziorski, Christina; Yandell, Brian S.; Sen, Śaunak; Attie, Alan D.
2015-01-01
In a mouse intercross with more than 500 animals and genome-wide gene expression data on six tissues, we identified a high proportion (18%) of sample mix-ups in the genotype data. Local expression quantitative trait loci (eQTL; genetic loci influencing gene expression) with extremely large effect were used to form a classifier to predict an individual’s eQTL genotype based on expression data alone. By considering multiple eQTL and their related transcripts, we identified numerous individuals whose predicted eQTL genotypes (based on their expression data) did not match their observed genotypes, and then went on to identify other individuals whose genotypes did match the predicted eQTL genotypes. The concordance of predictions across six tissues indicated that the problem was due to mix-ups in the genotypes (although we further identified a small number of sample mix-ups in each of the six panels of gene expression microarrays). Consideration of the plate positions of the DNA samples indicated a number of off-by-one and off-by-two errors, likely the result of pipetting errors. Such sample mix-ups can be a problem in any genetic study, but eQTL data allow us to identify, and even correct, such problems. Our methods have been implemented in an R package, R/lineup. PMID:26290572
Identification and Correction of Sample Mix-Ups in Expression Genetic Data: A Case Study.
Broman, Karl W; Keller, Mark P; Broman, Aimee Teo; Kendziorski, Christina; Yandell, Brian S; Sen, Śaunak; Attie, Alan D
2015-08-19
In a mouse intercross with more than 500 animals and genome-wide gene expression data on six tissues, we identified a high proportion (18%) of sample mix-ups in the genotype data. Local expression quantitative trait loci (eQTL; genetic loci influencing gene expression) with extremely large effect were used to form a classifier to predict an individual's eQTL genotype based on expression data alone. By considering multiple eQTL and their related transcripts, we identified numerous individuals whose predicted eQTL genotypes (based on their expression data) did not match their observed genotypes, and then went on to identify other individuals whose genotypes did match the predicted eQTL genotypes. The concordance of predictions across six tissues indicated that the problem was due to mix-ups in the genotypes (although we further identified a small number of sample mix-ups in each of the six panels of gene expression microarrays). Consideration of the plate positions of the DNA samples indicated a number of off-by-one and off-by-two errors, likely the result of pipetting errors. Such sample mix-ups can be a problem in any genetic study, but eQTL data allow us to identify, and even correct, such problems. Our methods have been implemented in an R package, R/lineup. Copyright © 2015 Broman et al.
An erythroid-specific ATP2B4 enhancer mediates red blood cell hydration and malaria susceptibility
Lessard, Samuel; Gatof, Emily Stern; Schupp, Patrick G.; Sher, Falak; Ali, Adnan; Prehar, Sukhpal; Kurita, Ryo; Nakamura, Yukio; Baena, Esther; Oceandy, Delvac; Bauer, Daniel E.
2017-01-01
The lack of mechanistic explanations for many genotype-phenotype associations identified by GWAS precludes thorough assessment of their impact on human health. Here, we conducted an expression quantitative trait locus (eQTL) mapping analysis in erythroblasts and found erythroid-specific eQTLs for ATP2B4, the main calcium ATPase of red blood cells (rbc). The same SNPs were previously associated with mean corpuscular hemoglobin concentration (MCHC) and susceptibility to severe malaria infection. We showed that Atp2b4–/– mice demonstrate increased MCHC, confirming ATP2B4 as the causal gene at this GWAS locus. Using CRISPR-Cas9, we fine mapped the genetic signal to an erythroid-specific enhancer of ATP2B4. Erythroid cells with a deletion of the ATP2B4 enhancer had abnormally high intracellular calcium levels. These results illustrate the power of combined transcriptomic, epigenomic, and genome-editing approaches in characterizing noncoding regulatory elements in phenotype-relevant cells. Our study supports ATP2B4 as a potential target for modulating rbc hydration in erythroid disorders and malaria infection. PMID:28714864
An erythroid-specific ATP2B4 enhancer mediates red blood cell hydration and malaria susceptibility.
Lessard, Samuel; Gatof, Emily Stern; Beaudoin, Mélissa; Schupp, Patrick G; Sher, Falak; Ali, Adnan; Prehar, Sukhpal; Kurita, Ryo; Nakamura, Yukio; Baena, Esther; Ledoux, Jonathan; Oceandy, Delvac; Bauer, Daniel E; Lettre, Guillaume
2017-08-01
The lack of mechanistic explanations for many genotype-phenotype associations identified by GWAS precludes thorough assessment of their impact on human health. Here, we conducted an expression quantitative trait locus (eQTL) mapping analysis in erythroblasts and found erythroid-specific eQTLs for ATP2B4, the main calcium ATPase of red blood cells (rbc). The same SNPs were previously associated with mean corpuscular hemoglobin concentration (MCHC) and susceptibility to severe malaria infection. We showed that Atp2b4-/- mice demonstrate increased MCHC, confirming ATP2B4 as the causal gene at this GWAS locus. Using CRISPR-Cas9, we fine mapped the genetic signal to an erythroid-specific enhancer of ATP2B4. Erythroid cells with a deletion of the ATP2B4 enhancer had abnormally high intracellular calcium levels. These results illustrate the power of combined transcriptomic, epigenomic, and genome-editing approaches in characterizing noncoding regulatory elements in phenotype-relevant cells. Our study supports ATP2B4 as a potential target for modulating rbc hydration in erythroid disorders and malaria infection.
A powerful approach reveals numerous expression quantitative trait haplotypes in multiple tissues.
Ying, Dingge; Li, Mulin Jun; Sham, Pak Chung; Li, Miaoxin
2018-04-26
Recently many studies showed single nucleotide polymorphisms (SNPs) affect gene expression and contribute to development of complex traits/diseases in a tissue context-dependent manner. However, little is known about haplotype's influence on gene expression and complex traits, which reflects the interaction effect between SNPs. In the present study, we firstly proposed a regulatory region guided eQTL haplotype association analysis approach, and then systematically investigate the expression quantitative trait loci (eQTL) haplotypes in 20 different tissues by the approach. The approach has a powerful design of reducing computational burden by the utilization of regulatory predictions for candidate SNP selection and multiple testing corrections on non-independent haplotypes. The application results in multiple tissues showed that haplotype-based eQTLs not only increased the number of eQTL genes in a tissue specific manner, but were also enriched in loci that associated with complex traits in a tissue-matched manner. In addition, we found that tag SNPs of eQTL haplotypes from whole blood were selectively enriched in certain combination of regulatory elements (e.g. promoters and enhancers) according to predicted chromatin states. In summary, this eQTL haplotype detection approach, together with the application results, shed insights into synergistic effect of sequence variants on gene expression and their susceptibility to complex diseases. The executable application "eHaplo" is implemented in Java and is publicly available at http://grass.cgs.hku.hk/limx/ehaplo/. jonsonfox@gmail.com, limiaoxin@mail.sysu.edu.cn. Supplementary data are available at Bioinformatics online.
Kirsten, Holger; Al-Hasani, Hoor; Holdt, Lesca; Gross, Arnd; Beutner, Frank; Krohn, Knut; Horn, Katrin; Ahnert, Peter; Burkhardt, Ralph; Reiche, Kristin; Hackermüller, Jörg; Löffler, Markus; Teupser, Daniel; Thiery, Joachim; Scholz, Markus
2015-08-15
Genetics of gene expression (eQTLs or expression QTLs) has proved an indispensable tool for understanding biological pathways and pathomechanisms of trait-associated SNPs. However, power of most genome-wide eQTL studies is still limited. We performed a large eQTL study in peripheral blood mononuclear cells of 2112 individuals increasing the power to detect trans-effects genome-wide. Going beyond univariate SNP-transcript associations, we analyse relations of eQTLs to biological pathways, polygenetic effects of expression regulation, trans-clusters and enrichment of co-localized functional elements. We found eQTLs for about 85% of analysed genes, and 18% of genes were trans-regulated. Local eSNPs were enriched up to a distance of 5 Mb to the transcript challenging typically implemented ranges of cis-regulations. Pathway enrichment within regulated genes of GWAS-related eSNPs supported functional relevance of identified eQTLs. We demonstrate that nearest genes of GWAS-SNPs might frequently be misleading functional candidates. We identified novel trans-clusters of potential functional relevance for GWAS-SNPs of several phenotypes including obesity-related traits, HDL-cholesterol levels and haematological phenotypes. We used chromatin immunoprecipitation data for demonstrating biological effects. Yet, we show for strongly heritable transcripts that still little trans-chromosomal heritability is explained by all identified trans-eSNPs; however, our data suggest that most cis-heritability of these transcripts seems explained. Dissection of co-localized functional elements indicated a prominent role of SNPs in loci of pseudogenes and non-coding RNAs for the regulation of coding genes. In summary, our study substantially increases the catalogue of human eQTLs and improves our understanding of the complex genetic regulation of gene expression, pathways and disease-related processes. © The Author 2015. Published by Oxford University Press.
Genome-wide identification of expression quantitative trait loci for human telomerase.
Kim, Hanseol; Ryu, Jihye; Lee, Chaeyoung
2016-10-01
A genome-wide association study was conducted to identify expression quantitative trait loci (eQTL) for human telomerase.We tested the genetic associations of nucleotide variants with expression of the genes encoding human telomerase reverse transcriptase (hTERT) and telomerase RNA components (TERC) in lymphoblastoid cell lines derived from 373 Europeans.Our results revealed 6 eQTLs associated with hTERT (P < 5 × 10). One eQTL (rs17755753) was located in the intron 1 of the gene encoding R-spondin-3 (RSPO3), a well-known Wnt signaling regulator. Transcriptome-wide association analysis for these eQTLs revealed their additional associations with the expression of 29 genes (P < 4.75 × 10), including prickle planar cell polarity protein 2 (PRICKLE2) gene important for the Wnt signaling pathway. This concurs with previous studies in which significant expressional relationships between hTERT and some genes (β-catenin and Wnt-3a) in the Wnt signaling pathway have been observed.This study suggested 6 novel eQTLs for hTERT and the association of hTERT with the Wnt signaling pathway. Further studies are needed to understand their underlying mechanisms to improve our understanding of the role of hTERT in cancer.
kruX: matrix-based non-parametric eQTL discovery.
Qi, Jianlong; Asl, Hassan Foroughi; Björkegren, Johan; Michoel, Tom
2014-01-14
The Kruskal-Wallis test is a popular non-parametric statistical test for identifying expression quantitative trait loci (eQTLs) from genome-wide data due to its robustness against variations in the underlying genetic model and expression trait distribution, but testing billions of marker-trait combinations one-by-one can become computationally prohibitive. We developed kruX, an algorithm implemented in Matlab, Python and R that uses matrix multiplications to simultaneously calculate the Kruskal-Wallis test statistic for several millions of marker-trait combinations at once. KruX is more than ten thousand times faster than computing associations one-by-one on a typical human dataset. We used kruX and a dataset of more than 500k SNPs and 20k expression traits measured in 102 human blood samples to compare eQTLs detected by the Kruskal-Wallis test to eQTLs detected by the parametric ANOVA and linear model methods. We found that the Kruskal-Wallis test is more robust against data outliers and heterogeneous genotype group sizes and detects a higher proportion of non-linear associations, but is more conservative for calling additive linear associations. kruX enables the use of robust non-parametric methods for massive eQTL mapping without the need for a high-performance computing infrastructure and is freely available from http://krux.googlecode.com.
Ramakodi, Meganathan P.; Devarajan, Karthik; Blackman, Elizabeth; Gibbs, Denise; Luce, Danièle; Deloumeaux, Jacqueline; Duflo, Suzy; Liu, Jeffrey C.; Mehra, Ranee; Kulathinal, Rob J.; Ragin, Camille C.
2016-01-01
BACKGROUND African-Americans (Afr-Amr) with head and neck squamous cell carcinoma (HNSCC) have a lower survival rate than Caucasians (Cau). This study investigates the functional importance of ancestry-informative SNPs in HNSCC and also examines the effect of functionally important genetic elements on racial disparities in HNSCC survival. METHODS Ancestry-informative SNPs, RNAseq, methylation, and copy number variation data for 316 oral cavity and laryngeal cancer patients were analyzed across 178 DNA repair genes. The results of eQTL analyses were also replicated using a Gene Expression Omnibus (GEO) dataset. The effects of eQTLs on overall survival (OS) and disease-free survival (DFS) were evaluated. RESULTS Five ancestry-related SNPs were identified as cis-eQTLs in the POLB gene (FDR<0.01). The homozygous/ heterozygous genotypes containing the Afr-allele showed higher POLB expression relative to the homozygous Cau-allele genotype (P<0.001). A replication study using a GEO dataset validated all five eQTLs, also showing a statistically significant difference in POLB expression based on genetic ancestry (P=0.002). An association was observed between these eQTLs and OS (P<0.037; FDR<0.0363) as well as DFS of oral cavity and laryngeal cancer patients treated with platinum-based chemotherapy and/or radiotherapy (P=0.018 to 0.0629; FDR<0.079). Genotypes containing the Afr-allele were associated with poor OS/DFS compared to homozygous genotypes harboring the Cau-allele. CONCLUSIONS Our analyses show that ancestry-related alleles could act as eQTLs in HNSCC and support the association of ancestry-related genetic factors with survival disparity in patients diagnosed with oral cavity and laryngeal cancer. PMID:27906459
Morrissey, Catherine; Grieve, Ian C; Heinig, Matthias; Atanur, Santosh; Petretto, Enrico; Pravenec, Michal; Hubner, Norbert; Aitman, Timothy J
2011-11-07
The spontaneously hypertensive rat (SHR) is a widely used rodent model of hypertension and metabolic syndrome. Previously we identified thousands of cis-regulated expression quantitative trait loci (eQTLs) across multiple tissues using a panel of rat recombinant inbred (RI) strains derived from Brown Norway and SHR progenitors. These cis-eQTLs represent potential susceptibility loci underlying physiological and pathophysiological traits manifested in SHR. We have prioritized 60 cis-eQTLs and confirmed differential expression between the parental strains by quantitative PCR in 43 (72%) of the eQTL transcripts. Quantitative trait transcript (QTT) analysis in the RI strains showed highly significant correlation between cis-eQTL transcript abundance and clinically relevant traits such as systolic blood pressure and blood glucose, with the physical location of a subset of the cis-eQTLs colocalizing with "physiological" QTLs (pQTLs) for these same traits. These colocalizing correlated cis-eQTLs (c3-eQTLs) are highly attractive as primary susceptibility loci for the colocalizing pQTLs. Furthermore, sequence analysis of the c3-eQTL genes identified single nucleotide polymorphisms (SNPs) that are predicted to affect transcription factor binding affinity, splicing and protein function. These SNPs, which potentially alter transcript abundance and stability, represent strong candidate factors underlying not just eQTL expression phenotypes, but also the correlated metabolic and physiological traits. In conclusion, by integration of genomic sequence, eQTL and QTT datasets we have identified several genes that are strong positional candidates for pathophysiological traits observed in the SHR strain. These findings provide a basis for the functional testing and ultimate elucidation of the molecular basis of these metabolic and cardiovascular phenotypes.
Genome-wide analysis of the genetic regulation of gene expression in human neutrophils
Andiappan, Anand Kumar; Melchiotti, Rossella; Poh, Tuang Yeow; Nah, Michelle; Puan, Kia Joo; Vigano, Elena; Haase, Doreen; Yusof, Nurhashikin; San Luis, Boris; Lum, Josephine; Kumar, Dilip; Foo, Shihui; Zhuang, Li; Vasudev, Anusha; Irwanto, Astrid; Lee, Bernett; Nardin, Alessandra; Liu, Hong; Zhang, Furen; Connolly, John; Liu, Jianjun; Mortellaro, Alessandra; Wang, De Yun; Poidinger, Michael; Larbi, Anis; Zolezzi, Francesca; Rotzschke, Olaf
2015-01-01
Neutrophils are an abundant immune cell type involved in both antimicrobial defence and autoimmunity. The regulation of their gene expression, however, is still largely unknown. Here we report an eQTL study on isolated neutrophils from 114 healthy individuals of Chinese ethnicity, identifying 21,210 eQTLs on 832 unique genes. Unsupervised clustering analysis of these eQTLs confirms their role in inflammatory responses and immunological diseases but also indicates strong involvement in dermatological pathologies. One of the strongest eQTL identified (rs2058660) is also the tagSNP of a linkage block reported to affect leprosy and Crohn's disease in opposite directions. In a functional study, we can link the C allele with low expression of the β-chain of IL18-receptor (IL18RAP). In neutrophils, this results in a reduced responsiveness to IL-18, detected both on the RNA and protein level. Thus, the polymorphic regulation of human neutrophils can impact beneficial as well as pathological inflammatory responses. PMID:26259071
Fast and robust group-wise eQTL mapping using sparse graphical models.
Cheng, Wei; Shi, Yu; Zhang, Xiang; Wang, Wei
2015-01-16
Genome-wide expression quantitative trait loci (eQTL) studies have emerged as a powerful tool to understand the genetic basis of gene expression and complex traits. The traditional eQTL methods focus on testing the associations between individual single-nucleotide polymorphisms (SNPs) and gene expression traits. A major drawback of this approach is that it cannot model the joint effect of a set of SNPs on a set of genes, which may correspond to hidden biological pathways. We introduce a new approach to identify novel group-wise associations between sets of SNPs and sets of genes. Such associations are captured by hidden variables connecting SNPs and genes. Our model is a linear-Gaussian model and uses two types of hidden variables. One captures the set associations between SNPs and genes, and the other captures confounders. We develop an efficient optimization procedure which makes this approach suitable for large scale studies. Extensive experimental evaluations on both simulated and real datasets demonstrate that the proposed methods can effectively capture both individual and group-wise signals that cannot be identified by the state-of-the-art eQTL mapping methods. Considering group-wise associations significantly improves the accuracy of eQTL mapping, and the successful multi-layer regression model opens a new approach to understand how multiple SNPs interact with each other to jointly affect the expression level of a group of genes.
kruX: matrix-based non-parametric eQTL discovery
2014-01-01
Background The Kruskal-Wallis test is a popular non-parametric statistical test for identifying expression quantitative trait loci (eQTLs) from genome-wide data due to its robustness against variations in the underlying genetic model and expression trait distribution, but testing billions of marker-trait combinations one-by-one can become computationally prohibitive. Results We developed kruX, an algorithm implemented in Matlab, Python and R that uses matrix multiplications to simultaneously calculate the Kruskal-Wallis test statistic for several millions of marker-trait combinations at once. KruX is more than ten thousand times faster than computing associations one-by-one on a typical human dataset. We used kruX and a dataset of more than 500k SNPs and 20k expression traits measured in 102 human blood samples to compare eQTLs detected by the Kruskal-Wallis test to eQTLs detected by the parametric ANOVA and linear model methods. We found that the Kruskal-Wallis test is more robust against data outliers and heterogeneous genotype group sizes and detects a higher proportion of non-linear associations, but is more conservative for calling additive linear associations. Conclusion kruX enables the use of robust non-parametric methods for massive eQTL mapping without the need for a high-performance computing infrastructure and is freely available from http://krux.googlecode.com. PMID:24423115
Qiu, Ying-Hua; Deng, Fei-Yan; Tang, Zai-Xiang; Jiang, Zhen-Huan; Lei, Shu-Feng
2015-10-01
Type 1 diabetes mellitus (type 1 DM) is an autoimmune disease. Although genome-wide association studies (GWAS) and meta-analyses have successfully identified numerous type 1 DM-associated susceptibility loci, the underlying mechanisms for these susceptibility loci are currently largely unclear. Based on publicly available datasets, we performed integrative analyses (i.e., integrated gene relationships among implicated loci, differential gene expression analysis, functional prediction and functional annotation clustering analysis) and combined with expression quantitative trait loci (eQTL) results to further explore function mechanisms underlying the associations between genetic variants and type 1 DM. Among a total of 183 type 1 DM-associated SNPs, eQTL analysis showed that 17 SNPs with cis-regulated eQTL effects on 9 genes. All the 9 eQTL genes enrich in immune-related pathways or Gene Ontology (GO) terms. Functional prediction analysis identified 5 SNPs located in transcription factor (TF) binding sites. Of the 9 eQTL genes, 6 (TAP2, HLA-DOB, HLA-DQB1, HLA-DQA1, HLA-DRB5 and CTSH) were differentially expressed in type 1 DM-associated related cells. Especially, rs3825932 in CTSH has integrative functional evidence supporting the association with type 1 DM. These findings indicated that integrative analyses can yield important functional information to link genetic variants and type 1 DM. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Huang, Yang; Siwo, Geoffrey; Wuchty, Stefan; Ferdig, Michael T; Przytycka, Teresa M
2012-04-01
It is being increasingly recognized that many important phenotypic traits, including various diseases, are governed by a combination of weak genetic effects and their interactions. While the detection of epistatic interactions that involve a non-additive effect of two loci on a quantitative trait is particularly challenging, this interaction type is fundamental for the understanding of genome organization and gene regulation. However, current methods that detect epistatic interactions typically rely on the existence of a strong primary effect, considerably limiting the sensitivity of the search. To fill this gap, we developed a new method, SEE (Symmetric Epistasis Estimation), allowing the genome-wide detection of epistatic interactions without the need for a strong primary effect. We applied our approach to progeny crosses of the human malaria parasite P. falciparum and S. cerevisiae. We found an abundance of epistatic interactions in the parasite and a much smaller number of such interactions in yeast. The genome of P. falciparum also harboured several epistatic interaction hotspots that putatively play a role in drug resistance mechanisms. The abundance of observed epistatic interactions might suggest a mechanism of compensation for the extremely limited repertoire of transcription factors. Interestingly, epistatic interaction hotspots were associated with elevated levels of linkage disequilibrium, an observation that suggests selection pressure acting on P. falciparum, potentially reflecting host-pathogen interactions or drug-induced selection.
Mapping eQTL Networks with Mixed Graphical Markov Models
Tur, Inma; Roverato, Alberto; Castelo, Robert
2014-01-01
Expression quantitative trait loci (eQTL) mapping constitutes a challenging problem due to, among other reasons, the high-dimensional multivariate nature of gene-expression traits. Next to the expression heterogeneity produced by confounding factors and other sources of unwanted variation, indirect effects spread throughout genes as a result of genetic, molecular, and environmental perturbations. From a multivariate perspective one would like to adjust for the effect of all of these factors to end up with a network of direct associations connecting the path from genotype to phenotype. In this article we approach this challenge with mixed graphical Markov models, higher-order conditional independences, and q-order correlation graphs. These models show that additive genetic effects propagate through the network as function of gene–gene correlations. Our estimation of the eQTL network underlying a well-studied yeast data set leads to a sparse structure with more direct genetic and regulatory associations that enable a straightforward comparison of the genetic control of gene expression across chromosomes. Interestingly, it also reveals that eQTLs explain most of the expression variability of network hub genes. PMID:25271303
High-Dimensional Heteroscedastic Regression with an Application to eQTL Data Analysis
Daye, Z. John; Chen, Jinbo; Li, Hongzhe
2011-01-01
Summary We consider the problem of high-dimensional regression under non-constant error variances. Despite being a common phenomenon in biological applications, heteroscedasticity has, so far, been largely ignored in high-dimensional analysis of genomic data sets. We propose a new methodology that allows non-constant error variances for high-dimensional estimation and model selection. Our method incorporates heteroscedasticity by simultaneously modeling both the mean and variance components via a novel doubly regularized approach. Extensive Monte Carlo simulations indicate that our proposed procedure can result in better estimation and variable selection than existing methods when heteroscedasticity arises from the presence of predictors explaining error variances and outliers. Further, we demonstrate the presence of heteroscedasticity in and apply our method to an expression quantitative trait loci (eQTLs) study of 112 yeast segregants. The new procedure can automatically account for heteroscedasticity in identifying the eQTLs that are associated with gene expression variations and lead to smaller prediction errors. These results demonstrate the importance of considering heteroscedasticity in eQTL data analysis. PMID:22547833
Larson, Nicholas B; McDonnell, Shannon K; Fogarty, Zach; Larson, Melissa C; Cheville, John; Riska, Shaun; Baheti, Saurabh; Weber, Alexandra M; Nair, Asha A; Wang, Liang; O'Brien, Daniel; Davila, Jaime; Schaid, Daniel J; Thibodeau, Stephen N
2017-10-17
Large-scale genome-wide association studies have identified multiple single-nucleotide polymorphisms associated with risk of prostate cancer. Many of these genetic variants are presumed to be regulatory in nature; however, follow-up expression quantitative trait loci (eQTL) association studies have to-date been restricted largely to cis -acting associations due to study limitations. While trans -eQTL scans suffer from high testing dimensionality, recent evidence indicates most trans -eQTL associations are mediated by cis -regulated genes, such as transcription factors. Leveraging a data-driven gene co-expression network, we conducted a comprehensive cis -mediator analysis using RNA-Seq data from 471 normal prostate tissue samples to identify downstream regulatory associations of previously identified prostate cancer risk variants. We discovered multiple trans -eQTL associations that were significantly mediated by cis -regulated transcripts, four of which involved risk locus 17q12, proximal transcription factor HNF1B , and target trans -genes with known HNF response elements ( MIA2 , SRC , SEMA6A , KIF12 ). We additionally identified evidence of cis -acting down-regulation of MSMB via rs10993994 corresponding to reduced co-expression of NDRG1 . The majority of these cis -mediator relationships demonstrated trans -eQTL replicability in 87 prostate tissue samples from the Gene-Tissue Expression Project. These findings provide further biological context to known risk loci and outline new hypotheses for investigation into the etiology of prostate cancer.
Fan, Qianrui; Wang, Wenyu; Hao, Jingcan; He, Awen; Wen, Yan; Guo, Xiong; Wu, Cuiyan; Ning, Yujie; Wang, Xi; Wang, Sen; Zhang, Feng
2017-08-01
Neuroticism is a fundamental personality trait with significant genetic determinant. To identify novel susceptibility genes for neuroticism, we conducted an integrative analysis of genomic and transcriptomic data of genome wide association study (GWAS) and expression quantitative trait locus (eQTL) study. GWAS summary data was driven from published studies of neuroticism, totally involving 170,906 subjects. eQTL dataset containing 927,753 eQTLs were obtained from an eQTL meta-analysis of 5311 samples. Integrative analysis of GWAS and eQTL data was conducted by summary data-based Mendelian randomization (SMR) analysis software. To identify neuroticism associated gene sets, the SMR analysis results were further subjected to gene set enrichment analysis (GSEA). The gene set annotation dataset (containing 13,311 annotated gene sets) of GSEA Molecular Signatures Database was used. SMR single gene analysis identified 6 significant genes for neuroticism, including MSRA (p value=2.27×10 -10 ), MGC57346 (p value=6.92×10 -7 ), BLK (p value=1.01×10 -6 ), XKR6 (p value=1.11×10 -6 ), C17ORF69 (p value=1.12×10 -6 ) and KIAA1267 (p value=4.00×10 -6 ). Gene set enrichment analysis observed significant association for Chr8p23 gene set (false discovery rate=0.033). Our results provide novel clues for the genetic mechanism studies of neuroticism. Copyright © 2017. Published by Elsevier Inc.
Accurate and fast multiple-testing correction in eQTL studies.
Sul, Jae Hoon; Raj, Towfique; de Jong, Simone; de Bakker, Paul I W; Raychaudhuri, Soumya; Ophoff, Roel A; Stranger, Barbara E; Eskin, Eleazar; Han, Buhm
2015-06-04
In studies of expression quantitative trait loci (eQTLs), it is of increasing interest to identify eGenes, the genes whose expression levels are associated with variation at a particular genetic variant. Detecting eGenes is important for follow-up analyses and prioritization because genes are the main entities in biological processes. To detect eGenes, one typically focuses on the genetic variant with the minimum p value among all variants in cis with a gene and corrects for multiple testing to obtain a gene-level p value. For performing multiple-testing correction, a permutation test is widely used. Because of growing sample sizes of eQTL studies, however, the permutation test has become a computational bottleneck in eQTL studies. In this paper, we propose an efficient approach for correcting for multiple testing and assess eGene p values by utilizing a multivariate normal distribution. Our approach properly takes into account the linkage-disequilibrium structure among variants, and its time complexity is independent of sample size. By applying our small-sample correction techniques, our method achieves high accuracy in both small and large studies. We have shown that our method consistently produces extremely accurate p values (accuracy > 98%) for three human eQTL datasets with different sample sizes and SNP densities: the Genotype-Tissue Expression pilot dataset, the multi-region brain dataset, and the HapMap 3 dataset. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Cell Specific eQTL Analysis without Sorting Cells
Esko, Tõnu; Peters, Marjolein J.; Schurmann, Claudia; Schramm, Katharina; Kettunen, Johannes; Yaghootkar, Hanieh; Fairfax, Benjamin P.; Andiappan, Anand Kumar; Li, Yang; Fu, Jingyuan; Karjalainen, Juha; Platteel, Mathieu; Visschedijk, Marijn; Weersma, Rinse K.; Kasela, Silva; Milani, Lili; Tserel, Liina; Peterson, Pärt; Reinmaa, Eva; Hofman, Albert; Uitterlinden, André G.; Rivadeneira, Fernando; Homuth, Georg; Petersmann, Astrid; Lorbeer, Roberto; Prokisch, Holger; Meitinger, Thomas; Herder, Christian; Roden, Michael; Grallert, Harald; Ripatti, Samuli; Perola, Markus; Wood, Andrew R.; Melzer, David; Ferrucci, Luigi; Singleton, Andrew B.; Hernandez, Dena G.; Knight, Julian C.; Melchiotti, Rossella; Lee, Bernett; Poidinger, Michael; Zolezzi, Francesca; Larbi, Anis; Wang, De Yun; van den Berg, Leonard H.; Veldink, Jan H.; Rotzschke, Olaf; Makino, Seiko; Salomaa, Veikko; Strauch, Konstantin; Völker, Uwe; van Meurs, Joyce B. J.; Metspalu, Andres; Wijmenga, Cisca; Jansen, Ritsert C.; Franke, Lude
2015-01-01
The functional consequences of trait associated SNPs are often investigated using expression quantitative trait locus (eQTL) mapping. While trait-associated variants may operate in a cell-type specific manner, eQTL datasets for such cell-types may not always be available. We performed a genome-environment interaction (GxE) meta-analysis on data from 5,683 samples to infer the cell type specificity of whole blood cis-eQTLs. We demonstrate that this method is able to predict neutrophil and lymphocyte specific cis-eQTLs and replicate these predictions in independent cell-type specific datasets. Finally, we show that SNPs associated with Crohn’s disease preferentially affect gene expression within neutrophils, including the archetypal NOD2 locus. PMID:25955312
Inferring molecular interactions pathways from eQTL data
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rashid, Imran; McDermott, Jason E.; Samudrala, Ram
Analysis of expression quantitative trait loci (eQTL) helps elucidate the connection between genotype, gene expression levels, and phenotype. However, standard statistical genetics can only attribute changes in expression levels to loci on the genome, not specific genes. Each locus can contain many genes, making it very difficult to discover which gene is controlling the expression levels of other genes. Furthermore, it is even more difficult to find a pathway of molecular interactions responsible for controlling the expression levels. Here we describe a series of techniques for finding explanatory pathways by exploring graphs of molecular interactions. We show several simple methodsmore » can find complete pathways the explain the mechanism of differential expression in eQTL data.« less
Sajuthi, Satria P.; Sharma, Neeraj K.; Chou, Jeff W.; Palmer, Nicholette D.; McWilliams, David R.; Beal, John; Comeau, Mary E.; Ma, Lijun; Calles-Escandon, Jorge; Demons, Jamehl; Rogers, Samantha; Cherry, Kristina; Menon, Lata; Kouba, Ethel; Davis, Donna; Burris, Marcie; Byerly, Sara J.; Ng, Maggie C.Y.; Maruthur, Nisa M.; Patel, Sanjay R.; Bielak, Lawrence F.; Lange, Leslie; Guo, Xiuqing; Sale, Michèle M.; Chan, Kei Hang; Monda, Keri L.; Chen, Gary K.; Taylor, Kira; Palmer, Cameron; Edwards, Todd L; North, Kari E.; Haiman, Christopher A.; Bowden, Donald W.; Freedman, Barry I.; Langefeld, Carl D.; Das, Swapan K.
2016-01-01
Relative to European Americans, type 2 diabetes (T2D) is more prevalent in African Americans (AAs). Genetic variation may modulate transcript abundance in insulin-responsive tissues and contribute to risk; yet published studies identifying expression quantitative trait loci (eQTLs) in African ancestry populations are restricted to blood cells. This study aims to develop a map of genetically regulated transcripts expressed in tissues important for glucose homeostasis in AAs, critical for identifying the genetic etiology of T2D and related traits. Quantitative measures of adipose and muscle gene expression, and genotypic data were integrated in 260 non-diabetic AAs to identify expression regulatory variants. Their roles in genetic susceptibility to T2D, and related metabolic phenotypes were evaluated by mining GWAS datasets. eQTL analysis identified 1,971 and 2,078 cis-eGenes in adipose and muscle, respectively. Cis-eQTLs for 885 transcripts including top cis-eGenes CHURC1, USMG5, and ERAP2, were identified in both tissues. 62.1% of top cis-eSNPs were within ±50kb of transcription start sites and cis-eGenes were enriched for mitochondrial transcripts. Mining GWAS databases revealed association of cis-eSNPs for more than 50 genes with T2D (e.g. PIK3C2A, RBMS1, UFSP1), gluco-metabolic phenotypes, (e.g. INPP5E, SNX17, ERAP2, FN3KRP), and obesity (e.g. POMC, CPEB4). Integration of GWAS meta-analysis data from AA cohorts revealed the most significant association for cis-eSNPs of ATP5SL and MCCC1 genes, with T2D and BMI, respectively. This study developed the first comprehensive map of adipose and muscle tissue eQTLs in AAs (publically accessible at https://mdsetaa.phs.wakehealth.edu) and identified genetically-regulated transcripts for delineating genetic causes of T2D, and related metabolic phenotypes. PMID:27193597
Pandey, Manish K; Wang, Ming Li; Qiao, Lixian; Feng, Suping; Khera, Pawan; Wang, Hui; Tonnis, Brandon; Barkley, Noelle A; Wang, Jianping; Holbrook, C Corley; Culbreath, Albert K; Varshney, Rajeev K; Guo, Baozhu
2014-12-10
Peanut is one of the major source for human consumption worldwide and its seed contain approximately 50% oil. Improvement of oil content and quality traits (high oleic and low linoleic acid) in peanut could be accelerated by exploiting linked markers through molecular breeding. The objective of this study was to identify QTLs associated with oil content, and estimate relative contribution of FAD2 genes (ahFAD2A and ahFAD2B) to oil quality traits in two recombinant inbred line (RIL) populations. Improved genetic linkage maps were developed for S-population (SunOleic 97R × NC94022) with 206 (1780.6 cM) and T-population (Tifrunner × GT-C20) with 378 (2487.4 cM) marker loci. A total of 6 and 9 QTLs controlling oil content were identified in the S- and T-population, respectively. The contribution of each QTL towards oil content variation ranged from 3.07 to 10.23% in the S-population and from 3.93 to 14.07% in the T-population. The mapping positions for ahFAD2A (A sub-genome) and ahFAD2B (B sub-genome) genes were assigned on a09 and b09 linkage groups. The ahFAD2B gene (26.54%, 25.59% and 41.02% PVE) had higher phenotypic effect on oleic acid (C18:1), linoleic acid (C18:2), and oleic/linoleic acid ratio (O/L ratio) than ahFAD2A gene (8.08%, 6.86% and 3.78% PVE). The FAD2 genes had no effect on oil content. This study identified a total of 78 main-effect QTLs (M-QTLs) with up to 42.33% phenotypic variation (PVE) and 10 epistatic QTLs (E-QTLs) up to 3.31% PVE for oil content and quality traits. A total of 78 main-effect QTLs (M-QTLs) and 10 E-QTLs have been detected for oil content and oil quality traits. One major QTL (more than 10% PVE) was identified in both the populations for oil content with source alleles from NC94022 and GT-C20 parental genotypes. FAD2 genes showed high effect for oleic acid (C18:1), linoleic acid (C18:2), and O/L ratio while no effect on total oil content. The information on phenotypic effect of FAD2 genes for oleic acid, linoleic acid and O/L ratio, and oil content will be applied in breeding selection.
Adaptive linear rank tests for eQTL studies
Szymczak, Silke; Scheinhardt, Markus O.; Zeller, Tanja; Wild, Philipp S.; Blankenberg, Stefan; Ziegler, Andreas
2013-01-01
Expression quantitative trait loci (eQTL) studies are performed to identify single-nucleotide polymorphisms that modify average expression values of genes, proteins, or metabolites, depending on the genotype. As expression values are often not normally distributed, statistical methods for eQTL studies should be valid and powerful in these situations. Adaptive tests are promising alternatives to standard approaches, such as the analysis of variance or the Kruskal–Wallis test. In a two-stage procedure, skewness and tail length of the distributions are estimated and used to select one of several linear rank tests. In this study, we compare two adaptive tests that were proposed in the literature using extensive Monte Carlo simulations of a wide range of different symmetric and skewed distributions. We derive a new adaptive test that combines the advantages of both literature-based approaches. The new test does not require the user to specify a distribution. It is slightly less powerful than the locally most powerful rank test for the correct distribution and at least as powerful as the maximin efficiency robust rank test. We illustrate the application of all tests using two examples from different eQTL studies. PMID:22933317
Adaptive linear rank tests for eQTL studies.
Szymczak, Silke; Scheinhardt, Markus O; Zeller, Tanja; Wild, Philipp S; Blankenberg, Stefan; Ziegler, Andreas
2013-02-10
Expression quantitative trait loci (eQTL) studies are performed to identify single-nucleotide polymorphisms that modify average expression values of genes, proteins, or metabolites, depending on the genotype. As expression values are often not normally distributed, statistical methods for eQTL studies should be valid and powerful in these situations. Adaptive tests are promising alternatives to standard approaches, such as the analysis of variance or the Kruskal-Wallis test. In a two-stage procedure, skewness and tail length of the distributions are estimated and used to select one of several linear rank tests. In this study, we compare two adaptive tests that were proposed in the literature using extensive Monte Carlo simulations of a wide range of different symmetric and skewed distributions. We derive a new adaptive test that combines the advantages of both literature-based approaches. The new test does not require the user to specify a distribution. It is slightly less powerful than the locally most powerful rank test for the correct distribution and at least as powerful as the maximin efficiency robust rank test. We illustrate the application of all tests using two examples from different eQTL studies. Copyright © 2012 John Wiley & Sons, Ltd.
Inference on the Strength of Balancing Selection for Epistatically Interacting Loci
Buzbas, Erkan Ozge; Joyce, Paul; Rosenberg, Noah A.
2011-01-01
Existing inference methods for estimating the strength of balancing selection in multi-locus genotypes rely on the assumption that there are no epistatic interactions between loci. Complex systems in which balancing selection is prevalent, such as sets of human immune system genes, are known to contain components that interact epistatically. Therefore, current methods may not produce reliable inference on the strength of selection at these loci. In this paper, we address this problem by presenting statistical methods that can account for epistatic interactions in making inference about balancing selection. A theoretical result due to Fearnhead (2006) is used to build a multi-locus Wright-Fisher model of balancing selection, allowing for epistatic interactions among loci. Antagonistic and synergistic types of interactions are examined. The joint posterior distribution of the selection and mutation parameters is sampled by Markov chain Monte Carlo methods, and the plausibility of models is assessed via Bayes factors. As a component of the inference process, an algorithm to generate multi-locus allele frequencies under balancing selection models with epistasis is also presented. Recent evidence on interactions among a set of human immune system genes is introduced as a motivating biological system for the epistatic model, and data on these genes are used to demonstrate the methods. PMID:21277883
Kertai, Miklos D; Qi, Wenjing; Li, Yi-Ju; Lombard, Frederick W; Liu, Yutao; Smith, Michael P; Stafford-Smith, Mark; Newman, Mark F; Milano, Carmelo A; Mathew, Joseph P; Podgoreanu, Mihai V
2016-03-01
Atrial tissue gene expression profiling may help to determine how differentially expressed genes in the human atrium before cardiopulmonary bypass (CPB) are related to subsequent biologic pathway activation patterns, and whether specific expression profiles are associated with an increased risk for postoperative atrial fibrillation (AF) or altered response to β-blocker (BB) therapy after coronary artery bypass grafting (CABG) surgery. Right atrial appendage (RAA) samples were collected from 45 patients who were receiving perioperative BB treatment, and underwent CABG surgery. The isolated RNA samples were used for microarray gene expression analysis, to identify probes that were expressed differently in patients with and without postoperative AF. Gene expression analysis was performed to identify probes that were expressed differently in patients with and without postoperative AF. Gene set enrichment analysis (GSEA) was performed to determine how sets of genes might be systematically altered in patients with postoperative AF. Of the 45 patients studied, genomic DNA from 42 patients was used for target sequencing of 66 candidate genes potentially associated with AF, and 2,144 single-nucleotide polymorphisms (SNPs) were identified. We then performed expression quantitative trait loci (eQTL) analysis to determine the correlation between SNPs identified in the genotyped patients, and RAA expression. Probes that met a false discovery rate<0.25 were selected for eQTL analysis. Of the 17,678 gene expression probes analyzed, 2 probes met our prespecified significance threshold of false discovery rate<0.25. The most significant probe corresponded to vesicular overexpressed in cancer - prosurvival protein 1 gene (VOPP1; 1.83 fold change; P=3.47×10(-7)), and was up-regulated in patients with postoperative AF, whereas the second most significant probe, which corresponded to the LOC389286 gene (0.49 fold change; P=1.54×10(-5)), was down-regulated in patients with postoperative AF. GSEA highlighted the role of VOPP1 in pathways with biologic relevance to myocardial homeostasis, and oxidative stress and redox modulation. Candidate gene eQTL showed a trans-acting association between variants of G protein-coupled receptor kinase 5 gene, previously linked to altered BB response, and high expression of VOPP1. In patients undergoing CABG surgery, RAA gene expression profiling, and pathway and eQTL analysis suggested that VOPP1 plays a novel etiological role in postoperative AF despite perioperative BB therapy. Copyright © 2016. Published by Elsevier Ltd.
Gahlaut, Vijay; Jaiswal, Vandana; Tyagi, Bhudeva S.; Singh, Gyanendra; Sareen, Sindhu; Balyan, Harindra S.
2017-01-01
In bread wheat, QTL interval mapping was conducted for nine important drought responsive agronomic traits. For this purpose, a doubled haploid (DH) mapping population derived from Kukri/Excalibur was grown over three years at four separate locations in India, both under irrigated and rain-fed environments. Single locus analysis using composite interval mapping (CIM) allowed detection of 98 QTL, which included 66 QTL for nine individual agronomic traits and 32 QTL, which affected drought sensitivity index (DSI) for the same nine traits. Two-locus analysis allowed detection of 19 main effect QTL (M-QTL) for four traits (days to anthesis, days to maturity, grain filling duration and thousand grain weight) and 19 pairs of epistatic QTL (E-QTL) for two traits (days to anthesis and thousand grain weight). Eight QTL were common in single locus analysis and two locus analysis. These QTL (identified both in single- and two-locus analysis) were distributed on 20 different chromosomes (except 4D). Important genomic regions on chromosomes 5A and 7A were also identified (5A carried QTL for seven traits and 7A carried QTL for six traits). Marker-assisted recurrent selection (MARS) involving pyramiding of important QTL reported in the present study, together with important QTL reported earlier, may be used for improvement of drought tolerance in wheat. In future, more closely linked markers for the QTL reported here may be developed through fine mapping, and the candidate genes may be identified and used for developing a better understanding of the genetic basis of drought tolerance in wheat. PMID:28793327
ARLTS1 and Prostate Cancer Risk - Analysis of Expression and Regulation
Siltanen, Sanna; Fischer, Daniel; Rantapero, Tommi; Laitinen, Virpi; Mpindi, John Patrick; Kallioniemi, Olli; Wahlfors, Tiina; Schleutker, Johanna
2013-01-01
Prostate cancer (PCa) is a heterogeneous trait for which several susceptibility loci have been implicated by genome-wide linkage and association studies. The genomic region 13q14 is frequently deleted in tumour tissues of both sporadic and familial PCa patients and is consequently recognised as a possible locus of tumour suppressor gene(s). Deletions of this region have been found in many other cancers. Recently, we showed that homozygous carriers for the T442C variant of the ARLTS1 gene (ADP-ribosylation factor-like tumour suppressor protein 1 or ARL11, located at 13q14) are associated with an increased risk for both unselected and familial PCa. Furthermore, the variant T442C was observed in greater frequency among malignant tissue samples, PCa cell lines and xenografts, supporting its role in PCa tumourigenesis. In this study, 84 PCa cases and 15 controls were analysed for ARLTS1 expression status in blood-derived RNA. A statistically significant (p = 0.0037) decrease of ARLTS1 expression in PCa cases was detected. Regulation of ARLTS1 expression was analysed with eQTL (expression quantitative trait loci) methods. Altogether fourteen significant cis-eQTLs affecting the ARLTS1 expression level were found. In addition, epistatic interactions of ARLTS1 genomic variants with genes involved in immune system processes were predicted with the MDR program. In conclusion, this study further supports the role of ARLTS1 as a tumour suppressor gene and reveals that the expression is regulated through variants localised in regulatory regions. PMID:23940804
Kel, Ivan; Chang, Zisong; Galluccio, Nadia; Romeo, Margherita; Beretta, Stefano; Diomede, Luisa; Mezzelani, Alessandra; Milanesi, Luciano; Dieterich, Christoph; Merelli, Ivan
2016-10-18
The interpretation of genome-wide association study is difficult, as it is hard to understand how polymorphisms can affect gene regulation, in particular for trans-regulatory elements located far from their controlling gene. Using RNA or protein expression data as phenotypes, it is possible to correlate their variations with specific genotypes. This technique is usually referred to as expression Quantitative Trait Loci (eQTLs) analysis and only few packages exist for the integration of genotype patterns and expression profiles. In particular, tools are needed for the analysis of next-generation sequencing (NGS) data on a genome-wide scale, which is essential to identify eQTLs able to control a large number of genes (hotspots). Here we present SPIRE (Software for Polymorphism Identification Regulating Expression), a generic, modular and functionally highly flexible pipeline for eQTL processing. SPIRE integrates different univariate and multivariate approaches for eQTL analysis, paying particular attention to the scalability of the procedure in order to support cis- as well as trans-mapping, thus allowing the identification of hotspots in NGS data. In particular, we demonstrated how SPIRE can handle big association study datasets, reproducing published results and improving the identification of trans-eQTLs. Furthermore, we employed the pipeline to analyse novel data concerning the genotypes of two different C. elegans strains (N2 and Hawaii) and related miRNA expression data, obtained using RNA-Seq. A miRNA regulatory hotspot was identified in chromosome 1, overlapping the transcription factor grh-1, known to be involved in the early phases of embryonic development of C. elegans. In a follow-up qPCR experiment we were able to verify most of the predicted eQTLs, as well as to show, for a novel miRNA, a significant difference in the sequences of the two analysed strains of C. elegans. SPIRE is publicly available as open source software at , together with some example data, a readme file, supplementary material and a short tutorial.
Wang, W; Huang, S; Hou, W; Liu, Y; Fan, Q; He, A; Wen, Y; Hao, J; Guo, X; Zhang, F
2017-10-01
Several genome-wide association studies (GWAS) of bone mineral density (BMD) have successfully identified multiple susceptibility genes, yet isolated susceptibility genes are often difficult to interpret biologically. The aim of this study was to unravel the genetic background of BMD at pathway level, by integrating BMD GWAS data with genome-wide expression quantitative trait loci (eQTLs) and methylation quantitative trait loci (meQTLs) data METHOD: We employed the GWAS datasets of BMD from the Genetic Factors for Osteoporosis Consortium (GEFOS), analysing patients' BMD. The areas studied included 32 735 femoral necks, 28 498 lumbar spines, and 8143 forearms. Genome-wide eQTLs (containing 923 021 eQTLs) and meQTLs (containing 683 152 unique methylation sites with local meQTLs) data sets were collected from recently published studies. Gene scores were first calculated by summary data-based Mendelian randomisation (SMR) software and meQTL-aligned GWAS results. Gene set enrichment analysis (GSEA) was then applied to identify BMD-associated gene sets with a predefined significance level of 0.05. We identified multiple gene sets associated with BMD in one or more regions, including relevant known biological gene sets such as the Reactome Circadian Clock (GSEA p-value = 1.0 × 10 -4 for LS and 2.7 × 10 -2 for femoral necks BMD in eQTLs-based GSEA) and insulin-like growth factor receptor binding (GSEA p-value = 5.0 × 10 -4 for femoral necks and 2.6 × 10 -2 for lumbar spines BMD in meQTLs-based GSEA). Our results provided novel clues for subsequent functional analysis of bone metabolism, and illustrated the benefit of integrating eQTLs and meQTLs data into pathway association analysis for genetic studies of complex human diseases. Cite this article : W. Wang, S. Huang, W. Hou, Y. Liu, Q. Fan, A. He, Y. Wen, J. Hao, X. Guo, F. Zhang. Integrative analysis of GWAS, eQTLs and meQTLs data suggests that multiple gene sets are associated with bone mineral density. Bone Joint Res 2017;6:572-576. © 2017 Wang et al.
Pasricha, Sarina; Kenney-Hunt, Jane; Anderson, Kristy; Jafari, Nadereh; Hall, Rabea A.; Lammert, Frank; Cheverud, James; Green, Richard M.
2015-01-01
Nonalcoholic fatty liver disease (NAFLD) is a highly prevalent form of human hepatic disease and feeding mice a high-fat, high-caloric (HFHC) diet is a standard model of NAFLD. To better understand the genetic basis of NAFLD, we conducted an expression quantitative trait locus (eQTL) analysis of mice fed a HFHC diet. Two-hundred sixty-five (A/J × C57BL/6J) F2 male mice were fed a HFHC diet for 8 wk. eQTL analysis was utilized to identify genomic regions that regulate hepatic gene expression of Xbp1s and Socs3. We identified two overlapping loci for Xbp1s and Socs3 on Chr 1 (164.0–185.4 Mb and 174.4–190.5 Mb, respectively) and Chr 11 (41.1–73.1 Mb and 44.0–68.6 Mb, respectively), and an additional locus for Socs3 on Chr 12 (109.9–117.4 Mb). C57BL/6J-Chr 11A/J/ NaJ mice fed a HFHC diet manifested the A/J phenotype of increased Xbp1s and Socs3 gene expression (P < 0.05), whereas C57BL/6J-Chr 1A/J/ NaJ mice retained the C57BL/6J phenotype. In addition, we replicated the eQTLs on Chr 1 and Chr 12 (LOD scores ≥3.5) using mice from the BXD murine reference panel challenged with CCl4 to induce chronic liver injury and fibrosis. We have identified overlapping eQTLs for Xbp1 and Socs3 on Chr 1 and Chr 11, and consomic mice confirmed that replacing the C57BL/6J Chr 11 with the A/J Chr 11 resulted in an A/J phenotype for Xbp1 and Socs3 gene expression. Identification of the genes for these eQTLs will lead to a better understanding of the genetic factors responsible for NAFLD and potentially other hepatic diseases. PMID:25617409
Polster, Robert; Petropoulos, Christos J; Bonhoeffer, Sebastian; Guillaume, Frédéric
2016-12-01
The genotype-phenotype (GP) map is a central concept in evolutionary biology as it describes the mapping of molecular genetic variation onto phenotypic trait variation. Our understanding of that mapping remains partial, especially when trying to link functional clustering of pleiotropic gene effects with patterns of phenotypic trait co-variation. Only on rare occasions have studies been able to fully explore that link and tend to show poor correspondence between modular structures within the GP map and among phenotypes. By dissecting the structure of the GP map of the replicative capacity of HIV-1 in 15 drug environments, we provide a detailed view of that mapping from mutational pleiotropic variation to phenotypic co-variation, including epistatic effects of a set of amino-acid substitutions in the reverse transcriptase and protease genes. We show that epistasis increases the pleiotropic degree of single mutations and provides modularity to the GP map of drug resistance in HIV-1. Moreover, modules of epistatic pleiotropic effects within the GP map match the phenotypic modules of correlated replicative capacity among drug classes. Epistasis thus increases the evolvability of cross-resistance in HIV by providing more drug- and class-specific pleiotropic profiles to the main effects of the mutations. We discuss the implications for the evolution of cross-resistance in HIV. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Li, Ming; Luo, Xiong-jian; Landén, Mikael; Bergen, Sarah E.; Hultman, Christina M.; Li, Xiao; Zhang, Wen; Yao, Yong-Gang; Zhang, Chen; Liu, Jiewei; Mattheisen, Manuel; Cichon, Sven; Mühleisen, Thomas W.; Degenhardt, Franziska A.; Nöthen, Markus M.; Schulze, Thomas G.; Grigoroiu-Serbanescu, Maria; Li, Hao; Fuller, Chris K.; Chen, Chunhui; Dong, Qi; Chen, Chuansheng; Jamain, Stéphane; Leboyer, Marion; Bellivier, Frank; Etain, Bruno; Kahn, Jean-Pierre; Henry, Chantal; Preisig, Martin; Kutalik, Zoltán; Castelao, Enrique; Wright, Adam; Mitchell, Philip B.; Fullerton, Janice M.; Schofield, Peter R.; Montgomery, Grant W.; Medland, Sarah E.; Gordon, Scott D.; Martin, Nicholas G.; Rietschel, Marcella; Liu, Chunyu; Kleinman, Joel E.; Hyde, Thomas M.; Weinberger, Daniel R.; Su, Bing
2016-01-01
Summary Bipolar disorder (BPD) is a highly heritable polygenic disorder. Recent enrichment analyses suggest that there may be true risk variants for BPD among the expression quantitative trait loci (eQTL) in the brain. Aims We sought to assess the impact of eQTL variants on BPD risk by combining data from both BPD genome-wide association study (GWAS) and brain eQTL. Method To detect single-nucleotide polymorphisms (SNPs) that influence expression levels of genes associated with BPD, we jointly analyzed data from a BPD GWAS (7,481 cases and 9,250 controls) and a genome-wide brain (cortical) eQTL (193 healthy controls) using a Bayesian statistical method, with independent follow-up replications. The identified risk SNP was then further tested for association with hippocampal volume (N=5,775) and cognitive performance (N=342) among healthy subjects. Results Integrative analysis revealed a significant association between a brain eQTL rs6088662 in 20q11.22 and BPD (Log Bayes Factor=5.48; BPD p-val=5.85×10−5). Follow-up studies across multiple independent samples confirmed the association of the risk SNP (rs6088662) with gene expression and BPD susceptibility (p-val=3.54×10−8). Further exploratory analysis revealed that rs6088662 is also associated with hippocampal volume and cognitive performance in healthy subjects. Conclusions Our findings suggest that 20q11.22 is likely a risk region for BPD, highlighting the informativeness of integrating functional annotation of genetic variants for gene expression in advancing our understanding of the biological basis underlying complex diseases such as BPD. PMID:26338991
Genetics of Sputum Gene Expression in Chronic Obstructive Pulmonary Disease
Qiu, Weiliang; Cho, Michael H.; Riley, John H.; Anderson, Wayne H.; Singh, Dave; Bakke, Per; Gulsvik, Amund; Litonjua, Augusto A.; Lomas, David A.; Crapo, James D.; Beaty, Terri H.; Celli, Bartolome R.; Rennard, Stephen; Tal-Singer, Ruth; Fox, Steven M.; Silverman, Edwin K.; Hersh, Craig P.
2011-01-01
Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus. PMID:21949713
Li, Ming; Luo, Xiong-jian; Landén, Mikael; Bergen, Sarah E; Hultman, Christina M; Li, Xiao; Zhang, Wen; Yao, Yong-Gang; Zhang, Chen; Liu, Jiewei; Mattheisen, Manuel; Cichon, Sven; Mühleisen, Thomas W; Degenhardt, Franziska A; Nöthen, Markus M; Schulze, Thomas G; Grigoroiu-Serbanescu, Maria; Li, Hao; Fuller, Chris K; Chen, Chunhui; Dong, Qi; Chen, Chuansheng; Jamain, Stéphane; Leboyer, Marion; Bellivier, Frank; Etain, Bruno; Kahn, Jean-Pierre; Henry, Chantal; Preisig, Martin; Kutalik, Zoltán; Castelao, Enrique; Wright, Adam; Mitchell, Philip B; Fullerton, Janice M; Schofield, Peter R; Montgomery, Grant W; Medland, Sarah E; Gordon, Scott D; Martin, Nicholas G; Rietschel, Marcella; Liu, Chunyu; Kleinman, Joel E; Hyde, Thomas M; Weinberger, Daniel R; Su, Bing
2016-02-01
Bipolar disorder is a highly heritable polygenic disorder. Recent enrichment analyses suggest that there may be true risk variants for bipolar disorder in the expression quantitative trait loci (eQTL) in the brain. We sought to assess the impact of eQTL variants on bipolar disorder risk by combining data from both bipolar disorder genome-wide association studies (GWAS) and brain eQTL. To detect single nucleotide polymorphisms (SNPs) that influence expression levels of genes associated with bipolar disorder, we jointly analysed data from a bipolar disorder GWAS (7481 cases and 9250 controls) and a genome-wide brain (cortical) eQTL (193 healthy controls) using a Bayesian statistical method, with independent follow-up replications. The identified risk SNP was then further tested for association with hippocampal volume (n = 5775) and cognitive performance (n = 342) among healthy individuals. Integrative analysis revealed a significant association between a brain eQTL rs6088662 on chromosome 20q11.22 and bipolar disorder (log Bayes factor = 5.48; bipolar disorder P = 5.85 × 10(-5)). Follow-up studies across multiple independent samples confirmed the association of the risk SNP (rs6088662) with gene expression and bipolar disorder susceptibility (P = 3.54 × 10(-8)). Further exploratory analysis revealed that rs6088662 is also associated with hippocampal volume and cognitive performance in healthy individuals. Our findings suggest that 20q11.22 is likely a risk region for bipolar disorder; they also highlight the informative value of integrating functional annotation of genetic variants for gene expression in advancing our understanding of the biological basis underlying complex disorders, such as bipolar disorder. © The Royal College of Psychiatrists 2016.
Griswold, Cortland K
2015-12-21
Epistatic gene action occurs when mutations or alleles interact to produce a phenotype. Theoretically and empirically it is of interest to know whether gene interactions can facilitate the evolution of diversity. In this paper, we explore how epistatic gene action affects the additive genetic component or heritable component of multivariate trait variation, as well as how epistatic gene action affects the evolvability of multivariate traits. The analysis involves a sexually reproducing and recombining population. Our results indicate that under stabilizing selection conditions a population with a mixed additive and epistatic genetic architecture can have greater multivariate additive genetic variation and evolvability than a population with a purely additive genetic architecture. That greater multivariate additive genetic variation can occur with epistasis is in contrast to previous theory that indicated univariate additive genetic variation is decreased with epistasis under stabilizing selection conditions. In a multivariate setting, epistasis leads to less relative covariance among individuals in their genotypic, as well as their breeding values, which facilitates the maintenance of additive genetic variation and increases a population׳s evolvability. Our analysis involves linking the combinatorial nature of epistatic genetic effects to the ancestral graph structure of a population to provide insight into the consequences of epistasis on multivariate trait variation and evolution. Copyright © 2015 Elsevier Ltd. All rights reserved.
Inference of epistatic effects in a key mitochondrial protein
NASA Astrophysics Data System (ADS)
Nelson, Erik D.; Grishin, Nick V.
2018-06-01
We use Potts model inference to predict pair epistatic effects in a key mitochondrial protein—cytochrome c oxidase subunit 2—for ray-finned fishes. We examine the effect of phylogenetic correlations on our predictions using a simple exact fitness model, and we find that, although epistatic effects are underpredicted, they maintain a roughly linear relationship to their true (model) values. After accounting for this correction, epistatic effects in the protein are still relatively weak, leading to fitness valleys of depth 2 N s ≃-5 in compensatory double mutants. Interestingly, positive epistasis is more pronounced than negative epistasis, and the strongest positive effects capture nearly all sites subject to positive selection in fishes, similar to virus proteins evolving under selection pressure in the context of drug therapy.
Use of eQTL Analysis for the Discovery of Target Genes Identified by GWAS
2013-04-01
the biologic pathways affected by these inherited factors, and ultimately to identify targets for disease prediction, risk stratification and...quality using an Agilent chip technology. Cases having a RIN number of 7.0 or greater were considered good quality. Once completed, the optimum set of...AD_________________ Award Number: W81XWH-11-1-0261 TITLE: Use of eQTL Analysis for the Discovery of
Use of eQTL Analysis for the Discovery of Target Genes Identified by GWAS
2014-04-01
technology. Cases having a RIN number of 7.0 or greater were considered good quality. Once completed, the optimum set of 500 samples were then selected for...AD_________________ Award Number: W81XWH-11-1-0261 TITLE: Use of eQTL Analysis for the Discovery...Distribution Unlimited The views, opinions and/or findings contained in this report are those of the author(s) and
Dai, Jiajuan; Wang, Xusheng; Chen, Ying; Wang, Xiaodong; Zhu, Jun; Lu, Lu
2009-11-01
Previous studies have revealed that the subunit alpha 2 (Gabra2) of the gamma-aminobutyric acid receptor plays a critical role in the stress response. However, little is known about the gentetic regulatory network for Gabra2 and the stress response. We combined gene expression microarray analysis and quantitative trait loci (QTL) mapping to characterize the genetic regulatory network for Gabra2 expression in the hippocampus of BXD recombinant inbred (RI) mice. Our analysis found that the expression level of Gabra2 exhibited much variation in the hippocampus across the BXD RI strains and between the parental strains, C57BL/6J, and DBA/2J. Expression QTL (eQTL) mapping showed three microarray probe sets of Gabra2 to have highly significant linkage likelihood ratio statistic (LRS) scores. Gene co-regulatory network analysis showed that 10 genes, including Gria3, Chka, Drd3, Homer1, Grik2, Odz4, Prkag2, Grm5, Gabrb1, and Nlgn1 are directly or indirectly associated with stress responses. Eleven genes were implicated as Gabra2 downstream genes through mapping joint modulation. The genetical genomics approach demonstrates the importance and the potential power of the eQTL studies in identifying genetic regulatory networks that contribute to complex traits, such as stress responses.
Modular analysis of the probabilistic genetic interaction network.
Hou, Lin; Wang, Lin; Qian, Minping; Li, Dong; Tang, Chao; Zhu, Yunping; Deng, Minghua; Li, Fangting
2011-03-15
Epistatic Miniarray Profiles (EMAP) has enabled the mapping of large-scale genetic interaction networks; however, the quantitative information gained from EMAP cannot be fully exploited since the data are usually interpreted as a discrete network based on an arbitrary hard threshold. To address such limitations, we adopted a mixture modeling procedure to construct a probabilistic genetic interaction network and then implemented a Bayesian approach to identify densely interacting modules in the probabilistic network. Mixture modeling has been demonstrated as an effective soft-threshold technique of EMAP measures. The Bayesian approach was applied to an EMAP dataset studying the early secretory pathway in Saccharomyces cerevisiae. Twenty-seven modules were identified, and 14 of those were enriched by gold standard functional gene sets. We also conducted a detailed comparison with state-of-the-art algorithms, hierarchical cluster and Markov clustering. The experimental results show that the Bayesian approach outperforms others in efficiently recovering biologically significant modules.
Lachowiec, Jennifer; Shen, Xia; Queitsch, Christine; Carlborg, Örjan
2015-01-01
Efforts to identify loci underlying complex traits generally assume that most genetic variance is additive. Here, we examined the genetics of Arabidopsis thaliana root length and found that the genomic narrow-sense heritability for this trait in the examined population was statistically zero. The low amount of additive genetic variance that could be captured by the genome-wide genotypes likely explains why no associations to root length could be found using standard additive-model-based genome-wide association (GWA) approaches. However, as the broad-sense heritability for root length was significantly larger, and primarily due to epistasis, we also performed an epistatic GWA analysis to map loci contributing to the epistatic genetic variance. Four interacting pairs of loci were revealed, involving seven chromosomal loci that passed a standard multiple-testing corrected significance threshold. The genotype-phenotype maps for these pairs revealed epistasis that cancelled out the additive genetic variance, explaining why these loci were not detected in the additive GWA analysis. Small population sizes, such as in our experiment, increase the risk of identifying false epistatic interactions due to testing for associations with very large numbers of multi-marker genotypes in few phenotyped individuals. Therefore, we estimated the false-positive risk using a new statistical approach that suggested half of the associated pairs to be true positive associations. Our experimental evaluation of candidate genes within the seven associated loci suggests that this estimate is conservative; we identified functional candidate genes that affected root development in four loci that were part of three of the pairs. The statistical epistatic analyses were thus indispensable for confirming known, and identifying new, candidate genes for root length in this population of wild-collected A. thaliana accessions. We also illustrate how epistatic cancellation of the additive genetic variance explains the insignificant narrow-sense and significant broad-sense heritability by using a combination of careful statistical epistatic analyses and functional genetic experiments.
JBASE: Joint Bayesian Analysis of Subphenotypes and Epistasis.
Colak, Recep; Kim, TaeHyung; Kazan, Hilal; Oh, Yoomi; Cruz, Miguel; Valladares-Salgado, Adan; Peralta, Jesus; Escobedo, Jorge; Parra, Esteban J; Kim, Philip M; Goldenberg, Anna
2016-01-15
Rapid advances in genotyping and genome-wide association studies have enabled the discovery of many new genotype-phenotype associations at the resolution of individual markers. However, these associations explain only a small proportion of theoretically estimated heritability of most diseases. In this work, we propose an integrative mixture model called JBASE: joint Bayesian analysis of subphenotypes and epistasis. JBASE explores two major reasons of missing heritability: interactions between genetic variants, a phenomenon known as epistasis and phenotypic heterogeneity, addressed via subphenotyping. Our extensive simulations in a wide range of scenarios repeatedly demonstrate that JBASE can identify true underlying subphenotypes, including their associated variants and their interactions, with high precision. In the presence of phenotypic heterogeneity, JBASE has higher Power and lower Type 1 Error than five state-of-the-art approaches. We applied our method to a sample of individuals from Mexico with Type 2 diabetes and discovered two novel epistatic modules, including two loci each, that define two subphenotypes characterized by differences in body mass index and waist-to-hip ratio. We successfully replicated these subphenotypes and epistatic modules in an independent dataset from Mexico genotyped with a different platform. JBASE is implemented in C++, supported on Linux and is available at http://www.cs.toronto.edu/∼goldenberg/JBASE/jbase.tar.gz. The genotype data underlying this study are available upon approval by the ethics review board of the Medical Centre Siglo XXI. Please contact Dr Miguel Cruz at mcruzl@yahoo.com for assistance with the application. anna.goldenberg@utoronto.ca Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Zeng, Ping; Mukherjee, Sayan; Zhou, Xiang
2017-01-01
Epistasis, commonly defined as the interaction between multiple genes, is an important genetic component underlying phenotypic variation. Many statistical methods have been developed to model and identify epistatic interactions between genetic variants. However, because of the large combinatorial search space of interactions, most epistasis mapping methods face enormous computational challenges and often suffer from low statistical power due to multiple test correction. Here, we present a novel, alternative strategy for mapping epistasis: instead of directly identifying individual pairwise or higher-order interactions, we focus on mapping variants that have non-zero marginal epistatic effects—the combined pairwise interaction effects between a given variant and all other variants. By testing marginal epistatic effects, we can identify candidate variants that are involved in epistasis without the need to identify the exact partners with which the variants interact, thus potentially alleviating much of the statistical and computational burden associated with standard epistatic mapping procedures. Our method is based on a variance component model, and relies on a recently developed variance component estimation method for efficient parameter inference and p-value computation. We refer to our method as the “MArginal ePIstasis Test”, or MAPIT. With simulations, we show how MAPIT can be used to estimate and test marginal epistatic effects, produce calibrated test statistics under the null, and facilitate the detection of pairwise epistatic interactions. We further illustrate the benefits of MAPIT in a QTL mapping study by analyzing the gene expression data of over 400 individuals from the GEUVADIS consortium. PMID:28746338
Genomic networks of hybrid sterility.
Turner, Leslie M; White, Michael A; Tautz, Diethard; Payseur, Bret A
2014-02-01
Hybrid dysfunction, a common feature of reproductive barriers between species, is often caused by negative epistasis between loci ("Dobzhansky-Muller incompatibilities"). The nature and complexity of hybrid incompatibilities remain poorly understood because identifying interacting loci that affect complex phenotypes is difficult. With subspecies in the early stages of speciation, an array of genetic tools, and detailed knowledge of reproductive biology, house mice (Mus musculus) provide a model system for dissecting hybrid incompatibilities. Male hybrids between M. musculus subspecies often show reduced fertility. Previous studies identified loci and several X chromosome-autosome interactions that contribute to sterility. To characterize the genetic basis of hybrid sterility in detail, we used a systems genetics approach, integrating mapping of gene expression traits with sterility phenotypes and QTL. We measured genome-wide testis expression in 305 male F2s from a cross between wild-derived inbred strains of M. musculus musculus and M. m. domesticus. We identified several thousand cis- and trans-acting QTL contributing to expression variation (eQTL). Many trans eQTL cluster into eleven 'hotspots,' seven of which co-localize with QTL for sterility phenotypes identified in the cross. The number and clustering of trans eQTL-but not cis eQTL-were substantially lower when mapping was restricted to a 'fertile' subset of mice, providing evidence that trans eQTL hotspots are related to sterility. Functional annotation of transcripts with eQTL provides insights into the biological processes disrupted by sterility loci and guides prioritization of candidate genes. Using a conditional mapping approach, we identified eQTL dependent on interactions between loci, revealing a complex system of epistasis. Our results illuminate established patterns, including the role of the X chromosome in hybrid sterility. The integrated mapping approach we employed is applicable in a broad range of organisms and we advocate for widespread adoption of a network-centered approach in speciation genetics.
Refining Susceptibility Loci of Chronic Obstructive Pulmonary Disease with Lung eqtls
Lamontagne, Maxime; Couture, Christian; Postma, Dirkje S.; Timens, Wim; Sin, Don D.; Paré, Peter D.; Hogg, James C.; Nickle, David; Laviolette, Michel; Bossé, Yohan
2013-01-01
Chronic obstructive pulmonary disease (COPD) is the fourth leading cause of mortality worldwide. Recent genome-wide association studies (GWAS) have identified robust susceptibility loci associated with COPD. However, the mechanisms mediating the risk conferred by these loci remain to be found. The goal of this study was to identify causal genes/variants within susceptibility loci associated with COPD. In the discovery cohort, genome-wide gene expression profiles of 500 non-tumor lung specimens were obtained from patients undergoing lung surgery. Blood-DNA from the same patients were genotyped for 1,2 million SNPs. Following genotyping and gene expression quality control filters, 409 samples were analyzed. Lung expression quantitative trait loci (eQTLs) were identified and overlaid onto three COPD susceptibility loci derived from GWAS; 4q31 (HHIP), 4q22 (FAM13A), and 19q13 (RAB4B, EGLN2, MIA, CYP2A6). Significant eQTLs were replicated in two independent datasets (n = 363 and 339). SNPs previously associated with COPD and lung function on 4q31 (rs1828591, rs13118928) were associated with the mRNA expression of HHIP. An association between mRNA expression level of FAM13A and SNP rs2045517 was detected at 4q22, but did not reach statistical significance. At 19q13, significant eQTLs were detected with EGLN2. In summary, this study supports HHIP, FAM13A, and EGLN2 as the most likely causal COPD genes on 4q31, 4q22, and 19q13, respectively. Strong lung eQTL SNPs identified in this study will need to be tested for association with COPD in case-control studies. Further functional studies will also be needed to understand the role of genes regulated by disease-related variants in COPD. PMID:23936167
Contribution of trans regulatory eQTL to cryptic genetic variation in C. elegans.
Snoek, Basten L; Sterken, Mark G; Bevers, Roel P J; Volkers, Rita J M; Van't Hof, Arjen; Brenchley, Rachel; Riksen, Joost A G; Cossins, Andrew; Kammenga, Jan E
2017-06-29
Cryptic genetic variation (CGV) is the hidden genetic variation that can be unlocked by perturbing normal conditions. CGV can drive the emergence of novel complex phenotypes through changes in gene expression. Although our theoretical understanding of CGV has thoroughly increased over the past decade, insight into polymorphic gene expression regulation underlying CGV is scarce. Here we investigated the transcriptional architecture of CGV in response to rapid temperature changes in the nematode Caenorhabditis elegans. We analyzed regulatory variation in gene expression (and mapped eQTL) across the course of a heat stress and recovery response in a recombinant inbred population. We measured gene expression over three temperature treatments: i) control, ii) heat stress, and iii) recovery from heat stress. Compared to control, exposure to heat stress affected the transcription of 3305 genes, whereas 942 were affected in recovering animals. These affected genes were mainly involved in metabolism and reproduction. The gene expression pattern in recovering animals resembled both the control and the heat-stress treatment. We mapped eQTL using the genetic variation of the recombinant inbred population and detected 2626 genes with an eQTL in the heat-stress treatment, 1797 in the control, and 1880 in the recovery. The cis-eQTL were highly shared across treatments. A considerable fraction of the trans-eQTL (40-57%) mapped to 19 treatment specific trans-bands. In contrast to cis-eQTL, trans-eQTL were highly environment specific and thus cryptic. Approximately 67% of the trans-eQTL were only induced in a single treatment, with heat-stress showing the most unique trans-eQTL. These results illustrate the highly dynamic pattern of CGV across three different environmental conditions that can be evoked by a stress response over a relatively short time-span (2 h) and that CGV is mainly determined by response related trans regulatory eQTL.
An eQTL Analysis of Partial Resistance to Puccinia hordei in Barley
Chen, Xinwei; Hackett, Christine A.; Niks, Rients E.; Hedley, Peter E.; Booth, Clare; Druka, Arnis; Marcel, Thierry C.; Vels, Anton; Bayer, Micha; Milne, Iain; Morris, Jenny; Ramsay, Luke; Marshall, David; Cardle, Linda; Waugh, Robbie
2010-01-01
Background Genetic resistance to barley leaf rust caused by Puccinia hordei involves both R genes and quantitative trait loci. The R genes provide higher but less durable resistance than the quantitative trait loci. Consequently, exploring quantitative or partial resistance has become a favorable alternative for controlling disease. Four quantitative trait loci for partial resistance to leaf rust have been identified in the doubled haploid Steptoe (St)/Morex (Mx) mapping population. Further investigations are required to study the molecular mechanisms underpinning partial resistance and ultimately identify the causal genes. Methodology/Principal Findings We explored partial resistance to barley leaf rust using a genetical genomics approach. We recorded RNA transcript abundance corresponding to each probe on a 15K Agilent custom barley microarray in seedlings from St and Mx and 144 doubled haploid lines of the St/Mx population. A total of 1154 and 1037 genes were, respectively, identified as being P. hordei-responsive among the St and Mx and differentially expressed between P. hordei-infected St and Mx. Normalized ratios from 72 distant-pair hybridisations were used to map the genetic determinants of variation in transcript abundance by expression quantitative trait locus (eQTL) mapping generating 15685 eQTL from 9557 genes. Correlation analysis identified 128 genes that were correlated with resistance, of which 89 had eQTL co-locating with the phenotypic quantitative trait loci (pQTL). Transcript abundance in the parents and conservation of synteny with rice allowed us to prioritise six genes as candidates for Rphq11, the pQTL of largest effect, and highlight one, a phospholipid hydroperoxide glutathione peroxidase (HvPHGPx) for detailed analysis. Conclusions/Significance The eQTL approach yielded information that led to the identification of strong candidate genes underlying pQTL for resistance to leaf rust in barley and on the general pathogen response pathway. The dataset will facilitate a systems appraisal of this host-pathogen interaction and, potentially, for other traits measured in this population. PMID:20066049
Ramakodi, Meganathan P; Devarajan, Karthik; Blackman, Elizabeth; Gibbs, Denise; Luce, Danièle; Deloumeaux, Jacqueline; Duflo, Suzy; Liu, Jeffrey C; Mehra, Ranee; Kulathinal, Rob J; Ragin, Camille C
2017-03-01
African Americans with head and neck squamous cell carcinoma (HNSCC) have a lower survival rate than whites. This study investigated the functional importance of ancestry-informative single-nucleotide polymorphisms (SNPs) in HNSCC and also examined the effect of functionally important genetic elements on racial disparities in HNSCC survival. Ancestry-informative SNPs, RNA sequencing, methylation, and copy number variation data for 316 oral cavity and laryngeal cancer patients were analyzed across 178 DNA repair genes. The results of expression quantitative trait locus (eQTL) analyses were also replicated with a Gene Expression Omnibus (GEO) data set. The effects of eQTLs on overall survival (OS) and disease-free survival (DFS) were evaluated. Five ancestry-related SNPs were identified as cis-eQTLs in the DNA polymerase β (POLB) gene (false discovery rate [FDR] < 0.01). The homozygous/heterozygous genotypes containing the African allele showed higher POLB expression than the homozygous white allele genotype (P < .001). A replication study using a GEO data set validated all 5 eQTLs and also showed a statistically significant difference in POLB expression based on genetic ancestry (P = .002). An association was observed between these eQTLs and OS (P < .037; FDR < 0.0363) as well as DFS (P = .018 to .0629; FDR < 0.079) for oral cavity and laryngeal cancer patients treated with platinum-based chemotherapy and/or radiotherapy. Genotypes containing the African allele were associated with poor OS/DFS in comparison with homozygous genotypes harboring the white allele. Analyses show that ancestry-related alleles could act as eQTLs in HNSCC and support the association of ancestry-related genetic factors with survival disparities in patients diagnosed with oral cavity and laryngeal cancer. Cancer 2017;123:849-60. © 2016 American Cancer Society. © 2016 American Cancer Society.
Genomic Networks of Hybrid Sterility
Turner, Leslie M.; White, Michael A.; Tautz, Diethard; Payseur, Bret A.
2014-01-01
Hybrid dysfunction, a common feature of reproductive barriers between species, is often caused by negative epistasis between loci (“Dobzhansky-Muller incompatibilities”). The nature and complexity of hybrid incompatibilities remain poorly understood because identifying interacting loci that affect complex phenotypes is difficult. With subspecies in the early stages of speciation, an array of genetic tools, and detailed knowledge of reproductive biology, house mice (Mus musculus) provide a model system for dissecting hybrid incompatibilities. Male hybrids between M. musculus subspecies often show reduced fertility. Previous studies identified loci and several X chromosome-autosome interactions that contribute to sterility. To characterize the genetic basis of hybrid sterility in detail, we used a systems genetics approach, integrating mapping of gene expression traits with sterility phenotypes and QTL. We measured genome-wide testis expression in 305 male F2s from a cross between wild-derived inbred strains of M. musculus musculus and M. m. domesticus. We identified several thousand cis- and trans-acting QTL contributing to expression variation (eQTL). Many trans eQTL cluster into eleven ‘hotspots,’ seven of which co-localize with QTL for sterility phenotypes identified in the cross. The number and clustering of trans eQTL—but not cis eQTL—were substantially lower when mapping was restricted to a ‘fertile’ subset of mice, providing evidence that trans eQTL hotspots are related to sterility. Functional annotation of transcripts with eQTL provides insights into the biological processes disrupted by sterility loci and guides prioritization of candidate genes. Using a conditional mapping approach, we identified eQTL dependent on interactions between loci, revealing a complex system of epistasis. Our results illuminate established patterns, including the role of the X chromosome in hybrid sterility. The integrated mapping approach we employed is applicable in a broad range of organisms and we advocate for widespread adoption of a network-centered approach in speciation genetics. PMID:24586194
Almlöf, Jonas Carlsson; Lundmark, Per; Lundmark, Anders; Ge, Bing; Maouche, Seraya; Göring, Harald H. H.; Liljedahl, Ulrika; Enström, Camilla; Brocheton, Jessy; Proust, Carole; Godefroy, Tiphaine; Sambrook, Jennifer G.; Jolley, Jennifer; Crisp-Hihn, Abigail; Foad, Nicola; Lloyd-Jones, Heather; Stephens, Jonathan; Gwilliam, Rhian; Rice, Catherine M.; Hengstenberg, Christian; Samani, Nilesh J.; Erdmann, Jeanette; Schunkert, Heribert; Pastinen, Tomi; Deloukas, Panos; Goodall, Alison H.; Ouwehand, Willem H.; Cambien, François; Syvänen, Ann-Christine
2012-01-01
A large number of genome-wide association studies have been performed during the past five years to identify associations between SNPs and human complex diseases and traits. The assignment of a functional role for the identified disease-associated SNP is not straight-forward. Genome-wide expression quantitative trait locus (eQTL) analysis is frequently used as the initial step to define a function while allele-specific gene expression (ASE) analysis has not yet gained a wide-spread use in disease mapping studies. We compared the power to identify cis-acting regulatory SNPs (cis-rSNPs) by genome-wide allele-specific gene expression (ASE) analysis with that of traditional expression quantitative trait locus (eQTL) mapping. Our study included 395 healthy blood donors for whom global gene expression profiles in circulating monocytes were determined by Illumina BeadArrays. ASE was assessed in a subset of these monocytes from 188 donors by quantitative genotyping of mRNA using a genome-wide panel of SNP markers. The performance of the two methods for detecting cis-rSNPs was evaluated by comparing associations between SNP genotypes and gene expression levels in sample sets of varying size. We found that up to 8-fold more samples are required for eQTL mapping to reach the same statistical power as that obtained by ASE analysis for the same rSNPs. The performance of ASE is insensitive to SNPs with low minor allele frequencies and detects a larger number of significantly associated rSNPs using the same sample size as eQTL mapping. An unequivocal conclusion from our comparison is that ASE analysis is more sensitive for detecting cis-rSNPs than standard eQTL mapping. Our study shows the potential of ASE mapping in tissue samples and primary cells which are difficult to obtain in large numbers. PMID:23300628
Komatsu, Masanori; Nishino, Kagetomo; Fujimori, Yuki; Haga, Yasutoshi; Iwama, Nagako; Arakawa, Aisaku; Aihara, Yoshito; Takeda, Hisato; Takahashi, Hideaki
2018-02-01
Growth hormone secretagogue receptor 1a (GHSR1a), growth hormone (GH), growth hormone receptor (GHR), non-SMC condensin I complex, subunit G (NCAPG) and stearoyl-CoA desaturase (SCD), are known to play important roles in growth and lipid metabolisms. Single and epistatic effects of the five genes on carcass, price-related and fatty acid (FA) composition traits were analyzed in a commercial Japanese Black cattle population of Ibaraki Prefecture. A total of 650 steers and 116 heifers for carcass and price-related traits, and 158 steers for FA composition traits were used in this study. Epistatic effects between pairs of the five genes were found in several traits. Alleles showing strain-specific differences in the five genes had significant single and epistatic effects in some traits. The data suggest that a TG-repeat polymorphism of the GHSR1a.5'UTR-(TG) n locus plays a central role in gene-gene epistatic interaction of FA composition traits in the adipose tissue of Japanese Black cattle. © 2017 Japanese Society of Animal Science.
A novel eQTL-based analysis reveals the biology of breast cancer risk loci
Li, Qiyuan; Seo, Ji-Heui; Stranger, Barbara; McKenna, Aaron; Pe'er, Itsik; LaFramboise, Thomas; Brown, Myles; Tyekucheva, Svitlana; Freedman, Matthew L.
2014-01-01
Summary Germline determinants of gene expression in tumors are less studied due to the complexity of transcript regulation caused by somatically acquired alterations. We performed expression quantitative trait locus (eQTL) based analyses using the multi-level information provided in The Cancer Genome Atlas (TCGA). Of the factors we measured, cis-acting eQTL saccounted for 1.2% of the total variation of tumor gene expression, while somatic copy number alteration and CpG methylation accounted for 7.3% and 3.3%, respectively. eQTL analyses of 15 previously reported breast cancer risk loci resulted in discovery of three variants that are significantly associated with transcript levels (FDR<0.1). In a novel trans- based analysis, an additional three risk loci were identified to act through ESR1, MYC, and KLF4. These findings provide a more comprehensive picture of gene expression determinants in breast cancer as well as insights into the underlying biology of breast cancer risk loci. PMID:23374354
Wei, Wen-Hua; Loh, Chia-Yin; Worthington, Jane; Eyre, Stephen
2016-05-01
Studying statistical gene-gene interactions (epistasis) has been limited by the difficulties in performance, both statistically and computationally, in large enough sample numbers to gain sufficient power. Three large Immunochip datasets from cohort samples recruited in the United Kingdom, United States, and Sweden with European ancestry were used to examine epistasis in rheumatoid arthritis (RA). A full pairwise search was conducted in the UK cohort using a high-throughput tool and the resultant significant epistatic signals were tested for replication in the United States and Swedish cohorts. A forward selection approach was applied to remove redundant signals, while conditioning on the preidentified additive effects. We detected abundant genome-wide significant (p < 1.0e-13) epistatic signals, all within the MHC region. These signals were reduced substantially, but a proportion remained significant (p < 1.0e-03) in conditional tests. We identified 11 independent epistatic interactions across the entire MHC, each explaining on average 0.12% of the phenotypic variance, nearly all replicated in both replication cohorts. We also identified non-MHC epistatic interactions between RA susceptible loci LOC100506023 and IRF5 with Immunochip-wide significance (p < 1.1e-08) and between 2 neighboring single-nucleotide polymorphism near PTPN22 that were in low linkage disequilibrium with independent interaction (p < 1.0e-05). Both non-MHC epistatic interactions were statistically replicated with a similar interaction pattern in the US cohort only. There are multiple but relatively weak interactions independent of the additive effects in RA and a larger sample number is required to confidently assign additional non-MHC epistasis.
Fang, Hai; Knezevic, Bogdan; Burnham, Katie L; Knight, Julian C
2016-12-13
Biological interpretation of genomic summary data such as those resulting from genome-wide association studies (GWAS) and expression quantitative trait loci (eQTL) studies is one of the major bottlenecks in medical genomics research, calling for efficient and integrative tools to resolve this problem. We introduce eXploring Genomic Relations (XGR), an open source tool designed for enhanced interpretation of genomic summary data enabling downstream knowledge discovery. Targeting users of varying computational skills, XGR utilises prior biological knowledge and relationships in a highly integrated but easily accessible way to make user-input genomic summary datasets more interpretable. We show how by incorporating ontology, annotation, and systems biology network-driven approaches, XGR generates more informative results than conventional analyses. We apply XGR to GWAS and eQTL summary data to explore the genomic landscape of the activated innate immune response and common immunological diseases. We provide genomic evidence for a disease taxonomy supporting the concept of a disease spectrum from autoimmune to autoinflammatory disorders. We also show how XGR can define SNP-modulated gene networks and pathways that are shared and distinct between diseases, how it achieves functional, phenotypic and epigenomic annotations of genes and variants, and how it enables exploring annotation-based relationships between genetic variants. XGR provides a single integrated solution to enhance interpretation of genomic summary data for downstream biological discovery. XGR is released as both an R package and a web-app, freely available at http://galahad.well.ox.ac.uk/XGR .
Newton, Timothy; Allison, Rachel; Edgar, James R; Lumb, Jennifer H; Rodger, Catherine E; Manna, Paul T; Rizo, Tania; Kohl, Zacharias; Nygren, Anders O H; Arning, Larissa; Schüle, Rebecca; Depienne, Christel; Goldberg, Lisa; Frahm, Christiane; Stevanin, Giovanni; Durr, Alexandra; Schöls, Ludger; Winner, Beate; Beetz, Christian; Reid, Evan
2018-05-01
Many genetic neurological disorders exhibit variable expression within affected families, often exemplified by variations in disease age at onset. Epistatic effects (i.e. effects of modifier genes on the disease gene) may underlie this variation, but the mechanistic basis for such epistatic interactions is rarely understood. Here we report a novel epistatic interaction between SPAST and the contiguous gene DPY30, which modifies age at onset in hereditary spastic paraplegia, a genetic axonopathy. We found that patients with hereditary spastic paraplegia caused by genomic deletions of SPAST that extended into DPY30 had a significantly younger age at onset. We show that, like spastin, the protein encoded by SPAST, the DPY30 protein controls endosomal tubule fission, traffic of mannose 6-phosphate receptors from endosomes to the Golgi, and lysosomal ultrastructural morphology. We propose that additive effects on this pathway explain the reduced age at onset of hereditary spastic paraplegia in patients who are haploinsufficient for both genes.
Collins, Ryan L; Hu, Ting; Wejse, Christian; Sirugo, Giorgio; Williams, Scott M; Moore, Jason H
2013-02-18
Identifying high-order genetics associations with non-additive (i.e. epistatic) effects in population-based studies of common human diseases is a computational challenge. Multifactor dimensionality reduction (MDR) is a machine learning method that was designed specifically for this problem. The goal of the present study was to apply MDR to mining high-order epistatic interactions in a population-based genetic study of tuberculosis (TB). The study used a previously published data set consisting of 19 candidate single-nucleotide polymorphisms (SNPs) in 321 pulmonary TB cases and 347 healthy controls from Guniea-Bissau in Africa. The ReliefF algorithm was applied first to generate a smaller set of the five most informative SNPs. MDR with 10-fold cross-validation was then applied to look at all possible combinations of two, three, four and five SNPs. The MDR model with the best testing accuracy (TA) consisted of SNPs rs2305619, rs187084, and rs11465421 (TA = 0.588) in PTX3, TLR9 and DC-Sign, respectively. A general 1000-fold permutation test of the null hypothesis of no association confirmed the statistical significance of the model (p = 0.008). An additional 1000-fold permutation test designed specifically to test the linear null hypothesis that the association effects are only additive confirmed the presence of non-additive (i.e. nonlinear) or epistatic effects (p = 0.013). An independent information-gain measure corroborated these results with a third-order epistatic interaction that was stronger than any lower-order associations. We have identified statistically significant evidence for a three-way epistatic interaction that is associated with susceptibility to TB. This interaction is stronger than any previously described one-way or two-way associations. This study highlights the importance of using machine learning methods that are designed to embrace, rather than ignore, the complexity of common diseases such as TB. We recommend future studies of the genetics of TB take into account the possibility that high-order epistatic interactions might play an important role in disease susceptibility.
2015-01-01
Background Over the past 50,000 years, shifts in human-environmental or human-human interactions shaped genetic differences within and among human populations, including variants under positive selection. Shaped by environmental factors, such variants influence the genetics of modern health, disease, and treatment outcome. Because evolutionary processes tend to act on gene regulation, we test whether regulatory variants are under positive selection. We introduce a new approach to enhance detection of genetic markers undergoing positive selection, using conditional entropy to capture recent local selection signals. Results We use conditional logistic regression to compare our Adjusted Haplotype Conditional Entropy (H|H) measure of positive selection to existing positive selection measures. H|H and existing measures were applied to published regulatory variants acting in cis (cis-eQTLs), with conditional logistic regression testing whether regulatory variants undergo stronger positive selection than the surrounding gene. These cis-eQTLs were drawn from six independent studies of genotype and RNA expression. The conditional logistic regression shows that, overall, H|H is substantially more powerful than existing positive-selection methods in identifying cis-eQTLs against other Single Nucleotide Polymorphisms (SNPs) in the same genes. When broken down by Gene Ontology, H|H predictions are particularly strong in some biological process categories, where regulatory variants are under strong positive selection compared to the bulk of the gene, distinct from those GO categories under overall positive selection. . However, cis-eQTLs in a second group of genes lack positive selection signatures detectable by H|H, consistent with ancient short haplotypes compared to the surrounding gene (for example, in innate immunity GO:0042742); under such other modes of selection, H|H would not be expected to be a strong predictor.. These conditional logistic regression models are adjusted for Minor allele frequency(MAF); otherwise, ascertainment bias is a huge factor in all eQTL data sets. Relationships between Gene Ontology categories, positive selection and eQTL specificity were replicated with H|H in a single larger data set. Our measure, Adjusted Haplotype Conditional Entropy (H|H), was essential in generating all of the results above because it: 1) is a stronger overall predictor for eQTLs than comparable existing approaches, and 2) shows low sequential auto-correlation, overcoming problems with convergence of these conditional regression statistical models. Conclusions Our new method, H|H, provides a consistently more robust signal associated with cis-eQTLs compared to existing methods. We interpret this to indicate that some cis-eQTLs are under positive selection compared to their surrounding genes. Conditional entropy indicative of a selective sweep is an especially strong predictor of eQTLs for genes in several biological processes of medical interest. Where conditional entropy is a weak or negative predictor of eQTLs, such as innate immune genes, this would be consistent with balancing selection acting on such eQTLs over long time periods. Different measures of selection may be needed for variant prioritization under other modes of evolutionary selection. PMID:26111110
Ameratunga, Rohan; Koopmans, Wikke; Woon, See-Tarn; Leung, Euphemia; Lehnert, Klaus; Slade, Charlotte A; Tempany, Jessica C; Enders, Anselm; Steele, Richard; Browett, Peter; Hodgkin, Philip D; Bryant, Vanessa L
2017-01-01
Common variable immunodeficiency disorders (CVID) are a group of primary immunodeficiencies where monogenetic causes account for only a fraction of cases. On this evidence, CVID is potentially polygenic and epistatic although there are, as yet, no examples to support this hypothesis. We have identified a non-consanguineous family, who carry the C104R (c.310T>C) mutation of the Transmembrane Activator Calcium-modulator and cyclophilin ligand Interactor (TACI, TNFRSF13B) gene. Variants in TNFRSF13B/TACI are identified in up to 10% of CVID patients, and are associated with, but not solely causative of CVID. The proband is heterozygous for the TNFRSF13B/TACI C104R mutation and meets the Ameratunga et al. diagnostic criteria for CVID and the American College of Rheumatology criteria for systemic lupus erythematosus (SLE). Her son has type 1 diabetes, arthritis, reduced IgG levels and IgA deficiency, but has not inherited the TNFRSF13B/TACI mutation. Her brother, homozygous for the TNFRSF13B/TACI mutation, is in good health despite profound hypogammaglobulinemia and mild cytopenias. We hypothesised that a second unidentified mutation contributed to the symptomatic phenotype of the proband and her son. Whole-exome sequencing of the family revealed a de novo nonsense mutation (T168fsX191) in the Transcription Factor 3 (TCF3) gene encoding the E2A transcription factors, present only in the proband and her son. We demonstrate mutations of TNFRSF13B/TACI impair immunoglobulin isotype switching and antibody production predominantly via T-cell-independent signalling, while mutations of TCF3 impair both T-cell-dependent and -independent pathways of B-cell activation and differentiation. We conclude that epistatic interactions between mutations of the TNFRSF13B/TACI and TCF3 signalling networks lead to the severe CVID-like disorder and SLE in the proband. PMID:29114388
Wang, D; Salah El-Basyoni, I; Stephen Baenziger, P; Crossa, J; Eskridge, K M; Dweikat, I
2012-11-01
Though epistasis has long been postulated to have a critical role in genetic regulation of important pathways as well as provide a major source of variation in the process of speciation, the importance of epistasis for genomic selection in the context of plant breeding is still being debated. In this paper, we report the results on the prediction of genetic values with epistatic effects for 280 accessions in the Nebraska Wheat Breeding Program using adaptive mixed least absolute shrinkage and selection operator (LASSO). The development of adaptive mixed LASSO, originally designed for association mapping, for the context of genomic selection is reported. The results show that adaptive mixed LASSO can be successfully applied to the prediction of genetic values while incorporating both marker main effects and epistatic effects. Especially, the prediction accuracy is substantially improved by the inclusion of two-locus epistatic effects (more than onefold in some cases as measured by cross-validation correlation coefficient), which is observed for multiple traits and planting locations. This points to significant potential in using non-additive genetic effects for genomic selection in crop breeding practices.
Transcriptional risk scores link GWAS to eQTLs and predict complications in Crohn's disease.
Marigorta, Urko M; Denson, Lee A; Hyams, Jeffrey S; Mondal, Kajari; Prince, Jarod; Walters, Thomas D; Griffiths, Anne; Noe, Joshua D; Crandall, Wallace V; Rosh, Joel R; Mack, David R; Kellermayer, Richard; Heyman, Melvin B; Baker, Susan S; Stephens, Michael C; Baldassano, Robert N; Markowitz, James F; Kim, Mi-Ok; Dubinsky, Marla C; Cho, Judy; Aronow, Bruce J; Kugathasan, Subra; Gibson, Greg
2017-10-01
Gene expression profiling can be used to uncover the mechanisms by which loci identified through genome-wide association studies (GWAS) contribute to pathology. Given that most GWAS hits are in putative regulatory regions and transcript abundance is physiologically closer to the phenotype of interest, we hypothesized that summation of risk-allele-associated gene expression, namely a transcriptional risk score (TRS), should provide accurate estimates of disease risk. We integrate summary-level GWAS and expression quantitative trait locus (eQTL) data with RNA-seq data from the RISK study, an inception cohort of pediatric Crohn's disease. We show that TRSs based on genes regulated by variants linked to inflammatory bowel disease (IBD) not only outperform genetic risk scores (GRSs) in distinguishing Crohn's disease from healthy samples, but also serve to identify patients who in time will progress to complicated disease. Our dissection of eQTL effects may be used to distinguish genes whose association with disease is through promotion versus protection, thereby linking statistical association to biological mechanism. The TRS approach constitutes a potential strategy for personalized medicine that enhances inference from static genotypic risk assessment.
2014-01-01
Expression quantitative trait loci (eQTL) mapping is a tool that can systematically identify genetic variation affecting gene expression. eQTL mapping studies have shown that certain genomic locations, referred to as regulatory hotspots, may affect the expression levels of many genes. Recently, studies have shown that various confounding factors may induce spurious regulatory hotspots. Here, we introduce a novel statistical method that effectively eliminates spurious hotspots while retaining genuine hotspots. Applied to simulated and real datasets, we validate that our method achieves greater sensitivity while retaining low false discovery rates compared to previous methods. PMID:24708878
ITGB5 and AGFG1 variants are associated with severity of airway responsiveness.
Himes, Blanca E; Qiu, Weiliang; Klanderman, Barbara; Ziniti, John; Senter-Sylvia, Jody; Szefler, Stanley J; Lemanske, Robert F; Zeiger, Robert S; Strunk, Robert C; Martinez, Fernando D; Boushey, Homer; Chinchilli, Vernon M; Israel, Elliot; Mauger, David; Koppelman, Gerard H; Nieuwenhuis, Maartje A E; Postma, Dirkje S; Vonk, Judith M; Rafaels, Nicholas; Hansel, Nadia N; Barnes, Kathleen; Raby, Benjamin; Tantisira, Kelan G; Weiss, Scott T
2013-08-28
Airway hyperresponsiveness (AHR), a primary characteristic of asthma, involves increased airway smooth muscle contractility in response to certain exposures. We sought to determine whether common genetic variants were associated with AHR severity. A genome-wide association study (GWAS) of AHR, quantified as the natural log of the dosage of methacholine causing a 20% drop in FEV1, was performed with 994 non-Hispanic white asthmatic subjects from three drug clinical trials: CAMP, CARE, and ACRN. Genotyping was performed on Affymetrix 6.0 arrays, and imputed data based on HapMap Phase 2, was used to measure the association of SNPs with AHR using a linear regression model. Replication of primary findings was attempted in 650 white subjects from DAG, and 3,354 white subjects from LHS. Evidence that the top SNPs were eQTL of their respective genes was sought using expression data available for 419 white CAMP subjects. The top primary GWAS associations were in rs848788 (P-value 7.2E-07) and rs6731443 (P-value 2.5E-06), located within the ITGB5 and AGFG1 genes, respectively. The AGFG1 result replicated at a nominally significant level in one independent population (LHS P-value 0.012), and the SNP had a nominally significant unadjusted P-value (0.0067) for being an eQTL of AGFG1. Based on current knowledge of ITGB5 and AGFG1, our results suggest that variants within these genes may be involved in modulating AHR. Future functional studies are required to confirm that our associations represent true biologically significant findings.
Farber, Charles R; van Nas, Atila; Ghazalpour, Anatole; Aten, Jason E; Doss, Sudheer; Sos, Brandon; Schadt, Eric E; Ingram-Drake, Leslie; Davis, Richard C; Horvath, Steve; Smith, Desmond J; Drake, Thomas A; Lusis, Aldons J
2009-01-01
Numerous quantitative trait loci (QTLs) affecting bone traits have been identified in the mouse; however, few of the underlying genes have been discovered. To improve the process of transitioning from QTL to gene, we describe an integrative genetics approach, which combines linkage analysis, expression QTL (eQTL) mapping, causality modeling, and genetic association in outbred mice. In C57BL/6J × C3H/HeJ (BXH) F2 mice, nine QTLs regulating femoral BMD were identified. To select candidate genes from within each QTL region, microarray gene expression profiles from individual F2 mice were used to identify 148 genes whose expression was correlated with BMD and regulated by local eQTLs. Many of the genes that were the most highly correlated with BMD have been previously shown to modulate bone mass or skeletal development. Candidates were further prioritized by determining whether their expression was predicted to underlie variation in BMD. Using network edge orienting (NEO), a causality modeling algorithm, 18 of the 148 candidates were predicted to be causally related to differences in BMD. To fine-map QTLs, markers in outbred MF1 mice were tested for association with BMD. Three chromosome 11 SNPs were identified that were associated with BMD within the Bmd11 QTL. Finally, our approach provides strong support for Wnt9a, Rasd1, or both underlying Bmd11. Integration of multiple genetic and genomic data sets can substantially improve the efficiency of QTL fine-mapping and candidate gene identification. PMID:18767929
ADP ribosyl-cyclases (CD38/CD157), social skills and friendship.
Chong, Anne; Malavasi, Fabio; Israel, Salomon; Khor, Chiea Chuen; Yap, Von Bing; Monakhov, Mikhail; Chew, Soo Hong; Lai, Poh San; Ebstein, Richard P
2017-04-01
Why some individuals seek social engagement while others shy away has profound implications for normal and pathological human behavior. Evidence suggests that oxytocin (OT), the paramount human social hormone, and CD38 that governs OT release, contribute to individual differences in social skills from intense social involvement to extreme avoidance that characterize autism. To explore the neurochemical underpinnings of sociality, CD38 expression of peripheral blood leukocytes (PBL) was measured in Han Chinese undergraduates. First, CD38 mRNA levels were correlated with lower Autism Quotient (AQ), indicating enhanced social skills. AQ assesses the extent of autistic-like traits including the propensity and dexterity needed for successful social engagement in the general population. Second, three CD157 eQTL SNPs in the CD38/CD157 gene region were associated with CD38 expression. CD157 is a paralogue of CD38 and is contiguous with it on chromosome 4p15. Third, association was also observed between the CD157 eQTL SNPs, CD38 expression and AQ. In the full model, CD38 expression and CD157 eQTL SNPs altogether account for a substantial 14% of the variance in sociality. Fourth, functionality of CD157 eQTL SNPs was suggested by a significant association with plasma oxytocin immunoreactivity products. Fifth, the ecological validity of these findings was demonstrated with subjects with higher PBL CD38 expression having more friends, especially for males. Furthermore, CD157 sequence variation predicts scores on the Friendship questionnaire. To summarize, this study by uniquely leveraging various measures reveals salient elements contributing to nonkin sociality and friendship, revealing a likely pathway underpinning the transition from normality to psychopathology. Copyright © 2017 Elsevier Ltd. All rights reserved.
Aberrant Gene Expression in Humans
Yang, Ence; Ji, Guoli; Brinkmeyer-Langford, Candice L.; Cai, James J.
2015-01-01
Gene expression as an intermediate molecular phenotype has been a focus of research interest. In particular, studies of expression quantitative trait loci (eQTL) have offered promise for understanding gene regulation through the discovery of genetic variants that explain variation in gene expression levels. Existing eQTL methods are designed for assessing the effects of common variants, but not rare variants. Here, we address the problem by establishing a novel analytical framework for evaluating the effects of rare or private variants on gene expression. Our method starts from the identification of outlier individuals that show markedly different gene expression from the majority of a population, and then reveals the contributions of private SNPs to the aberrant gene expression in these outliers. Using population-scale mRNA sequencing data, we identify outlier individuals using a multivariate approach. We find that outlier individuals are more readily detected with respect to gene sets that include genes involved in cellular regulation and signal transduction, and less likely to be detected with respect to the gene sets with genes involved in metabolic pathways and other fundamental molecular functions. Analysis of polymorphic data suggests that private SNPs of outlier individuals are enriched in the enhancer and promoter regions of corresponding aberrantly-expressed genes, suggesting a specific regulatory role of private SNPs, while the commonly-occurring regulatory genetic variants (i.e., eQTL SNPs) show little evidence of involvement. Additional data suggest that non-genetic factors may also underlie aberrant gene expression. Taken together, our findings advance a novel viewpoint relevant to situations wherein common eQTLs fail to predict gene expression when heritable, rare inter-individual variation exists. The analytical framework we describe, taking into consideration the reality of differential phenotypic robustness, may be valuable for investigating complex traits and conditions. PMID:25617623
Decomposing genomic variance using information from GWA, GWE and eQTL analysis.
Ehsani, A; Janss, L; Pomp, D; Sørensen, P
2016-04-01
A commonly used procedure in genome-wide association (GWA), genome-wide expression (GWE) and expression quantitative trait locus (eQTL) analyses is based on a bottom-up experimental approach that attempts to individually associate molecular variants with complex traits. Top-down modeling of the entire set of genomic data and partitioning of the overall variance into subcomponents may provide further insight into the genetic basis of complex traits. To test this approach, we performed a whole-genome variance components analysis and partitioned the genomic variance using information from GWA, GWE and eQTL analyses of growth-related traits in a mouse F2 population. We characterized the mouse trait genetic architecture by ordering single nucleotide polymorphisms (SNPs) based on their P-values and studying the areas under the curve (AUCs). The observed traits were found to have a genomic variance profile that differed significantly from that expected of a trait under an infinitesimal model. This situation was particularly true for both body weight and body fat, for which the AUCs were much higher compared with that of glucose. In addition, SNPs with a high degree of trait-specific regulatory potential (SNPs associated with subset of transcripts that significantly associated with a specific trait) explained a larger proportion of the genomic variance than did SNPs with high overall regulatory potential (SNPs associated with transcripts using traditional eQTL analysis). We introduced AUC measures of genomic variance profiles that can be used to quantify relative importance of SNPs as well as degree of deviation of a trait's inheritance from an infinitesimal model. The shape of the curve aids global understanding of traits: The steeper the left-hand side of the curve, the fewer the number of SNPs controlling most of the phenotypic variance. © 2015 Stichting International Foundation for Animal Genetics.
Integrative eQTL analysis of tumor and host omics data in individuals with bladder cancer.
Pineda, Silvia; Van Steen, Kristel; Malats, Núria
2017-09-01
Integrative analyses of several omics data are emerging. The data are usually generated from the same source material (i.e., tumor sample) representing one level of regulation. However, integrating different regulatory levels (i.e., blood) with those from tumor may also reveal important knowledge about the human genetic architecture. To model this multilevel structure, an integrative-expression quantitative trait loci (eQTL) analysis applying two-stage regression (2SR) was proposed. This approach first regressed tumor gene expression levels with tumor markers and the adjusted residuals from the previous model were then regressed with the germline genotypes measured in blood. Previously, we demonstrated that penalized regression methods in combination with a permutation-based MaxT method (Global-LASSO) is a promising tool to fix some of the challenges that high-throughput omics data analysis imposes. Here, we assessed whether Global-LASSO can also be applied when tumor and blood omics data are integrated. We further compared our strategy with two 2SR-approaches, one using multiple linear regression (2SR-MLR) and other using LASSO (2SR-LASSO). We applied the three models to integrate genomic, epigenomic, and transcriptomic data from tumor tissue with blood germline genotypes from 181 individuals with bladder cancer included in the TCGA Consortium. Global-LASSO provided a larger list of eQTLs than the 2SR methods, identified a previously reported eQTLs in prostate stem cell antigen (PSCA), and provided further clues on the complexity of APBEC3B loci, with a minimal false-positive rate not achieved by 2SR-MLR. It also represents an important contribution for omics integrative analysis because it is easy to apply and adaptable to any type of data. © 2017 WILEY PERIODICALS, INC.
Regulation of growth-defense balance by the JASMONATE ZIM-DOMAIN (JAZ)-MYC transcriptional module
DOE Office of Scientific and Technical Information (OSTI.GOV)
Major, Ian T.; Yoshida, Yuki; Campos, Marcelo L.
The plant hormone jasmonate (JA) promotes the degradation of JASMONATE ZIM-DOMAIN (JAZ) proteins to relieve repression on diverse transcription factors (TFs) that execute JA responses. However, little is known about how combinatorial complexity among JAZ–TF interactions maintains control over myriad aspects of growth, development, reproduction, and immunity. We used loss-of-function mutations to define epistatic interactions within the core JA signaling pathway and to investigate the contribution of MYC TFs to JA responses in Arabidopsis thaliana. Constitutive JA signaling in a jaz quintuple mutant (jazQ) was largely eliminated by mutations that block JA synthesis or perception. Comparison of jazQ and amore » jazQ myc2 myc3 myc4 octuple mutant validated known functions of MYC2/3/4 in root growth, chlorophyll degradation,and susceptibility to the pathogen Pseudomonas syringae. We found that MYC TFs also control both the enhanced resistance of jazQ leaves to insect herbivory and restricted leaf growth of jazQ. Epistatic transcriptional profiles mirrored these phenotypes and further showed that triterpenoid biosynthetic and glucosinolate catabolic genes are up-regulated in jazQ independently of MYC TFs. Lastly, our study highlights the utility of genetic epistasis to unravel the complexities of JAZ–TF interactions and demonstrates that MYC TFs exert master control over a JAZ-repressible transcriptional hierarchy that governs growth–defense balance.« less
Regulation of growth-defense balance by the JASMONATE ZIM-DOMAIN (JAZ)-MYC transcriptional module
Major, Ian T.; Yoshida, Yuki; Campos, Marcelo L.; ...
2017-06-26
The plant hormone jasmonate (JA) promotes the degradation of JASMONATE ZIM-DOMAIN (JAZ) proteins to relieve repression on diverse transcription factors (TFs) that execute JA responses. However, little is known about how combinatorial complexity among JAZ–TF interactions maintains control over myriad aspects of growth, development, reproduction, and immunity. We used loss-of-function mutations to define epistatic interactions within the core JA signaling pathway and to investigate the contribution of MYC TFs to JA responses in Arabidopsis thaliana. Constitutive JA signaling in a jaz quintuple mutant (jazQ) was largely eliminated by mutations that block JA synthesis or perception. Comparison of jazQ and amore » jazQ myc2 myc3 myc4 octuple mutant validated known functions of MYC2/3/4 in root growth, chlorophyll degradation,and susceptibility to the pathogen Pseudomonas syringae. We found that MYC TFs also control both the enhanced resistance of jazQ leaves to insect herbivory and restricted leaf growth of jazQ. Epistatic transcriptional profiles mirrored these phenotypes and further showed that triterpenoid biosynthetic and glucosinolate catabolic genes are up-regulated in jazQ independently of MYC TFs. Lastly, our study highlights the utility of genetic epistasis to unravel the complexities of JAZ–TF interactions and demonstrates that MYC TFs exert master control over a JAZ-repressible transcriptional hierarchy that governs growth–defense balance.« less
Kothandapani, Anbarasi; Sawant, Akshada; Dangeti, Venkata Srinivas Mohan Nimai; Sobol, Robert W.; Patrick, Steve M.
2013-01-01
Base excision repair (BER) and mismatch repair (MMR) pathways play an important role in modulating cis-Diamminedichloroplatinum (II) (cisplatin) cytotoxicity. In this article, we identified a novel mechanistic role of both BER and MMR pathways in mediating cellular responses to cisplatin treatment. Cells defective in BER or MMR display a cisplatin-resistant phenotype. Targeting both BER and MMR pathways resulted in no additional resistance to cisplatin, suggesting that BER and MMR play epistatic roles in mediating cisplatin cytotoxicity. Using a DNA Polymerase β (Polβ) variant deficient in polymerase activity (D256A), we demonstrate that MMR acts downstream of BER and is dependent on the polymerase activity of Polβ in mediating cisplatin cytotoxicity. MSH2 preferentially binds a cisplatin interstrand cross-link (ICL) DNA substrate containing a mismatch compared with a cisplatin ICL substrate without a mismatch, suggesting a novel mutagenic role of Polβ in activating MMR in response to cisplatin. Collectively, these results provide the first mechanistic model for BER and MMR functioning within the same pathway to mediate cisplatin sensitivity via non-productive ICL processing. In this model, MMR participation in non-productive cisplatin ICL processing is downstream of BER processing and dependent on Polβ misincorporation at cisplatin ICL sites, which results in persistent cisplatin ICLs and sensitivity to cisplatin. PMID:23761438
Yang, Tsun-Po; Beazley, Claude; Montgomery, Stephen B; Dimas, Antigone S; Gutierrez-Arcelus, Maria; Stranger, Barbara E; Deloukas, Panos; Dermitzakis, Emmanouil T
2010-10-01
Genevar (GENe Expression VARiation) is a database and Java tool designed to integrate multiple datasets, and provides analysis and visualization of associations between sequence variation and gene expression. Genevar allows researchers to investigate expression quantitative trait loci (eQTL) associations within a gene locus of interest in real time. The database and application can be installed on a standard computer in database mode and, in addition, on a server to share discoveries among affiliations or the broader community over the Internet via web services protocols. http://www.sanger.ac.uk/resources/software/genevar.
eQTL networks unveil enriched mRNA master integrators downstream of complex disease-associated SNPs.
Li, Haiquan; Pouladi, Nima; Achour, Ikbel; Gardeux, Vincent; Li, Jianrong; Li, Qike; Zhang, Hao Helen; Martinez, Fernando D; 'Skip' Garcia, Joe G N; Lussier, Yves A
2015-12-01
The causal and interplay mechanisms of Single Nucleotide Polymorphisms (SNPs) associated with complex diseases (complex disease SNPs) investigated in genome-wide association studies (GWAS) at the transcriptional level (mRNA) are poorly understood despite recent advancements such as discoveries reported in the Encyclopedia of DNA Elements (ENCODE) and Genotype-Tissue Expression (GTex). Protein interaction network analyses have successfully improved our understanding of both single gene diseases (Mendelian diseases) and complex diseases. Whether the mRNAs downstream of complex disease genes are central or peripheral in the genetic information flow relating DNA to mRNA remains unclear and may be disease-specific. Using expression Quantitative Trait Loci (eQTL) that provide DNA to mRNA associations and network centrality metrics, we hypothesize that we can unveil the systems properties of information flow between SNPs and the transcriptomes of complex diseases. We compare different conditions such as naïve SNP assignments and stringent linkage disequilibrium (LD) free assignments for transcripts to remove confounders from LD. Additionally, we compare the results from eQTL networks between lymphoblastoid cell lines and liver tissue. Empirical permutation resampling (p<0.001) and theoretic Mann-Whitney U test (p<10(-30)) statistics indicate that mRNAs corresponding to complex disease SNPs via eQTL associations are likely to be regulated by a larger number of SNPs than expected. We name this novel property mRNA hubness in eQTL networks, and further term mRNAs with high hubness as master integrators. mRNA master integrators receive and coordinate the perturbation signals from large numbers of polymorphisms and respond to the personal genetic architecture integratively. This genetic signal integration contrasts with the mechanism underlying some Mendelian diseases, where a genetic polymorphism affecting a single protein hub produces a divergent signal that affects a large number of downstream proteins. Indeed, we verify that this property is independent of the hubness in protein networks for which these mRNAs are transcribed. Our findings provide novel insights into the pleiotropy of mRNAs targeted by complex disease polymorphisms and the architecture of the information flow between the genetic polymorphisms and transcriptomes of complex diseases. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Molecular mechanisms underlying variations in lung function: a systems genetics analysis
Obeidat, Ma’en; Hao, Ke; Bossé, Yohan; Nickle, David C; Nie, Yunlong; Postma, Dirkje S; Laviolette, Michel; Sandford, Andrew J; Daley, Denise D; Hogg, James C; Elliott, W Mark; Fishbane, Nick; Timens, Wim; Hysi, Pirro G; Kaprio, Jaakko; Wilson, James F; Hui, Jennie; Rawal, Rajesh; Schulz, Holger; Stubbe, Beate; Hayward, Caroline; Polasek, Ozren; Järvelin, Marjo-Riitta; Zhao, Jing Hua; Jarvis, Deborah; Kähönen, Mika; Franceschini, Nora; North, Kari E; Loth, Daan W; Brusselle, Guy G; Smith, Albert Vernon; Gudnason, Vilmundur; Bartz, Traci M; Wilk, Jemma B; O’Connor, George T; Cassano, Patricia A; Tang, Wenbo; Wain, Louise V; Artigas, María Soler; Gharib, Sina A; Strachan, David P; Sin, Don D; Tobin, Martin D; London, Stephanie J; Hall, Ian P; Paré, Peter D
2016-01-01
Summary Background Lung function measures reflect the physiological state of the lung, and are essential to the diagnosis of chronic obstructive pulmonary disease (COPD). The SpiroMeta-CHARGE consortium undertook the largest genome-wide association study (GWAS) so far (n=48 201) for forced expiratory volume in 1 s (FEV1) and the ratio of FEV1 to forced vital capacity (FEV1/FVC) in the general population. The lung expression quantitative trait loci (eQTLs) study mapped the genetic architecture of gene expression in lung tissue from 1111 individuals. We used a systems genetics approach to identify single nucleotide polymorphisms (SNPs) associated with lung function that act as eQTLs and change the level of expression of their target genes in lung tissue; termed eSNPs. Methods The SpiroMeta-CHARGE GWAS results were integrated with lung eQTLs to map eSNPs and the genes and pathways underlying the associations in lung tissue. For comparison, a similar analysis was done in peripheral blood. The lung mRNA expression levels of the eSNP-regulated genes were tested for associations with lung function measures in 727 individuals. Additional analyses identified the pleiotropic effects of eSNPs from the published GWAS catalogue, and mapped enrichment in regulatory regions from the ENCODE project. Finally, the Connectivity Map database was used to identify potential therapeutics in silico that could reverse the COPD lung tissue gene signature. Findings SNPs associated with lung function measures were more likely to be eQTLs and vice versa. The integration mapped the specific genes underlying the GWAS signals in lung tissue. The eSNP-regulated genes were enriched for developmental and inflammatory pathways; by comparison, SNPs associated with lung function that were eQTLs in blood, but not in lung, were only involved in inflammatory pathways. Lung function eSNPs were enriched for regulatory elements and were over-represented among genes showing differential expression during fetal lung development. An mRNA gene expression signature for COPD was identified in lung tissue and compared with the Connectivity Map. This in-silico drug repurposing approach suggested several compounds that reverse the COPD gene expression signature, including a nicotine receptor antagonist. These findings represent novel therapeutic pathways for COPD. Interpretation The system genetics approach identified lung tissue genes driving the variation in lung function and susceptibility to COPD. The identification of these genes and the pathways in which they are enriched is essential to understand the pathophysiology of airway obstruction and to identify novel therapeutic targets and biomarkers for COPD, including drugs that reverse the COPD gene signature in silico. Funding The research reported in this article was not specifically funded by any agency. See Acknowledgments for a full list of funders of the lung eQTL study and the Spiro-Meta CHARGE GWAS. PMID:26404118
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers
Jiang, Yong; Schmidt, Renate H.; Reif, Jochen C.
2018-01-01
Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. PMID:29549092
Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering.
Guo, Xuan; Meng, Yu; Yu, Ning; Pan, Yi
2014-04-10
Taking the advantage of high-throughput single nucleotide polymorphism (SNP) genotyping technology, large genome-wide association studies (GWASs) have been considered to hold promise for unravelling complex relationships between genotype and phenotype. At present, traditional single-locus-based methods are insufficient to detect interactions consisting of multiple-locus, which are broadly existing in complex traits. In addition, statistic tests for high order epistatic interactions with more than 2 SNPs propose computational and analytical challenges because the computation increases exponentially as the cardinality of SNPs combinations gets larger. In this paper, we provide a simple, fast and powerful method using dynamic clustering and cloud computing to detect genome-wide multi-locus epistatic interactions. We have constructed systematic experiments to compare powers performance against some recently proposed algorithms, including TEAM, SNPRuler, EDCF and BOOST. Furthermore, we have applied our method on two real GWAS datasets, Age-related macular degeneration (AMD) and Rheumatoid arthritis (RA) datasets, where we find some novel potential disease-related genetic factors which are not shown up in detections of 2-loci epistatic interactions. Experimental results on simulated data demonstrate that our method is more powerful than some recently proposed methods on both two- and three-locus disease models. Our method has discovered many novel high-order associations that are significantly enriched in cases from two real GWAS datasets. Moreover, the running time of the cloud implementation for our method on AMD dataset and RA dataset are roughly 2 hours and 50 hours on a cluster with forty small virtual machines for detecting two-locus interactions, respectively. Therefore, we believe that our method is suitable and effective for the full-scale analysis of multiple-locus epistatic interactions in GWAS.
Li, Z K; Jiang, X L; Peng, T; Shi, C L; Han, S X; Tian, B; Zhu, Z L; Tian, J C
2014-02-28
Biomass yield is one of the most important traits for wheat (Triticum aestivum L.)-breeding programs. Increasing the yield of the aerial parts of wheat varieties will be an integral component of future wheat improvement; however, little is known regarding the genetic control of aerial part yield. A doubled haploid population, comprising 168 lines derived from a cross between two winter wheat cultivars, 'Huapei 3' (HP3) and 'Yumai 57' (YM57), was investigated. Quantitative trait loci (QTL) for total biomass yield, grain yield, and straw yield were determined for additive effects and additive x additive epistatic interactions using the QTLNetwork 2.0 software based on the mixed-linear model. Thirteen QTL were determined to have significant additive effects for the three yield traits, of which six also exhibited epistatic effects. Eleven significant additive x additive interactions were detected, of which seven occurred between QTL showing epistatic effects only, two occurred between QTL showing epistatic effects and additive effects, and two occurred between QTL with additive effects. These QTL explained 1.20 to 10.87% of the total phenotypic variation. The QTL with an allele originating from YM57 on chromosome 4B and another QTL contributed by HP3 alleles on chromosome 4D were simultaneously detected on the same or adjacent chromosome intervals for the three traits in two environments. Most of the repeatedly detected QTL across environments were not significant (P > 0.05). These results have implications for selection strategies in wheat biomass yield and for increasing the yield of the aerial part of wheat.
Cloud computing for detecting high-order genome-wide epistatic interaction via dynamic clustering
2014-01-01
Backgroud Taking the advan tage of high-throughput single nucleotide polymorphism (SNP) genotyping technology, large genome-wide association studies (GWASs) have been considered to hold promise for unravelling complex relationships between genotype and phenotype. At present, traditional single-locus-based methods are insufficient to detect interactions consisting of multiple-locus, which are broadly existing in complex traits. In addition, statistic tests for high order epistatic interactions with more than 2 SNPs propose computational and analytical challenges because the computation increases exponentially as the cardinality of SNPs combinations gets larger. Results In this paper, we provide a simple, fast and powerful method using dynamic clustering and cloud computing to detect genome-wide multi-locus epistatic interactions. We have constructed systematic experiments to compare powers performance against some recently proposed algorithms, including TEAM, SNPRuler, EDCF and BOOST. Furthermore, we have applied our method on two real GWAS datasets, Age-related macular degeneration (AMD) and Rheumatoid arthritis (RA) datasets, where we find some novel potential disease-related genetic factors which are not shown up in detections of 2-loci epistatic interactions. Conclusions Experimental results on simulated data demonstrate that our method is more powerful than some recently proposed methods on both two- and three-locus disease models. Our method has discovered many novel high-order associations that are significantly enriched in cases from two real GWAS datasets. Moreover, the running time of the cloud implementation for our method on AMD dataset and RA dataset are roughly 2 hours and 50 hours on a cluster with forty small virtual machines for detecting two-locus interactions, respectively. Therefore, we believe that our method is suitable and effective for the full-scale analysis of multiple-locus epistatic interactions in GWAS. PMID:24717145
Effects of stacked quantitative resistances to downy mildew in lettuce do not simply add up.
den Boer, Erik; Pelgrom, Koen T B; Zhang, Ningwen W; Visser, Richard G F; Niks, Rients E; Jeuken, Marieke J W
2014-08-01
In a stacking study of eight resistance QTLs in lettuce against downy mildew, only three out of ten double combinations showed an increased resistance effect under field conditions. Complete race nonspecific resistance to lettuce downy mildew, as observed for the nonhost wild lettuce species Lactuca saligna, is desired in lettuce cultivation. Genetic dissection of L. saligna's complete resistance has revealed several quantitative loci (QTL) for resistance with field infection reductions of 30-50 %. To test the effect of stacking these QTL, we analyzed interactions between homozygous L. saligna CGN05271 chromosome segments introgressed into the genetic background of L. sativa cv. Olof. Eight different backcross inbred lines (BILs) with single introgressions of 30-70 cM and selected predominately for quantitative resistance in field situations were intercrossed. Ten developed homozygous lines with stacked introgression segments (double combinations) were evaluated for resistance in the field. Seven double combinations showed a similar infection as the individual most resistant parental BIL, revealing epistatic interactions with 'less-than-additive' effects. Three double combinations showed an increased resistance level compared to their parental BILs and their interactions were additive, 'less-than-additive' epistatic and 'more-than-additive' epistatic, respectively. The additive interaction reduced field infection by 73 %. The double combination with a 'more-than-additive' epistatic effect, derived from a combination between a susceptible and a resistant BIL with 0 and 30 % infection reduction, respectively, showed an average field infection reduction of 52 %. For the latter line, an attempt to genetically dissect its underlying epistatic loci by substitution mapping did not result in smaller mapping intervals as none of the 22 substitution lines reached a similar high resistance level. Implications for breeding and the inheritance of L. saligna's complete resistance are discussed.
Haplotype-Based Genome-Wide Prediction Models Exploit Local Epistatic Interactions Among Markers.
Jiang, Yong; Schmidt, Renate H; Reif, Jochen C
2018-05-04
Genome-wide prediction approaches represent versatile tools for the analysis and prediction of complex traits. Mostly they rely on marker-based information, but scenarios have been reported in which models capitalizing on closely-linked markers that were combined into haplotypes outperformed marker-based models. Detailed comparisons were undertaken to reveal under which circumstances haplotype-based genome-wide prediction models are superior to marker-based models. Specifically, it was of interest to analyze whether and how haplotype-based models may take local epistatic effects between markers into account. Assuming that populations consisted of fully homozygous individuals, a marker-based model in which local epistatic effects inside haplotype blocks were exploited (LEGBLUP) was linearly transformable into a haplotype-based model (HGBLUP). This theoretical derivation formally revealed that haplotype-based genome-wide prediction models capitalize on local epistatic effects among markers. Simulation studies corroborated this finding. Due to its computational efficiency the HGBLUP model promises to be an interesting tool for studies in which ultra-high-density SNP data sets are studied. Applying the HGBLUP model to empirical data sets revealed higher prediction accuracies than for marker-based models for both traits studied using a mouse panel. In contrast, only a small subset of the traits analyzed in crop populations showed such a benefit. Cases in which higher prediction accuracies are observed for HGBLUP than for marker-based models are expected to be of immediate relevance for breeders, due to the tight linkage a beneficial haplotype will be preserved for many generations. In this respect the inheritance of local epistatic effects very much resembles the one of additive effects. Copyright © 2018 Jiang et al.
Hu, Xinli; Kim, Hyun; Raj, Towfique; Brennan, Patrick J; Trynka, Gosia; Teslovich, Nikola; Slowikowski, Kamil; Chen, Wei-Min; Onengut, Suna; Baecher-Allan, Clare; De Jager, Philip L; Rich, Stephen S; Stranger, Barbara E; Brenner, Michael B; Raychaudhuri, Soumya
2014-06-01
Genome-wide association studies (GWAS) and subsequent dense-genotyping of associated loci identified over a hundred single-nucleotide polymorphism (SNP) variants associated with the risk of rheumatoid arthritis (RA), type 1 diabetes (T1D), and celiac disease (CeD). Immunological and genetic studies suggest a role for CD4-positive effector memory T (CD+ TEM) cells in the pathogenesis of these diseases. To elucidate mechanisms of autoimmune disease alleles, we investigated molecular phenotypes in CD4+ effector memory T cells potentially affected by these variants. In a cohort of genotyped healthy individuals, we isolated high purity CD4+ TEM cells from peripheral blood, then assayed relative abundance, proliferation upon T cell receptor (TCR) stimulation, and the transcription of 215 genes within disease loci before and after stimulation. We identified 46 genes regulated by cis-acting expression quantitative trait loci (eQTL), the majority of which we detected in stimulated cells. Eleven of the 46 genes with eQTLs were previously undetected in peripheral blood mononuclear cells. Of 96 risk alleles of RA, T1D, and/or CeD in densely genotyped loci, eleven overlapped cis-eQTLs, of which five alleles completely explained the respective signals. A non-coding variant, rs389862A, increased proliferative response (p=4.75 × 10-8). In addition, baseline expression of seventeen genes in resting cells reliably predicted proliferative response after TCR stimulation. Strikingly, however, there was no evidence that risk alleles modulated CD4+ TEM abundance or proliferation. Our study underscores the power of examining molecular phenotypes in relevant cells and conditions for understanding pathogenic mechanisms of disease variants.
Yang, Tsun-Po; Beazley, Claude; Montgomery, Stephen B.; Dimas, Antigone S.; Gutierrez-Arcelus, Maria; Stranger, Barbara E.; Deloukas, Panos; Dermitzakis, Emmanouil T.
2010-01-01
Summary: Genevar (GENe Expression VARiation) is a database and Java tool designed to integrate multiple datasets, and provides analysis and visualization of associations between sequence variation and gene expression. Genevar allows researchers to investigate expression quantitative trait loci (eQTL) associations within a gene locus of interest in real time. The database and application can be installed on a standard computer in database mode and, in addition, on a server to share discoveries among affiliations or the broader community over the Internet via web services protocols. Availability: http://www.sanger.ac.uk/resources/software/genevar Contact: emmanouil.dermitzakis@unige.ch PMID:20702402
Jasinska, Anna J; Zelaya, Ivette; Service, Susan K; Peterson, Christine B; Cantor, Rita M; Choi, Oi-Wa; DeYoung, Joseph; Eskin, Eleazar; Fairbanks, Lynn A; Fears, Scott; Furterer, Allison E; Huang, Yu S; Ramensky, Vasily; Schmitt, Christopher A; Svardal, Hannes; Jorgensen, Matthew J; Kaplan, Jay R; Villar, Diego; Aken, Bronwen L; Flicek, Paul; Nag, Rishi; Wong, Emily S; Blangero, John; Dyer, Thomas D; Bogomolov, Marina; Benjamini, Yoav; Weinstock, George M; Dewar, Ken; Sabatti, Chiara; Wilson, Richard K; Jentsch, J David; Warren, Wesley; Coppola, Giovanni; Woods, Roger P; Freimer, Nelson B
2017-12-01
By analyzing multitissue gene expression and genome-wide genetic variation data in samples from a vervet monkey pedigree, we generated a transcriptome resource and produced the first catalog of expression quantitative trait loci (eQTLs) in a nonhuman primate model. This catalog contains more genome-wide significant eQTLs per sample than comparable human resources and identifies sex- and age-related expression patterns. Findings include a master regulatory locus that likely has a role in immune function and a locus regulating hippocampal long noncoding RNAs (lncRNAs), whose expression correlates with hippocampal volume. This resource will facilitate genetic investigation of quantitative traits, including brain and behavioral phenotypes relevant to neuropsychiatric disorders.
Wang, Ming Li; Khera, Pawan; Pandey, Manish K; Wang, Hui; Qiao, Lixian; Feng, Suping; Tonnis, Brandon; Barkley, Noelle A; Pinnow, David; Holbrook, Corley C; Culbreath, Albert K; Varshney, Rajeev K; Guo, Baozhu
2015-01-01
Peanut, a high-oil crop with about 50% oil content, is either crushed for oil or used as edible products. Fatty acid composition determines the oil quality which has high relevance to consumer health, flavor, and shelf life of commercial products. In addition to the major fatty acids, oleic acid (C18:1) and linoleic acid (C18:2) accounting for about 80% of peanut oil, the six other fatty acids namely palmitic acid (C16:0), stearic acid (C18:0), arachidic acid (C20:0), gadoleic acid (C20:1), behenic acid (C22:0), and lignoceric acid (C24:0) are accounted for the rest 20%. To determine the genetic basis and to improve further understanding on effect of FAD2 genes on these fatty acids, two recombinant inbred line (RIL) populations namely S-population (high oleic line 'SunOleic 97R' × low oleic line 'NC94022') and T-population (normal oleic line 'Tifrunner' × low oleic line 'GT-C20') were developed. Genetic maps with 206 and 378 marker loci for the S- and the T-population, respectively were used for quantitative trait locus (QTL) analysis. As a result, a total of 164 main-effect (M-QTLs) and 27 epistatic (E-QTLs) QTLs associated with the minor fatty acids were identified with 0.16% to 40.56% phenotypic variation explained (PVE). Thirty four major QTLs (>10% of PVE) mapped on five linkage groups and 28 clusters containing more than three QTLs were also identified. These results suggest that the major QTLs with large additive effects would play an important role in controlling composition of these minor fatty acids in addition to the oleic and linoleic acids in peanut oil. The interrelationship among these fatty acids should be considered while breeding for improved peanut genotypes with good oil quality and desired fatty acid composition.
Wang, Hui; Qiao, Lixian; Feng, Suping; Tonnis, Brandon; Barkley, Noelle A.; Pinnow, David; Holbrook, Corley C.; Culbreath, Albert K.; Varshney, Rajeev K.; Guo, Baozhu
2015-01-01
Peanut, a high-oil crop with about 50% oil content, is either crushed for oil or used as edible products. Fatty acid composition determines the oil quality which has high relevance to consumer health, flavor, and shelf life of commercial products. In addition to the major fatty acids, oleic acid (C18:1) and linoleic acid (C18:2) accounting for about 80% of peanut oil, the six other fatty acids namely palmitic acid (C16:0), stearic acid (C18:0), arachidic acid (C20:0), gadoleic acid (C20:1), behenic acid (C22:0), and lignoceric acid (C24:0) are accounted for the rest 20%. To determine the genetic basis and to improve further understanding on effect of FAD2 genes on these fatty acids, two recombinant inbred line (RIL) populations namely S-population (high oleic line ‘SunOleic 97R’ × low oleic line ‘NC94022’) and T-population (normal oleic line ‘Tifrunner’ × low oleic line ‘GT-C20’) were developed. Genetic maps with 206 and 378 marker loci for the S- and the T-population, respectively were used for quantitative trait locus (QTL) analysis. As a result, a total of 164 main-effect (M-QTLs) and 27 epistatic (E-QTLs) QTLs associated with the minor fatty acids were identified with 0.16% to 40.56% phenotypic variation explained (PVE). Thirty four major QTLs (>10% of PVE) mapped on five linkage groups and 28 clusters containing more than three QTLs were also identified. These results suggest that the major QTLs with large additive effects would play an important role in controlling composition of these minor fatty acids in addition to the oleic and linoleic acids in peanut oil. The interrelationship among these fatty acids should be considered while breeding for improved peanut genotypes with good oil quality and desired fatty acid composition. PMID:25849082
Glytsou, Christina; Calvo, Enrique; Cogliati, Sara; Mehrotra, Arpit; Anastasia, Irene; Rigoni, Giovanni; Raimondi, Andrea; Shintani, Norihito; Loureiro, Marta; Vazquez, Jesùs; Pellegrini, Luca; Enriquez, Jose Antonio; Scorrano, Luca; Soriano, Maria Eugenia
2016-12-13
The mitochondrial contact site and cristae organizing system (MICOS) and Optic atrophy 1 (OPA1) control cristae shape, thus affecting mitochondrial function and apoptosis. Whether and how they physically and functionally interact is unclear. Here, we provide evidence that OPA1 is epistatic to MICOS in the regulation of cristae shape. Proteomic analysis identifies multiple MICOS components in native OPA1-containing high molecular weight complexes disrupted during cristae remodeling. MIC60, a core MICOS protein, physically interacts with OPA1, and together, they control cristae junction number and stability, OPA1 being epistatic to MIC60. OPA1 defines cristae width and junction diameter independently of MIC60. Our combination of proteomics, biochemistry, genetics, and electron tomography provides a unifying model for mammalian cristae biogenesis by OPA1 and MICOS. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Stanaway, Ian B.; Gamazon, Eric R.; Smith, Joshua D.; Mirkov, Snezana; Ramirez, Jacqueline; Liu, Wanqing; Lin, Yvonne S.; Moloney, Cliona; Aldred, Shelly Force; Trinklein, Nathan D.; Schuetz, Erin; Nickerson, Deborah A.; Thummel, Ken E.; Rieder, Mark J.; Rettie, Allan E.; Ratain, Mark J.; Cox, Nancy J.; Brown, Christopher D.
2011-01-01
The discovery of expression quantitative trait loci (“eQTLs”) can help to unravel genetic contributions to complex traits. We identified genetic determinants of human liver gene expression variation using two independent collections of primary tissue profiled with Agilent (n = 206) and Illumina (n = 60) expression arrays and Illumina SNP genotyping (550K), and we also incorporated data from a published study (n = 266). We found that ∼30% of SNP-expression correlations in one study failed to replicate in either of the others, even at thresholds yielding high reproducibility in simulations, and we quantified numerous factors affecting reproducibility. Our data suggest that drug exposure, clinical descriptors, and unknown factors associated with tissue ascertainment and analysis have substantial effects on gene expression and that controlling for hidden confounding variables significantly increases replication rate. Furthermore, we found that reproducible eQTL SNPs were heavily enriched near gene starts and ends, and subsequently resequenced the promoters and 3′UTRs for 14 genes and tested the identified haplotypes using luciferase assays. For three genes, significant haplotype-specific in vitro functional differences correlated directly with expression levels, suggesting that many bona fide eQTLs result from functional variants that can be mechanistically isolated in a high-throughput fashion. Finally, given our study design, we were able to discover and validate hundreds of liver eQTLs. Many of these relate directly to complex traits for which liver-specific analyses are likely to be relevant, and we identified dozens of potential connections with disease-associated loci. These included previously characterized eQTL contributors to diabetes, drug response, and lipid levels, and they suggest novel candidates such as a role for NOD2 expression in leprosy risk and C2orf43 in prostate cancer. In general, the work presented here will be valuable for future efforts to precisely identify and functionally characterize genetic contributions to a variety of complex traits. PMID:21637794
A Systematic Survey of an Intragenic Epistatic Landscape
Bank, Claudia; Hietpas, Ryan T.; Jensen, Jeffrey D.; Bolon, Daniel N.A.
2015-01-01
Mutations are the source of evolutionary variation. The interactions of multiple mutations can have important effects on fitness and evolutionary trajectories. We have recently described the distribution of fitness effects of all single mutations for a nine-amino-acid region of yeast Hsp90 (Hsp82) implicated in substrate binding. Here, we report and discuss the distribution of intragenic epistatic effects within this region in seven Hsp90 point mutant backgrounds of neutral to slightly deleterious effect, resulting in an analysis of more than 1,000 double mutants. We find negative epistasis between substitutions to be common, and positive epistasis to be rare—resulting in a pattern that indicates a drastic change in the distribution of fitness effects one step away from the wild type. This can be well explained by a concave relationship between phenotype and genotype (i.e., a concave shape of the local fitness landscape), suggesting mutational robustness intrinsic to the local sequence space. Structural analyses indicate that, in this region, epistatic effects are most pronounced when a solvent-inaccessible position is involved in the interaction. In contrast, all 18 observations of positive epistasis involved at least one mutation at a solvent-exposed position. By combining the analysis of evolutionary and biophysical properties of an epistatic landscape, these results contribute to a more detailed understanding of the complexity of protein evolution. PMID:25371431
Jasinska, Anna J.; Zelaya, Ivette; Service, Susan K.; Peterson, Christine B.; Cantor, Rita M.; Choi, Oi-Wa; DeYoung, Joseph; Eskin, Eleazar; Fairbanks, Lynn A.; Fears, Scott; Furterer, Allison E.; Huang, Yu S.; Ramensky, Vasily; Schmitt, Christopher A.; Svardal, Hannes; Jorgensen, Matthew J.; Kaplan, Jay R.; Villar, Diego; Aken, Bronwen L.; Flicek, Paul; Nag, Rishi; Wong, Emily S.; Blangero, John; Dyer, Thomas D.; Bogomolov, Marina; Benjamini, Yoav; Weinstock, George M.; Dewar, Ken; Sabatti, Chiara; Wilson, Richard K.; Jentsch, J. David; Warren, Wesley; Coppola, Giovanni; Woods, Roger P.; Freimer, Nelson B.
2017-01-01
By analyzing multi-tissue gene expression and genome-wide genetic variation data in samples from a vervet monkey pedigree, we generated a transcriptome resource and produced the first catalogue of expression quantitative trait loci (eQTLs) in a non-human primate model. This catalogue contains more genome-wide significant eQTLs, per sample, than comparable human resources, and reveals sex and age-related expression patterns. Findings include a master regulatory locus that likely plays a role in immune function, and a locus regulating hippocampal long non-coding RNAs (lncRNAs), whose expression correlates with hippocampal volume. This resource will facilitate genetic investigation of quantitative traits, including brain and behavioral phenotypes relevant to neuropsychiatric disorders. PMID:29083405
Fadason, Tayaza; Ekblad, Cameron; Ingram, John R.; Schierding, William S.; O'Sullivan, Justin M.
2017-01-01
The mechanisms that underlie the association between obesity and type 2 diabetes are not fully understood. Here, we investigated the role of the 3D genome organization in the pathogeneses of obesity and type-2 diabetes. We interpreted the combined and differential impacts of 196 diabetes and 390 obesity associated single nucleotide polymorphisms (SNPs) by integrating data on the genes with which they physically interact (as captured by Hi-C) and the functional [i.e., expression quantitative trait loci (eQTL)] outcomes associated with these interactions. We identified 861 spatially regulated genes (e.g., AP3S2, ELP5, SVIP, IRS1, FADS2, WFS1, RBM6, HORMAD1, PYROXD2), which are enriched in tissues (e.g., adipose, skeletal muscle, pancreas) and biological processes and canonical pathways (e.g., lipid metabolism, leptin, and glucose-insulin signaling pathways) that are important for the pathogenesis of type 2 diabetes and obesity. Our discovery-based approach also identifies enrichment for eQTL SNP-gene interactions in tissues that are not classically associated with diabetes or obesity. We propose that the combinatorial action of active obesity and diabetes spatial eQTL SNPs on their gene pairs within different tissues reduces the ability of these tissues to contribute to the maintenance of a healthy energy metabolism. PMID:29081791
Genetic Architecture of Nest Building in Mice LG/J × SM/J
Sauce, Bruno; de Brito, Reinaldo Alves; Peripato, Andrea Cristina
2012-01-01
Maternal care is critical to offspring growth and survival, which is greatly improved by building an effective nest. Some suggest that genetic variation and underlying genetic effects differ between fitness-related traits and other phenotypes. We investigated the genetic architecture of a fitness-related trait, nest building, in F2 female mice intercrossed from inbred strains SM/J and LG/J using a QTL analysis for six related nest phenotypes (Presence and Structure pre- and postpartum, prepartum Material Used and postpartum Temperature). We found 15 direct-effect QTLs explaining from 4 to 13% of the phenotypic variation in nest building, mostly with non-additive effect. Epistatic analyses revealed 71 significant epistatic interactions which together explain from 28.4 to 75.5% of the variation, indicating an important role for epistasis in the adaptive process of nest building behavior in mice. Our results suggest a genetic architecture with small direct effects and a larger number of epistatic interactions as expected for fitness-related phenotypes. PMID:22654894
Epistasis can accelerate adaptive diversification in haploid asexual populations.
Griswold, Cortland K
2015-03-07
A fundamental goal of the biological sciences is to determine processes that facilitate the evolution of diversity. These processes can be separated into ecological, physiological, developmental and genetic. An ecological process that facilitates diversification is frequency-dependent selection caused by competition. Models of frequency-dependent adaptive diversification have generally assumed a genetic basis of phenotype that is non-epistatic. Here, we present a model that indicates diversification is accelerated by an epistatic basis of phenotype in combination with a competition model that invokes frequency-dependent selection. Our model makes use of a genealogical model of epistasis and insights into the effects of balancing selection on the genealogical structure of a population to understand how epistasis can facilitate diversification. The finding that epistasis facilitates diversification may be informative with respect to empirical results that indicate an epistatic basis of phenotype in experimental bacterial populations that experienced adaptive diversification. © 2015 The Author(s) Published by the Royal Society. All rights reserved.
Linkage mapping of beta 2 EEG waves via non-parametric regression.
Ghosh, Saurabh; Begleiter, Henri; Porjesz, Bernice; Chorlian, David B; Edenberg, Howard J; Foroud, Tatiana; Goate, Alison; Reich, Theodore
2003-04-01
Parametric linkage methods for analyzing quantitative trait loci are sensitive to violations in trait distributional assumptions. Non-parametric methods are relatively more robust. In this article, we modify the non-parametric regression procedure proposed by Ghosh and Majumder [2000: Am J Hum Genet 66:1046-1061] to map Beta 2 EEG waves using genome-wide data generated in the COGA project. Significant linkage findings are obtained on chromosomes 1, 4, 5, and 15 with findings at multiple regions on chromosomes 4 and 15. We analyze the data both with and without incorporating alcoholism as a covariate. We also test for epistatic interactions between regions of the genome exhibiting significant linkage with the EEG phenotypes and find evidence of epistatic interactions between a region each on chromosome 1 and chromosome 4 with one region on chromosome 15. While regressing out the effect of alcoholism does not affect the linkage findings, the epistatic interactions become statistically insignificant. Copyright 2003 Wiley-Liss, Inc.
Genomic approaches for the elucidation of genes and gene networks underlying cardiovascular traits.
Adriaens, M E; Bezzina, C R
2018-06-22
Genome-wide association studies have shed light on the association between natural genetic variation and cardiovascular traits. However, linking a cardiovascular trait associated locus to a candidate gene or set of candidate genes for prioritization for follow-up mechanistic studies is all but straightforward. Genomic technologies based on next-generation sequencing technology nowadays offer multiple opportunities to dissect gene regulatory networks underlying genetic cardiovascular trait associations, thereby aiding in the identification of candidate genes at unprecedented scale. RNA sequencing in particular becomes a powerful tool when combined with genotyping to identify loci that modulate transcript abundance, known as expression quantitative trait loci (eQTL), or loci modulating transcript splicing known as splicing quantitative trait loci (sQTL). Additionally, the allele-specific resolution of RNA-sequencing technology enables estimation of allelic imbalance, a state where the two alleles of a gene are expressed at a ratio differing from the expected 1:1 ratio. When multiple high-throughput approaches are combined with deep phenotyping in a single study, a comprehensive elucidation of the relationship between genotype and phenotype comes into view, an approach known as systems genetics. In this review, we cover key applications of systems genetics in the broad cardiovascular field.
Du, Qingzhang; Tian, Jiaxing; Yang, Xiaohui; Pan, Wei; Xu, Baohua; Li, Bailian; Ingvarsson, Pär K.; Zhang, Deqiang
2015-01-01
Economically important traits in many species generally show polygenic, quantitative inheritance. The components of genetic variation (additive, dominant and epistatic effects) of these traits conferred by multiple genes in shared biological pathways remain to be defined. Here, we investigated 11 full-length genes in cellulose biosynthesis, on 10 growth and wood-property traits, within a population of 460 unrelated Populus tomentosa individuals, via multi-gene association. To validate positive associations, we conducted single-marker analysis in a linkage population of 1,200 individuals. We identified 118, 121, and 43 associations (P< 0.01) corresponding to additive, dominant, and epistatic effects, respectively, with low to moderate proportions of phenotypic variance (R2). Epistatic interaction models uncovered a combination of three non-synonymous sites from three unique genes, representing a significant epistasis for diameter at breast height and stem volume. Single-marker analysis validated 61 associations (false discovery rate, Q ≤ 0.10), representing 38 SNPs from nine genes, and its average effect (R2 = 3.8%) nearly 2-fold higher than that identified with multi-gene association, suggesting that multi-gene association can capture smaller individual variants. Moreover, a structural gene–gene network based on tissue-specific transcript abundances provides a better understanding of the multi-gene pathway affecting tree growth and lignocellulose biosynthesis. Our study highlights the importance of pathway-based multiple gene associations to uncover the nature of genetic variance for quantitative traits and may drive novel progress in molecular breeding. PMID:25428896
Learning epistatic interactions from sequence-activity data to predict enantioselectivity
NASA Astrophysics Data System (ADS)
Zaugg, Julian; Gumulya, Yosephine; Malde, Alpeshkumar K.; Bodén, Mikael
2017-12-01
Enzymes with a high selectivity are desirable for improving economics of chemical synthesis of enantiopure compounds. To improve enzyme selectivity mutations are often introduced near the catalytic active site. In this compact environment epistatic interactions between residues, where contributions to selectivity are non-additive, play a significant role in determining the degree of selectivity. Using support vector machine regression models we map mutations to the experimentally characterised enantioselectivities for a set of 136 variants of the epoxide hydrolase from the fungus Aspergillus niger (AnEH). We investigate whether the influence a mutation has on enzyme selectivity can be accurately predicted through linear models, and whether prediction accuracy can be improved using higher-order counterparts. Comparing linear and polynomial degree = 2 models, mean Pearson coefficients (r) from 50 {× } 5 -fold cross-validation increase from 0.84 to 0.91 respectively. Equivalent models tested on interaction-minimised sequences achieve values of r=0.90 and r=0.93 . As expected, testing on a simulated control data set with no interactions results in no significant improvements from higher-order models. Additional experimentally derived AnEH mutants are tested with linear and polynomial degree = 2 models, with values increasing from r=0.51 to r=0.87 respectively. The study demonstrates that linear models perform well, however the representation of epistatic interactions in predictive models improves identification of selectivity-enhancing mutations. The improvement is attributed to higher-order kernel functions that represent epistatic interactions between residues.
Learning epistatic interactions from sequence-activity data to predict enantioselectivity
NASA Astrophysics Data System (ADS)
Zaugg, Julian; Gumulya, Yosephine; Malde, Alpeshkumar K.; Bodén, Mikael
2017-12-01
Enzymes with a high selectivity are desirable for improving economics of chemical synthesis of enantiopure compounds. To improve enzyme selectivity mutations are often introduced near the catalytic active site. In this compact environment epistatic interactions between residues, where contributions to selectivity are non-additive, play a significant role in determining the degree of selectivity. Using support vector machine regression models we map mutations to the experimentally characterised enantioselectivities for a set of 136 variants of the epoxide hydrolase from the fungus Aspergillus niger ( AnEH). We investigate whether the influence a mutation has on enzyme selectivity can be accurately predicted through linear models, and whether prediction accuracy can be improved using higher-order counterparts. Comparing linear and polynomial degree = 2 models, mean Pearson coefficients ( r) from 50 {× } 5-fold cross-validation increase from 0.84 to 0.91 respectively. Equivalent models tested on interaction-minimised sequences achieve values of r=0.90 and r=0.93. As expected, testing on a simulated control data set with no interactions results in no significant improvements from higher-order models. Additional experimentally derived AnEH mutants are tested with linear and polynomial degree = 2 models, with values increasing from r=0.51 to r=0.87 respectively. The study demonstrates that linear models perform well, however the representation of epistatic interactions in predictive models improves identification of selectivity-enhancing mutations. The improvement is attributed to higher-order kernel functions that represent epistatic interactions between residues.
Hybrid male sterility in rice is due to epistatic interactions with a pollen killer locus.
Kubo, Takahiko; Yoshimura, Atsushi; Kurata, Nori
2011-11-01
In intraspecific crosses between cultivated rice (Oryza sativa) subspecies indica and japonica, the hybrid male sterility gene S24 causes the selective abortion of male gametes carrying the japonica allele (S24-j) via an allelic interaction in the heterozygous hybrids. In this study, we first examined whether male sterility is due solely to the single locus S24. An analysis of near-isogenic lines (NIL-F(1)) showed different phenotypes for S24 in different genetic backgrounds. The S24 heterozygote with the japonica genetic background showed male semisterility, but no sterility was found in heterozygotes with the indica background. This result indicates that S24 is regulated epistatically. A QTL analysis of a BC(2)F(1) population revealed a novel sterility locus that interacts with S24 and is found on rice chromosome 2. The locus was named Epistatic Factor for S24 (EFS). Further genetic analyses revealed that S24 causes male sterility when in combination with the homozygous japonica EFS allele (efs-j). The results suggest that efs-j is a recessive sporophytic allele, while the indica allele (EFS-i) can dominantly counteract the pollen sterility caused by S24 heterozygosity. In summary, our results demonstrate that an additional epistatic locus is an essential element in the hybrid sterility caused by allelic interaction at a single locus in rice. This finding provides a significant contribution to our understanding of the complex molecular mechanisms underlying hybrid sterility and microsporogenesis.
Hybrid Male Sterility in Rice Is Due to Epistatic Interactions with a Pollen Killer Locus
Kubo, Takahiko; Yoshimura, Atsushi; Kurata, Nori
2011-01-01
In intraspecific crosses between cultivated rice (Oryza sativa) subspecies indica and japonica, the hybrid male sterility gene S24 causes the selective abortion of male gametes carrying the japonica allele (S24-j) via an allelic interaction in the heterozygous hybrids. In this study, we first examined whether male sterility is due solely to the single locus S24. An analysis of near-isogenic lines (NIL-F1) showed different phenotypes for S24 in different genetic backgrounds. The S24 heterozygote with the japonica genetic background showed male semisterility, but no sterility was found in heterozygotes with the indica background. This result indicates that S24 is regulated epistatically. A QTL analysis of a BC2F1 population revealed a novel sterility locus that interacts with S24 and is found on rice chromosome 2. The locus was named Epistatic Factor for S24 (EFS). Further genetic analyses revealed that S24 causes male sterility when in combination with the homozygous japonica EFS allele (efs-j). The results suggest that efs-j is a recessive sporophytic allele, while the indica allele (EFS-i) can dominantly counteract the pollen sterility caused by S24 heterozygosity. In summary, our results demonstrate that an additional epistatic locus is an essential element in the hybrid sterility caused by allelic interaction at a single locus in rice. This finding provides a significant contribution to our understanding of the complex molecular mechanisms underlying hybrid sterility and microsporogenesis. PMID:21868603
Learning epistatic interactions from sequence-activity data to predict enantioselectivity.
Zaugg, Julian; Gumulya, Yosephine; Malde, Alpeshkumar K; Bodén, Mikael
2017-12-01
Enzymes with a high selectivity are desirable for improving economics of chemical synthesis of enantiopure compounds. To improve enzyme selectivity mutations are often introduced near the catalytic active site. In this compact environment epistatic interactions between residues, where contributions to selectivity are non-additive, play a significant role in determining the degree of selectivity. Using support vector machine regression models we map mutations to the experimentally characterised enantioselectivities for a set of 136 variants of the epoxide hydrolase from the fungus Aspergillus niger (AnEH). We investigate whether the influence a mutation has on enzyme selectivity can be accurately predicted through linear models, and whether prediction accuracy can be improved using higher-order counterparts. Comparing linear and polynomial degree = 2 models, mean Pearson coefficients (r) from [Formula: see text]-fold cross-validation increase from 0.84 to 0.91 respectively. Equivalent models tested on interaction-minimised sequences achieve values of [Formula: see text] and [Formula: see text]. As expected, testing on a simulated control data set with no interactions results in no significant improvements from higher-order models. Additional experimentally derived AnEH mutants are tested with linear and polynomial degree = 2 models, with values increasing from [Formula: see text] to [Formula: see text] respectively. The study demonstrates that linear models perform well, however the representation of epistatic interactions in predictive models improves identification of selectivity-enhancing mutations. The improvement is attributed to higher-order kernel functions that represent epistatic interactions between residues.
Ma, Li; Ballantyne, Christie; Brautbar, Ariel; Keinan, Alon
2014-01-01
Epistasis has been suggested to underlie part of the missing heritability in genome-wide association studies. In this study, we first report an analysis of gene-gene interactions affecting HDL cholesterol (HDL-C) levels in a candidate gene study of 2,091 individuals with mixed dyslipidemia from a clinical trial. Two additional studies, the Atherosclerosis Risk in Communities study (ARIC; n = 9,713) and the Multi-Ethnic Study of Atherosclerosis (MESA; n = 2,685), were considered for replication. We identified a gene-gene interaction between rs1532085 and rs12980554 (P = 7.1×10−7) in their effect on HDL-C levels, which is significant after Bonferroni correction (P c = 0.017) for the number of SNP pairs tested. The interaction successfully replicated in the ARIC study (P = 7.0×10−4; P c = 0.02). Rs1532085, an expression QTL (eQTL) of LIPC, is one of the two SNPs involved in another, well-replicated gene-gene interaction underlying HDL-C levels. To further investigate the role of this eQTL SNP in gene-gene interactions affecting HDL-C, we tested in the ARIC study for interaction between this SNP and any other SNP genome-wide. We found the eQTL to be involved in a few suggestive interactions, one of which significantly replicated in MESA. Importantly, these gene-gene interactions, involving only rs1532085, explain an additional 1.4% variation of HDL-C, on top of the 0.65% explained by rs1532085 alone. LIPC plays a key role in the lipid metabolism pathway and it, and rs1532085 in particular, has been associated with HDL-C and other lipid levels. Collectively, we discovered several novel gene-gene interactions, all involving an eQTL of LIPC, thus suggesting a hub role of LIPC in the gene-gene interaction network that regulates HDL-C levels, which in turn raises the hypothesis that LIPC's contribution is largely via interactions with other lipid metabolism related genes. PMID:24651390
Singh, A; Knox, R E; DePauw, R M; Singh, A K; Cuthbert, R D; Campbell, H L; Shorter, S; Bhavani, S
2014-11-01
In wheat, advantageous gene-rich or pleiotropic regions for stripe, leaf, and stem rust and epistatic interactions between rust resistance loci should be accounted for in plant breeding strategies. Leaf rust (Puccinia triticina Eriks.) and stripe rust (Puccinia striiformis f. tritici Eriks) contribute to major production losses in many regions worldwide. The objectives of this research were to identify and study epistatic interactions of quantitative trait loci (QTL) for stripe and leaf rust resistance in a doubled haploid (DH) population derived from the cross of Canadian wheat cultivars, AC Cadillac and Carberry. The relationship of leaf and stripe rust resistance QTL that co-located with stem rust resistance QTL previously mapped in this population was also investigated. The Carberry/AC Cadillac population was genotyped with DArT(®) and simple sequence repeat markers. The parents and population were phenotyped for stripe rust severity and infection response in field rust nurseries in Kenya (Njoro), Canada (Swift Current), and New Zealand (Lincoln); and for leaf rust severity and infection response in field nurseries in Canada (Swift Current) and New Zealand (Lincoln). AC Cadillac was a source of stripe rust resistance QTL on chromosomes 2A, 2B, 3A, 3B, 5B, and 7B; and Carberry was a source of resistance on chromosomes 2B, 4B, and 7A. AC Cadillac contributed QTL for resistance to leaf rust on chromosome 2A and Carberry contributed QTL on chromosomes 2B and 4B. Stripe rust resistance QTL co-localized with previously reported stem rust resistance QTL on 2B, 3B, and 7B, while leaf rust resistance QTL co-localized with 4B stem rust resistance QTL. Several epistatic interactions were identified both for stripe and leaf rust resistance QTL. We have identified useful combinations of genetic loci with main and epistatic effects. Multiple disease resistance regions identified on chromosomes 2A, 2B, 3B, 4B, 5B, and 7B are prime candidates for further investigation and validation of their broad resistance.
Bipartite Community Structure of eQTLs.
Platig, John; Castaldi, Peter J; DeMeo, Dawn; Quackenbush, John
2016-09-01
Genome Wide Association Studies (GWAS) and expression quantitative trait locus (eQTL) analyses have identified genetic associations with a wide range of human phenotypes. However, many of these variants have weak effects and understanding their combined effect remains a challenge. One hypothesis is that multiple SNPs interact in complex networks to influence functional processes that ultimately lead to complex phenotypes, including disease states. Here we present CONDOR, a method that represents both cis- and trans-acting SNPs and the genes with which they are associated as a bipartite graph and then uses the modular structure of that graph to place SNPs into a functional context. In applying CONDOR to eQTLs in chronic obstructive pulmonary disease (COPD), we found the global network "hub" SNPs were devoid of disease associations through GWAS. However, the network was organized into 52 communities of SNPs and genes, many of which were enriched for genes in specific functional classes. We identified local hubs within each community ("core SNPs") and these were enriched for GWAS SNPs for COPD and many other diseases. These results speak to our intuition: rather than single SNPs influencing single genes, we see groups of SNPs associated with the expression of families of functionally related genes and that disease SNPs are associated with the perturbation of those functions. These methods are not limited in their application to COPD and can be used in the analysis of a wide variety of disease processes and other phenotypic traits.
Accounting for epistatic interactions improves the functional analysis of protein structures.
Wilkins, Angela D; Venner, Eric; Marciano, David C; Erdin, Serkan; Atri, Benu; Lua, Rhonald C; Lichtarge, Olivier
2013-11-01
The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs. Based on prior data, we hypothesize that epistatic forces operate most strongly between residues nearby in the structure, resulting in smooth evolutionary importance across the structure. We find that when residue scores of evolutionary importance are distributed smoothly between nearby residues, functional site prediction accuracy improves. Accordingly, we designed a novel measure of evolutionary importance that focuses on the interaction between pairs of structurally neighboring residues. This measure that we term pair-interaction Evolutionary Trace yields greater functional site overlap and better structure-based proteome-wide functional predictions. Our data show that the structural smoothness of evolutionary importance is a fundamental feature of the coevolution of sequence, structure and function. Mutations operate on individual residues, but selective pressure depends in part on the extent to which a mutation perturbs interactions with neighboring residues. In practice, this principle led us to redefine the importance of a residue in terms of the importance of its epistatic interactions with neighbors, yielding better annotation of functional residues, motivating experimental validation of a novel functional site in LexA and refining protein function prediction. lichtarge@bcm.edu. Supplementary data are available at Bioinformatics online.
Csilléry, Katalin; Lalagüe, Hadrien; Vendramin, Giovanni G; González-Martínez, Santiago C; Fady, Bruno; Oddou-Muratorio, Sylvie
2014-10-01
Detecting signatures of selection in tree populations threatened by climate change is currently a major research priority. Here, we investigated the signature of local adaptation over a short spatial scale using 96 European beech (Fagus sylvatica L.) individuals originating from two pairs of populations on the northern and southern slopes of Mont Ventoux (south-eastern France). We performed both single and multilocus analysis of selection based on 53 climate-related candidate genes containing 546 SNPs. FST outlier methods at the SNP level revealed a weak signal of selection, with three marginally significant outliers in the northern populations. At the gene level, considering haplotypes as alleles, two additional marginally significant outliers were detected, one on each slope. To account for the uncertainty of haplotype inference, we averaged the Bayes factors over many possible phase reconstructions. Epistatic selection offers a realistic multilocus model of selection in natural populations. Here, we used a test suggested by Ohta based on the decomposition of the variance of linkage disequilibrium. Overall populations, 0.23% of the SNP pairs (haplotypes) showed evidence of epistatic selection, with nearly 80% of them being within genes. One of the between gene epistatic selection signals arose between an FST outlier and a nonsynonymous mutation in a drought response gene. Additionally, we identified haplotypes containing selectively advantageous allele combinations which were unique to high or low elevations and northern or southern populations. Several haplotypes contained nonsynonymous mutations situated in genes with known functional importance for adaptation to climatic factors. © 2014 John Wiley & Sons Ltd.
Accounting for epistatic interactions improves the functional analysis of protein structures
Wilkins, Angela D.; Venner, Eric; Marciano, David C.; Erdin, Serkan; Atri, Benu; Lua, Rhonald C.; Lichtarge, Olivier
2013-01-01
Motivation: The constraints under which sequence, structure and function coevolve are not fully understood. Bringing this mutual relationship to light can reveal the molecular basis of binding, catalysis and allostery, thereby identifying function and rationally guiding protein redesign. Underlying these relationships are the epistatic interactions that occur when the consequences of a mutation to a protein are determined by the genetic background in which it occurs. Based on prior data, we hypothesize that epistatic forces operate most strongly between residues nearby in the structure, resulting in smooth evolutionary importance across the structure. Methods and Results: We find that when residue scores of evolutionary importance are distributed smoothly between nearby residues, functional site prediction accuracy improves. Accordingly, we designed a novel measure of evolutionary importance that focuses on the interaction between pairs of structurally neighboring residues. This measure that we term pair-interaction Evolutionary Trace yields greater functional site overlap and better structure-based proteome-wide functional predictions. Conclusions: Our data show that the structural smoothness of evolutionary importance is a fundamental feature of the coevolution of sequence, structure and function. Mutations operate on individual residues, but selective pressure depends in part on the extent to which a mutation perturbs interactions with neighboring residues. In practice, this principle led us to redefine the importance of a residue in terms of the importance of its epistatic interactions with neighbors, yielding better annotation of functional residues, motivating experimental validation of a novel functional site in LexA and refining protein function prediction. Contact: lichtarge@bcm.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24021383
Evolution of genetic architecture under directional selection.
Hansen, Thomas F; Alvarez-Castro, José M; Carter, Ashley J R; Hermisson, Joachim; Wagner, Günter P
2006-08-01
We investigate the multilinear epistatic model under mutation-limited directional selection. We confirm previous results that only directional epistasis, in which genes on average reinforce or diminish each other's effects, contribute to the initial evolution of mutational effects. Thus, either canalization or decanalization can occur under directional selection, depending on whether positive or negative epistasis is prevalent. We then focus on the evolution of the epistatic coefficients themselves. In the absence of higher-order epistasis, positive pairwise epistasis will tend to weaken relative to additive effects, while negative pairwise epistasis will tend to become strengthened. Positive third-order epistasis will counteract these effects, while negative third-order epistasis will reinforce them. More generally, gene interactions of all orders have an inherent tendency for negative changes under directional selection, which can only be modified by higher-order directional epistasis. We identify three types of nonadditive quasi-equilibrium architectures that, although not strictly stable, can be maintained for an extended time: (1) nondirectional epistatic architectures; (2) canalized architectures with strong epistasis; and (3) near-additive architectures in which additive effects keep increasing relative to epistasis.
Genotype to Phenotype Mapping of the E. coli lac Promoter
NASA Astrophysics Data System (ADS)
Otwinowski, Jakub; Nemenman, Ilya
2014-03-01
Genotype-to-phenotype maps and the related fitness landscapes that include epistatic interactions are difficult to measure because of their high dimensional structure. Here we construct such a map using the recently collected corpora of high-throughput sequence data from the 75 base pairs long mutagenized E. coli lac promoter region, where each sequence is associated with induced transcriptional activity measured by a fluorescent reporter. We find that the additive (non-epistatic) contributions of individual mutations account for about two-thirds of the explainable phenotype variance, while pairwise epistasis explains about 7% of the variance for the full mutagenized sequence and about 15% for the subsequence associated with protein binding sites. Surprisingly, there is no evidence for third order epistatic contributions, and our inferred fitness landscape is essentially single peaked, with a small amount of antagonistic epistasis. We identify transcription factor (CRP) and RNA polymerase binding sites in the promotor region and their interactions. We conclude with a cautionary note that inferred properties of fitness landscapes may be severely influenced by biases in the sequence data. Funded in part by HFSP and James S. McDonnell Foundation.
Epistatic determinism of durum wheat resistance to the wheat spindle streak mosaic virus.
Holtz, Yan; Bonnefoy, Michel; Viader, Véronique; Ardisson, Morgane; Rode, Nicolas O; Poux, Gérard; Roumet, Pierre; Marie-Jeanne, Véronique; Ranwez, Vincent; Santoni, Sylvain; Gouache, David; David, Jacques L
2017-07-01
The resistance of durum wheat to the Wheat spindle streak mosaic virus (WSSMV) is controlled by two main QTLs on chromosomes 7A and 7B, with a huge epistatic effect. Wheat spindle streak mosaic virus (WSSMV) is a major disease of durum wheat in Europe and North America. Breeding WSSMV-resistant cultivars is currently the only way to control the virus since no treatment is available. This paper reports studies of the inheritance of WSSMV resistance using two related durum wheat populations obtained by crossing two elite cultivars with a WSSMV-resistant emmer cultivar. In 2012 and 2015, 354 recombinant inbred lines (RIL) were phenotyped using visual notations, ELISA and qPCR and genotyped using locus targeted capture and sequencing. This allowed us to build a consensus genetic map of 8568 markers and identify three chromosomal regions involved in WSSMV resistance. Two major regions (located on chromosomes 7A and 7B) jointly explain, on the basis of epistatic interactions, up to 43% of the phenotypic variation. Flanking sequences of our genetic markers are provided to facilitate future marker-assisted selection of WSSMV-resistant cultivars.
Douvris, Adrianna; Soubeyrand, Sébastien; Naing, Thet; Martinuk, Amy; Nikpay, Majid; Williams, Andrew; Buick, Julie; Yauk, Carole; McPherson, Ruth
2014-06-03
The TRIB1 locus has been linked to hepatic triglyceride metabolism in mice and to plasma triglycerides and coronary artery disease in humans. The lipid-associated single nucleotide polymorphisms (SNPs), identified by genome-wide association studies, are located ≈30 kb downstream from TRIB1, suggesting complex regulatory effects on genes or pathways relevant to hepatic triglyceride metabolism. The goal of this study was to investigate the functional relationship between common SNPs at the TRIB1 locus and plasma lipid traits. Characterization of the risk locus reveals that it encompasses a gene, TRIB1-associated locus (TRIBAL), composed of a well-conserved promoter region and an alternatively spliced transcript. Bioinformatic analysis and resequencing identified a single SNP, rs2001844, within the promoter region that associates with increased plasma triglycerides and reduced high-density lipoprotein cholesterol and coronary artery disease risk. Further, correction for triglycerides as a covariate indicated that the genome-wide association studies association is largely dependent on triglycerides. In addition, we show that rs2001844 is an expression trait locus (eQTL) for TRIB1 expression in blood and alters TRIBAL promoter activity in a reporter assay model. The TRIBAL transcript has features typical of long noncoding RNAs, including poor sequence conservation. Modulation of TRIBAL expression had limited impact on either TRIB1 or lipid regulatory genes mRNA levels in human hepatocyte models. In contrast, TRIB1 knockdown markedly increased TRIBAL expression in HepG2 cells and primary human hepatocytes. These studies demonstrate an interplay between a novel locus, TRIBAL, and TRIB1. TRIBAL is located in the genome-wide association studies identified risk locus, responds to altered expression of TRIB1, harbors a risk SNP that is an eQTL for TRIB1 expression, and associates with plasma triglyceride concentrations. © 2014 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley Blackwell.
Douvris, Adrianna; Soubeyrand, Sébastien; Naing, Thet; Martinuk, Amy; Nikpay, Majid; Williams, Andrew; Buick, Julie; Yauk, Carole; McPherson, Ruth
2014-01-01
Background The TRIB1 locus has been linked to hepatic triglyceride metabolism in mice and to plasma triglycerides and coronary artery disease in humans. The lipid‐associated single nucleotide polymorphisms (SNPs), identified by genome‐wide association studies, are located ≈30 kb downstream from TRIB1, suggesting complex regulatory effects on genes or pathways relevant to hepatic triglyceride metabolism. The goal of this study was to investigate the functional relationship between common SNPs at the TRIB1 locus and plasma lipid traits. Methods and Results Characterization of the risk locus reveals that it encompasses a gene, TRIB1‐associated locus (TRIBAL), composed of a well‐conserved promoter region and an alternatively spliced transcript. Bioinformatic analysis and resequencing identified a single SNP, rs2001844, within the promoter region that associates with increased plasma triglycerides and reduced high‐density lipoprotein cholesterol and coronary artery disease risk. Further, correction for triglycerides as a covariate indicated that the genome‐wide association studies association is largely dependent on triglycerides. In addition, we show that rs2001844 is an expression trait locus (eQTL) for TRIB1 expression in blood and alters TRIBAL promoter activity in a reporter assay model. The TRIBAL transcript has features typical of long noncoding RNAs, including poor sequence conservation. Modulation of TRIBAL expression had limited impact on either TRIB1 or lipid regulatory genes mRNA levels in human hepatocyte models. In contrast, TRIB1 knockdown markedly increased TRIBAL expression in HepG2 cells and primary human hepatocytes. Conclusions These studies demonstrate an interplay between a novel locus, TRIBAL, and TRIB1. TRIBAL is located in the genome‐wide association studies identified risk locus, responds to altered expression of TRIB1, harbors a risk SNP that is an eQTL for TRIB1 expression, and associates with plasma triglyceride concentrations. PMID:24895164
Conlon, Benjamin H; Frey, Eva; Rosenkranz, Peter; Locke, Barbara; Moritz, Robin F A; Routtu, Jarkko
2018-06-01
The Red Queen hypothesis predicts that host-parasite coevolutionary dynamics can select for host resistance through increased genetic diversity, recombination and evolutionary rates. However, in haplodiploid organisms such as the honeybee (Apis mellifera), models suggest the selective pressure is weaker than in diploids. Haplodiploid sex determination, found in A. mellifera, can allow deleterious recessive alleles to persist in the population through the diploid sex with negative effects predominantly expressed in the haploid sex. To overcome these negative effects in haploid genomes, epistatic interactions have been hypothesized to play an important role. Here, we use the interaction between A. mellifera and the parasitic mite Varroa destructor to test epistasis in the expression of resistance, through the inhibition of parasite reproduction, in haploid drones. We find novel loci on three chromosomes which explain over 45% of the resistance phenotype. Two of these loci interact only additively, suggesting their expression is independent of each other, but both loci interact epistatically with the third locus. With drone offspring inheriting only one copy of the queen's chromosomes, the drones will only possess one of two queen alleles throughout the years-long lifetime of the honeybee colony. Varroa, in comparison, completes its highly inbred reproductive cycle in a matter of weeks, allowing it to rapidly evolve resistance. Faced with the rapidly evolving Varroa, a diversity of pathways and epistatic interactions for the inhibition of Varroa reproduction could therefore provide a selective advantage to the high levels of recombination seen in A. mellifera. This allows for the remixing of phenotypes despite a fixed queen genotype. © 2018 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2018 European Society For Evolutionary Biology.
Epistatic Effects Contribute to Variation in BMD in Fischer 344 × Lewis F2 Rats
Koller, Daniel L; Liu, Lixiang; Alam, Imranul; Sun, Qiwei; Econs, Michael J; Foroud, Tatiana; Turner, Charles H
2008-01-01
To further delineate the factors underlying the complex genetic architecture of BMD in the rat model, a genome screen for epistatic interactions was conducted. Several significant interactions were identified, involving both previously identified and novel QTLs. Introduction The variation in several of the risk factors for osteoporotic fracture, including BMD, has been shown to be caused largely by genetic differences. However, the genetic architecture of BMD is complex in both humans and in model organisms. We have previously reported quantitative trait locus (QTL) results for BMD from a genome screen of 595 female F2 progeny of Fischer 344 and Lewis rats. These progeny also provide an excellent opportunity to search for epistatic effects, or interaction between genetic loci, that contribute to fracture risk. Materials and Methods Microsatellite marker data from a 20-cM genome screen was analyzed along with weight-adjusted BMD (DXA and pQCT) phenotypic data using the R/qtl software package. Genotype and phenotype data were permuted to determine a genome-wide significance threshold for the epistasis or interaction LOD score corresponding to an α level of 0.01. Results and Conclusions Novel loci on chromosomes 12 and 15 showed a strong epistatic effect on total BMD at the femoral midshaft by pQCT (LOD = 5.4). A previously reported QTL on chromosome 7 was found to interact with a novel locus on chromosome 20 to affect whole lumbar BMD by pQCT (LOD = 6.2). These results provide new information regarding the mode of action of previously identified rat QTLs, as well as identifying novel loci that act in combination with known QTLs or with other novel loci to contribute to the risk factors for osteoporotic fracture. PMID:17907919
Epistatic effects contribute to variation in BMD in Fischer 344 x Lewis F2 rats.
Koller, Daniel L; Liu, Lixiang; Alam, Imranul; Sun, Qiwei; Econs, Michael J; Foroud, Tatiana; Turner, Charles H
2008-01-01
To further delineate the factors underlying the complex genetic architecture of BMD in the rat model, a genome screen for epistatic interactions was conducted. Several significant interactions were identified, involving both previously identified and novel QTLs. The variation in several of the risk factors for osteoporotic fracture, including BMD, has been shown to be caused largely by genetic differences. However, the genetic architecture of BMD is complex in both humans and in model organisms. We have previously reported quantitative trait locus (QTL) results for BMD from a genome screen of 595 female F(2) progeny of Fischer 344 and Lewis rats. These progeny also provide an excellent opportunity to search for epistatic effects, or interaction between genetic loci, that contribute to fracture risk. Microsatellite marker data from a 20-cM genome screen was analyzed along with weight-adjusted BMD (DXA and pQCT) phenotypic data using the R/qtl software package. Genotype and phenotype data were permuted to determine a genome-wide significance threshold for the epistasis or interaction LOD score corresponding to an alpha level of 0.01. Novel loci on chromosomes 12 and 15 showed a strong epistatic effect on total BMD at the femoral midshaft by pQCT (LOD = 5.4). A previously reported QTL on chromosome 7 was found to interact with a novel locus on chromosome 20 to affect whole lumbar BMD by pQCT (LOD = 6.2). These results provide new information regarding the mode of action of previously identified rat QTLs, as well as identifying novel loci that act in combination with known QTLs or with other novel loci to contribute to the risk factors for osteoporotic fracture.
Gong, Rujun; Liu, Zhihong; Li, Leishi
2007-05-01
Glomerular microthrombosis (GMT) is not uncommon in lupus nephritis and has been associated with active renal injury and progressive kidney destruction. We undertook this study to determine whether genetic variations of hemostasis factors, such as plasminogen activator inhibitor 1 (PAI-1) and fibrinogen, affect the risk of GMT. A cross-sectional cohort of 101 lupus nephritis patients with or without GMT was genotyped for PAI-1 -675 4G/5G and beta-fibrinogen (FGB) -455 G/A gene polymorphisms and analyzed. PAI-1 4G/4G homozygotes and FGB A allele carriers were both at increased risk for GMT. When the data were stratified for both gene polymorphisms, an epistatic effect was detected. The PAI-1 4G/4G genotype was found to predispose to GMT not equally in all lupus nephritis patients, but only in FGB A allele carriers. Likewise, the association between the FGB A allele and GMT was restricted to lupus nephritis patients homozygous for the PAI-1 4G allele. This epistatic effect was revalidated by the multifactor dimensionality reduction (MDR) analysis and further assessed by incorporating a variety of environmental and clinical factors into the MDR analysis. The most parsimonious model that had a cross-validation consistency of 100% included joint effects of PAI-1 and FGB gene polymorphisms and anticardiolipin antibody (aCL) status and yielded the best prediction of GMT, with 66.6% accuracy. Our findings suggest that risk of GMT in lupus nephritis is attributable, at least in part, to an epistatic effect of PAI-1 and FGB genes, likely via an interaction with environmental/clinical factors, such as aCL.
Schott, B H; Assmann, A; Schmierer, P; Soch, J; Erk, S; Garbusow, M; Mohnke, S; Pöhland, L; Romanczuk-Seiferth, N; Barman, A; Wüstenberg, T; Haddad, L; Grimm, O; Witt, S; Richter, S; Klein, M; Schütze, H; Mühleisen, T W; Cichon, S; Rietschel, M; Noethen, M M; Tost, H; Gundelfinger, E D; Düzel, E; Heinz, A; Meyer-Lindenberg, A; Seidenbecher, C I; Walter, H
2014-01-01
Recent genome-wide association studies have pointed to single-nucleotide polymorphisms (SNPs) in genes encoding the neuronal calcium channel CaV1.2 (CACNA1C; rs1006737) and the presynaptic active zone protein Piccolo (PCLO; rs2522833) as risk factors for affective disorders, particularly major depression. Previous neuroimaging studies of depression-related endophenotypes have highlighted the role of the subgenual cingulate cortex (CG25) in negative mood and depressive psychopathology. Here, we aimed to assess how recently associated PCLO and CACNA1C depression risk alleles jointly affect memory-related CG25 activity as an intermediate phenotype in clinically healthy humans. To investigate the combined effects of rs1006737 and rs2522833 on the CG25 response, we conducted three functional magnetic resonance imaging studies of episodic memory formation in three independent cohorts (N=79, 300, 113). An epistatic interaction of PCLO and CACNA1C risk alleles in CG25 during memory encoding was observed in all groups, with carriers of no risk allele and of both risk alleles showing higher CG25 activation during encoding when compared with carriers of only one risk allele. Moreover, PCLO risk allele carriers showed lower memory performance and reduced encoding-related hippocampal activation. In summary, our results point to region-specific epistatic effects of PCLO and CACNA1C risk variants in CG25, potentially related to episodic memory. Our data further suggest that genetic risk factors on the SNP level do not necessarily have additive effects but may show complex interactions. Such epistatic interactions might contribute to the ‘missing heritability' of complex phenotypes. PMID:24643163
GSCALite: A Web Server for Gene Set Cancer Analysis.
Liu, Chun-Jie; Hu, Fei-Fei; Xia, Mengxuan; Han, Leng; Zhang, Qiong; Guo, An-Yuan
2018-05-22
The availability of cancer genomic data makes it possible to analyze genes related to cancer. Cancer is usually the result of a set of genes and the signal of a single gene could be covered by background noise. Here, we present a web server named Gene Set Cancer Analysis (GSCALite) to analyze a set of genes in cancers with the following functional modules. (i) Differential expression in tumor vs normal, and the survival analysis; (ii) Genomic variations and their survival analysis; (iii) Gene expression associated cancer pathway activity; (iv) miRNA regulatory network for genes; (v) Drug sensitivity for genes; (vi) Normal tissue expression and eQTL for genes. GSCALite is a user-friendly web server for dynamic analysis and visualization of gene set in cancer and drug sensitivity correlation, which will be of broad utilities to cancer researchers. GSCALite is available on http://bioinfo.life.hust.edu.cn/web/GSCALite/. guoay@hust.edu.cn or zhangqiong@hust.edu.cn. Supplementary data are available at Bioinformatics online.
Sherlock: Detecting Gene-Disease Associations by Matching Patterns of Expression QTL and GWAS
He, Xin; Fuller, Chris K.; Song, Yi; Meng, Qingying; Zhang, Bin; Yang, Xia; Li, Hao
2013-01-01
Genetic mapping of complex diseases to date depends on variations inside or close to the genes that perturb their activities. A strong body of evidence suggests that changes in gene expression play a key role in complex diseases and that numerous loci perturb gene expression in trans. The information in trans variants, however, has largely been ignored in the current analysis paradigm. Here we present a statistical framework for genetic mapping by utilizing collective information in both cis and trans variants. We reason that for a disease-associated gene, any genetic variation that perturbs its expression is also likely to influence the disease risk. Thus, the expression quantitative trait loci (eQTL) of the gene, which constitute a unique “genetic signature,” should overlap significantly with the set of loci associated with the disease. We translate this idea into a computational algorithm (named Sherlock) to search for gene-disease associations from GWASs, taking advantage of independent eQTL data. Application of this strategy to Crohn disease and type 2 diabetes predicts a number of genes with possible disease roles, including several predictions supported by solid experimental evidence. Importantly, predicted genes are often implicated by multiple trans eQTL with moderate associations. These genes are far from any GWAS association signals and thus cannot be identified from the GWAS alone. Our approach allows analysis of association data from a new perspective and is applicable to any complex phenotype. It is readily generalizable to molecular traits other than gene expression, such as metabolites, noncoding RNAs, and epigenetic modifications. PMID:23643380
Iancu, Ovidiu D; Darakjian, Priscila; Kawane, Sunita; Bottomly, Daniel; Hitzemann, Robert; McWeeney, Shannon
2012-01-01
Complex Mus musculus crosses, e.g., heterogeneous stock (HS), provide increased resolution for quantitative trait loci detection. However, increased genetic complexity challenges detection methods, with discordant results due to low data quality or complex genetic architecture. We quantified the impact of theses factors across three mouse crosses and two different detection methods, identifying procedures that greatly improve detection quality. Importantly, HS populations have complex genetic architectures not fully captured by the whole genome kinship matrix, calling for incorporating chromosome specific relatedness information. We analyze three increasingly complex crosses, using gene expression levels as quantitative traits. The three crosses were an F(2) intercross, a HS formed by crossing four inbred strains (HS4), and a HS (HS-CC) derived from the eight lines found in the collaborative cross. Brain (striatum) gene expression and genotype data were obtained using the Illumina platform. We found large disparities between methods, with concordance varying as genetic complexity increased; this problem was more acute for probes with distant regulatory elements (trans). A suite of data filtering steps resulted in substantial increases in reproducibility. Genetic relatedness between samples generated overabundance of detected eQTLs; an adjustment procedure that includes the kinship matrix attenuates this problem. However, we find that relatedness between individuals is not evenly distributed across the genome; information from distinct chromosomes results in relatedness structure different from the whole genome kinship matrix. Shared polymorphisms from distinct chromosomes collectively affect expression levels, confounding eQTL detection. We suggest that considering chromosome specific relatedness can result in improved eQTL detection.
The structure of a gene co-expression network reveals biological functions underlying eQTLs.
Villa-Vialaneix, Nathalie; Liaubet, Laurence; Laurent, Thibault; Cherel, Pierre; Gamot, Adrien; SanCristobal, Magali
2013-01-01
What are the commonalities between genes, whose expression level is partially controlled by eQTL, especially with regard to biological functions? Moreover, how are these genes related to a phenotype of interest? These issues are particularly difficult to address when the genome annotation is incomplete, as is the case for mammalian species. Moreover, the direct link between gene expression and a phenotype of interest may be weak, and thus difficult to handle. In this framework, the use of a co-expression network has proven useful: it is a robust approach for modeling a complex system of genetic regulations, and to infer knowledge for yet unknown genes. In this article, a case study was conducted with a mammalian species. It showed that the use of a co-expression network based on partial correlation, combined with a relevant clustering of nodes, leads to an enrichment of biological functions of around 83%. Moreover, the use of a spatial statistics approach allowed us to superimpose additional information related to a phenotype; this lead to highlighting specific genes or gene clusters that are related to the network structure and the phenotype. Three main results are worth noting: first, key genes were highlighted as a potential focus for forthcoming biological experiments; second, a set of biological functions, which support a list of genes under partial eQTL control, was set up by an overview of the global structure of the gene expression network; third, pH was found correlated with gene clusters, and then with related biological functions, as a result of a spatial analysis of the network topology.
Poyatos-Pertíñez, Sandra; Quinet, Muriel; Ortíz-Atienza, Ana; Bretones, Sandra; Yuste-Lisbona, Fernando J; Lozano, Rafael
2016-09-01
Genetic interactions of UFD gene support its specific function during reproductive development of tomato; in this process, UFD could play a pivotal role between inflorescence architecture and flower initiation genes. Tomato (Solanum lycopersicum L.) is a major vegetable crop that also constitutes a model species for the study of plant developmental processes. To gain insight into the control of flowering and floral development, a novel tomato mutant, unfinished flower development (ufd), whose inflorescence and flowers were unable to complete their normal development was characterized using double mutant and gene expression analyses. Genetic interactions of ufd with mutations affecting inflorescence fate (uniflora, jointless and single flower truss) were additive and resulted in double mutants displaying the inflorescence structure of the non-ufd parental mutant and the flower phenotype of the ufd mutant. In addition, ufd mutation promotes an earlier inflorescence meristem termination. Taken together, both results indicated that UFD is not involved in the maintenance of inflorescence meristem identity, although it could participate in the regulatory system that modulates the rate of meristem maturation. Regarding the floral meristem identity, the falsiflora mutation was epistatic to the ufd mutation even though FALSIFLORA was upregulated in ufd inflorescences. In terms of floral organ identity, the ufd mutation was epistatic to macrocalyx, and MACROCALYX expression was differently regulated depending on the inflorescence developmental stage. These results suggest that the UFD gene may play a pivotal role between the genes required for flowering initiation and inflorescence development (such as UNIFLORA, FALSIFLORA, JOINTLESS and SINGLE FLOWER TRUSS) and those required for further floral organ development such as the floral organ identity genes.
Cheema, Jitender; Faraldos, Juan A; O'Maille, Paul E
2017-02-01
Epistasis, the interaction between mutations and the genetic background, is a pervasive force in evolution that is difficult to predict yet derives from a simple principle - biological systems are interconnected. Therefore, one effect may be intimately linked to another, hence interdependent. Untangling epistatic interactions between and within genes is a vibrant area of research. Deriving a mechanistic understanding of epistasis is a major challenge. Particularly, elucidating how epistasis can attenuate the effects of otherwise dominant mutations that control phenotypes. Using the emergence of terpene cyclization in specialized metabolism as an excellent example, this review describes the process of discovery and interpretation of dominance and epistasis in relation to current efforts. Specifically, we outline experimental approaches to isolating epistatic networks of mutations in protein structure, formally quantifying epistatic interactions, then building biochemical models with chemical mechanisms in efforts to achieve an understanding of the physical basis for epistasis. From these models we describe informed conjectures about past evolutionary events that underlie the emergence, divergence and specialization of terpene synthases to illustrate key principles of the constraining forces of epistasis in enzyme function. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Mapping of epistatic quantitative trait loci in four-way crosses.
He, Xiao-Hong; Qin, Hongde; Hu, Zhongli; Zhang, Tianzhen; Zhang, Yuan-Ming
2011-01-01
Four-way crosses (4WC) involving four different inbred lines often appear in plant and animal commercial breeding programs. Direct mapping of quantitative trait loci (QTL) in these commercial populations is both economical and practical. However, the existing statistical methods for mapping QTL in a 4WC population are built on the single-QTL genetic model. This simple genetic model fails to take into account QTL interactions, which play an important role in the genetic architecture of complex traits. In this paper, therefore, we attempted to develop a statistical method to detect epistatic QTL in 4WC population. Conditional probabilities of QTL genotypes, computed by the multi-point single locus method, were used to sample the genotypes of all putative QTL in the entire genome. The sampled genotypes were used to construct the design matrix for QTL effects. All QTL effects, including main and epistatic effects, were simultaneously estimated by the penalized maximum likelihood method. The proposed method was confirmed by a series of Monte Carlo simulation studies and real data analysis of cotton. The new method will provide novel tools for the genetic dissection of complex traits, construction of QTL networks, and analysis of heterosis.
Roso, V M; Schenkel, F S; Miller, S P; Schaeffer, L R
2005-08-01
Breed additive, dominance, and epistatic loss effects are of concern in the genetic evaluation of a multibreed population. Multiple regression equations used for fitting these effects may show a high degree of multicollinearity among predictor variables. Typically, when strong linear relationships exist, the regression coefficients have large SE and are sensitive to changes in the data file and to the addition or deletion of variables in the model. Generalized ridge regression methods were applied to obtain stable estimates of direct and maternal breed additive, dominance, and epistatic loss effects in the presence of multicollinearity among predictor variables. Preweaning weight gains of beef calves in Ontario, Canada, from 1986 to 1999 were analyzed. The genetic model included fixed direct and maternal breed additive, dominance, and epistatic loss effects, fixed environmental effects of age of the calf, contemporary group, and age of the dam x sex of the calf, random additive direct and maternal genetic effects, and random maternal permanent environment effect. The degree and the nature of the multicollinearity were identified and ridge regression methods were used as an alternative to ordinary least squares (LS). Ridge parameters were obtained using two different objective methods: 1) generalized ridge estimator of Hoerl and Kennard (R1); and 2) bootstrap in combination with cross-validation (R2). Both ridge regression methods outperformed the LS estimator with respect to mean squared error of predictions (MSEP) and variance inflation factors (VIF) computed over 100 bootstrap samples. The MSEP of R1 and R2 were similar, and they were 3% less than the MSEP of LS. The average VIF of LS, R1, and R2 were equal to 26.81, 6.10, and 4.18, respectively. Ridge regression methods were particularly effective in decreasing the multicollinearity involving predictor variables of breed additive effects. Because of a high degree of confounding between estimates of maternal dominance and direct epistatic loss effects, it was not possible to compare the relative importance of these effects with a high level of confidence. The inclusion of epistatic loss effects in the additive-dominance model did not cause noticeable reranking of sires, dams, and calves based on across-breed EBV. More precise estimates of breed effects as a result of this study may result in more stable across-breed estimated breeding values over the years.
Abad-Grau, María del Mar; Fedetz, María; Izquierdo, Guillermo; Lucas, Miguel; Fernández, Óscar; Ndagire, Dorothy; Catalá-Rabasa, Antonio; Ruiz, Agustín; Gayán, Javier; Delgado, Concepción; Arnal, Carmen
2012-01-01
The human leukocyte antigen (HLA) DRB1*1501 has been consistently associated with multiple sclerosis (MS) in nearly all populations tested. This points to a specific antigen presentation as the pathogenic mechanism though this does not fully explain the disease association. The identification of expression quantitative trait loci (eQTL) for genes in the HLA locus poses the question of the role of gene expression in MS susceptibility. We analyzed the eQTLs in the HLA region with respect to MS-associated HLA-variants obtained from genome-wide association studies (GWAS). We found that the Tag of DRB1*1501, rs3135388 A allele, correlated with high expression of DRB1, DRB5 and DQB1 genes in a Caucasian population. In quantitative terms, the MS-risk AA genotype carriers of rs3135388 were associated with 15.7-, 5.2- and 8.3-fold higher expression of DQB1, DRB5 and DRB1, respectively, than the non-risk GG carriers. The haplotype analysis of expression-associated variants in a Spanish MS cohort revealed that high expression of DRB1 and DQB1 alone did not contribute to the disease. However, in Caucasian, Asian and African American populations, the DRB1*1501 allele was always highly expressed. In other immune related diseases such as type 1 diabetes, inflammatory bowel disease, ulcerative colitis, asthma and IgA deficiency, the best GWAS-associated HLA SNPs were also eQTLs for different HLA Class II genes. Our data suggest that the DR/DQ expression levels, together with specific structural properties of alleles, seem to be the causal effect in MS and in other immunopathologies rather than specific antigen presentation alone. PMID:22253788
Markunas, Christina A; Johnson, Eric O; Hancock, Dana B
2017-07-01
Genome-wide association study (GWAS)-identified variants are enriched for functional elements. However, we have limited knowledge of how functional enrichment may differ by disease/trait and tissue type. We tested a broad set of eight functional elements for enrichment among GWAS-identified SNPs (p < 5×10 -8 ) from the NHGRI-EBI Catalog across seven disease/trait categories: cancer, cardiovascular disease, diabetes, autoimmune disease, psychiatric disease, neurological disease, and anthropometric traits. SNPs were annotated using HaploReg for the eight functional elements across any tissue: DNase sites, expression quantitative trait loci (eQTL), sequence conservation, enhancers, promoters, missense variants, sequence motifs, and protein binding sites. In addition, tissue-specific annotations were considered for brain vs. blood. Disease/trait SNPs were compared to a control set of 4809 SNPs matched to the GWAS SNPs (N = 1639) on allele frequency, gene density, distance to nearest gene, and linkage disequilibrium at ~3:1 ratio. Enrichment analyses were conducted using logistic regression, with Bonferroni correction. Overall, a significant enrichment was observed for all functional elements, except sequence motifs. Missense SNPs showed the strongest magnitude of enrichment. eQTLs were the only functional element significantly enriched across all diseases/traits. Magnitudes of enrichment were generally similar across diseases/traits, where enrichment was statistically significant. Blood vs. brain tissue effects on enrichment were dependent on disease/trait and functional element (e.g., cardiovascular disease: eQTLs P TissueDifference = 1.28 × 10 -6 vs. enhancers P TissueDifference = 0.94). Identifying disease/trait-relevant functional elements and tissue types could provide new insight into the underlying biology, by guiding a priori GWAS analyses (e.g., brain enhancer elements for psychiatric disease) or facilitating post hoc interpretation.
Characterization of candidate genes in inflammatory bowel disease–associated risk loci
Peloquin, Joanna M.; Sartor, R. Balfour; Newberry, Rodney D.; McGovern, Dermot P.; Yajnik, Vijay; Lira, Sergio A.
2016-01-01
GWAS have linked SNPs to risk of inflammatory bowel disease (IBD), but a systematic characterization of disease-associated genes has been lacking. Prior studies utilized microarrays that did not capture many genes encoded within risk loci or defined expression quantitative trait loci (eQTLs) using peripheral blood, which is not the target tissue in IBD. To address these gaps, we sought to characterize the expression of IBD-associated risk genes in disease-relevant tissues and in the setting of active IBD. Terminal ileal (TI) and colonic mucosal tissues were obtained from patients with Crohn’s disease or ulcerative colitis and from healthy controls. We developed a NanoString code set to profile 678 genes within IBD risk loci. A subset of patients and controls were genotyped for IBD-associated risk SNPs. Analyses included differential expression and variance analysis, weighted gene coexpression network analysis, and eQTL analysis. We identified 116 genes that discriminate between healthy TI and colon samples and uncovered patterns in variance of gene expression that highlight heterogeneity of disease. We identified 107 coexpressed gene pairs for which transcriptional regulation is either conserved or reversed in an inflammation-independent or -dependent manner. We demonstrate that on average approximately 60% of disease-associated genes are differentially expressed in inflamed tissue. Last, we identified eQTLs with either genotype-only effects on expression or an interaction effect between genotype and inflammation. Our data reinforce tissue specificity of expression in disease-associated candidate genes, highlight genes and gene pairs that are regulated in disease-relevant tissue and inflammation, and provide a foundation to advance the understanding of IBD pathogenesis. PMID:27668286
Xie, Fang-Fei; Deng, Fei-Yan; Wu, Long-Fei; Mo, Xing-Bo; Zhu, Hong; Wu, Jian; Guo, Yu-Fan; Zeng, Ke-Qin; Wang, Ming-Jun; Zhu, Xiao-Wei; Xia, Wei; Wang, Lan; He, Pei; Bing, Peng-Fei; Lu, Xin; Zhang, Yong-Hong; Lei, Shu-Feng
2018-01-01
DNA methylation is an important regulator on the mRNA expression. However, a genome-wide correlation pattern between DNA methylation and mRNA expression in human peripheral blood mononuclear cells (PBMCs) is largely unknown. The comprehensive relationship between mRNA and DNA methylation was explored by using four types of correlation analyses and a genome-wide methylation-mRNA expression quantitative trait locus (eQTL) analysis in PBMCs in 46 unrelated female subjects. An enrichment analysis was performed to detect biological function for the detected genes. Single pair correlation coefficient (r T1 ) between methylation level and mRNA is moderate (-0.63-0.62) in intensity, and the negative and positive correlations are nearly equal in quantity. Correlation analysis on each gene (T4) found 60.1% genes showed correlations between mRNA and gene-based methylation at P < 0.05 and more than 5.96% genes presented very strong correlation (R T4 > 0.8). Methylation sites have regulation effects on mRNA expression in eQTL analysis, with more often observations in region of transcription start site (TSS). The genes under significant methylation regulation both in correlation analysis and eQTL analysis tend to cluster to the categories (e.g., transcription, translation, regulation of transcription) that are essential for maintaining the basic life activities of cells. Our findings indicated that DNA methylation has predictive regulation effect on mRNA with a very complex pattern in PBMCs. The results increased our understanding on correlation of methylation and mRNA and also provided useful clues for future epigenetic studies in exploring biological and disease-related regulatory mechanisms in PBMC.
A genome-wide survey of CD4+ lymphocyte regulatory genetic variants identifies novel asthma genes
Sharma, Sunita; Zhou, Xiaobo; Thibault, Derek M.; Himes, Blanca E.; Liu, Andy; Szefler, Stanley J.; Strunk, Robert; Castro, Mario; Hansel, Nadia N.; Diette, Gregory B.; Vonakis, Becky M.; Adkinson, N. Franklin; Avila, Lydiana; Soto-Quiros, Manuel; Barraza-Villareal, Albino; Lemanske, Robert F.; Solway, Julian; Krishnan, Jerry; White, Steven R.; Cheadle, Chris; Berger, Alan E.; Fan, Jinshui; Boorgula, Meher Preethi; Nicolae, Dan; Gilliland, Frank; Barnes, Kathleen; London, Stephanie J.; Martinez, Fernando; Ober, Carole; Celedón, Juan C.; Carey, Vincent J.; Weiss, Scott T.; Raby, Benjamin A.
2014-01-01
Background Genome-wide association studies have yet to identify the majority of genetic variants involved in asthma. We hypothesized that expression quantitative trait locus (eQTL) mapping can identify novel asthma genes by enabling prioritization of putative functional variants for association testing. Objective We evaluated 6,706 cis-acting expression-associated variants (eSNP) identified through a genome-wide eQTL survey of CD4+ lymphocytes for association with asthma. Methods eSNP were tested for association with asthma in 359 asthma cases and 846 controls from the Childhood Asthma Management Program, with verification using family-based testing. Significant associations were tested for replication in 579 parent-child trios with asthma from Costa Rica. Further functional validation was performed by Formaldehyde Assisted Isolation of Regulatory Elements (FAIRE)-qPCR and Chromatin-Immunoprecipitation (ChIP)-PCR in lung derived epithelial cell lines (Beas-2B and A549) and Jurkat cells, a leukemia cell line derived from T lymphocytes. Results Cis-acting eSNP demonstrated associations with asthma in both cohorts. We confirmed the previously-reported association of ORMDL3/GSDMB variants with asthma (combined p=2.9 × 108). Reproducible associations were also observed for eSNP in three additional genes: FADS2 (p=0.002), NAGA (p=0.0002), and F13A1 (p=0.0001). We subsequently demonstrated that FADS2 mRNA is increased in CD4+ lymphocytes in asthmatics, and that the associated eSNPs reside within DNA segments with histone modifications that denote open chromatin status and confer enhancer activity. Conclusions Our results demonstrate the utility of eQTL mapping in the identification of novel asthma genes, and provide evidence for the importance of FADS2, NAGA, and F13A1 in the pathogenesis of asthma. PMID:24934276
Koturan, Surya; Ko, Jeong-Hun; Fonseca, Carmen; Harmston, Nathan; Game, Laurence; Martin, Javier; Ong, Voon; Abraham, David J; Denton, Christopher P; Behmoaras, Jacques; Petretto, Enrico
2018-01-01
Objectives Several common and rare risk variants have been reported for systemic sclerosis (SSc), but the effector cell(s) mediating the function of these genetic variants remains to be elucidated. While innate immune cells have been proposed as the critical targets to interfere with the disease process underlying SSc, no studies have comprehensively established their effector role. Here we investigated the contribution of monocyte-derived macrophages (MDMs) in mediating genetic susceptibility to SSc. Methods We carried out RNA sequencing and genome-wide genotyping in MDMs from 57 patients with SSc and 15 controls. Our differential expression and expression quantitative trait locus (eQTL) analysis in SSc was further integrated with epigenetic, expression and eQTL data from skin, monocytes, neutrophils and lymphocytes. Results We identified 602 genes upregulated and downregulated in SSc macrophages that were significantly enriched for genes previously implicated in SSc susceptibility (P=5×10−4), and 270 cis-regulated genes in MDMs. Among these, GSDMA was reported to carry an SSc risk variant (rs3894194) regulating expression of neighbouring genes in blood. We show that GSDMA is upregulated in SSc MDMs (P=8.4×10−4) but not in the skin, and is a significant eQTL in SSc macrophages and lipopolysaccharide/interferon gamma (IFNγ)-stimulated monocytes. Furthermore, we identify an SSc macrophage transcriptome signature characterised by upregulation of glycolysis, hypoxia and mTOR signalling and a downregulation of IFNγ response pathways. Conclusions Our data further establish the link between macrophages and SSc, and suggest that the contribution of the rs3894194 risk variant to SSc susceptibility can be mediated by GSDMA expression in macrophages. PMID:29348297
Leveraging lung tissue transcriptome to uncover candidate causal genes in COPD genetic associations.
Lamontagne, Maxime; Bérubé, Jean-Christophe; Obeidat, Ma'en; Cho, Michael H; Hobbs, Brian D; Sakornsakolpat, Phuwanat; de Jong, Kim; Boezen, H Marike; Nickle, David; Hao, Ke; Timens, Wim; van den Berge, Maarten; Joubert, Philippe; Laviolette, Michel; Sin, Don D; Paré, Peter D; Bossé, Yohan
2018-05-15
Causal genes of chronic obstructive pulmonary disease (COPD) remain elusive. The current study aims at integrating genome-wide association studies (GWAS) and lung expression quantitative trait loci (eQTL) data to map COPD candidate causal genes and gain biological insights into the recently discovered COPD susceptibility loci. Two complementary genomic datasets on COPD were studied. First, the lung eQTL dataset which included whole-genome gene expression and genotyping data from 1038 individuals. Second, the largest COPD GWAS to date from the International COPD Genetics Consortium (ICGC) with 13 710 cases and 38 062 controls. Methods that integrated GWAS with eQTL signals including transcriptome-wide association study (TWAS), colocalization and Mendelian randomization-based (SMR) approaches were used to map causality genes, i.e. genes with the strongest evidence of being the functional effector at specific loci. These methods were applied at the genome-wide level and at COPD risk loci derived from the GWAS literature. Replication was performed using lung data from GTEx. We collated 129 non-overlapping risk loci for COPD from the GWAS literature. At the genome-wide scale, 12 new COPD candidate genes/loci were revealed and six replicated in GTEx including CAMK2A, DMPK, MYO15A, TNFRSF10A, BTN3A2 and TRBV30. In addition, we mapped candidate causal genes for 60 out of the 129 GWAS-nominated loci and 23 of them were replicated in GTEx. Mapping candidate causal genes in lung tissue represents an important contribution to the genetics of COPD, enriches our biological interpretation of GWAS findings, and brings us closer to clinical translation of genetic associations.
Zhang, Xu; Zhang, Wei; Ma, Shwu-Fan; Desai, Ankit A.; Saraf, Santosh; Miasniakova, Galina; Sergueeva, Adelina; Ammosova, Tatiana; Xu, Min; Nekhai, Sergei; Abbasi, Taimur; Casanova, Nancy G.; Steinberg, Martin H.; Baldwin, Clinton T.; Sebastiani, Paola; Prchal, Josef T.; Kittles, Rick; Garcia, Joe G. N.; Machado, Roberto F.; Gordeuk, Victor R.
2014-01-01
Background We postulated that the hypoxic response in sickle cell disease (SCD) contributes to altered gene expression and pulmonary hypertension, a complication associated with early mortality. Methods and Results To identify genes regulated by the hypoxic response and not other effects of chronic anemia, we compared expression variation in peripheral blood mononuclear cells from 13 SCD subjects with hemoglobin SS genotype and 15 Chuvash polycythemia subjects (VHLR200W homozygotes with constitutive up-regulation of hypoxia inducible factors in the absence of anemia or hypoxia). At 5% false discovery rate, 1040 genes exhibited >1.15 fold change in both conditions; 297 were up-regulated and 743 down-regulated including MAPK8 encoding a mitogen-activated protein kinase important for apoptosis, T-cell differentiation and inflammatory responses. Association mapping with a focus on local regulatory polymorphisms in 61 SCD patients identified expression quantitative trait loci (eQTL) for 103 of these hypoxia response genes. In a University of Illinois SCD cohort the A allele of a MAPK8 eQTL, rs10857560, was associated with pre-capillary pulmonary hypertension defined as mean pulmonary artery pressure ≥25 and pulmonary capillary wedge pressure ≤15 mm Hg at right heart catheterization (allele frequency=0.66; OR=13.8, P=0.00036, n=238). This association was confirmed in an independent Walk-PHaSST cohort (allele frequency=0.65; OR=11.3, P=0.0025, n=519). The homozygous AA genotype of rs10857560 was associated with decreased MAPK8 expression and present in all 14 identified pre-capillary pulmonary hypertension cases among the combined 757 patients. Conclusions Our study demonstrates a prominent hypoxic transcription component in SCD and a MAPK8 eQTL associated with pre-capillary pulmonary hypertension. PMID:24515990
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hasin-Brumshtein, Yehudit; Khan, Arshad H.; Hormozdiari, Farhad
2016-09-13
Previous studies had shown that the integration of genome wide expression profiles, in metabolic tissues, with genetic and phenotypic variance, provided valuable insight into the underlying molecular mechanisms. We used RNA-Seq to characterize hypothalamic transcriptome in 99 inbred strains of mice from the Hybrid Mouse Diversity Panel (HMDP), a reference resource population for cardiovascular and metabolic traits. We report numerous novel transcripts supported by proteomic analyses, as well as novel non coding RNAs. High resolution genetic mapping of transcript levels in HMDP, reveals bothlocalandtransexpression Quantitative Trait Loci (eQTLs) demonstrating 2transeQTL 'hotspots' associated with expression of hundreds of genes. We alsomore » report thousands of alternative splicing events regulated by genetic variants. Finally, comparison with about 150 metabolic and cardiovascular traits revealed many highly significant associations. Our data provide a rich resource for understanding the many physiologic functions mediated by the hypothalamus and their genetic regulation.« less
Epistatic interactions influence terrestrial–marine functional shifts in cetacean rhodopsin
2017-01-01
Like many aquatic vertebrates, whales have blue-shifting spectral tuning substitutions in the dim-light visual pigment, rhodopsin, that are thought to increase photosensitivity in underwater environments. We have discovered that known spectral tuning substitutions also have surprising epistatic effects on another function of rhodopsin, the kinetic rates associated with light-activated intermediates. By using absorbance spectroscopy and fluorescence-based retinal release assays on heterologously expressed rhodopsin, we assessed both spectral and kinetic differences between cetaceans (killer whale) and terrestrial outgroups (hippo, bovine). Mutation experiments revealed that killer whale rhodopsin is unusually resilient to pleiotropic effects on retinal release from key blue-shifting substitutions (D83N and A292S), largely due to a surprisingly specific epistatic interaction between D83N and the background residue, S299. Ancestral sequence reconstruction indicated that S299 is an ancestral residue that predates the evolution of blue-shifting substitutions at the origins of Cetacea. Based on these results, we hypothesize that intramolecular epistasis helped to conserve rhodopsin's kinetic properties while enabling blue-shifting spectral tuning substitutions as cetaceans adapted to aquatic environments. Trade-offs between different aspects of molecular function are rarely considered in protein evolution, but in cetacean and other vertebrate rhodopsins, may underlie multiple evolutionary scenarios for the selection of specific amino acid substitutions. PMID:28250185
Gilchrist, A S; Partridge, L
1999-01-01
Body size clines in Drosophila melanogaster have been documented in both Australia and South America, and may exist in Southern Africa. We crossed flies from the northern and southern ends of each of these clines to produce F(1), F(2), and first backcross generations. Our analysis of generation means for wing area and wing length produced estimates of the additive, dominance, epistatic, and maternal effects underlying divergence within each cline. For both females and males of all three clines, the generation means were adequately described by these parameters, indicating that linkage and higher order interactions did not contribute significantly to wing size divergence. Marked differences were apparent between the clines in the occurrence and magnitude of the significant genetic parameters. No cline was adequately described by a simple additive-dominance model, and significant epistatic and maternal effects occurred in most, but not all, of the clines. Generation variances were also analyzed. Only one cline was described sufficiently by a simple additive variance model, indicating significant epistatic, maternal, or linkage effects in the remaining two clines. The diversity in genetic architecture of the clines suggests that natural selection has produced similar phenotypic divergence by different combinations of gene action and interaction. PMID:10581284
Kerwin, Rachel E; Feusier, Julie; Muok, Alise; Lin, Catherine; Larson, Brandon; Copeland, Daniel; Corwin, Jason A; Rubin, Matthew J; Francisco, Marta; Li, Baohua; Joseph, Bindu; Weinig, Cynthia; Kliebenstein, Daniel J
2017-08-01
Despite the growing number of studies showing that genotype × environment and epistatic interactions control fitness, the influences of epistasis × environment interactions on adaptive trait evolution remain largely uncharacterized. Across three field trials, we quantified aliphatic glucosinolate (GSL) defense chemistry, leaf damage, and relative fitness using mutant lines of Arabidopsis thaliana varying at pairs of causal aliphatic GSL defense genes to test the impact of epistatic and epistasis × environment interactions on adaptive trait variation. We found that aliphatic GSL accumulation was primarily influenced by additive and epistatic genetic variation, leaf damage was primarily influenced by environmental variation and relative fitness was primarily influenced by epistasis and epistasis × environment interactions. Epistasis × environment interactions accounted for up to 48% of the relative fitness variation in the field. At a single field site, the impact of epistasis on relative fitness varied significantly over 2 yr, showing that epistasis × environment interactions within a location can be temporally dynamic. These results suggest that the environmental dependency of epistasis can profoundly influence the response to selection, shaping the adaptive trajectories of natural populations in complex ways, and deserves further consideration in future evolutionary studies. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
Lagator, Mato; Colegrave, Nick; Neve, Paul
2014-11-07
In rapidly changing environments, selection history may impact the dynamics of adaptation. Mutations selected in one environment may result in pleiotropic fitness trade-offs in subsequent novel environments, slowing the rates of adaptation. Epistatic interactions between mutations selected in sequential stressful environments may slow or accelerate subsequent rates of adaptation, depending on the nature of that interaction. We explored the dynamics of adaptation during sequential exposure to herbicides with different modes of action in Chlamydomonas reinhardtii. Evolution of resistance to two of the herbicides was largely independent of selection history. For carbetamide, previous adaptation to other herbicide modes of action positively impacted the likelihood of adaptation to this herbicide. Furthermore, while adaptation to all individual herbicides was associated with pleiotropic fitness costs in stress-free environments, we observed that accumulation of resistance mechanisms was accompanied by a reduction in overall fitness costs. We suggest that antagonistic epistasis may be a driving mechanism that enables populations to more readily adapt in novel environments. These findings highlight the potential for sequences of xenobiotics to facilitate the rapid evolution of multiple-drug and -pesticide resistance, as well as the potential for epistatic interactions between adaptive mutations to facilitate evolutionary rescue in rapidly changing environments. © 2014 The Author(s) Published by the Royal Society. All rights reserved.
Gutiérrez, Jayson
2009-01-01
The way in which the information contained in genotypes is translated into complex phenotypic traits (i.e. embryonic expression patterns) depends on its decoding by a multilayered hierarchy of biomolecular systems (regulatory networks). Each layer of this hierarchy displays its own regulatory schemes (i.e. operational rules such as +/− feedback) and associated control parameters, resulting in characteristic variational constraints. This process can be conceptualized as a mapping issue, and in the context of highly-dimensional genotype-phenotype mappings (GPMs) epistatic events have been shown to be ubiquitous, manifested in non-linear correspondences between changes in the genotype and their phenotypic effects. In this study I concentrate on epistatic phenomena pervading levels of biological organization above the genetic material, more specifically the realm of molecular networks. At this level, systems approaches to studying GPMs are specially suitable to shed light on the mechanistic basis of epistatic phenomena. To this aim, I constructed and analyzed ensembles of highly-modular (fully interconnected) networks with distinctive topologies, each displaying dynamic behaviors that were categorized as either arbitrary or functional according to early patterning processes in the Drosophila embryo. Spatio-temporal expression trajectories in virtual syncytial embryos were simulated via reaction-diffusion models. My in silico mutational experiments show that: 1) the average fitness decay tendency to successively accumulated mutations in ensembles of functional networks indicates the prevalence of positive epistasis, whereas in ensembles of arbitrary networks negative epistasis is the dominant tendency; and 2) the evaluation of epistatic coefficients of diverse interaction orders indicates that, both positive and negative epistasis are more prevalent in functional networks than in arbitrary ones. Overall, I conclude that the phenotypic and fitness effects of multiple perturbations are strongly conditioned by both the regulatory architecture (i.e. pattern of coupled feedback structures) and the dynamic nature of the spatio-temporal expression trajectories displayed by the simulated networks. PMID:19738908
Yang, Delong; Liu, Yuan; Cheng, Hongbo; Chang, Lei; Chen, Jingjing; Chai, Shouxi; Li, Mengfei
2016-06-28
Morphological traits related to flag leaves are determinant traits influencing plant architecture and yield potential in wheat (Triticum aestivum L.). However, little is known regarding their genetic controls under drought stress. One hundred and twenty F8-derived recombinant inbred lines from a cross between two common wheat cultivars Longjian 19 and Q9086 were developed to identify quantitative trait loci (QTLs) and to dissect the genetic bases underlying flag leaf width, length, area, length to width ratio and basal angle under drought stress and well-watered conditions consistent over four environments. A total of 55 additive and 51 pairs of epistatic QTLs were identified on all 21 chromosomes except 6D, among which additive loci were highly concentrated in a few of same or adjacent marker intervals in individual chromosomes. Two specific marker intervals of Xwmc694-Xwmc156 on chromosome 1B and Xbarc1072-Xwmc272 on chromosome 2B were co-located by additive QTLs for four tested traits. Twenty additive loci were repeatedly detected in more than two environments, suggestive of stable A-QTLs. A majority of QTLs involved significant additive and epistatic effects, as well as QTL × environment interactions (QEIs). Of these, 72.7 % of additive QEIs and 80 % of epistatic QEIs were related to drought stress with significant genetic effects decreasing phenotypic values. By contrast, additive and QEIs effects contributed more phenotypic variation than epistatic effects. Flag leaf morphology in wheat was predominantly controlled by additive and QEIs effects, where more QEIs effects occurred in drought stress and depressed phenotypic performances. Several QTL clusters indicated tight linkage or pleiotropy in the inheritance of these traits. Twenty stable QTLs for flag leaf morphology are potentially useful for the genetic improvement of drought tolerance in wheat through QTL pyramiding.
Common Genetic Polymorphisms Influence Blood Biomarker Measurements in COPD.
Sun, Wei; Kechris, Katerina; Jacobson, Sean; Drummond, M Bradley; Hawkins, Gregory A; Yang, Jenny; Chen, Ting-Huei; Quibrera, Pedro Miguel; Anderson, Wayne; Barr, R Graham; Basta, Patricia V; Bleecker, Eugene R; Beaty, Terri; Casaburi, Richard; Castaldi, Peter; Cho, Michael H; Comellas, Alejandro; Crapo, James D; Criner, Gerard; Demeo, Dawn; Christenson, Stephanie A; Couper, David J; Curtis, Jeffrey L; Doerschuk, Claire M; Freeman, Christine M; Gouskova, Natalia A; Han, MeiLan K; Hanania, Nicola A; Hansel, Nadia N; Hersh, Craig P; Hoffman, Eric A; Kaner, Robert J; Kanner, Richard E; Kleerup, Eric C; Lutz, Sharon; Martinez, Fernando J; Meyers, Deborah A; Peters, Stephen P; Regan, Elizabeth A; Rennard, Stephen I; Scholand, Mary Beth; Silverman, Edwin K; Woodruff, Prescott G; O'Neal, Wanda K; Bowler, Russell P
2016-08-01
Implementing precision medicine for complex diseases such as chronic obstructive lung disease (COPD) will require extensive use of biomarkers and an in-depth understanding of how genetic, epigenetic, and environmental variations contribute to phenotypic diversity and disease progression. A meta-analysis from two large cohorts of current and former smokers with and without COPD [SPIROMICS (N = 750); COPDGene (N = 590)] was used to identify single nucleotide polymorphisms (SNPs) associated with measurement of 88 blood proteins (protein quantitative trait loci; pQTLs). PQTLs consistently replicated between the two cohorts. Features of pQTLs were compared to previously reported expression QTLs (eQTLs). Inference of causal relations of pQTL genotypes, biomarker measurements, and four clinical COPD phenotypes (airflow obstruction, emphysema, exacerbation history, and chronic bronchitis) were explored using conditional independence tests. We identified 527 highly significant (p < 8 X 10-10) pQTLs in 38 (43%) of blood proteins tested. Most pQTL SNPs were novel with low overlap to eQTL SNPs. The pQTL SNPs explained >10% of measured variation in 13 protein biomarkers, with a single SNP (rs7041; p = 10-392) explaining 71%-75% of the measured variation in vitamin D binding protein (gene = GC). Some of these pQTLs [e.g., pQTLs for VDBP, sRAGE (gene = AGER), surfactant protein D (gene = SFTPD), and TNFRSF10C] have been previously associated with COPD phenotypes. Most pQTLs were local (cis), but distant (trans) pQTL SNPs in the ABO blood group locus were the top pQTL SNPs for five proteins. The inclusion of pQTL SNPs improved the clinical predictive value for the established association of sRAGE and emphysema, and the explanation of variance (R2) for emphysema improved from 0.3 to 0.4 when the pQTL SNP was included in the model along with clinical covariates. Causal modeling provided insight into specific pQTL-disease relationships for airflow obstruction and emphysema. In conclusion, given the frequency of highly significant local pQTLs, the large amount of variance potentially explained by pQTL, and the differences observed between pQTLs and eQTLs SNPs, we recommend that protein biomarker-disease association studies take into account the potential effect of common local SNPs and that pQTLs be integrated along with eQTLs to uncover disease mechanisms. Large-scale blood biomarker studies would also benefit from close attention to the ABO blood group.
Common Genetic Polymorphisms Influence Blood Biomarker Measurements in COPD
Drummond, M. Bradley; Hawkins, Gregory A.; Yang, Jenny; Chen, Ting-huei; Quibrera, Pedro Miguel; Anderson, Wayne; Barr, R. Graham; Bleecker, Eugene R.; Beaty, Terri; Casaburi, Richard; Castaldi, Peter; Cho, Michael H.; Comellas, Alejandro; Crapo, James D.; Criner, Gerard; Demeo, Dawn; Christenson, Stephanie A.; Couper, David J.; Doerschuk, Claire M.; Freeman, Christine M.; Gouskova, Natalia A.; Han, MeiLan K.; Hanania, Nicola A.; Hansel, Nadia N.; Hersh, Craig P.; Hoffman, Eric A.; Kaner, Robert J.; Kanner, Richard E.; Kleerup, Eric C.; Lutz, Sharon; Martinez, Fernando J.; Meyers, Deborah A.; Peters, Stephen P.; Regan, Elizabeth A.; Rennard, Stephen I.; Scholand, Mary Beth; Silverman, Edwin K.; Woodruff, Prescott G.; O’Neal, Wanda K.; Bowler, Russell P.
2016-01-01
Implementing precision medicine for complex diseases such as chronic obstructive lung disease (COPD) will require extensive use of biomarkers and an in-depth understanding of how genetic, epigenetic, and environmental variations contribute to phenotypic diversity and disease progression. A meta-analysis from two large cohorts of current and former smokers with and without COPD [SPIROMICS (N = 750); COPDGene (N = 590)] was used to identify single nucleotide polymorphisms (SNPs) associated with measurement of 88 blood proteins (protein quantitative trait loci; pQTLs). PQTLs consistently replicated between the two cohorts. Features of pQTLs were compared to previously reported expression QTLs (eQTLs). Inference of causal relations of pQTL genotypes, biomarker measurements, and four clinical COPD phenotypes (airflow obstruction, emphysema, exacerbation history, and chronic bronchitis) were explored using conditional independence tests. We identified 527 highly significant (p < 8 X 10−10) pQTLs in 38 (43%) of blood proteins tested. Most pQTL SNPs were novel with low overlap to eQTL SNPs. The pQTL SNPs explained >10% of measured variation in 13 protein biomarkers, with a single SNP (rs7041; p = 10−392) explaining 71%-75% of the measured variation in vitamin D binding protein (gene = GC). Some of these pQTLs [e.g., pQTLs for VDBP, sRAGE (gene = AGER), surfactant protein D (gene = SFTPD), and TNFRSF10C] have been previously associated with COPD phenotypes. Most pQTLs were local (cis), but distant (trans) pQTL SNPs in the ABO blood group locus were the top pQTL SNPs for five proteins. The inclusion of pQTL SNPs improved the clinical predictive value for the established association of sRAGE and emphysema, and the explanation of variance (R2) for emphysema improved from 0.3 to 0.4 when the pQTL SNP was included in the model along with clinical covariates. Causal modeling provided insight into specific pQTL-disease relationships for airflow obstruction and emphysema. In conclusion, given the frequency of highly significant local pQTLs, the large amount of variance potentially explained by pQTL, and the differences observed between pQTLs and eQTLs SNPs, we recommend that protein biomarker-disease association studies take into account the potential effect of common local SNPs and that pQTLs be integrated along with eQTLs to uncover disease mechanisms. Large-scale blood biomarker studies would also benefit from close attention to the ABO blood group. PMID:27532455
Epistatically Interacting Substitutions Are Enriched during Adaptive Protein Evolution
Gong, Lizhi Ian; Bloom, Jesse D.
2014-01-01
Most experimental studies of epistasis in evolution have focused on adaptive changes—but adaptation accounts for only a portion of total evolutionary change. Are the patterns of epistasis during adaptation representative of evolution more broadly? We address this question by examining a pair of protein homologs, of which only one is subject to a well-defined pressure for adaptive change. Specifically, we compare the nucleoproteins from human and swine influenza. Human influenza is under continual selection to evade recognition by acquired immune memory, while swine influenza experiences less such selection due to the fact that pigs are less likely to be infected with influenza repeatedly in a lifetime. Mutations in some types of immune epitopes are therefore much more strongly adaptive to human than swine influenza—here we focus on epitopes targeted by human cytotoxic T lymphocytes. The nucleoproteins of human and swine influenza possess nearly identical numbers of such epitopes. However, mutations in these epitopes are fixed significantly more frequently in human than in swine influenza, presumably because these epitope mutations are adaptive only to human influenza. Experimentally, we find that epistatically constrained mutations are fixed only in the adaptively evolving human influenza lineage, where they occur at sites that are enriched in epitopes. Overall, our results demonstrate that epistatically interacting substitutions are enriched during adaptation, suggesting that the prevalence of epistasis is dependent on the underlying evolutionary forces at play. PMID:24811236
Blevins, Tana; Aliev, Fazil; Adkins, Amy; Hack, Laura; Bigdeli, Tim; D. van der Vaart, Andrew; Web, Bradley Todd; Bacanu, Silviu-Alin; Kalsi, Gursharan; Kendler, Kenneth S.; Miles, Michael F.; Dick, Danielle; Riley, Brien P.; Dumur, Catherine; Vladimirov, Vladimir I.
2015-01-01
Alcohol consumption is known to lead to gene expression changes in the brain. After performing weighted gene co-expression network analyses (WGCNA) on genome-wide mRNA and microRNA (miRNA) expression in Nucleus Accumbens (NAc) of subjects with alcohol dependence (AD; N = 18) and of matched controls (N = 18), six mRNA and three miRNA modules significantly correlated with AD were identified (Bonferoni-adj. p≤ 0.05). Cell-type-specific transcriptome analyses revealed two of the mRNA modules to be enriched for neuronal specific marker genes and downregulated in AD, whereas the remaining four mRNA modules were enriched for astrocyte and microglial specific marker genes and upregulated in AD. Gene set enrichment analysis demonstrated that neuronal specific modules were enriched for genes involved in oxidative phosphorylation, mitochondrial dysfunction and MAPK signaling. Glial-specific modules were predominantly enriched for genes involved in processes related to immune functions, i.e. cytokine signaling (all adj. p≤ 0.05). In mRNA and miRNA modules, 461 and 25 candidate hub genes were identified, respectively. In contrast to the expected biological functions of miRNAs, correlation analyses between mRNA and miRNA hub genes revealed a higher number of positive than negative correlations (χ2 test p≤ 0.0001). Integration of hub gene expression with genome-wide genotypic data resulted in 591 mRNA cis-eQTLs and 62 miRNA cis-eQTLs. mRNA cis-eQTLs were significantly enriched for AD diagnosis and AD symptom counts (adj. p = 0.014 and p = 0.024, respectively) in AD GWAS signals in a large, independent genetic sample from the Collaborative Study on Genetics of Alcohol (COGA). In conclusion, our study identified putative gene network hubs coordinating mRNA and miRNA co-expression changes in the NAc of AD subjects, and our genetic (cis-eQTL) analysis provides novel insights into the etiological mechanisms of AD. PMID:26381263
DOE Office of Scientific and Technical Information (OSTI.GOV)
Saxer, Gerda; Krepps, Michael D.; Merkley, Eric D.
Adaptation to ecologically complex environments can provide insights into the evolutionary dynamics and functional constraints encountered by organisms during natural selection. Unlike adaptation to a single limiting resource, adaptation to a new environment with abundant and varied resources can be difficult to achieve by small incremental changes since many mutations are required to achieve even modest gains in fitness. Since changing complex environments are quite common in nature, we investigated how such an epistatic bottleneck can be avoided to allow rapid adaptation. We show that adaptive mutations arise repeatedly in independently evolved populations in the context of greatly increased geneticmore » and phenotypic diversity. We go on to show that weak selection requiring substantial metabolic reprogramming can be readily achieved by mutations in the global response regulator arcA and the stress response regulator rpoS. We identified 46 unique single-nucleotide variants of arcA and 18 mutations in rpoS, nine of which resulted in stop codons or large deletions, suggesting that a subtle modulation of ArcA function and knockouts of rpoS are largely responsible for the metabolic shifts leading to adaptation. These mutations allow a higher order “metabolic selection” that eliminates epistatic bottlenecks, which could occur when many changes would be required. Proteomic and carbohydrate analysis of adapting E. coli populations revealed an up-regulation of enzymes associated with the TCA cycle and amino acid metabolism and an increase in the secretion of putrescine. The overall effect of adaptation across populations is to redirect and efficiently utilize uptake and catabolism of abundant amino acids. Concomitantly, there is a pronounced spread of more ecologically limited strains that results from specialization through metabolic erosion. Remarkably, the global regulators arcA and rpoS can provide a “one-step” mechanism of adaptation to a novel environment, which highlights the importance of global resource management as a powerful strategy to adaptation.« less
Mutations in Global Regulators Lead to Metabolic Selection during Adaptation to Complex Environments
Saxer, Gerda; Krepps, Michael D.; Merkley, Eric D.; Ansong, Charles; Deatherage Kaiser, Brooke L.; Valovska, Marie-Thérèse; Ristic, Nikola; Yeh, Ping T.; Prakash, Vittal P.; Leiser, Owen P.; Nakhleh, Luay; Gibbons, Henry S.; Kreuzer, Helen W.; Shamoo, Yousif
2014-01-01
Adaptation to ecologically complex environments can provide insights into the evolutionary dynamics and functional constraints encountered by organisms during natural selection. Adaptation to a new environment with abundant and varied resources can be difficult to achieve by small incremental changes if many mutations are required to achieve even modest gains in fitness. Since changing complex environments are quite common in nature, we investigated how such an epistatic bottleneck can be avoided to allow rapid adaptation. We show that adaptive mutations arise repeatedly in independently evolved populations in the context of greatly increased genetic and phenotypic diversity. We go on to show that weak selection requiring substantial metabolic reprogramming can be readily achieved by mutations in the global response regulator arcA and the stress response regulator rpoS. We identified 46 unique single-nucleotide variants of arcA and 18 mutations in rpoS, nine of which resulted in stop codons or large deletions, suggesting that subtle modulations of ArcA function and knockouts of rpoS are largely responsible for the metabolic shifts leading to adaptation. These mutations allow a higher order metabolic selection that eliminates epistatic bottlenecks, which could occur when many changes would be required. Proteomic and carbohydrate analysis of adapting E. coli populations revealed an up-regulation of enzymes associated with the TCA cycle and amino acid metabolism, and an increase in the secretion of putrescine. The overall effect of adaptation across populations is to redirect and efficiently utilize uptake and catabolism of abundant amino acids. Concomitantly, there is a pronounced spread of more ecologically limited strains that results from specialization through metabolic erosion. Remarkably, the global regulators arcA and rpoS can provide a “one-step” mechanism of adaptation to a novel environment, which highlights the importance of global resource management as a powerful strategy to adaptation. PMID:25501822
Mutations in global regulators lead to metabolic selection during adaptation to complex environments
Saxer, Gerda; Krepps, Michael D.; Merkley, Eric D.; ...
2014-12-11
Adaptation to ecologically complex environments can provide insights into the evolutionary dynamics and functional constraints encountered by organisms during natural selection. Unlike adaptation to a single limiting resource, adaptation to a new environment with abundant and varied resources can be difficult to achieve by small incremental changes since many mutations are required to achieve even modest gains in fitness. Since changing complex environments are quite common in nature, we investigated how such an epistatic bottleneck can be avoided to allow rapid adaptation. We show that adaptive mutations arise repeatedly in independently evolved populations in the context of greatly increased geneticmore » and phenotypic diversity. We go on to show that weak selection requiring substantial metabolic reprogramming can be readily achieved by mutations in the global response regulator arcA and the stress response regulator rpoS. We identified 46 unique single-nucleotide variants of arcA and 18 mutations in rpoS, nine of which resulted in stop codons or large deletions, suggesting that a subtle modulation of ArcA function and knockouts of rpoS are largely responsible for the metabolic shifts leading to adaptation. These mutations allow a higher order “metabolic selection” that eliminates epistatic bottlenecks, which could occur when many changes would be required. Proteomic and carbohydrate analysis of adapting E. coli populations revealed an up-regulation of enzymes associated with the TCA cycle and amino acid metabolism and an increase in the secretion of putrescine. The overall effect of adaptation across populations is to redirect and efficiently utilize uptake and catabolism of abundant amino acids. Concomitantly, there is a pronounced spread of more ecologically limited strains that results from specialization through metabolic erosion. Remarkably, the global regulators arcA and rpoS can provide a “one-step” mechanism of adaptation to a novel environment, which highlights the importance of global resource management as a powerful strategy to adaptation.« less
USDA-ARS?s Scientific Manuscript database
We have applied a systems genetics approach to elucidate molecular mechanisms of maize responses to gray leaf spot (GLS) disease, caused by Cercospora zeina, a major threat to maize production globally. We conducted expression QTL (eQTL) analysis of gene expression variation measured in earleaf samp...
USDA-ARS?s Scientific Manuscript database
We have evaluated the new Swine Protein-Annotated Oligonucleotide Microarray (http://www.pigoligoarray.org) by analyzing transcriptional profiles for longissimus dorsi muscle (LD), Bronchial lymph node (BLN) and Lung. Four LD samples were used to assess the stringency of hybridization conditions com...
Convergent evidence from systematic analysis of GWAS revealed genetic basis of esophageal cancer.
Gao, Xue-Xin; Gao, Lei; Wang, Jiu-Qiang; Qu, Su-Su; Qu, Yue; Sun, Hong-Lei; Liu, Si-Dang; Shang, Ying-Li
2016-07-12
Recent genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with risk of esophageal cancer (EC). However, investigation of genetic basis from the perspective of systematic biology and integrative genomics remains scarce.In this study, we explored genetic basis of EC based on GWAS data and implemented a series of bioinformatics methods including functional annotation, expression quantitative trait loci (eQTL) analysis, pathway enrichment analysis and pathway grouped network analysis.Two hundred and thirteen risk SNPs were identified, in which 44 SNPs were found to have significantly differential gene expression in esophageal tissues by eQTL analysis. By pathway enrichment analysis, 170 risk genes mapped by risk SNPs were enriched into 38 significant GO terms and 17 significant KEGG pathways, which were significantly grouped into 9 sub-networks by pathway grouped network analysis. The 9 groups of interconnected pathways were mainly involved with muscle cell proliferation, cellular response to interleukin-6, cell adhesion molecules, and ethanol oxidation, which might participate in the development of EC.Our findings provide genetic evidence and new insight for exploring the molecular mechanisms of EC.
Adrenal cortex expression quantitative trait loci in a German Holstein × Charolais cross.
Brand, Bodo; Scheinhardt, Markus O; Friedrich, Juliane; Zimmer, Daisy; Reinsch, Norbert; Ponsuksili, Siriluck; Schwerin, Manfred; Ziegler, Andreas
2016-10-06
The importance of the adrenal gland in regard to lactation and reproduction in cattle has been recognized early. Caused by interest in animal welfare and the impact of stress on economically important traits in farm animals the adrenal gland and its function within the stress response is of increasing interest. However, the molecular mechanisms and pathways involved in stress-related effects on economically important traits in farm animals are not fully understood. Gene expression is an important mechanism underlying complex traits, and genetic variants affecting the transcript abundance are thought to influence the manifestation of an expressed phenotype. We therefore investigated the genetic background of adrenocortical gene expression by applying an adaptive linear rank test to identify genome-wide expression quantitative trait loci (eQTL) for adrenal cortex transcripts in cattle. A total of 10,986 adrenal cortex transcripts and 37,204 single nucleotide polymorphisms (SNPs) were analysed in 145 F2 cows of a Charolais × German Holstein cross. We identified 505 SNPs that were associated with the abundance of 129 transcripts, comprising 482 cis effects and 17 trans effects. These SNPs were located on all chromosomes but X, 16, 24 and 28. Associated genes are mainly involved in molecular and cellular functions comprising free radical scavenging, cellular compromise, cell morphology and lipid metabolism, including genes such as CYP27A1 and LHCGR that have been shown to affect economically important traits in cattle. In this study we showed that adrenocortical eQTL affect the expression of genes known to contribute to the phenotypic manifestation in cattle. Furthermore, some of the identified genes and related molecular pathways were previously shown to contribute to the phenotypic variation of behaviour, temperament and growth at the onset of puberty in the same population investigated here. We conclude that eQTL analysis appears to be a useful approach providing insight into the molecular and genetic background of complex traits in cattle and will help to understand molecular networks involved.
Giambartolomei, Claudia; Vukcevic, Damjan; Schadt, Eric E; Franke, Lude; Hingorani, Aroon D; Wallace, Chris; Plagnol, Vincent
2014-05-01
Genetic association studies, in particular the genome-wide association study (GWAS) design, have provided a wealth of novel insights into the aetiology of a wide range of human diseases and traits, in particular cardiovascular diseases and lipid biomarkers. The next challenge consists of understanding the molecular basis of these associations. The integration of multiple association datasets, including gene expression datasets, can contribute to this goal. We have developed a novel statistical methodology to assess whether two association signals are consistent with a shared causal variant. An application is the integration of disease scans with expression quantitative trait locus (eQTL) studies, but any pair of GWAS datasets can be integrated in this framework. We demonstrate the value of the approach by re-analysing a gene expression dataset in 966 liver samples with a published meta-analysis of lipid traits including >100,000 individuals of European ancestry. Combining all lipid biomarkers, our re-analysis supported 26 out of 38 reported colocalisation results with eQTLs and identified 14 new colocalisation results, hence highlighting the value of a formal statistical test. In three cases of reported eQTL-lipid pairs (SYPL2, IFT172, TBKBP1) for which our analysis suggests that the eQTL pattern is not consistent with the lipid association, we identify alternative colocalisation results with SORT1, GCKR, and KPNB1, indicating that these genes are more likely to be causal in these genomic intervals. A key feature of the method is the ability to derive the output statistics from single SNP summary statistics, hence making it possible to perform systematic meta-analysis type comparisons across multiple GWAS datasets (implemented online at http://coloc.cs.ucl.ac.uk/coloc/). Our methodology provides information about candidate causal genes in associated intervals and has direct implications for the understanding of complex diseases as well as the design of drugs to target disease pathways.
Ruden, Douglas M.; Chen, Lang; Possidente, Debra; Possidente, Bernard; Rasouli, Parsa; Wang, Luan; Lu, Xiangyi; Garfinkel, Mark D.; Hirsch, Helmut V. B.; Page, Grier P.
2009-01-01
The genetics of gene expression in recombinant inbred lines (RILs) can be mapped as expression quantitative trait loci (eQTLs). So-called “genetical genomics” studies have identified locally-acting eQTLs (cis-eQTLs) for genes that show differences in steady state RNA levels. These studies have also identified distantly-acting master-modulatory trans-eQTLs that regulate tens or hundreds of transcripts (hotspots or transbands). We expand on these studies by performing genetical genomics experiments in two environments in order to identify trans-eQTL that might be regulated by developmental exposure to the neurotoxin lead. Flies from each of 75 RIL were raised from eggs to adults on either control food (made with 250 µM sodium acetate), or lead-treated food (made with 250 µM lead acetate, PbAc). RNA expression analyses of whole adult male flies (5–10 days old) were performed with Affymetrix DrosII whole genome arrays (18,952 probesets). Among the 1,389 genes with cis-eQTL, there were 405 genes unique to control flies and 544 genes unique to lead-treated ones (440 genes had the same cis-eQTLs in both samples). There are 2,396 genes with trans-eQTL which mapped to 12 major transbands with greater than 95 genes. Permutation analyses of the strain labels but not the expression data suggests that the total number of eQTL and the number of transbands are more important criteria for validation than the size of the transband. Two transbands, one located on the 2nd chromosome and one on the 3rd chromosome, co-regulate 33 lead-induced genes, many of which are involved in neurodevelopmental processes. For these 33 genes, rather than allelic variation at one locus exerting differential effects in two environments, we found that variation at two different loci are required for optimal effects on lead-induced expression. PMID:19737576
Stanley, Patrick D.; Ng’oma, Enoch; O’Day, Siri; King, Elizabeth G.
2017-01-01
The nutritional environments that organisms experience are inherently variable, requiring tight coordination of how resources are allocated to different functions relative to the total amount of resources available. A growing body of evidence supports the hypothesis that key endocrine pathways play a fundamental role in this coordination. In particular, the insulin/insulin-like growth factor signaling (IIS) and target of rapamycin (TOR) pathways have been implicated in nutrition-dependent changes in metabolism and nutrient allocation. However, little is known about the genetic basis of standing variation in IIS/TOR or how diet-dependent changes in expression in this pathway influence phenotypes related to resource allocation. To characterize natural genetic variation in the IIS/TOR pathway, we used >250 recombinant inbred lines (RILs) derived from a multiparental mapping population, the Drosophila Synthetic Population Resource, to map transcript-level QTL of genes encoding 52 core IIS/TOR components in three different nutritional environments [dietary restriction (DR), control (C), and high sugar (HS)]. Nearly all genes, 87%, were significantly differentially expressed between diets, though not always in ways predicted by loss-of-function mutants. We identified cis (i.e., local) expression QTL (eQTL) for six genes, all of which are significant in multiple nutrient environments. Further, we identified trans (i.e., distant) eQTL for two genes, specific to a single nutrient environment. Our results are consistent with many small changes in the IIS/TOR pathways. A discriminant function analysis for the C and DR treatments identified a pattern of gene expression associated with the diet treatment. Mapping the composite discriminant function scores revealed a significant global eQTL within the DR diet. A correlation between the discriminant function scores and the median life span (r = 0.46) provides evidence that gene expression changes in response to diet are associated with longevity in these RILs. PMID:28592498
Yin, Honglei; Pantazatos, Spiro P; Galfalvy, Hanga; Huang, Yung-Yu; Rosoklija, Gorazd B; Dwork, Andrew J; Burke, Ainsley; Arango, Victoria; Oquendo, Maria A; Mann, John J
2016-04-01
Gamma-amino butyric acid (GABA) and glutamate are the major inhibitory and excitatory neurotransmitters in the mammalian central nervous system, respectively, and have been associated with suicidal behavior and major depressive disorder (MDD). We examined the relationship between genotype, brain transcriptome, and MDD/suicide for 24 genes involved in GABAergic and glutamatergic signaling. In part 1 of the study, 119 candidate SNPs in 24 genes (4 transporters, 4 enzymes, and 16 receptors) were tested for associations with MDD and suicidal behavior in 276 live participants (86 nonfatal suicide attempters with MDD and 190 non-attempters of whom 70% had MDD) and 209 postmortem cases (121 suicide deaths of whom 62% had MDD and 88 sudden death from other causes of whom 11% had MDD) using logistic regression adjusting for sex and age. In part 2, RNA-seq was used to assay isoform-level expression in dorsolateral prefrontal cortex of 59 postmortem samples (21 with MDD and suicide, 9 MDD without suicide, and 29 sudden death non-suicides and no psychiatric illness) using robust regression adjusting for sex, age, and RIN score. In part 3, SNPs with subthreshold (uncorrected) significance levels below 0.05 for an association with suicidal behavior and/or MDD in part 1 were tested for eQTL effects in prefrontal cortex using the Brain eQTL Almanac (www.braineac.org). No SNPs or transcripts were significant after adjustment for multiple comparisons. However, a protein coding transcript (ENST00000414552) of the GABA A receptor, gamma 2 (GABRG2) had lower brain expression postmortem in suicide (P = 0.01) and evidence for association with suicide death (P = 0.03) in a SNP that may be an eQTL in prefrontal cortex (rs424740, P = 0.02). These preliminary results implicate GABRG2 in suicide and warrant further investigation and replication in larger samples. © 2016 Wiley Periodicals, Inc.
Non-additive genetic variation in growth, carcass and fertility traits of beef cattle.
Bolormaa, Sunduimijid; Pryce, Jennie E; Zhang, Yuandan; Reverter, Antonio; Barendse, William; Hayes, Ben J; Goddard, Michael E
2015-04-02
A better understanding of non-additive variance could lead to increased knowledge on the genetic control and physiology of quantitative traits, and to improved prediction of the genetic value and phenotype of individuals. Genome-wide panels of single nucleotide polymorphisms (SNPs) have been mainly used to map additive effects for quantitative traits, but they can also be used to investigate non-additive effects. We estimated dominance and epistatic effects of SNPs on various traits in beef cattle and the variance explained by dominance, and quantified the increase in accuracy of phenotype prediction by including dominance deviations in its estimation. Genotype data (729 068 real or imputed SNPs) and phenotypes on up to 16 traits of 10 191 individuals from Bos taurus, Bos indicus and composite breeds were used. A genome-wide association study was performed by fitting the additive and dominance effects of single SNPs. The dominance variance was estimated by fitting a dominance relationship matrix constructed from the 729 068 SNPs. The accuracy of predicted phenotypic values was evaluated by best linear unbiased prediction using the additive and dominance relationship matrices. Epistatic interactions (additive × additive) were tested between each of the 28 SNPs that are known to have additive effects on multiple traits, and each of the other remaining 729 067 SNPs. The number of significant dominance effects was greater than expected by chance and most of them were in the direction that is presumed to increase fitness and in the opposite direction to inbreeding depression. Estimates of dominance variance explained by SNPs varied widely between traits, but had large standard errors. The median dominance variance across the 16 traits was equal to 5% of the phenotypic variance. Including a dominance deviation in the prediction did not significantly increase its accuracy for any of the phenotypes. The number of additive × additive epistatic effects that were statistically significant was greater than expected by chance. Significant dominance and epistatic effects occur for growth, carcass and fertility traits in beef cattle but they are difficult to estimate precisely and including them in phenotype prediction does not increase its accuracy.
Bennett, G L; Shackelford, S D; Wheeler, T L; King, D A; Casas, E; Smith, T P L
2013-02-01
Genetic markers in casein (CSN1S1) and thyroglobulin (TG) genes have previously been associated with fat distribution in cattle. Determining the nature of these genetic associations (additive, recessive, or dominant) has been difficult, because both markers have small minor allele frequencies in most beef cattle populations. This results in few animals homozygous for the minor alleles. selection to increase the frequencies of the minor alleles for 2 SNP markers in these genes was undertaken in a composite population. The objective was to obtain better estimates of genetic effects associated with these markers and determine if there were epistatic interactions. Selection increased the frequencies of minor alleles for both SNP from <0.30 to 0.45. Bulls (n = 24) heterozygous for both SNP were used in 3 yr to produce 204 steer progeny harvested at an average age of 474 d. The combined effect of the 9 CSN1S1 × TG genotypes was associated with carcass-adjusted fat thickness (P < 0.06) and meat tenderness predicted at the abattoir by visible and near-infrared reflectance spectroscopy (P < 0.04). Genotype did not affect BW from birth through harvest, ribeye area, marbling score, slice shear force, or image-based yield grade (P > 0.10). Additive, dominance, and epistatic SNP association effects were estimated from genotypic effects for adjusted fat thickness and predicted meat tenderness. Adjusted fat thickness showed a dominance association with TG SNP (P < 0.06) and an epistatic additive CSN1S1 × additive TG association (P < 0.03). For predicted meat tenderness, heterozygous TG meat was more tender than meat from either homozygote (P < 0.002). Dominance and epistatic associations can result in different SNP allele substitution effects in populations where SNP have the same linkage disequilibrium with causal mutations but have different frequencies. Although the complex associations estimated in this study would contribute little to within-population selection response, they could be important for marker-assisted management or reciprocal selection schemes.
Katyal, Sachin; Lee, Youngsoo; Nitiss, Karin C; Downing, Susanna M; Li, Yang; Shimada, Mikio; Zhao, Jingfeng; Russell, Helen R; Petrini, John H J; Nitiss, John L; McKinnon, Peter J
2014-06-01
DNA damage is considered to be a prime factor in several spinocerebellar neurodegenerative diseases; however, the DNA lesions underpinning disease etiology are unknown. We observed the endogenous accumulation of pathogenic topoisomerase-1 (Top1)-DNA cleavage complexes (Top1ccs) in murine models of ataxia telangiectasia and spinocerebellar ataxia with axonal neuropathy 1. We found that the defective DNA damage response factors in these two diseases cooperatively modulated Top1cc turnover in a non-epistatic and ATM kinase-independent manner. Furthermore, coincident neural inactivation of ATM and DNA single-strand break repair factors, including tyrosyl-DNA phosphodiesterase-1 or XRCC1, resulted in increased Top1cc formation and excessive DNA damage and neurodevelopmental defects. Notably, direct Top1 poisoning to elevate Top1cc levels phenocopied the neuropathology of the mouse models described above. Our results identify a critical endogenous pathogenic lesion associated with neurodegenerative syndromes arising from DNA repair deficiency, indicating that genome integrity is important for preventing disease in the nervous system.
MicroRNAs in genetic disease: rethinking the dosage.
Henrion-Caude, Alexandra; Girard, Muriel; Amiel, Jeanne
2012-08-01
To date, the general assumption was that most mutations interested protein-coding genes only. Thus, only few illustrations have mentioned here that mutations may occur in non-protein coding genes such as microRNAs (miRNAs). We thus report progress in delineating their contribution as phenotypic modulators, genetic switches and fine-tuners of gene expression. We reasoned that browsing their contribution to genetic disease may provide a framework for understanding the proper requirements to devise miRNA-based therapy strategies, in particular the relief of an appropriate dosage. Gain and loss of function of miRNA enforce the need to respectively antagonize or supply the miRNAs. We further categorized human disease according to the different ways in which the miRNA was altered arising either de novo, or inherited whether as a mendelian or as an epistatic trait, uncovering its role in epigenetics. We discuss how improving our knowledge on the contribution of miRNAs to genetic disease may be beneficial to devise appropriate gene therapy strategies.
Bershtein, Shimon; Serohijos, Adrian W.R.; Shakhnovich, Eugene I.
2016-01-01
Bridging the gap between the molecular properties of proteins and organismal/population fitness is essential for understanding evolutionary processes. This task requires the integration of the several physical scales of biological organization, each defined by a distinct set of mechanisms and constraints, into a single unifying model. The molecular scale is dominated by the constraints imposed by the physico-chemical properties of proteins and their substrates, which give rise to trade-offs and epistatic (non-additive) effects of mutations. At the systems scale, biological networks modulate protein expression and can either buffer or enhance the fitness effects of mutations. The population scale is influenced by the mutational input, selection regimes, and stochastic changes affecting the size and structure of populations, which eventually determine the evolutionary fate of mutations. Here, we summarize the recent advances in theory, computer simulations, and experiments that advance our understanding of the links between various physical scales in biology. PMID:27810574
Bershtein, Shimon; Serohijos, Adrian Wr; Shakhnovich, Eugene I
2017-02-01
Bridging the gap between the molecular properties of proteins and organismal/population fitness is essential for understanding evolutionary processes. This task requires the integration of the several physical scales of biological organization, each defined by a distinct set of mechanisms and constraints, into a single unifying model. The molecular scale is dominated by the constraints imposed by the physico-chemical properties of proteins and their substrates, which give rise to trade-offs and epistatic (non-additive) effects of mutations. At the systems scale, biological networks modulate protein expression and can either buffer or enhance the fitness effects of mutations. The population scale is influenced by the mutational input, selection regimes, and stochastic changes affecting the size and structure of populations, which eventually determine the evolutionary fate of mutations. Here, we summarize the recent advances in theory, computer simulations, and experiments that advance our understanding of the links between various physical scales in biology. Copyright © 2016 Elsevier Ltd. All rights reserved.
Teng, Weili; Li, Wen; Zhang, Qi; Wu, Depeng; Zhao, Xue; Li, Haiyan; Han, Yingpeng; Li, Wenbin
2017-08-01
The objective here was to identify QTL underlying soybean protein content (PC), and to evaluate the additive and epistatic effects of the QTLs. A mapping population, consisting of 129 recombinant inbred lines (RILs), was created by crossing 'Dongnong 46' and 'L-100'. Phenotypic data of the parents and RILs were collected for 4 years in three locations of Heilongjiang Province of China. A total of 213 SSR markers were used to construct a genetic linkage map. Eight QTLs, located on seven chromosomes (Chr), were identified to be associated with PC among the 10 tested environments. Of the seven QTLs, five QTLs, qPR-2 (Satt710, on Chr9), qPR-3 (Sat_122, on Chr12), qPR-5 (Satt543, on Chr17), qPR-7 (Satt163, on Chr18), and qPR-8 (Satt614, on Chr20), were detected in six, seven, seven, six, and seven environments, respectively, implying relatively stable QTLs. qPR-3 could explain 3.33%-11.26% of the phenotypic variation across eight tested environments. qPR-5 and qPR-8 explained 3.64%-10.1% and 11.86%-18.40% of the phenotypic variation, respectively, across seven tested environments. Eight QTLs associated with PC exhibited additive and (or) additive × environment interaction effects. The results showed that environment-independent QTLs often had higher additive effects. Moreover, five epistatic pairwise QTLs were identified in the 10 environments.
Predicting the difficulty of pure, strict, epistatic models: metrics for simulated model selection.
Urbanowicz, Ryan J; Kiralis, Jeff; Fisher, Jonathan M; Moore, Jason H
2012-09-26
Algorithms designed to detect complex genetic disease associations are initially evaluated using simulated datasets. Typical evaluations vary constraints that influence the correct detection of underlying models (i.e. number of loci, heritability, and minor allele frequency). Such studies neglect to account for model architecture (i.e. the unique specification and arrangement of penetrance values comprising the genetic model), which alone can influence the detectability of a model. In order to design a simulation study which efficiently takes architecture into account, a reliable metric is needed for model selection. We evaluate three metrics as predictors of relative model detection difficulty derived from previous works: (1) Penetrance table variance (PTV), (2) customized odds ratio (COR), and (3) our own Ease of Detection Measure (EDM), calculated from the penetrance values and respective genotype frequencies of each simulated genetic model. We evaluate the reliability of these metrics across three very different data search algorithms, each with the capacity to detect epistatic interactions. We find that a model's EDM and COR are each stronger predictors of model detection success than heritability. This study formally identifies and evaluates metrics which quantify model detection difficulty. We utilize these metrics to intelligently select models from a population of potential architectures. This allows for an improved simulation study design which accounts for differences in detection difficulty attributed to model architecture. We implement the calculation and utilization of EDM and COR into GAMETES, an algorithm which rapidly and precisely generates pure, strict, n-locus epistatic models.
Ma, Li; Runesha, H Birali; Dvorkin, Daniel; Garbe, John R; Da, Yang
2008-01-01
Background Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers provide opportunities to detect epistatic SNPs associated with quantitative traits and to detect the exact mode of an epistasis effect. Computational difficulty is the main bottleneck for epistasis testing in large scale GWAS. Results The EPISNPmpi and EPISNP computer programs were developed for testing single-locus and epistatic SNP effects on quantitative traits in GWAS, including tests of three single-locus effects for each SNP (SNP genotypic effect, additive and dominance effects) and five epistasis effects for each pair of SNPs (two-locus interaction, additive × additive, additive × dominance, dominance × additive, and dominance × dominance) based on the extended Kempthorne model. EPISNPmpi is the parallel computing program for epistasis testing in large scale GWAS and achieved excellent scalability for large scale analysis and portability for various parallel computing platforms. EPISNP is the serial computing program based on the EPISNPmpi code for epistasis testing in small scale GWAS using commonly available operating systems and computer hardware. Three serial computing utility programs were developed for graphical viewing of test results and epistasis networks, and for estimating CPU time and disk space requirements. Conclusion The EPISNPmpi parallel computing program provides an effective computing tool for epistasis testing in large scale GWAS, and the epiSNP serial computing programs are convenient tools for epistasis analysis in small scale GWAS using commonly available computer hardware. PMID:18644146
Emera, Deena; Wagner, Günter P.
2012-01-01
Transposable elements (TEs) are known to provide DNA for host regulatory functions, but the mechanisms underlying the transformation of TEs into cis-regulatory elements are unclear. In humans two TEs—MER20 and MER39—contribute the enhancer/promoter for decidual prolactin (dPRL), which is dramatically induced during pregnancy. We show that evolution of the strong human dPRL promoter was a multistep process that took millions of years. First, MER39 inserted near MER20 in the primate/rodent ancestor, and then there were two phases of activity enhancement in primates. Through the mapping of causal nucleotide substitutions, we demonstrate that strong promoter activity in apes involves epistasis between transcription factor binding sites (TFBSs) ancestral to MER39 and derived sites. We propose a mode of molecular evolution that describes the process by which MER20/MER39 was transformed into a strong promoter, called “epistatic capture.” Epistatic capture is the stabilization of a TFBS that is ancestral but variable in outgroup lineages, and is fixed in the ingroup because of epistatic interactions with derived TFBSs. Finally, we note that evolution of human promoter activity coincides with the emergence of a unique reproductive character in apes, highly invasive placentation. Because prolactin communicates with immune cells during pregnancy, which regulate fetal invasion into maternal tissues, we speculate that ape dPRL promoter activity evolved in response to increased invasiveness of ape fetal tissue. PMID:22733751
Ignácio, Z M; Réus, G Z; Abelaira, H M; Quevedo, J
2014-09-05
Epidemiological studies have shown significant results in the interaction between the functions of brain-derived neurotrophic factor (BDNF) and 5-HT in mood disorders, such as major depressive disorder (MDD). The latest research has provided convincing evidence that gene transcription of these molecules is a target for epigenetic changes, triggered by stressful stimuli that starts in early childhood and continues throughout life, which are subsequently translated into structural and functional phenotypes culminating in depressive disorders. The short variants of 5-HTTLPR and BDNF-Met are seen as forms which are predisposed to epigenetic aberrations, which leads individuals to a susceptibility to environmental adversities, especially when subjected to stress in early life. Moreover, the polymorphic variants also feature epistatic interactions in directing the functional mechanisms elicited by stress and underlying the onset of depressive disorders. Also emphasized are works which show some mediators between stress and epigenetic changes of the 5-HTT and BDNF genes, such as the hypothalamic-pituitary-adrenal (HPA) axis and the cAMP response element-binding protein (CREB), which is a cellular transcription factor. Both the HPA axis and CREB are also involved in epistatic interactions between polymorphic variants of 5-HTTLPR and Val66Met. This review highlights some research studying changes in the epigenetic patterns intrinsic to genes of 5-HTT and BDNF, which are related to lifelong environmental adversities, which in turn increases the risks of developing MDD. Copyright © 2014 IBRO. Published by Elsevier Ltd. All rights reserved.
Howard, Réka; Carriquiry, Alicia L.; Beavis, William D.
2014-01-01
Parametric and nonparametric methods have been developed for purposes of predicting phenotypes. These methods are based on retrospective analyses of empirical data consisting of genotypic and phenotypic scores. Recent reports have indicated that parametric methods are unable to predict phenotypes of traits with known epistatic genetic architectures. Herein, we review parametric methods including least squares regression, ridge regression, Bayesian ridge regression, least absolute shrinkage and selection operator (LASSO), Bayesian LASSO, best linear unbiased prediction (BLUP), Bayes A, Bayes B, Bayes C, and Bayes Cπ. We also review nonparametric methods including Nadaraya-Watson estimator, reproducing kernel Hilbert space, support vector machine regression, and neural networks. We assess the relative merits of these 14 methods in terms of accuracy and mean squared error (MSE) using simulated genetic architectures consisting of completely additive or two-way epistatic interactions in an F2 population derived from crosses of inbred lines. Each simulated genetic architecture explained either 30% or 70% of the phenotypic variability. The greatest impact on estimates of accuracy and MSE was due to genetic architecture. Parametric methods were unable to predict phenotypic values when the underlying genetic architecture was based entirely on epistasis. Parametric methods were slightly better than nonparametric methods for additive genetic architectures. Distinctions among parametric methods for additive genetic architectures were incremental. Heritability, i.e., proportion of phenotypic variability, had the second greatest impact on estimates of accuracy and MSE. PMID:24727289
Efficient Integrative Multi-SNP Association Analysis via Deterministic Approximation of Posteriors.
Wen, Xiaoquan; Lee, Yeji; Luca, Francesca; Pique-Regi, Roger
2016-06-02
With the increasing availability of functional genomic data, incorporating genomic annotations into genetic association analysis has become a standard procedure. However, the existing methods often lack rigor and/or computational efficiency and consequently do not maximize the utility of functional annotations. In this paper, we propose a rigorous inference procedure to perform integrative association analysis incorporating genomic annotations for both traditional GWASs and emerging molecular QTL mapping studies. In particular, we propose an algorithm, named deterministic approximation of posteriors (DAP), which enables highly efficient and accurate joint enrichment analysis and identification of multiple causal variants. We use a series of simulation studies to highlight the power and computational efficiency of our proposed approach and further demonstrate it by analyzing the cross-population eQTL data from the GEUVADIS project and the multi-tissue eQTL data from the GTEx project. In particular, we find that genetic variants predicted to disrupt transcription factor binding sites are enriched in cis-eQTLs across all tissues. Moreover, the enrichment estimates obtained across the tissues are correlated with the cell types for which the annotations are derived. Copyright © 2016 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
The Influence of Higher-Order Epistasis on Biological Fitness Landscape Topography
NASA Astrophysics Data System (ADS)
Weinreich, Daniel M.; Lan, Yinghong; Jaffe, Jacob; Heckendorn, Robert B.
2018-07-01
The effect of a mutation on the organism often depends on what other mutations are already present in its genome. Geneticists refer to such mutational interactions as epistasis. Pairwise epistatic effects have been recognized for over a century, and their evolutionary implications have received theoretical attention for nearly as long. However, pairwise epistatic interactions themselves can vary with genomic background. This is called higher-order epistasis, and its consequences for evolution are much less well understood. Here, we assess the influence that higher-order epistasis has on the topography of 16 published, biological fitness landscapes. We find that on average, their effects on fitness landscape declines with order, and suggest that notable exceptions to this trend may deserve experimental scrutiny. We conclude by highlighting opportunities for further theoretical and experimental work dissecting the influence that epistasis of all orders has on fitness landscape topography and on the efficiency of evolution by natural selection.
The Influence of Higher-Order Epistasis on Biological Fitness Landscape Topography
NASA Astrophysics Data System (ADS)
Weinreich, Daniel M.; Lan, Yinghong; Jaffe, Jacob; Heckendorn, Robert B.
2018-02-01
The effect of a mutation on the organism often depends on what other mutations are already present in its genome. Geneticists refer to such mutational interactions as epistasis. Pairwise epistatic effects have been recognized for over a century, and their evolutionary implications have received theoretical attention for nearly as long. However, pairwise epistatic interactions themselves can vary with genomic background. This is called higher-order epistasis, and its consequences for evolution are much less well understood. Here, we assess the influence that higher-order epistasis has on the topography of 16 published, biological fitness landscapes. We find that on average, their effects on fitness landscape declines with order, and suggest that notable exceptions to this trend may deserve experimental scrutiny. We conclude by highlighting opportunities for further theoretical and experimental work dissecting the influence that epistasis of all orders has on fitness landscape topography and on the efficiency of evolution by natural selection.
Feltus, F Alex
2014-06-01
Understanding the control of any trait optimally requires the detection of causal genes, gene interaction, and mechanism of action to discover and model the biochemical pathways underlying the expressed phenotype. Functional genomics techniques, including RNA expression profiling via microarray and high-throughput DNA sequencing, allow for the precise genome localization of biological information. Powerful genetic approaches, including quantitative trait locus (QTL) and genome-wide association study mapping, link phenotype with genome positions, yet genetics is less precise in localizing the relevant mechanistic information encoded in DNA. The coupling of salient functional genomic signals with genetically mapped positions is an appealing approach to discover meaningful gene-phenotype relationships. Techniques used to define this genetic-genomic convergence comprise the field of systems genetics. This short review will address an application of systems genetics where RNA profiles are associated with genetically mapped genome positions of individual genes (eQTL mapping) or as gene sets (co-expression network modules). Both approaches can be applied for knowledge independent selection of candidate genes (and possible control mechanisms) underlying complex traits where multiple, likely unlinked, genomic regions might control specific complex traits. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Natural sequence variants of yeast environmental sensors confer cell-to-cell expression variability
Fehrmann, Steffen; Bottin-Duplus, Hélène; Leonidou, Andri; Mollereau, Esther; Barthelaix, Audrey; Wei, Wu; Steinmetz, Lars M; Yvert, Gaël
2013-01-01
Living systems may have evolved probabilistic bet hedging strategies that generate cell-to-cell phenotypic diversity in anticipation of environmental catastrophes, as opposed to adaptation via a deterministic response to environmental changes. Evolution of bet hedging assumes that genotypes segregating in natural populations modulate the level of intraclonal diversity, which so far has largely remained hypothetical. Using a fluorescent Pmet17-GFP reporter, we mapped four genetic loci conferring to a wild yeast strain an elevated cell-to-cell variability in the expression of MET17, a gene regulated by the methionine pathway. A frameshift mutation in the Erc1p transmembrane transporter, probably resulting from a release of laboratory strains from negative selection, reduced Pmet17-GFP expression variability. At a second locus, cis-regulatory polymorphisms increased mean expression of the Mup1p methionine permease, causing increased expression variability in trans. These results demonstrate that an expression quantitative trait locus (eQTL) can simultaneously have a deterministic effect in cis and a probabilistic effect in trans. Our observations indicate that the evolution of transmembrane transporter genes can tune intraclonal variation and may therefore be implicated in both reactive and anticipatory strategies of adaptation. PMID:24104478
Beyond the single gene: How epistasis and gene-by-environment effects influence crop domestication.
Doust, Andrew N; Lukens, Lewis; Olsen, Kenneth M; Mauro-Herrera, Margarita; Meyer, Ann; Rogers, Kimberly
2014-04-29
Domestication is a multifaceted evolutionary process, involving changes in individual genes, genetic interactions, and emergent phenotypes. There has been extensive discussion of the phenotypic characteristics of plant domestication, and recent research has started to identify the specific genes and mutational mechanisms that control domestication traits. However, there is an apparent disconnect between the simple genetic architecture described for many crop domestication traits, which should facilitate rapid phenotypic change under selection, and the slow rate of change reported from the archeobotanical record. A possible explanation involves the middle ground between individual genetic changes and their expression during development, where gene-by-gene (epistatic) and gene-by-environment interactions can modify the expression of phenotypes and opportunities for selection. These aspects of genetic architecture have the potential to significantly slow the speed of phenotypic evolution during crop domestication and improvement. Here we examine whether epistatic and gene-by-environment interactions have shaped how domestication traits have evolved. We review available evidence from the literature, and we analyze two domestication-related traits, shattering and flowering time, in a mapping population derived from a cross between domesticated foxtail millet and its wild progenitor. We find that compared with wild progenitor alleles, those favored during domestication often have large phenotypic effects and are relatively insensitive to genetic background and environmental effects. Consistent selection should thus be able to rapidly change traits during domestication. We conclude that if phenotypic evolution was slow during crop domestication, this is more likely due to cultural or historical factors than epistatic or environmental constraints.
The genetic architecture of susceptibility to parasites.
Wilfert, Lena; Schmid-Hempel, Paul
2008-06-30
The antagonistic co-evolution of hosts and their parasites is considered to be a potential driving force in maintaining host genetic variation including sexual reproduction and recombination. The examination of this hypothesis calls for information about the genetic basis of host-parasite interactions - such as how many genes are involved, how big an effect these genes have and whether there is epistasis between loci. We here examine the genetic architecture of quantitative resistance in animal and plant hosts by concatenating published studies that have identified quantitative trait loci (QTL) for host resistance in animals and plants. Collectively, these studies show that host resistance is affected by few loci. We particularly show that additional epistatic interactions, especially between loci on different chromosomes, explain a majority of the effects. Furthermore, we find that when experiments are repeated using different host or parasite genotypes under otherwise identical conditions, the underlying genetic architecture of host resistance can vary dramatically - that is, involves different QTLs and epistatic interactions. QTLs and epistatic loci vary much less when host and parasite types remain the same but experiments are repeated in different environments. This pattern of variability of the genetic architecture is predicted by strong interactions between genotypes and corroborates the prevalence of varying host-parasite combinations over varying environmental conditions. Moreover, epistasis is a major determinant of phenotypic variance for host resistance. Because epistasis seems to occur predominantly between, rather than within, chromosomes, segregation and chromosome number rather than recombination via cross-over should be the major elements affecting adaptive change in host resistance.
Collective feature selection to identify crucial epistatic variants.
Verma, Shefali S; Lucas, Anastasia; Zhang, Xinyuan; Veturi, Yogasudha; Dudek, Scott; Li, Binglan; Li, Ruowang; Urbanowicz, Ryan; Moore, Jason H; Kim, Dokyoon; Ritchie, Marylyn D
2018-01-01
Machine learning methods have gained popularity and practicality in identifying linear and non-linear effects of variants associated with complex disease/traits. Detection of epistatic interactions still remains a challenge due to the large number of features and relatively small sample size as input, thus leading to the so-called "short fat data" problem. The efficiency of machine learning methods can be increased by limiting the number of input features. Thus, it is very important to perform variable selection before searching for epistasis. Many methods have been evaluated and proposed to perform feature selection, but no single method works best in all scenarios. We demonstrate this by conducting two separate simulation analyses to evaluate the proposed collective feature selection approach. Through our simulation study we propose a collective feature selection approach to select features that are in the "union" of the best performing methods. We explored various parametric, non-parametric, and data mining approaches to perform feature selection. We choose our top performing methods to select the union of the resulting variables based on a user-defined percentage of variants selected from each method to take to downstream analysis. Our simulation analysis shows that non-parametric data mining approaches, such as MDR, may work best under one simulation criteria for the high effect size (penetrance) datasets, while non-parametric methods designed for feature selection, such as Ranger and Gradient boosting, work best under other simulation criteria. Thus, using a collective approach proves to be more beneficial for selecting variables with epistatic effects also in low effect size datasets and different genetic architectures. Following this, we applied our proposed collective feature selection approach to select the top 1% of variables to identify potential interacting variables associated with Body Mass Index (BMI) in ~ 44,000 samples obtained from Geisinger's MyCode Community Health Initiative (on behalf of DiscovEHR collaboration). In this study, we were able to show that selecting variables using a collective feature selection approach could help in selecting true positive epistatic variables more frequently than applying any single method for feature selection via simulation studies. We were able to demonstrate the effectiveness of collective feature selection along with a comparison of many methods in our simulation analysis. We also applied our method to identify non-linear networks associated with obesity.
Le Bihan-Duval, Elisabeth; Nadaf, Javad; Berri, Cécile; Pitel, Frédérique; Graulet, Benoît; Godet, Estelle; Leroux, Sophie Y.; Demeure, Olivier; Lagarrigue, Sandrine; Duby, Cécile; Cogburn, Larry A.; Beaumont, Catherine M.; Duclos, Michel J.
2011-01-01
Classical quantitative trait loci (QTL) analysis and gene expression QTL (eQTL) were combined to identify the causal gene (or QTG) underlying a highly significant QTL controlling the variation of breast meat color in a F2 cross between divergent high-growth (HG) and low-growth (LG) chicken lines. Within this meat quality QTL, BCMO1 (Accession number GenBank: AJ271386), encoding the β-carotene 15, 15′-monooxygenase, a key enzyme in the conversion of β-carotene into colorless retinal, was a good functional candidate. Analysis of the abundance of BCMO1 mRNA in breast muscle of the HG x LG F2 population allowed for the identification of a strong cis eQTL. Moreover, reevaluation of the color QTL taking BCMO1 mRNA levels as a covariate indicated that BCMO1 mRNA levels entirely explained the variations in meat color. Two fully-linked single nucleotide polymorphisms (SNP) located within the proximal promoter of BCMO1 gene were identified. Haplotype substitution resulted in a marked difference in BCMO1 promoter activity in vitro. The association study in the F2 population revealed a three-fold difference in BCMO1 expression leading to a difference of 1 standard deviation in yellow color between the homozygous birds at this haplotype. This difference in meat yellow color was fully consistent with the difference in carotenoid content (i.e. lutein and zeaxanthin) evidenced between the two alternative haplotypes. A significant association between the haplotype, the level of BCMO1 expression and the yellow color of the meat was also recovered in an unrelated commercial broiler population. The mutation could be of economic importance for poultry production by making possible a gene-assisted selection for color, a determining aspect of meat quality. Moreover, this natural genetic diversity constitutes a new model for the study of β-carotene metabolism which may act upon diverse biological processes as precursor of the vitamin A. PMID:21750696
Oparina, Nina Y; Delgado-Vega, Angelica M; Martinez-Bueno, Manuel; Magro-Checa, César; Fernández, Concepción; Castro, Rafaela Ortega; Pons-Estel, Bernardo A; D'Alfonso, Sandra; Sebastiani, Gian Domenico; Witte, Torsten; Lauwerys, Bernard R; Endreffy, Emoke; Kovács, László; Escudero, Alejandro; López-Pedrera, Chary; Vasconcelos, Carlos; da Silva, Berta Martins; Frostegård, Johan; Truedsson, Lennart; Martin, Javier; Raya, Enrique; Ortego-Centeno, Norberto; de Los Angeles Aguirre, Maria; de Ramón Garrido, Enrique; Palma, María-Jesús Castillo; Alarcon-Riquelme, Marta E; Kozyrev, Sergey V
2015-03-01
To perform fine mapping of the PXK locus associated with systemic lupus erythematosus (SLE) and study functional effects that lead to susceptibility to the disease. Linkage disequilibrium (LD) mapping was conducted by using 1251 SNPs (single nucleotide polymorphism) covering a 862 kb genomic region on 3p14.3 comprising the PXK locus in 1467 SLE patients and 2377 controls of European origin. Tag SNPs and genotypes imputed with IMPUTE2 were tested for association by using SNPTEST and PLINK. The expression QTLs data included three independent datasets for lymphoblastoid cells of European donors: HapMap3, MuTHER and the cross-platform eQTL catalogue. Correlation analysis of eQTLs was performed using Vassarstats. Alternative splicing for the PXK gene was analysed on mRNA from PBMCs. Fine mapping revealed long-range LD (>200 kb) extended over the ABHD6, RPP14, PXK, and PDHB genes on 3p14.3. The highly correlated variants tagged an SLE-associated haplotype that was less frequent in the patients compared with the controls (OR=0.89, p=0.00684). A robust correlation between the association with SLE and enhanced expression of ABHD6 gene was revealed, while neither expression, nor splicing alterations associated with SLE susceptibility were detected for PXK. The SNP allele frequencies as well as eQTL pattern analysed in the CEU and CHB HapMap3 populations indicate that the SLE association and the effect on ABHD6 expression are specific to Europeans. These results confirm the genetic association of the locus 3p14.3 with SLE in Europeans and point to the ABHD6 and not PXK, as the major susceptibility gene in the region. We suggest a pathogenic mechanism mediated by the upregulation of ABHD6 in individuals carrying the SLE-risk variants. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Branched-chain amino acid catabolism is a conserved regulator of physiological ageing.
Mansfeld, Johannes; Urban, Nadine; Priebe, Steffen; Groth, Marco; Frahm, Christiane; Hartmann, Nils; Gebauer, Juliane; Ravichandran, Meenakshi; Dommaschk, Anne; Schmeisser, Sebastian; Kuhlow, Doreen; Monajembashi, Shamci; Bremer-Streck, Sibylle; Hemmerich, Peter; Kiehntopf, Michael; Zamboni, Nicola; Englert, Christoph; Guthke, Reinhard; Kaleta, Christoph; Platzer, Matthias; Sühnel, Jürgen; Witte, Otto W; Zarse, Kim; Ristow, Michael
2015-12-01
Ageing has been defined as a global decline in physiological function depending on both environmental and genetic factors. Here we identify gene transcripts that are similarly regulated during physiological ageing in nematodes, zebrafish and mice. We observe the strongest extension of lifespan when impairing expression of the branched-chain amino acid transferase-1 (bcat-1) gene in C. elegans, which leads to excessive levels of branched-chain amino acids (BCAAs). We further show that BCAAs reduce a LET-363/mTOR-dependent neuro-endocrine signal, which we identify as DAF-7/TGFβ, and that impacts lifespan depending on its related receptors, DAF-1 and DAF-4, as well as ultimately on DAF-16/FoxO and HSF-1 in a cell-non-autonomous manner. The transcription factor HLH-15 controls and epistatically synergizes with BCAT-1 to modulate physiological ageing. Lastly and consistent with previous findings in rodents, nutritional supplementation of BCAAs extends nematodal lifespan. Taken together, BCAAs act as periphery-derived metabokines that induce a central neuro-endocrine response, culminating in extended healthspan.
Branched-chain amino acid catabolism is a conserved regulator of physiological ageing
Mansfeld, Johannes; Urban, Nadine; Priebe, Steffen; Groth, Marco; Frahm, Christiane; Hartmann, Nils; Gebauer, Juliane; Ravichandran, Meenakshi; Dommaschk, Anne; Schmeisser, Sebastian; Kuhlow, Doreen; Monajembashi, Shamci; Bremer-Streck, Sibylle; Hemmerich, Peter; Kiehntopf, Michael; Zamboni, Nicola; Englert, Christoph; Guthke, Reinhard; Kaleta, Christoph; Platzer, Matthias; Sühnel, Jürgen; Witte, Otto W.; Zarse, Kim; Ristow, Michael
2015-01-01
Ageing has been defined as a global decline in physiological function depending on both environmental and genetic factors. Here we identify gene transcripts that are similarly regulated during physiological ageing in nematodes, zebrafish and mice. We observe the strongest extension of lifespan when impairing expression of the branched-chain amino acid transferase-1 (bcat-1) gene in C. elegans, which leads to excessive levels of branched-chain amino acids (BCAAs). We further show that BCAAs reduce a LET-363/mTOR-dependent neuro-endocrine signal, which we identify as DAF-7/TGFβ, and that impacts lifespan depending on its related receptors, DAF-1 and DAF-4, as well as ultimately on DAF-16/FoxO and HSF-1 in a cell-non-autonomous manner. The transcription factor HLH-15 controls and epistatically synergizes with BCAT-1 to modulate physiological ageing. Lastly and consistent with previous findings in rodents, nutritional supplementation of BCAAs extends nematodal lifespan. Taken together, BCAAs act as periphery-derived metabokines that induce a central neuro-endocrine response, culminating in extended healthspan. PMID:26620638
Astudillo, Pablo; Carrasco, Héctor; Larraín, Juan
2014-01-01
Regulation of Wnt signaling is crucial for embryonic development and adult homeostasis. Here we study the role of Syndecan-4 (SDC4), a cell-surface heparan sulphate proteoglycan, and Fibronectin (FN), in Wnt/β-catenin signaling. Gain- and loss-of-function experiments in mammalian cell lines and Xenopus embryos demonstrate that SDC4 and FN inhibit Wnt/β-catenin signaling. Epistatic and biochemical experiments show that this inhibition occurs at the cell membrane level through regulation of LRP6. R-spondin 3, a ligand that promotes canonical and non-canonical Wnt signaling, is more prone to potentiate Wnt/β-catenin signaling when SDC4 levels are reduced, suggesting a model whereby SDC4 tunes the ability of R-spondin to modulate the different Wnt signaling pathways. Since SDC4 has been previously related to non-canonical Wnt signaling, our results also suggest that this proteoglycan can be a key component in the regulation of Wnt signaling. Copyright © 2013 Elsevier Ltd. All rights reserved.
Kim, Ju Young; Liu, Cindy Y; Zhang, Fengyu; Duan, Xin; Wen, Zhexing; Song, Juan; Feighery, Emer; Lu, Bai; Rujescu, Dan; St Clair, David; Christian, Kimberly; Callicott, Joseph H; Weinberger, Daniel R; Song, Hongjun; Ming, Guo-li
2012-03-02
How extrinsic stimuli and intrinsic factors interact to regulate continuous neurogenesis in the postnatal mammalian brain is unknown. Here we show that regulation of dendritic development of newborn neurons by Disrupted-in-Schizophrenia 1 (DISC1) during adult hippocampal neurogenesis requires neurotransmitter GABA-induced, NKCC1-dependent depolarization through a convergence onto the AKT-mTOR pathway. In contrast, DISC1 fails to modulate early-postnatal hippocampal neurogenesis when conversion of GABA-induced depolarization to hyperpolarization is accelerated. Extending the period of GABA-induced depolarization or maternal deprivation stress restores DISC1-dependent dendritic regulation through mTOR pathway during early-postnatal hippocampal neurogenesis. Furthermore, DISC1 and NKCC1 interact epistatically to affect risk for schizophrenia in two independent case control studies. Our study uncovers an interplay between intrinsic DISC1 and extrinsic GABA signaling, two schizophrenia susceptibility pathways, in controlling neurogenesis and suggests critical roles of developmental tempo and experience in manifesting the impact of susceptibility genes on neuronal development and risk for mental disorders. Copyright © 2012 Elsevier Inc. All rights reserved.
An empirical comparison of several recent epistatic interaction detection methods.
Wang, Yue; Liu, Guimei; Feng, Mengling; Wong, Limsoon
2011-11-01
Many new methods have recently been proposed for detecting epistatic interactions in GWAS data. There is, however, no in-depth independent comparison of these methods yet. Five recent methods-TEAM, BOOST, SNPHarvester, SNPRuler and Screen and Clean (SC)-are evaluated here in terms of power, type-1 error rate, scalability and completeness. In terms of power, TEAM performs best on data with main effect and BOOST performs best on data without main effect. In terms of type-1 error rate, TEAM and BOOST have higher type-1 error rates than SNPRuler and SNPHarvester. SC does not control type-1 error rate well. In terms of scalability, we tested the five methods using a dataset with 100 000 SNPs on a 64 bit Ubuntu system, with Intel (R) Xeon(R) CPU 2.66 GHz, 16 GB memory. TEAM takes ~36 days to finish and SNPRuler reports heap allocation problems. BOOST scales up to 100 000 SNPs and the cost is much lower than that of TEAM. SC and SNPHarvester are the most scalable. In terms of completeness, we study how frequently the pruning techniques employed by these methods incorrectly prune away the most significant epistatic interactions. We find that, on average, 20% of datasets without main effect and 60% of datasets with main effect are pruned incorrectly by BOOST, SNPRuler and SNPHarvester. The software for the five methods tested are available from the URLs below. TEAM: http://csbio.unc.edu/epistasis/download.php BOOST: http://ihome.ust.hk/~eeyang/papers.html. SNPHarvester: http://bioinformatics.ust.hk/SNPHarvester.html. SNPRuler: http://bioinformatics.ust.hk/SNPRuler.zip. Screen and Clean: http://wpicr.wpic.pitt.edu/WPICCompGen/. wangyue@nus.edu.sg.
Genome-wide analysis of epistasis in body mass index using multiple human populations.
Wei, Wen-Hua; Hemani, Gib; Gyenesei, Attila; Vitart, Veronique; Navarro, Pau; Hayward, Caroline; Cabrera, Claudia P; Huffman, Jennifer E; Knott, Sara A; Hicks, Andrew A; Rudan, Igor; Pramstaller, Peter P; Wild, Sarah H; Wilson, James F; Campbell, Harry; Hastie, Nicholas D; Wright, Alan F; Haley, Chris S
2012-08-01
We surveyed gene-gene interactions (epistasis) in human body mass index (BMI) in four European populations (n<1200) via exhaustive pair-wise genome scans where interactions were computed as F ratios by testing a linear regression model fitting two single-nucleotide polymorphisms (SNPs) with interactions against the one without. Before the association tests, BMI was corrected for sex and age, normalised and adjusted for relatedness. Neither single SNPs nor SNP interactions were genome-wide significant in either cohort based on the consensus threshold (P=5.0E-08) and a Bonferroni corrected threshold (P=1.1E-12), respectively. Next we compared sub genome-wide significant SNP interactions (P<5.0E-08) across cohorts to identify common epistatic signals, where SNPs were annotated to genes to test for gene ontology (GO) enrichment. Among the epistatic genes contributing to the commonly enriched GO terms, 19 were shared across study cohorts of which 15 are previously published genome-wide association loci, including CDH13 (cadherin 13) associated with height and SORCS2 (sortilin-related VPS10 domain containing receptor 2) associated with circulating insulin-like growth factor 1 and binding protein 3. Interactions between the 19 shared epistatic genes and those involving BMI candidate loci (P<5.0E-08) were tested across cohorts and found eight replicated at the SNP level (P<0.05) in at least one cohort, which were further tested and showed limited replication in a separate European population (n>5000). We conclude that genome-wide analysis of epistasis in multiple populations is an effective approach to provide new insights into the genetic regulation of BMI but requires additional efforts to confirm the findings.
Videau, Patrick; Rivers, Orion S; Hurd, Kathryn; Ushijima, Blake; Oshiro, Reid T; Ende, Rachel J; O'Hanlon, Samantha M; Cozy, Loralyn M
2016-10-24
The commitment of differentiating cells to a specialized fate is fundamental to the correct assembly of tissues within a multicellular organism. Because commitment is often irreversible, entry into and progression through this phase of development must be tightly regulated. Under nitrogen-limiting conditions, the multicellular cyanobacterium Anabaena sp. strain PCC 7120 terminally commits ∼10% of its cells to become specialized nitrogen-fixing heterocysts. Although commitment is known to occur 9-14 h after the induction of differentiation, the factors that regulate the initiation and duration of this phase have yet to be elucidated. Here, we report the identification of four genes that share a functional domain and modulate heterocyst commitment: hetP (alr2818), asl1930, alr2902, and alr3234 Epistatic relationships between all four genes relating to commitment were revealed by deleting them individually and in combination; asl1930 and alr3234 acted most upstream to delay commitment, alr2902 acted next in the pathway to inhibit development, and hetP acted most downstream to drive commitment forward. Possible protein-protein interactions between HetP, its homologs, and the heterocyst master regulator, HetR, were assessed, and interaction partners were defined. Finally, patterns of gene expression for each homolog, as determined by promoter fusions to gfp and reverse transcription-quantitative PCR, were distinct from that of hetP in both spatiotemporal organization and regulation. We posit that a dynamic succession of protein-protein interactions modulates the timing and efficiency of the commitment phase of development and note that this work highlights the utility of a multicellular cyanobacterium as a model for the study of developmental processes.
Delprato, A; Algéo, M-P; Bonheur, B; Bubier, J A; Lu, L; Williams, R W; Chesler, E J; Crusio, W E
2017-11-01
The open field is a classic test used to assess exploratory behavior, anxiety and locomotor activity in rodents. Here, we mapped quantitative trait loci (QTLs) underlying behaviors displayed in an open field, using a panel of 53 BXD recombinant inbred mouse strains with deep replication (10 per strain and sex). The use of these strains permits the integration and comparison of data obtained in different laboratories, and also offers the possibility to study trait covariance by exploiting powerful bioinformatics tools and resources. We quantified behavioral traits during 20-min test sessions including (1) percent time spent and distance traveled near the wall (thigmotaxis), (2) leaning against the wall, (3) rearing, (4) jumping, (5) grooming duration, (6) grooming frequency, (7) locomotion and (8) defecation. All traits exhibit moderate heritability making them amenable to genetic analysis. We identified a significant QTL on chromosome M.m. 4 at approximately 104 Mb that modulates grooming duration in both males and females (likelihood ratio statistic values of approximately 18, explaining 25% and 14% of the variance, respectively) and a suggestive QTL modulating locomotion that maps to the same locus. Bioinformatic analysis indicates Disabled 1 (Dab1, a key protein in the reelin signaling pathway) as a particularly strong candidate gene modulating these behaviors. We also found 2 highly suggestive QTLs for a sex by strain interaction for grooming duration on chromosomes 13 and 17. In addition, we identified a pairwise epistatic interaction between loci on chromosomes 12 at 36-37 Mb and 14 at 34-36 Mb that influences rearing frequency in males. © 2017 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Pritchard, Victoria L; Viitaniemi, Heidi M; McCairns, R J Scott; Merilä, Juha; Nikinmaa, Mikko; Primmer, Craig R; Leder, Erica H
2017-01-05
Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus), an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL) underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats. Copyright © 2017 Pritchard et al.
Genome-wide significant risk associations for mucinous ovarian carcinoma
Kelemen, Linda E.; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; M. Lee, Janet; Seo, Ji-Heui; Phelan, Catherine M.; Beesley, Jonathan; Chen, Xiaoqin; Spindler, Tassja J.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chen, Y. Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Engelholm, Svend Aage; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moes-Sosnowska, Joanna; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wlodzimierz, Sawicki; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Freedman, Matthew L.; Chenevix-Trench, Georgia; Pharoah, Paul D.; Gayther, Simon A.; Berchuck, Andrew
2015-01-01
Genome-wide association studies have identified several risk associations for ovarian carcinomas (OC) but not for mucinous ovarian carcinomas (MOC). Genotypes from OC cases and controls were imputed into the 1000 Genomes Project reference panel. Analysis of 1,644 MOC cases and 21,693 controls identified three novel risk associations: rs752590 at 2q13 (P = 3.3 × 10−8), rs711830 at 2q31.1 (P = 7.5 × 10−12) and rs688187 at 19q13.2 (P = 6.8 × 10−13). Expression Quantitative Trait Locus (eQTL) analysis in ovarian and colorectal tumors (which are histologically similar to MOC) identified significant eQTL associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10−4, FDR = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors, and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790
WormQTLHD—a web database for linking human disease to natural variation data in C. elegans
van der Velde, K. Joeri; de Haan, Mark; Zych, Konrad; Arends, Danny; Snoek, L. Basten; Kammenga, Jan E.; Jansen, Ritsert C.; Swertz, Morris A.; Li, Yang
2014-01-01
Interactions between proteins are highly conserved across species. As a result, the molecular basis of multiple diseases affecting humans can be studied in model organisms that offer many alternative experimental opportunities. One such organism—Caenorhabditis elegans—has been used to produce much molecular quantitative genetics and systems biology data over the past decade. We present WormQTLHD (Human Disease), a database that quantitatively and systematically links expression Quantitative Trait Loci (eQTL) findings in C. elegans to gene–disease associations in man. WormQTLHD, available online at http://www.wormqtl-hd.org, is a user-friendly set of tools to reveal functionally coherent, evolutionary conserved gene networks. These can be used to predict novel gene-to-gene associations and the functions of genes underlying the disease of interest. We created a new database that links C. elegans eQTL data sets to human diseases (34 337 gene–disease associations from OMIM, DGA, GWAS Central and NHGRI GWAS Catalogue) based on overlapping sets of orthologous genes associated to phenotypes in these two species. We utilized QTL results, high-throughput molecular phenotypes, classical phenotypes and genotype data covering different developmental stages and environments from WormQTL database. All software is available as open source, built on MOLGENIS and xQTL workbench. PMID:24217915
WormQTLHD--a web database for linking human disease to natural variation data in C. elegans.
van der Velde, K Joeri; de Haan, Mark; Zych, Konrad; Arends, Danny; Snoek, L Basten; Kammenga, Jan E; Jansen, Ritsert C; Swertz, Morris A; Li, Yang
2014-01-01
Interactions between proteins are highly conserved across species. As a result, the molecular basis of multiple diseases affecting humans can be studied in model organisms that offer many alternative experimental opportunities. One such organism-Caenorhabditis elegans-has been used to produce much molecular quantitative genetics and systems biology data over the past decade. We present WormQTL(HD) (Human Disease), a database that quantitatively and systematically links expression Quantitative Trait Loci (eQTL) findings in C. elegans to gene-disease associations in man. WormQTL(HD), available online at http://www.wormqtl-hd.org, is a user-friendly set of tools to reveal functionally coherent, evolutionary conserved gene networks. These can be used to predict novel gene-to-gene associations and the functions of genes underlying the disease of interest. We created a new database that links C. elegans eQTL data sets to human diseases (34 337 gene-disease associations from OMIM, DGA, GWAS Central and NHGRI GWAS Catalogue) based on overlapping sets of orthologous genes associated to phenotypes in these two species. We utilized QTL results, high-throughput molecular phenotypes, classical phenotypes and genotype data covering different developmental stages and environments from WormQTL database. All software is available as open source, built on MOLGENIS and xQTL workbench.
The complex genetics of gait speed: genome-wide meta-analysis approach
Lunetta, Kathryn L.; Smith, Jennifer A.; Eicher, John D.; Vered, Rotem; Deelen, Joris; Arnold, Alice M.; Buchman, Aron S.; Tanaka, Toshiko; Faul, Jessica D.; Nethander, Maria; Fornage, Myriam; Adams, Hieab H.; Matteini, Amy M.; Callisaya, Michele L.; Smith, Albert V.; Yu, Lei; De Jager, Philip L.; Evans, Denis A.; Gudnason, Vilmundur; Hofman, Albert; Pattie, Alison; Corley, Janie; Launer, Lenore J.; Knopman, Davis S.; Parimi, Neeta; Turner, Stephen T.; Bandinelli, Stefania; Beekman, Marian; Gutman, Danielle; Sharvit, Lital; Mooijaart, Simon P.; Liewald, David C.; Houwing-Duistermaat, Jeanine J.; Ohlsson, Claes; Moed, Matthijs; Verlinden, Vincent J.; Mellström, Dan; van der Geest, Jos N.; Karlsson, Magnus; Hernandez, Dena; McWhirter, Rebekah; Liu, Yongmei; Thomson, Russell; Tranah, Gregory J.; Uitterlinden, Andre G.; Weir, David R.; Zhao, Wei; Starr, John M.; Johnson, Andrew D.; Ikram, M. Arfan; Bennett, David A.; Cummings, Steven R.; Deary, Ian J.; Harris, Tamara B.; Kardia, Sharon L. R.; Mosley, Thomas H.; Srikanth, Velandai K.; Windham, Beverly G.; Newman, Ann B.; Walston, Jeremy D.; Davies, Gail; Evans, Daniel S.; Slagboom, Eline P.; Ferrucci, Luigi; Kiel, Douglas P.; Murabito, Joanne M.; Atzmon, Gil
2017-01-01
Emerging evidence suggests that the basis for variation in late-life mobility is attributable, in part, to genetic factors, which may become increasingly important with age. Our objective was to systematically assess the contribution of genetic variation to gait speed in older individuals. We conducted a meta-analysis of gait speed GWASs in 31,478 older adults from 17 cohorts of the CHARGE consortium, and validated our results in 2,588 older adults from 4 independent studies. We followed our initial discoveries with network and eQTL analysis of candidate signals in tissues. The meta-analysis resulted in a list of 536 suggestive genome wide significant SNPs in or near 69 genes. Further interrogation with Pathway Analysis placed gait speed as a polygenic complex trait in five major networks. Subsequent eQTL analysis revealed several SNPs significantly associated with the expression of PRSS16, WDSUB1 and PTPRT, which in addition to the meta-analysis and pathway suggested that genetic effects on gait speed may occur through synaptic function and neuronal development pathways. No genome-wide significant signals for gait speed were identified from this moderately large sample of older adults, suggesting that more refined physical function phenotypes will be needed to identify the genetic basis of gait speed in aging. PMID:28077804
Pritchard, Victoria L.; Viitaniemi, Heidi M.; McCairns, R. J. Scott; Merilä, Juha; Nikinmaa, Mikko; Primmer, Craig R.; Leder, Erica H.
2016-01-01
Much adaptive evolutionary change is underlain by mutational variation in regions of the genome that regulate gene expression rather than in the coding regions of the genes themselves. An understanding of the role of gene expression variation in facilitating local adaptation will be aided by an understanding of underlying regulatory networks. Here, we characterize the genetic architecture of gene expression variation in the threespine stickleback (Gasterosteus aculeatus), an important model in the study of adaptive evolution. We collected transcriptomic and genomic data from 60 half-sib families using an expression microarray and genotyping-by-sequencing, and located expression quantitative trait loci (eQTL) underlying the variation in gene expression in liver tissue using an interval mapping approach. We identified eQTL for several thousand expression traits. Expression was influenced by polymorphism in both cis- and trans-regulatory regions. Trans-eQTL clustered into hotspots. We did not identify master transcriptional regulators in hotspot locations: rather, the presence of hotspots may be driven by complex interactions between multiple transcription factors. One observed hotspot colocated with a QTL recently found to underlie salinity tolerance in the threespine stickleback. However, most other observed hotspots did not colocate with regions of the genome known to be involved in adaptive divergence between marine and freshwater habitats. PMID:27836907
Maize transgenes containing zein promoters are regulated by opaque2
USDA-ARS?s Scientific Manuscript database
Transgenes have great potential in crop improvement, but relatively little is known about the epistatic interaction of transgenes with the native genes in the genome. Understanding these interactions is critical for predicting the response of transgenes to different genetic backgrounds and environm...
HU, TING; DARABOS, CHRISTIAN; CRICCO, MARIA E.; KONG, EMILY; MOORE, JASON H.
2014-01-01
The large volume of GWAS data poses great computational challenges for analyzing genetic interactions associated with common human diseases. We propose a computational framework for characterizing epistatic interactions among large sets of genetic attributes in GWAS data. We build the human phenotype network (HPN) and focus around a disease of interest. In this study, we use the GLAUGEN glaucoma GWAS dataset and apply the HPN as a biological knowledge-based filter to prioritize genetic variants. Then, we use the statistical epistasis network (SEN) to identify a significant connected network of pairwise epistatic interactions among the prioritized SNPs. These clearly highlight the complex genetic basis of glaucoma. Furthermore, we identify key SNPs by quantifying structural network characteristics. Through functional annotation of these key SNPs using Biofilter, a software accessing multiple publicly available human genetic data sources, we find supporting biomedical evidences linking glaucoma to an array of genetic diseases, proving our concept. We conclude by suggesting hypotheses for a better understanding of the disease. PMID:25592582
Systems Genetic Analysis of Osteoblast-Lineage Cells
Calabrese, Gina; Bennett, Brian J.; Orozco, Luz; Kang, Hyun M.; Eskin, Eleazar; Dombret, Carlos; De Backer, Olivier; Lusis, Aldons J.; Farber, Charles R.
2012-01-01
The osteoblast-lineage consists of cells at various stages of maturation that are essential for skeletal development, growth, and maintenance. Over the past decade, many of the signaling cascades that regulate this lineage have been elucidated; however, little is known of the networks that coordinate, modulate, and transmit these signals. Here, we identify a gene network specific to the osteoblast-lineage through the reconstruction of a bone co-expression network using microarray profiles collected on 96 Hybrid Mouse Diversity Panel (HMDP) inbred strains. Of the 21 modules that comprised the bone network, module 9 (M9) contained genes that were highly correlated with prototypical osteoblast maker genes and were more highly expressed in osteoblasts relative to other bone cells. In addition, the M9 contained many of the key genes that define the osteoblast-lineage, which together suggested that it was specific to this lineage. To use the M9 to identify novel osteoblast genes and highlight its biological relevance, we knocked-down the expression of its two most connected “hub” genes, Maged1 and Pard6g. Their perturbation altered both osteoblast proliferation and differentiation. Furthermore, we demonstrated the mice deficient in Maged1 had decreased bone mineral density (BMD). It was also discovered that a local expression quantitative trait locus (eQTL) regulating the Wnt signaling antagonist Sfrp1 was a key driver of the M9. We also show that the M9 is associated with BMD in the HMDP and is enriched for genes implicated in the regulation of human BMD through genome-wide association studies. In conclusion, we have identified a physiologically relevant gene network and used it to discover novel genes and regulatory mechanisms involved in the function of osteoblast-lineage cells. Our results highlight the power of harnessing natural genetic variation to generate co-expression networks that can be used to gain insight into the function of specific cell-types. PMID:23300464
The Concept of Horizontal Linkage and Its Application to Genetics and Breeding
USDA-ARS?s Scientific Manuscript database
In diploid organisms, all dominance and many epistatic interactions are lost during the formation of gametes by meiosis. While outcrossing strategies may help to restore heterozygosity upon fertilization, there are few examples of mechanisms that retain inter- and intra-locus interactions during se...
Ueda, Sho; Oryoji, Daisuke; Yamamoto, Ken; Noh, Jaeduk Yoshimura; Okamura, Ken; Noda, Mitsuhiko; Kashiwase, Koichi; Kosuga, Yuka; Sekiya, Kenichi; Inoue, Kaori; Yamada, Hisakata; Oyamada, Akiko; Nishimura, Yasuharu; Yoshikai, Yasunobu; Ito, Koichi; Sasazuki, Takehiko
2014-02-01
Autoimmune thyroid disease (AITD) includes Graves disease (GD) and Hashimoto thyroiditis (HT), which partially share immunological features. Determining the genetic basis that distinguishes GD and HT is a key to understanding the differences between these 2 related diseases. The aims of this study were to identify HLA antigens that can explain the immunopathological difference between GD and HT and to elucidate epistatic interactions between protective and susceptible HLA alleles, which can delineate the distinct function of HLA in AITD etiology. We genotyped 991 patients with AITD (547 patients with GD and 444 patients with HT) and 481 control subjects at the HLA-A, HLA-C, HLA-B, DRB1, DQB1, and DPB1 loci. A direct comparison of HLA antigen frequencies between GD and HT was performed. We further analyzed an epistatic interaction between the susceptible and protective HLA alleles in the development of GD and HT. We identified 4 and 2 susceptible HLA molecules primarily associated with GD and HT, respectively, HLA-B*35:01, HLA-B*46:01, HLA-DRB1*14:03, and HLA-DPB1*05:01 for GD and HLA-A*02:07 and HLA-DRB4 for HT. In a direct comparison between GD and HT, we identified GD-specific susceptible class II molecules, HLA-DP5 (HLA-DPB1*05:01; Pc = 1.0 × 10(-9)) and HLA-DR14 (HLA-DRB*14:03; Pc = .0018). In contrast, HLA components on 3 common haplotypes in Japanese showed significant protective effects against the development of GD and HT (HLA-A*24:02-C*12:02-B*52:01-DRB1*15:02-DQB1*06:01-DPB1*09:01 and HLA-A*24:02-C*07:02-B*07:02-DRB1*01:01-DQB1*05:01-DPB1*04:02 haplotypes for GD and HLA-A*33:03-C*14:03-B*44:03-DRB1*13:02-DQB1*06:04-DPB1*04:01 haplotype for GD and HT). Interestingly, the representative protective HLA, HLA-DR13 (HLA-DRB1*13:02), was epistatic to susceptible HLA-DP5 in controlling the development of GD. We show that HLA exerts a dual function, susceptibility and resistance, in controlling the development of GD and HT. We also show that the protective HLA allele is partially epistatic to the susceptible HLA allele in GD.
Lee, Kyounghee; Lee, Hong Gil; Kim, Hyun Uk; Seo, Pil Joon
2015-01-01
Seed germination is a key developmental transition that initiates the plant life cycle. The timing of germination is determined by the coordinated action of two phytohormones, gibberellin and abscisic acid (ABA). In particular, ABA plays a key role in integrating environmental information and inhibiting the germination process. The utilization of embryonic lipid reserves contributes to seed germination by acting as an energy source, and ABA suppresses lipid degradation to modulate the germination process. Here, we report that the ABA-responsive R2R3-type MYB transcription factor MYB96, which is highly expressed in embryo, regulates seed germination by controlling the expression of ABSCISIC ACID-INSENSITIVE4 (ABI4) in Arabidopsis (Arabidopsis thaliana). In the presence of ABA, germination was accelerated in MYB96-deficient myb96-1 seeds, whereas the process was significantly delayed in MYB96-overexpressing activation-tagging myb96-ox seeds. Consistently, myb96-1 seeds degraded a larger extent of lipid reserves even in the presence of ABA, while reduced lipid mobilization was observed in myb96-ox seeds. MYB96 directly regulates ABI4, which acts as a repressor of lipid breakdown, to define its spatial and temporal expression. Genetic analysis further demonstrated that ABI4 is epistatic to MYB96 in the control of seed germination. Taken together, the MYB96-ABI4 module regulates lipid mobilization specifically in the embryo to ensure proper seed germination under suboptimal conditions. PMID:25869652
The identification and characterization of genetic and environmental factors that predict common, complex disease is a major goal of human genetics. The ubiquitous nature of epistatic interaction in the underlying genetic etiology of such disease presents a difficult analytical ...
SiNoPsis: Single Nucleotide Polymorphisms selection and promoter profiling.
Boloc, Daniel; Rodríguez, Natalia; Gassó, Patricia; Abril, Josep F; Bernardo, Miquel; Lafuente, Amalia; Mas, Sergi
2017-09-14
The selection of a Single Nucleotide Polymorphism (SNP) using bibliographic methods can be a very time-consuming task. Moreover, a SNP selected in this way may not be easily visualized in its genomic context by a standard user hoping to correlate it with other valuable information. Here we propose a web form built on top of Circos that can assist SNP-centred screening, based on their location in the genome and the regulatory modules they can disrupt. Its use may allow researchers to prioritize SNPs in genotyping and disease studies. SiNoPsis is bundled as a web portal. It focuses on the different structures involved in the genomic expression of a gene, especially those found in the core promoter upstream region. These structures include transcription factor binding sites (for promoter and enhancer signals), histones, and promoter flanking regions. Additionally, the tool provides eQTL and linkage disequilibrium (LD) properties for a given SNP query, yielding further clues about other indirectly associated SNPs. Possible disruptions of the aforementioned structures affecting gene transcription are reported using multiple resource databases. SiNoPsis has a simple user-friendly interface, which allows single queries by gene symbol, genomic coordinates, Ensembl gene identifiers, RefSeq transcript identifiers and SNPs. It is the only portal providing useful SNP selection based on regulatory modules and LD with functional variants in both textual and graphic modes (by properly defining the arguments and parameters needed to run Circos). SiNoPsis is freely available at https://compgen.bio.ub.edu/SiNoPsis /. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Ward-Caviness, Cavin K.; Neas, Lucas M.; Blach, Colette; Haynes, Carol S.; LaRocque-Abramson, Karen; Grass, Elizabeth; Dowdy, Elaine; Devlin, Robert B.; Diaz-Sanchez, David; Cascio, Wayne E.; Lynn Miranda, Marie; Gregory, Simon G.; Shah, Svati H.; Kraus, William E.; Hauser, Elizabeth R.
2016-01-01
There is a growing literature indicating that genetic variants modify many of the associations between environmental exposures and clinical outcomes, potentially by increasing susceptibility to these exposures. However, genome-scale investigations of these interactions have been rarely performed particularly in the case of air pollution exposures. We performed race-stratified genome-wide gene-environment interaction association studies on European-American (EA, N = 1623) and African-American (AA, N = 554) cohorts to investigate the joint influence of common single nucleotide polymorphisms (SNPs) and residential exposure to traffic (“traffic exposure”)—a recognized vascular disease risk factor—on peripheral arterial disease (PAD). Traffic exposure was estimated via the distance from the primary residence to the nearest major roadway, defined as the nearest limited access highways or major arterial. The rs755249-traffic exposure interaction was associated with PAD at a genome-wide significant level (P = 2.29x10-8) in European-Americans. Rs755249 is located in the 3’ untranslated region of BMP8A, a member of the bone morphogenic protein (BMP) gene family. Further investigation revealed several variants in BMP genes associated with PAD via an interaction with traffic exposure in both the EA and AA cohorts; this included interactions with non-synonymous variants in BMP2, which is regulated by air pollution exposure. The BMP family of genes is linked to vascular growth and calcification and is a novel gene family for the study of PAD pathophysiology. Further investigation of BMP8A using the Genotype Tissue Expression Database revealed multiple variants with nominally significant (P < 0.05) interaction P-values in our EA cohort were significant BMP8A eQTLs in tissue types highlight relevant for PAD such as rs755249 (tibial nerve, eQTL P = 3.6x10-6) and rs1180341 (tibial artery, eQTL P = 5.3x10-6). Together these results reveal a novel gene, and possibly gene family, associated with PAD via an interaction with traffic air pollution exposure. These results also highlight the potential for interactions studies, particularly at the genome scale, to reveal novel biology linking environmental exposures to clinical outcomes. PMID:27082954
Dumas, Marc-Emmanuel; Domange, Céline; Calderari, Sophie; Martínez, Andrea Rodríguez; Ayala, Rafael; Wilder, Steven P; Suárez-Zamorano, Nicolas; Collins, Stephan C; Wallis, Robert H; Gu, Quan; Wang, Yulan; Hue, Christophe; Otto, Georg W; Argoud, Karène; Navratil, Vincent; Mitchell, Steve C; Lindon, John C; Holmes, Elaine; Cazier, Jean-Baptiste; Nicholson, Jeremy K; Gauguier, Dominique
2016-09-30
The genetic regulation of metabolic phenotypes (i.e., metabotypes) in type 2 diabetes mellitus occurs through complex organ-specific cellular mechanisms and networks contributing to impaired insulin secretion and insulin resistance. Genome-wide gene expression profiling systems can dissect the genetic contributions to metabolome and transcriptome regulations. The integrative analysis of multiple gene expression traits and metabolic phenotypes (i.e., metabotypes) together with their underlying genetic regulation remains a challenge. Here, we introduce a systems genetics approach based on the topological analysis of a combined molecular network made of genes and metabolites identified through expression and metabotype quantitative trait locus mapping (i.e., eQTL and mQTL) to prioritise biological characterisation of candidate genes and traits. We used systematic metabotyping by 1 H NMR spectroscopy and genome-wide gene expression in white adipose tissue to map molecular phenotypes to genomic blocks associated with obesity and insulin secretion in a series of rat congenic strains derived from spontaneously diabetic Goto-Kakizaki (GK) and normoglycemic Brown-Norway (BN) rats. We implemented a network biology strategy approach to visualize the shortest paths between metabolites and genes significantly associated with each genomic block. Despite strong genomic similarities (95-99 %) among congenics, each strain exhibited specific patterns of gene expression and metabotypes, reflecting the metabolic consequences of series of linked genetic polymorphisms in the congenic intervals. We subsequently used the congenic panel to map quantitative trait loci underlying specific mQTLs and genome-wide eQTLs. Variation in key metabolites like glucose, succinate, lactate, or 3-hydroxybutyrate and second messenger precursors like inositol was associated with several independent genomic intervals, indicating functional redundancy in these regions. To navigate through the complexity of these association networks we mapped candidate genes and metabolites onto metabolic pathways and implemented a shortest path strategy to highlight potential mechanistic links between metabolites and transcripts at colocalized mQTLs and eQTLs. Minimizing the shortest path length drove prioritization of biological validations by gene silencing. These results underline the importance of network-based integration of multilevel systems genetics datasets to improve understanding of the genetic architecture of metabotype and transcriptomic regulation and to characterize novel functional roles for genes determining tissue-specific metabolism.
Genetic and Environmental Pathways in Type 1 Diabetes Complications
2010-09-01
active duty members of the military, their families and retired military personnel will potentially allow focused preventative treatment of at- risk...association and assess potential heterogeneity of association signals, such as by ancestry. Query eQTL databases for relevant associations. Goals: 1a1...1, so that at most an average of 7 SNPs remain as potential risk SNPs at each of the 30 loci. In Stage 2 we will genotype another 2000 cases and
Efficiently Identifying Significant Associations in Genome-wide Association Studies
Eskin, Eleazar
2013-01-01
Abstract Over the past several years, genome-wide association studies (GWAS) have implicated hundreds of genes in common disease. More recently, the GWAS approach has been utilized to identify regions of the genome that harbor variation affecting gene expression or expression quantitative trait loci (eQTLs). Unlike GWAS applied to clinical traits, where only a handful of phenotypes are analyzed per study, in eQTL studies, tens of thousands of gene expression levels are measured, and the GWAS approach is applied to each gene expression level. This leads to computing billions of statistical tests and requires substantial computational resources, particularly when applying novel statistical methods such as mixed models. We introduce a novel two-stage testing procedure that identifies all of the significant associations more efficiently than testing all the single nucleotide polymorphisms (SNPs). In the first stage, a small number of informative SNPs, or proxies, across the genome are tested. Based on their observed associations, our approach locates the regions that may contain significant SNPs and only tests additional SNPs from those regions. We show through simulations and analysis of real GWAS datasets that the proposed two-stage procedure increases the computational speed by a factor of 10. Additionally, efficient implementation of our software increases the computational speed relative to the state-of-the-art testing approaches by a factor of 75. PMID:24033261
Wang, Lu; Mariño-Ramírez, Leonardo
2017-01-01
Abstract Transposable element (TE) derived sequences are known to contribute to the regulation of the human genome. The majority of known TE-derived regulatory sequences correspond to relatively ancient insertions, which are fixed across human populations. The extent to which human genetic variation caused by recent TE activity leads to regulatory polymorphisms among populations has yet to be thoroughly explored. In this study, we searched for associations between polymorphic TE (polyTE) loci and human gene expression levels using an expression quantitative trait loci (eQTL) approach. We compared locus-specific polyTE insertion genotypes to B cell gene expression levels among 445 individuals from 5 human populations. Numerous human polyTE loci correspond to both cis and trans eQTL, and their regulatory effects are directly related to cell type-specific function in the immune system. PolyTE loci are associated with differences in expression between European and African population groups, and a single polyTE loci is indirectly associated with the expression of numerous genes via the regulation of the B cell-specific transcription factor PAX5. The polyTE-gene expression associations we found indicate that human TE genetic variation can have important phenotypic consequences. Our results reveal that TE-eQTL are involved in population-specific gene regulation as well as transcriptional network modification. PMID:27998931
Kang, Eun Yong; Martin, Lisa J.; Mangul, Serghei; Isvilanonda, Warin; Zou, Jennifer; Ben-David, Eyal; Han, Buhm; Lusis, Aldons J.; Shifman, Sagiv; Eskin, Eleazar
2016-01-01
The study of the genetics of gene expression is of considerable importance to understanding the nature of common, complex diseases. The most widely applied approach to identifying relationships between genetic variation and gene expression is the expression quantitative trait loci (eQTL) approach. Here, we increased the computational power of eQTL with an alternative and complementary approach based on analyzing allele specific expression (ASE). We designed a novel analytical method to identify cis-acting regulatory variants based on genome sequencing and measurements of ASE from RNA-sequencing (RNA-seq) data. We evaluated the power and resolution of our method using simulated data. We then applied the method to map regulatory variants affecting gene expression in lymphoblastoid cell lines (LCLs) from 77 unrelated northern and western European individuals (CEU), which were part of the HapMap project. A total of 2309 SNPs were identified as being associated with ASE patterns. The SNPs associated with ASE were enriched within promoter regions and were significantly more likely to signal strong evidence for a regulatory role. Finally, among the candidate regulatory SNPs, we identified 108 SNPs that were previously associated with human immune diseases. With further improvements in quantifying ASE from RNA-seq, the application of our method to other datasets is expected to accelerate our understanding of the biological basis of common diseases. PMID:27765809
Shaneka S. Lawson; Paula M. Pijut; Charles H. Michler
2014-01-01
Recent physiological analysis of Arabidopsis stomatal density (SD) mutants indicated that SD was not the major factor controlling aboveground biomass accumulation. Despite the general theory that plants with fewer stomata have limited biomass acquisition capabilities, epf1 and several other Arabidopsis mutants varied significantly in leaf fresh...
Molecular genetic analysis of the Phaseolus vulgaris P locus
USDA-ARS?s Scientific Manuscript database
Common bean market classes are distinguished by their many seed colors, patterns, and size. At least 23 genes, acting independently or in an epistatic manner, affect the seed coat color and pattern. The P locus which is described as the “ground factor” by Emerson, has multiple alleles and controls a...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jenson, Justin M.; Ryan, Jeremy A.; Grant, Robert A.
Overexpression of anti-apoptotic Bcl-2 family proteins contributes to cancer progression and confers resistance to chemotherapy. Small molecules that target Bcl-2 are used in the clinic to treat leukemia, but tight and selective inhibitors are not available for Bcl-2 paralog Bfl-1. Guided by computational analysis, we designed variants of the native BH3 motif PUMA that are > 150-fold selective for Bfl-1 binding. The designed peptides potently trigger disruption of the mitochondrial outer membrane in cells dependent on Bfl-1, but not in cells dependent on other anti-apoptotic homologs. High-resolution crystal structures show that designed peptide FS2 binds Bfl-1 in a shifted geometry,more » relative to PUMA and other binding partners, due to a set of epistatic mutations. FS2 modified with an electrophile reacts with a cysteine near the peptide-binding groove to augment specificity. Designed Bfl-1 binders provide reagents for cellular profiling and leads for developing enhanced and cell-permeable peptide or small-molecule inhibitors.« less
Kubo, Takahiko; Yoshimura, Atsushi; Kurata, Nori
2018-02-10
Hybrid male sterility genes are important factors in creating postzygotic reproductive isolation barriers in plants. One such gene, S25, is known to cause severe transmission ratio distortion in inter-subspecific progeny of cultivated rice Oryza sativa ssp. indica and japonica. To further characterize the S25 gene, we fine-mapped and genetically characterized the S25 gene using near-isogenic lines with reciprocal genetic backgrounds. We mapped the S25 locus within the 0.67-1.02 Mb region on rice chromosome 12. Further genetic analyses revealed that S25 substantially reduced male fertility in the japonica background, but not in the indica background. In first-generation hybrid progeny, S25 had a milder effect than it had in the japonica background. These results suggest that the expression of S25 is epistatically regulated by at least one partially dominant gene present in the indica genome. This finding supports our previous studies showing that hybrid male sterility due to pollen killer genes results from epistatic interaction with other genes that are hidden in the genetic background.
Jenson, Justin M; Ryan, Jeremy A; Grant, Robert A; Letai, Anthony; Keating, Amy E
2017-01-01
Overexpression of anti-apoptotic Bcl-2 family proteins contributes to cancer progression and confers resistance to chemotherapy. Small molecules that target Bcl-2 are used in the clinic to treat leukemia, but tight and selective inhibitors are not available for Bcl-2 paralog Bfl-1. Guided by computational analysis, we designed variants of the native BH3 motif PUMA that are > 150-fold selective for Bfl-1 binding. The designed peptides potently trigger disruption of the mitochondrial outer membrane in cells dependent on Bfl-1, but not in cells dependent on other anti-apoptotic homologs. High-resolution crystal structures show that designed peptide FS2 binds Bfl-1 in a shifted geometry, relative to PUMA and other binding partners, due to a set of epistatic mutations. FS2 modified with an electrophile reacts with a cysteine near the peptide-binding groove to augment specificity. Designed Bfl-1 binders provide reagents for cellular profiling and leads for developing enhanced and cell-permeable peptide or small-molecule inhibitors. DOI: http://dx.doi.org/10.7554/eLife.25541.001 PMID:28594323
Odagiri, Nao; Seki, Masayuki; Onoda, Fumitoshi; Yoshimura, Akari; Watanabe, Sei; Enomoto, Takemi
2003-03-01
MMS4 of Saccharomyces cerevisiae was originally identified as the gene responsible for one of the collection of methyl methanesulfonate (MMS)-sensitive mutants, mms4. Recently it was identified as a synthetic lethal gene with an SGS1 mutation. Epistatic analyses revealed that MMS4 is involved in a pathway leading to homologous recombination requiring Rad52 or in the recombination itself, in which SGS1 is also involved. MMS sensitivity of mms4 but not sgs1, was suppressed by introducing a bacterial Holliday junction (HJ) resolvase, RusA. The frequencies of spontaneously occurring unequal sister chromatid recombination (SCR) and loss of marker in the rDNA in haploid mms4 cells and interchromosomal recombination between heteroalleles in diploid mms4 cells were essentially the same as those of wild-type cells. Although UV- and MMS-induced interchromosomal recombination was defective in sgs1 diploid cells, hyper-induction of interchromosomal recombination was observed in diploid mms4 cells, indicating that the function of Mms4 is dispensable for this type of recombination.
Genetic Analysis of Reduced γ-Tocopherol Content in Ethiopian Mustard Seeds.
García-Navarro, Elena; Fernández-Martínez, José M; Pérez-Vich, Begoña; Velasco, Leonardo
2016-01-01
Ethiopian mustard (Brassica carinata A. Braun) line BCT-6, with reduced γ-tocopherol content in the seeds, has been previously developed. The objective of this research was to conduct a genetic analysis of seed tocopherols in this line. BCT-6 was crossed with the conventional line C-101 and the F1, F2, and BC plant generations were analyzed. Generation mean analysis using individual scaling tests indicated that reduced γ-tocopherol content fitted an additive-dominant genetic model with predominance of additive effects and absence of epistatic interactions. This was confirmed through a joint scaling test and additional testing of the goodness of fit of the model. Conversely, epistatic interactions were identified for total tocopherol content. Estimation of the minimum number of genes suggested that both γ- and total tocopherol content may be controlled by two genes. A positive correlation between total tocopherol content and the proportion of γ-tocopherol was identified in the F2 generation. Additional research on the feasibility of developing germplasm with high tocopherol content and reduced concentration of γ-tocopherol is required.
Genetic control of plant height in European winter wheat cultivars.
Würschum, Tobias; Langer, Simon M; Longin, C Friedrich H
2015-05-01
Plant height variation in European winter wheat cultivars is mainly controlled by the Rht - D1 and Rht - B1 semi-dwarfing genes, but also by other medium- or small-effect QTL and potentially epistatic QTL enabling fine adjustments of plant height. Plant height is an important goal in wheat (Triticum aestivum L.) breeding as it affects crop performance and thus yield and quality. The aim of this study was to investigate the genetic control of plant height in European winter wheat cultivars. To this end, a panel of 410 winter wheat varieties from across Europe was evaluated for plant height in multi-location field trials and genotyped for the candidate loci Rht-B1, Rht-D1, Rht8, Ppd-B1 copy number variation and Ppd-D1 as well as by a genotyping-by-sequencing approach yielding 23,371 markers with known map position. We found that Rht-D1 and Rht-B1 had the largest effects on plant height in this cultivar collection explaining 40.9 and 15.5% of the genotypic variance, respectively, while Ppd-D1 and Rht8 accounted for 3.0 and 2.0% of the variance, respectively. A genome-wide scan for marker-trait associations yielded two additional medium-effect QTL located on chromosomes 6A and 5B explaining 11.0 and 5.7% of the genotypic variance after the effects of the candidate loci were accounted for. In addition, we identified several small-effect QTL as well as epistatic QTL contributing to the genetic architecture of plant height. Taken together, our results show that the two Rht-1 semi-dwarfing genes are the major sources of variation in European winter wheat cultivars and that other small- or medium-effect QTL and potentially epistatic QTL enable fine adjustments in plant height.
Jing, Jing-Jing; Lu, You-Zhu; Sun, Li-Ping; Liu, Jing-Wei; Gong, Yue-Hua; Xu, Qian; Dong, Nan-Nan; Yuan, Yuan
2017-06-27
Excision repair cross-complementing group 6 and 8 (ERCC6 and ERCC8) are two indispensable genes for the initiation of transcription-coupled nucleotide excision repair pathway. This study aimed to evaluate the interactions between single nucleotide polymorphisms of ERCC6 (rs1917799) and ERCC8 (rs158572 and rs158916) in gastric cancer and its precancerous diseases. Besides, protein level analysis were performed to compare ERCC6 and ERCC8 expression in different stages of gastric diseases, and to correlate SNPs jointly with gene expression. Sequenom MassARRAY platform method was used to detect polymorphisms of ERCC6 and ERCC8 in 1916 subjects. In situ ERCC6 and ERCC8 protein expression were detected by immunohistochemistry in 109 chronic superficial gastritis, 109 chronic atrophic gastritis and 109 gastric cancer cases. Our results demonstrated pairwise epistatic interactions between ERCC6 and ERCC8 SNPs that ERCC6 rs1917799-ERCC8 rs158572 combination was associated with decreased risk of chronic atrophic gastritis and increased risk of gastric cancer. ERCC6 rs1917799 also showed a significant interaction with ERCC8 rs158916 to reduce gastric cancer risk. The expressions of ERCC6, ERCC8 and ERCC6-ERCC8 combination have similarities that higher positivity was observed in chronic superficial gastritis compared with chronic atrophic gastritis and gastric cancer. As for the effects of ERCC6 and ERCC8 SNPs on the protein expression, single SNP had no correlation with corresponding gene expression, whereas the ERCC6 rs1917799-ERCC8 rs158572 pair had significant influence on ERCC6 and ERCC6-ERCC8 expression. In conclusion, ERCC6 rs1917799, ERCC8 rs158572 and rs158916 demonstrated pairwise epistatic interactions to associate with chronic atrophic gastritis and gastric cancer risk. The ERCC6 rs1917799-ERCC8 rs158572 pair significantly influence ERCC6 and ERCC6-ERCC8 expression.
Li, Li; Paulo, Maria-João; van Eeuwijk, Fred
2010-01-01
Association mapping using DNA-based markers is a novel tool in plant genetics for the analysis of complex traits. Potato tuber yield, starch content, starch yield and chip color are complex traits of agronomic relevance, for which carbohydrate metabolism plays an important role. At the functional level, the genes and biochemical pathways involved in carbohydrate metabolism are among the best studied in plants. Quantitative traits such as tuber starch and sugar content are therefore models for association genetics in potato based on candidate genes. In an association mapping experiment conducted with a population of 243 tetraploid potato varieties and breeding clones, we previously identified associations between individual candidate gene alleles and tuber starch content, starch yield and chip quality. In the present paper, we tested 190 DNA markers at 36 loci scored in the same association mapping population for pairwise statistical epistatic interactions. Fifty marker pairs were associated mainly with tuber starch content and/or starch yield, at a cut-off value of q ≤ 0.20 for the experiment-wide false discovery rate (FDR). Thirteen marker pairs had an FDR of q ≤ 0.10. Alleles at loci encoding ribulose-bisphosphate carboxylase/oxygenase activase (Rca), sucrose phosphate synthase (Sps) and vacuolar invertase (Pain1) were most frequently involved in statistical epistatic interactions. The largest effect on tuber starch content and starch yield was observed for the paired alleles Pain1-8c and Rca-1a, explaining 9 and 10% of the total variance, respectively. The combination of these two alleles increased the means of tuber starch content and starch yield. Biological models to explain the observed statistical epistatic interactions are discussed. Electronic supplementary material The online version of this article (doi:10.1007/s00122-010-1389-3) contains supplementary material, which is available to authorized users. PMID:20603706
Urbanowicz, Ryan J; Kiralis, Jeff; Sinnott-Armstrong, Nicholas A; Heberling, Tamra; Fisher, Jonathan M; Moore, Jason H
2012-10-01
Geneticists who look beyond single locus disease associations require additional strategies for the detection of complex multi-locus effects. Epistasis, a multi-locus masking effect, presents a particular challenge, and has been the target of bioinformatic development. Thorough evaluation of new algorithms calls for simulation studies in which known disease models are sought. To date, the best methods for generating simulated multi-locus epistatic models rely on genetic algorithms. However, such methods are computationally expensive, difficult to adapt to multiple objectives, and unlikely to yield models with a precise form of epistasis which we refer to as pure and strict. Purely and strictly epistatic models constitute the worst-case in terms of detecting disease associations, since such associations may only be observed if all n-loci are included in the disease model. This makes them an attractive gold standard for simulation studies considering complex multi-locus effects. We introduce GAMETES, a user-friendly software package and algorithm which generates complex biallelic single nucleotide polymorphism (SNP) disease models for simulation studies. GAMETES rapidly and precisely generates random, pure, strict n-locus models with specified genetic constraints. These constraints include heritability, minor allele frequencies of the SNPs, and population prevalence. GAMETES also includes a simple dataset simulation strategy which may be utilized to rapidly generate an archive of simulated datasets for given genetic models. We highlight the utility and limitations of GAMETES with an example simulation study using MDR, an algorithm designed to detect epistasis. GAMETES is a fast, flexible, and precise tool for generating complex n-locus models with random architectures. While GAMETES has a limited ability to generate models with higher heritabilities, it is proficient at generating the lower heritability models typically used in simulation studies evaluating new algorithms. In addition, the GAMETES modeling strategy may be flexibly combined with any dataset simulation strategy. Beyond dataset simulation, GAMETES could be employed to pursue theoretical characterization of genetic models and epistasis.
Dorababu, Patchva; Nagesh, Narayana; Linga, Vijay Gandhi; Gundeti, Sadashivudu; Kutala, Vijay Kumar; Reddanna, Pallu; Digumarti, Raghunadharao
2012-04-01
To explore the role of genetic variants of thiopurine methyltransferase (TPMT) and inosine triphosphate pyrophosphatase (ITPA) in 6-mercaptopurine (6-MP)-induced toxicity in Indian children with acute lymphoblastic leukemia (ALL). Children with ALL receiving 6-MP in maintenance phase of treatment (n = 90) were enrolled in the study. Bidirectional sequencing of TPMT (whole gene) and ITPA (exon 2, exon 3, and intron 2) was undertaken, and correlation between genotype and 6-MP toxicity was assessed. Five variations were observed in TPMT, including two exonic variations, TPMT*12 (374 C > T) and TPMT*3C (719A > G), and three intronic, intron 3 (12356 C > T), intron 4 (16638 C > T), and TPMT rs2842949. Two exonic, ITPA exon -2 (94 C → A) and exon 3 of ITPA (138 G > A), and one intronic, ITPA intron 2 (A→C), variations were observed in ITPA. Multifactor dimensionality reduction analysis of all the genetic variants showed independent association of ITPA 94 C→A as well as synergic epistatic interactions, i.e., TPMT*12 × ITPA ex3, ITPA ex2 × TPMT*12 × ITPA ex3, and TPMT*3C × ITPA ex2 × TPMT*12 × ITPA ex3, in determining hematological toxicity. This is further substantiated by a multiple linear regression model, which showed moderate predictability of toxicity with these variants (area under the curve = 0.70, p = 0.004). Our results suggest that apart from the individual effect of ITPA 94 C→A, epistatic interactions between the variations of TPMT (*3C, *12) and ITPA (ex2, ex3) are associated with the 6-MP toxicity. Testing these variants facilitates tailoring of the 6-MP therapy in children with ALL.
Predictable Phenotypes of Antibiotic Resistance Mutations.
Knopp, M; Andersson, D I
2018-05-15
Antibiotic-resistant bacteria represent a major threat to our ability to treat bacterial infections. Two factors that determine the evolutionary success of antibiotic resistance mutations are their impact on resistance level and the fitness cost. Recent studies suggest that resistance mutations commonly show epistatic interactions, which would complicate predictions of their stability in bacterial populations. We analyzed 13 different chromosomal resistance mutations and 10 host strains of Salmonella enterica and Escherichia coli to address two main questions. (i) Are there epistatic interactions between different chromosomal resistance mutations? (ii) How does the strain background and genetic distance influence the effect of chromosomal resistance mutations on resistance and fitness? Our results show that the effects of combined resistance mutations on resistance and fitness are largely predictable and that epistasis remains rare even when up to four mutations were combined. Furthermore, a majority of the mutations, especially target alteration mutations, demonstrate strain-independent phenotypes across different species. This study extends our understanding of epistasis among resistance mutations and shows that interactions between different resistance mutations are often predictable from the characteristics of the individual mutations. IMPORTANCE The spread of antibiotic-resistant bacteria imposes an urgent threat to public health. The ability to forecast the evolutionary success of resistant mutants would help to combat dissemination of antibiotic resistance. Previous studies have shown that the phenotypic effects (fitness and resistance level) of resistance mutations can vary substantially depending on the genetic context in which they occur. We conducted a broad screen using many different resistance mutations and host strains to identify potential epistatic interactions between various types of resistance mutations and to determine the effect of strain background on resistance phenotypes. Combinations of several different mutations showed a large amount of phenotypic predictability, and the majority of the mutations displayed strain-independent phenotypes. However, we also identified a few outliers from these patterns, illustrating that the choice of host organism can be critically important when studying antibiotic resistance mutations. Copyright © 2018 Knopp and Andersson.
Su, Guosheng; Christensen, Ole F.; Ostersen, Tage; Henryon, Mark; Lund, Mogens S.
2012-01-01
Non-additive genetic variation is usually ignored when genome-wide markers are used to study the genetic architecture and genomic prediction of complex traits in human, wild life, model organisms or farm animals. However, non-additive genetic effects may have an important contribution to total genetic variation of complex traits. This study presented a genomic BLUP model including additive and non-additive genetic effects, in which additive and non-additive genetic relation matrices were constructed from information of genome-wide dense single nucleotide polymorphism (SNP) markers. In addition, this study for the first time proposed a method to construct dominance relationship matrix using SNP markers and demonstrated it in detail. The proposed model was implemented to investigate the amounts of additive genetic, dominance and epistatic variations, and assessed the accuracy and unbiasedness of genomic predictions for daily gain in pigs. In the analysis of daily gain, four linear models were used: 1) a simple additive genetic model (MA), 2) a model including both additive and additive by additive epistatic genetic effects (MAE), 3) a model including both additive and dominance genetic effects (MAD), and 4) a full model including all three genetic components (MAED). Estimates of narrow-sense heritability were 0.397, 0.373, 0.379 and 0.357 for models MA, MAE, MAD and MAED, respectively. Estimated dominance variance and additive by additive epistatic variance accounted for 5.6% and 9.5% of the total phenotypic variance, respectively. Based on model MAED, the estimate of broad-sense heritability was 0.506. Reliabilities of genomic predicted breeding values for the animals without performance records were 28.5%, 28.8%, 29.2% and 29.5% for models MA, MAE, MAD and MAED, respectively. In addition, models including non-additive genetic effects improved unbiasedness of genomic predictions. PMID:23028912
Kubo, Takahiko; Yamagata, Yoshiyuki; Eguchi, Maki; Yoshimura, Atsushi
2008-12-01
Postzygotic reproductive isolation (RI) often arises in inter-subspecific crosses as well as inter-specific crosses of rice (Oryza sativa L.). To further understand the genetic architecture of the postzygotic RI, we analyzed genes causing hybrid sterility and hybrid breakdown in a rice inter-subspecific cross. Here we report hybrid male sterility caused by epistatic interaction between two novel genes, S24 and S35, which were identified on rice chromosomes 5 and 1, respectively. Genetic analysis using near-isogenic lines (NILs) carrying IR24 (ssp. indica) segments with Asominori (ssp. japonica) genetic background revealed a complicated aspect of the epistasis. Allelic interaction at the S24 locus in the heterozygous plants caused abortion of male gametes carrying the Asominori allele (S24-as) independent of the S35 genotype. On the other hand, male gametes carrying the Asominori allele at the S35 locus (S35-as) showed abortion only when the IR24 allele at the S24 locus (S24-ir) was concurrently introgressed into the S35 heterozygous plants, indicating that the sterility phenotype due to S35 was dependent on the S24 genotype through negative epistasis between S24-ir and S35-as alleles. Due to the interaction between S24 and S35, self-pollination of the double heterozygous plants produced pollen-sterile progeny carrying the S24-ir/S24-ir S35-as/S35-ir genotype in addition to the S24 heterozygous plants. This result suggests that the S35 gene might function as a modifier of S24. This study presents strong evidence for the importance of epistatic interaction as a part of the genetic architecture of hybrid sterility in rice. In addition, it suggests that diverse systems have been developed as postzygotic RI mechanisms within the rice.
The genetic architecture of maize (Zea mays L.) kernel weight determination.
Alvarez Prado, Santiago; López, César G; Senior, M Lynn; Borrás, Lucas
2014-09-18
Individual kernel weight is an important trait for maize yield determination. We have identified genomic regions controlling this trait by using the B73xMo17 population; however, the effect of genetic background on control of this complex trait and its physiological components is not yet known. The objective of this study was to understand how genetic background affected our previous results. Two nested stable recombinant inbred line populations (N209xMo17 and R18xMo17) were designed for this purpose. A total of 408 recombinant inbred lines were genotyped and phenotyped at two environments for kernel weight and five other traits related to kernel growth and development. All traits showed very high and significant (P < 0.001) phenotypic variability and medium-to-high heritability (0.60-0.90). When N209xMo17 and R18xMo17 were analyzed separately, a total of 23 environmentally stable quantitative trait loci (QTL) and five epistatic interactions were detected for N209xMo17. For R18xMo17, 59 environmentally stable QTL and 17 epistatic interactions were detected. A joint analysis detected 14 stable QTL regardless of the genetic background. Between 57 and 83% of detected QTL were population specific, denoting medium-to-high genetic background effects. This percentage was dependent on the trait. A meta-analysis including our previous B73xMo17 results identified five relevant genomic regions deserving further characterization. In summary, our grain filling traits were dominated by small additive QTL with several epistatic and few environmental interactions and medium-to-high genetic background effects. This study demonstrates that the number of detected QTL and additive effects for different physiologically related grain filling traits need to be understood relative to the specific germplasm. Copyright © 2014 Alvarez Prado et al.
Advances in Genetical Genomics of Plants
Joosen, R.V.L.; Ligterink, W.; Hilhorst, H.W.M.; Keurentjes, J.J.B.
2009-01-01
Natural variation provides a valuable resource to study the genetic regulation of quantitative traits. In quantitative trait locus (QTL) analyses this variation, captured in segregating mapping populations, is used to identify the genomic regions affecting these traits. The identification of the causal genes underlying QTLs is a major challenge for which the detection of gene expression differences is of major importance. By combining genetics with large scale expression profiling (i.e. genetical genomics), resulting in expression QTLs (eQTLs), great progress can be made in connecting phenotypic variation to genotypic diversity. In this review we discuss examples from human, mouse, Drosophila, yeast and plant research to illustrate the advances in genetical genomics, with a focus on understanding the regulatory mechanisms underlying natural variation. With their tolerance to inbreeding, short generation time and ease to generate large families, plants are ideal subjects to test new concepts in genetics. The comprehensive resources which are available for Arabidopsis make it a favorite model plant but genetical genomics also found its way to important crop species like rice, barley and wheat. We discuss eQTL profiling with respect to cis and trans regulation and show how combined studies with other ‘omics’ technologies, such as metabolomics and proteomics may further augment current information on transcriptional, translational and metabolomic signaling pathways and enable reconstruction of detailed regulatory networks. The fast developments in the ‘omics’ area will offer great potential for genetical genomics to elucidate the genotype-phenotype relationships for both fundamental and applied research. PMID:20514216
Tang, Yuping; Jin, Bo; Zhou, Lingling; Lu, Weifeng
2017-01-10
Earlier GWAS has identified that rs17782313 near MC4R was associated with obesity. However, subsequent studies showed conflicting results, especially among childhood. Besides, the mechanisms underlying the association between rs17782313 and childhood obesity remain largely unexplored, and genetic and epigenetic may interact and together affect the development of childhood obesity. We conducted a comprehensive meta-analysis to assess the association between rs17782313 and childhood obesity. MeQTL and eQTL analysis was applied to explore the effect of rs17782313 on DNA methylation and MC4R expression. We found that rs17782313 near MC4R was associated with increased childhood obesity risk and BMI z-score in several inheritable models (P < 0.05). Additionally, the similar trend was observed among subgroups of Asians, Caucasian. Furthermore, meQTL and eQTL analysis indicated that individuals carrying rs17782313 TT genotype were significantly associated with increased methylation level of cg10097150 located in MC4R promoter and decreased expression of MC4R than those with CT/CC genotype (P = 1.7 × 10-4 and P = 1.9 × 10-3 respectively). Our results strongly confirmed that rs17782313 was associated with increased risk of childhood obesity. Furthermore, rs17782313 T allele was correlated with promoter hypermethylation and decreased expression of MC4R, thus involved in the development of childhood obesity.
Ponsuksili, Siriluck; Zebunke, Manuela; Murani, Eduard; Trakooljul, Nares; Krieter, Joachim; Puppe, Birger; Schwerin, Manfred; Wimmers, Klaus
2015-01-01
Animal personality and coping styles are basic concepts for evaluating animal welfare. Struggling response of piglets in so-called backtests early in life reflects their coping strategy. Behavioral reactions of piglets in backtests have a moderate heritability, but their genetic basis largely remains unknown. Here, latency, duration and frequency of struggling attempts during one-minute backtests were repeatedly recorded of piglets at days 5, 12, 19, and 26. A genome-wide association study for backtest traits revealed 465 significant SNPs (FDR ≤ 0.05) mostly located in QTL (quantitative trait locus) regions on chromosome 3, 5, 12 and 16. In order to capture genes in these regions, 37 transcripts with significant SNPs were selected for expressionQTL analysis in the hypothalamus. Eight genes (ASGR1, CPAMD8, CTC1, FBXO39, IL19, LOC100511790, RAD51B, UBOX5) had cis- and five (RANGRF, PER1, PDZRN3, SH2D4B, LONP2) had trans-expressionQTL. In particular, for PER1, with known physiological implications for maintenance of circadian rhythms, a role in coping behavior was evidenced by confirmed association in an independent population. For CTC1 a cis-expression QTL and the consistent relationship of gene polymorphism, mRNA expression level and backtest traits promoted its link to coping style. GWAS and eQTL analyses uncovered positional and functional gene candidates for coping behavior. PMID:26537429
Ponsuksili, Siriluck; Zebunke, Manuela; Murani, Eduard; Trakooljul, Nares; Krieter, Joachim; Puppe, Birger; Schwerin, Manfred; Wimmers, Klaus
2015-11-05
Animal personality and coping styles are basic concepts for evaluating animal welfare. Struggling response of piglets in so-called backtests early in life reflects their coping strategy. Behavioral reactions of piglets in backtests have a moderate heritability, but their genetic basis largely remains unknown. Here, latency, duration and frequency of struggling attempts during one-minute backtests were repeatedly recorded of piglets at days 5, 12, 19, and 26. A genome-wide association study for backtest traits revealed 465 significant SNPs (FDR ≤ 0.05) mostly located in QTL (quantitative trait locus) regions on chromosome 3, 5, 12 and 16. In order to capture genes in these regions, 37 transcripts with significant SNPs were selected for expressionQTL analysis in the hypothalamus. Eight genes (ASGR1, CPAMD8, CTC1, FBXO39, IL19, LOC100511790, RAD51B, UBOX5) had cis- and five (RANGRF, PER1, PDZRN3, SH2D4B, LONP2) had trans-expressionQTL. In particular, for PER1, with known physiological implications for maintenance of circadian rhythms, a role in coping behavior was evidenced by confirmed association in an independent population. For CTC1 a cis-expression QTL and the consistent relationship of gene polymorphism, mRNA expression level and backtest traits promoted its link to coping style. GWAS and eQTL analyses uncovered positional and functional gene candidates for coping behavior.
Inherited variants in regulatory T cell genes and outcome of ovarian cancer.
Goode, Ellen L; DeRycke, Melissa; Kalli, Kimberly R; Oberg, Ann L; Cunningham, Julie M; Maurer, Matthew J; Fridley, Brooke L; Armasu, Sebastian M; Serie, Daniel J; Ramar, Priya; Goergen, Krista; Vierkant, Robert A; Rider, David N; Sicotte, Hugues; Wang, Chen; Winterhoff, Boris; Phelan, Catherine M; Schildkraut, Joellen M; Weber, Rachel P; Iversen, Ed; Berchuck, Andrew; Sutphen, Rebecca; Birrer, Michael J; Hampras, Shalaka; Preus, Leah; Gayther, Simon A; Ramus, Susan J; Wentzensen, Nicolas; Yang, Hannah P; Garcia-Closas, Montserrat; Song, Honglin; Tyrer, Jonathan; Pharoah, Paul P D; Konecny, Gottfried; Sellers, Thomas A; Ness, Roberta B; Sucheston, Lara E; Odunsi, Kunle; Hartmann, Lynn C; Moysich, Kirsten B; Knutson, Keith L
2013-01-01
Although ovarian cancer is the most lethal of gynecologic malignancies, wide variation in outcome following conventional therapy continues to exist. The presence of tumor-infiltrating regulatory T cells (Tregs) has a role in outcome of this disease, and a growing body of data supports the existence of inherited prognostic factors. However, the role of inherited variants in genes encoding Treg-related immune molecules has not been fully explored. We analyzed expression quantitative trait loci (eQTL) and sequence-based tagging single nucleotide polymorphisms (tagSNPs) for 54 genes associated with Tregs in 3,662 invasive ovarian cancer cases. With adjustment for known prognostic factors, suggestive results were observed among rarer histological subtypes; poorer survival was associated with minor alleles at SNPs in RGS1 (clear cell, rs10921202, p=2.7×10(-5)), LRRC32 and TNFRSF18/TNFRSF4 (mucinous, rs3781699, p=4.5×10(-4), and rs3753348, p=9.0×10(-4), respectively), and CD80 (endometrioid, rs13071247, p=8.0×10(-4)). Fo0r the latter, correlative data support a CD80 rs13071247 genotype association with CD80 tumor RNA expression (p=0.006). An additional eQTL SNP in CD80 was associated with shorter survival (rs7804190, p=8.1×10(-4)) among all cases combined. As the products of these genes are known to affect induction, trafficking, or immunosuppressive function of Tregs, these results suggest the need for follow-up phenotypic studies.
Coleman, Jonathan R I; Lester, Kathryn J; Roberts, Susanna; Keers, Robert; Lee, Sang Hyuck; De Jong, Simone; Gaspar, Héléna; Teismann, Tobias; Wannemüller, André; Schneider, Silvia; Jöhren, Peter; Margraf, Jürgen; Breen, Gerome; Eley, Thalia C
2017-04-01
Exposure-based cognitive behavioural therapy (eCBT) is an effective treatment for anxiety disorders. Response varies between individuals. Gene expression integrates genetic and environmental influences. We analysed the effect of gene expression and genetic markers separately and together on treatment response. Adult participants (n ≤ 181) diagnosed with panic disorder or a specific phobia underwent eCBT as part of standard care. Percentage decrease in the Clinical Global Impression severity rating was assessed across treatment, and between baseline and a 6-month follow-up. Associations with treatment response were assessed using expression data from 3,233 probes, and expression profiles clustered in a data- and literature-driven manner. A total of 3,343,497 genetic variants were used to predict treatment response alone and combined in polygenic risk scores. Genotype and expression data were combined in expression quantitative trait loci (eQTL) analyses. Expression levels were not associated with either treatment phenotype in any analysis. A total of 1,492 eQTLs were identified with q < 0.05, but interactions between genetic variants and treatment response did not affect expression levels significantly. Genetic variants did not significantly predict treatment response alone or in polygenic risk scores. We assessed gene expression alone and alongside genetic variants. No associations with treatment outcome were identified. Future studies require larger sample sizes to discover associations.
PExFInS: An Integrative Post-GWAS Explorer for Functional Indels and SNPs
Cheng, Zhongshan; Chu, Hin; Fan, Yanhui; Li, Cun; Song, You-Qiang; Zhou, Jie; Yuen, Kwok-Yung
2015-01-01
Expression quantitative trait loci (eQTLs) mapping and linkage disequilibrium (LD) analysis have been widely employed to interpret findings of genome-wide association studies (GWAS). With the availability of deep sequencing data of 423 lymphoblastoid cell lines (LCLs) from six global populations and the microarray expression data, we performed eQTL analysis, identified more than 228 K SNP cis-eQTLs and 21 K indel cis-eQTLs and generated a LCL cis-eQTL database. We demonstrate that the percentages of population-shared and population-specific cis-eQTLs are comparable; while indel cis-eQTLs in the population-specific subsection make more contribution to gene expression variations than those in the population-shared subsection. We found cis-eQTLs, especially the population-shared cis-eQTLs are significantly enriched toward transcription start site. Moreover, the National Human Genome Research Institute cataloged GWAS SNPs are enriched for LCL cis-eQTLs. Specifically, 32.8% GWAS SNPs are LCL cis-eQTLs, among which 12.5% can be tagged by indel cis-eQTLs, suggesting the fundamental contribution of indel cis-eQTLs to GWAS association signals. To search for functional indels and SNPs tagging GWAS SNPs, a pipeline Post-GWAS Explorer for Functional Indels and SNPs (PExFInS) has been developed, integrating LD analysis, functional annotation from public databases, cis-eQTL mapping with our LCL cis-eQTL database and other published cis-eQTL datasets. PMID:26612672
Genetical Genomics Identifies the Genetic Architecture for Growth and Weevil Resistance in Spruce
Porth, Ilga; White, Richard; Jaquish, Barry; Alfaro, René; Ritland, Carol; Ritland, Kermit
2012-01-01
In plants, relationships between resistance to herbivorous insect pests and growth are typically controlled by complex interactions between genetically correlated traits. These relationships often result in tradeoffs in phenotypic expression. In this study we used genetical genomics to elucidate genetic relationships between tree growth and resistance to white pine terminal weevil (Pissodes strobi Peck.) in a pedigree population of interior spruce (Picea glauca, P. engelmannii and their hybrids) that was growing at Vernon, B.C. and segregating for weevil resistance. Genetical genomics uses genetic perturbations caused by allelic segregation in pedigrees to co-locate quantitative trait loci (QTLs) for gene expression and quantitative traits. Bark tissue of apical leaders from 188 trees was assayed for gene expression using a 21.8K spruce EST-spotted microarray; the same individuals were genotyped for 384 SNP markers for the genetic map. Many of the expression QTLs (eQTL) co-localized with resistance trait QTLs. For a composite resistance phenotype of six attack and oviposition traits, 149 positional candidate genes were identified. Resistance and growth QTLs also overlapped with eQTL hotspots along the genome suggesting that: 1) genetic pleiotropy of resistance and growth traits in interior spruce was substantial, and 2) master regulatory genes were important for weevil resistance in spruce. These results will enable future work on functional genetic studies of insect resistance in spruce, and provide valuable information about candidate genes for genetic improvement of spruce. PMID:22973444
Chen, Wenan; McDonnell, Shannon K; Thibodeau, Stephen N; Tillmans, Lori S; Schaid, Daniel J
2016-11-01
Functional annotations have been shown to improve both the discovery power and fine-mapping accuracy in genome-wide association studies. However, the optimal strategy to incorporate the large number of existing annotations is still not clear. In this study, we propose a Bayesian framework to incorporate functional annotations in a systematic manner. We compute the maximum a posteriori solution and use cross validation to find the optimal penalty parameters. By extending our previous fine-mapping method CAVIARBF into this framework, we require only summary statistics as input. We also derived an exact calculation of Bayes factors using summary statistics for quantitative traits, which is necessary when a large proportion of trait variance is explained by the variants of interest, such as in fine mapping expression quantitative trait loci (eQTL). We compared the proposed method with PAINTOR using different strategies to combine annotations. Simulation results show that the proposed method achieves the best accuracy in identifying causal variants among the different strategies and methods compared. We also find that for annotations with moderate effects from a large annotation pool, screening annotations individually and then combining the top annotations can produce overly optimistic results. We applied these methods on two real data sets: a meta-analysis result of lipid traits and a cis-eQTL study of normal prostate tissues. For the eQTL data, incorporating annotations significantly increased the number of potential causal variants with high probabilities. Copyright © 2016 by the Genetics Society of America.
Qu, Wen; Gurdziel, Katherine; Pique-Regi, Roger; Ruden, Douglas M
2017-01-01
Lead (Pb) poisoning has been a major public health issue globally and the recent Flint water crisis has drawn nation-wide attention to its effects. To better understand how lead plays a role as a neurotoxin, we utilized the Drosophila melanogaster model to study the genetic effects of lead exposure during development and identified lead-responsive genes. In our previous studies, we have successfully identified hundreds of lead-responsive expression QTLs (eQTLs) by using RNA-seq analysis on heads collected from the Drosophila Synthetic Population Resource. Cis -eQTLs, also known as allele-specific expression (ASE) polymorphisms, are generally single-nucleotide polymorphisms in the promoter regions of genes that affect expression of the gene, such as by inhibiting the binding of transcription factors. Trans -eQTLs are genes that regulate mRNA levels for many genes, and are generally thought to be SNPs in trans -acting transcription or translation factors. In this study, we focused our attention on alternative splicing events that are affected by lead exposure. Splicing QTLs (sQTLs), which can be caused by SNPs that alter splicing or alternative splicing (AS), such as by changing the sequence-specific binding affinity of splicing factors to the pre-mRNA. We applied two methods in search for sQTLs by using RNA-seq data from control and lead-exposed w 1118 Drosophila heads. First, we used the fraction of reads in a gene that falls in each exon as the phenotype. Second, we directly compared the transcript counts among the various splicing isoforms as the phenotype. Among the 1,236 potential Pb-responsive sQTLs ( p < 0.0001, FDR < 0.39), mostly cis -sQTLs, one of the most distinct genes is Dscam1 (Down Syndrome Cell Adhesion Molecule), which has over 30,000 potential alternative splicing isoforms. We have also identified a candidate Pb-responsive trans -sQTL hotspot that appears to regulate 129 genes that are enriched in the "cation channel" gene ontology category, suggesting a model in which alternative splicing of these channels might lead to an increase in the elimination of Pb 2+ from the neurons encoding these channels. To our knowledge, this is the first paper that uses sQTL analyses to understand the neurotoxicology of an environmental toxin in any organism, and the first reported discovery of a candidate trans -sQTL hotspot.
Hu, Ting; Chen, Yuanzhu; Kiralis, Jeff W; Collins, Ryan L; Wejse, Christian; Sirugo, Giorgio; Williams, Scott M; Moore, Jason H
2013-01-01
Background Epistasis has been historically used to describe the phenomenon that the effect of a given gene on a phenotype can be dependent on one or more other genes, and is an essential element for understanding the association between genetic and phenotypic variations. Quantifying epistasis of orders higher than two is very challenging due to both the computational complexity of enumerating all possible combinations in genome-wide data and the lack of efficient and effective methodologies. Objectives In this study, we propose a fast, non-parametric, and model-free measure for three-way epistasis. Methods Such a measure is based on information gain, and is able to separate all lower order effects from pure three-way epistasis. Results Our method was verified on synthetic data and applied to real data from a candidate-gene study of tuberculosis in a West African population. In the tuberculosis data, we found a statistically significant pure three-way epistatic interaction effect that was stronger than any lower-order associations. Conclusion Our study provides a methodological basis for detecting and characterizing high-order gene-gene interactions in genetic association studies. PMID:23396514
Lachance, Joseph; True, John R.
2010-01-01
Substantial genetic variation exists in natural populations of Drosophila melanogaster. This segregating variation includes alleles at different loci that interact to cause lethality or sterility (synthetic incompatibilities). Fitness epistasis in natural populations has important implications for speciation and the rate of adaptive evolution. To assess the prevalence of epistatic fitness interactions, we placed naturally occurring X chromosomes into genetic backgrounds derived from different geographic locations. Considerable amounts of synthetic incompatibilities were observed between X chromosomes and autosomes: greater than 44% of all combinations were either lethal or sterile. Sex-specific lethality and sterility were also tested to determine whether Haldane's rule holds for within-species variation. Surprisingly, we observed an excess of female sterility in genotypes that were homozygous, but not heterozygous, for the X chromosome. The recessive nature of these incompatibilities is similar to that predicted for incompatibilities underlying Haldane’s rule. Our study also found higher levels of sterility and lethality for genomes that contain chromosomes from different geographical regions. These findings are consistent with the view that genomes are co-adapted gene complexes and that geography affects the likelihood of epistatic fitness interactions. PMID:20455929
Xu, Z C; Zhu, J
2000-01-01
According to the double-cross mating design and using principles of Cockerham's general genetic model, a genetic model with additive, dominance and epistatic effects (ADAA model) was proposed for the analysis of agronomic traits. Components of genetic effects were derived for different generations. Monte Carlo simulation was conducted for analyzing the ADAA model and its reduced AD model by using different generations. It was indicated that genetic variance components could be estimated without bias by MINQUE(1) method and genetic effects could be predicted effectively by AUP method; at least three generations (including parent, F1 of single cross and F1 of double-cross) were necessary for analyzing the ADAA model and only two generations (including parent and F1 of double-cross) were enough for the reduced AD model. When epistatic effects were taken into account, a new approach for predicting the heterosis of agronomic traits of double-crosses was given on the basis of unbiased prediction of genotypic merits of parents and their crosses. In addition, genotype x environment interaction effects and interaction heterosis due to G x E interaction were discussed briefly.
Yang, Cheng-Hong; Chuang, Li-Yeh; Lin, Yu-Da
2017-08-01
Detecting epistatic interactions in genome-wide association studies (GWAS) is a computational challenge. Such huge numbers of single-nucleotide polymorphism (SNP) combinations limit the some of the powerful algorithms to be applied to detect the potential epistasis in large-scale SNP datasets. We propose a new algorithm which combines the differential evolution (DE) algorithm with a classification based multifactor-dimensionality reduction (CMDR), termed DECMDR. DECMDR uses the CMDR as a fitness measure to evaluate values of solutions in DE process for scanning the potential statistical epistasis in GWAS. The results indicated that DECMDR outperforms the existing algorithms in terms of detection success rate by the large simulation and real data obtained from the Wellcome Trust Case Control Consortium. For running time comparison, DECMDR can efficient to apply the CMDR to detect the significant association between cases and controls amongst all possible SNP combinations in GWAS. DECMDR is freely available at https://goo.gl/p9sLuJ . chuang@isu.edu.tw or e0955767257@yahoo.com.tw. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Branham, Sandra E; Stansell, Zachary J; Couillard, David M; Farnham, Mark W
2017-03-01
Five quantitative trait loci and one epistatic interaction were associated with heat tolerance in a doubled haploid population of broccoli evaluated in three summer field trials. Predicted rising global temperatures due to climate change have generated a demand for crops that are resistant to yield and quality losses from heat stress. Broccoli (Brassica oleracea var. italica) is a cool weather crop with high temperatures during production decreasing both head quality and yield. Breeding for heat tolerance in broccoli has potential to both expand viable production areas and extend the growing season but breeding efficiency is constrained by limited genetic information. A doubled haploid (DH) broccoli population segregating for heat tolerance was evaluated for head quality in three summer fields in Charleston, SC, USA. Multiple quantitative trait loci (QTL) mapping of 1,423 single nucleotide polymorphisms developed through genotyping-by-sequencing identified five QTL and one positive epistatic interaction that explained 62.1% of variation in heat tolerance. The QTL identified here can be used to develop markers for marker-assisted selection and to increase our understanding of the molecular mechanisms underlying plant response to heat stress.
a/alpha-specific effect on the mms3 mutation on ultraviolet mutagenesis in Saccharomyces cerevisiae.
Martin, P; Prakash, L; Prakash, S
1981-05-01
A new gene involved in error-prone repair of ultraviolet (UV) damage has been identified in Saccharomyces cerevisiae by the mms3-1 mutation. UV-induced reversion is reduced in diploids that are homozygous for mms3-1, only if they are also heterozygous (MATa/MAT alpha) at the mating type locus. The mms3-1 mutation has no effect on UV-induced reversion either in haploids or MATa/MATa or MAT alpha/MAT alpha diploids. The mutation confers sensitivity to UV and methyl methane sulfonate in both haploids and diploids. Even though mutation induction by UV is restored to wild-type levels in MATa/MATa mms3-1/mms3-1 or MAT alpha/MAT alpha mms3-1/mms3-1 diploids, such strains still retain sensitivity to the lethal effects of UV. Survival after UV irradiation in mms3-1 rad double mutant combinations indicates that mms3-1 is epistatic to rad6-1 whereas non-epistatic interactions are observed with rad3 and rad52 mutants. When present in the homozygous state in MATa/MAT alpha his1-1/his1-315 heteroallelic diploids, mms3-1 was found to lower UV-induced mitotic recombination.
2010-01-01
We present an extensible software model for the genotype and phenotype community, XGAP. Readers can download a standard XGAP (http://www.xgap.org) or auto-generate a custom version using MOLGENIS with programming interfaces to R-software and web-services or user interfaces for biologists. XGAP has simple load formats for any type of genotype, epigenotype, transcript, protein, metabolite or other phenotype data. Current functionality includes tools ranging from eQTL analysis in mouse to genome-wide association studies in humans. PMID:20214801
Swertz, Morris A; Velde, K Joeri van der; Tesson, Bruno M; Scheltema, Richard A; Arends, Danny; Vera, Gonzalo; Alberts, Rudi; Dijkstra, Martijn; Schofield, Paul; Schughart, Klaus; Hancock, John M; Smedley, Damian; Wolstencroft, Katy; Goble, Carole; de Brock, Engbert O; Jones, Andrew R; Parkinson, Helen E; Jansen, Ritsert C
2010-01-01
We present an extensible software model for the genotype and phenotype community, XGAP. Readers can download a standard XGAP (http://www.xgap.org) or auto-generate a custom version using MOLGENIS with programming interfaces to R-software and web-services or user interfaces for biologists. XGAP has simple load formats for any type of genotype, epigenotype, transcript, protein, metabolite or other phenotype data. Current functionality includes tools ranging from eQTL analysis in mouse to genome-wide association studies in humans.
Ogbunugafor, C Brandon; Hartl, Daniel
2016-01-25
The study of reverse evolution from resistant to susceptible phenotypes can reveal constraints on biological evolution, a topic for which evolutionary theory has relatively few general principles. The public health catastrophe of antimicrobial resistance in malaria has brought these constraints on evolution into a practical realm, with one proposed solution: withdrawing anti-malarial medication use in high resistance settings, built on the assumption that reverse evolution occurs readily enough that populations of pathogens may revert to their susceptible states. While past studies have suggested limits to reverse evolution, there have been few attempts to properly dissect its mechanistic constraints. Growth rates were determined from empirical data on the growth and resistance from a set of combinatorially complete set of mutants of a resistance protein (dihydrofolate reductase) in Plasmodium vivax, to construct reverse evolution trajectories. The fitness effects of individual mutations were calculated as a function of drug environment, revealing the magnitude of epistatic interactions between mutations and genetic backgrounds. Evolution across the landscape was simulated in two settings: starting from the population fixed for the quadruple mutant, and from a polymorphic population evenly distributed between double mutants. A single mutation of large effect (S117N) serves as a pivot point for evolution to high resistance regions of the landscape. Through epistatic interactions with other mutations, this pivot creates an epistatic ratchet against reverse evolution towards the wild type ancestor, even in environments where the wild type is the most fit of all genotypes. This pivot mutation underlies the directional bias in evolution across the landscape, where evolution towards the ancestor is precluded across all examined drug concentrations from various starting points in the landscape. The presence of pivot mutations can dictate dynamics of evolution across adaptive landscape through epistatic interactions within a protein, leaving a population trapped on local fitness peaks in an adaptive landscape, unable to locate ancestral genotypes. This irreversibility suggests that the structure of an adaptive landscape for a resistance protein should be understood before considering resistance management strategies. This proposed mechanism for constraints on reverse evolution corroborates evidence from the field indicating that phenotypic reversal often occurs via compensatory mutation at sites independent of those associated with the forward evolution of resistance. Because of this, molecular methods that identify resistance patterns via single SNPs in resistance-associated markers might be missing signals for resistance and compensatory mutation throughout the genome. In these settings, whole genome sequencing efforts should be used to identify resistance patterns, and will likely reveal a more complicated genomic signature for resistance and susceptibility, especially in settings where anti-malarial medications have been used intermittently. Lastly, the findings suggest that, given their role in dictating the dynamics of evolution across the landscape, pivot mutations might serve as future targets for therapy.
Functional Analysis of Somatic Mutations in Lung Cancer
2015-10-01
antibody cetuximab [11]. Finally, we have developed novel single cell sequencing approaches to uncover EGFR mutational variants in glioblastoma and their...assessed which mutations are epistatic to EGFR or capable of initiating xenograft tumor formation in vivo. Using eVIP, we identified 69% of mutations...analyzed as impactful whereas 31% appear functionally neutral. A subset of the impactful mutations induce xenograft tumor formation in mice and/or
QTL Analysis of Kernel-Related Traits in Maize Using an Immortalized F2 Population
Hu, Yanmin; Li, Weihua; Fu, Zhiyuan; Ding, Dong; Li, Haochuan; Qiao, Mengmeng; Tang, Jihua
2014-01-01
Kernel size and weight are important determinants of grain yield in maize. In this study, multivariate conditional and unconditional quantitative trait loci (QTL), and digenic epistatic analyses were utilized in order to elucidate the genetic basis for these kernel-related traits. Five kernel-related traits, including kernel weight (KW), volume (KV), length (KL), thickness (KT), and width (KWI), were collected from an immortalized F2 (IF2) maize population comprising of 243 crosses performed at two separate locations over a span of two years. A total of 54 unconditional main QTL for these five kernel-related traits were identified, many of which were clustered in chromosomal bins 6.04–6.06, 7.02–7.03, and 10.06–10.07. In addition, qKL3, qKWI6, qKV10a, qKV10b, qKW10a, and qKW7a were detected across multiple environments. Sixteen main QTL were identified for KW conditioned on the other four kernel traits (KL, KWI, KT, and KV). Thirteen main QTL were identified for KV conditioned on three kernel-shape traits. Conditional mapping analysis revealed that KWI and KV had the strongest influence on KW at the individual QTL level, followed by KT, and then KL; KV was mostly strongly influenced by KT, followed by KWI, and was least impacted by KL. Digenic epistatic analysis identified 18 digenic interactions involving 34 loci over the entire genome. However, only a small proportion of them were identical to the main QTL we detected. Additionally, conditional digenic epistatic analysis revealed that the digenic epistasis for KW and KV were entirely determined by their constituent traits. The main QTL identified in this study for determining kernel-related traits with high broad-sense heritability may play important roles during kernel development. Furthermore, digenic interactions were shown to exert relatively large effects on KL (the highest AA and DD effects were 4.6% and 6.7%, respectively) and KT (the highest AA effects were 4.3%). PMID:24586932
GDSL LIPASE1 Modulates Plant Immunity through Feedback Regulation of Ethylene Signaling1[W
Kim, Hye Gi; Kwon, Sun Jae; Jang, Young Jin; Nam, Myung Hee; Chung, Joo Hee; Na, Yun-Cheol; Guo, Hongwei; Park, Ohkmae K.
2013-01-01
Ethylene is a key signal in the regulation of plant defense responses. It is required for the expression and function of GDSL LIPASE1 (GLIP1) in Arabidopsis (Arabidopsis thaliana), which plays an important role in plant immunity. Here, we explore molecular mechanisms underlying the relationship between GLIP1 and ethylene signaling by an epistatic analysis of ethylene response mutants and GLIP1-overexpressing (35S:GLIP1) plants. We show that GLIP1 expression is regulated by ethylene signaling components and, further, that GLIP1 expression or application of petiole exudates from 35S:GLIP1 plants affects ethylene signaling both positively and negatively, leading to ETHYLENE RESPONSE FACTOR1 activation and ETHYLENE INSENSITIVE3 (EIN3) down-regulation, respectively. Additionally, 35S:GLIP1 plants or their exudates increase the expression of the salicylic acid biosynthesis gene SALICYLIC ACID INDUCTION-DEFICIENT2, known to be inhibited by EIN3 and EIN3-LIKE1. These results suggest that GLIP1 regulates plant immunity through positive and negative feedback regulation of ethylene signaling, and this is mediated by its activity to accumulate a systemic signal(s) in the phloem. We propose a model explaining how GLIP1 regulates the fine-tuning of ethylene signaling and ethylene-salicylic acid cross talk. PMID:24170202
Heritability and genetic basis of protein level variation in an outbred population
Liu, Yi-Chun; Tekkedil, Manu M.; Steinmetz, Lars M.; Caudy, Amy A.; Fraser, Andrew G.
2014-01-01
The genetic basis of heritable traits has been studied for decades. Although recent mapping efforts have elucidated genetic determinants of transcript levels, mapping of protein abundance has lagged. Here, we analyze levels of 4084 GFP-tagged yeast proteins in the progeny of a cross between a laboratory and a wild strain using flow cytometry and high-content microscopy. The genotype of trans variants contributed little to protein level variation between individual cells but explained >50% of the variance in the population’s average protein abundance for half of the GFP fusions tested. To map trans-acting factors responsible, we performed flow sorting and bulk segregant analysis of 25 proteins, finding a median of five protein quantitative trait loci (pQTLs) per GFP fusion. Further, we find that cis-acting variants predominate; the genotype of a gene and its surrounding region had a large effect on protein level six times more frequently than the rest of the genome combined. We present evidence for both shared and independent genetic control of transcript and protein abundance: More than half of the expression QTLs (eQTLs) contribute to changes in protein levels of regulated genes, but several pQTLs do not affect their cognate transcript levels. Allele replacements of genes known to underlie trans eQTL hotspots confirmed the correlation of effects on mRNA and protein levels. This study represents the first genome-scale measurement of genetic contribution to protein levels in single cells and populations, identifies more than a hundred trans pQTLs, and validates the propagation of effects associated with transcript variation to protein abundance. PMID:24823668
SZDB: A Database for Schizophrenia Genetic Research
Wu, Yong; Yao, Yong-Gang
2017-01-01
Abstract Schizophrenia (SZ) is a debilitating brain disorder with a complex genetic architecture. Genetic studies, especially recent genome-wide association studies (GWAS), have identified multiple variants (loci) conferring risk to SZ. However, how to efficiently extract meaningful biological information from bulk genetic findings of SZ remains a major challenge. There is a pressing need to integrate multiple layers of data from various sources, eg, genetic findings from GWAS, copy number variations (CNVs), association and linkage studies, gene expression, protein–protein interaction (PPI), co-expression, expression quantitative trait loci (eQTL), and Encyclopedia of DNA Elements (ENCODE) data, to provide a comprehensive resource to facilitate the translation of genetic findings into SZ molecular diagnosis and mechanism study. Here we developed the SZDB database (http://www.szdb.org/), a comprehensive resource for SZ research. SZ genetic data, gene expression data, network-based data, brain eQTL data, and SNP function annotation information were systematically extracted, curated and deposited in SZDB. In-depth analyses and systematic integration were performed to identify top prioritized SZ genes and enriched pathways. Multiple types of data from various layers of SZ research were systematically integrated and deposited in SZDB. In-depth data analyses and integration identified top prioritized SZ genes and enriched pathways. We further showed that genes implicated in SZ are highly co-expressed in human brain and proteins encoded by the prioritized SZ risk genes are significantly interacted. The user-friendly SZDB provides high-confidence candidate variants and genes for further functional characterization. More important, SZDB provides convenient online tools for data search and browse, data integration, and customized data analyses. PMID:27451428
Inherited Variants in Regulatory T Cell Genes and Outcome of Ovarian Cancer
Goode, Ellen L.; DeRycke, Melissa; Kalli, Kimberly R.; Oberg, Ann L.; Cunningham, Julie M.; Maurer, Matthew J.; Fridley, Brooke L.; Armasu, Sebastian M.; Serie, Daniel J.; Ramar, Priya; Goergen, Krista; Vierkant, Robert A.; Rider, David N.; Sicotte, Hugues; Wang, Chen; Winterhoff, Boris; Phelan, Catherine M.; Schildkraut, Joellen M.; Weber, Rachel P.; Iversen, Ed; Berchuck, Andrew; Sutphen, Rebecca; Birrer, Michael J.; Hampras, Shalaka; Preus, Leah; Gayther, Simon A.; Ramus, Susan J.; Wentzensen, Nicolas; Yang, Hannah P.; Garcia-Closas, Montserrat; Song, Honglin; Tyrer, Jonathan; Pharoah, Paul P. D.; Konecny, Gottfried; Sellers, Thomas A.; Ness, Roberta B.; Sucheston, Lara E.; Odunsi, Kunle; Hartmann, Lynn C.; Moysich, Kirsten B.; Knutson, Keith L.
2013-01-01
Although ovarian cancer is the most lethal of gynecologic malignancies, wide variation in outcome following conventional therapy continues to exist. The presence of tumor-infiltrating regulatory T cells (Tregs) has a role in outcome of this disease, and a growing body of data supports the existence of inherited prognostic factors. However, the role of inherited variants in genes encoding Treg-related immune molecules has not been fully explored. We analyzed expression quantitative trait loci (eQTL) and sequence-based tagging single nucleotide polymorphisms (tagSNPs) for 54 genes associated with Tregs in 3,662 invasive ovarian cancer cases. With adjustment for known prognostic factors, suggestive results were observed among rarer histological subtypes; poorer survival was associated with minor alleles at SNPs in RGS1 (clear cell, rs10921202, p = 2.7×10−5), LRRC32 and TNFRSF18/TNFRSF4 (mucinous, rs3781699, p = 4.5×10−4, and rs3753348, p = 9.0×10−4, respectively), and CD80 (endometrioid, rs13071247, p = 8.0×10−4). Fo0r the latter, correlative data support a CD80 rs13071247 genotype association with CD80 tumor RNA expression (p = 0.006). An additional eQTL SNP in CD80 was associated with shorter survival (rs7804190, p = 8.1×10−4) among all cases combined. As the products of these genes are known to affect induction, trafficking, or immunosuppressive function of Tregs, these results suggest the need for follow-up phenotypic studies. PMID:23382860
Shan, Jingxuan; Al-Rumaihi, Khalid; Rabah, Danny; Al-Bozom, Issam; Kizhakayil, Dhanya; Farhat, Karim; Al-Said, Sami; Kfoury, Hala; Dsouza, Shoba P; Rowe, Jillian; Khalak, Hanif G; Jafri, Shahzad; Aigha, Idil I; Chouchane, Lotfi
2013-05-13
Large databases focused on genetic susceptibility to prostate cancer have been accumulated from population studies of different ancestries, including Europeans and African-Americans. Arab populations, however, have been only rarely studied. Using Affymetrix Genome-Wide Human SNP Array 6, we conducted a genome-wide association study (GWAS) in which 534,781 single nucleotide polymorphisms (SNPs) were genotyped in 221 Tunisians (90 prostate cancer patients and 131 age-matched healthy controls). TaqMan SNP Genotyping Assays on 11 prostate cancer associated SNPs were performed in a distinct cohort of 337 individuals from Arab ancestry living in Qatar and Saudi Arabia (155 prostate cancer patients and 182 age-matched controls). In-silico expression quantitative trait locus (eQTL) analysis along with mRNA quantification of nearby genes was performed to identify loci potentially cis-regulated by the identified SNPs. Three chromosomal regions, encompassing 14 SNPs, are significantly associated with prostate cancer risk in the Tunisian population (P = 1 × 10-4 to P = 1 × 10-5). In addition to SNPs located on chromosome 17q21, previously found associated with prostate cancer in Western populations, two novel chromosomal regions are revealed on chromosome 9p24 and 22q13. eQTL analysis and mRNA quantification indicate that the prostate cancer associated SNPs of chromosome 17 could enhance the expression of STAT5B gene. Our findings, identifying novel GWAS prostate cancer susceptibility loci, indicate that prostate cancer genetic risk factors could be ethnic specific.
QuASAR: quantitative allele-specific analysis of reads
Harvey, Chris T.; Moyerbrailean, Gregory A.; Davis, Gordon O.; Wen, Xiaoquan; Luca, Francesca; Pique-Regi, Roger
2015-01-01
Motivation: Expression quantitative trait loci (eQTL) studies have discovered thousands of genetic variants that regulate gene expression, enabling a better understanding of the functional role of non-coding sequences. However, eQTL studies are costly, requiring large sample sizes and genome-wide genotyping of each sample. In contrast, analysis of allele-specific expression (ASE) is becoming a popular approach to detect the effect of genetic variation on gene expression, even within a single individual. This is typically achieved by counting the number of RNA-seq reads matching each allele at heterozygous sites and testing the null hypothesis of a 1:1 allelic ratio. In principle, when genotype information is not readily available, it could be inferred from the RNA-seq reads directly. However, there are currently no existing methods that jointly infer genotypes and conduct ASE inference, while considering uncertainty in the genotype calls. Results: We present QuASAR, quantitative allele-specific analysis of reads, a novel statistical learning method for jointly detecting heterozygous genotypes and inferring ASE. The proposed ASE inference step takes into consideration the uncertainty in the genotype calls, while including parameters that model base-call errors in sequencing and allelic over-dispersion. We validated our method with experimental data for which high-quality genotypes are available. Results for an additional dataset with multiple replicates at different sequencing depths demonstrate that QuASAR is a powerful tool for ASE analysis when genotypes are not available. Availability and implementation: http://github.com/piquelab/QuASAR. Contact: fluca@wayne.edu or rpique@wayne.edu Supplementary information: Supplementary Material is available at Bioinformatics online. PMID:25480375
Host genetic variation influences gene expression response to rhinovirus infection.
Çalışkan, Minal; Baker, Samuel W; Gilad, Yoav; Ober, Carole
2015-04-01
Rhinovirus (RV) is the most prevalent human respiratory virus and is responsible for at least half of all common colds. RV infections may result in a broad spectrum of effects that range from asymptomatic infections to severe lower respiratory illnesses. The basis for inter-individual variation in the response to RV infection is not well understood. In this study, we explored whether host genetic variation is associated with variation in gene expression response to RV infections between individuals. To do so, we obtained genome-wide genotype and gene expression data in uninfected and RV-infected peripheral blood mononuclear cells (PBMCs) from 98 individuals. We mapped local and distant genetic variation that is associated with inter-individual differences in gene expression levels (eQTLs) in both uninfected and RV-infected cells. We focused specifically on response eQTLs (reQTLs), namely, genetic associations with inter-individual variation in gene expression response to RV infection. We identified local reQTLs for 38 genes, including genes with known functions in viral response (UBA7, OAS1, IRF5) and genes that have been associated with immune and RV-related diseases (e.g., ITGA2, MSR1, GSTM3). The putative regulatory regions of genes with reQTLs were enriched for binding sites of virus-activated STAT2, highlighting the role of condition-specific transcription factors in genotype-by-environment interactions. Overall, we suggest that the 38 loci associated with inter-individual variation in gene expression response to RV-infection represent promising candidates for affecting immune and RV-related respiratory diseases.
Human brain arousal in the resting state: a genome-wide association study.
Jawinski, Philippe; Kirsten, Holger; Sander, Christian; Spada, Janek; Ulke, Christine; Huang, Jue; Burkhardt, Ralph; Scholz, Markus; Hensch, Tilman; Hegerl, Ulrich
2018-04-27
Arousal affects cognition, emotion, and behavior and has been implicated in the etiology of psychiatric disorders. Although environmental conditions substantially contribute to the level of arousal, stable interindividual characteristics are well-established and a genetic basis has been suggested. Here we investigated the molecular genetics of brain arousal in the resting state by conducting a genome-wide association study (GWAS). We selected N = 1877 participants from the population-based LIFE-Adult cohort. Participants underwent a 20-min eyes-closed resting state EEG, which was analyzed using the computerized VIGALL 2.1 (Vigilance Algorithm Leipzig). At the SNP-level, GWAS analyses revealed no genome-wide significant locus (p < 5E-8), although seven loci were suggestive (p < 1E-6). The strongest hit was an expression quantitative trait locus (eQTL) of TMEM159 (lead-SNP: rs79472635, p = 5.49E-8). Importantly, at the gene-level, GWAS analyses revealed significant evidence for TMEM159 (p = 0.013, Bonferroni-corrected). By mapping our SNPs to the GWAS results from the Psychiatric Genomics Consortium, we found that all corresponding markers of TMEM159 showed nominally significant associations with Major Depressive Disorder (MDD; 0.006 ≤ p ≤ 0.011). More specifically, variants associated with high arousal levels have previously been linked to an increased risk for MDD. In line with this, the MetaXcan database suggests increased expression levels of TMEM159 in MDD, as well as Autism Spectrum Disorder, and Alzheimer's Disease. Furthermore, our pathway analyses provided evidence for a role of sodium/calcium exchangers in resting state arousal. In conclusion, the present GWAS identifies TMEM159 as a novel candidate gene which may modulate the risk for psychiatric disorders through arousal mechanisms. Our results also encourage the elaboration of the previously reported interrelations between ion-channel modulators, sleep-wake behavior, and psychiatric disorders.
USDA-ARS?s Scientific Manuscript database
Genetic marker effects and type of inheritance are estimated with poor precision when minor marker allele frequencies are low. A stable composite population (MARC II) was subjected to marker assisted selection for two years to equalize CSN1S1 and TG genetic marker frequencies to evaluate the epista...
Evo-Devo-EpiR: a genome-wide search platform for epistatic control on the evolution of development.
Jiang, Libo; Zhang, Miaomiao; Sang, Mengmeng; Ye, Meixia; Wu, Rongling
2017-09-01
Evo-devo is a theory proposed to study how phenotypes evolve by comparing the developmental processes of different organisms or the same organism experiencing changing environments. It has been recognized that nonallelic interactions at different genes or quantitative trait loci, known as epistasis, may play a pivotal role in the evolution of development, but it has proven difficult to quantify and elucidate this role into a coherent picture. We implement a high-dimensional genome-wide association study model into the evo-devo paradigm and pack it into the R-based Evo-Devo-EpiR, aimed at facilitating the genome-wide landscaping of epistasis for the diversification of phenotypic development. By analyzing a high-throughput assay of DNA markers and their pairs simultaneously, Evo-Devo-EpiR is equipped with a capacity to systematically characterize various epistatic interactions that impact on the pattern and timing of development and its evolution. Enabling a global search for all possible genetic interactions for developmental processes throughout the whole genome, Evo-Devo-EpiR provides a computational tool to illustrate a precise genotype-phenotype map at interface between epistasis, development and evolution. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Mathew, Boby; Léon, Jens; Sannemann, Wiebke; Sillanpää, Mikko J.
2018-01-01
Gene-by-gene interactions, also known as epistasis, regulate many complex traits in different species. With the availability of low-cost genotyping it is now possible to study epistasis on a genome-wide scale. However, identifying genome-wide epistasis is a high-dimensional multiple regression problem and needs the application of dimensionality reduction techniques. Flowering Time (FT) in crops is a complex trait that is known to be influenced by many interacting genes and pathways in various crops. In this study, we successfully apply Sure Independence Screening (SIS) for dimensionality reduction to identify two-way and three-way epistasis for the FT trait in a Multiparent Advanced Generation Inter-Cross (MAGIC) barley population using the Bayesian multilocus model. The MAGIC barley population was generated from intercrossing among eight parental lines and thus, offered greater genetic diversity to detect higher-order epistatic interactions. Our results suggest that SIS is an efficient dimensionality reduction approach to detect high-order interactions in a Bayesian multilocus model. We also observe that many of our findings (genomic regions with main or higher-order epistatic effects) overlap with known candidate genes that have been already reported in barley and closely related species for the FT trait. PMID:29254994
Bergman, Juraj; Mitrikeski, Petar T.
2015-01-01
Summary Sporulation efficiency in the yeast Saccharomyces cerevisiae is a well-established model for studying quantitative traits. A variety of genes and nucleotides causing different sporulation efficiencies in laboratory, as well as in wild strains, has already been extensively characterised (mainly by reciprocal hemizygosity analysis and nucleotide exchange methods). We applied a different strategy in order to analyze the variation in sporulation efficiency of laboratory yeast strains. Coupling classical quantitative genetic analysis with simulations of phenotypic distributions (a method we call phenotype modelling) enabled us to obtain a detailed picture of the quantitative trait loci (QTLs) relationships underlying the phenotypic variation of this trait. Using this approach, we were able to uncover a dominant epistatic inheritance of loci governing the phenotype. Moreover, a molecular analysis of known causative quantitative trait genes and nucleotides allowed for the detection of novel alleles, potentially responsible for the observed phenotypic variation. Based on the molecular data, we hypothesise that the observed dominant epistatic relationship could be caused by the interaction of multiple quantitative trait nucleotides distributed across a 60--kb QTL region located on chromosome XIV and the RME1 locus on chromosome VII. Furthermore, we propose a model of molecular pathways which possibly underlie the phenotypic variation of this trait. PMID:27904371
Linkage and Segregation Analysis of Black and Brindle Coat Color in Domestic Dogs
Kerns, Julie A.; Cargill, Edward J.; Clark, Leigh Anne; Candille, Sophie I.; Berryere, Tom G.; Olivier, Michael; Lust, George; Todhunter, Rory J.; Schmutz, Sheila M.; Murphy, Keith E.; Barsh, Gregory S.
2007-01-01
Mutations of pigment type switching have provided basic insight into melanocortin physiology and evolutionary adaptation. In all vertebrates that have been studied to date, two key genes, Agouti and Melanocortin 1 receptor (Mc1r), encode a ligand-receptor system that controls the switch between synthesis of red–yellow pheomelanin vs. black–brown eumelanin. However, in domestic dogs, historical studies based on pedigree and segregation analysis have suggested that the pigment type-switching system is more complicated and fundamentally different from other mammals. Using a genomewide linkage scan on a Labrador × greyhound cross segregating for black, yellow, and brindle coat colors, we demonstrate that pigment type switching is controlled by an additional gene, the K locus. Our results reveal three alleles with a dominance order of black (KB) > brindle (kbr) > yellow (ky), whose genetic map position on dog chromosome 16 is distinct from the predicted location of other pigmentation genes. Interaction studies reveal that Mc1r is epistatic to variation at Agouti or K and that the epistatic relationship between Agouti and K depends on the alleles being tested. These findings suggest a molecular model for a new component of the melanocortin signaling pathway and reveal how coat-color patterns and pigmentary diversity have been shaped by recent selection. PMID:17483404
Epistatic interaction between the lipase-encoding genes Pnpla2 and Lipe causes liposarcoma in mice
Wang, Shu Pei; Yang, Hao; Ji, Bo; Gladdy, Rebecca; Andelfinger, Gregor; Mitchell, Grant A.
2017-01-01
Liposarcoma is an often fatal cancer of fat cells. Mechanisms of liposarcoma development are incompletely understood. The cleavage of fatty acids from acylglycerols (lipolysis) has been implicated in cancer. We generated mice with adipose tissue deficiency of two major enzymes of lipolysis, adipose triglyceride lipase (ATGL) and hormone-sensitive lipase (HSL), encoded respectively by Pnpla2 and Lipe. Adipocytes from double adipose knockout (DAKO) mice, deficient in both ATGL and HSL, showed near-complete deficiency of lipolysis. All DAKO mice developed liposarcoma between 11 and 14 months of age. No tumors occurred in single knockout or control mice. The transcriptome of DAKO adipose tissue showed marked differences from single knockout and normal controls as early as 3 months. Gpnmb and G0s2 were among the most highly dysregulated genes in premalignant and malignant DAKO adipose tissue, suggesting a potential utility as early markers of the disease. Similar changes of GPNMB and G0S2 expression were present in a human liposarcoma database. These results show that a previously-unknown, fully penetrant epistatic interaction between Pnpla2 and Lipe can cause liposarcoma in mice. DAKO mice provide a promising model for studying early premalignant changes that lead to late-onset malignant disease. PMID:28459858
Bocianowski, Jan
2013-03-01
Epistasis, an additive-by-additive interaction between quantitative trait loci, has been defined as a deviation from the sum of independent effects of individual genes. Epistasis between QTLs assayed in populations segregating for an entire genome has been found at a frequency close to that expected by chance alone. Recently, epistatic effects have been considered by many researchers as important for complex traits. In order to understand the genetic control of complex traits, it is necessary to clarify additive-by-additive interactions among genes. Herein we compare estimates of a parameter connected with the additive gene action calculated on the basis of two models: a model excluding epistasis and a model with additive-by-additive interaction effects. In this paper two data sets were analysed: 1) 150 barley doubled haploid lines derived from the Steptoe × Morex cross, and 2) 145 DH lines of barley obtained from the Harrington × TR306 cross. The results showed that in cases when the effect of epistasis was different from zero, the coefficient of determination was larger for the model with epistasis than for the one excluding epistasis. These results indicate that epistatic interaction plays an important role in controlling the expression of complex traits.
An Evolutionary Perspective on Epistasis and the Missing Heritability
Hemani, Gibran; Knott, Sara; Haley, Chris
2013-01-01
The relative importance between additive and non-additive genetic variance has been widely argued in quantitative genetics. By approaching this question from an evolutionary perspective we show that, while additive variance can be maintained under selection at a low level for some patterns of epistasis, the majority of the genetic variance that will persist is actually non-additive. We propose that one reason that the problem of the “missing heritability” arises is because the additive genetic variation that is estimated to be contributing to the variance of a trait will most likely be an artefact of the non-additive variance that can be maintained over evolutionary time. In addition, it can be shown that even a small reduction in linkage disequilibrium between causal variants and observed SNPs rapidly erodes estimates of epistatic variance, leading to an inflation in the perceived importance of additive effects. We demonstrate that the perception of independent additive effects comprising the majority of the genetic architecture of complex traits is biased upwards and that the search for causal variants in complex traits under selection is potentially underpowered by parameterising for additive effects alone. Given dense SNP panels the detection of causal variants through genome-wide association studies may be improved by searching for epistatic effects explicitly. PMID:23509438
Otoupal, Peter B; Erickson, Keesha E; Escalas-Bordoy, Antoni; Chatterjee, Anushree
2017-01-20
The evolution of antibiotic resistance has engendered an impending global health crisis that necessitates a greater understanding of how resistance emerges. The impact of nongenetic factors and how they influence the evolution of resistance is a largely unexplored area of research. Here we present a novel application of CRISPR-Cas9 technology for investigating how gene expression governs the adaptive pathways available to bacteria during the evolution of resistance. We examine the impact of gene expression changes on bacterial adaptation by constructing a library of deactivated CRISPR-Cas9 synthetic devices to tune the expression of a set of stress-response genes in Escherichia coli. We show that artificially inducing perturbations in gene expression imparts significant synthetic control over fitness and growth during stress exposure. We present evidence that these impacts are reversible; strains with synthetically perturbed gene expression regained wild-type growth phenotypes upon stress removal, while maintaining divergent growth characteristics under stress. Furthermore, we demonstrate a prevailing trend toward negative epistatic interactions when multiple gene perturbations are combined simultaneously, thereby posing an intrinsic constraint on gene expression underlying adaptive trajectories. Together, these results emphasize how CRISPR-Cas9 can be employed to engineer gene expression changes that shape bacterial adaptation, and present a novel approach to synthetically control the evolution of antimicrobial resistance.
Riveira-Munoz, Eva; He, Su-Min; Escaramís, Georgia; Stuart, Philip E; Hüffmeier, Ulrike; Lee, Catherine; Kirby, Brian; Oka, Akira; Giardina, Emiliano; Liao, Wilson; Bergboer, Judith; Kainu, Kati; de Cid, Rafael; Munkhbat, Batmunkh; Zeeuwen, Patrick L J M; Armour, John A L; Poon, Annie; Mabuchi, Tomotaka; Ozawa, Akira; Zawirska, Agnieszka; Burden, David A; Barker, Jonathan N; Capon, Francesca; Traupe, Heiko; Sun, Liang-Dan; Cui, Yong; Yin, Xian-Yong; Chen, Gang; Lim, Henry; Nair, Rajan; Voorhess, John; Tejasvi, Trilokraj; Pujol, Ramón; Munkhtuvshin, Namid; Fischer, Judith; Kere, Juha; Schalkwijk, Joost; Bowcock, Anne; Kwok, Pui-Yan; Novelli, Giuseppe; Inoko, Hidetoshi; Ryan, Anthony W; Trembath, Richard C; Reis, André; Zhang, Xue-Jun; Elder, James T; Estivill, Xavier
2012-01-01
A multicenter meta-analysis including data from 9389 psoriasis patients and 9477 control subjects was performed to investigate the contribution of the deletion of genes LCE3C and LCE3B, involved in skin barrier defense, to psoriasis susceptibility in different populations. The study confirms that the deletion of LCE3C and LCE3B is a common genetic factor for susceptibility to psoriasis in European populations [OROverall = 1.21 (1.15–1.27)], and for the first time directly demonstrated the deletion's association with psoriasis in [Chinese OR = 1.27 (1.16–1.34); Mongolian OR = 2.08 (1.44–2.99)] populations. The analysis of the HLA-Cw6 locus showed significant differences in the epistatic interaction with the LCE3C and LCE3B deletion in at least some European populations, indicating epistatic effects between these two major genetic contributors to psoriasis. The study highlights the value of examining genetic risk factors in multiple populations to identify genetic interactions, and indicates the need of further studies to understand the interaction of the skin barrier and the immune system in susceptibility to psoriasis. PMID:21107349
Ugolini, S; Bruschi, C V
1996-12-01
In the yeast Saccharomyces cerevisiae the ade2, and/or the ade1, mutation in the adenine biosynthetic pathway leads to the accumulation of a cell-limited red pigment, while epistatic mutations in the same pathway, i.e. ade8, preclude this phenomenon, resulting in normal white colonies. The shift in color from red to white (or vice versa) with a combination of appropriate wild-type and mutant alleles of the adenine-pathway genes has been widely utilized as a non-selective phenotype to visualise and quantify the occurrence of various genetic events such as recombination, conversion and aneuploidy. It has provided an invaluable tool for the study of gene dosage and plasmid stability. In competition experiments between disrupted ade2, ade8-18 transformants carrying either a functional or non-functional episomal ADE8 gene, we verified that white ade8 ade2 cells show a remarkable selective advantage over red ade2 cells, with important implications on the use of this assay for the monitoring of genetic events. The accumulation of the red pigment in ade2 cells is likely to be the cause for impaired growth in these cells.
Sabidó, Eduard; Bosch, Elena
2016-01-01
Essential trace elements possess vital functions at molecular, cellular, and physiological levels in health and disease, and they are tightly regulated in the human body. In order to assess variability and potential adaptive evolution of trace element homeostasis, we quantified 18 trace elements in 150 liver samples, together with the expression levels of 90 genes and abundances of 40 proteins involved in their homeostasis. Additionally, we genotyped 169 single nucleotide polymorphism (SNPs) in the same sample set. We detected significant associations for 8 protein quantitative trait loci (pQTL), 10 expression quantitative trait loci (eQTLs), and 15 micronutrient quantitative trait loci (nutriQTL). Six of these exceeded the false discovery rate cutoff and were related to essential trace elements: 1) one pQTL for GPX2 (rs10133290); 2) two previously described eQTLs for HFE (rs12346) and SELO (rs4838862) expression; and 3) three nutriQTLs: The pathogenic C282Y mutation at HFE affecting iron (rs1800562), and two SNPs within several clustered metallothionein genes determining selenium concentration (rs1811322 and rs904773). Within the complete set of significant QTLs (which involved 30 SNPs and 20 gene regions), we identified 12 SNPs with extreme patterns of population differentiation (FST values in the top 5% percentile in at least one HapMap population pair) and significant evidence for selective sweeps involving QTLs at GPX1, SELENBP1, GPX3, SLC30A9, and SLC39A8. Overall, this detailed study of various molecular phenotypes illustrates the role of regulatory variants in explaining differences in trace element homeostasis among populations and in the human adaptive response to environmental pressures related to micronutrients. PMID:26582562
Nieuwenhuis, Maartje A.; Siedlinski, Matteusz; van den Berge, Maarten; Granell, Raquel; Li, Xingnan; Niens, Marijke; van der Vlies, Pieter; Altmüller, Janine; Nürnberg, Peter; Kerkhof, Marjan; van Schayck, Onno C.; Riemersma, Ronald A.; van der Molen, Thys; de Monchy, Jan G.; Bossé, Yohan; Sandford, Andrew; Bruijnzeel-Koomen, Carla A.; van Wijk, Roy G.; ten Hacken, Nick H.; Timens, Wim; Boezen, H. Marike; Henderson, John; Kabesch, Michael; Vonk, Judith M.; Postma, Dirkje S.; Koppelman, Gerard H.
2016-01-01
Background Genome wide association studies (GWAS) of asthma have identified single nucleotide polymorphisms (SNPs) that modestly increase the risk for asthma. This could be due to phenotypic heterogeneity of asthma. Bronchial hyperresponsiveness (BHR) is a phenotypic hallmark of asthma. We aim to identify susceptibility genes for asthma combined with BHR and analyse the presence of cis-eQTLs among replicated SNPs. Secondly, we compare the genetic association of SNPs previously associated with (doctor diagnosed) asthma to our GWAS of asthma with BHR. Methods A GWAS was performed in 920 asthmatics with BHR and 980 controls. Top SNPs of our GWAS were analysed in four replication cohorts and lung cis-eQTL analysis was performed on replicated SNPs. We investigated association of SNPs previously associated with asthma in our data. Results 368 SNPs were followed up for replication. Six SNPs in genes encoding ABI3BP, NAF1, MICA and the 17q21 locus replicated in one or more cohorts, with one locus (17q21) achieving genome wide significance after meta-analysis. Five out of 6 replicated SNPs regulated 35 gene transcripts in whole lung. Eight of 20 asthma associated SNPs from previous GWAS were significantly associated with asthma and BHR. Three SNPs, in IL-33 and GSDMB, showed larger effect sizes in our data compared to published literature. Conclusions Combining GWAS with subsequent lung eQTL analysis revealed disease associated SNPs regulating lung mRNA expression levels of potential new asthma genes. Adding BHR to the asthma definition does not lead to an overall larger genetic effect size than analysing (doctor’s diagnosed) asthma. PMID:27439200
Sex difference in EGFR pathways in mouse kidney-potential impact on the immune system.
Liu, Fengxia; Jiao, Yan; Jiao, Yun; Garcia-Godoy, Franklin; Gu, Weikuan; Liu, Qingyi
2016-11-24
Epidermal growth factor receptor (Egfr) has been the target of several drugs for cancers. The potential gender differences in genes in the Egfr axis have been suggested in humans and in animal models. Female and male mice from the same recombinant inbred (RI) strain have the same genomic components except the sex difference. A population of different RI mouse strains allows to conduct precise analysis of molecular pathways and regulation of Egfr between female and male mice. The whole genome expression profiles of 70 genetically diverse RI strains of mice were used to compare three major molecular aspects of Egfr gene: the relative expression levels, gene network and expression quantitative trait loci (eQTL) that regulate the expression of Egfr between female and male mice. Our data showed that there is a significant sex difference in the expression levels in kidney. A considerable number of genes in the gene network of Egfr are sex differentially expressed. The expression levels of Egfr in mice are statistical significant different between C57BL/6 J (B6) and DBA/2 J (D2) genotypes in male while no difference in female mice. The eQTLs that regulate the expression levels of Egfr between female and male mice are also different. Furthermore, the differential expression levels of Egfr showed significantly different correlations with two known biological traits between male and female mice. Overall there is a substantial sex difference in the Egfr pathways in mice. These data may have significant impact on drug target design, development, formulation, and dosage determinant for women and men in clinical trials.
Wang, Qian; Yang, Can; Gelernter, Joel; Zhao, Hongyu
2015-01-01
Although some existing epidemiological observations and molecular experiments suggested that brain disorders in the realm of psychiatry may be influenced by immune dysregulation, the degree of genetic overlap between psychiatric disorders and immune disorders has not been well established. We investigated this issue by integrative analysis of genome-wide association studies of 18 complex human traits/diseases (five psychiatric disorders, seven immune disorders, and others) and multiple genome-wide annotation resources (Central nervous system genes, immune-related expression-quantitative trait loci (eQTL) and DNase I hypertensive sites from 98 cell-lines). We detected pleiotropy in 24 of the 35 psychiatric-immune disorder pairs. The strongest pleiotropy was observed for schizophrenia-rheumatoid arthritis with MHC region included in the analysis (p = 3.9 × 10−285), and schizophrenia-Crohns disease with MHC region excluded (p = 1.1 × 10−36). Significant enrichment (>1.4 fold) of immune-related eQTL was observed in four psychiatric disorders. Genomic regions responsible for pleiotropy between psychiatric disorders and immune disorders were detected. The MHC region on chromosome 6 appears to be the most important with other regions, such as cytoband 1p13.2, also playing significant roles in pleiotropy. We also found that most alleles shared between schizophrenia and Crohns disease have the same effect direction, with similar trend found for other disorder pairs, such as bipolar-Crohn’s disease. Our results offer a novel birds-eye view of the genetic relationship and demonstrate strong evidence for pervasive pleiotropy between psychiatric disorders and immune disorders. Our findings might open new routes for prevention and treatment strategies for these disorders based on a new appreciation of the importance of immunological mechanisms in mediating risk of many psychiatric diseases. PMID:26340901
Surana, Priyanka; Xu, Ruo; Fuerst, Gregory; Chapman, Antony V. E.; Nettleton, Dan; Wise, Roger P.
2017-01-01
Powdery mildew pathogens colonize over 9500 plant species, causing critical yield loss. The Ascomycete fungus, Blumeria graminis f. sp. hordei (Bgh), causes powdery mildew disease in barley (Hordeum vulgare L.). Successful infection begins with penetration of host epidermal cells, culminating in haustorial feeding structures, facilitating delivery of fungal effectors to the plant and exchange of nutrients from host to pathogen. We used expression Quantitative Trait Locus (eQTL) analysis to dissect the temporal control of immunity-associated gene expression in a doubled haploid barley population challenged with Bgh. Two highly significant regions possessing trans eQTL were identified near the telomeric ends of chromosomes (Chr) 2HL and 1HS. Within these regions reside diverse resistance loci derived from barley landrace H. laevigatum (MlLa) and H. vulgare cv. Algerian (Mla1), which associate with the altered expression of 961 and 3296 genes during fungal penetration of the host and haustorial development, respectively. Regulatory control of transcript levels for 299 of the 961 genes is reprioritized from MlLa on 2HL to Mla1 on 1HS as infection progresses, with 292 of the 299 alternating the allele responsible for higher expression, including Adaptin Protein-2 subunit μ AP2M and Vesicle Associated Membrane Protein VAMP72 subfamily members VAMP721/722. AP2M mediates effector-triggered immunity (ETI) via endocytosis of plasma membrane receptor components. VAMP721/722 and SNAP33 form a Soluble N-ethylmaleimide-sensitive factor Attachment Protein REceptor (SNARE) complex with SYP121 (PEN1), which is engaged in pathogen associated molecular pattern (PAMP)-triggered immunity via exocytosis. We postulate that genes regulated by alternate chromosomal positions are repurposed as part of a conserved immune complex to respond to different pathogen attack scenarios. PMID:28790145
Backenroth, Daniel; He, Zihuai; Kiryluk, Krzysztof; Boeva, Valentina; Pethukova, Lynn; Khurana, Ekta; Christiano, Angela; Buxbaum, Joseph D; Ionita-Laza, Iuliana
2018-05-03
We describe a method based on a latent Dirichlet allocation model for predicting functional effects of noncoding genetic variants in a cell-type- and/or tissue-specific way (FUN-LDA). Using this unsupervised approach, we predict tissue-specific functional effects for every position in the human genome in 127 different tissues and cell types. We demonstrate the usefulness of our predictions by using several validation experiments. Using eQTL data from several sources, including the GTEx project, Geuvadis project, and TwinsUK cohort, we show that eQTLs in specific tissues tend to be most enriched among the predicted functional variants in relevant tissues in Roadmap. We further show how these integrated functional scores can be used for (1) deriving the most likely cell or tissue type causally implicated for a complex trait by using summary statistics from genome-wide association studies and (2) estimating a tissue-based correlation matrix of various complex traits. We found large enrichment of heritability in functional components of relevant tissues for various complex traits, and FUN-LDA yielded higher enrichment estimates than existing methods. Finally, using experimentally validated functional variants from the literature and variants possibly implicated in disease by previous studies, we rigorously compare FUN-LDA with state-of-the-art functional annotation methods and show that FUN-LDA has better prediction accuracy and higher resolution than these methods. In particular, our results suggest that tissue- and cell-type-specific functional prediction methods tend to have substantially better prediction accuracy than organism-level prediction methods. Scores for each position in the human genome and for each ENCODE and Roadmap tissue are available online (see Web Resources). Copyright © 2018 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Zhou, Dan; Zhang, Dandan; Sun, Xiaohui; Li, Zhiqiang; Ni, Yaqin; Shan, Zhongyan; Li, Hong; Liu, Chengguo; Zhang, Shuai; Liu, Yi; Zheng, Ruizhi; Pan, Feixia; Zhu, Yimin; Shi, Yongyong; Lai, Maode
2018-06-01
Although numbers of genome-wide association studies (GWAS) have been performed for serum lipid levels, limited heritability has been explained. Studies showed that combining data from GWAS and expression quantitative trait loci (eQTLs) signals can both enhance the discovery of trait-associated SNPs and gain a better understanding of the mechanism. We performed an annotation-based, multistage genome-wide screening for serum-lipid-level-associated loci in totally 6863 Han Chinese. A serum high-density lipoprotein cholesterol (HDL-C) associated variant rs1880118 (hg19 chr7:g. 6435220G>C) was replicated (P combined = 1.4E-10). rs1880118 was associated with DAGLB (diacylglycerol lipase, beta) expression levels in subcutaneous adipose tissue (P = 5.9E-42) and explained 47.7% of the expression variance. After the replication, an active segment covering variants tagged by rs1880118 near 5' of DAGLB was annotated using histone modification and transcription factor binding signals. The luciferase report assay revealed that the segment containing the minor alleles showed increased transcriptional activity compared with segment contains the major alleles, which was consistent with the eQTL analyses. The expression-trait association tests indicated the association between the DAGLB and serum HDL-C levels using gene-based approaches called "TWAS" (P = 3.0E-8), "SMR" (P = 1.1E-4), and "Sherlock" (P = 1.6E-6). To summarize, we identified a novel HDL-C-associated variant which explained nearly half of the expression variance of DAGLB. Integrated analyses established a genotype-gene-phenotype three-way association and expanded our knowledge of DAGLB in lipid metabolism.
Le Clerc, Sigrid; van Manen, Daniëlle; Coulonges, Cédric; Ulveling, Damien; Laville, Vincent; Labib, Taoufik; Taing, Lieng; Delaneau, Olivier; Montes, Matthieu; Schuitemaker, Hanneke; Zagury, Jean-François
2015-01-01
Background Many genome-wide association studies have been performed on progression towards the acquired immune deficiency syndrome (AIDS) and they mainly identified associations within the HLA loci. In this study, we demonstrate that the integration of biological information, namely gene expression data, can enhance the sensitivity of genetic studies to unravel new genetic associations relevant to AIDS. Methods We collated the biological information compiled from three databases of expression quantitative trait loci (eQTLs) involved in cells of the immune system. We derived a list of single nucleotide polymorphisms (SNPs) that are functional in that they correlate with differential expression of genes in at least two of the databases. We tested the association of those SNPs with AIDS progression in two cohorts, GRIV and ACS. Tests on permuted phenotypes of the GRIV and ACS cohorts or on randomised sets of equivalent SNPs allowed us to assess the statistical robustness of this method and to estimate the true positive rate. Results Eight genes were identified with high confidence (p = 0.001, rate of true positives 75%). Some of those genes had previously been linked with HIV infection. Notably, ENTPD4 belongs to the same family as CD39, whose expression has already been associated with AIDS progression; while DNAJB12 is part of the HSP90 pathway, which is involved in the control of HIV latency. Our study also drew our attention to lesser-known functions such as mitochondrial ribosomal proteins and a zinc finger protein, ZFP57, which could be central to the effectiveness of HIV infection. Interestingly, for six out of those eight genes, down-regulation is associated with non-progression, which makes them appealing targets to develop drugs against HIV. PMID:26367535
Williams, Stephen R; Hsu, Fang-Chi; Keene, Keith L; Chen, Wei-Min; Dzhivhuho, Godfrey; Rowles, Joe L; Southerland, Andrew M; Furie, Karen L; Rich, Stephen S; Worrall, Bradford B; Sale, Michèle M
2017-01-01
Background and Purpose von Willebrand Factor (vWF) plays an important role in thrombus formation during cerebrovascular damage. We sought to investigate the potential role of circulating vWF in recurrent cerebrovascular events and identify genetic contributors to variation in vWF level in an ischemic stroke population. Methods We analyzed the effect of circulating vWF on risk of recurrent stroke using survival models in the Vitamin Intervention for Stroke Prevention (VISP) trial as well as the utility of vWF in reclassification over traditional factors. We conducted a genome-wide association study (GWAS) with imputation, based upon 1000 Genomes Project data, for circulating vWF levels and then interrogated loci previously associated with vWF levels. We performed expression quantitative trait locus (eQTL) analysis for vWF across different tissues. Results Elevated vWF levels were associated with increased risk for recurrent stroke in VISP. Adding vWF to traditional clinical parameters also improved recurrent stroke risk prediction. We identified SNPs significantly associated with circulating vWF at the ABO locus (p < 5×10-8) and replicated findings from previous genetic associations of vWF levels in humans. eQTL analyses demonstrate that most associated ABO SNPs were also associated with vWF gene expression. Conclusions Elevated vWF levels are associated with recurrent stroke in VISP. In the VISP population, genetic determinants of vWF levels that impact vWF gene expression were identified. These data add to our knowledge of the pathophysiologic and genetic basis for recurrent stroke risk, and may have implications for clinical care decision-making. PMID:28495826
Tissue-specific impact of FADS cluster variants on FADS1 and FADS2 gene expression.
Reynolds, Lindsay M; Howard, Timothy D; Ruczinski, Ingo; Kanchan, Kanika; Seeds, Michael C; Mathias, Rasika A; Chilton, Floyd H
2018-01-01
Omega-6 (n-6) and omega-3 (n-3) long (≥ 20 carbon) chain polyunsaturated fatty acids (LC-PUFAs) play a critical role in human health and disease. Biosynthesis of LC-PUFAs from dietary 18 carbon PUFAs in tissues such as the liver is highly associated with genetic variation within the fatty acid desaturase (FADS) gene cluster, containing FADS1 and FADS2 that encode the rate-limiting desaturation enzymes in the LC-PUFA biosynthesis pathway. However, the molecular mechanisms by which FADS genetic variants affect LC-PUFA biosynthesis, and in which tissues, are unclear. The current study examined associations between common single nucleotide polymorphisms (SNPs) within the FADS gene cluster and FADS1 and FADS2 gene expression in 44 different human tissues (sample sizes ranging 70-361) from the Genotype-Tissue Expression (GTEx) Project. FADS1 and FADS2 expression were detected in all 44 tissues. Significant cis-eQTLs (within 1 megabase of each gene, False Discovery Rate, FDR<0.05, as defined by GTEx) were identified in 12 tissues for FADS1 gene expression and 23 tissues for FADS2 gene expression. Six tissues had significant (FDR< 0.05) eQTLs associated with both FADS1 and FADS2 (including artery, esophagus, heart, muscle, nerve, and thyroid). Interestingly, the identified eQTLs were consistently found to be associated in opposite directions for FADS1 and FADS2 expression. Taken together, findings from this study suggest common SNPs within the FADS gene cluster impact the transcription of FADS1 and FADS2 in numerous tissues and raise important questions about how the inverse expression of these two genes impact intermediate molecular (such a LC-PUFA and LC-PUFA-containing glycerolipid levels) and ultimately clinical phenotypes associated with inflammatory diseases and brain health.
Zhou, Fei; Wang, Yanru; Liu, Hongliang; Ready, Neal; Han, Younghun; Hung, Rayjean J.; Brhane, Yonathan; McLaughlin, John; Brennan, Paul; Bickeböller, Heike; Rosenberger, Albert; Houlston, Richard S.; Caporaso, Neil; Landi, Maria Teresa; Brüske, Irene; Risch, Angela; Ye, Yuanqing; Wu, Xifeng; Christiani, David C.; Goodman, Gary; Chen, Chu; Amos, Christopher I.; Qingyi, Wei
2017-01-01
Purpose mRNA degradation is an important regulatory step for controlling gene expression and cell functions. Genetic abnormalities of the genes involved in mRNA degradation were found to be associated with cancer risks. Therefore, we systematically investigated the roles of genetic variants of genes in the general mRNA degradation pathway in lung cancer risk. Experimental design Meta-analyses were conducted in six lung cancer genome-wide association studies (GWASs) from the Transdisciplinary Research in Cancer of the Lung and additional two GWASs from Harvard University and deCODE in the International Lung Cancer Consortium. Expression quantitative trait loci analysis (eQTL) was used for in silico functional validation of the identified significant susceptibility loci. Results This pathway-based analysis included 4,603 single nucleotide polymorphisms (SNP) in 68 genes in 14,463 lung cancer cases and 44,188 controls, of which 20 SNPs were found to be associated with lung cancer risk with a false discovery rate threshold of <0.05. Among the 11 newly identified SNPs in CNOT6, which were in high linkage disequilibrium, the rs2453176 with a RegulomDB score “1f” was chosen as the tag SNP for further analysis. We found that the rs2453176 T allele was significantly associated with lung cancer risk (odds ratio=1.11, 95% confidence interval=1.04–1.18, P=0.001) in the eight GWASs. In the eQTL analysis, we found that levels of CNOT6 mRNA expression were significantly correlated with the rs2453176 T allele, which provided additional biological basis for the observed positive association. Conclusion The CNOT6 rs2453176 SNP may be a new functional susceptible locus for lung cancer risk. PMID:27805284
Zhou, Fei; Wang, Yanru; Liu, Hongliang; Ready, Neal; Han, Younghun; Hung, Rayjean J; Brhane, Yonathan; McLaughlin, John; Brennan, Paul; Bickeböller, Heike; Rosenberger, Albert; Houlston, Richard S; Caporaso, Neil; Landi, Maria Teresa; Brüske, Irene; Risch, Angela; Ye, Yuanqing; Wu, Xifeng; Christiani, David C; Goodman, Gary; Chen, Chu; Amos, Christopher I; Wei, Qingyi
2017-04-01
mRNA degradation is an important regulatory step for controlling gene expression and cell functions. Genetic abnormalities involved in mRNA degradation genes were found to be associated with cancer risks. Therefore, we systematically investigated the roles of genetic variants in the general mRNA degradation pathway in lung cancer risk. Meta-analyses were conducted using summary data from six lung cancer genome-wide association studies (GWASs) from the Transdisciplinary Research in Cancer of the Lung and additional two GWASs from Harvard University and deCODE in the International Lung Cancer Consortium. Expression quantitative trait loci analysis (eQTL) was used for in silico functional validation of the identified significant susceptibility loci. This pathway-based analysis included 6816 single nucleotide polymorphisms (SNP) in 68 genes in 14 463 lung cancer cases and 44 188 controls. In the single-locus analysis, we found that 20 SNPs were associated with lung cancer risk with a false discovery rate threshold of <0.05. Among the 11 newly identified SNPs in CNOT6, which were in high linkage disequilibrium, the rs2453176 with a RegulomDB score "1f" was chosen as the tagSNP for further analysis. We found that the rs2453176 T allele was significantly associated with lung cancer risk (odds ratio = 1.11, 95% confidence interval = 1.04-1.18) in the eight GWASs. In the eQTL analysis, we found that levels of CNOT6 mRNA expression were significantly correlated with the rs2453176 T allele, which provided additional biological basis for the observed positive association. The CNOT6 rs2453176 SNP may be a new functional susceptible locus for lung cancer risk. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Piasecka, Barbara; Duffy, Darragh; Urrutia, Alejandra; Quach, Hélène; Patin, Etienne; Posseme, Céline; Bergstedt, Jacob; Charbit, Bruno; Rouilly, Vincent; MacPherson, Cameron R; Hasan, Milena; Albaud, Benoit; Gentien, David; Fellay, Jacques; Albert, Matthew L; Quintana-Murci, Lluis
2018-01-16
The contribution of host genetic and nongenetic factors to immunological differences in humans remains largely undefined. Here, we generated bacterial-, fungal-, and viral-induced immune transcriptional profiles in an age- and sex-balanced cohort of 1,000 healthy individuals and searched for the determinants of immune response variation. We found that age and sex affected the transcriptional response of most immune-related genes, with age effects being more stimulus-specific relative to sex effects, which were largely shared across conditions. Although specific cell populations mediated the effects of age and sex on gene expression, including CD8 + T cells for age and CD4 + T cells and monocytes for sex, we detected a direct effect of these intrinsic factors for the majority of immune genes. The mapping of expression quantitative trait loci (eQTLs) revealed that genetic factors had a stronger effect on immune gene regulation than age and sex, yet they affected a smaller number of genes. Importantly, we identified numerous genetic variants that manifested their regulatory effects exclusively on immune stimulation, including a Candida albicans -specific master regulator at the CR1 locus. These response eQTLs were enriched in disease-associated variants, particularly for autoimmune and inflammatory disorders, indicating that differences in disease risk may result from regulatory variants exerting their effects only in the presence of immune stress. Together, this study quantifies the respective effects of age, sex, genetics, and cellular heterogeneity on the interindividual variability of immune responses and constitutes a valuable resource for further exploration in the context of different infection risks or disease outcomes. Copyright © 2018 the Author(s). Published by PNAS.
Genetic neuropathology of obsessive psychiatric syndromes
Jaffe, A E; Deep-Soboslay, A; Tao, R; Hauptman, D T; Kaye, W H; Arango, V; Weinberger, D R; Hyde, T M; Kleinman, J E
2014-01-01
Anorexia nervosa (AN), bulimia nervosa (BN) and obsessive-compulsive disorder (OCD) are complex psychiatric disorders with shared obsessive features, thought to arise from the interaction of multiple genes of small effect with environmental factors. Potential candidate genes for AN, BN and OCD have been identified through clinical association and neuroimaging studies; however, recent genome-wide association studies of eating disorders (ED) so far have failed to report significant findings. In addition, few, if any, studies have interrogated postmortem brain tissue for evidence of expression quantitative trait loci (eQTLs) associated with candidate genes, which has particular promise as an approach to elucidating molecular mechanisms of association. We therefore selected single-nucleotide polymorphisms (SNPs) based on candidate gene studies for AN, BN and OCD from the literature, and examined the association of these SNPs with gene expression across the lifespan in prefrontal cortex of a nonpsychiatric control cohort (N=268). Several risk-predisposing SNPs were significantly associated with gene expression among control subjects. We then measured gene expression in the prefrontal cortex of cases previously diagnosed with obsessive psychiatric disorders, for example, ED (N=15) and OCD/obsessive-compulsive personality disorder or tics (OCD/OCPD/Tic; N=16), and nonpsychiatric controls (N=102) and identified 6 and 286 genes that were differentially expressed between ED compared with controls and OCD cases compared with controls, respectively (false discovery rate (FDR) <5%). However, none of the clinical risk SNPs were among the eQTLs and none were significantly associated with gene expression within the broad obsessive cohort, suggesting larger sample sizes or other brain regions may be required to identify candidate molecular mechanisms of clinical association in postmortem brain data sets. PMID:25180571
Genetic neuropathology of obsessive psychiatric syndromes.
Jaffe, A E; Deep-Soboslay, A; Tao, R; Hauptman, D T; Kaye, W H; Arango, V; Weinberger, D R; Hyde, T M; Kleinman, J E
2014-09-02
Anorexia nervosa (AN), bulimia nervosa (BN) and obsessive-compulsive disorder (OCD) are complex psychiatric disorders with shared obsessive features, thought to arise from the interaction of multiple genes of small effect with environmental factors. Potential candidate genes for AN, BN and OCD have been identified through clinical association and neuroimaging studies; however, recent genome-wide association studies of eating disorders (ED) so far have failed to report significant findings. In addition, few, if any, studies have interrogated postmortem brain tissue for evidence of expression quantitative trait loci (eQTLs) associated with candidate genes, which has particular promise as an approach to elucidating molecular mechanisms of association. We therefore selected single-nucleotide polymorphisms (SNPs) based on candidate gene studies for AN, BN and OCD from the literature, and examined the association of these SNPs with gene expression across the lifespan in prefrontal cortex of a nonpsychiatric control cohort (N=268). Several risk-predisposing SNPs were significantly associated with gene expression among control subjects. We then measured gene expression in the prefrontal cortex of cases previously diagnosed with obsessive psychiatric disorders, for example, ED (N=15) and OCD/obsessive-compulsive personality disorder or tics (OCD/OCPD/Tic; N=16), and nonpsychiatric controls (N=102) and identified 6 and 286 genes that were differentially expressed between ED compared with controls and OCD cases compared with controls, respectively (false discovery rate (FDR) <5%). However, none of the clinical risk SNPs were among the eQTLs and none were significantly associated with gene expression within the broad obsessive cohort, suggesting larger sample sizes or other brain regions may be required to identify candidate molecular mechanisms of clinical association in postmortem brain data sets.
QuASAR: quantitative allele-specific analysis of reads.
Harvey, Chris T; Moyerbrailean, Gregory A; Davis, Gordon O; Wen, Xiaoquan; Luca, Francesca; Pique-Regi, Roger
2015-04-15
Expression quantitative trait loci (eQTL) studies have discovered thousands of genetic variants that regulate gene expression, enabling a better understanding of the functional role of non-coding sequences. However, eQTL studies are costly, requiring large sample sizes and genome-wide genotyping of each sample. In contrast, analysis of allele-specific expression (ASE) is becoming a popular approach to detect the effect of genetic variation on gene expression, even within a single individual. This is typically achieved by counting the number of RNA-seq reads matching each allele at heterozygous sites and testing the null hypothesis of a 1:1 allelic ratio. In principle, when genotype information is not readily available, it could be inferred from the RNA-seq reads directly. However, there are currently no existing methods that jointly infer genotypes and conduct ASE inference, while considering uncertainty in the genotype calls. We present QuASAR, quantitative allele-specific analysis of reads, a novel statistical learning method for jointly detecting heterozygous genotypes and inferring ASE. The proposed ASE inference step takes into consideration the uncertainty in the genotype calls, while including parameters that model base-call errors in sequencing and allelic over-dispersion. We validated our method with experimental data for which high-quality genotypes are available. Results for an additional dataset with multiple replicates at different sequencing depths demonstrate that QuASAR is a powerful tool for ASE analysis when genotypes are not available. http://github.com/piquelab/QuASAR. fluca@wayne.edu or rpique@wayne.edu Supplementary Material is available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Johnsson, Martin; Jonsson, Kenneth B; Andersson, Leif; Jensen, Per; Wright, Dominic
2015-05-01
Birds have a unique bone physiology, due to the demands placed on them through egg production. In particular their medullary bone serves as a source of calcium for eggshell production during lay and undergoes continuous and rapid remodelling. We take advantage of the fact that bone traits have diverged massively during chicken domestication to map the genetic basis of bone metabolism in the chicken. We performed a quantitative trait locus (QTL) and expression QTL (eQTL) mapping study in an advanced intercross based on Red Junglefowl (the wild progenitor of the modern domestic chicken) and White Leghorn chickens. We measured femoral bone traits in 456 chickens by peripheral computerised tomography and femoral gene expression in a subset of 125 females from the cross with microarrays. This resulted in 25 loci for female bone traits, 26 loci for male bone traits and 6318 local eQTL loci. We then overlapped bone and gene expression loci, before checking for an association between gene expression and trait values to identify candidate quantitative trait genes for bone traits. A handful of our candidates have been previously associated with bone traits in mice, but our results also implicate unexpected and largely unknown genes in bone metabolism. In summary, by utilising the unique bone metabolism of an avian species, we have identified a number of candidate genes affecting bone allocation and metabolism. These findings can have ramifications not only for the understanding of bone metabolism genetics in general, but could also be used as a potential model for osteoporosis as well as revealing new aspects of vertebrate bone regulation or features that distinguish avian and mammalian bone.
Genetic mapping of common bunt resistance and plant height QTL in wheat.
Singh, Arti; Knox, Ron E; DePauw, R M; Singh, A K; Cuthbert, R D; Kumar, S; Campbell, H L
2016-02-01
Breeding for field resistance to common bunt in wheat will need to account for multiple genes and epistatic and QTL by environment interactions. Loci associated with quantitative resistance to common bunt are co-localized with other beneficial traits including plant height and rust resistance. Common bunt, also known as stinking smut, is caused by seed borne fungi Tilletia tritici (Bjerk.) Wint. [syn. Tilletia caries (DC.) Tul.] and Tilletia laevis Kühn [syn. Tilletia foetida (Wallr.) Liro.]. Common bunt is known to cause grain yield and quality losses in wheat due to bunt ball formation and infestation of the grain. The objectives of this research were to identify and map quantitative trait loci (QTL) for common bunt resistance, to study the epistatic interactions between the identified QTL, and investigate the co-localization of bunt resistance with plant height. A population of 261 doubled haploid lines from the cross Carberry/AC Cadillac and checks were genotyped with polymorphic genome wide microsatellite and DArT(®) markers. The lines were grown in 2011, 2012, and 2013 in separate nurseries for common bunt incidence and height evaluation. AC Cadillac contributed a QTL (QCbt.spa-6D) for common bunt resistance on chromosome 6D at markers XwPt-1695, XwPt-672044, and XwPt-5114. Carberry contributed QTL for bunt resistance on chromosomes 1B (QCbt.spa-1B at XwPt743523) 4B (QCbt.spa-4B at XwPt-744434-Xwmc617), 4D (QCbt.spa-4D at XwPt-9747), 5B (QCbt.spa-5B at XtPt-3719) and 7D (QCbt.spa-7D at Xwmc273). Significant epistatic interactions were identified for percent bunt incidence between QCbt.spa-1B × QCbt.spa-4B and QCbt.spa-1B × QCbt.spa-6D, and QTL by environment interaction between QCbt.spa-1B × QCbt.spa-6D. Plant height QTL were found on chromosomes 4B (QPh.spa-4B) and 6D (QPh.spa-6D) that co-located with bunt resistance QTL. The identification of previously unreported common bunt resistance QTL (on chromosomes 4B, 4D and 7D), and new understanding of QTL × QTL interactions will facilitate marker-assisted breeding for common bunt resistance.
Aliloo, Hassan; Pryce, Jennie E; González-Recio, Oscar; Cocks, Benjamin G; Hayes, Ben J
2015-07-22
It has been suggested that traits with low heritability, such as fertility, may have proportionately more genetic variation arising from non-additive effects than traits with higher heritability, such as milk yield. Here, we performed a large genome scan with 408,255 single nucleotide polymorphism (SNP) markers to identify chromosomal regions associated with additive, dominance and epistatic (pairwise additive × additive) variability in milk yield and a measure of fertility, calving interval, using records from a population of 7,055 Holstein cows. The results were subsequently validated in an independent set of 3,795 Jerseys. We identified genomic regions with validated additive effects on milk yield on Bos taurus autosomes (BTA) 5, 14 and 20, whereas SNPs with suggestive additive effects on fertility were observed on BTA 5, 9, 11, 18, 22, 27, 29 and the X chromosome. We also confirmed genome regions with suggestive dominance effects for milk yield (BTA 2, 3, 5, 26 and 27) and for fertility (BTA 1, 2, 3, 7, 23, 25 and 28). A number of significant epistatic effects for milk yield on BTA 14 were found across breeds. However on close inspection, these were likely to be associated with the mutation in the diacylglycerol O-acyltransferase 1 (DGAT1) gene, given that the associations were no longer significant when the additive effect of the DGAT1 mutation was included in the epistatic model. In general, we observed a low statistical power (high false discovery rates and small number of significant SNPs) for non-additive genetic effects compared with additive effects for both traits which could be an artefact of higher dependence on linkage disequilibrium between markers and causative mutations or smaller size of non-additive effects relative to additive effects. The results of our study suggest that individual non-additive effects make a small contribution to the genetic variation of milk yield and fertility. Although we found no individual mutation with large dominance effect for both traits under investigation, a contribution to genetic variance is still possible from a large number of small dominance effects, so methods that simultaneously incorporate genotypes across all loci are suggested to test the variance explained by dominance gene actions.
Quenouille, J; Paulhiac, E; Moury, B; Palloix, A
2014-06-01
The combination of major resistance genes with quantitative resistance factors is hypothesized as a promising breeding strategy to preserve the durability of resistant cultivar, as recently observed in different pathosystems. Using the pepper (Capsicum annuum)/Potato virus Y (PVY, genus Potyvirus) pathosystem, we aimed at identifying plant genetic factors directly affecting the frequency of virus adaptation to the major resistance gene pvr2(3) and at comparing them with genetic factors affecting quantitative resistance. The resistance breakdown frequency was a highly heritable trait (h(2)=0.87). Four loci including additive quantitative trait loci (QTLs) and epistatic interactions explained together 70% of the variance of pvr2(3) breakdown frequency. Three of the four QTLs controlling pvr2(3) breakdown frequency were also involved in quantitative resistance, strongly suggesting that QTLs controlling quantitative resistance have a pleiotropic effect on the durability of the major resistance gene. With the first mapping of QTLs directly affecting resistance durability, this study provides a rationale for sustainable resistance breeding. Surprisingly, a genetic trade-off was observed between the durability of PVY resistance controlled by pvr2(3) and the spectrum of the resistance against different potyviruses. This trade-off seemed to have been resolved by the combination of minor-effect durability QTLs under long-term farmer selection.
Yu, Yunqing; Assmann, Sarah M
2018-06-15
Plant heterotrimeric G proteins modulate numerous developmental stress responses. Recently, receptor-like kinases (RLKs) have been implicated as functioning with G proteins, and may serve as plant G-protein-coupled-receptors (GPCRs). The RLK FERONIA (FER), in the Catharantus roseus RLK1-like subfamily, is activated by a family of polypeptides called Rapid Alkalinization Factors (RALFs). We previously showed that the Arabidopsis G protein β subunit, AGB1, physically interacts with FER, and that RALF1 regulation of stomatal movement through FER requires AGB1. Here, we investigated genetic interactions of AGB1 and FER in plant salinity response by comparing salt responses in the single and double mutants of agb1 and fer. We show that AGB1 and FER act additively or synergistically depending on the conditions of the NaCl treatments. We further show that the synergism likely occurs through salt-induced ROS production. In addition, we show that RALF1 enhances salt toxicity through increasing Na + accumulation and decreasing K + accumulation rather than by inducing ROS production, and that the RALF1 effect on salt response occurs in an AGB1-independent manner. Our results indicate that RLK epistatic relationships are not fixed, as AGB1 and FER display different genetic relationships to RALF1 in stomatal vs. salinity responses. This article is protected by copyright. All rights reserved.
Shankar, Jayendra; Walker, Rachel G; Wilkinson, Mark C; Ward, Deborah; Horsburgh, Malcolm J
2012-07-01
The culture supernatant fraction of an Enterococcus faecalis gelE mutant of strain OG1RF contained elevated levels of the secreted antigen SalB. Using differential fluorescence gel electrophoresis (DIGE) the salB mutant was shown to possess a unique complement of exoproteins. Differentially abundant exoproteins were identified using matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry. Stress-related proteins including DnaK, Dps family protein, SOD, and NADH peroxidase were present in greater quantity in the OG1RF salB mutant culture supernatant. Moreover, several proteins involved in cell wall synthesis and cell division, including d-Ala-d-Lac ligase and EzrA, were present in reduced quantity in OG1RF salB relative to the parent strain. The salB mutant displayed reduced viability and anomalous cell division, and these phenotypes were exacerbated in a gelE salB double mutant. An epistatic relationship between gelE and salB was not identified with respect to increased autolysis and cell morphological changes observed in the salB mutant. SalB was purified as a six-histidine-tagged protein to investigate peptidoglycan hydrolytic activity; however, activity was not evident. High-pressure liquid chromatography (HPLC) analysis of reduced muropeptides from peptidoglycan digested with mutanolysin revealed that the salB mutant and OG1RF were indistinguishable.
Fu, Guifang; Dai, Xiaotian; Symanzik, Jürgen; Bushman, Shaun
2017-01-01
Leaf shape traits have long been a focus of many disciplines, but the complex genetic and environmental interactive mechanisms regulating leaf shape variation have not yet been investigated in detail. The question of the respective roles of genes and environment and how they interact to modulate leaf shape is a thorny evolutionary problem, and sophisticated methodology is needed to address it. In this study, we investigated a framework-level approach that inputs shape image photographs and genetic and environmental data, and then outputs the relative importance ranks of all variables after integrating shape feature extraction, dimension reduction, and tree-based statistical models. The power of the proposed framework was confirmed by simulation and a Populus szechuanica var. tibetica data set. This new methodology resulted in the detection of novel shape characteristics, and also confirmed some previous findings. The quantitative modeling of a combination of polygenetic, plastic, epistatic, and gene-environment interactive effects, as investigated in this study, will improve the discernment of quantitative leaf shape characteristics, and the methods are ready to be applied to other leaf morphology data sets. Unlike the majority of approaches in the quantitative leaf shape literature, this framework-level approach is data-driven, without assuming any pre-known shape attributes, landmarks, or model structures. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Chu, Pei-Yu; Ke, Guan-Ming; Chen, Po-Chih; Liu, Li-Teh; Tsai, Yen-Chun; Tsai, Jih-Jin
2013-01-01
The continuing threat of dengue fever necessitates a comprehensive characterisation of its epidemiological trends. Phylogenetic and recombination events were reconstructed based on 100 worldwide dengue virus (DENV) type 1 genome sequences with an outgroup (prototypes of DENV2-4). The phylodynamic characteristics and site-specific variation were then analysed using data without the outgroup. Five genotypes (GI-GV) and a ladder-like structure with short terminal branch topology were observed in this study. Apparently, the transmission of DENV1 was geographically random before gradual localising with human activity as GI-GIII in South Asia, GIV in the South Pacific, and GV in the Americas. Genotypes IV and V have recently shown higher population densities compared to older genotypes. All codon regions and all tree branches were skewed toward a negative selection, which indicated that their variation was restricted by protein function. Notably, multi-epistatic interaction sites were found in both PrM 221 and NS3 1730. Recombination events accumulated in regions E, NS3-NS4A, and particularly in region NS5. The estimated coevolution pattern also highlights the need for further study of the biological role of protein PrM 221 and NS3 1730. The recent transmission of emergent GV sublineages into Central America and Europe mandates closely monitoring of genotype interaction and succession. PMID:24040199
Lack of genetic interaction between Tbx20 and Tbx3 in early mouse heart development.
Gavrilov, Svetlana; Harvey, Richard P; Papaioannou, Virginia E
2013-01-01
Members of the T-box family of transcription factors are important regulators orchestrating the complex regionalization of the developing mammalian heart. Individual mutations in Tbx20 and Tbx3 cause distinct congenital heart abnormalities in the mouse: Tbx20 mutations result in failure of heart looping, developmental arrest and lack of chamber differentiation, while hearts of Tbx3 mutants progress further, loop normally but show atrioventricular convergence and outflow tract defects. The two genes have overlapping areas of expression in the atrioventricular canal and outflow tract of the heart but their potential genetic interaction has not been previously investigated. In this study we produced compound mutants to investigate potential genetic interactions at the earliest stages of heart development. We find that Tbx20; Tbx3 double heterozygous mice are viable and fertile with no apparent abnormalities, while double homozygous mutants are embryonic lethal by midgestation. Double homozygous mutant embryos display abnormal cardiac morphogenesis, lack of heart looping, expression patterns of cardiac genes and time of death that are indistinguishable from Tbx20 homozygous mutants. Prior to death, the double homozygotes show an overall developmental delay similar to Tbx3 homozygous mutants. Thus the effects of Tbx20 are epistatic to Tbx3 in the heart but Tbx3 is epistatic to Tbx20 with respect to developmental delay.
Moore, Jason H; Amos, Ryan; Kiralis, Jeff; Andrews, Peter C
2015-01-01
Simulation plays an essential role in the development of new computational and statistical methods for the genetic analysis of complex traits. Most simulations start with a statistical model using methods such as linear or logistic regression that specify the relationship between genotype and phenotype. This is appealing due to its simplicity and because these statistical methods are commonly used in genetic analysis. It is our working hypothesis that simulations need to move beyond simple statistical models to more realistically represent the biological complexity of genetic architecture. The goal of the present study was to develop a prototype genotype–phenotype simulation method and software that are capable of simulating complex genetic effects within the context of a hierarchical biology-based framework. Specifically, our goal is to simulate multilocus epistasis or gene–gene interaction where the genetic variants are organized within the framework of one or more genes, their regulatory regions and other regulatory loci. We introduce here the Heuristic Identification of Biological Architectures for simulating Complex Hierarchical Interactions (HIBACHI) method and prototype software for simulating data in this manner. This approach combines a biological hierarchy, a flexible mathematical framework, a liability threshold model for defining disease endpoints, and a heuristic search strategy for identifying high-order epistatic models of disease susceptibility. We provide several simulation examples using genetic models exhibiting independent main effects and three-way epistatic effects. PMID:25395175
The evolution of recombination in a heterogeneous environment.
Lenormand, T; Otto, S P
2000-01-01
Most models describing the evolution of recombination have focused on the case of a single population, implicitly assuming that all individuals are equally likely to mate and that spatial heterogeneity in selection is absent. In these models, the evolution of recombination is driven by linkage disequilibria generated either by epistatic selection or drift. Models based on epistatic selection show that recombination can be favored if epistasis is negative and weak compared to directional selection and if the recombination modifier locus is tightly linked to the selected loci. In this article, we examine the joint effects of spatial heterogeneity in selection and epistasis on the evolution of recombination. In a model with two patches, each subject to different selection regimes, we consider the cases of mutation-selection and migration-selection balance as well as the spread of beneficial alleles. We find that including spatial heterogeneity extends the range of epistasis over which recombination can be favored. Indeed, recombination can be favored without epistasis, with negative and even with positive epistasis depending on environmental circumstances. The selection pressure acting on recombination-modifier loci is often much stronger with spatial heterogeneity, and even loosely linked modifiers and free linkage may evolve. In each case, predicting whether recombination is favored requires knowledge of both the type of environmental heterogeneity and epistasis, as none of these factors alone is sufficient to predict the outcome. PMID:10978305
Peng, Jianling; Yu, Jianbin; Wang, Hongliang; Guo, Yingqing; Li, Guangming; Bai, Guihua; Chen, Rujin
2011-01-01
Medicago truncatula is a legume species belonging to the inverted repeat lacking clade (IRLC) with trifoliolate compound leaves. However, the regulatory mechanisms underlying development of trifoliolate leaves in legumes remain largely unknown. Here, we report isolation and characterization of fused compound leaf1 (fcl1) mutants of M. truncatula. Phenotypic analysis suggests that FCL1 plays a positive role in boundary separation and proximal-distal axis development of compound leaves. Map-based cloning indicates that FCL1 encodes a class M KNOX protein that harbors the MEINOX domain but lacks the homeodomain. Yeast two-hybrid assays show that FCL1 interacts with a subset of Arabidopsis thaliana BEL1-like proteins with slightly different substrate specificities from the Arabidopsis homolog KNATM-B. Double mutant analyses with M. truncatula single leaflet1 (sgl1) and palmate-like pentafoliata1 (palm1) leaf mutants show that fcl1 is epistatic to palm1 and sgl1 is epistatic to fcl1 in terms of leaf complexity and that SGL1 and FCL1 act additively and are required for petiole development. Previous studies have shown that the canonical KNOX proteins are not involved in compound leaf development in IRLC legumes. The identification of FCL1 supports the role of a truncated KNOX protein in compound leaf development in M. truncatula. PMID:22080596
Beretta, Lorenzo; Santaniello, Alessandro; van Riel, Piet L C M; Coenen, Marieke J H; Scorza, Raffaella
2010-08-06
Epistasis is recognized as a fundamental part of the genetic architecture of individuals. Several computational approaches have been developed to model gene-gene interactions in case-control studies, however, none of them is suitable for time-dependent analysis. Herein we introduce the Survival Dimensionality Reduction (SDR) algorithm, a non-parametric method specifically designed to detect epistasis in lifetime datasets. The algorithm requires neither specification about the underlying survival distribution nor about the underlying interaction model and proved satisfactorily powerful to detect a set of causative genes in synthetic epistatic lifetime datasets with a limited number of samples and high degree of right-censorship (up to 70%). The SDR method was then applied to a series of 386 Dutch patients with active rheumatoid arthritis that were treated with anti-TNF biological agents. Among a set of 39 candidate genes, none of which showed a detectable marginal effect on anti-TNF responses, the SDR algorithm did find that the rs1801274 SNP in the Fc gamma RIIa gene and the rs10954213 SNP in the IRF5 gene non-linearly interact to predict clinical remission after anti-TNF biologicals. Simulation studies and application in a real-world setting support the capability of the SDR algorithm to model epistatic interactions in candidate-genes studies in presence of right-censored data. http://sourceforge.net/projects/sdrproject/.
Yi, Qiang; Liu, Yinghong; Zhang, Xiangge; Hou, Xianbin; Zhang, Junjie; Liu, Hanmei; Hu, Yufeng; Yu, Guowu; Huang, Yubi
2018-03-01
Tassel architecture is an important trait in maize breeding and hybrid seed production. In this study, we investigated total tassel length (TTL) and tassel branch number (TBN) in 266 F 2:3 families across six environments and in 301 recombinant inbred lines (RILs) across three environments, where all the plants were derived from a cross between 08-641 and Ye478. We compared the genetic architecture of the two traits across two generations through combined analysis. In total, 27 quantitative trait loci (QTLs) (15 in F 2:3 ; 16 in RIL), two QTL × environment interactions (both in F 2:3 ), 11 pairs of epistatic interactions (seven in F 2:3 ; four in RIL) and four stable QTLs in both the F 2:3 and RILs were detected. The RIL population had higher detection power than the F 2:3 population. Nevertheless, QTL × environment interactions and epistatic interactions could be more easily detected in the F 2:3 population than in the RILs. Overall, the QTL mapping results in the F 2:3 and RILs were greatly influenced by genetic generations and environments. Finally, fine mapping for a novel and major QTL, qTTL-2-3 (bin 2.07), which accounted for over 8.49% of the phenotypic variation across different environments and generations, could be useful in marker-assisted breeding.
Branham, Sandra E; Levi, Amnon; Katawczik, Melanie; Fei, Zhangjun; Wechter, W Patrick
2018-04-01
Four QTLs and an epistatic interaction were associated with disease severity in response to inoculation with Fusarium oxysporum f. sp. melonis race 1 in a recombinant inbred line population of melon. The USDA Cucumis melo inbred line, MR-1, harbors a wealth of alleles associated with resistance to several major diseases of melon, including powdery mildew, downy mildew, Alternaria leaf blight, and Fusarium wilt. MR-1 was crossed to an Israeli cultivar, Ananas Yok'neam, which is susceptible to all of these diseases, to generate a recombinant inbred line (RIL) population of 172 lines. In this study, the RIL population was genotyped to construct an ultra-dense genetic linkage map with 5663 binned SNPs anchored to the C. melo genome and exhibits the overall high quality of the assembly. The utility of the densely genotyped population was demonstrated through QTL mapping of a well-studied trait, resistance to Fusarium wilt caused by Fusarium oxysporum f. sp. melonis (Fom) race 1. A major QTL co-located with the previously validated resistance gene Fom-2. In addition, three minor QTLs and an epistatic interaction contributing to Fom race 1 resistance were identified. The MR-1 × AY RIL population provides a valuable resource for future QTL mapping studies and marker-assisted selection of disease resistance in melon.
O'Tuathaigh, Colm M P; Fumagalli, Fabio; Desbonnet, Lieve; Perez-Branguli, Francesc; Moloney, Gerard; Loftus, Samim; O'Leary, Claire; Petit, Emilie; Cox, Rachel; Tighe, Orna; Clarke, Gerard; Lai, Donna; Harvey, Richard P; Cryan, John F; Mitchell, Kevin J; Dinan, Timothy G; Riva, Marco A; Waddington, John L
2017-01-01
Few studies have addressed likely gene × gene (ie, epistatic) interactions in mediating risk for schizophrenia. Using a preclinical genetic approach, we investigated whether simultaneous disruption of the risk factors Neuregulin-1 (NRG1) and Disrupted-in-schizophrenia 1 (DISC1) would produce a disease-relevant phenotypic profile different from that observed following disruption to either gene alone. NRG1 heterozygotes exhibited hyperactivity and disruption to prepulse inhibition, both reversed by antipsychotic treatment, and accompanied by reduced striatal dopamine D2 receptor protein expression, impaired social cognition, and altered glutamatergic synaptic protein expression in selected brain areas. Single gene DISC1 mutants demonstrated a disruption in social cognition and nest-building, altered brain 5-hydroxytryptamine levels and hippocampal ErbB4 expression, and decreased cortical expression of the schizophrenia-associated microRNA miR-29b. Co-disruption of DISC1 and NRG1, indicative of epistasis, evoked an impairment in sociability and enhanced self-grooming, accompanied by changes in hypothalamic oxytocin/vasopressin gene expression. The findings indicate specific behavioral correlates and underlying cellular pathways downstream of main effects of DNA variation in the schizophrenia-associated genes NRG1 and DISC1. © The Author 2016. Published by Oxford University Press on behalf of the Maryland Psychiatric Research Center. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Transgressive Hybrids as Hopeful Monsters.
Dittrich-Reed, Dylan R; Fitzpatrick, Benjamin M
2013-06-01
The origin of novelty is a critical subject for evolutionary biologists. Early geneticists speculated about the sudden appearance of new species via special macromutations, epitomized by Goldschmidt's infamous "hopeful monster". Although these ideas were easily dismissed by the insights of the Modern Synthesis, a lingering fascination with the possibility of sudden, dramatic change has persisted. Recent work on hybridization and gene exchange suggests an underappreciated mechanism for the sudden appearance of evolutionary novelty that is entirely consistent with the principles of modern population genetics. Genetic recombination in hybrids can produce transgressive phenotypes, "monstrous" phenotypes beyond the range of parental populations. Transgressive phenotypes can be products of epistatic interactions or additive effects of multiple recombined loci. We compare several epistatic and additive models of transgressive segregation in hybrids and find that they are special cases of a general, classic quantitative genetic model. The Dobzhansky-Muller model predicts "hopeless" monsters, sterile and inviable transgressive phenotypes. The Bateson model predicts "hopeful" monsters with fitness greater than either parental population. The complementation model predicts both. Transgressive segregation after hybridization can rapidly produce novel phenotypes by recombining multiple loci simultaneously. Admixed populations will also produce many similar recombinant phenotypes at the same time, increasing the probability that recombinant "hopeful monsters" will establish true-breeding evolutionary lineages. Recombination is not the only (or even most common) process generating evolutionary novelty, but might be the most credible mechanism for sudden appearance of new forms.
Erceg, Jelena; Saunders, Timothy E.; Girardot, Charles; Devos, Damien P.; Hufnagel, Lars; Furlong, Eileen E. M.
2014-01-01
Deciphering the specific contribution of individual motifs within cis-regulatory modules (CRMs) is crucial to understanding how gene expression is regulated and how this process is affected by sequence variation. But despite vast improvements in the ability to identify where transcription factors (TFs) bind throughout the genome, we are limited in our ability to relate information on motif occupancy to function from sequence alone. Here, we engineered 63 synthetic CRMs to systematically assess the relationship between variation in the content and spacing of motifs within CRMs to CRM activity during development using Drosophila transgenic embryos. In over half the cases, very simple elements containing only one or two types of TF binding motifs were capable of driving specific spatio-temporal patterns during development. Different motif organizations provide different degrees of robustness to enhancer activity, ranging from binary on-off responses to more subtle effects including embryo-to-embryo and within-embryo variation. By quantifying the effects of subtle changes in motif organization, we were able to model biophysical rules that explain CRM behavior and may contribute to the spatial positioning of CRM activity in vivo. For the same enhancer, the effects of small differences in motif positions varied in developmentally related tissues, suggesting that gene expression may be more susceptible to sequence variation in one tissue compared to another. This result has important implications for human eQTL studies in which many associated mutations are found in cis-regulatory regions, though the mechanism for how they affect tissue-specific gene expression is often not understood. PMID:24391522
Manry, Jérémy; Nédélec, Yohann; Fava, Vinicius M; Cobat, Aurélie; Orlova, Marianna; Thuc, Nguyen Van; Thai, Vu Hong; Laval, Guillaume; Barreiro, Luis B; Schurr, Erwin
2017-08-01
Leprosy is a human infectious disease caused by Mycobacterium leprae. A strong host genetic contribution to leprosy susceptibility is well established. However, the modulation of the transcriptional response to infection and the mechanism(s) of disease control are poorly understood. To address this gap in knowledge of leprosy pathogenicity, we conducted a genome-wide search for expression quantitative trait loci (eQTL) that are associated with transcript variation before and after stimulation with M. leprae sonicate in whole blood cells. We show that M. leprae antigen stimulation mainly triggered the upregulation of immune related genes and that a substantial proportion of the differential gene expression is genetically controlled. Indeed, using stringent criteria, we identified 318 genes displaying cis-eQTL at an FDR of 0.01, including 66 genes displaying response-eQTL (reQTL), i.e. cis-eQTL that showed significant evidence for interaction with the M. leprae stimulus. Such reQTL correspond to regulatory variations that affect the interaction between human whole blood cells and M. leprae sonicate and, thus, likely between the human host and M. leprae bacilli. We found that reQTL were significantly enriched among binding sites of transcription factors that are activated in response to infection, and that they were enriched among single nucleotide polymorphisms (SNPs) associated with susceptibility to leprosy per se and Type-I Reaction, and seven of them have been targeted by recent positive selection. Our study suggested that natural selection shaped our genomic diversity to face pathogen exposure including M. leprae infection.
Lai, Wen-Sung
2014-01-01
AKT1 (also known as protein kinase B, α), a serine/threonine kinase of AKT family, has been implicated in both schizophrenia and methamphetamine (Meth) use disorders. AKT1 or its protein also has epistatic effects on the regulation of dopamine-dependent behaviors or drug effects, especially in the striatum. The aim of this study is to investigate the sex-specific role of Akt1 in the regulation of Meth-induced behavioral sensitization and the alterations of striatal neurons using Akt1 −/− mice and wild-type littermates as a model. A series of 4 Experiments were conducted. Meth-induced hyperlocomotion and Meth-related alterations of brain activity were measured. The neural properties of striatal medium spiny neurons (MSNs) were also characterized. Further, 17β-estradiol was applied to examine its protective effect in Meth-sensitized male mice. Our findings indicate that (1) Akt1 −/− males were less sensitive to Meth-induced hyperlocomotion during Meth challenge compared with wild-type controls and Akt1 −/− females, (2) further sex differences were revealed by coinjection of Meth with raclopride but not SCH23390 in Meth-sensitized Akt1 −/− males, (3) Meth-induced alterations of striatal activity were confirmed in Akt1 −/− males using microPET scan with 18F-flurodeoxyglucose, (4) Akt1 deficiency had a significant impact on the electrophysiological and neuromorphological properties of striatal MSNs in male mice, and (5) subchronic injections of 17β-estradiol prevented the reduction of Meth-induced hyperactivity in Meth-sensitized Akt1 −/− male mice. This study highlights a sex- and region-specific effect of Akt1 in the regulation of dopamine-dependent behaviors and implies the importance of AKT1 in the modulation of sex differences in Meth sensitivity and schizophrenia. PMID:23474853
Sotelo, Tamara; Soengas, Pilar; Velasco, Pablo; Rodríguez, Víctor M.; Cartea, María Elena
2014-01-01
Glucosinolates are major secondary metabolites found in the Brassicaceae family. These compounds play an essential role in plant defense against biotic and abiotic stresses, but more interestingly they have beneficial effects on human health. We performed a genetic analysis in order to identify the genome regions regulating glucosinolates biosynthesis in a DH mapping population of Brassica oleracea. In order to obtain a general overview of regulation in the whole plant, analyses were performed in the three major organs where glucosinolates are synthesized (leaves, seeds and flower buds). Eighty two significant QTLs were detected, which explained a broad range of variability in terms of individual and total glucosinolate (GSL) content. A meta-analysis rendered eighteen consensus QTLs. Thirteen of them regulated more than one glucosinolate and its content. In spite of the considerable variability of glucosinolate content and profiles across the organ, some of these consensus QTLs were identified in more than one tissue. Consensus QTLs control the GSL content by interacting epistatically in complex networks. Based on in silico analysis within the B. oleracea genome along with synteny with Arabidopsis, we propose seven major candidate loci that regulate GSL biosynthesis in the Brassicaceae family. Three of these loci control the content of aliphatic GSL and four of them control the content of indolic glucosinolates. GSL-ALK plays a central role in determining aliphatic GSL variation directly and by interacting epistatically with other loci, thus suggesting its regulatory effect. PMID:24614913
2010-01-01
Background Epistasis is recognized as a fundamental part of the genetic architecture of individuals. Several computational approaches have been developed to model gene-gene interactions in case-control studies, however, none of them is suitable for time-dependent analysis. Herein we introduce the Survival Dimensionality Reduction (SDR) algorithm, a non-parametric method specifically designed to detect epistasis in lifetime datasets. Results The algorithm requires neither specification about the underlying survival distribution nor about the underlying interaction model and proved satisfactorily powerful to detect a set of causative genes in synthetic epistatic lifetime datasets with a limited number of samples and high degree of right-censorship (up to 70%). The SDR method was then applied to a series of 386 Dutch patients with active rheumatoid arthritis that were treated with anti-TNF biological agents. Among a set of 39 candidate genes, none of which showed a detectable marginal effect on anti-TNF responses, the SDR algorithm did find that the rs1801274 SNP in the FcγRIIa gene and the rs10954213 SNP in the IRF5 gene non-linearly interact to predict clinical remission after anti-TNF biologicals. Conclusions Simulation studies and application in a real-world setting support the capability of the SDR algorithm to model epistatic interactions in candidate-genes studies in presence of right-censored data. Availability: http://sourceforge.net/projects/sdrproject/ PMID:20691091
Zhou, Daling; Du, Qingzhang; Chen, Jinhui; Wang, Qingshi; Zhang, Deqiang
2017-10-01
Long non-coding RNAs (lncRNAs) function in various biological processes. However, their roles in secondary growth of plants remain poorly understood. Here, 15,691 lncRNAs were identified from vascular cambium, developing xylem, and mature xylem of Populus tomentosa with high and low biomass using RNA-seq, including 1,994 lncRNAs that were differentially expressed (DE) among the six libraries. 3,569 cis-regulated and 3,297 trans-regulated protein-coding genes were predicted as potential target genes (PTGs) of the DE lncRNAs to participate in biological regulation. Then, 476 and 28 lncRNAs were identified as putative targets and endogenous target mimics (eTMs) of Populus known microRNAs (miRNAs), respectively. Genome re-sequencing of 435 individuals from a natural population of P. tomentosa found 34,015 single nucleotide polymorphisms (SNPs) within 178 lncRNA loci and 522 PTGs. Single-SNP associations analysis detected 2,993 associations with 10 growth and wood-property traits under additive and dominance model. Epistasis analysis identified 17,656 epistatic SNP pairs, providing evidence for potential regulatory interactions between lncRNAs and their PTGs. Furthermore, a reconstructed epistatic network, representing interactions of 8 lncRNAs and 15 PTGs, might enrich regulation roles of genes in the phenylpropanoid pathway. These findings may enhance our understanding of non-coding genes in plants. © The Author 2017. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Fixation Probability in a Two-Locus Model by the Ancestral Recombination–Selection Graph
Lessard, Sabin; Kermany, Amir R.
2012-01-01
We use the ancestral influence graph (AIG) for a two-locus, two-allele selection model in the limit of a large population size to obtain an analytic approximation for the probability of ultimate fixation of a single mutant allele A. We assume that this new mutant is introduced at a given locus into a finite population in which a previous mutant allele B is already segregating with a wild type at another linked locus. We deduce that the fixation probability increases as the recombination rate increases if allele A is either in positive epistatic interaction with B and allele B is beneficial or in no epistatic interaction with B and then allele A itself is beneficial. This holds at least as long as the recombination fraction and the selection intensity are small enough and the population size is large enough. In particular this confirms the Hill–Robertson effect, which predicts that recombination renders more likely the ultimate fixation of beneficial mutants at different loci in a population in the presence of random genetic drift even in the absence of epistasis. More importantly, we show that this is true from weak negative epistasis to positive epistasis, at least under weak selection. In the case of deleterious mutants, the fixation probability decreases as the recombination rate increases. This supports Muller’s ratchet mechanism to explain the accumulation of deleterious mutants in a population lacking recombination. PMID:22095080
Data Imputation in Epistatic MAPs by Network-Guided Matrix Completion
Žitnik, Marinka; Zupan, Blaž
2015-01-01
Abstract Epistatic miniarray profile (E-MAP) is a popular large-scale genetic interaction discovery platform. E-MAPs benefit from quantitative output, which makes it possible to detect subtle interactions with greater precision. However, due to the limits of biotechnology, E-MAP studies fail to measure genetic interactions for up to 40% of gene pairs in an assay. Missing measurements can be recovered by computational techniques for data imputation, in this way completing the interaction profiles and enabling downstream analysis algorithms that could otherwise be sensitive to missing data values. We introduce a new interaction data imputation method called network-guided matrix completion (NG-MC). The core part of NG-MC is low-rank probabilistic matrix completion that incorporates prior knowledge presented as a collection of gene networks. NG-MC assumes that interactions are transitive, such that latent gene interaction profiles inferred by NG-MC depend on the profiles of their direct neighbors in gene networks. As the NG-MC inference algorithm progresses, it propagates latent interaction profiles through each of the networks and updates gene network weights toward improved prediction. In a study with four different E-MAP data assays and considered protein–protein interaction and gene ontology similarity networks, NG-MC significantly surpassed existing alternative techniques. Inclusion of information from gene networks also allowed NG-MC to predict interactions for genes that were not included in original E-MAP assays, a task that could not be considered by current imputation approaches. PMID:25658751
Conformational suppression of inter-receptor signaling defects
Ames, Peter; Parkinson, John S.
2006-01-01
Motile bacteria follow gradients of attractant and repellent chemicals with high sensitivity. Their chemoreceptors are physically clustered, which may enable them to function as a cooperative array. Although native chemoreceptor molecules are typically transmembrane homodimers, they appear to associate through their cytoplasmic tips to form trimers of dimers, which may be an important architectural element in the assembly and operation of receptor clusters. The five receptors of Escherichia coli that mediate most of its chemotactic and aerotactic behaviors have identical trimer contact residues and have been shown by in vivo crosslinking methods to form mixed trimers of dimers. Mutations at the trimer contact sites of Tsr, the serine chemoreceptor, invariably abrogate Tsr function, but some of those lesions (designated Tsr*) are epistatic and block the function of heterologous chemoreceptors. We isolated and characterized mutations (designated Tar⋀) in the aspartate chemoreceptor that restored function to Tsr* receptors. The suppressors arose at or near the Tar trimer contact sites and acted in an allele-specific fashion on Tsr* partners. Alone, many Tar⋀ receptors were unable to mediate chemotactic responses to aspartate, but all formed clusters with varying efficiencies. Most of those Tar⋀ receptors were epistatic to WT Tsr, but some regained Tar function in combination with a suppressible Tsr* partner. Tar⋀–Tsr* suppression most likely occurs through compensatory changes in the conformation or dynamics of a mixed receptor signaling complex, presumably based on trimer-of-dimer interactions. These collaborative teams may be responsible for the high-gain signaling properties of bacterial chemoreceptors. PMID:16751275
Epistasis between QTLs for bone density variation in Copenhagen × dark agouti F2 rats
Liu, Lixiang; Alam, Imranul; Sun, Qiwei; Econs, Michael J.; Foroud, Tatiana; Turner, Charles H.
2010-01-01
The variation in several of the risk factors for osteoporotic fracture, including bone mineral density (BMD), has been shown to be strongly influenced by genetic differences. However, the genetic architecture of BMD is complex in both humans and in model organisms. We previously reported quantitative trait locus (QTL) results for BMD from a genome screen of 828 F2 progeny of Copenhagen and dark agouti rats. These progeny also provide an excellent opportunity to search for epistatic effects, or interaction between genetic loci, that contribute to fracture risk. Microsatellite marker data from a 20-cM genome screen was analyzed along with weight-adjusted bone density (DXA and pQCT) phenotypic data using the R/qtl software package. Genotype and phenotype data were permuted to determine genome-wide significance thresholds for the full model and epistasis (interaction) LOD scores corresponding to an alpha level of 0.01. A novel locus on chromosome 15 and a previously reported chromosome 14 QTL demonstrated a strong epistatic effect on BMD at the femur by DXA (LOD = 5.4). Two novel QTLs on chromosomes 2 and 12 were found to interact to affect total BMD at the femur midshaft by pQCT (LOD = 5.0). These results provide new information regarding the mode of action of previously identified QTL in the rat, as well as identifying novel loci that act in combination with known QTL or with other novel loci to contribute to BMD variation. PMID:19153792
Epistasis between QTLs for bone density variation in Copenhagen x dark agouti F2 rats.
Koller, Daniel L; Liu, Lixiang; Alam, Imranul; Sun, Qiwei; Econs, Michael J; Foroud, Tatiana; Turner, Charles H
2009-03-01
The variation in several of the risk factors for osteoporotic fracture, including bone mineral density (BMD), has been shown to be strongly influenced by genetic differences. However, the genetic architecture of BMD is complex in both humans and in model organisms. We previously reported quantitative trait locus (QTL) results for BMD from a genome screen of 828 F2 progeny of Copenhagen and dark agouti rats. These progeny also provide an excellent opportunity to search for epistatic effects, or interaction between genetic loci, that contribute to fracture risk. Microsatellite marker data from a 20-cM genome screen was analyzed along with weight-adjusted bone density (DXA and pQCT) phenotypic data using the R/qtl software package. Genotype and phenotype data were permuted to determine genome-wide significance thresholds for the full model and epistasis (interaction) LOD scores corresponding to an alpha level of 0.01. A novel locus on chromosome 15 and a previously reported chromosome 14 QTL demonstrated a strong epistatic effect on BMD at the femur by DXA (LOD = 5.4). Two novel QTLs on chromosomes 2 and 12 were found to interact to affect total BMD at the femur midshaft by pQCT (LOD = 5.0). These results provide new information regarding the mode of action of previously identified QTL in the rat, as well as identifying novel loci that act in combination with known QTL or with other novel loci to contribute to BMD variation.
TEAM: efficient two-locus epistasis tests in human genome-wide association study.
Zhang, Xiang; Huang, Shunping; Zou, Fei; Wang, Wei
2010-06-15
As a promising tool for identifying genetic markers underlying phenotypic differences, genome-wide association study (GWAS) has been extensively investigated in recent years. In GWAS, detecting epistasis (or gene-gene interaction) is preferable over single locus study since many diseases are known to be complex traits. A brute force search is infeasible for epistasis detection in the genome-wide scale because of the intensive computational burden. Existing epistasis detection algorithms are designed for dataset consisting of homozygous markers and small sample size. In human study, however, the genotype may be heterozygous, and number of individuals can be up to thousands. Thus, existing methods are not readily applicable to human datasets. In this article, we propose an efficient algorithm, TEAM, which significantly speeds up epistasis detection for human GWAS. Our algorithm is exhaustive, i.e. it does not ignore any epistatic interaction. Utilizing the minimum spanning tree structure, the algorithm incrementally updates the contingency tables for epistatic tests without scanning all individuals. Our algorithm has broader applicability and is more efficient than existing methods for large sample study. It supports any statistical test that is based on contingency tables, and enables both family-wise error rate and false discovery rate controlling. Extensive experiments show that our algorithm only needs to examine a small portion of the individuals to update the contingency tables, and it achieves at least an order of magnitude speed up over the brute force approach.
Jiang, Wenzhu; Lee, Joohyun; Jin, Yong-Mei; Qiao, Yongli; Piao, Rihua; Jang, Sun Mi; Woo, Mi-Ok; Kwon, Soon-Wook; Liu, Xianhu; Pan, Hong-Yu; Du, Xinglin; Koh, Hee-Jong
2011-04-01
Seed germination capability of rice is one of the important traits in the production and storage of seeds. Quantitative trait loci (QTL) associated with seed germination capability in various storage periods was identified using two sets of recombinant inbred lines (RILs) which derived from crosses between Milyang 23 and Tong 88-7 (MT-RILs) and between Dasanbyeo and TR22183 (DT-RILs). A total of five and three main additive effects (QTLs) associated with seed germination capability were identified in MT-RILs and DT-RILs, respectively. Among them, six QTLs were identified repeatedly in various seed storage periods designated as qMT-SGC5.1, qMT-SGC7.2, and qMT-SGC9.1 on chromosomes 5, 7, and 9 in MT-RILs, and qDT-SGC2.1, qDT-SGC3.1, and qDT-SGC9.1 on chromosomes 2, 3, and 9 in DT-RILs, respectively. The QTL on chromosome 9 was identified in both RIL populations under all three storage periods, explaining up to 40% of the phenotypic variation. Eight and eighteen pairs additive × additive epistatic effect (epistatic QTL) were identified in MT-RILs and DT-RILs, respectively. In addition, several near isogenic lines (NILs) were developed to confirm six repeatable QTL effects using controlled deterioration test (CDT). The identified QTLs will be further studied to elucidate the mechanisms controlling seed germination capability, which have important implications for long-term seed storage.
A Model of Substitution Trajectories in Sequence Space and Long-Term Protein Evolution
Usmanova, Dinara R.; Ferretti, Luca; Povolotskaya, Inna S.; Vlasov, Peter K.; Kondrashov, Fyodor A.
2015-01-01
The nature of factors governing the tempo and mode of protein evolution is a fundamental issue in evolutionary biology. Specifically, whether or not interactions between different sites, or epistasis, are important in directing the course of evolution became one of the central questions. Several recent reports have scrutinized patterns of long-term protein evolution claiming them to be compatible only with an epistatic fitness landscape. However, these claims have not yet been substantiated with a formal model of protein evolution. Here, we formulate a simple covarion-like model of protein evolution focusing on the rate at which the fitness impact of amino acids at a site changes with time. We then apply the model to the data on convergent and divergent protein evolution to test whether or not the incorporation of epistatic interactions is necessary to explain the data. We find that convergent evolution cannot be explained without the incorporation of epistasis and the rate at which an amino acid state switches from being acceptable at a site to being deleterious is faster than the rate of amino acid substitution. Specifically, for proteins that have persisted in modern prokaryotic organisms since the last universal common ancestor for one amino acid substitution approximately ten amino acid states switch from being accessible to being deleterious, or vice versa. Thus, molecular evolution can only be perceived in the context of rapid turnover of which amino acids are available for evolution. PMID:25415964
Gamal El-Dien, Omnia; Ratcliffe, Blaise; Klápště, Jaroslav; Porth, Ilga; Chen, Charles; El-Kassaby, Yousry A.
2016-01-01
The open-pollinated (OP) family testing combines the simplest known progeny evaluation and quantitative genetics analyses as candidates’ offspring are assumed to represent independent half-sib families. The accuracy of genetic parameter estimates is often questioned as the assumption of “half-sibling” in OP families may often be violated. We compared the pedigree- vs. marker-based genetic models by analysing 22-yr height and 30-yr wood density for 214 white spruce [Picea glauca (Moench) Voss] OP families represented by 1694 individuals growing on one site in Quebec, Canada. Assuming half-sibling, the pedigree-based model was limited to estimating the additive genetic variances which, in turn, were grossly overestimated as they were confounded by very minor dominance and major additive-by-additive epistatic genetic variances. In contrast, the implemented genomic pairwise realized relationship models allowed the disentanglement of additive from all nonadditive factors through genetic variance decomposition. The marker-based models produced more realistic narrow-sense heritability estimates and, for the first time, allowed estimating the dominance and epistatic genetic variances from OP testing. In addition, the genomic models showed better prediction accuracies compared to pedigree models and were able to predict individual breeding values for new individuals from untested families, which was not possible using the pedigree-based model. Clearly, the use of marker-based relationship approach is effective in estimating the quantitative genetic parameters of complex traits even under simple and shallow pedigree structure. PMID:26801647
Unraveling additive from nonadditive effects using genomic relationship matrices.
Muñoz, Patricio R; Resende, Marcio F R; Gezan, Salvador A; Resende, Marcos Deon Vilela; de Los Campos, Gustavo; Kirst, Matias; Huber, Dudley; Peter, Gary F
2014-12-01
The application of quantitative genetics in plant and animal breeding has largely focused on additive models, which may also capture dominance and epistatic effects. Partitioning genetic variance into its additive and nonadditive components using pedigree-based models (P-genomic best linear unbiased predictor) (P-BLUP) is difficult with most commonly available family structures. However, the availability of dense panels of molecular markers makes possible the use of additive- and dominance-realized genomic relationships for the estimation of variance components and the prediction of genetic values (G-BLUP). We evaluated height data from a multifamily population of the tree species Pinus taeda with a systematic series of models accounting for additive, dominance, and first-order epistatic interactions (additive by additive, dominance by dominance, and additive by dominance), using either pedigree- or marker-based information. We show that, compared with the pedigree, use of realized genomic relationships in marker-based models yields a substantially more precise separation of additive and nonadditive components of genetic variance. We conclude that the marker-based relationship matrices in a model including additive and nonadditive effects performed better, improving breeding value prediction. Moreover, our results suggest that, for tree height in this population, the additive and nonadditive components of genetic variance are similar in magnitude. This novel result improves our current understanding of the genetic control and architecture of a quantitative trait and should be considered when developing breeding strategies. Copyright © 2014 by the Genetics Society of America.
Assessing non-additive effects in GBLUP model.
Vieira, I C; Dos Santos, J P R; Pires, L P M; Lima, B M; Gonçalves, F M A; Balestre, M
2017-05-10
Understanding non-additive effects in the expression of quantitative traits is very important in genotype selection, especially in species where the commercial products are clones or hybrids. The use of molecular markers has allowed the study of non-additive genetic effects on a genomic level, in addition to a better understanding of its importance in quantitative traits. Thus, the purpose of this study was to evaluate the behavior of the GBLUP model in different genetic models and relationship matrices and their influence on the estimates of genetic parameters. We used real data of the circumference at breast height in Eucalyptus spp and simulated data from a population of F 2 . Three commonly reported kinship structures in the literature were adopted. The simulation results showed that the inclusion of epistatic kinship improved prediction estimates of genomic breeding values. However, the non-additive effects were not accurately recovered. The Fisher information matrix for real dataset showed high collinearity in estimates of additive, dominant, and epistatic variance, causing no gain in the prediction of the unobserved data and convergence problems. Estimates presented differences of genetic parameters and correlations considering the different kinship structures. Our results show that the inclusion of non-additive effects can improve the predictive ability or even the prediction of additive effects. However, the high distortions observed in the variance estimates when the Hardy-Weinberg equilibrium assumption is violated due to the presence of selection or inbreeding can converge at zero gains in models that consider epistasis in genomic kinship.
Ephemeral association between gene CG5762 and hybrid male sterility in Drosophila sibling species.
Ma, Daina; Michalak, Pawel
2011-10-01
Interspecies divergence in regulatory pathways may result in hybrid male sterility (HMS) when dominance and epistatic interactions between alleles that are functional within one genome are disrupted in hybrid genomes. The identification of genes contributing to HMS and other hybrid dysfunctions is essential for understanding the origin of new species (speciation). Previously, we identified a panel of male-specific loci misexpressed in sterile male hybrids of Drosophila simulans and D. mauritiana relative to parental species. In the current work, we attempt to dissect the genetic associations between HMS and one of the genes, CG5762, a Drosophila-unique locus characterized by rapid sequence divergence within the genus, presumably driven by positive natural selection. CG5762 is underexpressed in sterile backcross males compared with their fertile brothers. In CG5762 heterozygotes, the D. mauritiana allele is consistently overexpressed on both the D. simulans and D. mauritiana backcross genomic background, suggesting a cis-acting regulation factor. There is a significant association between heterozygosity and HMS in hybrid males from early but not later backcross generations. Microsatellite markers spanning CG5762 fail to associate with HMS. These observations lead to a conclusion that CG5762 is not a causative factor of HMS. Although genetic linkage between CG5762 and a neighboring causative introgression cannot be ruled out, it seems that the pattern is most consistent with CG5762 participating in epistatic interactions that are disrupted in flies with HMS.
The reality and importance of founder speciation in evolution.
Templeton, Alan R
2008-05-01
A founder event occurs when a new population is established from a small number of individuals drawn from a large ancestral population. Mayr proposed that genetic drift in an isolated founder population could alter the selective forces in an epistatic system, an observation supported by recent studies. Carson argued that a period of relaxed selection could occur when a founder population is in an open ecological niche, allowing rapid population growth after the founder event. Selectable genetic variation can actually increase during this founder-flush phase due to recombination, enhanced survival of advantageous mutations, and the conversion of non-additive genetic variance into additive variance in an epistatic system, another empirically confirmed prediction. Templeton combined the theories of Mayr and Carson with population genetic models to predict the conditions under which founder events can contribute to speciation, and these predictions are strongly confirmed by the empirical literature. Much of the criticism of founder speciation is based upon equating founder speciation to an adaptive peak shift opposed by selection. However, Mayr, Carson and Templeton all modeled a positive interaction of selection and drift, and Templeton showed that founder speciation is incompatible with peak-shift conditions. Although rare, founder speciation can have a disproportionate importance in adaptive innovation and radiation, and examples are given to show that "rare" does not mean "unimportant" in evolution. Founder speciation also interacts with other speciation mechanisms such that a speciation event is not a one-dimensional process due to either selection alone or drift alone. (c) 2008 Wiley Periodicals, Inc.
Xu, X. F.; Mei, H. W.; Luo, L. J.; Cheng, X. N.; Li, Z. K.
2002-02-01
Quantitative trait loci (QTLs), conferring quantitative resistance to rice brown planthopper (BPH), were investigated using 160 F(11) recombinant inbred lines (RILs) from the Lemont/Teqing cross, a complete RFLP map, and replicated phenotyping of seedbox inoculation. The paternal indica parent, Teqing, was more-resistant to BPH than the maternal japonica parent, Lemont. The RILs showed transgressive segregation for resistance to BPH. Seven main-effect QTLs and many epistatic QTL pairs were identified and mapped on the 12 rice chromosomes. Collectively, the main-effect and epistatic QTLs accounted for over 70% of the total variation in damage scores. Teqing has the resistance allele at four main-effect QTLs, and the Lemont allele resulted in resistance at the other three. Of the main-effect QTLs identified, QBphr5b was mapped to the vicinity of gl1, a major gene controlling leaf and stem pubescence. The Teqing allele controlling leaf and stem pubescence was associated with resistance, while the Lemont allele for glabrous stem and leaves was associated with susceptibility, indicating that this gene may have contributed to resistance through antixenosis. Similar to the reported BPH resistance genes, the other six detected main-effect QTLs were all mapped to regions where major disease resistance genes locate, suggesting they might have contributed either to antibiosis or tolerance. Our results indicated that marker-aided pyramiding of major resistance genes and QTLs should provide effective and stable control over this devastating pest.
Eskandari, Mehrzad; Cober, Elroy R; Rajcan, Istvan
2013-06-01
Soybean [Glycine max (L.) Merrill] seed oil is the primary global source of edible oil and a major renewable and sustainable feedstock for biodiesel production. Therefore, increasing the relative oil concentration in soybean is desirable; however, that goal is complex due to the quantitative nature of the oil concentration trait and possible effects on major agronomic traits such as seed yield or protein concentration. The objectives of the present study were to study the relationship between seed oil concentration and important agronomic and seed quality traits, including seed yield, 100-seed weight, protein concentration, plant height, and days to maturity, and to identify oil quantitative trait loci (QTL) that are co-localized with the traits evaluated. A population of 203 F4:6 recombinant inbred lines, derived from a cross between moderately high oil soybean genotypes OAC Wallace and OAC Glencoe, was developed and grown across multiple environments in Ontario, Canada, in 2009 and 2010. Among the 11 QTL associated with seed oil concentration in the population, which were detected using either single-factor ANOVA or multiple QTL mapping methods, the number of QTL that were co-localized with other important traits QTL were six for protein concentration, four for seed yield, two for 100-seed weight, one for days to maturity, and one for plant height. The oil-beneficial allele of the QTL tagged by marker Sat_020 was positively associated with seed protein concentration. The oil favorable alleles of markers Satt001 and GmDGAT2B were positively correlated with seed yield. In addition, significant two-way epistatic interactions, where one of the interacting markers was solely associated with seed oil concentration, were identified for the selected traits in this study. The number of significant epistatic interactions was seven for yield, four for days to maturity, two for 100-seed weight, one for protein concentration, and one for plant height. The identified molecular markers associated with oil-related QTL in this study, which also have positive effects on other important traits such as seed yield and protein concentration, could be used in the soybean marker breeding programs aimed at developing either higher seed yield and oil concentration or higher seed protein and oil concentration per hectare. Alternatively, selecting complementary parents with greater breeding values due to positive epistatic interactions could lead to the development of higher oil soybean cultivars.
de Torres Zabala, Marta; Bennett, Mark H; Truman, William H; Grant, Murray R
2009-08-01
The importance of phytohormone balance is increasingly recognized as central to the outcome of plant-pathogen interactions. Recently it has been demonstrated that abscisic acid signalling pathways are utilized by the bacterial phytopathogen Pseudomonas syringae to promote pathogenesis. In this study, we examined the dynamics, inter-relationship and impact of three key acidic phytohormones, salicylic acid, abscisic acid and jasmonic acid, and the bacterial virulence factor, coronatine, during progression of P. syringae infection of Arabidopsis thaliana. We show that levels of SA and ABA, but not JA, appear to play important early roles in determining the outcome of the infection process. SA is required in order to mount a full innate immune responses, while bacterial effectors act rapidly to activate ABA biosynthesis. ABA suppresses inducible innate immune responses by down-regulating SA biosynthesis and SA-mediated defences. Mutant analyses indicated that endogenous ABA levels represent an important reservoir that is necessary for effector suppression of plant-inducible innate defence responses and SA synthesis prior to subsequent pathogen-induced increases in ABA. Enhanced susceptibility due to loss of SA-mediated basal resistance is epistatically dominant over acquired resistance due to ABA deficiency, although ABA also contributes to symptom development. We conclude that pathogen-modulated ABA signalling rapidly antagonizes SA-mediated defences. We predict that hormonal perturbations, either induced or as a result of environmental stress, have a marked impact on pathological outcomes, and we provide a mechanistic basis for understanding priming events in plant defence.
The effect of gene interactions on the long-term response to selection.
Paixão, Tiago; Barton, Nicholas H
2016-04-19
The role of gene interactions in the evolutionary process has long been controversial. Although some argue that they are not of importance, because most variation is additive, others claim that their effect in the long term can be substantial. Here, we focus on the long-term effects of genetic interactions under directional selection assuming no mutation or dominance, and that epistasis is symmetrical overall. We ask by how much the mean of a complex trait can be increased by selection and analyze two extreme regimes, in which either drift or selection dominate the dynamics of allele frequencies. In both scenarios, epistatic interactions affect the long-term response to selection by modulating the additive genetic variance. When drift dominates, we extend Robertson's [Robertson A (1960)Proc R Soc Lond B Biol Sci153(951):234-249] argument to show that, for any form of epistasis, the total response of a haploid population is proportional to the initial total genotypic variance. In contrast, the total response of a diploid population is increased by epistasis, for a given initial genotypic variance. When selection dominates, we show that the total selection response can only be increased by epistasis when some initially deleterious alleles become favored as the genetic background changes. We find a simple approximation for this effect and show that, in this regime, it is the structure of the genotype-phenotype map that matters and not the variance components of the population.
Alleles versus genotypes: Genetic interactions and the dynamics of selection in sexual populations
NASA Astrophysics Data System (ADS)
Neher, Richard
2010-03-01
Physical interactions between amino-acids are essential for protein structure and activity, while protein-protein interactions and regulatory interactions are central to cellular function. As a consequence of these interactions, the combined effect of two mutations can differ from the sum of the individual effects of the mutations. This phenomenon of genetic interaction is known as epistasis. However, the importance of epistasis and its effects on evolutionary dynamics are poorly understood, especially in sexual populations where recombination breaks up existing combinations of alleles to produce new ones. Here, we present a computational model of selection dynamics involving many epistatic loci in a recombining population. We demonstrate that a large number of polymorphic interacting loci can, despite frequent recombination, exhibit cooperative behavior that locks alleles into favorable genotypes leading to a population consisting of a set of competing clones. As the recombination rate exceeds a certain critical value this ``genotype selection'' phase disappears in an abrupt transition giving way to ``allele selection'' - the phase where different loci are only weakly correlated as expected in sexually reproducing populations. Clustering of interacting sets of genes on a chromosome leads to the emergence of an intermediate regime, where localized blocks of cooperating alleles lock into genetic modules. Large populations attain highest fitness at a recombination rate just below critical, suggesting that natural selection might tune recombination rates to balance the beneficial aspect of exploration of genotype space with the breaking up of synergistic allele combinations.
Hidding, E; Swaab, H; de Sonneville, L M J; van Engeland, H; Vorstman, J A S
2016-11-01
This paper examines how COMT 158 genotypes and plasma proline levels are associated with variable penetrance of social behavioural and social cognitive problems in 22q11.2 deletion syndrome (22q11DS). Severity of autistic spectrum symptoms of 45 participants with 22q11DS was assessed using the Autism Diagnostic Interview Revised. Face and facial emotion recognition was evaluated using standardized computer-based test-paradigms. Associations with COMT 158 genotypes and proline levels were examined. High proline levels and poor face recognition in individuals with the COMT MET allele, and poor facial emotion recognition, explained almost 50% of the variance in severity of autism symptomatology in individuals with 22q11DS. High proline levels and a decreased capacity to break down dopamine as a result of the COMT MET variant are both relevant in the expression of the social phenotype in patients. This epistatic interaction effect between the COMT 158 genotype and proline on the expression of social deficits in 22q11DS shows how factors other than the direct effects of the deletion itself can modulate the penetrance of associated cognitive and behavioural outcomes. These findings are not only relevant to our insight into 22q11DS, but also provide a model to better understand the phenomenon of variable penetrance in other pathogenic genetic variants. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Mallik, Saurav; Basu, Sudipto; Hait, Suman; Kundu, Sudip
2018-04-21
Do coding and regulatory segments of a gene co-evolve with each-other? Seeking answers to this question, here we analyze the case of Escherichia coli ribosomal protein S15, that represses its own translation by specifically binding its messenger RNA (rpsO mRNA) and stabilizing a pseudoknot structure at the upstream untranslated region, thus trapping the ribosome into an incomplete translation initiation complex. In the absence of S15, ribosomal protein S1 recognizes rpsO and promotes translation by melting this very pseudoknot. We employ a robust statistical method to detect signatures of positive epistasis between residue site pairs and find that biophysical constraints of translational regulation (S15-rpsO and S1-rpsO recognition, S15-mediated rpsO structural rearrangement, and S1-mediated melting) are strong predictors of positive epistasis. Transforming the epistatic pairs into a network, we find that signatures of two different, but interconnected regulatory cascades are imprinted in the sequence-space and can be captured in terms of two dense network modules that are sparsely connected to each other. This network topology further reflects a general principle of how functionally coupled components of biological networks are interconnected. These results depict a model case, where translational regulation drives characteristic residue-level epistasis-not only between a protein and its own mRNA but also between a protein and the mRNA of an entirely different protein. © 2018 Wiley Periodicals, Inc.
Epistasis in protein evolution
Starr, Tyler N.
2016-01-01
Abstract The structure, function, and evolution of proteins depend on physical and genetic interactions among amino acids. Recent studies have used new strategies to explore the prevalence, biochemical mechanisms, and evolutionary implications of these interactions—called epistasis—within proteins. Here we describe an emerging picture of pervasive epistasis in which the physical and biological effects of mutations change over the course of evolution in a lineage‐specific fashion. Epistasis can restrict the trajectories available to an evolving protein or open new paths to sequences and functions that would otherwise have been inaccessible. We describe two broad classes of epistatic interactions, which arise from different physical mechanisms and have different effects on evolutionary processes. Specific epistasis—in which one mutation influences the phenotypic effect of few other mutations—is caused by direct and indirect physical interactions between mutations, which nonadditively change the protein's physical properties, such as conformation, stability, or affinity for ligands. In contrast, nonspecific epistasis describes mutations that modify the effect of many others; these typically behave additively with respect to the physical properties of a protein but exhibit epistasis because of a nonlinear relationship between the physical properties and their biological effects, such as function or fitness. Both types of interaction are rampant, but specific epistasis has stronger effects on the rate and outcomes of evolution, because it imposes stricter constraints and modulates evolutionary potential more dramatically; it therefore makes evolution more contingent on low‐probability historical events and leaves stronger marks on the sequences, structures, and functions of protein families. PMID:26833806
Wang, Ben-Gang; Xu, Qian; Lv, Zhi; Fang, Xin-Xin; Ding, Han-Xi; Wen, Jing; Yuan, Yuan
2018-01-01
AIM To evaluate the association of 12 tag single nucleotide polymorphisms (tagSNPs) in three onco-long non-coding RNA (lncRNA) genes (HOTTIP, CCAT2, MALAT1) with the risk and prognosis of hepatocellular cancer (HCC). METHODS Twelve tagSNPs covering the three onco-lncRNAs were genotyped by the KASP method in a total of 1338 samples, including 521 HCC patients and frequency-matched 817 controls. The samples were obtained from an unrelated Chinese population at the First Hospital of China Medical University from 2012-2015. The expression quantitative trait loci (eQTL) analyses were conducted to explore further the potential function of the promising SNPs. RESULTS Three SNPs in HOTTIP, one promoter SNP in MALAT1, and one haplotype of HOTTIP were associated with HCC risk. The HOTTIP rs17501292, rs2067087, and rs17427960 SNPs were increased to 1.55-, 1.20-, and 1.18-fold HCC risk under allelic models (P = 0.012, 0.017 and 0.049, respectively). MALAT1 rs4102217 SNP was increased to a 1.32-fold HCC risk under dominant models (P = 0.028). In addition, the two-way interaction of HOTTIP rs17501292-MALAT1 rs619586 polymorphisms showed a decreased effect on HCC risk (Pinteraction = 0.028, OR = 0.30) and epistasis with each other. HOTTIP rs3807598 variant genotype showed significantly longer survival time in HBV negative subgroup (P = 0.049, HR = 0.12), and MALAT1 rs591291 showed significantly better prognosis in female and HBV negative subgroups (P = 0.022, HR = 0.37; P = 0.042, HR = 0.25, respectively). In the study, no significant effect was observed in eQTL analysis. CONCLUSION Specific lncRNA (HOTTIP and MALAT1) SNPs have potential to be biomarkers for HCC risk and prognosis. PMID:29930469
Moscou, Matthew J; Lauter, Nick; Steffenson, Brian; Wise, Roger P
2011-07-01
Stem rust (Puccinia graminis f. sp. tritici; Pgt) is a devastating fungal disease of wheat and barley. Pgt race TTKSK (isolate Ug99) is a serious threat to these Triticeae grain crops because resistance is rare. In barley, the complex Rpg-TTKSK locus on chromosome 5H is presently the only known source of qualitative resistance to this aggressive Pgt race. Segregation for resistance observed on seedlings of the Q21861 × SM89010 (QSM) doubled-haploid (DH) population was found to be predominantly qualitative, with little of the remaining variance explained by loci other than Rpg-TTKSK. In contrast, analysis of adult QSM DH plants infected by field inoculum of Pgt race TTKSK in Njoro, Kenya, revealed several additional quantitative trait loci that contribute to resistance. To molecularly characterize these loci, Barley1 GeneChips were used to measure the expression of 22,792 genes in the QSM population after inoculation with Pgt race TTKSK or mock-inoculation. Comparison of expression Quantitative Trait Loci (eQTL) between treatments revealed an inoculation-dependent expression polymorphism implicating Actin depolymerizing factor3 (within the Rpg-TTKSK locus) as a candidate susceptibility gene. In parallel, we identified a chromosome 2H trans-eQTL hotspot that co-segregates with an enhancer of Rpg-TTKSK-mediated, adult plant resistance discovered through the Njoro field trials. Our genome-wide eQTL studies demonstrate that transcript accumulation of 25% of barley genes is altered following challenge by Pgt race TTKSK, but that few of these genes are regulated by the qualitative Rpg-TTKSK on chromosome 5H. It is instead the chromosome 2H trans-eQTL hotspot that orchestrates the largest inoculation-specific responses, where enhanced resistance is associated with transcriptional suppression of hundreds of genes scattered throughout the genome. Hence, the present study associates the early suppression of genes expressed in this host-pathogen interaction with enhancement of R-gene mediated resistance.
Moscou, Matthew J.; Lauter, Nick; Steffenson, Brian; Wise, Roger P.
2011-01-01
Stem rust (Puccinia graminis f. sp. tritici; Pgt) is a devastating fungal disease of wheat and barley. Pgt race TTKSK (isolate Ug99) is a serious threat to these Triticeae grain crops because resistance is rare. In barley, the complex Rpg-TTKSK locus on chromosome 5H is presently the only known source of qualitative resistance to this aggressive Pgt race. Segregation for resistance observed on seedlings of the Q21861 × SM89010 (QSM) doubled-haploid (DH) population was found to be predominantly qualitative, with little of the remaining variance explained by loci other than Rpg-TTKSK. In contrast, analysis of adult QSM DH plants infected by field inoculum of Pgt race TTKSK in Njoro, Kenya, revealed several additional quantitative trait loci that contribute to resistance. To molecularly characterize these loci, Barley1 GeneChips were used to measure the expression of 22,792 genes in the QSM population after inoculation with Pgt race TTKSK or mock-inoculation. Comparison of expression Quantitative Trait Loci (eQTL) between treatments revealed an inoculation-dependent expression polymorphism implicating Actin depolymerizing factor3 (within the Rpg-TTKSK locus) as a candidate susceptibility gene. In parallel, we identified a chromosome 2H trans-eQTL hotspot that co-segregates with an enhancer of Rpg-TTKSK-mediated, adult plant resistance discovered through the Njoro field trials. Our genome-wide eQTL studies demonstrate that transcript accumulation of 25% of barley genes is altered following challenge by Pgt race TTKSK, but that few of these genes are regulated by the qualitative Rpg-TTKSK on chromosome 5H. It is instead the chromosome 2H trans-eQTL hotspot that orchestrates the largest inoculation-specific responses, where enhanced resistance is associated with transcriptional suppression of hundreds of genes scattered throughout the genome. Hence, the present study associates the early suppression of genes expressed in this host–pathogen interaction with enhancement of R-gene mediated resistance. PMID:21829384
Reid, Brett M; Permuth, Jennifer B; Chen, Y Ann; Teer, Jamie K; Monteiro, Alvaro N A; Chen, Zhihua; Tyrer, Jonathan; Berchuck, Andrew; Chenevix-Trench, Georgia; Doherty, Jennifer A; Goode, Ellen L; Iverson, Edwin S; Lawrenson, Kate; Pearce, Celeste L; Pharoah, Paul D; Phelan, Catherine M; Ramus, Susan J; Rossing, Mary Anne; Schildkraut, Joellen M; Cheng, Jin Q; Gayther, Simon A; Sellers, Thomas A
2017-01-01
Genome-wide association studies (GWAS) have identified multiple loci associated with epithelial ovarian cancer (EOC) susceptibility, but further progress requires integration of epidemiology and biology to illuminate true risk loci below genome-wide significance levels (P < 5 × 10 -8 ). Most risk SNPs lie within non-protein-encoding regions, and we hypothesize that long noncoding RNA (lncRNA) genes are enriched at EOC risk regions and represent biologically relevant functional targets. Using imputed GWAS data from about 18,000 invasive EOC cases and 34,000 controls of European ancestry, the GENCODE (v19) lncRNA database was used to annotate SNPs from 13,442 lncRNAs for permutation-based enrichment analysis. Tumor expression quantitative trait locus (eQTL) analysis was performed for sub-genome-wide regions (1 × 10 -5 > P > 5 × 10 -8 ) overlapping lncRNAs. Of 5,294 EOC-associated SNPs (P < 1.0 × 10 -5 ), 1,464 (28%) mapped within 53 unique lncRNAs and an additional 3,484 (66%) SNPs were correlated (r 2 > 0.2) with SNPs within 115 lncRNAs. EOC-associated SNPs comprised 130 independent regions, of which 72 (55%) overlapped with lncRNAs, representing a significant enrichment (P = 5.0 × 10 -4 ) that was more pronounced among a subset of 5,401 lncRNAs with active epigenetic regulation in normal ovarian tissue. EOC-associated lncRNAs and their putative promoters and transcription factors were enriched for biologically relevant pathways and eQTL analysis identified five novel putative risk regions with allele-specific effects on lncRNA gene expression. lncRNAs are significantly enriched at EOC risk regions, suggesting a mechanistic role for lncRNAs in driving predisposition to EOC. lncRNAs represent key candidates for integrative epidemiologic and functional studies. Further research on their biologic role in ovarian cancer is indicated. Cancer Epidemiol Biomarkers Prev; 26(1); 116-25. ©2016 AACR. ©2016 American Association for Cancer Research.
Genotype-based gene signature of glioma risk.
Huang, Yen-Tsung; Zhang, Yi; Wu, Zhijin; Michaud, Dominique S
2017-07-01
Glioma accounts for 80% of malignant brain tumors, but its etiologic determinants remain elusive. Despite genetic susceptibility loci identified by genome-wide association study (GWAS), the agnostic approach leaves open the possibility that other susceptibility genes remain to be discovered. Here we conduct a gene-centric integrative GWAS (iGWAS) of glioma risk that combines transcriptomics and genetics. We synthesized a brain transcriptomics dataset (n = 354), a GWAS dataset (n = 4203), and an advanced glioma tumor transcriptomic dataset (n = 483) to conduct an iGWAS. Using the expression quantitative trait loci (eQTL) dataset, we built models to predict gene expression for the GWAS data, based on eQTL genotypes. With the predicted gene expression, iGWAS analyses were performed using a novel statistical method. Gene signature risk score was constructed using a penalized logistic regression model. A total of 30527 transcripts were analyzed using the iGWAS approach. Four novel glioma susceptibility genes were identified with internal and external validation, including DRD5 (P = 3.0 × 10-79), WDR1 (P = 8.4 × 10-77), NOMO1 (P = 1.3 × 10-25), and PDXDC1 (P = 8.3 × 10-24). The genotype-predicted transcription pattern between cases and controls is consistent with that between tumor and its matched normal tissue. The genotype-based 4-gene signature improved the classification between glioma cases and controls based on age, gender, and population stratification, with area under the receiver operating characteristic curve increasing from 0.77 to 0.85 (P = 8.1 × 10-23). A new genotype-based gene signature of glioma was identified using a novel iGWAS approach, which integrates multiplatform genomic data as well as different genetic association studies. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Neuro-Oncology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
Lamontagne, Maxime; Timens, Wim; Hao, Ke; Bossé, Yohan; Laviolette, Michel; Steiling, Katrina; Campbell, Joshua D; Couture, Christian; Conti, Massimo; Sherwood, Karen; Hogg, James C; Brandsma, Corry-Anke; van den Berge, Maarten; Sandford, Andrew; Lam, Stephen; Lenburg, Marc E; Spira, Avrum; Paré, Peter D; Nickle, David; Sin, Don D; Postma, Dirkje S
2014-11-01
COPD is a complex chronic disease with poorly understood pathogenesis. Integrative genomic approaches have the potential to elucidate the biological networks underlying COPD and lung function. We recently combined genome-wide genotyping and gene expression in 1111 human lung specimens to map expression quantitative trait loci (eQTL). To determine causal associations between COPD and lung function-associated single nucleotide polymorphisms (SNPs) and lung tissue gene expression changes in our lung eQTL dataset. We evaluated causality between SNPs and gene expression for three COPD phenotypes: FEV(1)% predicted, FEV(1)/FVC and COPD as a categorical variable. Different models were assessed in the three cohorts independently and in a meta-analysis. SNPs associated with a COPD phenotype and gene expression were subjected to causal pathway modelling and manual curation. In silico analyses evaluated functional enrichment of biological pathways among newly identified causal genes. Biologically relevant causal genes were validated in two separate gene expression datasets of lung tissues and bronchial airway brushings. High reliability causal relations were found in SNP-mRNA-phenotype triplets for FEV(1)% predicted (n=169) and FEV(1)/FVC (n=80). Several genes of potential biological relevance for COPD were revealed. eQTL-SNPs upregulating cystatin C (CST3) and CD22 were associated with worse lung function. Signalling pathways enriched with causal genes included xenobiotic metabolism, apoptosis, protease-antiprotease and oxidant-antioxidant balance. By using integrative genomics and analysing the relationships of COPD phenotypes with SNPs and gene expression in lung tissue, we identified CST3 and CD22 as potential causal genes for airflow obstruction. This study also augmented the understanding of previously described COPD pathways. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Kim, Sewon; Kim, Kyunglan; Heo, Dong Won; Kim, Jong-Sung; Park, Chan Kee; Kim, Chang-sik
2015-01-01
Purpose The human CAV1-CAV2 locus has been associated with susceptibility to primary open-angle glaucoma in four studies of Caucasian, Chinese, and Pakistani populations, although not in several other studies of non-Korean populations. In this study with Korean participants, the CAV1-CAV2 locus was investigated for associations with susceptibility to primary open-angle glaucoma accompanied by elevated intraocular pressure (IOP), namely, high-tension glaucoma (HTG), as well as with IOP elevation, which is a strong risk factor for glaucoma. Methods Two single nucleotide polymorphisms (SNPs) were genotyped in 1,161 Korean participants including 229 patients with HTG and 932 healthy controls and statistically examined for association with HTG susceptibility and IOP. One SNP was rs4236601 G>A, which had been reported in the original study, and the other SNP was rs17588172 T>G, which was perfectly correlated (r2=1) with another reported SNP rs1052990. Expression quantitative trait loci (eQTL) analysis was performed using GENe Expression VARiation (Genevar) data. Results Both SNPs were associated with HTG susceptibility, but the rs4236601 association disappeared when adjusted for the rs17588172 genotype and not vice versa. The minor allele G of rs17588172 was associated significantly with 1.5-fold increased susceptibility to HTG (p=0.0069) and marginally with IOP elevation (p=0.043) versus the major allele T. This minor allele was also associated with decreased CAV1 and CAV2 mRNA in skin and adipose according to the Genevar eQTL analysis. Conclusions The minor allele G of rs17588172 in the CAV1-CAV2 locus is associated with decreased expression of CAV1 and CAV2 in some tissues, marginally with IOP elevation, and consequently with increased susceptibility to HTG. PMID:26015768
Pinsonneault, Julia K; Frater, John T; Kompa, Benjamin; Mascarenhas, Roshan; Wang, Danxin; Sadee, Wolfgang
2017-01-01
Genetic variants of ESR1 have been implicated in multiple diseases, including behavioral disorders, but causative variants remain uncertain. We have searched for regulatory variants affecting ESR1 expression in human brain, measuring allelic ESR1 mRNA expression in human brain tissues with marker SNPs in exon4 representing ESR1-008 (or ESRα-36), and in the 3'UTR of ESR1-203, two main ESR1 isoforms in brain. In prefrontal cortex from subjects with bipolar disorder, schizophrenia, and controls (n = 35 each; Stanley Foundation brain bank), allelic ESR1 mRNA ratios deviated from unity up to tenfold at the exon4 marker SNP, with large allelic ratios observed primarily in bipolar and schizophrenic subjects. SNP scanning and targeted sequencing identified rs2144025, associated with large allelic mRNA ratios (p = 1.6E10-6). Moreover, rs2144025 was significantly associated with ESR1 mRNA levels in the Brain eQTL Almanac and in brain regions in the Genotype-Tissue Expression project. In four GWAS cohorts, rs2104425 was significantly associated with behavioral traits, including: hypomanic episodes in female bipolar disorder subjects (GAIN bipolar disorder study; p = 0.0004), comorbid psychological symptoms in both males and females with attention deficit hyperactivity disorder (GAIN ADHD, p = 0.00002), psychological diagnoses in female children (eMERGE study of childhood health, subject age ≥9, p = 0.0009), and traits in schizophrenia (e.g., grandiose delusions, GAIN schizophrenia, p = 0.0004). The first common ESR1 variant (MAF 12-33% across races) linked to regulatory functions, rs2144025 appears conditionally to affect ESR1 mRNA expression in the brain and modulate traits in behavioral disorders.
Kompa, Benjamin; Mascarenhas, Roshan; Wang, Danxin; Sadee, Wolfgang
2017-01-01
Genetic variants of ESR1 have been implicated in multiple diseases, including behavioral disorders, but causative variants remain uncertain. We have searched for regulatory variants affecting ESR1 expression in human brain, measuring allelic ESR1 mRNA expression in human brain tissues with marker SNPs in exon4 representing ESR1-008 (or ESRα-36), and in the 3’UTR of ESR1-203, two main ESR1 isoforms in brain. In prefrontal cortex from subjects with bipolar disorder, schizophrenia, and controls (n = 35 each; Stanley Foundation brain bank), allelic ESR1 mRNA ratios deviated from unity up to tenfold at the exon4 marker SNP, with large allelic ratios observed primarily in bipolar and schizophrenic subjects. SNP scanning and targeted sequencing identified rs2144025, associated with large allelic mRNA ratios (p = 1.6E10-6). Moreover, rs2144025 was significantly associated with ESR1 mRNA levels in the Brain eQTL Almanac and in brain regions in the Genotype-Tissue Expression project. In four GWAS cohorts, rs2104425 was significantly associated with behavioral traits, including: hypomanic episodes in female bipolar disorder subjects (GAIN bipolar disorder study; p = 0.0004), comorbid psychological symptoms in both males and females with attention deficit hyperactivity disorder (GAIN ADHD, p = 0.00002), psychological diagnoses in female children (eMERGE study of childhood health, subject age ≥9, p = 0.0009), and traits in schizophrenia (e.g., grandiose delusions, GAIN schizophrenia, p = 0.0004). The first common ESR1 variant (MAF 12–33% across races) linked to regulatory functions, rs2144025 appears conditionally to affect ESR1 mRNA expression in the brain and modulate traits in behavioral disorders. PMID:28617822
Faraji, Farhoud; Hu, Ying; Wu, Gang; Goldberger, Natalie E.; Walker, Renard C.; Zhang, Jinghui; Hunter, Kent W.
2014-01-01
Metastasis is the result of stochastic genomic and epigenetic events leading to gene expression profiles that drive tumor dissemination. Here we exploit the principle that metastatic propensity is modified by the genetic background to generate prognostic gene expression signatures that illuminate regulators of metastasis. We also identify multiple microRNAs whose germline variation is causally linked to tumor progression and metastasis. We employ network analysis of global gene expression profiles in tumors derived from a panel of recombinant inbred mice to identify a network of co-expressed genes centered on Cnot2 that predicts metastasis-free survival. Modulating Cnot2 expression changes tumor cell metastatic potential in vivo, supporting a functional role for Cnot2 in metastasis. Small RNA sequencing of the same tumor set revealed a negative correlation between expression of the Mir216/217 cluster and tumor progression. Expression quantitative trait locus analysis (eQTL) identified cis-eQTLs at the Mir216/217 locus, indicating that differences in expression may be inherited. Ectopic expression of Mir216/217 in tumor cells suppressed metastasis in vivo. Finally, small RNA sequencing and mRNA expression profiling data were integrated to reveal that miR-3470a/b target a high proportion of network transcripts. In vivo analysis of Mir3470a/b demonstrated that both promote metastasis. Moreover, Mir3470b is a likely regulator of the Cnot2 network as its overexpression down-regulated expression of network hub genes and enhanced metastasis in vivo, phenocopying Cnot2 knockdown. The resulting data from this strategy identify Cnot2 as a novel regulator of metastasis and demonstrate the power of our systems-level approach in identifying modifiers of metastasis. PMID:24322557
Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma.
Law, Matthew H; Bishop, D Timothy; Lee, Jeffrey E; Brossard, Myriam; Martin, Nicholas G; Moses, Eric K; Song, Fengju; Barrett, Jennifer H; Kumar, Rajiv; Easton, Douglas F; Pharoah, Paul D P; Swerdlow, Anthony J; Kypreou, Katerina P; Taylor, John C; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A; Andresen, Per A; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M; Dębniak, Tadeusz; Duffy, David L; Elder, David E; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M; Goldstein, Alisa M; Gruis, Nelleke A; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A; Chen, Wei V; Landi, Maria Teresa; Lang, Julie; Lathrop, G Mark; Lubiński, Jan; Mackie, Rona M; Mann, Graham J; Molven, Anders; Montgomery, Grant W; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A; Radford-Smith, Graham L; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C; Craig, Jamie E; Schadendorf, Dirk; Simms, Lisa A; Burdon, Kathryn P; Nyholt, Dale R; Pooley, Karen A; Orr, Nick; Stratigos, Alexander J; Cust, Anne E; Ward, Sarah V; Hayward, Nicholas K; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M; Bishop, Julia A Newton; Demenais, Florence; Amos, Christopher I; MacGregor, Stuart; Iles, Mark M
2015-09-01
Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5 × 10(-8)), as did 2 previously reported but unreplicated loci and all 13 established loci. Newly associated SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes in the associated regions, including one involved in telomere biology.
Genome-wide meta-analysis identifies five new susceptibility loci for cutaneous malignant melanoma
Law, Matthew H.; Bishop, D. Timothy; Martin, Nicholas G.; Moses, Eric K.; Song, Fengju; Barrett, Jennifer H.; Kumar, Rajiv; Easton, Douglas F.; Pharoah, Paul D. P.; Swerdlow, Anthony J.; Kypreou, Katerina P.; Taylor, John C.; Harland, Mark; Randerson-Moor, Juliette; Akslen, Lars A.; Andresen, Per A.; Avril, Marie-Françoise; Azizi, Esther; Scarrà, Giovanna Bianchi; Brown, Kevin M.; Dębniak, Tadeusz; Duffy, David L.; Elder, David E.; Fang, Shenying; Friedman, Eitan; Galan, Pilar; Ghiorzo, Paola; Gillanders, Elizabeth M.; Goldstein, Alisa M.; Gruis, Nelleke A.; Hansson, Johan; Helsing, Per; Hočevar, Marko; Höiom, Veronica; Ingvar, Christian; Kanetsky, Peter A.; Chen, Wei V.; Landi, Maria Teresa; Lang, Julie; Lathrop, G. Mark; Lubiński, Jan; Mackie, Rona M.; Mann, Graham J.; Molven, Anders; Montgomery, Grant W.; Novaković, Srdjan; Olsson, Håkan; Puig, Susana; Puig-Butille, Joan Anton; Qureshi, Abrar A.; Radford-Smith, Graham L.; van der Stoep, Nienke; van Doorn, Remco; Whiteman, David C.; Craig, Jamie E.; Schadendorf, Dirk; Simms, Lisa A.; Burdon, Kathryn P.; Nyholt, Dale R.; Pooley, Karen A.; Orr, Nick; Stratigos, Alexander J.; Cust, Anne E.; Ward, Sarah V.; Hayward, Nicholas K.; Han, Jiali; Schulze, Hans-Joachim; Dunning, Alison M.; Bishop, Julia A. Newton; MacGregor, Stuart; Iles, Mark M.
2015-01-01
Thirteen common susceptibility loci have been reproducibly associated with cutaneous malignant melanoma (CMM). We report the results of an international 2-stage meta-analysis of CMM genome-wide association studies (GWAS). This meta-analysis combines 11 GWAS (5 previously unpublished) and a further three stage 2 data sets, totaling 15,990 CMM cases and 26,409 controls. Five loci not previously associated with CMM risk reached genome-wide significance (P < 5×10–8), as did two previously-reported but un-replicated loci and all thirteen established loci. Novel SNPs fall within putative melanocyte regulatory elements, and bioinformatic and expression quantitative trait locus (eQTL) data highlight candidate genes including one involved in telomere biology. PMID:26237428
Urwin, Ruth Elizabeth; Nunn, Kenneth Patrick
2005-03-01
The serotonin (5-HT) and norepinephrine (NE) systems are likely involved in the aetiology of anorexia nervosa (AN) as sufferers are premorbidly anxious. Specifically, we hypothesize that genes encoding proteins, which clear 5-HT and NE from the synapse, are prime candidates for affecting susceptibility to AN. Supporting our hypothesis, we earlier showed that the NE transporter (NET) and monoamine oxidase A (MAOA) genes appear to contribute additively to increased risk of developing restricting AN (AN-R). With regard to the MAOA gene, a sequence variant that increases MAOA activity and has suggested association with the anxiety condition, panic disorder was preferentially transmitted from parents to affected children. Here we provide evidence in support of interaction between the MAOA and serotonin transporter (SERT) genes in 114 AN nuclear families (patient with AN plus biological parents). A SERT gene genotype with no apparent individual effect on risk and known to be associated with anxiety is preferentially transmitted to children with AN (chi2 trend=9.457, 1 df, P=0.0021) and AN-R alone (chi2 trend=7.477, 1 df, P=0.0063) when the 'more active' MAOA gene variant is also transmitted. The increased risk of developing the disorder is up to eight times greater than the risk imposed by the MAOA gene variant alone--an example of synergistic epistatic interaction. If independently replicated, our findings to date suggest that we may have identified three genes affecting susceptibility to AN, particularly AN-R: the MAOA, SERT, and NET genes.
Multigenerational hybridisation and its consequences for maternal effects in Atlantic salmon
Debes, P V; Fraser, D J; McBride, M C; Hutchings, J A
2013-01-01
Outbreeding between segregating populations can be important from an evolutionary, conservation and economical-agricultural perspective. Whether and how outbreeding influences maternal effects in wild populations has rarely been studied, despite both the prominent maternal influence on early offspring survival and the known presence of fitness effects resulting from outbreeding in many taxa. We studied several traits during the yolk-feeding stage in multigenerational crosses between a wild and a domesticated Atlantic salmon (Salmo salar) population up to their third-generation hybrid in a common laboratory environment. Using cross-means analysis, we inferred that maternal additive outbreeding effects underlie most offspring traits but that yolk mass also underlies maternal dominant effects. As a consequence of the interplay between additive and dominant maternally controlled traits, offspring from first-generation hybrid mothers expressed an excessive proportion of residual yolk mass, relative to total mass, at the time of first feeding. Their residual yolk mass was 23–97% greater than those of other crosses and 31% more than that predicted by a purely additive model. Offspring additive, epistatic and epistatic offspring-by-maternal outbreeding effects appeared to further modify this largely maternally controlled cross-means pattern, resulting in an increase in offspring size with the percentage of domesticated alleles. Fitness implications remain elusive because of unknown phenotype-by-environment interactions. However, these results suggest how mechanistically co-adapted genetic maternal control on early offspring development can be disrupted by the effects of combining alleles from divergent populations. Complex outbreeding effects at both the maternal and offspring levels make the prediction of hybrid phenotypes difficult. PMID:23652564
Asselbergs, Folkert W.; Williams, Scott M.; Hebert, Patricia R.; Coffey, Christopher S.; Hillege, Hans L.; Navis, Gerjan; Vaughan, Douglas E.; van Gilst, Wiek H.; Moore, Jason H.
2007-01-01
Tissue plasminogen activator (t-PA) and plasminogen activator inhibitor 1 (PAI-1) directly influence thrombus formation and degradation and thereby risk for arterial thrombosis. Activation of the renin-angiotensin system has been linked to the production of PAI-1 expression via the angiotensin II type 1 receptor (AT1R). In addition, bradykinin can induce the release of t-PA through a B2 receptor mechanism. In the present study, we aimed to investigate the epistatic effects of polymorphisms in genes from the renin-angiotensin, bradykinin and fibrinolytic systems on plasma t-PA and PAI-1 levels in a large population-based sample (n=2,527). We demonstrated a strong significant interaction within genetic variations of the bradykinin B2 gene (p=0.002) and between ACE and bradykinin B2 (p=0.003) polymorphisms on t-PA levels in females. In males, polymorphisms in the bradykinin B2 and AT1R gene showed the most strong effect on t-PA levels (p=0.006). In both females as well as males, the bradykinin B2 gene interacted with AT1R gene on plasma PAI-1 levels (p=0.026 and p=0.039, respectively). In addition, the current study found a borderline significant interaction between PAI 4G5G and ACE I/D on plasma t-PA and PAI-1 levels. These results support the idea that the interplay between the renin-angiotensin, bradykinin, and fibrinolytic systems might play an important role in t-PA and PAI-1 biology. PMID:17207964
Deepe, Raymond N; Kistner-Griffin, Emily; Martin, Jeffrey N; Deeks, Steven G; Pandey, Janardan P
2012-03-01
Host genetic factors are thought to contribute to the interindividual differences in the control of human immunodeficiency virus (HIV) replication. The aim of the present investigation was to determine whether genes encoding GM and KM allotypes-genetic markers of immunoglobulin γ and κ chains, respectively-and those encoding Fcγ receptor (FcγR) IIa and IIIa are associated with the host control of HIV replication. A case-control design was employed among HIV-infected subjects, with a group that spontaneously controlled HIV replication ("controllers") as cases (n = 73) and those who did not control replication as controls (n = 100). Genotyping was performed by polymerase chain reaction-restriction fragment length polymorphism, direct DNA sequencing, and TaqMan genotyping assays. In Caucasian Americans, certain combinations of FcγR and GM genotypes were differentially distributed between controllers and noncontrollers. Among the noncarriers of the FcγRIIa arginine allele, GM21 noncarriers had over 7-fold greater odds of being controllers than the carriers of this allele (odds ratio [OR] = 7.47). These GM determinants also interacted with FcγRIIIa alleles. Among the carriers of the FcγRIIIa valine allele, GM21 noncarriers had over 3-fold greater odds of being controllers than the carriers of this allele (OR = 3.26). These results demonstrate epistatic interactions of genes on chromosomes 14 (GM) and 1 (FcγR) in influencing the control of HIV replication. Copyright © 2012 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.
Burgio, Gaétan; Baylac, Michel; Heyer, Evelyne; Montagutelli, Xavier
2012-01-01
Background Genetic determinism of cranial morphology in the mouse is still largely unknown, despite the localization of putative QTLs and the identification of genes associated with Mendelian skull malformations. To approach the dissection of this multigenic control, we have used a set of interspecific recombinant congenic strains (IRCS) produced between C57BL/6 and mice of the distant species Mus spretus (SEG/Pas). Each strain has inherited 1.3% of its genome from SEG/Pas under the form of few, small-sized, chromosomal segments. Results The shape of the nasal bone was studied using outline analysis combined with Fourier descriptors, and differential features were identified between IRCS BcG-66H and C57BL/6. An F2 cross between BcG-66H and C57BL/6 revealed that, out of the three SEG/Pas-derived chromosomal regions present in BcG-66H, two were involved. Segments on chromosomes 1 (∼32 Mb) and 18 (∼13 Mb) showed additive effect on nasal bone shape. The three chromosomal regions present in BcG-66H were isolated in congenic strains to study their individual effect. Epistatic interactions were assessed in bicongenic strains. Conclusions Our results show that, besides a strong individual effect, the QTL on chromosome 1 interacts with genes on chromosomes 13 and 18. This study demonstrates that nasal bone shape is under complex genetic control but can be efficiently dissected in the mouse using appropriate genetic tools and shape descriptors. PMID:22662199
Characterization of a Moraxella species that causes epistaxis in macaques
Embers, Monica E.; Doyle, Lara A.; Whitehouse, Chris A.; Selby, Edward B.; Chappell, Mark; Philipp, Mario T.
2014-01-01
Bacteria of the genus Moraxella have been isolated from a variety of mammalian hosts. In a prior survey of bacteria that colonize the rhesus macaque nasopharynx, performed at the Tulane National Primate Research Center, organisms of the Moraxella genus were isolated from animals with epistaxis, or “bloody nose syndrome.” They were biochemically identified as Moraxella catarrhalis, and cryopreserved. Another isolate was obtained from an epistatic cynomolgus macaque at the U.S. Army Medical Research Institute of Infectious Diseases. Based on differences in colony and cell morphologies between rhesus and human M. catarrhalis isolates, we hypothesized that the nonhuman primate Moraxella might instead be a different species. Despite morphological differences, the rhesus isolates, by several biochemical tests, were indistinguishable from M. catarrhalis. Analysis of the cynomolgus isolate by Vitek 2 Compact indicated that it belonged to a Moraxella group, but could not differentiate among species. However, sequencing of the 16S ribosomal RNA gene from four representative rhesus isolates and the cynomolgus isolate showed closest homology to Moraxella lincolnii, a human respiratory tract inhabitant, with 90.16% identity. To examine rhesus macaques as potential hosts for M. catarrhalis, eight animals were inoculated with human M. catarrhalis isolates. Only one of the animals was colonized and showed disease, whereas four of four macaques became epistatic after inoculation with the rhesus Moraxella isolate. The nasopharyngeal isolates in this study appear uniquely adapted to a macaque host and, though they share many of the phenotypic characteristics of M. catarrhalis, appear to form a genotypically distinct species. PMID:20667430
Roux, Fabrice; Camilleri, Christine; Giancola, Sandra; Brunel, Dominique; Reboud, Xavier
2005-01-01
The type of interactions among deleterious mutations is considered to be crucial in numerous areas of evolutionary biology, including the evolution of sex and recombination, the evolution of ploidy, the evolution of selfing, and the conservation of small populations. Because the herbicide resistance genes could be viewed as slightly deleterious mutations in the absence of the pesticide selection pressure, the epistatic interactions among three herbicide resistance genes (acetolactate synthase CSR, cellulose synthase IXR1, and auxin-induced AXR1 target genes) were estimated in both the homozygous and the heterozygous states, giving 27 genotype combinations in the model plant Arabidopsis thaliana. By analyzing eight quantitative traits in a segregating population for the three herbicide resistances in the absence of herbicide, we found that most interactions in both the homozygous and the heterozygous states were best explained by multiplicative effects (each additional resistance gene causes a comparable reduction in fitness) rather than by synergistic effects (each additional resistance gene causes a disproportionate fitness reduction). Dominance coefficients of the herbicide resistance cost ranged from partial dominance to underdominance, with a mean dominance coefficient of 0.07. It was suggested that the csr1-1, ixr1-2, and axr1-3 resistance alleles are nearly fully recessive for the fitness cost. More interestingly, the dominance of a specific resistance gene in the absence of herbicide varied according to, first, the presence of the other resistance genes and, second, the quantitative trait analyzed. These results and their implications for multiresistance evolution are discussed in relation to the maintenance of polymorphism at resistance loci in a heterogeneous environment. PMID:16020787
Rodriguez-Murillo, Laura; Subaran, Ryan; Stewart, William C. L.; Pramanik, Sreemanta; Marathe, Sudhir; Barst, Robyn J.; Chung, Wendy K.; Greenberg, David A.
2009-01-01
Background Familial pulmonary arterial hypertension (FPAH) is a rare, autosomal-dominant inherited disease with low penetrance. Mutations in the Bone Morphogenetic Protein Receptor 2 (BMPR2) have been identified in at least 70% of FPAH patients. However, the lifetime penetrance of these BMPR2 mutations is 10-20%, suggesting that genetic and/or environmental modifiers are required for disease expression. Our goal in this study is to identify genetic loci that may influence FPAH expression in BMPR2-mutation-carriers. Methods We performed a genome-wide linkage scan in 15 FPAH families segregating for BMPR2 mutations. We used a dense SNP array and a novel multi-scan linkage procedure that provides increased power and precision for the localization of linked loci. Results We observed linkage evidence in four regions: 3q22 (median LOD=3.43), 3p12 (median LOD = 2.35), 2p22 (median LOD = 2.21), and 13q21 (median LOD = 2.09). When used in conjunction with the nonparametric bootstrap, our approach yields high-resolution to identify candidate gene regions containing putative BMPR2-interacting genes. Imputation of the disease model by LOD score maximization indicates that the 3q22 locus alone predicts most FPAH cases in BMPR2-mutation carriers, providing strong evidence that BMPR2 and the 3q22 locus interact epistatically. Conclusions Our findings suggest that genotypes at loci in the newly-identified regions, especially at 3q22, could improve FPAH risk prediction in FPAH families and suggest other targets for therapeutic intervention. PMID:19864167
Rodriguez-Murillo, Laura; Subaran, Ryan; Stewart, William C L; Pramanik, Sreemanta; Marathe, Sudhir; Barst, Robyn J; Chung, Wendy K; Greenberg, David A
2010-02-01
Familial pulmonary arterial hypertension (FPAH) is a rare, autosomal-dominant, inherited disease with low penetrance. Mutations in the bone morphogenetic protein receptor 2 (BMPR2) have been identified in at least 70% of FPAH patients. However, the lifetime penetrance of these BMPR2 mutations is 10% to 20%, suggesting that genetic and/or environmental modifiers are required for disease expression. Our goal in this study was to identify genetic loci that may influence FPAH expression in BMPR2 mutation carriers. We performed a genome-wide linkage scan in 15 FPAH families segregating for BMPR2 mutations. We used a dense single-nucleotide polymorphism (SNP) array and a novel multi-scan linkage procedure that provides increased power and precision for the localization of linked loci. We observed linkage evidence in four regions: 3q22 ([median log of the odds (LOD) = 3.43]), 3p12 (median LOD) = 2.35), 2p22 (median LOD = 2.21), and 13q21 (median LOD = 2.09). When used in conjunction with the non-parametric bootstrap, our approach yields high-resolution to identify candidate gene regions containing putative BMPR2-interacting genes. Imputation of the disease model by LOD-score maximization indicates that the 3q22 locus alone predicts most FPAH cases in BMPR2 mutation carriers, providing strong evidence that BMPR2 and the 3q22 locus interact epistatically. Our findings suggest that genotypes at loci in the newly identified regions, especially at 3q22, could improve FPAH risk prediction in FPAH families. We also suggest other targets for therapeutic intervention.
Cromer, Laurence; Heyman, Jefri; Touati, Sandra; Harashima, Hirofumi; Araou, Emilie; Girard, Chloe; Horlow, Christine; Wassmann, Katja; Schnittger, Arp; De Veylder, Lieven; Mercier, Raphael
2012-01-01
Cell cycle control is modified at meiosis compared to mitosis, because two divisions follow a single DNA replication event. Cyclin-dependent kinases (CDKs) promote progression through both meiosis and mitosis, and a central regulator of their activity is the APC/C (Anaphase Promoting Complex/Cyclosome) that is especially required for exit from mitosis. We have shown previously that OSD1 is involved in entry into both meiosis I and meiosis II in Arabidopsis thaliana; however, the molecular mechanism by which OSD1 controls these transitions has remained unclear. Here we show that OSD1 promotes meiotic progression through APC/C inhibition. Next, we explored the functional relationships between OSD1 and the genes known to control meiotic cell cycle transitions in Arabidopsis. Like osd1, cyca1;2/tam mutation leads to a premature exit from meiosis after the first division, while tdm mutants perform an aberrant third meiotic division after normal meiosis I and II. Remarkably, while tdm is epistatic to tam, osd1 is epistatic to tdm. We further show that the expression of a non-destructible CYCA1;2/TAM provokes, like tdm, the entry into a third meiotic division. Finally, we show that CYCA1;2/TAM forms an active complex with CDKA;1 that can phosphorylate OSD1 in vitro. We thus propose that a functional network composed of OSD1, CYCA1;2/TAM, and TDM controls three key steps of meiotic progression, in which OSD1 is a meiotic APC/C inhibitor.
Rane, Rahul V; Rako, Lea; Kapun, Martin; Lee, Siu F; Hoffmann, Ary A
2015-05-01
Chromosomal inversion polymorphisms are common in animals and plants, and recent models suggest that alternative arrangements spread by capturing different combinations of alleles acting additively or epistatically to favour local adaptation. It is also thought that inversions typically maintain favoured combinations for a long time by suppressing recombination between alternative chromosomal arrangements. Here, we consider patterns of linkage disequilibrium and genetic divergence in an old inversion polymorphism in Drosophila melanogaster (In(3R)Payne) known to be associated with climate change adaptation and a recent invasion event into Australia. We extracted, karyotyped and sequenced whole chromosomes from two Australian populations, so that changes in the arrangement of the alleles between geographically separated tropical and temperate areas could be compared. Chromosome-wide linkage disequilibrium (LD) analysis revealed strong LD within the region spanned by In(3R)Payne. This genomic region also showed strong differentiation between the tropical and the temperate populations, but no differentiation between different karyotypes from the same population, after controlling for chromosomal arrangement. Patterns of differentiation across the chromosome arm and in gene ontologies were enhanced by the presence of the inversion. These data support the notion that inversions are strongly selected by bringing together combinations of genes, but it is still not clear if such combinations act additively or epistatically. Our data suggest that climatic adaptation through inversions can be dynamic, reflecting changes in the relative abundance of different forms of an inversion and ongoing evolution of allelic content within an inversion. © 2015 John Wiley & Sons Ltd.
Raadsma, H W; Jonas, E; Fleet, M R; Fullard, K; Gongora, J; Cavanagh, C R; Tammen, I; Thomson, P C
2013-08-01
The pursuits of white features and white fleeces free of pigmented fibre have been important selection objectives for many sheep breeds. The cause and inheritance of non-white colour patterns in sheep has been studied since the early 19th century. Discovery of genetic causes, especially those which predispose pigmentation in white sheep, may lead to more accurate selection tools for improved apparel wool. This article describes an extended QTL study for 13 skin and fibre pigmentation traits in sheep. A total of 19 highly significant, 10 significant and seven suggestive QTL were identified in a QTL mapping experiment using an Awassi × Merino × Merino backcross sheep population. All QTL on chromosome 2 exceeded a LOD score of greater than 4 (range 4.4-30.1), giving very strong support for a major gene for pigmentation on this chromosome. Evidence of epistatic interactions was found for QTL for four traits on chromosomes 2 and 19. The ovine TYRP1 gene on OAR 2 was sequenced as a strong positional candidate gene. A highly significant association (P < 0.01) of grandparental haplotypes across nine segregating SNP/microsatellite markers including one non-synonymous SNP with pigmentation traits could be shown. Up to 47% of the observed variation in pigmentation was accounted for by models using TYRP1 haplotypes and 83% for models with interactions between two QTL probabilities, offering scope for marker-assisted selection for these traits. © 2013 The Authors, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
Multi-location wheat stripe rust QTL analysis: genetic background and epistatic interactions.
Vazquez, M Dolores; Zemetra, Robert; Peterson, C James; Chen, Xianming M; Heesacker, Adam; Mundt, Christopher C
2015-07-01
Epistasis and genetic background were important influences on expression of stripe rust resistance in two wheat RIL populations, one with resistance conditioned by two major genes and the other conditioned by several minor QTL. Stripe rust is a foliar disease of wheat (Triticum aestivum L.) caused by the air-borne fungus Puccinia striiformis f. sp. tritici and is present in most regions around the world where commercial wheat is grown. Breeding for durable resistance to stripe rust continues to be a priority, but also is a challenge due to the complexity of interactions among resistance genes and to the wide diversity and continuous evolution of the pathogen races. The goal of this study was to detect chromosomal regions for resistance to stripe rust in two winter wheat populations, 'Tubbs'/'NSA-98-0995' (T/N) and 'Einstein'/'Tubbs' (E/T), evaluated across seven environments and mapped with diversity array technology and simple sequence repeat markers covering polymorphic regions of ≈1480 and 1117 cM, respectively. Analysis of variance for phenotypic data revealed significant (P < 0.01) genotypic differentiation for stripe rust among the recombinant inbred lines. Results for quantitative trait loci/locus (QTL) analysis in the E/T population indicated that two major QTL located in chromosomes 2AS and 6AL, with epistatic interaction between them, were responsible for the main phenotypic response. For the T/N population, eight QTL were identified, with those in chromosomes 2AL and 2BL accounting for the largest percentage of the phenotypic variance.
2014-01-01
Background Drought is one of the most important abiotic stresses that cause drastic reduction in rice grain yield (GY) in rainfed environments. The identification and introgression of QTL leading to high GY under drought have been advocated to be the preferred breeding strategy to improve drought tolerance of popular rice varieties. Genetic control of GY under reproductive-stage drought stress (RS) was studied in two BC1F4 mapping populations derived from crosses of Kali Aus, a drought-tolerant aus cultivar, with high-yielding popular varieties MTU1010 and IR64. The aim was to identify QTL for GY under RS that show a large and consistent effect for the trait. Bulk segregant analysis (BSA) was used to identify significant markers putatively linked with high GY under drought. Results QTL analysis revealed major-effect GY QTL: qDTY 1.2 , qDTY 2.2 and qDTY 1.3 , qDTY 2.3 (DTY; Drought grain yield) under drought consistently over two seasons in Kali Aus/2*MTU1010 and Kali Aus/2*IR64 populations, respectively. qDTY 1.2 and qDTY 2.2 explained an additive effect of 288 kg ha−1 and 567 kg ha−1 in Kali Aus/2*MTU1010, whereas qDTY 1.3 and qDTY 2.3 explained an additive effect of 198 kg ha−1 and 147 kg ha−1 in Kali Aus/2*IR64 populations, respectively. Epistatic interaction was observed for DTF (days to flowering) between regions on chromosome 2 flanked by markers RM154–RM324 and RM263–RM573 and major epistatic QTL for GY showing interaction between genomic locations on chromosome 1 at marker interval RM488–RM315 and chromosome 2 at RM324–RM263 in 2012 DS and 2013 DS RS in Kali Aus/2*IR64 mapping populations. Conclusion The QTL, qDTY 1.2 , qDTY 1.3 , qDTY 2.2 , and qDTY 2.3, identified in this study can be used to improve GY of mega varieties MTU1010 and IR64 under different degrees of severity of drought stress through marker-aided backcrossing and provide farmers with improved varieties that effectively combine high yield potential with good yield under drought. The observed epistatic interaction for GY and DTF will contribute to our understanding of the genetic basis of agronomically important traits and enhance predictive ability at an individualized level in agriculture. PMID:24885990
Cerqueira, Gustavo C; Cheeseman, Ian H; Schaffner, Steve F; Nair, Shalini; McDew-White, Marina; Phyo, Aung Pyae; Ashley, Elizabeth A; Melnikov, Alexandre; Rogov, Peter; Birren, Bruce W; Nosten, François; Anderson, Timothy J C; Neafsey, Daniel E
2017-04-28
Artemisinin-based combination therapies are the first line of treatment for Plasmodium falciparum infections worldwide, but artemisinin resistance has risen rapidly in Southeast Asia over the past decade. Mutations in the kelch13 gene have been implicated in this resistance. We used longitudinal genomic surveillance to detect signals in kelch13 and other loci that contribute to artemisinin or partner drug resistance. We retrospectively sequenced the genomes of 194 P. falciparum isolates from five sites in Northwest Thailand, over the period of a rapid increase in the emergence of artemisinin resistance (2001-2014). We evaluate statistical metrics for temporal change in the frequency of individual SNPs, assuming that SNPs associated with resistance increase in frequency over this period. After Kelch13-C580Y, the strongest temporal change is seen at a SNP in phosphatidylinositol 4-kinase, which is involved in a pathway recently implicated in artemisinin resistance. Furthermore, other loci exhibit strong temporal signatures which warrant further investigation for involvement in artemisinin resistance evolution. Through genome-wide association analysis we identify a variant in a kelch domain-containing gene on chromosome 10 that may epistatically modulate artemisinin resistance. This analysis demonstrates the potential of a longitudinal genomic surveillance approach to detect resistance-associated gene loci to improve our mechanistic understanding of how resistance develops. Evidence for additional genomic regions outside of the kelch13 locus associated with artemisinin-resistant parasites may yield new molecular markers for resistance surveillance, which may be useful in efforts to reduce the emergence or spread of artemisinin resistance in African parasite populations.
Pedros, Christophe; Gaud, Guillaume; Bernard, Isabelle; Kassem, Sahar; Chabod, Marianne; Lagrange, Dominique; Andréoletti, Olivier; Dejean, Anne S; Lesourne, Renaud; Fournié, Gilbert J; Saoudi, Abdelhadi
2015-08-15
The development of inflammatory diseases depends on complex interactions between several genes and various environmental factors. Discovering new genetic risk factors and understanding the mechanisms whereby they influence disease development is of paramount importance. We previously reported that deficiency in Themis1, a new actor of TCR signaling, impairs regulatory T cell (Treg) function and predisposes Brown-Norway (BN) rats to spontaneous inflammatory bowel disease (IBD). In this study, we reveal that the epistasis between Themis1 and Vav1 controls the occurrence of these phenotypes. Indeed, by contrast with BN rats, Themis1 deficiency in Lewis rats neither impairs Treg suppressive functions nor induces pathological manifestations. By using congenic lines on the BN genomic background, we show that the impact of Themis1 deficiency on Treg suppressive functions depends on a 117-kb interval coding for a R63W polymorphism that impacts Vav1 expression and functions. Indeed, the introduction of a 117-kb interval containing the Lewis Vav1-R63 variant restores Treg function and protects Themis1-deficient BN rats from spontaneous IBD development. We further show that Themis1 binds more efficiently to the BN Vav1-W63 variant and is required to stabilize its recruitment to the transmembrane adaptor LAT and to fully promote the activation of Erk kinases. Together, these results highlight the importance of the signaling pathway involving epistasis between Themis1 and Vav1 in the control of Treg suppressive function and susceptibility to IBD development. Copyright © 2015 by The American Association of Immunologists, Inc.
Zhang, Yu'e; Xu, Wenying; Li, Zhonghui; Deng, Xing Wang; Wu, Weihua; Xue, Yongbiao
2008-12-01
Guard cells, which form stoma in leaf epidermis, sense and integrate environmental signals to modulate stomatal aperture in response to diverse conditions. Under drought stress, plants synthesize abscisic acid (ABA), which in turn induces a rapid closing of stoma, to prevent water loss by transpiration. However, many aspects of the molecular mechanism for ABA-mediated stomatal closure are still not understood. Here, we report a novel negative regulator of guard cell ABA signaling, DOR, in Arabidopsis (Arabidopsis thaliana). The DOR gene encodes a putative F-box protein, a member of the S-locus F-box-like family related to AhSLF-S(2) and specifically interacting with ASK14 and CUL1. A null mutation in DOR resulted in a hypersensitive ABA response of stomatal closing and a substantial increase of drought tolerance; in contrast, the transgenic plants overexpressing DOR were more susceptible to the drought stress. DOR is strongly expressed in guard cells and suppressed by ABA treatment, suggesting a negative feedback loop of DOR in ABA responses. Double-mutant analyses of dor with ABA-insensitive mutant abi1-1 showed that abi1-1 is epistatic to dor, but no apparent change of phospholipase Dalpha1 was detected between the wild type and dor. Affymetrix GeneChip analysis showed that DOR likely regulates ABA biosynthesis under drought stress. Taken together, our results demonstrate that DOR acts independent of phospholipase Dalpha1 in an ABA signaling pathway to inhibit the ABA-induced stomatal closure under drought stress.
Pleiotropic and Epistatic Network-Based Discovery: Integrated Networks for Target Gene Discovery
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weighill, Deborah; Jones, Piet; Shah, Manesh
Biological organisms are complex systems that are composed of functional networks of interacting molecules and macro-molecules. Complex phenotypes are the result of orchestrated, hierarchical, heterogeneous collections of expressed genomic variants. However, the effects of these variants are the result of historic selective pressure and current environmental and epigenetic signals, and, as such, their co-occurrence can be seen as genome-wide correlations in a number of different manners. Biomass recalcitrance (i.e., the resistance of plants to degradation or deconstruction, which ultimately enables access to a plant's sugars) is a complex polygenic phenotype of high importance to biofuels initiatives. This study makes usemore » of data derived from the re-sequenced genomes from over 800 different Populus trichocarpa genotypes in combination with metabolomic and pyMBMS data across this population, as well as co-expression and co-methylation networks in order to better understand the molecular interactions involved in recalcitrance, and identify target genes involved in lignin biosynthesis/degradation. A Lines Of Evidence (LOE) scoring system is developed to integrate the information in the different layers and quantify the number of lines of evidence linking genes to target functions. This new scoring system was applied to quantify the lines of evidence linking genes to lignin-related genes and phenotypes across the network layers, and allowed for the generation of new hypotheses surrounding potential new candidate genes involved in lignin biosynthesis in P. trichocarpa, including various AGAMOUS-LIKE genes. Lastly, the resulting Genome Wide Association Study networks, integrated with Single Nucleotide Polymorphism (SNP) correlation, co-methylation, and co-expression networks through the LOE scores are proving to be a powerful approach to determine the pleiotropic and epistatic relationships underlying cellular functions and, as such, the molecular basis for complex phenotypes, such as recalcitrance.« less
A high-density association screen of 155 ion transport genes for involvement with common migraine
Nyholt, Dale R.; LaForge, K. Steven; Kallela, Mikko; Alakurtti, Kirsi; Anttila, Verneri; Färkkilä, Markus; Hämaläinen, Eija; Kaprio, Jaakko; Kaunisto, Mari A.; Heath, Andrew C.; Montgomery, Grant W.; Göbel, Hartmut; Todt, Unda; Ferrari, Michel D.; Launer, Lenore J.; Frants, Rune R.; Terwindt, Gisela M.; de Vries, Boukje; Verschuren, W.M. Monique; Brand, Jan; Freilinger, Tobias; Pfaffenrath, Volker; Straube, Andreas; Ballinger, Dennis G.; Zhan, Yiping; Daly, Mark J.; Cox, David R.; Dichgans, Martin; van den Maagdenberg, Arn M.J.M.; Kubisch, Christian; Martin, Nicholas G.; Wessman, Maija; Peltonen, Leena; Palotie, Aarno
2008-01-01
The clinical overlap between monogenic Familial Hemiplegic Migraine (FHM) and common migraine subtypes, and the fact that all three FHM genes are involved in the transport of ions, suggest that ion transport genes may underlie susceptibility to common forms of migraine. To test this leading hypothesis, we examined common variation in 155 ion transport genes using 5257 single nucleotide polymorphisms (SNPs) in a Finnish sample of 841 unrelated migraine with aura cases and 884 unrelated non-migraine controls. The top signals were then tested for replication in four independent migraine case–control samples from the Netherlands, Germany and Australia, totalling 2835 unrelated migraine cases and 2740 unrelated controls. SNPs within 12 genes (KCNB2, KCNQ3, CLIC5, ATP2C2, CACNA1E, CACNB2, KCNE2, KCNK12, KCNK2, KCNS3, SCN5A and SCN9A) with promising nominal association (0.00041 < P < 0.005) in the Finnish sample were selected for replication. Although no variant remained significant after adjusting for multiple testing nor produced consistent evidence for association across all cohorts, a significant epistatic interaction between KCNB2 SNP rs1431656 (chromosome 8q13.3) and CACNB2 SNP rs7076100 (chromosome 10p12.33) (pointwise P = 0.00002; global P = 0.02) was observed in the Finnish case–control sample. We conclude that common variants of moderate effect size in ion transport genes do not play a major role in susceptibility to common migraine within these European populations, although there is some evidence for epistatic interaction between potassium and calcium channel genes, KCNB2 and CACNB2. Multiple rare variants or trans-regulatory elements of these genes are not ruled out. PMID:18676988
Powers, Natalie R; Eicher, John D; Miller, Laura L; Kong, Yong; Smith, Shelley D; Pennington, Bruce F; Willcutt, Erik G; Olson, Richard K; Ring, Susan M; Gruen, Jeffrey R
2016-01-01
Background Reading disability (RD) and language impairment (LI) are heritable learning disabilities that obstruct acquisition and use of written and spoken language, respectively. We previously reported that two risk haplotypes, each in strong linkage disequilibrium (LD) with an allele of READ1, a polymorphic compound short tandem repeat within intron 2 of risk gene DCDC2, are associated with RD and LI. Additionally, we showed a non-additive genetic interaction between READ1 and KIAHap, a previously reported risk haplotype in risk gene KIAA0319, and that READ1 binds the transcriptional regulator ETV6. Objective To examine the hypothesis that READ1 is a transcriptional regulator of KIAA0319. Methods We characterised associations between READ1 alleles and RD and LI in a large European cohort, and also assessed interactions between READ1 and KIAHap and their effect on performance on measures of reading, language and IQ. We also used family-based data to characterise the genetic interaction, and chromatin conformation capture (3C) to investigate the possibility of a physical interaction between READ1 and KIAHap. Results and conclusions READ1 and KIAHap show interdependence—READ1 risk alleles synergise with KIAHap, whereas READ1 protective alleles act epistatically to negate the effects of KIAHap. The family data suggest that these variants interact in trans genetically, while the 3C results show that a region of DCDC2 containing READ1 interacts physically with the region upstream of KIAA0319. These data support a model in which READ1 regulates KIAA0319 expression through KIAHap and in which the additive effects of READ1 and KIAHap alleles are responsible for the trans genetic interaction. PMID:26660103
Pool, John E
2015-12-01
North American populations of Drosophila melanogaster derive from both European and African source populations, but despite their importance for genetic research, patterns of ancestry along their genomes are largely undocumented. Here, I infer geographic ancestry along genomes of the Drosophila Genetic Reference Panel (DGRP) and the D. melanogaster reference genome, which may have implications for reference alignment, association mapping, and population genomic studies in Drosophila. Overall, the proportion of African ancestry was estimated to be 20% for the DGRP and 9% for the reference genome. Combining my estimate of admixture timing with historical records, I provide the first estimate of natural generation time for this species (approximately 15 generations per year). Ancestry levels were found to vary strikingly across the genome, with less African introgression on the X chromosome, in regions of high recombination, and at genes involved in specific processes (e.g., circadian rhythm). An important role for natural selection during the admixture process was further supported by evidence that many unlinked pairs of loci showed a deficiency of Africa-Europe allele combinations between them. Numerous epistatic fitness interactions may therefore exist between African and European genotypes, leading to ongoing selection against incompatible variants. By focusing on hubs in this network of fitness interactions, I identified a set of interacting loci that include genes with roles in sensation and neuropeptide/hormone reception. These findings suggest that admixed D. melanogaster samples could become an important study system for the genetics of early-stage isolation between populations. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Macek, M. Jr.; Nash, E.; Cutting, G.R.
1994-09-01
Cystic fibrosis (CF) is one of the more common lethal autosomal recessive disorders in Caucasian populations. Numerous hypotheses including genetic drift, founder effect, sex ratio, segregation distortions and various forms of heterozygote advantage have been proposed to explain the relatively high frequency of CF alleles. The observation of high linkage disequilibrium between markers at the 5{prime} end of CFTR and mutations that cause CF raised the possibility of epistatic selection. CF-linked marker allele frequencies were determined in 417 elderly individuals from a stable Czech population that survived high levels of infant and childhood mortality in the pre-antibiotic era. These datamore » were compared with allele frequencies of 646 contemporary newborns and 345 young adults drawn from the same population who had significantly lower mortality rates in the antibiotic era. Allele frequencies of markers CS7/Hhal and KM19/Pstl from the D7S23 locus are significantly different (p<0.05) between elderly female and male subjects in this population. Furthermore, there is a significant difference in the allele frequencies of marker CS7/Hhal when newborn females and elderly women are compared (p<0.05). Taken together, these data suggest that the allele status at the CS7 region influenced female survival in the period of high infant and childhood mortality in the pre-antibiotic era. Under this selective pressure, CFTR mutations that occurred on the {open_quotes}favorable{close_quotes} background would marginally increase in frequency in each successive generation and more ancient mutations residing on this background would become the most frequent in the general population.« less
Chapman, Natalie H; Bonnet, Julien; Grivet, Laurent; Lynn, James; Graham, Neil; Smith, Rebecca; Sun, Guiping; Walley, Peter G; Poole, Mervin; Causse, Mathilde; King, Graham J; Baxter, Charles; Seymour, Graham B
2012-08-01
Fruit firmness in tomato (Solanum lycopersicum) is determined by a number of factors including cell wall structure, turgor, and cuticle properties. Firmness is a complex polygenic trait involving the coregulation of many genes and has proved especially challenging to unravel. In this study, a quantitative trait locus (QTL) for fruit firmness was mapped to tomato chromosome 2 using the Zamir Solanum pennellii interspecific introgression lines (ILs) and fine-mapped in a population consisting of 7,500 F2 and F3 lines from IL 2-3 and IL 2-4. This firmness QTL contained five distinct subpeaks, Fir(s.p.)QTL2.1 to Fir(s.p.)QTL2.5, and an effect on a distal region of IL 2-4 that was nonoverlapping with IL 2-3. All these effects were located within an 8.6-Mb region. Using genetic markers, each subpeak within this combinatorial locus was mapped to a physical location within the genome, and an ethylene response factor (ERF) underlying Fir(s.p.)QTL2.2 and a region containing three pectin methylesterase (PME) genes underlying Fir(s.p.)QTL2.5 were nominated as QTL candidate genes. Statistical models used to explain the observed variability between lines indicated that these candidates and the nonoverlapping portion of IL 2-4 were sufficient to account for the majority of the fruit firmness effects. Quantitative reverse transcription-polymerase chain reaction was used to quantify the expression of each candidate gene. ERF showed increased expression associated with soft fruit texture in the mapping population. In contrast, PME expression was tightly linked with firm fruit texture. Analysis of a range of recombinant lines revealed evidence for an epistatic interaction that was associated with this combinatorial locus.
Santos, R M C; do Rêgo, E R; Borém, A; Nascimento, M F; Nascimento, N F F; Finger, F L; Rêgo, M M
2014-10-31
Two accessions of ornamental pepper Capsicum annuum L., differing in most of the characters studied, were crossed, resulting in the F1 generation, and the F2 generation was obtained through self-fertilization of the F1 generation. The backcross generations RC1 and RC2 were obtained through crossing between F1 and the parents P1 and P2, respectively. Morpho-agronomic characterization was performed based on the 19 quantitative descriptors of Capsicum. The data obtained were subjected to generation analysis, in which the means and additive variance (σa(2)), variance due to dominance deviation (σd(2)), phenotypic variance (σf(2)), genetic variance (σg(2)) and environmental variance (σm(2)) were calculated. For the full model, we estimated the mean effects of all possible homozygotes, additives, dominants, and epistatics: additive-additive, additive-dominant, and dominant-dominant. For the additive-dominant model, we estimated the additive effects, dominant effects and mean effects of possible homozygotes. The character fruit dry matter had the lowest value for broad sense heritability (0.42), and the highest values were found for fresh matter and fruit weight, 0.91 and 0.92, respectively. The lowest value for narrow sense heritability was for the minor fruit diameter character (0.33), and the highest values were found for seed yield per fruit and fresh matter, 0.87 and 0.84, respectively. The additive-dominant model explained only the variation found in plant height, canopy width, stem length, corolla diameter, leaf width, and pedicel length, but in the other characters, the epistatic effects showed significant values.
Werren, John H.; Cohen, Lorna B.; Gadau, Juergen; Ponce, Rita; Baudry, Emmanuelle; Lynch, Jeremy A.
2016-01-01
The animal head is a complex structure where numerous sensory, structural and alimentary structures are concentrated and integrated, and its ontogeny requires precise and delicate interactions among genes, cells, and tissues. Thus, it is perhaps unsurprising that craniofacial abnormalities are among the most common birth defects in people, or that these defects have a complex genetic basis involving interactions among multiple loci. Developmental processes that depend on such epistatic interactions become exponentially more difficult to study in diploid organisms as the number of genes involved increases. Here, we present hybrid haploid males of the wasp species pair Nasonia vitripennis and Nasonia giraulti, which have distinct male head morphologies, as a genetic model of craniofacial development that possesses the genetic advantages of haploidy, along with many powerful genomic tools. Viable, fertile hybrids can be made between the species, and quantitative trail loci related to shape differences have been identified. In addition, a subset of hybrid males show head abnormalities, including clefting at the midline and asymmetries. Crucially, epistatic interactions among multiple loci underlie several developmental differences and defects observed in the F2 hybrid males. Furthermore, we demonstrate an introgression of a chromosomal region from N. giraulti into N. vitripennis that shows an abnormality in relative eye size, which maps to a region containing a major QTL for this trait. Therefore, the genetic sources of head morphology can, in principle, be identified by positional cloning. Thus, Nasonia is well positioned to be a uniquely powerful model invertebrate system with which to probe both development and complex genetics of craniofacial patterning and defects. PMID:26721604
Sato, Trey K.; Tremaine, Mary; Parreiras, Lucas S.; ...
2016-10-14
The inability of native Saccharomyces cerevisiae to convert xylose from plant biomass into biofuels remains a major challenge for the production of renewable bioenergy. Despite extensive knowledge of the regulatory networks controlling carbon metabolism in yeast, little is known about how to reprogram S. cerevisiae to ferment xylose at rates comparable to glucose. Here we combined genome sequencing, proteomic profiling, and metabolomic analyses to identify and characterize the responsible mutations in a series of evolved strains capable of metabolizing xylose aerobically or anaerobically. We report that rapid xylose conversion by engineered and evolved S. cerevisiae strains depends upon epistatic interactionsmore » among genes encoding a xylose reductase ( GRE3), a component of MAP Kinase (MAPK) signaling ( HOG1), a regulator of Protein Kinase A (PKA) signaling ( IRA2), and a scaffolding protein for mitochondrial iron-sulfur (Fe-S) cluster biogenesis ( ISU1). Interestingly, the mutation in IRA2 only impacted anaerobic xylose consumption and required the loss of ISU1 function, indicating a previously unknown connection between PKA signaling, Fe-S cluster biogenesis, and anaerobiosis. Proteomic and metabolomic comparisons revealed that the xylose-metabolizing mutant strains exhibit altered metabolic pathways relative to the parental strain when grown in xylose. Further analyses revealed that interacting mutations in HOG1 and ISU1 unexpectedly elevated mitochondrial respiratory proteins and enabled rapid aerobic respiration of xylose and other non-fermentable carbon substrates. Lastly, our findings suggest a surprising connection between Fe-S cluster biogenesis and signaling that facilitates aerobic respiration and anaerobic fermentation of xylose, underscoring how much remains unknown about the eukaryotic signaling systems that regulate carbon metabolism.« less
Kumar, Satish; Molloy, Claire; Muñoz, Patricio; Daetwyler, Hans; Chagné, David; Volz, Richard
2015-01-01
The nonadditive genetic effects may have an important contribution to total genetic variation of phenotypes, so estimates of both the additive and nonadditive effects are desirable for breeding and selection purposes. Our main objectives were to: estimate additive, dominance and epistatic variances of apple (Malus × domestica Borkh.) phenotypes using relationship matrices constructed from genome-wide dense single nucleotide polymorphism (SNP) markers; and compare the accuracy of genomic predictions using genomic best linear unbiased prediction models with or without including nonadditive genetic effects. A set of 247 clonally replicated individuals was assessed for six fruit quality traits at two sites, and also genotyped using an Illumina 8K SNP array. Across several fruit quality traits, the additive, dominance, and epistatic effects contributed about 30%, 16%, and 19%, respectively, to the total phenotypic variance. Models ignoring nonadditive components yielded upwardly biased estimates of additive variance (heritability) for all traits in this study. The accuracy of genomic predicted genetic values (GEGV) varied from about 0.15 to 0.35 for various traits, and these were almost identical for models with or without including nonadditive effects. However, models including nonadditive genetic effects further reduced the bias of GEGV. Between-site genotypic correlations were high (>0.85) for all traits, and genotype-site interaction accounted for <10% of the phenotypic variability. The accuracy of prediction, when the validation set was present only at one site, was generally similar for both sites, and varied from about 0.50 to 0.85. The prediction accuracies were strongly influenced by trait heritability, and genetic relatedness between the training and validation families. PMID:26497141
Pleiotropic and Epistatic Network-Based Discovery: Integrated Networks for Target Gene Discovery
Weighill, Deborah; Jones, Piet; Shah, Manesh; ...
2018-05-11
Biological organisms are complex systems that are composed of functional networks of interacting molecules and macro-molecules. Complex phenotypes are the result of orchestrated, hierarchical, heterogeneous collections of expressed genomic variants. However, the effects of these variants are the result of historic selective pressure and current environmental and epigenetic signals, and, as such, their co-occurrence can be seen as genome-wide correlations in a number of different manners. Biomass recalcitrance (i.e., the resistance of plants to degradation or deconstruction, which ultimately enables access to a plant's sugars) is a complex polygenic phenotype of high importance to biofuels initiatives. This study makes usemore » of data derived from the re-sequenced genomes from over 800 different Populus trichocarpa genotypes in combination with metabolomic and pyMBMS data across this population, as well as co-expression and co-methylation networks in order to better understand the molecular interactions involved in recalcitrance, and identify target genes involved in lignin biosynthesis/degradation. A Lines Of Evidence (LOE) scoring system is developed to integrate the information in the different layers and quantify the number of lines of evidence linking genes to target functions. This new scoring system was applied to quantify the lines of evidence linking genes to lignin-related genes and phenotypes across the network layers, and allowed for the generation of new hypotheses surrounding potential new candidate genes involved in lignin biosynthesis in P. trichocarpa, including various AGAMOUS-LIKE genes. Lastly, the resulting Genome Wide Association Study networks, integrated with Single Nucleotide Polymorphism (SNP) correlation, co-methylation, and co-expression networks through the LOE scores are proving to be a powerful approach to determine the pleiotropic and epistatic relationships underlying cellular functions and, as such, the molecular basis for complex phenotypes, such as recalcitrance.« less
Prediction of individualized therapeutic vulnerabilities in cancer from genomic profiles
Aksoy, Bülent Arman; Demir, Emek; Babur, Özgün; Wang, Weiqing; Jing, Xiaohong; Schultz, Nikolaus; Sander, Chris
2014-01-01
Motivation: Somatic homozygous deletions of chromosomal regions in cancer, while not necessarily oncogenic, may lead to therapeutic vulnerabilities specific to cancer cells compared with normal cells. A recently reported example is the loss of one of the two isoenzymes in glioblastoma cancer cells such that the use of a specific inhibitor selectively inhibited growth of the cancer cells, which had become fully dependent on the second isoenzyme. We have now made use of the unprecedented conjunction of large-scale cancer genomics profiling of tumor samples in The Cancer Genome Atlas (TCGA) and of tumor-derived cell lines in the Cancer Cell Line Encyclopedia, as well as the availability of integrated pathway information systems, such as Pathway Commons, to systematically search for a comprehensive set of such epistatic vulnerabilities. Results: Based on homozygous deletions affecting metabolic enzymes in 16 TCGA cancer studies and 972 cancer cell lines, we identified 4104 candidate metabolic vulnerabilities present in 1019 tumor samples and 482 cell lines. Up to 44% of these vulnerabilities can be targeted with at least one Food and Drug Administration-approved drug. We suggest focused experiments to test these vulnerabilities and clinical trials based on personalized genomic profiles of those that pass preclinical filters. We conclude that genomic profiling will in the future provide a promising basis for network pharmacology of epistatic vulnerabilities as a promising therapeutic strategy. Availability and implementation: A web-based tool for exploring all vulnerabilities and their details is available at http://cbio.mskcc.org/cancergenomics/statius/ along with supplemental data files. Contact: statius@cbio.mskcc.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:24665131
Genome-Assisted Prediction of Quantitative Traits Using the R Package sommer.
Covarrubias-Pazaran, Giovanny
2016-01-01
Most traits of agronomic importance are quantitative in nature, and genetic markers have been used for decades to dissect such traits. Recently, genomic selection has earned attention as next generation sequencing technologies became feasible for major and minor crops. Mixed models have become a key tool for fitting genomic selection models, but most current genomic selection software can only include a single variance component other than the error, making hybrid prediction using additive, dominance and epistatic effects unfeasible for species displaying heterotic effects. Moreover, Likelihood-based software for fitting mixed models with multiple random effects that allows the user to specify the variance-covariance structure of random effects has not been fully exploited. A new open-source R package called sommer is presented to facilitate the use of mixed models for genomic selection and hybrid prediction purposes using more than one variance component and allowing specification of covariance structures. The use of sommer for genomic prediction is demonstrated through several examples using maize and wheat genotypic and phenotypic data. At its core, the program contains three algorithms for estimating variance components: Average information (AI), Expectation-Maximization (EM) and Efficient Mixed Model Association (EMMA). Kernels for calculating the additive, dominance and epistatic relationship matrices are included, along with other useful functions for genomic analysis. Results from sommer were comparable to other software, but the analysis was faster than Bayesian counterparts in the magnitude of hours to days. In addition, ability to deal with missing data, combined with greater flexibility and speed than other REML-based software was achieved by putting together some of the most efficient algorithms to fit models in a gentle environment such as R.
Rouse, Matthew N; Talbert, Luther E; Singh, Davinder; Sherman, Jamie D
2014-07-01
Quantitative trait loci conferring adult plant resistance to Ug99 stem rust in Thatcher wheat display complementary gene action suggesting multiple quantitative trait loci are needed for effective resistance. Adult plant resistance (APR) in wheat (Triticum aestivum L.) to stem rust, caused by Puccinia graminis f. sp. tritici (Pgt), is desirable because this resistance can be Pgt race non-specific. Resistance derived from cultivar Thatcher can confer high levels of APR to the virulent Pgt race TTKSK (Ug99) when combined with stem rust resistance gene Sr57 (Lr34). To identify the loci conferring APR in Thatcher, we evaluated 160 RILs derived from Thatcher crossed to susceptible cultivar McNeal for field stem rust reaction in Kenya for two seasons and in St. Paul for one season. All RILs and parents were susceptible as seedlings to race TTKSK. However, adult plant stem rust severities in Kenya varied from 5 to 80 %. Composite interval mapping identified four quantitative trait loci (QTL). Three QTL were inherited from Thatcher and one, Sr57, was inherited from McNeal. The markers closest to the QTL peaks were used in an ANOVA to determine the additive and epistatic effects. A QTL on 3BS was detected in all three environments and explained 27-35 % of the variation. The peak of this QTL was at the same location as the Sr12 seedling resistance gene effective to race SCCSC. Epistatic interactions were significant between Sr12 and QTL on chromosome arms 1AL and 2BS. Though Sr12 cosegregated with the largest effect QTL, lines with Sr12 were not always resistant. The data suggest that Sr12 or a linked gene, though not effective to race TTKSK alone, confers APR when combined with other resistance loci.
Scurrah, Katrina J; Lamantia, Angela; Ellis, Justine A; Harrap, Stephen B
2017-06-01
Renin-angiotensin-aldosterone system genes have been inconsistently associated with blood pressure, possibly because of unrecognized influences of sex-dependent genetic effects or gene-gene interactions (epistasis). We tested association of systolic blood pressure with single-nucleotide polymorphisms (SNPs) at renin ( REN ), angiotensinogen ( AGT ), angiotensin-converting enzyme ( ACE ), angiotensin II type 1 receptor ( AGTR1 ), and aldosterone synthase ( CYP11B2 ), including sex-SNP or SNP-SNP interactions. Eighty-eight tagSNPs were tested in 2872 white individuals in 809 pedigrees from the Victorian Family Heart Study using variance components models. Three SNPs (rs8075924 and rs4277404 at ACE and rs12721297 at AGTR1 ) were individually associated with lower systolic blood pressure with significant ( P <0.00076) effect sizes ≈1.7 to 2.5 mm Hg. Sex-specific associations were seen for 3 SNPs in men (rs2468523 and rs2478544 at AGT and rs11658531 at ACE ) and 1 SNP in women (rs12451328 at ACE ). SNP-SNP interaction was suggested ( P <0.005) for 14 SNP pairs, none of which had shown individual association with systolic blood pressure. Four SNP pairs were at the same gene (2 for REN , 1 for AGT , and 1 for AGTR1 ). The SNP rs3097 at CYP11B2 was represented in 5 separate pairs. SNPs at key renin-angiotensin-aldosterone system genes associate with systolic blood pressure individually in both sexes, individually in one sex only and only when combined with another SNP. Analyses that incorporate sex-dependent and epistatic effects could reconcile past inconsistencies and account for some of the missing heritability of blood pressure and are generally relevant to SNP association studies for any phenotype. © 2017 American Heart Association, Inc.
Eskandari, Mehrzad; Cober, Elroy R; Rajcan, Istvan
2013-02-01
Soybean seed is a major source of oil for human consumption worldwide and the main renewable feedstock for biodiesel production in North America. Increasing seed oil concentration in soybean [Glycine max (L.) Merrill] with no or minimal impact on protein concentration could be accelerated by exploiting quantitative trait loci (QTL) or gene-specific markers. Oil concentration in soybean is a polygenic trait regulated by many genes with mostly small effects and which is negatively associated with protein concentration. The objectives of this study were to discover and validate oil QTL in two recombinant inbred line (RIL) populations derived from crosses between three moderately high-oil soybean cultivars, OAC Wallace, OAC Glencoe, and RCAT Angora. The RIL populations were grown across several environments over 2 years in Ontario, Canada. In a population of 203 F(3:6) RILs from a cross of OAC Wallace and OAC Glencoe, a total of 11 genomic regions on nine different chromosomes were identified as associated with oil concentration using multiple QTL mapping and single-factor ANOVA. The percentage of the phenotypic variation accounted for by each QTL ranged from 4 to 11 %. Of the five QTL that were tested in a population of 211 F(3:5) RILs from the cross RCAT Angora × OAC Wallace, a "trait-based" bidirectional selective genotyping analysis validated four QTL (80 %). In addition, a total of seven two-way epistatic interactions were identified for oil concentration in this study. The QTL and epistatic interactions identified in this study could be used in marker-assisted introgression aimed at pyramiding high-oil alleles in soybean cultivars to increase oil concentration for biodiesel as well as edible oil applications.
Sato, Trey K; Tremaine, Mary; Parreiras, Lucas S; Hebert, Alexander S; Myers, Kevin S; Higbee, Alan J; Sardi, Maria; McIlwain, Sean J; Ong, Irene M; Breuer, Rebecca J; Avanasi Narasimhan, Ragothaman; McGee, Mick A; Dickinson, Quinn; La Reau, Alex; Xie, Dan; Tian, Mingyuan; Reed, Jennifer L; Zhang, Yaoping; Coon, Joshua J; Hittinger, Chris Todd; Gasch, Audrey P; Landick, Robert
2016-10-01
The inability of native Saccharomyces cerevisiae to convert xylose from plant biomass into biofuels remains a major challenge for the production of renewable bioenergy. Despite extensive knowledge of the regulatory networks controlling carbon metabolism in yeast, little is known about how to reprogram S. cerevisiae to ferment xylose at rates comparable to glucose. Here we combined genome sequencing, proteomic profiling, and metabolomic analyses to identify and characterize the responsible mutations in a series of evolved strains capable of metabolizing xylose aerobically or anaerobically. We report that rapid xylose conversion by engineered and evolved S. cerevisiae strains depends upon epistatic interactions among genes encoding a xylose reductase (GRE3), a component of MAP Kinase (MAPK) signaling (HOG1), a regulator of Protein Kinase A (PKA) signaling (IRA2), and a scaffolding protein for mitochondrial iron-sulfur (Fe-S) cluster biogenesis (ISU1). Interestingly, the mutation in IRA2 only impacted anaerobic xylose consumption and required the loss of ISU1 function, indicating a previously unknown connection between PKA signaling, Fe-S cluster biogenesis, and anaerobiosis. Proteomic and metabolomic comparisons revealed that the xylose-metabolizing mutant strains exhibit altered metabolic pathways relative to the parental strain when grown in xylose. Further analyses revealed that interacting mutations in HOG1 and ISU1 unexpectedly elevated mitochondrial respiratory proteins and enabled rapid aerobic respiration of xylose and other non-fermentable carbon substrates. Our findings suggest a surprising connection between Fe-S cluster biogenesis and signaling that facilitates aerobic respiration and anaerobic fermentation of xylose, underscoring how much remains unknown about the eukaryotic signaling systems that regulate carbon metabolism.
Tremaine, Mary; Hebert, Alexander S.; Myers, Kevin S.; Sardi, Maria; Dickinson, Quinn; Reed, Jennifer L.; Zhang, Yaoping; Coon, Joshua J.; Hittinger, Chris Todd; Gasch, Audrey P.; Landick, Robert
2016-01-01
The inability of native Saccharomyces cerevisiae to convert xylose from plant biomass into biofuels remains a major challenge for the production of renewable bioenergy. Despite extensive knowledge of the regulatory networks controlling carbon metabolism in yeast, little is known about how to reprogram S. cerevisiae to ferment xylose at rates comparable to glucose. Here we combined genome sequencing, proteomic profiling, and metabolomic analyses to identify and characterize the responsible mutations in a series of evolved strains capable of metabolizing xylose aerobically or anaerobically. We report that rapid xylose conversion by engineered and evolved S. cerevisiae strains depends upon epistatic interactions among genes encoding a xylose reductase (GRE3), a component of MAP Kinase (MAPK) signaling (HOG1), a regulator of Protein Kinase A (PKA) signaling (IRA2), and a scaffolding protein for mitochondrial iron-sulfur (Fe-S) cluster biogenesis (ISU1). Interestingly, the mutation in IRA2 only impacted anaerobic xylose consumption and required the loss of ISU1 function, indicating a previously unknown connection between PKA signaling, Fe-S cluster biogenesis, and anaerobiosis. Proteomic and metabolomic comparisons revealed that the xylose-metabolizing mutant strains exhibit altered metabolic pathways relative to the parental strain when grown in xylose. Further analyses revealed that interacting mutations in HOG1 and ISU1 unexpectedly elevated mitochondrial respiratory proteins and enabled rapid aerobic respiration of xylose and other non-fermentable carbon substrates. Our findings suggest a surprising connection between Fe-S cluster biogenesis and signaling that facilitates aerobic respiration and anaerobic fermentation of xylose, underscoring how much remains unknown about the eukaryotic signaling systems that regulate carbon metabolism. PMID:27741250
Li, Shuwei; Xie, Lisheng; Du, Mulong; Xu, Kaili; Zhu, Lingjun; Chu, Haiyan; Chen, Jinfei; Wang, Meilin; Zhang, Zhengdong; Gu, Dongying
2018-05-16
Although studies have investigated the association of genetic variants and the abnormal expression of estrogen-related genes with colorectal cancer risk, the evidence remains inconsistent. We clarified the relationship of genetic variants in estrogen metabolic pathway genes with colorectal cancer risk and survival. A case-control study was performed to assess the association of single-nucleotide polymorphisms (SNPs) in ten candidate genes with colorectal cancer risk in a Chinese population. A logistic regression model and Cox regression model were used to calculate SNP effects on colorectal cancer susceptibility and survival, respectively. Expression quantitative trait loci (eQTL) analysis was conducted using the Genotype-Tissue Expression (GTEx) project dataset. The sequence kernel association test (SKAT) was used to perform gene-set analysis. Colorectal cancer risk and rs3760806 in SULT2B1 were significantly associated in both genders [male: OR = 1.38 (1.15-1.66); female: OR = 1.38 (1.13-1.68)]. Two SNPs in SULT1E1 were related to progression-free survival (PFS) [rs1238574: HR = 1.24 (1.02-1.50), P = 2.79 × 10 -2 ; rs3822172: HR = 1.30 (1.07-1.57), P = 8.44 × 10 -3 ] and overall survival (OS) [rs1238574: HR = 1.51 (1.16-1.97), P = 2.30 × 10 -3 ; rs3822172: HR = 1.53 (1.67-2.00), P = 2.03 × 10 -3 ]. Moreover, rs3760806 was an eQTL for SULT2B1 in colon samples (transverse: P = 3.6 × 10 -3 ; sigmoid: P = 1.0 × 10 -3 ). SULT2B1 expression was significantly higher in colorectal tumor tissues than in normal tissues in the Cancer Genome Atlas (TCGA) database (P < 1.0 × 10 -4 ). Our results indicated that SNPs in estrogen metabolic pathway genes confer colorectal cancer susceptibility and survival.
Pierce, Brandon L.; Tong, Lin; Chen, Lin S.; Rahaman, Ronald; Argos, Maria; Jasmine, Farzana; Roy, Shantanu; Paul-Brutus, Rachelle; Westra, Harm-Jan; Franke, Lude; Esko, Tonu; Zaman, Rakibuz; Islam, Tariqul; Rahman, Mahfuzar; Baron, John A.; Kibriya, Muhammad G.; Ahsan, Habibul
2014-01-01
A large fraction of human genes are regulated by genetic variation near the transcribed sequence (cis-eQTL, expression quantitative trait locus), and many cis-eQTLs have implications for human disease. Less is known regarding the effects of genetic variation on expression of distant genes (trans-eQTLs) and their biological mechanisms. In this work, we use genome-wide data on SNPs and array-based expression measures from mononuclear cells obtained from a population-based cohort of 1,799 Bangladeshi individuals to characterize cis- and trans-eQTLs and determine if observed trans-eQTL associations are mediated by expression of transcripts in cis with the SNPs showing trans-association, using Sobel tests of mediation. We observed 434 independent trans-eQTL associations at a false-discovery rate of 0.05, and 189 of these trans-eQTLs were also cis-eQTLs (enrichment P<0.0001). Among these 189 trans-eQTL associations, 39 were significantly attenuated after adjusting for a cis-mediator based on Sobel P<10-5. We attempted to replicate 21 of these mediation signals in two European cohorts, and while only 7 trans-eQTL associations were present in one or both cohorts, 6 showed evidence of cis-mediation. Analyses of simulated data show that complete mediation will be observed as partial mediation in the presence of mediator measurement error or imperfect LD between measured and causal variants. Our data demonstrates that trans-associations can become significantly stronger or switch directions after adjusting for a potential mediator. Using simulated data, we demonstrate that this phenomenon is expected in the presence of strong cis-trans confounding and when the measured cis-transcript is correlated with the true (unmeasured) mediator. In conclusion, by applying mediation analysis to eQTL data, we show that a substantial fraction of observed trans-eQTL associations can be explained by cis-mediation. Future studies should focus on understanding the mechanisms underlying widespread cis-mediation and their relevance to disease biology, as well as using mediation analysis to improve eQTL discovery. PMID:25474530
Wang, Min; Hancock, Timothy P; Chamberlain, Amanda J; Vander Jagt, Christy J; Pryce, Jennie E; Cocks, Benjamin G; Goddard, Mike E; Hayes, Benjamin J
2018-05-24
Topological association domains (TADs) are chromosomal domains characterised by frequent internal DNA-DNA interactions. The transcription factor CTCF binds to conserved DNA sequence patterns called CTCF binding motifs to either prohibit or facilitate chromosomal interactions. TADs and CTCF binding motifs control gene expression, but they are not yet well defined in the bovine genome. In this paper, we sought to improve the annotation of bovine TADs and CTCF binding motifs, and assess whether the new annotation can reduce the search space for cis-regulatory variants. We used genomic synteny to map TADs and CTCF binding motifs from humans, mice, dogs and macaques to the bovine genome. We found that our mapped TADs exhibited the same hallmark properties of those sourced from experimental data, such as housekeeping genes, transfer RNA genes, CTCF binding motifs, short interspersed elements, H3K4me3 and H3K27ac. We showed that runs of genes with the same pattern of allele-specific expression (ASE) (either favouring paternal or maternal allele) were often located in the same TAD or between the same conserved CTCF binding motifs. Analyses of variance showed that when averaged across all bovine tissues tested, TADs explained 14% of ASE variation (standard deviation, SD: 0.056), while CTCF explained 27% (SD: 0.078). Furthermore, we showed that the quantitative trait loci (QTLs) associated with gene expression variation (eQTLs) or ASE variation (aseQTLs), which were identified from mRNA transcripts from 141 lactating cows' white blood and milk cells, were highly enriched at putative bovine CTCF binding motifs. The linearly-furthermost, and most-significant aseQTL and eQTL for each genic target were located within the same TAD as the gene more often than expected (Chi-Squared test P-value < 0.001). Our results suggest that genomic synteny can be used to functionally annotate conserved transcriptional components, and provides a tool to reduce the search space for causative regulatory variants in the bovine genome.
Pauly, Matthew D.; Lyons, Daniel M.; Fitzsimmons, William J.
2017-01-01
ABSTRACT Lethal mutagenesis is a broad-spectrum antiviral strategy that employs mutagenic nucleoside analogs to exploit the high mutation rate and low mutational tolerance of many RNA viruses. Studies of mutagen-resistant viruses have identified determinants of replicative fidelity and the importance of mutation rate to viral population dynamics. We have previously demonstrated the effective lethal mutagenesis of influenza A virus using three nucleoside analogs as well as the virus’s high genetic barrier to mutagen resistance. Here, we investigate the mutagen-resistant phenotypes of mutations that were enriched in drug-treated populations. We find that PB1 T123A has higher replicative fitness than the wild type, PR8, and maintains its level of genome production during 5-fluorouracil (2,4-dihydroxy-5-fluoropyrimidine) treatment. Surprisingly, this mutagen-resistant variant also has an increased baseline rate of C-to-U and G-to-A mutations. A second drug-selected mutation, PA T97I, interacts epistatically with PB1 T123A to mediate high-level mutagen resistance, predominantly by limiting the inhibitory effect of nucleosides on polymerase activity. Consistent with the importance of epistatic interactions in the influenza virus polymerase, our data suggest that nucleoside analog resistance and replication fidelity are strain dependent. Two previously identified ribavirin {1-[(2R,3R,4S,5R)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1H-1,2,4-triazole-3-carboxamide} resistance mutations, PB1 V43I and PB1 D27N, do not confer drug resistance in the PR8 background, and the PR8-PB1 V43I polymerase exhibits a normal baseline mutation rate. Our results highlight the genetic complexity of the influenza A virus polymerase and demonstrate that increased replicative capacity is a mechanism by which an RNA virus can counter the negative effects of elevated mutation rates. IMPORTANCE RNA viruses exist as genetically diverse populations. This standing genetic diversity gives them the potential to adapt rapidly, evolve resistance to antiviral therapeutics, and evade immune responses. Viral mutants with altered mutation rates or mutational tolerance have provided insights into how genetic diversity arises and how it affects the behavior of RNA viruses. To this end, we identified variants within the polymerase complex of influenza virus that are able to tolerate drug-mediated increases in viral mutation rates. We find that drug resistance is highly dependent on interactions among mutations in the polymerase complex. In contrast to other viruses, influenza virus counters the effect of higher mutation rates primarily by maintaining high levels of genome replication. These findings suggest the importance of maintaining large population sizes for viruses with high mutation rates and show that multiple proteins can affect both mutation rate and genome synthesis. PMID:28815216
Gao, Yong-Ming; Wan, Ping
2002-06-01
Screening markers efficiently is the foundation of mapping QTLs by composite interval mapping. Main and interaction markers distinguished, besides using background control for genetic variation, could also be used to construct intervals of two-way searching for mapping QTLs with epistasis, which can save a lot of calculation time. Therefore, the efficiency of marker screening would affect power and precision of QTL mapping. A doubled haploid population with 200 individuals and 5 chromosomes was constructed, with 50 markers evenly distributed at 10 cM space. Among a total of 6 QTLs, one was placed on chromosome I, two linked on chromosome II, and the other three linked on chromosome IV. QTL setting included additive effects and epistatic effects of additive x additive, the corresponding QTL interaction effects were set if data were collected under multiple environments. The heritability was assumed to be 0.5 if no special declaration. The power of marker screening by stepwise regression, forward regression, and three methods for random effect prediction, e.g. best linear unbiased prediction (BLUP), linear unbiased prediction (LUP) and adjusted unbiased prediction (AUP), was studied and compared through 100 Monte Carlo simulations. The results indicated that the marker screening power by stepwise regression at 0.1, 0.05 and 0.01 significant level changed from 2% to 68%, the power changed from 2% to 72% by forward regression. The larger the QTL effects, the higher the marker screening power. While the power of marker screening by three random effect prediction was very low, the maximum was only 13%. That suggested that regression methods were much better than those by using the approaches of random effect prediction to identify efficient markers flanking QTLs, and forward selection method was more simple and efficient. The results of simulation study on heritability showed that heightening of both general heritability and interaction heritability of genotype x environments could enhance marker screening power, the former had a greater influence on QTLs with larger main and/or epistatic effects, while the later on QTLs with small main and/or epistatic effects. The simulation of 100 times was also conducted to study the influence of different marker number and density on marker screening power. It is indicated that the marker screening power would decrease if there were too many markers, especially with high density in a mapping population, which suggested that a mapping population with definite individuals could only hold limited markers. According to the simulation study, the reasonable number of markers should not be more than individuals. The simulation study of marker screening under multiple environments showed high total power of marker screening. In order to relieve the problem that marker screening power restricted the efficiency of QTL mapping, markers identified in multiple environments could be used to construct two search intervals.
Neufeld, Stanley J.; Wang, Fan; Cobb, John
2014-01-01
The growth and development of the vertebrate limb relies on homeobox genes of the Hox and Shox families, with their independent mutation often giving dose-dependent effects. Here we investigate whether Shox2 and Hox genes function together during mouse limb development by modulating their relative dosage and examining the limb for nonadditive effects on growth. Using double mRNA fluorescence in situ hybridization (FISH) in single embryos, we first show that Shox2 and Hox genes have associated spatial expression dynamics, with Shox2 expression restricted to the proximal limb along with Hoxd9 and Hoxa11 expression, juxtaposing the distal expression of Hoxa13 and Hoxd13. By generating mice with all possible dosage combinations of mutant Shox2 alleles and HoxA/D cluster deletions, we then show that their coordinated proximal limb expression is critical to generate normally proportioned limb segments. These epistatic interactions tune limb length, where Shox2 underexpression enhances, and Shox2 overexpression suppresses, Hox-mutant phenotypes. Disruption of either Shox2 or Hox genes leads to a similar reduction in Runx2 expression in the developing humerus, suggesting their concerted action drives cartilage maturation during normal development. While we furthermore provide evidence that Hox gene function influences Shox2 expression, this regulation is limited in extent and is unlikely on its own to be a major explanation for their genetic interaction. Given the similar effect of human SHOX mutations on regional limb growth, Shox and Hox genes may generally function as genetic interaction partners during the growth and development of the proximal vertebrate limb. PMID:25217052
Neufeld, Stanley J; Wang, Fan; Cobb, John
2014-11-01
The growth and development of the vertebrate limb relies on homeobox genes of the Hox and Shox families, with their independent mutation often giving dose-dependent effects. Here we investigate whether Shox2 and Hox genes function together during mouse limb development by modulating their relative dosage and examining the limb for nonadditive effects on growth. Using double mRNA fluorescence in situ hybridization (FISH) in single embryos, we first show that Shox2 and Hox genes have associated spatial expression dynamics, with Shox2 expression restricted to the proximal limb along with Hoxd9 and Hoxa11 expression, juxtaposing the distal expression of Hoxa13 and Hoxd13. By generating mice with all possible dosage combinations of mutant Shox2 alleles and HoxA/D cluster deletions, we then show that their coordinated proximal limb expression is critical to generate normally proportioned limb segments. These epistatic interactions tune limb length, where Shox2 underexpression enhances, and Shox2 overexpression suppresses, Hox-mutant phenotypes. Disruption of either Shox2 or Hox genes leads to a similar reduction in Runx2 expression in the developing humerus, suggesting their concerted action drives cartilage maturation during normal development. While we furthermore provide evidence that Hox gene function influences Shox2 expression, this regulation is limited in extent and is unlikely on its own to be a major explanation for their genetic interaction. Given the similar effect of human SHOX mutations on regional limb growth, Shox and Hox genes may generally function as genetic interaction partners during the growth and development of the proximal vertebrate limb. Copyright © 2014 by the Genetics Society of America.
Wang, Ying; Yan, Jie; Lee, Haeryun; Lu, Qiuheng; Adler, Paul N.
2014-01-01
The frizzled/starry night pathway regulates planar cell polarity in a wide variety of tissues in many types of animals. It was discovered and has been most intensively studied in the Drosophila wing where it controls the formation of the array of distally pointing hairs that cover the wing. The pathway does this by restricting the activation of the cytoskeleton to the distal edge of wing cells. This results in hairs initiating at the distal edge and growing in the distal direction. All of the proteins encoded by genes in the pathway accumulate asymmetrically in wing cells. The pathway is a hierarchy with the Planar Cell Polarity (PCP) genes (aka the core genes) functioning as a group upstream of the Planar Polarity Effector (PPE) genes which in turn function as a group upstream of multiple wing hairs. Upstream proteins, such as Frizzled accumulate on either the distal and/or proximal edges of wing cells. Downstream PPE proteins accumulate on the proximal edge under the instruction of the upstream proteins. A variety of types of data support this hierarchy, however, we have found that when over expressed the PPE proteins can alter both the subcellular location and level of accumulation of the upstream proteins. Thus, the epistatic relationship is context dependent. We further show that the PPE proteins interact physically and can modulate the accumulation of each other in wing cells. We also find that over expression of Frtz results in a marked delay in hair initiation suggesting that it has a separate role/activity in regulating the cytoskeleton that is not shared by other members of the group. PMID:25072625
Smith, J S; Brachmann, C B; Pillus, L; Boeke, J D
1998-01-01
Transcriptional silencing in Saccharomyces cerevisiae occurs at the silent mating-type loci HML and HMR, at telomeres, and at the ribosomal DNA (rDNA) locus RDN1. Silencing in the rDNA occurs by a novel mechanism that depends on a single Silent Information Regulator (SIR) gene, SIR2. SIR4, essential for other silenced loci, paradoxically inhibits rDNA silencing. In this study, we elucidate a regulatory mechanism for rDNA silencing based on the finding that rDNA silencing strength directly correlates with cellular Sir2 protein levels. The endogenous level of Sir2p was shown to be limiting for rDNA silencing. Furthermore, small changes in Sir2p levels altered rDNA silencing strength. In rDNA silencing phenotypes, sir2 mutations were shown to be epistatic to sir4 mutations, indicating that SIR4 inhibition of rDNA silencing is mediated through SIR2. Furthermore, rDNA silencing is insensitive to SIR3 overexpression, but is severely reduced by overexpression of full-length Sir4p or a fragment of Sir4p that interacts with Sir2p. This negative effect of SIR4 overexpression was overridden by co-overexpression of SIR2, suggesting that SIR4 directly inhibits the rDNA silencing function of SIR2. Finally, genetic manipulations of SIR4 previously shown to promote extended life span also resulted in enhanced rDNA silencing. We propose a simple model in which telomeres act as regulators of rDNA silencing by competing for limiting amounts of Sir2 protein. PMID:9649515
Garcia-Garcia, Manuel; Via, Marc; Zarnowiec, Katarzyna; SanMiguel, Iria; Escera, Carles; Clemente, Immaculada C
2017-01-01
Attention capture by potentially relevant environmental stimuli is critical for human survival, yet it varies considerably among individuals. A large series of studies has suggested that attention capture may depend on the cognitive balance between maintenance and manipulation of mental representations and the flexible switch between goal-directed representations and potentially relevant stimuli outside the focus of attention; a balance that seems modulated by a prefrontostriatal dopamine pathway. Here, we examined inter-individual differences in the cognitive control of attention through studying the effects of two single nucleotide polymorphisms regulating dopamine at the prefrontal cortex and the striatum (i.e., COMTMet108/158Val and ANKK1/DRD2TaqIA) on stimulus-driven attention capture. Healthy adult participants (N = 40) were assigned to different groups according to the combination of the polymorphisms COMTMet108/158Val and ANKK1/DRD2TaqIA, and were instructed to perform on a well-established distraction protocol. Performance in individuals with a balance between prefrontal dopamine display and striatal receptor density was slowed down by the occurrence of unexpected distracting events, while those with a rather unbalanced dopamine activity were able maintain task performance with no time delay, yet at the expense of a slightly lower accuracy. This advantage, associated to their distinct genetic profiles, was paralleled by an electrophysiological mechanism of phase-resetting of gamma neural oscillation to the novel, distracting events. Taken together, the current results suggest that the epistatic interaction between COMTVal108/158Met and ANKK1/DRD2 TaqIa genetic polymorphisms lies at the basis of stimulus-driven attention capture.
Genetics of Inflammatory Bowel Diseases
McGovern, Dermot; Kugathasan, Subra; Cho, Judy H.
2015-01-01
In this Review, we provide an update on genome-wide association studies (GWAS) in inflammatory bowel disease (IBD). In addition, we summarize progress in defining the functional consequences of associated alleles for coding and non-coding genetic variation. In the small minority of loci where major association signals correspond to non-synonymous variation, we summarize studies defining their functional effects and implications for therapeutic targeting. Importantly, the large majority of GWAS-associated loci involve non-coding variation, many of which modulate levels of gene expression. Recent expression quantitative trait loci (eQTL) studies have established that expression of the large majority of human genes is regulated by non-coding genetic variation. Significant advances in defining the epigenetic landscape have demonstrated that IBD GWAS signals are highly enriched within cell-specific active enhancer marks. Studies in European ancestry populations have dominated the landscape of IBD genetics studies, but increasingly, studies in Asian and African-American populations are being reported. Common variation accounts for only a modest fraction of the predicted heritability and the role of rare genetic variation of higher effects (i.e. odds ratios markedly deviating from one) is increasingly being identified through sequencing efforts. These sequencing studies have been particularly productive in very-early onset, more severe cases. A major challenge in IBD genetics will be harnessing the vast array of genetic discovery for clinical utility, through emerging precision medicine initiatives. We discuss the rapidly evolving area of direct to consumer genetic testing, as well as the current utility of clinical exome sequencing, especially in very early onset, severe IBD cases. We summarize recent progress in the pharmacogenetics of IBD with respect of partitioning patient responses to anti-TNF and thiopurine therapies. Highly collaborative studies across research centers and across subspecialties and disciplines will be required to fully realize the promise of genetic discovery in IBD. PMID:26255561
van der Vaart, Andrew D.; Wolstenholme, Jennifer T.; Smith, Maren L.; Harris, Guy M.; Lopez, Marcelo F.; Wolen, Aaron R.; Becker, Howard C.; Williams, Robert W.; Miles, Michael F.
2016-01-01
The transition from acute to chronic ethanol exposure leads to lasting behavioral and physiological changes such as increased consumption, dependence, and withdrawal. Changes in brain gene expression are hypothesized to underlie these adaptive responses to ethanol. Previous studies on acute ethanol identified genetic variation in brain gene expression networks and behavioral responses to ethanol across the BXD panel of recombinant inbred mice. In this work, we have performed the first joint genetic and genomic analysis of transcriptome shifts in response to chronic intermittent ethanol (CIE) by vapor chamber exposure in a BXD cohort. CIE treatment is known to produce significant and sustained changes in ethanol consumption with repeated cycles of ethanol vapor. Using Affymetrix microarray analysis of prefrontal cortex (PFC) and nucleus accumbens (NAC) RNA, we compared CIE expression responses to those seen following acute ethanol treatment, and to voluntary ethanol consumption. Gene expression changes in PFC and NAC after CIE overlapped significantly across brain regions and with previously published expression following acute ethanol. Genes highly modulated by CIE were enriched for specific biological processes including synaptic transmission, neuron ensheathment, intracellular signaling, and neuronal projection development. Expression quantitative trait locus (eQTL) analyses identified genomic loci associated with ethanol-induced transcriptional changes with largely distinct loci identified between brain regions. Correlating CIE-regulated genes to ethanol consumption data identified specific genes highly associated with variation in the increase in drinking seen with repeated cycles of CIE. In particular, multiple myelin-related genes were identified. Furthermore, genetic variance in or near dynamin3 (Dnm3) on Chr1 at ~164 Mb may have a major regulatory role in CIE-responsive gene expression. Dnm3 expression correlates significantly with ethanol consumption, is contained in a highly ranked functional group of CIE-regulated genes in the NAC, and has a cis-eQTL within a genomic region linked with multiple CIE-responsive genes. PMID:27838001
Gui, Jiang; Moore, Jason H.; Williams, Scott M.; Andrews, Peter; Hillege, Hans L.; van der Harst, Pim; Navis, Gerjan; Van Gilst, Wiek H.; Asselbergs, Folkert W.; Gilbert-Diamond, Diane
2013-01-01
We present an extension of the two-class multifactor dimensionality reduction (MDR) algorithm that enables detection and characterization of epistatic SNP-SNP interactions in the context of a quantitative trait. The proposed Quantitative MDR (QMDR) method handles continuous data by modifying MDR’s constructive induction algorithm to use a T-test. QMDR replaces the balanced accuracy metric with a T-test statistic as the score to determine the best interaction model. We used a simulation to identify the empirical distribution of QMDR’s testing score. We then applied QMDR to genetic data from the ongoing prospective Prevention of Renal and Vascular End-Stage Disease (PREVEND) study. PMID:23805232
Inheritance of flower color in periwinkle: orange-red corolla and white eye.
Sreevalli, Y; Kulkarni, R N; Baskaran, K
2002-01-01
The commonly found flower colors in periwinkle (Catharanthus roseus)--pink, white, red-eyed, and pale pink center--are reported to be governed by the epistatic interaction between four genes--A, R, W, and I. The mode of inheritance of an uncommon flower color, orange-red corolla and white eye, was studied by crossing an accession possessing this corolla color with a white flowered variety (Nirmal). The phenotype of the F(1) plants and segregation data of F(2) and backcross generations suggested the involvement of two more interacting and independently inherited genes, one (proposed symbol E) determining the presence or absence of red eye and another (proposed symbol O) determining orange-red corolla.
Kuntz, Sara; Poeck, Burkhard; Sokolowski, Marla B.; Strauss, Roland
2012-01-01
Orientation and navigation in a complex environment requires path planning and recall to exert goal-driven behavior. Walking Drosophila flies possess a visual orientation memory for attractive targets which is localized in the central complex of the adult brain. Here we show that this type of working memory requires the cGMP-dependent protein kinase encoded by the foraging gene in just one type of ellipsoid-body ring neurons. Moreover, genetic and epistatic interaction studies provide evidence that Foraging functions upstream of the Ignorant Ribosomal-S6 Kinase 2, thus revealing a novel neuronal signaling pathway necessary for this type of memory in Drosophila. PMID:22815538
Population- and individual-specific regulatory variation in Sardinia.
Pala, Mauro; Zappala, Zachary; Marongiu, Mara; Li, Xin; Davis, Joe R; Cusano, Roberto; Crobu, Francesca; Kukurba, Kimberly R; Gloudemans, Michael J; Reinier, Frederic; Berutti, Riccardo; Piras, Maria G; Mulas, Antonella; Zoledziewska, Magdalena; Marongiu, Michele; Sorokin, Elena P; Hess, Gaelen T; Smith, Kevin S; Busonero, Fabio; Maschio, Andrea; Steri, Maristella; Sidore, Carlo; Sanna, Serena; Fiorillo, Edoardo; Bassik, Michael C; Sawcer, Stephen J; Battle, Alexis; Novembre, John; Jones, Chris; Angius, Andrea; Abecasis, Gonçalo R; Schlessinger, David; Cucca, Francesco; Montgomery, Stephen B
2017-05-01
Genetic studies of complex traits have mainly identified associations with noncoding variants. To further determine the contribution of regulatory variation, we combined whole-genome and transcriptome data for 624 individuals from Sardinia to identify common and rare variants that influence gene expression and splicing. We identified 21,183 expression quantitative trait loci (eQTLs) and 6,768 splicing quantitative trait loci (sQTLs), including 619 new QTLs. We identified high-frequency QTLs and found evidence of selection near genes involved in malarial resistance and increased multiple sclerosis risk, reflecting the epidemiological history of Sardinia. Using family relationships, we identified 809 segregating expression outliers (median z score of 2.97), averaging 13.3 genes per individual. Outlier genes were enriched for proximal rare variants, providing a new approach to study large-effect regulatory variants and their relevance to traits. Our results provide insight into the effects of regulatory variants and their relationship to population history and individual genetic risk.
xQTL workbench: a scalable web environment for multi-level QTL analysis.
Arends, Danny; van der Velde, K Joeri; Prins, Pjotr; Broman, Karl W; Möller, Steffen; Jansen, Ritsert C; Swertz, Morris A
2012-04-01
xQTL workbench is a scalable web platform for the mapping of quantitative trait loci (QTLs) at multiple levels: for example gene expression (eQTL), protein abundance (pQTL), metabolite abundance (mQTL) and phenotype (phQTL) data. Popular QTL mapping methods for model organism and human populations are accessible via the web user interface. Large calculations scale easily on to multi-core computers, clusters and Cloud. All data involved can be uploaded and queried online: markers, genotypes, microarrays, NGS, LC-MS, GC-MS, NMR, etc. When new data types come available, xQTL workbench is quickly customized using the Molgenis software generator. xQTL workbench runs on all common platforms, including Linux, Mac OS X and Windows. An online demo system, installation guide, tutorials, software and source code are available under the LGPL3 license from http://www.xqtl.org. m.a.swertz@rug.nl.
xQTL workbench: a scalable web environment for multi-level QTL analysis
Arends, Danny; van der Velde, K. Joeri; Prins, Pjotr; Broman, Karl W.; Möller, Steffen; Jansen, Ritsert C.; Swertz, Morris A.
2012-01-01
Summary: xQTL workbench is a scalable web platform for the mapping of quantitative trait loci (QTLs) at multiple levels: for example gene expression (eQTL), protein abundance (pQTL), metabolite abundance (mQTL) and phenotype (phQTL) data. Popular QTL mapping methods for model organism and human populations are accessible via the web user interface. Large calculations scale easily on to multi-core computers, clusters and Cloud. All data involved can be uploaded and queried online: markers, genotypes, microarrays, NGS, LC-MS, GC-MS, NMR, etc. When new data types come available, xQTL workbench is quickly customized using the Molgenis software generator. Availability: xQTL workbench runs on all common platforms, including Linux, Mac OS X and Windows. An online demo system, installation guide, tutorials, software and source code are available under the LGPL3 license from http://www.xqtl.org. Contact: m.a.swertz@rug.nl PMID:22308096
Application of Response Surface Methods To Determine Conditions for Optimal Genomic Prediction
Howard, Réka; Carriquiry, Alicia L.; Beavis, William D.
2017-01-01
An epistatic genetic architecture can have a significant impact on prediction accuracies of genomic prediction (GP) methods. Machine learning methods predict traits comprised of epistatic genetic architectures more accurately than statistical methods based on additive mixed linear models. The differences between these types of GP methods suggest a diagnostic for revealing genetic architectures underlying traits of interest. In addition to genetic architecture, the performance of GP methods may be influenced by the sample size of the training population, the number of QTL, and the proportion of phenotypic variability due to genotypic variability (heritability). Possible values for these factors and the number of combinations of the factor levels that influence the performance of GP methods can be large. Thus, efficient methods for identifying combinations of factor levels that produce most accurate GPs is needed. Herein, we employ response surface methods (RSMs) to find the experimental conditions that produce the most accurate GPs. We illustrate RSM with an example of simulated doubled haploid populations and identify the combination of factors that maximize the difference between prediction accuracies of best linear unbiased prediction (BLUP) and support vector machine (SVM) GP methods. The greatest impact on the response is due to the genetic architecture of the population, heritability of the trait, and the sample size. When epistasis is responsible for all of the genotypic variance and heritability is equal to one and the sample size of the training population is large, the advantage of using the SVM method vs. the BLUP method is greatest. However, except for values close to the maximum, most of the response surface shows little difference between the methods. We also determined that the conditions resulting in the greatest prediction accuracy for BLUP occurred when genetic architecture consists solely of additive effects, and heritability is equal to one. PMID:28720710
Zhang, Qingrun; Long, Quan; Ott, Jurg
2014-06-01
Identifying gene-gene interaction is a hot topic in genome wide association studies. Two fundamental challenges are: (1) how to smartly identify combinations of variants that may be associated with the trait from astronomical number of all possible combinations; and (2) how to test epistatic interaction when all potential combinations are available. We developed AprioriGWAS, which brings two innovations. (1) Based on Apriori, a successful method in field of Frequent Itemset Mining (FIM) in which a pattern growth strategy is leveraged to effectively and accurately reduce search space, AprioriGWAS can efficiently identify genetically associated genotype patterns. (2) To test the hypotheses of epistasis, we adopt a new conditional permutation procedure to obtain reliable statistical inference of Pearson's chi-square test for the [Formula: see text] contingency table generated by associated variants. By applying AprioriGWAS to age-related macular degeneration (AMD) data, we found that: (1) angiopoietin 1 (ANGPT1) and four retinal genes interact with Complement Factor H (CFH). (2) GO term "glycosaminoglycan biosynthetic process" was enriched in AMD interacting genes. The epistatic interactions newly found by AprioriGWAS on AMD data are likely true interactions, since genes interacting with CFH are retinal genes, and GO term enrichment also verified that interaction between glycosaminoglycans (GAGs) and CFH plays an important role in disease pathology of AMD. By applying AprioriGWAS on Bipolar disorder in WTCCC data, we found variants without marginal effect show significant interactions. For example, multiple-SNP genotype patterns inside gene GABRB2 and GRIA1 (AMPA subunit 1 receptor gene). AMPARs are found in many parts of the brain and are the most commonly found receptor in the nervous system. The GABRB2 mediates the fastest inhibitory synaptic transmission in the central nervous system. GRIA1 and GABRB2 are relevant to mental disorders supported by multiple evidences.
Arenas-Alfonseca, Lucía; Gotor, Cecilia; Romero, Luis C; García, Irene
2018-05-01
In Arabidopsis thaliana, cyanide is produced concomitantly with ethylene biosynthesis and is mainly detoxified by the ß-cyanoalanine synthase CAS-C1. In roots, CAS-C1 activity is essential to maintain a low level of cyanide for proper root hair development. Root hair elongation relies on polarized cell expansion at the growing tip, and we have observed that CAS-C1 locates in mitochondria and accumulates in root hair tips during root hair elongation, as shown by observing the fluorescence in plants transformed with the translational construct ProC1:CASC1-GFP, containing the complete CAS-C1 gene fused to green fluorescent protein (GFP). Mutants in the SUPERCENTIPEDE (SCN1) gene, that regulate the NADPH oxidase gene ROOT HAIR DEFECTIVE 2 (RHD2)/AtrbohC, are affected at the very early steps of the development of root hair that do not elongate and do not show a preferential localization of the GFP accumulation in the tips of the root hair primordia. Root hairs of mutants in CAS-C1 or RHD2/AtrbohC, whose protein product catalyzes the generation of ROS and the Ca2+ gradient, start to grow out correctly, but they do not elongate. Genetic crosses between the cas-c1 mutant and scn1 or rhd2 mutants were performed, and the detailed phenotypic and molecular characterization of the double mutants demonstrates that scn1 mutation is epistatic to cas-c1 and cas-c1 is epistatic to rhd2 mutation, indicating that CAS-C1 acts in early steps of the root hair development process. Moreover, our results show that the role of CAS-C1 in root hair elongation is independent of H2O2 production and of a direct NADPH oxidase inhibition by cyanide.
Genetic dissection of main and epistatic effects of QTL based on augmented triple test cross design
Zhang, Zheng; Dai, Zhijun; Chen, Yuan; Yuan, Xiong; Yuan, Zheming; Tang, Wenbang; Li, Lanzhi; Hu, Zhongli
2017-01-01
The use of heterosis has considerably increased the productivity of many crops; however, the biological mechanism underpinning the technique remains elusive. The North Carolina design III (NCIII) and the triple test cross (TTC) are powerful and popular genetic mating design that can be used to decipher the genetic basis of heterosis. However, when using the NCIII design with the present quantitative trait locus (QTL) mapping method, if epistasis exists, the estimated additive or dominant effects are confounded with epistatic effects. Here, we propose a two-step approach to dissect all genetic effects of QTL and digenic interactions on a whole genome without sacrificing statistical power based on an augmented TTC (aTTC) design. Because the aTTC design has more transformation combinations than do the NCIII and TTC designs, it greatly enriches the QTL mapping for studying heterosis. When the basic population comprises recombinant inbred lines (RIL), we can use the same materials in the NCIII design for aTTC-design QTL mapping with transformation combination Z1, Z2, and Z4 to obtain genetic effect of QTL and digenic interactions. Compared with RIL-based TTC design, RIL-based aTTC design saves time, money, and labor for basic population crossed with F1. Several Monte Carlo simulation studies were carried out to confirm the proposed approach; the present genetic parameters could be identified with high statistical power, precision, and calculation speed, even at small sample size or low heritability. Additionally, two elite rice hybrid datasets for nine agronomic traits were estimated for real data analysis. We dissected the genetic effects and calculated the dominance degree of each QTL and digenic interaction. Real mapping results suggested that the dominance degree in Z2 that mainly characterize heterosis showed overdominance and dominance for QTL and digenic interactions. Dominance and overdominance were the major genetic foundations of heterosis in rice. PMID:29240818
Non-additive and epistatic effects of HLA polymorphisms contributing to risk of adult glioma.
Zhang, Chenan; de Smith, Adam J; Smirnov, Ivan V; Wiencke, John K; Wiemels, Joseph L; Witte, John S; Walsh, Kyle M
2017-11-01
Although genome-wide association studies have identified several susceptibility loci for adult glioma, little is known regarding the potential contribution of genetic variation in the human leukocyte antigen (HLA) region to glioma risk. HLA associations have been reported for various malignancies, with many studies investigating selected candidate HLA polymorphisms. However, no systematic analysis has been conducted in glioma patients, and no investigation into potential non-additive effects has been described. We conducted comprehensive genetic analyses of HLA variants among 1746 adult glioma patients and 2312 controls of European-ancestry from the GliomaScan Consortium. Genotype data were generated with the Illumina 660-Quad array, and we imputed HLA alleles using a reference panel of 5225 individuals in the Type 1 Diabetes Genetics Consortium who underwent high-resolution HLA typing via next-generation sequencing. Case-control comparisons were adjusted for population stratification using ancestry-informative principal components. Because alleles in different loci across the HLA region are linked, we created multigene haplotypes consisting of the genes DRB1, DQA1, and DQB1. Although none of the haplotypes were associated with glioma in additive models, inclusion of a dominance term significantly improved the model for multigene haplotype HLA-DRB1*1501-DQA1*0102-DQB1*0602 (P = 0.002). Heterozygous carriers of the haplotype had an increased risk of glioma [odds ratio (OR) 1.23; 95% confidence interval (CI) 1.01-1.49], while homozygous carriers were at decreased risk compared with non-carriers (OR 0.64; 95% CI 0.40-1.01). Our results suggest that the DRB1*1501-DQA1*0102-DQB1*0602 haplotype may contribute to the risk of glioma in a non-additive manner, with the positive dominance effect partly explained by an epistatic interaction with HLA-DRB1*0401-DQA1*0301-DQB1*0301.
Jia, Dongjie; Shen, Fei; Wang, Yi; Wu, Ting; Xu, Xuefeng; Zhang, Xinzhong; Han, Zhenhai
2018-05-11
Many efforts have been made to map quantitative trait loci (QTLs) to facilitate practical marker-assisted selection (MAS) in plants. In the present study, we identified four genome-wide major QTLs responsible for apple fruit acidity by MapQTL and BSA-seq analyses using two independent pedigree-based populations. Candidate genes were screened in major QTL regions, and three functional gene markers, including a non-synonymous A/G single nucleotide polymorphism (SNP) in the coding region of MdPP2CH, a 36-bp insertion in the promoter of MdSAUR37, and a previously reported SNP in MdALMTII, were validated to influence the malate content of apple fruits. In addition, MdPP2CH inactivated three vacuolar H + -ATPases (MdVHA-A3, MdVHA-B2 and MdVHA-D2) and one aluminium-activated malate transporter (MdALMTII) via dephosphorylation and negatively influenced fruit malate accumulation. The dephosphotase activity of MdPP2CH was suppressed by MdSAUR37, which implied a higher hierarchy of genetic interaction. Therefore, the MdSAUR37/MdPP2CH/MdALMTII chain cascaded hierarchical epistatic genetic effects to precisely determine apple fruit malate content. An A/G SNP (-1010) on MdMYB44 promoter region from a major QTL (qtl08.1) was closely associated with fruit malate content. The predicted phenotype values (PPVs) were estimated using the tentative genotype values of the gene markers, and the PPVs were significantly correlated with the observed phenotype values. Our findings provide an insight into plant genome-based selection in apples and will aid in conducting research to understand the physiological fundamentals of quantitative genetics. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Dixit, Shalabh; Huang, B Emma; Sta Cruz, Ma Teresa; Maturan, Paul T; Ontoy, Jhon Christian E; Kumar, Arvind
2014-01-01
The coupling of biotic and abiotic stresses leads to high yield losses in rainfed rice (Oryza sativa L.) growing areas. While several studies target these stresses independently, breeding strategies to combat multiple stresses seldom exist. This study reports an integrated strategy that combines QTL mapping and phenotypic selection to develop rice lines with high grain yield (GY) under drought stress and non-stress conditions, and tolerance of rice blast. A blast-tolerant BC2F3-derived population was developed from the cross of tropical japonica cultivar Moroberekan (blast- and drought-tolerant) and high-yielding indica variety Swarna (blast- and drought-susceptible) through phenotypic selection for blast tolerance at the BC2F2 generation. The population was studied for segregation distortion patterns and QTLs for GY under drought were identified along with study of epistatic interactions for the trait. Segregation distortion, in favour of Moroberekan, was observed at 50 of the 59 loci. Majority of these marker loci co-localized with known QTLs for blast tolerance or NBS-LRR disease resistance genes. Despite the presence of segregation distortion, high variation for DTF, PH and GY was observed and several QTLs were identified under drought stress and non-stress conditions for the three traits. Epistatic interactions were also detected for GY which explained a large proportion of phenotypic variance observed in the population. This strategy allowed us to identify QTLs for GY along with rapid development of high-yielding purelines tolerant to blast and drought with considerably reduced efforts. Apart from this, it also allowed us to study the effects of the selection cycle for blast tolerance. The developed lines were screened at IRRI and in the target environment, and drought and blast tolerant lines with high yield were identified. With tolerance to two major stresses and high yield potential, these lines may provide yield stability in rainfed rice areas.
Cardinal, Andrea J; Whetten, Rebecca; Wang, Sanbao; Auclair, Jérôme; Hyten, David; Cregan, Perry; Bachlava, Eleni; Gillman, Jason; Ramirez, Martha; Dewey, Ralph; Upchurch, Greg; Miranda, Lilian; Burton, Joseph W
2014-01-01
fap 1 mutation is caused by a G174A change in GmKASIIIA that disrupts a donor splice site recognition and creates a GATCTG motif that enhanced its expression. Soybean oil with reduced palmitic acid content is desirable to reduce the health risks associated with consumption of this fatty acid. The objectives of this study were: to identify the genomic location of the reduced palmitate fap1 mutation, determine its molecular basis, estimate the amount of phenotypic variation in fatty acid composition explained by this locus, determine if there are epistatic interactions between the fap1 and fap nc loci and, determine if the fap1 mutation has pleiotropic effects on seed yield, oil and protein content in three soybean populations. This study detected two major QTL for 16:0 content located in chromosome 5 (GmFATB1a, fap nc) and chromosome 9 near BARCSOYSSR_09_1707 that explained, with their interaction, 66-94 % of the variation in 16:0 content in the three populations. Sequencing results of a putative candidate gene, GmKASIIIA, revealed a single unique polymorphism in the germplasm line C1726, which was predicted to disrupt the donor splice site recognition between exon one and intron one and produce a truncated KASIIIA protein. This G to A change also created the GATCTG motif that enhanced gene expression of the mutated GmKASIIIA gene. Lines homozygous for the GmKASIIIA mutation (fap1) had a significant reduction in 16:0, 18:0, and oil content; and an increase in unsaturated fatty acids content. There were significant epistatic interactions between GmKASIIIA (fap1) and fap nc for 16:0 and oil contents, and seed yield in two populations. In conclusion, the fap1 phenotype is caused by a single unique SNP in the GmKASIIIA gene.
Hernandez, W; Gamazon, E R; Aquino-Michaels, K; Smithberger, E; O'Brien, T J; Harralson, A F; Tuck, M; Barbour, A; Cavallari, L H; Perera, M A
2017-04-01
Essentials Genetic variants controlling gene regulation have not been explored in pharmacogenomics. We tested liver expression quantitative trait loci for association with warfarin dose response. A novel predictor for increased warfarin dose response in African Americans was identified. Precision medicine must take into account population-specific variation in gene regulation. Background Warfarin is commonly used to control and prevent thromboembolic disorders. However, because of warfarin's complex dose-requirement relationship, safe and effective use is challenging. Pharmacogenomics-guided warfarin dosing algorithms that include the well-established VKORC1 and CYP2C9 polymorphisms explain only a small proportion of inter-individual variability in African Americans (AAs). Objectives We aimed to assess whether transcriptomic analyses could be used to identify regulatory variants associated with warfarin dose response in AAs. Patients/Methods We identified a total of 56 expression quantitative trait loci (eQTLs) for CYP2C9, VKORC1 and CALU derived from human livers and evaluated their association with warfarin dose response in two independent AA warfarin patient cohorts. Results We found that rs4889606, a strong cis-eQTL for VKORC1 (log 10 Bayes Factor = 12.02), is significantly associated with increased warfarin daily dose requirement (β = 1.1; 95% confidence interval [CI] 0.46 to 1.8) in the discovery cohort (n = 305) and in the replication cohort (β = 1.04; 95% CI 0.33 -1.7; n = 141) after conditioning on relevant covariates and the VKORC1 -1639G>A (rs9923231) variant. Inclusion of rs4889606 genotypes, along with CYP2C9 alleles, rs9923231 genotypes and clinical variables, explained 31% of the inter-patient variability in warfarin dose requirement. We demonstrate different linkage disequilibrium patterns in the region encompassing rs4889606 and rs9923231 between AAs and European Americans, which may explain the increased dose requirement found in AAs. Conclusion Our approach of interrogating eQTLs identified in liver has revealed a novel predictor of warfarin dose response in AAs. Our work highlights the utility of leveraging information from regulatory variants mapped in the liver to uncover novel variants associated with drug response and the importance of population-specific research. © 2017 International Society on Thrombosis and Haemostasis.
Nielson, Carrie M; Liu, Ching-Ti; Smith, Albert V; Ackert-Bicknell, Cheryl L; Reppe, Sjur; Jakobsdottir, Johanna; Wassel, Christina; Register, Thomas C; Oei, Ling; Alonso, Nerea; Oei, Edwin H; Parimi, Neeta; Samelson, Elizabeth J; Nalls, Mike A; Zmuda, Joseph; Lang, Thomas; Bouxsein, Mary; Latourelle, Jeanne; Claussnitzer, Melina; Siggeirsdottir, Kristin; Srikanth, Priya; Lorentzen, Erik; Vandenput, Liesbeth; Langefeld, Carl; Raffield, Laura; Terry, Greg; Cox, Amanda J; Allison, Matthew A; Criqui, Michael H; Bowden, Don; Ikram, M Arfan; Mellstrom, Dan; Karlsson, Magnus K; Carr, John; Budoff, Matthew; Phillips, Caroline; Cupples, L Adrienne; Chou, Wen-Chi; Myers, Richard H; Ralston, Stuart H; Gautvik, Kaare M; Cawthon, Peggy M; Cummings, Steven; Karasik, David; Rivadeneira, Fernando; Gudnason, Vilmundur; Orwoll, Eric S; Harris, Tamara B; Ohlsson, Claes; Kiel, Douglas P; Hsu, Yi-Hsiang
2017-01-01
Genome-wide association studies (GWASs) have revealed numerous loci for areal bone mineral density (aBMD). We completed the first GWAS meta-analysis (n = 15,275) of lumbar spine volumetric BMD (vBMD) measured by quantitative computed tomography (QCT), allowing for examination of the trabecular bone compartment. SNPs that were significantly associated with vBMD were also examined in two GWAS meta-analyses to determine associations with morphometric vertebral fracture (n = 21,701) and clinical vertebral fracture (n = 5893). Expression quantitative trait locus (eQTL) analyses of iliac crest biopsies were performed in 84 postmenopausal women, and murine osteoblast expression of genes implicated by eQTL or by proximity to vBMD-associated SNPs was examined. We identified significant vBMD associations with five loci, including: 1p36.12, containing WNT4 and ZBTB40; 8q24, containing TNFRSF11B; and 13q14, containing AKAP11 and TNFSF11. Two loci (5p13 and 1p36.12) also contained associations with radiographic and clinical vertebral fracture, respectively. In 5p13, rs2468531 (minor allele frequency [MAF] = 3%) was associated with higher vBMD (β = 0.22, p = 1.9 × 10−8) and decreased risk of radiographic vertebral fracture (odds ratio [OR] = 0.75; false discovery rate [FDR] p = 0.01). In 1p36.12, rs12742784 (MAF = 21%) was associated with higher vBMD (β = 0.09, p = 1.2 × 10−10) and decreased risk of clinical vertebral fracture (OR = 0.82; FDR p = 7.4 × 10−4). Both SNPs are noncoding and were associated with increased mRNA expression levels in human bone biopsies: rs2468531 with SLC1A3 (β = 0.28, FDR p = 0.01, involved in glutamate signaling and osteogenic response to mechanical loading) and rs12742784 with EPHB2 (β = 0.12, FDR p = 1.7 × 10−3, functions in bone-related ephrin signaling). Both genes are expressed in murine osteoblasts. This is the first study to linkSLC1A3 and EPHB2 to clinically relevant vertebral osteoporosis phenotypes. These results may help elucidate vertebral bone biology and novel approaches to reducing vertebral fracture incidence. © 2016 American Society for Bone and Mineral Research. PMID:27476799
O'Brien, Timothy D; Jia, Peilin; Caporaso, Neil E; Landi, Maria Teresa; Zhao, Zhongming
2018-02-27
There are two main types of lung cancer: small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). NSCLC has many subtypes, but the two most common are lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC). These subtypes are mainly classified by physiological and pathological characteristics, although there is increasing evidence of genetic and molecular differences as well. Although some work has been done at the somatic level to explore the genetic and biological differences among subtypes, little work has been done that interrogates these differences at the germline level to characterize the unique and shared susceptibility genes for each subtype. We used single-nucleotide polymorphisms (SNPs) from a genome-wide association study (GWAS) of European samples to interrogate the similarity of the subtypes at the SNP, gene, pathway, and regulatory levels. We expanded these genotyped SNPs to include all SNPs in linkage disequilibrium (LD) using data from the 1000 Genomes Project. We mapped these SNPs to several lung tissue expression quantitative trait loci (eQTL) and enhancer datasets to identify regulatory SNPs and their target genes. We used these genes to perform a biological pathway analysis for each subtype. We identified 8295, 8734, and 8361 SNPs with moderate association signals for LUAD, LUSC, and SCLC, respectively. Those SNPs had p < 1 × 10 - 3 in the original GWAS or were within LD (r 2 > 0.8, Europeans) to the genotyped SNPs. We identified 215, 320, and 172 disease-associated genes for LUAD, LUSC, and SCLC, respectively. Only five genes (CHRNA5, IDH3A, PSMA4, RP11-650 L12.2, and TBC1D2B) overlapped all subtypes. Furthermore, we observed only two pathways from the Kyoto Encyclopedia of Genes and Genomes shared by all subtypes. At the regulatory level, only three eQTL target genes and two enhancer target genes overlapped between all subtypes. Our results suggest that the three lung cancer subtypes do not share much genetic signal at the SNP, gene, pathway, or regulatory level, which differs from the common subtype classification based upon histology. However, three (CHRNA5, IDH3A, and PSMA4) of the five genes shared between the subtypes are well-known lung cancer genes that may act as general lung cancer genes regardless of subtype.
2012-01-01
Background F1 hybrid clones of Eucalyptus grandis and E. urophylla are widely grown for pulp and paper production in tropical and subtropical regions. Volume growth and wood quality are priority objectives in Eucalyptus tree improvement. The molecular basis of quantitative variation and trait expression in eucalypt hybrids, however, remains largely unknown. The recent availability of a draft genome sequence (http://www.phytozome.net) and genome-wide genotyping platforms, combined with high levels of genetic variation and high linkage disequilibrium in hybrid crosses, greatly facilitate the detection of quantitative trait loci (QTLs) as well as underlying candidate genes for growth and wood property traits. In this study, we used Diversity Arrays Technology markers to assess the genetic architecture of volume growth (diameter at breast height, DBH) and wood basic density in four-year-old progeny of an interspecific backcross pedigree of E. grandis and E. urophylla. In addition, we used Illumina RNA-Seq expression profiling in the E. urophylla backcross family to identify cis- and trans-acting polymorphisms (eQTLs) affecting transcript abundance of genes underlying QTLs for wood basic density. Results A total of five QTLs for DBH and 12 for wood basic density were identified in the two backcross families. Individual QTLs for DBH and wood basic density explained 3.1 to 12.2% of phenotypic variation. Candidate genes underlying QTLs for wood basic density on linkage groups 8 and 9 were found to share trans-acting eQTLs located on linkage groups 4 and 10, which in turn coincided with QTLs for wood basic density suggesting that these QTLs represent segregating components of an underlying transcriptional network. Conclusion This is the first demonstration of the use of next-generation expression profiling to quantify transcript abundance in a segregating tree population and identify candidate genes potentially affecting wood property variation. The QTLs identified in this study provide a resource for identifying candidate genes and developing molecular markers for marker-assisted breeding of volume growth and wood basic density. Our results suggest that integrated analysis of transcript and trait variation in eucalypt hybrids can be used to dissect the molecular basis of quantitative variation in wood property traits. PMID:22817272
Origins of magic: review of genetic and epigenetic effects.
Ramagopalan, Sreeram V; Knight, Marian; Ebers, George C; Knight, Julian C
2007-12-22
To assess the evidence for a genetic basis to magic. Literature review. Harry Potter novels of J K Rowling. Muggles, witches, wizards, and squibs. Limited. Family and twin studies, magical ability, and specific magical skills. Magic shows strong evidence of heritability, with familial aggregation and concordance in twins. Evidence suggests magical ability to be a quantitative trait. Specific magical skills, notably being able to speak to snakes, predict the future, and change hair colour, all seem heritable. A multilocus model with a dominant gene for magic might exist, controlled epistatically by one or more loci, possibly recessive in nature. Magical enhancers regulating gene expressionmay be involved, combined with mutations at specific genes implicated in speech and hair colour such as FOXP2 and MCR1.
Revealing evolutionary pathways by fitness landscape reconstruction.
Kogenaru, Manjunatha; de Vos, Marjon G J; Tans, Sander J
2009-01-01
The concept of epistasis has since long been used to denote non-additive fitness effects of genetic changes and has played a central role in understanding the evolution of biological systems. Owing to an array of novel experimental methodologies, it has become possible to experimentally determine epistatic interactions as well as more elaborate genotype-fitness maps. These data have opened up the investigation of a host of long-standing questions in evolutionary biology, such as the ruggedness of fitness landscapes and the accessibility of mutational trajectories, the evolution of sex, and the origin of robustness and modularity. Here we review this recent and timely marriage between systems biology and evolutionary biology, which holds the promise to understand evolutionary dynamics in a more mechanistic and predictive manner.
Gene × Environment Interactions in Schizophrenia: Evidence from Genetic Mouse Models
Marr, Julia; Bock, Gavin; Desbonnet, Lieve; Waddington, John
2016-01-01
The study of gene × environment, as well as epistatic interactions in schizophrenia, has provided important insight into the complex etiopathologic basis of schizophrenia. It has also increased our understanding of the role of susceptibility genes in the disorder and is an important consideration as we seek to translate genetic advances into novel antipsychotic treatment targets. This review summarises data arising from research involving the modelling of gene × environment interactions in schizophrenia using preclinical genetic models. Evidence for synergistic effects on the expression of schizophrenia-relevant endophenotypes will be discussed. It is proposed that valid and multifactorial preclinical models are important tools for identifying critical areas, as well as underlying mechanisms, of convergence of genetic and environmental risk factors, and their interaction in schizophrenia. PMID:27725886
Calcitonin Gene-Related Peptide (CGRP)
Russo, Andrew F.
2015-01-01
Migraine is a neurological disorder that manifests as a debilitating headache associated with altered sensory perception. The neuropeptide calcitonin gene-related peptide (CGRP) is now firmly established as a key player in migraine. Clinical trials carried out during the past decade have proved that CGRP receptor antagonists are effective for treating migraine, and antibodies to the receptor and CGRP are currently under investigation. Despite this progress in the clinical arena, the mechanisms by which CGRP triggers migraine remain uncertain. This review discusses mechanisms whereby CGRP enhances sensitivity to sensory input at multiple levels in both the periphery and central nervous system. Future studies on epistatic and epigenetic regulators of CGRP actions are expected to shed further light on CGRP actions in migraine. In conclusion, targeting CGRP represents an approachable therapeutic strategy for migraine. PMID:25340934
Stam, L. F.; Laurie, C. C.
1996-01-01
A molecular mapping experiment shows that a major gene effect on a quantitative trait, the level of alcohol dehydrogenase expression in Drosophila melanogaster, is due to multiple polymorphisms within the Adh gene. These polymorphisms are located in an intron, the coding sequence, and the 3' untranslated region. Because of nonrandom associations among polymorphisms at different sites, the individual effects combine (in some cases epistatically) to produce ``superalleles'' with large effect. These results have implications for the interpretation of major gene effects detected by quantitative trait locus mapping methods. They show that large effects due to a single locus may be due to multiple associated polymorphisms (or sequential fixations in isolated populations) rather than individual mutations of large effect. PMID:8978044
Chromosomes, conflict, and epigenetics: chromosomal speciation revisited.
Brown, Judith D; O'Neill, Rachel J
2010-01-01
Since Darwin first noted that the process of speciation was indeed the "mystery of mysteries," scientists have tried to develop testable models for the development of reproductive incompatibilities-the first step in the formation of a new species. Early theorists proposed that chromosome rearrangements were implicated in the process of reproductive isolation; however, the chromosomal speciation model has recently been questioned. In addition, recent data from hybrid model systems indicates that simple epistatic interactions, the Dobzhansky-Muller incompatibilities, are more complex. In fact, incompatibilities are quite broad, including interactions among heterochromatin, small RNAs, and distinct, epigenetically defined genomic regions such as the centromere. In this review, we will examine both classical and current models of chromosomal speciation and describe the "evolving" theory of genetic conflict, epigenetics, and chromosomal speciation.
Quigley, David A; Kandyba, Eve; Huang, Phillips; Halliwill, Kyle D; Sjölund, Jonas; Pelorosso, Facundo; Wong, Christine E; Hirst, Gillian L; Wu, Di; Delrosario, Reyno; Kumar, Atul; Balmain, Allan
2016-07-26
Inherited germline polymorphisms can cause gene expression levels in normal tissues to differ substantially between individuals. We present an analysis of the genetic architecture of normal adult skin from 470 genetically unique mice, demonstrating the effect of germline variants, skin tissue location, and perturbation by exogenous inflammation or tumorigenesis on gene signaling pathways. Gene networks related to specific cell types and signaling pathways, including sonic hedgehog (Shh), Wnt, Lgr family stem cell markers, and keratins, differed at these tissue sites, suggesting mechanisms for the differential susceptibility of dorsal and tail skin to development of skin diseases and tumorigenesis. The Pten tumor suppressor gene network is rewired in premalignant tumors compared to normal tissue, but this response to perturbation is lost during malignant progression. We present a software package for expression quantitative trait loci (eQTL) network analysis and demonstrate how network analysis of whole tissues provides insights into interactions between cell compartments and signaling molecules. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Graham, Deborah S Cunninghame; Pinder, Christopher L; Tombleson, Philip; Behrens, Timothy W; Martín, Javier; Fairfax, Benjamin P; Knight, Julian C; Chen, Lingyan; Replogle, Joseph; Syvänen, Ann-Christine; Rönnblom, Lars; Graham, Robert R; Wither, Joan E; Rioux, John D; Alarcón-Riquelme, Marta E; Vyse, Timothy J
2015-01-01
Systemic lupus erythematosus (SLE; OMIM 152700) is a genetically complex autoimmune disease characterized by loss of immune tolerance to nuclear and cell surface antigens. Previous genome-wide association studies (GWAS) had modest sample sizes, reducing their scope and reliability. Our study comprised 7,219 cases and 15,991 controls of European ancestry: a new GWAS, meta-analysis with a published GWAS and a replication study. We have mapped 43 susceptibility loci, including 10 novel associations. Assisted by dense genome coverage, imputation provided evidence for missense variants underpinning associations in eight genes. Other likely causal genes were established by examining associated alleles for cis-acting eQTL effects in a range of ex vivo immune cells. We found an over-representation (n=16) of transcription factors among SLE susceptibility genes. This supports the view that aberrantly regulated gene expression networks in multiple cell types in both the innate and adaptive immune response contribute to the risk of developing SLE. PMID:26502338
Li, Jin; Wang, Limei; Guo, Maozu; Zhang, Ruijie; Dai, Qiguo; Liu, Xiaoyan; Wang, Chunyu; Teng, Zhixia; Xuan, Ping; Zhang, Mingming
2015-01-01
In humans, despite the rapid increase in disease-associated gene discovery, a large proportion of disease-associated genes are still unknown. Many network-based approaches have been used to prioritize disease genes. Many networks, such as the protein-protein interaction (PPI), KEGG, and gene co-expression networks, have been used. Expression quantitative trait loci (eQTLs) have been successfully applied for the determination of genes associated with several diseases. In this study, we constructed an eQTL-based gene-gene co-regulation network (GGCRN) and used it to mine for disease genes. We adopted the random walk with restart (RWR) algorithm to mine for genes associated with Alzheimer disease. Compared to the Human Protein Reference Database (HPRD) PPI network alone, the integrated HPRD PPI and GGCRN networks provided faster convergence and revealed new disease-related genes. Therefore, using the RWR algorithm for integrated PPI and GGCRN is an effective method for disease-associated gene mining.
[Fine mapping of complex disease susceptibility loci].
Song, Qingfeng; Zhang, Hongxing; Ma, Yilong; Zhou, Gangqiao
2014-01-01
Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers have identified more than 3800 susceptibility loci for more than 660 diseases or traits. However, the most significantly associated variants or causative variants in these loci and their biological functions have remained to be clarified. These causative variants can help to elucidate the pathogenesis and discover new biomarkers of complex diseases. One of the main goals in the post-GWAS era is to identify the causative variants and susceptibility genes, and clarify their functional aspects by fine mapping. For common variants, imputation or re-sequencing based strategies were implemented to increase the number of analyzed variants and help to identify the most significantly associated variants. In addition, functional element, expression quantitative trait locus (eQTL) and haplotype analyses were performed to identify functional common variants and susceptibility genes. For rare variants, fine mapping was carried out by re-sequencing, rare haplotype analysis, family-based analysis, burden test, etc.This review summarizes the strategies and problems for fine mapping.
Mapping cis- and trans-regulatory effects across multiple tissues in twins
Grundberg, Elin; Small, Kerrin S.; Hedman, Åsa K.; Nica, Alexandra C.; Buil, Alfonso; Keildson, Sarah; Bell, Jordana T.; Yang, Tsun-Po; Meduri, Eshwar; Barrett, Amy; Nisbett, James; Sekowska, Magdalena; Wilk, Alicja; Shin, So-Youn; Glass, Daniel; Travers, Mary; Min, Josine L.; Ring, Sue; Ho, Karen; Thorleifsson, Gudmar; Kong, Augustine; Thorsteindottir, Unnur; Ainali, Chrysanthi; Dimas, Antigone S.; Hassanali, Neelam; Ingle, Catherine; Knowles, David; Krestyaninova, Maria; Lowe, Christopher E.; Di Meglio, Paola; Montgomery, Stephen B.; Parts, Leopold; Potter, Simon; Surdulescu, Gabriela; Tsaprouni, Loukia; Tsoka, Sophia; Bataille, Veronique; Durbin, Richard; Nestle, Frank O.; O’Rahilly, Stephen; Soranzo, Nicole; Lindgren, Cecilia M.; Zondervan, Krina T.; Ahmadi, Kourosh R.; Schadt, Eric E.; Stefansson, Kari; Smith, George Davey; McCarthy, Mark I.; Deloukas, Panos; Dermitzakis, Emmanouil T.; Spector, Tim D.
2013-01-01
Sequence-based variation in gene expression is a key driver of disease risk. Common variants regulating expression in cis have been mapped in many eQTL studies typically in single tissues from unrelated individuals. Here, we present a comprehensive analysis of gene expression across multiple tissues conducted in a large set of mono- and dizygotic twins that allows systematic dissection of genetic (cis and trans) and non-genetic effects on gene expression. Using identity-by-descent estimates, we show that at least 40% of the total heritable cis-effect on expression cannot be accounted for by common cis-variants, a finding which exposes the contribution of low frequency and rare regulatory variants with respect to both transcriptional regulation and complex trait susceptibility. We show that a substantial proportion of gene expression heritability is trans to the structural gene and identify several replicating trans-variants which act predominantly in a tissue-restricted manner and may regulate the transcription of many genes. PMID:22941192
McKay, James D.; Hung, Rayjean J.; Han, Younghun; Zong, Xuchen; Carreras-Torres, Robert; Christiani, David C.; Caporaso, Neil E.; Johansson, Mattias; Xiao, Xiangjun; Li, Yafang; Byun, Jinyoung; Dunning, Alison; Pooley, Karen A.; Qian, David C.; Ji, Xuemei; Liu, Geoffrey; Timofeeva, Maria N.; Bojesen, Stig E.; Wu, Xifeng; Le Marchand, Loic; Albanes, Demetrios; Bickeböller, Heike; Aldrich, Melinda C.; Bush, William S.; Tardon, Adonina; Rennert, Gad; Teare, M. Dawn; Field, John K.; Kiemeney, Lambertus A.; Lazarus, Philip; Haugen, Aage; Lam, Stephen; Schabath, Matthew B.; Andrew, Angeline S.; Shen, Hongbing; Hong, Yun-Chul; Yuan, Jian-Min; Bertazzi, Pier Alberto; Pesatori, Angela C.; Ye, Yuanqing; Diao, Nancy; Su, Li; Zhang, Ruyang; Brhane, Yonathan; Leighl, Natasha; Johansen, Jakob S.; Mellemgaard, Anders; Saliba, Walid; Haiman, Christopher A.; Wilkens, Lynne R.; Fernandez-Somoano, Ana; Fernandez-Tardon, Guillermo; van der Heijden, Henricus F.M.; Kim, Jin Hee; Dai, Juncheng; Hu, Zhibin; Davies, Michael PA; Marcus, Michael W.; Brunnström, Hans; Manjer, Jonas; Melander, Olle; Muller, David C.; Overvad, Kim; Trichopoulou, Antonia; Tumino, Rosario; Doherty, Jennifer A.; Barnett, Matt P.; Chen, Chu; Goodman, Gary E.; Cox, Angela; Taylor, Fiona; Woll, Penella; Brüske, Irene; Wichmann, H.-Erich; Manz, Judith; Muley, Thomas R.; Risch, Angela; Rosenberger, Albert; Grankvist, Kjell; Johansson, Mikael; Shepherd, Frances A.; Tsao, Ming-Sound; Arnold, Susanne M.; Haura, Eric B.; Bolca, Ciprian; Holcatova, Ivana; Janout, Vladimir; Kontic, Milica; Lissowska, Jolanta; Mukeria, Anush; Ognjanovic, Simona; Orlowski, Tadeusz M.; Scelo, Ghislaine; Swiatkowska, Beata; Zaridze, David; Bakke, Per; Skaug, Vidar; Zienolddiny, Shanbeh; Duell, Eric J.; Butler, Lesley M.; Koh, Woon-Puay; Gao, Yu-Tang; Houlston, Richard S.; McLaughlin, John; Stevens, Victoria L.; Joubert, Philippe; Lamontagne, Maxime; Nickle, David C.; Obeidat, Ma’en; Timens, Wim; Zhu, Bin; Song, Lei; Kachuri, Linda; Artigas, María Soler; Tobin, Martin D.; Wain, Louise V.; Rafnar, Thorunn; Thorgeirsson, Thorgeir E.; Reginsson, Gunnar W.; Stefansson, Kari; Hancock, Dana B.; Bierut, Laura J.; Spitz, Margaret R.; Gaddis, Nathan C.; Lutz, Sharon M.; Gu, Fangyi; Johnson, Eric O.; Kamal, Ahsan; Pikielny, Claudio; Zhu, Dakai; Lindströem, Sara; Jiang, Xia; Tyndale, Rachel F.; Chenevix-Trench, Georgia; Beesley, Jonathan; Bossé, Yohan; Chanock, Stephen; Brennan, Paul; Landi, Maria Teresa; Amos, Christopher I.
2017-01-01
Summary While several lung cancer susceptibility loci have been identified, much of lung cancer heritability remains unexplained. Here, 14,803 cases and 12,262 controls of European descent were genotyped on the OncoArray and combined with existing data for an aggregated GWAS analysis of lung cancer on 29,266 patients and 56,450 controls. We identified 18 susceptibility loci achieving genome wide significance, including 10 novel loci. The novel loci highlighted the striking heterogeneity in genetic susceptibility across lung cancer histological subtypes, with four loci associated with lung cancer overall and six with lung adenocarcinoma. Gene expression quantitative trait analysis (eQTL) in 1,425 normal lung tissues highlighted RNASET2, SECISBP2L and NRG1 as candidate genes. Other loci include genes such as a cholinergic nicotinic receptor, CHRNA2, and the telomere-related genes, OFBC1 and RTEL1. Further exploration of the target genes will continue to provide new insights into the etiology of lung cancer. PMID:28604730
Ron, Gil; Globerson, Yuval; Moran, Dror; Kaplan, Tommy
2017-12-21
Proximity-ligation methods such as Hi-C allow us to map physical DNA-DNA interactions along the genome, and reveal its organization into topologically associating domains (TADs). As the Hi-C data accumulate, computational methods were developed for identifying domain borders in multiple cell types and organisms. Here, we present PSYCHIC, a computational approach for analyzing Hi-C data and identifying promoter-enhancer interactions. We use a unified probabilistic model to segment the genome into domains, which we then merge hierarchically and fit using a local background model, allowing us to identify over-represented DNA-DNA interactions across the genome. By analyzing the published Hi-C data sets in human and mouse, we identify hundreds of thousands of putative enhancers and their target genes, and compile an extensive genome-wide catalog of gene regulation in human and mouse. As we show, our predictions are highly enriched for ChIP-seq and DNA accessibility data, evolutionary conservation, eQTLs and other DNA-DNA interaction data.
An xQTL map integrates the genetic architecture of the human brain's transcriptome and epigenome.
Ng, Bernard; White, Charles C; Klein, Hans-Ulrich; Sieberts, Solveig K; McCabe, Cristin; Patrick, Ellis; Xu, Jishu; Yu, Lei; Gaiteri, Chris; Bennett, David A; Mostafavi, Sara; De Jager, Philip L
2017-10-01
We report a multi-omic resource generated by applying quantitative trait locus (xQTL) analyses to RNA sequence, DNA methylation and histone acetylation data from the dorsolateral prefrontal cortex of 411 older adults who have all three data types. We identify SNPs significantly associated with gene expression, DNA methylation and histone modification levels. Many of these SNPs influence multiple molecular features, and we demonstrate that SNP effects on RNA expression are fully mediated by epigenetic features in 9% of these loci. Further, we illustrate the utility of our new resource, xQTL Serve, by using it to prioritize the cell type(s) most affected by an xQTL. We also reanalyze published genome wide association studies using an xQTL-weighted analysis approach and identify 18 new schizophrenia and 2 new bipolar susceptibility variants, which is more than double the number of loci that can be discovered with a larger blood-based expression eQTL resource.
DHU1 negatively regulates UV-B signaling via its direct interaction with COP1 and RUP1.
Kim, Sang-Hoon; Kim, Hani; Chung, Sunglan; Lee, Jae-Hoon
2017-09-16
Although DWD HYPERSENSITIVE TO UV-B 1 (DHU1) is reported to be a negative regulator in UV-B mediated cellular responses, its detailed role in UV-B signaling is still elusive. To further understand the action mechanism of DHU1 in UV-B response, physical and genetic interactions of DHU1 with various UV-B signaling components were investigated. Yeast two hybrid assay results suggested that DHU1 directly interacts with COP1 and RUP1, implying a functional connection with both COP1 and RUP1. In spite of the physical association between DHU1 and COP1, loss of DHU1 did not affect protein stability of COP1. Epistatic analysis showed that the functional loss of both DHU1 and UVR8 leads to alleviation of UV-B hypersensitivity displayed in dhu1-1. Moreover, phenotypic studies with dhu1-1 cop1-6 and dhu1-1 hy5-215 revealed that COP1 and HY5 are epistatic to DHU1, indicating that UV-B hypersensitivity of dhu1-1 requires both COP1 and HY5. In the case of dhu1-1 rup1-1, UV-B responsiveness was similar to that of both dhu1-1 and rup1-1, implying that DHU1 and RUP1 are required for each other's function. Collectively, these results show that the role of DHU1 as a negative regulator in UV-B response may be derived from its direct interaction with COP1 by sequestering COP1 from the active UVR8-COP1 complex, resulting in a decrease in the COP1 population that positively participates in UV-B signaling together with UVR8. Furthermore, this inhibitory role of DHU1 in UV-B signaling is likely to be functionally connected to RUP1. This study will serve as a platform to further understand more detailed action mechanism of DHU1 in UV-B response and DHU1-mediated core UV-B signaling in Arabidopsis. Copyright © 2017 Elsevier Inc. All rights reserved.
Reflections on the pathogenesis of Down syndrome.
Opitz, J M; Gilbert-Barness, E F
1990-01-01
Present efforts to identify, isolate, and characterize in molecular terms the "consensus" segment of 21q sufficient to cause most of the major and some of the most characteristic minor manifestations of Down syndrome will soon provide answers to many questions. However, we think that a reductionist approach to explain the Down syndrome phenotype in a "linear" manner from the DNA sequence of the segment will be doomed to failure from the outset because of the open, complex, nonlinear, hierarchical nature of morphogenetic systems. Neo-Darwinism is under strong attack; most genetic changes accumulated over time may very well be of neutral effect, and detailed studies in several related groups of vertebrate species has shown that molecular and organismal evolution are largely independent of one another. It has been pointed out recently that biology lacks a theory of ontogenetic and phylogenetic development, and that a purely "genocentric" view of biology at the expense of the complexly hierarchical intrinsic epigenetic attributes of developmental systems is "out of focus with respect to ... biological organization and morphogenesis," and may be "a residue of nineteenth century romantic idealism." Down syndrome impresses us as a paradigm of increased developmental variability due to a deceleration of the rate of development (neoteny) with many anomalies of incomplete morphogenesis (vestigia), atavisms, increased morphometric variability with many decreased means, increased variances, and increased fluctuating asymmetry. These abnormalities, together with highly increased risk of prenatal death and postnatal morbidity, impaired growth, and abnormal CNS and gonadal structure and function characteristic of most aneuploidy syndromes, suggest to us that the pathogenesis of Down syndrome is best viewed in terms of the mechanisms of speciation. Transgenic experiment involving sequential or overlapping pieces of "the consensus segment" on distal 21q22.1-22.3 may help decide to what extent the Down syndrome phenotype can be resolved into the additive effect of several pleiotropic oligogenes with epistatic interaction or the indirect secondary "mass" effect of a specific segment of 21q with epistatic interaction involving multiple loci on 21q and other chromosomes.
2012-01-01
Background Proanthocyanidins (PAs), or condensed tannins, are flavonoid polymers, widespread throughout the plant kingdom, which provide protection against herbivores while conferring organoleptic and nutritive values to plant-derived foods, such as wine. However, the genetic basis of qualitative and quantitative PA composition variation is still poorly understood. To elucidate the genetic architecture of the complex grape PA composition, we first carried out quantitative trait locus (QTL) analysis on a 191-individual pseudo-F1 progeny. Three categories of PA variables were assessed: total content, percentages of constitutive subunits and composite ratio variables. For nine functional candidate genes, among which eight co-located with QTLs, we performed association analyses using a diversity panel of 141 grapevine cultivars in order to identify causal SNPs. Results Multiple QTL analysis revealed a total of 103 and 43 QTLs, respectively for seed and skin PA variables. Loci were mainly of additive effect while some loci were primarily of dominant effect. Results also showed a large involvement of pairwise epistatic interactions in shaping PA composition. QTLs for PA variables in skin and seeds differed in number, position, involvement of epistatic interaction and allelic effect, thus revealing different genetic determinisms for grape PA composition in seeds and skin. Association results were consistent with QTL analyses in most cases: four out of nine tested candidate genes (VvLAR1, VvMYBPA2, VvCHI1, VvMYBPA1) showed at least one significant association with PA variables, especially VvLAR1 revealed as of great interest for further functional investigation. Some SNP-phenotype associations were observed only in the diversity panel. Conclusions This study presents the first QTL analysis on grape berry PA composition with a comparison between skin and seeds, together with an association study. Our results suggest a complex genetic control for PA traits and different genetic architectures for grape PA composition between berry skin and seeds. This work also uncovers novel genomic regions for further investigation in order to increase our knowledge of the genetic basis of PA composition. PMID:22369244
Ibeh, Neke; Nshogozabahizi, Jean Claude; Aris-Brosou, Stéphane
2016-06-01
Throughout the last 3 decades, Ebola virus (EBOV) outbreaks have been confined to isolated areas within Central Africa; however, the 2014 variant reached unprecedented transmission and mortality rates. While the outbreak was still under way, it was reported that the variant leading up to this outbreak evolved faster than previous EBOV variants, but evidence for diversifying selection was undetermined. Here, we test this selection hypothesis and show that while previous EBOV outbreaks were preceded by bursts of diversification, evidence for site-specific diversifying selection during the emergence of the 2014 EBOV clade is weak. However, we show strong evidence supporting an interplay between selection and correlated evolution (epistasis), particularly in the mucin-like domain (MLD) of the EBOV glycoprotein. By reconstructing ancestral structures of the MLD, we further propose a structural mechanism explaining how the substitutions that accumulated between 1918 and 1969 distorted the MLD, while more recent epistatic substitutions restored part of the structure, with the most recent substitution being adaptive. We suggest that it is this complex interplay between weak selection, epistasis, and structural constraints that has shaped the evolution of the 2014 EBOV variant. The role that selection plays in the emergence of viral epidemics remains debated, particularly in the context of the 2014 EBOV outbreak. Most critically, should such evidence exist, it is generally unclear how this relates to function and increased virulence. Here, we show that the viral lineage leading up to the 2014 outbreak underwent a complex interplay between selection and correlated evolution (epistasis) in a protein region that is critical for immune evasion. We then reconstructed the three-dimensional structure of this domain and showed that the initial mutations in this lineage deformed the structure, while subsequent mutations restored part of the structure. Along this mutational path, the first and last mutations were adaptive, while the intervening ones were epistatic. Altogether, we provide a mechanistic model that explains how selection and epistasis acted on the structural constraints that materialized during the 2014 EBOV outbreak. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Kos, Mark Z; Carless, Melanie A; Peralta, Juan; Curran, Joanne E; Quillen, Ellen E; Almeida, Marcio; Blackburn, August; Blondell, Lucy; Roalf, David R; Pogue-Geile, Michael F; Gur, Ruben C; Göring, Harald H H; Nimgaonkar, Vishwajit L; Gur, Raquel E; Almasy, Laura
2017-12-01
Schizophrenia is a serious mental illness, involving disruptions in thought and behavior, with a worldwide prevalence of about one percent. Although highly heritable, much of the genetic liability of schizophrenia is yet to be explained. We searched for susceptibility loci in multiplex, multigenerational families affected by schizophrenia, targeting protein-altering variation with in silico predicted functional effects. Exome sequencing was performed on 136 samples from eight European-American families, including 23 individuals diagnosed with schizophrenia or schizoaffective disorder. In total, 11,878 non-synonymous variants from 6,396 genes were tested for their association with schizophrenia spectrum disorders. Pathway enrichment analyses were conducted on gene-based test results, protein-protein interaction (PPI) networks, and epistatic effects. Using a significance threshold of FDR < 0.1, association was detected for rs10941112 (p = 2.1 × 10 -5 ; q-value = 0.073) in AMACR, a gene involved in fatty acid metabolism and previously implicated in schizophrenia, with significant cis effects on gene expression (p = 5.5 × 10 -4 ), including brain tissue data from the Genotype-Tissue Expression project (minimum p = 6.0 × 10 -5 ). A second SNP, rs10378 located in TMEM176A, also shows risk effects in the exome data (p = 2.8 × 10 -5 ; q-value = 0.073). PPIs among our top gene-based association results (p < 0.05; n = 359 genes) reveal significant enrichment of genes involved in NCAM-mediated neurite outgrowth (p = 3.0 × 10 -5 ), while exome-wide SNP-SNP interaction effects for rs10941112 and rs10378 indicate a potential role for kinase-mediated signaling involved in memory and learning. In conclusion, these association results implicate AMACR and TMEM176A in schizophrenia risk, whose effects may be modulated by genes involved in synaptic plasticity and neurocognitive performance. © 2017 Wiley Periodicals, Inc.
Hong, Jungeui; Gresham, David
2014-01-01
One of the central goals of evolutionary biology is to explain and predict the molecular basis of adaptive evolution. We studied the evolution of genetic networks in Saccharomyces cerevisiae (budding yeast) populations propagated for more than 200 generations in different nitrogen-limiting conditions. We find that rapid adaptive evolution in nitrogen-poor environments is dominated by the de novo generation and selection of copy number variants (CNVs), a large fraction of which contain genes encoding specific nitrogen transporters including PUT4, DUR3 and DAL4. The large fitness increases associated with these alleles limits the genetic heterogeneity of adapting populations even in environments with multiple nitrogen sources. Complete identification of acquired point mutations, in individual lineages and entire populations, identified heterogeneity at the level of genetic loci but common themes at the level of functional modules, including genes controlling phosphatidylinositol-3-phosphate metabolism and vacuole biogenesis. Adaptive strategies shared with other nutrient-limited environments point to selection of genetic variation in the TORC1 and Ras/PKA signaling pathways as a general mechanism underlying improved growth in nutrient-limited environments. Within a single population we observed the repeated independent selection of a multi-locus genotype, comprised of the functionally related genes GAT1, MEP2 and LST4. By studying the fitness of individual alleles, and their combination, as well as the evolutionary history of the evolving population, we find that the order in which these mutations are acquired is constrained by epistasis. The identification of repeatedly selected variation at functionally related loci that interact epistatically suggests that gene network polymorphisms (GNPs) may be a frequent outcome of adaptive evolution. Our results provide insight into the mechanistic basis by which cells adapt to nutrient-limited environments and suggest that knowledge of the selective environment and the regulatory mechanisms important for growth and survival in that environment greatly increase the predictability of adaptive evolution.
Kachroo, Aardra; Venugopal, Srivathsa C.; Lapchyk, Ludmila; Falcone, Deane; Hildebrand, David; Kachroo, Pradeep
2004-01-01
Stearoyl-acyl-carrier-protein-desaturase-mediated conversion of stearic acid (18:0) to oleic acid (18:1) is a key step, which regulates levels of unsaturated fatty acids in cells. We previously showed that stearoyl-acyl-carrier-protein-desaturase mutants ssi2/fab2 carrying a loss-of-function mutation in the plastidial glycerol-3-phosphate (G3P) acyltransferase (act1) have elevated 18:1 levels and are restored in their altered defense signaling. Because G3P is required for the acylation of 18:1 by G3P acyltransferase, it was predicted that reduction of G3P levels should increase 18:1 levels and thereby revert ssi2-triggered phenotypes. Here we show that a mutation in G3P dehydrogenase restores both salicylic acid- and jasmonic acid-mediated phenotypes of ssi2 plants. The G3P dehydrogenase gene was identified by map-based cloning of the ssi2 suppressor mutant rdc8 (gly1-3) and confirmed by epistatic analysis of ssi2 with gly1-1. Restoration of ssi2-triggered phenotypes by the gly1-3 mutation was age-dependent and correlated with the levels of 18:1. Regeneration of G3P pools by glycerol application in ssi2 and ssi2 gly1-3 plants caused a marked reduction in the 18:1 levels, which rendered these plants hypersensitive to glycerol. This hypersensitivity in ssi2 was rescued by the act1 mutation. Furthermore, overexpression of the ACT1 gene resulted in enhanced sensitivity to glycerol. Glycerol application also lowered the 18:1 content in SSI2 plants and converted these into ssi2-mimics. Our results show that 18:1 levels in plastids are regulated by means of acylation with G3P, and a balance between G3P and 18:1 is critical for the regulation of salicylic acid- and jasmonic acid-mediated signaling pathways. PMID:15044700
Veturi, Yogasudha; Ritchie, Marylyn D
2018-01-01
Transcriptome-wide association studies (TWAS) have recently been employed as an approach that can draw upon the advantages of genome-wide association studies (GWAS) and gene expression studies to identify genes associated with complex traits. Unlike standard GWAS, summary level data suffices for TWAS and offers improved statistical power. Two popular TWAS methods include either (a) imputing the cis genetic component of gene expression from smaller sized studies (using multi-SNP prediction or MP) into much larger effective sample sizes afforded by GWAS - TWAS-MP or (b) using summary-based Mendelian randomization - TWAS-SMR. Although these methods have been effective at detecting functional variants, it remains unclear how extensive variability in the genetic architecture of complex traits and diseases impacts TWAS results. Our goal was to investigate the different scenarios under which these methods yielded enough power to detect significant expression-trait associations. In this study, we conducted extensive simulations based on 6000 randomly chosen, unrelated Caucasian males from Geisinger's MyCode population to compare the power to detect cis expression-trait associations (within 500 kb of a gene) using the above-described approaches. To test TWAS across varying genetic backgrounds we simulated gene expression and phenotype using different quantitative trait loci per gene and cis-expression /trait heritability under genetic models that differentiate the effect of causality from that of pleiotropy. For each gene, on a training set ranging from 100 to 1000 individuals, we either (a) estimated regression coefficients with gene expression as the response using five different methods: LASSO, elastic net, Bayesian LASSO, Bayesian spike-slab, and Bayesian ridge regression or (b) performed eQTL analysis. We then sampled with replacement 50,000, 150,000, and 300,000 individuals respectively from the testing set of the remaining 5000 individuals and conducted GWAS on each set. Subsequently, we integrated the GWAS summary statistics derived from the testing set with the weights (or eQTLs) derived from the training set to identify expression-trait associations using (a) TWAS-MP (b) TWAS-SMR (c) eQTL-based GWAS, or (d) standalone GWAS. Finally, we examined the power to detect functionally relevant genes using the different approaches under the considered simulation scenarios. In general, we observed great similarities among TWAS-MP methods although the Bayesian methods resulted in improved power in comparison to LASSO and elastic net as the trait architecture grew more complex while training sample sizes and expression heritability remained small. Finally, we observed high power under causality but very low to moderate power under pleiotropy.
Using genetic markers to orient the edges in quantitative trait networks: the NEO software.
Aten, Jason E; Fuller, Tova F; Lusis, Aldons J; Horvath, Steve
2008-04-15
Systems genetic studies have been used to identify genetic loci that affect transcript abundances and clinical traits such as body weight. The pairwise correlations between gene expression traits and/or clinical traits can be used to define undirected trait networks. Several authors have argued that genetic markers (e.g expression quantitative trait loci, eQTLs) can serve as causal anchors for orienting the edges of a trait network. The availability of hundreds of thousands of genetic markers poses new challenges: how to relate (anchor) traits to multiple genetic markers, how to score the genetic evidence in favor of an edge orientation, and how to weigh the information from multiple markers. We develop and implement Network Edge Orienting (NEO) methods and software that address the challenges of inferring unconfounded and directed gene networks from microarray-derived gene expression data by integrating mRNA levels with genetic marker data and Structural Equation Model (SEM) comparisons. The NEO software implements several manual and automatic methods for incorporating genetic information to anchor traits. The networks are oriented by considering each edge separately, thus reducing error propagation. To summarize the genetic evidence in favor of a given edge orientation, we propose Local SEM-based Edge Orienting (LEO) scores that compare the fit of several competing causal graphs. SEM fitting indices allow the user to assess local and overall model fit. The NEO software allows the user to carry out a robustness analysis with regard to genetic marker selection. We demonstrate the utility of NEO by recovering known causal relationships in the sterol homeostasis pathway using liver gene expression data from an F2 mouse cross. Further, we use NEO to study the relationship between a disease gene and a biologically important gene co-expression module in liver tissue. The NEO software can be used to orient the edges of gene co-expression networks or quantitative trait networks if the edges can be anchored to genetic marker data. R software tutorials, data, and supplementary material can be downloaded from: http://www.genetics.ucla.edu/labs/horvath/aten/NEO.
Reyes Fernandez, Perla C.; Replogle, Rebecca A.; Wang, Libo; Zhang, Min; Fleet, James C.
2016-01-01
Low dietary calcium (Ca) intake during growth limits peak bone mass but physiological adaptation can prevent this adverse effect. To assess the genetic control on the physiologic response to dietary Ca restriction (RCR) we conducted a study in 51 BXD lines fed either 0.5% (basal) or 0.25% (low) Ca diets from 4–12 wks of age (n=8/line/diet). Ca absorption (CaAbs), femur bone mineral density (BMD), and bone mineral content (BMC) were examined. ANCOVA with body size as covariate was used to detect significant line and diet main effects, and line-by-diet interactions. Body size-corrected residuals were used for linkage mapping and to estimate heritability (h2). Loci controlling the phenotypes were identified using composite interval mapping on each diet and for the RCR. h2 of basal phenotypes (0.37– 0.43) and their RCR (0.32–0.38) was moderate. For each phenotype we identified multiple QTL on each diet and for the RCR. Several loci affected multiple traits: Chr 1 (88.3–90.6 cM, CaAbs, BMC), Chr 4 (45.8–49.2 cM, CaAbs, BMD, BMC), Chr 8 (28.6–31.6 cM, CaAbs, BMD RCR), and Chr 15 (13.6–24 cM, BMD, BMC), and (32.3–36 cM, CaAbs RCR, BMD). This suggests that gene clusters may regulate interdependent bone-related phenotypes. Using in silico expression QTL (eQTL) mapping and bioinformatic tools we identified novel candidates for the regulation of bone under Ca stress (Ext1, Deptor), and for the first time, we report genes modulating Ca absorption (Inadl, Sc4mol, Sh3rf1 and Dennd3), and both Ca and bone metabolism (Tceanc2, Tll1 and Aadat). Our data reveal gene-by-diet interactions and the existence of novel relationships between bone and Ca metabolism during growth. This article is protected by copyright. All rights reserved PMID:26636428
ViSEN: methodology and software for visualization of statistical epistasis networks
Hu, Ting; Chen, Yuanzhu; Kiralis, Jeff W.; Moore, Jason H.
2013-01-01
The non-linear interaction effect among multiple genetic factors, i.e. epistasis, has been recognized as a key component in understanding the underlying genetic basis of complex human diseases and phenotypic traits. Due to the statistical and computational complexity, most epistasis studies are limited to interactions with an order of two. We developed ViSEN to analyze and visualize epistatic interactions of both two-way and three-way. ViSEN not only identifies strong interactions among pairs or trios of genetic attributes, but also provides a global interaction map that shows neighborhood and clustering structures. This visualized information could be very helpful to infer the underlying genetic architecture of complex diseases and to generate plausible hypotheses for further biological validations. ViSEN is implemented in Java and freely available at https://sourceforge.net/projects/visen/. PMID:23468157
Sadee, Wolfgang
2013-09-01
Pharmacogenetic biomarker tests include mostly specific single gene-drug pairs, capable of accounting for a portion of interindividual variability in drug response and toxicity. However, multiple genes are likely to contribute, either acting independently or epistatically, with the CYP2C9-VKORC1-warfarin test panel, an example of a clinically used gene-gene-dug interaction. I discuss here further instances of gene-gene-drug interactions, including a proposed dynamic effect on statin therapy by genetic variants in both a transporter (SLCO1B1) and a metabolizing enzyme (CYP3A4) in liver cells, the main target site where statins block cholesterol synthesis. These examples set a conceptual framework for developing diagnostic panels involving multiple gene-drug combinations. Copyright © 2013 Wiley Periodicals, Inc.
Multiplexed droplet single-cell RNA-sequencing using natural genetic variation.
Kang, Hyun Min; Subramaniam, Meena; Targ, Sasha; Nguyen, Michelle; Maliskova, Lenka; McCarthy, Elizabeth; Wan, Eunice; Wong, Simon; Byrnes, Lauren; Lanata, Cristina M; Gate, Rachel E; Mostafavi, Sara; Marson, Alexander; Zaitlen, Noah; Criswell, Lindsey A; Ye, Chun Jimmie
2018-01-01
Droplet single-cell RNA-sequencing (dscRNA-seq) has enabled rapid, massively parallel profiling of transcriptomes. However, assessing differential expression across multiple individuals has been hampered by inefficient sample processing and technical batch effects. Here we describe a computational tool, demuxlet, that harnesses natural genetic variation to determine the sample identity of each droplet containing a single cell (singlet) and detect droplets containing two cells (doublets). These capabilities enable multiplexed dscRNA-seq experiments in which cells from unrelated individuals are pooled and captured at higher throughput than in standard workflows. Using simulated data, we show that 50 single-nucleotide polymorphisms (SNPs) per cell are sufficient to assign 97% of singlets and identify 92% of doublets in pools of up to 64 individuals. Given genotyping data for each of eight pooled samples, demuxlet correctly recovers the sample identity of >99% of singlets and identifies doublets at rates consistent with previous estimates. We apply demuxlet to assess cell-type-specific changes in gene expression in 8 pooled lupus patient samples treated with interferon (IFN)-β and perform eQTL analysis on 23 pooled samples.
Katsumata, Yuriko; Nelson, Peter T.; Ellingson, Sally R.; Fardo, David W.
2017-01-01
Hippocampal sclerosis of aging (HS-Aging) is a common neurodegenerative condition associated with dementia. To learn more about genetic risk of HS-Aging pathology, we tested gene-based associations of the GRN, TMEM106B, ABCC9, and KCNMB2 genes, which were reported to be associated with HS-Aging pathology in previous studies. Genetic data were obtained from the Alzheimer’s Disease Genetics Consortium (ADGC), linked to autopsy-derived neuropathological outcomes from the National Alzheimer’s Coordinating Center (NACC). Of the 3,251 subjects included in the study, 271 (8.3%) were identified as an HS-Aging case. The significant gene-based association between the ABCC9 gene and HS-Aging appeared to be driven by a region in which a significant haplotype-based association was found. We tested this haplotype as an expression Quantitative Trait Locus (eQTL) using two different public-access brain gene expression databases. The HS-Aging pathology protective ABCC9 haplotype was associated with decreased ABCC9 expression, indicating a possible toxic gain of function. PMID:28131462
Dadaev, Tokhir; Saunders, Edward J; Newcombe, Paul J; Anokian, Ezequiel; Leongamornlert, Daniel A; Brook, Mark N; Cieza-Borrella, Clara; Mijuskovic, Martina; Wakerell, Sarah; Olama, Ali Amin Al; Schumacher, Fredrick R; Berndt, Sonja I; Benlloch, Sara; Ahmed, Mahbubl; Goh, Chee; Sheng, Xin; Zhang, Zhuo; Muir, Kenneth; Govindasami, Koveela; Lophatananon, Artitaya; Stevens, Victoria L; Gapstur, Susan M; Carter, Brian D; Tangen, Catherine M; Goodman, Phyllis; Thompson, Ian M; Batra, Jyotsna; Chambers, Suzanne; Moya, Leire; Clements, Judith; Horvath, Lisa; Tilley, Wayne; Risbridger, Gail; Gronberg, Henrik; Aly, Markus; Nordström, Tobias; Pharoah, Paul; Pashayan, Nora; Schleutker, Johanna; Tammela, Teuvo L J; Sipeky, Csilla; Auvinen, Anssi; Albanes, Demetrius; Weinstein, Stephanie; Wolk, Alicja; Hakansson, Niclas; West, Catharine; Dunning, Alison M; Burnet, Neil; Mucci, Lorelei; Giovannucci, Edward; Andriole, Gerald; Cussenot, Olivier; Cancel-Tassin, Géraldine; Koutros, Stella; Freeman, Laura E Beane; Sorensen, Karina Dalsgaard; Orntoft, Torben Falck; Borre, Michael; Maehle, Lovise; Grindedal, Eli Marie; Neal, David E; Donovan, Jenny L; Hamdy, Freddie C; Martin, Richard M; Travis, Ruth C; Key, Tim J; Hamilton, Robert J; Fleshner, Neil E; Finelli, Antonio; Ingles, Sue Ann; Stern, Mariana C; Rosenstein, Barry; Kerns, Sarah; Ostrer, Harry; Lu, Yong-Jie; Zhang, Hong-Wei; Feng, Ninghan; Mao, Xueying; Guo, Xin; Wang, Guomin; Sun, Zan; Giles, Graham G; Southey, Melissa C; MacInnis, Robert J; FitzGerald, Liesel M; Kibel, Adam S; Drake, Bettina F; Vega, Ana; Gómez-Caamaño, Antonio; Fachal, Laura; Szulkin, Robert; Eklund, Martin; Kogevinas, Manolis; Llorca, Javier; Castaño-Vinyals, Gemma; Penney, Kathryn L; Stampfer, Meir; Park, Jong Y; Sellers, Thomas A; Lin, Hui-Yi; Stanford, Janet L; Cybulski, Cezary; Wokolorczyk, Dominika; Lubinski, Jan; Ostrander, Elaine A; Geybels, Milan S; Nordestgaard, Børge G; Nielsen, Sune F; Weisher, Maren; Bisbjerg, Rasmus; Røder, Martin Andreas; Iversen, Peter; Brenner, Hermann; Cuk, Katarina; Holleczek, Bernd; Maier, Christiane; Luedeke, Manuel; Schnoeller, Thomas; Kim, Jeri; Logothetis, Christopher J; John, Esther M; Teixeira, Manuel R; Paulo, Paula; Cardoso, Marta; Neuhausen, Susan L; Steele, Linda; Ding, Yuan Chun; De Ruyck, Kim; De Meerleer, Gert; Ost, Piet; Razack, Azad; Lim, Jasmine; Teo, Soo-Hwang; Lin, Daniel W; Newcomb, Lisa F; Lessel, Davor; Gamulin, Marija; Kulis, Tomislav; Kaneva, Radka; Usmani, Nawaid; Slavov, Chavdar; Mitev, Vanio; Parliament, Matthew; Singhal, Sandeep; Claessens, Frank; Joniau, Steven; Van den Broeck, Thomas; Larkin, Samantha; Townsend, Paul A; Aukim-Hastie, Claire; Gago-Dominguez, Manuela; Castelao, Jose Esteban; Martinez, Maria Elena; Roobol, Monique J; Jenster, Guido; van Schaik, Ron H N; Menegaux, Florence; Truong, Thérèse; Koudou, Yves Akoli; Xu, Jianfeng; Khaw, Kay-Tee; Cannon-Albright, Lisa; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Thibodeau, Stephen N; McDonnell, Shannon K; Schaid, Daniel J; Lindstrom, Sara; Turman, Constance; Ma, Jing; Hunter, David J; Riboli, Elio; Siddiq, Afshan; Canzian, Federico; Kolonel, Laurence N; Le Marchand, Loic; Hoover, Robert N; Machiela, Mitchell J; Kraft, Peter; Freedman, Matthew; Wiklund, Fredrik; Chanock, Stephen; Henderson, Brian E; Easton, Douglas F; Haiman, Christopher A; Eeles, Rosalind A; Conti, David V; Kote-Jarai, Zsofia
2018-06-11
Prostate cancer is a polygenic disease with a large heritable component. A number of common, low-penetrance prostate cancer risk loci have been identified through GWAS. Here we apply the Bayesian multivariate variable selection algorithm JAM to fine-map 84 prostate cancer susceptibility loci, using summary data from a large European ancestry meta-analysis. We observe evidence for multiple independent signals at 12 regions and 99 risk signals overall. Only 15 original GWAS tag SNPs remain among the catalogue of candidate variants identified; the remainder are replaced by more likely candidates. Biological annotation of our credible set of variants indicates significant enrichment within promoter and enhancer elements, and transcription factor-binding sites, including AR, ERG and FOXA1. In 40 regions at least one variant is colocalised with an eQTL in prostate cancer tissue. The refined set of candidate variants substantially increase the proportion of familial relative risk explained by these known susceptibility regions, which highlights the importance of fine-mapping studies and has implications for clinical risk profiling.
Small RNA sequencing in cells and exosomes identifies eQTLs and 14q32 as a region of active export
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tsang, Emily K.; Abell, Nathan S.; Li, Xin
Exosomes are small extracellular vesicles that carry heterogeneous cargo, including RNA, between cells. Increasing evidence suggests that exosomes are important mediators of intercellular communication and biomarkers of disease. Despite this, the variability of exosomal RNA between individuals has not been well quantified. To assess this variability, we sequenced the small RNA of cells and exosomes from a 17-member family. Across individuals, we show that selective export of miRNAs occurs not only at the level of specific transcripts, but that a cluster of 74 mature miRNAs on chromosome 14q32 is massively exported in exosomes while mostly absent from cells. We alsomore » observe more interindividual variability between exosomal samples than between cellular ones and identify four miRNA expression quantitative trait loci shared between cells and exosomes. Lastly, our findings indicate that genomically colocated miRNAs can be exported together and highlight the variability in exosomal miRNA levels between individuals as relevant for exosome use as diagnostics.« less
Heinig, Matthias; Adriaens, Michiel E; Schafer, Sebastian; van Deutekom, Hanneke W M; Lodder, Elisabeth M; Ware, James S; Schneider, Valentin; Felkin, Leanne E; Creemers, Esther E; Meder, Benjamin; Katus, Hugo A; Rühle, Frank; Stoll, Monika; Cambien, François; Villard, Eric; Charron, Philippe; Varro, Andras; Bishopric, Nanette H; George, Alfred L; Dos Remedios, Cristobal; Moreno-Moral, Aida; Pesce, Francesco; Bauerfeind, Anja; Rüschendorf, Franz; Rintisch, Carola; Petretto, Enrico; Barton, Paul J; Cook, Stuart A; Pinto, Yigal M; Bezzina, Connie R; Hubner, Norbert
2017-09-14
Genetic variation is an important determinant of RNA transcription and splicing, which in turn contributes to variation in human traits, including cardiovascular diseases. Here we report the first in-depth survey of heart transcriptome variation using RNA-sequencing in 97 patients with dilated cardiomyopathy and 108 non-diseased controls. We reveal extensive differences of gene expression and splicing between dilated cardiomyopathy patients and controls, affecting known as well as novel dilated cardiomyopathy genes. Moreover, we show a widespread effect of genetic variation on the regulation of transcription, isoform usage, and allele-specific expression. Systematic annotation of genome-wide association SNPs identifies 60 functional candidate genes for heart phenotypes, representing 20% of all published heart genome-wide association loci. Focusing on the dilated cardiomyopathy phenotype we found that eQTL variants are also enriched for dilated cardiomyopathy genome-wide association signals in two independent cohorts. RNA transcription, splicing, and allele-specific expression are each important determinants of the dilated cardiomyopathy phenotype and are controlled by genetic factors. Our results represent a powerful resource for the field of cardiovascular genetics.
Jin, Ying; Andersen, Genevieve; Yorgov, Daniel; Ferrara, Tracey M; Ben, Songtao; Brownson, Kelly M; Holland, Paulene J; Birlea, Stanca A; Siebert, Janet; Hartmann, Anke; Lienert, Anne; van Geel, Nanja; Lambert, Jo; Luiten, Rosalie M; Wolkerstorfer, Albert; Wietze van der Veen, J P; Bennett, Dorothy C; Taïeb, Alain; Ezzedine, Khaled; Kemp, E Helen; Gawkrodger, David J; Weetman, Anthony P; Kõks, Sulev; Prans, Ele; Kingo, Külli; Karelson, Maire; Wallace, Margaret R; McCormack, Wayne T; Overbeck, Andreas; Moretti, Silvia; Colucci, Roberta; Picardo, Mauro; Silverberg, Nanette B; Olsson, Mats; Valle, Yan; Korobko, Igor; Böhm, Markus; Lim, Henry W; Hamzavi, Iltefat; Zhou, Li; Mi, Qing-Sheng; Fain, Pamela R; Santorico, Stephanie A; Spritz, Richard A
2016-11-01
Vitiligo is an autoimmune disease in which depigmented skin results from the destruction of melanocytes, with epidemiological association with other autoimmune diseases. In previous linkage and genome-wide association studies (GWAS1 and GWAS2), we identified 27 vitiligo susceptibility loci in patients of European ancestry. We carried out a third GWAS (GWAS3) in European-ancestry subjects, with augmented GWAS1 and GWAS2 controls, genome-wide imputation, and meta-analysis of all three GWAS, followed by an independent replication. The combined analyses, with 4,680 cases and 39,586 controls, identified 23 new significantly associated loci and 7 suggestive loci. Most encode immune and apoptotic regulators, with some also associated with other autoimmune diseases, as well as several melanocyte regulators. Bioinformatic analyses indicate a predominance of causal regulatory variation, some of which corresponds to expression quantitative trait loci (eQTLs) at these loci. Together, the identified genes provide a framework for the genetic architecture and pathobiology of vitiligo, highlight relationships with other autoimmune diseases and melanoma, and offer potential targets for treatment.
Genome-wide significant risk associations for mucinous ovarian carcinoma.
Kelemen, Linda E; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; Lee, Janet M; Seo, Ji-Heui; Phelan, Catherine M; Beesley, Jonathan; Chen, Xiaoqing; Spindler, Tassja J; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia
2015-08-01
Genome-wide association studies have identified several risk associations for ovarian carcinomas but not for mucinous ovarian carcinomas (MOCs). Our analysis of 1,644 MOC cases and 21,693 controls with imputation identified 3 new risk associations: rs752590 at 2q13 (P = 3.3 × 10(-8)), rs711830 at 2q31.1 (P = 7.5 × 10(-12)) and rs688187 at 19q13.2 (P = 6.8 × 10(-13)). We identified significant expression quantitative trait locus (eQTL) associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10(-4), false discovery rate (FDR) = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk-associated SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease.
Predicting enhancer activity and variant impact using gkm-SVM.
Beer, Michael A
2017-09-01
We participated in the Critical Assessment of Genome Interpretation eQTL challenge to further test computational models of regulatory variant impact and their association with human disease. Our prediction model is based on a discriminative gapped-kmer SVM (gkm-SVM) trained on genome-wide chromatin accessibility data in the cell type of interest. The comparisons with massively parallel reporter assays (MPRA) in lymphoblasts show that gkm-SVM is among the most accurate prediction models even though all other models used the MPRA data for model training, and gkm-SVM did not. In addition, we compare gkm-SVM with other MPRA datasets and show that gkm-SVM is a reliable predictor of expression and that deltaSVM is a reliable predictor of variant impact in K562 cells and mouse retina. We further show that DHS (DNase-I hypersensitive sites) and ATAC-seq (assay for transposase-accessible chromatin using sequencing) data are equally predictive substrates for training gkm-SVM, and that DHS regions flanked by H3K27Ac and H3K4me1 marks are more predictive than DHS regions alone. © 2017 Wiley Periodicals, Inc.
Wang, Jianan; He, Max M; Li, Liren; Zhang, Jinfeng
2016-01-01
Asian Americans (AS) have significantly lower incidence and mortality rates of breast cancer (BRCA) than Caucasian Americans (CA). While this racial disparity has been documented the underlying pathogenetic factors explaining it are obscure. We addressed this issue by an integrative genomics approach to compare mRNA expression between AS and CA cases of BRCA. RNA-seq data from the Cancer Genome Atlas showed that mRNA expression revealed significant differences at gene and pathway levels. Increased susceptibility and severity in CA patients were likely the result of synergistic environmental and genetic risk factors, with arachidonic acid metabolism and PPAR signaling pathways implicated in linking environmental and genetic factors. An analysis that also added eQTL data from the Genotype-Tissue Expression Project and single nucleotide polymorphism (SNP) data from the 1000 Genomes Project identified several SNPs associated with differentially expressed genes. Overall, the associations we identified may enable a more focused study of genotypic differences that may help explain the disparity in BRCA incidence and mortality rates in CA and AS populations and inform precision medicine. PMID:28069798
Small RNA sequencing in cells and exosomes identifies eQTLs and 14q32 as a region of active export
Tsang, Emily K.; Abell, Nathan S.; Li, Xin; ...
2016-10-31
Exosomes are small extracellular vesicles that carry heterogeneous cargo, including RNA, between cells. Increasing evidence suggests that exosomes are important mediators of intercellular communication and biomarkers of disease. Despite this, the variability of exosomal RNA between individuals has not been well quantified. To assess this variability, we sequenced the small RNA of cells and exosomes from a 17-member family. Across individuals, we show that selective export of miRNAs occurs not only at the level of specific transcripts, but that a cluster of 74 mature miRNAs on chromosome 14q32 is massively exported in exosomes while mostly absent from cells. We alsomore » observe more interindividual variability between exosomal samples than between cellular ones and identify four miRNA expression quantitative trait loci shared between cells and exosomes. Lastly, our findings indicate that genomically colocated miRNAs can be exported together and highlight the variability in exosomal miRNA levels between individuals as relevant for exosome use as diagnostics.« less
McKay, James D; Hung, Rayjean J; Han, Younghun; Zong, Xuchen; Carreras-Torres, Robert; Christiani, David C; Caporaso, Neil E; Johansson, Mattias; Xiao, Xiangjun; Li, Yafang; Byun, Jinyoung; Dunning, Alison; Pooley, Karen A; Qian, David C; Ji, Xuemei; Liu, Geoffrey; Timofeeva, Maria N; Bojesen, Stig E; Wu, Xifeng; Le Marchand, Loic; Albanes, Demetrios; Bickeböller, Heike; Aldrich, Melinda C; Bush, William S; Tardon, Adonina; Rennert, Gad; Teare, M Dawn; Field, John K; Kiemeney, Lambertus A; Lazarus, Philip; Haugen, Aage; Lam, Stephen; Schabath, Matthew B; Andrew, Angeline S; Shen, Hongbing; Hong, Yun-Chul; Yuan, Jian-Min; Bertazzi, Pier Alberto; Pesatori, Angela C; Ye, Yuanqing; Diao, Nancy; Su, Li; Zhang, Ruyang; Brhane, Yonathan; Leighl, Natasha; Johansen, Jakob S; Mellemgaard, Anders; Saliba, Walid; Haiman, Christopher A; Wilkens, Lynne R; Fernandez-Somoano, Ana; Fernandez-Tardon, Guillermo; van der Heijden, Henricus F M; Kim, Jin Hee; Dai, Juncheng; Hu, Zhibin; Davies, Michael P A; Marcus, Michael W; Brunnström, Hans; Manjer, Jonas; Melander, Olle; Muller, David C; Overvad, Kim; Trichopoulou, Antonia; Tumino, Rosario; Doherty, Jennifer A; Barnett, Matt P; Chen, Chu; Goodman, Gary E; Cox, Angela; Taylor, Fiona; Woll, Penella; Brüske, Irene; Wichmann, H-Erich; Manz, Judith; Muley, Thomas R; Risch, Angela; Rosenberger, Albert; Grankvist, Kjell; Johansson, Mikael; Shepherd, Frances A; Tsao, Ming-Sound; Arnold, Susanne M; Haura, Eric B; Bolca, Ciprian; Holcatova, Ivana; Janout, Vladimir; Kontic, Milica; Lissowska, Jolanta; Mukeria, Anush; Ognjanovic, Simona; Orlowski, Tadeusz M; Scelo, Ghislaine; Swiatkowska, Beata; Zaridze, David; Bakke, Per; Skaug, Vidar; Zienolddiny, Shanbeh; Duell, Eric J; Butler, Lesley M; Koh, Woon-Puay; Gao, Yu-Tang; Houlston, Richard S; McLaughlin, John; Stevens, Victoria L; Joubert, Philippe; Lamontagne, Maxime; Nickle, David C; Obeidat, Ma'en; Timens, Wim; Zhu, Bin; Song, Lei; Kachuri, Linda; Artigas, María Soler; Tobin, Martin D; Wain, Louise V; Rafnar, Thorunn; Thorgeirsson, Thorgeir E; Reginsson, Gunnar W; Stefansson, Kari; Hancock, Dana B; Bierut, Laura J; Spitz, Margaret R; Gaddis, Nathan C; Lutz, Sharon M; Gu, Fangyi; Johnson, Eric O; Kamal, Ahsan; Pikielny, Claudio; Zhu, Dakai; Lindströem, Sara; Jiang, Xia; Tyndale, Rachel F; Chenevix-Trench, Georgia; Beesley, Jonathan; Bossé, Yohan; Chanock, Stephen; Brennan, Paul; Landi, Maria Teresa; Amos, Christopher I
2017-07-01
Although several lung cancer susceptibility loci have been identified, much of the heritability for lung cancer remains unexplained. Here 14,803 cases and 12,262 controls of European descent were genotyped on the OncoArray and combined with existing data for an aggregated genome-wide association study (GWAS) analysis of lung cancer in 29,266 cases and 56,450 controls. We identified 18 susceptibility loci achieving genome-wide significance, including 10 new loci. The new loci highlight the striking heterogeneity in genetic susceptibility across the histological subtypes of lung cancer, with four loci associated with lung cancer overall and six loci associated with lung adenocarcinoma. Gene expression quantitative trait locus (eQTL) analysis in 1,425 normal lung tissue samples highlights RNASET2, SECISBP2L and NRG1 as candidate genes. Other loci include genes such as a cholinergic nicotinic receptor, CHRNA2, and the telomere-related genes OFBC1 and RTEL1. Further exploration of the target genes will continue to provide new insights into the etiology of lung cancer.
Fehringer, Gordon; Kraft, Peter; Pharoah, Paul D.; Eeles, Rosalind A.; Chatterjee, Nilanjan; Schumacher, Fred; Schildkraut, Joellen; Lindström, Sara; Brennan, Paul; Bickeböller, Heike; Houlston, Richard S.; Landi, Maria Teresa; Caporaso, Neil; Risch, Angela; Olama, Ali Amin Al; Berndt, Sonja I; Giovannucci, Edward; Grönberg, Henrik; Kote-Jarai, Zsofia; Ma, Jing; Muir, Kenneth; Stampfer, Meir; Stevens, Victoria L.; Wiklund, Fredrik; Willett, Walter; Goode, Ellen L.; Permuth, Jennifer; Risch, Harvey A.; Reid, Brett M.; Bezieau, Stephane; Brenner, Hermann; Chan, Andrew T.; Chang-Claude, Jenny; Hudson, Thomas J.; Kocarnik, Jonathan K.; Newcomb, Polly A.; Schoen, Robert E.; Slattery, Martha L.; White, Emily; Adank, Muriel A.; Ahsan, Habibul; Aittomäki, Kristiina; Baglietto, Laura; Blomquist, Carl; Canzian, Federico; Czene, Kamila; dos-Santos-Silva, Isabel; Eliassen, A. Heather; Figueroa, Jonine; Flesch-Janys, Dieter; Fletcher, Olivia; Garcia-Closas, Montserrat; Gaudet, Mia M.; Johnson, Nichola; Hall, Per; Hazra, Aditi; Hein, Rebecca; Hofman, Albert; Hopper, John L.; Irwanto, Astrid; Johansson, Mattias; Kaaks, Rudolf; Kibriya, Muhammad G.; Lichtner, Peter; Liu, Jianjun; Lund, Eiliv; Makalic, Enes; Meindl, Alfons; Müller-Myhsok, Bertram; Muranen, Taru A.; Nevanlinna, Heli; Peeters, Petra H.; Peto, Julian; Prentice, Ross L.; Rahman, Nazneen; Sanchez, Maria Jose; Schmidt, Daniel F.; Schmutzler, Rita K.; Southey, Melissa C.; Tamimi, Rulla; Travis, Ruth C.; Turnbull, Clare; Uitterlinden, Andre G.; Wang, Zhaoming; Whittemore, Alice S.; Yang, Xiaohong R.; Zheng, Wei; Rafnar, Thorunn; Gudmundsson, Julius; Stacey, Simon N.; Stefansson, Kari; Sulem, Patrick; Chen, Y. Ann; Tyrer, Jonathan P.; Christiani, David C.; Wei, Yongyue; Shen, Hongbing; Hu, Zhibin; Shu, Xiao-Ou; Shiraishi, Kouya; Takahashi, Atsushi; Bossé, Yohan; Obeidat, Ma’en; Nickle, David; Timens, Wim; Freedman, Matthew L.; Li, Qiyuan; Seminara, Daniela; Chanock, Stephen J.; Gong, Jian; Peters, Ulrike; Gruber, Stephen B.; Amos, Christopher I.; Sellers, Thomas A.; Easton, Douglas F.; Hunter, David J.; Haiman, Christopher A.; Henderson, Brian E.; Hung, Rayjean J.
2016-01-01
Identifying genetic variants with pleiotropic associations can uncover common pathways influencing multiple cancers. We took a two-staged approach to conduct genome-wide association studies for lung, ovary, breast, prostate and colorectal cancer from the GAME-ON/GECCO Network (61,851 cases, 61,820 controls) to identify pleiotropic loci. Findings were replicated in independent association studies (55,789 cases, 330,490 controls). We identified a novel pleiotropic association at 1q22 involving breast and lung squamous cell carcinoma, with eQTL analysis showing an association with ADAM15/THBS3 gene expression in lung. We also identified a known breast cancer locus CASP8/ALS2CR12 associated with prostate cancer, a known cancer locus at CDKN2B-AS1 with different variants associated with lung adenocarcinoma and prostate cancer and confirmed the associations of a breast BRCA2 locus with lung and serous ovarian cancer. This is the largest study to date examining pleiotropy across multiple cancer-associated loci, identifying common mechanisms of cancer development and progression. PMID:27197191
Jin, Ying; Andersen, Genevieve; Yorgov, Daniel; Ferrara, Tracey M; Ben, Songtao; Brownson, Kelly M; Holland, Paulene J; Birlea, Stanca A; Siebert, Janet; Hartmann, Anke; Lienert, Anne; van Geel, Nanja; Lambert, Jo; Luiten, Rosalie M; Wolkerstorfer, Albert; van der Veen, JP Wietze; Bennett, Dorothy C; Taïeb, Alain; Ezzedine, Khaled; Kemp, E Helen; Gawkrodger, David J; Weetman, Anthony P; Kõks, Sulev; Prans, Ele; Kingo, Külli; Karelson, Maire; Wallace, Margaret R; McCormack, Wayne T; Overbeck, Andreas; Moretti, Silvia; Colucci, Roberta; Picardo, Mauro; Silverberg, Nanette B; Olsson, Mats; Valle, Yan; Korobko, Igor; Böhm, Markus; Lim, Henry W.; Hamzavi, Iltefat; Zhou, Li; Mi, Qing-Sheng; Fain, Pamela R.; Santorico, Stephanie A; Spritz, Richard A
2016-01-01
Vitiligo is an autoimmune disease in which depigmented skin results from destruction of melanocytes1, with epidemiologic association with other autoimmune diseases2. In previous linkage and genome-wide association studies (GWAS1, GWAS2), we identified 27 vitiligo susceptibility loci in patients of European (EUR) ancestry. We carried out a third GWAS (GWAS3) in EUR subjects, with augmented GWAS1 and GWAS2 controls, genome-wide imputation, and meta-analysis of all three GWAS, followed by an independent replication. The combined analyses, with 4,680 cases and 39,586 controls, identified 23 new loci and 7 suggestive loci, most encoding immune and apoptotic regulators, some also associated with other autoimmune diseases, as well as several melanocyte regulators. Bioinformatic analyses indicate a predominance of causal regulatory variation, some corresponding to eQTL at these loci. Together, the identified genes provide a framework for vitiligo genetic architecture and pathobiology, highlight relationships to other autoimmune diseases and melanoma, and offer potential targets for treatment. PMID:27723757
Genetic loci associated with heart rate variability and their effects on cardiac disease risk
Nolte, Ilja M.; Munoz, M. Loretto; Tragante, Vinicius; Amare, Azmeraw T.; Jansen, Rick; Vaez, Ahmad; von der Heyde, Benedikt; Avery, Christy L.; Bis, Joshua C.; Dierckx, Bram; van Dongen, Jenny; Gogarten, Stephanie M.; Goyette, Philippe; Hernesniemi, Jussi; Huikari, Ville; Hwang, Shih-Jen; Jaju, Deepali; Kerr, Kathleen F.; Kluttig, Alexander; Krijthe, Bouwe P.; Kumar, Jitender; van der Laan, Sander W.; Lyytikäinen, Leo-Pekka; Maihofer, Adam X.; Minassian, Arpi; van der Most, Peter J.; Müller-Nurasyid, Martina; Nivard, Michel; Salvi, Erika; Stewart, James D.; Thayer, Julian F.; Verweij, Niek; Wong, Andrew; Zabaneh, Delilah; Zafarmand, Mohammad H.; Abdellaoui, Abdel; Albarwani, Sulayma; Albert, Christine; Alonso, Alvaro; Ashar, Foram; Auvinen, Juha; Axelsson, Tomas; Baker, Dewleen G.; de Bakker, Paul I. W.; Barcella, Matteo; Bayoumi, Riad; Bieringa, Rob J.; Boomsma, Dorret; Boucher, Gabrielle; Britton, Annie R.; Christophersen, Ingrid; Dietrich, Andrea; Ehret, George B.; Ellinor, Patrick T.; Eskola, Markku; Felix, Janine F.; Floras, John S.; Franco, Oscar H.; Friberg, Peter; Gademan, Maaike G. J.; Geyer, Mark A.; Giedraitis, Vilmantas; Hartman, Catharina A.; Hemerich, Daiane; Hofman, Albert; Hottenga, Jouke-Jan; Huikuri, Heikki; Hutri-Kähönen, Nina; Jouven, Xavier; Junttila, Juhani; Juonala, Markus; Kiviniemi, Antti M.; Kors, Jan A.; Kumari, Meena; Kuznetsova, Tatiana; Laurie, Cathy C.; Lefrandt, Joop D.; Li, Yong; Li, Yun; Liao, Duanping; Limacher, Marian C.; Lin, Henry J.; Lindgren, Cecilia M.; Lubitz, Steven A.; Mahajan, Anubha; McKnight, Barbara; zu Schwabedissen, Henriette Meyer; Milaneschi, Yuri; Mononen, Nina; Morris, Andrew P.; Nalls, Mike A.; Navis, Gerjan; Neijts, Melanie; Nikus, Kjell; North, Kari E.; O'Connor, Daniel T.; Ormel, Johan; Perz, Siegfried; Peters, Annette; Psaty, Bruce M.; Raitakari, Olli T.; Risbrough, Victoria B.; Sinner, Moritz F.; Siscovick, David; Smit, Johannes H.; Smith, Nicholas L.; Soliman, Elsayed Z.; Sotoodehnia, Nona; Staessen, Jan A.; Stein, Phyllis K.; Stilp, Adrienne M.; Stolarz-Skrzypek, Katarzyna; Strauch, Konstantin; Sundström, Johan; Swenne, Cees A.; Syvänen, Ann-Christine; Tardif, Jean-Claude; Taylor, Kent D.; Teumer, Alexander; Thornton, Timothy A.; Tinker, Lesley E.; Uitterlinden, André G.; van Setten, Jessica; Voss, Andreas; Waldenberger, Melanie; Wilhelmsen, Kirk C.; Willemsen, Gonneke; Wong, Quenna; Zhang, Zhu-Ming; Zonderman, Alan B.; Cusi, Daniele; Evans, Michele K.; Greiser, Halina K.; van der Harst, Pim; Hassan, Mohammad; Ingelsson, Erik; Järvelin, Marjo-Riitta; Kääb, Stefan; Kähönen, Mika; Kivimaki, Mika; Kooperberg, Charles; Kuh, Diana; Lehtimäki, Terho; Lind, Lars; Nievergelt, Caroline M.; O'Donnell, Chris J.; Oldehinkel, Albertine J.; Penninx, Brenda; Reiner, Alexander P.; Riese, Harriëtte; van Roon, Arie M.; Rioux, John D.; Rotter, Jerome I.; Sofer, Tamar; Stricker, Bruno H.; Tiemeier, Henning; Vrijkotte, Tanja G. M.; Asselbergs, Folkert W.; Brundel, Bianca J. J. M.; Heckbert, Susan R.; Whitsel, Eric A.; den Hoed, Marcel; Snieder, Harold; de Geus, Eco J. C.
2017-01-01
Reduced cardiac vagal control reflected in low heart rate variability (HRV) is associated with greater risks for cardiac morbidity and mortality. In two-stage meta-analyses of genome-wide association studies for three HRV traits in up to 53,174 individuals of European ancestry, we detect 17 genome-wide significant SNPs in eight loci. HRV SNPs tag non-synonymous SNPs (in NDUFA11 and KIAA1755), expression quantitative trait loci (eQTLs) (influencing GNG11, RGS6 and NEO1), or are located in genes preferentially expressed in the sinoatrial node (GNG11, RGS6 and HCN4). Genetic risk scores account for 0.9 to 2.6% of the HRV variance. Significant genetic correlation is found for HRV with heart rate (−0.74
Identifying gene networks underlying the neurobiology of ethanol and alcoholism.
Wolen, Aaron R; Miles, Michael F
2012-01-01
For complex disorders such as alcoholism, identifying the genes linked to these diseases and their specific roles is difficult. Traditional genetic approaches, such as genetic association studies (including genome-wide association studies) and analyses of quantitative trait loci (QTLs) in both humans and laboratory animals already have helped identify some candidate genes. However, because of technical obstacles, such as the small impact of any individual gene, these approaches only have limited effectiveness in identifying specific genes that contribute to complex diseases. The emerging field of systems biology, which allows for analyses of entire gene networks, may help researchers better elucidate the genetic basis of alcoholism, both in humans and in animal models. Such networks can be identified using approaches such as high-throughput molecular profiling (e.g., through microarray-based gene expression analyses) or strategies referred to as genetical genomics, such as the mapping of expression QTLs (eQTLs). Characterization of gene networks can shed light on the biological pathways underlying complex traits and provide the functional context for identifying those genes that contribute to disease development.
PDF receptor signaling in Drosophila contributes to both circadian and geotactic behaviors.
Mertens, Inge; Vandingenen, Anick; Johnson, Erik C; Shafer, Orie T; Li, W; Trigg, J S; De Loof, Arnold; Schoofs, Liliane; Taghert, Paul H
2005-10-20
The neuropeptide Pigment-Dispersing Factor (PDF) is a principle transmitter regulating circadian locomotor rhythms in Drosophila. We have identified a Class II (secretin-related) G protein-coupled receptor (GPCR) that is specifically responsive to PDF and also to calcitonin-like peptides and to PACAP. In response to PDF, the PDF receptor (PDFR) elevates cAMP levels when expressed in HEK293 cells. As predicted by in vivo studies, cotransfection of Neurofibromatosis Factor 1 significantly improves coupling of PDFR to adenylate cyclase. pdfr mutant flies display increased circadian arrhythmicity, and also display altered geotaxis that is epistatic to that of pdf mutants. PDFR immunosignals are expressed by diverse neurons, but only by a small subset of circadian pacemakers. These data establish the first synapse within the Drosophila circadian neural circuit and underscore the importance of Class II peptide GPCR signaling in circadian neural systems.
Dynamically correlated mutations drive human Influenza A evolution.
Tria, F; Pompei, S; Loreto, V
2013-01-01
Human Influenza A virus undergoes recurrent changes in the hemagglutinin (HA) surface protein, primarily involved in the human antibody recognition. Relevant antigenic changes, enabling the virus to evade host immune response, have been recognized to occur in parallel to multiple mutations at antigenic sites in HA. Yet, the role of correlated mutations (epistasis) in driving the molecular evolution of the virus still represents a challenging puzzle. Further, though circulation at a global geographic level is key for the survival of Influenza A, its role in shaping the viral phylodynamics remains largely unexplored. Here we show, through a sequence based epidemiological model, that epistatic effects between amino acids substitutions, coupled with a reservoir that mimics worldwide circulating viruses, are key determinants that drive human Influenza A evolution. Our approach explains all the up-to-date observations characterizing the evolution of H3N2 subtype, including phylogenetic properties, nucleotide fixation patterns, and composition of antigenic clusters.
Rice-arsenate interactions in hydroponics: a three-gene model for tolerance.
Norton, Gareth J; Nigar, Meher; Williams, Paul N; Dasgupta, Tapash; Meharg, Andrew A; Price, Adam H
2008-01-01
In this study, the genetic mapping of the tolerance of root growth to 13.3 muM arsenate [As(V)] using the BalaxAzucena population is improved, and candidate genes for further study are identified. A remarkable three-gene model of tolerance is advanced, which appears to involve epistatic interaction between three major genes, two on chromosome 6 and one on chromosome 10. Any combination of two of these genes inherited from the tolerant parent leads to the plant having tolerance. Lists of potential positional candidate genes are presented. These are then refined using whole genome transcriptomics data and bioinformatics. Physiological evidence is also provided that genes related to phosphate transport are unlikely to be behind the genetic loci conferring tolerance. These results offer testable hypotheses for genes related to As(V) tolerance that might offer strategies for mitigating arsenic (As) accumulation in consumed rice.
Rice–arsenate interactions in hydroponics: a three-gene model for tolerance
Norton, Gareth J.; Nigar, Meher; Dasgupta, Tapash; Meharg, Andrew A.; Price, Adam H.
2008-01-01
In this study, the genetic mapping of the tolerance of root growth to 13.3 μM arsenate [As(V)] using the Bala×Azucena population is improved, and candidate genes for further study are identified. A remarkable three-gene model of tolerance is advanced, which appears to involve epistatic interaction between three major genes, two on chromosome 6 and one on chromosome 10. Any combination of two of these genes inherited from the tolerant parent leads to the plant having tolerance. Lists of potential positional candidate genes are presented. These are then refined using whole genome transcriptomics data and bioinformatics. Physiological evidence is also provided that genes related to phosphate transport are unlikely to be behind the genetic loci conferring tolerance. These results offer testable hypotheses for genes related to As(V) tolerance that might offer strategies for mitigating arsenic (As) accumulation in consumed rice. PMID:18453529
High-resolution mapping reveals hundreds of genetic incompatibilities in hybridizing fish species.
Schumer, Molly; Cui, Rongfeng; Powell, Daniel L; Dresner, Rebecca; Rosenthal, Gil G; Andolfatto, Peter
2014-06-04
Hybridization is increasingly being recognized as a common process in both animal and plant species. Negative epistatic interactions between genes from different parental genomes decrease the fitness of hybrids and can limit gene flow between species. However, little is known about the number and genome-wide distribution of genetic incompatibilities separating species. To detect interacting genes, we perform a high-resolution genome scan for linkage disequilibrium between unlinked genomic regions in naturally occurring hybrid populations of swordtail fish. We estimate that hundreds of pairs of genomic regions contribute to reproductive isolation between these species, despite them being recently diverged. Many of these incompatibilities are likely the result of natural or sexual selection on hybrids, since intrinsic isolation is known to be weak. Patterns of genomic divergence at these regions imply that genetic incompatibilities play a significant role in limiting gene flow even in young species.
Deep epistasis in human metabolism
NASA Astrophysics Data System (ADS)
Imielinski, Marcin; Belta, Calin
2010-06-01
We extend and apply a method that we have developed for deriving high-order epistatic relationships in large biochemical networks to a published genome-scale model of human metabolism. In our analysis we compute 33 328 reaction sets whose knockout synergistically disables one or more of 43 important metabolic functions. We also design minimal knockouts that remove flux through fumarase, an enzyme that has previously been shown to play an important role in human cancer. Most of these knockout sets employ more than eight mutually buffering reactions, spanning multiple cellular compartments and metabolic subsystems. These reaction sets suggest that human metabolic pathways possess a striking degree of parallelism, inducing "deep" epistasis between diversely annotated genes. Our results prompt specific chemical and genetic perturbation follow-up experiments that could be used to query in vivo pathway redundancy. They also suggest directions for future statistical studies of epistasis in genetic variation data sets.
Hippo Signaling Suppresses Cell Ploidy and Tumorigenesis through Skp2.
Zhang, Shihao; Chen, Qinghua; Liu, Qingxu; Li, Yuxi; Sun, Xiufeng; Hong, Lixin; Ji, Suyuan; Liu, Chengyan; Geng, Jing; Zhang, Weiji; Lu, Zhonglei; Yin, Zhen-Yu; Zeng, Yuanyuan; Lin, Kwang-Huei; Wu, Qiao; Li, Qiyuan; Nakayama, Keiko; Nakayama, Keiich I; Deng, Xianming; Johnson, Randy L; Zhu, Liang; Gao, Daming; Chen, Lanfen; Zhou, Dawang
2017-05-08
Polyploidy can lead to aneuploidy and tumorigenesis. Here, we report that the Hippo pathway effector Yap promotes the diploid-polyploid conversion and polyploid cell growth through the Akt-Skp2 axis. Yap strongly induces the acetyltransferase p300-mediated acetylation of the E3 ligase Skp2 via Akt signaling. Acetylated Skp2 is exclusively localized to the cytosol, which causes hyper-accumulation of the cyclin-dependent kinase inhibitor p27, leading to mitotic arrest and subsequently cell polyploidy. In addition, the pro-apoptotic factors FoxO1/3 are overly degraded by acetylated Skp2, resulting in polyploid cell division, genomic instability, and oncogenesis. Importantly, the depletion or inactivation of Akt or Skp2 abrogated Hippo signal deficiency-induced liver tumorigenesis, indicating their epistatic interaction. Thus, we conclude that Hippo-Yap signaling suppresses cell polyploidy and oncogenesis through Skp2. Copyright © 2017 Elsevier Inc. All rights reserved.
Genomic investigations of evolutionary dynamics and epistasis in microbial evolution experiments.
Jerison, Elizabeth R; Desai, Michael M
2015-12-01
Microbial evolution experiments enable us to watch adaptation in real time, and to quantify the repeatability and predictability of evolution by comparing identical replicate populations. Further, we can resurrect ancestral types to examine changes over evolutionary time. Until recently, experimental evolution has been limited to measuring phenotypic changes, or to tracking a few genetic markers over time. However, recent advances in sequencing technology now make it possible to extensively sequence clones or whole-population samples from microbial evolution experiments. Here, we review recent work exploiting these techniques to understand the genomic basis of evolutionary change in experimental systems. We first focus on studies that analyze the dynamics of genome evolution in microbial systems. We then survey work that uses observations of sequence evolution to infer aspects of the underlying fitness landscape, concentrating on the epistatic interactions between mutations and the constraints these interactions impose on adaptation. Copyright © 2015 Elsevier Ltd. All rights reserved.
Pietra, Stefano; Gustavsson, Anna; Kiefer, Christian; Kalmbach, Lothar; Hörstedt, Per; Ikeda, Yoshihisa; Stepanova, Anna N; Alonso, Jose M; Grebe, Markus
2013-01-01
The orientation of cell division and the coordination of cell polarity within the plane of the tissue layer (planar polarity) contribute to shape diverse multicellular organisms. The root of Arabidopsis thaliana displays regularly oriented cell divisions, cell elongation and planar polarity providing a plant model system to study these processes. Here we report that the SABRE protein, which shares similarity with proteins of unknown function throughout eukaryotes, has important roles in orienting cell division and planar polarity. SABRE localizes at the plasma membrane, endomembranes, mitotic spindle and cell plate. SABRE stabilizes the orientation of CLASP-labelled preprophase band microtubules predicting the cell division plane, and of cortical microtubules driving cell elongation. During planar polarity establishment, sabre is epistatic to clasp at directing polar membrane domains of Rho-of-plant GTPases. Our findings mechanistically link SABRE to CLASP-dependent microtubule organization, shedding new light on the function of SABRE-related proteins in eukaryotes.
He, Kuang; Marden, Jeremiah N.; Quardokus, Ellen M.; Bauer, Carl E.
2013-01-01
Genomic and genetic analyses have demonstrated that many species contain multiple chemotaxis-like signal transduction cascades that likely control processes other than chemotaxis. The Che3 signal transduction cascade from Rhodospirillum centenum is one such example that regulates development of dormant cysts. This Che-like cascade contains two hybrid response regulator-histidine kinases, CheA3 and CheS3, and a single-domain response regulator CheY3. We demonstrate that cheS3 is epistatic to cheA3 and that only CheS3∼P can phosphorylate CheY3. We further show that CheA3 derepresses cyst formation by phosphorylating a CheS3 receiver domain. These results demonstrate that the flow of phosphate as defined by the paradigm E. coli chemotaxis cascade does not necessarily hold true for non-chemotactic Che-like signal transduction cascades. PMID:24367276
Associations among multiple markers and complex disease: models, algorithms, and applications.
Assimes, Themistocles L; Olshen, Adam B; Narasimhan, Balasubramanian; Olshen, Richard A
2008-01-01
This chapter is a report on collaborations among its authors and others over many years. It devolves from our goal of understanding genes, their main and epistatic effects combined with interactions involving demographic and environmental features also, as together they predict genetically complex diseases. Thus, our goal is "association." Particular phenotypes of interest to us are hypertension, insulin resistance, angina, and myocardial infarction. Prediction of complex disease is notoriously difficult, though it would be made easier were we given strand-specific information on genotype. Unfortunately, with current technology, genotypic information comes to us "unphased." While obviously we have strand-specific information when genotype is homozygous, we do not have such information when genotype is heterozygous. To summarize, the ultimate goals of approaches we provide is to predict phenotype, typically untoward or not, within a specific window of time. Our approach is neither through linkage nor from finding haplotype frequencies per se.
Inferring genetic interactions from comparative fitness data
2017-01-01
Darwinian fitness is a central concept in evolutionary biology. In practice, however, it is hardly possible to measure fitness for all genotypes in a natural population. Here, we present quantitative tools to make inferences about epistatic gene interactions when the fitness landscape is only incompletely determined due to imprecise measurements or missing observations. We demonstrate that genetic interactions can often be inferred from fitness rank orders, where all genotypes are ordered according to fitness, and even from partial fitness orders. We provide a complete characterization of rank orders that imply higher order epistasis. Our theory applies to all common types of gene interactions and facilitates comprehensive investigations of diverse genetic interactions. We analyzed various genetic systems comprising HIV-1, the malaria-causing parasite Plasmodium vivax, the fungus Aspergillus niger, and the TEM-family of β-lactamase associated with antibiotic resistance. For all systems, our approach revealed higher order interactions among mutations. PMID:29260711
Inferring genetic interactions from comparative fitness data.
Crona, Kristina; Gavryushkin, Alex; Greene, Devin; Beerenwinkel, Niko
2017-12-20
Darwinian fitness is a central concept in evolutionary biology. In practice, however, it is hardly possible to measure fitness for all genotypes in a natural population. Here, we present quantitative tools to make inferences about epistatic gene interactions when the fitness landscape is only incompletely determined due to imprecise measurements or missing observations. We demonstrate that genetic interactions can often be inferred from fitness rank orders, where all genotypes are ordered according to fitness, and even from partial fitness orders. We provide a complete characterization of rank orders that imply higher order epistasis. Our theory applies to all common types of gene interactions and facilitates comprehensive investigations of diverse genetic interactions. We analyzed various genetic systems comprising HIV-1, the malaria-causing parasite Plasmodium vivax , the fungus Aspergillus niger , and the TEM-family of β-lactamase associated with antibiotic resistance. For all systems, our approach revealed higher order interactions among mutations.
Inheritance of tristyly in Oxalis tuberosa (Oxalidaceae).
Trognitz, B R; Hermann, M
2001-05-01
Frequencies of floral morphs in progenies obtained from a complete set of diallelic crosses among three accessions of tristylous, octoploid oca (Oxalis tuberosa) were used for a Mendelian analysis of floral morph inheritance. The frequencies observed had the best fit to a model of tetrasomic inheritance with two diallelic factors, S, s and M, m, with S being epistatic over M. No explanation could be found for the unexpected formation of a small percentage of short-styled individuals in crosses between the mid-styled and the long-styled parent. For the acceptance of models of disomic and octosomic inheritance several additional assumptions would have to be made and therefore these modes of inheritance are less likely. Dosage-dependent inheritance of floral morph was rejected. Only a small frequency (36%) of the cross progenies flowered, in contrast to the greater propensity for flowering of O. tuberosa accessions held at gene banks.
Neuroglian activates Echinoid to antagonize the Drosophila EGF receptor signaling pathway.
Islam, Rafique; Wei, Shu-Yi; Chiu, Wei-Hsin; Hortsch, Michael; Hsu, Jui-Chou
2003-05-01
echinoid (ed) encodes an cell-adhesion molecule (CAM) that contains immunoglobulin domains and regulates the EGFR signaling pathway during Drosophila eye development. Based on our previous genetic mosaic and epistatic analysis, we proposed that Ed, via homotypic interactions, activates a novel, as yet unknown pathway that antagonizes EGFR signaling. In this report, we demonstrate that Ed functions as a homophilic adhesion molecule and also engages in a heterophilic trans-interaction with Drosophila Neuroglian (Nrg), an L1-type CAM. Co-expression of ed and nrg in the eye exhibits a strong genetic synergy in inhibiting EGFR signaling. This synergistic effect requires the intracellular domain of Ed, but not that of Nrg. In addition, Ed and Nrg colocalize in the Drosophila eye and are efficiently co-immunoprecipitated. Together, our results suggest a model in which Nrg acts as a heterophilic ligand and activator of Ed, which in turn antagonizes EGFR signaling.
Song, Ci; Nutile, Teresa; Vernon Smith, Albert; Concas, Maria Pina; Traglia, Michela; Barbieri, Caterina; Ndiaye, Ndeye Coumba; Stathopoulou, Maria G.; Lagou, Vasiliki; Maestrale, Giovanni Battista; Sala, Cinzia; Debette, Stephanie; Kovacs, Peter; Lind, Lars; Lamont, John; Fitzgerald, Peter; Tönjes, Anke; Gudnason, Vilmundur; Toniolo, Daniela; Pirastu, Mario; Bellenguez, Celine; Vasan, Ramachandran S.; Ingelsson, Erik; Leutenegger, Anne-Louise; Johnson, Andrew D.; DeStefano, Anita L.; Visvikis-Siest, Sophie; Seshadri, Sudha; Ciullo, Marina
2016-01-01
Vascular endothelial growth factor (VEGF) is an angiogenic and neurotrophic factor, secreted by endothelial cells, known to impact various physiological and disease processes from cancer to cardiovascular disease and to be pharmacologically modifiable. We sought to identify novel loci associated with circulating VEGF levels through a genome-wide association meta-analysis combining data from European-ancestry individuals and using a dense variant map from 1000 genomes imputation panel. Six discovery cohorts including 13,312 samples were analyzed, followed by in-silico and de-novo replication studies including an additional 2,800 individuals. A total of 10 genome-wide significant variants were identified at 7 loci. Four were novel loci (5q14.3, 10q21.3, 16q24.2 and 18q22.3) and the leading variants at these loci were rs114694170 (MEF2C, P = 6.79x10-13), rs74506613 (JMJD1C, P = 1.17x10-19), rs4782371 (ZFPM1, P = 1.59x10-9) and rs2639990 (ZADH2, P = 1.72x10-8), respectively. We also identified two new independent variants (rs34528081, VEGFA, P = 1.52x10-18; rs7043199, VLDLR-AS1, P = 5.12x10-14) at the 3 previously identified loci and strengthened the evidence for the four previously identified SNPs (rs6921438, LOC100132354, P = 7.39x10-1467; rs1740073, C6orf223, P = 2.34x10-17; rs6993770, ZFPM2, P = 2.44x10-60; rs2375981, KCNV2, P = 1.48x10-100). These variants collectively explained up to 52% of the VEGF phenotypic variance. We explored biological links between genes in the associated loci using Ingenuity Pathway Analysis that emphasized their roles in embryonic development and function. Gene set enrichment analysis identified the ERK5 pathway as enriched in genes containing VEGF associated variants. eQTL analysis showed, in three of the identified regions, variants acting as both cis and trans eQTLs for multiple genes. Most of these genes, as well as some of those in the associated loci, were involved in platelet biogenesis and functionality, suggesting the importance of this process in regulation of VEGF levels. This work also provided new insights into the involvement of genes implicated in various angiogenesis related pathologies in determining circulating VEGF levels. The understanding of the molecular mechanisms by which the identified genes affect circulating VEGF levels could be important in the development of novel VEGF-related therapies for such diseases. PMID:26910538