Science.gov

Sample records for genome-wide breeding values

  1. Why breeding values estimated using familial data should not be used for genome-wide association studies.

    PubMed

    Ekine, Chinyere C; Rowe, Suzanne J; Bishop, Stephen C; de Koning, Dirk-Jan

    2014-02-19

    In animal breeding, the genetic potential of an animal is summarized as its estimated breeding value, which is derived from its own performance as well as the performance of related individuals. Here, we illustrate why estimated breeding values are not suitable as a phenotype for genome-wide association studies. We simulated human-type and pig-type pedigrees with a range of quantitative trait loci (QTL) effects (0.5-3% of phenotypic variance) and heritabilities (0.3-0.8). We analyzed 1000 replicates of each scenario with four models: (a) a full mixed model including a polygenic effect, (b) a regression analysis using the residual of a mixed model as a trait score (so called GRAMMAR approach), (c) a regression analysis using the estimated breeding value as a trait score, and (d) a regression analysis that uses the raw phenotype as a trait score. We show that using breeding values as a trait score gives very high false-positive rates (up 14% in human pedigrees and >60% in pig pedigrees). Simulations based on a real pedigree show that additional generations of pedigree increase the type I error. Including the family relationship as a random effect provides the greatest power to detect QTL while controlling for type I error at the desired level and providing the most accurate estimates of the QTL effect. Both the use of residuals and the use of breeding values result in deflated estimates of the QTL effect. We derive the contributions of QTL effects to the breeding value and residual and show how this affects the estimates.

  2. Genome Wide Screening of Candidate Genes for Improving Piglet Birth Weight Using High and Low Estimated Breeding Value Populations

    PubMed Central

    Zhang, Lifan; Zhou, Xiang; Michal, Jennifer J.; Ding, Bo; Li, Rui; Jiang, Zhihua

    2014-01-01

    Birth weight is an economically important trait in pig production because it directly impacts piglet growth and survival rate. In the present study, we performed a genome wide survey of candidate genes and pathways associated with individual birth weight (IBW) using the Illumina PorcineSNP60 BeadChip on 24 high (HEBV) and 24 low estimated breeding value (LEBV) animals. These animals were selected from a reference population of 522 individuals produced by three sires and six dam lines, which were crossbreds with multiple breeds. After quality-control, 43,257 SNPs (single nucleotide polymorphisms), including 42,243 autosomal SNPs and 1,014 SNPs on chromosome X, were used in the data analysis. A total of 27 differentially selected regions (DSRs), including 1 on Sus scrofa chromosome 1 (SSC1), 1 on SSC4, 2 on SSC5, 4 on SSC6, 2 on SSC7, 5 on SSC8, 3 on SSC9, 1 on SSC14, 3 on SSC18, and 5 on SSCX, were identified to show the genome wide separations between the HEBV and LEBV groups for IBW in piglets. A DSR with the most number of significant SNPs (including 7 top 0.1% and 31 top 5% SNPs) was located on SSC6, while another DSR with the largest genetic differences in FST was found on SSC18. These regions harbor known functionally important genes involved in growth and development, such as TNFRSF9 (tumor necrosis factor receptor superfamily member 9), CA6 (carbonic anhydrase VI) and MDFIC (MyoD family inhibitor domain containing). A DSR rich in imprinting genes appeared on SSC9, which included PEG10 (paternally expressed 10), SGCE (sarcoglycan, epsilon), PPP1R9A (protein phosphatase 1, regulatory subunit 9A) and ASB4 (ankyrin repeat and SOCS box containing 4). More importantly, our present study provided evidence to support six quantitative trait loci (QTL) regions for pig birth weight, six QTL regions for average birth weight (ABW) and three QTL regions for litter birth weight (LBW) reported previously by other groups. Furthermore, gene ontology analysis with 183 genes

  3. Genome wide screening of candidate genes for improving piglet birth weight using high and low estimated breeding value populations.

    PubMed

    Zhang, Lifan; Zhou, Xiang; Michal, Jennifer J; Ding, Bo; Li, Rui; Jiang, Zhihua

    2014-01-01

    Birth weight is an economically important trait in pig production because it directly impacts piglet growth and survival rate. In the present study, we performed a genome wide survey of candidate genes and pathways associated with individual birth weight (IBW) using the Illumina PorcineSNP60 BeadChip on 24 high (HEBV) and 24 low estimated breeding value (LEBV) animals. These animals were selected from a reference population of 522 individuals produced by three sires and six dam lines, which were crossbreds with multiple breeds. After quality-control, 43,257 SNPs (single nucleotide polymorphisms), including 42,243 autosomal SNPs and 1,014 SNPs on chromosome X, were used in the data analysis. A total of 27 differentially selected regions (DSRs), including 1 on Sus scrofa chromosome 1 (SSC1), 1 on SSC4, 2 on SSC5, 4 on SSC6, 2 on SSC7, 5 on SSC8, 3 on SSC9, 1 on SSC14, 3 on SSC18, and 5 on SSCX, were identified to show the genome wide separations between the HEBV and LEBV groups for IBW in piglets. A DSR with the most number of significant SNPs (including 7 top 0.1% and 31 top 5% SNPs) was located on SSC6, while another DSR with the largest genetic differences in F ST was found on SSC18. These regions harbor known functionally important genes involved in growth and development, such as TNFRSF9 (tumor necrosis factor receptor superfamily member 9), CA6 (carbonic anhydrase VI) and MDFIC (MyoD family inhibitor domain containing). A DSR rich in imprinting genes appeared on SSC9, which included PEG10 (paternally expressed 10), SGCE (sarcoglycan, epsilon), PPP1R9A (protein phosphatase 1, regulatory subunit 9A) and ASB4 (ankyrin repeat and SOCS box containing 4). More importantly, our present study provided evidence to support six quantitative trait loci (QTL) regions for pig birth weight, six QTL regions for average birth weight (ABW) and three QTL regions for litter birth weight (LBW) reported previously by other groups. Furthermore, gene ontology analysis with 183 genes

  4. Genome wide selection in Citrus breeding.

    PubMed

    Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A

    2016-10-17

    Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq(TM) (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.

  5. Genome-wide Association Study (GWAS) and Its Application for Improving the Genomic Estimated Breeding Values (GEBV) of the Berkshire Pork Quality Traits.

    PubMed

    Lee, Young-Sup; Jeong, Hyeonsoo; Taye, Mengistie; Kim, Hyeon Jeong; Ka, Sojeong; Ryu, Youn-Chul; Cho, Seoae

    2015-11-01

    The missing heritability has been a major problem in the analysis of best linear unbiased prediction (BLUP). We introduced the traditional genome-wide association study (GWAS) into the BLUP to improve the heritability estimation. We analyzed eight pork quality traits of the Berkshire breeds using GWAS and BLUP. GWAS detects the putative quantitative trait loci regions given traits. The single nucleotide polymorphisms (SNPs) were obtained using GWAS results with p value <0.01. BLUP analyzed with significant SNPs was much more accurate than that with total genotyped SNPs in terms of narrow-sense heritability. It implies that genomic estimated breeding values (GEBVs) of pork quality traits can be calculated by BLUP via GWAS. The GWAS model was the linear regression using PLINK and BLUP model was the G-BLUP and SNP-GBLUP. The SNP-GBLUP uses SNP-SNP relationship matrix. The BLUP analysis using preprocessing of GWAS can be one of the possible alternatives of solving the missing heritability problem and it can provide alternative BLUP method which can find more accurate GEBVs.

  6. Genome-wide association and genomic selection in animal breeding.

    PubMed

    Hayes, Ben; Goddard, Mike

    2010-11-01

    Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

  7. Genome-wide association and genomic prediction of breeding values for fatty acid composition in subcutaneous adipose and longissimus lumborum muscle of beef cattle.

    PubMed

    Chen, Liuhong; Ekine-Dzivenu, Chinyere; Vinsky, Michael; Basarab, John; Aalhus, Jennifer; Dugan, Mike E R; Fitzsimmons, Carolyn; Stothard, Paul; Li, Changxi

    2015-11-21

    Identification of genetic variants that are associated with fatty acid composition in beef will enhance our understanding of host genetic influence on the trait and also allow for more effective improvement of beef fatty acid profiles through genomic selection and marker-assisted diet management. In this study, 81 and 83 fatty acid traits were measured in subcutaneous adipose (SQ) and longissimus lumborum muscle (LL), respectively, from 1366 purebred and crossbred beef steers and heifers that were genotyped on the Illumina BovineSNP50 Beadchip. The objective was to conduct genome-wide association studies (GWAS) for the fatty acid traits and to evaluate the accuracy of genomic prediction for fatty acid composition using genomic best linear unbiased prediction (GBLUP) and Bayesian methods. In total, 302 and 360 significant SNPs spanning all autosomal chromosomes were identified to be associated with fatty acid composition in SQ and LL tissues, respectively. Proportions of total genetic variance explained by individual significant SNPs ranged from 0.03 to 11.06% in SQ, and from 0.005 to 24.28% in the LL muscle. Markers with relatively large effects were located near fatty acid synthase (FASN), stearoyl-CoA desaturase (SCD), and thyroid hormone responsive (THRSP) genes. For the majority of the fatty acid traits studied, the accuracy of genomic prediction was relatively low (<0.40). Relatively high accuracies (> = 0.50) were achieved for 10:0, 12:0, 14:0, 15:0, 16:0, 9c-14:1, 12c-16:1, 13c-18:1, and health index (HI) in LL, and for 12:0, 14:0, 15:0, 10 t,12c-18:2, and 11 t,13c + 11c,13 t-18:2 in SQ. The Bayesian method performed similarly as GBLUP for most of the traits but substantially better for traits that were affected by SNPs of large effects as identified by GWAS. Fatty acid composition in beef is influenced by a few host genes with major effects and many genes of smaller effects. With the current training population size and marker density, genomic

  8. Genome-Wide Specific Selection in Three Domestic Sheep Breeds.

    PubMed

    Wang, Huihua; Zhang, Li; Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin

    2015-01-01

    Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding.

  9. Genome-Wide Specific Selection in Three Domestic Sheep Breeds

    PubMed Central

    Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin

    2015-01-01

    Background Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. Results We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Conclusions Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding. PMID:26083354

  10. Adjusted P values for genome-wide scans.

    PubMed Central

    Lystig, Theodore C

    2003-01-01

    Genome-wide scans for quantitative trait loci (QTL) have traditionally been summarized with plots of logarithm of odds (LOD) scores. A valuable modification is to supplement such plots with an additional vertical axis displaying quantiles of adjusted P values and labeling local maxima of the LOD scores with location-specific adjusted P values. This provides a visible gradation of genome-wide significance for the LOD score curve, instead of the stark dichotomy that a single threshold yields. Adjusted P values give genome-wide significance of individual LOD scores and are obtained through a straightforward modification of the familiar algorithm for generating permutation-based thresholds. PMID:12930772

  11. Genome wide linkage disequilibrium and genetic structure in Sicilian dairy sheep breeds.

    PubMed

    Mastrangelo, Salvatore; Di Gerlando, Rosalia; Tolone, Marco; Tortorici, Lina; Sardina, Maria Teresa; Portolano, Baldassare

    2014-10-10

    The recent availability of sheep genome-wide SNP panels allows providing background information concerning genome structure in domestic animals. The aim of this work was to investigate the patterns of linkage disequilibrium (LD), the genetic diversity and population structure in Valle del Belice, Comisana, and Pinzirita dairy sheep breeds using the Illumina Ovine SNP50K Genotyping array. Average r (2) between adjacent SNPs across all chromosomes was 0.155 ± 0.204 for Valle del Belice, 0.156 ± 0.208 for Comisana, and 0.128 ± 0.188 for Pinzirita breeds, and some variations in LD value across chromosomes were observed, in particular for Valle del Belice and Comisana breeds. Average values of r (2) estimated for all pairwise combinations of SNPs pooled over all autosomes were 0.058 ± 0.023 for Valle del Belice, 0.056 ± 0.021 for Comisana, and 0.037 ± 0.017 for Pinzirita breeds. The LD declined as a function of distance and average r (2) was lower than the values observed in other sheep breeds. Consistency of results among the several used approaches (Principal component analysis, Bayesian clustering, F ST, Neighbor networks) showed that while Valle del Belice and Pinzirita breeds formed a unique cluster, Comisana breed showed the presence of substructure. In Valle del Belice breed, the high level of genetic differentiation within breed, the heterogeneous cluster in Admixture analysis, but at the same time the highest inbreeding coefficient, suggested that the breed had a wide genetic base with inbred individuals belonging to the same flock. The Sicilian breeds were characterized by low genetic differentiation and high level of admixture. Pinzirita breed displayed the highest genetic diversity (He, Ne) whereas the lowest value was found in Valle del Belice breed. This study has reported for the first time estimates of LD and genetic diversity from a genome-wide perspective in Sicilian dairy sheep breeds. Our results indicate that breeds formed non

  12. The genome-wide structure of two economically important indigenous Sicilian cattle breeds.

    PubMed

    Mastrangelo, S; Saura, M; Tolone, M; Salces-Ortiz, J; Di Gerlando, R; Bertolini, F; Fontanesi, L; Sardina, M T; Serrano, M; Portolano, B

    2014-11-01

    Genomic technologies, such as high-throughput genotyping based on SNP arrays, provided background information concerning genome structure in domestic animals. The aim of this work was to investigate the genetic structure, the genome-wide estimates of inbreeding, coancestry, effective population size (Ne), and the patterns of linkage disequilibrium (LD) in 2 economically important Sicilian local cattle breeds, Cinisara (CIN) and Modicana (MOD), using the Illumina Bovine SNP50K v2 BeadChip. To understand the genetic relationship and to place both Sicilian breeds in a global context, genotypes from 134 other domesticated bovid breeds were used. Principal component analysis showed that the Sicilian cattle breeds were closer to individuals of Bos taurus taurus from Eurasia and formed nonoverlapping clusters with other breeds. Between the Sicilian cattle breeds, MOD was the most differentiated, whereas the animals belonging to the CIN breed showed a lower value of assignment, the presence of substructure, and genetic links with the MOD breed. The average molecular inbreeding and coancestry coefficients were moderately high, and the current estimates of Ne were low in both breeds. These values indicated a low genetic variability. Considering levels of LD between adjacent markers, the average r(2) in the MOD breed was comparable to those reported for others cattle breeds, whereas CIN showed a lower value. Therefore, these results support the need of more dense SNP arrays for a high-power association mapping and genomic selection efficiency, particularly for the CIN cattle breed. Controlling molecular inbreeding and coancestry would restrict inbreeding depression, the probability of losing beneficial rare alleles, and therefore the risk of extinction. The results generated from this study have important implications for the development of conservation and/or selection breeding programs in these 2 local cattle breeds.

  13. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    USDA-ARS?s Scientific Manuscript database

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identi...

  14. Genome-wide haplotype changes produced by artificial selection during modern rice breeding in Japan.

    PubMed

    Yonemaru, Jun-ichi; Yamamoto, Toshio; Ebana, Kaworu; Yamamoto, Eiji; Nagasaki, Hideki; Shibaya, Taeko; Yano, Masahiro

    2012-01-01

    During the last 90 years, the breeding of rice has delivered cultivars with improved agronomic and economic characteristics. Crossing of different lines and successive artificial selection of progeny based on their phenotypes have changed the chromosomal constitution of the ancestors of modern rice; however, the nature of these changes is unclear. The recent accumulation of data for genome-wide single-nucleotide polymorphisms (SNPs) in rice has allowed us to investigate the change in haplotype structure and composition. To assess the impact of these changes during modern breeding, we studied 177 Japanese rice accessions, which were categorized into three groups: landraces, improved cultivars developed from 1931 to 1974 (the early breeding phase), and improved cultivars developed from 1975 to 2005 (the late breeding phase). Phylogenetic tree and structure analysis indicated genetic differentiation between non-irrigated (upland) and irrigated (lowland) rice groups as well as genetic structuring within the irrigated rice group that corresponded to the existence of three subgroups. Pedigree analysis revealed that a limited number of landraces and cultivars was used for breeding at the beginning of the period of systematic breeding and that 11 landraces accounted for 70% of the ancestors of the modern improved cultivars. The values for linkage disequilibrium estimated from SNP alleles and the haplotype diversity determined from consecutive alleles in five-SNP windows indicated that haplotype blocks became less diverse over time as a result of the breeding process. A decrease in haplotype diversity, caused by a reduced number of polymorphisms in the haplotype blocks, was observed in several chromosomal regions. However, our results also indicate that new haplotype polymorphisms have been generated across the genome during the breeding process. These findings will facilitate our understanding of the association between particular haplotypes and desirable phenotypes in

  15. A genome-wide scan for signatures of differential artificial selection in ten cattle breeds.

    PubMed

    Rothammer, Sophie; Seichter, Doris; Förster, Martin; Medugorac, Ivica

    2013-12-21

    Since the times of domestication, cattle have been continually shaped by the influence of humans. Relatively recent history, including breed formation and the still enduring enormous improvement of economically important traits, is expected to have left distinctive footprints of selection within the genome. The purpose of this study was to map genome-wide selection signatures in ten cattle breeds and thus improve the understanding of the genome response to strong artificial selection and support the identification of the underlying genetic variants of favoured phenotypes. We analysed 47,651 single nucleotide polymorphisms (SNP) using Cross Population Extended Haplotype Homozygosity (XP-EHH). We set the significance thresholds using the maximum XP-EHH values of two essentially artificially unselected breeds and found up to 229 selection signatures per breed. Through a confirmation process we verified selection for three distinct phenotypes typical for one breed (polledness in Galloway, double muscling in Blanc-Bleu Belge and red coat colour in Red Holstein cattle). Moreover, we detected six genes strongly associated with known QTL for beef or dairy traits (TG, ABCG2, DGAT1, GH1, GHR and the Casein Cluster) within selection signatures of at least one breed. A literature search for genes lying in outstanding signatures revealed further promising candidate genes. However, in concordance with previous genome-wide studies, we also detected a substantial number of signatures without any yet known gene content. These results show the power of XP-EHH analyses in cattle to discover promising candidate genes and raise the hope of identifying phenotypically important variants in the near future. The finding of plausible functional candidates in some short signatures supports this hope. For instance, MAP2K6 is the only annotated gene of two signatures detected in Galloway and Gelbvieh cattle and is already known to be associated with carcass weight, back fat thickness and

  16. A genome-wide scan for signatures of differential artificial selection in ten cattle breeds

    PubMed Central

    2013-01-01

    Background Since the times of domestication, cattle have been continually shaped by the influence of humans. Relatively recent history, including breed formation and the still enduring enormous improvement of economically important traits, is expected to have left distinctive footprints of selection within the genome. The purpose of this study was to map genome-wide selection signatures in ten cattle breeds and thus improve the understanding of the genome response to strong artificial selection and support the identification of the underlying genetic variants of favoured phenotypes. We analysed 47,651 single nucleotide polymorphisms (SNP) using Cross Population Extended Haplotype Homozygosity (XP-EHH). Results We set the significance thresholds using the maximum XP-EHH values of two essentially artificially unselected breeds and found up to 229 selection signatures per breed. Through a confirmation process we verified selection for three distinct phenotypes typical for one breed (polledness in Galloway, double muscling in Blanc-Bleu Belge and red coat colour in Red Holstein cattle). Moreover, we detected six genes strongly associated with known QTL for beef or dairy traits (TG, ABCG2, DGAT1, GH1, GHR and the Casein Cluster) within selection signatures of at least one breed. A literature search for genes lying in outstanding signatures revealed further promising candidate genes. However, in concordance with previous genome-wide studies, we also detected a substantial number of signatures without any yet known gene content. Conclusions These results show the power of XP-EHH analyses in cattle to discover promising candidate genes and raise the hope of identifying phenotypically important variants in the near future. The finding of plausible functional candidates in some short signatures supports this hope. For instance, MAP2K6 is the only annotated gene of two signatures detected in Galloway and Gelbvieh cattle and is already known to be associated with carcass

  17. Genome-wide association analysis for quantitative trait loci influencing Warner–Bratzler shear force in five taurine cattle breeds

    PubMed Central

    McClure, M C; Ramey, H R; Rolf, M M; McKay, S D; Decker, J E; Chapple, R H; Kim, J W; Taxis, T M; Weaber, R L; Schnabel, R D; Taylor, J F

    2012-01-01

    Summary We performed a genome-wide association study for Warner–Bratzler shear force (WBSF), a measure of meat tenderness, by genotyping 3360 animals from five breeds with 54 790 BovineSNP50 and 96 putative single-nucleotide polymorphisms (SNPs) within μ-calpain [HUGO nomenclature calpain 1, (mu/I) large subunit; CAPN1] and calpastatin (CAST). Within- and across-breed analyses estimated SNP allele substitution effects (ASEs) by genomic best linear unbiased prediction (GBLUP) and variance components by restricted maximum likelihood under an animal model incorporating a genomic relationship matrix. GBLUP estimates of ASEs from the across-breed analysis were moderately correlated (0.31–0.66) with those from the individual within-breed analyses, indicating that prediction equations for molecular estimates of breeding value developed from across-breed analyses should be effective for genomic selection within breeds. We identified 79 genomic regions associated with WBSF in at least three breeds, but only eight were detected in all five breeds, suggesting that the within-breed analyses were underpowered, that different quantitative trait loci (QTL) underlie variation between breeds or that the BovineSNP50 SNP density is insufficient to detect common QTL among breeds. In the across-breed analysis, CAPN1 was followed by CAST as the most strongly associated WBSF QTL genome-wide, and associations with both were detected in all five breeds. We show that none of the four commercialized CAST and CAPN1SNP diagnostics are causal for associations with WBSF, and we putatively fine-map the CAPN1 causal mutation to a 4581-bp region. We estimate that variation in CAST and CAPN1 explains 1.02 and 1.85% of the phenotypic variation in WBSF respectively. PMID:22497286

  18. A genome-wide association study of malting quality across eight U.S. barley breeding programs

    USDA-ARS?s Scientific Manuscript database

    This study leverages the breeding data of 1,862 breeding lines evaluated in 97 field trials for genome-wide association study of malting quality traits in barley. The breeding lines were six-row and two-row barley advanced breeding lines from eight barley breeding populations established at six pub...

  19. A genome-wide scan for signatures of selection in Chinese indigenous and commercial pig breeds

    PubMed Central

    2014-01-01

    Background Modern breeding and artificial selection play critical roles in pig domestication and shape the genetic variation of different breeds. China has many indigenous pig breeds with various characteristics in morphology and production performance that differ from those of foreign commercial pig breeds. However, the signatures of selection on genes implying for economic traits between Chinese indigenous and commercial pigs have been poorly understood. Results We identified footprints of positive selection at the whole genome level, comprising 44,652 SNPs genotyped in six Chinese indigenous pig breeds, one developed breed and two commercial breeds. An empirical genome-wide distribution of Fst (F-statistics) was constructed based on estimations of Fst for each SNP across these nine breeds. We detected selection at the genome level using the High-Fst outlier method and found that 81 candidate genes show high evidence of positive selection. Furthermore, the results of network analyses showed that the genes that displayed evidence of positive selection were mainly involved in the development of tissues and organs, and the immune response. In addition, we calculated the pairwise Fst between Chinese indigenous and commercial breeds (CHN VS EURO) and between Northern and Southern Chinese indigenous breeds (Northern VS Southern). The IGF1R and ESR1 genes showed evidence of positive selection in the CHN VS EURO and Northern VS Southern groups, respectively. Conclusions In this study, we first identified the genomic regions that showed evidences of selection between Chinese indigenous and commercial pig breeds using the High-Fst outlier method. These regions were found to be involved in the development of tissues and organs, the immune response, growth and litter size. The results of this study provide new insights into understanding the genetic variation and domestication in pigs. PMID:24422716

  20. A genome-wide scan for signatures of selection in Chinese indigenous and commercial pig breeds.

    PubMed

    Yang, Songbai; Li, Xiuling; Li, Kui; Fan, Bin; Tang, Zhonglin

    2014-01-15

    Modern breeding and artificial selection play critical roles in pig domestication and shape the genetic variation of different breeds. China has many indigenous pig breeds with various characteristics in morphology and production performance that differ from those of foreign commercial pig breeds. However, the signatures of selection on genes implying for economic traits between Chinese indigenous and commercial pigs have been poorly understood. We identified footprints of positive selection at the whole genome level, comprising 44,652 SNPs genotyped in six Chinese indigenous pig breeds, one developed breed and two commercial breeds. An empirical genome-wide distribution of Fst (F-statistics) was constructed based on estimations of Fst for each SNP across these nine breeds. We detected selection at the genome level using the High-Fst outlier method and found that 81 candidate genes show high evidence of positive selection. Furthermore, the results of network analyses showed that the genes that displayed evidence of positive selection were mainly involved in the development of tissues and organs, and the immune response. In addition, we calculated the pairwise Fst between Chinese indigenous and commercial breeds (CHN VS EURO) and between Northern and Southern Chinese indigenous breeds (Northern VS Southern). The IGF1R and ESR1 genes showed evidence of positive selection in the CHN VS EURO and Northern VS Southern groups, respectively. In this study, we first identified the genomic regions that showed evidences of selection between Chinese indigenous and commercial pig breeds using the High-Fst outlier method. These regions were found to be involved in the development of tissues and organs, the immune response, growth and litter size. The results of this study provide new insights into understanding the genetic variation and domestication in pigs.

  1. Genome-wide genetic diversity, population structure and admixture analysis in African and Asian cattle breeds.

    PubMed

    Edea, Z; Bhuiyan, M S A; Dessie, T; Rothschild, M F; Dadi, H; Kim, K S

    2015-02-01

    Knowledge about genetic diversity and population structure is useful for designing effective strategies to improve the production, management and conservation of farm animal genetic resources. Here, we present a comprehensive genome-wide analysis of genetic diversity, population structure and admixture based on 244 animals sampled from 10 cattle populations in Asia and Africa and genotyped for 69,903 autosomal single-nucleotide polymorphisms (SNPs) mainly derived from the indicine breed. Principal component analysis, STRUCTURE and distance analysis from high-density SNP data clearly revealed that the largest genetic difference occurred between the two domestic lineages (taurine and indicine), whereas Ethiopian cattle populations represent a mosaic of the humped zebu and taurine. Estimation of the genetic influence of zebu and taurine revealed that Ethiopian cattle were characterized by considerable levels of introgression from South Asian zebu, whereas Bangladeshi populations shared very low taurine ancestry. The relationships among Ethiopian cattle populations reflect their history of origin and admixture rather than phenotype-based distinctions. The high within-individual genetic variability observed in Ethiopian cattle represents an untapped opportunity for adaptation to changing environments and for implementation of within-breed genetic improvement schemes. Our results provide a basis for future applications of genome-wide SNP data to exploit the unique genetic makeup of indigenous cattle breeds and to facilitate their improvement and conservation.

  2. Genome-wide association studies of growth traits in three dairy cattle breeds using whole-genome sequence data.

    PubMed

    Mao, X; Sahana, G; De Koning, D-J; Guldbrandtsen, B

    2016-04-01

    Male calves and culled cows of dairy cattle are used for beef production. However, unlike beef breeds, the genetics of growth performance traits in dairy breeds have not been extensively studied. Here, we performed a genome-wide association study (GWAS) on Holsteins ( = 5,519), Jerseys ( = 1,231), and Red Dairy Cattle ( = 4,410) to identify QTL for growth traits. First, a GWAS was performed within breeds using whole-genome sequence variants. Later, a meta-analysis was performed to combine information across the 3 breeds. We have identified several QTL that have large effects on growth traits in Holsteins and Red Dairy Cattle but with little overlap across breeds. Only 1 QTL located on chromosome 10 was shared between Holsteins and Red Dairy Cattle. The most significant variant (BTA10:59,164,533, rs43636323; -value = 2.8 × 10) in this QTL explained 2.4% of the total additive genetic variance in Red Dairy Cattle. The gene is a strong candidate for the underlying gene of this QTL. In Red Dairy Cattle, a QTL near 25 Mb on chromosome 14 was very significantly associated with growth traits, consistent with the previously reported gene , which affects growth in beef cattle and humans. No QTL for growth performance was statistically significant in Jerseys, possibly due to the low power of detection with the small sample size. The meta-analysis of the 3 breeds increased the power to detect QTL.

  3. Genome-wide association mapping of quantitative traits in a breeding population of sugarcane.

    PubMed

    Racedo, Josefina; Gutiérrez, Lucía; Perera, María Francisca; Ostengo, Santiago; Pardo, Esteban Mariano; Cuenya, María Inés; Welin, Bjorn; Castagnaro, Atilio Pedro

    2016-06-24

    Molecular markers associated with relevant agronomic traits could significantly reduce the time and cost involved in developing new sugarcane varieties. Previous sugarcane genome-wide association analyses (GWAS) have found few molecular markers associated with relevant traits at plant-cane stage. The aim of this study was to establish an appropriate GWAS to find molecular markers associated with yield related traits consistent across harvesting seasons in a breeding population. Sugarcane clones were genotyped with DArT (Diversity Array Technology) and TRAP (Target Region Amplified Polymorphism) markers, and evaluated for cane yield (CY) and sugar content (SC) at two locations during three successive crop cycles. GWAS mapping was applied within a novel mixed-model framework accounting for population structure with Principal Component Analysis scores as random component. A total of 43 markers significantly associated with CY in plant-cane, 42 in first ratoon, and 41 in second ratoon were detected. Out of these markers, 20 were associated with CY in 2 years. Additionally, 38 significant associations for SC were detected in plant-cane, 34 in first ratoon, and 47 in second ratoon. For SC, one marker-trait association was found significant for the 3 years of the study, while twelve markers presented association for 2 years. In the multi-QTL model several markers with large allelic substitution effect were found. Sequences of four DArT markers showed high similitude and e-value with coding sequences of Sorghum bicolor, confirming the high gene microlinearity between sorghum and sugarcane. In contrast with other sugarcane GWAS studies reported earlier, the novel methodology to analyze multi-QTLs through successive crop cycles used in the present study allowed us to find several markers associated with relevant traits. Combining existing phenotypic trial data and genotypic DArT and TRAP marker characterizations within a GWAS approach including population structure as

  4. Genome-wide analysis of DNA methylation in obese, lean, and miniature pig breeds

    PubMed Central

    Yang, Yalan; Zhou, Rong; Mu, Yulian; Hou, Xinhua; Tang, Zhonglin; Li, Kui

    2016-01-01

    DNA methylation is a crucial epigenetic modification involved in diverse biological processes. There is significant phenotypic variance between Chinese indigenous and western pig breeds. Here, we surveyed the genome-wide DNA methylation profiles of blood leukocytes from three pig breeds (Tongcheng, Landrace, and Wuzhishan) by methylated DNA immunoprecipitation sequencing. The results showed that DNA methylation was enriched in gene body regions and repetitive sequences. LINE/L1 and SINE/tRNA-Glu were the predominant methylated repeats in pigs. The methylation level in the gene body regions was higher than in the 5′ and 3′ flanking regions of genes. About 15% of CpG islands were methylated in the pig genomes. Additionally, 2,807, 2,969, and 5,547 differentially methylated genes (DMGs) were identified in the Tongcheng vs. Landrace, Tongcheng vs. Wuzhishan, and Landrace vs. Wuzhishan comparisons, respectively. A total of 868 DMGs were shared by the three contrasts. The DMGs were significantly enriched in development- and metabolism-related biological processes and pathways. Finally, we identified 32 candidate DMGs associated with phenotype variance in pigs. Our research provides a DNA methylome resource for pigs and furthers understanding of epigenetically regulated phenotype variance in mammals. PMID:27444743

  5. Genome-wide linkage disequilibrium and past effective population size in three Korean cattle breeds.

    PubMed

    Sudrajad, P; Seo, D W; Choi, T J; Park, B H; Roh, S H; Jung, W Y; Lee, S S; Lee, J H; Kim, S; Lee, S H

    2017-02-01

    The routine collection and use of genomic data are useful for effectively managing breeding programs for endangered populations. Linkage disequilibrium (LD) using high-density DNA markers has been widely used to determine population structures and predict the genomic regions that are associated with economic traits in beef cattle. The extent of LD also provides information about historical events, including past effective population size (Ne ), and it allows inferences on the genetic diversity of breeds. The objective of this study was to estimate the LD and Ne in three Korean cattle breeds that are genetically similar but have different coat colors (Brown, Brindle and Jeju Black Hanwoo). Brindle and Jeju Black are endangered breeds with small populations, whereas Brown Hanwoo is the main breeding population in Korea. DNA samples from these cattle breeds were genotyped using the Illumina BovineSNP50 Bead Chip. We examined 13 cattle breeds, including European taurines, African taurines and indicines, and hybrids to compare their LD values. Brown Hanwoo consistently had the lowest mean LD compared to Jeju Black, Brindle and the other 13 cattle breeds (0.13, 0.19, 0.21 and 0.15-0.22 respectively). The high LD values of Brindle and Jeju Black contributed to small Ne values (53 and 60 respectively), which were distinct from that of Brown Hanwoo (531) for 11 generations ago. The differences in LD and Ne for each breed reflect the breeding strategy applied. The Ne for these endangered cattle breeds remain low; thus, effort is needed to bring them back to a sustainable tract. © 2016 Stichting International Foundation for Animal Genetics.

  6. An across-breed genome wide association analysis of susceptibility to paratuberculosis in dairy cattle.

    PubMed

    Sallam, Ahmed M; Zare, Yalda; Alpay, Fazli; Shook, George E; Collins, Michael T; Alsheikh, Samir; Sharaby, Mahmoud; Kirkpatrick, Brian W

    2017-02-01

    Paratuberculosis is a chronic disease of ruminants caused by Mycobacterium avium subspecies paratuberculosis (MAP). It occurs worldwide and causes a significant loss in the animal production industry. There is no cure for MAP infection and vaccination is problematic. Identification of genetics of susceptibility could be a useful adjunct for programs that focus on management, testing and culling of diseased animals. A case-control, genome-wide association study (GWAS) was conducted using Holstein and Jersey cattle in a combined analysis in order to identify markers and chromosomal regions associated with susceptibility to MAP infection across-breed. A mixed-model method (GRAMMAR-GC) implemented in the GenABEL R package and a Bayes C analysis implemented in GenSel software were used as alternative approaches to conduct GWAS analysis focused on single SNPs and chromosomal segments, respectively. After conducting quality control, 22 406 SNPs from 2157 individuals were available for the GRAMMAR-GC (Bayes C) analysis and 45 640 SNPs from 2199 individuals were available for the Bayes C analysis. One SNP located on BTA27 (8·6 Mb) was identified as moderately associated (P < 5 × 10-5, FDR = 0·44) in the GRAMMAR-GC analysis of the combined breed data. Nine 1 Mb windows located on BTA 2, 3 (3 windows), 6, 8, 25, 27 and 29 each explained ≥1% of the total proportion of genetic variance in the Bayes C analysis. In an analysis ignoring differences in linkage phase, two moderately significantly associated SNPs were identified; ARS-BFGL-NGS-19381 on BTA23 (32 Mb) and Hapmap40994-BTA-46361 on BTA19 (61 Mb). New common genomic regions and candidate genes have been identified from the across-breed analysis that might be involved in the immune response and susceptibility to MAP infection.

  7. Genome-wide association studies for reproductive seasonality traits in Rasa Aragonesa sheep breed.

    PubMed

    Martinez-Royo, Albert; Alabart, José Luis; Sarto, Pilar; Serrano, Magdalena; Lahoz, Belén; Folch, José; Calvo, Jorge Hugo

    2017-09-01

    Sheep breeds from Mediterranean area show reproductive seasonal patterns of oestrous behaviour and ovulatory activity, mainly regulated by variation in the photoperiod. Maximal reproductive activity is associated with short days from August to March. The aim of this study therefore was, to identify new SNPs and genes associated to reproductive seasonality in sheep by using the Illumina OvineSNP50 Beadchip. A total of 239 adult Rasa Aragonesa breed ewes from one flock were controlled from January to August. Three reproductive seasonality traits were considered: the total days of anoestrus (TDA), based on weekly individual plasma progesterone levels and defined as the sum of days in anoestrus, considering anoestrus those periods with three or more consecutive P4 concentrations lower than 0.5 ng/ml; the progesterone cycling months (P4CM), defined for each ewe as the rate of cycling months between January and August based on progesterone determinations and the oestrus cycling months (OCM), defined for each ewe as the rate of months cycling between January and August based on oestrus records. Genotyping of 123 ewes was performed with the OvineSNP50 Infinium Beadchip. After the quality control (QC) performed on the raw genotypes, a total of 47,206 SNPs distributed over the 27 ovine chromosomes and 110 ewes were included in subsequent analyses. Principal component analysis revealed a substructure within the total dataset and identified 4 principal clusters in the experimental flock. None of the SNPs overcame the genome-wide significance level (P = 1.06 × 10(-6)). However, the SNPs OAR4_66002395 (9.41E-6), and OAR8_25877010 (1.86E-5) reached the genome-wide suggestive significance level (set to 2.32 × 10(-5)) for TDA and P4CM traits, respectively, while OAR23_14608581 was significant for both TDA (2.02E-5) and P4CM (1.05E-5) traits. Five SNPs evidenced association at chromosome-wise level: SNPs OAR4_66002395, OAR23_14608581 and s20800 (DTA), and OAR8_25877010, OAR

  8. Genome wide association study of seedling and adult plant leaf rust resistance in elite spring wheat breeding lines

    USDA-ARS?s Scientific Manuscript database

    Leaf rust is an important disease, threatening wheat production annually. Identification of resistance genes or QTLs for effective field resistance could greatly enhance our ability to breed durably resistant varieties. We applied a genome wide association study (GWAS) approach to identify resista...

  9. Genetic parameters and genome-wide associations of twinning rate in a local breed, the Maremmana cattle.

    PubMed

    Moioli, B; Steri, R; Marchitelli, C; Catillo, G; Buttazzoni, L

    2017-10-01

    This study seeks to verify the feasibility of increasing twinning in a herd of the Italian autochtonous Maremmana breed. The data set included 1260 individuals born from 1963 to 2014, 527 males and 733 females, 402 of them calving at least once from 1983 through 2015. Breeding values for twinning were estimated by a single-trait linear animal model. However, since twinning is a dichotomous trait and the frequency of twins is far smaller than the frequency of single births, breeding values were also estimated by a single-trait animal threshold model. Heritability of twinning was 0.014±0.018 and 0.062±0.093 for the linear and the threshold models, respectively. Repeatability was 0.071±0.004 and 0.286± 0.012, respectively, for the two models. Genotyping with the Illumina BovineSNP54 BeadChip was performed for cows living on farm in 2012 (119 cows) and a genome-wide association analysis was performed on the corrected phenotype of all calving during the lifespan of each cow, using the GenABEL package in R and a three step GRAMMAR-GC approach. Genomic heritability, calculated from the genomic kinship matrix estimated through genomic marker data, was 0.29±0.021. The most significant detected single nucleotide polymorphisms (Hapmap22923-BTA-129564) was located in proximity of two genes, ARHGAP8 and TMEM200C, which might be potential functional candidates for twinning rate in cattle.

  10. A genome-wide SNP panel for genetic diversity, mapping and breeding studies in rice

    USDA-ARS?s Scientific Manuscript database

    A genome-wide SNP resource was developed for rice using the GoldenGate assay and used to genotype 400 landrace accessions of O. sativa. SNPs were originally discovered using Perlegen re-sequencing technology in 20 diverse landraces of O. sativa as part of OryzaSNP project (http://irfgc.irri.org). An...

  11. Characterizing the population structure and genetic diversity of maize breeding germplasm in Southwest China using genome-wide SNP markers.

    PubMed

    Zhang, Xiao; Zhang, Hua; Li, Lujiang; Lan, Hai; Ren, Zhiyong; Liu, Dan; Wu, Ling; Liu, Hailan; Jaqueth, Jennifer; Li, Bailin; Pan, Guangtang; Gao, Shibin

    2016-08-31

    representatively not only illustrates the foundation and evolution trend of maize breeding resource as a theoretical reference for the improvement of heterosis, but also provides plenty of information for genetic researches such as genome-wide association study and marker-assisted selection in the future.

  12. Merino and Merino-derived sheep breeds: a genome-wide intercontinental study.

    PubMed

    Ciani, Elena; Lasagna, Emiliano; D'Andrea, Mariasilvia; Alloggio, Ingrid; Marroni, Fabio; Ceccobelli, Simone; Delgado Bermejo, Juan V; Sarti, Francesca M; Kijas, James; Lenstra, Johannes A; Pilla, Fabio

    2015-08-14

    Merino and Merino-derived sheep breeds have been widely distributed across the world, both as purebred and admixed populations. They represent an economically and historically important genetic resource which over time has been used as the basis for the development of new breeds. In order to examine the genetic influence of Merino in the context of a global collection of domestic sheep breeds, we analyzed genotype data that were obtained with the OvineSNP50 BeadChip (Illumina) for 671 individuals from 37 populations, including a subset of breeds from the Sheep HapMap dataset. Based on a multi-dimensional scaling analysis, we highlighted four main clusters in this dataset, which corresponded to wild sheep, mouflon, primitive North European breeds and modern sheep (including Merino), respectively. The neighbor-network analysis further differentiated North-European and Mediterranean domestic breeds, with subclusters of Merino and Merino-derived breeds, other Spanish breeds and other Italian breeds. Model-based clustering, migration analysis and haplotype sharing indicated that genetic exchange occurred between archaic populations and also that a more recent Merino-mediated gene flow to several Merino-derived populations around the world took place. The close relationship between Spanish Merino and other Spanish breeds was consistent with an Iberian origin for the Merino breed, with possible earlier contributions from other Mediterranean stocks. The Merino populations from Australia, New Zealand and China were clearly separated from their European ancestors. We observed a genetic substructuring in the Spanish Merino population, which reflects recent herd management practices. Our data suggest that intensive gene flow, founder effects and geographic isolation are the main factors that determined the genetic makeup of current Merino and Merino-derived breeds. To explain how the current Merino and Merino-derived breeds were obtained, we propose a scenario that includes

  13. Sniffing out significant "Pee values": genome wide association study of asparagus anosmia.

    PubMed

    Markt, Sarah C; Nuttall, Elizabeth; Turman, Constance; Sinnott, Jennifer; Rimm, Eric B; Ecsedy, Ethan; Unger, Robert H; Fall, Katja; Finn, Stephen; Jensen, Majken K; Rider, Jennifer R; Kraft, Peter; Mucci, Lorelei A

    2016-12-13

     To determine the inherited factors associated with the ability to smell asparagus metabolites in urine.  Genome wide association study.  Nurses' Health Study and Health Professionals Follow-up Study cohorts.  6909 men and women of European-American descent with available genetic data from genome wide association studies.  Participants were characterized as asparagus smellers if they strongly agreed with the prompt "after eating asparagus, you notice a strong characteristic odor in your urine," and anosmic if otherwise. We calculated per-allele estimates of asparagus anosmia for about nine million single nucleotide polymorphisms using logistic regression. P values <5×10(-8) were considered as genome wide significant.  58.0% of men (n=1449/2500) and 61.5% of women (n=2712/4409) had anosmia. 871 single nucleotide polymorphisms reached genome wide significance for asparagus anosmia, all in a region on chromosome 1 (1q44: 248139851-248595299) containing multiple genes in the olfactory receptor 2 (OR2) family. Conditional analyses revealed three independent markers associated with asparagus anosmia: rs13373863, rs71538191, and rs6689553.  A large proportion of people have asparagus anosmia. Genetic variation near multiple olfactory receptor genes is associated with the ability of an individual to smell the metabolites of asparagus in urine. Future replication studies are necessary before considering targeted therapies to help anosmic people discover what they are missing. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  14. Genome-Wide Survey of SNP Variation Uncovers the Genetic Structure of Cattle Breeds

    PubMed Central

    2009-01-01

    The imprints of domestication and breed development on the genomes of livestock likely differ from those of companion animals. A deep draft sequence assembly of shotgun reads from a single Hereford female and comparative sequences sampled from six additional breeds were used to develop probes to interrogate 37,470 single-nucleotide polymorphisms (SNPs) in 497 cattle from 19 geographically and biologically diverse breeds. These data show that cattle have undergone a rapid recent decrease in effective population size from a very large ancestral population, possibly due to bottlenecks associated with domestication, selection, and breed formation. Domestication and artificial selection appear to have left detectable signatures of selection within the cattle genome, yet the current levels of diversity within breeds are at least as great as exists within humans. PMID:19390050

  15. Multi-breed genome-wide association study reveals heterogeneous loci associated with loin eye area in pigs.

    PubMed

    He, Yuna; Ma, Junwu; Zhang, Feng; Hou, Lijuan; Chen, Hao; Guo, Yuanmei; Zhang, Zhiyan

    2016-11-01

    Numerous quantitative trait loci (QTL) for loin eye area had been identified by linkage mapping studies, but the lack of their precise position hinders their application in the pig breeding industry. To map QTL for loin eye area to a precise genomic region, we conducted a genome-wide association study (GWAS) using Illumina 60 K PorcineSNP60 Beadchip in four swine populations: 819 F2 pigs, 273 Laiwu pigs, 434 Sutai pigs, and 326 Erhualian pigs. In total, 26 single nucleotide polymorphisms (SNPs) deposited on seven chromosomes associated with loin eye area were identified, 11 of which surpassed the genome-wide significant threshold; of the 11 SNPs, seven located on SSC2 in F2 pigs and four located on SSC12 and SSC18 in Laiwu pigs. Of note, all of the identified QTL were breed specific and no common QTL was identified across the four populations in our study. These findings not only confirmed a previous QTL on SSC2 harboring the candidate gene insulin-like growth factor 2 (IGF2), but also identified some novel candidate genes, far upstream element binding protein 3 (FUBP3), myosin heavy chain (MYH) family, leucine-rich repeats and guanylate kinase domain containing (LRGUK). Our study will contribute to the further identification of the causal mutation underlying these QTL and improve our knowledge of the complex genetic architecture for loin eye area in pigs.

  16. Genomic prediction in contrast to a genome-wide association study in explaining heritable variation of complex growth traits in breeding populations of Eucalyptus.

    PubMed

    Müller, Bárbara S F; Neves, Leandro G; de Almeida Filho, Janeo E; Resende, Márcio F R; Muñoz, Patricio R; Dos Santos, Paulo E T; Filho, Estefano Paludzyszyn; Kirst, Matias; Grattapaglia, Dario

    2017-07-11

    The advent of high-throughput genotyping technologies coupled to genomic prediction methods established a new paradigm to integrate genomics and breeding. We carried out whole-genome prediction and contrasted it to a genome-wide association study (GWAS) for growth traits in breeding populations of Eucalyptus benthamii (n =505) and Eucalyptus pellita (n =732). Both species are of increasing commercial interest for the development of germplasm adapted to environmental stresses. Predictive ability reached 0.16 in E. benthamii and 0.44 in E. pellita for diameter growth. Predictive abilities using either Genomic BLUP or different Bayesian methods were similar, suggesting that growth adequately fits the infinitesimal model. Genomic prediction models using ~5000-10,000 SNPs provided predictive abilities equivalent to using all 13,787 and 19,506 SNPs genotyped in the E. benthamii and E. pellita populations, respectively. No difference was detected in predictive ability when different sets of SNPs were utilized, based on position (equidistantly genome-wide, inside genes, linkage disequilibrium pruned or on single chromosomes), as long as the total number of SNPs used was above ~5000. Predictive abilities obtained by removing relatedness between training and validation sets fell near zero for E. benthamii and were halved for E. pellita. These results corroborate the current view that relatedness is the main driver of genomic prediction, although some short-range historical linkage disequilibrium (LD) was likely captured for E. pellita. A GWAS identified only one significant association for volume growth in E. pellita, illustrating the fact that while genome-wide regression is able to account for large proportions of the heritability, very little or none of it is captured into significant associations using GWAS in breeding populations of the size evaluated in this study. This study provides further experimental data supporting positive prospects of using genome-wide data to

  17. Genome-Wide Analysis Reveals Selection for Important Traits in Domestic Horse Breeds

    PubMed Central

    Petersen, Jessica L.; Mickelson, James R.; Rendahl, Aaron K.; Valberg, Stephanie J.; Andersson, Lisa S.; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M.; Borges, Alexandre S.; Brama, Pieter; da Câmara Machado, Artur; Capomaccio, Stefano; Cappelli, Katia; Cothran, E. Gus; Distl, Ottmar; Fox-Clipsham, Laura; Graves, Kathryn T.; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A.; Mikko, Sofia; Orr, Nicholas; Penedo, M. Cecilia T.; Piercy, Richard J.; Raekallio, Marja; Rieder, Stefan; Røed, Knut H.; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; Wade, Claire M.; McCue, Molly E.

    2013-01-01

    Intense selective pressures applied over short evolutionary time have resulted in homogeneity within, but substantial variation among, horse breeds. Utilizing this population structure, 744 individuals from 33 breeds, and a 54,000 SNP genotyping array, breed-specific targets of selection were identified using an FST-based statistic calculated in 500-kb windows across the genome. A 5.5-Mb region of ECA18, in which the myostatin (MSTN) gene was centered, contained the highest signature of selection in both the Paint and Quarter Horse. Gene sequencing and histological analysis of gluteal muscle biopsies showed a promoter variant and intronic SNP of MSTN were each significantly associated with higher Type 2B and lower Type 1 muscle fiber proportions in the Quarter Horse, demonstrating a functional consequence of selection at this locus. Signatures of selection on ECA23 in all gaited breeds in the sample led to the identification of a shared, 186-kb haplotype including two doublesex related mab transcription factor genes (DMRT2 and 3). The recent identification of a DMRT3 mutation within this haplotype, which appears necessary for the ability to perform alternative gaits, provides further evidence for selection at this locus. Finally, putative loci for the determination of size were identified in the draft breeds and the Miniature horse on ECA11, as well as when signatures of selection surrounding candidate genes at other loci were examined. This work provides further evidence of the importance of MSTN in racing breeds, provides strong evidence for selection upon gait and size, and illustrates the potential for population-based techniques to find genomic regions driving important phenotypes in the modern horse. PMID:23349635

  18. Genome-wide analysis reveals selection for important traits in domestic horse breeds.

    PubMed

    Petersen, Jessica L; Mickelson, James R; Rendahl, Aaron K; Valberg, Stephanie J; Andersson, Lisa S; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M; Borges, Alexandre S; Brama, Pieter; da Câmara Machado, Artur; Capomaccio, Stefano; Cappelli, Katia; Cothran, E Gus; Distl, Ottmar; Fox-Clipsham, Laura; Graves, Kathryn T; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A; Mikko, Sofia; Orr, Nicholas; Penedo, M Cecilia T; Piercy, Richard J; Raekallio, Marja; Rieder, Stefan; Røed, Knut H; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; Wade, Claire M; McCue, Molly E

    2013-01-01

    Intense selective pressures applied over short evolutionary time have resulted in homogeneity within, but substantial variation among, horse breeds. Utilizing this population structure, 744 individuals from 33 breeds, and a 54,000 SNP genotyping array, breed-specific targets of selection were identified using an F(ST)-based statistic calculated in 500-kb windows across the genome. A 5.5-Mb region of ECA18, in which the myostatin (MSTN) gene was centered, contained the highest signature of selection in both the Paint and Quarter Horse. Gene sequencing and histological analysis of gluteal muscle biopsies showed a promoter variant and intronic SNP of MSTN were each significantly associated with higher Type 2B and lower Type 1 muscle fiber proportions in the Quarter Horse, demonstrating a functional consequence of selection at this locus. Signatures of selection on ECA23 in all gaited breeds in the sample led to the identification of a shared, 186-kb haplotype including two doublesex related mab transcription factor genes (DMRT2 and 3). The recent identification of a DMRT3 mutation within this haplotype, which appears necessary for the ability to perform alternative gaits, provides further evidence for selection at this locus. Finally, putative loci for the determination of size were identified in the draft breeds and the Miniature horse on ECA11, as well as when signatures of selection surrounding candidate genes at other loci were examined. This work provides further evidence of the importance of MSTN in racing breeds, provides strong evidence for selection upon gait and size, and illustrates the potential for population-based techniques to find genomic regions driving important phenotypes in the modern horse.

  19. Genome Wide Association Study of Seedling and Adult Plant Leaf Rust Resistance in Elite Spring Wheat Breeding Lines

    PubMed Central

    Gao, Liangliang; Turner, M. Kathryn; Chao, Shiaoman; Kolmer, James; Anderson, James A.

    2016-01-01

    Leaf rust is an important disease, threatening wheat production annually. Identification of resistance genes or QTLs for effective field resistance could greatly enhance our ability to breed durably resistant varieties. We applied a genome wide association study (GWAS) approach to identify resistance genes or QTLs in 338 spring wheat breeding lines from public and private sectors that were predominately developed in the Americas. A total of 46 QTLs were identified for field and seedling traits and approximately 20–30 confer field resistance in varying degrees. The 10 QTLs accounting for the most variation in field resistance explained 26–30% of the total variation (depending on traits: percent severity, coefficient of infection or response type). Similarly, the 10 QTLs accounting for most of the variation in seedling resistance to different races explained 24–34% of the variation, after correcting for population structure. Two potentially novel QTLs (QLr.umn-1AL, QLr.umn-4AS) were identified. Identification of novel genes or QTLs and validation of previously identified genes or QTLs for seedling and especially adult plant resistance will enhance understanding of leaf rust resistance and assist breeding for resistant wheat varieties. We also developed computer programs to automate field and seedling rust phenotype data conversions. This is the first GWAS study of leaf rust resistance in elite wheat breeding lines genotyped with high density 90K SNP arrays. PMID:26849364

  20. Genome-wide association study and genomic prediction in citrus: Potential of genomics-assisted breeding for fruit quality traits.

    PubMed

    Minamikawa, Mai F; Nonaka, Keisuke; Kaminuma, Eli; Kajiya-Kanegae, Hiromi; Onogi, Akio; Goto, Shingo; Yoshioka, Terutaka; Imai, Atsushi; Hamada, Hiroko; Hayashi, Takeshi; Matsumoto, Satomi; Katayose, Yuichi; Toyoda, Atsushi; Fujiyama, Asao; Nakamura, Yasukazu; Shimizu, Tokurou; Iwata, Hiroyoshi

    2017-07-05

    Novel genomics-based approaches such as genome-wide association studies (GWAS) and genomic selection (GS) are expected to be useful in fruit tree breeding, which requires much time from the cross to the release of a cultivar because of the long generation time. In this study, a citrus parental population (111 varieties) and a breeding population (676 individuals from 35 full-sib families) were genotyped for 1,841 single nucleotide polymorphisms (SNPs) and phenotyped for 17 fruit quality traits. GWAS power and prediction accuracy were increased by combining the parental and breeding populations. A multi-kernel model considering both additive and dominance effects improved prediction accuracy for acidity and juiciness, implying that the effects of both types are important for these traits. Genomic best linear unbiased prediction (GBLUP) with linear ridge kernel regression (RR) was more robust and accurate than GBLUP with non-linear Gaussian kernel regression (GAUSS) in the tails of the phenotypic distribution. The results of this study suggest that both GWAS and GS are effective for genetic improvement of citrus fruit traits. Furthermore, the data collected from breeding populations are beneficial for increasing the detection power of GWAS and the prediction accuracy of GS.

  1. Genome-wide copy number variations using SNP genotyping in a mixed breed swine population

    USDA-ARS?s Scientific Manuscript database

    Copy number variations (CNVs) are increasingly understood to affect phenotypic variation. This study uses SNP genotyping of trios of mixed breed swine to add to the catalog of known genotypic variation in an important agricultural animal. Porcine SNP60 BeadChip genotypes were collected from 1802 pi...

  2. Population genomic structure and linkage disequilibrium analysis of South African goat breeds using genome-wide SNP data.

    PubMed

    Mdladla, K; Dzomba, E F; Huson, H J; Muchadeyi, F C

    2016-08-01

    The sustainability of goat farming in marginal areas of southern Africa depends on local breeds that are adapted to specific agro-ecological conditions. Unimproved non-descript goats are the main genetic resources used for the development of commercial meat-type breeds of South Africa. Little is known about genetic diversity and the genetics of adaptation of these indigenous goat populations. This study investigated the genetic diversity, population structure and breed relations, linkage disequilibrium, effective population size and persistence of gametic phase in goat populations of South Africa. Three locally developed meat-type breeds of the Boer (n = 33), Savanna (n = 31), Kalahari Red (n = 40), a feral breed of Tankwa (n = 25) and unimproved non-descript village ecotypes (n = 110) from four goat-producing provinces of the Eastern Cape, KwaZulu-Natal, Limpopo and North West were assessed using the Illumina Goat 50K SNP Bead Chip assay. The proportion of SNPs with minor allele frequencies >0.05 ranged from 84.22% in the Tankwa to 97.58% in the Xhosa ecotype, with a mean of 0.32 ± 0.13 across populations. Principal components analysis, admixture and pairwise FST identified Tankwa as a genetically distinct population and supported clustering of the populations according to their historical origins. Genome-wide FST identified 101 markers potentially under positive selection in the Tankwa. Average linkage disequilibrium was highest in the Tankwa (r(2)  = 0.25 ± 0.26) and lowest in the village ecotypes (r(2) range = 0.09 ± 0.12 to 0.11 ± 0.14). We observed an effective population size of <150 for all populations 13 generations ago. The estimated correlations for all breed pairs were lower than 0.80 at marker distances >100 kb with the exception of those in Savanna and Tswana populations. This study highlights the high level of genetic diversity in South African indigenous goats as well as the utility of the genome-wide SNP marker panels in

  3. Genome-wide analysis reveals population structure and selection in Chinese indigenous sheep breeds.

    PubMed

    Wei, Caihong; Wang, Huihua; Liu, Gang; Wu, Mingming; Cao, Jiaxve; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Zhang, Li; Lu, Jian; Liu, Chousheng; Du, Lixin

    2015-03-17

    Traditionally, Chinese indigenous sheep were classified geographically and morphologically into three groups: Mongolian, Kazakh and Tibetan. Herein, we aimed to evaluate the population structure and genome selection among 140 individuals from ten representative Chinese indigenous sheep breeds: Ujimqin, Hu, Tong, Large-Tailed Han and Lop breed (Mongolian group); Duolang and Kazakh (Kazakh group); and Diqing, Plateau-type Tibetan, and Valley-type Tibetan breed (Tibetan group). We analyzed the population using principal component analysis (PCA), STRUCTURE and a Neighbor-Joining (NJ)-tree. In PCA plot, the Tibetan and Mongolian groups were clustered as expected; however, Duolang and Kazakh (Kazakh group) were segregated. STRUCTURE analyses suggested two subpopulations: one from North China (Kazakh and Mongolian groups) and the other from the Southwest (Tibetan group). In the NJ-tree, the Tibetan group formed an independent branch and the Kazakh and Mongolian groups were mixed. We then used the d i statistic approach to reveal selection in Chinese indigenous sheep breeds. Among the 599 genome sequence windows analyzed, sixteen (2.7%) exhibited signatures of selection in four or more breeds. We detected three strong selection windows involving three functional genes: RXFP2, PPP1CC and PDGFD. PDGFD, one of the four subfamilies of PDGF, which promotes proliferation and inhibits differentiation of preadipocytes, was significantly selected in fat type breeds by the Rsb (across pairs of populations) approach. Two consecutive selection regions in Duolang sheep were obviously different to other breeds. One region was in OAR2 including three genes (NPR2, SPAG8 and HINT2) the influence growth traits. The other region was in OAR 6 including four genes (PKD2, SPP1, MEPE, and IBSP) associated with a milk production quantitative trait locus. We also identified known candidate genes such as BMPR1B, MSRB3, and three genes (KIT, MC1R, and FRY) that influence lambing percentage, ear size

  4. Genome-Wide Detection of Copy Number Variations among Diverse Horse Breeds by Array CGH

    PubMed Central

    Hou, Chenglin; Xing, Yanping; Cao, Junwei; Wu, Kaifeng; Liu, Chunxia; Zhang, Dong; Zhang, Li; Zhang, Yanru; Zhou, Huanmin

    2014-01-01

    Recent studies have found that copy number variations (CNVs) are widespread in human and animal genomes. CNVs are a significant source of genetic variation, and have been shown to be associated with phenotypic diversity. However, the effect of CNVs on genetic variation in horses is not well understood. In the present study, CNVs in 6 different breeds of mare horses, Mongolia horse, Abaga horse, Hequ horse and Kazakh horse (all plateau breeds) and Debao pony and Thoroughbred, were determined using aCGH. In total, seven hundred CNVs were identified ranging in size from 6.1 Kb to 0.57 Mb across all autosomes, with an average size of 43.08 Kb and a median size of 15.11 Kb. By merging overlapping CNVs, we found a total of three hundred and fifty-three CNV regions (CNVRs). The length of the CNVRs ranged from 6.1 Kb to 1.45 Mb with average and median sizes of 38.49 Kb and 13.1 Kb. Collectively, 13.59 Mb of copy number variation was identified among the horses investigated and accounted for approximately 0.61% of the horse genome sequence. Five hundred and eighteen annotated genes were affected by CNVs, which corresponded to about 2.26% of all horse genes. Through the gene ontology (GO), genetic pathway analysis and comparison of CNV genes among different breeds, we found evidence that CNVs involving 7 genes may be related to the adaptation to severe environment of these plateau horses. This study is the first report of copy number variations in Chinese horses, which indicates that CNVs are ubiquitous in the horse genome and influence many biological processes of the horse. These results will be helpful not only in mapping the horse whole-genome CNVs, but also to further research for the adaption to the high altitude severe environment for plateau horses. PMID:24497987

  5. Genome-wide detection of copy number variations among diverse horse breeds by array CGH.

    PubMed

    Wang, Wei; Wang, Shenyuan; Hou, Chenglin; Xing, Yanping; Cao, Junwei; Wu, Kaifeng; Liu, Chunxia; Zhang, Dong; Zhang, Li; Zhang, Yanru; Zhou, Huanmin

    2014-01-01

    Recent studies have found that copy number variations (CNVs) are widespread in human and animal genomes. CNVs are a significant source of genetic variation, and have been shown to be associated with phenotypic diversity. However, the effect of CNVs on genetic variation in horses is not well understood. In the present study, CNVs in 6 different breeds of mare horses, Mongolia horse, Abaga horse, Hequ horse and Kazakh horse (all plateau breeds) and Debao pony and Thoroughbred, were determined using aCGH. In total, seven hundred CNVs were identified ranging in size from 6.1 Kb to 0.57 Mb across all autosomes, with an average size of 43.08 Kb and a median size of 15.11 Kb. By merging overlapping CNVs, we found a total of three hundred and fifty-three CNV regions (CNVRs). The length of the CNVRs ranged from 6.1 Kb to 1.45 Mb with average and median sizes of 38.49 Kb and 13.1 Kb. Collectively, 13.59 Mb of copy number variation was identified among the horses investigated and accounted for approximately 0.61% of the horse genome sequence. Five hundred and eighteen annotated genes were affected by CNVs, which corresponded to about 2.26% of all horse genes. Through the gene ontology (GO), genetic pathway analysis and comparison of CNV genes among different breeds, we found evidence that CNVs involving 7 genes may be related to the adaptation to severe environment of these plateau horses. This study is the first report of copy number variations in Chinese horses, which indicates that CNVs are ubiquitous in the horse genome and influence many biological processes of the horse. These results will be helpful not only in mapping the horse whole-genome CNVs, but also to further research for the adaption to the high altitude severe environment for plateau horses.

  6. Use of modern tomato breeding germplasm for deciphering the genetic control of agronomical traits by Genome Wide Association study.

    PubMed

    Bauchet, Guillaume; Grenier, Stéphane; Samson, Nicolas; Bonnet, Julien; Grivet, Laurent; Causse, Mathilde

    2017-05-01

    A panel of 300 tomato accessions including breeding materials was built and characterized with >11,000 SNP. A population structure in six subgroups was identified. Strong heterogeneity in linkage disequilibrium and recombination landscape among groups and chromosomes was shown. GWAS identified several associations for fruit weight, earliness and plant growth. Genome-wide association studies (GWAS) have become a method of choice in quantitative trait dissection. First limited to highly polymorphic and outcrossing species, it is now applied in horticultural crops, notably in tomato. Until now GWAS in tomato has been performed on panels of heirloom and wild accessions. Using modern breeding materials would be of direct interest for breeding purpose. To implement GWAS on a large panel of 300 tomato accessions including 168 breeding lines, this study assessed the genetic diversity and linkage disequilibrium decay and revealed the population structure and performed GWA experiment. Genetic diversity and population structure analyses were based on molecular markers (>11,000 SNP) covering the whole genome. Six genetic subgroups were revealed and associated to traits of agronomical interest, such as fruit weight and disease resistance. Estimates of linkage disequilibrium highlighted the heterogeneity of its decay among genetic subgroups. Haplotype definition allowed a fine characterization of the groups and their recombination landscape revealing the patterns of admixture along the genome. Selection footprints showed results in congruence with introgressions. Taken together, all these elements refined our knowledge of the genetic material included in this panel and allowed the identification of several associations for fruit weight, plant growth and earliness, deciphering the genetic architecture of these complex traits and identifying several new loci useful for tomato breeding.

  7. Genome-Wide Copy Number Variations Using SNP Genotyping in a Mixed Breed Swine Population

    PubMed Central

    Wiedmann, Ralph T.; Nonneman, Dan J.; Rohrer, Gary A.

    2015-01-01

    Copy number variations (CNVs) are increasingly understood to affect phenotypic variation. This study uses SNP genotyping of trios of mixed breed swine to add to the catalog of known genotypic variation in an important agricultural animal. PorcineSNP60 BeadChip genotypes were collected from 1802 pigs that combined to form 1621 trios. These trios were from the crosses of 50 boars with 525 sows producing 1621 piglets. The pigs were part of a population that was a mix of ¼ Duroc, ½ Landrace and ¼ Yorkshire breeds. Merging the overlapping CNVs that were observed in two or more individuals to form CNV regions (CNVRs) yielded 502 CNVRs across the autosomes. The CNVRs intersected genes, as defined by RefSeq, 84% of the time – 420 out of 502. The results of this study are compared and contrasted to other swine studies using similar and different methods of detecting CNVR. While progress is being made in this field, more work needs to be done to improve consistency and confidence in CNVR results. PMID:26172260

  8. Diversifying Selection Between Pure-Breed and Free-Breeding Dogs Inferred from Genome-Wide SNP Analysis

    PubMed Central

    Pilot, Małgorzata; Malewski, Tadeusz; Moura, Andre E.; Grzybowski, Tomasz; Oleński, Kamil; Kamiński, Stanisław; Fadel, Fernanda Ruiz; Alagaili, Abdulaziz N.; Mohammed, Osama B.; Bogdanowicz, Wiesław

    2016-01-01

    Domesticated species are often composed of distinct populations differing in the character and strength of artificial and natural selection pressures, providing a valuable model to study adaptation. In contrast to pure-breed dogs that constitute artificially maintained inbred lines, free-ranging dogs are typically free-breeding, i.e., unrestrained in mate choice. Many traits in free-breeding dogs (FBDs) may be under similar natural and sexual selection conditions to wild canids, while relaxation of sexual selection is expected in pure-breed dogs. We used a Bayesian approach with strict false-positive control criteria to identify FST-outlier SNPs between FBDs and either European or East Asian breeds, based on 167,989 autosomal SNPs. By identifying outlier SNPs located within coding genes, we found four candidate genes under diversifying selection shared by these two comparisons. Three of them are associated with the Hedgehog (HH) signaling pathway regulating vertebrate morphogenesis. A comparison between FBDs and East Asian breeds also revealed diversifying selection on the BBS6 gene, which was earlier shown to cause snout shortening and dental crowding via disrupted HH signaling. Our results suggest that relaxation of natural and sexual selection in pure-breed dogs as opposed to FBDs could have led to mild changes in regulation of the HH signaling pathway. HH inhibits adhesion and the migration of neural crest cells from the neural tube, and minor deficits of these cells during embryonic development have been proposed as the underlying cause of “domestication syndrome.” This suggests that the process of breed formation involved the same genetic and developmental pathways as the process of domestication. PMID:27233669

  9. P-value based analysis for shared controls design in genome-wide association studies.

    PubMed

    Zaykin, Dmitri V; Kozbur, Damian O

    2010-11-01

    An appealing genome-wide association study design compares one large control group against several disease samples. A pioneering study by the Wellcome Trust Case Control Consortium that employed such a design has identified multiple susceptibility regions, many of which have been independently replicated. While reusing a control sample provides effective utilization of data, it also creates correlation between association statistics across diseases. An observation of a large association statistic for one of the diseases may greatly increase chances of observing a spuriously large association for a different disease. Accounting for the correlation is also particularly important when screening for SNPs that might be involved in a set of diseases with overlapping etiology. We describe methods that correct association statistics for dependency due to shared controls, and we describe ways to obtain a measure of overall evidence and to combine association signals across multiple diseases. The methods we describe require no access to individual subject data, instead, they efficiently utilize information contained in P-values for association reported for individual diseases. P-value based combined tests for association are flexible and essentially as powerful as the approach based on aggregating the individual subject data. © 2010 Wiley-Liss, Inc.

  10. Genome-wide scan for visceral leishmaniasis in mixed-breed dogs identifies candidate genes involved in T helper cells and macrophage signaling

    USDA-ARS?s Scientific Manuscript database

    We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2....

  11. FAPI: Fast and accurate P-value Imputation for genome-wide association study.

    PubMed

    Kwan, Johnny S H; Li, Miao-Xin; Deng, Jia-En; Sham, Pak C

    2016-05-01

    Imputing individual-level genotypes (or genotype imputation) is now a standard procedure in genome-wide association studies (GWAS) to examine disease associations at untyped common genetic variants. Meta-analysis of publicly available GWAS summary statistics can allow more disease-associated loci to be discovered, but these data are usually provided for various variant sets. Thus imputing these summary statistics of different variant sets into a common reference panel for meta-analyses is impossible using traditional genotype imputation methods. Here we develop a fast and accurate P-value imputation (FAPI) method that utilizes summary statistics of common variants only. Its computational cost is linear with the number of untyped variants and has similar accuracy compared with IMPUTE2 with prephasing, one of the leading methods in genotype imputation. In addition, based on the FAPI idea, we develop a metric to detect abnormal association at a variant and showed that it had a significantly greater power compared with LD-PAC, a method that quantifies the evidence of spurious associations based on likelihood ratio. Our method is implemented in a user-friendly software tool, which is available at http://statgenpro.psychiatry.hku.hk/fapi.

  12. Genome-wide association mapping for yield and other agronomic traits in an elite breeding population of tropical rice (Oryza sativa).

    PubMed

    Begum, Hasina; Spindel, Jennifer E; Lalusin, Antonio; Borromeo, Teresita; Gregorio, Glenn; Hernandez, Jose; Virk, Parminder; Collard, Bertrand; McCouch, Susan R

    2015-01-01

    Genome-wide association mapping studies (GWAS) are frequently used to detect QTL in diverse collections of crop germplasm, based on historic recombination events and linkage disequilibrium across the genome. Generally, diversity panels genotyped with high density SNP panels are utilized in order to assay a wide range of alleles and haplotypes and to monitor recombination breakpoints across the genome. By contrast, GWAS have not generally been performed in breeding populations. In this study we performed association mapping for 19 agronomic traits including yield and yield components in a breeding population of elite irrigated tropical rice breeding lines so that the results would be more directly applicable to breeding than those from a diversity panel. The population was genotyped with 71,710 SNPs using genotyping-by-sequencing (GBS), and GWAS performed with the explicit goal of expediting selection in the breeding program. Using this breeding panel we identified 52 QTL for 11 agronomic traits, including large effect QTLs for flowering time and grain length/grain width/grain-length-breadth ratio. We also identified haplotypes that can be used to select plants in our population for short stature (plant height), early flowering time, and high yield, and thus demonstrate the utility of association mapping in breeding populations for informing breeding decisions. We conclude by exploring how the newly identified significant SNPs and insights into the genetic architecture of these quantitative traits can be leveraged to build genomic-assisted selection models.

  13. Genome-wide study of an elite rice pedigree reveals a complex history of genetic architecture for breeding improvement

    PubMed Central

    Chen, Shaoxia; Lin, Zechuan; Zhou, Degui; Wang, Chongrong; Li, Hong; Yu, Renbo; Deng, Hanchao; Tang, Xiaoyan; Zhou, Shaochuan; Wang Deng, Xing; He, Hang

    2017-01-01

    Improving breeding has been widely utilized in crop breeding and contributed to yield and quality improvement, yet few researches have been done to analyze genetic architecture underlying breeding improvement comprehensively. Here, we collected genotype and phenotype data of 99 cultivars from the complete pedigree including Huanghuazhan, an elite, high-quality, conventional indica rice that has been grown over 4.5 million hectares in southern China and from which more than 20 excellent cultivars have been derived. We identified 1,313 selective sweeps (SSWs) revealing four stage-specific selection patterns corresponding to improvement preference during 65 years, and 1113 conserved Huanghuazhan traceable blocks (cHTBs) introduced from different donors and conserved in >3 breeding generations were the core genomic regions for superior performance of Huanghuazhan. Based on 151 quantitative trait loci (QTLs) identified for 13 improved traits in the pedigree, we reproduced their improvement process in silico, highlighting improving breeding works well for traits controlled by major/major + minor effect QTLs, but was inefficient for traits controlled by QTLs with complex interactions or explaining low levels of phenotypic variation. These results indicate long-term breeding improvement is efficient to construct superior genetic architecture for elite performance, yet molecular breeding with designed genotype of QTLs can facilitate complex traits improvement. PMID:28374863

  14. Development of Highly Informative Genome-Wide Single Sequence Repeat Markers for Breeding Applications in Sesame and Construction of a Web Resource: SisatBase.

    PubMed

    Dossa, Komivi; Yu, Jingyin; Liao, Boshou; Cisse, Ndiaga; Zhang, Xiurong

    2017-01-01

    The sequencing of the full nuclear genome of sesame (Sesamum indicum L.) provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR) in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to in silico amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78%) were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/), which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries.

  15. Development of Highly Informative Genome-Wide Single Sequence Repeat Markers for Breeding Applications in Sesame and Construction of a Web Resource: SisatBase

    PubMed Central

    Dossa, Komivi; Yu, Jingyin; Liao, Boshou; Cisse, Ndiaga; Zhang, Xiurong

    2017-01-01

    The sequencing of the full nuclear genome of sesame (Sesamum indicum L.) provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR) in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to in silico amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78%) were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/), which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries. PMID:28878802

  16. Genome-Wide Analysis of the World's Sheep Breeds Reveals High Levels of Historic Mixture and Strong Recent Selection

    PubMed Central

    Kijas, James W.; Lenstra, Johannes A.; Hayes, Ben; Boitard, Simon; Porto Neto, Laercio R.; San Cristobal, Magali; Servin, Bertrand; McCulloch, Russell; Whan, Vicki; Gietzen, Kimberly; Paiva, Samuel; Barendse, William; Ciani, Elena; Raadsma, Herman; McEwan, John; Dalrymple, Brian

    2012-01-01

    Through their domestication and subsequent selection, sheep have been adapted to thrive in a diverse range of environments. To characterise the genetic consequence of both domestication and selection, we genotyped 49,034 SNP in 2,819 animals from a diverse collection of 74 sheep breeds. We find the majority of sheep populations contain high SNP diversity and have retained an effective population size much higher than most cattle or dog breeds, suggesting domestication occurred from a broad genetic base. Extensive haplotype sharing and generally low divergence time between breeds reveal frequent genetic exchange has occurred during the development of modern breeds. A scan of the genome for selection signals revealed 31 regions containing genes for coat pigmentation, skeletal morphology, body size, growth, and reproduction. We demonstrate the strongest selection signal has occurred in response to breeding for the absence of horns. The high density map of genetic variability provides an in-depth view of the genetic history for this important livestock species. PMID:22346734

  17. Breed-specific ancestry studies and genome-wide association analysis highlight an association between the MYH9 gene and heat tolerance in Alaskan sprint racing sled dogs.

    PubMed

    Huson, Heather J; vonHoldt, Bridgett M; Rimbault, Maud; Byers, Alexandra M; Runstadler, Jonathan A; Parker, Heidi G; Ostrander, Elaine A

    2012-02-01

    Alaskan sled dogs are a genetically distinct population shaped by generations of selective interbreeding with purebred dogs to create a group of high-performance athletes. As a result of selective breeding strategies, sled dogs present a unique opportunity to employ admixture-mapping techniques to investigate how breed composition and trait selection impact genomic structure. We used admixture mapping to investigate genetic ancestry across the genomes of two classes of sled dogs, sprint and long-distance racers, and combined that with genome-wide association studies (GWAS) to identify regions that correlate with performance-enhancing traits. The sled dog genome is enhanced by differential contributions from four non-admixed breeds (Alaskan Malamute, Siberian Husky, German Shorthaired Pointer, and Borzoi). A principal components analysis (PCA) of 115,000 genome-wide SNPs clearly resolved the sprint and distance populations as distinct genetic groups, with longer blocks of linkage disequilibrium (LD) observed in the distance versus sprint dogs (7.5-10 and 2.5-3.75 kb, respectively). Furthermore, we identified eight regions with the genomic signal from either a selective sweep or an association analysis, corroborated by an excess of ancestry when comparing sprint and distance dogs. A comparison of elite and poor-performing sled dogs identified a single region significantly associated with heat tolerance. Within the region we identified seven SNPs within the myosin heavy chain 9 gene (MYH9) that were significantly associated with heat tolerance in sprint dogs, two of which correspond to conserved promoter and enhancer regions in the human ortholog.

  18. Breed-Specific Ancestry Studies and Genome-Wide Association Analysis Highlight an Association Between the MYH9 Gene and Heat Tolerance in Alaskan Sprint Racing Sled Dogs

    PubMed Central

    Huson, Heather J.; vonHoldt, Bridgett M.; Rimbault, Maud; Byers, Alexandra M.; Runstadler, Jonathan A.; Parker, Heidi G.; Ostrander, Elaine A.

    2012-01-01

    Alaskan sled dogs are a genetically distinct population shaped by generations of selective interbreeding with purebred dogs to create a group of high performance athletes. As a result of selective breeding strategies, sled dogs present a unique opportunity to employ admixture-mapping techniques to investigate how breed composition and trait selection impact genomic structure. We used admixture mapping to investigate genetic ancestry across the genomes of two classes of sled dogs, sprint and long distance racers, and combined that with genome wide association studies (GWAS) to identify regions correlating with performance enhancing traits. The sled dog genome is enhanced by differential contributions from four non-admixed breeds (Alaskan Malamute, Siberian Husky, German Shorthaired Pointer, and Borzoi). A principle components analysis (PCA) of 115,000 genome-wide SNPs clearly resolved the sprint and distance populations as distinct genetic groups, with longer blocks of linkage disequilibrium (LD) observed in the distance versus sprint dogs (7.5–10 and 2.5–3.75 kb, respectively). Further, we identified eight regions with the genomic signal either from a selective sweep or an association analysis, corroborated by an excess of ancestry when comparing sprint and distance dogs. A comparison of elite and poor performing sled dogs identified a single region significantly association with heat tolerance. Within the region we identified seven SNPs within the myosin heavy chain 9 gene (MYH9) that were significantly associated with heat tolerance in sprint dogs, two of which correspond to conserved promoter and enhancer regions in the human ortholog. PMID:22105876

  19. A Multi-Breed Genome-Wide Association Analysis for Canine Hypothyroidism Identifies a Shared Major Risk Locus on CFA12

    PubMed Central

    Massey, Jonathan; Dietschi, Elisabeth; Kierczak, Marcin; Lund-Ziener, Martine; Sundberg, Katarina; Thoresen, Stein Istre; Kämpe, Olle; Andersson, Göran; Ollier, William E. R.; Hedhammar, Åke; Leeb, Tosso; Lindblad-Toh, Kerstin; Kennedy, Lorna J.; Lingaas, Frode; Rosengren Pielberg, Gerli

    2015-01-01

    Hypothyroidism is a complex clinical condition found in both humans and dogs, thought to be caused by a combination of genetic and environmental factors. In this study we present a multi-breed analysis of predisposing genetic risk factors for hypothyroidism in dogs using three high-risk breeds—the Gordon Setter, Hovawart and the Rhodesian Ridgeback. Using a genome-wide association approach and meta-analysis, we identified a major hypothyroidism risk locus shared by these breeds on chromosome 12 (p = 2.1x10-11). Further characterisation of the candidate region revealed a shared ~167 kb risk haplotype (4,915,018–5,081,823 bp), tagged by two SNPs in almost complete linkage disequilibrium. This breed-shared risk haplotype includes three genes (LHFPL5, SRPK1 and SLC26A8) and does not extend to the dog leukocyte antigen (DLA) class II gene cluster located in the vicinity. These three genes have not been identified as candidate genes for hypothyroid disease previously, but have functions that could potentially contribute to the development of the disease. Our results implicate the potential involvement of novel genes and pathways for the development of canine hypothyroidism, raising new possibilities for screening, breeding programmes and treatments in dogs. This study may also contribute to our understanding of the genetic etiology of human hypothyroid disease, which is one of the most common endocrine disorders in humans. PMID:26261983

  20. Bayes factors for genome-wide association studies: comparison with P-values.

    PubMed

    Wakefield, Jon

    2009-01-01

    The Bayes factor is a summary measure that provides an alternative to the P-value for the ranking of associations, or the flagging of associations as "significant". We describe an approximate Bayes factor that is straightforward to use and is appropriate when sample sizes are large. We consider various choices of the prior on the effect size, including those that allow effect size to vary with the minor allele frequency (MAF) of the marker. An important contribution is the description of a specific prior that gives identical rankings between Bayes factors and P-values, providing a link between the two approaches, and allowing the implications of the use of P-values to be more easily understood. As a summary measure of noteworthiness P-values are difficult to calibrate since their interpretation depends on MAF and, crucially, on sample size. A consequence is that a consistent decision-making procedure using P-values requires a threshold for significance that reduces with sample size, contrary to common practice.

  1. Genome-wide assessment of worldwide chicken SNP genetic diversity indicates significant absence of rare alleles in commercial breeds

    USDA-ARS?s Scientific Manuscript database

    Breed utilization, genetic improvement, and industry consolidation are predicted to have major impacts on the genetic composition of commercial chickens. Consequently, the question arises as to whether sufficient genetic diversity remains within industry stocks to address future needs. With the ch...

  2. Genome-wide assessment of worldwide chicken SNP genetic diversity indicates significant absence of rare alleles in commercial breeds.

    PubMed

    Muir, William M; Wong, Gane Ka-Shu; Zhang, Yong; Wang, Jun; Groenen, Martien A M; Crooijmans, Richard P M A; Megens, Hendrik-Jan; Zhang, Huanmin; Okimoto, Ron; Vereijken, Addie; Jungerius, Annemieke; Albers, Gerard A A; Lawley, Cindy Taylor; Delany, Mary E; MacEachern, Sean; Cheng, Hans H

    2008-11-11

    Breed utilization, genetic improvement, and industry consolidation are predicted to have major impacts on the genetic composition of commercial chickens. Consequently, the question arises as to whether sufficient genetic diversity remains within industry stocks to address future needs. With the chicken genome sequence and more than 2.8 million single-nucleotide polymorphisms (SNPs), it is now possible to address biodiversity using a previously unattainable metric: missing alleles. To achieve this assessment, 2551 informative SNPs were genotyped on 2580 individuals, including 1440 commercial birds. The proportion of alleles lacking in commercial populations was assessed by (1) estimating the global SNP allele frequency distribution from a hypothetical ancestral population as a reference, then determining the portion of the distribution lost, and then (2) determining the relationship between allele loss and the inbreeding coefficient. The results indicate that 50% or more of the genetic diversity in ancestral breeds is absent in commercial pure lines. The missing genetic diversity resulted from the limited number of incorporated breeds. As such, hypothetically combining stocks within a company could recover only preexisting within-breed variability, but not more rare ancestral alleles. We establish that SNP weights act as sentinels of biodiversity and provide an objective assessment of the strains that are most valuable for preserving genetic diversity. This is the first experimental analysis investigating the extant genetic diversity of virtually an entire agricultural commodity. The methods presented are the first to characterize biodiversity in terms of allelic diversity and to objectively link rate of allele loss with the inbreeding coefficient.

  3. Genome-wide association and prediction of grain and semolina quality traits in durum wheat breeding populations

    USDA-ARS?s Scientific Manuscript database

    Grain yield and semolina quality traits are essential selection criteria in durum wheat breeding. However, high cost of phenotypic screening limited the selection only on small number of lines and at later generations. This leads to relatively low selection efficiency due to the advancement of undes...

  4. Genome-Wide SNP Markers Based on SLAF-Seq Uncover Breeding Traces in Rapeseed (Brassica napus L.)

    PubMed Central

    Zhou, Qinghong; Zhou, Can; Zheng, Wei; Mason, Annaliese S.; Fan, Shuying; Wu, Caijun; Fu, Donghui; Huang, Yingjin

    2017-01-01

    Single Nucleotide Polymorphisms (SNPs) are the most abundant and richest form of genomic polymorphism, and hence make highly favorable markers for genetic map construction and genome-wide association studies. In this study, a total of 300 rapeseed accessions (278 representative of Chinese germplasm, plus 22 outgroup accessions of different origins and ecotypes) were collected and sequenced using Specific-Locus Amplified Fragment Sequencing (SLAF-seq) technology, obtaining 660.25M reads with an average sequencing depth of 6.27 × and a mean Q30 of 85.96%. Based on the 238,711 polymorphic SLAF tags a total of 1,197,282 SNPs were discovered, and a subset of 201,817 SNPs with minor allele frequency >0.05 and integrity >0.8 were selected. Of these, 30,877 were designated SNP “hotspots,” and 41 SNP-rich genomic regions could be delineated, with 100 genes associated with plant resistance, vernalization response, and signal transduction detected in these regions. Subsequent analysis of genetic diversity, linkage disequilibrium (LD), and population structure in the 300 accessions was carried out based on the 201,817 SNPs. Nine subpopulations were observed based on the population structure analysis. Hierarchical clustering and principal component analysis divided the 300 varieties roughly in accordance with their ecotype origins. However, spring-type varieties were intermingled with semi-winter type varieties, indicating frequent hybridization between spring and semi-winter ecotypes in China. In addition, LD decay across the whole genome averaged 299 kb when r2 = 0.1, but the LD decay in the A genome (43 kb) was much shorter than in the C genome (1,455 kb), supporting the targeted introgression of the A genome from progenitor species B. rapa into Chinese rapeseed. This study also lays the foundation for genetic analysis of important agronomic traits using this rapeseed population. PMID:28503182

  5. Genome-wide assessment of worldwide chicken SNP genetic diversity indicates significant absence of rare alleles in commercial breeds

    PubMed Central

    Muir, William M.; Wong, Gane Ka-Shu; Zhang, Yong; Wang, Jun; Groenen, Martien A. M.; Crooijmans, Richard P. M. A.; Megens, Hendrik-Jan; Zhang, Huanmin; Okimoto, Ron; Vereijken, Addie; Jungerius, Annemieke; Albers, Gerard A. A.; Lawley, Cindy Taylor; Delany, Mary E.; MacEachern, Sean; Cheng, Hans H.

    2008-01-01

    Breed utilization, genetic improvement, and industry consolidation are predicted to have major impacts on the genetic composition of commercial chickens. Consequently, the question arises as to whether sufficient genetic diversity remains within industry stocks to address future needs. With the chicken genome sequence and more than 2.8 million single-nucleotide polymorphisms (SNPs), it is now possible to address biodiversity using a previously unattainable metric: missing alleles. To achieve this assessment, 2551 informative SNPs were genotyped on 2580 individuals, including 1440 commercial birds. The proportion of alleles lacking in commercial populations was assessed by (1) estimating the global SNP allele frequency distribution from a hypothetical ancestral population as a reference, then determining the portion of the distribution lost, and then (2) determining the relationship between allele loss and the inbreeding coefficient. The results indicate that 50% or more of the genetic diversity in ancestral breeds is absent in commercial pure lines. The missing genetic diversity resulted from the limited number of incorporated breeds. As such, hypothetically combining stocks within a company could recover only preexisting within-breed variability, but not more rare ancestral alleles. We establish that SNP weights act as sentinels of biodiversity and provide an objective assessment of the strains that are most valuable for preserving genetic diversity. This is the first experimental analysis investigating the extant genetic diversity of virtually an entire agricultural commodity. The methods presented are the first to characterize biodiversity in terms of allelic diversity and to objectively link rate of allele loss with the inbreeding coefficient. PMID:18981413

  6. Using an Inbred Horse Breed in a High Density Genome-Wide Scan for Genetic Risk Factors of Insect Bite Hypersensitivity (IBH).

    PubMed

    Velie, Brandon D; Shrestha, Merina; Franҫois, Liesbeth; Schurink, Anouk; Tesfayonas, Yohannes G; Stinckens, Anneleen; Blott, Sarah; Ducro, Bart J; Mikko, Sofia; Thomas, Ruth; Swinburne, June E; Sundqvist, Marie; Eriksson, Susanne; Buys, Nadine; Lindgren, Gabriella

    2016-01-01

    While susceptibility to hypersensitive reactions is a common problem amongst humans and animals alike, the population structure of certain animal species and breeds provides a more advantageous route to better understanding the biology underpinning these conditions. The current study uses Exmoor ponies, a highly inbred breed of horse known to frequently suffer from insect bite hypersensitivity, to identify genomic regions associated with a type I and type IV hypersensitive reaction. A total of 110 cases and 170 controls were genotyped on the 670K Axiom Equine Genotyping Array. Quality control resulted in 452,457 SNPs and 268 individuals being tested for association. Genome-wide association analyses were performed using the GenABEL package in R and resulted in the identification of two regions of interest on Chromosome 8. The first region contained the most significant SNP identified, which was located in an intron of the DCC netrin 1 receptor gene. The second region identified contained multiple top SNPs and encompassed the PIGN, KIAA1468, TNFRSF11A, ZCCHC2, and PHLPP1 genes. Although additional studies will be needed to validate the importance of these regions in horses and the relevance of these regions in other species, the knowledge gained from the current study has the potential to be a step forward in unraveling the complex nature of hypersensitive reactions.

  7. Using an Inbred Horse Breed in a High Density Genome-Wide Scan for Genetic Risk Factors of Insect Bite Hypersensitivity (IBH)

    PubMed Central

    Velie, Brandon D.; Shrestha, Merina; Franҫois, Liesbeth; Schurink, Anouk; Tesfayonas, Yohannes G.; Stinckens, Anneleen; Blott, Sarah; Ducro, Bart J.; Mikko, Sofia; Thomas, Ruth; Swinburne, June E.; Sundqvist, Marie; Eriksson, Susanne; Buys, Nadine; Lindgren, Gabriella

    2016-01-01

    While susceptibility to hypersensitive reactions is a common problem amongst humans and animals alike, the population structure of certain animal species and breeds provides a more advantageous route to better understanding the biology underpinning these conditions. The current study uses Exmoor ponies, a highly inbred breed of horse known to frequently suffer from insect bite hypersensitivity, to identify genomic regions associated with a type I and type IV hypersensitive reaction. A total of 110 cases and 170 controls were genotyped on the 670K Axiom Equine Genotyping Array. Quality control resulted in 452,457 SNPs and 268 individuals being tested for association. Genome-wide association analyses were performed using the GenABEL package in R and resulted in the identification of two regions of interest on Chromosome 8. The first region contained the most significant SNP identified, which was located in an intron of the DCC netrin 1 receptor gene. The second region identified contained multiple top SNPs and encompassed the PIGN, KIAA1468, TNFRSF11A, ZCCHC2, and PHLPP1 genes. Although additional studies will be needed to validate the importance of these regions in horses and the relevance of these regions in other species, the knowledge gained from the current study has the potential to be a step forward in unraveling the complex nature of hypersensitive reactions. PMID:27070818

  8. Genome-Wide Study of Structural Variants in Bovine Holstein, Montbéliarde and Normande Dairy Breeds

    PubMed Central

    Boussaha, Mekki; Esquerré, Diane; Barbieri, Johanna; Djari, Anis; Pinton, Alain; Letaief, Rabia; Salin, Gérald; Escudié, Frédéric; Roulet, Alain; Fritz, Sébastien; Samson, Franck; Grohs, Cécile; Bernard, Maria; Klopp, Christophe; Boichard, Didier; Rocha, Dominique

    2015-01-01

    High-throughput sequencing technologies have offered in recent years new opportunities to study genome variations. These studies have mostly focused on single nucleotide polymorphisms, small insertions or deletions and on copy number variants. Other structural variants, such as large insertions or deletions, tandem duplications, translocations, and inversions are less well-studied, despite that some have an important impact on phenotypes. In the present study, we performed a large-scale survey of structural variants in cattle. We report the identification of 6,426 putative structural variants in cattle extracted from whole-genome sequence data of 62 bulls representing the three major French dairy breeds. These genomic variants affect DNA segments greater than 50 base pairs and correspond to deletions, inversions and tandem duplications. Out of these, we identified a total of 547 deletions and 410 tandem duplications which could potentially code for CNVs. Experimental validation was carried out on 331 structural variants using a novel high-throughput genotyping method. Out of these, 255 structural variants (77%) generated good quality genotypes and 191 (75%) of them were validated. Gene content analyses in structural variant regions revealed 941 large deletions removing completely one or several genes, including 10 single-copy genes. In addition, some of the structural variants are located within quantitative trait loci for dairy traits. This study is a pan-genome assessment of genomic variations in cattle and may provide a new glimpse into the bovine genome architecture. Our results may also help to study the effects of structural variants on gene expression and consequently their effect on certain phenotypes of interest. PMID:26317361

  9. Genome-Wide Study of Structural Variants in Bovine Holstein, Montbéliarde and Normande Dairy Breeds.

    PubMed

    Boussaha, Mekki; Esquerré, Diane; Barbieri, Johanna; Djari, Anis; Pinton, Alain; Letaief, Rabia; Salin, Gérald; Escudié, Frédéric; Roulet, Alain; Fritz, Sébastien; Samson, Franck; Grohs, Cécile; Bernard, Maria; Klopp, Christophe; Boichard, Didier; Rocha, Dominique

    2015-01-01

    High-throughput sequencing technologies have offered in recent years new opportunities to study genome variations. These studies have mostly focused on single nucleotide polymorphisms, small insertions or deletions and on copy number variants. Other structural variants, such as large insertions or deletions, tandem duplications, translocations, and inversions are less well-studied, despite that some have an important impact on phenotypes. In the present study, we performed a large-scale survey of structural variants in cattle. We report the identification of 6,426 putative structural variants in cattle extracted from whole-genome sequence data of 62 bulls representing the three major French dairy breeds. These genomic variants affect DNA segments greater than 50 base pairs and correspond to deletions, inversions and tandem duplications. Out of these, we identified a total of 547 deletions and 410 tandem duplications which could potentially code for CNVs. Experimental validation was carried out on 331 structural variants using a novel high-throughput genotyping method. Out of these, 255 structural variants (77%) generated good quality genotypes and 191 (75%) of them were validated. Gene content analyses in structural variant regions revealed 941 large deletions removing completely one or several genes, including 10 single-copy genes. In addition, some of the structural variants are located within quantitative trait loci for dairy traits. This study is a pan-genome assessment of genomic variations in cattle and may provide a new glimpse into the bovine genome architecture. Our results may also help to study the effects of structural variants on gene expression and consequently their effect on certain phenotypes of interest.

  10. Genome-wide insertion–deletion (InDel) marker discovery and genotyping for genomics-assisted breeding applications in chickpea

    PubMed Central

    Das, Shouvik; Upadhyaya, Hari D.; Srivastava, Rishi; Bajaj, Deepak; Gowda, C.L.L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    We developed 21,499 genome-wide insertion–deletion (InDel) markers (2- to 54-bp in silico fragment length polymorphism) by comparing the genomic sequences of four (desi, kabuli and wild C. reticulatum) chickpea [Cicer arietinum (L.)] accessions. InDel markers showing 2- to 6-bp fragment length polymorphism among accessions were abundant (76.8%) in the chickpea genome. The physically mapped 7,643 and 13,856 markers on eight chromosomes and unanchored scaffolds, respectively, were structurally and functionally annotated. The 4,506 coding (23% large-effect frameshift mutations) and regulatory InDel markers were identified from 3,228 genes (representing 11.7% of total 27,571 desi genes), suggesting their functional relevance for trait association/genetic mapping. High amplification (97%) and intra-specific polymorphic (60–83%) potential and wider genetic diversity (15–89%) were detected by genome-wide 6,254 InDel markers among desi, kabuli and wild accessions using even a simpler cost-effective agarose gel-based assay. This signifies added advantages of this user-friendly genetic marker system for manifold large-scale genotyping applications in laboratories with limited infrastructure and resources. Utilizing 6,254 InDel markers-based high-density (inter-marker distance: 0.212 cM) inter-specific genetic linkage map (ICC 4958 × ICC 17160) of chickpea as a reference, three major genomic regions harboring six flowering and maturity time robust QTLs (16.4–27.5% phenotypic variation explained, 8.1–11.5 logarithm of odds) were identified. Integration of genetic and physical maps at these target QTL intervals mapped on three chromosomes delineated five InDel markers-containing candidate genes tightly linked to the QTLs governing flowering and maturity time in chickpea. Taken together, our study demonstrated the practical utility of developing and high-throughput genotyping of such beneficial InDel markers at a genome-wide scale to expedite genomics

  11. Genome-wide insertion-deletion (InDel) marker discovery and genotyping for genomics-assisted breeding applications in chickpea.

    PubMed

    Das, Shouvik; Upadhyaya, Hari D; Srivastava, Rishi; Bajaj, Deepak; Gowda, C L L; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K; Parida, Swarup K

    2015-10-01

    We developed 21,499 genome-wide insertion-deletion (InDel) markers (2- to 54-bp in silico fragment length polymorphism) by comparing the genomic sequences of four (desi, kabuli and wild C. reticulatum) chickpea [Cicer arietinum (L.)] accessions. InDel markers showing 2- to 6-bp fragment length polymorphism among accessions were abundant (76.8%) in the chickpea genome. The physically mapped 7,643 and 13,856 markers on eight chromosomes and unanchored scaffolds, respectively, were structurally and functionally annotated. The 4,506 coding (23% large-effect frameshift mutations) and regulatory InDel markers were identified from 3,228 genes (representing 11.7% of total 27,571 desi genes), suggesting their functional relevance for trait association/genetic mapping. High amplification (97%) and intra-specific polymorphic (60-83%) potential and wider genetic diversity (15-89%) were detected by genome-wide 6,254 InDel markers among desi, kabuli and wild accessions using even a simpler cost-effective agarose gel-based assay. This signifies added advantages of this user-friendly genetic marker system for manifold large-scale genotyping applications in laboratories with limited infrastructure and resources. Utilizing 6,254 InDel markers-based high-density (inter-marker distance: 0.212 cM) inter-specific genetic linkage map (ICC 4958 × ICC 17160) of chickpea as a reference, three major genomic regions harboring six flowering and maturity time robust QTLs (16.4-27.5% phenotypic variation explained, 8.1-11.5 logarithm of odds) were identified. Integration of genetic and physical maps at these target QTL intervals mapped on three chromosomes delineated five InDel markers-containing candidate genes tightly linked to the QTLs governing flowering and maturity time in chickpea. Taken together, our study demonstrated the practical utility of developing and high-throughput genotyping of such beneficial InDel markers at a genome-wide scale to expedite genomics-assisted breeding

  12. Efficient approximation of P-value of the maximum of correlated tests, with applications to genome-wide association studies.

    PubMed

    Li, Qizhai; Zheng, Gang; Li, Zhaohai; Yu, Kai

    2008-05-01

    Genome-wide association study (GWAS), typically involving 100,000 to 500,000 single-nucleotide polymorphisms (SNPs), is a powerful approach to identify disease susceptibility loci. In a GWAS, single-marker analysis, which tests one SNP at a time, is usually used as the first stage to screen SNPs across the genome in order to identify a small fraction of promising SNPs with relatively low p-values for further and more focused studies. For single-marker analysis, the trend test derived for an additive genetic model is often used. This may not be robust when the additive assumption is not appropriate for the true underlying disease model. A robust test, MAX, based on the maximum of three trend test statistics derived for recessive, additive, and dominant models, has been proposed recently for GWAS. But its p-value has to be evaluated through a resampling-based procedure, which is computationally challenging for the analysis of GWAS. Obtaining the p-value for MAX with adjustment for the covariates can be even more time-consuming. In this article, we provide a simple approximation for the p-value of the MAX test with or without adjusting for the covariates. The new method avoids resampling steps and thus makes the MAX test readily applicable to GWAS. We use simulation studies as well as real datasets on 17 confirmed disease-associated SNPs to assess the accuracy of the proposed method. We also apply the method to the GWAS of coronary artery disease.

  13. Cassava Breeding I: The Value of Breeding Value

    PubMed Central

    Ceballos, Hernán; Pérez, Juan C.; Joaqui Barandica, Orlando; Lenis, Jorge I.; Morante, Nelson; Calle, Fernando; Pino, Lizbeth; Hershey, Clair H.

    2016-01-01

    Breeding cassava relies on several selection stages (single row trial-SRT; preliminary; advanced; and uniform yield trials—UYT). This study uses data from 14 years of evaluations. From more than 20,000 genotypes initially evaluated only 114 reached the last stage. The objective was to assess how the data at SRT could be used to predict the probabilities of genotypes reaching the UYT. Phenotypic data from each genotype at SRT was integrated into the selection index (SIN) used by the cassava breeding program. Average SIN from all the progenies derived from each progenitor was then obtained. Average SIN is an approximation of the breeding value of each progenitor. Data clearly suggested that some genotypes were better progenitors than others (e.g., high number of their progenies reaching the UYT), suggesting important variation in breeding values of progenitors. However, regression of average SIN of each parental genotype on the number of their respective progenies reaching UYT resulted in a negligible coefficient of determination (r2 = 0.05). Breeding value (e.g., average SIN) at SRT was not efficient predicting which genotypes were more likely to reach the UYT stage. Number of families and progenies derived from a given progenitor were more efficient predicting the probabilities of the progeny from a given parent reaching the UYT stage. Large within-family genetic variation tends to mask the true breeding value of each progenitor. The use of partially inbred progenitors (e.g., S1 or S2 genotypes) would reduce the within-family genetic variation thus making the assessment of breeding value more accurate. Moreover, partial inbreeding of progenitors can improve the breeding value of the original (S0) parental material and sharply accelerate genetic gains. For instance, homozygous S1 genotypes for the dominant resistance to cassava mosaic disease (CMD) could be generated and selected. All gametes from these selected S1 genotypes would carry the desirable allele and

  14. Genome-wide resequencing of KRICE_CORE reveals their potential for future breeding, as well as functional and evolutionary studies in the post-genomic era.

    PubMed

    Kim, Tae-Sung; He, Qiang; Kim, Kyu-Won; Yoon, Min-Young; Ra, Won-Hee; Li, Feng Peng; Tong, Wei; Yu, Jie; Oo, Win Htet; Choi, Buung; Heo, Eun-Beom; Yun, Byoung-Kook; Kwon, Soon-Jae; Kwon, Soon-Wook; Cho, Yoo-Hyun; Lee, Chang-Yong; Park, Beom-Seok; Park, Yong-Jin

    2016-05-26

    Rice germplasm collections continue to grow in number and size around the world. Since maintaining and screening such massive resources remains challenging, it is important to establish practical methods to manage them. A core collection, by definition, refers to a subset of the entire population that preserves the majority of genetic diversity, enhancing the efficiency of germplasm utilization. Here, we report whole-genome resequencing of the 137 rice mini core collection or Korean rice core set (KRICE_CORE) that represents 25,604 rice germplasms deposited in the Korean genebank of the Rural Development Administration (RDA). We implemented the Illumina HiSeq 2000 and 2500 platform to produce short reads and then assembled those with 9.8 depths using Nipponbare as a reference. Comparisons of the sequences with the reference genome yielded more than 15 million (M) single nucleotide polymorphisms (SNPs) and 1.3 M INDELs. Phylogenetic and population analyses using 2,046,529 high-quality SNPs successfully assigned rice accessions to the relevant rice subgroups, suggesting that these SNPs capture evolutionary signatures that have accumulated in rice subpopulations. Furthermore, genome-wide association studies (GWAS) for four exemplary agronomic traits in the KRIC_CORE manifest the utility of KRICE_CORE; that is, identifying previously defined genes or novel genetic factors that potentially regulate important phenotypes. This study provides strong evidence that the size of KRICE_CORE is small but contains high genetic and functional diversity across the genome. Thus, our resequencing results will be useful for future breeding, as well as functional and evolutionary studies, in the post-genomic era.

  15. The complete mitochondrial genome of a purebred Tibetan Mastiff (Canis lupus familiaris breed Tibetan Mastiff) from Lijiang, China, and comparison of genome-wide sequence variations.

    PubMed

    Deng, Li Xin; He, Cong

    2016-01-01

    In this study, the complete mitochondrial genome sequence of the Tibetan Mastiff was reported. The total length of the mitogenome is 16,729 bp. It contains the typical structure, including 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 1 control region is in line with other canine animals. We further identified genome-wide variations among different canine mitochondrial genomes and indicated that the D-loop region harbors the most sequence variation, which will provide sequence variation information for the protection and utilization of the Tibetan Mastiff germplasm resource.

  16. Genome Wide Association Studies

    NASA Astrophysics Data System (ADS)

    Sebastiani, Paola; Solovieff, Nadia

    The availability of high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS). These studies generate massive amount of genetic data that challenge investigators with issues related to data management, statistical analysis of large data sets, visualization, and annotation of results. We will review the common approach to analysis of GWAS data and then discuss options to learn more from these data.

  17. A new statistical approach to combining p-values using gamma distribution and its application to genome-wide association study

    PubMed Central

    2014-01-01

    Background Combining information from different studies is an important and useful practice in bioinformatics, including genome-wide association study, rare variant data analysis and other set-based analyses. Many statistical methods have been proposed to combine p-values from independent studies. However, it is known that there is no uniformly most powerful test under all conditions; therefore, finding a powerful test in specific situation is important and desirable. Results In this paper, we propose a new statistical approach to combining p-values based on gamma distribution, which uses the inverse of the p-value as the shape parameter in the gamma distribution. Conclusions Simulation study and real data application demonstrate that the proposed method has good performance under some situations. PMID:25559433

  18. Genome-wide development and deployment of informative intron-spanning and intron-length polymorphism markers for genomics-assisted breeding applications in chickpea.

    PubMed

    Srivastava, Rishi; Bajaj, Deepak; Sayal, Yogesh K; Meher, Prabina K; Upadhyaya, Hari D; Kumar, Rajendra; Tripathi, Shailesh; Bharadwaj, Chellapilla; Rao, Atmakuri R; Parida, Swarup K

    2016-11-01

    The discovery and large-scale genotyping of informative gene-based markers is essential for rapid delineation of genes/QTLs governing stress tolerance and yield component traits in order to drive genetic enhancement in chickpea. A genome-wide 119169 and 110491 ISM (intron-spanning markers) from 23129 desi and 20386 kabuli protein-coding genes and 7454 in silico InDel (insertion-deletion) (1-45-bp)-based ILP (intron-length polymorphism) markers from 3283 genes were developed that were structurally and functionally annotated on eight chromosomes and unanchored scaffolds of chickpea. A much higher amplification efficiency (83%) and intra-specific polymorphic potential (86%) detected by these markers than that of other sequence-based genetic markers among desi and kabuli chickpea accessions was apparent even by a cost-effective agarose gel-based assay. The genome-wide physically mapped 1718 ILP markers assayed a wider level of functional genetic diversity (19-81%) and well-defined phylogenetics among domesticated chickpea accessions. The gene-derived 1424 ILP markers were anchored on a high-density (inter-marker distance: 0.65cM) desi intra-specific genetic linkage map/functional transcript map (ICC 4958×ICC 2263) of chickpea. This reference genetic map identified six major genomic regions harbouring six robust QTLs mapped on five chromosomes, which explained 11-23% seed weight trait variation (7.6-10.5 LOD) in chickpea. The integration of high-resolution QTL mapping with differential expression profiling detected six including one potential serine carboxypeptidase gene with ILP markers (linked tightly to the major seed weight QTLs) exhibiting seed-specific expression as well as pronounced up-regulation especially in seeds of high (ICC 4958) as compared to low (ICC 2263) seed weight mapping parental accessions. The marker information generated in the present study was made publicly accessible through a user-friendly web-resource, "Chickpea ISM-ILP Marker Database

  19. Breeding experiments and genome-wide association analysis elucidate two genetically different forms of non-syndromic congenital cleft lip and jaw in Vorderwald × Montbéliarde cattle.

    PubMed

    Reinartz, S; Distl, O

    2017-10-01

    Non-syndromic congenital cleft lip and jaw (CLJ) is a condition reported in Vorderwald × Montbéliarde cattle. The objective of the present study was to perform a genome-wide association study (GWAS) for 10 CLJ-affected and 50 unaffected Vorderwald × Montbéliarde cattle using the bovine Illumina high density bead chip to identify loci for this condition. Phenotypic classification of CLJ was based on a detailed recording of orofacial structures using computed tomography. A breeding experiment among CLJ-affected Vorderwald × Montbéliarde cattle and CLJ-affected Vorderwald × Montbéliarde cattle with unaffected Holsteins confirmed recessive inheritance and different loci for bilateral or left-sided versus right-sided CLJ. The GWAS for the five cases with right-sided CLJ gave a genome-wide signal on bovine chromosome (BTA) 29 at 16 Mb. For the four left-sided and one bilateral CLJ case, a genome-wide significant association was identified on BTA4 at 32 Mb. Two different loci are very likely to be involved in CLJ in Vorderwald × Montbéliarde cattle because experimental matings among affected cows and bulls with different types of CLJ did not result in CLJ-affected progeny, and in addition, two different loci were also found through GWAS and mapped on two different bovine chromosomes. Validation in 346 Vorderwald × Montbéliarde cattle for the highly associated SNPs on BTA4 and 29 gave ratios of 33/346 (0.095, BTA4) and 6/346 (0.017, BTA29) homozygous mutant genotypes. Further studies should elucidate the responsible mutations underlying the different types of CLJ in Vorderwald × Montbéliarde cattle. © 2017 Stichting International Foundation for Animal Genetics.

  20. Establishing an adjusted p-value threshold to control the family-wide type 1 error in genome wide association studies.

    PubMed

    Duggal, Priya; Gillanders, Elizabeth M; Holmes, Taura N; Bailey-Wilson, Joan E

    2008-10-31

    By assaying hundreds of thousands of single nucleotide polymorphisms, genome wide association studies (GWAS) allow for a powerful, unbiased review of the entire genome to localize common genetic variants that influence health and disease. Although it is widely recognized that some correction for multiple testing is necessary, in order to control the family-wide Type 1 Error in genetic association studies, it is not clear which method to utilize. One simple approach is to perform a Bonferroni correction using all n single nucleotide polymorphisms (SNPs) across the genome; however this approach is highly conservative and would "overcorrect" for SNPs that are not truly independent. Many SNPs fall within regions of strong linkage disequilibrium (LD) ("blocks") and should not be considered "independent". We proposed to approximate the number of "independent" SNPs by counting 1 SNP per LD block, plus all SNPs outside of blocks (interblock SNPs). We examined the effective number of independent SNPs for Genome Wide Association Study (GWAS) panels. In the CEPH Utah (CEU) population, by considering the interdependence of SNPs, we could reduce the total number of effective tests within the Affymetrix and Illumina SNP panels from 500,000 and 317,000 to 67,000 and 82,000 "independent" SNPs, respectively. For the Affymetrix 500 K and Illumina 317 K GWAS SNP panels we recommend using 10(-5), 10(-7) and 10(-8) and for the Phase II HapMap CEPH Utah and Yoruba populations we recommend using 10(-6), 10(-7) and 10(-9) as "suggestive", "significant" and "highly significant" p-value thresholds to properly control the family-wide Type 1 error. By approximating the effective number of independent SNPs across the genome we are able to 'correct' for a more accurate number of tests and therefore develop 'LD adjusted' Bonferroni corrected p-value thresholds that account for the interdepdendence of SNPs on well-utilized commercially available SNP "chips". These thresholds will serve as guides

  1. Genome-wide differential expression of genes and small RNAs in testis of two different porcine breeds and at two different ages

    PubMed Central

    Li, Yao; Li, Jialian; Fang, Chengchi; Shi, Liang; Tan, Jiajian; Xiong, Yuanzhu; Bin Fan; Li, Changchun

    2016-01-01

    Some documented evidences proved small RNAs (sRNA) and targeted genes are involved in mammalian testicular development and spermatogenesis. However, the detailed molecular regulation mechanisms of them remain largely unknown so far. In this study, we obtained a total of 10,716 mRNAs, 67 miRNAs and 16,953 piRNAs which were differentially expressed between LC and LW pig breeds or between the two sexual maturity stages. Of which, we identified 16 miRNAs and 28 targeted genes possibly related to spermatogenesis; 14 miRNA and 18 targeted genes probably associated with cell adhesion related testis development. We also annotated 579 piRNAs which could potentially regulate cell death, nucleosome organization and other basic biology process, which implied that those piRNAs might be involved in sexual maturation difference. The integrated network analysis results suggested that some differentially expressed genes were involved in spermatogenesis through the ECM–receptor interaction, focal adhesion, Wnt and PI3K–Akt signaling pathways, some particular miRNAs have the negative regulation roles and some special piRNAs have the positive and negative regulation roles in testicular development. Our data provide novel insights into the molecular expression and regulation similarities and diversities of spermatogenesis and testicular development in different pig breeds at different stages of sexual maturity. PMID:27229484

  2. Genome-Wide Association Study among Four Horse Breeds Identifies a Common Haplotype Associated with In Vitro CD3+ T Cell Susceptibility/Resistance to Equine Arteritis Virus Infection ▿

    PubMed Central

    Go, Yun Young; Bailey, Ernest; Cook, Deborah G.; Coleman, Stephen J.; MacLeod, James N.; Chen, Kuey-Chu; Timoney, Peter J.; Balasuriya, Udeni B. R.

    2011-01-01

    Previously, we have shown that horses could be divided into susceptible and resistant groups based on an in vitro assay using dual-color flow cytometric analysis of CD3+ T cells infected with equine arteritis virus (EAV). Here, we demonstrate that the differences in in vitro susceptibility of equine CD3+ T lymphocytes to EAV infection have a genetic basis. To investigate the possible hereditary basis for this trait, we conducted a genome-wide association study (GWAS) to compare susceptible and resistant phenotypes. Testing of 267 DNA samples from four horse breeds that had a susceptible or a resistant CD3+ T lymphocyte phenotype using both Illumina Equine SNP50 BeadChip and Sequenom's MassARRAY system identified a common, genetically dominant haplotype associated with the susceptible phenotype in a region of equine chromosome 11 (ECA11), positions 49572804 to 49643932. The presence of a common haplotype indicates that the trait occurred in a common ancestor of all four breeds, suggesting that it may be segregated among other modern horse breeds. Biological pathway analysis revealed several cellular genes within this region of ECA11 encoding proteins associated with virus attachment and entry, cytoskeletal organization, and NF-κB pathways that may be associated with the trait responsible for the in vitro susceptibility/resistance of CD3+ T lymphocytes to EAV infection. The data presented in this study demonstrated a strong association of genetic markers with the trait, representing de facto proof that the trait is under genetic control. To our knowledge, this is the first GWAS of an equine infectious disease and the first GWAS of equine viral arteritis. PMID:21994447

  3. Comparison of molecular breeding values based on within- and across-breed training in beef cattle.

    PubMed

    Kachman, Stephen D; Spangler, Matthew L; Bennett, Gary L; Hanford, Kathryn J; Kuehn, Larry A; Snelling, Warren M; Thallman, R Mark; Saatchi, Mahdi; Garrick, Dorian J; Schnabel, Robert D; Taylor, Jeremy F; Pollak, E John

    2013-08-16

    Although the efficacy of genomic predictors based on within-breed training looks promising, it is necessary to develop and evaluate across-breed predictors for the technology to be fully applied in the beef industry. The efficacies of genomic predictors trained in one breed and utilized to predict genetic merit in differing breeds based on simulation studies have been reported, as have the efficacies of predictors trained using data from multiple breeds to predict the genetic merit of purebreds. However, comparable studies using beef cattle field data have not been reported. Molecular breeding values for weaning and yearling weight were derived and evaluated using a database containing BovineSNP50 genotypes for 7294 animals from 13 breeds in the training set and 2277 animals from seven breeds (Angus, Red Angus, Hereford, Charolais, Gelbvieh, Limousin, and Simmental) in the evaluation set. Six single-breed and four across-breed genomic predictors were trained using pooled data from purebred animals. Molecular breeding values were evaluated using field data, including genotypes for 2227 animals and phenotypic records of animals born in 2008 or later. Accuracies of molecular breeding values were estimated based on the genetic correlation between the molecular breeding value and trait phenotype. With one exception, the estimated genetic correlations of within-breed molecular breeding values with trait phenotype were greater than 0.28 when evaluated in the breed used for training. Most estimated genetic correlations for the across-breed trained molecular breeding values were moderate (> 0.30). When molecular breeding values were evaluated in breeds that were not in the training set, estimated genetic correlations clustered around zero. Even for closely related breeds, within- or across-breed trained molecular breeding values have limited prediction accuracy for breeds that were not in the training set. For breeds in the training set, across- and within-breed trained

  4. Domestic estimated breeding values and genomic enhanced breeding values of bulls in comparison with their foreign genomic enhanced breeding values.

    PubMed

    Přibyl, J; Bauer, J; Čermák, V; Pešek, P; Přibylová, J; Šplíchal, J; Vostrá-Vydrová, H; Vostrý, L; Zavadilová, L

    2015-10-01

    Estimated breeding values (EBVs) and genomic enhanced breeding values (GEBVs) for milk production of young genotyped Holstein bulls were predicted using a conventional BLUP - Animal Model, a method fitting regression coefficients for loci (RRBLUP), a method utilizing the realized genomic relationship matrix (GBLUP), by a single-step procedure (ssGBLUP) and by a one-step blending procedure. Information sources for prediction were the nation-wide database of domestic Czech production records in the first lactation combined with deregressed proofs (DRP) from Interbull files (August 2013) and domestic test-day (TD) records for the first three lactations. Data from 2627 genotyped bulls were used, of which 2189 were already proven under domestic conditions. Analyses were run that used Interbull values for genotyped bulls only or that used Interbull values for all available sires. Resultant predictions were compared with GEBV of 96 young foreign bulls evaluated abroad and whose proofs were from Interbull method GMACE (August 2013) on the Czech scale. Correlations of predictions with GMACE values of foreign bulls ranged from 0.33 to 0.75. Combining domestic data with Interbull EBVs improved prediction of both EBV and GEBV. Predictions by Animal Model (traditional EBV) using only domestic first lactation records and GMACE values were correlated by only 0.33. Combining the nation-wide domestic database with all available DRP for genotyped and un-genotyped sires from Interbull resulted in an EBV correlation of 0.60, compared with 0.47 when only Interbull data were used. In all cases, GEBVs had higher correlations than traditional EBVs, and the highest correlations were for predictions from the ssGBLUP procedure using combined data (0.75), or with all available DRP from Interbull records only (one-step blending approach, 0.69). The ssGBLUP predictions using the first three domestic lactation records in the TD model were correlated with GMACE predictions by 0.69, 0.64 and 0

  5. Potential assessment of genome-wide association study and genomic selection in Japanese pear Pyrus pyrifolia.

    PubMed

    Iwata, Hiroyoshi; Hayashi, Takeshi; Terakami, Shingo; Takada, Norio; Sawamura, Yutaka; Yamamoto, Toshiya

    2013-03-01

    Although the potential of marker-assisted selection (MAS) in fruit tree breeding has been reported, bi-parental QTL mapping before MAS has hindered the introduction of MAS to fruit tree breeding programs. Genome-wide association studies (GWAS) are an alternative to bi-parental QTL mapping in long-lived perennials. Selection based on genomic predictions of breeding values (genomic selection: GS) is another alternative for MAS. This study examined the potential of GWAS and GS in pear breeding with 76 Japanese pear cultivars to detect significant associations of 162 markers with nine agronomic traits. We applied multilocus Bayesian models accounting for ordinal categorical phenotypes for GWAS and GS model training. Significant associations were detected at harvest time, black spot resistance and the number of spurs and two of the associations were closely linked to known loci. Genome-wide predictions for GS were accurate at the highest level (0.75) in harvest time, at medium levels (0.38-0.61) in resistance to black spot, firmness of flesh, fruit shape in longitudinal section, fruit size, acid content and number of spurs and at low levels (<0.2) in all soluble solid content and vigor of tree. Results suggest the potential of GWAS and GS for use in future breeding programs in Japanese pear.

  6. Genome-wide association study for semen quality traits in German Warmblood stallions.

    PubMed

    Gottschalk, Maren; Metzger, Julia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar

    2016-08-01

    We performed a genome-wide association study for semen quality traits in 139 German Warmblood stallions. Stallions were genotyped using the Illumina equine SNP50 Beadchip. Traits analysed were de-regressed estimated breeding values (EBVs) for gel-free volume, sperm concentration, total number of sperm, progressive motility and the total number of progressively motile sperm. The GWAS revealed 29 SNPs on 12 different chromosomes as genome-wide significantly associated with semen quality traits. For ten genomic regions we could retrieve candidate genes influencing stallion fertility. Among the candidate genes, we could find the genes encoding cysteine-rich secretory proteins (CRISP1, CRISP2 and CRISP3). This was the first GWAS in horses performed for semen quality traits.

  7. Accuracies of genomically estimated breeding values from pure-breed and across-breed predictions in Australian beef cattle.

    PubMed

    Boerner, Vinzent; Johnston, David J; Tier, Bruce

    2014-10-24

    The major obstacles for the implementation of genomic selection in Australian beef cattle are the variety of breeds and in general, small numbers of genotyped and phenotyped individuals per breed. The Australian Beef Cooperative Research Center (Beef CRC) investigated these issues by deriving genomic prediction equations (PE) from a training set of animals that covers a range of breeds and crosses including Angus, Murray Grey, Shorthorn, Hereford, Brahman, Belmont Red, Santa Gertrudis and Tropical Composite. This paper presents accuracies of genomically estimated breeding values (GEBV) that were calculated from these PE in the commercial pure-breed beef cattle seed stock sector. PE derived by the Beef CRC from multi-breed and pure-breed training populations were applied to genotyped Angus, Limousin and Brahman sires and young animals, but with no pure-breed Limousin in the training population. The accuracy of the resulting GEBV was assessed by their genetic correlation to their phenotypic target trait in a bi-variate REML approach that models GEBV as trait observations. Accuracies of most GEBV for Angus and Brahman were between 0.1 and 0.4, with accuracies for abattoir carcass traits generally greater than for live animal body composition traits and reproduction traits. Estimated accuracies greater than 0.5 were only observed for Brahman abattoir carcass traits and for Angus carcass rib fat. Averaged across traits within breeds, accuracies of GEBV were highest when PE from the pooled across-breed training population were used. However, for the Angus and Brahman breeds the difference in accuracy from using pure-breed PE was small. For the Limousin breed no reasonable results could be achieved for any trait. Although accuracies were generally low compared to published accuracies estimated within breeds, they are in line with those derived in other multi-breed populations. Thus PE developed by the Beef CRC can contribute to the implementation of genomic selection in

  8. Compression distance can discriminate animals by genetic profile, build relationship matrices and estimate breeding values.

    PubMed

    Hudson, Nicholas J; Porto-Neto, Laercio; Kijas, James W; Reverter, Antonio

    2015-10-13

    Genetic relatedness is currently estimated by a combination of traditional pedigree-based approaches (i.e. numerator relationship matrices, NRM) and, given the recent availability of molecular information, using marker genotypes (via genomic relationship matrices, GRM). To date, GRM are computed by genome-wide pair-wise SNP (single nucleotide polymorphism) correlations. We describe a new estimate of genetic relatedness using the concept of normalised compression distance (NCD) that is borrowed from Information Theory. Analogous to GRM, the resultant compression relationship matrix (CRM) exploits numerical patterns in genome-wide allele order and proportion, which are known to vary systematically with relatedness. We explored properties of the CRM in two industry cattle datasets by analysing the genetic basis of yearling weight, a phenotype of moderate heritability. In both Brahman (Bos indicus) and Tropical Composite (Bos taurus by Bos indicus) populations, the clustering inferred by NCD was comparable to that based on SNP correlations using standard principal component analysis approaches. One of the versions of the CRM modestly increased the amount of explained genetic variance, slightly reduced the 'missing heritability' and tended to improve the prediction accuracy of breeding values in both populations when compared to both NRM and GRM. Finally, a sliding window-based application of the compression approach on these populations identified genomic regions influenced by introgression of taurine haplotypes. For these two bovine populations, CRM reduced the missing heritability and increased the amount of explained genetic variation for a moderately heritable complex trait. Given that NCD can sensitively discriminate closely related individuals, we foresee CRM having possible value for estimating breeding values in highly inbred populations.

  9. Genome-association analysis of Korean Holstein milk traits using genomic estimated breeding value

    PubMed Central

    Shin, Donghyun; Lee, Chul; Park, Kyoung-Do; Kim, Heebal; Cho, Kwang-hyeon

    2017-01-01

    Objective Holsteins are known as the world’s highest-milk producing dairy cattle. The purpose of this study was to identify genetic regions strongly associated with milk traits (milk production, fat, and protein) using Korean Holstein data. Methods This study was performed using single nucleotide polymorphism (SNP) chip data (Illumina BovineSNP50 Beadchip) of 911 Korean Holstein individuals. We inferred each genomic estimated breeding values based on best linear unbiased prediction (BLUP) and ridge regression using BLUPF90 and R. We then performed a genome-wide association study and identified genetic regions related to milk traits. Results We identified 9, 6, and 17 significant genetic regions related to milk production, fat and protein, respectively. These genes are newly reported in the genetic association with milk traits of Holstein. Conclusion This study complements a recent Holstein genome-wide association studies that identified other SNPs and genes as the most significant variants. These results will help to expand the knowledge of the polygenic nature of milk production in Holsteins. PMID:26954162

  10. Gaussian covariance graph models accounting for correlated marker effects in genome-wide prediction.

    PubMed

    Martínez, C A; Khare, K; Rahman, S; Elzo, M A

    2017-10-01

    Several statistical models used in genome-wide prediction assume uncorrelated marker allele substitution effects, but it is known that these effects may be correlated. In statistics, graphical models have been identified as a useful tool for covariance estimation in high-dimensional problems and it is an area that has recently experienced a great expansion. In Gaussian covariance graph models (GCovGM), the joint distribution of a set of random variables is assumed to be Gaussian and the pattern of zeros of the covariance matrix is encoded in terms of an undirected graph G. In this study, methods adapting the theory of GCovGM to genome-wide prediction were developed (Bayes GCov, Bayes GCov-KR and Bayes GCov-H). In simulated data sets, improvements in correlation between phenotypes and predicted breeding values and accuracies of predicted breeding values were found. Our models account for correlation of marker effects and permit to accommodate general structures as opposed to models proposed in previous studies, which consider spatial correlation only. In addition, they allow incorporation of biological information in the prediction process through its use when constructing graph G, and their extension to the multi-allelic loci case is straightforward. © 2017 Blackwell Verlag GmbH.

  11. Genome-wide association analysis for feed efficiency in Angus cattle

    PubMed Central

    Rolf, M M; Taylor, J F; Schnabel, R D; McKay, S D; McClure, M C; Northcutt, S L; Kerley, M S; Weaber, R L

    2012-01-01

    Estimated breeding values for average daily feed intake (AFI; kg/day), residual feed intake (RFI; kg/day) and average daily gain (ADG; kg/day) were generated using a mixed linear model incorporating genomic relationships for 698 Angus steers genotyped with the Illumina BovineSNP50 assay. Association analyses of estimated breeding values (EBVs) were performed for 41 028 single nucleotide polymorphisms (SNPs), and permutation analysis was used to empirically establish the genome-wide significance threshold (P < 0.05) for each trait. SNPs significantly associated with each trait were used in a forward selection algorithm to identify genomic regions putatively harbouring genes with effects on each trait. A total of 53, 66 and 68 SNPs explained 54.12% (24.10%), 62.69% (29.85%) and 55.13% (26.54%) of the additive genetic variation (when accounting for the genomic relationships) in steer breeding values for AFI, RFI and ADG, respectively, within this population. Evaluation by pathway analysis revealed that many of these SNPs are in genomic regions that harbour genes with metabolic functions. The presence of genetic correlations between traits resulted in 13.2% of SNPs selected for AFI and 4.5% of SNPs selected for RFI also being selected for ADG in the analysis of breeding values. While our study identifies panels of SNPs significant for efficiency traits in our population, validation of all SNPs in independent populations will be necessary before commercialization. PMID:22497295

  12. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

    USDA-ARS?s Scientific Manuscript database

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...

  13. Methods to estimate breeding values in honey bees.

    PubMed

    Brascamp, Evert W; Bijma, Piter

    2014-09-19

    Efficient methodologies based on animal models are widely used to estimate breeding values in farm animals. These methods are not applicable in honey bees because of their mode of reproduction. Observations are recorded on colonies, which consist of a single queen and thousands of workers that descended from the queen mated to 10 to 20 drones. Drones are haploid and sperms are copies of a drone's genotype. As a consequence, Mendelian sampling terms of full-sibs are correlated, such that the covariance matrix of Mendelian sampling terms is not diagonal. In this paper, we show how the numerator relationship matrix and its inverse can be obtained for honey bee populations. We present algorithms to derive the covariance matrix of Mendelian sampling terms that accounts for correlated terms. The resulting matrix is a block-diagonal matrix, with a small block for each full-sib family, and is easy to invert numerically. The method allows incorporating the within-colony distribution of progeny from drone-producing queens and drones, such that estimates of breeding values weigh information from relatives appropriately. Simulation shows that the resulting estimated breeding values are unbiased predictors of true breeding values. Benefits for response to selection, compared to an existing approximate method, appear to be limited (~5%). Benefits may however be greater when estimating genetic parameters. This work shows how the relationship matrix and its inverse can be developed for honey bee populations, and used to estimate breeding values and variance components.

  14. Genome-Wide Detection of CNVs and Their Association with Meat Tenderness in Nelore Cattle.

    PubMed

    Silva, Vinicius Henrique da; Regitano, Luciana Correia de Almeida; Geistlinger, Ludwig; Pértille, Fábio; Giachetto, Poliana Fernanda; Brassaloti, Ricardo Augusto; Morosini, Natália Silva; Zimmer, Ralf; Coutinho, Luiz Lehmann

    2016-01-01

    Brazil is one of the largest beef producers and exporters in the world with the Nelore breed representing the vast majority of Brazilian cattle (Bos taurus indicus). Despite the great adaptability of the Nelore breed to tropical climate, meat tenderness (MT) remains to be improved. Several factors including genetic composition can influence MT. In this article, we report a genome-wide analysis of copy number variation (CNV) inferred from Illumina® High Density SNP-chip data for a Nelore population of 723 males. We detected >2,600 CNV regions (CNVRs) representing ≈6.5% of the genome. Comparing our results with previous studies revealed an overlap in ≈1400 CNVRs (>50%). A total of 1,155 CNVRs (43.6%) overlapped 2,750 genes. They were enriched for processes involving guanosine triphosphate (GTP), previously reported to influence skeletal muscle physiology and morphology. Nelore CNVRs also overlapped QTLs for MT reported in other breeds (8.9%, 236 CNVRs) and from a previous study with this population (4.1%, 109 CNVRs). Two CNVRs were also proximal to glutathione metabolism genes that were previously associated with MT. Genome-wide association study of CN state with estimated breeding values derived from meat shear force identified 6 regions, including a region on BTA3 that contains genes of the cAMP and cGMP pathway. Ten CNVRs that overlapped regions associated with MT were successfully validated by qPCR. Our results represent the first comprehensive CNV study in Bos taurus indicus cattle and identify regions in which copy number changes are potentially of importance for the MT phenotype.

  15. Genome-Wide Detection of CNVs and Their Association with Meat Tenderness in Nelore Cattle

    PubMed Central

    da Silva, Vinicius Henrique; Regitano, Luciana Correia de Almeida; Geistlinger, Ludwig; Pértille, Fábio; Morosini, Natália Silva; Zimmer, Ralf; Coutinho, Luiz Lehmann

    2016-01-01

    Brazil is one of the largest beef producers and exporters in the world with the Nelore breed representing the vast majority of Brazilian cattle (Bos taurus indicus). Despite the great adaptability of the Nelore breed to tropical climate, meat tenderness (MT) remains to be improved. Several factors including genetic composition can influence MT. In this article, we report a genome-wide analysis of copy number variation (CNV) inferred from Illumina® High Density SNP-chip data for a Nelore population of 723 males. We detected >2,600 CNV regions (CNVRs) representing ≈6.5% of the genome. Comparing our results with previous studies revealed an overlap in ≈1400 CNVRs (>50%). A total of 1,155 CNVRs (43.6%) overlapped 2,750 genes. They were enriched for processes involving guanosine triphosphate (GTP), previously reported to influence skeletal muscle physiology and morphology. Nelore CNVRs also overlapped QTLs for MT reported in other breeds (8.9%, 236 CNVRs) and from a previous study with this population (4.1%, 109 CNVRs). Two CNVRs were also proximal to glutathione metabolism genes that were previously associated with MT. Genome-wide association study of CN state with estimated breeding values derived from meat shear force identified 6 regions, including a region on BTA3 that contains genes of the cAMP and cGMP pathway. Ten CNVRs that overlapped regions associated with MT were successfully validated by qPCR. Our results represent the first comprehensive CNV study in Bos taurus indicus cattle and identify regions in which copy number changes are potentially of importance for the MT phenotype. PMID:27348523

  16. Multibreed analysis by splitting the breeding values

    PubMed Central

    García-Cortés, Luis Alberto; Toro, Miguel Ángel

    2006-01-01

    An equivalent model for multibreed variance covariance estimation is presented. It considers the additive case including or not the segregation variances. The model is based on splitting the additive genetic values in several independent parts depending on their genetic origin. For each part, it expresses the covariance between relatives as a partial numerator relationship matrix times the corresponding variance component. Estimation of fixed effects, random effects or variance components provided by the model are as simple as any model including several random factors. We present a small example describing the mixed model equations for genetic evaluations and two simulated examples to illustrate the Bayesian variance component estimation. PMID:17129562

  17. Genome-wide association mapping in plants.

    PubMed

    George, Andrew W; Cavanagh, Colin

    2015-06-01

    We present new association mapping methods which address the unique challenges of analyzing genome-wide data from multi-environment plant studies. Association studies on a genome-wide scale are being performed in plants. Unlike human studies, plant studies contain replicates whose data may be recorded across different environments. Plant studies also often employ elaborate experimental designs for controlling extraneous phenotypic variation. As a result, the genome-wide analysis of data from plant studies can be challenging. In this paper, we present QK-based association mapping for the analysis of data from plant association studies. In doing so, we have developed: (a) a general multivariate QK framework for association mapping in plant studies of arbitrary complexity; (b) a new weighted two-stage analysis approach for QK-based association mapping; (c) a heuristic procedure for determining when two-stage analysis is appropriate; and (d) a Monte Carlo sampling procedure for controlling the genome-wide type I error rate. We conduct a simulation study to evaluate the performance of our genome-wide mapping technique. We also analyze data from a multi-environment association study in wheat.

  18. Genome-wide analysis highlights genetic dilution in Algerian sheep.

    PubMed

    Gaouar, S B S; Lafri, M; Djaout, A; El-Bouyahiaoui, R; Bouri, A; Bouchatal, A; Maftah, A; Ciani, E; Da Silva, A B

    2017-03-01

    Algeria represents a reservoir of genetic diversity with local sheep breeds adapted to a large range of environments and showing specific features necessary to deal with harsh conditions. This remarkable diversity results from the traditional management of dryland by pastoralists over centuries. Most of these breeds are poorly productive, and the economic pressure leads farmers to realize anarchic cross-breeding (that is, not carried out in the framework of selection plans) with the hope to increase animal's conformation. In this study, eight of the nine local Algerian sheep breeds (D'men, Hamra, Ouled-Djellal, Rembi, Sidaoun, Tazegzawt, Berber and Barbarine) were investigated for the first time by genome-wide single-nucleotide polymorphism genotyping. At an international scale, Algerian sheep occupied an original position shaped by relations with African and European (particularly Italian) breeds. The strong genetic proximity with Caribbean and Brazilian breeds confirmed that the genetic make-up of these American breeds was largely influenced by the Atlantic slave trade. At a national scale, an alarming genetic dilution of the Berber (a primitive breed) and the Rembi was observed, as a consequence of uncontrolled mating practices with Ouled-Djellal. A similar, though less pronounced, phenomenon was also detected for the Barbarine, another ancestral breed. Genetic originality appeared to be better preserved in Tazegzawt, Hamra, D'men and Sidaoun. These breeds should be given high priority in the establishment of conservation plans to halt their progressive loss. For Berber and Barbarine that also occur in the bordering neighbor countries, urgent concerted transnational actions are needed.

  19. Genome-wide association studies of cancer.

    PubMed

    Jorgenson, Eric; Witte, John S

    2007-08-01

    Genome-wide association studies provide a new and powerful approach to investigate the effect of inherited genetic variation on the risk of human disease. These studies rely on high throughput DNA microarray technology to genotype hundreds of thousands of genetic variants across the human genome. The first genome-wide association studies have identified previously unknown genetic risk factors that influence a range of diseases, including prostate cancer, breast cancer, myocardial infarction, age-related macular degeneration, diabetes, Crohn's disease and obesity. Many more studies are currently underway, including a number that will focus on other cancers (e.g., colorectal). Here we discuss the major issues involved in conducting genome-wide association studies and how these studies can be used to examine cancer phenotypes.

  20. Accuracy of genomic breeding values in multibreed beef cattle populations derived from deregressed breeding values and phenotypes.

    PubMed

    Weber, K L; Thallman, R M; Keele, J W; Snelling, W M; Bennett, G L; Smith, T P L; McDaneld, T G; Allan, M F; Van Eenennaam, A L; Kuehn, L A

    2012-12-01

    Genomic selection involves the assessment of genetic merit through prediction equations that allocate genetic variation with dense marker genotypes. It has the potential to provide accurate breeding values for selection candidates at an early age and facilitate selection for expensive or difficult to measure traits. Accurate across-breed prediction would allow genomic selection to be applied on a larger scale in the beef industry, but the limited availability of large populations for the development of prediction equations has delayed researchers from providing genomic predictions that are accurate across multiple beef breeds. In this study, the accuracy of genomic predictions for 6 growth and carcass traits were derived and evaluated using 2 multibreed beef cattle populations: 3,358 crossbred cattle of the U.S. Meat Animal Research Center Germplasm Evaluation Program (USMARC_GPE) and 1,834 high accuracy bull sires of the 2,000 Bull Project (2000_BULL) representing influential breeds in the U.S. beef cattle industry. The 2000_BULL EPD were deregressed, scaled, and weighted to adjust for between- and within-breed heterogeneous variance before use in training and validation. Molecular breeding values (MBV) trained in each multibreed population and in Angus and Hereford purebred sires of 2000_BULL were derived using the GenSel BayesCπ function (Fernando and Garrick, 2009) and cross-validated. Less than 10% of large effect loci were shared between prediction equations trained on (USMARC_GPE) relative to 2000_BULL although locus effects were moderately to highly correlated for most traits and the traits themselves were highly correlated between populations. Prediction of MBV accuracy was low and variable between populations. For growth traits, MBV accounted for up to 18% of genetic variation in a pooled, multibreed analysis and up to 28% in single breeds. For carcass traits, MBV explained up to 8% of genetic variation in a pooled, multibreed analysis and up to 42% in

  1. Genome-wide linkage disequilibrium and genetic diversity in five populations of Australian domestic sheep.

    PubMed

    Al-Mamun, Hawlader Abdullah; Clark, Samuel A; Kwan, Paul; Gondro, Cedric

    2015-11-24

    Knowledge of the genetic structure and overall diversity of livestock species is important to maximise the potential of genome-wide association studies and genomic prediction. Commonly used measures such as linkage disequilibrium (LD), effective population size (N e ), heterozygosity, fixation index (F ST) and runs of homozygosity (ROH) are widely used and help to improve our knowledge about genetic diversity in animal populations. The development of high-density single nucleotide polymorphism (SNP) arrays and the subsequent genotyping of large numbers of animals have greatly increased the accuracy of these population-based estimates. In this study, we used the Illumina OvineSNP50 BeadChip array to estimate and compare LD (measured by r (2) and D'), N e , heterozygosity, F ST and ROH in five Australian sheep populations: three pure breeds, i.e., Merino (MER), Border Leicester (BL), Poll Dorset (PD) and two crossbred populations i.e. F1 crosses of Merino and Border Leicester (MxB) and MxB crossed to Poll Dorset (MxBxP). Compared to other livestock species, the sheep populations that were analysed in this study had low levels of LD and high levels of genetic diversity. The rate of LD decay was greater in Merino than in the other pure breeds. Over short distances (<10 kb), the levels of LD were higher in BL and PD than in MER. Similarly, BL and PD had comparatively smaller N e than MER. Observed heterozygosity in the pure breeds ranged from 0.3 in BL to 0.38 in MER. Genetic distances between breeds were modest compared to other livestock species (highest F ST = 0.063) but the genetic diversity within breeds was high. Based on ROH, two chromosomal regions showed evidence of strong recent selection. This study shows that there is a large range of genome diversity in Australian sheep breeds, especially in Merino sheep. The observed range of diversity will influence the design of genome-wide association studies and the results that can be obtained from them. This

  2. Genetic Diversity in the Modern Horse Illustrated from Genome-Wide SNP Data

    PubMed Central

    Petersen, Jessica L.; Mickelson, James R.; Cothran, E. Gus; Andersson, Lisa S.; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M.; Borges, Alexandre S.; Brama, Pieter; da Câmara Machado, Artur; Distl, Ottmar; Felicetti, Michela; Fox-Clipsham, Laura; Graves, Kathryn T.; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A.; Mikko, Sofia; Orr, Nicholas; Penedo, M. Cecilia T; Piercy, Richard J.; Raekallio, Marja; Rieder, Stefan; Røed, Knut H.; Silvestrelli, Maurizio; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; M. Wade, Claire; McCue, Molly E.

    2013-01-01

    Horses were domesticated from the Eurasian steppes 5,000–6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds). This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. FST calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection. PMID:23383025

  3. Genetic Control of Canine Leishmaniasis: Genome-Wide Association Study and Genomic Selection Analysis

    PubMed Central

    Quilez, Javier; Martínez, Verónica; Woolliams, John A.; Sanchez, Armand; Pong-Wong, Ricardo; Kennedy, Lorna J.; Quinnell, Rupert J.; Ollier, William E. R.; Roura, Xavier; Ferrer, Lluís; Altet, Laura; Francino, Olga

    2012-01-01

    Background The current disease model for leishmaniasis suggests that only a proportion of infected individuals develop clinical disease, while others are asymptomatically infected due to immune control of infection. The factors that determine whether individuals progress to clinical disease following Leishmania infection are unclear, although previous studies suggest a role for host genetics. Our hypothesis was that canine leishmaniasis is a complex disease with multiple loci responsible for the progression of the disease from Leishmania infection. Methodology/Principal Findings Genome-wide association and genomic selection approaches were applied to a population-based case-control dataset of 219 dogs from a single breed (Boxer) genotyped for ∼170,000 SNPs. Firstly, we aimed to identify individual disease loci; secondly, we quantified the genetic component of the observed phenotypic variance; and thirdly, we tested whether genome-wide SNP data could accurately predict the disease. Conclusions/Significance We estimated that a substantial proportion of the genome is affecting the trait and that its heritability could be as high as 60%. Using the genome-wide association approach, the strongest associations were on chromosomes 1, 4 and 20, although none of these were statistically significant at a genome-wide level and after correcting for genetic stratification and lifestyle. Amongst these associations, chromosome 4: 61.2–76.9 Mb maps to a locus that has previously been associated with host susceptibility to human and murine leishmaniasis, and genomic selection estimated markers in this region to have the greatest effect on the phenotype. We therefore propose these regions as candidates for replication studies. An important finding of this study was the significant predictive value from using the genomic information. We found that the phenotype could be predicted with an accuracy of ∼0.29 in new samples and that the affection status was correctly predicted in 60

  4. Genome-wide association study of schizophrenia in Ashkenazi Jews.

    PubMed

    Goes, Fernando S; McGrath, John; Avramopoulos, Dimitrios; Wolyniec, Paula; Pirooznia, Mehdi; Ruczinski, Ingo; Nestadt, Gerald; Kenny, Eimear E; Vacic, Vladimir; Peters, Inga; Lencz, Todd; Darvasi, Ariel; Mulle, Jennifer G; Warren, Stephen T; Pulver, Ann E

    2015-12-01

    Schizophrenia is a common, clinically heterogeneous disorder associated with lifelong morbidity and early mortality. Several genetic variants associated with schizophrenia have been identified, but the majority of the heritability remains unknown. In this study, we report on a case-control sample of Ashkenazi Jews (AJ), a founder population that may provide additional insights into genetic etiology of schizophrenia. We performed a genome-wide association analysis (GWAS) of 592 cases and 505 controls of AJ ancestry ascertained in the US. Subsequently, we performed a meta-analysis with an Israeli AJ sample of 913 cases and 1640 controls, followed by a meta-analysis and polygenic risk scoring using summary results from Psychiatric GWAS Consortium 2 schizophrenia study. The U.S. AJ sample showed strong evidence of polygenic inheritance (pseudo-R(2) ∼9.7%) and a SNP-heritability estimate of 0.39 (P = 0.00046). We found no genome-wide significant associations in the U.S. sample or in the combined US/Israeli AJ meta-analysis of 1505 cases and 2145 controls. The strongest AJ specific associations (P-values in 10(-6) -10(-7) range) were in the 22q 11.2 deletion region and included the genes TBX1, GLN1, and COMT. Supportive evidence (meta P < 1 × 10(-4) ) was also found for several previously identified genome-wide significant findings, including the HLA region, CNTN4, IMMP2L, and GRIN2A. The meta-analysis of the U.S. sample with the PGC2 results provided initial genome-wide significant evidence for six new loci. Among the novel potential susceptibility genes is PEPD, a gene involved in proline metabolism, which is associated with a Mendelian disorder characterized by developmental delay and cognitive deficits. © 2015 Wiley Periodicals, Inc.

  5. Genome-wide association study of paliperidone efficacy

    PubMed Central

    Wineinger, Nathan E.; Fu, Dong-Jing; Libiger, Ondrej; Alphs, Larry; Savitz, Adam; Gopal, Srihari; Cohen, Nadine; Schork, Nicholas J.

    2017-01-01

    Objective Clinical response to the atypical antipsychotic paliperidone is known to vary among schizophrenic patients. We carried out a genome-wide association study to identify common genetic variants predictive of paliperidone efficacy. Methods We leveraged a collection of 1390 samples from individuals of European ancestry enrolled in 12 clinical studies investigating the efficacy of the extended-release tablet paliperidone ER (n1=490) and the once-monthly injection paliperidone palmitate (n2=550 and n3=350). We carried out a genome-wide association study using a general linear model (GLM) analysis on three separate cohorts, followed by meta-analysis and using a mixed linear model analysis on all samples. The variations in response explained by each single nucleotide polymorphism (h2SNP) were estimated. Results No SNP passed genome-wide significance in the GLM-based analyses with suggestive signals from rs56240334 [P=7.97×10−8 for change in the Clinical Global Impression Scale-Severity (CGI-S); P=8.72×10−7 for change in the total Positive and Negative Syndrome Scale (PANSS)] in the intron of ADCK1. The mixed linear model-based association P-values for rs56240334 were consistent with the results from GLM-based analyses and the association with change in CGI-S (P=4.26×10−8) reached genome-wide significance (i.e. P<5×10−8). We also found suggestive evidence for a polygenic contribution toward paliperidone treatment response with estimates of heritability, h2SNP, ranging from 0.31 to 0.43 for change in the total PANSS score, the PANSS positive Marder factor score, and CGI-S. Conclusion Genetic variations in the ADCK1 gene may differentially predict paliperidone efficacy in schizophrenic patients. However, this finding should be replicated in additional samples. PMID:27846195

  6. Genomic selection & association mapping in rice: effect of trait genetic architecture, training population composition, marker number & statistical model on accuracy of rice genomic selection in elite, tropical rice breeding

    USDA-ARS?s Scientific Manuscript database

    Genomic Selection (GS) is a new breeding method in which genome-wide markers are used to predict the breeding value of individuals in a breeding population. GS has been shown to improve breeding efficiency in dairy cattle and several crop plant species, and here we evaluate for the first time its ef...

  7. Meta-analysis of genome-wide association from genomic prediction models

    USDA-ARS?s Scientific Manuscript database

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...

  8. Methods for meta-analysis of genome-wide association studies

    USDA-ARS?s Scientific Manuscript database

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. For increasing N, results from different GWA can be combined in a meta-analysis (MA-...

  9. Genome-wide estimates of coancestry, inbreeding and effective population size in the Spanish Holstein population.

    PubMed

    Rodríguez-Ramilo, Silvia Teresa; Fernández, Jesús; Toro, Miguel Angel; Hernández, Delfino; Villanueva, Beatriz

    2015-01-01

    Estimates of effective population size in the Holstein cattle breed have usually been low despite the large number of animals that constitute this breed. Effective population size is inversely related to the rates at which coancestry and inbreeding increase and these rates have been high as a consequence of intense and accurate selection. Traditionally, coancestry and inbreeding coefficients have been calculated from pedigree data. However, the development of genome-wide single nucleotide polymorphisms has increased the interest of calculating these coefficients from molecular data in order to improve their accuracy. In this study, genomic estimates of coancestry, inbreeding and effective population size were obtained in the Spanish Holstein population and then compared with pedigree-based estimates. A total of 11,135 animals genotyped with the Illumina BovineSNP50 BeadChip were available for the study. After applying filtering criteria, the final genomic dataset included 36,693 autosomal SNPs and 10,569 animals. Pedigree data from those genotyped animals included 31,203 animals. These individuals represented only the last five generations in order to homogenise the amount of pedigree information across animals. Genomic estimates of coancestry and inbreeding were obtained from identity by descent segments (coancestry) or runs of homozygosity (inbreeding). The results indicate that the percentage of variance of pedigree-based coancestry estimates explained by genomic coancestry estimates was higher than that for inbreeding. Estimates of effective population size obtained from genome-wide and pedigree information were consistent and ranged from about 66 to 79. These low values emphasize the need of controlling the rate of increase of coancestry and inbreeding in Holstein selection programmes.

  10. Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement

    PubMed Central

    Spindel, J E; Begum, H; Akdemir, D; Collard, B; Redoña, E; Jannink, J-L; McCouch, S

    2016-01-01

    To address the multiple challenges to food security posed by global climate change, population growth and rising incomes, plant breeders are developing new crop varieties that can enhance both agricultural productivity and environmental sustainability. Current breeding practices, however, are unable to keep pace with demand. Genomic selection (GS) is a new technique that helps accelerate the rate of genetic gain in breeding by using whole-genome data to predict the breeding value of offspring. Here, we describe a new GS model that combines RR-BLUP with markers fit as fixed effects selected from the results of a genome-wide-association study (GWAS) on the RR-BLUP training data. We term this model GS + de novo GWAS. In a breeding population of tropical rice, GS + de novo GWAS outperformed six other models for a variety of traits and in multiple environments. On the basis of these results, we propose an extended, two-part breeding design that can be used to efficiently integrate novel variation into elite breeding populations, thus expanding genetic diversity and enhancing the potential for sustainable productivity gains. PMID:26860200

  11. Progress of genome wide association study in domestic animals.

    PubMed

    Zhang, Hui; Wang, Zhipeng; Wang, Shouzhi; Li, Hui

    2012-08-22

    Domestic animals are invaluable resources for study of the molecular architecture of complex traits. Although the mapping of quantitative trait loci (QTL) responsible for economically important traits in domestic animals has achieved remarkable results in recent decades, not all of the genetic variation in the complex traits has been captured because of the low density of markers used in QTL mapping studies. The genome wide association study (GWAS), which utilizes high-density single-nucleotide polymorphism (SNP), provides a new way to tackle this issue. Encouraging achievements in dissection of the genetic mechanisms of complex diseases in humans have resulted from the use of GWAS. At present, GWAS has been applied to the field of domestic animal breeding and genetics, and some advances have been made. Many genes or markers that affect economic traits of interest in domestic animals have been identified. In this review, advances in the use of GWAS in domestic animals are described.

  12. Profiling genome-wide DNA methylation.

    PubMed

    Yong, Wai-Shin; Hsu, Fei-Man; Chen, Pao-Yang

    2016-01-01

    DNA methylation is an epigenetic modification that plays an important role in regulating gene expression and therefore a broad range of biological processes and diseases. DNA methylation is tissue-specific, dynamic, sequence-context-dependent and trans-generationally heritable, and these complex patterns of methylation highlight the significance of profiling DNA methylation to answer biological questions. In this review, we surveyed major methylation assays, along with comparisons and biological examples, to provide an overview of DNA methylation profiling techniques. The advances in microarray and sequencing technologies make genome-wide profiling possible at a single-nucleotide or even a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, genomic region coverage, and bioinformatics analysis, and selecting a feasible method requires knowledge of these methods. We first introduce the biological background of DNA methylation and its pattern in plants, animals and fungi. We present an overview of major experimental approaches to profiling genome-wide DNA methylation and hydroxymethylation and then extend to the single-cell methylome. To evaluate these methods, we outline their strengths and weaknesses and perform comparisons across the different platforms. Due to the increasing need to compute high-throughput epigenomic data, we interrogate the computational pipeline for bisulfite sequencing data and also discuss the concept of identifying differentially methylated regions (DMRs). This review summarizes the experimental and computational concepts for profiling genome-wide DNA methylation, followed by biological examples. Overall, this review provides researchers useful guidance for the selection of a profiling method suited to specific research questions.

  13. Genome-Wide Approaches to Schizophrenia

    PubMed Central

    Duan, Jubao; Sanders, Alan R.; Gejman, Pablo V.

    2010-01-01

    Schizophrenia (SZ) is a common and severe psychiatric disorder with both environmental and genetic risk factors, and a high heritability. After over 20 years of molecular genetics research, new molecular strategies, primarily genome-wide association studies (GWAS), have generated major tangible progress. This new data provides evidence for: 1) A number of chromosomal regions with common polymorphisms showing genome-wide association with SZ (the major histocompatibility complex, MHC, region at 6p22-p21; 18q21.2; and 2q32.1). The associated alleles present small odds ratios (the odds of a risk variant being present in cases versus controls) and suggest causative involvement of gene regulatory mechanisms in SZ. 2) Polygenic inheritance. 3) Involvement of rare (<1%) and large (>100kb) copy number variants (CNVs). 4) A genetic overlap of SZ with autism and with bipolar disorder (BP) challenging the classical clinical classifications. Most new SZ findings (chromosomal regions and genes) have generated new biological leads. These new findings, however, still need to be translated into a better understanding of the underlying biology and into causal mechanisms. Furthermore, a considerable amount of heritability still remains unexplained (missing heritability). Deep resequencing for rare variants and system biology approaches (e.g., integrating DNA sequence and functional data) are expected to further improve our understanding of the genetic architecture of SZ and its underlying biology. PMID:20433910

  14. Genome-wide Association Study of Obsessive-Compulsive Disorder

    PubMed Central

    Stewart, S Evelyn; Yu, Dongmei; Scharf, Jeremiah M; Neale, Benjamin M; Fagerness, Jesen A; Mathews, Carol A; Arnold, Paul D; Evans, Patrick D; Gamazon, Eric R; Osiecki, Lisa; McGrath, Lauren; Haddad, Stephen; Crane, Jacquelyn; Hezel, Dianne; Illman, Cornelia; Mayerfeld, Catherine; Konkashbaev, Anuar; Liu, Chunyu; Pluzhnikov, Anna; Tikhomirov, Anna; Edlund, Christopher K; Rauch, Scott L; Moessner, Rainald; Falkai, Peter; Maier, Wolfgang; Ruhrmann, Stephan; Grabe, Hans-Jörgen; Lennertz, Leonard; Wagner, Michael; Bellodi, Laura; Cavallini, Maria Cristina; Richter, Margaret A; Cook, Edwin H; Kennedy, James L; Rosenberg, David; Stein, Dan J; Hemmings, Sian MJ; Lochner, Christine; Azzam, Amin; Chavira, Denise A; Fournier, Eduardo; Garrido, Helena; Sheppard, Brooke; Umaña, Paul; Murphy, Dennis L; Wendland, Jens R; Veenstra-VanderWeele, Jeremy; Denys, Damiaan; Blom, Rianne; Deforce, Dieter; Van Nieuwerburgh, Filip; Westenberg, Herman GM; Walitza, Susanne; Egberts, Karin; Renner, Tobias; Miguel, Euripedes Constantino; Cappi, Carolina; Hounie, Ana G; Conceição do Rosário, Maria; Sampaio, Aline S; Vallada, Homero; Nicolini, Humberto; Lanzagorta, Nuria; Camarena, Beatriz; Delorme, Richard; Leboyer, Marion; Pato, Carlos N; Pato, Michele T; Voyiaziakis, Emanuel; Heutink, Peter; Cath, Danielle C; Posthuma, Danielle; Smit, Jan H; Samuels, Jack; Bienvenu, O Joseph; Cullen, Bernadette; Fyer, Abby J; Grados, Marco A; Greenberg, Benjamin D; McCracken, James T; Riddle, Mark A; Wang, Ying; Coric, Vladimir; Leckman, James F; Bloch, Michael; Pittenger, Christopher; Eapen, Valsamma; Black, Donald W; Ophoff, Roel A; Strengman, Eric; Cusi, Daniele; Turiel, Maurizio; Frau, Francesca; Macciardi, Fabio; Gibbs, J Raphael; Cookson, Mark R; Singleton, Andrew; Hardy, John; Crenshaw, Andrew T; Parkin, Melissa A; Mirel, Daniel B; Conti, David V; Purcell, Shaun; Nestadt, Gerald; Hanna, Gregory L; Jenike, Michael A; Knowles, James A; Cox, Nancy; Pauls, David L

    2014-01-01

    Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1,465 cases, 5,557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9,657 X-chromosome SNPs. Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two p-values were located within DLGAP1 (p=2.49×10-6 and p=3.44×10-6), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a p-value=3.84 × 10-8. However, when trios were meta-analyzed with the combined case-control samples, the p-value for this variant was 3.62×10-5, losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation-QTLs (p<0.001) and frontal lobe eQTLs (p=0.001) was observed within the top-ranked SNPs (p<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921

  15. Genome Wide Association Study of Sepsis in Extremely Premature Infants

    PubMed Central

    Srinivasan, Lakshmi; Page, Grier; Kirpalani, Haresh; Murray, Jeffrey C.; Das, Abhik; Higgins, Rosemary D.; Carlo, Waldemar A.; Bell, Edward F.; Goldberg, Ronald N.; Schibler, Kurt; Sood, Beena G.; Stevenson, David K.; Stoll, Barbara J.; Van Meurs, Krisa P.; Johnson, Karen J.; Levy, Joshua; McDonald, Scott A.; Zaterka-Baxter, Kristin M.; Kennedy, Kathleen A.; Sánchez, Pablo J.; Duara, Shahnaz; Walsh, Michele C.; Shankaran, Seetha; Wynn, James L.; Cotten, C. Michael

    2017-01-01

    Objective To identify genetic variants associated with sepsis (early and late-onset) using a genome wide association (GWA) analysis in a cohort of extremely premature infants. Study Design Previously generated GWA data from the Neonatal Research Network’s anonymized genomic database biorepository of extremely premature infants were used for this study. Sepsis was defined as culture-positive early-onset or late-onset sepsis or culture-proven meningitis. Genomic and whole genome amplified DNA was genotyped for 1.2 million single nucleotide polymorphisms (SNPs); 91% of SNPs were successfully genotyped. We imputed 7.2 million additional SNPs. P values and false discovery rates were calculated from multivariate logistic regression analysis adjusting for gender, gestational age and ancestry. Target statistical value was p<10−5. Secondary analyses assessed associations of SNPs with pathogen type. Pathway analyses were also run on primary and secondary end points. Results Data from 757 extremely premature infants were included: 351 infants with sepsis and 406 infants without sepsis. No SNPs reached genome-wide significance levels (5×10−8); two SNPs in proximity to FOXC2 and FOXL1 genes achieved target levels of significance. In secondary analyses, SNPs for ELMO1, IRAK2 (Gram positive sepsis), RALA, IMMP2L (Gram negative sepsis) and PIEZO2 (fungal sepsis) met target significance levels. Pathways associated with sepsis and Gram negative sepsis included gap junctions, fibroblast growth factor receptors, regulators of cell division and Interleukin-1 associated receptor kinase 2 (p values<0.001 and FDR<20%). Conclusions No SNPs met genome-wide significance in this cohort of ELBW infants; however, areas of potential association and pathways meriting further study were identified. PMID:28283553

  16. Genome-Wide Association Study of Metabolic Syndrome in Koreans

    PubMed Central

    Jeong, Seok Won; Chung, Myungguen; Park, Soo-Jung; Cho, Seong Beom

    2014-01-01

    Metabolic syndrome (METS) is a disorder of energy utilization and storage and increases the risk of developing cardiovascular disease and diabetes. To identify the genetic risk factors of METS, we carried out a genome-wide association study (GWAS) for 2,657 cases and 5,917 controls in Korean populations. As a result, we could identify 2 single nucleotide polymorphisms (SNPs) with genome-wide significance level p-values (<5 × 10-8), 8 SNPs with genome-wide suggestive p-values (5 × 10-8 ≤ p < 1 × 10-5), and 2 SNPs of more functional variants with borderline p-values (5 × 10-5 ≤ p < 1 × 10-4). On the other hand, the multiple correction criteria of conventional GWASs exclude false-positive loci, but simultaneously, they discard many true-positive loci. To reconsider the discarded true-positive loci, we attempted to include the functional variants (nonsynonymous SNPs [nsSNPs] and expression quantitative trait loci [eQTL]) among the top 5,000 SNPs based on the proportion of phenotypic variance explained by genotypic variance. In total, 159 eQTLs and 18 nsSNPs were presented in the top 5,000 SNPs. Although they should be replicated in other independent populations, 6 eQTLs and 2 nsSNP loci were located in the molecular pathways of LPL, APOA5, and CHRM2, which were the significant or suggestive loci in the METS GWAS. Conclusively, our approach using the conventional GWAS, reconsidering functional variants and pathway-based interpretation, suggests a useful method to understand the GWAS results of complex traits and can be expanded in other genomewide association studies. PMID:25705157

  17. Patterns of Genome-Wide VDR Locations

    PubMed Central

    Tuoresmäki, Pauli; Väisänen, Sami; Neme, Antonio

    2014-01-01

    The genome-wide analysis of the binding sites of the transcription factor vitamin D receptor (VDR) is essential for a global appreciation the physiological impact of the nuclear hormone 1α,25-dihydroxyvitamin D3 (1,25(OH)2D3). Genome-wide analysis of lipopolysaccharide (LPS)-polarized THP-1 human monocytic leukemia cells via chromatin immunoprecipitation sequencing (ChIP-seq) resulted in 1,318 high-confidence VDR binding sites, of which 789 and 364 occurred uniquely with and without 1,25(OH)2D3 stimulation, while only 165 were common. We re-analyzed five public VDR ChIP-seq datasets with identical peak calling settings (MACS, version 2) and found, using a novel consensus summit identification strategy, in total 23,409 non-overlapping VDR binding sites, 75% of which are unique within the six analyzed cellular models. LPS-differentiated THP-1 cells have 22% more genomic VDR locations than undifferentiated cells and both cell types display more overlap in their VDR locations than the other investigated cell types. In general, the intersection of VDR binding profiles of ligand-stimulated cells is higher than those of unstimulated cells. De novo binding site searches and HOMER screening for binding motifs formed by direct repeats spaced by three nucleotides (DR3) suggest for all six VDR ChIP-seq datasets that these sequences are found preferentially at highly ligand responsive VDR loci. Importantly, all VDR ChIP-seq datasets display the same relationship between the VDR occupancy and the percentage of DR3-type sequences below the peak summits. The comparative analysis of six VDR ChIP-seq datasets demonstrated that the mechanistic basis for the action of the VDR is independent of the cell type. Only the minority of genome-wide VDR binding sites contains a DR3-type sequence. Moreover, the total number of identified VDR binding sites in each ligand-stimulated cell line inversely correlates with the percentage of peak summits with DR3 sites. PMID:24787735

  18. Genome-wide association study of parity in Bangladeshi women.

    PubMed

    Aschebrook-Kilfoy, Briseis; Argos, Maria; Pierce, Brandon L; Tong, Lin; Jasmine, Farzana; Roy, Shantanu; Parvez, Faruque; Ahmed, Alauddin; Islam, Tariqul; Kibriya, Muhammad G; Ahsan, Habibul

    2015-01-01

    Human fertility is a complex trait determined by gene-environment interactions in which genetic factors represent a significant component. To better understand inter-individual variability in fertility, we performed one of the first genome-wide association studies (GWAS) of common fertility phenotypes, lifetime number of pregnancies and number of children in a developing country population. The fertility phenotype data and DNA samples were obtained at baseline recruitment from individuals participating in a large prospective cohort study in Bangladesh. GWAS analyses of fertility phenotypes were conducted among 1,686 married women. One SNP on chromosome 4 was non-significantly associated with number of children at P <10(-7) and number of pregnancies at P <10(-6). This SNP is located in a region without a gene within 1 Mb. One SNP on chromosome 6 was non-significantly associated with extreme number of children at P <10(-6). The closest gene to this SNP is HDGFL1, a hepatoma-derived growth factor. When we excluded hormonal contraceptive users, a SNP on chromosome 5 was non-significantly associated at P <10(-5) for number of children and number of pregnancies. This SNP is located near C5orf64, an open reading frame, and ZSWIM6, a zinc ion binding gene. We also estimated the heritability of these phenotypes from our genotype data using GCTA (Genome-wide Complex Trait Analysis) for number of children (hg2 = 0.149, SE = 0.24, p-value = 0.265) and number of pregnancies (hg2 = 0.007, SE = 0.22, p-value = 0.487). Our genome-wide association study and heritability estimates of number of pregnancies and number of children in Bangladesh did not confer strong evidence of common variants for parity variation. However, our results suggest that future studies may want to consider the role of 3 notable SNPs in their analysis.

  19. Genome-wide determination of drug localization

    PubMed Central

    Anders, Lars; Guenther, Matthew G.; Qi, Jun; Fan, Zi Peng; Marineau, Jason J.; Rahl, Peter B.; Lovén, Jakob; Sigova, Alla A.; Smith, William B.; Lee, Tong Ihn; Bradner, James E.; Young, Richard A.

    2014-01-01

    A vast number of small-molecule ligands, including therapeutic drugs under development and in clinical use, elicit their effects by binding specific proteins associated with the genome. An ability to map the direct interactions of a chemical entity with chromatin genome-wide could provide new and important insights into chemical perturbation of cellular function. Here we describe a method that couples ligand-affinity capture and massively parallel DNA sequencing (Chem-seq) to identify the sites bound by small chemical molecules throughout the human genome. We show how Chem-seq can be combined with ChIP-seq to gain unique insights into the interaction of drugs with their target proteins throughout the genome of tumor cells. These methods provide a powerful approach to enhance understanding of therapeutic action and characterize the specificity of chemical entities that interact with DNA or genome-associated proteins. PMID:24336317

  20. Genome-Wide Association Studies of Cancer

    PubMed Central

    Stadler, Zsofia K.; Thom, Peter; Robson, Mark E.; Weitzel, Jeffrey N.; Kauff, Noah D.; Hurley, Karen E.; Devlin, Vincent; Gold, Bert; Klein, Robert J.; Offit, Kenneth

    2010-01-01

    Knowledge of the inherited risk for cancer is an important component of preventive oncology. In addition to well-established syndromes of cancer predisposition, much remains to be discovered about the genetic variation underlying susceptibility to common malignancies. Increased knowledge about the human genome and advances in genotyping technology have made possible genome-wide association studies (GWAS) of human diseases. These studies have identified many important regions of genetic variation associated with an increased risk for human traits and diseases including cancer. Understanding the principles, major findings, and limitations of GWAS is becoming increasingly important for oncologists as dissemination of genomic risk tests directly to consumers is already occurring through commercial companies. GWAS have contributed to our understanding of the genetic basis of cancer and will shed light on biologic pathways and possible new strategies for targeted prevention. To date, however, the clinical utility of GWAS-derived risk markers remains limited. PMID:20585100

  1. Replication in genome-wide association studies

    PubMed Central

    Kraft, Peter; Zeggini, Eleftheria; Ioannidis, John P. A.

    2009-01-01

    Summary Replication helps ensure that a genotype-phenotype association observed in a genome-wide association (GWA) study represents a credible association and is not a chance finding or an artifact due to uncontrolled biases. We discuss prerequisites for exact replication; issues of heterogeneity; advantages and disadvantages of different methods of data synthesis across multiple studies; frequentist vs. Bayesian inferences for replication; and challenges that arise from multi-team collaborations. While consistent replication can greatly improve the credibility of a genotype-phenotype association, it may not eliminate spurious associations due to biases shared by many studies. Conversely, lack of replication in well-powered follow-up studies usually invalidates the initially proposed association, although occasionally it may point to differences in linkage disequilibrium or effect modifiers across studies. PMID:20454541

  2. Comparison of molecular breeding values based on within- and across-breed training in beef cattle

    USDA-ARS?s Scientific Manuscript database

    Background Although the efficacy of genomic predictors based on within-breed training looks promising, it is necessary to develop and evaluate across-breed predictors for the technology to be fully applied in the beef industry. The efficacies of genomic predictors trained in one breed and utilized ...

  3. An Efficient Resampling Method for Assessing Genome-Wide Statistical Significance in Mapping Quantitative Trait Loci

    PubMed Central

    Zou, Fei; Fine, Jason P.; Hu, Jianhua; Lin, D. Y.

    2004-01-01

    Assessing genome-wide statistical significance is an important and difficult problem in multipoint linkage analysis. Due to multiple tests on the same genome, the usual pointwise significance level based on the chi-square approximation is inappropriate. Permutation is widely used to determine genome-wide significance. Theoretical approximations are available for simple experimental crosses. In this article, we propose a resampling procedure to assess the significance of genome-wide QTL mapping for experimental crosses. The proposed method is computationally much less intensive than the permutation procedure (in the order of 102 or higher) and is applicable to complex breeding designs and sophisticated genetic models that cannot be handled by the permutation and theoretical methods. The usefulness of the proposed method is demonstrated through simulation studies and an application to a Drosophila backcross. PMID:15611194

  4. Risk aversion affects economic values of blue fox breeding scheme.

    PubMed

    Peura, J; Kempe, R; Strandén, I; Rydhmer, L

    2016-12-01

    The profit and production of an average Finnish blue fox farm was simulated using a deterministic bio-economic farm model. Risk was included using Arrow-Prat absolute risk aversion coefficient and profit variance. Risk-rated economic values were calculated for pregnancy rate, litter loss, litter size, pelt size, pelt quality, pelt colour clarity, feed efficiency and eye infection. With high absolute risk aversion, economic values were lower than with low absolute risk aversion. Economic values were highest for litter loss (18.16 and 26.42 EUR), litter size (13.27 and 19.40 EUR), pregnancy (11.99 and 18.39 EUR) and eye infection (12.39 and 13.81 EUR). Sensitivity analysis showed that selection pressure for improved eye health depended strongly on proportion of culled animals among infected animals and much less on the proportion of infected animals. The economic value of feed efficiency was lower than expected (6.06 and 8.03 EUR). However, it was almost the same magnitude as pelt quality (7.30 and 7.30 EUR) and higher than the economic value of pelt size (3.37 and 5.26 EUR). Risk factors should be considered in blue fox breeding scheme because they change the relative importance of traits.

  5. Efficient multivariate linear mixed model algorithms for genome-wide association studies.

    PubMed

    Zhou, Xiang; Stephens, Matthew

    2014-04-01

    Multivariate linear mixed models (mvLMMs) are powerful tools for testing associations between single-nucleotide polymorphisms and multiple correlated phenotypes while controlling for population stratification in genome-wide association studies. We present efficient algorithms in the genome-wide efficient mixed model association (GEMMA) software for fitting mvLMMs and computing likelihood ratio tests. These algorithms offer improved computation speed, power and P-value calibration over existing methods, and can deal with more than two phenotypes.

  6. Genome-Wide Association Studies for Comb Traits in Chickens

    PubMed Central

    Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61–0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  7. Genome-Wide Association of Heroin Dependence in Han Chinese.

    PubMed

    Kalsi, Gursharan; Euesden, Jack; Coleman, Jonathan R I; Ducci, Francesca; Aliev, Fazil; Newhouse, Stephen J; Liu, Xiehe; Ma, Xiaohong; Wang, Yingcheng; Collier, David A; Asherson, Philip; Li, Tao; Breen, Gerome

    2016-01-01

    Drug addiction is a costly and recurring healthcare problem, necessitating a need to understand risk factors and mechanisms of addiction, and to identify new biomarkers. To date, genome-wide association studies (GWAS) for heroin addiction have been limited; moreover they have been restricted to examining samples of European and African-American origin due to difficulty of recruiting samples from other populations. This is the first study to test a Han Chinese population; we performed a GWAS on a homogeneous sample of 370 Han Chinese subjects diagnosed with heroin dependence using the DSM-IV criteria and 134 ethnically matched controls. Analysis using the diagnostic criteria of heroin dependence yielded suggestive evidence for association between variants in the genes CCDC42 (coiled coil domain 42; p = 2.8x10-7) and BRSK2 (BR serine/threonine 2; p = 4.110-6). In addition, we found evidence for risk variants within the ARHGEF10 (Rho guanine nucleotide exchange factor 10) gene on chromosome 8 and variants in a region on chromosome 20q13, which is gene-poor but has a concentration of mRNAs and predicted miRNAs. Gene-based association analysis identified genome-wide significant association between variants in CCDC42 and heroin addiction. Additionally, when we investigated shared risk variants between heroin addiction and risk of other addiction-related and psychiatric phenotypes using polygenic risk scores, we found a suggestive relationship with variants predicting tobacco addiction, and a significant relationship with variants predicting schizophrenia. Our genome wide association study of heroin dependence provides data in a novel sample, with functionally plausible results and evidence of genetic data of value to the field.

  8. Genome-Wide Association of Heroin Dependence in Han Chinese

    PubMed Central

    Coleman, Jonathan R. I.; Ducci, Francesca; Aliev, Fazil; Newhouse, Stephen J.; Liu, Xiehe; Ma, Xiaohong; Wang, Yingcheng; Collier, David A.; Asherson, Philip; Li, Tao; Breen, Gerome

    2016-01-01

    Drug addiction is a costly and recurring healthcare problem, necessitating a need to understand risk factors and mechanisms of addiction, and to identify new biomarkers. To date, genome-wide association studies (GWAS) for heroin addiction have been limited; moreover they have been restricted to examining samples of European and African-American origin due to difficulty of recruiting samples from other populations. This is the first study to test a Han Chinese population; we performed a GWAS on a homogeneous sample of 370 Han Chinese subjects diagnosed with heroin dependence using the DSM-IV criteria and 134 ethnically matched controls. Analysis using the diagnostic criteria of heroin dependence yielded suggestive evidence for association between variants in the genes CCDC42 (coiled coil domain 42; p = 2.8x10-7) and BRSK2 (BR serine/threonine 2; p = 4.110−6). In addition, we found evidence for risk variants within the ARHGEF10 (Rho guanine nucleotide exchange factor 10) gene on chromosome 8 and variants in a region on chromosome 20q13, which is gene-poor but has a concentration of mRNAs and predicted miRNAs. Gene-based association analysis identified genome-wide significant association between variants in CCDC42 and heroin addiction. Additionally, when we investigated shared risk variants between heroin addiction and risk of other addiction-related and psychiatric phenotypes using polygenic risk scores, we found a suggestive relationship with variants predicting tobacco addiction, and a significant relationship with variants predicting schizophrenia. Our genome wide association study of heroin dependence provides data in a novel sample, with functionally plausible results and evidence of genetic data of value to the field. PMID:27936112

  9. Genome-wide mapping of 10 calving and fertility traits in Holstein dairy cattle with special regard to chromosome 18.

    PubMed

    Müller, M-P; Rothammer, S; Seichter, D; Russ, I; Hinrichs, D; Tetens, J; Thaller, G; Medugorac, I

    2017-03-01

    Over the last decades, a dramatic decrease in reproductive performance has been observed in Holstein cattle and fertility problems have become the most common reason for a cow to leave the herd. The premature removal of animals with high breeding values results in both economic and breeding losses. For efficient future Holstein breeding, the identification of loci associated with low fertility is of major interest and thus constitutes the aim of this study. To reach this aim, a genome-wide combined linkage disequilibrium and linkage analysis (cLDLA) was conducted using data on the following 10 calving and fertility traits in the form of estimated breeding values: days from first service to conception of heifers and cows, nonreturn rate on d 56 of heifers and cows, days from calving to first insemination, days open, paternal and maternal calving ease, paternal and maternal stillbirth. The animal data set contained 2,527 daughter-proven Holstein bulls from Germany that were genotyped with Illumina's BovineSNP50 BeadChip (Illumina Inc., San Diego, CA). For the cLDLA, 41,635 sliding windows of 40 adjacent single nucleotide polymorphisms (SNP) were used. At each window midpoint, a variance component analysis was executed using ASReml. The underlying mixed linear model included random quantitative trait locus (QTL) and polygenic effects. We identified 50 genome-wide significant QTL. The most significant peak was detected for direct calving ease at 59,179,424 bp on chromosome 18 (BTA18). Next, a mixed-linear model association (MLMA) analysis was conducted. A comparison of the cLDLA and MLMA results with special regard to BTA18 showed that the genome-wide most significant SNP from the MLMA was associated with the same trait and located on the same chromosome at 57,589,121 bp (i.e., about 1.5 Mb apart from the cLDLA peak). The results of 5 different cLDLA and 2 MLMA models, which included the fixed effects of either SNP or haplotypes, suggested that the cLDLA method

  10. Genomic selection accuracy using multi-family prediction models in a wheat breeding program

    USDA-ARS?s Scientific Manuscript database

    Genomic selection (GS) uses genome-wide molecular marker data to predict the genetic value of selection candidates in breeding programs. In plant breeding, the ability to produce large numbers of progeny per cross allows GS to be conducted within each family. However, this approach requires phenotyp...

  11. Genome-Wide Association Study for Indicator Traits of Sexual Precocity in Nellore Cattle

    PubMed Central

    Irano, Natalia; de Camargo, Gregório Miguel Ferreira; Costa, Raphael Bermal; Terakado, Ana Paula Nascimento; Magalhães, Ana Fabrícia Braga; Silva, Rafael Medeiros de Oliveira; Dias, Marina Mortati; Bignardi, Annaiza Braga; Baldi, Fernando; Carvalheiro, Roberto; de Oliveira, Henrique Nunes; de Albuquerque, Lucia Galvão

    2016-01-01

    The objective of this study was to perform a genome-wide association study (GWAS) to detect chromosome regions associated with indicator traits of sexual precocity in Nellore cattle. Data from Nellore animals belonging to farms which participate in the DeltaGen® and Paint® animal breeding programs, were used. The traits used in this study were the occurrence of early pregnancy (EP) and scrotal circumference (SC). Data from 72,675 females and 83,911 males with phenotypes were used; of these, 1,770 females and 1,680 males were genotyped. The SNP effects were estimated with a single-step procedure (WssGBLUP) and the observed phenotypes were used as dependent variables. All animals with available genotypes and phenotypes, in addition to those with only phenotypic information, were used. A single-trait animal model was applied to predict breeding values and the solutions of SNP effects were obtained from these breeding values. The results of GWAS are reported as the proportion of variance explained by windows with 150 adjacent SNPs. The 10 windows that explained the highest proportion of variance were identified. The results of this study indicate the polygenic nature of EP and SC, demonstrating that the indicator traits of sexual precocity studied here are probably controlled by many genes, including some of moderate effect. The 10 windows with large effects obtained for EP are located on chromosomes 5, 6, 7, 14, 18, 21 and 27, and together explained 7.91% of the total genetic variance. For SC, these windows are located on chromosomes 4, 8, 11, 13, 14, 19, 22 and 23, explaining 6.78% of total variance. GWAS permitted to identify chromosome regions associated with EP and SC. The identification of these regions contributes to a better understanding and evaluation of these traits, and permits to indicate candidate genes for future investigation of causal mutations. PMID:27494397

  12. A Genome-Wide Perspective on Metabolism.

    PubMed

    Rauch, Alexander; Mandrup, Susanne

    2016-01-01

    Mammals have at least 210 histologically diverse cell types (Alberts, Molecular biology of the cell. Garland Science, New York, 2008) and the number would be even higher if functional differences are taken into account. The genome in each of these cell types is differentially programmed to express the specific set of genes needed to fulfill the phenotypical requirements of the cell. Furthermore, in each of these cell types, the gene program can be differentially modulated by exposure to external signals such as hormones or nutrients. The basis for the distinct gene programs relies on cell type-selective activation of transcriptional enhancers, which in turn are particularly sensitive to modulation. Until recently we had only fragmented insight into the regulation of a few of these enhancers; however, the recent advances in high-throughput sequencing technologies have enabled the development of a large number of technologies that can be used to obtain genome-wide insight into how genomes are reprogrammed during development and in response to specific external signals. By applying such technologies, we have begun to reveal the cross-talk between metabolism and the genome, i.e., how genomes are reprogrammed in response to metabolites, and how the regulation of metabolic networks is coordinated at the genomic level.

  13. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  14. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    PubMed

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  15. A Pooled Genome-Wide Association Study of Asperger Syndrome

    PubMed Central

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  16. Genome-wide positioning of bivalent mononucleosomes.

    PubMed

    Sen, Subhojit; Block, Kirsten F; Pasini, Alice; Baylin, Stephen B; Easwaran, Hariharan

    2016-09-15

    Bivalent chromatin refers to overlapping regions containing activating histone H3 Lys4 trimethylation (H3K4me3) and inactivating H3K27me3 marks. Existence of such bivalent marks on the same nucleosome has only recently been suggested. Previous genome-wide efforts to characterize bivalent chromatin have focused primarily on individual marks to define overlapping zones of bivalency rather than mapping positions of truly bivalent mononucleosomes. Here, we developed an efficacious sequential ChIP technique for examining global positioning of individual bivalent nucleosomes. Using next generation sequencing approaches we show that although individual H3K4me3 and H3K27me3 marks overlap in broad zones, bivalent nucleosomes are focally enriched in the vicinity of the transcription start site (TSS). These seem to occupy the H2A.Z nucleosome positions previously described as salt-labile nucleosomes, and are correlated with low gene expression. Although the enrichment profiles of bivalent nucleosomes show a clear dependency on CpG island content, they demonstrate a stark anti-correlation with methylation status. We show that regional overlap of H3K4me3 and H3K27me3 chromatin tend to be upstream to the TSS, while bivalent nucleosomes with both marks are mainly promoter proximal near the TSS of CpG island-containing genes with poised/low expression. We discuss the implications of the focal enrichment of bivalent nucleosomes around the TSS on the poised chromatin state of promoters in stem cells.

  17. Genome Wide Methylome Alterations in Lung Cancer.

    PubMed

    Mullapudi, Nandita; Ye, Bin; Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D; Spivack, Simon D

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)-non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents.

  18. Genome Wide Methylome Alterations in Lung Cancer

    PubMed Central

    Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K.; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D.; Spivack, Simon D.

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)–non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  19. Genome-wide DNA methylation profile in mungbean

    PubMed Central

    Kang, Yang Jae; Bae, Ahra; Shim, Sangrea; Lee, Taeyoung; Lee, Jayern; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha

    2017-01-01

    DNA methylation on cytosine residues is known to affect gene expression and is potentially responsible for the phenotypic variations among different crop cultivars. Here, we present the whole-genome DNA methylation profiles and assess the potential effects of single nucleotide polymorphisms (SNPs) for two mungbean cultivars, Sunhwanogdu (VC1973A) and Kyunggijaerae#5 (V2984). By measuring the DNA methylation levels in leaf tissue with the bisulfite sequencing (BSseq) approach, we show both the frequencies of the various types of DNA methylation and the distribution of weighted gene methylation levels. SNPs that cause nucleotide changes from/to CHH – where C is cytosine and H is any other nucleotide – were found to affect DNA methylation status in VC1973A and V2984. In order to better understand the correlation between gene expression and DNA methylation levels, we surveyed gene expression in leaf tissues of VC1973A and V2984 using RNAseq. Transcript expressions of paralogous genes were controlled by DNA methylation within the VC1973A genome. Moreover, genes that were differentially expressed between the two cultivars showed distinct DNA methylation patterns. Our mungbean genome-wide methylation profiles will be valuable resources for understanding the phenotypic variations between different cultivars, as well as for molecular breeding. PMID:28084412

  20. Genome-wide distribution of genetic diversity and linkage disequilibrium in elite sugar beet germplasm

    PubMed Central

    2011-01-01

    Background Characterization of population structure and genetic diversity of germplasm is essential for the efficient organization and utilization of breeding material. The objectives of this study were to (i) explore the patterns of population structure in the pollen parent heterotic pool using different methods, (ii) investigate the genome-wide distribution of genetic diversity, and (iii) assess the extent and genome-wide distribution of linkage disequilibrium (LD) in elite sugar beet germplasm. Results A total of 264 and 238 inbred lines from the yield type and sugar type inbreds of the pollen parent heterotic gene pools, respectively, which had been genotyped with 328 SNP markers, were used in this study. Two distinct subgroups were detected based on different statistical methods within the elite sugar beet germplasm set, which was in accordance with its breeding history. MCLUST based on principal components, principal coordinates, or lapvectors had high correspondence with the germplasm type information as well as the assignment by STRUCTURE, which indicated that these methods might be alternatives to STRUCTURE for population structure analysis. Gene diversity and modified Roger's distance between the examined germplasm types varied considerably across the genome, which might be due to artificial selection. This observation indicates that population genetic approaches could be used to identify candidate genes for the traits under selection. Due to the fact that r2 >0.8 is required to detect marker-phenotype association explaining less than 1% of the phenotypic variance, our observation of a low proportion of SNP loci pairs showing such levels of LD suggests that the number of markers has to be dramatically increased for powerful genome-wide association mapping. Conclusions We provided a genome-wide distribution map of genetic diversity and linkage disequilibrium for the elite sugar beet germplasm, which is useful for the application of genome-wide association

  1. Breeding Value of Primary Synthetic Wheat Genotypes for Grain Yield

    PubMed Central

    Jafarzadeh, Jafar; Bonnett, David; Jannink, Jean-Luc; Akdemir, Deniz; Dreisigacker, Susanne; Sorrells, Mark E.

    2016-01-01

    To introduce new genetic diversity into the bread wheat gene pool from its progenitor, Aegilops tauschii (Coss.) Schmalh, 33 primary synthetic hexaploid wheat genotypes (SYN) were crossed to 20 spring bread wheat (BW) cultivars at the International Wheat and Maize Improvement Center. Modified single seed descent was used to develop 97 populations with 50 individuals per population using first back-cross, biparental, and three-way crosses. Individuals from each cross were selected for short stature, early heading, flowering and maturity, minimal lodging, and free threshing. Yield trials were conducted under irrigated, drought, and heat-stress conditions from 2011 to 2014 in Ciudad Obregon, Mexico. Genomic estimated breeding values (GEBVs) of parents and synthetic derived lines (SDLs) were estimated using a genomic best linear unbiased prediction (GBLUP) model with markers in each trial. In each environment, there were SDLs that had higher GEBVs than their recurrent BW parent for yield. The GEBVs of BW parents for yield ranged from -0.32 in heat to 1.40 in irrigated trials. The range of the SYN parent GEBVs for yield was from -2.69 in the irrigated to 0.26 in the heat trials and were mostly negative across environments. The contribution of the SYN parents to improved grain yield of the SDLs was highest under heat stress, with an average GEBV for the top 10% of the SDLs of 0.55 while the weighted average GEBV of their corresponding recurrent BW parents was 0.26. Using the pedigree-based model, the accuracy of genomic prediction for yield was 0.42, 0.43, and 0.49 in the drought, heat and irrigated trials, respectively, while for the marker-based model these values were 0.43, 0.44, and 0.55. The SYN parents introduced novel diversity into the wheat gene pool. Higher GEBVs of progenies were due to introgression and retention of some positive alleles from SYN parents. PMID:27656893

  2. Genome-Wide Association Study of Septoria tritici Blotch Resistance in Ethiopian Durum Wheat Landraces.

    PubMed

    Kidane, Yosef G; Hailemariam, Bogale N; Mengistu, Dejene K; Fadda, Carlo; Pè, Mario Enrico; Dell'Acqua, Matteo

    2017-01-01

    Septoria tritici blotch (STB) is a devastating fungal disease affecting durum and bread wheat cultivation worldwide. The identification, development, and employment of resistant wheat genetic material is the key to overcoming costs and limitations of fungicide treatments. The search for resistance sources in untapped genetic material may speed up the deployment of STB genetic resistance in the field. Ethiopian durum wheat landraces represent a valuable source of such diversity. In this study, 318 Ethiopian durum wheat genotypes, for the most part traditional landraces, were phenotyped for resistance to different aspects of STB infection. Phenology, yield and yield component traits were concurrently measured the collection. Here we describe the distribution of STB resistance traits in modern varieties and in landraces, and the relation existing between STB resistance and other agronomic traits. STB resistance sources were found in landraces as well as in modern varieties tested, suggesting the presence of alleles of breeding relevance. The genetic material was genotyped with more than 16 thousand genome-wide polymorphic markers to describe the linkage disequilibrium and genetic structure existing within the panel of genotypes, and a genome-wide association (GWA) study was run to allow the identification of genomic loci involved in STB resistance. High diversity and low genetic structure in the panel allowed high efficiency GWA. The GWA scan detected five major putative QTL for STB resistance, only partially overlapping those already reported in the wheat literature. We report four putative loci for Septoria resistance with no match in previous literature: two highly significant ones on Chr 3A and 5A, and two suggestive ones on Chr 4B and 5B. Markers underlying these QTL explained as much as 10% of the phenotypic variance for disease resistance. We found three cases in which putative QTL for agronomic traits overlapped marker trait association deriving from STB GWA

  3. Bioinformatics challenges in genome-wide association studies (GWAS).

    PubMed

    De, Rishika; Bush, William S; Moore, Jason H

    2014-01-01

    Genome-wide association studies (GWAS) are a powerful tool for investigators to examine the human genome to detect genetic risk factors, reveal the genetic architecture of diseases and open up new opportunities for treatment and prevention. However, despite its successes, GWAS have not been able to identify genetic loci that are effective classifiers of disease, limiting their value for genetic testing. This chapter highlights the challenges that lie ahead for GWAS in better identifying disease risk predictors, and how we may address them. In this regard, we review basic concepts regarding GWAS, the technologies used for capturing genetic variation, the missing heritability problem, the need for efficient study design especially for replication efforts, reducing the bias introduced into a dataset, and how to utilize new resources available, such as electronic medical records. We also look to what lies ahead for the field, and the approaches that can be taken to realize the full potential of GWAS.

  4. A Discovery Genome-Wide Association Study of Entrepreneurship

    ERIC Educational Resources Information Center

    Quaye, Lydia; Nicolaou, Nicos; Shane, Scott; Mangino, Massimo

    2012-01-01

    To identify specific genetic variants influencing the phenotype of entrepreneurship, we conducted a genome-wide association study (GWAS) with 3,933 Caucasian females from the TwinsUK Adult Twin Registry. Following stringent genotype quality control, GWAF (genome-wide association analyses for family data) software was used to assess the association…

  5. Assessing genomic selection prediction accuracy in a dynamic barley breeding

    USDA-ARS?s Scientific Manuscript database

    Genomic selection is a method to improve quantitative traits in crops and livestock by estimating breeding values of selection candidates using phenotype and genome-wide marker data sets. Prediction accuracy has been evaluated through simulation and cross-validation, however validation based on prog...

  6. Genome-wide discovery of loci influencing chemotherapy cytotoxicity.

    PubMed

    Watters, James W; Kraja, Aldi; Meucci, Melissa A; Province, Michael A; McLeod, Howard L

    2004-08-10

    Little is known about the heritability of chemotherapy activity or the identity of genes that may enable the individualization of cancer chemotherapy. Although numerous genes are likely to influence chemotherapy response, current candidate gene-based pharmacogenetics approaches require a priori knowledge and the selection of a small number of candidate genes for hypothesis testing. In this study, an ex vivo familial genetics strategy using lymphoblastoid cells derived from Centre d'Etude du Polymorphisme Humain reference pedigrees was used to discover genetic determinants of chemotherapy cytotoxicity. Cytotoxicity to the mechanistically distinct chemotherapy agents 5-fluorouracil and docetaxel were shown to be heritable traits, with heritability values ranging from 0.26 to 0.65 for 5-fluorouracil and 0.21 to 0.70 for docetaxel, varying with dose. Genome-wide linkage analysis was also used to map a quantitative trait locus influencing the cellular effects of 5-fluorouracil to chromosome 9q13-q22 [logarithm of odds (LOD) = 3.44], and two quantitative trait loci influencing the cellular effects of docetaxel to chromosomes 5q11-21 (LOD = 2.21) and 9q13-q22 (LOD = 2.73). Finally, 5-fluorouracil and docetaxel were shown to cause apoptotic cell death involving caspase-3 cleavage in Centre d'Etude du Polymorphisme Humain lymphoblastoid cells. This study identifies genomic regions likely to harbor genes important for chemotherapy cytotoxicity using genome-wide linkage analysis in human pedigrees and provides a widely applicable strategy for pharmacogenomic discovery without the requirement for a priori candidate gene selection.

  7. Genome-Wide Expression Profiling of Complex Regional Pain Syndrome

    PubMed Central

    Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung

    2013-01-01

    Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and p<0.05). Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1), matrix metalloproteinase 9 (MMP9), alanine aminopeptidase N (ANPEP), l-histidine decarboxylase (HDC), granulocyte colony-stimulating factor 3 receptor (G-CSF3R), and signal transducer and activator of transcription 3 (STAT3) genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504

  8. Systems-Level Analysis of Genome-Wide Association Data

    PubMed Central

    Farber, Charles R.

    2013-01-01

    Genome-wide association studies (GWAS) have emerged as the method of choice for identifying common variants affecting complex disease. In a GWAS, particular attention is placed, for obvious reasons, on single-nucleotide polymorphisms (SNPs) that exceed stringent genome-wide significance thresholds. However, it is expected that many SNPs with only nominal evidence of association (e.g., P < 0.05) truly influence disease. Efforts to extract additional biological information from entire GWAS datasets have primarily focused on pathway-enrichment analyses. However, these methods suffer from a number of limitations and typically fail to lead to testable hypotheses. To evaluate alternative approaches, we performed a systems-level analysis of GWAS data using weighted gene coexpression network analysis. A weighted gene coexpression network was generated for 1918 genes harboring SNPs that displayed nominal evidence of association (P ≤ 0.05) from a GWAS of bone mineral density (BMD) using microarray data on circulating monocytes isolated from individuals with extremely low or high BMD. Thirteen distinct gene modules were identified, each comprising coexpressed and highly interconnected GWAS genes. Through the characterization of module content and topology, we illustrate how network analysis can be used to discover disease-associated subnetworks and characterize novel interactions for genes with a known role in the regulation of BMD. In addition, we provide evidence that network metrics can be used as a prioritizing tool when selecting genes and SNPs for replication studies. Our results highlight the advantages of using systems-level strategies to add value to and inform GWAS. PMID:23316444

  9. A Genome-Wide Association Study (GWAS) for Bronchopulmonary Dysplasia

    PubMed Central

    Wang, Hui; St. Julien, Krystal R.; Stevenson, David K.; Hoffmann, Thomas J.; Witte, John S.; Lazzeroni, Laura C.; Krasnow, Mark A.; Quaintance, Cecele C.; Oehlert, John W.; Jelliffe-Pawlowski, Laura L.; Gould, Jeffrey B.; Shaw, Gary M.

    2013-01-01

    OBJECTIVE: Twin studies suggest that heritability of moderate-severe bronchopulmonary dysplasia (BPD) is 53% to 79%, we conducted a genome-wide association study (GWAS) to identify genetic variants associated with the risk for BPD. METHODS: The discovery GWAS was completed on 1726 very low birth weight infants (gestational age = 250–296/7 weeks) who had a minimum of 3 days of intermittent positive pressure ventilation and were in the hospital at 36 weeks’ postmenstrual age. At 36 weeks’ postmenstrual age, moderate-severe BPD cases (n = 899) were defined as requiring continuous supplemental oxygen, whereas controls (n = 827) inhaled room air. An additional 795 comparable infants (371 cases, 424 controls) were a replication population. Genomic DNA from case and control newborn screening bloodspots was used for the GWAS. The replication study interrogated single-nucleotide polymorphisms (SNPs) identified in the discovery GWAS and those within the HumanExome beadchip. RESULTS: Genotyping using genomic DNA was successful. We did not identify SNPs associated with BPD at the genome-wide significance level (5 × 10−8) and no SNP identified in previous studies reached statistical significance (Bonferroni-corrected P value threshold .0018). Pathway analyses were not informative. CONCLUSIONS: We did not identify genomic loci or pathways that account for the previously described heritability for BPD. Potential explanations include causal mutations that are genetic variants and were not assayed or are mapped to many distributed loci, inadequate sample size, race ethnicity of our study population, or case-control differences investigated are not attributable to underlying common genetic variation. PMID:23897914

  10. Beef cattle body temperature during climatic stress: a genome-wide association study

    NASA Astrophysics Data System (ADS)

    Howard, Jeremy T.; Kachman, Stephen D.; Snelling, Warren M.; Pollak, E. John; Ciobanu, Daniel C.; Kuehn, Larry A.; Spangler, Matthew L.

    2014-09-01

    Cattle are reared in diverse environments and collecting phenotypic body temperature (BT) measurements to characterize BT variation across diverse environments is difficult and expensive. To better understand the genetic basis of BT regulation, a genome-wide association study was conducted utilizing crossbred steers and heifers totaling 239 animals of unknown pedigree and breed fraction. During predicted extreme heat and cold stress events, hourly tympanic and vaginal BT devices were placed in steers and heifers, respectively. Individuals were genotyped with the BovineSNP50K_v2 assay and data analyzed using Bayesian models for area under the curve (AUC), a measure of BT over time, using hourly BT observations summed across 5-days (AUC summer 5-day (AUCS5D) and AUC winter 5-day (AUCW5D)). Posterior heritability estimates were moderate to high and were estimated to be 0.68 and 0.21 for AUCS5D and AUCW5D, respectively. Moderately positive correlations between direct genomic values for AUCS5D and AUCW5D (0.40) were found, although a small percentage of the top 5 % 1-Mb windows were in common. Different sets of genes were associated with BT during winter and summer, thus simultaneous selection for animals tolerant to both heat and cold appears possible.

  11. Identification of genes related to intramuscular fat content of pig using genome-wide association study.

    PubMed

    Won, Sohyoung; Jung, Jaehoon; Park, Eungwoo; Kim, H B

    2017-06-27

    The aim of this study is to identify SNPs and genes related to pig IMF and estimate the heritability of IMF. Genome-wide association study (GWAS) on 704 inbred Berkshires was performed for intramuscular fat content (IMF). To consider the inbreeding among samples, associations of the SNPs with IMF were tested as random effects in a mixed linear model using the genetic relationship matrix by GEMMA. Significant genes were compared with reported pig IMF QTL regions and functional classification of the identified genes were also performed. Heritability of IMF was estimated by GCTA tool. Total 365 SNPs were found to be significant from a cutoff of p-value <0.01 and the 365 significant SNPs were annotated across 120 genes. 25 genes were on pig IMF QTL regions. BMPER, FOXO1, EDAR, RNF149, CD40, PTPN1, SOX9, MYC, MIF were related to mitogen-activated protein kinase (MAPK) pathway which regulates the differentiation to adipocytes. These genes and the genes mapped on QTLs could be the candidate genes affecting IMF. Heritability of IMF was estimated as 0.52, which was relatively high, suggesting that a considerable portion of the total variance of IMF is explained by the SNP information. Our results can contribute to breeding pig with better IMF and therefore, producing pork with better sensory qualities.

  12. Genome-Wide Detection of Selective Signatures in Chicken through High Density SNPs

    PubMed Central

    Liu, Zhuang; Sun, Congjiao; Qu, Liang; Wang, Kehua; Yang, Ning

    2016-01-01

    Chicken is recognized as an excellent model for studies of genetic mechanism of phenotypic and genomic evolution, with large effective population size and strong human-driven selection. In the present study, we performed Extended Haplotype Homozygosity (EHH) tests to identify significant core regions employing 600K SNP Chicken chip in an F2 population of 1,534 hens, which was derived from reciprocal crosses between White Leghorn and Dongxiang chicken. Results indicated that a total of 49,151 core regions with an average length of 9.79 Kb were identified, which occupied approximately 52.15% of genome across all autosomes, and 806 significant core regions attracted us mostly. Genes in candidate regions may experience positive selection and were considered to have possible influence on beneficial economic traits. A panel of genes including AASDHPPT, GDPD5, PAR3, SOX6, GPC1 and a signal pathway of AKT1 were detected with the most extreme P-values. Further enrichment analyses indicated that these genes were associated with immune function, sensory organ development and neurogenesis, and may have experienced positive selection in chicken. Moreover, some of core regions exactly overlapped with genes excavated in our previous GWAS, suggesting that these genes have undergone positive selection may affect egg production. Findings in our study could draw a comparatively integrate genome-wide map of selection signature in the chicken genome, and would be worthy for explicating the genetic mechanisms of phenotypic diversity in poultry breeding. PMID:27820849

  13. Genome-Wide Detection of Selective Signatures in Chicken through High Density SNPs.

    PubMed

    Liu, Zhuang; Sun, Congjiao; Qu, Liang; Wang, Kehua; Yang, Ning

    2016-01-01

    Chicken is recognized as an excellent model for studies of genetic mechanism of phenotypic and genomic evolution, with large effective population size and strong human-driven selection. In the present study, we performed Extended Haplotype Homozygosity (EHH) tests to identify significant core regions employing 600K SNP Chicken chip in an F2 population of 1,534 hens, which was derived from reciprocal crosses between White Leghorn and Dongxiang chicken. Results indicated that a total of 49,151 core regions with an average length of 9.79 Kb were identified, which occupied approximately 52.15% of genome across all autosomes, and 806 significant core regions attracted us mostly. Genes in candidate regions may experience positive selection and were considered to have possible influence on beneficial economic traits. A panel of genes including AASDHPPT, GDPD5, PAR3, SOX6, GPC1 and a signal pathway of AKT1 were detected with the most extreme P-values. Further enrichment analyses indicated that these genes were associated with immune function, sensory organ development and neurogenesis, and may have experienced positive selection in chicken. Moreover, some of core regions exactly overlapped with genes excavated in our previous GWAS, suggesting that these genes have undergone positive selection may affect egg production. Findings in our study could draw a comparatively integrate genome-wide map of selection signature in the chicken genome, and would be worthy for explicating the genetic mechanisms of phenotypic diversity in poultry breeding.

  14. Beef cattle body temperature during climatic stress: a genome-wide association study.

    PubMed

    Howard, Jeremy T; Kachman, Stephen D; Snelling, Warren M; Pollak, E John; Ciobanu, Daniel C; Kuehn, Larry A; Spangler, Matthew L

    2014-09-01

    Cattle are reared in diverse environments and collecting phenotypic body temperature (BT) measurements to characterize BT variation across diverse environments is difficult and expensive. To better understand the genetic basis of BT regulation, a genome-wide association study was conducted utilizing crossbred steers and heifers totaling 239 animals of unknown pedigree and breed fraction. During predicted extreme heat and cold stress events, hourly tympanic and vaginal BT devices were placed in steers and heifers, respectively. Individuals were genotyped with the BovineSNP50K_v2 assay and data analyzed using Bayesian models for area under the curve (AUC), a measure of BT over time, using hourly BT observations summed across 5-days (AUC summer 5-day (AUCS5D) and AUC winter 5-day (AUCW5D)). Posterior heritability estimates were moderate to high and were estimated to be 0.68 and 0.21 for AUCS5D and AUCW5D, respectively. Moderately positive correlations between direct genomic values for AUCS5D and AUCW5D (0.40) were found, although a small percentage of the top 5% 1-Mb windows were in common. Different sets of genes were associated with BT during winter and summer, thus simultaneous selection for animals tolerant to both heat and cold appears possible.

  15. Genome-Wide Association Study of a Varroa-Specific Defense Behavior in Honeybees (Apis mellifera)

    PubMed Central

    Spötter, Andreas; Gupta, Pooja; Mayer, Manfred; Reinsch, Norbert

    2016-01-01

    Honey bees are exposed to many damaging pathogens and parasites. The most devastating is Varroa destructor, which mainly affects the brood. A promising approach for preventing its spread is to breed Varroa-resistant honey bees. One trait that has been shown to provide significant resistance against the Varroa mite is hygienic behavior, which is a behavioral response of honeybee workers to brood diseases in general. Here, we report the use of an Affymetrix 44K SNP array to analyze SNPs associated with detection and uncapping of Varroa-parasitized brood by individual worker bees (Apis mellifera). For this study, 22 000 individually labeled bees were video-monitored and a sample of 122 cases and 122 controls was collected and analyzed to determine the dependence/independence of SNP genotypes from hygienic and nonhygienic behavior on a genome-wide scale. After false-discovery rate correction of the P values, 6 SNP markers had highly significant associations with the trait investigated (α < 0.01). Inspection of the genomic regions around these SNPs led to the discovery of putative candidate genes. PMID:26774061

  16. Genome-Wide Association Study of a Varroa-Specific Defense Behavior in Honeybees (Apis mellifera).

    PubMed

    Spötter, Andreas; Gupta, Pooja; Mayer, Manfred; Reinsch, Norbert; Bienefeld, Kaspar

    2016-05-01

    Honey bees are exposed to many damaging pathogens and parasites. The most devastating is Varroa destructor, which mainly affects the brood. A promising approach for preventing its spread is to breed Varroa-resistant honey bees. One trait that has been shown to provide significant resistance against the Varroa mite is hygienic behavior, which is a behavioral response of honeybee workers to brood diseases in general. Here, we report the use of an Affymetrix 44K SNP array to analyze SNPs associated with detection and uncapping of Varroa-parasitized brood by individual worker bees (Apis mellifera). For this study, 22 000 individually labeled bees were video-monitored and a sample of 122 cases and 122 controls was collected and analyzed to determine the dependence/independence of SNP genotypes from hygienic and nonhygienic behavior on a genome-wide scale. After false-discovery rate correction of the P values, 6 SNP markers had highly significant associations with the trait investigated (α < 0.01). Inspection of the genomic regions around these SNPs led to the discovery of putative candidate genes.

  17. [Phenotypic trends and breeding values for canine congenital sensorineural deafness in Dalmatian dogs].

    PubMed

    Blum, Meike; Distl, Ottmar

    2014-01-01

    In the present study, breeding values for canine congenital sensorineural deafness, the presence of blue eyes and patches have been predicted using multivariate animal models to test the reliability of the breeding values for planned matings. The dataset consisted of 6669 German Dalmatian dogs born between 1988 and 2009. Data were provided by the Dalmatian kennel clubs which are members of the German Association for Dog Breeding and Husbandry (VDH). The hearing status for all dogs was evaluated using brainstem auditory evoked potentials. The reliability using the prediction error variance of breeding values and the realized reliability of the prediction of the phenotype of future progeny born in each one year between 2006 and 2009 were used as parameters to evaluate the goodness of prediction through breeding values. All animals from the previous birth years were used for prediction of the breeding values of the progeny in each of the up-coming birth years. The breeding values based on pedigree records achieved an average reliability of 0.19 for the future 1951 progeny. The predictive accuracy (R2) for the hearing status of single future progeny was at 1.3%. Combining breeding values for littermates increased the predictive accuracy to 3.5%. Corresponding values for maternal and paternal half-sib groups were at 3.2 and 7.3%. The use of breeding values for planned matings increases the phenotypic selection response over mass selection. The breeding values of sires may be used for planned matings because reliabilities and predictive accuracies for future paternal progeny groups were highest.

  18. Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms

    USDA-ARS?s Scientific Manuscript database

    Background: A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a...

  19. Identification and characterization of a genome-wide significant region associated with red blood cell phenotypes in domestic sheep

    USDA-ARS?s Scientific Manuscript database

    A genome wide association study (GWAS) investigating red blood cell (RBC) phenotypes was performed with over 500 domestic sheep (Ovis aries) from three economically important breeds in the US (Columbia, Polypay, and Rambouillet). A single nucleotide polymorphism (SNP, hereafter the discovery SNP) sh...

  20. Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms

    USDA-ARS?s Scientific Manuscript database

    Background A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a ...

  1. Genome-wide association mapping of fusarium head blight resistance in wheat (Triticum aestivum L.) using genotyping by sequencing

    USDA-ARS?s Scientific Manuscript database

    Fusarium head blight (FHB) is one of the most important wheat diseases worldwide and host resistance displays complex genetic control. A genome-wide association study (GWAS) was performed on 273 winter wheat breeding lines from the mid-western and eastern regions of the US to identify chromosomal re...

  2. Genome Wide Allele Frequency Fingerprints (GWAFFs) of Populations via Genotyping by Sequencing

    PubMed Central

    Byrne, Stephen; Czaban, Adrian; Studer, Bruno; Panitz, Frank; Bendixen, Christian; Asp, Torben

    2013-01-01

    Genotyping-by-Sequencing (GBS) is an excellent tool for characterising genetic variation between plant genomes. To date, its use has been reported only for genotyping of single individuals. However, there are many applications where resolving allele frequencies within populations on a genome-wide scale would be very powerful, examples include the breeding of outbreeding species, varietal protection in outbreeding species, monitoring changes in population allele frequencies. This motivated us to test the potential to use GBS to evaluate allele frequencies within populations. Perennial ryegrass is an outbreeding species, and breeding programs are based upon selection on populations. We tested two restriction enzymes for their efficiency in complexity reduction of the perennial ryegrass genome. The resulting profiles have been termed Genome Wide Allele Frequency Fingerprints (GWAFFs), and we have shown how these fingerprints can be used to distinguish between plant populations. Even at current costs and throughput, using sequencing to directly evaluate populations on a genome-wide scale is viable. GWAFFs should find many applications, from varietal development in outbreeding species right through to playing a role in protecting plant breeders’ rights. PMID:23469194

  3. Genome-Wide Association for Growth Traits in Canchim Beef Cattle

    PubMed Central

    Buzanskas, Marcos E.; Grossi, Daniela A.; Ventura, Ricardo V.; Schenkel, Flávio S.; Sargolzaei, Mehdi; Meirelles, Sarah L. C.; Mokry, Fabiana B.; Higa, Roberto H.; Mudadu, Maurício A.; da Silva, Marcos V. G. Barbosa.; Niciura, Simone C. M.; Júnior, Roberto A. A. Torres.; Alencar, Maurício M.; Regitano, Luciana C. A.; Munari, Danísio P.

    2014-01-01

    Studies are being conducted on the applicability of genomic data to improve the accuracy of the selection process in livestock, and genome-wide association studies (GWAS) provide valuable information to enhance the understanding on the genetics of complex traits. The aim of this study was to identify genomic regions and genes that play roles in birth weight (BW), weaning weight adjusted for 210 days of age (WW), and long-yearling weight adjusted for 420 days of age (LYW) in Canchim cattle. GWAS were performed by means of the Generalized Quasi-Likelihood Score (GQLS) method using genotypes from the BovineHD BeadChip and estimated breeding values for BW, WW, and LYW. Data consisted of 285 animals from the Canchim breed and 114 from the MA genetic group (derived from crossings between Charolais sires and ½ Canchim + ½ Zebu dams). After applying a false discovery rate correction at a 10% significance level, a total of 4, 12, and 10 SNPs were significantly associated with BW, WW, and LYW, respectively. These SNPs were surveyed to their corresponding genes or to surrounding genes within a distance of 250 kb. The genes DPP6 (dipeptidyl-peptidase 6) and CLEC3B (C-type lectin domain family 3 member B) were highlighted, considering its functions on the development of the brain and skeletal system, respectively. The GQLS method identified regions on chromosome associated with birth weight, weaning weight, and long-yearling weight in Canchim and MA animals. New candidate regions for body weight traits were detected and some of them have interesting biological functions, of which most have not been previously reported. The observation of QTL reports for body weight traits, covering areas surrounding the genes (SNPs) herein identified provides more evidence for these associations. Future studies targeting these areas could provide further knowledge to uncover the genetic architecture underlying growth traits in Canchim cattle. PMID:24733441

  4. Genome-wide association for heifer reproduction and calf performance traits in beef cattle.

    PubMed

    Akanno, Everestus C; Plastow, Graham; Fitzsimmons, Carolyn; Miller, Stephen P; Baron, Vern; Ominski, Kimberly; Basarab, John A

    2015-12-01

    The aim of this study was to identify SNP markers that associate with variation in beef heifer reproduction and performance of their calves. A genome-wide association study was performed by means of the generalized quasi-likelihood score (GQLS) method using heifer genotypes from the BovineSNP50 BeadChip and estimated breeding values for pre-breeding body weight (PBW), pregnancy rate (PR), calving difficulty (CD), age at first calving (AFC), calf birth weight (BWT), calf weaning weight (WWT), and calf pre-weaning average daily gain (ADG). Data consisted of 785 replacement heifers from three Canadian research herds, namely Brandon Research Centre, Brandon, Manitoba, University of Alberta Roy Berg Kinsella Ranch, Kinsella, Alberta, and Lacombe Research Centre, Lacombe, Alberta. After applying a false discovery rate correction at a 5% significance level, a total of 4, 3, 3, 9, 6, 2, and 1 SNPs were significantly associated with PBW, PR, CD, AFC, BWT, WWT, and ADG, respectively. These SNPs were located on chromosomes 1, 5-7, 9, 13-16, 19-21, 24, 25, and 27-29. Chromosomes 1, 5, and 24 had SNPs with pleiotropic effects. New significant SNPs that impact functional traits were detected, many of which have not been previously reported. The results of this study support quantitative genetic studies related to the inheritance of these traits, and provides new knowledge regarding beef cattle quantitative trait loci effects. The identification of these SNPs provides a starting point to identify genes affecting heifer reproduction traits and performance of their calves (BWT, WWT, and ADG). They also contribute to a better understanding of the biology underlying these traits and will be potentially useful in marker- and genome-assisted selection and management.

  5. Genome Wide Association Study Identifies New Loci Associated with Undesired Coat Color Phenotypes in Saanen Goats

    PubMed Central

    Martin, Pauline Marie; Palhière, Isabelle; Ricard, Anne; Tosser-Klopp, Gwenola; Rupp, Rachel

    2016-01-01

    This paper reports a quantitative genetics and genomic analysis of undesirable coat color patterns in goats. Two undesirable coat colors have routinely been recorded for the past 15 years in French Saanen goats. One fifth of Saanen females have been phenotyped “pink” (8.0%) or “pink neck” (11.5%) and consequently have not been included in the breeding program as elite animals. Heritability of the binary “pink” and “pink neck” phenotype, estimated from 103,443 females was 0.26 for “pink” and 0.21 for “pink neck”. Genome wide association studies (using haplotypes or single SNPs) were implemented using a daughter design of 810 Saanen goats sired by 9 Artificial Insemination bucks genotyped with the goatSNP50 chip. A highly significant signal (-log10pvalue = 10.2) was associated with the “pink neck” phenotype on chromosome 11, suggesting the presence of a major gene. Highly significant signals for the “pink” phenotype were found on chromosomes 5 and 13 (-log10p values of 7.2 and, 7.7 respectively). The most significant SNP on chromosome 13 was in the ASIP gene region, well known for its association with coat color phenotypes. Nine significant signals were also found for both traits. The highest signal for each trait was detected by both single SNP and haplotype approaches, whereas the smaller signals were not consistently detected by the two methods. Altogether these results demonstrated a strong genetic control of the “pink” and “pink neck” phenotypes in French Saanen goats suggesting that SNP information could be used to identify and remove undesired colored animals from the breeding program. PMID:27030980

  6. Genome-wide association for growth traits in Canchim beef cattle.

    PubMed

    Buzanskas, Marcos E; Grossi, Daniela A; Ventura, Ricardo V; Schenkel, Flávio S; Sargolzaei, Mehdi; Meirelles, Sarah L C; Mokry, Fabiana B; Higa, Roberto H; Mudadu, Maurício A; da Silva, Marcos V G Barbosa; Niciura, Simone C M; Torres, Roberto A A; Alencar, Maurício M; Regitano, Luciana C A; Munari, Danísio P

    2014-01-01

    Studies are being conducted on the applicability of genomic data to improve the accuracy of the selection process in livestock, and genome-wide association studies (GWAS) provide valuable information to enhance the understanding on the genetics of complex traits. The aim of this study was to identify genomic regions and genes that play roles in birth weight (BW), weaning weight adjusted for 210 days of age (WW), and long-yearling weight adjusted for 420 days of age (LYW) in Canchim cattle. GWAS were performed by means of the Generalized Quasi-Likelihood Score (GQLS) method using genotypes from the BovineHD BeadChip and estimated breeding values for BW, WW, and LYW. Data consisted of 285 animals from the Canchim breed and 114 from the MA genetic group (derived from crossings between Charolais sires and ½ Canchim + ½ Zebu dams). After applying a false discovery rate correction at a 10% significance level, a total of 4, 12, and 10 SNPs were significantly associated with BW, WW, and LYW, respectively. These SNPs were surveyed to their corresponding genes or to surrounding genes within a distance of 250 kb. The genes DPP6 (dipeptidyl-peptidase 6) and CLEC3B (C-type lectin domain family 3 member B) were highlighted, considering its functions on the development of the brain and skeletal system, respectively. The GQLS method identified regions on chromosome associated with birth weight, weaning weight, and long-yearling weight in Canchim and MA animals. New candidate regions for body weight traits were detected and some of them have interesting biological functions, of which most have not been previously reported. The observation of QTL reports for body weight traits, covering areas surrounding the genes (SNPs) herein identified provides more evidence for these associations. Future studies targeting these areas could provide further knowledge to uncover the genetic architecture underlying growth traits in Canchim cattle.

  7. Genome-wide characterization of genetic diversity and population structure in Secale

    PubMed Central

    2014-01-01

    Background Numerous rye accessions are stored in ex situ genebanks worldwide. Little is known about the extent of genetic diversity contained in any of them and its relation to contemporary varieties, since to date rye genetic diversity studies had a very limited scope, analyzing few loci and/ or few accessions. Development of high throughput genotyping methods for rye opened the possibility for genome wide characterizations of large accessions sets. In this study we used 1054 Diversity Array Technology (DArT) markers with defined chromosomal location to characterize genetic diversity and population structure in a collection of 379 rye accessions including wild species, landraces, cultivated materials, historical and contemporary rye varieties. Results Average genetic similarity (GS) coefficients and average polymorphic information content (PIC) values varied among chromosomes. Comparison of chromosome specific average GS within and between germplasm sub-groups indicated regions of chromosomes 1R and 4R as being targeted by selection in current breeding programs. Bayesian clustering, principal coordinate analysis and Neighbor Joining clustering demonstrated that source and improvement status contributed significantly to the structure observed in the analyzed set of Secale germplasm. We revealed a relatively limited diversity in improved rye accessions, both historical and contemporary, as well as lack of correlation between clustering of improved accessions and geographic origin, suggesting common genetic background of rye accessions from diverse geographic regions and extensive germplasm exchange. Moreover, contemporary varieties were distinct from the remaining accessions. Conclusions Our results point to an influence of reproduction methods on the observed diversity patterns and indicate potential of ex situ collections for broadening the genetic diversity in rye breeding programs. Obtained data show that DArT markers provide a realistic picture of the genetic

  8. Control selection options for genome-wide association studies in cohorts.

    PubMed

    Wacholder, Sholom; Rotunno, Melissa

    2009-03-01

    Investigators planning studies within cohorts have many options for choosing an efficient sampling design for genome-wide association and other molecular epidemiology studies. Consideration of person-year and proportional hazards analyses of full cohorts may add further insight into ramifications of different designs. Empirical evidence from genome-wide association studies can supplement intuition and simulations in comparing properties of various case-control designs within cohorts. Additional theoretical and empirical work, justification of sampling choice in publications, and consideration of context and scientific aims can improve designs and, thereby, increase the scientific value and cost effectiveness of future studies.

  9. Genome-Wide Association Mapping of Anther Extrusion in Hexaploid Spring Wheat

    PubMed Central

    Muqaddasi, Quddoos H.; Lohwasser, Ulrike; Nagel, Manuela; Börner, Andreas; Pillen, Klaus; Röder, Marion S.

    2016-01-01

    In a number of crop species hybrids are able to outperform line varieties. The anthers of the autogamous bread wheat plant are normally extruded post anthesis, a trait which is unfavourable for the production of F1 hybrid grain. Higher anther extrusion (AE) promotes cross fertilization for more efficient hybrid seed production. Therefore, this study aimed at the genetic dissection of AE by genome wide association studies (GWAS) and determination of the main effect QTL. We applied GWAS approach to identify DArT markers potentially linked to AE to unfold its genetic basis in a panel of spring wheat accessions. Phenotypic data were collected for three years and best linear unbiased estimate (BLUE) values were calculated across all years. The extent of the AE correlation between growing years and BLUE values ranged from r = +0.56 (2013 vs 2015) to 0.91 (2014 vs BLUE values). The broad sense heritability was 0.84 across all years. Six accessions displayed stable AE >80% across all the years. Genotyping data included 2,575 DArT markers (with minimum of 0.05 minor allele frequency applied). AE was influenced both by genotype and by the growing environment. In all, 131 significant marker trait associations (MTAs) (|log10 (P)| >FDR) were established for AE. AE behaved as a quantitative trait, with five consistently significant markers (significant across at least two years with a significant BLUE value) contributing a minor to modest proportion (4.29% to 8.61%) of the phenotypic variance and affecting the trait either positively or negatively. For this reason, there is potential for breeding for improved AE by gene pyramiding. The consistently significant markers linked to AE could be helpful for marker assisted selection to transfer AE to high yielding varieties allowing to promote the exploitation of hybrid-heterosis in the key crop wheat. PMID:27191600

  10. The past, present and future of genome-wide re-annotation

    PubMed Central

    Ouzounis, Christos A; Karp, Peter D

    2002-01-01

    Annotation, the process by which structural or functional information is inferred for genes or proteins, is crucial for obtaining value from genome sequences. We define the process of annotating a previously annotated genome sequence as 're-annotation', and examine the strengths and weaknesses of current manual and automatic genome-wide re-annotation approaches. PMID:11864365

  11. Genome-Wide Association Mapping and Genomic Selection for Alfalfa (Medicago sativa) Forage Quality Traits.

    PubMed

    Biazzi, Elisa; Nazzicari, Nelson; Pecetti, Luciano; Brummer, E Charles; Palmonari, Alberto; Tava, Aldo; Annicchiarico, Paolo

    2017-01-01

    Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the most-grown forage legume worldwide. This study aimed at exploring opportunities for marker-assisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3-0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits

  12. Genome-wide mapping reveals conservation of promoter DNA methylation following chicken domestication.

    PubMed

    Li, Qinghe; Wang, Yuanyuan; Hu, Xiaoxiang; Zhao, Yaofeng; Li, Ning

    2015-03-04

    It is well-known that environment influences DNA methylation, however, the extent of heritable DNA methylation variation following animal domestication remains largely unknown. Using meDIP-chip we mapped the promoter methylomes for 23,316 genes in muscle tissues of ancestral and domestic chickens. We systematically examined the variation of promoter DNA methylation in terms of different breeds, differentially expressed genes, SNPs and genes undergo genetic selection sweeps. While considerable changes in DNA sequence and gene expression programs were prevalent, we found that the inter-strain DNA methylation patterns were highly conserved in promoter region between the wild and domestic chicken breeds. Our data suggests a global preservation of DNA methylation between the wild and domestic chicken breeds in either a genome-wide or locus-specific scale in chick muscle tissues.

  13. Multiple Genes Related to Muscle Identified through a Joint Analysis of a Two-stage Genome-wide Association Study for Racing Performance of 1,156 Thoroughbreds.

    PubMed

    Shin, Dong-Hyun; Lee, Jin Woo; Park, Jong-Eun; Choi, Ik-Young; Oh, Hee-Seok; Kim, Hyeon Jeong; Kim, Heebal

    2015-06-01

    Thoroughbred, a relatively recent horse breed, is best known for its use in horse racing. Although myostatin (MSTN) variants have been reported to be highly associated with horse racing performance, the trait is more likely to be polygenic in nature. The purpose of this study was to identify genetic variants strongly associated with racing performance by using estimated breeding value (EBV) for race time as a phenotype. We conducted a two-stage genome-wide association study to search for genetic variants associated with the EBV. In the first stage of genome-wide association study, a relatively large number of markers (~54,000 single-nucleotide polymorphisms, SNPs) were evaluated in a small number of samples (240 horses). In the second stage, a relatively small number of markers identified to have large effects (170 SNPs) were evaluated in a much larger number of samples (1,156 horses). We also validated the SNPs related to MSTN known to have large effects on racing performance and found significant associations in the stage two analysis, but not in stage one. We identified 28 significant SNPs related to 17 genes. Among these, six genes have a function related to myogenesis and five genes are involved in muscle maintenance. To our knowledge, these genes are newly reported for the genetic association with racing performance of Thoroughbreds. It complements a recent horse genome-wide association studies of racing performance that identified other SNPs and genes as the most significant variants. These results will help to expand our knowledge of the polygenic nature of racing performance in Thoroughbreds.

  14. Mapping the sensory perception of apple using descriptive sensory evaluation in a genome wide association study

    PubMed Central

    Amyotte, Beatrice; Bowen, Amy J.; Banks, Travis; Rajcan, Istvan; Somers, Daryl J.

    2017-01-01

    Breeding apples is a long-term endeavour and it is imperative that new cultivars are selected to have outstanding consumer appeal. This study has taken the approach of merging sensory science with genome wide association analyses in order to map the human perception of apple flavour and texture onto the apple genome. The goal was to identify genomic associations that could be used in breeding apples for improved fruit quality. A collection of 85 apple cultivars was examined over two years through descriptive sensory evaluation by a trained sensory panel. The trained sensory panel scored randomized sliced samples of each apple cultivar for seventeen taste, flavour and texture attributes using controlled sensory evaluation practices. In addition, the apple collection was subjected to genotyping by sequencing for marker discovery. A genome wide association analysis suggested significant genomic associations for several sensory traits including juiciness, crispness, mealiness and fresh green apple flavour. The findings include previously unreported genomic regions that could be used in apple breeding and suggest that similar sensory association mapping methods could be applied in other plants. PMID:28231290

  15. Mapping the sensory perception of apple using descriptive sensory evaluation in a genome wide association study.

    PubMed

    Amyotte, Beatrice; Bowen, Amy J; Banks, Travis; Rajcan, Istvan; Somers, Daryl J

    2017-01-01

    Breeding apples is a long-term endeavour and it is imperative that new cultivars are selected to have outstanding consumer appeal. This study has taken the approach of merging sensory science with genome wide association analyses in order to map the human perception of apple flavour and texture onto the apple genome. The goal was to identify genomic associations that could be used in breeding apples for improved fruit quality. A collection of 85 apple cultivars was examined over two years through descriptive sensory evaluation by a trained sensory panel. The trained sensory panel scored randomized sliced samples of each apple cultivar for seventeen taste, flavour and texture attributes using controlled sensory evaluation practices. In addition, the apple collection was subjected to genotyping by sequencing for marker discovery. A genome wide association analysis suggested significant genomic associations for several sensory traits including juiciness, crispness, mealiness and fresh green apple flavour. The findings include previously unreported genomic regions that could be used in apple breeding and suggest that similar sensory association mapping methods could be applied in other plants.

  16. Genome-wide association scan suggests basis for microtia in Awassi sheep.

    PubMed

    Jawasreh, K; Boettcher, P J; Stella, A

    2016-08-01

    Hereditary underdevelopment of the ear, a condition also known as microtia, has been observed in several sheep breeds as well as in humans and other species. Its genetic basis in sheep is unknown. The Awassi sheep, a breed native to southwest Asia, carries this phenotype and was targeted for molecular characterization via a genome-wide association study. DNA samples were collected from sheep in Jordan. Eight affected and 12 normal individuals were genotyped with the Illumina OvineSNP50(®) chip. Multilocus analyses failed to identify any genotypic association. In contrast, a single-locus analysis revealed a statistically significant association (P = 0.012, genome-wide) with a SNP at basepair 34 647 499 on OAR23. This marker is adjacent to the gene encoding transcription factor GATA-6, which has been shown to play a role in many developmental processes, including chondrogenesis. The lack of extended homozygosity in this region suggests a fairly ancient mutation, and the time of occurrence was estimated to be approximately 3000 years ago. Many of the earless sheep breeds may thus share the causative mutation, especially within the subgroup of fat-tailed, wool sheep. © 2016 Food and Agriculture Organization of the United Nations. Animal Genetics © 2016 Stichting International Foundation.

  17. Comparison of performance records and national breeding values as input into international genetic evaluation.

    PubMed

    Fikse, W F

    2004-08-01

    The purpose of this investigation was to compare accuracy and precision of variance components and breeding values for international genetic evaluations based on national breeding values or animal performance records. A conventional progeny test scheme was simulated for 3 countries. True breeding values and observations were generated specific to production environments. Two production environments were considered, and both balanced and unbalanced distribution of production environments over countries were considered. True breeding values for both production environments were generated as bivariate normal deviates, and low (0.70) and high (0.90) genetic correlations between performance in production environments were considered. Each cow had an observation in one country only. Performance records were generated as the sum of the true breeding value, a contemporary group effect, and a random residual. Eight generations of data were simulated, and the entire simulated data set was used to compare 3 methods for international genetic evaluation: 1) multiple-trait across-country evaluation based on national predicted breeding values of bulls (Mace), 2) international genetic evaluation across country using performance records, and 3) international genetic evaluation across production environment using performance records. Estimated genetic parameters were biased for all models in this study. Genetic correlations between countries were generally more biased for Mace than for the across-country analyses using performance records. Bias in within-country genetic variances was smaller for Mace. Even genetic parameters obtained with the international evaluation across production environment using performance records were biased, despite the fact that this model was closest to the true, simulated model. The root mean square error of predicted breeding values was similar between models for most of the situations considered. The difference between models was largest when the

  18. Genome wide association scan for chronic periodontitis implicates novel locus

    PubMed Central

    2014-01-01

    Background There is evidence for a genetic contribution to chronic periodontitis. In this study, we conducted a genome wide association study among 866 participants of the University of Pittsburgh Dental Registry and DNA Repository, whose periodontal diagnosis ranged from healthy (N = 767) to severe chronic periodontitis (N = 99). Methods Genotypingi of over half-million single nucleotide polymorphisms was determined. Analyses were done twice, first in the complete dataset of all ethnicities, and second including only samples defined as self-reported Whites. From the top 100 results, twenty single nucleotide polymorphisms had consistent results in both analyses (borderline p-values ranging from 1E-05 to 1E-6) and were selected to be tested in two independent datasets derived from 1,460 individuals from Porto Alegre, and 359 from Rio de Janeiro, Brazil. Meta-analyses of the Single nucleotide polymorphisms showing a trend for association in the independent dataset were performed. Results The rs1477403 marker located on 16q22.3 showed suggestive association in the discovery phase and in the Porto Alegre dataset (p = 0.05). The meta-analysis suggested the less common allele decreases the risk of chronic periodontitis. Conclusions Our data offer a clear hypothesis to be independently tested regarding the contribution of the 16q22.3 locus to chronic periodontitis. PMID:25008200

  19. Biostatistical aspects of genome-wide association studies.

    PubMed

    Ziegler, Andreas; König, Inke R; Thompson, John R

    2008-02-01

    To search the entire human genome for association is a novel and promising approach to unravelling the genetic basis of complex genetic diseases. In these genome-wide association studies (GWAs), several hundreds of thousands of single nucleotide polymorphisms (SNPs) are analyzed at the same time, posing substantial biostatistical and computational challenges. In this paper, we discuss a number of biostatistical aspects of GWAs in detail. We specifically consider quality control issues and show that signal intensity plots are a sine qua condition non in today's GWAs. Approaches to detect and adjust for population stratification are briefly examined. We discuss different strategies aimed at tackling the problem of multiple testing, including adjustment of p -values, the false positive report probability and the false discovery rate. Another aspect of GWAs requiring special attention is the search for gene-gene and gene-environment interactions. We finally describe multistage approaches to GWAs. (c) 2008 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim

  20. Heritability and genome-wide association mapping for supernumerary teats in French Alpine and Saanen dairy goats.

    PubMed

    Martin, Pauline; Palhière, Isabelle; Tosser-Klopp, Gwenola; Rupp, Rachel

    2016-11-01

    This paper reports a quantitative genetics and genomic analysis of undesired presence of supernumerary teats (SNT) in goats. Supernumerary teats are a problem in goat breeding as they can considerably impede machine milking efficiency, leading to increased milking time and injury. This phenotype has routinely been recorded for the past 15 yr in French Alpine and Saanen goats. Around 4% of the females had been assigned the SNT phenotype and consequently could not be included in the breeding program as elite animals. The heritability of this binary trait, estimated by applying linear logistic polygenic models to 32,908 Alpine and 23,217 Saanen females, was 0.40 and 0.44, respectively. A genome-wide association study was implemented using a daughter design composed of 810 Saanen goats sired by 9 artificial insemination bucks and 1,185 Alpine goats sired by 11 bucks, genotyped with the goatSNP50 chip (Illumina Inc., San Diego, CA). This association study was based on logistic polygenic models, one with separately taken single nucleotide polymorphisms and the other with haplotypes as fixed effects. The 2 breeds were analyzed together and separately. No region was found to be significant at the genome level, but 17 regions on 10 chromosomes were significant at the chromosome level. These signals were always only slightly above the chromosome significance threshold and only a few of them overlapped across analyses. No evidence of segregation of a major gene in our Saanen and Alpine populations was observed, suggesting that SNT presence is inherited in a polygenic fashion. This conclusion regarding SNT determinism agrees with recent association analyses in cattle, and one locus was even found in an orthologous region. The possibility of applying markers-based selection on the SNT trait is therefore unlikely, but, as this trait is heritable and routinely recorded, it could be managed by attributing a dedicated estimated breeding value. Copyright © 2016 American Dairy

  1. Genome-Wide Association Study of Grain Appearance and Milling Quality in a Worldwide Collection of Indica Rice Germplasm

    PubMed Central

    Yuan, Zhihua; Xing, Danying; Xu, Jianlong; Dingkuhn, Michael; Li, Zhikang; Ye, Guoyou

    2015-01-01

    Grain appearance quality and milling quality are the main determinants of market value of rice. Breeding for improved grain quality is a major objective of rice breeding worldwide. Identification of genes/QTL controlling quality traits is the prerequisite for increasing breeding efficiency through marker-assisted selection. Here, we reported a genome-wide association study in indica rice to identify QTL associated with 10 appearance and milling quality related traits, including grain length, grain width, grain length to width ratio, grain thickness, thousand grain weight, degree of endosperm chalkiness, percentage of grains with chalkiness, brown rice rate, milled rice rate and head milled rice rate. A diversity panel consisting of 272 indica accessions collected worldwide was evaluated in four locations including Hangzhou, Jingzhou, Sanya and Shenzhen representing indica rice production environments in China and genotyped using genotyping-by-sequencing and Diversity Arrays Technology based on next-generation sequencing technique called DArTseq™. A wide range of variation was observed for all traits in all environments. A total of 16 different association analysis models were compared to determine the best model for each trait-environment combination. Association mapping based on 18,824 high quality markers yielded 38 QTL for the 10 traits. Five of the detected QTL corresponded to known genes or fine mapped QTL. Among the 33 novel QTL identified, qDEC1.1 (qGLWR1.1), qBRR2.2 (qGL2.1), qTGW2.1 (qGL2.2), qGW11.1 (qMRR11.1) and qGL7.1 affected multiple traits with relatively large effects and/or were detected in multiple environments. The research provided an insight of the genetic architecture of rice grain quality and important information for mining genes/QTL with large effects within indica accessions for rice breeding. PMID:26714258

  2. Genome wide association and genomic prediction for growth traits in juvenile farmed Atlantic salmon using a high density SNP array.

    PubMed

    Tsai, Hsin-Yuan; Hamilton, Alastair; Tinch, Alan E; Guy, Derrick R; Gharbi, Karim; Stear, Michael J; Matika, Oswald; Bishop, Steve C; Houston, Ross D

    2015-11-18

    The genetic architecture of complex traits in farmed animal populations is of interest from a scientific and practical perspective. The use of genetic markers to predict the genetic merit (breeding values) of individuals is commonplace in modern farm animal breeding schemes. Recently, high density SNP arrays have become available for Atlantic salmon, which facilitates genomic prediction and association studies using genome-wide markers and economically important traits. The aims of this study were (i) to use a high density SNP array to investigate the genetic architecture of weight and length in juvenile Atlantic salmon; (ii) to assess the utility of genomic prediction for these traits, including testing different marker densities; (iii) to identify potential candidate genes underpinning variation in early growth. A pedigreed population of farmed Atlantic salmon (n = 622) were measured for weight and length traits at one year of age, and genotyped for 111,908 segregating SNP markers using a high density SNP array. The heritability of both traits was estimated using pedigree and genomic relationship matrices, and was comparable at around 0.5 and 0.6 respectively. The results of the GWA analysis pointed to a polygenic genetic architecture, with no SNPs surpassing the genome-wide significance threshold, and one SNP associated with length at the chromosome-wide level. SNPs surpassing an arbitrary threshold of significance (P < 0.005, ~ top 0.5 % of markers) were aligned to an Atlantic salmon reference transcriptome, identifying 109 SNPs in transcribed regions that were annotated by alignment to human, mouse and zebrafish protein databases. Prediction of breeding values was more accurate when applying genomic (GBLUP) than pedigree (PBLUP) relationship matrices (accuracy ~ 0.7 and 0.58 respectively) and 5,000 SNPs were sufficient for obtaining this accuracy increase over PBLUP in this specific population. The high density SNP array can effectively capture

  3. Reproduction and In-Depth Evaluation of Genome-Wide Association Studies and Genome-Wide Meta-analyses Using Summary Statistics

    PubMed Central

    Niu, Yao-Fang; Ye, Chengyin; He, Ji; Han, Fang; Guo, Long-Biao; Zheng, Hou-Feng; Chen, Guo-Bo

    2017-01-01

    In line with open-source genetics, we report a novel linear regression technique for genome-wide association studies (GWAS), called Open GWAS algoriTHm (OATH). When individual-level data are not available, OATH can not only completely reproduce reported results from an experimental model, but also recover underreported results from other alternative models with a different combination of nuisance parameters using naïve summary statistics (NSS). OATH can also reliably evaluate all reported results in-depth (e.g., p-value variance analysis), as demonstrated for 42 Arabidopsis phenotypes under three magnesium (Mg) conditions. In addition, OATH can be used for consortium-driven genome-wide association meta-analyses (GWAMA), and can greatly improve the flexibility of GWAMA. A prototype of OATH is available in the Genetic Analysis Repository (https://github.com/gc5k/GEAR). PMID:28122950

  4. Genome-wide association for grain morphology in synthetic hexaploid wheats using digital imaging analysis

    PubMed Central

    2014-01-01

    with TKW, grain width and thickness. In silico functional analysis predicted a range of biological functions for 32 DArT loci and receptor like kinase, known to affect plant development, appeared to be common protein family encoded by several loci responsible for grain size and shape. Conclusion Conclusively, we demonstrated the application and integration of multiple approaches including high throughput phenotyping using DI, genome wide association studies (GWAS) and in silico functional analysis of candidate loci to analyze target traits, and identify candidate genomic regions underlying these traits. These approaches provided great opportunity to understand the breeding value of SHWs for improving grain weight and enhanced our deep understanding on molecular genetics of grain weight in wheat. PMID:24884376

  5. Genome-Wide Association Studies of Multiple Keratinocyte Cancers

    PubMed Central

    Verkouteren, Joris A. C.; Hofman, Albert; Uitterlinden, André G.; Kraft, Peter; Turman, Constance; Han, Jiali; Cho, Eunyoung; Murabito, Joanne M.; Levy, Daniel; Qureshi, Abrar A.; Nijsten, Tamar

    2017-01-01

    There is strong evidence for a role of environmental risk factors involved in susceptibility to develop multiple keratinocyte cancers (mKCs), but whether genes are also involved in mKCs susceptibility has not been thoroughly investigated. We investigated whether single nucleotide polymorphisms (SNPs) are associated with susceptibility for mKCs. A genome-wide association study (GWAS) of 1,666 cases with mKCs and 1,950 cases with single KC (sKCs; controls) from Harvard cohorts (the Nurses' Health Study [NHS], NHS II, and the Health Professionals Follow-Up Study) and the Framingham Heart Study was carried-out using over 8 million SNPs (stage-1). We sought to replicate the most significant statistical associations (p-value≤ 5.5x10-6) in an independent cohort of 574 mKCs and 872 sKCs from the Rotterdam Study. In the discovery stage, 40 SNPs with suggestive associations (p-value ≤5.5x10-6) were identified, with eight independent SNPs tagging all 40 SNPs. The most significant SNP was located at chromosome 9 (rs7468390; p-value = 3.92x10-7). In stage-2, none of these SNPs replicated and only two of them were associated with mKCs in the same direction in the combined meta-analysis. We tested the associations for 19 previously reported basal cell carcinoma-related SNPs (candidate gene association analysis), and found that rs1805007 (MC1R locus) was significantly associated with risk of mKCs (p-value = 2.80x10-4). Although the suggestive SNPs with susceptibility for mKCs were not replicated, we found that previously identified BCC variants may also be associated with mKC, which the most significant association (rs1805007) located at the MC1R gene. PMID:28081215

  6. Machine learning in genome-wide association studies.

    PubMed

    Szymczak, Silke; Biernacka, Joanna M; Cordell, Heather J; González-Recio, Oscar; König, Inke R; Zhang, Heping; Sun, Yan V

    2009-01-01

    Recently, genome-wide association studies have substantially expanded our knowledge about genetic variants that influence the susceptibility to complex diseases. Although standard statistical tests for each single-nucleotide polymorphism (SNP) separately are able to capture main genetic effects, different approaches are necessary to identify SNPs that influence disease risk jointly or in complex interactions. Experimental and simulated genome-wide SNP data provided by the Genetic Analysis Workshop 16 afforded an opportunity to analyze the applicability and benefit of several machine learning methods. Penalized regression, ensemble methods, and network analyses resulted in several new findings while known and simulated genetic risk variants were also identified. In conclusion, machine learning approaches are promising complements to standard single-and multi-SNP analysis methods for understanding the overall genetic architecture of complex human diseases. However, because they are not optimized for genome-wide SNP data, improved implementations and new variable selection procedures are required. (c) 2009 Wiley-Liss, Inc.

  7. Combining Genome-Wide Information with a Functional Structural Plant Model to Simulate 1-Year-Old Apple Tree Architecture

    PubMed Central

    Migault, Vincent; Pallas, Benoît; Costes, Evelyne

    2017-01-01

    In crops, optimizing target traits in breeding programs can be fostered by selecting appropriate combinations of architectural traits which determine light interception and carbon acquisition. In apple tree, architectural traits were observed to be under genetic control. However, architectural traits also result from many organogenetic and morphological processes interacting with the environment. The present study aimed at combining a FSPM built for apple tree, MAppleT, with genetic determinisms of architectural traits, previously described in a bi-parental population. We focused on parameters related to organogenesis (phyllochron and immediate branching) and morphogenesis processes (internode length and leaf area) during the first year of tree growth. Two independent datasets collected in 2004 and 2007 on 116 genotypes, issued from a ‘Starkrimson’ × ‘Granny Smith’ cross, were used. The phyllochron was estimated as a function of thermal time and sylleptic branching was modeled subsequently depending on phyllochron. From a genetic map built with SNPs, marker effects were estimated on four MAppleT parameters with rrBLUP, using 2007 data. These effects were then considered in MAppleT to simulate tree development in the two climatic conditions. The genome wide prediction model gave consistent estimations of parameter values with correlation coefficients between observed values and estimated values from SNP markers ranging from 0.79 to 0.96. However, the accuracy of the prediction model following cross validation schemas was lower. Three integrative traits (the number of leaves, trunk length, and number of sylleptic laterals) were considered for validating MAppleT simulations. In 2007 climatic conditions, simulated values were close to observations, highlighting the correct simulation of genetic variability. However, in 2004 conditions which were not used for model calibration, the simulations differed from observations. This study demonstrates the possibility

  8. A novel statistic for genome-wide interaction analysis.

    PubMed

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  9. Genome-wide mining, characterization, and development of microsatellite markers in Marsupenaeus japonicus by genome survey sequencing

    NASA Astrophysics Data System (ADS)

    Lu, Xia; Luan, Sheng; Kong, Jie; Hu, Longyang; Mao, Yong; Zhong, Shengping

    2017-01-01

    The kuruma prawn, Marsupenaeus japonicus, is one of the most cultivated and consumed species of shrimp. However, very few molecular genetic/genomic resources are publically available for it. Thus, the characterization and distribution of simple sequence repeats (SSRs) remains ambiguous and the use of SSR markers in genomic studies and marker-assisted selection is limited. The goal of this study is to characterize and develop genome-wide SSR markers in M. japonicus by genome survey sequencing for application in comparative genomics and breeding. A total of 326 945 perfect SSRs were identified, among which dinucleotide repeats were the most frequent class (44.08%), followed by mononucleotides (29.67%), trinucleotides (18.96%), tetranucleotides (5.66%), hexanucleotides (1.07%), and pentanucleotides (0.56%). In total, 151 541 SSR loci primers were successfully designed. A subset of 30 SSR primer pairs were synthesized and tested in 42 individuals from a wild population, of which 27 loci (90.0%) were successfully amplified with specific products and 24 (80.0%) were polymorphic. For the amplified polymorphic loci, the alleles ranged from 5 to 17 (with an average of 9.63), and the average PIC value was 0.796. A total of 58 256 SSR-containing sequences had significant Gene Ontology annotation; these are good functional molecular marker candidates for association studies and comparative genomic analysis. The newly identified SSRs significantly contribute to the M. japonicus genomic resources and will facilitate a number of genetic and genomic studies, including high density linkage mapping, genome-wide association analysis, marker-aided selection, comparative genomics analysis, population genetics, and evolution.

  10. Assessing statistical significance in multivariable genome wide association analysis

    PubMed Central

    Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter

    2016-01-01

    Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: peter.buehlmann@stat.math.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677

  11. Heritability and genome-wide linkage scan of subjective happiness.

    PubMed

    Bartels, Meike; Saviouk, Viatcheslav; de Moor, Marleen H M; Willemsen, Gonneke; van Beijsterveldt, Toos C E M; Hottenga, Jouke-Jan; de Geus, Eco J C; Boomsma, Dorret I

    2010-04-01

    Causes of individual differences in happiness, as assessed with the Subjective Happiness Scale, are investigated in a large of sample twins and siblings from the Netherlands Twin Register. Over 12,000 twins and siblings, average age 24.7 years (range 12 to 88), took part in the study. A genetic model with an age by sex design was fitted to the data with structural equation modeling in Mx. The heritability of happiness was estimated at 22% for males and 41% in females. No effect of age was observed. To identify the genomic regions contributing to this heritability, a genome-wide linkage study for happiness was conducted in sibling pairs. A subsample of 1157 offspring from 441 families was genotyped with an average of 371 micro-satellite markers per individual. Phenotype and genotype data were analyzed in MERLIN with multipoint variance component linkage analysis and age and sex as covariates. A linkage signal (logarithm of odds score 2.73, empirical p value 0.095) was obtained at the end of the long arm of chromosome 19 for marker D19S254 at 110 cM. A second suggestive linkage peak was found at the short arm of chromosome 1 (LOD of 2.37) at 153 cM, marker D1S534 (empirical p value of .209). These two regions of interest are not overlapping with the regions found for contrasting phenotypes (such as depression, which is negatively associated with happiness). Further linkage and future association studies are warranted.

  12. A genome-wide DNA methylation study in azoospermia.

    PubMed

    Ferfouri, F; Boitrelle, F; Ghout, I; Albert, M; Molina Gomes, D; Wainer, R; Bailly, M; Selva, J; Vialard, F

    2013-11-01

    The objective of this study was to assess genome-wide DNA methylation in testicular tissue from azoospermic patients. A total of 94 azoospermic patients were recruited and classified into three groups: 29 patients presented obstructive azoospermia (OA), 26 displayed non-obstructive azoospermia (NOA) and successful retrieval of spermatozoa by testicular sperm extraction (TESE+) and 39 displayed NOA and failure to retrieve spermatozoa by TESE (TESE-). An Illumina Infinium Human Methylation27 BeadChip DNA methylation array was used to establish a testicular DNA methylation pattern for each type of azoospermic patient. The OA and NOA groups were compared in terms of the relative M-value (the log2 ratio between methylated and non-methylated probe intensities) for each CpG site. We observed significantly different DNA methylation profiles for the NOA and OA groups, with differences at over 9000 of the 27 578 CpG sites; 212 CpG sites had a relative M-value >3. The results highlighted 14 testis-specific genes. Patient clustering with respect to these 212 CpG sites corresponded closely to the clinical classification. The DNA methylation patterns showed that in the NOA group, 78 of the 212 CpG sites were hypomethylated and 134 were hypermethylated (relative to the OA group). On the basis of these DNA methylation profiles, azoospermic patients could be classified as OA or NOA by considering the 212 CpG sites with the greatest methylation differences. Furthermore, we identified genes that may provide insight into the mechanism of idiopathic NOA.

  13. Genome-Wide Association Analysis of the Anthocyanin and Carotenoid Contents of Rose Petals

    PubMed Central

    Schulz, Dietmar F.; Schott, Rena T.; Voorrips, Roeland E.; Smulders, Marinus J. M.; Linde, Marcus; Debener, Thomas

    2016-01-01

    Petal color is one of the key characteristics determining the attractiveness and therefore the commercial value of an ornamental crop. Here, we present the first genome-wide association study for the important ornamental crop rose, focusing on the anthocyanin and carotenoid contents in petals of 96 diverse tetraploid garden rose genotypes. Cultivated roses display a vast phenotypic and genetic diversity and are therefore ideal targets for association genetics. For marker analysis, we used a recently designed Axiom SNP chip comprising 68,000 SNPs with additionally 281 SSRs, 400 AFLPs and 246 markers from candidate genes. An analysis of the structure of the rose population revealed three subpopulations with most of the genetic variation between individual genotypes rather than between clusters and with a high average proportion of heterozygous loci. The mapping of markers significantly associated with anthocyanin and carotenoid content to the related Fragaria and Prunus genomes revealed clusters of associated markers indicating five genomic regions associated with the total anthocyanin content and two large clusters associated with the carotenoid content. Among the marker clusters associated with the phenotypes, we found several candidate genes with known functions in either the anthocyanin or the carotenoid biosynthesis pathways. Among others, we identified a glutathione-S-transferase, 4CL, an auxin response factor and F3'H as candidate genes affecting anthocyanin concentration, and CCD4 and Zeaxanthine epoxidase as candidates affecting the concentration of carotenoids. These markers are starting points for future validation experiments in independent populations as well as for functional genomic studies to identify the causal factors for the observed color phenotypes. Furthermore, validated markers may be interesting tools for marker-assisted selection in commercial breeding programmes in that they provide the tools to identify superior parental combinations that

  14. Multi-generational genome wide association studies identify chromosomal regions associated with ascites phenotype.

    PubMed

    Tarrant, K J; Dey, S; Kinney, R; Anthony, N B; Rhoads, D D

    2017-02-21

    Ascites is a multi-faceted disease commonly observed in fast growing broilers, which is initiated when the body is insufficiently oxygenated. A series of events follow, including an increase in pulmonary artery pressure, right ventricle hypertrophy, and accumulation of fluid in the abdominal cavity and pericardium. Advances in management practices along with improved selection programs have decreased ascites incidence in modern broilers. However, ascites syndrome remains an economically important disease throughout the world, causing estimated losses of $100 million per year. In this study, a 60 K Illumina SNP BeadChip was used to perform a series of genome wide association studies (GWAS) on the 16th and 18th generation of our relaxed (REL) line descended from a commercial elite broiler line beginning in 1995. Regions significantly associated with ascites incidence were identified on chromosome 2 around 70 megabase pairs (Mbp) and on chromosome Z around 60 Mbp. Five candidate single nucleotide polymorphisms (SNP) were evaluated as indicators for these 2 regions in order to identify association with ascites and right ventricle to total ventricle weight (RVTV) ratios. Chromosome 2 SNP showed an association with RVTV ratios in males phenotyped as ascites resistant and ascites susceptible (P = 0.02 and P = 0.03, respectively). The chromosome Z region also indicates an association with resistant female RVTV values (P = 0.02). Regions of significance identified on chromosomes 2 and Z described in this study will be used as proposed candidate regions for further investigation into the genetics of ascites. This information will lead to a better understanding of the underlying genetics and gene networks contributing to ascites, and thus advances in ascites reduction through commercial breeding schemes.

  15. Genome-wide association analysis of forage quality in maize mature stalk.

    PubMed

    Wang, Hongwu; Li, Kun; Hu, Xiaojiao; Liu, Zhifang; Wu, Yujin; Huang, Changling

    2016-10-21

    Plant digestibility of silage maize (Zea mays L.) has a large influence on nutrition intake for animal feeding. Improving forage quality will enhance the utilization efficiency and feeding value of forage maize. Dissecting the genetic basis of forage quality will improve our understanding of the complex nature of cell wall biosynthesis and degradation, which is also helpful for breeding good quality silage maize. Acid detergent fiber (ADF), neutral detergent fiber (NDF) and in vitro dry matter digestibility (IVDMD) of stalk were evaluated in a diverse maize population, which is comprised of 368 inbred lines and planted across seven environments. Using a mixed model accounting for population structure and polygenic background effects, a genome-wide association study was conducted to identify single nucleotide polymorphisms (SNPs) significantly associated with forage quality. Scanning 559,285 SNPs across the whole genome, 73, 41 and 82 SNPs were found to be associated with ADF, NDF, and IVDMD, respectively. Each significant SNP explained 4.2 %-6.2 % of the phenotypic variation. Underlying these associated loci, 56 genes were proposed as candidate genes for forage quality. Of all the candidate genes proposed by GWAS, we only found a C3H gene (ZmC3H2) that is directly involved in cell wall component biosynthesis. The candidate genes found in this study are mainly involved in signal transduction, stress resistance, and transcriptional regulation of cell wall biosynthetic gene expression. Adding high digestibility maize into the association panel would be helpful for increasing genetic variability and identifying more genes associated with forage quality traits. Cloning and functional validation of these genes would be helpful for understanding the molecular mechanism of the fiber content and digestibility. These findings provide us new insights into cell wall formation and deposition.

  16. Hot topic: Definition and implementation of a breeding value for feed efficiency in dairy cows.

    PubMed

    Pryce, J E; Gonzalez-Recio, O; Nieuwhof, G; Wales, W J; Coffey, M P; Hayes, B J; Goddard, M E

    2015-10-01

    A new breeding value that combines the amount of feed saved through improved metabolic efficiency with predicted maintenance requirements is described. The breeding value includes a genomic component for residual feed intake (RFI) combined with maintenance requirements calculated from either a genomic or pedigree estimated breeding value (EBV) for body weight (BW) predicted using conformation traits. Residual feed intake is only available for genotyped Holsteins; however, BW is available for all breeds. The RFI component of the "feed saved" EBV has 2 parts: Australian calf RFI and Australian lactating cow RFI. Genomic breeding values for RFI were estimated from a reference population of 2,036 individuals in a multi-trait analysis including Australian calf RFI (n=843), Australian lactating cow RFI (n=234), and UK and Dutch lactating cow RFI (n=958). In all cases, the RFI phenotypes were deviations from a mean of 0, calculated by correcting dry matter intake for BW, growth, and milk yield (in the case of lactating cows). Single nucleotide polymorphism effects were calculated from the output of genomic BLUP and used to predict breeding values of 4,106 Holstein sires that were genotyped but did not have RFI phenotypes themselves. These bulls already had BW breeding values calculated from type traits, from which maintenance requirements in kilograms of feed per year were inferred. Finally, RFI and the feed required for maintenance (through BW) were used to calculate a feed saved breeding value and expressed as the predicted amount of feed saved per year. Animals that were 1 standard deviation above the mean were predicted to eat 66 kg dry matter less per year at the same level of milk production. In a data set of genotyped Holstein sires, the mean reliability of the feed saved breeding value was 0.37. For Holsteins that are not genotyped and for breeds other than Holsteins, feed saved is calculated using BW only. From April 2015, feed saved has been included as part of

  17. Genome-wide analysis in endangered populations: a case study in Barbaresca sheep.

    PubMed

    Mastrangelo, S; Portolano, B; Di Gerlando, R; Ciampolini, R; Tolone, M; Sardina, M T

    2017-01-12

    Analysis of genomic data is becoming increasingly common in the livestock industry and the findings have been an invaluable resource for effective management of breeding programs in small and endangered populations. In this paper, with the goal of highlighting the potential of genomic analysis for small and endangered populations, genome-wide levels of linkage disequilibrium, measured as the squared correlation coefficient of allele frequencies at a pair of loci, effective population size, runs of homozygosity (ROH) and genetic diversity parameters, were estimated in Barbaresca sheep using Illumina OvineSNP50K array data. Moreover, the breed's genetic structure and its relationship with other breeds were investigated. Levels of pairwise linkage disequilibrium decreased with increasing distance between single nucleotide polymorphisms. An average correlation coefficient <0.25 was found for markers located up to 50 kb apart. Therefore, these results support the need to use denser single nucleotide polymorphism panels for high power association mapping and genomic selection efficiency in future breeding programs. The estimate of past effective population size ranged from 747 animals 250 generations ago to 28 animals five generations ago, whereas the contemporary effective population size was 25 animals. A total of 637 ROH were identified, most of which were short (67%) and ranged from 1 to 10 Mb. The genetic analyses revealed that the Barbaresca breed tended to display lower variability than other Sicilian breeds. Recent inbreeding was evident, according to the ROH analysis. All the investigated parameters showed a comparatively narrow genetic base and indicated an endangered status for Barbaresca. Multidimensional scaling, model-based clustering, measurement of population differentiation, neighbor networks and haplotype sharing distinguished Barbaresca from other breeds, showed a low level of admixture with the other breeds considered in this study, and indicated

  18. Genome-wide association study confirms SNPs in SNCA and the MAPT region as common risk factors for Parkinson disease

    PubMed Central

    Edwards, Todd L.; Scott, William K.; Almonte, Cherylyn; Burt, Amber; Powell, Eric H.; Beecham, Gary W.; Wang, Liyong; Züchner, Stephan; Konidari, Ioanna; Wang, Gaofeng; Singer, Carlos; Nahab, Fatta; Scott, Burton; Stajich, Jeffrey M.; Pericak-Vance, Margaret; Haines, Jonathan; Vance, Jeffery M.; Martin, Eden R.

    2010-01-01

    SUMMARY Parkinson disease (PD) is a chronic neurodegenerative disorder with a cumulative prevalence of greater than one per thousand. To date three independent genome-wide association studies (GWAS) have investigated the genetic susceptibility to PD. These studies have also implicated several genes as PD risk loci with strong, but not genome-wide significant, associations. In this study, we combined data from two previously published GWAS of Caucasian subjects with our GWAS of 604 cases and 619 controls for a joint analysis with a combined sample size of 1752 cases and 1745 controls. SNPs in SNCA (rs2736990, p-value = 6.7×10−8; genome-wide adjusted p = 0.0109, odds ratio (OR) = 1.29 [95% CI: 1.17–1.42] G vs. A allele, population attributable risk percent (PAR%) = 12%) and the MAPT region (rs11012, p-value = 5.6×10−8; genome-wide adjusted p = 0.0079, OR = 0.70 [95% CI: 0.62–0.79] T vs. C allele, PAR% = 8%) were genome-wide significant. No other SNPs were genome-wide significant in this analysis. This study confirms that SNCA and the MAPT region are major genes whose common variants are influencing risk of PD. PMID:20070850

  19. Genome-wide association study confirms SNPs in SNCA and the MAPT region as common risk factors for Parkinson disease.

    PubMed

    Edwards, Todd L; Scott, William K; Almonte, Cherylyn; Burt, Amber; Powell, Eric H; Beecham, Gary W; Wang, Liyong; Züchner, Stephan; Konidari, Ioanna; Wang, Gaofeng; Singer, Carlos; Nahab, Fatta; Scott, Burton; Stajich, Jeffrey M; Pericak-Vance, Margaret; Haines, Jonathan; Vance, Jeffery M; Martin, Eden R

    2010-03-01

    Parkinson disease (PD) is a chronic neurodegenerative disorder with a cumulative prevalence of greater than one per thousand. To date three independent genome-wide association studies (GWAS) have investigated the genetic susceptibility to PD. These studies implicated several genes as PD risk loci with strong, but not genome-wide significant, associations. In this study, we combined data from two previously published GWAS of Caucasian subjects with our GWAS of 604 cases and 619 controls for a joint analysis with a combined sample size of 1752 cases and 1745 controls. SNPs in SNCA (rs2736990, p-value = 6.7 x 10(-8); genome-wide adjusted p = 0.0109, odds ratio (OR) = 1.29 [95% CI: 1.17-1.42] G vs. A allele, population attributable risk percent (PAR%) = 12%) and the MAPT region (rs11012, p-value = 5.6 x 10(-8); genome-wide adjusted p = 0.0079, OR = 0.70 [95% CI: 0.62-0.79] T vs. C allele, PAR%= 8%) were genome-wide significant. No other SNPs were genome-wide significant in this analysis. This study confirms that SNCA and the MAPT region are major genes whose common variants are influencing risk of PD.

  20. Genetic parameters and breeding values for semen characteristics in Hanoverian stallions.

    PubMed

    Labitzke, D; Sieme, H; Martinsson, G; Distl, O

    2014-08-01

    The objectives of this study were to show whether semen traits of 30 Hanoverian stallions regularly used in AI may be useful for breeding purposes. Semen characteristics were studied using 15 149 ejaculates from 30 Hanoverian stallions of the State Stud Celle of Lower Saxony. Semen samples were collected between 2005 and 2009. Traits analysed were gel-free volume, sperm concentration, total and motile sperm number and progressive motility. A linear multivariate animal model was employed to estimate heritabilities and permanent environmental variances for stallions. The same model was used to predict breeding values for all traits simultaneously. Heritabilities were high for gel-free volume (h(2) = 0.43) and moderate for total number of sperm (h(2) = 0.29) and progressive motility (h(2) = 0.20). Gel-free volume, sperm concentration and total number of sperm were genetically negatively correlated with progressive motility. The effect of the permanent environment for stallions accounted for 9-55% of the trait variance. The total variance among stallions explained 37-69% of the trait variance. The average reliabilities of the breeding values were 0.43-0.76 for the 30 Hanoverian stallions. In conclusion, the study could demonstrate large effects of stallions, routinely employed in a breeding programme, on semen characteristics analysed here. We could demonstrate that estimated breeding values (EBV) with sufficient high reliabilities can be predicted using data from these stallions and these EBV are useful in horse breeding programmes to achieve genetic improvement in semen quality.

  1. A Genome-wide Association Study of Myasthenia Gravis

    PubMed Central

    Renton, Alan E.; Pliner, Hannah A.; Provenzano, Carlo; Evoli, Amelia; Ricciardi, Roberta; Nalls, Michael A.; Marangi, Giuseppe; Abramzon, Yevgeniya; Arepalli, Sampath; Chong, Sean; Hernandez, Dena G.; Johnson, Janel O.; Bartoccioni, Emanuela; Scuderi, Flavia; Maestri, Michelangelo; Raphael Gibbs, J.; Errichiello, Edoardo; Chiò, Adriano; Restagno, Gabriella; Sabatelli, Mario; Macek, Mark; Scholz, Sonja W.; Corse, Andrea; Chaudhry, Vinay; Benatar, Michael; Barohn, Richard J.; McVey, April; Pasnoor, Mamatha; Dimachkie, Mazen M.; Rowin, Julie; Kissel, John; Freimer, Miriam; Kaminski, Henry J.; Sanders, Donald B.; Lipscomb, Bernadette; Massey, Janice M.; Chopra, Manisha; Howard, James F.; Koopman, Wilma J.; Nicolle, Michael W.; Pascuzzi, Robert M.; Pestronk, Alan; Wulf, Charlie; Florence, Julaine; Blackmore, Derrick; Soloway, Aimee; Siddiqi, Zaeem; Muppidi, Srikanth; Wolfe, Gil; Richman, David; Mezei, Michelle M.; Jiwa, Theresa; Oger, Joel; Drachman, Daniel B.; Traynor, Bryan J.

    2016-01-01

    IMPORTANCE Myasthenia gravis is a chronic, autoimmune, neuromuscular disease characterized by fluctuating weakness of voluntary muscle groups. Although genetic factors are known to play a role in this neuroimmunological condition, the genetic etiology underlying myasthenia gravis is not well understood. OBJECTIVE To identify genetic variants that alter susceptibility to myasthenia gravis, we performed a genome-wide association study. DESIGN, SETTING, AND PARTICIPANTS DNA was obtained from 1032 white individuals from North America diagnosed as having acetylcholine receptor antibody–positive myasthenia gravis and 1998 race/ethnicity-matched control individuals from January 2010 to January 2011. These samples were genotyped on Illumina OmniExpress single-nucleotide polymorphism arrays. An independent cohort of 423 Italian cases and 467 Italian control individuals were used for replication. MAIN OUTCOMES AND MEASURES We calculated P values for association between 8114394 genotyped and imputed variants across the genome and risk for developing myasthenia gravis using logistic regression modeling. A threshold P value of 5.0 × 10−8 was set for genome-wide significance after Bonferroni correction for multiple testing. RESULTS In the over all case-control cohort, we identified association signals at CTLA4 (rs231770; P = 3.98 × 10−8; odds ratio, 1.37; 95% CI, 1.25–1.49), HLA-DQA1 (rs9271871; P = 1.08 × 10−8; odds ratio, 2.31; 95% CI, 2.02 – 2.60), and TNFRSF11A (rs4263037; P = 1.60 × 10−9; odds ratio, 1.41; 95% CI, 1.29–1.53). These findings replicated for CTLA4 and HLA-DQA1 in an independent cohort of Italian cases and control individuals. Further analysis revealed distinct, but overlapping, disease-associated loci for early- and late-onset forms of myasthenia gravis. In the late-onset cases, we identified 2 association peaks: one was located in TNFRSF11A (rs4263037; P = 1.32 × 10−12; odds ratio, 1.56; 95% CI, 1.44–1.68) and the other was detected

  2. A genome-wide association study of myasthenia gravis.

    PubMed

    Renton, Alan E; Pliner, Hannah A; Provenzano, Carlo; Evoli, Amelia; Ricciardi, Roberta; Nalls, Michael A; Marangi, Giuseppe; Abramzon, Yevgeniya; Arepalli, Sampath; Chong, Sean; Hernandez, Dena G; Johnson, Janel O; Bartoccioni, Emanuela; Scuderi, Flavia; Maestri, Michelangelo; Gibbs, J Raphael; Errichiello, Edoardo; Chiò, Adriano; Restagno, Gabriella; Sabatelli, Mario; Macek, Mark; Scholz, Sonja W; Corse, Andrea; Chaudhry, Vinay; Benatar, Michael; Barohn, Richard J; McVey, April; Pasnoor, Mamatha; Dimachkie, Mazen M; Rowin, Julie; Kissel, John; Freimer, Miriam; Kaminski, Henry J; Sanders, Donald B; Lipscomb, Bernadette; Massey, Janice M; Chopra, Manisha; Howard, James F; Koopman, Wilma J; Nicolle, Michael W; Pascuzzi, Robert M; Pestronk, Alan; Wulf, Charlie; Florence, Julaine; Blackmore, Derrick; Soloway, Aimee; Siddiqi, Zaeem; Muppidi, Srikanth; Wolfe, Gil; Richman, David; Mezei, Michelle M; Jiwa, Theresa; Oger, Joel; Drachman, Daniel B; Traynor, Bryan J

    2015-04-01

    Myasthenia gravis is a chronic, autoimmune, neuromuscular disease characterized by fluctuating weakness of voluntary muscle groups. Although genetic factors are known to play a role in this neuroimmunological condition, the genetic etiology underlying myasthenia gravis is not well understood. To identify genetic variants that alter susceptibility to myasthenia gravis, we performed a genome-wide association study. DNA was obtained from 1032 white individuals from North America diagnosed as having acetylcholine receptor antibody-positive myasthenia gravis and 1998 race/ethnicity-matched control individuals from January 2010 to January 2011. These samples were genotyped on Illumina OmniExpress single-nucleotide polymorphism arrays. An independent cohort of 423 Italian cases and 467 Italian control individuals were used for replication. We calculated P values for association between 8,114,394 genotyped and imputed variants across the genome and risk for developing myasthenia gravis using logistic regression modeling. A threshold P value of 5.0×10(-8) was set for genome-wide significance after Bonferroni correction for multiple testing. In the overall case-control cohort, we identified association signals at CTLA4 (rs231770; P=3.98×10(-8); odds ratio, 1.37; 95% CI, 1.25-1.49), HLA-DQA1 (rs9271871; P=1.08×10(-8); odds ratio, 2.31; 95% CI, 2.02-2.60), and TNFRSF11A (rs4263037; P=1.60×10(-9); odds ratio, 1.41; 95% CI, 1.29-1.53). These findings replicated for CTLA4 and HLA-DQA1 in an independent cohort of Italian cases and control individuals. Further analysis revealed distinct, but overlapping, disease-associated loci for early- and late-onset forms of myasthenia gravis. In the late-onset cases, we identified 2 association peaks: one was located in TNFRSF11A (rs4263037; P=1.32×10(-12); odds ratio, 1.56; 95% CI, 1.44-1.68) and the other was detected in the major histocompatibility complex on chromosome 6p21 (HLA-DQA1; rs9271871; P=7.02×10(-18); odds ratio, 4.27; 95

  3. The First Pilot Genome-Wide Gene-Environment Study of Depression in the Japanese Population

    PubMed Central

    Otowa, Takeshi; Kawamura, Yoshiya; Tsutsumi, Akizumi; Kawakami, Norito; Kan, Chiemi; Shimada, Takafumi; Umekage, Tadashi; Kasai, Kiyoto; Tokunaga, Katsushi; Sasaki, Tsukasa

    2016-01-01

    Stressful events have been identified as a risk factor for depression. Although gene–environment (G × E) interaction in a limited number of candidate genes has been explored, no genome-wide search has been reported. The aim of the present study is to identify genes that influence the association of stressful events with depression. Therefore, we performed a genome-wide G × E interaction analysis in the Japanese population. A genome-wide screen with 320 subjects was performed using the Affymetrix Genome-Wide Human Array 6.0. Stressful life events were assessed using the Social Readjustment Rating Scale (SRRS) and depression symptoms were assessed with self-rating questionnaires using the Center for Epidemiologic Studies Depression (CES-D) scale. The p values for interactions between single nucleotide polymorphisms (SNPs) and stressful events were calculated using the linear regression model adjusted for sex and age. After quality control of genotype data, a total of 534,848 SNPs on autosomal chromosomes were further analyzed. Although none surpassed the level of the genome-wide significance, a marginal significant association of interaction between SRRS and rs10510057 with depression were found (p = 4.5 × 10−8). The SNP is located on 10q26 near Regulators of G-protein signaling 10 (RGS10), which encodes a regulatory molecule involved in stress response. When we investigated a similar G × E interaction between depression (K6 scale) and work-related stress in an independent sample (n = 439), a significant G × E effect on depression was observed (p = 0.015). Our findings suggest that rs10510057, interacting with stressors, may be involved in depression risk. Incorporating G × E interaction into GWAS can contribute to find susceptibility locus that are potentially missed by conventional GWAS. PMID:27529621

  4. Genome-Wide Association Study of Schizophrenia in Japanese Population

    PubMed Central

    Yamada, Kazuo; Iwayama, Yoshimi; Hattori, Eiji; Iwamoto, Kazuya; Toyota, Tomoko; Ohnishi, Tetsuo; Ohba, Hisako; Maekawa, Motoko; Kato, Tadafumi; Yoshikawa, Takeo

    2011-01-01

    Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS) is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs) in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions). The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila)-like 2] gene located on 9p21.3 (p = 0.00087). In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals) of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026). The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology. PMID:21674006

  5. Genome-wide SNP typing reveals signatures of population history.

    PubMed

    Hughes, Austin L; Welch, Robert; Puri, Vinita; Matthews, Casey; Haque, Kashif; Chanock, Stephen J; Yeager, Meredith

    2008-07-01

    Single-nucleotide polymorphism (SNP) arrays have become a popular technology for disease-association studies, but they also have potential for studying the genetic differentiation of human populations. Application of the Affymetrix GeneChip Human Mapping 500K Array Set to a population of 102 individuals representing the major ethnic groups in the United States (African, Asian, European, and Hispanic) revealed patterns of gene diversity and genetic distance that reflected population history. We analyzed allelic frequencies at 388,654 autosomal SNP sites that showed some variation in our study population and 10% or fewer missing values. Despite the small size (23-31 individuals) of each subpopulation, there were no fixed differences at any site between any two subpopulations. As expected from the African origin of modern humans, greater gene diversity was seen in Africans than in either Asians or Europeans, and the genetic distance between the Asian and the European populations was significantly lower than that between either of these two populations and Africans. Principal components analysis applied to a correlation matrix among individuals was able to separate completely the major continental groups of humans (Africans, Asians, and Europeans), while Hispanics overlapped all three of these groups. Genes containing two or more markers with extraordinarily high genetic distance between subpopulations were identified as candidate genes for health differences between subpopulations. The results show that, even with modest sample sizes, genome-wide SNP genotyping technologies have great promise for capturing signatures of gene frequency difference between human subpopulations, with applications in areas as diverse as forensics and the study of ethnic health disparities.

  6. Accuracy of predicting genomic breeding values for residual feed intake in Angus and Charolais beef cattle.

    PubMed

    Chen, L; Schenkel, F; Vinsky, M; Crews, D H; Li, C

    2013-10-01

    In beef cattle, phenotypic data that are difficult and/or costly to measure, such as feed efficiency, and DNA marker genotypes are usually available on a small number of animals of different breeds or populations. To achieve a maximal accuracy of genomic prediction using the phenotype and genotype data, strategies for forming a training population to predict genomic breeding values (GEBV) of the selection candidates need to be evaluated. In this study, we examined the accuracy of predicting GEBV for residual feed intake (RFI) based on 522 Angus and 395 Charolais steers genotyped on SNP with the Illumina Bovine SNP50 Beadchip for 3 training population forming strategies: within breed, across breed, and by pooling data from the 2 breeds (i.e., combined). Two other scenarios with the training and validation data split by birth year and by sire family within a breed were also investigated to assess the impact of genetic relationships on the accuracy of genomic prediction. Three statistical methods including the best linear unbiased prediction with the relationship matrix defined based on the pedigree (PBLUP), based on the SNP genotypes (GBLUP), and a Bayesian method (BayesB) were used to predict the GEBV. The results showed that the accuracy of the GEBV prediction was the highest when the prediction was within breed and when the validation population had greater genetic relationships with the training population, with a maximum of 0.58 for Angus and 0.64 for Charolais. The within-breed prediction accuracies dropped to 0.29 and 0.38, respectively, when the validation populations had a minimal pedigree link with the training population. When the training population of a different breed was used to predict the GEBV of the validation population, that is, across-breed genomic prediction, the accuracies were further reduced to 0.10 to 0.22, depending on the prediction method used. Pooling data from the 2 breeds to form the training population resulted in accuracies increased

  7. Genome-wide association studies with proteomics data reveal genes important for synthesis, transport and packaging of globulins in legume seeds.

    PubMed

    Le Signor, Christine; Aimé, Delphine; Bordat, Amandine; Belghazi, Maya; Labas, Valérie; Gouzy, Jérôme; Young, Nevin D; Prosperi, Jean-Marie; Leprince, Olivier; Thompson, Richard D; Buitink, Julia; Burstin, Judith; Gallardo, Karine

    2017-06-01

    Improving nutritional seed quality is an important challenge in grain legume breeding. However, the genes controlling the differential accumulation of globulins, which are major contributors to seed nutritional value in legumes, remain largely unknown. We combined a search for protein quantity loci with genome-wide association studies on the abundance of 7S and 11S globulins in seeds of the model legume species Medicago truncatula. Identified genomic regions and genes carrying polymorphisms linked to globulin variations were then cross-compared with pea (Pisum sativum), leading to the identification of candidate genes for the regulation of globulin abundance in this crop. Key candidates identified include genes involved in transcription, chromatin remodeling, post-translational modifications, transport and targeting of proteins to storage vacuoles. Inference of a gene coexpression network of 12 candidate transcription factors and globulin genes revealed the transcription factor ABA-insensitive 5 (ABI5) as a highly connected hub. Characterization of loss-of-function abi5 mutants in pea uncovered a role for ABI5 in controlling the relative abundance of vicilin, a sulfur-poor 7S globulin, in pea seeds. This demonstrates the feasibility of using genome-wide association studies in M. truncatula to reveal genes that can be modulated to improve seed nutritional value. © 2017 INRA. New Phytologist © 2017 New Phytologist Trust.

  8. Genome-wide association mapping of soybean aphid resistance traits

    USDA-ARS?s Scientific Manuscript database

    Soybean aphid is the most damaging insect pest of soybean in the Upper Midwest and is primarily controlled by insecticides. Soybean aphid resistance (i.e., Rag genes) has been documented in some soybean lines at chromosomes 6, 7, 13, and 16, but more sources of resistance are needed. Genome-wide ass...

  9. A super powerful method for genome wide association study

    USDA-ARS?s Scientific Manuscript database

    Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population struct...

  10. Genome-wide characterization of maize miRNA genes

    USDA-ARS?s Scientific Manuscript database

    MicroRNAs (miRNAs) are small non-coding RNAs that play essential roles in plant growth and development. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling ident...

  11. Genome-wide association studies in maize: praise and stargaze

    USDA-ARS?s Scientific Manuscript database

    Genome-wide association study (GWAS) has appeared as a widespread strategy in decoding genotype-phenotype associations in many species thanks to technical advances in next-generation sequencing (NGS) applications. Maize is an ideal crop for GWAS and significant progress has been made in the last dec...

  12. Genome-wide association study identifies five new schizophrenia loci

    PubMed Central

    2012-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10−11) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10−9), ANK3 (rs10994359, P = 2.5 × 10−8) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10−9). PMID:21926974

  13. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    PubMed Central

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  14. Improving the Accuracy of Whole Genome Prediction for Complex Traits Using the Results of Genome Wide Association Studies

    PubMed Central

    Zhang, Zhe; Ober, Ulrike; Erbe, Malena; Zhang, Hao; Gao, Ning; He, Jinlong; Li, Jiaqi; Simianer, Henner

    2014-01-01

    Utilizing the whole genomic variation of complex traits to predict the yet-to-be observed phenotypes or unobserved genetic values via whole genome prediction (WGP) and to infer the underlying genetic architecture via genome wide association study (GWAS) is an interesting and fast developing area in the context of human disease studies as well as in animal and plant breeding. Though thousands of significant loci for several species were detected via GWAS in the past decade, they were not used directly to improve WGP due to lack of proper models. Here, we propose a generalized way of building trait-specific genomic relationship matrices which can exploit GWAS results in WGP via a best linear unbiased prediction (BLUP) model for which we suggest the name BLUP|GA. Results from two illustrative examples show that using already existing GWAS results from public databases in BLUP|GA improved the accuracy of WGP for two out of the three model traits in a dairy cattle data set, and for nine out of the 11 traits in a rice diversity data set, compared to the reference methods GBLUP and BayesB. While BLUP|GA outperforms BayesB, its required computing time is comparable to GBLUP. Further simulation results suggest that accounting for publicly available GWAS results is potentially more useful for WGP utilizing smaller data sets and/or traits of low heritability, depending on the genetic architecture of the trait under consideration. To our knowledge, this is the first study incorporating public GWAS results formally into the standard GBLUP model and we think that the BLUP|GA approach deserves further investigations in animal breeding, plant breeding as well as human genetics. PMID:24663104

  15. A Genome-Wide Association Study Identifies the Genomic Region Associated with Shell Color in Yesso Scallop, Patinopecten yessoensis.

    PubMed

    Zhao, Liang; Li, Yangping; Li, Yajuan; Yu, Jiachen; Liao, Huan; Wang, Shuyue; Lv, Jia; Liang, Jun; Huang, Xiaoting; Bao, Zhenmin

    2017-06-01

    The shell color polymorphism widely exists in economic shellfish, which not only results in a better visual perception but also shows great value as an economic trait for breeding. Small numbers of reddish-orange shell Yesso scallops, Patinopecten yessoensis, were found in cultured populations compared to the brown majority. In this study, a genome-wide association study was conducted to understand the genetic basis of shell color. Sixty-six 2b-RAD libraries with equal numbers of reddish-orange and brown shell individuals were constructed and sequenced using the Illumina HiSeq 2000 platform. A total of 322,332,684 high-quality reads were obtained, and the average sequencing depth was 18.4×. One genomic region on chromosome 11 that included 239 single-nucleotide polymorphisms (SNPs) was identified as significantly associated with shell color. After verification by high-resolution melting in another population, two SNPs were selected as specific loci for reddish-orange shell color. These two SNPs could be used to improve the selective breeding progress of true-breeding strains with complete reddish-orange scallops. In addition, within the significantly associated genomic region, candidate genes were identified using marker sequences to search the draft genome of Yesso scallop. Three genes (LDLR, FRIS, and FRIY) with known functions in carotenoid metabolism were identified. Further study using high-performance liquid chromatography proved that the relative level of carotenoids in the reddish-orange shells was 40 times higher than that in the brown shells. These results suggested that the accumulation of carotenoids contributes to the formation of reddish-orange shells.

  16. Genome-wide Association and Functional Studies Identify a Role for IGFBP3 in Hip Osteoarthritis

    PubMed Central

    Evans, Daniel S.; Cailotto, Frederic; Parimi, Neeta; Valdes, Ana M.; Castaño-Betancourt, Martha C.; Liu, Youfang; Kaplan, Robert C.; Bidlingmaier, Martin; Vasan, Ramachandran S.; Teumer, Alexander; Tranah, Gregory J.; Nevitt, Michael C.; Cummings, Steven R.; Orwoll, Eric S.; Barrett-Connor, Elizabeth; Renner, Jordan B.; Jordan, Joanne M.; Doherty, Michael; Doherty, Sally A.; Uitterlinden, Andre G.; van Meurs, Joyce B.J.; Spector, Tim D.; Lories, Rik J.; Lane, Nancy E.

    2015-01-01

    Objectives To identify genetic associations with hip osteoarthritis (HOA), we performed a meta-analysis of genome-wide association studies (GWAS) of HOA. Methods The GWAS meta-analysis included approximately 2.5 million imputed HapMap single nucleotide polymorphisms (SNPs). HOA cases and controls defined radiographically and by total hip replacement were selected from the Osteoporotic Fractures in Men (MrOS) Study and the Study of Osteoporotic Fractures (SOF) (654 cases and 4697 controls, combined). Replication of genome-wide significant SNP associations (P-value ≤ 5x10−8) was examined in five studies (3243 cases and 6891 controls, combined). Functional studies were performed using in vitro models of chondrogenesis and osteogenesis. Results The A allele of rs788748, located 65 kb upstream of the IGFBP3 gene, was associated with lower HOA odds at the genome-wide significance level in the discovery stage (OR = 0.71, P-value = 2x10−8). The association replicated in five studies (OR = 0.92, P-value = 0.020), but the joint analysis of discovery and replication results was not genome-wide significant (P-value = 1x10−6). In separate study populations, the rs788748 A allele was also associated with lower circulating IGFBP3 protein levels (P-value = 4x10−13), suggesting that this SNP or a variant in linkage disequilibrium (LD) could be an IGFBP3 regulatory variant. Results from functional studies were consistent with association results. Chondrocyte hypertrophy, a deleterious event in OA pathogenesis, was largely prevented upon IGFBP3 knockdown in chondrocytes. Furthermore, IGFBP3 overexpression induced cartilage catabolism and osteogenic differentiation. Conclusions Results from GWAS and functional studies provided suggestive links between IGFBP3 and HOA. PMID:24928840

  17. A sampling method for estimating the accuracy of predicted breeding values in genetic evaluation

    PubMed Central

    Fouilloux, Marie-Noëlle; Laloë, Denis

    2001-01-01

    A sampling-based method for estimating the accuracy of estimated breeding values using an animal model is presented. Empirical variances of true and estimated breeding values were estimated from a simulated n-sample. The method was validated using a small data set from the Parthenaise breed with the estimated coefficient of determination converging to the true values. It was applied to the French Salers data file used for the 2000 on-farm evaluation (IBOVAL) of muscle development score. A drawback of the method is its computational demand. Consequently, convergence can not be achieved in a reasonable time for very large data files. Two advantages of the method are that a) it is applicable to any model (animal, sire, multivariate, maternal effects...) and b) it supplies off-diagonal coefficients of the inverse of the mixed model equations and can therefore be the basis of connectedness studies. PMID:11712970

  18. A Comparison of Phenotypic Traits Related to Trypanotolerance in Five West African Cattle Breeds Highlights the Value of Shorthorn Taurine Breeds

    PubMed Central

    Berthier, David; Peylhard, Moana; Dayo, Guiguigbaza-Kossigan; Flori, Laurence; Sylla, Souleymane; Bolly, Seydou; Sakande, Hassane; Chantal, Isabelle; Thevenon, Sophie

    2015-01-01

    Background Animal African Trypanosomosis particularly affects cattle and dramatically impairs livestock development in sub-Saharan Africa. African Zebu (AFZ) or European taurine breeds usually die of the disease in the absence of treatment, whereas West African taurine breeds (AFT), considered trypanotolerant, are able to control the pathogenic effects of trypanosomosis. Up to now, only one AFT breed, the longhorn N’Dama (NDA), has been largely studied and is considered as the reference trypanotolerant breed. Shorthorn taurine trypanotolerance has never been properly assessed and compared to NDA and AFZ breeds. Methodology/Principal Findings This study compared the trypanotolerant/susceptible phenotype of five West African local breeds that differ in their demographic history. Thirty-six individuals belonging to the longhorn taurine NDA breed, two shorthorn taurine Lagune (LAG) and Baoulé (BAO) breeds, the Zebu Fulani (ZFU) and the Borgou (BOR), an admixed breed between AFT and AFZ, were infected by Trypanosoma congolense IL1180. All the cattle were genetically characterized using dense SNP markers, and parameters linked to parasitaemia, anaemia and leukocytes were analysed using synthetic variables and mixed models. We showed that LAG, followed by NDA and BAO, displayed the best control of anaemia. ZFU showed the greatest anaemia and the BOR breed had an intermediate value, as expected from its admixed origin. Large differences in leukocyte counts were also observed, with higher leukocytosis for AFT. Nevertheless, no differences in parasitaemia were found, except a tendency to take longer to display detectable parasites in ZFU. Conclusions We demonstrated that LAG and BAO are as trypanotolerant as NDA. This study highlights the value of shorthorn taurine breeds, which display strong local adaptation to trypanosomosis. Thanks to further analyses based on comparisons of the genome or transcriptome of the breeds, these results open up the way for better knowledge

  19. Accuracy of predicting genomic breeding values for carcass merit traits in Angus and Charolais beef cattle.

    PubMed

    Chen, L; Vinsky, M; Li, C

    2015-02-01

    Accuracy of predicting genomic breeding values for carcass merit traits including hot carcass weight, longissimus muscle area (REA), carcass average backfat thickness (AFAT), lean meat yield (LMY) and carcass marbling score (CMAR) was evaluated based on 543 Angus and 400 Charolais steers genotyped on the Illumina BovineSNP50 Beadchip. For the genomic prediction within Angus, the average accuracy was 0.35 with a range from 0.32 (LMY) to 0.37 (CMAR) across different training/validation data-splitting strategies and statistical methods. The within-breed genomic prediction for Charolais yielded an average accuracy of 0.36 with a range from 0.24 (REA) to 0.46 (AFAT). The across-breed prediction had the lowest accuracy, which was on average near zero. When the data from the two breeds were combined to predict the breeding values of either breed, the prediction accuracy averaged 0.35 for Angus with a range from 0.33 (REA) to 0.39 (CMAR) and averaged 0.33 for Charolais with a range from 0.18 (REA) to 0.46 (AFAT). The prediction accuracy was slightly higher on average when the data were split by animal's birth year than when the data were split by sire family. These results demonstrate that the genetic relationship or relatedness of selection candidates with the training population has a great impact on the accuracy of predicting genomic breeding values under the density of the marker panel used in this study. © 2014 Her Majesty the Queen in Right of Canada. Animal Genetics © 2014 Stichting International Foundation for Animal Genetics.

  20. Genome-wide association analysis reveals new targets for carotenoid biofortification in maize.

    PubMed

    Suwarno, Willy B; Pixley, Kevin V; Palacios-Rojas, Natalia; Kaeppler, Shawn M; Babu, Raman

    2015-05-01

    Genome-wide association analysis in CIMMYT's association panel revealed new favorable native genomic variations in/nearby important genes such as hydroxylases and CCD1 that have potential for carotenoid biofortification in maize. Genome-wide association studies (GWAS) have been used extensively to identify allelic variation for genes controlling important agronomic and nutritional traits in plants. Provitamin A (proVA) enhancing alleles of lycopene epsilon cyclase (LCYE) and β-carotene hydroxylase 1 (CRTRB1), previously identified through candidate-gene based GWAS, are currently used in CIMMYT's maize breeding program. The objective of this study was to identify genes or genomic regions controlling variation for carotenoid concentrations in grain for CIMMYT's carotenoid association mapping panel of 380 inbred maize lines, using high-density genome-wide platforms with ~476,000 SNP markers. Population structure effects were minimized by adjustments using principal components and kinship matrix with mixed models. Genome-wide linkage disequilibrium (LD) analysis indicated faster LD decay (3.9 kb; r (2) = 0.1) than commonly reported for temperate germplasm, and therefore the possibility of achieving higher mapping resolution with our mostly tropical diversity panel. GWAS for various carotenoids identified CRTRB1, LCYE and other key genes or genomic regions that govern rate-critical steps in the upstream pathway, such as DXS1, GGPS1, and GGPS2 that are known to play important roles in the accumulation of precursor isoprenoids as well as downstream genes HYD5, CCD1, and ZEP1, which are involved in hydroxylation and carotenoid degradation. SNPs at or near all of these regions were identified and may be useful target regions for carotenoid biofortification breeding efforts in maize; for example a genomic region on chromosome 2 explained ~16% of the phenotypic variance for β-carotene independently of CRTRB1, and a variant of CCD1 that resulted in reduced

  1. Genome-wide patterns of selection in 230 ancient Eurasians.

    PubMed

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R; Llamas, Bastien; Dryomov, Stanislav; Pickrell, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vyacheslav; Guerra, Manuel A Rojo; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2015-12-24

    Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height.

  2. Genome-wide patterns of selection in 230 ancient Eurasians

    PubMed Central

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  3. Genome-wide association studies of obesity and metabolic syndrome.

    PubMed

    Fall, Tove; Ingelsson, Erik

    2014-01-25

    Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years.

  4. Genome wide copy number analysis of single cells

    PubMed Central

    Baslan, Timour; Kendall, Jude; Rodgers, Linda; Cox, Hilary; Riggs, Mike; Stepansky, Asya; Troge, Jennifer; Ravi, Kandasamy; Esposito, Diane; Lakshmi, B.; Wigler, Michael; Navin, Nicholas; Hicks, James

    2016-01-01

    Summary Copy number variation (CNV) is increasingly recognized as an important contributor to phenotypic variation in health and disease. Most methods for determining CNV rely on admixtures of cells, where information regarding genetic heterogeneity is lost. Here, we present a protocol that allows for the genome wide copy number analysis of single nuclei isolated from mixed populations of cells. Single nucleus sequencing (SNS), combines flow sorting of single nuclei based on DNA content, whole genome amplification (WGA), followed by next generation sequencing to quantize genomic intervals in a genome wide manner. Multiplexing of single cells is discussed. Additionally, we outline informatic approaches that correct for biases inherent in the WGA procedure and allow for accurate determination of copy number profiles. All together, the protocol takes ~3 days from flow cytometry to sequence-ready DNA libraries. PMID:22555242

  5. Genome-wide scans for loci under selection in humans.

    PubMed

    Ronald, James; Akey, Joshua M

    2005-06-01

    Natural selection, which can be defined as the differential contribution of genetic variants to future generations, is the driving force of Darwinian evolution. Identifying regions of the human genome that have been targets of natural selection is an important step in clarifying human evolutionary history and understanding how genetic variation results in phenotypic diversity, it may also facilitate the search for complex disease genes. Technological advances in high-throughput DNA sequencing and single nucleotide polymorphism genotyping have enabled several genome-wide scans of natural selection to be undertaken. Here, some of the observations that are beginning to emerge from these studies will be reviewed, including evidence for geographically restricted selective pressures (ie local adaptation) and a relationship between genes subject to natural selection and human disease. In addition, the paper will highlight several important problems that need to be addressed in future genome-wide studies of natural selection.

  6. Genome-wide functional analysis in Candida albicans.

    PubMed

    Motaung, Thabiso E; Ells, Ruan; Pohl, Carolina H; Albertyn, Jacobus; Tsilo, Toi J

    2017-02-08

    Candida albicans is an important etiological agent of superficial and life-threatening infections in individuals with compromised immune systems. To date, we know of several overlapping genetic networks that govern virulence attributes in this fungal pathogen. Classical use of deletion mutants has led to the discovery of numerous virulence factors over the years, and genome-wide functional analysis has propelled gene discovery at an even faster pace. Indeed, a number of recent studies using large-scale genetic screens followed by genome-wide functional analysis has allowed for the unbiased discovery of many new genes involved in C. albicans biology. Here we share our perspectives on the role of these studies in analyzing fundamental aspects of C. albicans virulence properties.

  7. Genome-wide RNA Tomography in the zebrafish embryo.

    PubMed

    Junker, Jan Philipp; Noël, Emily S; Guryev, Victor; Peterson, Kevin A; Shah, Gopi; Huisken, Jan; McMahon, Andrew P; Berezikov, Eugene; Bakkers, Jeroen; van Oudenaarden, Alexander

    2014-10-23

    Advancing our understanding of embryonic development is heavily dependent on identification of novel pathways or regulators. Although genome-wide techniques such as RNA sequencing are ideally suited for discovering novel candidate genes, they are unable to yield spatially resolved information in embryos or tissues. Microscopy-based approaches, using in situ hybridization, for example, can provide spatial information about gene expression, but are limited to analyzing one or a few genes at a time. Here, we present a method where we combine traditional histological techniques with low-input RNA sequencing and mathematical image reconstruction to generate a high-resolution genome-wide 3D atlas of gene expression in the zebrafish embryo at three developmental stages. Importantly, our technique enables searching for genes that are expressed in specific spatial patterns without manual image annotation. We envision broad applicability of RNA tomography as an accurate and sensitive approach for spatially resolved transcriptomics in whole embryos and dissected organs.

  8. Analysis of Heritability Using Genome-Wide Data.

    PubMed

    Hall, Jacob B; Bush, William S

    2016-10-11

    Most analyses of genome-wide association data consider each variant independently without considering or adjusting for the genetic background present in the rest of the genome. New approaches to genome analysis use representations of genomic sharing to better account for confounding factors like population stratification or to directly approximate heritability through the estimated sharing of individuals in a dataset. These approaches use mixed linear models, which relate genotypic sharing to phenotypic sharing, and rely on the efficient computation of genetic sharing among individuals in a dataset. This unit describes the principles and practical application of mixed models for the analysis of genome-wide association study data. © 2016 by John Wiley & Sons, Inc.

  9. Genome-Wide Significant Loci: How Important Are They?

    PubMed Central

    Björkegren, Johan L.M.; Kovacic, Jason C.; Dudley, Joel T.; Schadt, Eric E.

    2015-01-01

    Genome-wide association studies (GWAS) have been extensively used to study common complex diseases such as coronary artery disease (CAD), revealing 153 suggestive CAD loci, of which at least 46 have been validated as having genome-wide significance. However, these loci collectively explain <10% of the genetic variance in CAD. Thus, we must address the key question of what factors constitute the remaining 90% of CAD heritability. We review possible limitations of GWAS, and contextually consider some candidate CAD loci identified by this method. Looking ahead, we propose systems genetics as a complementary approach to unlocking the CAD heritability and etiology. Systems genetics builds network models of relevant molecular processes by combining genetic and genomic datasets to ultimately identify key “drivers” of disease. By leveraging systems-based genetic approaches, we can help reveal the full genetic basis of common complex disorders, enabling novel diagnostic and therapeutic opportunities. PMID:25720628

  10. Genome-Wide Association Study Reveals Natural Variations Contributing to Drought Resistance in Crops

    PubMed Central

    Wang, Hongwei; Qin, Feng

    2017-01-01

    Crops are often cultivated in regions where they will face environmental adversities; resulting in substantial yield loss which can ultimately lead to food and societal problems. Thus, significant efforts have been made to breed stress tolerant cultivars in an attempt to minimize these problems and to produce more stability with respect to crop yields across broad geographies. Since stress tolerance is a complex and multi-genic trait, advancements with classical breeding approaches have been challenging. On the other hand, molecular breeding, which is based on transgenics, marker-assisted selection and genome editing technologies; holds great promise to enable farmers to better cope with these challenges. However, identification of the key genetic components underlying the trait is critical and will serve as the foundation for future crop genetic improvement. Recently, genome-wide association studies have made significant contributions to facilitate the discovery of natural variation contributing to stress tolerance in crops. From these studies, the identified loci can serve as targets for genomic selection or editing to enable the molecular design of new cultivars. Here, we summarize research progress on this issue and focus on the genetic basis of drought tolerance as revealed by genome-wide association studies and quantitative trait loci mapping. Although many favorable loci have been identified, elucidation of their molecular mechanisms contributing to increased stress tolerance still remains a challenge. Thus, continuous efforts are still required to functionally dissect this complex trait through comprehensive approaches, such as system biological studies. It is expected that proper application of the acquired knowledge will enable the development of stress tolerant cultivars; allowing agricultural production to become more sustainable under dynamic environmental conditions. PMID:28713401

  11. Genome-Wide Association Mapping and Genomic Selection for Alfalfa (Medicago sativa) Forage Quality Traits

    PubMed Central

    Pecetti, Luciano; Brummer, E. Charles; Palmonari, Alberto; Tava, Aldo

    2017-01-01

    Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the most-grown forage legume worldwide. This study aimed at exploring opportunities for marker-assisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3–0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits

  12. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast.

    PubMed

    Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-03-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages.

  13. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast

    PubMed Central

    Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-01-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095

  14. Genome-Wide Association Studies and Liver Disease

    PubMed Central

    Speliotes, Elizabeth K.

    2016-01-01

    Sequencing of the human genome has opened up many opportunities to learn about our own genetic susceptibilities to disease. In this Foreword to this issue of Seminars in Liver Disease, I provide some required background to understanding genome-wide association analyses in general, including a list of terms (Table 1) often used in such studies. Five areas of particular significance are then reviewed in detail in the articles that follow. PMID:26676811

  15. Genome-Wide Profiling of Alternative Translation Initiation Sites.

    PubMed

    Gao, Xiangwei; Wan, Ji; Qian, Shu-Bing

    2016-01-01

    Regulation of translation initiation is a central control point in protein synthesis. Variations of start codon selection contribute to protein diversity and complexity. Systemic mapping of start codon positions and precise measurement of the corresponding initiation rate would transform our understanding of translational control. Here we describe a ribosome profiling approach that enables identification of translation initiation sites on a genome-wide scale. By capturing initiating ribosomes using lactimidomycin, this approach permits qualitative and quantitative analysis of alternative translation initiation.

  16. Genome-wide epigenomic profiling for biomarker discovery.

    PubMed

    Dirks, René A M; Stunnenberg, Hendrik G; Marks, Hendrik

    2016-01-01

    A myriad of diseases is caused or characterized by alteration of epigenetic patterns, including changes in DNA methylation, post-translational histone modifications, or chromatin structure. These changes of the epigenome represent a highly interesting layer of information for disease stratification and for personalized medicine. Traditionally, epigenomic profiling required large amounts of cells, which are rarely available with clinical samples. Also, the cellular heterogeneity complicates analysis when profiling clinical samples for unbiased genome-wide biomarker discovery. Recent years saw great progress in miniaturization of genome-wide epigenomic profiling, enabling large-scale epigenetic biomarker screens for disease diagnosis, prognosis, and stratification on patient-derived samples. All main genome-wide profiling technologies have now been scaled down and/or are compatible with single-cell readout, including: (i) Bisulfite sequencing to determine DNA methylation at base-pair resolution, (ii) ChIP-Seq to identify protein binding sites on the genome, (iii) DNaseI-Seq/ATAC-Seq to profile open chromatin, and (iv) 4C-Seq and HiC-Seq to determine the spatial organization of chromosomes. In this review we provide an overview of current genome-wide epigenomic profiling technologies and main technological advances that allowed miniaturization of these assays down to single-cell level. For each of these technologies we evaluate their application for future biomarker discovery. We will focus on (i) compatibility of these technologies with methods used for clinical sample preservation, including methods used by biobanks that store large numbers of patient samples, and (ii) automation of these technologies for robust sample preparation and increased throughput.

  17. Significance of genome-wide association studies in molecular anthropology.

    PubMed

    Gupta, Vipin; Khadgawat, Rajesh; Sachdeva, Mohinder Pal

    2009-12-01

    The successful advent of a genome-wide approach in association studies raises the hopes of human geneticists for solving a genetic maze of complex traits especially the disorders. This approach, which is replete with the application of cutting-edge technology and supported by big science projects (like Human Genome Project; and even more importantly the International HapMap Project) and various important databases (SNP database, CNV database, etc.), has had unprecedented success in rapidly uncovering many of the genetic determinants of complex disorders. The magnitude of this approach in the genetics of classical anthropological variables like height, skin color, eye color, and other genome diversity projects has certainly expanded the horizons of molecular anthropology. Therefore, in this article we have proposed a genome-wide association approach in molecular anthropological studies by providing lessons from the exemplary study of the Wellcome Trust Case Control Consortium. We have also highlighted the importance and uniqueness of Indian population groups in facilitating the design and finding optimum solutions for other genome-wide association-related challenges.

  18. Voxelwise genome-wide association study (vGWAS).

    PubMed

    Stein, Jason L; Hua, Xue; Lee, Suh; Ho, April J; Leow, Alex D; Toga, Arthur W; Saykin, Andrew J; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J; Craig, David W; Gerber, Jill D; Allen, April N; Corneveaux, Jason J; Dechairo, Bryan M; Potkin, Steven G; Weiner, Michael W; Thompson, Paul

    2010-11-15

    The structure of the human brain is highly heritable, and is thought to be influenced by many common genetic variants, many of which are currently unknown. Recent advances in neuroimaging and genetics have allowed collection of both highly detailed structural brain scans and genome-wide genotype information. This wealth of information presents a new opportunity to find the genes influencing brain structure. Here we explore the relation between 448,293 single nucleotide polymorphisms in each of 31,622 voxels of the entire brain across 740 elderly subjects (mean age+/-s.d.: 75.52+/-6.82 years; 438 male) including subjects with Alzheimer's disease, Mild Cognitive Impairment, and healthy elderly controls from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used tensor-based morphometry to measure individual differences in brain structure at the voxel level relative to a study-specific template based on healthy elderly subjects. We then conducted a genome-wide association at each voxel to identify genetic variants of interest. By studying only the most associated variant at each voxel, we developed a novel method to address the multiple comparisons problem and computational burden associated with the unprecedented amount of data. No variant survived the strict significance criterion, but several genes worthy of further exploration were identified, including CSMD2 and CADPS2. These genes have high relevance to brain structure. This is the first voxelwise genome wide association study to our knowledge, and offers a novel method to discover genetic influences on brain structure.

  19. Genome-wide DNA polymorphism analyses using VariScan

    PubMed Central

    Hutter, Stephan; Vilella, Albert J; Rozas, Julio

    2006-01-01

    Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data. PMID:16968531

  20. Genome-Wide Detection and Analysis of Multifunctional Genes

    PubMed Central

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  1. Genome-Wide Association Study of Polymorphisms Predisposing to Bronchiolitis

    PubMed Central

    Pasanen, Anu; Karjalainen, Minna K.; Bont, Louis; Piippo-Savolainen, Eija; Ruotsalainen, Marja; Goksör, Emma; Kumawat, Kuldeep; Hodemaekers, Hennie; Nuolivirta, Kirsi; Jartti, Tuomas; Wennergren, Göran; Hallman, Mikko; Rämet, Mika; Korppi, Matti

    2017-01-01

    Bronchiolitis is a major cause of hospitalization among infants. Severe bronchiolitis is associated with later asthma, suggesting a common genetic predisposition. Genetic background of bronchiolitis is not well characterized. To identify polymorphisms associated with bronchiolitis, we conducted a genome-wide association study (GWAS) in which 5,300,000 single nucleotide polymorphisms (SNPs) were tested for association in a Finnish–Swedish population of 217 children hospitalized for bronchiolitis and 778 controls. The most promising SNPs (n = 77) were genotyped in a Dutch replication population of 416 cases and 432 controls. Finally, we used a set of 202 Finnish bronchiolitis cases to further investigate candidate SNPs. We did not detect genome-wide significant associations, but several suggestive association signals (p < 10−5) were observed in the GWAS. In the replication population, three SNPs were nominally associated (p < 0.05). Of them, rs269094 was an expression quantitative trait locus (eQTL) for KCND3, previously shown to be associated with occupational asthma. In the additional set of Finnish cases, the association for another SNP (rs9591920) within a noncoding RNA locus was further strengthened. Our results provide a first genome-wide examination of the genetics underlying bronchiolitis. These preliminary findings require further validation in a larger sample size. PMID:28139761

  2. Genome-Wide Estimates of Heritability for Social Demographic Outcomes

    PubMed Central

    Domingue, Benjamin W.; Wedow, Robbee; Conley, Dalton; McQueen, Matt; Hoffmann, Thomas J.; Boardman, Jason D.

    2016-01-01

    An increasing number of studies that are widely used in the demographic research community have collected genome-wide data from their respondents. It is therefore important that demographers have a proper understanding of some of the methodological tools needed to analyze such data. Our paper details the underlying methodology behind one of the most common techniques for analyzing genome-wide data, Genome-Wide Complex Trait Analysis (GCTA). GCTA models provide heritability estimates for health, health behaviors, or indicators of attainment using data from unrelated persons.. Our goal is to describe this model, to highlight the utility of the model for biodemographic research, and to demonstrate the performance of this approach under modifications of the underlying assumptions. The first set of modifications involves changing the nature of the genetic data used to compute genetic similarities between individuals (the genetic relationship matrix). We then explore the sensitivity of the model to heteroscedastic errors. In general, GCTA estimates are robust to the modifications proposed here but we also highlight potential limitations of GCTA estimates. PMID:27050030

  3. Genome-wide gene-environment interaction analysis for asbestos exposure in lung cancer susceptibility.

    PubMed

    Wei, Sheng; Wang, Li-E; McHugh, Michelle K; Han, Younghun; Xiong, Momiao; Amos, Christopher I; Spitz, Margaret R; Wei, Qingyi Wei

    2012-08-01

    Asbestos exposure is a known risk factor for lung cancer. Although recent genome-wide association studies (GWASs) have identified some novel loci for lung cancer risk, few addressed genome-wide gene-environment interactions. To determine gene-asbestos interactions in lung cancer risk, we conducted genome-wide gene-environment interaction analyses at levels of single nucleotide polymorphisms (SNPs), genes and pathways, using our published Texas lung cancer GWAS dataset. This dataset included 317 498 SNPs from 1154 lung cancer cases and 1137 cancer-free controls. The initial SNP-level P-values for interactions between genetic variants and self-reported asbestos exposure were estimated by unconditional logistic regression models with adjustment for age, sex, smoking status and pack-years. The P-value for the most significant SNP rs13383928 was 2.17×10(-6), which did not reach the genome-wide statistical significance. Using a versatile gene-based test approach, we found that the top significant gene was C7orf54, located on 7q32.1 (P = 8.90×10(-5)). Interestingly, most of the other significant genes were located on 11q13. When we used an improved gene-set-enrichment analysis approach, we found that the Fas signaling pathway and the antigen processing and presentation pathway were most significant (nominal P < 0.001; false discovery rate < 0.05) among 250 pathways containing 17 572 genes. We believe that our analysis is a pilot study that first describes the gene-asbestos interaction in lung cancer risk at levels of SNPs, genes and pathways. Our findings suggest that immune function regulation-related pathways may be mechanistically involved in asbestos-associated lung cancer risk.

  4. GW-SEM: A Statistical Package to Conduct Genome-Wide Structural Equation Modeling.

    PubMed

    Verhulst, Brad; Maes, Hermine H; Neale, Michael C

    2017-05-01

    Improving the accuracy of phenotyping through the use of advanced psychometric tools will increase the power to find significant associations with genetic variants and expand the range of possible hypotheses that can be tested on a genome-wide scale. Multivariate methods, such as structural equation modeling (SEM), are valuable in the phenotypic analysis of psychiatric and substance use phenotypes, but these methods have not been integrated into standard genome-wide association analyses because fitting a SEM at each single nucleotide polymorphism (SNP) along the genome was hitherto considered to be too computationally demanding. By developing a method that can efficiently fit SEMs, it is possible to expand the set of models that can be tested. This is particularly necessary in psychiatric and behavioral genetics, where the statistical methods are often handicapped by phenotypes with large components of stochastic variance. Due to the enormous amount of data that genome-wide scans produce, the statistical methods used to analyze the data are relatively elementary and do not directly correspond with the rich theoretical development, and lack the potential to test more complex hypotheses about the measurement of, and interaction between, comorbid traits. In this paper, we present a method to test the association of a SNP with multiple phenotypes or a latent construct on a genome-wide basis using a diagonally weighted least squares (DWLS) estimator for four common SEMs: a one-factor model, a one-factor residuals model, a two-factor model, and a latent growth model. We demonstrate that the DWLS parameters and p-values strongly correspond with the more traditional full information maximum likelihood parameters and p-values. We also present the timing of simulations and power analyses and a comparison with and existing multivariate GWAS software package.

  5. Enhancing the value of the breeding bird survey: reply to Sauer et al. (2005)

    Treesearch

    Charles M. Francis; Jonathan Bart; Erica H. Dunn; Kenneth P. Burnham; C. John Ralph

    2005-01-01

    Bart et al (2004a) proposed several approaches for enhancing the considerable value of the Breeding Bird Survey (BBS). Sauer et al. (2005) critiqued some of these approaches, and emphasized alternative goals for the survey. We agree with many of the suggestions of Sauer et al. (2005); notably that multispecies, large-scale surveys such as the BBS are most valuable for...

  6. Phenotypic structures and breeding value of open-pollinated corn varietal hybrids

    USDA-ARS?s Scientific Manuscript database

    The growing interest in using open-pollinated varieties (OPVs) and varietal hybrids (OPVhs) of corn (Zea mays L.) especially in breeding programs for organic and low-input farming reflects the value of large plasticity levels available in their plant, ear, and kernel traits. We estimated and partiti...

  7. Genetic Correlations Between Carcass Traits And Molecular Breeding Values In Angus Cattle

    USDA-ARS?s Scientific Manuscript database

    This research elucidated genetic relationships between carcass traits, ultrasound indicator traits, and their respective molecular breeding values (MBV). Animals whose MBV data were used to estimate (co)variance components were not previously used in development of the MBV. Results are presented fo...

  8. Comparison of Bayesian models to estimate direct genomic values in multi-breed commercial beef cattle

    USDA-ARS?s Scientific Manuscript database

    Background Several studies have examined the accuracy of genomic selection both within and across purebred beef or dairy populations. However, the accuracy of direct genomic breeding values (DGVs) has been less well studied in crossbred or admixed cattle populations. We used a population of 3,240 cr...

  9. Disruptive selection without genome-wide evolution across a migratory divide.

    PubMed

    von Rönn, Jan A C; Shafer, Aaron B A; Wolf, Jochen B W

    2016-06-01

    Transcontinental migration is a fascinating example of how animals can respond to climatic oscillation. Yet, quantitative data on fitness components are scarce, and the resulting population genetic consequences are poorly understood. Migratory divides, hybrid zones with a transition in migratory behaviour, provide a natural setting to investigate the micro-evolutionary dynamics induced by migration under sympatric conditions. Here, we studied the effects of migratory programme on survival, trait evolution and genome-wide patterns of population differentiation in a migratory divide of European barn swallows. We sampled a total of 824 individuals from both allopatric European populations wintering in central and southern Africa, respectively, along with two mixed populations from within the migratory divide. While most morphological characters varied by latitude consistent with Bergmann's rule, wing length co-varied with distance to wintering grounds. Survival data collected during a 5-year period provided strong evidence that this covariance is repeatedly generated by disruptive selection against intermediate phenotypes. Yet, selection-induced divergence did not translate into genome-wide genetic differentiation as assessed by microsatellites, mtDNA and >20 000 genome-wide SNP markers; nor did we find evidence of local genomic selection between migratory types. Among breeding populations, a single outlier locus mapped to the BUB1 gene with a role in mitotic and meiotic organization. Overall, this study provides evidence for an adaptive response to variation in migration behaviour continuously eroded by gene flow under current conditions of nonassortative mating. It supports the theoretical prediction that population differentiation is difficult to achieve under conditions of gene flow despite measurable disruptive selection.

  10. Genome-wide analysis of zygotic linkage disequilibrium and its components in crossbred cattle

    PubMed Central

    2012-01-01

    Background Linkage disequilibrium (LD) between genes at linked or independent loci can occur at gametic and zygotic levels known asgametic LD and zygotic LD, respectively. Gametic LD is well known for its roles in fine-scale mapping of quantitative trait loci, genomic selection and evolutionary inference. The less-well studied is the zygotic LD and its components that can be also estimated directly from the unphased SNPs. Results This study was set up to investigate the genome-wide extent and patterns of zygotic LD and its components in a crossbred cattle population using the genomic data from the Illumina BovineSNP50 beadchip. The animal population arose from repeated crossbreeding of multiple breeds and selection for growth and cow reproduction. The study showed that similar genomic structures in gametic and zygotic LD were observed, with zygotic LD decaying faster than gametic LD over marker distance. The trigenic and quadrigenic disequilibria were generally two- to three-fold smaller than the usual digenic disequilibria (gametic or composite LD). There was less power of testing for these high-order genic disequilibria than for the digenic disequilibria. The power estimates decreased with the marker distance between markers though the decay trend is more obvious for the digenic disequilibria than for high-order disequilibria. Conclusions This study is the first major genome-wide survey of all non-allelic associations between pairs of SNPs in a cattle population. Such analysis allows us to assess the relative importance of gametic LD vs. all other non-allelic genic LDs regardless of whether or not the population is in HWE. The observed predominance of digenic LD (gametic or composite LD) coupled with insignificant high-order trigenic and quadrigenic disequilibria supports the current intensive focus on the use of high-density SNP markers for genome-wide association studies and genomic selection activities in the cattle population. PMID:22827586

  11. Genome Wide Association Mapping for Arabinoxylan Content in a Collection of Tetraploid Wheats.

    PubMed

    Marcotuli, Ilaria; Houston, Kelly; Waugh, Robbie; Fincher, Geoffrey B; Burton, Rachel A; Blanco, Antonio; Gadaleta, Agata

    2015-01-01

    Arabinoxylans (AXs) are major components of plant cell walls in bread wheat and are important in bread-making and starch extraction. Furthermore, arabinoxylans are components of soluble dietary fibre that has potential health-promoting effects in human nutrition. Despite their high value for human health, few studies have been carried out on the genetics of AX content in durum wheat. The genetic variability of AX content was investigated in a set of 104 tetraploid wheat genotypes and regions attributable to AX content were identified through a genome wide association study (GWAS). The amount of arabinoxylan, expressed as percentage (w/w) of the dry weight of the kernel, ranged from 1.8% to 5.5% with a mean value of 4.0%. The GWAS revealed a total of 37 significant marker-trait associations (MTA), identifying 19 quantitative trait loci (QTL) associated with AX content. The highest number of MTAs was identified on chromosome 5A (seven), where three QTL regions were associated with AX content, while the lowest number of MTAs was detected on chromosomes 2B and 4B, where only one MTA identified a single locus. Conservation of synteny between SNP marker sequences and the annotated genes and proteins in Brachypodium distachyon, Oryza sativa and Sorghum bicolor allowed the identification of nine QTL coincident with candidate genes. These included a glycosyl hydrolase GH35, which encodes Gal7 and a glucosyltransferase GT31 on chromosome 1A; a cluster of GT1 genes on chromosome 2B that includes TaUGT1 and cisZog1; a glycosyl hydrolase that encodes a CelC gene on chromosome 3A; Ugt12887 and TaUGT1genes on chromosome 5A; a (1,3)-β-D-glucan synthase (Gsl12 gene) and a glucosyl hydrolase (Cel8 gene) on chromosome 7A. This study identifies significant MTAs for the AX content in the grain of tetraploid wheat genotypes. We propose that these may be used for molecular breeding of durum wheat varieties with higher soluble fibre content.

  12. Genome Wide Association Mapping for Arabinoxylan Content in a Collection of Tetraploid Wheats

    PubMed Central

    Marcotuli, Ilaria; Houston, Kelly; Waugh, Robbie; Fincher, Geoffrey B.; Burton, Rachel A.; Blanco, Antonio; Gadaleta, Agata

    2015-01-01

    Background Arabinoxylans (AXs) are major components of plant cell walls in bread wheat and are important in bread-making and starch extraction. Furthermore, arabinoxylans are components of soluble dietary fibre that has potential health-promoting effects in human nutrition. Despite their high value for human health, few studies have been carried out on the genetics of AX content in durum wheat. Results The genetic variability of AX content was investigated in a set of 104 tetraploid wheat genotypes and regions attributable to AX content were identified through a genome wide association study (GWAS). The amount of arabinoxylan, expressed as percentage (w/w) of the dry weight of the kernel, ranged from 1.8% to 5.5% with a mean value of 4.0%. The GWAS revealed a total of 37 significant marker-trait associations (MTA), identifying 19 quantitative trait loci (QTL) associated with AX content. The highest number of MTAs was identified on chromosome 5A (seven), where three QTL regions were associated with AX content, while the lowest number of MTAs was detected on chromosomes 2B and 4B, where only one MTA identified a single locus. Conservation of synteny between SNP marker sequences and the annotated genes and proteins in Brachypodium distachyon, Oryza sativa and Sorghum bicolor allowed the identification of nine QTL coincident with candidate genes. These included a glycosyl hydrolase GH35, which encodes Gal7 and a glucosyltransferase GT31 on chromosome 1A; a cluster of GT1 genes on chromosome 2B that includes TaUGT1 and cisZog1; a glycosyl hydrolase that encodes a CelC gene on chromosome 3A; Ugt12887 and TaUGT1genes on chromosome 5A; a (1,3)-β-D-glucan synthase (Gsl12 gene) and a glucosyl hydrolase (Cel8 gene) on chromosome 7A. Conclusions This study identifies significant MTAs for the AX content in the grain of tetraploid wheat genotypes. We propose that these may be used for molecular breeding of durum wheat varieties with higher soluble fibre content. PMID:26176552

  13. Genome-wide association study identifies candidate markers for bull fertility in Holstein dairy cattle.

    PubMed

    Peñagaricano, F; Weigel, K A; Khatib, H

    2012-07-01

    The decline in the reproductive efficiency of dairy cattle has become a challenging problem worldwide. Female fertility is now taken into account in breeding goals while generally less attention is given to male fertility. The objective of this study was to perform a genome-wide association study in Holstein bulls to identify genetic variants significantly related to sire conception rate (SCR), a new phenotypic evaluation of bull fertility. The analysis included 1755 sires with SCR data and 38,650 single nucleotide polymorphisms (SNPs) spanning the entire bovine genome. Associations between SNPs and SCR were analyzed using a mixed linear model that included a random polygenic effect and SNP genotype either as a linear covariate or as a categorical variable. A multiple testing correction approach was used to account for the correlation between SNPs because of linkage disequilibrium. After genome-wide correction, eight SNPs showed significant association with SCR. Some of these SNPs are located close to or in the middle of genes with functions related to male fertility, such as the sperm acrosome reaction, chromatin remodeling during the spermatogenesis, and the meiotic process during male germ cell maturation. Some SNPs showed marked dominance effects, which provide more evidence for the relevance of non-additive effects in traits closely related to fitness such as fertility. The results could contribute to the identification of genes and pathways associated with male fertility in dairy cattle.

  14. Software for Genome-Wide Association Studies in Autopolyploids and Its Application to Potato.

    PubMed

    Rosyara, Umesh R; De Jong, Walter S; Douches, David S; Endelman, Jeffrey B

    2016-07-01

    Genome-wide association studies (GWAS) are widely used in diploid species to study complex traits in diversity and breeding populations, but GWAS software tailored to autopolyploids is lacking. The objectives of this research were to (i) develop an R package for autopolyploids based on the + mixed model, (ii) validate the software with simulated data, and (iii) analyze a diversity panel of tetraploid potatoes. A unique feature of the R package, called GWASpoly, is its ability to model different types of polyploid gene action, including additive, simplex dominant, and duplex dominant. Using a simulated tetraploid population, we confirmed our hypothesis that statistical power is higher when the assumed gene action in the GWAS model matches the gene action at unobserved quantitative trait loci (QTL). Thirteen traits were analyzed in the Solanaceae Coordinated Agricultural Project (SolCAP) potato diversity panel and, consistent with previous studies, significant QTL for tuber shape and eye depth co-localized on chromosome 10. For the other traits, only marginally significant QTL were detected, most likely due to insufficient statistical power: for simulated traits with a heritability () of 0.3, the median genome-wide power was only 0.01. Our results indicate that both marker density and population size were limiting factors for GWAS with the SolCAP panel. Copyright © 2016 Crop Science Society of America.

  15. Genome-Wide Divergence in the West-African Malaria Vector Anopheles melas

    PubMed Central

    Deitz, Kevin C.; Athrey, Giridhar A.; Jawara, Musa; Overgaard, Hans J.; Matias, Abrahan; Slotman, Michel A.

    2016-01-01

    Anopheles melas is a member of the recently diverged An. gambiae species complex, a model for speciation studies, and is a locally important malaria vector along the West-African coast where it breeds in brackish water. A recent population genetic study of An. melas revealed species-level genetic differentiation between three population clusters. An. melas West extends from The Gambia to the village of Tiko, Cameroon. The other mainland cluster, An. melas South, extends from the southern Cameroonian village of Ipono to Angola. Bioko Island, Equatorial Guinea An. melas populations are genetically isolated from mainland populations. To examine how genetic differentiation between these An. melas forms is distributed across their genomes, we conducted a genome-wide analysis of genetic differentiation and selection using whole genome sequencing data of pooled individuals (Pool-seq) from a representative population of each cluster. The An. melas forms exhibit high levels of genetic differentiation throughout their genomes, including the presence of numerous fixed differences between clusters. Although the level of divergence between the clusters is on a par with that of other species within the An. gambiae complex, patterns of genome-wide divergence and diversity do not provide evidence for the presence of pre- and/or postmating isolating mechanisms in the form of speciation islands. These results are consistent with an allopatric divergence process with little or no introgression. PMID:27466271

  16. Genome-wide association studies for fatty acid metabolic traits in five divergent pig populations

    PubMed Central

    Zhang, Wanchang; Bin Yang; Zhang, Junjie; Cui, Leilei; Ma, Junwu; Chen, Congying; Ai, Huashui; Xiao, Shijun; Ren, Jun; Huang, Lusheng

    2016-01-01

    Fatty acid composition profiles are important indicators of meat quality and tasting flavor. Metabolic indices of fatty acids are more authentic to reflect meat nutrition and public acceptance. To investigate the genetic mechanism of fatty acid metabolic indices in pork, we conducted genome-wide association studies (GWAS) for 33 fatty acid metabolic traits in five pig populations. We identified a total of 865 single nucleotide polymorphisms (SNPs), corresponding to 11 genome-wide significant loci on nine chromosomes and 12 suggestive loci on nine chromosomes. Our findings not only confirmed seven previously reported QTL with stronger association strength, but also revealed four novel population-specific loci, showing that investigations on intermediate phenotypes like the metabolic traits of fatty acids can increase the statistical power of GWAS for end-point phenotypes. We proposed a list of candidate genes at the identified loci, including three novel genes (FADS2, SREBF1 and PLA2G7). Further, we constructed the functional networks involving these candidate genes and deduced the potential fatty acid metabolic pathway. These findings advance our understanding of the genetic basis of fatty acid composition in pigs. The results from European hybrid commercial pigs can be immediately transited into breeding practice for beneficial fatty acid composition. PMID:27097669

  17. Genome-wide association study of drought-related resistance traits in Aegilops tauschii

    PubMed Central

    Qin, Peng; Lin, Yu; Hu, Yaodong; Liu, Kun; Mao, Shuangshuang; Li, Zhanyi; Wang, Jirui; Liu, Yaxi; Wei, Yuming; Zheng, Youliang

    2016-01-01

    Abstract The D-genome progenitor of wheat (Triticum aestivum), Aegilops tauschii, possesses numerous genes for resistance to abiotic stresses, including drought. Therefore, information on the genetic architecture of A. tauschii can aid the development of drought-resistant wheat varieties. Here, we evaluated 13 traits in 373 A. tauschii accessions grown under normal and polyethylene glycol-simulated drought stress conditions and performed a genome-wide association study using 7,185 single nucleotide polymorphism (SNP) markers. We identified 208 and 28 SNPs associated with all traits using the general linear model and mixed linear model, respectively, while both models detected 25 significant SNPs with genome-wide distribution. Public database searches revealed several candidate/flanking genes related to drought resistance that were grouped into three categories according to the type of encoded protein (enzyme, storage protein, and drought-induced protein). This study provided essential information for SNPs and genes related to drought resistance in A. tauschii and wheat, and represents a foundation for breeding drought-resistant wheat cultivars using marker-assisted selection. PMID:27560650

  18. Genome-wide association studies for multiple diseases of the German Shepherd Dog

    PubMed Central

    Tsai, Kate L.; Noorai, Rooksana E.; Starr-Moss, Alison N.; Quignon, Pascale; Rinz, Caitlin J.; Ostrander, Elaine A.; Steiner, Jörg M.; Murphy, Keith E.

    2012-01-01

    The German Shepherd Dog (GSD) is a popular working and companion breed for which over 50 hereditary diseases have been documented. Herein, SNP profiles for 197 GSDs were generated using the Affymetrix v2 canine SNP array for a genome-wide association study to identify loci associated with four diseases: pituitary dwarfism, degenerative myelopathy (DM), congenital megaesophagus (ME), and pancreatic acinar atrophy (PAA). A locus on Chr 9 is strongly associated with pituitary dwarfism and is proximal to a plausible candidate gene, LHX3. Results for DM confirm a major locus encompassing SOD1, in which an associated point mutation was previously identified, but do not suggest modifier loci. Several SNPs on Chr 12 are associated with ME and a 4.7 Mb haplotype block is present in affected dogs. Analysis of additional ME cases for a SNP within the haplotype provides further support for this association. Results for PAA indicate more complex genetic underpinnings. Several regions on multiple chromosomes reach genome-wide significance. However, no major locus is apparent and only two associated haplotype blocks, on Chrs 7 and 12 are observed. These data suggest that PAA may be governed by multiple loci with small effects, or it may be a heterogeneous disorder. PMID:22105877

  19. Genome-wide microsatellite characterization and marker development in the sequenced Brassica crop species.

    PubMed

    Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2014-02-01

    Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species.

  20. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-02-09

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.

  1. Development and application of a novel genome-wide SNP array reveals domestication history in soybean

    PubMed Central

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  2. Genome-Wide Microsatellite Characterization and Marker Development in the Sequenced Brassica Crop Species

    PubMed Central

    Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2014-01-01

    Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species. PMID:24130371

  3. Genome-Wide Divergence in the West-African Malaria Vector Anopheles melas.

    PubMed

    Deitz, Kevin C; Athrey, Giridhar A; Jawara, Musa; Overgaard, Hans J; Matias, Abrahan; Slotman, Michel A

    2016-09-08

    Anopheles melas is a member of the recently diverged An. gambiae species complex, a model for speciation studies, and is a locally important malaria vector along the West-African coast where it breeds in brackish water. A recent population genetic study of An. melas revealed species-level genetic differentiation between three population clusters. An. melas West extends from The Gambia to the village of Tiko, Cameroon. The other mainland cluster, An. melas South, extends from the southern Cameroonian village of Ipono to Angola. Bioko Island, Equatorial Guinea An. melas populations are genetically isolated from mainland populations. To examine how genetic differentiation between these An. melas forms is distributed across their genomes, we conducted a genome-wide analysis of genetic differentiation and selection using whole genome sequencing data of pooled individuals (Pool-seq) from a representative population of each cluster. The An. melas forms exhibit high levels of genetic differentiation throughout their genomes, including the presence of numerous fixed differences between clusters. Although the level of divergence between the clusters is on a par with that of other species within the An. gambiae complex, patterns of genome-wide divergence and diversity do not provide evidence for the presence of pre- and/or postmating isolating mechanisms in the form of speciation islands. These results are consistent with an allopatric divergence process with little or no introgression.

  4. Genome-Wide Meta-Analysis of Longitudinal Alcohol Consumption Across Youth and Early Adulthood.

    PubMed

    Adkins, Daniel E; Clark, Shaunna L; Copeland, William E; Kennedy, Martin; Conway, Kevin; Angold, Adrian; Maes, Hermine; Liu, Youfang; Kumar, Gaurav; Erkanli, Alaattin; Patkar, Ashwin A; Silberg, Judy; Brown, Tyson H; Fergusson, David M; Horwood, L John; Eaves, Lindon; van den Oord, Edwin J C G; Sullivan, Patrick F; Costello, E J

    2015-08-01

    The public health burden of alcohol is unevenly distributed across the life course, with levels of use, abuse, and dependence increasing across adolescence and peaking in early adulthood. Here, we leverage this temporal patterning to search for common genetic variants predicting developmental trajectories of alcohol consumption. Comparable psychiatric evaluations measuring alcohol consumption were collected in three longitudinal community samples (N=2,126, obs=12,166). Consumption-repeated measurements spanning adolescence and early adulthood were analyzed using linear mixed models, estimating individual consumption trajectories, which were then tested for association with Illumina 660W-Quad genotype data (866,099 SNPs after imputation and QC). Association results were combined across samples using standard meta-analysis methods. Four meta-analysis associations satisfied our pre-determined genome-wide significance criterion (FDR<0.1) and six others met our 'suggestive' criterion (FDR<0.2). Genome-wide significant associations were highly biological plausible, including associations within GABA transporter 1, SLC6A1 (solute carrier family 6, member 1), and exonic hits in LOC100129340 (mitofusin-1-like). Pathway analyses elaborated single marker results, indicating significant enriched associations to intuitive biological mechanisms, including neurotransmission, xenobiotic pharmacodynamics, and nuclear hormone receptors (NHR). These findings underscore the value of combining longitudinal behavioral data and genome-wide genotype information in order to study developmental patterns and improve statistical power in genomic studies.

  5. Empirical estimation of genome-wide significance thresholds based on the 1000 Genomes Project data set

    PubMed Central

    Kanai, Masahiro; Tanaka, Toshihiro; Okada, Yukinori

    2016-01-01

    To assess the statistical significance of associations between variants and traits, genome-wide association studies (GWAS) should employ an appropriate threshold that accounts for the massive burden of multiple testing in the study. Although most studies in the current literature commonly set a genome-wide significance threshold at the level of P=5.0 × 10−8, the adequacy of this value for respective populations has not been fully investigated. To empirically estimate thresholds for different ancestral populations, we conducted GWAS simulations using the 1000 Genomes Phase 3 data set for Africans (AFR), Europeans (EUR), Admixed Americans (AMR), East Asians (EAS) and South Asians (SAS). The estimated empirical genome-wide significance thresholds were Psig=3.24 × 10−8 (AFR), 9.26 × 10−8 (EUR), 1.83 × 10−7 (AMR), 1.61 × 10−7 (EAS) and 9.46 × 10−8 (SAS). We additionally conducted trans-ethnic meta-analyses across all populations (ALL) and all populations except for AFR (ΔAFR), which yielded Psig=3.25 × 10−8 (ALL) and 4.20 × 10−8 (ΔAFR). Our results indicate that the current threshold (P=5.0 × 10−8) is overly stringent for all ancestral populations except for Africans; however, we should employ a more stringent threshold when conducting a meta-analysis, regardless of the presence of African samples. PMID:27305981

  6. Quality control and quality assurance in genotypic data for genome-wide association studies

    PubMed Central

    Laurie, Cathy C.; Doheny, Kimberly F.; Mirel, Daniel B.; Pugh, Elizabeth W.; Bierut, Laura J.; Bhangale, Tushar; Boehm, Frederick; Caporaso, Neil E.; Cornelis, Marilyn C.; Edenberg, Howard J.; Gabriel, Stacy B.; Harris, Emily L.; Hu, Frank B.; Jacobs, Kevin; Kraft, Peter; Landi, Maria Teresa; Lumley, Thomas; Manolio, Teri A.; McHugh, Caitlin; Painter, Ian; Paschall, Justin; Rice, John P.; Rice, Kenneth M.; Zheng, Xiuwen; Weir, Bruce S.

    2011-01-01

    Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies. This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium (HWE) test p-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis (PCA) to SNP selection. The methods are illustrated with examples from the ‘Gene Environment Association Studies’ (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of genome-wide association studies. PMID:20718045

  7. A Genome-Wide Association Study Identifies Genetic Variants Associated with Mathematics Ability.

    PubMed

    Chen, Huan; Gu, Xiao-Hong; Zhou, Yuxi; Ge, Zeng; Wang, Bin; Siok, Wai Ting; Wang, Guoqing; Huen, Michael; Jiang, Yuyang; Tan, Li-Hai; Sun, Yimin

    2017-02-03

    Mathematics ability is a complex cognitive trait with polygenic heritability. Genome-wide association study (GWAS) has been an effective approach to investigate genetic components underlying mathematic ability. Although previous studies reported several candidate genetic variants, none of them exceeded genome-wide significant threshold in general populations. Herein, we performed GWAS in Chinese elementary school students to identify potential genetic variants associated with mathematics ability. The discovery stage included 494 and 504 individuals from two independent cohorts respectively. The replication stage included another cohort of 599 individuals. In total, 28 of 81 candidate SNPs that met validation criteria were further replicated. Combined meta-analysis of three cohorts identified four SNPs (rs1012694, rs11743006, rs17778739 and rs17777541) of SPOCK1 gene showing association with mathematics ability (minimum p value 5.67 × 10(-10), maximum β -2.43). The SPOCK1 gene is located on chromosome 5q31.2 and encodes a highly conserved glycoprotein testican-1 which was associated with tumor progression and prognosis as well as neurogenesis. This is the first study to report genome-wide significant association of individual SNPs with mathematics ability in general populations. Our preliminary results further supported the role of SPOCK1 during neurodevelopment. The genetic complexities underlying mathematics ability might contribute to explain the basis of human cognition and intelligence at genetic level.

  8. A Genome-Wide Association Study Identifies Genetic Variants Associated with Mathematics Ability

    PubMed Central

    Chen, Huan; Gu, Xiao-hong; Zhou, Yuxi; Ge, Zeng; Wang, Bin; Siok, Wai Ting; Wang, Guoqing; Huen, Michael; Jiang, Yuyang; Tan, Li-Hai; Sun, Yimin

    2017-01-01

    Mathematics ability is a complex cognitive trait with polygenic heritability. Genome-wide association study (GWAS) has been an effective approach to investigate genetic components underlying mathematic ability. Although previous studies reported several candidate genetic variants, none of them exceeded genome-wide significant threshold in general populations. Herein, we performed GWAS in Chinese elementary school students to identify potential genetic variants associated with mathematics ability. The discovery stage included 494 and 504 individuals from two independent cohorts respectively. The replication stage included another cohort of 599 individuals. In total, 28 of 81 candidate SNPs that met validation criteria were further replicated. Combined meta-analysis of three cohorts identified four SNPs (rs1012694, rs11743006, rs17778739 and rs17777541) of SPOCK1 gene showing association with mathematics ability (minimum p value 5.67 × 10−10, maximum β −2.43). The SPOCK1 gene is located on chromosome 5q31.2 and encodes a highly conserved glycoprotein testican-1 which was associated with tumor progression and prognosis as well as neurogenesis. This is the first study to report genome-wide significant association of individual SNPs with mathematics ability in general populations. Our preliminary results further supported the role of SPOCK1 during neurodevelopment. The genetic complexities underlying mathematics ability might contribute to explain the basis of human cognition and intelligence at genetic level. PMID:28155865

  9. Genome-wide association study of personality traits in bipolar patients

    PubMed Central

    Alliey-Rodriguez, Ney; Zhang, Dandan; Badner, Judith A.; Lahey, Benjamin B.; Zhang, Xiaotong; Dinwiddie, Stephen; Romanos, Benjamin; Plenys, Natalie; Liu, Chunyu; Gershon, Elliot S.

    2011-01-01

    Objective Genome-wide association study was carried out on personality traits among bipolar patients as possible endophenotypes for gene discovery in bipolar disorder. Methods The subscales of Cloninger’s Temperament and Character Inventory (TCI) and the Zuckerman–Kuhlman Personality Questionnaire (ZKPQ) were used as quantitative phenotypes. The genotyping platform was the Affymetrix 6.0 SNP array. The sample consisted of 944 individuals for TCI and 1007 for ZKPQ, all of European ancestry, diagnosed with bipolar disorder by Diagnostic and Statistical Manual of Mental Disorders-IV criteria. Results Genome-wide significant association was found for two subscales of the TCI, rs10479334 with the ‘Social Acceptance versus Social Intolerance’ subscale (Bonferroni P = 0.014) in an intergenic region, and rs9419788 with the ‘Spiritual Acceptance versus Rational Materialism’ subscale (Bonferroni P = 0.036) in PLCE1 gene. Although genome-wide significance was not reached for ZKPQ scales, lowest P values pinpointed to genes, RXRG for Sensation Seeking, GRM7 and ITK for Neuroticism Anxiety, and SPTLC3 gene for Aggression Hostility. Conclusion After correction for the 25 subscales in TCI and four scales plus two subscales in ZKPQ, phenotype-wide significance was not reached. PMID:21368711

  10. Empirical estimation of genome-wide significance thresholds based on the 1000 Genomes Project data set.

    PubMed

    Kanai, Masahiro; Tanaka, Toshihiro; Okada, Yukinori

    2016-10-01

    To assess the statistical significance of associations between variants and traits, genome-wide association studies (GWAS) should employ an appropriate threshold that accounts for the massive burden of multiple testing in the study. Although most studies in the current literature commonly set a genome-wide significance threshold at the level of P=5.0 × 10(-8), the adequacy of this value for respective populations has not been fully investigated. To empirically estimate thresholds for different ancestral populations, we conducted GWAS simulations using the 1000 Genomes Phase 3 data set for Africans (AFR), Europeans (EUR), Admixed Americans (AMR), East Asians (EAS) and South Asians (SAS). The estimated empirical genome-wide significance thresholds were Psig=3.24 × 10(-8) (AFR), 9.26 × 10(-8) (EUR), 1.83 × 10(-7) (AMR), 1.61 × 10(-7) (EAS) and 9.46 × 10(-8) (SAS). We additionally conducted trans-ethnic meta-analyses across all populations (ALL) and all populations except for AFR (ΔAFR), which yielded Psig=3.25 × 10(-8) (ALL) and 4.20 × 10(-8) (ΔAFR). Our results indicate that the current threshold (P=5.0 × 10(-8)) is overly stringent for all ancestral populations except for Africans; however, we should employ a more stringent threshold when conducting a meta-analysis, regardless of the presence of African samples.

  11. Meta-analyses of genome-wide linkage scans of anxiety-related phenotypes

    PubMed Central

    Webb, Bradley T; Guo, An-Yuan; Maher, Brion S; Zhao, Zhongming; van den Oord, Edwin J; Kendler, Kenneth S; Riley, Brien P; Gillespie, Nathan A; Prescott, Carol A; Middeldorp, Christel M; Willemsen, Gonneke; de Geus, Eco JC; Hottenga, Jouke-Jan; Boomsma, Dorret I; Slagboom, Eline P; Wray, Naomi R; Montgomery, Grant W; Martin, Nicholas G; Wright, Margie J; Heath, Andrew C; Madden, Pamela A; Gelernter, Joel; Knowles, James A; Hamilton, Steven P; Weissman, Myrna M; Fyer, Abby J; Huezo-Diaz, Patricia; McGuffin, Peter; Farmer, Anne; Craig, Ian W; Lewis, Cathryn; Sham, Pak; Crowe, Raymond R; Flint, Jonathan; Hettema, John M

    2012-01-01

    Genetic factors underlying trait neuroticism, reflecting a tendency towards negative affective states, may overlap genetic susceptibility for anxiety disorders and help explain the extensive comorbidity amongst internalizing disorders. Genome-wide linkage (GWL) data from several studies of neuroticism and anxiety disorders have been published, providing an opportunity to test such hypotheses and identify genomic regions that harbor genes common to these phenotypes. In all, 11 independent GWL studies of either neuroticism (n=8) or anxiety disorders (n=3) were collected, which comprised of 5341 families with 15 529 individuals. The rank-based genome scan meta-analysis (GSMA) approach was used to analyze each trait separately and combined, and global correlations between results were examined. False discovery rate (FDR) analysis was performed to test for enrichment of significant effects. Using 10 cM intervals, bins nominally significant for both GSMA statistics, PSR and POR, were found on chromosomes 9, 11, 12, and 14 for neuroticism and on chromosomes 1, 5, 15, and 16 for anxiety disorders. Genome-wide, the results for the two phenotypes were significantly correlated, and a combined analysis identified additional nominally significant bins. Although none reached genome-wide significance, an excess of significant PSRP-values were observed, with 12 bins falling under a FDR threshold of 0.50. As demonstrated by our identification of multiple, consistent signals across the genome, meta-analytically combining existing GWL data is a valuable approach to narrowing down regions relevant for anxiety-related phenotypes. This may prove useful for prioritizing emerging genome-wide association data for anxiety disorders. PMID:22473089

  12. Common genetic variation and survival after colorectal cancer diagnosis: a genome-wide analysis

    PubMed Central

    Phipps, Amanda I.; Passarelli, Michael N.; Chan, Andrew T.; Harrison, Tabitha A.; Jeon, Jihyoun; Hutter, Carolyn M.; Berndt, Sonja I.; Brenner, Hermann; Caan, Bette J.; Campbell, Peter T.; Chang-Claude, Jenny; Chanock, Stephen J.; Cheadle, Jeremy P.; Curtis, Keith R.; Duggan, David; Fisher, David; Fuchs, Charles S.; Gala, Manish; Giovannucci, Edward L.; Hayes, Richard B.; Hoffmeister, Michael; Hsu, Li; Jacobs, Eric J.; Jansen, Lina; Kaplan, Richard; Kap, Elisabeth J.; Maughan, Timothy S.; Potter, John D.; Schoen, Robert E.; Seminara, Daniela; Slattery, Martha L.; West, Hannah; White, Emily; Peters, Ulrike; Newcomb, Polly A.

    2016-01-01

    Genome-wide association studies have identified several germline single nucleotide polymorphisms (SNPs) significantly associated with colorectal cancer (CRC) incidence. Common germline genetic variation may also be related to CRC survival. We used a discovery-based approach to identify SNPs related to survival outcomes after CRC diagnosis. Genome-wide genotyping arrays were conducted for 3494 individuals with invasive CRC enrolled in six prospective cohort studies (median study-specific follow-up = 4.2–8.1 years). In pooled analyses, we used Cox regression to assess SNP-specific associations with CRC-specific and overall survival, with additional analyses stratified by stage at diagnosis. Top findings were followed-up in independent studies. A P value threshold of P < 5×10−8 in analyses combining discovery and follow-up studies was required for genome-wide significance. Among individuals with distant-metastatic CRC, several SNPs at 6p12.1, nearest the ELOVL5 gene, were statistically significantly associated with poorer survival, with the strongest associations noted for rs209489 [hazard ratio (HR) = 1.8, P = 7.6×10−10 and HR = 1.8, P = 3.7×10−9 for CRC-specific and overall survival, respectively). No SNPs were statistically significantly associated with survival among all cases combined or in cases without distant-metastases. SNPs in 6p12.1/ELOVL5 were associated with survival outcomes in individuals with distant-metastatic CRC, and merit further follow-up for functional significance. Findings from this genome-wide association study highlight the potential importance of genetic variation in CRC prognosis and provide clues to genomic regions of potential interest. PMID:26586795

  13. Sniffing out significant “Pee values”: genome wide association study of asparagus anosmia

    PubMed Central

    Markt, Sarah C; Nuttall, Elizabeth; Turman, Constance; Sinnott, Jennifer; Rimm, Eric B; Ecsedy, Ethan; Unger, Robert H; Fall, Katja; Finn, Stephen; Jensen, Majken K; Rider, Jennifer R; Kraft, Peter

    2016-01-01

    Objective To determine the inherited factors associated with the ability to smell asparagus metabolites in urine. Design Genome wide association study. Setting Nurses’ Health Study and Health Professionals Follow-up Study cohorts. Participants 6909 men and women of European-American descent with available genetic data from genome wide association studies. Main outcome measure Participants were characterized as asparagus smellers if they strongly agreed with the prompt “after eating asparagus, you notice a strong characteristic odor in your urine,” and anosmic if otherwise. We calculated per-allele estimates of asparagus anosmia for about nine million single nucleotide polymorphisms using logistic regression. P values <5×10-8 were considered as genome wide significant. Results 58.0% of men (n=1449/2500) and 61.5% of women (n=2712/4409) had anosmia. 871 single nucleotide polymorphisms reached genome wide significance for asparagus anosmia, all in a region on chromosome 1 (1q44: 248139851-248595299) containing multiple genes in the olfactory receptor 2 (OR2) family. Conditional analyses revealed three independent markers associated with asparagus anosmia: rs13373863, rs71538191, and rs6689553. Conclusion A large proportion of people have asparagus anosmia. Genetic variation near multiple olfactory receptor genes is associated with the ability of an individual to smell the metabolites of asparagus in urine. Future replication studies are necessary before considering targeted therapies to help anosmic people discover what they are missing. PMID:27965198

  14. A genome-wide association study of sporadic ALS in a homogenous Irish population.

    PubMed

    Cronin, Simon; Berger, Stephen; Ding, Jinhui; Schymick, Jennifer C; Washecka, Nicole; Hernandez, Dena G; Greenway, Matthew J; Bradley, Daniel G; Traynor, Bryan J; Hardiman, Orla

    2008-03-01

    Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease characterized by progressive limb or bulbar weakness. Efforts to elucidate the disease-associated loci have to date produced conflicting results. One strategy to improve power in genome-wide studies is to genotype a genetically homogenous population. Such a population exhibits extended linkage disequilibrium (LD) and lower allelic heterogeneity to facilitate disease gene mapping. We sought to identify associated variants for ALS in the Irish, a stable population of relatively homogenous genetic background, and to replicate these findings in larger genetically out-bred populations. We conducted a genome-wide association study in 432 Irish individuals using Illumina HumanHap 550K single nucleotide polymorphism chips. We demonstrated extended LD and increased homogeneity in the Irish sample when compared to an out-bred population of mixed European ancestry. The Irish scan identified 35 loci associated with P-values below 0.0001. For replication, we identified seven chromosomal regions commonly associated in a joint analysis of genome-wide data on 958 ALS cases and 932 controls from Ireland and the previously published datasets from the US and The Netherlands. When pooled, the strongest association was a variant in the gene encoding DPP6, a component of type A neuronal transmembrane potassium channels. Further confirmation of the candidate loci is warranted in additional genome-wide datasets. We have made our individual genotyping data publicly available, contributing to a powerful world-wide resource to refine our understanding of the genetics of sporadic ALS.

  15. Genome-wide association study reveals novel variants for growth and egg traits in Dongxiang blue-shelled and White Leghorn chickens.

    PubMed

    Liao, R; Zhang, X; Chen, Q; Wang, Z; Wang, Q; Yang, C; Pan, Y

    2016-10-01

    This study was designed to investigate the genetic basis of growth and egg traits in Dongxiang blue-shelled chickens and White Leghorn chickens. In this study, we employed a reduced representation sequencing approach called genotyping by genome reducing and sequencing to detect genome-wide SNPs in 252 Dongxiang blue-shelled chickens and 252 White Leghorn chickens. The Dongxiang blue-shelled chicken breed has many specific traits and is characterized by blue-shelled eggs, black plumage, black skin, black bone and black organs. The White Leghorn chicken is an egg-type breed with high productivity. As multibreed genome-wide association studies (GWASs) can improve precision due to less linkage disequilibrium across breeds, a multibreed GWAS was performed with 156 575 SNPs to identify the associated variants underlying growth and egg traits within the two chicken breeds. The analysis revealed 32 SNPs exhibiting a significant genome-wide association with growth and egg traits. Some of the significant SNPs are located in genes that are known to impact growth and egg traits, but nearly half of the significant SNPs are located in genes with unclear functions in chickens. To our knowledge, this is the first multibreed genome-wide report for the genetics of growth and egg traits in the Dongxiang blue-shelled and White Leghorn chickens.

  16. Genome-wide associations for water-soluble carbohydrate concentration and relative maturity in wheat using SNP and DArT marker arrays

    USDA-ARS?s Scientific Manuscript database

    Improving water-use efficiency by incorporating drought avoidance traits into new wheat varieties is an important objective for wheat breeding in water-limited environments. This study uses genome wide association studies (GWAS) to identify candidate loci for water-soluble carbohydrate accumulation,...

  17. Genome-wide association study identifies candidate loci underlying seven agronomic traits in Middle American diversity panel in common bean (Phaseolus vulgaris L.)

    USDA-ARS?s Scientific Manuscript database

    Common bean (Phaseolus vulgaris L.) breeding programs aim to improve both agronomic and seed characteristics traits. However, the genetic architecture of the many traits that affect common bean production are not completely understood. Genome-wide associate studies (GWAS) provide an experimental ap...

  18. Genome-wide association study of insect bite hypersensitivity in two horse populations in the Netherlands

    PubMed Central

    2012-01-01

    Background Insect bite hypersensitivity is a common allergic disease in horse populations worldwide. Insect bite hypersensitivity is affected by both environmental and genetic factors. However, little is known about genes contributing to the genetic variance associated with insect bite hypersensitivity. Therefore, the aim of our study was to identify and quantify genomic associations with insect bite hypersensitivity in Shetland pony mares and Icelandic horses in the Netherlands. Methods Data on 200 Shetland pony mares and 146 Icelandic horses were collected according to a matched case–control design. Cases and controls were matched on various factors (e.g. region, sire) to minimize effects of population stratification. Breed-specific genome-wide association studies were performed using 70 k single nucleotide polymorphisms genotypes. Bayesian variable selection method Bayes-C with a threshold model implemented in GenSel software was applied. A 1 Mb non-overlapping window approach that accumulated contributions of adjacent single nucleotide polymorphisms was used to identify associated genomic regions. Results The percentage of variance explained by all single nucleotide polymorphisms was 13% in Shetland pony mares and 28% in Icelandic horses. The 20 non-overlapping windows explaining the largest percentages of genetic variance were found on nine chromosomes in Shetland pony mares and on 14 chromosomes in Icelandic horses. Overlap in identified associated genomic regions between breeds would suggest interesting candidate regions to follow-up on. Such regions common to both breeds (within 15 Mb) were found on chromosomes 3, 7, 11, 20 and 23. Positional candidate genes within 2 Mb from the associated windows were identified on chromosome 20 in both breeds. Candidate genes are within the equine lymphocyte antigen class II region, which evokes an immune response by recognizing many foreign molecules. Conclusions The genome-wide association study identified several

  19. Genome-wide association study of insect bite hypersensitivity in two horse populations in the Netherlands.

    PubMed

    Schurink, Anouk; Wolc, Anna; Ducro, Bart J; Frankena, Klaas; Garrick, Dorian J; Dekkers, Jack C M; van Arendonk, Johan A M

    2012-10-30

    Insect bite hypersensitivity is a common allergic disease in horse populations worldwide. Insect bite hypersensitivity is affected by both environmental and genetic factors. However, little is known about genes contributing to the genetic variance associated with insect bite hypersensitivity. Therefore, the aim of our study was to identify and quantify genomic associations with insect bite hypersensitivity in Shetland pony mares and Icelandic horses in the Netherlands. Data on 200 Shetland pony mares and 146 Icelandic horses were collected according to a matched case-control design. Cases and controls were matched on various factors (e.g. region, sire) to minimize effects of population stratification. Breed-specific genome-wide association studies were performed using 70 k single nucleotide polymorphisms genotypes. Bayesian variable selection method Bayes-C with a threshold model implemented in GenSel software was applied. A 1 Mb non-overlapping window approach that accumulated contributions of adjacent single nucleotide polymorphisms was used to identify associated genomic regions. The percentage of variance explained by all single nucleotide polymorphisms was 13% in Shetland pony mares and 28% in Icelandic horses. The 20 non-overlapping windows explaining the largest percentages of genetic variance were found on nine chromosomes in Shetland pony mares and on 14 chromosomes in Icelandic horses. Overlap in identified associated genomic regions between breeds would suggest interesting candidate regions to follow-up on. Such regions common to both breeds (within 15 Mb) were found on chromosomes 3, 7, 11, 20 and 23. Positional candidate genes within 2 Mb from the associated windows were identified on chromosome 20 in both breeds. Candidate genes are within the equine lymphocyte antigen class II region, which evokes an immune response by recognizing many foreign molecules. The genome-wide association study identified several genomic regions associated with insect bite

  20. Comparison of selective genotyping strategies for prediction of breeding values in a population undergoing selection.

    PubMed

    Boligon, A A; Long, N; Albuquerque, L G; Weigel, K A; Gianola, D; Rosa, G J M

    2012-12-01

    Genomewide marker information can improve the reliability of breeding value predictions for young selection candidates in genomic selection. However, the cost of genotyping limits its use to elite animals, and how such selective genotyping affects predictive ability of genomic selection models is an open question. We performed a simulation study to evaluate the quality of breeding value predictions for selection candidates based on different selective genotyping strategies in a population undergoing selection. The genome consisted of 10 chromosomes of 100 cM each. After 5,000 generations of random mating with a population size of 100 (50 males and 50 females), generation G(0) (reference population) was produced via a full factorial mating between the 50 males and 50 females from generation 5,000. Different levels of selection intensities (animals with the largest yield deviation value) in G(0) or random sampling (no selection) were used to produce offspring of G(0) generation (G(1)). Five genotyping strategies were used to choose 500 animals in G(0) to be genotyped: 1) Random: randomly selected animals, 2) Top: animals with largest yield deviation values, 3) Bottom: animals with lowest yield deviations values, 4) Extreme: animals with the 250 largest and the 250 lowest yield deviations values, and 5) Less Related: less genetically related animals. The number of individuals in G(0) and G(1) was fixed at 2,500 each, and different levels of heritability were considered (0.10, 0.25, and 0.50). Additionally, all 5 selective genotyping strategies (Random, Top, Bottom, Extreme, and Less Related) were applied to an indicator trait in generation G(0,) and the results were evaluated for the target trait in generation G(1), with the genetic correlation between the 2 traits set to 0.50. The 5 genotyping strategies applied to individuals in G(0) (reference population) were compared in terms of their ability to predict the genetic values of the animals in G(1) (selection

  1. Genome-wide association studies identified multiple genetic loci for body size at four growth stages in Chinese Holstein cattle

    PubMed Central

    Zhang, Xu; Chu, Qin; Guo, Gang; Dong, Ganghui; Li, Xizhi; Zhang, Qin; Zhang, Shengli; Zhang, Zhiwu

    2017-01-01

    The growth and maturity of cattle body size affect not only feed efficiency, but also productivity and longevity. Dissecting the genetic architecture of body size is critical for cattle breeding to improve both efficiency and productivity. The volume and weight of body size are indicated by several measurements. Among them, Heart Girth (HG) and Hip Height (HH) are the most important traits. They are widely used as predictors of body weight (BW). Few association studies have been conducted for HG and HH in cattle focusing on single growth stage. In this study, we extended the Genome-wide association studies to a full spectrum of four growth stages (6-, 12-, 18-, and 24-months after birth) in Chinese Holstein heifers. The whole genomic single nucleotide polymorphisms (SNPs) were obtained from the Illumina BovineSNP50 v2 BeadChip genotyped on 3,325 individuals. Estimated breeding values (EBVs) were derived for both HG and HH at the four different ages and analyzed separately for GWAS by using the Fixed and random model Circuitous Probability Unification (FarmCPU) method. In total, 27 SNPs were identified to be significantly associated with HG and HH at different growth stages. We found 66 candidate genes located nearby the associated SNPs, including nine genes that were known as highly related to development and skeletal and muscular growth. In addition, biological function analysis was performed by Ingenuity Pathway Analysis and an interaction network related to development was obtained, which contained 16 genes out of the 66 candidates. The set of putative genes provided valuable resources and can help elucidate the genomic architecture and mechanisms underlying growth traits in dairy cattle. PMID:28426785

  2. Genome-Wide Linkage and Association Analysis Identifies Major Gene Loci for Guttural Pouch Tympany in Arabian and German Warmblood Horses

    PubMed Central

    Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2012-01-01

    Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16–26 Mb and 34–55 Mb and for Arabian on ECA15 at 64–65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553

  3. Genome-wide linkage and association analysis identifies major gene loci for guttural pouch tympany in Arabian and German warmblood horses.

    PubMed

    Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar

    2012-01-01

    Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16-26 Mb and 34-55 Mb and for Arabian on ECA15 at 64-65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT.

  4. A genome-wide association study reveals a QTL influencing caudal supernumerary teats in Holstein cattle.

    PubMed

    Joerg, H; Meili, C; Ruprecht, O; Bangerter, E; Burren, A; Bigler, A

    2014-12-01

    Supernumerary teats represent a common abnormality of the bovine udder. A genome-wide association study was performed based on the proportion of the occurrence of supernumerary teats in the daughters of 1097 Holstein bulls. The heritability of caudal supernumerary teats without mammary gland in this study was 0.604. The largest proportion of the heritability was attributable to BTA 20. The strongest evidence for association was with five SNPs on chromosome 20, referred to as a QTL. The mode of inheritance at this QTL was dominant. These findings reveal that the occurrence of caudal supernumerary teats without mammary gland in Holstein cattle is influenced by a QTL on chromosome 20 and a polygenic part. The data support the high potential of the SNPs in the QTL region as markers for breeding against caudal supernumerary teats. © 2014 Stichting International Foundation for Animal Genetics.

  5. Genome-wide Association Study Identifies Loci for the Polled Phenotype in Yak

    PubMed Central

    Wu, Xiaoyun; Wang, Kun; Ding, Xuezhi; Wang, Mingcheng; Chu, Min; Xie, Xiuyue; Qiu, Qiang; Yan, Ping

    2016-01-01

    The absence of horns, known as the polled phenotype, is an economically important trait in modern yak husbandry, but the genomic structure and genetic basis of this phenotype have yet to be discovered. Here, we conducted a genome-wide association study with a panel of 10 horned and 10 polled yaks using whole genome sequencing. We mapped the POLLED locus to a 200-kb interval, which comprises three protein-coding genes. Further characterization of the candidate region showed recent artificial selection signals resulting from the breeding process. We suggest that expressional variations rather than structural variations in protein probably contribute to the polled phenotype. Our results not only represent the first and important step in establishing the genomic structure of the polled region in yak, but also add to our understanding of the polled trait in bovid species. PMID:27389700

  6. Persistence of accuracy of genomic estimated breeding values over generations in layer chickens.

    PubMed

    Wolc, Anna; Arango, Jesus; Settar, Petek; Fulton, Janet E; O'Sullivan, Neil P; Preisinger, Rudolf; Habier, David; Fernando, Rohan; Garrick, Dorian J; Dekkers, Jack C M

    2011-06-21

    The predictive ability of genomic estimated breeding values (GEBV) originates both from associations between high-density markers and QTL (Quantitative Trait Loci) and from pedigree information. Thus, GEBV are expected to provide more persistent accuracy over successive generations than breeding values estimated using pedigree-based methods. The objective of this study was to evaluate the accuracy of GEBV in a closed population of layer chickens and to quantify their persistence over five successive generations using marker or pedigree information. The training data consisted of 16 traits and 777 genotyped animals from two generations of a brown-egg layer breeding line, 295 of which had individual phenotype records, while others had phenotypes on 2,738 non-genotyped relatives, or similar data accumulated over up to five generations. Validation data included phenotyped and genotyped birds from five subsequent generations (on average 306 birds/generation). Birds were genotyped for 23,356 segregating SNP. Animal models using genomic or pedigree relationship matrices and Bayesian model averaging methods were used for training analyses. Accuracy was evaluated as the correlation between EBV and phenotype in validation divided by the square root of trait heritability. Pedigree relationships in outbred populations are reduced by 50% at each meiosis, therefore accuracy is expected to decrease by the square root of 0.5 every generation, as observed for pedigree-based EBV (Estimated Breeding Values). In contrast the GEBV accuracy was more persistent, although the drop in accuracy was substantial in the first generation. Traits that were considered to be influenced by fewer QTL and to have a higher heritability maintained a higher GEBV accuracy over generations. In conclusion, GEBV capture information beyond pedigree relationships, but retraining every generation is recommended for genomic selection in closed breeding populations.

  7. The importance of information on relatives for the prediction of genomic breeding values and the implications for the makeup of reference data sets in livestock breeding schemes.

    PubMed

    Clark, Samuel A; Hickey, John M; Daetwyler, Hans D; van der Werf, Julius H J

    2012-02-09

    The theory of genomic selection is based on the prediction of the effects of genetic markers in linkage disequilibrium with quantitative trait loci. However, genomic selection also relies on relationships between individuals to accurately predict genetic value. This study aimed to examine the importance of information on relatives versus that of unrelated or more distantly related individuals on the estimation of genomic breeding values. Simulated and real data were used to examine the effects of various degrees of relationship on the accuracy of genomic selection. Genomic Best Linear Unbiased Prediction (gBLUP) was compared to two pedigree based BLUP methods, one with a shallow one generation pedigree and the other with a deep ten generation pedigree. The accuracy of estimated breeding values for different groups of selection candidates that had varying degrees of relationships to a reference data set of 1750 animals was investigated. The gBLUP method predicted breeding values more accurately than BLUP. The most accurate breeding values were estimated using gBLUP for closely related animals. Similarly, the pedigree based BLUP methods were also accurate for closely related animals, however when the pedigree based BLUP methods were used to predict unrelated animals, the accuracy was close to zero. In contrast, gBLUP breeding values, for animals that had no pedigree relationship with animals in the reference data set, allowed substantial accuracy. An animal's relationship to the reference data set is an important factor for the accuracy of genomic predictions. Animals that share a close relationship to the reference data set had the highest accuracy from genomic predictions. However a baseline accuracy that is driven by the reference data set size and the overall population effective population size enables gBLUP to estimate a breeding value for unrelated animals within a population (breed), using information previously ignored by pedigree based BLUP methods.

  8. Improved Heritability Estimation from Genome-wide SNPs

    PubMed Central

    Speed, Doug; Hemani, Gibran; Johnson, Michael R.; Balding, David J.

    2012-01-01

    Estimation of narrow-sense heritability, h2, from genome-wide SNPs genotyped in unrelated individuals has recently attracted interest and offers several advantages over traditional pedigree-based methods. With the use of this approach, it has been estimated that over half the heritability of human height can be attributed to the ∼300,000 SNPs on a genome-wide genotyping array. In comparison, only 5%–10% can be explained by SNPs reaching genome-wide significance. We investigated via simulation the validity of several key assumptions underpinning the mixed-model analysis used in SNP-based h2 estimation. Although we found that the method is reasonably robust to violations of four key assumptions, it can be highly sensitive to uneven linkage disequilibrium (LD) between SNPs: contributions to h2 are overestimated from causal variants in regions of high LD and are underestimated in regions of low LD. The overall direction of the bias can be up or down depending on the genetic architecture of the trait, but it can be substantial in realistic scenarios. We propose a modified kinship matrix in which SNPs are weighted according to local LD. We show that this correction greatly reduces the bias and increases the precision of h2 estimates. We demonstrate the impact of our method on the first seven diseases studied by the Wellcome Trust Case Control Consortium. Our LD adjustment revises downward the h2 estimate for immune-related diseases, as expected because of high LD in the major-histocompatibility region, but increases it for some nonimmune diseases. To calculate our revised kinship matrix, we developed LDAK, software for computing LD-adjusted kinships. PMID:23217325

  9. Genome-Wide Approaches to Drosophila Heart Development

    PubMed Central

    Frasch, Manfred

    2016-01-01

    The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi) reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level. PMID:27294102

  10. Voxelwise genome-wide association study (vGWAS)

    PubMed Central

    Stein, Jason L.; Hua, Xue; Lee, Suh; Ho, April J.; Leow, Alex D.; Toga, Arthur W.; Saykin, Andrew J.; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J.; Craig, David W.; Gerber, Jill D.; Allen, April N.; Corneveaux, Jason J.; DeChairo, Bryan M.; Potkin, Steven G.; Weiner, Michael W.; Thompson, Paul M.

    2010-01-01

    The structure of the human brain is highly heritable, and is thought to be influenced by many common genetic variants, many of which are currently unknown. Recent advances in neuroimaging and genetics have allowed collection of both highly detailed structural brain scans and genome-wide genotype information. This wealth of information presents a new opportunity to find the genes influencing brain structure. Here we explore the relation between 448,293 single nucleotide polymorphisms in each of 31,622 voxels of the entire brain across 740 elderly subjects (mean age±s.d.: 75.52±6.82 years; 438 male) including subjects with Alzheimer's disease, Mild Cognitive Impairment, and healthy elderly controls from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used tensor-based morphometry to measure individual differences in brain structure at the voxel level relative to a study-specific template based on healthy elderly subjects. We then conducted a genome-wide association at each voxel to identify genetic variants of interest. By studying only the most associated variant at each voxel, we developed a novel method to address the multiple comparisons problem and computational burden associated with the unprecedented amount of data. No variant survived the strict significance criterion, but several genes worthy of further exploration were identified, including CSMD2 and CADPS2. These genes have high relevance to brain structure. This is the first voxelwise genome wide association study to our knowledge, and offers a novel method to discover genetic influences on brain structure. PMID:20171287

  11. Genome-wide association study of conduct disorder symptomatology

    PubMed Central

    Dick, DM; Aliev, F; Krueger, RF; Edwards, A; Agrawal, A; Lynskey, M; Lin, P; Schuckit, M; Hesselbrock, V; Nurnberger, J; Almasy, L; Porjesz, B; Edenberg, HJ; Bucholz, K; Kramer, J; Kuperman, S; Bierut, L

    2013-01-01

    Conduct disorder (CD) is one of the most prevalent childhood psychiatric conditions, and is associated with a number of serious concomitant and future problems. CD symptomatology is known to have a considerable genetic component, with heritability estimates in the range of 50%. Despite this, there is a relative paucity of studies aimed at identifying genes involved in the susceptibility to CD. In this study, we report results from a genome-wide association study of CD symptoms. CD symptoms were retrospectively reported by a psychiatric interview among a sample of cases and controls, in which cases met the criteria for alcohol dependence. Our primary phenotype was the natural log transformation of the number of CD symptoms that were endorsed, with data available for 3963 individuals who were genotyped on the Illumina Human 1M beadchip array. Secondary analyses are presented for case versus control status, in which caseness was established as endorsing three or more CD symptoms (N= 872 with CD and N= 3091 without CD). We find four markers that meet the criteria for genome-wide significance (P < 5 × 10−8) with the CD symptom count, two of which are located in the gene C1QTNF7 (C1q and tumor necrosis factor-related protein 7). There were six additional SNPs in the gene that yielded converging evidence of association. These data provide the first evidence of a specific gene that is associated with CD symptomatology. None of the top signals resided in traditional candidate genes, underscoring the importance of a genome-wide approach for identifying novel variants involved in this serious childhood disorder. PMID:20585324

  12. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    PubMed

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  13. Genome-wide approaches to defining macrophage identity and function

    PubMed Central

    Fonseca, Gregory J; Seidman, Jason S; Glass, Christopher K

    2016-01-01

    Macrophages play essential roles in the response to injury and infection and contribute to the development and/or homeostasis of the various tissues they reside in. Conversely, macrophages also influence the pathogenesis of metabolic, neurodegenerative, and neoplastic diseases. Mechanisms that contribute to the phenotypic diversity of macrophages in health and disease remain poorly understood. Here we review the recent application of genome-wide approaches to characterize the transcriptomes and epigenetic landscapes of tissue-resident macrophages. These studies are beginning to provide insights into how distinct tissue environments are interpreted by transcriptional regulatory elements to drive specialized programs of gene expression. PMID:28087927

  14. Genome-wide approaches to understanding behaviour in Drosophila melanogaster.

    PubMed

    Neville, Megan; Goodwin, Stephen F

    2012-09-01

    Understanding how an organism exhibits specific behaviours remains a major and important biological question. Studying behaviour in a simple model organism like the fruit fly Drosophila melanogaster has the advantages of advanced molecular genetics approaches along with well-defined anatomy and physiology. With advancements in functional genomic technologies, researchers are now attempting to uncover genes and pathways involved in complex behaviours on a genome-wide scale. A systems-level network approach, which will include genomic approaches, to study behaviour will be key to understanding the regulation and modulation of behaviours and the importance of context in regulating them.

  15. Validating, augmenting and refining genome-wide association signals.

    PubMed

    Ioannidis, John P A; Thomas, Gilles; Daly, Mark J

    2009-05-01

    Studies using genome-wide platforms have yielded an unprecedented number of promising signals of association between genomic variants and human traits. This Review addresses the steps required to validate, augment and refine such signals to identify underlying causal variants for well-defined phenotypes. These steps include: large-scale exact replication across both similar and diverse populations; fine mapping and resequencing; determination of the most informative markers and multiple independent informative loci; incorporation of functional information; and improved phenotype mapping of the implicated genetic effects. Even in cases for which replication proves that an effect exists, confident localization of the causal variant often remains elusive.

  16. Genome-wide association studies and contribution to cardiovascular physiology

    PubMed Central

    Munroe, Patricia B.

    2015-01-01

    The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology. PMID:26106147

  17. [Genome-wide association study for adolescent idiopathic scoliosis].

    PubMed

    Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro

    2016-04-01

    Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS.

  18. [New insight of genome-wide association study (GWAS)].

    PubMed

    Hotta, Kikuko

    2013-02-01

    The number of obese patients is increasing in Japan, due to the westernization of lifestyle. Obesity, especially visceral fat obesity, is important for the development of metabolic syndrome. Genetic factors are important for the development of obesity as well as environmental factors. Importance of genetic factors of fat distribution is also reported. Recent genome-wide association studies (GWASs) have revealed the obesity and fat distribution-related polymorphisms. GWAS will highlight a better understanding of the underlying molecular mechanisms in the regulation of obesity and distribution of body fat.

  19. Evaluation of multiple approaches to identify genome-wide polymorphisms in closely related genotypes of sweet cherry (Prunus avium L.).

    PubMed

    Hewitt, Seanna; Kilian, Benjamin; Hari, Ramyya; Koepke, Tyson; Sharpe, Richard; Dhingra, Amit

    2017-01-01

    Identification of genetic polymorphisms and subsequent development of molecular markers is important for marker assisted breeding of superior cultivars of economically important species. Sweet cherry (Prunus avium L.) is an economically important non-climacteric tree fruit crop in the Rosaceae family and has undergone a genetic bottleneck due to breeding, resulting in limited genetic diversity in the germplasm that is utilized for breeding new cultivars. Therefore, it is critical to recognize the best platforms for identifying genome-wide polymorphisms that can help identify, and consequently preserve, the diversity in a genetically constrained species. For the identification of polymorphisms in five closely related genotypes of sweet cherry, a gel-based approach (TRAP), reduced representation sequencing (TRAPseq), a 6k cherry SNParray, and whole genome sequencing (WGS) approaches were evaluated in the identification of genome-wide polymorphisms in sweet cherry cultivars. All platforms facilitated detection of polymorphisms among the genotypes with variable efficiency. In assessing multiple SNP detection platforms, this study has demonstrated that a combination of appropriate approaches is necessary for efficient polymorphism identification, especially between closely related cultivars of a species. The information generated in this study provides a valuable resource for future genetic and genomic studies in sweet cherry, and the insights gained from the evaluation of multiple approaches can be utilized for other closely related species with limited genetic diversity in the breeding germplasm.

  20. Genome-wide patterns of genetic variation in sweet and grain sorghum (Sorghum bicolor)

    PubMed Central

    2011-01-01

    Background Sorghum (Sorghum bicolor) is globally produced as a source of food, feed, fiber and fuel. Grain and sweet sorghums differ in a number of important traits, including stem sugar and juice accumulation, plant height as well as grain and biomass production. The first whole genome sequence of a grain sorghum is available, but additional genome sequences are required to study genome-wide and intraspecific variation for dissecting the genetic basis of these important traits and for tailor-designed breeding of this important C4 crop. Results We resequenced two sweet and one grain sorghum inbred lines, and identified a set of nearly 1,500 genes differentiating sweet and grain sorghum. These genes fall into ten major metabolic pathways involved in sugar and starch metabolisms, lignin and coumarin biosynthesis, nucleic acid metabolism, stress responses and DNA damage repair. In addition, we uncovered 1,057,018 SNPs, 99,948 indels of 1 to 10 bp in length and 16,487 presence/absence variations as well as 17,111 copy number variations. The majority of the large-effect SNPs, indels and presence/absence variations resided in the genes containing leucine rich repeats, PPR repeats and disease resistance R genes possessing diverse biological functions or under diversifying selection, but were absent in genes that are essential for life. Conclusions This is a first report of the identification of genome-wide patterns of genetic variation in sorghum. High-density SNP and indel markers reported here will be a valuable resource for future gene-phenotype studies and the molecular breeding of this important crop and related species. PMID:22104744

  1. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication

    PubMed Central

    vonHoldt, Bridgett M.; Pollinger, John P.; Lohmueller, Kirk E.; Han, Eunjung; Parker, Heidi G.; Quignon, Pascale; Degenhardt, Jeremiah D.; Boyko, Adam R.; Earl, Dent A.; Auton, Adam; Reynolds, Andy; Bryc, Kasia; Brisbin, Abra; Knowles, James C.; Mosher, Dana S.; Spady, Tyrone C.; Elkahloun, Abdel; Geffen, Eli; Pilot, Malgorzata; Jedrzejewski, Wlodzimierz; Greco, Claudia; Randi, Ettore; Bannasch, Danika; Wilton, Alan; Shearman, Jeremy; Musiani, Marco; Cargill, Michelle; Jones, Paul G.; Qian, Zuwei; Huang, Wei; Ding, Zhao-Li; Zhang, Ya-ping; Bustamante, Carlos D.; Ostrander, Elaine A.; Novembre, John; Wayne, Robert K.

    2010-01-01

    Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication1,2. To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data3. Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity. PMID:20237475

  2. Genome-wide SNP and haplotype analyses reveal a rich history underlying dog domestication.

    PubMed

    Vonholdt, Bridgett M; Pollinger, John P; Lohmueller, Kirk E; Han, Eunjung; Parker, Heidi G; Quignon, Pascale; Degenhardt, Jeremiah D; Boyko, Adam R; Earl, Dent A; Auton, Adam; Reynolds, Andy; Bryc, Kasia; Brisbin, Abra; Knowles, James C; Mosher, Dana S; Spady, Tyrone C; Elkahloun, Abdel; Geffen, Eli; Pilot, Malgorzata; Jedrzejewski, Wlodzimierz; Greco, Claudia; Randi, Ettore; Bannasch, Danika; Wilton, Alan; Shearman, Jeremy; Musiani, Marco; Cargill, Michelle; Jones, Paul G; Qian, Zuwei; Huang, Wei; Ding, Zhao-Li; Zhang, Ya-Ping; Bustamante, Carlos D; Ostrander, Elaine A; Novembre, John; Wayne, Robert K

    2010-04-08

    Advances in genome technology have facilitated a new understanding of the historical and genetic processes crucial to rapid phenotypic evolution under domestication. To understand the process of dog diversification better, we conducted an extensive genome-wide survey of more than 48,000 single nucleotide polymorphisms in dogs and their wild progenitor, the grey wolf. Here we show that dog breeds share a higher proportion of multi-locus haplotypes unique to grey wolves from the Middle East, indicating that they are a dominant source of genetic diversity for dogs rather than wolves from east Asia, as suggested by mitochondrial DNA sequence data. Furthermore, we find a surprising correspondence between genetic and phenotypic/functional breed groupings but there are exceptions that suggest phenotypic diversification depended in part on the repeated crossing of individuals with novel phenotypes. Our results show that Middle Eastern wolves were a critical source of genome diversity, although interbreeding with local wolf populations clearly occurred elsewhere in the early history of specific lineages. More recently, the evolution of modern dog breeds seems to have been an iterative process that drew on a limited genetic toolkit to create remarkable phenotypic diversity.

  3. Genome-wide analysis reveals adaptation to high altitudes in Tibetan sheep

    PubMed Central

    Wei, Caihong; Wang, Huihua; Liu, Gang; Zhao, Fuping; Kijas, James W.; Ma, Youji; Lu, Jian; Zhang, Li; Cao, Jiaxue; Wu, Mingming; Wang, Guangkai; Liu, Ruizao; Liu, Zhen; Zhang, Shuzhen; Liu, Chousheng; Du, Lixin

    2016-01-01

    Tibetan sheep have lived on the Tibetan Plateau for thousands of years; however, the process and consequences of adaptation to this extreme environment have not been elucidated for important livestock such as sheep. Here, seven sheep breeds, representing both highland and lowland breeds from different areas of China, were genotyped for a genome-wide collection of single-nucleotide polymorphisms (SNPs). The FST and XP-EHH approaches were used to identify regions harbouring local positive selection between these highland and lowland breeds, and 236 genes were identified. We detected selection events spanning genes involved in angiogenesis, energy production and erythropoiesis. In particular, several candidate genes were associated with high-altitude hypoxia, including EPAS1, CRYAA, LONP1, NF1, DPP4, SOD1, PPARG and SOCS2. EPAS1 plays a crucial role in hypoxia adaption; therefore, we investigated the exon sequences of EPAS1 and identified 12 mutations. Analysis of the relationship between blood-related phenotypes and EPAS1 genotypes in additional highland sheep revealed that a homozygous mutation at a relatively conserved site in the EPAS1 3′ untranslated region was associated with increased mean corpuscular haemoglobin concentration and mean corpuscular volume. Taken together, our results provide evidence of the genetic diversity of highland sheep and indicate potential high-altitude hypoxia adaptation mechanisms, including the role of EPAS1 in adaptation. PMID:27230812

  4. Genome-wide association study in German patients with attention deficit/hyperactivity disorder.

    PubMed

    Hinney, Anke; Scherag, André; Jarick, Ivonne; Albayrak, Özgür; Pütter, Carolin; Pechlivanis, Sonali; Dauvermann, Maria R; Beck, Sebastian; Weber, Heike; Scherag, Susann; Nguyen, Trang T; Volckmar, Anna-Lena; Knoll, Nadja; Faraone, Stephen V; Neale, Benjamin M; Franke, Barbara; Cichon, Sven; Hoffmann, Per; Nöthen, Markus M; Schreiber, Stefan; Jöckel, Karl-Heinz; Wichmann, H-Erich; Freitag, Christine; Lempp, Thomas; Meyer, Jobst; Gilsbach, Susanne; Herpertz-Dahlmann, Beate; Sinzig, Judith; Lehmkuhl, Gerd; Renner, Tobias J; Warnke, Andreas; Romanos, Marcel; Lesch, Klaus-Peter; Reif, Andreas; Schimmelmann, Benno G; Hebebrand, Johannes

    2011-12-01

    The heritability of attention deficit hyperactivity disorder (ADHD) is approximately 0.8. Despite several larger scale attempts, genome-wide association studies (GWAS) have not led to the identification of significant results. We performed a GWAS based on 495 German young patients with ADHD (according to DSM-IV criteria; Human660W-Quadv1; Illumina, San Diego, CA) and on 1,300 population-based adult controls (HumanHap550v3; Illumina). Some genes neighboring the single nucleotide polymorphisms (SNPs) with the lowest P-values (best P-value: 8.38 × 10(-7)) have potential relevance for ADHD (e.g., glutamate receptor, metabotropic 5 gene, GRM5). After quality control, the 30 independent SNPs with the lowest P-values (P-values ≤ 7.57 × 10(-5) ) were chosen for confirmation. Genotyping of these SNPs in up to 320 independent German families comprising at least one child with ADHD revealed directionally consistent effect-size point estimates for 19 (10 not consistent) of the SNPs. In silico analyses of the 30 SNPs in the largest meta-analysis so far (2,064 trios, 896 cases, and 2,455 controls) revealed directionally consistent effect-size point estimates for 16 SNPs (11 not consistent). None of the combined analyses revealed a genome-wide significant result. SNPs in previously described autosomal candidate genes did not show significantly lower P-values compared to SNPs within random sets of genes of the same size. We did not find genome-wide significant results in a GWAS of German children with ADHD compared to controls. The second best SNP is located in an intron of GRM5, a gene located within a recently described region with an infrequent copy number variation in patients with ADHD.

  5. Utilizing twins as controls for non-twin case-materials in genome wide association studies.

    PubMed

    Ganna, Andrea; Ortega-Alonso, Alfredo; Havulinna, Aki; Salomaa, Veikko; Kaprio, Jaakko; Pedersen, Nancy L; Sullivan, Patrick F; Ingelsson, Erik; Hultman, Christina M; Magnusson, Patrik K E

    2013-01-01

    Twin registries around the globe have collected DNA samples from large numbers of monozygotic and dizygotic twins. The twin sample collections are frequently used as controls in disease-specific studies together with non-twins. This approach is unbiased under the hypothesis that twins and singletons are comparable in terms of allele frequencies; i.e. there are no genetic variants associated with being a twin per se. To test this hypothesis we performed a genome-wide association study comparing the allele frequency of 572,352 single nucleotide polymorphisms (SNPs) in 1,413 monozygotic (MZ) and 5,451 dizygotic (DZ) twins with 3,720 healthy singletons. Twins and singletons have been genotyped using the same platform. SNPs showing association with being a twin at P-value < 1 × 10(-5) were selected for replication analysis in 1,492 twins (463 MZ and 1,029 DZ) and 1,880 singletons from Finland. No SNPs reached genome-wide significance (P-value < 5 × 10(-8)) in the main analysis combining MZ and DZ twins. In a secondary analysis including only DZ twins two SNPs (rs2033541 close to ADAMTSL1 and rs4149283 close to ABCA1) were genome-wide significant after meta-analysis with the Finnish population. The estimated proportion of variance on the liability scale explained by all SNPs was 0.08 (P-value=0.003) when MZ and DZ were considered together and smaller for MZ (0.06, P-value=0.10) compared to DZ (0.09, P-value=0.003) when analyzed separately. In conclusion, twins and singletons can be used in genetic studies together with general population samples without introducing large bias. Further research is needed to explore genetic variances associated with DZ twinning.

  6. Economic values of body weight, reproduction and parasite resistance traits for a Creole goat breeding goal.

    PubMed

    Gunia, M; Mandonnet, N; Arquet, R; Alexandre, G; Gourdine, J-L; Naves, M; Angeon, V; Phocas, F

    2013-01-01

    A specific breeding goal definition was developed for Creole goats in Guadeloupe. This local breed is used for meat production. To ensure a balanced selection outcome, the breeding objective included two production traits, live weight (BW11) and dressing percentage (DP) at 11 months (the mating or selling age), one reproduction trait, fertility (FER), and two traits to assess animal response to parasite infection: packed cell volume (PCV), a resilience trait, and faecal worm eggs count (FEC), a resistance trait. A deterministic bio-economic model was developed to calculate the economic values based on the description of the profit of a Guadeloupean goat farm. The farm income came from the sale of animals for meat or as reproducers. The main costs were feeding and treatments against gastro-intestinal parasites. The economic values were 7.69€ per kg for BW11, 1.38€ per % for FER, 3.53€ per % for DP and 3 × 10(-4)€ per % for PCV. The economic value for FEC was derived by comparing the expected profit and average FEC in a normal situation and in an extreme situation where parasites had developed resistance to anthelmintics. This method yielded a maximum weighting for FEC, which was -18.85€ per log(eggs per gram). Alternative scenarios were tested to assess the robustness of the economic values to variations in the economic and environmental context. The economic values of PCV and DP were the most stable. Issues involved in paving the way for selective breeding on resistance or resilience to parasites are discussed.

  7. First WNK4-Hypokalemia Animal Model Identified by Genome-Wide Association in Burmese Cats

    PubMed Central

    Gandolfi, Barbara; Gruffydd-Jones, Timothy J.; Malik, Richard; Cortes, Alejandro; Jones, Boyd R.; Helps, Chris R.; Prinzenberg, Eva M.; Erhardt, George; Lyons, Leslie A.

    2012-01-01

    Burmese is an old and popular cat breed, however, several health concerns, such as hypokalemia and a craniofacial defect, are prevalent, endangering the general health of the breed. Hypokalemia, a subnormal serum potassium ion concentration ([K+]), most often occurs as a secondary problem but can occur as a primary problem, such as hypokalaemic periodic paralysis in humans, and as feline hypokalaemic periodic polymyopathy primarily in Burmese. The most characteristic clinical sign of hypokalemia in Burmese is a skeletal muscle weakness that is frequently episodic in nature, either generalized, or sometimes localized to the cervical and thoracic limb girdle muscles. Burmese hypokalemia is suspected to be a single locus autosomal recessive trait. A genome wide case-control study using the illumina Infinium Feline 63K iSelect DNA array was performed using 35 cases and 25 controls from the Burmese breed that identified a locus on chromosome E1 associated with hypokalemia. Within approximately 1.2 Mb of the highest associated SNP, two candidate genes were identified, KCNH4 and WNK4. Direct sequencing of the genes revealed a nonsense mutation, producing a premature stop codon within WNK4 (c.2899C>T), leading to a truncated protein that lacks the C-terminal coiled-coil domain and the highly conserved Akt1/SGK phosphorylation site. All cases were homozygous for the mutation. Although the exact mechanism causing hypokalemia has not been determined, extrapolation from the homologous human and mouse genes suggests the mechanism may involve a potassium-losing nephropathy. A genetic test to screen for the genetic defect within the active breeding population has been developed, which should lead to eradication of the mutation and improved general health within the breed. Moreover, the identified mutation may help clarify the role of the protein in K+ regulation and the cat represents the first animal model for WNK4-associated hypokalemia. PMID:23285264

  8. First WNK4-hypokalemia animal model identified by genome-wide association in Burmese cats.

    PubMed

    Gandolfi, Barbara; Gruffydd-Jones, Timothy J; Malik, Richard; Cortes, Alejandro; Jones, Boyd R; Helps, Chris R; Prinzenberg, Eva M; Erhardt, George; Lyons, Leslie A

    2012-01-01

    Burmese is an old and popular cat breed, however, several health concerns, such as hypokalemia and a craniofacial defect, are prevalent, endangering the general health of the breed. Hypokalemia, a subnormal serum potassium ion concentration ([K(+)]), most often occurs as a secondary problem but can occur as a primary problem, such as hypokalaemic periodic paralysis in humans, and as feline hypokalaemic periodic polymyopathy primarily in Burmese. The most characteristic clinical sign of hypokalemia in Burmese is a skeletal muscle weakness that is frequently episodic in nature, either generalized, or sometimes localized to the cervical and thoracic limb girdle muscles. Burmese hypokalemia is suspected to be a single locus autosomal recessive trait. A genome wide case-control study using the illumina Infinium Feline 63K iSelect DNA array was performed using 35 cases and 25 controls from the Burmese breed that identified a locus on chromosome E1 associated with hypokalemia. Within approximately 1.2 Mb of the highest associated SNP, two candidate genes were identified, KCNH4 and WNK4. Direct sequencing of the genes revealed a nonsense mutation, producing a premature stop codon within WNK4 (c.2899C>T), leading to a truncated protein that lacks the C-terminal coiled-coil domain and the highly conserved Akt1/SGK phosphorylation site. All cases were homozygous for the mutation. Although the exact mechanism causing hypokalemia has not been determined, extrapolation from the homologous human and mouse genes suggests the mechanism may involve a potassium-losing nephropathy. A genetic test to screen for the genetic defect within the active breeding population has been developed, which should lead to eradication of the mutation and improved general health within the breed. Moreover, the identified mutation may help clarify the role of the protein in K⁺ regulation and the cat represents the first animal model for WNK4-associated hypokalemia.

  9. Additive genetic breeding values correlate with the load of partially deleterious mutations.

    PubMed

    Tomkins, Joseph L; Penrose, Marissa A; Greeff, Johan; LeBas, Natasha R

    2010-05-14

    The mutation-selection-balance model predicts most additive genetic variation to arise from numerous mildly deleterious mutations of small effect. Correspondingly, "good genes" models of sexual selection and recent models for the evolution of sex are built on the assumption that mutational loads and breeding values for fitness-related traits are correlated. In support of this concept, inbreeding depression was negatively genetically correlated with breeding values for traits under natural and sexual selection in the weevil Callosobruchus maculatus. The correlations were stronger in males and strongest for condition. These results confirm the role of existing, partially recessive mutations in maintaining additive genetic variation in outbred populations, reveal the nature of good genes under sexual selection, and show how sexual selection can offset the cost of sex.

  10. A Genome-Wide Association Study of a Biomarker of Nicotine Metabolism

    PubMed Central

    Loukola, Anu; Buchwald, Jadwiga; Gupta, Richa; Palviainen, Teemu; Hällfors, Jenni; Tikkanen, Emmi; Korhonen, Tellervo; Ollikainen, Miina; Sarin, Antti-Pekka; Ripatti, Samuli; Lehtimäki, Terho; Raitakari, Olli; Salomaa, Veikko; Rose, Richard J.; Tyndale, Rachel F.; Kaprio, Jaakko

    2015-01-01

    Individuals with fast nicotine metabolism typically smoke more and thus have a greater risk for smoking-induced diseases. Further, the efficacy of smoking cessation pharmacotherapy is dependent on the rate of nicotine metabolism. Our objective was to use nicotine metabolite ratio (NMR), an established biomarker of nicotine metabolism rate, in a genome-wide association study (GWAS) to identify novel genetic variants influencing nicotine metabolism. A heritability estimate of 0.81 (95% CI 0.70–0.88) was obtained for NMR using monozygotic and dizygotic twins of the FinnTwin cohort. We performed a GWAS in cotinine-verified current smokers of three Finnish cohorts (FinnTwin, Young Finns Study, FINRISK2007), followed by a meta-analysis of 1518 subjects, and annotated the genome-wide significant SNPs with methylation quantitative loci (meQTL) analyses. We detected association on 19q13 with 719 SNPs exceeding genome-wide significance within a 4.2 Mb region. The strongest evidence for association emerged for CYP2A6 (min p = 5.77E-86, in intron 4), the main metabolic enzyme for nicotine. Other interesting genes with genome-wide significant signals included CYP2B6, CYP2A7, EGLN2, and NUMBL. Conditional analyses revealed three independent signals on 19q13, all located within or in the immediate vicinity of CYP2A6. A genetic risk score constructed using the independent signals showed association with smoking quantity (p = 0.0019) in two independent Finnish samples. Our meQTL results showed that methylation values of 16 CpG sites within the region are affected by genotypes of the genome-wide significant SNPs, and according to causal inference test, for some of the SNPs the effect on NMR is mediated through methylation. To our knowledge, this is the first GWAS on NMR. Our results enclose three independent novel signals on 19q13.2. The detected CYP2A6 variants explain a strikingly large fraction of variance (up to 31%) in NMR in these study samples. Further, we provide evidence

  11. A Genome-Wide Association Study of a Biomarker of Nicotine Metabolism.

    PubMed

    Loukola, Anu; Buchwald, Jadwiga; Gupta, Richa; Palviainen, Teemu; Hällfors, Jenni; Tikkanen, Emmi; Korhonen, Tellervo; Ollikainen, Miina; Sarin, Antti-Pekka; Ripatti, Samuli; Lehtimäki, Terho; Raitakari, Olli; Salomaa, Veikko; Rose, Richard J; Tyndale, Rachel F; Kaprio, Jaakko

    2015-01-01

    Individuals with fast nicotine metabolism typically smoke more and thus have a greater risk for smoking-induced diseases. Further, the efficacy of smoking cessation pharmacotherapy is dependent on the rate of nicotine metabolism. Our objective was to use nicotine metabolite ratio (NMR), an established biomarker of nicotine metabolism rate, in a genome-wide association study (GWAS) to identify novel genetic variants influencing nicotine metabolism. A heritability estimate of 0.81 (95% CI 0.70-0.88) was obtained for NMR using monozygotic and dizygotic twins of the FinnTwin cohort. We performed a GWAS in cotinine-verified current smokers of three Finnish cohorts (FinnTwin, Young Finns Study, FINRISK2007), followed by a meta-analysis of 1518 subjects, and annotated the genome-wide significant SNPs with methylation quantitative loci (meQTL) analyses. We detected association on 19q13 with 719 SNPs exceeding genome-wide significance within a 4.2 Mb region. The strongest evidence for association emerged for CYP2A6 (min p = 5.77E-86, in intron 4), the main metabolic enzyme for nicotine. Other interesting genes with genome-wide significant signals included CYP2B6, CYP2A7, EGLN2, and NUMBL. Conditional analyses revealed three independent signals on 19q13, all located within or in the immediate vicinity of CYP2A6. A genetic risk score constructed using the independent signals showed association with smoking quantity (p = 0.0019) in two independent Finnish samples. Our meQTL results showed that methylation values of 16 CpG sites within the region are affected by genotypes of the genome-wide significant SNPs, and according to causal inference test, for some of the SNPs the effect on NMR is mediated through methylation. To our knowledge, this is the first GWAS on NMR. Our results enclose three independent novel signals on 19q13.2. The detected CYP2A6 variants explain a strikingly large fraction of variance (up to 31%) in NMR in these study samples. Further, we provide evidence

  12. Genome-wide association study of Tourette Syndrome

    PubMed Central

    Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.

    2012-01-01

    Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  13. Genome-wide identification of hypoxia-induced enhancer regions

    PubMed Central

    Preston, Jessica L.; Randel, Melissa A.; Johnson, Eric A.

    2015-01-01

    Here we present a genome-wide method for de novo identification of enhancer regions. This approach enables massively parallel empirical investigation of DNA sequences that mediate transcriptional activation and provides a platform for discovery of regulatory modules capable of driving context-specific gene expression. The method links fragmented genomic DNA to the transcription of randomer molecule identifiers and measures the functional enhancer activity of the library by massively parallel sequencing. We transfected a Drosophila melanogaster library into S2 cells in normoxia and hypoxia, and assayed 4,599,881 genomic DNA fragments in parallel. The locations of the enhancer regions strongly correlate with genes up-regulated after hypoxia and previously described enhancers. Novel enhancer regions were identified and integrated with RNAseq data and transcription factor motifs to describe the hypoxic response on a genome-wide basis as a complex regulatory network involving multiple stress-response pathways. This work provides a novel method for high-throughput assay of enhancer activity and the genome-scale identification of 31 hypoxia-activated enhancers in Drosophila. PMID:26713262

  14. A genome-wide association study for malignant mesothelioma risk.

    PubMed

    Cadby, Gemma; Mukherjee, Sutapa; Musk, A W Bill; Reid, Alison; Garlepp, Mike; Dick, Ian; Robinson, Cleo; Hui, Jennie; Fiorito, Giovanni; Guarrera, Simonetta; Beilby, John; Melton, Phillip E; Moses, Eric K; Ugolini, Donatella; Mirabelli, Dario; Bonassi, Stefano; Magnani, Corrado; Dianzani, Irma; Matullo, Giuseppe; Robinson, Bruce; Creaney, Jenette; Palmer, Lyle J

    2013-10-01

    Malignant mesothelioma (MM) is a uniformly fatal tumour of mesothelial cells. MM is caused by exposure to asbestos however most individuals with documented asbestos exposure do not develop MM. Although MM appears to aggregate within families, the genetics of MM susceptibility is a relatively unexplored area. The aim of the current study was to identify genetic factors that contribute to MM risk. A genome-wide association analysis of 2,508,203 single nucleotide polymorphisms (SNPs) from 428 MM cases and 1269 controls from Western Australia was performed. Additional genotyping was performed on a sample of 778 asbestos-exposed Western Australian controls. Replication of the most strongly associated SNPs was undertaken in an independent case-control study of 392 asbestos-exposed cases and 367 asbestos-exposed controls from Italy. No SNPs achieved formal genome-wide statistical significance in the Western Australian study. However, suggestive results for MM risk were identified in the SDK1, CRTAM and RASGRF2 genes, and in the 2p12 chromosomal region. These findings were not replicated in the Italian study, although there was some evidence of replication in the region of SDK1. These suggestive associations will be further investigated in sequencing and functional studies. Copyright © 2013. Published by Elsevier Ireland Ltd.

  15. A Genome-Wide Association Study of Aging

    PubMed Central

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W.; Garcia, Melissa E.; Kaplan, Robert C.; Kumari, Meena; Lunetta, Kathryn L.; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J.; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J.; Biffar, Reiner; Buchman, Aron S.; Boerwinkle, Eric; Couper, David; De Jager, Philip L.; Evans, Denis A.; Harris, Tamara B.; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P.; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J.; Lohman, Kurt K.; Lutsey, Pamela L.; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M.; Reiman, Eric M.; Rotter, Jerome I.; Seshadri, Sudha; Shardell, Michelle D.; Smith, Albert V.; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M. Carola; Bandinelli, Stefania; Baumeister, Sebastian E.; Bennett, David A.; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M.; Newman, Anne B.; Tiemeier, Henning; Franceschini, Nora

    2011-01-01

    Human longevity and healthy aging show moderate heritability (20–50%). We conducted a meta-analysis of genome-wide association studies from nine studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for two outcomes: a) all-cause mortality and b) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10−8). We found fourteen independent SNPs that predicted risk of death, and eight SNPs that predicted event-free survival (p < 10−5). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer’s disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. PMID:21782286

  16. Genome-wide patterns of Arabidopsis gene expression in nature.

    PubMed

    Richards, Christina L; Rosas, Ulises; Banta, Joshua; Bhambhra, Naeha; Purugganan, Michael D

    2012-01-01

    Organisms in the wild are subject to multiple, fluctuating environmental factors, and it is in complex natural environments that genetic regulatory networks actually function and evolve. We assessed genome-wide gene expression patterns in the wild in two natural accessions of the model plant Arabidopsis thaliana and examined the nature of transcriptional variation throughout its life cycle and gene expression correlations with natural environmental fluctuations. We grew plants in a natural field environment and measured genome-wide time-series gene expression from the plant shoot every three days, spanning the seedling to reproductive stages. We find that 15,352 genes were expressed in the A. thaliana shoot in the field, and accession and flowering status (vegetative versus flowering) were strong components of transcriptional variation in this plant. We identified between ∼110 and 190 time-varying gene expression clusters in the field, many of which were significantly overrepresented by genes regulated by abiotic and biotic environmental stresses. The two main principal components of vegetative shoot gene expression (PC(veg)) correlate to temperature and precipitation occurrence in the field. The largest PC(veg) axes included thermoregulatory genes while the second major PC(veg) was associated with precipitation and contained drought-responsive genes. By exposing A. thaliana to natural environments in an open field, we provide a framework for further understanding the genetic networks that are deployed in natural environments, and we connect plant molecular genetics in the laboratory to plant organismal ecology in the wild.

  17. Genome-wide association interaction analysis for Alzheimer's disease

    PubMed Central

    Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel

    2015-01-01

    We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192

  18. Measuring genome-wide nucleosome turnover using CATCH-IT.

    PubMed

    Teves, Sheila S; Deal, Roger B; Henikoff, Steven

    2012-01-01

    The dynamic interplay between DNA-binding proteins and nucleosomes underlies essential nuclear processes such as transcription, replication, and DNA repair. Manifestations of this interplay include the assembly, eviction, and replacement of nucleosomes. Hence, measurements of nucleosome turnover kinetics can lead to insights into the regulation of dynamic chromatin processes. In this chapter, we describe a genome-wide method for measuring nucleosome turnover that uses metabolic labeling followed by capture of newly synthesized histones, which we have termed Covalent Attachment of Tagged Histones to Capture and Identify Turnover (CATCH-IT). Although CATCH-IT can be used with any genome-wide mapping procedure, high-resolution profiling is attainable using paired-end sequencing of native chromatin. Our protocol also includes an efficient Solexa DNA sequencing library preparation protocol that can be used for single base-pair resolution mapping of both nucleosome and subnucleosomal particles. We not only describe the use of these protocols in the context of a Drosophila cell line but also provide the necessary changes for adaptation to other model systems. Copyright © 2012 Elsevier Inc. All rights reserved.

  19. Genome-wide analysis of differential RNA editing in epilepsy.

    PubMed

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M; Petretto, Enrico; Johnson, Michael R

    2017-03-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine-temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including "neuron projection" and "seizures." Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures.

  20. Genome-wide mapping of DNA strand breaks.

    PubMed

    Leduc, Frédéric; Faucher, David; Bikond Nkoma, Geneviève; Grégoire, Marie-Chantal; Arguin, Mélina; Wellinger, Raymund J; Boissonneault, Guylain

    2011-02-25

    Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP), uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL) to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.

  1. A genome-wide association study of aging.

    PubMed

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora

    2011-11-01

    Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity.

  2. Genome-wide scans for footprints of natural selection

    PubMed Central

    Oleksyk, Taras K.; Smith, Michael W.; O'Brien, Stephen J.

    2010-01-01

    Detecting recent selected ‘genomic footprints’ applies directly to the discovery of disease genes and in the imputation of the formative events that molded modern population genetic structure. The imprints of historic selection/adaptation episodes left in human and animal genomes allow one to interpret modern and ancestral gene origins and modifications. Current approaches to reveal selected regions applied in genome-wide selection scans (GWSSs) fall into eight principal categories: (I) phylogenetic footprinting, (II) detecting increased rates of functional mutations, (III) evaluating divergence versus polymorphism, (IV) detecting extended segments of linkage disequilibrium, (V) evaluating local reduction in genetic variation, (VI) detecting changes in the shape of the frequency distribution (spectrum) of genetic variation, (VII) assessing differentiating between populations (FST), and (VIII) detecting excess or decrease in admixture contribution from one population. Here, we review and compare these approaches using available human genome-wide datasets to provide independent verification (or not) of regions found by different methods and using different populations. The lessons learned from GWSSs will be applied to identify genome signatures of historic selective pressures on genes and gene regions in other species with emerging genome sequences. This would offer considerable potential for genome annotation in functional, developmental and evolutionary contexts. PMID:20008396

  3. Genome-Wide Mapping of DNA Methylation in Chicken

    PubMed Central

    Hu, Xiaoxiang; Li, Jinxiu; Du, Zhuo; Chen, Li; Yin, Guangliang; Duan, Jinjie; Zhang, Haichao; Zhao, Yaofeng; Wang, Jun; Li, Ning

    2011-01-01

    Cytosine DNA methylation is an important epigenetic modification termed as the fifth base that functions in diverse processes. Till now, the genome-wide DNA methylation maps of many organisms has been reported, such as human, Arabidopsis, rice and silkworm, but the methylation pattern of bird remains rarely studied. Here we show the genome-wide DNA methylation map of bird, using the chicken as a model organism and an immunocapturing approach followed by high-throughput sequencing. In both of the red jungle fowl and the avian broiler, DNA methylation was described separately for the liver and muscle tissue. Generally, chicken displays analogous methylation pattern with that of animals and plants. DNA methylation is enriched in the gene body regions and the repetitive sequences, and depleted in the transcription start site (TSS) and the transcription termination site (TTS). Most of the CpG islands in the chicken genome are kept in unmethylated state. Promoter methylation is negatively correlated with the gene expression level, indicating its suppressive role in regulating gene transcription. This work contributes to our understanding of epigenetics in birds. PMID:21573164

  4. Genome-wide nucleotide-level mammalian ancestor reconstruction.

    PubMed

    Paten, Benedict; Herrero, Javier; Fitzgerald, Stephen; Beal, Kathryn; Flicek, Paul; Holmes, Ian; Birney, Ewan

    2008-11-01

    Recently attention has been turned to the problem of reconstructing complete ancestral sequences from large multiple alignments. Successful generation of these genome-wide reconstructions will facilitate a greater knowledge of the events that have driven evolution. We present a new evolutionary alignment modeler, called "Ortheus," for inferring the evolutionary history of a multiple alignment, in terms of both substitutions and, importantly, insertions and deletions. Based on a multiple sequence probabilistic transducer model of the type proposed by Holmes, Ortheus uses efficient stochastic graph-based dynamic programming methods. Unlike other methods, Ortheus does not rely on a single fixed alignment from which to work. Ortheus is also more scaleable than previous methods while being fast, stable, and open source. Large-scale simulations show that Ortheus performs close to optimally on a deep mammalian phylogeny. Simulations also indicate that significant proportions of errors due to insertions and deletions can be avoided by not assuming a fixed alignment. We additionally use a challenging hold-out cross-validation procedure to test the method; using the reconstructions to predict extant sequence bases, we demonstrate significant improvements over using closest extant neighbor sequences. Accompanying this paper, a new, public, and genome-wide set of Ortheus ancestor alignments provide an intriguing new resource for evolutionary studies in mammals. As a first piece of analysis, we attempt to recover "fossilized" ancestral pseudogenes. We confidently find 31 cases in which the ancestral sequence had a more complete sequence than any of the extant sequences.

  5. Genome-wide analysis of differential RNA editing in epilepsy

    PubMed Central

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R.; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M.; Petretto, Enrico; Johnson, Michael R.

    2017-01-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine–temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including “neuron projection” and “seizures.” Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures. PMID:28250018

  6. Genome-wide association study for cheese yield and curd nutrient recovery in dairy cows.

    PubMed

    Dadousis, C; Biffani, S; Cipolat-Gotet, C; Nicolazzi, E L; Rosa, G J M; Gianola, D; Rossoni, A; Santus, E; Bittante, G; Cecchinato, A

    2017-02-01

    Cheese production and consumption are increasing in many countries worldwide. As a result, interest has increased in strategies for genetic selection of individuals for technological traits of milk related to cheese yield (CY) in dairy cattle breeding. However, little is known about the genetic background of a cow's ability to produce cheese. Recently, a relatively large panel (1,264 cows) of different measures of individual cow CY and milk nutrient and energy recoveries in the cheese (REC) became available. Genetic analyses showed considerable variation for CY and for aptitude to retain high proportions of fat, protein, and water in the coagulum. For the dairy industry, these characteristics are of major economic importance. Nevertheless, use of this knowledge in dairy breeding is hampered by high costs, intense labor requirement, and lack of appropriate technology. However, in the era of genomics, new possibilities are available for animal breeding and genetic improvement. For example, identification of genomic regions involved in cow CY might provide potential for marker-assisted selection. The objective of this study was to perform genome-wide association studies on different CY and REC measures. Milk and DNA samples from 1,152 Italian Brown Swiss cows were used. Three CY traits expressing the weight (wt) of fresh curd (%CYCURD), curd solids (%CYSOLIDS), and curd moisture (%CYWATER) as a percentage of weight of milk processed, and 4 REC (RECFAT, RECPROTEIN, RECSOLIDS, and RECENERGY, calculated as the % ratio between the nutrient in curd and the corresponding nutrient in processed milk) were analyzed. Animals were genotyped with the Illumina BovineSNP50 Bead Chip v.2. Single marker regressions were fitted using the GenABEL R package (genome-wide association using mixed model and regression-genomic control). In total, 103 significant associations (88 single nucleotide polymorphisms) were identified in 10 chromosomes (2, 6, 9, 11, 12, 14, 18, 19, 27, 28). For

  7. Assessment of the value of international genetic evaluations for yield in predicting domestic breeding values for foreign Holstein bulls.

    PubMed

    Nicolazzi, E L; Forabosco, F; Fikse, W F

    2011-05-01

    International genetic evaluations are a valuable source of information for decisions about the importation of (the semen of) foreign bulls. This study analyzed data from 6 countries (Australia, Canada, Italy, France, the Netherlands, and the United States) and compared international evaluations for production traits of foreign bulls (i.e., when no national daughter information was available) to their national breeding values in August 2009, which were based only on domestic daughters' data. A total of 821 bulls with highly reliable estimated breeding values (EBV) for milk, fat, and protein yield were analyzed. No evidence of systematic over- or underestimation was found in most of the countries analyzed. Observed correlations between national and international evaluations were close to 0.9 and, for most countries, generally close to their expected values (calculated from national and international EBV reliabilities). In Italy, however, higher differences between observed and expected correlations and significant mean differences between EBV for more than one trait were observed in bulls progeny-tested in the United States and in other European countries (with differences up to 33.1% of the genetic standard deviation). These results were probably induced by a relatively recent change in the model for national evaluation. The findings in this study reflect a conservative estimate of the real value of international evaluations, as changes in methodologies in either the national or the international evaluations decreased the ability of past international evaluations to predict current national evaluations. Nevertheless, our results indicate that international evaluations based on foreign information for Holstein bulls were reasonably accurate predictors of the future national breeding values based only upon domestic daughters.

  8. Pedigree reconstruction with genome-wide markers in potato

    USDA-ARS?s Scientific Manuscript database

    Reliable pedigree information facilitates a scientific approach to breeding, but errors can be introduced in many stages of a breeding program. Our objective was to use single nucleotide polymorphisms (SNPs) to check the pedigree records of elite North American potato germplasm. A population of 635 ...

  9. Genome-wide association studies and prediction of 17 traits related to phenology, biomass and cell wall composition in the energy grass Miscanthus sinensis.

    PubMed

    Slavov, Gancho T; Nipper, Rick; Robson, Paul; Farrar, Kerrie; Allison, Gordon G; Bosch, Maurice; Clifton-Brown, John C; Donnison, Iain S; Jensen, Elaine

    2014-03-01

    • Increasing demands for food and energy require a step change in the effectiveness, speed and flexibility of crop breeding. Therefore, the aim of this study was to assess the potential of genome-wide association studies (GWASs) and genomic selection (i.e. phenotype prediction from a genome-wide set of markers) to guide fundamental plant science and to accelerate breeding in the energy grass Miscanthus. • We generated over 100,000 single-nucleotide variants (SNVs) by sequencing restriction site-associated DNA (RAD) tags in 138 Micanthus sinensis genotypes, and related SNVs to phenotypic data for 17 traits measured in a field trial. • Confounding by population structure and relatedness was severe in naïve GWAS analyses, but mixed-linear models robustly controlled for these effects and allowed us to detect multiple associations that reached genome-wide significance. Genome-wide prediction accuracies tended to be moderate to high (average of 0.57), but varied dramatically across traits. As expected, predictive abilities increased linearly with the size of the mapping population, but reached a plateau when the number of markers used for prediction exceeded 10,000-20,000, and tended to decline, but remain significant, when cross-validations were performed across subpopulations. • Our results suggest that the immediate implementation of genomic selection in Miscanthus breeding programs may be feasible.

  10. Genome-wide association study of personality traits in the long life family study.

    PubMed

    Bae, Harold T; Sebastiani, Paola; Sun, Jenny X; Andersen, Stacy L; Daw, E Warwick; Terracciano, Antonio; Ferrucci, Luigi; Perls, Thomas T

    2013-01-01

    Personality traits have been shown to be associated with longevity and healthy aging. In order to discover novel genetic modifiers associated with personality traits as related with longevity, we performed a genome-wide association study (GWAS) on personality factors assessed by NEO-five-factor inventory in individuals enrolled in the Long Life Family Study (LLFS), a study of 583 families (N up to 4595) with clustering for longevity in the United States and Denmark. Three SNPs, in almost perfect LD, associated with agreeableness reached genome-wide significance (p < 10(-8)) and replicated in an additional sample of 1279 LLFS subjects, although one (rs9650241) failed to replicate and the other two were not available in two independent replication cohorts, the Baltimore Longitudinal Study of Aging and the New England Centenarian Study. Based on 10,000,000 permutations, the empirical p-value of 2 × 10(-7) was observed for the genome-wide significant SNPs. Seventeen SNPs that reached marginal statistical significance in the two previous GWASs (p-value <10(-4) and 10(-5)), were also marginally significantly associated in this study (p-value <0.05), although none of the associations passed the Bonferroni correction. In addition, we tested age-by-SNP interactions and found some significant associations. Since scores of personality traits in LLFS subjects change in the oldest ages, and genetic factors outweigh environmental factors to achieve extreme ages, these age-by-SNP interactions could be a proxy for complex gene-gene interactions affecting personality traits and longevity.

  11. Genome-wide association study of sepsis in extremely premature infants.

    PubMed

    Srinivasan, Lakshmi; Page, Grier; Kirpalani, Haresh; Murray, Jeffrey C; Das, Abhik; Higgins, Rosemary D; Carlo, Waldemar A; Bell, Edward F; Goldberg, Ronald N; Schibler, Kurt; Sood, Beena G; Stevenson, David K; Stoll, Barbara J; Van Meurs, Krisa P; Johnson, Karen J; Levy, Joshua; McDonald, Scott A; Zaterka-Baxter, Kristin M; Kennedy, Kathleen A; Sánchez, Pablo J; Duara, Shahnaz; Walsh, Michele C; Shankaran, Seetha; Wynn, James L; Cotten, C Michael

    2017-09-01

    To identify genetic variants associated with sepsis (early-onset and late-onset) using a genome-wide association (GWA) analysis in a cohort of extremely premature infants. Previously generated GWA data from the Neonatal Research Network's anonymised genomic database biorepository of extremely premature infants were used for this study. Sepsis was defined as culture-positive early-onset or late-onset sepsis or culture-proven meningitis. Genomic and whole-genome-amplified DNA was genotyped for 1.2 million single-nucleotide polymorphisms (SNPs); 91% of SNPs were successfully genotyped. We imputed 7.2 million additional SNPs. p Values and false discovery rates (FDRs) were calculated from multivariate logistic regression analysis adjusting for gender, gestational age and ancestry. Target statistical value was p<10(-5). Secondary analyses assessed associations of SNPs with pathogen type. Pathway analyses were also run on primary and secondary end points. Data from 757 extremely premature infants were included: 351 infants with sepsis and 406 infants without sepsis. No SNPs reached genome-wide significance levels (5×10(-8)); two SNPs in proximity to FOXC2 and FOXL1 genes achieved target levels of significance. In secondary analyses, SNPs for ELMO1, IRAK2 (Gram-positive sepsis), RALA, IMMP2L (Gram-negative sepsis) and PIEZO2 (fungal sepsis) met target significance levels. Pathways associated with sepsis and Gram-negative sepsis included gap junctions, fibroblast growth factor receptors, regulators of cell division and interleukin-1-associated receptor kinase 2 (p values<0.001 and FDR<20%). No SNPs met genome-wide significance in this cohort of extremely low birthweight infants; however, areas of potential association and pathways meriting further study were identified. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  12. Canine hip dysplasia: phenotypic scoring and the role of estimated breeding value analysis.

    PubMed

    Soo, M; Worth, Aj

    2015-03-01

    Canine hip dysplasia (CHD) is a developmental orthopaedic disease of the coxofemoral joints with a multifactorial mode of inheritance. Multiple gene effects are influenced by environmental factors; therefore, it is unlikely that a simple genetic screening test with which to identify susceptible individuals will be developed in the near future. In the absence of feasible methods for objectively quantifying clinical CHD, radiographic techniques have been developed and widely used to identify dogs for breeding which are less affected by the disease. A hip-extended ventrodorsal view of the pelvis has been traditionally used to identify dogs with subluxation and/or osteoarthritis of the coxofemoral joints. More recently, there has been emphasis on the role of coxofemoral joint laxity as a determinant of CHD and methods have been developed to measure passive hip laxity. Though well-established worldwide, the effectiveness of traditional phenotypic scoring schemes in reducing the prevalence of CHD has been variable. The most successful implementation of traditional CHD scoring has occurred in countries or breeding colonies with mandatory scoring and open registries with access to pedigree records. Several commentators have recommended that for quantitative traits like CHD, selection of breeding stock should be based on estimated breeding values (EBV) rather than individual hip score/grade. The EBV is a reflection of the genetic superiority of an animal compared to its counterparts and is calculated from the phenotype of an individual and its relatives and their pedigree relationship. Selecting breeding stock on the basis of a dog's genetic merit, ideally based on a highly predictive phenotype, will confer the breeder with greater selection power, accelerate genetic improvement towards better hip conformation and thus more likely decrease the prevalence of CHD.

  13. Using pooled data to estimate variance components and breeding values for traits affected by social interactions

    PubMed Central

    2013-01-01

    Background Through social interactions, individuals affect one another’s phenotype. In such cases, an individual’s phenotype is affected by the direct (genetic) effect of the individual itself and the indirect (genetic) effects of the group mates. Using data on individual phenotypes, direct and indirect genetic (co)variances can be estimated. Together, they compose the total genetic variance that determines a population’s potential to respond to selection. However, it can be difficult or expensive to obtain individual phenotypes. Phenotypes on traits such as egg production and feed intake are, therefore, often collected on group level. In this study, we investigated whether direct, indirect and total genetic variances, and breeding values can be estimated from pooled data (pooled by group). In addition, we determined the optimal group composition, i.e. the optimal number of families represented in a group to minimise the standard error of the estimates. Methods This study was performed in three steps. First, all research questions were answered by theoretical derivations. Second, a simulation study was conducted to investigate the estimation of variance components and optimal group composition. Third, individual and pooled survival records on 12 944 purebred laying hens were analysed to investigate the estimation of breeding values and response to selection. Results Through theoretical derivations and simulations, we showed that the total genetic variance can be estimated from pooled data, but the underlying direct and indirect genetic (co)variances cannot. Moreover, we showed that the most accurate estimates are obtained when group members belong to the same family. Additional theoretical derivations and data analyses on survival records showed that the total genetic variance and breeding values can be estimated from pooled data. Moreover, the correlation between the estimated total breeding values obtained from individual and pooled data was surprisingly

  14. BlueSNP: R package for highly scalable genome-wide association studies using Hadoop clusters.

    PubMed

    Huang, Hailiang; Tata, Sandeep; Prill, Robert J

    2013-01-01

    Computational workloads for genome-wide association studies (GWAS) are growing in scale and complexity outpacing the capabilities of single-threaded software designed for personal computers. The BlueSNP R package implements GWAS statistical tests in the R programming language and executes the calculations across computer clusters configured with Apache Hadoop, a de facto standard framework for distributed data processing using the MapReduce formalism. BlueSNP makes computationally intensive analyses, such as estimating empirical p-values via data permutation, and searching for expression quantitative trait loci over thousands of genes, feasible for large genotype-phenotype datasets. http://github.com/ibm-bioinformatics/bluesnp

  15. Genome-wide Pleiotropy Between Parkinson Disease and Autoimmune Diseases.

    PubMed

    Witoelar, Aree; Jansen, Iris E; Wang, Yunpeng; Desikan, Rahul S; Gibbs, J Raphael; Blauwendraat, Cornelis; Thompson, Wesley K; Hernandez, Dena G; Djurovic, Srdjan; Schork, Andrew J; Bettella, Francesco; Ellinghaus, David; Franke, Andre; Lie, Benedicte A; McEvoy, Linda K; Karlsen, Tom H; Lesage, Suzanne; Morris, Huw R; Brice, Alexis; Wood, Nicholas W; Heutink, Peter; Hardy, John; Singleton, Andrew B; Dale, Anders M; Gasser, Thomas; Andreassen, Ole A; Sharma, Manu

    2017-07-01

    Recent genome-wide association studies (GWAS) and pathway analyses supported long-standing observations of an association between immune-mediated diseases and Parkinson disease (PD). The post-GWAS era provides an opportunity for cross-phenotype analyses between different complex phenotypes. To test the hypothesis that there are common genetic risk variants conveying risk of both PD and autoimmune diseases (ie, pleiotropy) and to identify new shared genetic variants and their pathways by applying a novel statistical framework in a genome-wide approach. Using the conjunction false discovery rate method, this study analyzed GWAS data from a selection of archetypal autoimmune diseases among 138 511 individuals of European ancestry and systemically investigated pleiotropy between PD and type 1 diabetes, Crohn disease, ulcerative colitis, rheumatoid arthritis, celiac disease, psoriasis, and multiple sclerosis. NeuroX data (6927 PD cases and 6108 controls) were used for replication. The study investigated the biological correlation between the top loci through protein-protein interaction and changes in the gene expression and methylation levels. The dates of the analysis were June 10, 2015, to March 4, 2017. The primary outcome was a list of novel loci and their pathways involved in PD and autoimmune diseases. Genome-wide conjunctional analysis identified 17 novel loci at false discovery rate less than 0.05 with overlap between PD and autoimmune diseases, including known PD loci adjacent to GAK, HLA-DRB5, LRRK2, and MAPT for rheumatoid arthritis, ulcerative colitis and Crohn disease. Replication confirmed the involvement of HLA, LRRK2, MAPT, TRIM10, and SETD1A in PD. Among the novel genes discovered, WNT3, KANSL1, CRHR1, BOLA2, and GUCY1A3 are within a protein-protein interaction network with known PD genes. A subset of novel loci was significantly associated with changes in methylation or expression levels of adjacent genes. The study findings provide novel mechanistic

  16. Genome-wide association study of antisocial personality disorder

    PubMed Central

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  17. Susceptibility to Childhood Pneumonia: A Genome-Wide Analysis.

    PubMed

    Hayden, Lystra P; Cho, Michael H; McDonald, Merry-Lynn N; Crapo, James D; Beaty, Terri H; Silverman, Edwin K; Hersh, Craig P

    2017-01-01

    Previous studies have indicated that in adult smokers, a history of childhood pneumonia is associated with reduced lung function and chronic obstructive pulmonary disease. There have been few previous investigations using genome-wide association studies to investigate genetic predisposition to pneumonia. This study aims to identify the genetic variants associated with the development of pneumonia during childhood and over the course of the lifetime. Study subjects included current and former smokers with and without chronic obstructive pulmonary disease participating in the COPDGene Study. Pneumonia was defined by subject self-report, with childhood pneumonia categorized as having the first episode at <16 years. Genome-wide association studies for childhood pneumonia (843 cases, 9,091 control subjects) and lifetime pneumonia (3,766 cases, 5,659 control subjects) were performed separately in non-Hispanic whites and African Americans. Non-Hispanic white and African American populations were combined in the meta-analysis. Top genetic variants from childhood pneumonia were assessed in network analysis. No single-nucleotide polymorphisms reached genome-wide significance, although we identified potential regions of interest. In the childhood pneumonia analysis, this included variants in NGR1 (P = 6.3 × 10(-8)), PAK6 (P = 3.3 × 10(-7)), and near MATN1 (P = 2.8 × 10(-7)). In the lifetime pneumonia analysis, this included variants in LOC339862 (P = 8.7 × 10(-7)), RAPGEF2 (P = 8.4 × 10(-7)), PHACTR1 (P = 6.1 × 10(-7)), near PRR27 (P = 4.3 × 10(-7)), and near MCPH1 (P = 2.7 × 10(-7)). Network analysis of the genes associated with childhood pneumonia included top networks related to development, blood vessel morphogenesis, muscle contraction, WNT signaling, DNA damage, apoptosis, inflammation, and immune response (P ≤ 0.05). We have identified genes potentially associated with the risk of pneumonia

  18. Genome-Wide Footprints of Pig Domestication and Selection Revealed through Massive Parallel Sequencing of Pooled DNA

    PubMed Central

    Amaral, Andreia J.; Ferretti, Luca; Megens, Hendrik-Jan; Crooijmans, Richard P. M. A.; Nie, Haisheng; Ramos-Onsins, Sebastian E.; Perez-Enciso, Miguel; Schook, Lawrence B.; Groenen, Martien A. M.

    2011-01-01

    Background Artificial selection has caused rapid evolution in domesticated species. The identification of selection footprints across domesticated genomes can contribute to uncover the genetic basis of phenotypic diversity. Methodology/Main Findings Genome wide footprints of pig domestication and selection were identified using massive parallel sequencing of pooled reduced representation libraries (RRL) representing ∼2% of the genome from wild boar and four domestic pig breeds (Large White, Landrace, Duroc and Pietrain) which have been under strong selection for muscle development, growth, behavior and coat color. Using specifically developed statistical methods that account for DNA pooling, low mean sequencing depth, and sequencing errors, we provide genome-wide estimates of nucleotide diversity and genetic differentiation in pig. Widespread signals suggestive of positive and balancing selection were found and the strongest signals were observed in Pietrain, one of the breeds most intensively selected for muscle development. Most signals were population-specific but affected genomic regions which harbored genes for common biological categories including coat color, brain development, muscle development, growth, metabolism, olfaction and immunity. Genetic differentiation in regions harboring genes related to muscle development and growth was higher between breeds than between a given breed and the wild boar. Conclusions/Significance These results, suggest that although domesticated breeds have experienced similar selective pressures, selection has acted upon different genes. This might reflect the multiple domestication events of European breeds or could be the result of subsequent introgression of Asian alleles. Overall, it was estimated that approximately 7% of the porcine genome has been affected by selection events. This study illustrates that the massive parallel sequencing of genomic pools is a cost-effective approach to identify footprints of selection

  19. Genome-Wide Patterns of Genetic Variation in Two Domestic Chickens

    PubMed Central

    Fan, Wen-Lang; Ng, Chen Siang; Chen, Chih-Feng; Lu, Mei-Yeh Jade; Chen, Yu-Hsiang; Liu, Chia-Jung; Wu, Siao-Man; Chen, Chih-Kuan; Chen, Jiun-Jie; Mao, Chi-Tang; Lai, Yu-Ting; Lo, Wen-Sui; Chang, Wei-Hua; Li, Wen-Hsiung

    2013-01-01

    Domestic chickens are excellent models for investigating the genetic basis of phenotypic diversity, as numerous phenotypic changes in physiology, morphology, and behavior in chickens have been artificially selected. Genomic study is required to study genome-wide patterns of DNA variation for dissecting the genetic basis of phenotypic traits. We sequenced the genomes of the Silkie and the Taiwanese native chicken L2 at ∼23- and 25-fold average coverage depth, respectively, using Illumina sequencing. The reads were mapped onto the chicken reference genome (including 5.1% Ns) to 92.32% genome coverage for the two breeds. Using a stringent filter, we identified ∼7.6 million single-nucleotide polymorphisms (SNPs) and 8,839 copy number variations (CNVs) in the mapped regions; 42% of the SNPs have not found in other chickens before. Among the 68,906 SNPs annotated in the chicken sequence assembly, 27,852 were nonsynonymous SNPs located in 13,537 genes. We also identified hundreds of shared and divergent structural and copy number variants in intronic and intergenic regions and in coding regions in the two breeds. Functional enrichments of identified genetic variants were discussed. Radical nsSNP-containing immunity genes were enriched in the QTL regions associated with some economic traits for both breeds. Moreover, genetic changes involved in selective sweeps were detected. From the selective sweeps identified in our two breeds, several genes associated with growth, appetite, and metabolic regulation were identified. Our study provides a framework for genetic and genomic research of domestic chickens and facilitates the domestic chicken as an avian model for genomic, biomedical, and evolutionary studies. PMID:23814129

  20. Genome Wide assessment of Parkinson’s disease in a Southern Spanish population

    PubMed Central

    Bandrés-Ciga, S; Price, TR; Barrero, FJ; Escamilla-Sevilla, F; Pelegrina, J; Arepalli, S; Hernández, D; Gutiérrez, B; Cervilla, J; Rivera, M; Rivera, AM; Ding, J; Vives, F; Nalls, MA; Singleton, AB; Durán, R

    2016-01-01

    Here, we set out to study the genetic architecture of Parkinson’s disease (PD) through a Genome-Wide Association Study (GWAS) in a Southern Spanish population. 240 PD cases and 192 controls were genotyped on the NeuroX array. We estimated genetic variation associated with PD risk and age at onset (AAO). Risk profile analyses for PD and AAO were performed using a weighted genetic risk score (GRS). Total heritability was estimated by genome-wide complex trait analysis. Rare variants were screened with single-variant and burden tests. We also screened for variation in known PD genes. Finally, we explored runs of homozygosity and structural genomic variations. We replicate PD association (uncorrected p-value < 0.05) at the following loci: ACMSD/TMEM163, MAPT, STK39, MIR4697 and SREBF/RAI1. Subjects in the highest GRS quintile showed significantly increased risk of PD versus the lowest quintile (OR=3.6, p-value < 4e−7), but no significant difference in AAO. We found evidence of runs of homozygosity in two PD-associated regions: one intersecting the HLA-DQB1 gene in six patients and one control; and another intersecting the GBA-SYT11 gene in one PD case. The GBA N370S and the LRRK2 G2019S variants were found in 8 and 7 cases respectively, replicating previous work. A structural variant was found in one case in the PARK2 gene locus. This current work represents a comprehensive assessment at a genome-wide level characterizing a novel population in PD genetics. PMID:27393345

  1. A Pilot Genome-Wide Association Study Identifies Potential Metabolic Pathways Involved in Tinnitus

    PubMed Central

    Gilles, Annick; Van Camp, Guy; Van de Heyning, Paul; Fransen, Erik

    2017-01-01

    Tinnitus, the perception of an auditory phantom sound in the form of ringing, buzzing, roaring, or hissing in the absence of an external sound source, is perceived by ~15% of the population and 2.5% experiences a severely bothersome tinnitus. The contribution of genes on the development of tinnitus is still under debate. The current manuscript reports a pilot Genome Wide Association Study (GWAS) into tinnitus, in a small cohort of 167 independent tinnitus subjects, and 749 non-tinnitus controls, who were collected as part of a cross-sectional study. After genotyping, imputation, and quality checking, the association between the tinnitus phenotype and 4,000,000 single-nucleotide polymorphisms (SNPs) was tested followed by gene set enrichment analysis. None of the SNPs reached the threshold for genome-wide significance (p < 5.0e–8), with the most significant SNPs, situated outside coding genes, reaching a p-value of 3.4e–7. By using the Genetic Analysis of Complex Traits (GACT) software, the percentage of the variance explained by all SNPs in the GWAS was estimated to be 3.2%, indicating that additive genetic effects explain only a small fraction of the tinnitus phenotype. Despite the lack of genome-wide significant SNPs, which is, at least in part, due to the limited sample size of the current study, evidence was found for a genetic involvement in tinnitus. Gene set enrichment analysis showed several metabolic pathways to be significantly enriched with SNPs having a low p-value in the GWAS. These pathways are involved in oxidative stress, endoplasmatic reticulum (ER) stress, and serotonin reception mediated signaling. These results are a promising basis for further research into the genetic basis of tinnitus, including GWAS with larger sample sizes and considering tinnitus subtypes for which a greater genetic contribution is more likely. PMID:28303087

  2. Microfluidics for genome-wide studies involving next generation sequencing

    PubMed Central

    Murphy, Travis W.; Lu, Chang

    2017-01-01

    Next-generation sequencing (NGS) has revolutionized how molecular biology studies are conducted. Its decreasing cost and increasing throughput permit profiling of genomic, transcriptomic, and epigenomic features for a wide range of applications. Microfluidics has been proven to be highly complementary to NGS technology with its unique capabilities for handling small volumes of samples and providing platforms for automation, integration, and multiplexing. In this article, we review recent progress on applying microfluidics to facilitate genome-wide studies. We emphasize on several technical aspects of NGS and how they benefit from coupling with microfluidic technology. We also summarize recent efforts on developing microfluidic technology for genomic, transcriptomic, and epigenomic studies, with emphasis on single cell analysis. We envision rapid growth in these directions, driven by the needs for testing scarce primary cell samples from patients in the context of precision medicine. PMID:28396707

  3. Implications of genome-wide association studies in cancer therapeutics

    PubMed Central

    Patel, Jai N; McLeod, Howard L; Innocenti, Federico

    2013-01-01

    Genome wide association studies (GWAS) provide an agnostic approach to identifying potential genetic variants associated with disease susceptibility, prognosis of survival and/or predictive of drug response. Although these techniques are costly and interpretation of study results is challenging, they do allow for a more unbiased interrogation of the entire genome, resulting in the discovery of novel genes and understanding of novel biological associations. This review will focus on the implications of GWAS in cancer therapy, in particular germ-line mutations, including findings from major GWAS which have identified predictive genetic loci for clinical outcome and/or toxicity. Lessons and challenges in cancer GWAS are also discussed, including the need for functional analysis and replication, as well as future perspectives for biological and clinical utility. Given the large heterogeneity in response to cancer therapeutics, novel methods of identifying mechanisms and biology of variable drug response and ultimately treatment individualization will be indispensable. PMID:23701381

  4. High-resolution genome-wide mapping of histone modifications.

    PubMed

    Roh, Tae-young; Ngau, Wing Chi; Cui, Kairong; Landsman, David; Zhao, Keji

    2004-08-01

    The expression patterns of eukaryotic genomes are controlled by their chromatin structure, consisting of nucleosome subunits in which DNA of approximately 146 bp is wrapped around a core of 8 histone molecules. Post-translational histone modifications play an essential role in modifying chromatin structure. Here we apply a combination of SAGE and chromatin immunoprecipitation (ChIP) protocols to determine the distribution of hyperacetylated histones H3 and H4 in the Saccharomyces cerevisiae genome. We call this approach genome-wide mapping technique (GMAT). Using GMAT, we find that the highest acetylation levels are detected in the 5' end of a gene's coding region, but not in the promoter. Furthermore, we show that the histone acetyltransferase, GCN5p, regulates H3 acetylation in the promoter and 5' end of the coding regions. These findings indicate that GMAT should find valuable applications in mapping target sites of chromatin-modifying enzymes.

  5. Genome-Wide Association Studies of Drug-Resistance Determinants.

    PubMed

    Volkman, Sarah K; Herman, Jonathan; Lukens, Amanda K; Hartl, Daniel L

    2017-03-01

    Population genetic strategies that leverage association, selection, and linkage have identified drug-resistant loci. However, challenges and limitations persist in identifying drug-resistance loci in malaria. In this review we discuss the genetic basis of drug resistance and the use of genome-wide association studies, complemented by selection and linkage studies, to identify and understand mechanisms of drug resistance and response. We also discuss the implications of nongenetic mechanisms of drug resistance recently reported in the literature, and present models of the interplay between nongenetic and genetic processes that contribute to the emergence of drug resistance. Throughout, we examine artemisinin resistance as an example to emphasize challenges in identifying phenotypes suitable for population genetic studies as well as complications due to multiple-factor drug resistance. Copyright © 2016. Published by Elsevier Ltd.

  6. Genome-wide measurement of RNA folding energies.

    PubMed

    Wan, Yue; Qu, Kun; Ouyang, Zhengqing; Kertesz, Michael; Li, Jun; Tibshirani, Robert; Makino, Debora L; Nutter, Robert C; Segal, Eran; Chang, Howard Y

    2012-10-26

    RNA structural transitions are important in the function and regulation of RNAs. Here, we reveal a layer of transcriptome organization in the form of RNA folding energies. By probing yeast RNA structures at different temperatures, we obtained relative melting temperatures (Tm) for RNA structures in over 4000 transcripts. Specific signatures of RNA Tm demarcated the polarity of mRNA open reading frames and highlighted numerous candidate regulatory RNA motifs in 3' untranslated regions. RNA Tm distinguished noncoding versus coding RNAs and identified mRNAs with distinct cellular functions. We identified thousands of putative RNA thermometers, and their presence is predictive of the pattern of RNA decay in vivo during heat shock. The exosome complex recognizes unpaired bases during heat shock to degrade these RNAs, coupling intrinsic structural stabilities to gene regulation. Thus, genome-wide structural dynamics of RNA can parse functional elements of the transcriptome and reveal diverse biological insights.

  7. Genome-wide studies of telomere biology in budding yeast

    PubMed Central

    Harari, Yaniv; Kupiec, Martin

    2014-01-01

    Telomeres are specialized DNA-protein structures at the ends of eukaryotic chromosomes. Telomeres are essential for chromosomal stability and integrity, as they prevent chromosome ends from being recognized as double strand breaks. In rapidly proliferating cells, telomeric DNA is synthesized by the enzyme telomerase, which copies a short template sequence within its own RNA moiety, thus helping to solve the “end-replication problem”, in which information is lost at the ends of chromosomes with each DNA replication cycle. The basic mechanisms of telomere length, structure and function maintenance are conserved among eukaryotes. Studies in the yeast Saccharomyces cerevisiae have been instrumental in deciphering the basic aspects of telomere biology. In the last decade, technical advances, such as the availability of mutant collections, have allowed carrying out systematic genome-wide screens for mutants affecting various aspects of telomere biology. In this review we summarize these efforts, and the insights that this Systems Biology approach has produced so far.

  8. [Genome-wide associations for cigarette smoking behavior].

    PubMed

    Strauss, Ewa

    2013-01-01

    Diseases related to tobacco smoking are the second leading cause of death in the world. Despite increasing evidence of genetic determination, the susceptibility genes and loci underlying various aspects of smoking behavior are largely unknown. Genome-wide association studies (GWASs) provided a new conceptual framework in the search for variants underlying common traits/disorders. A massive scan of the genome and a "hypothesis-free" approach enable discovery of new aspects of genetics of complex traits. In this paper the results of GWASs and GWAS meta-analyzes of cigarette smoking behavior and nicotine dependence are reviewed with the particular attention to smoking cessation success and the replacement therapy. The results of these studies are discussed in the context of the results of the candidate gene association studies. Studies on the role of the genomic regions, identified in GWASs, in the development of smoking-related diseases are also discussed.

  9. The utility of genome-wide association studies in hepatology.

    PubMed

    Karlsen, Tom H; Melum, Espen; Franke, Andre

    2010-05-01

    Over the last 4 years, more than 450 genome-wide association studies (GWAS) have been successfully performed in a variety of human traits, of which approximately 2% relates to the field of hepatology. Whereas the many robust susceptibility gene findings have provided insight into fundamental physiological aspects of the phenotypes that have been studied, the widespread application has also revealed important limitations of the GWAS design. This review aims to systematically summarize both the strengths and the weaknesses of GWAS, as well as underscore important experiences made in model diseases outside the field of hepatology. By reviewing the GWAS performed in hepatology so far on this broader background, extensions and guidelines for the rational application of the study design in hepatology are proposed.

  10. Implications of genome-wide association studies in cancer therapeutics.

    PubMed

    Patel, Jai N; McLeod, Howard L; Innocenti, Federico

    2013-09-01

    Genome wide association studies (GWAS) provide an agnostic approach to identifying potential genetic variants associated with disease susceptibility, prognosis of survival and/or predictive of drug response. Although these techniques are costly and interpretation of study results is challenging, they do allow for a more unbiased interrogation of the entire genome, resulting in the discovery of novel genes and understanding of novel biological associations. This review will focus on the implications of GWAS in cancer therapy, in particular germ-line mutations, including findings from major GWAS which have identified predictive genetic loci for clinical outcome and/or toxicity. Lessons and challenges in cancer GWAS are also discussed, including the need for functional analysis and replication, as well as future perspectives for biological and clinical utility. Given the large heterogeneity in response to cancer therapeutics, novel methods of identifying mechanisms and biology of variable drug response and ultimately treatment individualization will be indispensable. © 2013 The British Pharmacological Society.

  11. Quality control for genome-wide association studies.

    PubMed

    Gondro, Cedric; Lee, Seung Hwan; Lee, Hak Kyo; Porto-Neto, Laercio R

    2013-01-01

    This chapter overviews the quality control (QC) issues for SNP-based genotyping methods used in genome-wide association studies. The main metrics for evaluating the quality of the genotypes are discussed followed by a worked out example of QC pipeline starting with raw data and finishing with a fully filtered dataset ready for downstream analysis. The emphasis is on automation of data storage, filtering, and manipulation to ensure data integrity throughput the process and on how to extract a global summary from these high dimensional datasets to allow better-informed downstream analytical decisions. All examples will be run using the R statistical programming language followed by a practical example using a fully automated QC pipeline for the Illumina platform.

  12. Ultrafast laser nanosurgery in microfluidics for genome-wide screenings

    PubMed Central

    Ben-Yakar, Adela; Bourgeois, Frederic

    2009-01-01

    Summary The use of ultrafast laser pulses in surgery has allowed for unprecedented precision with minimal collateral damage to surrounding tissues. For these reasons, ultrafast laser nanosurgery, as an injury model, has gained tremendous momentum in experimental biology ranging from in-vitro manipulations of subcellular structures to in-vivo studies in whole living organisms. For example, femtosecond laser nanosurgery on such model organism as the nematode Caenorhabditis elegans (C. elegans) has opened new opportunities for in-vivo nerve regeneration studies. Meanwhile, the development of novel microfluidic devices has brought the control in experimental environment to the level required for precise nanosurgery in various animal models. Merging microfluidics and laser nanosurgery has recently improved the specificities and increased the speed of laser surgeries enabling fast genome-wide screenings that can more readily decode the genetic map of various biological processes. PMID:19278850

  13. Genome-wide association studies in pharmacogenomics of antidepressants.

    PubMed

    Lin, Eugene; Lane, Hsien-Yuan

    2015-01-01

    Major depressive disorder (MDD) is one of the most common psychiatric disorders worldwide. Doctors must prescribe antidepressants based on educated guesses due to the fact that it is unmanageable to predict the effectiveness of any particular antidepressant in an individual patient. With the recent advent of scientific research, the genome-wide association study (GWAS) is extensively employed to analyze hundreds of thousands of single nucleotide polymorphisms by high-throughput genotyping technologies. In addition to the candidate-gene approach, the GWAS approach has recently been utilized to investigate the determinants of antidepressant response to therapy. In this study, we reviewed GWAS studies, their limitations and future directions with respect to the pharmacogenomics of antidepressants in MDD.

  14. Genome-wide linkage in Utah autism pedigrees

    PubMed Central

    Allen-Brady, K; Robison, R; Cannon, D; Varvil, T; Villalobos, M; Pingree, C; Leppert, MF; Miller, J; McMahon, WM; Coon, H

    2014-01-01

    Genetic studies of autism over the past decade suggest a complex landscape of multiple genes. In the face of this heterogeneity, studies that include large extended pedigrees may offer valuable insight, as the relatively few susceptibility genes within single large families may be more easily discerned. This genome-wide screen of 70 families includes 20 large extended pedigrees of 6–9 generations, 6 moderate-sized families of 4–5 generations, and 44 smaller families of 2–3 generations. The Center for Inherited Disease Research (CIDR) provided genotyping using the Illumina Linkage Panel 12, a 6K single nucleotide polymorphism (SNP) platform. Results from 192 subjects with an Autism Spectrum Disorder (ASD), and 461 of their relatives revealed genome-wide significance on chromosome 15q, with three possibly distinct peaks: 15q13.1-q14 (HLOD=4.09 at 29,459,872bp); 15q14-q21.1 (HLOD=3.59 at 36,837,208bp); and 15q21.1-q22.2 (HLOD=5.31 at 55,629,733bp). Two of these peaks replicate previous findings. There were additional suggestive results on chromosomes 2p25.3-p24.1 (HLOD=1.87), 7q31.31-q32.3 (HLOD=1.97), and 13q12.11-q12.3 (HLOD=1.93). Affected subjects in families supporting the linkage peaks found in this study did not reveal strong evidence for distinct phenotypic subgroups. PMID:19455147

  15. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2015-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  16. Genome-wide significant loci for addiction and anxiety

    PubMed Central

    Hodgson, K.; Almasy, L.; Knowles, E.E.M.; Kent, J.W.; Curran, J.E.; Dyer, T.D.; Göring, H.H.H.; Olvera, R.L.; Fox, P.T.; Pearlson, G.D.; Krystal, J.H.; Duggirala, R.; Blangero, J.; Glahn, D.C.

    2017-01-01

    Background Psychiatric comorbidity is common among individuals with addictive disorders, with patients frequently suffering from anxiety disorders. While the genetic architecture of comorbid addictive and anxiety disorders remains unclear, elucidating the genes involved could provide important insights into the underlying etiology. Methods Here we examine a sample of 1284 Mexican-Americans from randomly selected extended pedigrees. Variance decomposition methods were used to examine the role of genetics in addiction phenotypes (lifetime history of alcohol dependence, drug dependence or chronic smoking) and various forms of clinically relevant anxiety. Genome-wide univariate and bivariate linkage scans were conducted to localize the chromosomal regions influencing these traits. Results Addiction phenotypes and anxiety were shown to be heritable and univariate genome-wide linkage scans revealed significant quantitative trait loci for drug dependence (14q13.2–q21.2, LOD = 3.322) and a broad anxiety phenotype (12q24.32–q24.33, LOD = 2.918). Significant positive genetic correlations were observed between anxiety and each of the addiction subtypes (ρg = 0.550–0.655) and further investigation with bivariate linkage analyses identified significant pleiotropic signals for alcohol dependence-anxiety (9q33.1–q33.2, LOD = 3.054) and drug dependence-anxiety (18p11.23–p11.22, LOD = 3.425). Conclusions This study confirms the shared genetic underpinnings of addiction and anxiety and identifies genomic loci involved in the etiology of these comorbid disorders. The linkage signal for anxiety on 12q24 spans the location of TMEM132D, an emerging gene of interest from previous GWAS of anxiety traits, whilst the bivariate linkage signal identified for anxiety-alcohol on 9q33 peak coincides with a region where rare CNVs have been associated with psychiatric disorders. Other signals identified implicate novel regions of the genome in addiction genetics. PMID:27318301

  17. Genome-Wide Association Study of Meiotic Recombination Phenotypes

    PubMed Central

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G.; Sherman, Stephanie L.; Feingold, Eleanor

    2016-01-01

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9. By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2. This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events. PMID:27733454

  18. Genome-Wide Association Study of Meiotic Recombination Phenotypes.

    PubMed

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G; Sherman, Stephanie L; Feingold, Eleanor

    2016-12-07

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9 By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2 This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events.

  19. A genome-wide association study of attempted suicide

    PubMed Central

    Willour, Virginia L.; Seifuddin, Fayaz; Mahon, Pamela B.; Jancic, Dubravka; Pirooznia, Mehdi; Steele, Jo; Schweizer, Barbara; Goes, Fernando S.; Mondimore, Francis M.; MacKinnon, Dean F.; Perlis, Roy H.; Lee, Phil Hyoun; Huang, Jie; Kelsoe, John R.; Shilling, Paul D.; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W.; Craddock, Nicholas; DePaulo, J. Raymond; Schulze, Thomas G.; McMahon, Francis J.; Zandi, Peter P.; Potash, James B.

    2011-01-01

    The heritable component to attempted and completed suicide is partly related to psychiatric disorders and also partly independent of them. While attempted suicide linkage regions have been identified on 2p11–12 and 6q25–26, there are likely many more such loci, the discovery of which will require a much higher resolution approach, such as the genome-wide association study (GWAS). With this in mind, we conducted an attempted suicide GWAS that compared the single nucleotide polymorphism (SNP) genotypes of 1,201 bipolar (BP) subjects with a history of suicide attempts to the genotypes of 1,497 BP subjects without a history of suicide attempts. 2,507 SNPs with evidence for association at p<0.001 were identified. These associated SNPs were subsequently tested for association in a large and independent BP sample set. None of these SNPs were significantly associated in the replication sample after correcting for multiple testing, but the combined analysis of the two sample sets produced an association signal on 2p25 (rs300774) at the threshold of genome-wide significance (p= 5.07 × 10−8). The associated SNPs on 2p25 fall in a large linkage disequilibrium block containing the ACP1 gene, a gene whose expression is significantly elevated in BP subjects who have completed suicide. Furthermore, the ACP1 protein is a tyrosine phosphatase that influences Wnt signaling, a pathway regulated by lithium, making ACP1 a functional candidate for involvement in the phenotype. Larger GWAS sample sets will be required to confirm the signal on 2p25 and to identify additional genetic risk factors increasing susceptibility for attempted suicide. PMID:21423239

  20. A genome-wide methylation study on obesity

    PubMed Central

    Xu, Xiaojing; Su, Shaoyong; Barnes, Vernon A.; De Miguel, Carmen; Pollock, Jennifer; Ownby, Dennis; Shi, Huidong; Zhu, Haidong; Snieder, Harold; Wang, Xiaoling

    2013-01-01

    Besides differential methylation, DNA methylation variation has recently been proposed and demonstrated to be a potential contributing factor to cancer risk. Here we aim to examine whether differential variability in methylation is also an important feature of obesity, a typical non-malignant common complex disease. We analyzed genome-wide methylation profiles of over 470,000 CpGs in peripheral blood samples from 48 obese and 48 lean African-American youth aged 14–20 y old. A substantial number of differentially variable CpG sites (DVCs), using statistics based on variances, as well as a substantial number of differentially methylated CpG sites (DMCs), using statistics based on means, were identified. Similar to the findings in cancers, DVCs generally exhibited an outlier structure and were more variable in cases than in controls. By randomly splitting the current sample into a discovery and validation set, we observed that both the DVCs and DMCs identified from the first set could independently predict obesity status in the second set. Furthermore, both the genes harboring DMCs and the genes harboring DVCs showed significant enrichment of genes identified by genome-wide association studies on obesity and related diseases, such as hypertension, dyslipidemia, type 2 diabetes and certain types of cancers, supporting their roles in the etiology and pathogenesis of obesity. We generalized the recent finding on methylation variability in cancer research to obesity and demonstrated that differential variability is also an important feature of obesity-related methylation changes. Future studies on the epigenetics of obesity will benefit from both statistics based on means and statistics based on variances. PMID:23644594

  1. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Slof-Op t Landt, Margarita CT; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O’Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2013-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10−7) in SOX2OT and rs17030795 (P=5.84×10−6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10−6) between CUL3 and FAM124B and rs1886797 (P=8.05×10−6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P= 4×10−6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:21079607

  2. A genome-wide association study of anorexia nervosa.

    PubMed

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-10-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.

  3. Genome-Wide DNA Methylation Scan in Major Depressive Disorder

    PubMed Central

    Irizarry, Rafael A.; Rongione, Michael; Webster, Maree J.; Kaufman, Walter E.; Murakami, Peter; Lessard, Andree; Yolken, Robert H.; Feinberg, Andrew P.; Potash, James B.; Consortium, GenRED

    2012-01-01

    While genome-wide association studies are ongoing to identify sequence variation influencing susceptibility to major depressive disorder (MDD), epigenetic marks, such as DNA methylation, which can be influenced by environment, might also play a role. Here we present the first genome-wide DNA methylation (DNAm) scan in MDD. We compared 39 postmortem frontal cortex MDD samples to 26 controls. DNA was hybridized to our Comprehensive High-throughput Arrays for Relative Methylation (CHARM) platform, covering 3.5 million CpGs. CHARM identified 224 candidate regions with DNAm differences >10%. These regions are highly enriched for neuronal growth and development genes. Ten of 17 regions for which validation was attempted showed true DNAm differences; the greatest were in PRIMA1, with 12–15% increased DNAm in MDD (p = 0.0002–0.0003), and a concomitant decrease in gene expression. These results must be considered pilot data, however, as we could only test replication in a small number of additional brain samples (n = 16), which showed no significant difference in PRIMA1. Because PRIMA1 anchors acetylcholinesterase in neuronal membranes, decreased expression could result in decreased enzyme function and increased cholinergic transmission, consistent with a role in MDD. We observed decreased immunoreactivity for acetylcholinesterase in MDD brain with increased PRIMA1 DNAm, non-significant at p = 0.08. While we cannot draw firm conclusions about PRIMA1 DNAm in MDD, the involvement of neuronal development genes across the set showing differential methylation suggests a role for epigenetics in the illness. Further studies using limbic system brain regions might shed additional light on this role. PMID:22511943

  4. Genome-wide identification of molecular mimicry candidates in parasites.

    PubMed

    Ludin, Philipp; Nilsson, Daniel; Mäser, Pascal

    2011-03-08

    Among the many strategies employed by parasites for immune evasion and host manipulation, one of the most fascinating is molecular mimicry. With genome sequences available for host and parasite, mimicry of linear amino acid epitopes can be investigated by comparative genomics. Here we developed an in silico pipeline for genome-wide identification of molecular mimicry candidate proteins or epitopes. The predicted proteome of a given parasite was broken down into overlapping fragments, each of which was screened for close hits in the human proteome. Control searches were carried out against unrelated, free-living eukaryotes to eliminate the generally conserved proteins, and with randomized versions of the parasite proteins to get an estimate of statistical significance. This simple but computation-intensive approach yielded interesting candidates from human-pathogenic parasites. From Plasmodium falciparum, it returned a 14 amino acid motif in several of the PfEMP1 variants identical to part of the heparin-binding domain in the immunosuppressive serum protein vitronectin. And in Brugia malayi, fragments were detected that matched to periphilin-1, a protein of cell-cell junctions involved in barrier formation. All the results are publicly available by means of mimicDB, a searchable online database for molecular mimicry candidates from pathogens. To our knowledge, this is the first genome-wide survey for molecular mimicry proteins in parasites. The strategy can be adopted to any pair of host and pathogen, once appropriate negative control organisms are chosen. MimicDB provides a host of new starting points to gain insights into the molecular nature of host-pathogen interactions.

  5. Genome-Wide Identification of Molecular Mimicry Candidates in Parasites

    PubMed Central

    Ludin, Philipp; Nilsson, Daniel; Mäser, Pascal

    2011-01-01

    Among the many strategies employed by parasites for immune evasion and host manipulation, one of the most fascinating is molecular mimicry. With genome sequences available for host and parasite, mimicry of linear amino acid epitopes can be investigated by comparative genomics. Here we developed an in silico pipeline for genome-wide identification of molecular mimicry candidate proteins or epitopes. The predicted proteome of a given parasite was broken down into overlapping fragments, each of which was screened for close hits in the human proteome. Control searches were carried out against unrelated, free-living eukaryotes to eliminate the generally conserved proteins, and with randomized versions of the parasite proteins to get an estimate of statistical significance. This simple but computation-intensive approach yielded interesting candidates from human-pathogenic parasites. From Plasmodium falciparum, it returned a 14 amino acid motif in several of the PfEMP1 variants identical to part of the heparin-binding domain in the immunosuppressive serum protein vitronectin. And in Brugia malayi, fragments were detected that matched to periphilin-1, a protein of cell-cell junctions involved in barrier formation. All the results are publicly available by means of mimicDB, a searchable online database for molecular mimicry candidates from pathogens. To our knowledge, this is the first genome-wide survey for molecular mimicry proteins in parasites. The strategy can be adopted to any pair of host and pathogen, once appropriate negative control organisms are chosen. MimicDB provides a host of new starting points to gain insights into the molecular nature of host-pathogen interactions. PMID:21408160

  6. Genome-wide association study of working memory brain activation.

    PubMed

    Blokland, Gabriëlla A M; Wallace, Angus K; Hansell, Narelle K; Thompson, Paul M; Hickie, Ian B; Montgomery, Grant W; Martin, Nicholas G; McMahon, Katie L; de Zubicaray, Greig I; Wright, Margaret J

    2017-05-01

    In a population-based genome-wide association (GWA) study of n-back working memory task-related brain activation, we extracted the average percent BOLD signal change (2-back minus 0-back) from 46 regions-of-interest (ROIs) in functional MRI scans from 863 healthy twins and siblings. ROIs were obtained by creating spheres around group random effects analysis local maxima, and by thresholding a voxel-based heritability map of working memory brain activation at 50%. Quality control for test-retest reliability and heritability of ROI measures yielded 20 reliable (r>0.7) and heritable (h(2)>20%) ROIs. For GWA analysis, the cohort was divided into a discovery (n=679) and replication (n=97) sample. No variants survived the stringent multiple-testing-corrected genome-wide significance threshold (p<4.5×10(-9)), or were replicated (p<0.0016), but several genes were identified that are worthy of further investigation. A search of 529,379 genomic markers resulted in discovery of 31 independent single nucleotide polymorphisms (SNPs) associated with BOLD signal change at a discovery level of p<1×10(-5). Two SNPs (rs7917410 and rs7672408) were associated at a significance level of p<1×10(-7). Only one, most strongly affecting BOLD signal change in the left supramarginal gyrus (R(2)=5.5%), had multiple SNPs associated at p<1×10(-5) in linkage disequilibrium with it, all located in and around the BANK1 gene. BANK1 encodes a B-cell-specific scaffold protein and has been shown to negatively regulate CD40-mediated AKT activation. AKT is part of the dopamine-signaling pathway, suggesting a mechanism for the involvement of BANK1 in the BOLD response to working memory. Variants identified here may be relevant to (the susceptibility to) common disorders affecting brain function. Copyright © 2016 Elsevier B.V. All rights reserved.

  7. Genome-wide association study of atypical psychosis.

    PubMed

    Kanazawa, Tetsufumi; Ikeda, Masashi; Glatt, Stephen J; Tsutsumi, Atsushi; Kikuyama, Hiroki; Kawamura, Yoshiya; Nishida, Nao; Miyagawa, Taku; Hashimoto, Ryota; Takeda, Masatoshi; Sasaki, Tsukasa; Tokunaga, Katsushi; Koh, Jun; Iwata, Nakao; Yoneda, Hiroshi

    2013-10-01

    Atypical psychosis with a periodic course of exacerbation and features of major psychiatric disorders [schizophrenia (SZ) and bipolar disorder (BD)] has a long history in clinical psychiatry in Japan. Based upon the new criteria of atypical psychosis, a Genome-Wide Association Study (GWAS) was conducted to identify the risk gene or variants. The relationships between atypical psychosis, SZ and BD were then assessed using independent GWAS data. Forty-seven patients with solid criteria of atypical psychosis and 882 normal controls (NCs) were scanned using an Affymetrics 6.0 chip. GWAS SZ data (560 SZ cases and 548 NCs) and GWAS BD (107 cases with BD type 1 and 107 NCs) were compared using gene-based analysis. The most significant SNPs were detected around the CHN2/CPVL genes (rs245914, P = 1.6 × 10(-7)) , COL21A1 gene (rs12196860, P = 2.45 × 10(-7) ), and PYGL/TRIM9 genes (rs1959536, P = 7.73 × 10(-7) ), although none of the single-nucleotide polymorphisms exhibited genome-wide significance (P = 5 × 10(-8) ). One of the highest peaks was detected on the major histocompatibility complex region, where large SZ GWASs have previously disclosed an association. The gene-based analysis suggested significant enrichment between SZ and atypical psychosis (P = 0.01), but not BD. This study provides clues about the types of patient whose diagnosis lies between SZ and BD. Studies with larger samples are required to determine the causal variant.

  8. Genome-Wide Binding Patterns of Thyroid Hormone Receptor Beta

    PubMed Central

    Ayers, Stephen; Switnicki, Michal Piotr; Angajala, Anusha; Lammel, Jan; Arumanayagam, Anithachristy S.; Webb, Paul

    2014-01-01

    Thyroid hormone (TH) receptors (TRs) play central roles in metabolism and are major targets for pharmaceutical intervention. Presently, however, there is limited information about genome wide localizations of TR binding sites. Thus, complexities of TR genomic distribution and links between TRβ binding events and gene regulation are not fully appreciated. Here, we employ a BioChIP approach to capture TR genome-wide binding events in a liver cell line (HepG2). Like other NRs, TRβ appears widely distributed throughout the genome. Nevertheless, there is striking enrichment of TRβ binding sites immediately 5′ and 3′ of transcribed genes and TRβ can be detected near 50% of T3 induced genes. In contrast, no significant enrichment of TRβ is seen at negatively regulated genes or genes that respond to unliganded TRs in this system. Canonical TRE half-sites are present in more than 90% of TRβ peaks and classical TREs are also greatly enriched, but individual TRE organization appears highly variable with diverse half-site orientation and spacing. There is also significant enrichment of binding sites for TR associated transcription factors, including AP-1 and CTCF, near TR peaks. We conclude that T3-dependent gene induction commonly involves proximal TRβ binding events but that far-distant binding events are needed for T3 induction of some genes and that distinct, indirect, mechanisms are often at play in negative regulation and unliganded TR actions. Better understanding of genomic context of TR binding sites will help us determine why TR regulates genes in different ways and determine possibilities for selective modulation of TR action. PMID:24558356

  9. Genome-wide association study for feed efficiency and growth traits in U.S. beef cattle.

    PubMed

    Seabury, Christopher M; Oldeschulte, David L; Saatchi, Mahdi; Beever, Jonathan E; Decker, Jared E; Halley, Yvette A; Bhattarai, Eric K; Molaei, Maral; Freetly, Harvey C; Hansen, Stephanie L; Yampara-Iquise, Helen; Johnson, Kristen A; Kerley, Monty S; Kim, JaeWoo; Loy, Daniel D; Marques, Elisa; Neibergs, Holly L; Schnabel, Robert D; Shike, Daniel W; Spangler, Matthew L; Weaber, Robert L; Garrick, Dorian J; Taylor, Jeremy F

    2017-05-18

    Single nucleotide polymorphism (SNP) arrays for domestic cattle have catalyzed the identification of genetic markers associated with complex traits for inclusion in modern breeding and selection programs. Using actual and imputed Illumina 778K genotypes for 3887 U.S. beef cattle from 3 populations (Angus, Hereford, SimAngus), we performed genome-wide association analyses for feed efficiency and growth traits including average daily gain (ADG), dry matter intake (DMI), mid-test metabolic weight (MMWT), and residual feed intake (RFI), with marker-based heritability estimates produced for all traits and populations. Moderate and/or large-effect QTL were detected for all traits in all populations, as jointly defined by the estimated proportion of variance explained (PVE) by marker effects (PVE ≥ 1.0%) and a nominal P-value threshold (P ≤ 5e-05). Lead SNPs with PVE ≥ 2.0% were considered putative evidence of large-effect QTL (n = 52), whereas those with PVE ≥ 1.0% but < 2.0% were considered putative evidence for moderate-effect QTL (n = 35). Identical or proximal lead SNPs associated with ADG, DMI, MMWT, and RFI collectively supported the potential for either pleiotropic QTL, or independent but proximal causal mutations for multiple traits within and between the analyzed populations. Marker-based heritability estimates for all investigated traits ranged from 0.18 to 0.60 using 778K genotypes, or from 0.17 to 0.57 using 50K genotypes (reduced from Illumina 778K HD to Illumina Bovine SNP50). An investigation to determine if QTL detected by 778K analysis could also be detected using 50K genotypes produced variable results, suggesting that 50K analyses were generally insufficient for QTL detection in these populations, and that relevant breeding or selection programs should be based on higher density analyses (imputed or directly ascertained). Fourteen moderate to large-effect QTL regions which ranged from being physically proximal (lead

  10. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    USDA-ARS?s Scientific Manuscript database

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  11. Genome-Wide Association Mapping of Seed Coat Color in Brassica napus.

    PubMed

    Wang, Jia; Xian, Xiaohua; Xu, Xinfu; Qu, Cunmin; Lu, Kun; Li, Jiana; Liu, Liezhao

    2017-07-05

    Seed coat color is an extremely important breeding characteristic of Brassica napus. To elucidate the factors affecting the genetic architecture of seed coat color, a genome-wide association study (GWAS) of seed coat color was conducted with a diversity panel comprising 520 B. napus cultivars and inbred lines. In total, 22 single-nucleotide polymorphisms (SNPs) distributed on 7 chromosomes were found to be associated with seed coat color. The most significant SNPs were found in 2014 near Bn-scaff_15763_1-p233999, only 43.42 kb away from BnaC06g17050D, which is orthologous to Arabidopsis thaliana TRANSPARENT TESTA 12 (TT12), an important gene involved in the transportation of proanthocyanidin precursors into the vacuole. Two of eight repeatedly detected SNPs can be identified and digested by restriction enzymes. Candidate gene mining revealed that the relevant regions of significant SNP loci on the A09 and C08 chromosomes are highly homologous. Moreover, a comparison of the GWAS results to those of previous quantitative trait locus (QTL) studies showed that 11 SNPs were located in the confidence intervals of the QTLs identified in previous studies based on linkage analyses or association mapping. Our results provide insights into the genetic basis of seed coat color in B. napus, and the beneficial allele, SNP information, and candidate genes should be useful for selecting yellow seeds in B. napus breeding.

  12. Genome-Wide Association of Stem Water Soluble Carbohydrates in Bread Wheat

    PubMed Central

    Dong, Yan; Liu, Jindong; Zhang, Yan; Geng, Hongwei; Rasheed, Awais; Xiao, Yonggui; Cao, Shuanghe; Fu, Luping; Yan, Jun; Wen, Weie; Zhang, Yong; Jing, Ruilian; Xia, Xianchun; He, Zhonghu

    2016-01-01

    Water soluble carbohydrates (WSC) in stems play an important role in buffering grain yield in wheat against biotic and abiotic stresses; however, knowledge of genes controlling WSC is very limited. We conducted a genome-wide association study (GWAS) using a high-density 90K SNP array to better understand the genetic basis underlying WSC, and to explore marker-based breeding approaches. WSC was evaluated in an association panel comprising 166 Chinese bread wheat cultivars planted in four environments. Fifty two marker-trait associations (MTAs) distributed across 23 loci were identified for phenotypic best linear unbiased estimates (BLUEs), and 11 MTAs were identified in two or more environments. Liner regression showed a clear dependence of WSC BLUE scores on numbers of favorable (increasing WSC content) and unfavorable alleles (decreasing WSC), indicating that genotypes with higher numbers of favorable or lower numbers of unfavorable alleles had higher WSC content. In silico analysis of flanking sequences of trait-associated SNPs revealed eight candidate genes related to WSC content grouped into two categories based on the type of encoding proteins, namely, defense response proteins and proteins triggered by environmental stresses. The identified SNPs and candidate genes related to WSC provide opportunities for breeding higher WSC wheat cultivars. PMID:27802269

  13. Genome-wide association studies dissect the genetic networks underlying agronomical traits in soybean.

    PubMed

    Fang, Chao; Ma, Yanming; Wu, Shiwen; Liu, Zhi; Wang, Zheng; Yang, Rui; Hu, Guanghui; Zhou, Zhengkui; Yu, Hong; Zhang, Min; Pan, Yi; Zhou, Guoan; Ren, Haixiang; Du, Weiguang; Yan, Hongrui; Wang, Yanping; Han, Dezhi; Shen, Yanting; Liu, Shulin; Liu, Tengfei; Zhang, Jixiang; Qin, Hao; Yuan, Jia; Yuan, Xiaohui; Kong, Fanjiang; Liu, Baohui; Li, Jiayang; Zhang, Zhiwu; Wang, Guodong; Zhu, Baoge; Tian, Zhixi

    2017-08-24

    Soybean (Glycine max [L.] Merr.) is one of the most important oil and protein crops. Ever-increasing soybean consumption necessitates the improvement of varieties for more efficient production. However, both correlations among different traits and genetic interactions among genes that affect a single trait pose a challenge to soybean breeding. To understand the genetic networks underlying phenotypic correlations, we collected 809 soybean accessions worldwide and phenotyped them for two years at three locations for 84 agronomic traits. Genome-wide association studies identified 245 significant genetic loci, among which 95 genetically interacted with other loci. We determined that 14 oil synthesis-related genes are responsible for fatty acid accumulation in soybean and function in line with an additive model. Network analyses demonstrated that 51 traits could be linked through the linkage disequilibrium of 115 associated loci and these links reflect phenotypic correlations. We revealed that 23 loci, including the known Dt1, E2, E1, Ln, Dt2, Fan, and Fap loci, as well as 16 undefined associated loci, have pleiotropic effects on different traits. This study provides insights into the genetic correlation among complex traits and will facilitate future soybean functional studies and breeding through molecular design.

  14. Genome-Wide Association of Stem Water Soluble Carbohydrates in Bread Wheat.

    PubMed

    Dong, Yan; Liu, Jindong; Zhang, Yan; Geng, Hongwei; Rasheed, Awais; Xiao, Yonggui; Cao, Shuanghe; Fu, Luping; Yan, Jun; Wen, Weie; Zhang, Yong; Jing, Ruilian; Xia, Xianchun; He, Zhonghu

    2016-01-01

    Water soluble carbohydrates (WSC) in stems play an important role in buffering grain yield in wheat against biotic and abiotic stresses; however, knowledge of genes controlling WSC is very limited. We conducted a genome-wide association study (GWAS) using a high-density 90K SNP array to better understand the genetic basis underlying WSC, and to explore marker-based breeding approaches. WSC was evaluated in an association panel comprising 166 Chinese bread wheat cultivars planted in four environments. Fifty two marker-trait associations (MTAs) distributed across 23 loci were identified for phenotypic best linear unbiased estimates (BLUEs), and 11 MTAs were identified in two or more environments. Liner regression showed a clear dependence of WSC BLUE scores on numbers of favorable (increasing WSC content) and unfavorable alleles (decreasing WSC), indicating that genotypes with higher numbers of favorable or lower numbers of unfavorable alleles had higher WSC content. In silico analysis of flanking sequences of trait-associated SNPs revealed eight candidate genes related to WSC content grouped into two categories based on the type of encoding proteins, namely, defense response proteins and proteins triggered by environmental stresses. The identified SNPs and candidate genes related to WSC provide opportunities for breeding higher WSC wheat cultivars.

  15. Genome-wide association study for performance traits in chickens using genotype by sequencing approach

    PubMed Central

    Pértille, Fábio; Moreira, Gabriel Costa Monteiro; Zanella, Ricardo; Nunes, José de Ribamar da Silva; Boschiero, Clarissa; Rovadoscki, Gregori Alberto; Mourão, Gerson Barreto; Ledur, Mônica Corrêa; Coutinho, Luiz Lehmann

    2017-01-01

    Performance traits are economically important and are targets for selection in breeding programs, especially in the poultry industry. To identify regions on the chicken genome associated with performance traits, different genomic approaches have been applied in the last years. The aim of this study was the application of CornellGBS approach (134,528 SNPs generated from a PstI restriction enzyme) on Genome-Wide Association Studies (GWAS) in an outbred F2 chicken population. We have validated 91.7% of these 134,528 SNPs after imputation of missed genotypes. Out of those, 20 SNPs were associated with feed conversion, one was associated with body weight at 35 days of age (P < 7.86E-07) and 93 were suggestively associated with a variety of performance traits (P < 1.57E-05). The majority of these SNPs (86.2%) overlapped with previously mapped QTL for the same performance traits and some of the SNPs also showed novel potential QTL regions. The results obtained in this study suggests future searches for candidate genes and QTL refinements as well as potential use of the SNPs described here in breeding programs. PMID:28181508

  16. Genome-wide association mapping and agronomic impact of cowpea root architecture.

    PubMed

    Burridge, James D; Schneider, Hannah M; Huynh, Bao-Lam; Roberts, Philip A; Bucksch, Alexander; Lynch, Jonathan P

    2017-02-01

    Genetic analysis of data produced by novel root phenotyping tools was used to establish relationships between cowpea root traits and performance indicators as well between root traits and Striga tolerance. Selection and breeding for better root phenotypes can improve acquisition of soil resources and hence crop production in marginal environments. We hypothesized that biologically relevant variation is measurable in cowpea root architecture. This study implemented manual phenotyping (shovelomics) and automated image phenotyping (DIRT) on a 189-entry diversity panel of cowpea to reveal biologically important variation and genome regions affecting root architecture phenes. Significant variation in root phenes was found and relatively high heritabilities were detected for root traits assessed manually (0.4 for nodulation and 0.8 for number of larger laterals) as well as repeatability traits phenotyped via DIRT (0.5 for a measure of root width and 0.3 for a measure of root tips). Genome-wide association study identified 11 significant quantitative trait loci (QTL) from manually scored root architecture traits and 21 QTL from root architecture traits phenotyped by DIRT image analysis. Subsequent comparisons of results from this root study with other field studies revealed QTL co-localizations between root traits and performance indicators including seed weight per plant, pod number, and Striga (Striga gesnerioides) tolerance. The data suggest selection for root phenotypes could be employed by breeding programs to improve production in multiple constraint environments.

  17. Genome-Wide Association of the Laboratory-Based Nicotine Metabolite Ratio in Three Ancestries.

    PubMed

    Baurley, James W; Edlund, Christopher K; Pardamean, Carissa I; Conti, David V; Krasnow, Ruth; Javitz, Harold S; Hops, Hyman; Swan, Gary E; Benowitz, Neal L; Bergen, Andrew W

    2016-09-01

    Metabolic enzyme variation and other patient and environmental characteristics influence smoking behaviors, treatment success, and risk of related disease. Population-specific variation in metabolic genes contributes to challenges in developing and optimizing pharmacogenetic interventions. We applied a custom genome-wide genotyping array for addiction research (Smokescreen), to three laboratory-based studies of nicotine metabolism with oral or venous administration of labeled nicotine and cotinine, to model nicotine metabolism in multiple populations. The trans-3'-hydroxycotinine/cotinine ratio, the nicotine metabolite ratio (NMR), was the nicotine metabolism measure analyzed. Three hundred twelve individuals of self-identified European, African, and Asian American ancestry were genotyped and included in ancestry-specific genome-wide association scans (GWAS) and a meta-GWAS analysis of the NMR. We modeled natural-log transformed NMR with covariates: principal components of genetic ancestry, age, sex, body mass index, and smoking status. African and Asian American NMRs were statistically significantly (P values ≤ 5E-5) lower than European American NMRs. Meta-GWAS analysis identified 36 genome-wide significant variants over a 43 kilobase pair region at CYP2A6 with minimum P = 2.46E-18 at rs12459249, proximal to CYP2A6. Additional minima were located in intron 4 (rs56113850, P = 6.61E-18) and in the CYP2A6-CYP2A7 intergenic region (rs34226463, P = 1.45E-12). Most (34/36) genome-wide significant variants suggested reduced CYP2A6 activity; functional mechanisms were identified and tested in knowledge-bases. Conditional analysis resulted in intergenic variants of possible interest (P values < 5E-5). This meta-GWAS of the NMR identifies CYP2A6 variants, replicates the top-ranked single nucleotide polymorphism from a recent Finnish meta-GWAS of the NMR, identifies functional mechanisms, and provides pan-continental population biomarkers for nicotine metabolism. This

  18. Genome-Wide Association of the Laboratory-Based Nicotine Metabolite Ratio in Three Ancestries

    PubMed Central

    Baurley, James W.; Edlund, Christopher K.; Pardamean, Carissa I.; Conti, David V.; Krasnow, Ruth; Javitz, Harold S.; Hops, Hyman; Swan, Gary E.; Benowitz, Neal L.

    2016-01-01

    Introduction: Metabolic enzyme variation and other patient and environmental characteristics influence smoking behaviors, treatment success, and risk of related disease. Population-specific variation in metabolic genes contributes to challenges in developing and optimizing pharmacogenetic interventions. We applied a custom genome-wide genotyping array for addiction research (Smokescreen), to three laboratory-based studies of nicotine metabolism with oral or venous administration of labeled nicotine and cotinine, to model nicotine metabolism in multiple populations. The trans-3′-hydroxycotinine/cotinine ratio, the nicotine metabolite ratio (NMR), was the nicotine metabolism measure analyzed. Methods: Three hundred twelve individuals of self-identified European, African, and Asian American ancestry were genotyped and included in ancestry-specific genome-wide association scans (GWAS) and a meta-GWAS analysis of the NMR. We modeled natural-log transformed NMR with covariates: principal components of genetic ancestry, age, sex, body mass index, and smoking status. Results: African and Asian American NMRs were statistically significantly (P values ≤ 5E-5) lower than European American NMRs. Meta-GWAS analysis identified 36 genome-wide significant variants over a 43 kilobase pair region at CYP2A6 with minimum P = 2.46E-18 at rs12459249, proximal to CYP2A6. Additional minima were located in intron 4 (rs56113850, P = 6.61E-18) and in the CYP2A6-CYP2A7 intergenic region (rs34226463, P = 1.45E-12). Most (34/36) genome-wide significant variants suggested reduced CYP2A6 activity; functional mechanisms were identified and tested in knowledge-bases. Conditional analysis resulted in intergenic variants of possible interest (P values < 5E-5). Conclusions: This meta-GWAS of the NMR identifies CYP2A6 variants, replicates the top-ranked single nucleotide polymorphism from a recent Finnish meta-GWAS of the NMR, identifies functional mechanisms, and provides pan

  19. Genome-Wide Linkage Analysis Identifies Loci for Physical Appearance Traits in Chickens.

    PubMed

    Sun, Yanfa; Liu, Ranran; Zhao, Guiping; Zheng, Maiqing; Sun, Yan; Yu, Xiaoqiong; Li, Peng; Wen, Jie

    2015-08-06

    Physical appearance traits, such as feather-crested head, comb size and type, beard, wattles size, and feathered feet, are used to distinguish between breeds of chicken and also may be associated with economic traits. In this study, a genome-wide linkage analysis was used to identify candidate regions and genes for physical appearance traits and to potentially provide further knowledge of the molecular mechanisms that underlie these traits. The linkage analysis was conducted with an F2 population derived from Beijing-You chickens and a commercial broiler line. Single-nucleotide polymorphisms were analyzed using the Illumina 60K Chicken SNP Beadchip. The data were used to map quantitative trait loci and genes for six physical appearance traits. A 10-cM/0.51-Mb region (0.0-10.0 cM/0.00-0.51 Mb) with 1% genome-wide significant level on LGE22C19W28_E50C23 linkage group (LGE22) for crest trait was identified, which is likely very closely linked to the HOXC8. A QTL with 5% chromosome-wide significant level for comb weight, which partly overlaps with a region identified in a previous study, was identified at 74 cM/25.55 Mb on chicken (Gallus gallus; GG) chromosome 3 (i.e., GGA3). For beard and wattles traits, an identical region 11 cM/2.23 Mb (0.0-11.0 cM/0.00-2.23 Mb) including WNT3 and GH genes on GGA27 was identified. Two QTL with 1% genome-wide significant level for feathered feet trait, one 9-cM/2.80-Mb (48.0-57.0/13.40-16.20 Mb) region on GGA13, and another 12-cM/1.45-Mb (41.0-53.0 cM/11.37-12.82 Mb) region on GGA15 were identified. These candidate regions and genes provide important genetic information for the physical appearance traits in chicken. Copyright © 2015 Sun et al.

  20. Genome-Wide Linkage Analysis Identifies Loci for Physical Appearance Traits in Chickens

    PubMed Central

    Sun, Yanfa; Liu, Ranran; Zhao, Guiping; Zheng, Maiqing; Sun, Yan; Yu, Xiaoqiong; Li, Peng; Wen, Jie

    2015-01-01

    Physical appearance traits, such as feather-crested head, comb size and type, beard, wattles size, and feathered feet, are used to distinguish between breeds of chicken and also may be associated with economic traits. In this study, a genome-wide linkage analysis was used to identify candidate regions and genes for physical appearance traits and to potentially provide further knowledge of the molecular mechanisms that underlie these traits. The linkage analysis was conducted with an F2 population derived from Beijing-You chickens and a commercial broiler line. Single-nucleotide polymorphisms were analyzed using the Illumina 60K Chicken SNP Beadchip. The data were used to map quantitative trait loci and genes for six physical appearance traits. A 10-cM/0.51-Mb region (0.0−10.0 cM/0.00−0.51 Mb) with 1% genome-wide significant level on LGE22C19W28_E50C23 linkage group (LGE22) for crest trait was identified, which is likely very closely linked to the HOXC8. A QTL with 5% chromosome-wide significant level for comb weight, which partly overlaps with a region identified in a previous study, was identified at 74 cM/25.55 Mb on chicken (Gallus gallus; GG) chromosome 3 (i.e., GGA3). For beard and wattles traits, an identical region 11 cM/2.23 Mb (0.0−11.0 cM/0.00−2.23 Mb) including WNT3 and GH genes on GGA27 was identified. Two QTL with 1% genome-wide significant level for feathered feet trait, one 9-cM/2.80-Mb (48.0-57.0/13.40-16.20 Mb) region on GGA13, and another 12-cM/1.45-Mb (41.0−53.0 cM/11.37−12.82 Mb) region on GGA15 were identified. These candidate regions and genes provide important genetic information for the physical appearance traits in chicken. PMID:26248982

  1. Genome-wide QTL mapping of nine body composition and bone mineral density traits in pigs.

    PubMed

    Rothammer, Sophie; Kremer, Prisca V; Bernau, Maren; Fernandez-Figares, Ignacio; Pfister-Schär, Jennifer; Medugorac, Ivica; Scholz, Armin M

    2014-10-28

    Since the pig is one of the most important livestock animals worldwide, mapping loci that are associated with economically important traits and/or traits that influence animal welfare is extremely relevant for efficient future pig breeding. Therefore, the purpose of this study was a genome-wide mapping of quantitative trait loci (QTL) associated with nine body composition and bone mineral traits: absolute (Fat, Lean) and percentage (FatPC, LeanPC) fat and lean mass, live weight (Weight), soft tissue X-ray attenuation coefficient (R), absolute (BMC) and percentage (BMCPC) bone mineral content and bone mineral density (BMD). Data on the nine traits investigated were obtained by Dual-energy X-ray absorptiometry for 551 pigs that were between 160 and 200 days old. In addition, all pigs were genotyped using Illumina's PorcineSNP60 Genotyping BeadChip. Based on these data, a genome-wide combined linkage and linkage disequilibrium analysis was conducted. Thus, we used 44 611 sliding windows that each consisted of 20 adjacent single nucleotide polymorphisms (SNPs). For the middle of each sliding window a variance component analysis was carried out using ASReml. The underlying mixed linear model included random QTL and polygenic effects, with fixed effects of sex, housing, season and age. Using a Bonferroni-corrected genome-wide significance threshold of P < 0.001, significant peaks were identified for all traits except BMCPC. Overall, we identified 72 QTL on 16 chromosomes, of which 24 were significantly associated with one trait only and the remaining with more than one trait. For example, a QTL on chromosome 2 included the highest peak across the genome for four traits (Fat, FatPC, LeanPC and R). The nearby gene, ZNF608, is known to be associated with body mass index in humans and involved in starvation in Drosophila, which makes it an extremely good candidate gene for this QTL. Our QTL mapping approach identified 72 QTL, some of which confirmed results of previous

  2. Employing genome-wide SNP discovery and genotyping strategy to extrapolate the natural allelic diversity and domestication patterns in chickpea.

    PubMed

    Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C L L; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K; Parida, Swarup K

    2015-01-01

    important complex quantitative agronomic traits in chickpea. The numerous informative genome-wide SNPs, natural allelic diversity-led domestication pattern, and LD-based information generated in our study have got multidimensional applicability with respect to chickpea genomics-assisted breeding.

  3. Genome-wide association study reveals genetic architecture of coleoptile length in wheat.

    PubMed

    Li, Genqiao; Bai, Guihua; Carver, Brett F; Elliott, Norman C; Bennett, Rebecca S; Wu, Yanqi; Hunger, Robert; Bonman, J Michael; Xu, Xiangyang

    2017-02-01

    Eight QTL for coleoptile length were identified in a genome-wide association study on a set of 893 wheat accessions, four of which are novel loci. Wheat cultivars with long coleoptiles are preferred in wheat-growing regions where deep planting is practiced. However, the wide use of gibberellic acid (GA)-insensitive dwarfing genes, Rht-B1b and Rht-D1b, makes it challenging to breed dwarf wheat cultivars with long coleoptiles. To understand the genetic basis of coleoptile length, we performed a genome-wide association study on a set of 893 landraces and historical cultivars using 5011 single nucleotide polymorphism (SNP) markers. Structure analysis revealed four subgroups in the association panel. Association analysis results suggested that Rht-B1b and Rht-D1b genes significantly reduced coleoptile length, and eight additional quantitative trait loci (QTL) for coleoptile length were also identified. These QTL explained 1.45-3.18 and 1.36-3.11% of the phenotypic variation in 2015 and 2016, respectively, and their allelic substitution effects ranged from 0.31 to 1.75 cm in 2015, and 0.63-1.55 cm in 2016. Of the eight QTL, QCL.stars-1BS1, QCL.stars-2DS1, QCL.stars-4BS2, and QCL.stars-5BL1 are likely novel loci for coleoptile length. The favorable alleles in each accession ranged from two to eight with an average of 5.8 at eight loci in the panel, and more favorable alleles were significantly associated with longer coleoptile, suggesting that QTL pyramiding is an effective approach to increase wheat coleoptile length.

  4. Identification of Promising Mutants Associated with Egg Production Traits Revealed by Genome-Wide Association Study

    PubMed Central

    Dou, Taocun; Yi, Guoqiang; Qu, LuJiang; Qu, Liang; Wang, Kehua; Yang, Ning

    2015-01-01

    Egg number (EN), egg laying rate (LR) and age at first egg (AFE) are important production traits related to egg production in poultry industry. To better understand the knowledge of genetic architecture of dynamic EN during the whole laying cycle and provide the precise positions of associated variants for EN, LR and AFE, laying records from 21 to 72 weeks of age were collected individually for 1,534 F2 hens produced by reciprocal crosses between White Leghorn and Dongxiang Blue-shelled chicken, and their genotypes were assayed by chicken 600 K Affymetrix high density genotyping arrays. Subsequently, pedigree and SNP-based genetic parameters were estimated and a genome-wide association study (GWAS) was conducted on EN, LR and AFE. The heritability estimates were similar between pedigree and SNP-based estimates varying from 0.17 to 0.36. In the GWA analysis, we identified nine genome-wide significant loci associated with EN of the laying periods from 21 to 26 weeks, 27 to 36 weeks and 37 to 72 weeks. Analysis of GTF2A1 and CLSPN suggested that they influenced the function of ovary and uterus, and may be considered as relevant candidates. The identified SNP rs314448799 for accumulative EN from 21 to 40 weeks on chromosome 5 created phenotypic differences of 6.86 eggs between two homozygous genotypes, which could be potentially applied to the molecular breeding for EN selection. Moreover, our finding showed that LR was a moderate polygenic trait. The suggestive significant region on chromosome 16 for AFE suggested the relationship between sex maturity and immune in the current population. The present study comprehensively evaluates the role of genetic variants in the development of egg laying. The findings will be helpful to investigation of causative genes function and future marker-assisted selection and genomic selection in chickens. PMID:26496084

  5. Genome-wide linkage analysis and association study identifies loci for polydactyly in chickens.

    PubMed

    Sun, Yanfa; Liu, Ranran; Zhao, Guiping; Zheng, Maiqing; Sun, Yan; Yu, Xiaoqiong; Li, Peng; Wen, Jie

    2014-04-21

    Polydactyly occurs in some chicken breeds, but the molecular mechanism remains incompletely understood. Combined genome-wide linkage analysis and association study (GWAS) for chicken polydactyly helps identify loci or candidate genes for the trait and potentially provides further mechanistic understanding of this phenotype in chickens and perhaps other species. The linkage analysis and GWAS for polydactyly was conducted using an F2 population derived from Beijing-You chickens and commercial broilers. The results identified two QTLs through linkage analysis and seven single-nucleotide polymorphisms (SNPs) through GWAS, associated with the polydactyly trait. One QTL located at 35 cM on the GGA2 was significant at the 1% genome-wise level and another QTL at the 1% chromosome-wide significance level was detected at 39 cM on GGA19. A total of seven SNPs, four of 5% genome-wide significance (P < 2.98 × 10(-6)) and three of suggestive significance (5.96 × 10(-5)) were identified, including two SNPs (GGaluGA132178 and Gga_rs14135036) in the QTL on GGA2. Of the identified SNPs, the eight nearest genes were sonic hedgehog (SHH), limb region 1 homolog (mouse) (LMBR1), dipeptidyl-peptidase 6, transcript variant 3 (DPP6), thyroid-stimulating hormone, beta (TSHB), sal-like 4 (Drosophila) (SALL4), par-6 partitioning defective 6 homolog beta (Caenorhabditis elegans) (PARD6B), coenzyme Q5 (COQ5), and tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, etapolypeptide (YWHAH). The GWAS supports earlier reports of the importance of SHH and LMBR1 as regulating genes for polydactyly in chickens and other species, and identified others, most of which have not previously been associated with limb development. The genes and associated SNPs revealed here provide detailed information for further exploring the molecular and developmental mechanisms underlying polydactyly.

  6. Genome-wide association mapping for root traits in a panel of rice accessions from Vietnam.

    PubMed

    Phung, Nhung Thi Phuong; Mai, Chung Duc; Hoang, Giang Thi; Truong, Hue Thi Minh; Lavarenne, Jeremy; Gonin, Mathieu; Nguyen, Khanh Le; Ha, Thuy Thi; Do, Vinh Nang; Gantet, Pascal; Courtois, Brigitte

    2016-03-10

    Despite recent sequencing efforts, local genetic resources remain underexploited, even though they carry alleles that can bring agronomic benefits. Taking advantage of the recent genotyping with 22,000 single-nucleotide polymorphism markers of a core collection of 180 Vietnamese rice varieties originating from provinces from North to South Vietnam and from different agrosystems characterized by contrasted water regimes, we have performed a genome-wide association study for different root parameters. Roots contribute to water stress avoidance and are a still underexploited target for breeding purpose due to the difficulty to observe them. The panel of 180 rice varieties was phenotyped under greenhouse conditions for several root traits in an experimental design with 3 replicates. The phenotyping system consisted of long plastic bags that were filled with sand and supplemented with fertilizer. Root length, root mass in different layers, root thickness, and the number of crown roots, as well as several derived root parameters and shoot traits, were recorded. The results were submitted to association mapping using a mixed model involving structure and kinship to enable the identification of significant associations. The analyses were conducted successively on the whole panel and on its indica (115 accessions) and japonica (64 accessions) subcomponents. The two associations with the highest significance were for root thickness on chromosome 2 and for crown root number on chromosome 11. No common associations were detected between the indica and japonica subpanels, probably because of the polymorphism repartition between the subspecies. Based on orthology with Arabidopsis, the possible candidate genes underlying the quantitative trait loci are reviewed. Some of the major quantitative trait loci we detected through this genome-wide association study contain promising candidate genes encoding regulatory elements of known key regulators of root formation and development.

  7. Genome wide association mapping for grain shape traits in indica rice.

    PubMed

    Feng, Yue; Lu, Qing; Zhai, Rongrong; Zhang, Mengchen; Xu, Qun; Yang, Yaolong; Wang, Shan; Yuan, Xiaoping; Yu, Hanyong; Wang, Yiping; Wei, Xinghua

    2016-10-01

    Using genome-wide association mapping, 47 SNPs within 27 significant loci were identified for four grain shape traits, and 424 candidate genes were predicted from public database. Grain shape is a key determinant of grain yield and quality in rice (Oryza sativa L.). However, our knowledge of genes controlling rice grain shape remains limited. Genome-wide association mapping based on linkage disequilibrium (LD) has recently emerged as an effective approach for identifying genes or quantitative trait loci (QTL) underlying complex traits in plants. In this study, association mapping based on 5291 single nucleotide polymorphisms (SNPs) was conducted to identify significant loci associated with grain shape traits in a global collection of 469 diverse rice accessions. A total of 47 SNPs were located in 27 significant loci for four grain traits, and explained ~44.93-65.90 % of the phenotypic variation for each trait. In total, 424 candidate genes within a 200 kb extension region (±100 kb of each locus) of these loci were predicted. Of them, the cloned genes GS3 and qSW5 showed very strong effects on grain length and grain width in our study. Comparing with previously reported QTLs for grain shape traits, we found 11 novel loci, including 3, 3, 2 and 3 loci for grain length, grain width, grain length-width ratio and thousand grain weight, respectively. Validation of these new loci would be performed in the future studies. These results revealed that besides GS3 and qSW5, multiple novel loci and mechanisms were involved in determining rice grain shape. These findings provided valuable information for understanding of the genetic control of grain shape and molecular marker assistant selection (MAS) breeding in rice.

  8. Genome-wide single nucleotide polymorphisms reveal population history and adaptive divergence in wild guppies.

    PubMed

    Willing, Eva-Maria; Bentzen, Paul; van Oosterhout, Cock; Hoffmann, Margarete; Cable, Joanne; Breden, Felix; Weigel, Detlef; Dreyer, Christine

    2010-03-01

    Adaptation of guppies (Poecilia reticulata) to contrasting upland and lowland habitats has been extensively studied with respect to behaviour, morphology and life history traits. Yet population history has not been studied at the whole-genome level. Although single nucleotide polymorphisms (SNPs) are the most abundant form of variation in many genomes and consequently very informative for a genome-wide picture of standing natural variation in populations, genome-wide SNP data are rarely available for wild vertebrates. Here we use genetically mapped SNP markers to comprehensively survey genetic variation within and among naturally occurring guppy populations from a wide geographic range in Trinidad and Venezuela. Results from three different clustering methods, Neighbor-net, principal component analysis (PCA) and Bayesian analysis show that the population substructure agrees with geographic separation and largely with previously hypothesized patterns of historical colonization. Within major drainages (Caroni, Oropouche and Northern), populations are genetically similar, but those in different geographic regions are highly divergent from one another, with some indications of ancient shared polymorphisms. Clear genomic signatures of a previous introduction experiment were seen, and we detected additional potential admixture events. Headwater populations were significantly less heterozygous than downstream populations. Pairwise F(ST) values revealed marked differences in allele frequencies among populations from different regions, and also among populations within the same region. F(ST) outlier methods indicated some regions of the genome as being under directional selection. Overall, this study demonstrates the power of a genome-wide SNP data set to inform for studies on natural variation, adaptation and evolution of wild populations.

  9. Genome-wide interaction studies reveal sex-specific asthma risk alleles

    PubMed Central

    Myers, Rachel A.; Scott, Nicole M.; Gauderman, W. James; Qiu, Weiliang; Mathias, Rasika A.; Romieu, Isabelle; Levin, Albert M.; Pino-Yanes, Maria; Graves, Penelope E.; Villarreal, Albino Barraza; Beaty, Terri H.; Carey, Vincent J.; Croteau-Chonka, Damien C.; del Rio Navarro, Blanca; Edlund, Christopher; Hernandez-Cadena, Leticia; Navarro-Olivos, Efrain; Padhukasahasram, Badri; Salam, Muhammad T.; Torgerson, Dara G.; Van den Berg, David J.; Vora, Hita; Bleecker, Eugene R.; Meyers, Deborah A.; Williams, L. Keoki; Martinez, Fernando D.; Burchard, Esteban G.; Barnes, Kathleen C.; Gilliland, Frank D.; Weiss, Scott T.; London, Stephanie J.; Raby, Benjamin A.; Ober, Carole; Nicolae, Dan L.

    2014-01-01

    Asthma is a complex disease with sex-specific differences in prevalence. Candidate gene studies have suggested that genotype-by-sex interaction effects on asthma risk exist, but this has not yet been explored at a genome-wide level. We aimed to identify sex-specific asthma risk alleles by performing a genome-wide scan for genotype-by-sex interactions in the ethnically diverse participants in the EVE Asthma Genetics Consortium. We performed male- and female-specific genome-wide association studies in 2653 male asthma cases, 2566 female asthma cases and 3830 non-asthma controls from European American, African American, African Caribbean and Latino populations. Association tests were conducted in each study sample, and the results were combined in ancestry-specific and cross-ancestry meta-analyses. Six sex-specific asthma risk loci had P-values < 1 × 10−6, of which two were male specific and four were female specific; all were ancestry specific. The most significant sex-specific association in European Americans was at the interferon regulatory factor 1 (IRF1) locus on 5q31.1. We also identify a Latino female-specific association in RAP1GAP2. Both of these loci included single-nucleotide polymorphisms that are known expression quantitative trait loci and have been associated with asthma in independent studies. The IRF1 locus is a strong candidate region for male-specific asthma susceptibility due to the association and validation we demonstrate here, the known role of IRF1 in asthma-relevant immune pathways and prior reports of sex-specific differences in interferon responses. PMID:24824216

  10. A genome-wide analysis of gene-caffeine consumption interaction on basal cell carcinoma.

    PubMed

    Li, Xin; Cornelis, Marilyn C; Liang, Liming; Song, Fengju; De Vivo, Immaculata; Giovannucci, Edward; Tang, Jean Y; Han, Jiali

    2016-12-01

    Animal models have suggested that oral or topical administration of caffeine could inhibit ultraviolet-induced carcinogenesis via the ataxia telangiectasia and rad3 (ATR)-related apoptosis. Previous epidemiological studies have demonstrated that increased caffeine consumption is associated with reduced risk of basal cell carcinoma (BCC). To identify common genetic markers that may modify this association, we tested gene-caffeine intake interaction on BCC risk in a genome-wide analysis. We included 3383 BCC cases and 8528 controls of European ancestry from the Nurses' Health Study and Health Professionals Follow-up Study. Single nucleotide polymorphism (SNP) rs142310826 near the NEIL3 gene showed a genome-wide significant interaction with caffeine consumption (P = 1.78 × 10(-8) for interaction) on BCC risk. There was no gender difference for this interaction (P = 0.64 for heterogeneity). NEIL3, a gene belonging to the base excision DNA repair pathway, encodes a DNA glycosylase that recognizes and removes lesions produced by oxidative stress. In addition, we identified several loci with P value for interaction <5 × 10(-7) in gender-specific analyses (P for heterogeneity between genders < 0.001) including those mapping to the genes LRRTM4, ATF3 and DCLRE1C in women and POTEA in men. Finally, we tested the associations between caffeine consumption-related SNPs reported by previous genome-wide association studies and risk of BCC, both individually and jointly, but found no significant association. In sum, we identified a DNA repair gene that could be involved in caffeine-mediated skin tumor inhibition. Further studies are warranted to confirm these findings. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  11. Evaluation of potential power gain with imputed genotypes in genome-wide association studies.

    PubMed

    Becker, Tim; Flaquer, Antonia; Brockschmidt, Felix F; Herold, Christine; Steffens, Michael

    2009-01-01

    With the beginning of the era of genome-wide association studies methods to obtain 'in silico' genotypes have gained importance. In this context, an evaluation of genome-wide power levels of current marker panels and the power gain achievable with imputed genotypes are of high interest. Power for single-marker analysis of imputed genotypes is evaluated via a simulation study based on HapMap data. Power values for genome-wide significance of marker panels of 1,000,000 SNPs are considered for small effect sizes typical of common diseases and large case-control samples. In order to evaluate the performance of imputing, we consider a method that is conceptually related to previous approaches. We introduce various modifications which together lead to an alternative implementation of the imputation idea. In particular, a Monte-Carlo (MC) simulation method for association testing of imputed markers is introduced. We show that the incorporation of imputed genotypes can lead to a substantial power gain for common disease variants if the training sample is large enough. In addition, we show that the MC approach is valuable to for validating association results obtained with imputed genotypes. Our simulation study also shows that even denser marker panels than those currently available are needed when sample size is limited. We thus expect that full genome SNP panels will lead to the identification of additional disease variants in the future. Until then, it is desirable that large and ethnically matched training samples genotyped on dense marker panels are available in each country. (c) 2009 S. Karger AG, Basel.

  12. Family-based genome-wide association scan of attention-deficit/hyperactivity disorder.

    PubMed

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F; Reiersen, Angela M; Wilens, Timothy E; Wozniak, Janet; Neale, Benjamin M; Faraone, Stephen V

    2010-09-01

    Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of DSM-IV-TR ADHD. Families were ascertained at Massachusetts General Hospital (MGH; N = 309 trios), Washington University at St. Louis (WASH-U; N = 272 trios), and University of California at Los Angeles (UCLA; N = 156 trios). Genotyping was conducted with the Illumina Human1M or Human1M-Duo BeadChip platforms. After applying quality control filters, association with ADHD was tested with 835,136 SNPs in 735 DSM-IV ADHD trios from 732 families. Our smallest p value (6.7E-07) did not reach the threshold for genome-wide statistical significance (5.0E-08), but one of the 20 most significant associations was located in a candidate gene of interest for ADHD (SLC9A9, rs9810857, p = 6.4E-6). We also conducted gene-based tests of candidate genes identified in the literature and found additional evidence of association with SLC9A9. We and our colleagues in the Psychiatric GWAS Consortium are working to pool together GWAS samples to establish the large data sets needed to follow-up on these results and to identify genes for ADHD and other disorders. 2010 American Academy of Child and Adolescent Psychiatry. Published by Elsevier Inc. All rights reserved.

  13. Genomic prediction of breeding values for carcass traits in Nellore cattle.

    PubMed

    Fernandes Júnior, Gerardo A; Rosa, Guilherme J M; Valente, Bruno D; Carvalheiro, Roberto; Baldi, Fernando; Garcia, Diogo A; Gordo, Daniel G M; Espigolan, Rafael; Takada, Luciana; Tonussi, Rafael L; de Andrade, Willian B F; Magalhães, Ana F B; Chardulo, Luis A L; Tonhati, Humberto; de Albuquerque, Lucia G

    2016-01-29

    The objective of this study was to evaluate the accuracy of genomic predictions for rib eye area (REA), backfat thickness (BFT), and hot carcass weight (HCW) in Nellore beef cattle from Brazilian commercial herds using different prediction models. Phenotypic data from 1756 Nellore steers from ten commercial herds in Brazil were used. Animals were offspring of 294 sires and 1546 dams, reared on pasture, feedlot finished, and slaughtered at approximately 2 years of age. All animals were genotyped using a 777k Illumina Bovine HD SNP chip. Accuracy of genomic predictions of breeding values was evaluated by using a 5-fold cross-validation scheme and considering three models: Bayesian ridge regression (BRR), Bayes C (BC) and Bayesian Lasso (BL), and two types of response variables: traditional estimated breeding value (EBV), and phenotype adjusted for fixed effects (Y*). The prediction accuracies achieved with the BRR model were equal to 0.25 (BFT), 0.33 (HCW) and 0.36 (REA) when EBV was used as response variable, and 0.21 (BFT), 0.37 (HCW) and 0.46 (REA) when using Y*. Results obtained with the BC and BL models were similar. Accuracies increased for traits with a higher heritability, and using Y* instead of EBV as response variable resulted in higher accuracy when heritability was higher. Our results indicate that the accuracy of genomic prediction of carcass traits in Nellore cattle is moderate to high. Prediction of genomic breeding values from adjusted phenotypes Y* was more accurate than from EBV, especially for highly heritable traits. The three models considered (BRR, BC and BL) led to similar predictive abilities and, thus, either one could be used to implement genomic prediction for carcass traits in Nellore cattle.

  14. Refining genome-wide linkage intervals using a meta-analysis of genome-wide association studies identifies loci influencing personality dimensions

    PubMed Central

    Amin, Najaf; Hottenga, Jouke-Jan; Hansell, Narelle K; Janssens, A Cecile JW; de Moor, Marleen HM; Madden, Pamela AF; Zorkoltseva, Irina V; Penninx, Brenda W; Terracciano, Antonio; Uda, Manuela; Tanaka, Toshiko; Esko, Tonu; Realo, Anu; Ferrucci, Luigi; Luciano, Michelle; Davies, Gail; Metspalu, Andres; Abecasis, Goncalo R; Deary, Ian J; Raikkonen, Katri; Bierut, Laura J; Costa, Paul T; Saviouk, Viatcheslav; Zhu, Gu; Kirichenko, Anatoly V; Isaacs, Aaron; Aulchenko, Yurii S; Willemsen, Gonneke; Heath, Andrew C; Pergadia, Michele L; Medland, Sarah E; Axenovich, Tatiana I; de Geus, Eco; Montgomery, Grant W; Wright, Margaret J; Oostra, Ben A; Martin, Nicholas G; Boomsma, Dorret I; van Duijn, Cornelia M

    2013-01-01

    Personality traits are complex phenotypes related to psychosomatic health. Individually, various gene finding methods have not achieved much success in finding genetic variants associated with personality traits. We performed a meta-analysis of four genome-wide linkage scans (N=6149 subjects) of five basic personality traits assessed with the NEO Five-Factor Inventory. We compared the significant regions from the meta-analysis of linkage scans with the results of a meta-analysis of genome-wide association studies (GWAS) (N∼17 000). We found significant evidence of linkage of neuroticism to chromosome 3p14 (rs1490265, LOD=4.67) and to chromosome 19q13 (rs628604, LOD=3.55); of extraversion to 14q32 (ATGG002, LOD=3.3); and of agreeableness to 3p25 (rs709160, LOD=3.67) and to two adjacent regions on chromosome 15, including 15q13 (rs970408, LOD=4.07) and 15q14 (rs1055356, LOD=3.52) in the individual scans. In the meta-analysis, we found strong evidence of linkage of extraversion to 4q34, 9q34, 10q24 and 11q22, openness to 2p25, 3q26, 9p21, 11q24, 15q26 and 19q13 and agreeableness to 4q34 and 19p13. Significant evidence of association in the GWAS was detected between openness and rs677035 at 11q24 (P-value=2.6 × 10−06, KCNJ1). The findings of our linkage meta-analysis and those of the GWAS suggest that 11q24 is a susceptible locus for openness, with KCNJ1 as the possible candidate gene. PMID:23211697

  15. Refining genome-wide linkage intervals using a meta-analysis of genome-wide association studies identifies loci influencing personality dimensions.

    PubMed

    Amin, Najaf; Hottenga, Jouke-Jan; Hansell, Narelle K; Janssens, A Cecile J W; de Moor, Marleen H M; Madden, Pamela A F; Zorkoltseva, Irina V; Penninx, Brenda W; Terracciano, Antonio; Uda, Manuela; Tanaka, Toshiko; Esko, Tonu; Realo, Anu; Ferrucci, Luigi; Luciano, Michelle; Davies, Gail; Metspalu, Andres; Abecasis, Goncalo R; Deary, Ian J; Raikkonen, Katri; Bierut, Laura J; Costa, Paul T; Saviouk, Viatcheslav; Zhu, Gu; Kirichenko, Anatoly V; Isaacs, Aaron; Aulchenko, Yurii S; Willemsen, Gonneke; Heath, Andrew C; Pergadia, Michele L; Medland, Sarah E; Axenovich, Tatiana I; de Geus, Eco; Montgomery, Grant W; Wright, Margaret J; Oostra, Ben A; Martin, Nicholas G; Boomsma, Dorret I; van Duijn, Cornelia M

    2013-08-01

    Personality traits are complex phenotypes related to psychosomatic health. Individually, various gene finding methods have not achieved much success in finding genetic variants associated with personality traits. We performed a meta-analysis of four genome-wide linkage scans (N=6149 subjects) of five basic personality traits assessed with the NEO Five-Factor Inventory. We compared the significant regions from the meta-analysis of linkage scans with the results of a meta-analysis of genome-wide association studies (GWAS) (N∼17 000). We found significant evidence of linkage of neuroticism to chromosome 3p14 (rs1490265, LOD=4.67) and to chromosome 19q13 (rs628604, LOD=3.55); of extraversion to 14q32 (ATGG002, LOD=3.3); and of agreeableness to 3p25 (rs709160, LOD=3.67) and to two adjacent regions on chromosome 15, including 15q13 (rs970408, LOD=4.07) and 15q14 (rs1055356, LOD=3.52) in the individual scans. In the meta-analysis, we found strong evidence of linkage of extraversion to 4q34, 9q34, 10q24 and 11q22, openness to 2p25, 3q26, 9p21, 11q24, 15q26 and 19q13 and agreeableness to 4q34 and 19p13. Significant evidence of association in the GWAS was detected between openness and rs677035 at 11q24 (P-value=2.6 × 10(-06), KCNJ1). The findings of our linkage meta-analysis and those of the GWAS suggest that 11q24 is a susceptible locus for openness, with KCNJ1 as the possible candidate gene.

  16. Analysis of genome-wide structure, diversity and fine mapping of Mendelian traits in traditional and village chickens

    PubMed Central

    Wragg, D; Mwacharo, J M; Alcalde, J A; Hocking, P M; Hanotte, O

    2012-01-01

    Extensive phenotypic variation is a common feature among village chickens found throughout much of the developing world, and in traditional chicken breeds that have been artificially selected for traits such as plumage variety. We present here an assessment of traditional and village chicken populations, for fine mapping of Mendelian traits using genome-wide single-nucleotide polymorphism (SNP) genotyping while providing information on their genetic structure and diversity. Bayesian clustering analysis reveals two main genetic backgrounds in traditional breeds, Kenyan, Ethiopian and Chilean village chickens. Analysis of linkage disequilibrium (LD) reveals useful LD (r2⩾0.3) in both traditional and village chickens at pairwise marker distances of ∼10 Kb; while haplotype block analysis indicates a median block size of 11–12 Kb. Association mapping yielded refined mapping intervals for duplex comb (Gga 2:38.55–38.89 Mb) and rose comb (Gga 7:18.41–22.09 Mb) phenotypes in traditional breeds. Combined mapping information from traditional breeds and Chilean village chicken allows the oocyan phenotype to be fine mapped to two small regions (Gga 1:67.25–67.28 Mb, Gga 1:67.28–67.32 Mb) totalling ∼75 Kb. Mapping the unmapped earlobe pigmentation phenotype supports previous findings that the trait is sex-linked and polygenic. A critical assessment of the number of SNPs required to map simple traits indicate that between 90 and 110K SNPs are required for full genome-wide analysis of haplotype block structure/ancestry, and for association mapping in both traditional and village chickens. Our results demonstrate the importance and uniqueness of phenotypic diversity and genetic structure of traditional chicken breeds for fine-scale mapping of Mendelian traits in the species, with village chicken populations providing further opportunities to enhance mapping resolutions. PMID:22395157

  17. Analysis of genome-wide structure, diversity and fine mapping of Mendelian traits in traditional and village chickens.

    PubMed

    Wragg, D; Mwacharo, J M; Alcalde, J A; Hocking, P M; Hanotte, O

    2012-07-01

    Extensive phenotypic variation is a common feature among village chickens found throughout much of the developing world, and in traditional chicken breeds that have been artificially selected for traits such as plumage variety. We present here an assessment of traditional and village chicken populations, for fine mapping of Mendelian traits using genome-wide single-nucleotide polymorphism (SNP) genotyping while providing information on their genetic structure and diversity. Bayesian clustering analysis reveals two main genetic backgrounds in traditional breeds, Kenyan, Ethiopian and Chilean village chickens. Analysis of linkage disequilibrium (LD) reveals useful LD (r(2) ≥ 0.3) in both traditional and village chickens at pairwise marker distances of ~10 Kb; while haplotype block analysis indicates a median block size of 11-12 Kb. Association mapping yielded refined mapping intervals for duplex comb (Gga 2:38.55-38.89 Mb) and rose comb (Gga 7:18.41-22.09 Mb) phenotypes in traditional breeds. Combined mapping information from traditional breeds and Chilean village chicken allows the oocyan phenotype to be fine mapped to two small regions (Gga 1:67.25-67.28 Mb, Gga 1:67.28-67.32 Mb) totalling ~75 Kb. Mapping the unmapped earlobe pigmentation phenotype supports previous findings that the trait is sex-linked and polygenic. A critical assessment of the number of SNPs required to map simple traits indicate that between 90 and 110K SNPs are required for full genome-wide analysis of haplotype block structure/ancestry, and for association mapping in both traditional and village chickens. Our results demonstrate the importance and uniqueness of phenotypic diversity and genetic structure of traditional chicken breeds for fine-scale mapping of Mendelian traits in the species, with village chicken populations providing further opportunities to enhance mapping resolutions.

  18. A Genome-Wide Association Study of Total Bilirubin and Cholelithiasis Risk in Sickle Cell Anemia

    PubMed Central

    Milton, Jacqueline N.; Sebastiani, Paola; Solovieff, Nadia; Hartley, Stephen W.; Bhatnagar, Pallav; Arking, Dan E.; Dworkis, Daniel A.; Casella, James F.; Barron-Casella, Emily; Bean, Christopher J.; Hooper, W. Craig; DeBaun, Michael R.; Garrett, Melanie E.; Soldano, Karen; Telen, Marilyn J.; Ashley-Koch, Allison; Gladwin, Mark T.; Baldwin, Clinton T.; Steinberg, Martin H.; Klings, Elizabeth S.

    2012-01-01

    Serum bilirubin levels have been associated with polymorphisms in the UGT1A1 promoter in normal populations and in patients with hemolytic anemias, including sickle cell anemia. When hemolysis occurs circulating heme increases, leading to elevated bilirubin levels and an increased incidence of cholelithiasis. We performed the first genome-wide association study (GWAS) of bilirubin levels and cholelithiasis risk in a discovery cohort of 1,117 sickle cell anemia patients. We found 15 single nucleotide polymorphisms (SNPs) associated with total bilirubin levels at the genome-wide significance level (p value <5×10−8). SNPs in UGT1A1, UGT1A3, UGT1A6, UGT1A8 and UGT1A10, different isoforms within the UGT1A locus, were identified (most significant rs887829, p = 9.08×10−25). All of these associations were validated in 4 independent sets of sickle cell anemia patients. We tested the association of the 15 SNPs with cholelithiasis in the discovery cohort and found a significant association (most significant p value 1.15×10−4). These results confirm that the UGT1A region is the major regulator of bilirubin metabolism in African Americans with sickle cell anemia, similar to what is observed in other ethnicities. PMID:22558097

  19. Developments in obesity genetics in the era of genome-wide association studies.

    PubMed

    Day, Felix R; Loos, Ruth J F

    2011-01-01

    Obesity is an important factor contributing to the global burden of morbidity and mortality. By identifying obesity susceptibility genes, scientists aim to elucidate some of its aetiology. Early studies used candidate gene and genome-wide linkage approaches to search for such genes with limited success. However, the advent of genome-wide association studies (GWAS) has dramatically increased the pace of gene discovery. So far, GWAS have identified at least 50 loci robustly associated with body mass index (BMI), waist-to-hip ratio, body fat percentage and extreme obesity. Some of these have been shown to replicate in non-white populations and in children and adolescents. Furthermore, for some loci interaction studies have shown that the BMI-increasing effect is attenuated in physically active individuals. Despite many successful discoveries, the effect sizes of the established loci are small, and combined they explain only a fraction of the inter-individual variation in BMI. The low predictive value means that their value in mainstream health care is limited. However, as most of these newly established loci were not previously linked to obesity, they may provide new insights into body weight regulation. Continued efforts in gene discovery, using a range of approaches, will be needed to increase our understanding of obesity. Copyright © 2011 S. Karger AG, Basel.

  20. Genome-wide association analyses in East Asians identify new susceptibility loci for colorectal cancer

    PubMed Central

    Jia, Wei-Hua; Zhang, Ben; Matsuo, Keitaro; Shin, Aesun; Xiang, Yong-Bing; Jee, Sun Ha; Kim, Dong-Hyun; Ren, Zefang; Cai, Qiuyin; Long, Jirong; Shi, Jiajun; Wen, Wanqing; Yang, Gong; Delahanty, Ryan J.; Ji, Bu-Tian; Pan, Zhi-Zhong; Matsuda, Fumihiko; Gao, Yu-Tang; Oh, Jae Hwan; Ahn, Yoon-Ok; Park, Eun Jung; Li, Hong-Lan; Park, Ji Won; Jo, Jaeseong; Jeong, Jin-Young; Hosono, Satoyo; Casey, Graham; Peters, Ulrike; Shu, Xiao-Ou; Zeng, Yi-Xin; Zheng, Wei

    2013-01-01

    To identify novel genetic factors for colorectal cancer (CRC), we conducted a genome-wide association study in East Asians. By analyzing genome-wide data in 2,098 cases and 5,749 controls, we selected 64 promising SNPs for replication in an independent set of samples including up to 5,358 cases and 5,922 controls. We identified four SNPs with a P-value of 8.58 × 10−7 to 3.77 × 10−10 in the combined analysis of all East Asian samples. Three of the four SNPs were replicated in a study conducted among 26,060 European descendants with a combined P-value of 1.22 × 10−10 for rs647161 (5q31.1), 6.64 × 10−9 for rs2423279 (20p12.3), and 3.06 × 10−8 for rs10774214 (12p13.32 near the CCND2 gene), respectively, derived from the meta-analysis of data from both East Asian and European populations. This study identified three new CRC susceptibility loci and provides additional insight into the genetics and biology of CRC. PMID:23263487

  1. A powerful test of independent assortment that determines genome-wide significance quickly and accurately

    PubMed Central

    Stewart, W C L; Hager, V R

    2016-01-01

    In the analysis of DNA sequences on related individuals, most methods strive to incorporate as much information as possible, with little or no attention paid to the issue of statistical significance. For example, a modern workstation can easily handle the computations needed to perform a large-scale genome-wide inheritance-by-descent (IBD) scan, but accurate assessment of the significance of that scan is often hindered by inaccurate approximations and computationally intensive simulation. To address these issues, we developed gLOD—a test of co-segregation that, for large samples, models chromosome-specific IBD statistics as a collection of stationary Gaussian processes. With this simple model, the parametric bootstrap yields an accurate and rapid assessment of significance—the genome-wide corrected P-value. Furthermore, we show that (i) under the null hypothesis, the limiting distribution of the gLOD is the standard Gumbel distribution; (ii) our parametric bootstrap simulator is approximately 40 000 times faster than gene-dropping methods, and it is more powerful than methods that approximate the adjusted P-value; and, (iii) the gLOD has the same statistical power as the widely used maximum Kong and Cox LOD. Thus, our approach gives researchers the ability to determine quickly and accurately the significance of most large-scale IBD scans, which may contain multiple traits, thousands of families and tens of thousands of DNA sequences. PMID:27245422

  2. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep.

    PubMed

    Mousel, Michelle R; Reynolds, James O; White, Stephen N

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10(-5)) were identified including markers in or near PIK3CB (P = 2.22x10(-6); additive model), KCNB1 (P = 2.93x10(-6); dominance model), ZC3H12C (P = 3.25x10(-6); genotypic model), JPH1 (P = 4.68x20(-6); genotypic model), and MYO3B (P = 5.74x10(-6); recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection.

  3. Genome-Wide Association Identifies SLC2A9 and NLN Gene Regions as Associated with Entropion in Domestic Sheep

    PubMed Central

    Mousel, Michelle R.; Reynolds, James O.; White, Stephen N.

    2015-01-01

    Entropion is an inward rolling of the eyelid allowing contact between the eyelashes and cornea that may lead to blindness if not corrected. Although many mammalian species, including humans and dogs, are afflicted by congenital entropion, no specific genes or gene regions related to development of entropion have been reported in any mammalian species to date. Entropion in domestic sheep is known to have a genetic component therefore, we used domestic sheep as a model system to identify genomic regions containing genes associated with entropion. A genome-wide association was conducted with congenital entropion in 998 Columbia, Polypay, and Rambouillet sheep genotyped with 50,000 SNP markers. Prevalence of entropion was 6.01%, with all breeds represented. Logistic regression was performed in PLINK with additive allelic, recessive, dominant, and genotypic inheritance models. Two genome-wide significant (empirical P<0.05) SNP were identified, specifically markers in SLC2A9 (empirical P = 0.007; genotypic model) and near NLN (empirical P = 0.026; dominance model). Six additional genome-wide suggestive SNP (nominal P<1x10-5) were identified including markers in or near PIK3CB (P = 2.22x10-6; additive model), KCNB1 (P = 2.93x10-6; dominance model), ZC3H12C (P = 3.25x10-6; genotypic model), JPH1 (P = 4.68x20-6; genotypic model), and MYO3B (P = 5.74x10-6; recessive model). This is the first report of specific gene regions associated with congenital entropion in any mammalian species, to our knowledge. Further, none of these genes have previously been associated with any eyelid traits. These results represent the first genome-wide analysis of gene regions associated with entropion and provide target regions for the development of sheep genetic markers for marker-assisted selection. PMID:26098909

  4. Breast cancer prediction using genome wide single nucleotide polymorphism data

    PubMed Central

    2013-01-01

    Background This paper introduces and applies a genome wide predictive study to learn a model that predicts whether a new subject will develop breast cancer or not, based on her SNP profile. Results We first genotyped 696 female subjects (348 breast cancer cases and 348 apparently healthy controls), predominantly of Caucasian origin from Alberta, Canada using Affymetrix Human SNP 6.0 arrays. Then, we applied EIGENSTRAT population stratification correction method to remove 73 subjects not belonging to the Caucasian population. Then, we filtered any SNP that had any missing calls, whose genotype frequency was deviated from Hardy-Weinberg equilibrium, or whose minor allele frequency was less than 5%. Finally, we applied a combination of MeanDiff feature selection method and KNN learning method to this filtered dataset to produce a breast cancer prediction model. LOOCV accuracy of this classifier is 59.55%. Random permutation tests show that this result is significantly better than the baseline accuracy of 51.52%. Sensitivity analysis shows that the classifier is fairly robust to the number of MeanDiff-selected SNPs. External validation on the CGEMS breast cancer dataset, the only other publicly available breast cancer dataset, shows that this combination of MeanDiff and KNN leads to a LOOCV accuracy of 60.25%, which is significantly better than its baseline of 50.06%. We then considered a dozen different combinations of feature selection and learning method, but found that none of these combinations produces a better predictive model than our model. We also considered various biological feature selection methods like selecting SNPs reported in recent genome wide association studies to be associated with breast cancer, selecting SNPs in genes associated with KEGG cancer pathways, or selecting SNPs associated with breast cancer in the F-SNP database to produce predictive models, but again found that none of these models achieved accuracy better than baseline. Conclusions

  5. Genome-wide association study of circulating vitamin D levels.

    PubMed

    Ahn, Jiyoung; Yu, Kai; Stolzenberg-Solomon, Rachael; Simon, K Claire; McCullough, Marjorie L; Gallicchio, Lisa; Jacobs, Eric J; Ascherio, Alberto; Helzlsouer, Kathy; Jacobs, Kevin B; Li, Qizhai; Weinstein, Stephanie J; Purdue, Mark; Virtamo, Jarmo; Horst, Ronald; Wheeler, William; Chanock, Stephen; Hunter, David J; Hayes, Richard B; Kraft, Peter; Albanes, Demetrius

    2010-07-01

    The primary circulating form of vitamin D, 25-hydroxy-vitamin D [25(OH)D], is associated with multiple medical outcomes, including rickets, osteoporosis, multiple sclerosis and cancer. In a genome-wide association study (GWAS) of 4501 persons of European ancestry drawn from five cohorts, we identified single-nucleotide polymorphisms (SNPs) in the gene encoding group-specific component (vitamin D binding) protein, GC, on chromosome 4q12-13 that were associated with 25(OH)D concentrations: rs2282679 (P=2.0x10(-30)), in linkage disequilibrium (LD) with rs7041, a non-synonymous SNP (D432E; P=4.1x10(-22)) and rs1155563 (P=3.8x10(-25)). Suggestive signals for association with 25(OH)D were also observed for SNPs in or near three other genes involved in vitamin D synthesis or activation: rs3829251 on chromosome 11q13.4 in NADSYN1 [encoding nicotinamide adenine dinucleotide (NAD) synthetase; P=8.8x10(-7)], which was in high LD with rs1790349, located in DHCR7, the gene encoding 7-dehydrocholesterol reductase that synthesizes cholesterol from 7-dehydrocholesterol; rs6599638 in the region harboring the open-reading frame 88 (C10orf88) on chromosome 10q26.13 in the vicinity of ACADSB (acyl-Coenzyme A dehydrogenase), involved in cholesterol and vitamin D synthesis (P=3.3x10(-7)); and rs2060793 on chromosome 11p15.2 in CYP2R1 (cytochrome P450, family 2, subfamily R, polypeptide 1, encoding a key C-25 hydroxylase that converts vitamin D3 to an active vitamin D receptor ligand; P=1.4x10(-5)). We genotyped SNPs in these four regions in 2221 additional samples and confirmed strong genome-wide significant associations with 25(OH)D through meta-analysis with the GWAS data for GC (P=1.8x10(-49)), NADSYN1/DHCR7 (P=3.4x10(-9)) and CYP2R1 (P=2.9x10(-17)), but not C10orf88 (P=2.4x10(-5)).

  6. Assessment of expected breeding values for fertility traits of Murrah buffaloes under subtropical climate

    PubMed Central

    Dash, Soumya; Chakravarty, A. K.; Singh, Avtar; Shivahre, Pushp Raj; Upadhyay, Arpan; Sah, Vaishali; Singh, K. Mahesh

    2015-01-01

    Aim: The aim of the present study was to assess the influence of temperature and humidity prevalent under subtropical climate on the breeding values for fertility traits viz. service period (SP), pregnancy rate (PR) and conception rate (CR) of Murrah buffaloes in National Dairy Research Institute (NDRI) herd. Materials and Methods: Fertility data on 1379 records of 581 Murrah buffaloes spread over four lactations and climatic parameters viz. dry bulb temperature and relative humidity (RH) spanned over 20 years (1993-2012) were collected from NDRI and Central Soil and Salinity Research Institute, Karnal, India. Monthly average temperature humidity index (THI) values were estimated. Threshold THI value affecting fertility traits was identified by fixed least-squares model analysis. Three zones of non-heat stress, heat stress and critical heat stress zones were developed in a year. The genetic parameters heritability (h2) and repeatability (r) of each fertility trait were estimated. Genetic evaluation of Murrah buffaloes was performed in each zone with respect to their expected breeding values (EBV) for fertility traits. Results: Effect of THI was found significant (p<0.001) on all fertility traits with threshold THI value identified as 75. Based on THI values, a year was classified into three zones: Non heat stress zone(THI 56.71-73.21), HSZ (THI 75.39-81.60) and critical HSZ (THI 80.27-81.60). The EBVfor SP, PR, CR were estimated as 138.57 days, 0.362 and 69.02% in non-HSZ while in HSZ EBV were found as 139.62 days, 0.358 and 68.81%, respectively. EBV for SP was increased to 140.92 days and for PR and CR, it was declined to 0.357 and 68.71% in critical HSZ. Conclusion: The negative effect of THI was observed on EBV of fertility traits under the non-HSZ and critical HSZ Thus, the influence of THI should be adjusted before estimating the breeding values for fertility traits in Murrah buffaloes. PMID:27047091

  7. Genetic diversity and genome-wide association analysis of cooking time in dry bean (Phaseolus vulgaris L.).

    PubMed

    Cichy, Karen A; Wiesinger, Jason A; Mendoza, Fernando A

    2015-08-01

    Fivefold diversity for cooking time found in a panel of 206 Phaseolus vulgaris accessions. Fastest accession cooks nearly 20 min faster than average.   SNPs associated with cooking time on Pv02, 03, and 06. Dry beans (Phaseolus vulgaris L.) are a nutrient dense food and a dietary staple in parts of Africa and Latin America. One of the major factors that limits greater utilization of beans is their long cooking times compared to other foods. Cooking time is an important trait with implications for gender equity, nutritional value of diets, and energy utilization. Very little is known about the genetic diversity and genomic regions involved in determining cooking time. The objective of this research was to assess cooking time on a panel of 206 P. vulgaris accessions, use genome- wide association analysis (GWAS) to identify genomic regions influencing this trait, and to test the ability to predict cooking time by raw seed characteristics. In this study 5.5-fold variation for cooking time was found and five bean accessions were identified which cook in less than 27 min across 2 years, where the average cooking time was 37 min. One accession, ADP0367 cooked nearly 20 min faster than average. Four of these five accessions showed close phylogenetic relationship based on a NJ tree developed with ~5000 SNP markers, suggesting a potentially similar underlying genetic mechanism. GWAS revealed regions on chromosomes Pv02, Pv03, and Pv06 associated with cooking time. Vis/NIR scanning of raw seed explained 68 % of the phenotypic variation for cooking time, suggesting with additional experimentation, it may be possible to use this spectroscopy method to non-destructively identify fast cooking lines as part of a breeding program.

  8. Prediction of heterosis using genome-wide SNP-marker data: application to egg production traits in white Leghorn crosses.

    PubMed

    Amuzu-Aweh, E N; Bijma, P; Kinghorn, B P; Vereijken, A; Visscher, J; van Arendonk, J Am; Bovenhuis, H

    2013-12-01

    Prediction of heterosis has a long history with mixed success, partly due to low numbers of genetic markers and/or small data sets. We investigated the prediction of heterosis for egg number, egg weight and survival days in domestic white Leghorns, using ∼400 000 individuals from 47 crosses and allele frequencies on ∼53 000 genome-wide single nucleotide polymorphisms (SNPs). When heterosis is due to dominance, and dominance effects are independent of allele frequencies, heterosis is proportional to the squared difference in allele frequency (SDAF) between parental pure lines (not necessarily homozygous). Under these assumptions, a linear model including regression on SDAF partitions crossbred phenotypes into pure-line values and heterosis, even without pure-line phenotypes. We therefore used models where phenotypes of crossbreds were regressed on the SDAF between parental lines. Accuracy of prediction was determined using leave-one-out cross-validation. SDAF predicted heterosis for egg number and weight with an accuracy of ∼0.5, but did not predict heterosis for survival days. Heterosis predictions allowed preselection of pure lines before field-testing, saving ∼50% of field-testing cost with only 4% loss in heterosis. Accuracies from cross-validation were lower than from the model-fit, suggesting that accuracies previously reported in literature are overestimated. Cross-validation also indicated that dominance cannot fully explain heterosis. Nevertheless, the dominance model had considerable accuracy, clearly greater than that of a general/specific combining ability model. This work also showed that heterosis can be modelled even when pure-line phenotypes are unavailable. We concluded that SDAF is a useful predictor of heterosis in commercial layer breeding.

  9. Genome wide association analysis of cold tolerance at germination in temperate japonica rice (Oryza sativa L.) varieties

    PubMed Central

    Viruel, Juan; Domingo, Concha; Marqués, Luis

    2017-01-01

    A pool of 200 traditional, landraces and modern elite and old cultivars of rice, mainly japonica varieties adapted to temperate regions, have been used to perform a genome wide association study to detect chromosome regions associated to low temperature germination (LTG) regulation using a panel of 1672 SNP markers. Phenotyping was performed by determining growth rates when seeds were germinated at 25° and 15°C in order to separate the germination vigorousness from cold tolerance effects. As expected, the ability to produce viable seedlings varied widely among rice cultivars and also depended greatly on temperature. Furthermore, we observed a differential response during seed germination and in coleoptile elongation. Faster development at 15°C was observed in seeds from varieties traditionally used as cold tolerant parents by breeders, along with other potentially useful cultivars, mainly of Italian origin. When phenotypic data were combined with the panel of SNPs for japonica rice cultivars, significant associations were detected for 31 markers: 7 were related to growth rate at 25°C and 24 to growth rates at 15°. Among the latter, some chromosome regions were associated to LTG while others were related to coleoptile elongation. Individual effects of the associated markers were low, but by combining favourable alleles in a linear regression model we estimated that 27 loci significantly explained the observed phenotypic variation. From these, a core panel of 13 markers was selected and, furthermore, two wide regions of chromosomes 3 and 6 were consistently associated to rice LTG. Varieties with higher numbers of favourable alleles for the panels of associated markers significantly correlated with increased phenotypic values at both temperatures, thus corroborating the utility of the tagged markers for marker assisted selection (MAS) when breeding japonica rice for LTG. PMID:28817683

  10. Genome wide association analysis of cold tolerance at germination in temperate japonica rice (Oryza sativa L.) varieties.

    PubMed

    Sales, Ester; Viruel, Juan; Domingo, Concha; Marqués, Luis

    2017-01-01

    A pool of 200 traditional, landraces and modern elite and old cultivars of rice, mainly japonica varieties adapted to temperate regions, have been used to perform a genome wide association study to detect chromosome regions associated to low temperature germination (LTG) regulation using a panel of 1672 SNP markers. Phenotyping was performed by determining growth rates when seeds were germinated at 25° and 15°C in order to separate the germination vigorousness from cold tolerance effects. As expected, the ability to produce viable seedlings varied widely among rice cultivars and also depended greatly on temperature. Furthermore, we observed a differential response during seed germination and in coleoptile elongation. Faster development at 15°C was observed in seeds from varieties traditionally used as cold tolerant parents by breeders, along with other potentially useful cultivars, mainly of Italian origin. When phenotypic data were combined with the panel of SNPs for japonica rice cultivars, significant associations were detected for 31 markers: 7 were related to growth rate at 25°C and 24 to growth rates at 15°. Among the latter, some chromosome regions were associated to LTG while others were related to coleoptile elongation. Individual effects of the associated markers were low, but by combining favourable alleles in a linear regression model we estimated that 27 loci significantly explained the observed phenotypic variation. From these, a core panel of 13 markers was selected and, furthermore, two wide regions of chromosomes 3 and 6 were consistently associated to rice LTG. Varieties with higher numbers of favourable alleles for the panels of associated markers significantly correlated with increased phenotypic values at both temperatures, thus corroborating the utility of the tagged markers for marker assisted selection (MAS) when breeding japonica rice for LTG.

  11. Leading the way: finding genes for neurologic disease in dogs using genome-wide mRNA sequencing.

    PubMed

    Ostrander, Elaine A; Beale, Holly C

    2012-07-10

    Because of dogs' unique population structure, human-like disease biology, and advantageous genomic features, the canine system has risen dramatically in popularity as a tool for discovering disease alleles that have been difficult to find by studying human families or populations. To date, disease studies in dogs have primarily employed either linkage analysis, leveraging the typically large family size, or genome-wide association, which requires only modest-sized case and control groups in dogs. Both have been successful but, like most techniques, each requires a specific combination of time and money, and there are inherent problems associated with each. Here we review the first report of mRNA-Seq in the dog, a study that provides insights into the potential value of applying high-throughput sequencing to the study of genetic diseases in dogs. Forman and colleagues apply high-throughput sequencing to a single case of canine neonatal cerebellar cortical degeneration. This implementation of whole genome mRNA sequencing, the first reported in dog, is additionally unusual due to the analysis: the data was used not to examine transcript levels or annotate genes, but as a form of target capture that revealed the sequence of transcripts of genes associated with ataxia in humans. This approach entails risks. It would fail if, for example, the relevant transcripts were not sufficiently expressed for genotyping or were not associated with ataxia in humans. But here it pays off handsomely, identifying a single frameshift mutation that segregates with the disease. This work sets the stage for similar studies that take advantage of recent advances in genomics while exploiting the historical background of dog breeds to identify disease-causing mutations.

  12. Determination of non-market values to inform conservation strategies for the threatened Alistana-Sanabresa cattle breed.

    PubMed

    Martin-Collado, D; Diaz, C; Drucker, A G; Carabaño, M J; Zander, K K

    2014-08-01

    Livestock breed-related public good functions are often used to justify support for endangered breed conservation despite the fact that little is known about such non-market values. We show how stated preference techniques can be used to assess the non-market values that people place on livestock breeds. Through the application of a case study choice experiment survey in Zamora province, Spain, the total economic value (TEV) of the threatened Alistana-Sanabresa (AS) cattle breed was investigated. An analysis of the relative importance of the non-market components of its TEV and an assessment of the socio-economic variables that influence people's valuation of such components is used to inform conservation strategy design. Overall, the findings reveal that the AS breed had significant non-market values associated with it and that the value that respondents placed on each specific public good function also varied significantly. Functions related with indirect use cultural and existence values were much more highly valued than landscape maintenance values. These high cultural and existence values (totalling over 80% of TEV) suggest that an AS in situ conservation strategy will be required to secure such values. As part of such a strategy, incentive mechanisms will be needed to permit farmers to capture some of these public good values and thus be able to afford to maintain breed population numbers at socially desirable levels. One such mechanism could be related to the development of breed-related agritourism initiatives, with a view to enhancing private good values and providing an important addition to continued direct support. Where linked with cultural dimensions, niche product market development, including through improving AS breed-related product quality and brand recognition may also have a role to play as part of such an overall conservation and use strategy. We conclude that livestock breed conservation strategies with the highest potential to maximise

  13. Genome-wide Association Analysis Tracks Bacterial Leaf Blight Resistance Loci In Rice Diverse Germplasm.

    PubMed

    Dilla-Ermita, Christine Jade; Tandayu, Erwin; Juanillas, Venice Margarette; Detras, Jeffrey; Lozada, Dennis Nicuh; Dwiyanti, Maria Stefanie; Vera Cruz, Casiana; Mbanjo, Edwige Gaby Nkouaya; Ardales, Edna; Diaz, Maria Genaleen; Mendioro, Merlyn; Thomson, Michael J; Kretzschmar, Tobias

    2017-12-01

    A range of resistance loci against different races of Xanthomonas oryzae pv. oryzae (Xoo), the pathogen causing bacterial blight (BB) disease of rice, have been discovered and characterized. Several have been deployed in modern varieties, however, due to rapid evolution of Xoo, a number have already become ineffective. The continuous "arms race" between Xoo and rice makes it imperative to discover new resistance loci to enable durable deployment of multiple resistance genes in modern breeding lines. Rice diversity panels can be exploited as reservoirs of useful genetic variation for bacterial blight (BB) resistance. This study was conducted to identify loci associated to BB resistance, new genetic donors and useful molecular markers for marker-assisted breeding. A genome-wide association study (GWAS) of BB resistance using a diverse panel of 285 rice accessions was performed to identify loci that are associated with resistance to nine Xoo strains from the Philippines, representative of eight global races. Single nucleotide polymorphisms (SNPs) associated with differential resistance were identified in the diverse panel and a subset of 198 indica accessions. Strong associations were found for novel SNPs linked with known bacterial blight resistance Xa genes, from which high utility markers for tracking and selection of resistance genes in breeding programs were designed. Furthermore, significant associations of SNPs in chromosomes 6, 9, 11, and 12 did not overlap with known resistance loci and hence might prove to be novel sources of resistance. Detailed analysis revealed haplotypes that correlated with resistance and analysis of putative resistance alleles identified resistant genotypes as potential donors of new resistance genes. The results of the GWAS validated known genes underlying resistance and identified novel loci that provide useful targets for further investigation. SNP markers and genetic donors identified in this study will help plant breeders in

  14. Breeding objectives for pigs in Kenya. II: economic values incorporating risks in different smallholder production systems.

    PubMed

    Mbuthia, Jackson Mwenda; Rewe, Thomas Odiwuor; Kahi, Alexander Kigunzu

    2015-02-01

    This study estimated economic values for production traits (dressing percentage (DP), %; live weight for growers (LWg), kg; live weight for sows (LWs), kg) and functional traits (feed intake for growers (FEEDg), feed intake for sow (FEEDs), preweaning survival rate (PrSR), %; postweaning survival (PoSR), %; sow survival rate (SoSR), %, total number of piglets born (TNB) and farrowing interval (FI), days) under different smallholder pig production systems in Kenya. Economic values were estimated considering two production circumstances: fixed-herd and fixed-feed. Under the fixed-herd scenario, economic values were estimated assuming a situation where the herd cannot be increased due to other constraints apart from feed resources. The fixed-feed input scenario assumed that the herd size is restricted by limitation of feed resources available. In addition to the tradition profit model, a risk-rated bio-economic model was used to derive risk-rated economic values. This model accounted for imperfect knowledge concerning risk attitude of farmers and variance of input and output prices. Positive economic values obtained for traits DP, LWg, LWs, PoSR, PrSR, SoSR and TNB indicate that targeting them in improvement would positively impact profitability in pig breeding programmes. Under the fixed-feed basis, the risk-rated economic values for DP, LWg, LWs and SoSR were similar to those obtained under the fixed-herd situation. Accounting for risks in the EVs did not yield errors greater than ±50 % in all the production systems and basis of evaluation meaning there would be relatively little effect on the real genetic gain of a selection index. Therefore, both traditional and risk-rated models can be satisfactorily used to predict profitability in pig breeding programmes.

  15. Estimation of genetic parameters and breeding values across challenged environments to select for robust pigs.

    PubMed

    Herrero-Medrano, J M; Mathur, P K; ten Napel, J; Rashidi, H; Alexandri, P; Knol, E F; Mulder, H A

    2015-04-01

    Robustness is an important issue in the pig production industry. Since pigs from international breeding organizations have to withstand a variety of environmental challenges, selection of pigs with the inherent ability to sustain their productivity in diverse environments may be an economically feasible approach in the livestock industry. The objective of this study was to estimate genetic parameters and breeding values across different levels of environmental challenge load. The challenge load (CL) was estimated as the reduction in reproductive performance during different weeks of a year using 925,711 farrowing records from farms distributed worldwide. A wide range of levels of challenge, from favorable to unfavorable environments, was observed among farms with high CL values being associated with confirmed situations of unfavorable environment. Genetic parameters and breeding values were estimated in high- and low-challenge environments using a bivariate analysis, as well as across increasing levels of challenge with a random regression model using Legendre polynomials. Although heritability estimates of number of pigs born alive were slightly higher in environments with extreme CL than in those with intermediate levels of CL, the heritabilities of number of piglet losses increased progressively as CL increased. Genetic correlations among environments with different levels of CL suggest that selection in environments with extremes of low or high CL would result in low response to selection. Therefore, selection programs of breeding organizations that are commonly conducted under favorable environments could have low response to selection in commercial farms that have unfavorable environmental conditions. Sows that had experienced high levels of challenge at least once during their productive life were ranked according to their EBV. The selection of pigs using EBV ignoring environmental challenges or on the basis of records from only favorable environments

  16. Genome-wide association study of circulating retinol levels.

    PubMed

    Mondul, Alison M; Yu, Kai; Wheeler, William; Zhang, Hong; Weinstein, Stephanie J; Major, Jacqueline M; Cornelis, Marilyn C; Männistö, Satu; Hazra, Aditi; Hsing, Ann W; Jacobs, Kevin B; Eliassen, Heather; Tanaka, Toshiko; Reding, Douglas J; Hendrickson, Sara; Ferrucci, Luigi; Virtamo, Jarmo; Hunter, David J; Chanock, Stephen J; Kraft, Peter; Albanes, Demetrius

    2011-12-01

    Retinol is one of the most biologically active forms of vitamin A and is hypothesized to influence a wide range of human diseases including asthma, cardiovascular disease, infectious diseases and cancer. We conducted a genome-wide association study of 5006 Caucasian individuals drawn from two cohorts of men: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention (ATBC) Study and the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. We identified two independent single-nucleotide polymorphisms associated with circulating retinol levels, which are located near the transthyretin (TTR) and retinol binding protein 4 (RBP4) genes which encode major carrier proteins of retinol: rs1667255 (P =2.30× 10(-17)) and rs10882272 (P =6.04× 10(-12)). We replicated the association with rs10882272 in RBP4 in independent samples from the Nurses' Health Study and the Invecchiare in Chianti Study (InCHIANTI) that included 3792 women and 504 men (P =9.49× 10(-5)), but found no association for retinol with rs1667255 in TTR among women, thus suggesting evidence for gender dimorphism (P-interaction=1.31× 10(-5)). Discovery of common genetic variants associated with serum retinol levels may provide further insight into the contribution of retinol and other vitamin A compounds to the development of cancer and other complex diseases.

  17. Genome-Wide Analysis of Human Metapneumovirus Evolution

    PubMed Central

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  18. A synergistic DNA logic predicts genome-wide chromatin accessibility

    PubMed Central

    Hashimoto, Tatsunori; Sherwood, Richard I.; Kang, Daniel D.; Rajagopal, Nisha; Barkal, Amira A.; Zeng, Haoyang; Emons, Bart J.M.; Srinivasan, Sharanya; Jaakkola, Tommi; Gifford, David K.

    2016-01-01

    Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types. PMID:27456004

  19. A genome-wide association study in multiple system atrophy

    PubMed Central

    Sailer, Anna; Nalls, Michael A.; Schulte, Claudia; Federoff, Monica; Price, T. Ryan; Lees, Andrew; Ross, Owen A.; Dickson, Dennis W.; Mok, Kin; Mencacci, Niccolo E.; Schottlaender, Lucia; Chelban, Viorica; Ling, Helen; O'Sullivan, Sean S.; Wood, Nicholas W.; Traynor, Bryan J.; Ferrucci, Luigi; Federoff, Howard J.; Mhyre, Timothy R.; Morris, Huw R.; Deuschl, Günther; Quinn, Niall; Widner, Hakan; Albanese, Alberto; Infante, Jon; Bhatia, Kailash P.; Poewe, Werner; Oertel, Wolfgang; Höglinger, Günter U.; Wüllner, Ullrich; Goldwurm, Stefano; Pellecchia, Maria Teresa; Ferreira, Joaquim; Tolosa, Eduardo; Bloem, Bastiaan R.; Rascol, Olivier; Meissner, Wassilios G.; Hardy, John A.; Revesz, Tamas; Holton, Janice L.; Gasser, Thomas; Wenning, Gregor K.; Singleton, Andrew B.

    2016-01-01

    Objective: To identify genetic variants that play a role in the pathogenesis of multiple system atrophy (MSA), we undertook a genome-wide association study (GWAS). Methods: We performed a GWAS with >5 million genotyped and imputed single nucleotide polymorphisms (SNPs) in 918 patients with MSA of European ancestry and 3,864 controls. MSA cases were collected from North American and European centers, one third of which were neuropathologically confirmed. Results: We found no significant loci after stringent multiple testing correction. A number of regions emerged as potentially interesting for follow-up at p < 1 × 10−6, including SNPs in the genes FBXO47, ELOVL7, EDN1, and MAPT. Contrary to previous reports, we found no association of the genes SNCA and COQ2 with MSA. Conclusions: We present a GWAS in MSA. We have identified several potentially interesting gene loci, including the MAPT locus, whose significance will have to be evaluated in a larger sample set. Common genetic variation in SNCA and COQ2 does not seem to be associated with MSA. In the future, additional samples of well-characterized patients with MSA will need to be collected to perform a larger MSA GWAS, but this initial study forms the basis for these next steps. PMID:27629089

  20. Genome-wide profiling of forum domains in Drosophila melanogaster.

    PubMed

    Tchurikov, Nickolai A; Kretova, Olga V; Sosin, Dmitri V; Zykov, Ivan A; Zhimulev, Igor F; Kravatsky, Yuri V

    2011-05-01

    Forum domains are stretches of chromosomal DNA that are excised from eukaryotic chromosomes during their spontaneous non-random fragmentation. Most forum domains are 50-200 kb in length. We mapped forum domain termini using FISH on polytene chromosomes and we performed genome-wide mapping using a Drosophila melanogaster genomic tiling microarray consisting of overlapping 3 kb fragments. We found that forum termini very often correspond to regions of intercalary heterochromatin and regions of late replication in polytene chromosomes. We found that forum domains contain clusters of several or many genes. The largest forum domains correspond to the main clusters of homeotic genes inside BX-C and ANTP-C, cluster of histone genes and clusters of piRNAs. PRE/TRE and transcription factor binding sites often reside inside domains and do not overlap with forum domain termini. We also found that about 20% of forum domain termini correspond to small chromosomal regions where Ago1, Ago2, small RNAs and repressive chromatin structures are detected. Our results indicate that forum domains correspond to big multi-gene chromosomal units, some of which could be coordinately expressed. The data on the global mapping of forum domains revealed a strong correlation between fragmentation sites in chromosomes, particular sets of mobile elements and regions of intercalary heterochromatin.

  1. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several e