Science.gov

Sample records for genome wide scan

  1. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    MedlinePlus

    ... historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A team of international researchers supported by the ... when they divide and grow uncontrollably, develop into melanoma. Also, MITF activity is known to be amplified ...

  2. Genome-wide DNA polymorphism analyses using VariScan

    PubMed Central

    Hutter, Stephan; Vilella, Albert J; Rozas, Julio

    2006-01-01

    Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i) exhaustive population-genetic analyses including those based on the coalescent theory; ii) analysis adapted to the shallow data generated by the high-throughput genome projects; iii) use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv) identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v) visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data. PMID:16968531

  3. Adjusted P values for genome-wide scans.

    PubMed Central

    Lystig, Theodore C

    2003-01-01

    Genome-wide scans for quantitative trait loci (QTL) have traditionally been summarized with plots of logarithm of odds (LOD) scores. A valuable modification is to supplement such plots with an additional vertical axis displaying quantiles of adjusted P values and labeling local maxima of the LOD scores with location-specific adjusted P values. This provides a visible gradation of genome-wide significance for the LOD score curve, instead of the stark dichotomy that a single threshold yields. Adjusted P values give genome-wide significance of individual LOD scores and are obtained through a straightforward modification of the familiar algorithm for generating permutation-based thresholds. PMID:12930772

  4. Genome-wide scans for loci under selection in humans.

    PubMed

    Ronald, James; Akey, Joshua M

    2005-06-01

    Natural selection, which can be defined as the differential contribution of genetic variants to future generations, is the driving force of Darwinian evolution. Identifying regions of the human genome that have been targets of natural selection is an important step in clarifying human evolutionary history and understanding how genetic variation results in phenotypic diversity, it may also facilitate the search for complex disease genes. Technological advances in high-throughput DNA sequencing and single nucleotide polymorphism genotyping have enabled several genome-wide scans of natural selection to be undertaken. Here, some of the observations that are beginning to emerge from these studies will be reviewed, including evidence for geographically restricted selective pressures (ie local adaptation) and a relationship between genes subject to natural selection and human disease. In addition, the paper will highlight several important problems that need to be addressed in future genome-wide studies of natural selection.

  5. Heritability and genome-wide linkage scan of subjective happiness.

    PubMed

    Bartels, Meike; Saviouk, Viatcheslav; de Moor, Marleen H M; Willemsen, Gonneke; van Beijsterveldt, Toos C E M; Hottenga, Jouke-Jan; de Geus, Eco J C; Boomsma, Dorret I

    2010-04-01

    Causes of individual differences in happiness, as assessed with the Subjective Happiness Scale, are investigated in a large of sample twins and siblings from the Netherlands Twin Register. Over 12,000 twins and siblings, average age 24.7 years (range 12 to 88), took part in the study. A genetic model with an age by sex design was fitted to the data with structural equation modeling in Mx. The heritability of happiness was estimated at 22% for males and 41% in females. No effect of age was observed. To identify the genomic regions contributing to this heritability, a genome-wide linkage study for happiness was conducted in sibling pairs. A subsample of 1157 offspring from 441 families was genotyped with an average of 371 micro-satellite markers per individual. Phenotype and genotype data were analyzed in MERLIN with multipoint variance component linkage analysis and age and sex as covariates. A linkage signal (logarithm of odds score 2.73, empirical p value 0.095) was obtained at the end of the long arm of chromosome 19 for marker D19S254 at 110 cM. A second suggestive linkage peak was found at the short arm of chromosome 1 (LOD of 2.37) at 153 cM, marker D1S534 (empirical p value of .209). These two regions of interest are not overlapping with the regions found for contrasting phenotypes (such as depression, which is negatively associated with happiness). Further linkage and future association studies are warranted.

  6. A Genome-wide Pleiotropy Scan for Prostate Cancer Risk

    PubMed Central

    Panagiotou, Orestis A; Travis, Ruth C; Campa, Daniele; Berndt, Sonja I.; Lindstrom, Sara; Kraft, Peter; Schumacher, Fredrick R.; Siddiq, Afshan; Papatheodorou, Stefania I.; Stanford, Janet L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie J.; Diver, W. Ryan; Gapstur, Susan M.; Stevens, Victoria L.; Boeing, Heiner; Bueno-de-Mesquita, H. Bas; Gurrea, Aurelio Barricarte; Kaaks, Rudolf; Khaw, Kay-Tee; Krogh, Vittorio; Overvad, Kim; Riboli, Elio; Trichopoulos, Dimitrios; Giovannucci, Edward; Stampfer, Meir; Haiman, Christopher; Henderson, Brian; Le Marchand, Loic; Gaziano, J. Michael; Hunter, DavidJ.; Koutros, Stella; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Wacholder, Sholom; Key, Timothy J.; Tsilidis, Konstantinos K

    2014-01-01

    Background No single-nucleotide polymorphisms (SNPs) specific for aggressive prostate cancer have been identified in genome-wide association studies (GWAS). Objective To test if SNPs associated with other traits may also affect the risk of aggressive prostate cancer. Design, setting, and participants SNPs implicated in any phenotype other than prostate cancer (p ≤ 10−7) were identified through the catalog of published GWAS and tested in 2891 aggressive prostate cancer cases and 4592 controls from the Breast and Prostate Cancer Cohort Consortium (BPC3). The 40 most significant SNPs were followed up in 4872 aggressive prostate cancer cases and 24 534 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. Outcome measurements and statistical analysis Odds ratios (ORs) and 95% confidence intervals (CIs) for aggressive prostate cancer were estimated. Results and limitations A total of 4666 SNPs were evaluated by the BPC3. Two signals were seen in regions already reported for prostate cancer risk. rs7014346 at 8q24.21 was marginally associated with aggressive prostate cancer in the BPC3 trial (p = 1.6 × 10-6), whereas after meta-analysis by PRACTICAL the summary OR was 1.21 (95%CI 1.16–1.27; p = 3.22 × 10−18). rs9900242 at 17q24.3 was also marginally associated with aggressive disease in the meta-analysis (OR 0.90, 95% CI 0.86–0.94; p = 2.5 × 10−6). Neither of these SNPs remained statistically significant when conditioning on correlated known prostate cancer SNPs. The meta-analysis by BPC3 and PRACTICAL identified a third promising signal, marked by rs16844874 at 2q34, independent of known prostate cancer loci (OR 1.12,95% CI 1.06–1.19; p = 4.67 × 10−5); it has been shown that SNPs correlated with this signal affect glycine concentrations. The main limitation is the heterogeneity in the definition of aggressive prostate cancer between BPC3 and PRACTICAL. Conclusions We did

  7. Genomic Scans across Three Eucalypts Suggest that Adaptation to Aridity is a Genome-Wide Phenomenon

    PubMed Central

    Potts, Brad M.; McLean, Elizabeth H.; Collins, Lesley; Holland, Barbara R.; Prober, Suzanne M.; Stock, William D.; Vaillancourt, René E.; Byrne, Margaret

    2017-01-01

    Widespread species spanning strong environmental (e.g., climatic) gradients frequently display morphological and physiological adaptations to local conditions. Some adaptations are common to different species that occupy similar environments. However, the genomic architecture underlying such convergent traits may not be the same between species. Using genomic data from previous studies of three widespread eucalypt species that grow along rainfall gradients in southern Australia, our probabilistic approach provides evidence that adaptation to aridity is a genome-wide phenomenon, likely to involve multiple and diverse genes, gene families and regulatory regions that affect a multitude of complex genetic and biochemical processes. PMID:28391293

  8. A Fast Implementation of a Scan Statistic for Identifying Chromosomal Patterns of Genome Wide Association Studies.

    PubMed

    Sun, Yan V; Jacobsen, Douglas M; Turner, Stephen T; Boerwinkle, Eric; Kardia, Sharon L R

    2009-03-15

    In order to take into account the complex genomic distribution of SNP variations when identifying chromosomal regions with significant SNP effects, a single nucleotide polymorphism (SNP) association scan statistic was developed. To address the computational needs of genome wide association (GWA) studies, a fast Java application, which combines single-locus SNP tests and a scan statistic for identifying chromosomal regions with significant clusters of significant SNP effects, was developed and implemented. To illustrate this application, SNP associations were analyzed in a pharmacogenomic study of the blood pressure lowering effect of thiazide-diuretics (N=195) using the Affymetrix Human Mapping 100K Set. 55,335 tagSNPs (pair-wise linkage disequilibrium R(2)<0.5) were selected to reduce the frequency correlation between SNPs. A typical workstation can complete the whole genome scan including 10,000 permutation tests within 3 hours. The most significant regions locate on chromosome 3, 6, 13 and 16, two of which contain candidate genes that may be involved in the underlying drug response mechanism. The computational performance of ChromoScan-GWA and its scalability were tested with up to 1,000,000 SNPs and up to 4,000 subjects. Using 10,000 permutations, the computation time grew linearly in these datasets. This scan statistic application provides a robust statistical and computational foundation for identifying genomic regions associated with disease and provides a method to compare GWA results even across different platforms.

  9. Meta-analyses of genome-wide linkage scans of anxiety-related phenotypes

    PubMed Central

    Webb, Bradley T; Guo, An-Yuan; Maher, Brion S; Zhao, Zhongming; van den Oord, Edwin J; Kendler, Kenneth S; Riley, Brien P; Gillespie, Nathan A; Prescott, Carol A; Middeldorp, Christel M; Willemsen, Gonneke; de Geus, Eco JC; Hottenga, Jouke-Jan; Boomsma, Dorret I; Slagboom, Eline P; Wray, Naomi R; Montgomery, Grant W; Martin, Nicholas G; Wright, Margie J; Heath, Andrew C; Madden, Pamela A; Gelernter, Joel; Knowles, James A; Hamilton, Steven P; Weissman, Myrna M; Fyer, Abby J; Huezo-Diaz, Patricia; McGuffin, Peter; Farmer, Anne; Craig, Ian W; Lewis, Cathryn; Sham, Pak; Crowe, Raymond R; Flint, Jonathan; Hettema, John M

    2012-01-01

    Genetic factors underlying trait neuroticism, reflecting a tendency towards negative affective states, may overlap genetic susceptibility for anxiety disorders and help explain the extensive comorbidity amongst internalizing disorders. Genome-wide linkage (GWL) data from several studies of neuroticism and anxiety disorders have been published, providing an opportunity to test such hypotheses and identify genomic regions that harbor genes common to these phenotypes. In all, 11 independent GWL studies of either neuroticism (n=8) or anxiety disorders (n=3) were collected, which comprised of 5341 families with 15 529 individuals. The rank-based genome scan meta-analysis (GSMA) approach was used to analyze each trait separately and combined, and global correlations between results were examined. False discovery rate (FDR) analysis was performed to test for enrichment of significant effects. Using 10 cM intervals, bins nominally significant for both GSMA statistics, PSR and POR, were found on chromosomes 9, 11, 12, and 14 for neuroticism and on chromosomes 1, 5, 15, and 16 for anxiety disorders. Genome-wide, the results for the two phenotypes were significantly correlated, and a combined analysis identified additional nominally significant bins. Although none reached genome-wide significance, an excess of significant PSRP-values were observed, with 12 bins falling under a FDR threshold of 0.50. As demonstrated by our identification of multiple, consistent signals across the genome, meta-analytically combining existing GWL data is a valuable approach to narrowing down regions relevant for anxiety-related phenotypes. This may prove useful for prioritizing emerging genome-wide association data for anxiety disorders. PMID:22473089

  10. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity.

    PubMed

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P; Nir, Talia M; Toga, Arthur W; Jack, Clifford R; Saykin, Andrew J; Green, Robert C; Weiner, Michael W; Medland, Sarah E; Montgomery, Grant W; Hansell, Narelle K; McMahon, Katie L; de Zubicaray, Greig I; Martin, Nicholas G; Wright, Margaret J; Thompson, Paul M

    2013-03-19

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer's disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain's connectivity pattern, allowing us to discover genetic variants that affect the human brain's wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer's disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases.

  11. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity

    PubMed Central

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P.; Nir, Talia M.; Toga, Arthur W.; Jack, Clifford R.; Saykin, Andrew J.; Green, Robert C.; Weiner, Michael W.; Medland, Sarah E.; Montgomery, Grant W.; Hansell, Narelle K.; McMahon, Katie L.; de Zubicaray, Greig I.; Martin, Nicholas G.; Wright, Margaret J.; Thompson, Paul M.; Weiner, Michael; Aisen, Paul; Weiner, Michael; Aisen, Paul; Petersen, Ronald; Jack, Clifford R.; Jagust, William; Trojanowski, John Q.; Toga, Arthur W.; Beckett, Laurel; Green, Robert C.; Saykin, Andrew J.; Morris, John; Liu, Enchi; Green, Robert C.; Montine, Tom; Petersen, Ronald; Aisen, Paul; Gamst, Anthony; Thomas, Ronald G.; Donohue, Michael; Walter, Sarah; Gessert, Devon; Sather, Tamie; Beckett, Laurel; Harvey, Danielle; Gamst, Anthony; Donohue, Michael; Kornak, John; Jack, Clifford R.; Dale, Anders; Bernstein, Matthew; Felmlee, Joel; Fox, Nick; Thompson, Paul; Schuff, Norbert; Alexander, Gene; DeCarli, Charles; Jagust, William; Bandy, Dan; Koeppe, Robert A.; Foster, Norm; Reiman, Eric M.; Chen, Kewei; Mathis, Chet; Morris, John; Cairns, Nigel J.; Taylor-Reinwald, Lisa; Trojanowki, J.Q.; Shaw, Les; Lee, Virginia M.Y.; Korecka, Magdalena; Toga, Arthur W.; Crawford, Karen; Neu, Scott; Saykin, Andrew J.; Foroud, Tatiana M.; Potkin, Steven; Shen, Li; Khachaturian, Zaven; Frank, Richard; Snyder, Peter J.; Molchan, Susan; Kaye, Jeffrey; Quinn, Joseph; Lind, Betty; Dolen, Sara; Schneider, Lon S.; Pawluczyk, Sonia; Spann, Bryan M.; Brewer, James; Vanderswag, Helen; Heidebrink, Judith L.; Lord, Joanne L.; Petersen, Ronald; Johnson, Kris; Doody, Rachelle S.; Villanueva-Meyer, Javier; Chowdhury, Munir; Stern, Yaakov; Honig, Lawrence S.; Bell, Karen L.; Morris, John C.; Ances, Beau; Carroll, Maria; Leon, Sue; Mintun, Mark A.; Schneider, Stacy; Marson, Daniel; Griffith, Randall; Clark, David; Grossman, Hillel; Mitsis, Effie; Romirowsky, Aliza; deToledo-Morrell, Leyla; Shah, Raj C.; Duara, Ranjan; Varon, Daniel; Roberts, Peggy; Albert, Marilyn; Onyike, Chiadi; Kielb, Stephanie; Rusinek, Henry; de Leon, Mony J.; Glodzik, Lidia; De Santi, Susan; Doraiswamy, P. Murali; Petrella, Jeffrey R.; Coleman, R. Edward; Arnold, Steven E.; Karlawish, Jason H.; Wolk, David; Smith, Charles D.; Jicha, Greg; Hardy, Peter; Lopez, Oscar L.; Oakley, MaryAnn; Simpson, Donna M.; Porsteinsson, Anton P.; Goldstein, Bonnie S.; Martin, Kim; Makino, Kelly M.; Ismail, M. Saleem; Brand, Connie; Mulnard, Ruth A.; Thai, Gaby; Mc-Adams-Ortiz, Catherine; Womack, Kyle; Mathews, Dana; Quiceno, Mary; Diaz-Arrastia, Ramon; King, Richard; Weiner, Myron; Martin-Cook, Kristen; DeVous, Michael; Levey, Allan I.; Lah, James J.; Cellar, Janet S.; Burns, Jeffrey M.; Anderson, Heather S.; Swerdlow, Russell H.; Apostolova, Liana; Lu, Po H.; Bartzokis, George; Silverman, Daniel H.S.; Graff-Radford, Neill R.; Parfitt, Francine; Johnson, Heather; Farlow, Martin R.; Hake, Ann Marie; Matthews, Brandy R.; Herring, Scott; van Dyck, Christopher H.; Carson, Richard E.; MacAvoy, Martha G.; Chertkow, Howard; Bergman, Howard; Hosein, Chris; Black, Sandra; Stefanovic, Bojana; Caldwell, Curtis; Hsiung, Ging-Yuek Robin; Feldman, Howard; Mudge, Benita; Assaly, Michele; Kertesz, Andrew; Rogers, John; Trost, Dick; Bernick, Charles; Munic, Donna; Kerwin, Diana; Mesulam, Marek-Marsel; Lipowski, Kristina; Wu, Chuang-Kuo; Johnson, Nancy; Sadowsky, Carl; Martinez, Walter; Villena, Teresa; Turner, Raymond Scott; Johnson, Kathleen; Reynolds, Brigid; Sperling, Reisa A.; Johnson, Keith A.; Marshall, Gad; Frey, Meghan; Yesavage, Jerome; Taylor, Joy L.; Lane, Barton; Rosen, Allyson; Tinklenberg, Jared; Sabbagh, Marwan; Belden, Christine; Jacobson, Sandra; Kowall, Neil; Killiany, Ronald; Budson, Andrew E.; Norbash, Alexander; Johnson, Patricia Lynn; Obisesan, Thomas O.; Wolday, Saba; Bwayo, Salome K.; Lerner, Alan; Hudson, Leon; Ogrocki, Paula; Fletcher, Evan; Carmichael, Owen; Olichney, John; DeCarli, Charles; Kittur, Smita; Borrie, Michael; Lee, T.-Y.; Bartha, Rob; Johnson, Sterling; Asthana, Sanjay; Carlsson, Cynthia M.; Potkin, Steven G.; Preda, Adrian; Nguyen, Dana; Tariot, Pierre; Fleisher, Adam; Reeder, Stephanie; Bates, Vernice; Capote, Horacio; Rainka, Michelle; Scharre, Douglas W.; Kataki, Maria; Zimmerman, Earl A.; Celmins, Dzintra; Brown, Alice D.; Pearlson, Godfrey D.; Blank, Karen; Anderson, Karen; Saykin, Andrew J.; Santulli, Robert B.; Schwartz, Eben S.; Sink, Kaycee M.; Williamson, Jeff D.; Garg, Pradeep; Watkins, Franklin; Ott, Brian R.; Querfurth, Henry; Tremont, Geoffrey; Salloway, Stephen; Malloy, Paul; Correia, Stephen; Rosen, Howard J.; Miller, Bruce L.; Mintzer, Jacobo; Longmire, Crystal Flynn; Spicer, Kenneth; Finger, Elizabeth; Rachinsky, Irina; Rogers, John; Kertesz, Andrew; Drost, Dick

    2013-01-01

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer’s disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain’s connectivity pattern, allowing us to discover genetic variants that affect the human brain’s wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer’s disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases. PMID:23471985

  12. A high-density SNP genome-wide linkage scan in a large autism extended pedigree.

    PubMed

    Allen-Brady, K; Miller, J; Matsunami, N; Stevens, J; Block, H; Farley, M; Krasny, L; Pingree, C; Lainhart, J; Leppert, M; McMahon, W M; Coon, H

    2009-06-01

    We performed a high-density, single nucleotide polymorphism (SNP), genome-wide scan on a six-generation pedigree from Utah with seven affected males, diagnosed with autism spectrum disorder. Using a two-stage linkage design, we first performed a nonparametric analysis on the entire genome using a 10K SNP chip to identify potential regions of interest. To confirm potentially interesting regions, we eliminated SNPs in high linkage disequilibrium (LD) using a principal components analysis (PCA) method and repeated the linkage results. Three regions met genome-wide significance criteria after controlling for LD: 3q13.2-q13.31 (nonparametric linkage (NPL), 5.58), 3q26.31-q27.3 (NPL, 4.85) and 20q11.21-q13.12 (NPL, 5.56). Two regions met suggestive criteria for significance 7p14.1-p11.22 (NPL, 3.18) and 9p24.3 (NPL, 3.44). All five chromosomal regions are consistent with other published findings. Haplotype sharing results showed that five of the affected subjects shared more than a single chromosomal region of interest with other affected subjects. Although no common autism susceptibility genes were found for all seven autism cases, these results suggest that multiple genetic loci within these regions may contribute to the autism phenotype in this family, and further follow-up of these chromosomal regions is warranted.

  13. Genome-wide scans of genetic variants for psychophysiological endophenotypes: A methodological overview

    PubMed Central

    IACONO, WILLIAM. G.; MALONE, STEPHEN. M.; VAIDYANATHAN, UMA; VRIEZE, SCOTT I.

    2014-01-01

    This article provides an introductory overview of the investigative strategy employed to evaluate the genetic basis of 17 endophenotypes examined as part of a 20-year data collection effort from the Minnesota Center for Twin and Family Research. Included are characterization of the study samples, descriptive statistics for key properties of the psychophysiological measures, and rationale behind the steps taken in the molecular genetic study design. The statistical approach included (a) biometric analysis of twin and family data, (b) heritability analysis using 527,829 single nucleotide polymorphisms (SNPs), (c) genome-wide association analysis of these SNPs and 17,601 autosomal genes, (d) follow-up analyses of candidate SNPs and genes hypothesized to have an association with each endophenotype, (e) rare variant analysis of nonsynonymous SNPs in the exome, and (f) whole genome sequencing association analysis using 27 million genetic variants. These methods were used in the accompanying empirical articles comprising this special issue, Genome-Wide Scans of Genetic Variants for Psychophysiological Endophenotypes. PMID:25387703

  14. Genome-Wide Scan for Adaptive Divergence and Association with Population-Specific Covariates.

    PubMed

    Gautier, Mathieu

    2015-12-01

    In population genomics studies, accounting for the neutral covariance structure across population allele frequencies is critical to improve the robustness of genome-wide scan approaches. Elaborating on the BayEnv model, this study investigates several modeling extensions (i) to improve the estimation accuracy of the population covariance matrix and all the related measures, (ii) to identify significantly overly differentiated SNPs based on a calibration procedure of the XtX statistics, and (iii) to consider alternative covariate models for analyses of association with population-specific covariables. In particular, the auxiliary variable model allows one to deal with multiple testing issues and, providing the relative marker positions are available, to capture some linkage disequilibrium information. A comprehensive simulation study was carried out to evaluate the performances of these different models. Also, when compared in terms of power, robustness, and computational efficiency to five other state-of-the-art genome-scan methods (BayEnv2, BayScEnv, BayScan, flk, and lfmm), the proposed approaches proved highly effective. For illustration purposes, genotyping data on 18 French cattle breeds were analyzed, leading to the identification of 13 strong signatures of selection. Among these, four (surrounding the KITLG, KIT, EDN3, and ALB genes) contained SNPs strongly associated with the piebald coloration pattern while a fifth (surrounding PLAG1) could be associated to morphological differences across the populations. Finally, analysis of Pool-Seq data from 12 populations of Littorina saxatilis living in two different ecotypes illustrates how the proposed framework might help in addressing relevant ecological issues in nonmodel species. Overall, the proposed methods define a robust Bayesian framework to characterize adaptive genetic differentiation across populations. The BayPass program implementing the different models is available at http://www1.montpellier.inra.fr/CBGP/software/baypass/.

  15. Genome-Wide Scan for Adaptive Divergence and Association with Population-Specific Covariates

    PubMed Central

    Gautier, Mathieu

    2015-01-01

    In population genomics studies, accounting for the neutral covariance structure across population allele frequencies is critical to improve the robustness of genome-wide scan approaches. Elaborating on the BayEnv model, this study investigates several modeling extensions (i) to improve the estimation accuracy of the population covariance matrix and all the related measures, (ii) to identify significantly overly differentiated SNPs based on a calibration procedure of the XtX statistics, and (iii) to consider alternative covariate models for analyses of association with population-specific covariables. In particular, the auxiliary variable model allows one to deal with multiple testing issues and, providing the relative marker positions are available, to capture some linkage disequilibrium information. A comprehensive simulation study was carried out to evaluate the performances of these different models. Also, when compared in terms of power, robustness, and computational efficiency to five other state-of-the-art genome-scan methods (BayEnv2, BayScEnv, BayScan, flk, and lfmm), the proposed approaches proved highly effective. For illustration purposes, genotyping data on 18 French cattle breeds were analyzed, leading to the identification of 13 strong signatures of selection. Among these, four (surrounding the KITLG, KIT, EDN3, and ALB genes) contained SNPs strongly associated with the piebald coloration pattern while a fifth (surrounding PLAG1) could be associated to morphological differences across the populations. Finally, analysis of Pool-Seq data from 12 populations of Littorina saxatilis living in two different ecotypes illustrates how the proposed framework might help in addressing relevant ecological issues in nonmodel species. Overall, the proposed methods define a robust Bayesian framework to characterize adaptive genetic differentiation across populations. The BayPass program implementing the different models is available at http://www1.montpellier

  16. Genome-wide association scan in psoriasis: new insights into chronic inflammatory disease.

    PubMed

    Schrodi, Steven J

    2008-09-01

    Evaluation of: Liu Y, Helms C, Liao W et al. A genome-wide association study of psoriasis and psoriatic arthritis identifies new disease loci. PLoS Genet. 4, e1000041 (2008). Genome-wide association scans have delivered on their promise of revealing susceptibility polymorphisms underlying common diseases. This comprehensive psoriasis study by Liu and colleagues reports confirmation of previously identified genes (HLA-C, IL12B and IL23R), identifies several novel psoriasis loci and is the first to report psoriatic arthritis association on a genome-wide scale. Along with other recent studies, this work gives further evidence that IL-23-mediated signaling is a key component of both psoriasis and psoriatic arthritis pathogenesis. Importantly, this study provides evidence of a single-nucleotide polymorphism (SNP), 35 kb upstream of HLA-C, which is stronger than Cw 0602 - the variant traditionally attributed to the MHC-linked psoriasis-susceptibility effect. Within this region, the authors also discovered an independent SNP with very strong predisposing effects. SNPs in the COG6 region and the USP8-TNFAIP8l3 region are among the novel psoriasis associations reported. In addition, a region showing linkage on chromosome 1q demonstrated association in the epidermal differentiation complex. Four SNPs over a 439-kb region on chromosome 4q27, where KIAA1109, ADAD1 and two cytokine-encoding genes (IL2 and IL21) reside, exhibit intriguing correlation with psoriatic arthritis, although the signal strength is moderate. These results, while still preliminary, may substantially expand our knowledge of psoriasis and psoriatic arthritis genetics, opening new avenues of chronic inflammatory disease research.

  17. Genome-wide scans detect adaptation to aridity in a widespread forest tree species.

    PubMed

    Steane, Dorothy A; Potts, Brad M; McLean, Elizabeth; Prober, Suzanne M; Stock, William D; Vaillancourt, René E; Byrne, Margaret

    2014-05-01

    Patterns of adaptive variation within plant species are best studied through common garden experiments, but these are costly and time-consuming, especially for trees that have long generation times. We explored whether genome-wide scanning technology combined with outlier marker detection could be used to detect adaptation to climate and provide an alternative to common garden experiments. As a case study, we sampled nine provenances of the widespread forest tree species, Eucalyptus tricarpa, across an aridity gradient in southeastern Australia. Using a Bayesian analysis, we identified a suite of 94 putatively adaptive (outlying) sequence-tagged markers across the genome. Population-level allele frequencies of these outlier markers were strongly correlated with temperature and moisture availability at the site of origin, and with population differences in functional traits measured in two common gardens. Using the output from a canonical analysis of principal coordinates, we devised a metric that provides a holistic measure of genomic adaptation to aridity that could be used to guide assisted migration or genetic augmentation.

  18. A Genome-wide Scan for Selective Sweeps in Racing Horses

    PubMed Central

    Moon, Sunjin; Lee, Jin Woo; Shin, Donghyun; Shin, Kwang-Yun; Kim, Jun; Choi, Ik-Young; Kim, Jaemin; Kim, Heebal

    2015-01-01

    Using next-generation sequencing, we conducted a genome-wide scan of selective sweeps associated with selection toward genetic improvement in Thoroughbreds. We investigated potential phenotypic consequence of putative candidate loci by candidate gene association mapping for the finishing time in 240 Thoroughbred horses. We found a significant association with the trait for Ral GApase alpha 2 (RALGAP2) that regulates a variety of cellular processes of signal trafficking. Neighboring genes around RALGAP2 included insulinoma-associated 1 (INSM1), pallid (PLDN), and Ras and Rab interactor 2 (RIN2) genes have similar roles in signal trafficking, suggesting that a co-evolving gene cluster located on the chromosome 22 is under strong artificial selection in racehorses. PMID:26333666

  19. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2010-01-01

    Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…

  20. The genetic architecture of seed composition in soybean is refined by genome-wide association scans across multiple populations

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Soybean oil and meal are major contributors to world-wide food production. Consequently, the genetic basis for soybean seed composition has been intensely studied using family-based mapping. Population-based mapping approaches, in the form of genome-wide association (GWA) scans, have been able to re...

  1. Suitability of Different Mapping Algorithms for Genome-Wide Polymorphism Scans with Pool-Seq Data

    PubMed Central

    Kofler, Robert; Langmüller, Anna Maria; Nouhaud, Pierre; Otte, Kathrin Anna; Schlötterer, Christian

    2016-01-01

    The cost-effectiveness of sequencing pools of individuals (Pool-Seq) provides the basis for the popularity and widespread use of this method for many research questions, ranging from unraveling the genetic basis of complex traits, to the clonal evolution of cancer cells. Because the accuracy of Pool-Seq could be affected by many potential sources of error, several studies have determined, for example, the influence of sequencing technology, the library preparation protocol, and mapping parameters. Nevertheless, the impact of the mapping tools has not yet been evaluated. Using simulated and real Pool-Seq data, we demonstrate a substantial impact of the mapping tools, leading to characteristic false positives in genome-wide scans. The problem of false positives was particularly pronounced when data with different read lengths and insert sizes were compared. Out of 14 evaluated algorithms novoalign, bwa mem and clc4 are most suitable for mapping Pool-Seq data. Nevertheless, no single algorithm is sufficient for avoiding all false positives. We show that the intersection of the results of two mapping algorithms provides a simple, yet effective, strategy to eliminate false positives. We propose that the implementation of a consistent Pool-Seq bioinformatics pipeline, building on the recommendations of this study, can substantially increase the reliability of Pool-Seq results, in particular when libraries generated with different protocols are being compared. PMID:27613752

  2. Genome-wide Scanning and Characterization of Sorghum bicolor L. Heat Shock Transcription Factors

    PubMed Central

    Nagaraju, M.; Reddy, Palakolanu Sudhakar; Kumar, S. Anil; Srivastava, Rakesh K.; Kishor, P. B. Kavi; Rao, D. Manohar

    2015-01-01

    A genome-wide scanning of Sorghum bicolor resulted in the identification of 25 SbHsf genes. Phylogenetic analysis shows the ortholog genes that are clustered with only rice, representing a common ancestor. Promoter analysis revealed the identification of different cis-acting elements that are responsible for abiotic as well as biotic stresses. Hsf domains like DBD, NLS, NES, and AHA have been analyzed for their sequence similarity and functional characterization. Tissue specific expression patterns of Hsfs in different tissues like mature embryo, seedling, root, and panicle were studied using real-time PCR. While Hsfs4 and 22 are highly expressed in panicle, 4 and 9 are expressed in seedlings. Sorghum plants were exposed to different abiotic stress treatments but no expression of any Hsf was observed when seedlings were treated with ABA. High level expression of Hsf1 was noticed during high temperature as well as cold stresses, 4 and 6 during salt and 5, 6, 10, 13, 19, 23 and 25 during drought stress. This comprehensive analysis of SbHsf genes will provide an insight on how these genes are regulated in different tissues and also under different abiotic stresses and help to determine the functions of Hsfs during drought and temperature stress tolerance. PMID:27006630

  3. Genome-wide linkage scan for the metabolic syndrome: the GENNID study.

    PubMed

    Edwards, Karen L; Hutter, Carolyn M; Wan, Jia Yin; Kim, Helen; Monks, Stephanie A

    2008-07-01

    In the United States, the metabolic syndrome (MetS) constitutes a major public health problem with over 47 million persons meeting clinical criteria for MetS. Numerous studies have suggested genetic susceptibility to MetS. The goals of this study were (i) to identify susceptibility loci for MetS in well-characterized families with type 2 diabetes (T2D) in four ethnic groups and (ii) to determine whether evidence for linkage varies across the four groups. The GENNID study (Genetics of NIDDM) is a multicenter study established by the American Diabetes Association in 1993 and comprises a comprehensive, well-characterized resource of T2D families from four ethnic groups (whites, Mexican Americans, African Americans, and Japanese Americans). Principal component factor analysis (PCFA) was used to define quantitative phenotypes of the MetS. Variance components linkage analysis was conducted using microsatellite markers from a 10-cM genome-wide linkage scan, separately in each of the four ethnic groups. Three quantitative MetS factors were identified by PCFA and used as phenotypes for MetS: (i) a weight/waist factor, (ii) a blood pressure factor, and (iii) a lipid factor. Evidence for linkage to each of these factors was observed. For each ethnic group, our results suggest that several regions harbor susceptibility genes for the MetS. The strongest evidence for linkage for MetS phenotypes was observed on chromosome 2 (2q12.1-2q13) in the white sample and on chromosome 3 (3q26.1-3q29) in the Mexican-American sample. In conclusion, the results suggest that several regions harbor MetS susceptibility genes and that heterogeneity may exist across groups.

  4. A Genome-Wide Scan for Breast Cancer Risk Haplotypes among African American Women

    PubMed Central

    Song, Chi; Chen, Gary K.; Millikan, Robert C.; Ambrosone, Christine B.; John, Esther M.; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J.; Ziegler, Regina G.; Nyante, Sarah; Bandera, Elisa V.; Ingles, Sue A.; Press, Michael F.; Deming, Sandra L.; Rodriguez-Gil, Jorge L.; Chanock, Stephen J.; Wan, Peggy; Sheng, Xin; Pooler, Loreall C.; Van Den Berg, David J.; Le Marchand, Loic; Kolonel, Laurence N.; Henderson, Brian E.; Haiman, Chris A.; Stram, Daniel O.

    2013-01-01

    Genome-wide association studies (GWAS) simultaneously investigating hundreds of thousands of single nucleotide polymorphisms (SNP) have become a powerful tool in the investigation of new disease susceptibility loci. Haplotypes are sometimes thought to be superior to SNPs and are promising in genetic association analyses. The application of genome-wide haplotype analysis, however, is hindered by the complexity of haplotypes themselves and sophistication in computation. We systematically analyzed the haplotype effects for breast cancer risk among 5,761 African American women (3,016 cases and 2,745 controls) using a sliding window approach on the genome-wide scale. Three regions on chromosomes 1, 4 and 18 exhibited moderate haplotype effects. Furthermore, among 21 breast cancer susceptibility loci previously established in European populations, 10p15 and 14q24 are likely to harbor novel haplotype effects. We also proposed a heuristic of determining the significance level and the effective number of independent tests by the permutation analysis on chromosome 22 data. It suggests that the effective number was approximately half of the total (7,794 out of 15,645), thus the half number could serve as a quick reference to evaluating genome-wide significance if a similar sliding window approach of haplotype analysis is adopted in similar populations using similar genotype density. PMID:23468962

  5. A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM

    PubMed Central

    Weiss, Lauren A.; Arking, Dan E.

    2009-01-01

    Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673

  6. A comparison in association and linkage genome-wide scans for alcoholism susceptibility genes using single-nucleotide polymorphisms.

    PubMed

    Chiu, Yen-Feng; Liu, Su-Yun; Tsai, Ya-Yu

    2005-12-30

    We conducted genome-wide linkage scans using both microsatellite and single-nucleotide polymorphism (SNP) markers. Regions showing the strongest evidence of linkage to alcoholism susceptibility genes were identified. Haplotype analyses using a sliding-window approach for SNPs in these regions were performed. In addition, we performed a genome-wide association scan using SNP data. SNPs in these regions with evidence of association (P genome scans are fairly consistent; however, the peaks of the NPL scores are mostly higher in the SNP-based scan than those using microsatellite markers, which might be located at different regions. Furthermore, SNPs identified from linkage screens were not so strongly associated with alcoholism (the most significant SNP had a p-value of 0.030) as those identified from association genomic screening (the most significant SNP had a p-value of 2.0 x 10(-8)).

  7. A twin study of breastfeeding with a preliminary genome wide association scan

    PubMed Central

    Colodro-Conde, L.; Zhu, G.; Power, R. A.; Henders, A.; Heath, A.C.; Madden, P.A.F.; Montgomery, G.W.; Medland, S. E.; Ordoñana, J.R.; Martin, N.G.

    2015-01-01

    Breastfeeding has been an important survival trait during human history, though it has long been recognised that individuals differ in their exact breastfeeding behaviour. Here our aims were, first, to explore to what extent genetic and environmental influences contributed to the individual differences in breastfeeding behaviour; second, to detect possible genetic variants related to breastfeeding; and lastly, to test if the genetic variants associated with breastfeeding have been previously found to be related with breast size. Data were collected from a large community-based cohort of Australian twins, with 3,364 women for the twin modelling analyses and 1,521 of them included in the genome wide association study. Monozygotic twin correlations (rMZ = .52, 95% CI .46 – .57) were larger than dizygotic twin correlations (rDZ = .35, 95% CI .25 – .43) and the best-fitting model was the one composed by additive genetics and unique environmental factors, explaining 53% and 47% of the variance in breastfeeding behaviour, respectively. No breastfeeding-related genetic variants reached genome-wide significance. The polygenic risk score analyses showed no significant results, suggesting breast size does not influence breastfeeding. This study confers a replication of a previous one exploring the sources of variance of breastfeeding and, to our knowledge, is the first one to conduct a Genome-Wide Association Study on breastfeeding and look at the overlap with variants for breast size. PMID:25475840

  8. Genome-Wide Scans for Delineation of Candidate Genes Regulating Seed-Protein Content in Chickpea

    PubMed Central

    Upadhyaya, Hari D.; Bajaj, Deepak; Narnoliya, Laxmi; Das, Shouvik; Kumar, Vinod; Gowda, C. L. L.; Sharma, Shivali; Tyagi, Akhilesh K.; Parida, Swarup K.

    2016-01-01

    Identification of potential genes/alleles governing complex seed-protein content (SPC) is essential in marker-assisted breeding for quality trait improvement of chickpea. Henceforth, the present study utilized an integrated genomics-assisted breeding strategy encompassing trait association analysis, selective genotyping in traditional bi-parental mapping population and differential expression profiling for the first-time to understand the complex genetic architecture of quantitative SPC trait in chickpea. For GWAS (genome-wide association study), high-throughput genotyping information of 16376 genome-based SNPs (single nucleotide polymorphism) discovered from a structured population of 336 sequenced desi and kabuli accessions [with 150–200 kb LD (linkage disequilibrium) decay] was utilized. This led to identification of seven most effective genomic loci (genes) associated [10–20% with 41% combined PVE (phenotypic variation explained)] with SPC trait in chickpea. Regardless of the diverse desi and kabuli genetic backgrounds, a comparable level of association potential of the identified seven genomic loci with SPC trait was observed. Five SPC-associated genes were validated successfully in parental accessions and homozygous individuals of an intra-specific desi RIL (recombinant inbred line) mapping population (ICC 12299 × ICC 4958) by selective genotyping. The seed-specific expression, including differential up-regulation (>four fold) of six SPC-associated genes particularly in accessions, parents and homozygous individuals of the aforementioned mapping population with a high level of contrasting SPC (21–22%) was evident. Collectively, the integrated genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in six potential candidate genes regulating SPC trait in chickpea. Of these, a non-synonymous SNP allele-carrying zinc finger transcription factor gene exhibiting strong association with SPC trait was found to be the most

  9. A genome-wide association scan in admixed Latin Americans identifies loci influencing facial and scalp hair features

    PubMed Central

    Adhikari, Kaustubh; Fontanil, Tania; Cal, Santiago; Mendoza-Revilla, Javier; Fuentes-Guajardo, Macarena; Chacón-Duque, Juan-Camilo; Al-Saadi, Farah; Johansson, Jeanette A.; Quinto-Sanchez, Mirsha; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Barquera Lozano, Rodrigo; Macín Pérez, Gastón; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C.; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M.; Bortolini, Maria-Cátira; Canizales-Quinteros, Samuel; Rothhammer, Francisco; Bedoya, Gabriel; Gonzalez-José, Rolando; Headon, Denis; López-Otín, Carlos; Tobin, Desmond J.; Balding, David; Ruiz-Linares, Andrés

    2016-01-01

    We report a genome-wide association scan in over 6,000 Latin Americans for features of scalp hair (shape, colour, greying, balding) and facial hair (beard thickness, monobrow, eyebrow thickness). We found 18 signals of association reaching genome-wide significance (P values 5 × 10−8 to 3 × 10−119), including 10 novel associations. These include novel loci for scalp hair shape and balding, and the first reported loci for hair greying, monobrow, eyebrow and beard thickness. A newly identified locus influencing hair shape includes a Q30R substitution in the Protease Serine S1 family member 53 (PRSS53). We demonstrate that this enzyme is highly expressed in the hair follicle, especially the inner root sheath, and that the Q30R substitution affects enzyme processing and secretion. The genome regions associated with hair features are enriched for signals of selection, consistent with proposals regarding the evolution of human hair. PMID:26926045

  10. Genome-wide scans provide evidence for positive selection of genes implicated in Lassa fever.

    PubMed

    Andersen, Kristian G; Shylakhter, Ilya; Tabrizi, Shervin; Grossman, Sharon R; Happi, Christian T; Sabeti, Pardis C

    2012-03-19

    Rapidly evolving viruses and other pathogens can have an immense impact on human evolution as natural selection acts to increase the prevalence of genetic variants providing resistance to disease. With the emergence of large datasets of human genetic variation, we can search for signatures of natural selection in the human genome driven by such disease-causing microorganisms. Based on this approach, we have previously hypothesized that Lassa virus (LASV) may have been a driver of natural selection in West African populations where Lassa haemorrhagic fever is endemic. In this study, we provide further evidence for this notion. By applying tests for selection to genome-wide data from the International Haplotype Map Consortium and the 1000 Genomes Consortium, we demonstrate evidence for positive selection in LARGE and interleukin 21 (IL21), two genes implicated in LASV infectivity and immunity. We further localized the signals of selection, using the recently developed composite of multiple signals method, to introns and putative regulatory regions of those genes. Our results suggest that natural selection may have targeted variants giving rise to alternative splicing or differential gene expression of LARGE and IL21. Overall, our study supports the hypothesis that selective pressures imposed by LASV may have led to the emergence of particular alleles conferring resistance to Lassa fever, and opens up new avenues of research pursuit.

  11. A Genome Wide Association Scan of Bovine Tuberculosis Susceptibility in Holstein-Friesian Dairy Cattle

    PubMed Central

    Finlay, Emma K.; Berry, Donagh P.; Wickham, Brian; Gormley, Eamonn P.; Bradley, Daniel G.

    2012-01-01

    Background Bovine tuberculosis is a significant veterinary and financial problem in many parts of the world. Although many factors influence infection and progression of the disease, there is a host genetic component and dissection of this may enlighten on the wider biology of host response to tuberculosis. However, a binary phenotype of presence/absence of infection presents a noisy signal for genomewide association study. Methodology/Principal Findings We calculated a composite phenotype of genetic merit for TB susceptibility based on disease incidence in daughters of elite sires used for artificial insemination in the Irish dairy herd. This robust measure was compared with 44,426 SNP genotypes in the most informative 307 subjects in a genome wide association analysis. Three SNPs in a 65 kb genomic region on BTA 22 were associated (i.e. p<10−5, peaking at position 59588069, p = 4.02×10−6) with tuberculosis susceptibility. Conclusions/Significance A genomic region on BTA 22 was suggestively associated with tuberculosis susceptibility; it contains the taurine transporter gene SLC6A6, or TauT, which is known to function in the immune system but has not previously been investigated for its role in tuberculosis infection. PMID:22355315

  12. A genome-wide scan for tying-up syndrome in Japanese Thoroughbreds.

    PubMed

    Tozaki, T; Hirota, K; Sugita, S; Ishida, N; Miyake, T; Oki, H; Hasegawa, T

    2010-12-01

    Tying-up syndrome, also known as recurrent exertional rhabdomyolysis in Thoroughbreds, is a common muscle disorder for racehorses. In this study, we performed a multipoint linkage analysis using LOKI based on the Bayesian Markov chain Monte Carlo method using 5 half-sib families (51 affected and 277 nonaffected horses in total), and a genome-wide association study (GWAS) using microsatellites (144 affected and 144 nonaffected horses) to map candidate regions for tying-up syndrome in Japanese Thoroughbreds. The linkage analysis identified one strong L-score (82.45) between the loci UCDEQ411 and COR058 (24.9-27.9 Mb) on ECA12. The GWAS identified two suggestive genomic regions on ECA12 (24.9-27.8 Mb) and ECA20 (29.3-33.5 Mb). Based on both results, the genomic region between UCDEQ411 and TKY499 (24.9-27.8 Mb) on ECA12 was the most significant and was considered as a candidate region for tying-up syndrome in Japanese Thoroughbreds.

  13. Genome-wide association scan suggests basis for microtia in Awassi sheep.

    PubMed

    Jawasreh, K; Boettcher, P J; Stella, A

    2016-08-01

    Hereditary underdevelopment of the ear, a condition also known as microtia, has been observed in several sheep breeds as well as in humans and other species. Its genetic basis in sheep is unknown. The Awassi sheep, a breed native to southwest Asia, carries this phenotype and was targeted for molecular characterization via a genome-wide association study. DNA samples were collected from sheep in Jordan. Eight affected and 12 normal individuals were genotyped with the Illumina OvineSNP50(®) chip. Multilocus analyses failed to identify any genotypic association. In contrast, a single-locus analysis revealed a statistically significant association (P = 0.012, genome-wide) with a SNP at basepair 34 647 499 on OAR23. This marker is adjacent to the gene encoding transcription factor GATA-6, which has been shown to play a role in many developmental processes, including chondrogenesis. The lack of extended homozygosity in this region suggests a fairly ancient mutation, and the time of occurrence was estimated to be approximately 3000 years ago. Many of the earless sheep breeds may thus share the causative mutation, especially within the subgroup of fat-tailed, wool sheep.

  14. A genome-wide scan for signatures of selection in Chinese indigenous and commercial pig breeds

    PubMed Central

    2014-01-01

    Background Modern breeding and artificial selection play critical roles in pig domestication and shape the genetic variation of different breeds. China has many indigenous pig breeds with various characteristics in morphology and production performance that differ from those of foreign commercial pig breeds. However, the signatures of selection on genes implying for economic traits between Chinese indigenous and commercial pigs have been poorly understood. Results We identified footprints of positive selection at the whole genome level, comprising 44,652 SNPs genotyped in six Chinese indigenous pig breeds, one developed breed and two commercial breeds. An empirical genome-wide distribution of Fst (F-statistics) was constructed based on estimations of Fst for each SNP across these nine breeds. We detected selection at the genome level using the High-Fst outlier method and found that 81 candidate genes show high evidence of positive selection. Furthermore, the results of network analyses showed that the genes that displayed evidence of positive selection were mainly involved in the development of tissues and organs, and the immune response. In addition, we calculated the pairwise Fst between Chinese indigenous and commercial breeds (CHN VS EURO) and between Northern and Southern Chinese indigenous breeds (Northern VS Southern). The IGF1R and ESR1 genes showed evidence of positive selection in the CHN VS EURO and Northern VS Southern groups, respectively. Conclusions In this study, we first identified the genomic regions that showed evidences of selection between Chinese indigenous and commercial pig breeds using the High-Fst outlier method. These regions were found to be involved in the development of tissues and organs, the immune response, growth and litter size. The results of this study provide new insights into understanding the genetic variation and domestication in pigs. PMID:24422716

  15. Genome-wide linkage scan identifies two novel genetic loci for coronary artery disease: in GeneQuest families.

    PubMed

    Gao, Hanxiang; Li, Lin; Rao, Shaoqi; Shen, Gongqing; Xi, Quansheng; Chen, Shenghan; Zhang, Zheng; Wang, Kai; Ellis, Stephen G; Chen, Qiuyun; Topol, Eric J; Wang, Qing K

    2014-01-01

    Coronary artery disease (CAD) is the leading cause of death worldwide. Recent genome-wide association studies (GWAS) identified >50 common variants associated with CAD or its complication myocardial infarction (MI), but collectively they account for <20% of heritability, generating a phenomena of "missing heritability". Rare variants with large effects may account for a large portion of missing heritability. Genome-wide linkage studies of large families and follow-up fine mapping and deep sequencing are particularly effective in identifying rare variants with large effects. Here we show results from a genome-wide linkage scan for CAD in multiplex GeneQuest families with early onset CAD and MI. Whole genome genotyping was carried out with 408 markers that span the human genome by every 10 cM and linkage analyses were performed using the affected relative pair analysis implemented in GENEHUNTER. Affected only nonparametric linkage (NPL) analysis identified two novel CAD loci with highly significant evidence of linkage on chromosome 3p25.1 (peak NPL  = 5.49) and 3q29 (NPL  = 6.84). We also identified four loci with suggestive linkage on 9q22.33, 9q34.11, 17p12, and 21q22.3 (NPL  = 3.18-4.07). These results identify novel loci for CAD and provide a framework for fine mapping and deep sequencing to identify new susceptibility genes and novel variants associated with risk of CAD.

  16. A genome-wide scan for signatures of differential artificial selection in ten cattle breeds

    PubMed Central

    2013-01-01

    Background Since the times of domestication, cattle have been continually shaped by the influence of humans. Relatively recent history, including breed formation and the still enduring enormous improvement of economically important traits, is expected to have left distinctive footprints of selection within the genome. The purpose of this study was to map genome-wide selection signatures in ten cattle breeds and thus improve the understanding of the genome response to strong artificial selection and support the identification of the underlying genetic variants of favoured phenotypes. We analysed 47,651 single nucleotide polymorphisms (SNP) using Cross Population Extended Haplotype Homozygosity (XP-EHH). Results We set the significance thresholds using the maximum XP-EHH values of two essentially artificially unselected breeds and found up to 229 selection signatures per breed. Through a confirmation process we verified selection for three distinct phenotypes typical for one breed (polledness in Galloway, double muscling in Blanc-Bleu Belge and red coat colour in Red Holstein cattle). Moreover, we detected six genes strongly associated with known QTL for beef or dairy traits (TG, ABCG2, DGAT1, GH1, GHR and the Casein Cluster) within selection signatures of at least one breed. A literature search for genes lying in outstanding signatures revealed further promising candidate genes. However, in concordance with previous genome-wide studies, we also detected a substantial number of signatures without any yet known gene content. Conclusions These results show the power of XP-EHH analyses in cattle to discover promising candidate genes and raise the hope of identifying phenotypically important variants in the near future. The finding of plausible functional candidates in some short signatures supports this hope. For instance, MAP2K6 is the only annotated gene of two signatures detected in Galloway and Gelbvieh cattle and is already known to be associated with carcass

  17. A genome-wide scan for common alleles affecting risk for autism

    PubMed Central

    Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R.; Correia, Catarina; Abrahams, Brett S.; Sykes, Nuala; Pagnamenta, Alistair T.; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J.; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F.; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R.; Casallo, Guillermo; Casey, Jillian; Chu, Su H.; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L.; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A.; Folstein, Susan E.; Fombonne, Eric; Freitag, Christine M.; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T.; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J.; Hakonarson, Hakon; Heron, Elizabeth A.; Hill, Matthew; Holt, Richard; Howe, Jennifer L.; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M.; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M.; Lamb, Janine A.; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L.; Lionel, Anath C.; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C.; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R.; McConachie, Helen; McDougle, Christopher J.; McGrath, Jane; McMahon, William M.; Melhem, Nadine M.; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J.; Mirza, Ghazala K.; Munson, Jeff; Nelson, Stanley F.; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R.; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L.; Bierut, Laura J.; Rice, John P.; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C.; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P.; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B.; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H.; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L.; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D.; Cantor, Rita M.; Cook, Edwin H.; Coon, Hilary; Cuccaro, Michael L.; Gallagher, Louise; Geschwind, Daniel H.; Gill, Michael; Haines, Jonathan L.; Miller, Judith; Monaco, Anthony P.; Nurnberger, John I.; Paterson, Andrew D.; Pericak-Vance, Margaret A.; Schellenberg, Gerard D.; Scherer, Stephen W.; Sutcliffe, James S.; Szatmari, Peter; Vicente, Astrid M.; Vieland, Veronica J.; Wijsman, Ellen M.; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim

    2010-01-01

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10−8. When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10−8 threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C. PMID:20663923

  18. A genome-wide scan for common alleles affecting risk for autism.

    PubMed

    Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R; Correia, Catarina; Abrahams, Brett S; Sykes, Nuala; Pagnamenta, Alistair T; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R; Casallo, Guillermo; Casey, Jillian; Chu, Su H; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A; Folstein, Susan E; Fombonne, Eric; Freitag, Christine M; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J; Hakonarson, Hakon; Heron, Elizabeth A; Hill, Matthew; Holt, Richard; Howe, Jennifer L; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M; Lamb, Janine A; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L; Lionel, Anath C; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R; McConachie, Helen; McDougle, Christopher J; McGrath, Jane; McMahon, William M; Melhem, Nadine M; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J; Mirza, Ghazala K; Munson, Jeff; Nelson, Stanley F; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L; Bierut, Laura J; Rice, John P; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D; Cantor, Rita M; Cook, Edwin H; Coon, Hilary; Cuccaro, Michael L; Gallagher, Louise; Geschwind, Daniel H; Gill, Michael; Haines, Jonathan L; Miller, Judith; Monaco, Anthony P; Nurnberger, John I; Paterson, Andrew D; Pericak-Vance, Margaret A; Schellenberg, Gerard D; Scherer, Stephen W; Sutcliffe, James S; Szatmari, Peter; Vicente, Astrid M; Vieland, Veronica J; Wijsman, Ellen M; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim

    2010-10-15

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C.

  19. A genome-wide scan in affected sibling pairs with idiopathic recurrent miscarriage suggests genetic linkage.

    PubMed

    Kolte, A M; Nielsen, H S; Moltke, I; Degn, B; Pedersen, B; Sunde, L; Nielsen, F C; Christiansen, O B

    2011-06-01

    Previously, siblings of patients with idiopathic recurrent miscarriage (IRM) have been shown to have a higher risk of miscarriage. This study comprises two parts: (i) an epidemiological part, in which we introduce data on the frequency of miscarriage among 268 siblings of 244 patients with IRM and (ii) a genetic part presenting data from a genome-wide linkage study of 38 affected sibling pairs with IRM. All IRM patients (probands) had experienced three or more miscarriages and affected siblings two or more miscarriages. The sibling pairs were genotyped by the Affymetrix GeneChip 50K XbaI platform and non-parametric linkage analysis was performed via the software package Merlin. We find that siblings of IRM patients exhibit a higher frequency of miscarriage than population controls regardless of age at the time of pregnancy. We identify chromosomal regions with LOD scores between 2.5 and 3.0 in subgroups of affected sibling pairs. Maximum LOD scores were identified in four occurrences: for rs10514716 (3p14.2) when analyzing sister-pairs only; for rs10511668 (9p22.1) and rs341048 (11q13.4) when only analyzing families where the probands have had four or more miscarriages; and for rs10485275 (6q16.3) when analyzing one sibling pair from each family only. We identify no founder mutations. Concluding, our results imply that IRM patients and their siblings share factors which increase the risk of miscarriage. In this first genome-wide linkage study of affected sibling pairs with IRM, we identify regions on chromosomes 3, 6, 9 and 11 which warrant further investigation in order to elucidate their putative roles in the genesis of IRM.

  20. Genome-wide association Scan of dental caries in the permanent dentition

    PubMed Central

    2012-01-01

    Background Over 90% of adults aged 20 years or older with permanent teeth have suffered from dental caries leading to pain, infection, or even tooth loss. Although caries prevalence has decreased over the past decade, there are still about 23% of dentate adults who have untreated carious lesions in the US. Dental caries is a complex disorder affected by both individual susceptibility and environmental factors. Approximately 35-55% of caries phenotypic variation in the permanent dentition is attributable to genes, though few specific caries genes have been identified. Therefore, we conducted the first genome-wide association study (GWAS) to identify genes affecting susceptibility to caries in adults. Methods Five independent cohorts were included in this study, totaling more than 7000 participants. For each participant, dental caries was assessed and genetic markers (single nucleotide polymorphisms, SNPs) were genotyped or imputed across the entire genome. Due to the heterogeneity among the five cohorts regarding age, genotyping platform, quality of dental caries assessment, and study design, we first conducted genome-wide association (GWA) analyses on each of the five independent cohorts separately. We then performed three meta-analyses to combine results for: (i) the comparatively younger, Appalachian cohorts (N = 1483) with well-assessed caries phenotype, (ii) the comparatively older, non-Appalachian cohorts (N = 5960) with inferior caries phenotypes, and (iii) all five cohorts (N = 7443). Top ranking genetic loci within and across meta-analyses were scrutinized for biologically plausible roles on caries. Results Different sets of genes were nominated across the three meta-analyses, especially between the younger and older age cohorts. In general, we identified several suggestive loci (P-value ≤ 10E-05) within or near genes with plausible biological roles for dental caries, including RPS6KA2 and PTK2B, involved in p38-depenedent MAPK signaling

  1. Genome-Wide Association Scan Meta-Analysis Identifies Three Loci Influencing Adiposity and Fat Distribution

    PubMed Central

    Qi, Lu; Speliotes, Elizabeth K.; Thorleifsson, Gudmar; Willer, Cristen J.; Herrera, Blanca M.; Jackson, Anne U.; Lim, Noha; Scheet, Paul; Soranzo, Nicole; Amin, Najaf; Aulchenko, Yurii S.; Chambers, John C.; Drong, Alexander; Luan, Jian'an; Lyon, Helen N.; Rivadeneira, Fernando; Sanna, Serena; Timpson, Nicholas J.; Zillikens, M. Carola; Zhao, Jing Hua; Almgren, Peter; Bandinelli, Stefania; Bennett, Amanda J.; Bergman, Richard N.; Bonnycastle, Lori L.; Bumpstead, Suzannah J.; Chanock, Stephen J.; Cherkas, Lynn; Chines, Peter; Coin, Lachlan; Cooper, Cyrus; Crawford, Gabriel; Doering, Angela; Dominiczak, Anna; Doney, Alex S. F.; Ebrahim, Shah; Elliott, Paul; Erdos, Michael R.; Estrada, Karol; Ferrucci, Luigi; Fischer, Guido; Forouhi, Nita G.; Gieger, Christian; Grallert, Harald; Groves, Christopher J.; Grundy, Scott; Guiducci, Candace; Hadley, David; Hamsten, Anders; Havulinna, Aki S.; Hofman, Albert; Holle, Rolf; Holloway, John W.; Illig, Thomas; Isomaa, Bo; Jacobs, Leonie C.; Jameson, Karen; Jousilahti, Pekka; Karpe, Fredrik; Kuusisto, Johanna; Laitinen, Jaana; Lathrop, G. Mark; Lawlor, Debbie A.; Mangino, Massimo; McArdle, Wendy L.; Meitinger, Thomas; Morken, Mario A.; Morris, Andrew P.; Munroe, Patricia; Narisu, Narisu; Nordström, Anna; Nordström, Peter; Oostra, Ben A.; Palmer, Colin N. A.; Payne, Felicity; Peden, John F.; Prokopenko, Inga; Renström, Frida; Ruokonen, Aimo; Salomaa, Veikko; Sandhu, Manjinder S.; Scott, Laura J.; Scuteri, Angelo; Silander, Kaisa; Song, Kijoung; Yuan, Xin; Stringham, Heather M.; Swift, Amy J.; Tuomi, Tiinamaija; Uda, Manuela; Vollenweider, Peter; Waeber, Gerard; Wallace, Chris; Walters, G. Bragi; Weedon, Michael N.; Witteman, Jacqueline C. M.; Zhang, Cuilin; Zhang, Weihua; Caulfield, Mark J.; Collins, Francis S.; Davey Smith, George; Day, Ian N. M.; Franks, Paul W.; Hattersley, Andrew T.; Hu, Frank B.; Jarvelin, Marjo-Riitta; Kong, Augustine; Kooner, Jaspal S.; Laakso, Markku; Lakatta, Edward; Mooser, Vincent; Morris, Andrew D.; Peltonen, Leena; Samani, Nilesh J.; Spector, Timothy D.; Strachan, David P.; Tanaka, Toshiko; Tuomilehto, Jaakko; Uitterlinden, André G.; van Duijn, Cornelia M.; Wareham, Nicholas J.; Watkins for the PROCARDIS consortia, Hugh; Waterworth, Dawn M.; Boehnke, Michael; Deloukas, Panos; Groop, Leif; Hunter, David J.; Thorsteinsdottir, Unnur; Schlessinger, David; Wichmann, H.-Erich; Frayling, Timothy M.; Abecasis, Gonçalo R.; Hirschhorn, Joel N.; Loos, Ruth J. F.; Stefansson, Kari; Mohlke, Karen L.; Barroso, Inês; McCarthy for the GIANT consortium, Mark I.

    2009-01-01

    To identify genetic loci influencing central obesity and fat distribution, we performed a meta-analysis of 16 genome-wide association studies (GWAS, N = 38,580) informative for adult waist circumference (WC) and waist–hip ratio (WHR). We selected 26 SNPs for follow-up, for which the evidence of association with measures of central adiposity (WC and/or WHR) was strong and disproportionate to that for overall adiposity or height. Follow-up studies in a maximum of 70,689 individuals identified two loci strongly associated with measures of central adiposity; these map near TFAP2B (WC, P = 1.9×10−11) and MSRA (WC, P = 8.9×10−9). A third locus, near LYPLAL1, was associated with WHR in women only (P = 2.6×10−8). The variants near TFAP2B appear to influence central adiposity through an effect on overall obesity/fat-mass, whereas LYPLAL1 displays a strong female-only association with fat distribution. By focusing on anthropometric measures of central obesity and fat distribution, we have identified three loci implicated in the regulation of human adiposity. PMID:19557161

  2. A genome-wide scan identifies variants in NFIB associated with metastasis in patients with osteosarcoma

    PubMed Central

    Mirabello, Lisa; Koster, Roelof; Moriarity, Branden S.; Spector, Logan G.; Meltzer, Paul S.; Gary, Joy; Machiela, Mitchell J.; Pankratz, Nathan; Panagiotou, Orestis A.; Largaespada, David; Wang, Zhaoming; Gastier-Foster, Julie M.; Gorlick, Richard; Khanna, Chand; de Toledo, Silvia Regina Caminada; Petrilli, Antonio S.; Patiño-Garcia, Ana; Sierrasesúmaga, Luis; Lecanda, Fernando; Andrulis, Irene L.; Wunder, Jay S.; Gokgoz, Nalan; Serra, Massimo; Hattinger, Claudia; Picci, Piero; Scotlandi, Katia; Flanagan, Adrienne M.; Tirabosco, Roberto; Amary, Maria Fernanda; Halai, Dina; Ballinger, Mandy L.; Thomas, David M.; Davis, Sean; Barkauskas, Donald A.; Marina, Neyssa; Helman, Lee; Otto, George M.; Becklin, Kelsie L.; Wolf, Natalie K.; Weg, Madison T.; Tucker, Margaret; Wacholder, Sholom; Fraumeni, Joseph F.; Caporaso, Neil E.; Boland, Joseph F.; Hicks, Belynda D.; Vogt, Aurelie; Burdett, Laurie; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Savage, Sharon A.

    2015-01-01

    Metastasis is the leading cause of death in osteosarcoma patients, the most common pediatric bone malignancy. We conducted a multi-stage genome-wide association study of osteosarcoma metastasis at diagnosis in 935 osteosarcoma patients to determine whether germline genetic variation contributes to risk of metastasis. We identified a SNP, rs7034162, in NFIB significantly associated with metastasis in European osteosarcoma cases, as well as in cases of African and Brazilian ancestry (meta-analysis of all cases: P=1.2×10−9, OR 2.43, 95% CI 1.83–3.24). The risk allele was significantly associated with lowered NFIB expression, which led to increased osteosarcoma cell migration, proliferation, and colony formation. Additionally, a transposon screen in mice identified a significant proportion of osteosarcomas harboring inactivating insertions in Nfib, and had lowered Nfib expression. These data suggest that germline genetic variation at rs7034162 is important in osteosarcoma metastasis, and that NFIB is an osteosarcoma metastasis susceptibility gene. PMID:26084801

  3. Genome-Wide Linkage Scan Identifies Two Novel Genetic Loci for Coronary Artery Disease: In GeneQuest Families

    PubMed Central

    Shen, Gongqing; Xi, Quansheng; Chen, Shenghan; Zhang, Zheng; Wang, Kai; Ellis, Stephen G.; Chen, Qiuyun; Topol, Eric J.; Wang, Qing K.

    2014-01-01

    Coronary artery disease (CAD) is the leading cause of death worldwide. Recent genome-wide association studies (GWAS) identified >50 common variants associated with CAD or its complication myocardial infarction (MI), but collectively they account for <20% of heritability, generating a phenomena of “missing heritability”. Rare variants with large effects may account for a large portion of missing heritability. Genome-wide linkage studies of large families and follow-up fine mapping and deep sequencing are particularly effective in identifying rare variants with large effects. Here we show results from a genome-wide linkage scan for CAD in multiplex GeneQuest families with early onset CAD and MI. Whole genome genotyping was carried out with 408 markers that span the human genome by every 10 cM and linkage analyses were performed using the affected relative pair analysis implemented in GENEHUNTER. Affected only nonparametric linkage (NPL) analysis identified two novel CAD loci with highly significant evidence of linkage on chromosome 3p25.1 (peak NPL  = 5.49) and 3q29 (NPL  = 6.84). We also identified four loci with suggestive linkage on 9q22.33, 9q34.11, 17p12, and 21q22.3 (NPL  = 3.18–4.07). These results identify novel loci for CAD and provide a framework for fine mapping and deep sequencing to identify new susceptibility genes and novel variants associated with risk of CAD. PMID:25485937

  4. A genome-wide scan for selection signatures in Nelore cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Brazilian Nelore cattle have been selected for growth traits over more than four decades. In recent years, reproductive and meat quality traits have become more important because of increasing consumption, exports and consumer demand. The identification of genomic regions altered by artificial selec...

  5. Genome-wide scans for candidate genes involved in the aquatic adaptation of dolphins.

    PubMed

    Sun, Yan-Bo; Zhou, Wei-Ping; Liu, He-Qun; Irwin, David M; Shen, Yong-Yi; Zhang, Ya-Ping

    2013-01-01

    Since their divergence from the terrestrial artiodactyls, cetaceans have fully adapted to an aquatic lifestyle, which represents one of the most dramatic transformations in mammalian evolutionary history. Numerous morphological and physiological characters of cetaceans have been acquired in response to this drastic habitat transition, such as thickened blubber, echolocation, and ability to hold their breath for a long period of time. However, knowledge about the molecular basis underlying these adaptations is still limited. The sequence of the genome of Tursiops truncates provides an opportunity for a comparative genomic analyses to examine the molecular adaptation of this species. Here, we constructed 11,838 high-quality orthologous gene alignments culled from the dolphin and four other terrestrial mammalian genomes and screened for positive selection occurring in the dolphin lineage. In total, 368 (3.1%) of the genes were identified as having undergone positive selection by the branch-site model. Functional characterization of these genes showed that they are significantly enriched in the categories of lipid transport and localization, ATPase activity, sense perception of sound, and muscle contraction, areas that are potentially related to cetacean adaptations. In contrast, we did not find a similar pattern in the cow, a closely related species. We resequenced some of the positively selected sites (PSSs), within the positively selected genes, and showed that most of our identified PSSs (50/52) could be replicated. The results from this study should have important implications for our understanding of cetacean evolution and their adaptations to the aquatic environment.

  6. Genome-Wide Scans for Candidate Genes Involved in the Aquatic Adaptation of Dolphins

    PubMed Central

    Liu, He-Qun; Irwin, David M.; Shen, Yong-Yi; Zhang, Ya-Ping

    2013-01-01

    Since their divergence from the terrestrial artiodactyls, cetaceans have fully adapted to an aquatic lifestyle, which represents one of the most dramatic transformations in mammalian evolutionary history. Numerous morphological and physiological characters of cetaceans have been acquired in response to this drastic habitat transition, such as thickened blubber, echolocation, and ability to hold their breath for a long period of time. However, knowledge about the molecular basis underlying these adaptations is still limited. The sequence of the genome of Tursiops truncates provides an opportunity for a comparative genomic analyses to examine the molecular adaptation of this species. Here, we constructed 11,838 high-quality orthologous gene alignments culled from the dolphin and four other terrestrial mammalian genomes and screened for positive selection occurring in the dolphin lineage. In total, 368 (3.1%) of the genes were identified as having undergone positive selection by the branch-site model. Functional characterization of these genes showed that they are significantly enriched in the categories of lipid transport and localization, ATPase activity, sense perception of sound, and muscle contraction, areas that are potentially related to cetacean adaptations. In contrast, we did not find a similar pattern in the cow, a closely related species. We resequenced some of the positively selected sites (PSSs), within the positively selected genes, and showed that most of our identified PSSs (50/52) could be replicated. The results from this study should have important implications for our understanding of cetacean evolution and their adaptations to the aquatic environment. PMID:23246795

  7. A genome-wide scan for selection signatures in Nellore cattle.

    PubMed

    Somavilla, A L; Sonstegard, T S; Higa, R H; Rosa, A N; Siqueira, F; Silva, L O C; Torres Júnior, R A A; Coutinho, L L; Mudadu, M A; Alencar, M M; Regitano, L C A

    2014-12-01

    Brazilian Nellore cattle (Bos indicus) have been selected for growth traits for over more than four decades. In recent years, reproductive and meat quality traits have become more important because of increasing consumption, exports and consumer demand. The identification of genome regions altered by artificial selection can potentially permit a better understanding of the biology of specific phenotypes that are useful for the development of tools designed to increase selection efficiency. Therefore, the aims of this study were to detect evidence of recent selection signatures in Nellore cattle using extended haplotype homozygosity methodology and BovineHD marker genotypes (>777,000 single nucleotide polymorphisms) as well as to identify corresponding genes underlying these signals. Thirty-one significant regions (P < 0.0001) of possible recent selection signatures were detected, and 19 of these overlapped quantitative trait loci related to reproductive traits, growth, feed efficiency, meat quality, fatty acid profiles and immunity. In addition, 545 genes were identified in regions harboring selection signatures. Within this group, 58 genes were associated with growth, muscle and adipose tissue metabolism, reproductive traits or the immune system. Using relative extended haplotype homozygosity to analyze high-density single nucleotide polymorphism marker data allowed for the identification of regions potentially under artificial selection pressure in the Nellore genome, which might be used to better understand autozygosity and the effects of selection on the Nellore genome.

  8. Using the Gene Ontology to Scan Multi-Level Gene Sets for Associations in Genome Wide Association Studies

    PubMed Central

    Schaid, Daniel J.; Sinnwell, Jason P.; Jenkins, Gregory D.; McDonnell, Shannon K.; Ingle, James N.; Kubo, Michiaki; Goss, Paul E.; Costantino, Joseph P.; Wickerham, D. Lawrence; Weinshilboum, Richard M.

    2011-01-01

    Gene-set analyses have been widely used in gene expression studies, and some of the developed methods have been extended to genome wide association studies (GWAS). Yet, complications due to linkage disequilibrium (LD) among single nucleotide polymorphisms (SNPs), and variable numbers of SNPs per gene and genes per gene-set, have plagued current approaches, often leading to ad hoc “fixes”. To overcome some of the current limitations, we developed a general approach to scan GWAS SNP data for both gene-level and gene-set analyses, building on score statistics for generalized linear models, and taking advantage of the directed acyclic graph structure of the gene ontology when creating gene-sets. However, other types of gene-set structures can be used, such as the popular Kyoto Encyclopedia of Genes and Genomes (KEGG). Our approach combines SNPs into genes, and genes into gene-sets, but assures that positive and negative effects of genes on a trait do not cancel. To control for multiple testing of many gene-sets, we use an efficient computational strategy that accounts for LD and provides accurate step-down adjusted p-values for each gene-set. Application of our methods to two different GWAS provide guidance on the potential strengths and weaknesses of our proposed gene-set analyses. PMID:22161999

  9. A powerful score test to detect positive selection in genome-wide scans

    PubMed Central

    Zhong, Ming; Lange, Kenneth; Papp, Jeanette C; Fan, Ruzong

    2010-01-01

    One of the surest signatures of recent positive selection is a local elevation of advantageous allele frequency and linkage disequilibrium (LD). We proposed to detect such hitchhiking effects by using extended stretches of homozygosity as a surrogate indicator of recent positive selection. An extended haplotype-based homozygosity score test (EHHST) was developed to detect excess homozygosity. The EHHST conditioned on existing LD and it tested the haplotype version of the Hardy–Weinberg equilibrium. Compared with existing popular tests, which usually lack clear distribution, the EHHST is asymptotically normal, which makes analysis and applications easier. In particular, the EHHST facilitates the computation of an asymptotic P-value instead of an empirical P-value, using simulations. We evaluated by simulation that the EHHST led to appropriate false-positive rates, and it had higher or similar power as the existing popular methods. The method was applied to HapMap Phase II data. We were able to replicate previous findings of strong positive selection in 17 autosome genomic regions out of 20 reported candidates. On the basis of high EHHST values and population differentiations, we identified 15 new candidate regions that could undergo recent selection. PMID:20461112

  10. Genome-wide Scan of 29,141 African Americans Finds No Evidence of Directional Selection since Admixture

    PubMed Central

    Bhatia, Gaurav; Tandon, Arti; Patterson, Nick; Aldrich, Melinda C.; Ambrosone, Christine B.; Amos, Christopher; Bandera, Elisa V.; Berndt, Sonja I.; Bernstein, Leslie; Blot, William J.; Bock, Cathryn H.; Caporaso, Neil; Casey, Graham; Deming, Sandra L.; Diver, W. Ryan; Gapstur, Susan M.; Gillanders, Elizabeth M.; Harris, Curtis C.; Henderson, Brian E.; Ingles, Sue A.; Isaacs, William; De Jager, Phillip L.; John, Esther M.; Kittles, Rick A.; Larkin, Emma; McNeill, Lorna H.; Millikan, Robert C.; Murphy, Adam; Neslund-Dudas, Christine; Nyante, Sarah; Press, Michael F.; Rodriguez-Gil, Jorge L.; Rybicki, Benjamin A.; Schwartz, Ann G.; Signorello, Lisa B.; Spitz, Margaret; Strom, Sara S.; Tucker, Margaret A.; Wiencke, John K.; Witte, John S.; Wu, Xifeng; Yamamura, Yuko; Zanetti, Krista A.; Zheng, Wei; Ziegler, Regina G.; Chanock, Stephen J.; Haiman, Christopher A.; Reich, David; Price, Alkes L.

    2014-01-01

    The extent of recent selection in admixed populations is currently an unresolved question. We scanned the genomes of 29,141 African Americans and failed to find any genome-wide-significant deviations in local ancestry, indicating no evidence of selection influencing ancestry after admixture. A recent analysis of data from 1,890 African Americans reported that there was evidence of selection in African Americans after their ancestors left Africa, both before and after admixture. Selection after admixture was reported on the basis of deviations in local ancestry, and selection before admixture was reported on the basis of allele-frequency differences between African Americans and African populations. The local-ancestry deviations reported by the previous study did not replicate in our very large sample, and we show that such deviations were expected purely by chance, given the number of hypotheses tested. We further show that the previous study’s conclusion of selection in African Americans before admixture is also subject to doubt. This is because the FST statistics they used were inflated and because true signals of unusual allele-frequency differences between African Americans and African populations would be best explained by selection that occurred in Africa prior to migration to the Americas. PMID:25242497

  11. Genome-wide scan of 29,141 African Americans finds no evidence of directional selection since admixture.

    PubMed

    Bhatia, Gaurav; Tandon, Arti; Patterson, Nick; Aldrich, Melinda C; Ambrosone, Christine B; Amos, Christopher; Bandera, Elisa V; Berndt, Sonja I; Bernstein, Leslie; Blot, William J; Bock, Cathryn H; Caporaso, Neil; Casey, Graham; Deming, Sandra L; Diver, W Ryan; Gapstur, Susan M; Gillanders, Elizabeth M; Harris, Curtis C; Henderson, Brian E; Ingles, Sue A; Isaacs, William; De Jager, Phillip L; John, Esther M; Kittles, Rick A; Larkin, Emma; McNeill, Lorna H; Millikan, Robert C; Murphy, Adam; Neslund-Dudas, Christine; Nyante, Sarah; Press, Michael F; Rodriguez-Gil, Jorge L; Rybicki, Benjamin A; Schwartz, Ann G; Signorello, Lisa B; Spitz, Margaret; Strom, Sara S; Tucker, Margaret A; Wiencke, John K; Witte, John S; Wu, Xifeng; Yamamura, Yuko; Zanetti, Krista A; Zheng, Wei; Ziegler, Regina G; Chanock, Stephen J; Haiman, Christopher A; Reich, David; Price, Alkes L

    2014-10-02

    The extent of recent selection in admixed populations is currently an unresolved question. We scanned the genomes of 29,141 African Americans and failed to find any genome-wide-significant deviations in local ancestry, indicating no evidence of selection influencing ancestry after admixture. A recent analysis of data from 1,890 African Americans reported that there was evidence of selection in African Americans after their ancestors left Africa, both before and after admixture. Selection after admixture was reported on the basis of deviations in local ancestry, and selection before admixture was reported on the basis of allele-frequency differences between African Americans and African populations. The local-ancestry deviations reported by the previous study did not replicate in our very large sample, and we show that such deviations were expected purely by chance, given the number of hypotheses tested. We further show that the previous study's conclusion of selection in African Americans before admixture is also subject to doubt. This is because the FST statistics they used were inflated and because true signals of unusual allele-frequency differences between African Americans and African populations would be best explained by selection that occurred in Africa prior to migration to the Americas.

  12. Wide scanning spherical antenna

    NASA Technical Reports Server (NTRS)

    Shen, Bing (Inventor); Stutzman, Warren L. (Inventor)

    1995-01-01

    A novel method for calculating the surface shapes for subreflectors in a suboptic assembly of a tri-reflector spherical antenna system is introduced, modeled from a generalization of Galindo-Israel's method of solving partial differential equations to correct for spherical aberration and provide uniform feed to aperture mapping. In a first embodiment, the suboptic assembly moves as a single unit to achieve scan while the main reflector remains stationary. A feed horn is tilted during scan to maintain the illuminated area on the main spherical reflector fixed throughout the scan thereby eliminating the need to oversize the main spherical reflector. In an alternate embodiment, both the main spherical reflector and the suboptic assembly are fixed. A flat mirror is used to create a virtual image of the suboptic assembly. Scan is achieved by rotating the mirror about the spherical center of the main reflector. The feed horn is tilted during scan to maintain the illuminated area on the main spherical reflector fixed throughout the scan.

  13. Genome-wide scan identifies TNIP1, PSORS1C1, and RHOB as novel risk loci for systemic sclerosis.

    PubMed

    Allanore, Yannick; Saad, Mohamad; Dieudé, Philippe; Avouac, Jérôme; Distler, Jorg H W; Amouyel, Philippe; Matucci-Cerinic, Marco; Riemekasten, Gabriella; Airo, Paolo; Melchers, Inga; Hachulla, Eric; Cusi, Daniele; Wichmann, H-Erich; Wipff, Julien; Lambert, Jean-Charles; Hunzelmann, Nicolas; Tiev, Kiet; Caramaschi, Paola; Diot, Elisabeth; Kowal-Bielecka, Otylia; Valentini, Gabriele; Mouthon, Luc; Czirják, László; Damjanov, Nemanja; Salvi, Erika; Conti, Costanza; Müller, Martina; Müller-Ladner, Ulf; Riccieri, Valeria; Ruiz, Barbara; Cracowski, Jean-Luc; Letenneur, Luc; Dupuy, Anne Marie; Meyer, Oliver; Kahan, André; Munnich, Arnold; Boileau, Catherine; Martinez, Maria

    2011-07-01

    Systemic sclerosis (SSc) is an orphan, complex, inflammatory disease affecting the immune system and connective tissue. SSc stands out as a severely incapacitating and life-threatening inflammatory rheumatic disease, with a largely unknown pathogenesis. We have designed a two-stage genome-wide association study of SSc using case-control samples from France, Italy, Germany, and Northern Europe. The initial genome-wide scan was conducted in a French post quality-control sample of 564 cases and 1,776 controls, using almost 500 K SNPs. Two SNPs from the MHC region, together with the 6 loci outside MHC having at least one SNP with a P<10(-5) were selected for follow-up analysis. These markers were genotyped in a post-QC replication sample of 1,682 SSc cases and 3,926 controls. The three top SNPs are in strong linkage disequilibrium and located on 6p21, in the HLA-DQB1 gene: rs9275224, P = 9.18×10(-8), OR = 0.69, 95% CI [0.60-0.79]; rs6457617, P = 1.14×10(-7) and rs9275245, P = 1.39×10(-7). Within the MHC region, the next most associated SNP (rs3130573, P = 1.86×10(-5), OR = 1.36 [1.18-1.56]) is located in the PSORS1C1 gene. Outside the MHC region, our GWAS analysis revealed 7 top SNPs (P<10(-5)) that spanned 6 independent genomic regions. Follow-up of the 17 top SNPs in an independent sample of 1,682 SSc and 3,926 controls showed associations at PSORS1C1 (overall P = 5.70×10(-10), OR:1.25), TNIP1 (P = 4.68×10(-9), OR:1.31), and RHOB loci (P = 3.17×10(-6), OR:1.21). Because of its biological relevance, and previous reports of genetic association at this locus with connective tissue disorders, we investigated TNIP1 expression. A markedly reduced expression of the TNIP1 gene and also its protein product were observed both in lesional skin tissue and in cultured dermal fibroblasts from SSc patients. Furthermore, TNIP1 showed in vitro inhibitory effects on inflammatory cytokine-induced collagen production. The genetic signal of

  14. Genome-wide scan for visceral leishmaniasis in mixed-breed dogs identifies candidate genes involved in T helper cells and macrophage signaling

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2....

  15. A Genome-Wide Scan of Selective Sweeps and Association Mapping of Fruit Traits Using Microsatellite Markers in Watermelon

    PubMed Central

    Reddy, Umesh K.; Abburi, Lavanya; Abburi, Venkata Lakshmi; Saminathan, Thangasamy; Cantrell, Robert; Vajja, Venkata Gopinath; Reddy, Rishi; Tomason, Yan R.; Levi, Amnon; Wehner, Todd C.; Nimmakayala, Padma

    2015-01-01

    Our genetic diversity study uses microsatellites of known map position to estimate genome level population structure and linkage disequilibrium, and to identify genomic regions that have undergone selection during watermelon domestication and improvement. Thirty regions that showed evidence of selective sweep were scanned for the presence of candidate genes using the watermelon genome browser (www.icugi.org). We localized selective sweeps in intergenic regions, close to the promoters, and within the exons and introns of various genes. This study provided an evidence of convergent evolution for the presence of diverse ecotypes with special reference to American and European ecotypes. Our search for location of linked markers in the whole-genome draft sequence revealed that BVWS00358, a GA repeat microsatellite, is the GAGA type transcription factor located in the 5′ untranslated regions of a structure and insertion element that expresses a Cys2His2 Zinc finger motif, with presumed biological processes related to chitin response and transcriptional regulation. In addition, BVWS01708, an ATT repeat microsatellite, located in the promoter of a DTW domain-containing protein (Cla002761); and 2 other simple sequence repeats that association mapping link to fruit length and rind thickness. PMID:25425675

  16. Family based genome-wide copy number scan identifies complex rearrangements at 17q21.31 in dyslexics.

    PubMed

    Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B

    2014-10-01

    Developmental dyslexia (DD) is a complex heritable disorder with unexpected difficulty in learning to read and spell despite adequate intelligence, education, environment, and normal senses. We performed genome-wide screening for copy number variations (CNVs) in 10 large Indian dyslexic families using Affymetrix Genome-Wide Human SNP Array 6.0. Results revealed the complex genomic rearrangements due to one non-contiguous deletion and five contiguous micro duplications and micro deletions at 17q21.31 region in three dyslexic families. CNVs in this region harbor the genes KIAA1267, LRRC37A, ARL17A/B, NSFP1, and NSF. The CNVs in case 1 and case 2 at this locus were found to be in homozygous state and case 3 was a de novo CNV. These CNVs were found with at least one CNV having a common break and end points in the parents. This cluster of genes containing NSF is implicated in learning, cognition, and memory, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia related module genes suggests NSF and other genes to be associated with cellular/vesicular membrane fusion and synaptic transmission. Thus, we suggest that NSF in this cluster would be the nearest gene responsible for the learning disability phenotype.

  17. Genome-wide scan of gastrointestinal nematode resistance in closed Angus population selected for minimized influence of MHC.

    PubMed

    Kim, Eui-Soo; Sonstegard, Tad S; da Silva, Marcos V G B; Gasbarre, Louis C; Van Tassell, Curtis P

    2015-01-01

    Genetic markers associated with parasite indicator traits are ideal targets for study of marker assisted selection aimed at controlling infections that reduce herd use of anthelminthics. For this study, we collected gastrointestinal (GI) nematode fecal egg count (FEC) data from post-weaning animals of an Angus resource population challenged to a 26 week natural exposure on pasture. In all, data from 487 animals was collected over a 16 year period between 1992 and 2007, most of which were selected for a specific DRB1 allele to reduce the influence of potential allelic variant effects of the MHC locus. A genome-wide association study (GWAS) based on BovineSNP50 genotypes revealed six genomic regions located on bovine Chromosomes 3, 5, 8, 15 and 27; which were significantly associated (-log10 p=4.3) with Box-Cox transformed mean FEC (BC-MFEC). DAVID analysis of the genes within the significant genomic regions suggested a correlation between our results and annotation for genes involved in inflammatory response to infection. Furthermore, ROH and selection signature analyses provided strong evidence that the genomic regions associated BC-MFEC have not been affected by local autozygosity or recent experimental selection. These findings provide useful information for parasite resistance prediction for young grazing cattle and suggest new candidate gene targets for development of disease-modifying therapies or future studies of host response to GI parasite infection.

  18. Genome-Wide Scan of Gastrointestinal Nematode Resistance in Closed Angus Population Selected for Minimized Influence of MHC

    PubMed Central

    Kim, Eui-Soo; Sonstegard, Tad S.; da Silva, Marcos V. G. B.; Gasbarre, Louis C.; Van Tassell, Curtis P.

    2015-01-01

    Genetic markers associated with parasite indicator traits are ideal targets for study of marker assisted selection aimed at controlling infections that reduce herd use of anthelminthics. For this study, we collected gastrointestinal (GI) nematode fecal egg count (FEC) data from post-weaning animals of an Angus resource population challenged to a 26 week natural exposure on pasture. In all, data from 487 animals was collected over a 16 year period between 1992 and 2007, most of which were selected for a specific DRB1 allele to reduce the influence of potential allelic variant effects of the MHC locus. A genome-wide association study (GWAS) based on BovineSNP50 genotypes revealed six genomic regions located on bovine Chromosomes 3, 5, 8, 15 and 27; which were significantly associated (-log10 p=4.3) with Box-Cox transformed mean FEC (BC-MFEC). DAVID analysis of the genes within the significant genomic regions suggested a correlation between our results and annotation for genes involved in inflammatory response to infection. Furthermore, ROH and selection signature analyses provided strong evidence that the genomic regions associated BC-MFEC have not been affected by local autozygosity or recent experimental selection. These findings provide useful information for parasite resistance prediction for young grazing cattle and suggest new candidate gene targets for development of disease-modifying therapies or future studies of host response to GI parasite infection. PMID:25803687

  19. Non-additive genome-wide association scan reveals a new gene associated with habitual coffee consumption

    PubMed Central

    Pirastu, Nicola; Kooyman, Maarten; Robino, Antonietta; van der Spek, Ashley; Navarini, Luciano; Amin, Najaf; Karssen, Lennart C.; Van Duijn, Cornelia M; Gasparini, Paolo

    2016-01-01

    Coffee is one of the most consumed beverages world-wide and one of the primary sources of caffeine intake. Given its important health and economic impact, the underlying genetics of its consumption has been widely studied. Despite these efforts, much has still to be uncovered. In particular, the use of non-additive genetic models may uncover new information about the genetic variants driving coffee consumption. We have conducted a genome-wide association study in two Italian populations using additive, recessive and dominant models for analysis. This has uncovered a significant association in the PDSS2 gene under the recessive model that has been replicated in an independent cohort from the Netherlands (ERF). The identified gene has been shown to negatively regulate the expression of the caffeine metabolism genes and can thus be linked to coffee consumption. Further bioinformatics analysis of eQTL and histone marks from Roadmap data has evidenced a possible role of the identified SNPs in regulating PDSS2 gene expression through enhancers present in its intron. Our results highlight a novel gene which regulates coffee consumption by regulating the expression of the genes linked to caffeine metabolism. Further studies will be needed to clarify the biological mechanism which links PDSS2 and coffee consumption. PMID:27561104

  20. A Genome-Wide Linkage Scan for Cleft Lip and Cleft Palate Identifies a Novel Locus on 8p11-23

    PubMed Central

    Riley, B.M.; Schultz, R.E.; Cooper, M.E.; Goldstein-McHenry, T.; Daack-Hirsch, S.; Lee, K.T.; Dragan, E.; Vieira, A.R.; Lidral, A.C.; Marazita, M.L.; Murray, J.C.

    2008-01-01

    Isolated or nonsyndromic cleft lip and palate (NS CLP) is a complex disorder resulting from multiple genetic and environmental factors. NS CLP has a birth prevalence of 1 per 500 in the Philippines where large families provide an opportunity for gene localization. Genotyping of 392 microsatellite repeat markers at 10 cM intervals over the genome was performed by the Center for Inherited Disease Research (CIDR) on 220 Filipino families with 567 affected and 1,109 unaffected family members genotyped. Among the most statistically significant results from analysis of the genome-wide scan data was a 20 cM region at 8p11-23 in which markers had LODs ≥1.0. This region on 8p11-23 has not been found in any previous genome wide scan nor does it contain any of the candidate genes widely studied in CLP. Fine mapping in 8p11-23 was done in the 220 families plus an additional 51 families, using SNP markers from 10 known genes (FGFR1, NRG1, FZD3, SLC8A1, PPP3CC, EPHX2, BNIP3L, EGR3, PPP2R2A, and NAT1) within the 20 cM region of 8p11-23. Linkage and association analyses of these SNPs yield suggestive results for markers in FGFR1 (recessive multipoint HLOD 1.07) and BAG4 (recessive multipoint HLOD 1.31). PMID:17366557

  1. Evolution of the pygmy phenotype: evidence of positive selection fro genome-wide scans in African, Asian, and Melanesian pygmies.

    PubMed

    Migliano, Andrea Bamberg; Romero, Irene Gallego; Metspalu, Mait; Leavesley, Matthew; Pagani, Luca; Antao, Tiago; Huang, Da-Wei; Sherman, Brad T; Siddle, Katharine; Scholes, Clarissa; Hudjashov, Georgi; Kaitokai, Elton; Babalu, Avis; Belatti, Maggie; Cagan, Alex; Hopkinshaw, Byrony; Shaw, Colin; Nelis, Mari; Metspalu, Ene; Mägi, Reedik; Lempicki, Richard A; Villems, Richard; Lahr, Marta Mirazon; Kivisild, Toomas

    2013-01-01

    Human pygmy populations inhabit different regions of the world, from Africa to Melanesia. In Asia, short-statured populations are often referred to as "negritos." Their short stature has been interpreted as a consequence of thermoregulatory, nutritional, and/or locomotory adaptations to life in tropical forests. A more recent hypothesis proposes that their stature is the outcome of a life history trade-off in high-mortality environments, where early reproduction is favored and, consequently, early sexual maturation and early growth cessation have coevolved. Some serological evidence of deficiencies in the growth hormone/insulin-like growth factor axis have been previously associated with pygmies' short stature. Using genome-wide single-nucleotide polymorphism genotype data, we first tested whether different negrito groups living in the Philippines and Papua New Guinea are closely related and then investigated genomic signals of recent positive selection in African, Asian, and Papuan pygmy populations. We found that negritos in the Philippines and Papua New Guinea are genetically more similar to their nonpygmy neighbors than to one another and have experienced positive selection at different genes. These results indicate that geographically distant pygmy groups are likely to have evolved their short stature independently. We also found that selection on common height variants is unlikely to explain their short stature and that different genes associated with growth, thyroid function, and sexual development are under selection in different pygmy groups.

  2. Genome Wide Association Studies

    NASA Astrophysics Data System (ADS)

    Sebastiani, Paola; Solovieff, Nadia

    The availability of high throughput technology for parallel genotyping has opened the field of genetics to genome-wide association studies (GWAS). These studies generate massive amount of genetic data that challenge investigators with issues related to data management, statistical analysis of large data sets, visualization, and annotation of results. We will review the common approach to analysis of GWAS data and then discuss options to learn more from these data.

  3. Asymmetric Introgression in the Horticultural Living Fossil Cycas Sect. Asiorientales Using a Genome-Wide Scanning Approach

    PubMed Central

    Chiang, Yu-Chung; Huang, Bing-Hong; Chang, Chun-Wen; Wan, Yu-Ting; Lai, Shih-Jie; Huang, Shong; Liao, Pei-Chun

    2013-01-01

    The Asian cycads are mostly allopatric, distributed in small population sizes. Hybridization between allopatric species provides clues in determining the mechanism of species divergence. Horticultural introduction provides the chance of interspecific gene flow between allopatric species. Two allopatrically eastern Asian Cycas sect. Asiorientales species, C. revoluta and C. taitungensis, which are widely distributed in Ryukyus and Fujian Province and endemic to Taiwan, respectively, were planted in eastern Taiwan for horticultural reason. Higher degrees of genetic admixture in cultivated samples than wild populations in both cycad species were detected based on multilocus scans by neutral AFLP markers. Furthermore, bidirectional but asymmetric introgression by horticultural introduction of C. revoluta is evidenced by the reanalyses of species associated loci, which are assumed to be diverged after species divergence. Partial loci introgressed from native cycad to the invaders were also detected at the loci of strong species association. Consistent results tested by all neutral loci, and the species-associated loci, specify the recent introgression from the paradox of sharing of ancestral polymorphisms. Phenomenon of introgression of cultivated cycads implies niche conservation among two geographic-isolated cycads, even though the habitats of the extant wild populations of two species are distinct. PMID:23591840

  4. Asymmetric introgression in the horticultural living fossil cycas sect. Asiorientales using a genome-wide scanning approach.

    PubMed

    Chiang, Yu-Chung; Huang, Bing-Hong; Chang, Chun-Wen; Wan, Yu-Ting; Lai, Shih-Jie; Huang, Shong; Liao, Pei-Chun

    2013-04-15

    The Asian cycads are mostly allopatric, distributed in small population sizes. Hybridization between allopatric species provides clues in determining the mechanism of species divergence. Horticultural introduction provides the chance of interspecific gene flow between allopatric species. Two allopatrically eastern Asian Cycas sect. Asiorientales species, C. revoluta and C. taitungensis, which are widely distributed in Ryukyus and Fujian Province and endemic to Taiwan, respectively, were planted in eastern Taiwan for horticultural reason. Higher degrees of genetic admixture in cultivated samples than wild populations in both cycad species were detected based on multilocus scans by neutral AFLP markers. Furthermore, bidirectional but asymmetric introgression by horticultural introduction of C. revoluta is evidenced by the reanalyses of species associated loci, which are assumed to be diverged after species divergence. Partial loci introgressed from native cycad to the invaders were also detected at the loci of strong species association. Consistent results tested by all neutral loci, and the species-associated loci, specify the recent introgression from the paradox of sharing of ancestral polymorphisms. Phenomenon of introgression of cultivated cycads implies niche conservation among two geographic-isolated cycads, even though the habitats of the extant wild populations of two species are distinct.

  5. Genome-wide SNP-Based Linkage Scan Identifies a Locus on 8q24 for an Age-Related Hearing Impairment Trait

    PubMed Central

    Huyghe, Jeroen R.; Van Laer, Lut; Hendrickx, Jan-Jaap; Fransen, Erik; Demeester, Kelly; Topsakal, Vedat; Kunst, Sylvia; Manninen, Minna; Jensen, Mona; Bonaconsa, Amanda; Mazzoli, Manuela; Baur, Manuela; Hannula, Samuli; Mäki-Torkko, Elina; Espeso, Angeles; Van Eyken, Els; Flaquer, Antonia; Becker, Christian; Stephens, Dafydd; Sorri, Martti; Orzan, Eva; Bille, Michael; Parving, Agnete; Pyykkö, Ilmari; Cremers, Cor W.R.J.; Kremer, Hannie; Van de Heyning, Paul H.; Wienker, Thomas F.; Nürnberg, Peter; Pfister, Markus; Van Camp, Guy

    2008-01-01

    Age-related hearing impairment (ARHI), or presbycusis, is a very common multifactorial disorder. Despite the knowledge that genetics play an important role in the etiology of human ARHI as revealed by heritability studies, to date, its precise genetic determinants remain elusive. Here we report the results of a cross-sectional family-based genetic study employing audiometric data. By using principal component analysis, we were able to reduce the dimensionality of this multivariate phenotype while capturing most of the variation and retaining biologically important features of the audiograms. We conducted a genome-wide association as well as a linkage scan with high-density SNP microarrays. Because of the presence of genetic population substructure, association testing was stratified after which evidence was combined by meta-analysis. No association signals reaching genome-wide significance were detected. Linkage analysis identified a linkage peak on 8q24.13-q24.22 for a trait correlated to audiogram shape. The signal reached genome-wide significance, as assessed by simulations. This finding represents the first locus for an ARHI trait. PMID:18760390

  6. Genome-wide SNP-based linkage scan identifies a locus on 8q24 for an age-related hearing impairment trait.

    PubMed

    Huyghe, Jeroen R; Van Laer, Lut; Hendrickx, Jan-Jaap; Fransen, Erik; Demeester, Kelly; Topsakal, Vedat; Kunst, Sylvia; Manninen, Minna; Jensen, Mona; Bonaconsa, Amanda; Mazzoli, Manuela; Baur, Manuela; Hannula, Samuli; Mäki-Torkko, Elina; Espeso, Angeles; Van Eyken, Els; Flaquer, Antonia; Becker, Christian; Stephens, Dafydd; Sorri, Martti; Orzan, Eva; Bille, Michael; Parving, Agnete; Pyykkö, Ilmari; Cremers, Cor W R J; Kremer, Hannie; Van de Heyning, Paul H; Wienker, Thomas F; Nürnberg, Peter; Pfister, Markus; Van Camp, Guy

    2008-09-01

    Age-related hearing impairment (ARHI), or presbycusis, is a very common multifactorial disorder. Despite the knowledge that genetics play an important role in the etiology of human ARHI as revealed by heritability studies, to date, its precise genetic determinants remain elusive. Here we report the results of a cross-sectional family-based genetic study employing audiometric data. By using principal component analysis, we were able to reduce the dimensionality of this multivariate phenotype while capturing most of the variation and retaining biologically important features of the audiograms. We conducted a genome-wide association as well as a linkage scan with high-density SNP microarrays. Because of the presence of genetic population substructure, association testing was stratified after which evidence was combined by meta-analysis. No association signals reaching genome-wide significance were detected. Linkage analysis identified a linkage peak on 8q24.13-q24.22 for a trait correlated to audiogram shape. The signal reached genome-wide significance, as assessed by simulations. This finding represents the first locus for an ARHI trait.

  7. Genome-wide scan for linkage to schizophrenia in a Spanish-origin cohort from Costa Rica.

    PubMed

    DeLisi, Lynn E; Mesen, Andrea; Rodriguez, Carlos; Bertheau, Arturo; LaPrade, Beatrice; Llach, Michelle; Riondet, Silvina; Razi, Kamran; Relja, Margaret; Byerley, William; Sherrington, Robin

    2002-07-08

    Genetic isolates have been useful cohorts in which to search for genes underlying disorders of unknown pathology. One such cohort is thought to exist in the Central Valley of Costa Rica surrounding the city of San Jose. Previous investigators identified a rare dominant gene for hereditary deafness in this population, and a suggestive linkage of severe bipolar psychosis has been reported in another study. Ninety-nine families with at least one pair of siblings affected with schizophrenia or a schizophrenia-spectrum diagnosis had clinical evaluations and DNA collected for genotyping. The Marshfield Medical Research Foundation (NHLBI) Mammalian Genotyping Service performed all genotyping using 404 short-tandem repeat polymorphic markers (STRPs) spaced on average 10 cM apart. Data were analyzed using the nonparametric program, GeneHunterPlus. The population structure was investigated using the STRUCT program. No region was found with genome-wide significance for linkage. Using a phenotype of schizophrenia plus schizoaffective disorder, the highest maximum likelihood score (MLS) observed was 1.78 (P < 0.004) at 176.6 cM from pter on chromosome 5q, an area previously implicated by some other groups. In addition, five regions on chromosomes 1p, 2p, 2q, 14p, and 8p had MLSs above 1.0. All other regions produced scores below 1.0. Population genetic analysis reveals no evidence for population substructure, for admixture with other populations, such as Amerindians, or for inbreeding in the parental generation. The latter casts some doubt on this population being an isolate, although there was evidence of inbreeding among the offspring.

  8. Detecting Loci under recent positive selection in dairy and beef cattle by combining different genome-wide scan methods

    Technology Transfer Automated Retrieval System (TEKTRAN)

    As the methodologies available for the detection of positive selection from genomic data vary in terms of assumptions and execution, weak correlations are expected among them. However, if there is any given signal that is consistently supported across different tests, it might be a strong evidence o...

  9. A genome-wide SNP scan reveals novel loci for egg production and quality traits in white leghorn and brown-egg dwarf layers.

    PubMed

    Liu, Wenbo; Li, Dongfeng; Liu, Jianfeng; Chen, Sirui; Qu, Lujiang; Zheng, Jiangxia; Xu, Guiyun; Yang, Ning

    2011-01-01

    Availability of the complete genome sequence as well as high-density SNP genotyping platforms allows genome-wide association studies (GWAS) in chickens. A high-density SNP array containing 57,636 markers was employed herein to identify associated variants underlying egg production and quality traits within two lines of chickens, i.e., White Leghorn and brown-egg dwarf layers. For each individual, age at first egg (AFE), first egg weight (FEW), and number of eggs (EN) from 21 to 56 weeks of age were recorded, and egg quality traits including egg weight (EW), eggshell weight (ESW), yolk weight (YW), eggshell thickness (EST), eggshell strength (ESS), albumen height(AH) and Haugh unit(HU) were measured at 40 and 60 weeks of age. A total of 385 White Leghorn females and 361 brown-egg dwarf dams were selected to be genotyped. The genome-wide scan revealed 8 SNPs showing genome-wise significant (P<1.51E-06, Bonferroni correction) association with egg production and quality traits under the Fisher's combined probability method. Some significant SNPs are located in known genes including GRB14 and GALNT1 that can impact development and function of ovary, but more are located in genes with unclear functions in layers, and need to be studied further. Many chromosome-wise significant SNPs were also detected in this study and some of them are located in previously reported QTL regions. Most of loci detected in this study are novel and the follow-up replication studies may be needed to further confirm the functional significance for these newly identified SNPs.

  10. Sex-limited genome-wide linkage scan for body mass index in an unselected sample of 933 Australian twin families.

    PubMed

    Cornes, Belinda K; Medland, Sarah E; Ferreira, Manuel A R; Morley, Katherine I; Duffy, David L; Heijmans, Bastiaan T; Montgomery, Grant W; Martin, Nicholas G

    2005-12-01

    Genes involved in pathways regulating body weight may operate differently in men and women. To determine whether sex-limited genes influence the obesity-related phenotype body mass index (BMI), we have conducted a general nonscalar sex-limited genome-wide linkage scan using variance components analysis in Mx (Neale, 2002). BMI measurements and genotypic data were available for 2053 Australian female and male adult twins and their siblings from 933 families. Clinical measures of BMI were available for 64.4% of these individuals, while only self-reported measures were available for the remaining participants. The mean age of participants was 39.0 years of age (SD 12.1 years). The use of a sex-limited linkage model identified areas on the genome where quantitative trait loci (QTL) effects differ between the sexes, particularly on chromosome 8 and 20, providing us with evidence that some of the genes responsible for BMI may have different effects in men and women. Our highest linkage peak was observed at 12q24 (-log10p = 3.02), which was near the recommended threshold for suggestive linkage (-log10p = 3.13). Previous studies have found evidence for a quantitative trait locus on 12q24 affecting BMI in a wide range of populations, and candidate genes for noninsulin-dependent diabetes mellitus, a consequence of obesity, have also been mapped to this region. We also identified many peaks near a -log10p of 2 (threshold for replicating an existing finding) in many areas across the genome that are within regions previously identified by other studies, as well as in locations that harbor genes known to influence weight regulation.

  11. A genome-wide association scan implicates DCHS2, RUNX2, GLI3, PAX1 and EDAR in human facial variation.

    PubMed

    Adhikari, Kaustubh; Fuentes-Guajardo, Macarena; Quinto-Sánchez, Mirsha; Mendoza-Revilla, Javier; Camilo Chacón-Duque, Juan; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Lozano, Rodrigo Barquera; Pérez, Gastón Macín; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M; Bortolini, Maria-Cátira; Canizales-Quinteros, Samuel; Cheeseman, Michael; Rosique, Javier; Bedoya, Gabriel; Rothhammer, Francisco; Headon, Denis; González-José, Rolando; Balding, David; Ruiz-Linares, Andrés

    2016-05-19

    We report a genome-wide association scan for facial features in ∼6,000 Latin Americans. We evaluated 14 traits on an ordinal scale and found significant association (P values<5 × 10(-8)) at single-nucleotide polymorphisms (SNPs) in four genomic regions for three nose-related traits: columella inclination (4q31), nose bridge breadth (6p21) and nose wing breadth (7p13 and 20p11). In a subsample of ∼3,000 individuals we obtained quantitative traits related to 9 of the ordinal phenotypes and, also, a measure of nasion position. Quantitative analyses confirmed the ordinal-based associations, identified SNPs in 2q12 associated to chin protrusion, and replicated the reported association of nasion position with SNPs in PAX3. Strongest association in 2q12, 4q31, 6p21 and 7p13 was observed for SNPs in the EDAR, DCHS2, RUNX2 and GLI3 genes, respectively. Associated SNPs in 20p11 extend to PAX1. Consistent with the effect of EDAR on chin protrusion, we documented alterations of mandible length in mice with modified Edar funtion.

  12. A genome-wide association scan implicates DCHS2, RUNX2, GLI3, PAX1 and EDAR in human facial variation

    PubMed Central

    Adhikari, Kaustubh; Fuentes-Guajardo, Macarena; Quinto-Sánchez, Mirsha; Mendoza-Revilla, Javier; Camilo Chacón-Duque, Juan; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Lozano, Rodrigo Barquera; Pérez, Gastón Macín; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C.; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M.; Bortolini, Maria- Cátira; Canizales-Quinteros, Samuel; Cheeseman, Michael; Rosique, Javier; Bedoya, Gabriel; Rothhammer, Francisco; Headon, Denis; González-José, Rolando; Balding, David; Ruiz-Linares, Andrés

    2016-01-01

    We report a genome-wide association scan for facial features in ∼6,000 Latin Americans. We evaluated 14 traits on an ordinal scale and found significant association (P values<5 × 10−8) at single-nucleotide polymorphisms (SNPs) in four genomic regions for three nose-related traits: columella inclination (4q31), nose bridge breadth (6p21) and nose wing breadth (7p13 and 20p11). In a subsample of ∼3,000 individuals we obtained quantitative traits related to 9 of the ordinal phenotypes and, also, a measure of nasion position. Quantitative analyses confirmed the ordinal-based associations, identified SNPs in 2q12 associated to chin protrusion, and replicated the reported association of nasion position with SNPs in PAX3. Strongest association in 2q12, 4q31, 6p21 and 7p13 was observed for SNPs in the EDAR, DCHS2, RUNX2 and GLI3 genes, respectively. Associated SNPs in 20p11 extend to PAX1. Consistent with the effect of EDAR on chin protrusion, we documented alterations of mandible length in mice with modified Edar funtion. PMID:27193062

  13. A genome-wide scan for evidence of selection in a maize population under long-term artificial selection for ear number.

    PubMed

    Beissinger, Timothy M; Hirsch, Candice N; Vaillancourt, Brieanne; Deshpande, Shweta; Barry, Kerrie; Buell, C Robin; Kaeppler, Shawn M; Gianola, Daniel; de Leon, Natalia

    2014-03-01

    A genome-wide scan to detect evidence of selection was conducted in the Golden Glow maize long-term selection population. The population had been subjected to selection for increased number of ears per plant for 30 generations, with an empirically estimated effective population size ranging from 384 to 667 individuals and an increase of more than threefold in the number of ears per plant. Allele frequencies at >1.2 million single-nucleotide polymorphism loci were estimated from pooled whole-genome resequencing data, and FST values across sliding windows were employed to assess divergence between the population preselection and the population postselection. Twenty-eight highly divergent regions were identified, with half of these regions providing gene-level resolution on potentially selected variants. Approximately 93% of the divergent regions do not demonstrate a significant decrease in heterozygosity, which suggests that they are not approaching fixation. Also, most regions display a pattern consistent with a soft-sweep model as opposed to a hard-sweep model, suggesting that selection mostly operated on standing genetic variation. For at least 25% of the regions, results suggest that selection operated on variants located outside of currently annotated coding regions. These results provide insights into the underlying genetic effects of long-term artificial selection and identification of putative genetic elements underlying number of ears per plant in maize.

  14. Genetic analysis of albuminuria in the aging mouse and concordance with loci for diabetic nephropathy found in a genome-wide association scan

    PubMed Central

    Tsaih, Shirng-Wern; Pezzolesi, Marcus G.; Yuan, Rong; Warram, James H.; Krolewski, Andrzej S.; Korstanje, Ron

    2009-01-01

    Aging is a physiological process involving both genetic factors and environmental agents that can lead to function loss in organs. In the kidney, aging can cause leakage of proteins in urine, starting with albumin. Discovering molecular mechanisms responsible for albuminuria during aging could offer new perspectives on the etiology of this abnormality. Haplotype association mapping in the mouse is a novel approach which uses the haplotypes of the relatively closely related mouse inbred strains and the variation of the phenotypes among these strains to find associations between haplotypes and phenotype. Albumin-to-creatinine ratios, measures of urinary albumin excretion, were determined in 30 inbred mouse strains at 12, 18, and 24 months of age. To determine genetic loci that are involved in albuminuria, haplotype association mapping was performed for males and females separately at all 3 time points using a set of 63,222 SNPs. One significant and 8 suggestive loci were identified, some of which map to previously identified loci for traits associated with kidney damage in the mouse, but with a much higher resolution, which narrowed the mapped loci. These 9 loci were then investigated in the data of the genome-wide association scan for diabetic nephropathy in human type 1 diabetes. Two of the 9 mouse loci were found to be significantly associated with diabetic nephropathy, suggesting common underlying genes predisposing to kidney disease in mice and humans. PMID:19924099

  15. Genome-wide scan for serum ghrelin detects linkage on chromosome 1p36 in Hispanic children: results from the Viva La Familia study.

    PubMed

    Voruganti, V Saroja; Göring, Harald H H; Diego, Vincent P; Cai, Guowen; Mehta, Nitesh R; Haack, Karin; Cole, Shelley A; Butte, Nancy F; Comuzzie, Anthony G

    2007-10-01

    This study was conducted to investigate genetic influence on serum ghrelin and its relationship with adiposity-related phenotypes in Hispanic children (n=1030) from the Viva La Familia study (VFS). Anthropometric measurements and levels of serum ghrelin were estimated and genetic analyses conducted according to standard procedures. Mean age, body mass index (BMI), and serum ghrelin were 11+/-0.13 y, 25+/-0.24 kg/m2 and 38+/-0.5 ng/mL, respectively. Significant heritabilities (p<0.001) were obtained for BMI, weight, fat mass, percent fat, waist circumference, waist-to-height ratio, and ghrelin. Bivariate analyses of ghrelin with adiposity traits showed significant negative genetic correlations (p<0.0001) with weight, BMI, fat mass, percent fat, waist circumference, and waist-to-height ratio. A genome-wide scan for ghrelin detected significant linkage on chromosome 1p36.2 between STR markers D1S2697 and D1S199 (LOD=3.2). The same region on chromosome 1 was the site of linkage for insulin (LOD=3.3), insulinlike growth factor binding protein 1 (IGFBP1) (LOD=3.4), homeostatic model assessment method (HOMA) (LOD=2.9), and C-peptide (LOD=2.0). Several family-based studies have reported linkages for obesity-related phenotypes in the region of 1p36. These results indicate the importance of this region in relation to adiposity in children from the VFS.

  16. An Islet-Targeted Genome-Wide Association Scan Identifies Novel Genes Implicated in Cytokine-Mediated Islet Stress in Type 2 Diabetes

    PubMed Central

    Sharma, Poonam R.; Mackey, Aaron J.; Dejene, Eden A.; Ramadan, James W.; Langefeld, Carl D.; Palmer, Nicholette D.; Taylor, Kent D.; Wagenknecht, Lynne E.; Watanabe, Richard M.; Rich, Stephen S.

    2015-01-01

    Genome-wide association studies in human type 2 diabetes (T2D) have renewed interest in the pancreatic islet as a contributor to T2D risk. Chronic low-grade inflammation resulting from obesity is a risk factor for T2D and a possible trigger of β-cell failure. In this study, microarray data were collected from mouse islets after overnight treatment with cytokines at concentrations consistent with the chronic low-grade inflammation in T2D. Genes with a cytokine-induced change of >2-fold were then examined for associations between single nucleotide polymorphisms and the acute insulin response to glucose (AIRg) using data from the Genetics Underlying Diabetes in Hispanics (GUARDIAN) Consortium. Significant evidence of association was found between AIRg and single nucleotide polymorphisms in Arap3 (5q31.3), F13a1 (6p25.3), Klhl6 (3q27.1), Nid1 (1q42.3), Pamr1 (11p13), Ripk2 (8q21.3), and Steap4 (7q21.12). To assess the potential relevance to islet function, mouse islets were exposed to conditions modeling low-grade inflammation, mitochondrial stress, endoplasmic reticulum (ER) stress, glucotoxicity, and lipotoxicity. RT-PCR revealed that one or more forms of stress significantly altered expression levels of all genes except Arap3. Thapsigargin-induced ER stress up-regulated both Pamr1 and Klhl6. Three genes confirmed microarray predictions of significant cytokine sensitivity: F13a1 was down-regulated 3.3-fold by cytokines, Ripk2 was up-regulated 1.5- to 3-fold by all stressors, and Steap4 was profoundly cytokine sensitive (167-fold up-regulation). Three genes were thus closely associated with low-grade inflammation in murine islets and also with a marker for islet function (AIRg) in a diabetes-prone human population. This islet-targeted genome-wide association scan identified several previously unrecognized candidate genes related to islet dysfunction during the development of T2D. PMID:26018251

  17. Using an Inbred Horse Breed in a High Density Genome-Wide Scan for Genetic Risk Factors of Insect Bite Hypersensitivity (IBH)

    PubMed Central

    Velie, Brandon D.; Shrestha, Merina; Franҫois, Liesbeth; Schurink, Anouk; Tesfayonas, Yohannes G.; Stinckens, Anneleen; Blott, Sarah; Ducro, Bart J.; Mikko, Sofia; Thomas, Ruth; Swinburne, June E.; Sundqvist, Marie; Eriksson, Susanne; Buys, Nadine; Lindgren, Gabriella

    2016-01-01

    While susceptibility to hypersensitive reactions is a common problem amongst humans and animals alike, the population structure of certain animal species and breeds provides a more advantageous route to better understanding the biology underpinning these conditions. The current study uses Exmoor ponies, a highly inbred breed of horse known to frequently suffer from insect bite hypersensitivity, to identify genomic regions associated with a type I and type IV hypersensitive reaction. A total of 110 cases and 170 controls were genotyped on the 670K Axiom Equine Genotyping Array. Quality control resulted in 452,457 SNPs and 268 individuals being tested for association. Genome-wide association analyses were performed using the GenABEL package in R and resulted in the identification of two regions of interest on Chromosome 8. The first region contained the most significant SNP identified, which was located in an intron of the DCC netrin 1 receptor gene. The second region identified contained multiple top SNPs and encompassed the PIGN, KIAA1468, TNFRSF11A, ZCCHC2, and PHLPP1 genes. Although additional studies will be needed to validate the importance of these regions in horses and the relevance of these regions in other species, the knowledge gained from the current study has the potential to be a step forward in unraveling the complex nature of hypersensitive reactions. PMID:27070818

  18. Using an Inbred Horse Breed in a High Density Genome-Wide Scan for Genetic Risk Factors of Insect Bite Hypersensitivity (IBH).

    PubMed

    Velie, Brandon D; Shrestha, Merina; Franҫois, Liesbeth; Schurink, Anouk; Tesfayonas, Yohannes G; Stinckens, Anneleen; Blott, Sarah; Ducro, Bart J; Mikko, Sofia; Thomas, Ruth; Swinburne, June E; Sundqvist, Marie; Eriksson, Susanne; Buys, Nadine; Lindgren, Gabriella

    2016-01-01

    While susceptibility to hypersensitive reactions is a common problem amongst humans and animals alike, the population structure of certain animal species and breeds provides a more advantageous route to better understanding the biology underpinning these conditions. The current study uses Exmoor ponies, a highly inbred breed of horse known to frequently suffer from insect bite hypersensitivity, to identify genomic regions associated with a type I and type IV hypersensitive reaction. A total of 110 cases and 170 controls were genotyped on the 670K Axiom Equine Genotyping Array. Quality control resulted in 452,457 SNPs and 268 individuals being tested for association. Genome-wide association analyses were performed using the GenABEL package in R and resulted in the identification of two regions of interest on Chromosome 8. The first region contained the most significant SNP identified, which was located in an intron of the DCC netrin 1 receptor gene. The second region identified contained multiple top SNPs and encompassed the PIGN, KIAA1468, TNFRSF11A, ZCCHC2, and PHLPP1 genes. Although additional studies will be needed to validate the importance of these regions in horses and the relevance of these regions in other species, the knowledge gained from the current study has the potential to be a step forward in unraveling the complex nature of hypersensitive reactions.

  19. Multipoint genome-wide linkage scan for nonword repetition in a multigenerational family further supports chromosome 13q as a locus for verbal trait disorders.

    PubMed

    Truong, D T; Shriberg, L D; Smith, S D; Chapman, K L; Scheer-Cohen, A R; DeMille, M M C; Adams, A K; Nato, A Q; Wijsman, E M; Eicher, J D; Gruen, J R

    2016-12-01

    Verbal trait disorders encompass a wide range of conditions and are marked by deficits in five domains that impair a person's ability to communicate: speech, language, reading, spelling, and writing. Nonword repetition is a robust endophenotype for verbal trait disorders that is sensitive to cognitive processes critical to verbal development, including auditory processing, phonological working memory, and motor planning and programming. In the present study, we present a six-generation extended pedigree with a history of verbal trait disorders. Using genome-wide multipoint variance component linkage analysis of nonword repetition, we identified a region spanning chromosome 13q14-q21 with LOD = 4.45 between 52 and 55 cM, spanning approximately 5.5 Mb on chromosome 13. This region overlaps with SLI3, a locus implicated in reading disability in families with a history of specific language impairment. Our study of a large multigenerational family with verbal trait disorders further implicates the SLI3 region in verbal trait disorders. Future studies will further refine the specific causal genetic factors in this locus on chromosome 13q that contribute to language traits.

  20. Genome-wide association scan identifies a colorectal cancer susceptibility locus on 11q23 and replicates risk loci at 8q24 and 18q21

    PubMed Central

    Tenesa, Albert; Farrington, Susan M; Prendergast, James GD; Porteous, Mary E; Walker, Marion; Haq, Naila; Barnetson, Rebecca A; Theodoratou, Evropi; Cetnarskyj, Roseanne; Cartwright, Nicola; Semple, Colin; Clark, Andrew J; Reid, Fiona JL; Smith, Lorna A; Kavoussanakis, Kostas; Koessler, Thibaud; Pharoah, Paul DP; Buch, Stephan; Schafmayer, Clemens; Tepel, Jürgen; Schreiber, Stefan; Völzke, Henry; Schmidt, Carsten O; Hampe, Jochen; Chang-Claude, Jenny; Hoffmeister, Michael; Brenner, Hermann; Wilkening, Stefan; Canzian, Federico; Capella, Gabriel; Moreno, Victor; Deary, Ian J; Starr, John M; Tomlinson, Ian PM; Kemp, Zoe; Howarth, Kimberley; Carvajal-Carmona, Luis; Webb, Emily; Broderick, Peter; Vijayakrishnan, Jayaram; Houlston, Richard S; Rennert, Gad; Ballinger, Dennis; Rozek, Laura; Gruber, Stephen B; Matsuda, Koichi; Kidokoro, Tomohide; Nakamura, Yusuke; Zanke, Brent W; Greenwood, Celia MT; Rangrej, Jagadish; Kustra, Rafal; Montpetit, Alexandre; Hudson, Thomas J; Gallinger, Steven; Campbell, Harry; Dunlop, Malcolm G

    2009-01-01

    In a genome-wide association study to identify loci associated with colorectal cancer (CRC) risk, we genotyped 555,510 SNPs in 1,012 early-onset Scottish CRC cases and 1,012 controls (phase 1.) In phase 2, we genotyped the 15,008 highest-ranked SNPs in 2,057 Scottish cases and 2,111 controls. We then genotyped the five highest-ranked SNPs from the joint phase 1 and 2 analysis in 14,500 cases and 13,294 controls from seven populations, and identified a previously unreported association, rs3802842 on 11q23 (OR = 1.1; P = 5.8 × 10-10), showing population differences in risk. We also replicated and fine-mapped associations at 8q24 (rs7014346; OR = 1.19; P = 8.6 × 10-26) and 18q21 (rs4939827; OR = 1.2; P = 7.8 × 10-28). Risk was greater for rectal than for colon cancer for rs3802842 (P < 0.008) and rs4939827 (P < 0.009). Carrying all six possible risk alleles yielded OR = 2.6 (95% CI = 1.75-3.89) for CRC. These findings extend our understanding of the role of common genetic variation in CRC etiology. PMID:18372901

  1. Genome-wide scan identifies CDH13 as a novel susceptibility locus contributing to blood pressure determination in two European populations.

    PubMed

    Org, Elin; Eyheramendy, Susana; Juhanson, Peeter; Gieger, Christian; Lichtner, Peter; Klopp, Norman; Veldre, Gudrun; Döring, Angela; Viigimaa, Margus; Sõber, Siim; Tomberg, Kärt; Eckstein, Gertrud; Kelgo, Piret; Rebane, Tiina; Shaw-Hawkins, Sue; Howard, Philip; Onipinla, Abiodun; Dobson, Richard J; Newhouse, Stephen J; Brown, Morris; Dominiczak, Anna; Connell, John; Samani, Nilesh; Farrall, Martin; Caulfield, Mark J; Munroe, Patricia B; Illig, Thomas; Wichmann, H-Erich; Meitinger, Thomas; Laan, Maris

    2009-06-15

    Hypertension is a complex disease that affects a large proportion of adult population. Although approximately half of the inter-individual variance in blood pressure (BP) level is heritable, identification of genes responsible for its regulation has remained challenging. Genome-wide association study (GWAS) is a novel approach to search for genetic variants contributing to complex diseases. We conducted GWAS for three BP traits [systolic and diastolic blood pressure (SBP and DBP); hypertension (HYP)] in the Kooperative Gesundheitsforschung in der Region Augsburg (KORA) S3 cohort (n = 1644) recruited from general population in Southern Germany. GWAS with 395,912 single nucleotide polymorphisms (SNPs) identified an association between BP traits and a common variant rs11646213 (T/A) upstream of the CDH13 gene at 16q23.3. The initial associations with HYP and DBP were confirmed in two other European population-based cohorts: KORA S4 (Germans) and HYPEST (Estonians). The associations between rs11646213 and three BP traits were replicated in combined analyses (dominant model: DBP, P = 5.55 x 10(-5), effect -1.40 mmHg; SBP, P = 0.007, effect -1.56 mmHg; HYP, P = 5.30 x 10(-8), OR = 0.67). Carriers of the minor allele A had a decreased risk of hypertension. A non-significant trend for association was also detected with severe family based hypertension in the BRIGHT sample (British). The novel susceptibility locus, CDH13, encodes for an adhesion glycoprotein T-cadherin, a regulator of vascular wall remodeling and angiogenesis. Its function is compatible with the BP biology and may improve the understanding of the pathogenesis of hypertension.

  2. Genome-wide indel/SSR scanning reveals significant loci associated with excellent agronomic traits of a cabbage (Brassica oleracea) elite parental line ‘01–20’

    PubMed Central

    Lv, Honghao; Wang, Qingbiao; Han, Fengqing; Liu, Xing; Fang, Zhiyuan; Yang, Limei; Zhuang, Mu; Liu, Yumei; Li, Zhansheng; Zhang, Yangyong

    2017-01-01

    Elite parental lines are of great significance to crop breeding. To discover unique genomic loci associated with excellent economic traits in the elite cabbage inbred-line ‘01–20’, we performed comparisons of phenotypes as well as whole-genome insertion-deletion/simple sequence repeat loci between ‘01–20’ and each of its five sister lines. ‘01–20’ has a range of excellent agronomic traits, including early-maturing, and improvements in plant type and leaf colour. Eight unique loci were discovered for ‘01–20’ and ‘01-07-258’, another elite line similar to ‘01–20’ at the whole-genome level. In addition, two excellent double-haploid lines derived from a cross of ‘01–20’ also inherited these loci. Based on the quantitative trait locus association results, five of these loci were found to be associated with important agronomic traits, which could explain why the elite parent ‘01–20’ possesses greener outer leaves, a more compact and upright plant-type, rounder head, shorter core length, and better taste. Additionally, some of these loci have clustering effects for quantitative trait loci associated with different traits; therefore, important genes in these regions were analysed. The obtained results should enable marker-assisted multi-trait selection at the whole-genome level in cabbage breeding and provide insights into significant genome loci and their breeding effects. PMID:28164997

  3. Clinical applications of Genome Polymorphism Scans

    PubMed Central

    Weber, James L

    2006-01-01

    Applications of Genome Polymorphism Scans range from the relatively simple such as gender determination and confirmation of biological relationships, to the relatively complex such as determination of autozygosity and propagation of genetic information throughout pedigrees. Unlike nearly all other clinical DNA tests, the Scan is a universal test – it covers all people and all genes. In balance, I argue that the Genome Polymorphism Scan is the most powerful, affordable clinical DNA test available today. Reviewers: This article was reviewed by Scott Weiss (nominated by Neil Smalheiser), Roberta Pagon (nominated by Jerzy Jurka) and Val Sheffield (nominated by Neil Smalheiser). PMID:16756678

  4. Genome-wide association and genomic selection in animal breeding.

    PubMed

    Hayes, Ben; Goddard, Mike

    2010-11-01

    Results from genome-wide association studies in livestock, and humans, has lead to the conclusion that the effect of individual quantitative trait loci (QTL) on complex traits, such as yield, are likely to be small; therefore, a large number of QTL are necessary to explain genetic variation in these traits. Given this genetic architecture, gains from marker-assisted selection (MAS) programs using only a small number of DNA markers to trace a limited number of QTL is likely to be small. This has lead to the development of alternative technology for using the available dense single nucleotide polymorphism (SNP) information, called genomic selection. Genomic selection uses a genome-wide panel of dense markers so that all QTL are likely to be in linkage disequilibrium with at least one SNP. The genomic breeding values are predicted to be the sum of the effect of these SNPs across the entire genome. In dairy cattle breeding, the accuracy of genomic estimated breeding values (GEBV) that can be achieved and the fact that these are available early in life have lead to rapid adoption of the technology. Here, we discuss the design of experiments necessary to achieve accurate prediction of GEBV in future generations in terms of the number of markers necessary and the size of the reference population where marker effects are estimated. We also present a simple method for implementing genomic selection using a genomic relationship matrix. Future challenges discussed include using whole genome sequence data to improve the accuracy of genomic selection and management of inbreeding through genomic relationships.

  5. Detection of quantitative trait loci affecting caffeine metabolism by interval mapping in a genome-wide scan of C3H/HeJ x APN F(2) mice.

    PubMed

    Casley, W L; Menzies, J A; Whitehouse, L W; Moon, T W

    1999-12-01

    Caffeine metabolite ratios have been widely used to measure cytochrome P-450 1A2 activity in humans. Serum paraxanthine/caffeine ratio is one such index of this activity. We had previously demonstrated genetic variation of this trait among inbred mouse strains. In the present study, we have undertaken a genome-wide scan for quantitative trait loci affecting this trait with an interval mapping approach on an F(2) intercross population of acetaminophen nonsusceptible and C3H/HeJ inbred mice. A statistically significant association (log-likelihood ratio = 25.0) between a locus on chromosome 9, which colocalized with the murine Cyp1a2 locus, and the plasma paraxanthine/caffeine ratio was identified. This result suggested the presence of an expression polymorphism affecting this gene. A second locus was identified on chromosome 1 (log-likelihood ratio = 9.7) for which no obvious candidate gene has been identified. The influence of this locus on the paraxanthine/caffeine index was more significant among males (log-likelihood ratio = 6.3) than females (log-likelihood ratio = 3.6). A third locus was identified on chromosome 4 with a less statistically robust association (log-likelihood ratio = 3.4) to the paraxanthine/caffeine phenotype. Collectively, these three loci accounted for 63.2% of the variation observed in the F(2) population for this phenotype. These results demonstrate the potential for genetic variation arising from factors other than CYP1A2 activity to influence the plasma paraxanthine/caffeine ratio in mice. This study demonstrates the utility of quantitative genetics in the analysis of polygenic drug metabolism.

  6. A meta-analysis of genome-wide association scans identifies IL18RAP, PTPN2, TAGAP, and PUS10 as shared risk loci for Crohn's disease and celiac disease.

    PubMed

    Festen, Eleonora A M; Goyette, Philippe; Green, Todd; Boucher, Gabrielle; Beauchamp, Claudine; Trynka, Gosia; Dubois, Patrick C; Lagacé, Caroline; Stokkers, Pieter C F; Hommes, Daan W; Barisani, Donatella; Palmieri, Orazio; Annese, Vito; van Heel, David A; Weersma, Rinse K; Daly, Mark J; Wijmenga, Cisca; Rioux, John D

    2011-01-27

    Crohn's disease (CD) and celiac disease (CelD) are chronic intestinal inflammatory diseases, involving genetic and environmental factors in their pathogenesis. The two diseases can co-occur within families, and studies suggest that CelD patients have a higher risk to develop CD than the general population. These observations suggest that CD and CelD may share common genetic risk loci. Two such shared loci, IL18RAP and PTPN2, have already been identified independently in these two diseases. The aim of our study was to explicitly identify shared risk loci for these diseases by combining results from genome-wide association study (GWAS) datasets of CD and CelD. Specifically, GWAS results from CelD (768 cases, 1,422 controls) and CD (3,230 cases, 4,829 controls) were combined in a meta-analysis. Nine independent regions had nominal association p-value <1.0 x 10⁻⁵ in this meta-analysis and showed evidence of association to the individual diseases in the original scans (p-value < 1 x 10⁻² in CelD and < 1 x 10⁻³ in CD). These include the two previously reported shared loci, IL18RAP and PTPN2, with p-values of 3.37 x 10⁻⁸ and 6.39 x 10⁻⁹, respectively, in the meta-analysis. The other seven had not been reported as shared loci and thus were tested in additional CelD (3,149 cases and 4,714 controls) and CD (1,835 cases and 1,669 controls) cohorts. Two of these loci, TAGAP and PUS10, showed significant evidence of replication (Bonferroni corrected p-values <0.0071) in the combined CelD and CD replication cohorts and were firmly established as shared risk loci of genome-wide significance, with overall combined p-values of 1.55 x 10⁻¹⁰ and 1.38 x 10⁻¹¹ respectively. Through a meta-analysis of GWAS data from CD and CelD, we have identified four shared risk loci: PTPN2, IL18RAP, TAGAP, and PUS10. The combined analysis of the two datasets provided the power, lacking in the individual GWAS for single diseases, to detect shared loci with a relatively small

  7. Enhancer scanning to locate regulatory regions in genomic loci

    PubMed Central

    Buckley, Melissa; Gjyshi, Anxhela; Mendoza-Fandiño, Gustavo; Baskin, Rebekah; Carvalho, Renato S.; Carvalho, Marcelo A.; Woods, Nicholas T.; Monteiro, Alvaro N.A.

    2016-01-01

    The present protocol provides a rapid, streamlined and scalable strategy to systematically scan genomic regions for the presence of transcriptional regulatory regions active in a specific cell type. It creates genomic tiles spanning a region of interest that are subsequently cloned by recombination into a luciferase reporter vector containing the Simian Virus 40 promoter. Tiling clones are transfected into specific cell types to test for the presence of transcriptional regulatory regions. The protocol includes testing of different SNP (single nucleotide polymorphism) alleles to determine their effect on regulatory activity. This procedure provides a systematic framework to identify candidate functional SNPs within a locus during functional analysis of genome-wide association studies. This protocol adapts and combines previous well-established molecular biology methods to provide a streamlined strategy, based on automated primer design and recombinational cloning to rapidly go from a genomic locus to a set of candidate functional SNPs in eight weeks. PMID:26658467

  8. R for genome-wide association studies.

    PubMed

    Gondro, Cedric; Porto-Neto, Laercio R; Lee, Seung Hwan

    2013-01-01

    In recent years R has become de facto statistical programming language of choice for statisticians and it is also arguably the most widely used generic environment for analysis of high-throughput genomic data. In this chapter we discuss some approaches to improve performance of R when working with large SNP datasets.

  9. Genome-Wide Association Studies of Cancer

    PubMed Central

    Stadler, Zsofia K.; Thom, Peter; Robson, Mark E.; Weitzel, Jeffrey N.; Kauff, Noah D.; Hurley, Karen E.; Devlin, Vincent; Gold, Bert; Klein, Robert J.; Offit, Kenneth

    2010-01-01

    Knowledge of the inherited risk for cancer is an important component of preventive oncology. In addition to well-established syndromes of cancer predisposition, much remains to be discovered about the genetic variation underlying susceptibility to common malignancies. Increased knowledge about the human genome and advances in genotyping technology have made possible genome-wide association studies (GWAS) of human diseases. These studies have identified many important regions of genetic variation associated with an increased risk for human traits and diseases including cancer. Understanding the principles, major findings, and limitations of GWAS is becoming increasingly important for oncologists as dissemination of genomic risk tests directly to consumers is already occurring through commercial companies. GWAS have contributed to our understanding of the genetic basis of cancer and will shed light on biologic pathways and possible new strategies for targeted prevention. To date, however, the clinical utility of GWAS-derived risk markers remains limited. PMID:20585100

  10. Profiling genome-wide DNA methylation.

    PubMed

    Yong, Wai-Shin; Hsu, Fei-Man; Chen, Pao-Yang

    2016-01-01

    DNA methylation is an epigenetic modification that plays an important role in regulating gene expression and therefore a broad range of biological processes and diseases. DNA methylation is tissue-specific, dynamic, sequence-context-dependent and trans-generationally heritable, and these complex patterns of methylation highlight the significance of profiling DNA methylation to answer biological questions. In this review, we surveyed major methylation assays, along with comparisons and biological examples, to provide an overview of DNA methylation profiling techniques. The advances in microarray and sequencing technologies make genome-wide profiling possible at a single-nucleotide or even a single-cell resolution. These profiling approaches vary in many aspects, such as DNA input, resolution, genomic region coverage, and bioinformatics analysis, and selecting a feasible method requires knowledge of these methods. We first introduce the biological background of DNA methylation and its pattern in plants, animals and fungi. We present an overview of major experimental approaches to profiling genome-wide DNA methylation and hydroxymethylation and then extend to the single-cell methylome. To evaluate these methods, we outline their strengths and weaknesses and perform comparisons across the different platforms. Due to the increasing need to compute high-throughput epigenomic data, we interrogate the computational pipeline for bisulfite sequencing data and also discuss the concept of identifying differentially methylated regions (DMRs). This review summarizes the experimental and computational concepts for profiling genome-wide DNA methylation, followed by biological examples. Overall, this review provides researchers useful guidance for the selection of a profiling method suited to specific research questions.

  11. [Genome-wide associations for cigarette smoking behavior].

    PubMed

    Strauss, Ewa

    2013-01-01

    Diseases related to tobacco smoking are the second leading cause of death in the world. Despite increasing evidence of genetic determination, the susceptibility genes and loci underlying various aspects of smoking behavior are largely unknown. Genome-wide association studies (GWASs) provided a new conceptual framework in the search for variants underlying common traits/disorders. A massive scan of the genome and a "hypothesis-free" approach enable discovery of new aspects of genetics of complex traits. In this paper the results of GWASs and GWAS meta-analyzes of cigarette smoking behavior and nicotine dependence are reviewed with the particular attention to smoking cessation success and the replacement therapy. The results of these studies are discussed in the context of the results of the candidate gene association studies. Studies on the role of the genomic regions, identified in GWASs, in the development of smoking-related diseases are also discussed.

  12. Genome-wide patterns of selection in 230 ancient Eurasians.

    PubMed

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R; Llamas, Bastien; Dryomov, Stanislav; Pickrell, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vyacheslav; Guerra, Manuel A Rojo; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2015-12-24

    Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height.

  13. Genome-wide patterns of selection in 230 ancient Eurasians

    PubMed Central

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  14. Genome-Wide Approaches to Schizophrenia

    PubMed Central

    Duan, Jubao; Sanders, Alan R.; Gejman, Pablo V.

    2010-01-01

    Schizophrenia (SZ) is a common and severe psychiatric disorder with both environmental and genetic risk factors, and a high heritability. After over 20 years of molecular genetics research, new molecular strategies, primarily genome-wide association studies (GWAS), have generated major tangible progress. This new data provides evidence for: 1) A number of chromosomal regions with common polymorphisms showing genome-wide association with SZ (the major histocompatibility complex, MHC, region at 6p22-p21; 18q21.2; and 2q32.1). The associated alleles present small odds ratios (the odds of a risk variant being present in cases versus controls) and suggest causative involvement of gene regulatory mechanisms in SZ. 2) Polygenic inheritance. 3) Involvement of rare (<1%) and large (>100kb) copy number variants (CNVs). 4) A genetic overlap of SZ with autism and with bipolar disorder (BP) challenging the classical clinical classifications. Most new SZ findings (chromosomal regions and genes) have generated new biological leads. These new findings, however, still need to be translated into a better understanding of the underlying biology and into causal mechanisms. Furthermore, a considerable amount of heritability still remains unexplained (missing heritability). Deep resequencing for rare variants and system biology approaches (e.g., integrating DNA sequence and functional data) are expected to further improve our understanding of the genetic architecture of SZ and its underlying biology. PMID:20433910

  15. A genome wide dosage suppressor network reveals genomic robustness

    PubMed Central

    Patra, Biranchi; Kon, Yoshiko; Yadav, Gitanjali; Sevold, Anthony W.; Frumkin, Jesse P.; Vallabhajosyula, Ravishankar R.; Hintze, Arend; Østman, Bjørn; Schossau, Jory; Bhan, Ashish; Marzolf, Bruz; Tamashiro, Jenna K.; Kaur, Amardeep; Baliga, Nitin S.; Grayhack, Elizabeth J.; Adami, Christoph; Galas, David J.; Raval, Alpan; Phizicky, Eric M.; Ray, Animesh

    2017-01-01

    Genomic robustness is the extent to which an organism has evolved to withstand the effects of deleterious mutations. We explored the extent of genomic robustness in budding yeast by genome wide dosage suppressor analysis of 53 conditional lethal mutations in cell division cycle and RNA synthesis related genes, revealing 660 suppressor interactions of which 642 are novel. This collection has several distinctive features, including high co-occurrence of mutant-suppressor pairs within protein modules, highly correlated functions between the pairs and higher diversity of functions among the co-suppressors than previously observed. Dosage suppression of essential genes encoding RNA polymerase subunits and chromosome cohesion complex suggests a surprising degree of functional plasticity of macromolecular complexes, and the existence of numerous degenerate pathways for circumventing the effects of potentially lethal mutations. These results imply that organisms and cancer are likely able to exploit the genomic robustness properties, due the persistence of cryptic gene and pathway functions, to generate variation and adapt to selective pressures. PMID:27899637

  16. Genomic scan for genes predisposing to schizophrenia

    SciTech Connect

    Coon, H.; Jensen. S.; Holik, J.

    1994-03-15

    We initiated a genome-wide search for genes predisposing to schizophrenia by ascertaining 9 families, each containing three to five cases of schizophrenia. The 9 pedigrees were initially genotyped with 329 polymorphic DNA loci distributed throughout the genome. Assuming either autosomal dominant or recessive inheritance, 254 DNA loci yielded lod scores less than -2.0 at {theta} = 0.0, 101 DNA markers gave lod scores less than -2.0 at {theta} = 0.05, while 5 DNA loci produced maximum lod scores greater than 1: D4S35, D14S17, D15S1, D22S84, and D22S55. Of the DNA markers yielding lod scores greater than 1, D4S35 and D22S55 also were suggestive of linkage when the Affected-Pedigree-Member method was used. The families were then genotyped with four highly polymorphic simple sequence repeat markers; possible linkage diminished with DNA markers mapping nearby D4S35, while suggestive evidence of linkage remained with loci in the region of D22S55. Although follow-up investigation of these chromosomal regions may be warranted, our linkage results should be viewed as preliminary observations, as 35 unaffected persons are not past the age of risk. 90 refs., 3 tabs.

  17. Voxelwise genome-wide association study (vGWAS).

    PubMed

    Stein, Jason L; Hua, Xue; Lee, Suh; Ho, April J; Leow, Alex D; Toga, Arthur W; Saykin, Andrew J; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J; Craig, David W; Gerber, Jill D; Allen, April N; Corneveaux, Jason J; Dechairo, Bryan M; Potkin, Steven G; Weiner, Michael W; Thompson, Paul

    2010-11-15

    The structure of the human brain is highly heritable, and is thought to be influenced by many common genetic variants, many of which are currently unknown. Recent advances in neuroimaging and genetics have allowed collection of both highly detailed structural brain scans and genome-wide genotype information. This wealth of information presents a new opportunity to find the genes influencing brain structure. Here we explore the relation between 448,293 single nucleotide polymorphisms in each of 31,622 voxels of the entire brain across 740 elderly subjects (mean age+/-s.d.: 75.52+/-6.82 years; 438 male) including subjects with Alzheimer's disease, Mild Cognitive Impairment, and healthy elderly controls from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used tensor-based morphometry to measure individual differences in brain structure at the voxel level relative to a study-specific template based on healthy elderly subjects. We then conducted a genome-wide association at each voxel to identify genetic variants of interest. By studying only the most associated variant at each voxel, we developed a novel method to address the multiple comparisons problem and computational burden associated with the unprecedented amount of data. No variant survived the strict significance criterion, but several genes worthy of further exploration were identified, including CSMD2 and CADPS2. These genes have high relevance to brain structure. This is the first voxelwise genome wide association study to our knowledge, and offers a novel method to discover genetic influences on brain structure.

  18. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  19. Wide field of view multifocal scanning microscopy with sparse sampling

    NASA Astrophysics Data System (ADS)

    Wang, Jie; Wu, Jigang

    2016-02-01

    We propose to use sparsely sampled line scans with a sparsity-based reconstruction method to obtain images in a wide field of view (WFOV) multifocal scanning microscope. In the WFOV microscope, we used a holographically generated irregular focus grid to scan the sample in one dimension and then reconstructed the sample image from line scans by measuring the transmission of the foci through the sample during scanning. The line scans were randomly spaced with average spacing larger than the Nyquist sampling requirement, and the image was recovered with sparsity-based reconstruction techniques. With this scheme, the acquisition data can be significantly reduced and the restriction for equally spaced foci positions can be removed, indicating simpler experimental requirement. We built a prototype system and demonstrated the effectiveness of the reconstruction by recovering microscopic images of a U.S. Air Force target and an onion skin cell microscope slide with 40, 60, and 80% missing data with respect to the Nyquist sampling requirement.

  20. Genome Wide Methylome Alterations in Lung Cancer.

    PubMed

    Mullapudi, Nandita; Ye, Bin; Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D; Spivack, Simon D

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)-non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents.

  1. Genome Wide Methylome Alterations in Lung Cancer

    PubMed Central

    Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K.; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D.; Spivack, Simon D.

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)–non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  2. Genome wide selection in Citrus breeding.

    PubMed

    Gois, I B; Borém, A; Cristofani-Yaly, M; de Resende, M D V; Azevedo, C F; Bastianel, M; Novelli, V M; Machado, M A

    2016-10-17

    Genome wide selection (GWS) is essential for the genetic improvement of perennial species such as Citrus because of its ability to increase gain per unit time and to enable the efficient selection of characteristics with low heritability. This study assessed GWS efficiency in a population of Citrus and compared it with selection based on phenotypic data. A total of 180 individual trees from a cross between Pera sweet orange (Citrus sinensis Osbeck) and Murcott tangor (Citrus sinensis Osbeck x Citrus reticulata Blanco) were evaluated for 10 characteristics related to fruit quality. The hybrids were genotyped using 5287 DArT_seq(TM) (diversity arrays technology) molecular markers and their effects on phenotypes were predicted using the random regression - best linear unbiased predictor (rr-BLUP) method. The predictive ability, prediction bias, and accuracy of GWS were estimated to verify its effectiveness for phenotype prediction. The proportion of genetic variance explained by the markers was also computed. The heritability of the traits, as determined by markers, was 16-28%. The predictive ability of these markers ranged from 0.53 to 0.64, and the regression coefficients between predicted and observed phenotypes were close to unity. Over 35% of the genetic variance was accounted for by the markers. Accuracy estimates with GWS were lower than those obtained by phenotypic analysis; however, GWS was superior in terms of genetic gain per unit time. Thus, GWS may be useful for Citrus breeding as it can predict phenotypes early and accurately, and reduce the length of the selection cycle. This study demonstrates the feasibility of genomic selection in Citrus.

  3. Geodetic Laser Scanning: Refractive Optics Offer Wide Variety of Scan Patterns

    NASA Astrophysics Data System (ADS)

    Carter, W. E.; Shrestha, R. L.; Slatton, C. K.; Shrestha, K. Y.; Cossio, T.

    2005-12-01

    Most commercial geodetic laser mapping instruments use reflective element scanners, often a single nutating or oscillating mirror, and sometimes dual axis units, to create a specific pattern of laser spots on the surface being mapped. The user may be able to set the scanning speed (scan lines per second) and field of coverage (range of scan angles), but the basic pattern of points sampled is fixed. Engineers developing scanners for a surprisingly diverse set of applications, ranging from bar code scanning, to compensating for image motion in astronomical telescopes, to scanning spectrometers, have increasingly turned to refractive scanners-most particularly to scanners that utilize "Risley prisms." Samuel Doty Risley (1845-1920), an ophthalmologist, invented an optometer that contained a pair of thin prisms that rotated in opposite directions about their optical axes to change the convergence of light rays from a single source. He used his optometer measure the visual acuity of patients eyes, as a function of distance. In this original application, both prisms were driven by a common gear assembly, which resulted in a nearly linear scan line. But if the prisms are driven independently in both direction and angular speed, a wide variety of scan patterns can be generated. The University of Florida is developing, a photon counting geodetic laser scanning instrument that will use a Risley prism scanner. The scanner, being built by Sigma Space Inc., will be capable of producing nearly linear scan lines (saw tooth pattern from moving platform), circular scans lines (helical pattern from a moving platform) and any number of rosette scan patterns that are particularly interesting for fixed ground based work. The flexibility provided by the scanner offers the possibility of using the same sensor for airborne and ground based geodetic laser scanning. Examples of the scanner patterns and the initial results from laboratory and early field tests will be presented.

  4. Identification of a Novel Risk Locus for Multiple Sclerosis at 13q31.3 by a Pooled Genome-Wide Scan of 500,000 Single Nucleotide Polymorphisms

    PubMed Central

    Camiña-Tato, Montse; Morcillo, Carlos; Lopez, Cristina; Navarro, Arcadi; Rio, Jordi; Montalban, Xavier; Martin, Roland

    2008-01-01

    Multiple sclerosis is a chronic inflammatory demyelinating disease of the central nervous system with an important genetic component and strongest association driven by the HLA genes. We performed a pooling-based genome-wide association study of 500,000 SNPs in order to find new loci associated with the disease. After applying several criteria, 320 SNPs were selected from the microarrays and individually genotyped in a first and independent Spanish Caucasian replication cohort. The 8 most significant SNPs validated in this cohort were also genotyped in a second US Caucasian replication cohort for confirmation. The most significant association was obtained for SNP rs3129934, which neighbors the HLA-DRB/DQA loci and validates our pooling-based strategy. The second strongest association signal was found for SNP rs1327328, which resides in an unannotated region of chromosome 13 but is in linkage disequilibrium with nearby functional elements that may play important roles in disease susceptibility. This region of chromosome 13 has not been previously identified in MS linkage genome screens and represents a novel risk locus for the disease. PMID:18941528

  5. Genome Scans for Transmission Ratio Distortion Regions in Mice

    PubMed Central

    Casellas, Joaquim; Gularte, Rodrigo J.; Farber, Charles R.; Varona, Luis; Mehrabian, Margarete; Schadt, Eric E.; Lusis, Aldon J.; Attie, Alan D.; Yandell, Brian S.; Medrano, Juan F.

    2012-01-01

    Transmission ratio distortion (TRD) is the departure from the expected genotypic frequencies under Mendelian inheritance. This departure can be due to multiple physiological mechanisms during gametogenesis, fertilization, fetal and embryonic development, and early neonatal life. Although a few TRD loci have been reported in mouse, inheritance patterns have never been evaluated for TRD. In this article, we developed a Bayesian binomial model accounting for additive and dominant deviation TRD mechanisms. Moreover, this model was used to perform genome-wide scans for TRD quantitative trait loci (QTL) on six F2 mouse crosses involving between 296 and 541 mice and between 72 and 1854 genetic markers. Statistical significance of each model was checked at each genetic marker with Bayes factors. Genome scans revealed overdominance TRD QTL located in mouse chromosomes 1, 2, 12, 13, and 14 and additive TRD QTL in mouse chromosomes 2, 3, and 15, although these results did not replicate across mouse crosses. This research contributes new statistical tools for the analysis of specific genetic patterns involved in TRD in F2 populations, our results suggesting a relevant incidence of TRD phenomena in mouse with important implications for both statistical analyses and biological research. PMID:22367040

  6. Genome-wide scan of bipolar disorder and investigation of population stratification effects on linkage: support for susceptibility loci at 4q21, 7q36, 9p21, 12q24, 14q24, and 16p13.

    PubMed

    Cassidy, F; Zhao, C; Badger, J; Claffey, E; Dobrin, S; Roche, S; McKeon, P

    2007-09-05

    Bipolar disorder (BPD) is a complex genetic disorder with cycling symptoms of depression and mania. Despite the extreme complexity of this psychiatric disorder, attempts to localize genes which confer vulnerability to the disorder have had some success. Chromosomal regions including 4p16, 12q24, 18p11, 18q22, and 21q21 have been repeatedly linked to BPD in different populations. Here we present the results of a whole genome scan for linkage to BPD in an Irish population. Our most significant result was at 14q24 which yielded a non-parametric LOD (NPL) score of 3.27 at the D14S588 marker with a nominal P-value of 0.0006 under a narrow (bipolar type I only) model of affection. We previously reported linkage to 14q22-24 in a subset of the families tested in this analysis. We also obtained suggestive evidence for linkage at 4q21, 9p21, 12q24, and 16p13, chromosomal regions that have all been previously linked to BPD. Additionally, we report on a novel approach to linkage analysis, STRUCTURE-Guided Linkage Analysis (SGLA), which is designed to reduce genetic heterogeneity and increase the power to detect linkage. Application of this technique resulted in more highly significant evidence for linkage of BPD to three regions including 16p13, a locus that has been repeatedly linked to numerous psychiatric disorders.

  7. Wide-Angle-Scanning Reflectarray Antennas Actuated by MEMS

    NASA Technical Reports Server (NTRS)

    Fang, Houfei; Huang, John; Thomson, Mark W.

    2009-01-01

    An effort to develop large-aperture, wide-angle-scanning reflectarray antennas for microwave radar and communication systems is underway. In an antenna of this type as envisioned, scanning of the radiated or incident microwave beam would be effected through mechanical rotation of the passive (reflective) patch antenna elements, using microelectromechanical systems (MEMS) stepping rotary actuators typified by piezoelectric micromotors. It is anticipated that the cost, mass, and complexity of such an antenna would be less than, and the reliability greater than, those of an electronically scanned phased-array antenna of comparable beam-scanning capability and angular resolution. In the design and operation of a reflectarray, one seeks to position and orient an array of passive patch elements in a geometric pattern such that, through constructive interference of the reflections from them, they collectively act as an efficient single reflector of radio waves within a desired frequency band. Typically, the patches lie in a common plane and radiation is incident upon them from a feed horn.

  8. Genome-wide linkage scans for type 2 diabetes mellitus in four ethnically diverse populations; significant evidence for linkage on chromosome 4q in African Americans: the Family Investigation of Nephropathy and Diabetes (FIND) Research Group

    PubMed Central

    Malhotra, Alka; Igo, Robert P.; Thameem, Farook; Kao, W.H. Linda; Abboud, Hanna E.; Adler, Sharon G.; Arar, Nedal H.; Bowden, Donald W.; Duggirala, Ravindranath; Freedman, Barry I.; Goddard, Katrina A.B.; Ipp, Eli; Iyengar, Sudha K.; Kimmel, Paul L.; Knowler, William C.; Kohn, Orly; Leehey, David; Meoni, Lucy A.; Nelson, Robert G.; Nicholas, Susanne B.; Parekh, Rulan S.; Rich, Stephen S.; Chen, Yii-Der I.; Saad, Mohammed F.; Scavini, Marina; Schelling, Jeffrey R.; Sedor, John R.; Shah, Vallabh O.; Taylor, Kent D.; Thornley-Brown, Denyse; Zager, Philip G.; Horvath, Amanda; Hanson, Robert L.

    2009-01-01

    Background Previous studies have shown that, in addition to environmental influences, type 2 diabetes mellitus (T2DM) has a strong genetic component. The goal of the current study is to identify regions of linkage for T2DM in ethnically diverse populations. Methods Phenotypic and genotypic data were obtained from African American (AA; total number of individuals (N)=1004), American Indian (AI; N=883), European American (EA; N=537), and Mexican American (MA; N=1634) individuals from the Family Investigation of Nephropathy and Diabetes. Nonparametric linkage analysis, using an average of 4,404 SNPs, was performed in relative pairs affected with T2DM in each ethnic group. In addition, family-based tests were performed to detect association with T2DM. Results Statistically significant evidence for linkage was observed on chromosomes 4q21.1 (LOD=3.13; genome-wide p=0.04) in AA. In addition, a total of eleven regions showed suggestive evidence for linkage (estimated at LOD>1.71), with the highest LOD scores on chromosomes 12q21.31 (LOD=2.02) and 22q12.3 (LOD=2.38) in AA, 2p11.1 (LOD=2.23) in AI, 6p12.3 (LOD=2.77) in EA, and 13q21.1 (LOD=2.24) in MA. While no region overlapped across all ethnic groups, at least five loci showing LOD>1.71 have been identified in previously published studies. Conclusions The results from this study provide evidence for the presence of genes affecting T2DM on chromosomes 4q, 12q, and 22q in AA, 6p in EA, 2p in AI, and 13q in MA. The strong evidence for linkage on chromosome 4q in AA provides important information given the paucity of diabetes genetic studies in this population. PMID:19795399

  9. Water-Immersible MEMS scanning mirror designed for wide-field fast-scanning photoacoustic microscopy

    NASA Astrophysics Data System (ADS)

    Yao, Junjie; Huang, Chih-Hsien; Martel, Catherine; Maslov, Konstantin I.; Wang, Lidai; Yang, Joon-Mo; Gao, Liang; Randolph, Gwendalyn; Zou, Jun; Wang, Lihong V.

    2013-03-01

    By offering images with high spatial resolution and unique optical absorption contrast, optical-resolution photoacoustic microscopy (OR-PAM) has gained increasing attention in biomedical research. Recent developments in OR-PAM have improved its imaging speed, but have sacrificed either the detection sensitivity or field of view or both. We have developed a wide-field fast-scanning OR-PAM by using a water-immersible MEMS scanning mirror (MEMS-ORPAM). Made of silicon with a gold coating, the MEMS mirror plate can reflect both optical and acoustic beams. Because it uses an electromagnetic driving force, the whole MEMS scanning system can be submerged in water. In MEMS-ORPAM, the optical and acoustic beams are confocally configured and simultaneously steered, which ensures uniform detection sensitivity. A B-scan imaging speed as high as 400 Hz can be achieved over a 3 mm scanning range. A diffraction-limited lateral resolution of 2.4 μm in water and a maximum imaging depth of 1.1 mm in soft tissue have been experimentally determined. Using the system, we imaged the flow dynamics of both red blood cells and carbon particles in a mouse ear in vivo. By using Evans blue dye as the contrast agent, we also imaged the flow dynamics of lymphatic vessels in a mouse tail in vivo. The results show that MEMS-OR-PAM could be a powerful tool for studying highly dynamic and time-sensitive biological phenomena.

  10. Genome-Wide Views of Chromatin Structure

    PubMed Central

    Rando, Oliver J.; Chang, Howard Y.

    2010-01-01

    Eukaryotic genomes are packaged into a nucleoprotein complex known as chromatin, which affects most processes that occur on DNA. Along with genetic and biochemical studies of resident chromatin proteins and their modifying enzymes, mapping of chromatin structure in vivo is one of the main pillars in our understanding of how chromatin relates to cellular processes. In this review, we discuss the use of genomic technologies to characterize chromatin structure in vivo, with a focus on data from budding yeast and humans. The picture emerging from these studies is the detailed chromatin structure of a typical gene, where the typical behavior gives insight into the mechanisms and deep rules that establish chromatin structure. Important deviation from the archetype is also observed, usually as a consequence of unique regulatory mechanisms at special genomic loci. Chromatin structure shows substantial conservation from yeast to humans, but mammalian chromatin has additional layers of complexity that likely relate to the requirements of multicellularity such as the need to establish faithful gene regulatory mechanisms for cell differentiation. PMID:19317649

  11. Gene Fusion: A Genome Wide Survey

    NASA Technical Reports Server (NTRS)

    Liang, Ping; Riley, Monica

    2001-01-01

    As a well known fact, organisms form larger and complex multimodular (composite or chimeric) and mostly multi-functional proteins through gene fusion of two or more individual genes which have independent evolution histories and functions. We call each of these components a module. The existence of multimodular proteins may improves the efficiency in gene regulation and in cellular functions, and thus may give the host organism advantages in adaptation to environments. Analysis of all gene fusions in present-day organisms should allow us to examine the patterns of gene fusion in context with cellular functions, to trace back the evolution processes from the ancient smaller and uni-functional proteins to the present-day larger and complex multi-functional proteins, and to estimate the minimal number of ancestor proteins that existed in the last common ancestor for all life on earth. Although many multimodular proteins have been experimentally known, identification of gene fusion events systematically at genome scale had not been possible until recently when large number of completed genome sequences have been becoming available. In addition, technical difficulties for such analysis also exist due to the complexity of this biological and evolutionary process. We report from this study a new strategy to computationally identify multimodular proteins using completed genome sequences and the results surveyed from 22 organisms with the data from over 40 organisms to be presented during the meeting. Additional information is contained in the original extended abstract.

  12. Genome-wide association study of atypical psychosis.

    PubMed

    Kanazawa, Tetsufumi; Ikeda, Masashi; Glatt, Stephen J; Tsutsumi, Atsushi; Kikuyama, Hiroki; Kawamura, Yoshiya; Nishida, Nao; Miyagawa, Taku; Hashimoto, Ryota; Takeda, Masatoshi; Sasaki, Tsukasa; Tokunaga, Katsushi; Koh, Jun; Iwata, Nakao; Yoneda, Hiroshi

    2013-10-01

    Atypical psychosis with a periodic course of exacerbation and features of major psychiatric disorders [schizophrenia (SZ) and bipolar disorder (BD)] has a long history in clinical psychiatry in Japan. Based upon the new criteria of atypical psychosis, a Genome-Wide Association Study (GWAS) was conducted to identify the risk gene or variants. The relationships between atypical psychosis, SZ and BD were then assessed using independent GWAS data. Forty-seven patients with solid criteria of atypical psychosis and 882 normal controls (NCs) were scanned using an Affymetrics 6.0 chip. GWAS SZ data (560 SZ cases and 548 NCs) and GWAS BD (107 cases with BD type 1 and 107 NCs) were compared using gene-based analysis. The most significant SNPs were detected around the CHN2/CPVL genes (rs245914, P = 1.6 × 10(-7)) , COL21A1 gene (rs12196860, P = 2.45 × 10(-7) ), and PYGL/TRIM9 genes (rs1959536, P = 7.73 × 10(-7) ), although none of the single-nucleotide polymorphisms exhibited genome-wide significance (P = 5 × 10(-8) ). One of the highest peaks was detected on the major histocompatibility complex region, where large SZ GWASs have previously disclosed an association. The gene-based analysis suggested significant enrichment between SZ and atypical psychosis (P = 0.01), but not BD. This study provides clues about the types of patient whose diagnosis lies between SZ and BD. Studies with larger samples are required to determine the causal variant.

  13. Ultrahigh vacuum scanning electron microscope system combined with wide-movable scanning tunneling microscope

    SciTech Connect

    Kaneko, A.; Homma, Y.; Hibino, H.; Ogino, T.

    2005-08-15

    A surface analysis system has been newly developed with combination of ultrahigh vacuum scanning electron microscope (SEM) and wide-movable scanning tunneling microscope (STM). The basic performance is experimentally demonstrated. These SEM and STM images are clear enough to obtain details of surface structures. The STM unit moves horizontally over several millimeters by sliding motion of PZT actuators. The motion resolution is proved to be submicrometers. The STM tip mounted on another PZT scanner can be guided to a specific object on the sample surface during SEM observation. In the observation of a Si(111) surface rapidly cooled from high temperature, the STM tip was accurately guided to an isolated atomic step and slightly moved along it during SEM observation. The STM observation shows an asymmetry of the (7x7)-transformed region along the step between the upper and lower terraces. (7x7) bands continuously formed along the edge of terraces, while (7x7) domains distributed on the terraces slightly far from the step. These experiments show the wide-movable STM unit resolves a gap of observation area between SEM and STM and the system enables a specific object found in the SEM image to be observed easily by STM.

  14. Analysis of Heritability Using Genome-Wide Data.

    PubMed

    Hall, Jacob B; Bush, William S

    2016-10-11

    Most analyses of genome-wide association data consider each variant independently without considering or adjusting for the genetic background present in the rest of the genome. New approaches to genome analysis use representations of genomic sharing to better account for confounding factors like population stratification or to directly approximate heritability through the estimated sharing of individuals in a dataset. These approaches use mixed linear models, which relate genotypic sharing to phenotypic sharing, and rely on the efficient computation of genetic sharing among individuals in a dataset. This unit describes the principles and practical application of mixed models for the analysis of genome-wide association study data. © 2016 by John Wiley & Sons, Inc.

  15. A Genome Scan for Positive Selection in Thoroughbred Horses

    PubMed Central

    Gu, Jingjing; Orr, Nick; Park, Stephen D.; Katz, Lisa M.; Sulimova, Galina; MacHugh, David E.; Hill, Emmeline W.

    2009-01-01

    Thoroughbred horses have been selected for exceptional racing performance resulting in system-wide structural and functional adaptations contributing to elite athletic phenotypes. Because selection has been recent and intense in a closed population that stems from a small number of founder animals Thoroughbreds represent a unique population within which to identify genomic contributions to exercise-related traits. Employing a population genetics-based hitchhiking mapping approach we performed a genome scan using 394 autosomal and X chromosome microsatellite loci and identified positively selected loci in the extreme tail-ends of the empirical distributions for (1) deviations from expected heterozygosity (Ewens-Watterson test) in Thoroughbred (n = 112) and (2) global differentiation among four geographically diverse horse populations (FST). We found positively selected genomic regions in Thoroughbred enriched for phosphoinositide-mediated signalling (3.2-fold enrichment; P<0.01), insulin receptor signalling (5.0-fold enrichment; P<0.01) and lipid transport (2.2-fold enrichment; P<0.05) genes. We found a significant overrepresentation of sarcoglycan complex (11.1-fold enrichment; P<0.05) and focal adhesion pathway (1.9-fold enrichment; P<0.01) genes highlighting the role for muscle strength and integrity in the Thoroughbred athletic phenotype. We report for the first time candidate athletic-performance genes within regions targeted by selection in Thoroughbred horses that are principally responsible for fatty acid oxidation, increased insulin sensitivity and muscle strength: ACSS1 (acyl-CoA synthetase short-chain family member 1), ACTA1 (actin, alpha 1, skeletal muscle), ACTN2 (actinin, alpha 2), ADHFE1 (alcohol dehydrogenase, iron containing, 1), MTFR1 (mitochondrial fission regulator 1), PDK4 (pyruvate dehydrogenase kinase, isozyme 4) and TNC (tenascin C). Understanding the genetic basis for exercise adaptation will be crucial for the identification of genes

  16. Genome-wide approaches (GWA) in oral and craniofacial diseases research

    PubMed Central

    Kim, H; Gordon, S; Dionne, R

    2012-01-01

    Underlying molecular genetic mechanisms of diseases can be deciphered with unbiased strategies using recently developed technologies enabling genome-wide scale investigations. These technologies have been applied in scanning for genetic variations, gene expression profiles, and epigenetic changes for oral and craniofacial diseases. However, these approaches as applied to oral and craniofacial conditions are in the initial stages, and challenges remain to be overcome, including analysis of high throughput data and their interpretation. Here, we review methodology and studies using genome-wide approaches in oral and craniofacial diseases and suggest future directions. PMID:22913301

  17. A novel statistic for genome-wide interaction analysis.

    PubMed

    Wu, Xuesen; Dong, Hua; Luo, Li; Zhu, Yun; Peng, Gang; Reveille, John D; Xiong, Momiao

    2010-09-23

    Although great progress in genome-wide association studies (GWAS) has been made, the significant SNP associations identified by GWAS account for only a few percent of the genetic variance, leading many to question where and how we can find the missing heritability. There is increasing interest in genome-wide interaction analysis as a possible source of finding heritability unexplained by current GWAS. However, the existing statistics for testing interaction have low power for genome-wide interaction analysis. To meet challenges raised by genome-wide interactional analysis, we have developed a novel statistic for testing interaction between two loci (either linked or unlinked). The null distribution and the type I error rates of the new statistic for testing interaction are validated using simulations. Extensive power studies show that the developed statistic has much higher power to detect interaction than classical logistic regression. The results identified 44 and 211 pairs of SNPs showing significant evidence of interactions with FDR<0.001 and 0.001genome-wide interaction analysis is a valuable tool for finding remaining missing heritability unexplained by the current GWAS, and the developed novel statistic is able to search significant interaction between SNPs across the genome. Real data analysis showed that the results of genome-wide interaction analysis can be replicated in two independent studies.

  18. Accounting for Linkage Disequilibrium in genome scans for selection without individual genotypes: the local score approach.

    PubMed

    Fariello, María Inés; Boitard, Simon; Mercier, Sabine; Robelin, David; Faraut, Thomas; Arnould, Cécile; Recoquillay, Julien; Bouchez, Olivier; Salin, Gérald; Dehais, Patrice; Gourichon, David; Leroux, Sophie; Pitel, Frédérique; Leterrier, Christine; SanCristobal, Magali

    2017-04-10

    Detecting genomic footprints of selection is an important step in the understanding of evolution. Accounting for linkage disequilibrium in genome scans increases detection power, but haplotype-based methods require individual genotypes and are not applicable on pool-sequenced samples. We propose to take advantage of the local score approach to account for linkage disequilibrium in genome scans for selection, cumulating (possibly small) signals from single markers over a genomic segment, to clearly pinpoint a selection signal. Using computer simulations, we demonstrate that this approach detects selection with higher power than several state-of-the-art single marker, windowing or haplotype-based approaches. We illustrate this on two benchmark data sets including individual genotypes, for which we obtain similar results with the local score and one haplotype-based approach. Finally, we apply the local score approach to Pool-Seq data obtained from a divergent selection experiment on behavior in quail, and obtain precise and biologically coherent selection signals: while competing methods fail to highlight any clear selection signature, our method detects several regions involving genes known to act on social responsiveness or autistic traits. Although we focus here on the detection of positive selection from multiple population data, the local score approach is general and can be applied to other genome scans for selection or other genome-wide analyses such as GWAS. This article is protected by copyright. All rights reserved.

  19. Genome-Wide Association Studies and Liver Disease

    PubMed Central

    Speliotes, Elizabeth K.

    2016-01-01

    Sequencing of the human genome has opened up many opportunities to learn about our own genetic susceptibilities to disease. In this Foreword to this issue of Seminars in Liver Disease, I provide some required background to understanding genome-wide association analyses in general, including a list of terms (Table 1) often used in such studies. Five areas of particular significance are then reviewed in detail in the articles that follow. PMID:26676811

  20. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    PubMed

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  1. Genome-Wide Linkage Scan of Bipolar Disorder in a Colombian Population Isolate Replicates Loci on Chromosomes 7p21–22, 1p31, 16p12 and 21q21–22 and Identifies a Novel Locus on Chromosome 12q

    PubMed Central

    Kremeyer, B.; García, J.; Müller, H.; Burley, M.W.; Herzberg, I.; Parra, M.V.; Duque, C.; Vega, J.; Montoya, P.; López, M.C.; Bedoya, G.; Reus, V.; Palacio, C.; López, C.; Ospina-Duque, J.; Freimer, N.B.; Ruiz-Linares, A.

    2011-01-01

    Background/Aims: Bipolar disorder (BP) is a severe psychiatric illness, characterised by alternating episodes of depression and mania, which ranks among the top ten causes of morbidity and life-long disability world-wide. We have previously performed a whole-genome linkage scan on 6 pedigrees segregating severe BP from the well-characterised population isolate of Antioquia, Colombia. We recently collected genotypes for the same set of 382 autosomal microsatellite markers in 9 additional Antioquian BP pedigrees. Here, we report the analysis of the combined pedigree set. Methods: Linkage analysis using both parametric and nonparametric approaches was conducted for 3 different diagnostic models: severe BP only (BPI); mood disorders (BPI, BPII and major depression); and psychosis (operationally defined by the occurrence of at least 1 episode of hallucinations and/or delusions). Results and Conclusion: For BPI only, the most interesting result was obtained for chromosome 7p21.1–p22.2 under a recessive model of inheritance (heterogeneity LOD score = 2.80), a region that had previously been linked to BP in a study on Portuguese Island families. For both BPI and mood disorders, nonparametric analyses identified a locus on chromosome 12ct–q14 (nonparametric linkage = 2.55 and 2.35, respectively). This locus has not previously been reported as a candidate region for BP. Additional candidate regions were found on chromosomes 1p22–31 (mood disorders) and 21q21–22 (BPI), 2 loci that have repeatedly been implicated in BP susceptibility. Linkage analysis of psychosis as a phenotype identified candidate regions on chromosomes 2q24–31 and 16p12–q12. The finding on chromosome 16p is noteworthy because the same locus has been implicated by genome-wide association analyses of BP. PMID:21071953

  2. Microfluidics for genome-wide studies involving next generation sequencing

    PubMed Central

    Murphy, Travis W.; Lu, Chang

    2017-01-01

    Next-generation sequencing (NGS) has revolutionized how molecular biology studies are conducted. Its decreasing cost and increasing throughput permit profiling of genomic, transcriptomic, and epigenomic features for a wide range of applications. Microfluidics has been proven to be highly complementary to NGS technology with its unique capabilities for handling small volumes of samples and providing platforms for automation, integration, and multiplexing. In this article, we review recent progress on applying microfluidics to facilitate genome-wide studies. We emphasize on several technical aspects of NGS and how they benefit from coupling with microfluidic technology. We also summarize recent efforts on developing microfluidic technology for genomic, transcriptomic, and epigenomic studies, with emphasis on single cell analysis. We envision rapid growth in these directions, driven by the needs for testing scarce primary cell samples from patients in the context of precision medicine.

  3. Genome-Wide Approaches to Drosophila Heart Development

    PubMed Central

    Frasch, Manfred

    2016-01-01

    The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi) reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level. PMID:27294102

  4. Genome-wide association studies in maize: praise and stargaze

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide association study (GWAS) has appeared as a widespread strategy in decoding genotype-phenotype associations in many species thanks to technical advances in next-generation sequencing (NGS) applications. Maize is an ideal crop for GWAS and significant progress has been made in the last dec...

  5. A super powerful method for genome wide association study

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population struct...

  6. Genome-wide association study identifies five new schizophrenia loci

    PubMed Central

    2012-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10−11) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10−9), ANK3 (rs10994359, P = 2.5 × 10−8) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10−9). PMID:21926974

  7. Genome-wide association mapping of soybean aphid resistance traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Soybean aphid is the most damaging insect pest of soybean in the Upper Midwest and is primarily controlled by insecticides. Soybean aphid resistance (i.e., Rag genes) has been documented in some soybean lines at chromosomes 6, 7, 13, and 16, but more sources of resistance are needed. Genome-wide ass...

  8. A Genome-wide Breast Cancer Scan in African Americans

    DTIC Science & Technology

    2012-06-01

    Harris TB, Henderson BE, Hernandez DG, Hitsman B, Hu JJ, Hunt SC, Ingles SA, John EM, Kittles R, Kolb S, Kolonel LN, Le Marchand L, Liu Y, Lohman...Chanock SJ, Demerath E, Deming SL, Diver WR, Fox C, Harris TB, Hernandez DG, Hu JJ, Ingles SA, John EM, Johnson C, Keating B, Kittles RA, Kolonel...Carolyn M. Hutter • Alicia Young • Charles Kooperberg • Ulrike Peters • Suhn K. Rhie • Peggy Wan • Xin Sheng • Loreall C. Pooler • David J. Van Den

  9. A Genome-Wide Breast Cancer Scan in African Americans

    DTIC Science & Technology

    2011-06-01

    Boggess, J.F., Basil , J., Blank, S.V., Friedman, E., Kaufman, B., Laitman, Y., Milgrom, R., Andrulis, I.L., Glendon, G., Ozcelik, H., Kirchhoff, T... Moore , L.E., Prokhortchouk, E., Wu, X., Kiemeney, L.A., Gaborieau, V., Jacobs, K.B., Chow, W.H., Zaridze, D., Matveev, V., Lubinski, J., Trubicka

  10. Genome-wide identification of hypoxia-induced enhancer regions

    PubMed Central

    Preston, Jessica L.; Randel, Melissa A.; Johnson, Eric A.

    2015-01-01

    Here we present a genome-wide method for de novo identification of enhancer regions. This approach enables massively parallel empirical investigation of DNA sequences that mediate transcriptional activation and provides a platform for discovery of regulatory modules capable of driving context-specific gene expression. The method links fragmented genomic DNA to the transcription of randomer molecule identifiers and measures the functional enhancer activity of the library by massively parallel sequencing. We transfected a Drosophila melanogaster library into S2 cells in normoxia and hypoxia, and assayed 4,599,881 genomic DNA fragments in parallel. The locations of the enhancer regions strongly correlate with genes up-regulated after hypoxia and previously described enhancers. Novel enhancer regions were identified and integrated with RNAseq data and transcription factor motifs to describe the hypoxic response on a genome-wide basis as a complex regulatory network involving multiple stress-response pathways. This work provides a novel method for high-throughput assay of enhancer activity and the genome-scale identification of 31 hypoxia-activated enhancers in Drosophila. PMID:26713262

  11. Validating, augmenting and refining genome-wide association signals.

    PubMed

    Ioannidis, John P A; Thomas, Gilles; Daly, Mark J

    2009-05-01

    Studies using genome-wide platforms have yielded an unprecedented number of promising signals of association between genomic variants and human traits. This Review addresses the steps required to validate, augment and refine such signals to identify underlying causal variants for well-defined phenotypes. These steps include: large-scale exact replication across both similar and diverse populations; fine mapping and resequencing; determination of the most informative markers and multiple independent informative loci; incorporation of functional information; and improved phenotype mapping of the implicated genetic effects. Even in cases for which replication proves that an effect exists, confident localization of the causal variant often remains elusive.

  12. Genome-Wide Estimates of Heritability for Social Demographic Outcomes

    PubMed Central

    Domingue, Benjamin W.; Wedow, Robbee; Conley, Dalton; McQueen, Matt; Hoffmann, Thomas J.; Boardman, Jason D.

    2016-01-01

    An increasing number of studies that are widely used in the demographic research community have collected genome-wide data from their respondents. It is therefore important that demographers have a proper understanding of some of the methodological tools needed to analyze such data. Our paper details the underlying methodology behind one of the most common techniques for analyzing genome-wide data, Genome-Wide Complex Trait Analysis (GCTA). GCTA models provide heritability estimates for health, health behaviors, or indicators of attainment using data from unrelated persons.. Our goal is to describe this model, to highlight the utility of the model for biodemographic research, and to demonstrate the performance of this approach under modifications of the underlying assumptions. The first set of modifications involves changing the nature of the genetic data used to compute genetic similarities between individuals (the genetic relationship matrix). We then explore the sensitivity of the model to heteroscedastic errors. In general, GCTA estimates are robust to the modifications proposed here but we also highlight potential limitations of GCTA estimates. PMID:27050030

  13. High-resolution genome-wide mapping of histone modifications.

    PubMed

    Roh, Tae-young; Ngau, Wing Chi; Cui, Kairong; Landsman, David; Zhao, Keji

    2004-08-01

    The expression patterns of eukaryotic genomes are controlled by their chromatin structure, consisting of nucleosome subunits in which DNA of approximately 146 bp is wrapped around a core of 8 histone molecules. Post-translational histone modifications play an essential role in modifying chromatin structure. Here we apply a combination of SAGE and chromatin immunoprecipitation (ChIP) protocols to determine the distribution of hyperacetylated histones H3 and H4 in the Saccharomyces cerevisiae genome. We call this approach genome-wide mapping technique (GMAT). Using GMAT, we find that the highest acetylation levels are detected in the 5' end of a gene's coding region, but not in the promoter. Furthermore, we show that the histone acetyltransferase, GCN5p, regulates H3 acetylation in the promoter and 5' end of the coding regions. These findings indicate that GMAT should find valuable applications in mapping target sites of chromatin-modifying enzymes.

  14. Genome-Wide Association Studies with a Genomic Relationship Matrix: A Case Study with Wheat and Arabidopsis.

    PubMed

    Gianola, Daniel; Fariello, Maria I; Naya, Hugo; Schön, Chris-Carolin

    2016-10-13

    Standard genome-wide association studies (GWAS) scan for relationships between each of p molecular markers and a continuously distributed target trait. Typically, a marker-based matrix of genomic similarities among individuals ( G: ) is constructed, to account more properly for the covariance structure in the linear regression model used. We show that the generalized least-squares estimator of the regression of phenotype on one or on m markers is invariant with respect to whether or not the marker(s) tested is(are) used for building G,: provided variance components are unaffected by exclusion of such marker(s) from G: The result is arrived at by using a matrix expression such that one can find many inverses of genomic relationship, or of phenotypic covariance matrices, stemming from removing markers tested as fixed, but carrying out a single inversion. When eigenvectors of the genomic relationship matrix are used as regressors with fixed regression coefficients, e.g., to account for population stratification, their removal from G: does matter. Removal of eigenvectors from G: can have a noticeable effect on estimates of genomic and residual variances, so caution is needed. Concepts were illustrated using genomic data on 599 wheat inbred lines, with grain yield as target trait, and on close to 200 Arabidopsis thaliana accessions.

  15. Genome-Wide Association Studies with a Genomic Relationship Matrix: A Case Study with Wheat and Arabidopsis

    PubMed Central

    Gianola, Daniel; Fariello, Maria I.; Naya, Hugo; Schön, Chris-Carolin

    2016-01-01

    Standard genome-wide association studies (GWAS) scan for relationships between each of p molecular markers and a continuously distributed target trait. Typically, a marker-based matrix of genomic similarities among individuals (G) is constructed, to account more properly for the covariance structure in the linear regression model used. We show that the generalized least-squares estimator of the regression of phenotype on one or on m markers is invariant with respect to whether or not the marker(s) tested is(are) used for building G, provided variance components are unaffected by exclusion of such marker(s) from G. The result is arrived at by using a matrix expression such that one can find many inverses of genomic relationship, or of phenotypic covariance matrices, stemming from removing markers tested as fixed, but carrying out a single inversion. When eigenvectors of the genomic relationship matrix are used as regressors with fixed regression coefficients, e.g., to account for population stratification, their removal from G does matter. Removal of eigenvectors from G can have a noticeable effect on estimates of genomic and residual variances, so caution is needed. Concepts were illustrated using genomic data on 599 wheat inbred lines, with grain yield as target trait, and on close to 200 Arabidopsis thaliana accessions. PMID:27520956

  16. Genome-Wide Binding Patterns of Thyroid Hormone Receptor Beta

    PubMed Central

    Ayers, Stephen; Switnicki, Michal Piotr; Angajala, Anusha; Lammel, Jan; Arumanayagam, Anithachristy S.; Webb, Paul

    2014-01-01

    Thyroid hormone (TH) receptors (TRs) play central roles in metabolism and are major targets for pharmaceutical intervention. Presently, however, there is limited information about genome wide localizations of TR binding sites. Thus, complexities of TR genomic distribution and links between TRβ binding events and gene regulation are not fully appreciated. Here, we employ a BioChIP approach to capture TR genome-wide binding events in a liver cell line (HepG2). Like other NRs, TRβ appears widely distributed throughout the genome. Nevertheless, there is striking enrichment of TRβ binding sites immediately 5′ and 3′ of transcribed genes and TRβ can be detected near 50% of T3 induced genes. In contrast, no significant enrichment of TRβ is seen at negatively regulated genes or genes that respond to unliganded TRs in this system. Canonical TRE half-sites are present in more than 90% of TRβ peaks and classical TREs are also greatly enriched, but individual TRE organization appears highly variable with diverse half-site orientation and spacing. There is also significant enrichment of binding sites for TR associated transcription factors, including AP-1 and CTCF, near TR peaks. We conclude that T3-dependent gene induction commonly involves proximal TRβ binding events but that far-distant binding events are needed for T3 induction of some genes and that distinct, indirect, mechanisms are often at play in negative regulation and unliganded TR actions. Better understanding of genomic context of TR binding sites will help us determine why TR regulates genes in different ways and determine possibilities for selective modulation of TR action. PMID:24558356

  17. Genome-wide mapping of DNA strand breaks.

    PubMed

    Leduc, Frédéric; Faucher, David; Bikond Nkoma, Geneviève; Grégoire, Marie-Chantal; Arguin, Mélina; Wellinger, Raymund J; Boissonneault, Guylain

    2011-02-25

    Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP), uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL) to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.

  18. Genome-wide functional analysis in Candida albicans.

    PubMed

    Motaung, Thabiso E; Ells, Ruan; Pohl, Carolina H; Albertyn, Jacobus; Tsilo, Toi J

    2017-02-08

    Candida albicans is an important etiological agent of superficial and life-threatening infections in individuals with compromised immune systems. To date, we know of several overlapping genetic networks that govern virulence attributes in this fungal pathogen. Classical use of deletion mutants has led to the discovery of numerous virulence factors over the years, and genome-wide functional analysis has propelled gene discovery at an even faster pace. Indeed, a number of recent studies using large-scale genetic screens followed by genome-wide functional analysis has allowed for the unbiased discovery of many new genes involved in C. albicans biology. Here we share our perspectives on the role of these studies in analyzing fundamental aspects of C. albicans virulence properties.

  19. Genome-wide association studies of obesity and metabolic syndrome.

    PubMed

    Fall, Tove; Ingelsson, Erik

    2014-01-25

    Until just a few years ago, the genetic determinants of obesity and metabolic syndrome were largely unknown, with the exception of a few forms of monogenic extreme obesity. Since genome-wide association studies (GWAS) became available, large advances have been made. The first single nucleotide polymorphism robustly associated with increased body mass index (BMI) was in 2007 mapped to a gene with for the time unknown function. This gene, now known as fat mass and obesity associated (FTO) has been repeatedly replicated in several ethnicities and is affecting obesity by regulating appetite. Since the first report from a GWAS of obesity, an increasing number of markers have been shown to be associated with BMI, other measures of obesity or fat distribution and metabolic syndrome. This systematic review of obesity GWAS will summarize genome-wide significant findings for obesity and metabolic syndrome and briefly give a few suggestions of what is to be expected in the next few years.

  20. Genome-wide association study of paliperidone efficacy

    PubMed Central

    Wineinger, Nathan E.; Fu, Dong-Jing; Libiger, Ondrej; Alphs, Larry; Savitz, Adam; Gopal, Srihari; Cohen, Nadine; Schork, Nicholas J.

    2017-01-01

    Objective Clinical response to the atypical antipsychotic paliperidone is known to vary among schizophrenic patients. We carried out a genome-wide association study to identify common genetic variants predictive of paliperidone efficacy. Methods We leveraged a collection of 1390 samples from individuals of European ancestry enrolled in 12 clinical studies investigating the efficacy of the extended-release tablet paliperidone ER (n1=490) and the once-monthly injection paliperidone palmitate (n2=550 and n3=350). We carried out a genome-wide association study using a general linear model (GLM) analysis on three separate cohorts, followed by meta-analysis and using a mixed linear model analysis on all samples. The variations in response explained by each single nucleotide polymorphism (h2SNP) were estimated. Results No SNP passed genome-wide significance in the GLM-based analyses with suggestive signals from rs56240334 [P=7.97×10−8 for change in the Clinical Global Impression Scale-Severity (CGI-S); P=8.72×10−7 for change in the total Positive and Negative Syndrome Scale (PANSS)] in the intron of ADCK1. The mixed linear model-based association P-values for rs56240334 were consistent with the results from GLM-based analyses and the association with change in CGI-S (P=4.26×10−8) reached genome-wide significance (i.e. P<5×10−8). We also found suggestive evidence for a polygenic contribution toward paliperidone treatment response with estimates of heritability, h2SNP, ranging from 0.31 to 0.43 for change in the total PANSS score, the PANSS positive Marder factor score, and CGI-S. Conclusion Genetic variations in the ADCK1 gene may differentially predict paliperidone efficacy in schizophrenic patients. However, this finding should be replicated in additional samples. PMID:27846195

  1. Genome-Wide Profiling of Alternative Translation Initiation Sites.

    PubMed

    Gao, Xiangwei; Wan, Ji; Qian, Shu-Bing

    2016-01-01

    Regulation of translation initiation is a central control point in protein synthesis. Variations of start codon selection contribute to protein diversity and complexity. Systemic mapping of start codon positions and precise measurement of the corresponding initiation rate would transform our understanding of translational control. Here we describe a ribosome profiling approach that enables identification of translation initiation sites on a genome-wide scale. By capturing initiating ribosomes using lactimidomycin, this approach permits qualitative and quantitative analysis of alternative translation initiation.

  2. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use.

  3. Genome-Wide Association Study of Polymorphisms Predisposing to Bronchiolitis

    PubMed Central

    Pasanen, Anu; Karjalainen, Minna K.; Bont, Louis; Piippo-Savolainen, Eija; Ruotsalainen, Marja; Goksör, Emma; Kumawat, Kuldeep; Hodemaekers, Hennie; Nuolivirta, Kirsi; Jartti, Tuomas; Wennergren, Göran; Hallman, Mikko; Rämet, Mika; Korppi, Matti

    2017-01-01

    Bronchiolitis is a major cause of hospitalization among infants. Severe bronchiolitis is associated with later asthma, suggesting a common genetic predisposition. Genetic background of bronchiolitis is not well characterized. To identify polymorphisms associated with bronchiolitis, we conducted a genome-wide association study (GWAS) in which 5,300,000 single nucleotide polymorphisms (SNPs) were tested for association in a Finnish–Swedish population of 217 children hospitalized for bronchiolitis and 778 controls. The most promising SNPs (n = 77) were genotyped in a Dutch replication population of 416 cases and 432 controls. Finally, we used a set of 202 Finnish bronchiolitis cases to further investigate candidate SNPs. We did not detect genome-wide significant associations, but several suggestive association signals (p < 10−5) were observed in the GWAS. In the replication population, three SNPs were nominally associated (p < 0.05). Of them, rs269094 was an expression quantitative trait locus (eQTL) for KCND3, previously shown to be associated with occupational asthma. In the additional set of Finnish cases, the association for another SNP (rs9591920) within a noncoding RNA locus was further strengthened. Our results provide a first genome-wide examination of the genetics underlying bronchiolitis. These preliminary findings require further validation in a larger sample size. PMID:28139761

  4. A powerful test of independent assortment that determines genome-wide significance quickly and accurately

    PubMed Central

    Stewart, W C L; Hager, V R

    2016-01-01

    In the analysis of DNA sequences on related individuals, most methods strive to incorporate as much information as possible, with little or no attention paid to the issue of statistical significance. For example, a modern workstation can easily handle the computations needed to perform a large-scale genome-wide inheritance-by-descent (IBD) scan, but accurate assessment of the significance of that scan is often hindered by inaccurate approximations and computationally intensive simulation. To address these issues, we developed gLOD—a test of co-segregation that, for large samples, models chromosome-specific IBD statistics as a collection of stationary Gaussian processes. With this simple model, the parametric bootstrap yields an accurate and rapid assessment of significance—the genome-wide corrected P-value. Furthermore, we show that (i) under the null hypothesis, the limiting distribution of the gLOD is the standard Gumbel distribution; (ii) our parametric bootstrap simulator is approximately 40 000 times faster than gene-dropping methods, and it is more powerful than methods that approximate the adjusted P-value; and, (iii) the gLOD has the same statistical power as the widely used maximum Kong and Cox LOD. Thus, our approach gives researchers the ability to determine quickly and accurately the significance of most large-scale IBD scans, which may contain multiple traits, thousands of families and tens of thousands of DNA sequences. PMID:27245422

  5. Genome-wide metabolic (re-) annotation of Kluyveromyces lactis

    PubMed Central

    2012-01-01

    Background Even before having its genome sequence published in 2004, Kluyveromyces lactis had long been considered a model organism for studies in genetics and physiology. Research on Kluyveromyces lactis is quite advanced and this yeast species is one of the few with which it is possible to perform formal genetic analysis. Nevertheless, until now, no complete metabolic functional annotation has been performed to the proteins encoded in the Kluyveromyces lactis genome. Results In this work, a new metabolic genome-wide functional re-annotation of the proteins encoded in the Kluyveromyces lactis genome was performed, resulting in the annotation of 1759 genes with metabolic functions, and the development of a methodology supported by merlin (software developed in-house). The new annotation includes novelties, such as the assignment of transporter superfamily numbers to genes identified as transporter proteins. Thus, the genes annotated with metabolic functions could be exclusively enzymatic (1410 genes), transporter proteins encoding genes (301 genes) or have both metabolic activities (48 genes). The new annotation produced by this work largely surpassed the Kluyveromyces lactis currently available annotations. A comparison with KEGG’s annotation revealed a match with 844 (~90%) of the genes annotated by KEGG, while adding 850 new gene annotations. Moreover, there are 32 genes with annotations different from KEGG. Conclusions The methodology developed throughout this work can be used to re-annotate any yeast or, with a little tweak of the reference organism, the proteins encoded in any sequenced genome. The new annotation provided by this study offers basic knowledge which might be useful for the scientific community working on this model yeast, because new functions have been identified for the so-called metabolic genes. Furthermore, it served as the basis for the reconstruction of a compartmentalized, genome-scale metabolic model of Kluyveromyces lactis, which is

  6. A Pooled Genome-Wide Association Study of Asperger Syndrome.

    PubMed

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision.

  7. Genome-wide association study of Tourette Syndrome

    PubMed Central

    Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.

    2012-01-01

    Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  8. A Genome-Wide Association Study of Aging

    PubMed Central

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W.; Garcia, Melissa E.; Kaplan, Robert C.; Kumari, Meena; Lunetta, Kathryn L.; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J.; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J.; Biffar, Reiner; Buchman, Aron S.; Boerwinkle, Eric; Couper, David; De Jager, Philip L.; Evans, Denis A.; Harris, Tamara B.; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P.; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J.; Lohman, Kurt K.; Lutsey, Pamela L.; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M.; Reiman, Eric M.; Rotter, Jerome I.; Seshadri, Sudha; Shardell, Michelle D.; Smith, Albert V.; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M. Carola; Bandinelli, Stefania; Baumeister, Sebastian E.; Bennett, David A.; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M.; Newman, Anne B.; Tiemeier, Henning; Franceschini, Nora

    2011-01-01

    Human longevity and healthy aging show moderate heritability (20–50%). We conducted a meta-analysis of genome-wide association studies from nine studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for two outcomes: a) all-cause mortality and b) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10−8). We found fourteen independent SNPs that predicted risk of death, and eight SNPs that predicted event-free survival (p < 10−5). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer’s disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. PMID:21782286

  9. Genome-wide association interaction analysis for Alzheimer's disease

    PubMed Central

    Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel

    2015-01-01

    We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192

  10. A genome-wide association study of aging.

    PubMed

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W; Garcia, Melissa E; Kaplan, Robert C; Kumari, Meena; Lunetta, Kathryn L; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J; Biffar, Reiner; Buchman, Aron S; Boerwinkle, Eric; Couper, David; De Jager, Philip L; Evans, Denis A; Harris, Tamara B; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J; Lohman, Kurt K; Lutsey, Pamela L; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M; Reiman, Eric M; Rotter, Jerome I; Seshadri, Sudha; Shardell, Michelle D; Smith, Albert V; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M Carola; Bandinelli, Stefania; Baumeister, Sebastian E; Bennett, David A; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M; Newman, Anne B; Tiemeier, Henning; Franceschini, Nora

    2011-11-01

    Human longevity and healthy aging show moderate heritability (20%-50%). We conducted a meta-analysis of genome-wide association studies from 9 studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for 2 outcomes: (1) all-cause mortality, and (2) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10(-8)). We found 14 independent SNPs that predicted risk of death, and 8 SNPs that predicted event-free survival (p < 10(-5)). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer's disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity.

  11. A Pooled Genome-Wide Association Study of Asperger Syndrome

    PubMed Central

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  12. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  13. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-02-09

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.

  14. From Loci to Biology: Functional Genomics of Genome-Wide Association for Coronary Disease

    PubMed Central

    Nurnberg, Sylvia T; Zhang, Hanrui; Hand, Nicholas J; Bauer, Robert C; Saleheen, Danish; Reilly, Muredach P; Rader, Daniel J

    2016-01-01

    Genome-wide association studies (GWAS) have provided a rich collection of ~58 CAD loci that suggest the existence of previously unsuspected new biology relevant to atherosclerosis. However, these studies only identify genomic loci associated with CAD and many questions remain even after a genomic locus is definitively implicated, including the nature of the causal variant(s) and the causal gene(s), as well as the directionality of effect. There are a number of tools that can be employed for investigation of the functional genomics of these loci, and progress has been made on a limited number of novel CAD loci. New biology regarding atherosclerosis and CAD will be learned through the functional genomics of these loci and the hope is that at least some of these new pathways relevant to CAD pathogenesis will yield new therapeutic targets for the prevention and treatment of CAD. PMID:26892960

  15. Breaking RAD: An evaluation of the utility of restriction site associated DNA sequencing for genome scans of adaptation.

    PubMed

    Lowry, David B; Hoban, Sean; Kelley, Joanna L; Lotterhos, Katie E; Reed, Laura K; Antolin, Michael F; Storfer, Andrew

    2016-09-12

    Understanding how and why populations evolve is of fundamental importance to molecular ecology. RADseq (Restriction site-Associated DNA sequencing), a popular reduced representation method, has ushered in a new era of genome-scale research for assessing population structure, hybridization, demographic history, phylogeography, and migration. RADseq has also been widely used to conduct genome scans to detect loci involved in adaptive divergence among natural populations. Here, we examine the capacity of those RADseq-based genome scan studies to detect loci involved in local adaptation. To understand what proportion of the genome is missed by RADseq studies, we developed a simple model using different numbers of RAD-tags, genome sizes, and extents of linkage disequilibrium (length of haplotype blocks). We then surveyed recent studies that have used RADseq for genome scans and found that that the median density of RADseq markers across these studies was one marker per 3.96 megabases. Given that the length of linkage disequilibrium is often orders of magnitude less than a megabase, we conclude that genome scans based on RADseq data alone are unlikely to advance our understanding of molecular ecology or evolutionary genetics for most systems. This article is protected by copyright. All rights reserved.

  16. Implications of genome-wide association studies in cancer therapeutics.

    PubMed

    Patel, Jai N; McLeod, Howard L; Innocenti, Federico

    2013-09-01

    Genome wide association studies (GWAS) provide an agnostic approach to identifying potential genetic variants associated with disease susceptibility, prognosis of survival and/or predictive of drug response. Although these techniques are costly and interpretation of study results is challenging, they do allow for a more unbiased interrogation of the entire genome, resulting in the discovery of novel genes and understanding of novel biological associations. This review will focus on the implications of GWAS in cancer therapy, in particular germ-line mutations, including findings from major GWAS which have identified predictive genetic loci for clinical outcome and/or toxicity. Lessons and challenges in cancer GWAS are also discussed, including the need for functional analysis and replication, as well as future perspectives for biological and clinical utility. Given the large heterogeneity in response to cancer therapeutics, novel methods of identifying mechanisms and biology of variable drug response and ultimately treatment individualization will be indispensable.

  17. Genome-wide association study identifies novel loci predisposing to cutaneous melanoma†

    PubMed Central

    Amos, Christopher I.; Wang, Li-E; Lee, Jeffrey E.; Gershenwald, Jeffrey E.; Chen, Wei V.; Fang, Shenying; Kosoy, Roman; Zhang, Mingfeng; Qureshi, Abrar A.; Vattathil, Selina; Schacherer, Christopher W.; Gardner, Julie M.; Wang, Yuling; Tim Bishop, D.; Barrett, Jennifer H.; MacGregor, Stuart; Hayward, Nicholas K.; Martin, Nicholas G.; Duffy, David L.; Mann, Graham J.; Cust, Anne; Hopper, John; Brown, Kevin M.; Grimm, Elizabeth A.; Xu, Yaji; Han, Younghun; Jing, Kaiyan; McHugh, Caitlin; Laurie, Cathy C.; Doheny, Kim F.; Pugh, Elizabeth W.; Seldin, Michael F.; Han, Jiali; Wei, Qingyi

    2011-01-01

    We performed a multistage genome-wide association study of melanoma. In a discovery cohort of 1804 melanoma cases and 1026 controls, we identified loci at chromosomes 15q13.1 (HERC2/OCA2 region) and 16q24.3 (MC1R) regions that reached genome-wide significance within this study and also found strong evidence for genetic effects on susceptibility to melanoma from markers on chromosome 9p21.3 in the p16/ARF region and on chromosome 1q21.3 (ARNT/LASS2/ANXA9 region). The most significant single-nucleotide polymorphisms (SNPs) in the 15q13.1 locus (rs1129038 and rs12913832) lie within a genomic region that has profound effects on eye and skin color; notably, 50% of variability in eye color is associated with variation in the SNP rs12913832. Because eye and skin colors vary across European populations, we further evaluated the associations of the significant SNPs after carefully adjusting for European substructure. We also evaluated the top 10 most significant SNPs by using data from three other genome-wide scans. Additional in silico data provided replication of the findings from the most significant region on chromosome 1q21.3 rs7412746 (P = 6 × 10−10). Together, these data identified several candidate genes for additional studies to identify causal variants predisposing to increased risk for developing melanoma. PMID:21926416

  18. A genome-wide search for type 2 diabetes susceptibility genes in Utah Caucasians.

    PubMed

    Elbein, S C; Hoffman, M D; Teng, K; Leppert, M F; Hasstedt, S J

    1999-05-01

    Considerable evidence supports a major inherited component of type 2 diabetes. We initially conducted a genome-wide scan with 440 microsatellite markers at 10-cM intervals in 19 multigenerational families of Northern European ancestry with at least two diabetic siblings. Initial two-point analyses of these families directed marker typing of 23 additional families. Subsequently, all available marker data on the total of 42 families were analyzed using both parametric and nonparametric multipoint methods to test for linkage to type 2 diabetes. One locus on chromosome 1q21-1q23 met genome-wide criteria for significant linkage under a model of recessive inheritance with a common diabetes allele (logarithm of odds [LOD] = 4.295). Both pedigree-based nonparametric linkage (NPL) analysis and affected sib pair (MAPMAKER/SIBS) nonparametric methods also showed the highest genome-wide scores at this region, near markers CRP and APOA2, but failed to meet levels of genome-wide significance. The risk of type 2 diabetes to siblings of a diabetic person when compared with the population (lambdaS) was estimated from MAPMAKER/SIBS to be 2.8 in these 42 families. Simulation studies using study data confirmed a genome-wide significance level of P<0.05 (95% CI 0.005-0.0466). However, analysis of 20 similarly ascertained but smaller families failed to confirm this linkage. The LOD score with 50% heterogeneity for all 62 families considered together was only 2.25, with an estimated lambdaS of 1.87. Our data suggest a novel diabetes susceptibility locus near APOA2 on chromosome 1 in a region with many transcribed genes.

  19. Genome-Wide Association Study of Metabolic Syndrome in Koreans

    PubMed Central

    Jeong, Seok Won; Chung, Myungguen; Park, Soo-Jung; Cho, Seong Beom

    2014-01-01

    Metabolic syndrome (METS) is a disorder of energy utilization and storage and increases the risk of developing cardiovascular disease and diabetes. To identify the genetic risk factors of METS, we carried out a genome-wide association study (GWAS) for 2,657 cases and 5,917 controls in Korean populations. As a result, we could identify 2 single nucleotide polymorphisms (SNPs) with genome-wide significance level p-values (<5 × 10-8), 8 SNPs with genome-wide suggestive p-values (5 × 10-8 ≤ p < 1 × 10-5), and 2 SNPs of more functional variants with borderline p-values (5 × 10-5 ≤ p < 1 × 10-4). On the other hand, the multiple correction criteria of conventional GWASs exclude false-positive loci, but simultaneously, they discard many true-positive loci. To reconsider the discarded true-positive loci, we attempted to include the functional variants (nonsynonymous SNPs [nsSNPs] and expression quantitative trait loci [eQTL]) among the top 5,000 SNPs based on the proportion of phenotypic variance explained by genotypic variance. In total, 159 eQTLs and 18 nsSNPs were presented in the top 5,000 SNPs. Although they should be replicated in other independent populations, 6 eQTLs and 2 nsSNP loci were located in the molecular pathways of LPL, APOA5, and CHRM2, which were the significant or suggestive loci in the METS GWAS. Conclusively, our approach using the conventional GWAS, reconsidering functional variants and pathway-based interpretation, suggests a useful method to understand the GWAS results of complex traits and can be expanded in other genomewide association studies. PMID:25705157

  20. Genome-Wide Association of Heroin Dependence in Han Chinese

    PubMed Central

    Coleman, Jonathan R. I.; Ducci, Francesca; Aliev, Fazil; Newhouse, Stephen J.; Liu, Xiehe; Ma, Xiaohong; Wang, Yingcheng; Collier, David A.; Asherson, Philip; Li, Tao; Breen, Gerome

    2016-01-01

    Drug addiction is a costly and recurring healthcare problem, necessitating a need to understand risk factors and mechanisms of addiction, and to identify new biomarkers. To date, genome-wide association studies (GWAS) for heroin addiction have been limited; moreover they have been restricted to examining samples of European and African-American origin due to difficulty of recruiting samples from other populations. This is the first study to test a Han Chinese population; we performed a GWAS on a homogeneous sample of 370 Han Chinese subjects diagnosed with heroin dependence using the DSM-IV criteria and 134 ethnically matched controls. Analysis using the diagnostic criteria of heroin dependence yielded suggestive evidence for association between variants in the genes CCDC42 (coiled coil domain 42; p = 2.8x10-7) and BRSK2 (BR serine/threonine 2; p = 4.110−6). In addition, we found evidence for risk variants within the ARHGEF10 (Rho guanine nucleotide exchange factor 10) gene on chromosome 8 and variants in a region on chromosome 20q13, which is gene-poor but has a concentration of mRNAs and predicted miRNAs. Gene-based association analysis identified genome-wide significant association between variants in CCDC42 and heroin addiction. Additionally, when we investigated shared risk variants between heroin addiction and risk of other addiction-related and psychiatric phenotypes using polygenic risk scores, we found a suggestive relationship with variants predicting tobacco addiction, and a significant relationship with variants predicting schizophrenia. Our genome wide association study of heroin dependence provides data in a novel sample, with functionally plausible results and evidence of genetic data of value to the field. PMID:27936112

  1. Genome-wide Association Study of Obsessive-Compulsive Disorder

    PubMed Central

    Stewart, S Evelyn; Yu, Dongmei; Scharf, Jeremiah M; Neale, Benjamin M; Fagerness, Jesen A; Mathews, Carol A; Arnold, Paul D; Evans, Patrick D; Gamazon, Eric R; Osiecki, Lisa; McGrath, Lauren; Haddad, Stephen; Crane, Jacquelyn; Hezel, Dianne; Illman, Cornelia; Mayerfeld, Catherine; Konkashbaev, Anuar; Liu, Chunyu; Pluzhnikov, Anna; Tikhomirov, Anna; Edlund, Christopher K; Rauch, Scott L; Moessner, Rainald; Falkai, Peter; Maier, Wolfgang; Ruhrmann, Stephan; Grabe, Hans-Jörgen; Lennertz, Leonard; Wagner, Michael; Bellodi, Laura; Cavallini, Maria Cristina; Richter, Margaret A; Cook, Edwin H; Kennedy, James L; Rosenberg, David; Stein, Dan J; Hemmings, Sian MJ; Lochner, Christine; Azzam, Amin; Chavira, Denise A; Fournier, Eduardo; Garrido, Helena; Sheppard, Brooke; Umaña, Paul; Murphy, Dennis L; Wendland, Jens R; Veenstra-VanderWeele, Jeremy; Denys, Damiaan; Blom, Rianne; Deforce, Dieter; Van Nieuwerburgh, Filip; Westenberg, Herman GM; Walitza, Susanne; Egberts, Karin; Renner, Tobias; Miguel, Euripedes Constantino; Cappi, Carolina; Hounie, Ana G; Conceição do Rosário, Maria; Sampaio, Aline S; Vallada, Homero; Nicolini, Humberto; Lanzagorta, Nuria; Camarena, Beatriz; Delorme, Richard; Leboyer, Marion; Pato, Carlos N; Pato, Michele T; Voyiaziakis, Emanuel; Heutink, Peter; Cath, Danielle C; Posthuma, Danielle; Smit, Jan H; Samuels, Jack; Bienvenu, O Joseph; Cullen, Bernadette; Fyer, Abby J; Grados, Marco A; Greenberg, Benjamin D; McCracken, James T; Riddle, Mark A; Wang, Ying; Coric, Vladimir; Leckman, James F; Bloch, Michael; Pittenger, Christopher; Eapen, Valsamma; Black, Donald W; Ophoff, Roel A; Strengman, Eric; Cusi, Daniele; Turiel, Maurizio; Frau, Francesca; Macciardi, Fabio; Gibbs, J Raphael; Cookson, Mark R; Singleton, Andrew; Hardy, John; Crenshaw, Andrew T; Parkin, Melissa A; Mirel, Daniel B; Conti, David V; Purcell, Shaun; Nestadt, Gerald; Hanna, Gregory L; Jenike, Michael A; Knowles, James A; Cox, Nancy; Pauls, David L

    2014-01-01

    Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1,465 cases, 5,557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9,657 X-chromosome SNPs. Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two p-values were located within DLGAP1 (p=2.49×10-6 and p=3.44×10-6), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a p-value=3.84 × 10-8. However, when trios were meta-analyzed with the combined case-control samples, the p-value for this variant was 3.62×10-5, losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation-QTLs (p<0.001) and frontal lobe eQTLs (p=0.001) was observed within the top-ranked SNPs (p<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921

  2. Genome-wide association study of parity in Bangladeshi women.

    PubMed

    Aschebrook-Kilfoy, Briseis; Argos, Maria; Pierce, Brandon L; Tong, Lin; Jasmine, Farzana; Roy, Shantanu; Parvez, Faruque; Ahmed, Alauddin; Islam, Tariqul; Kibriya, Muhammad G; Ahsan, Habibul

    2015-01-01

    Human fertility is a complex trait determined by gene-environment interactions in which genetic factors represent a significant component. To better understand inter-individual variability in fertility, we performed one of the first genome-wide association studies (GWAS) of common fertility phenotypes, lifetime number of pregnancies and number of children in a developing country population. The fertility phenotype data and DNA samples were obtained at baseline recruitment from individuals participating in a large prospective cohort study in Bangladesh. GWAS analyses of fertility phenotypes were conducted among 1,686 married women. One SNP on chromosome 4 was non-significantly associated with number of children at P <10(-7) and number of pregnancies at P <10(-6). This SNP is located in a region without a gene within 1 Mb. One SNP on chromosome 6 was non-significantly associated with extreme number of children at P <10(-6). The closest gene to this SNP is HDGFL1, a hepatoma-derived growth factor. When we excluded hormonal contraceptive users, a SNP on chromosome 5 was non-significantly associated at P <10(-5) for number of children and number of pregnancies. This SNP is located near C5orf64, an open reading frame, and ZSWIM6, a zinc ion binding gene. We also estimated the heritability of these phenotypes from our genotype data using GCTA (Genome-wide Complex Trait Analysis) for number of children (hg2 = 0.149, SE = 0.24, p-value = 0.265) and number of pregnancies (hg2 = 0.007, SE = 0.22, p-value = 0.487). Our genome-wide association study and heritability estimates of number of pregnancies and number of children in Bangladesh did not confer strong evidence of common variants for parity variation. However, our results suggest that future studies may want to consider the role of 3 notable SNPs in their analysis.

  3. Genome-wide discovery of loci influencing chemotherapy cytotoxicity.

    PubMed

    Watters, James W; Kraja, Aldi; Meucci, Melissa A; Province, Michael A; McLeod, Howard L

    2004-08-10

    Little is known about the heritability of chemotherapy activity or the identity of genes that may enable the individualization of cancer chemotherapy. Although numerous genes are likely to influence chemotherapy response, current candidate gene-based pharmacogenetics approaches require a priori knowledge and the selection of a small number of candidate genes for hypothesis testing. In this study, an ex vivo familial genetics strategy using lymphoblastoid cells derived from Centre d'Etude du Polymorphisme Humain reference pedigrees was used to discover genetic determinants of chemotherapy cytotoxicity. Cytotoxicity to the mechanistically distinct chemotherapy agents 5-fluorouracil and docetaxel were shown to be heritable traits, with heritability values ranging from 0.26 to 0.65 for 5-fluorouracil and 0.21 to 0.70 for docetaxel, varying with dose. Genome-wide linkage analysis was also used to map a quantitative trait locus influencing the cellular effects of 5-fluorouracil to chromosome 9q13-q22 [logarithm of odds (LOD) = 3.44], and two quantitative trait loci influencing the cellular effects of docetaxel to chromosomes 5q11-21 (LOD = 2.21) and 9q13-q22 (LOD = 2.73). Finally, 5-fluorouracil and docetaxel were shown to cause apoptotic cell death involving caspase-3 cleavage in Centre d'Etude du Polymorphisme Humain lymphoblastoid cells. This study identifies genomic regions likely to harbor genes important for chemotherapy cytotoxicity using genome-wide linkage analysis in human pedigrees and provides a widely applicable strategy for pharmacogenomic discovery without the requirement for a priori candidate gene selection.

  4. Genome-wide association studies and contribution to cardiovascular physiology

    PubMed Central

    Munroe, Patricia B.

    2015-01-01

    The study of family pedigrees with rare monogenic cardiovascular disorders has revealed new molecular players in physiological processes. Genome-wide association studies of complex traits with a heritable component may afford a similar and potentially intellectually richer opportunity. In this review we focus on the interpretation of genetic associations and the issue of causality in relation to known and potentially new physiology. We mainly discuss cardiometabolic traits as it reflects our personal interests, but the issues pertain broadly in many other disciplines. We also describe some of the resources that are now available that may expedite follow up of genetic association signals into observations on causal mechanisms and pathophysiology. PMID:26106147

  5. Genome-wide approaches to defining macrophage identity and function

    PubMed Central

    Fonseca, Gregory J; Seidman, Jason S; Glass, Christopher K

    2016-01-01

    Macrophages play essential roles in the response to injury and infection and contribute to the development and/or homeostasis of the various tissues they reside in. Conversely, macrophages also influence the pathogenesis of metabolic, neurodegenerative, and neoplastic diseases. Mechanisms that contribute to the phenotypic diversity of macrophages in health and disease remain poorly understood. Here we review the recent application of genome-wide approaches to characterize the transcriptomes and epigenetic landscapes of tissue-resident macrophages. These studies are beginning to provide insights into how distinct tissue environments are interpreted by transcriptional regulatory elements to drive specialized programs of gene expression. PMID:28087927

  6. [New insight of genome-wide association study (GWAS)].

    PubMed

    Hotta, Kikuko

    2013-02-01

    The number of obese patients is increasing in Japan, due to the westernization of lifestyle. Obesity, especially visceral fat obesity, is important for the development of metabolic syndrome. Genetic factors are important for the development of obesity as well as environmental factors. Importance of genetic factors of fat distribution is also reported. Recent genome-wide association studies (GWASs) have revealed the obesity and fat distribution-related polymorphisms. GWAS will highlight a better understanding of the underlying molecular mechanisms in the regulation of obesity and distribution of body fat.

  7. [Genome-wide association study for adolescent idiopathic scoliosis].

    PubMed

    Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro

    2016-04-01

    Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS.

  8. Genome-wide DNA methylation profile in mungbean

    PubMed Central

    Kang, Yang Jae; Bae, Ahra; Shim, Sangrea; Lee, Taeyoung; Lee, Jayern; Satyawan, Dani; Kim, Moon Young; Lee, Suk-Ha

    2017-01-01

    DNA methylation on cytosine residues is known to affect gene expression and is potentially responsible for the phenotypic variations among different crop cultivars. Here, we present the whole-genome DNA methylation profiles and assess the potential effects of single nucleotide polymorphisms (SNPs) for two mungbean cultivars, Sunhwanogdu (VC1973A) and Kyunggijaerae#5 (V2984). By measuring the DNA methylation levels in leaf tissue with the bisulfite sequencing (BSseq) approach, we show both the frequencies of the various types of DNA methylation and the distribution of weighted gene methylation levels. SNPs that cause nucleotide changes from/to CHH – where C is cytosine and H is any other nucleotide – were found to affect DNA methylation status in VC1973A and V2984. In order to better understand the correlation between gene expression and DNA methylation levels, we surveyed gene expression in leaf tissues of VC1973A and V2984 using RNAseq. Transcript expressions of paralogous genes were controlled by DNA methylation within the VC1973A genome. Moreover, genes that were differentially expressed between the two cultivars showed distinct DNA methylation patterns. Our mungbean genome-wide methylation profiles will be valuable resources for understanding the phenotypic variations between different cultivars, as well as for molecular breeding. PMID:28084412

  9. A new method to scan genomes for introgression in a secondary contact model.

    PubMed

    Geneva, Anthony J; Muirhead, Christina A; Kingan, Sarah B; Garrigan, Daniel

    2015-01-01

    Secondary contact between divergent populations or incipient species may result in the exchange and introgression of genomic material. We develop a simple DNA sequence measure, called Gmin, which is designed to identify genomic regions experiencing introgression in a secondary contact model. Gmin is defined as the ratio of the minimum between-population number of nucleotide differences in a genomic window to the average number of between-population differences. Although it is conceptually simple, one advantage of Gmin is that it is computationally inexpensive relative to model-based methods for detecting gene flow and it scales easily to the level of whole-genome analysis. We compare the sensitivity and specificity of Gmin to those of the widely used index of population differentiation, FST, and suggest a simple statistical test for identifying genomic outliers. Extensive computer simulations demonstrate that Gmin has both greater sensitivity and specificity for detecting recent introgression than does FST. Furthermore, we find that the sensitivity of Gmin is robust with respect to both the population mutation and recombination rates. Finally, a scan of Gmin across the X chromosome of Drosophila melanogaster identifies candidate regions of introgression between sub-Saharan African and cosmopolitan populations that were previously missed by other methods. These results show that Gmin is a biologically straightforward, yet powerful, alternative to FST, as well as to more computationally intensive model-based methods for detecting gene flow.

  10. A New Method to Scan Genomes for Introgression in a Secondary Contact Model

    PubMed Central

    Geneva, Anthony J.; Muirhead, Christina A.; Kingan, Sarah B.; Garrigan, Daniel

    2015-01-01

    Secondary contact between divergent populations or incipient species may result in the exchange and introgression of genomic material. We develop a simple DNA sequence measure, called Gmin, which is designed to identify genomic regions experiencing introgression in a secondary contact model. Gmin is defined as the ratio of the minimum between-population number of nucleotide differences in a genomic window to the average number of between-population differences. Although it is conceptually simple, one advantage of Gmin is that it is computationally inexpensive relative to model-based methods for detecting gene flow and it scales easily to the level of whole-genome analysis. We compare the sensitivity and specificity of Gmin to those of the widely used index of population differentiation, FST, and suggest a simple statistical test for identifying genomic outliers. Extensive computer simulations demonstrate that Gmin has both greater sensitivity and specificity for detecting recent introgression than does FST. Furthermore, we find that the sensitivity of Gmin is robust with respect to both the population mutation and recombination rates. Finally, a scan of Gmin across the X chromosome of Drosophila melanogaster identifies candidate regions of introgression between sub-Saharan African and cosmopolitan populations that were previously missed by other methods. These results show that Gmin is a biologically straightforward, yet powerful, alternative to FST, as well as to more computationally intensive model-based methods for detecting gene flow. PMID:25874895

  11. GW-SEM: A Statistical Package to Conduct Genome-Wide Structural Equation Modeling.

    PubMed

    Verhulst, Brad; Maes, Hermine H; Neale, Michael C

    2017-03-15

    Improving the accuracy of phenotyping through the use of advanced psychometric tools will increase the power to find significant associations with genetic variants and expand the range of possible hypotheses that can be tested on a genome-wide scale. Multivariate methods, such as structural equation modeling (SEM), are valuable in the phenotypic analysis of psychiatric and substance use phenotypes, but these methods have not been integrated into standard genome-wide association analyses because fitting a SEM at each single nucleotide polymorphism (SNP) along the genome was hitherto considered to be too computationally demanding. By developing a method that can efficiently fit SEMs, it is possible to expand the set of models that can be tested. This is particularly necessary in psychiatric and behavioral genetics, where the statistical methods are often handicapped by phenotypes with large components of stochastic variance. Due to the enormous amount of data that genome-wide scans produce, the statistical methods used to analyze the data are relatively elementary and do not directly correspond with the rich theoretical development, and lack the potential to test more complex hypotheses about the measurement of, and interaction between, comorbid traits. In this paper, we present a method to test the association of a SNP with multiple phenotypes or a latent construct on a genome-wide basis using a diagonally weighted least squares (DWLS) estimator for four common SEMs: a one-factor model, a one-factor residuals model, a two-factor model, and a latent growth model. We demonstrate that the DWLS parameters and p-values strongly correspond with the more traditional full information maximum likelihood parameters and p-values. We also present the timing of simulations and power analyses and a comparison with and existing multivariate GWAS software package.

  12. Genome-wide analysis of differential RNA editing in epilepsy

    PubMed Central

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R.; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M.; Petretto, Enrico; Johnson, Michael R.

    2017-01-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine–temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including “neuron projection” and “seizures.” Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures. PMID:28250018

  13. Genome-wide analysis of differential RNA editing in epilepsy.

    PubMed

    Srivastava, Prashant Kumar; Bagnati, Marta; Delahaye-Duriez, Andree; Ko, Jeong-Hun; Rotival, Maxime; Langley, Sarah R; Shkura, Kirill; Mazzuferi, Manuela; Danis, Bénédicte; van Eyll, Jonathan; Foerch, Patrik; Behmoaras, Jacques; Kaminski, Rafal M; Petretto, Enrico; Johnson, Michael R

    2017-03-01

    The recoding of genetic information through RNA editing contributes to proteomic diversity, but the extent and significance of RNA editing in disease is poorly understood. In particular, few studies have investigated the relationship between RNA editing and disease at a genome-wide level. Here, we developed a framework for the genome-wide detection of RNA sites that are differentially edited in disease. Using RNA-sequencing data from 100 hippocampi from mice with epilepsy (pilocarpine-temporal lobe epilepsy model) and 100 healthy control hippocampi, we identified 256 RNA sites (overlapping with 87 genes) that were significantly differentially edited between epileptic cases and controls. The degree of differential RNA editing in epileptic mice correlated with frequency of seizures, and the set of genes differentially RNA-edited between case and control mice were enriched for functional terms highly relevant to epilepsy, including "neuron projection" and "seizures." Genes with differential RNA editing were preferentially enriched for genes with a genetic association to epilepsy. Indeed, we found that they are significantly enriched for genes that harbor nonsynonymous de novo mutations in patients with epileptic encephalopathy and for common susceptibility variants associated with generalized epilepsy. These analyses reveal a functional convergence between genes that are differentially RNA-edited in acquired symptomatic epilepsy and those that contribute risk for genetic epilepsy. Taken together, our results suggest a potential role for RNA editing in the epileptic hippocampus in the occurrence and severity of epileptic seizures.

  14. Meta-analysis of genome-wide association from genomic prediction models

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...

  15. Enhancing genomic prediction with genome-wide association studies in multiparental maize populations

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide association mapping using dense marker sets has identified some nucleotide variants affecting complex traits which have been validated with fine-mapping and functional analysis. Many sequence variants associated with complex traits in maize have small effects and low repeatability, howev...

  16. Genome-wide association studies and genetic architecture of common human diseases.

    PubMed

    Montgomery, Grant W

    2011-06-03

    Genome-wide association scans provide the first successful method to identify genetic variation contributing to risk for common complex disease. Progress in identifying genes associated with melanoma show complex relationships between genes for pigmentation and the development of melanoma. Novel risk loci account for only a small fraction of the genetic variation contributing to this and many other diseases. Large meta-analyses find additional variants, but there is current debate about the contribution of common polymorphisms, rare polymorphisms or mutations to disease risk.

  17. Genome Scan for Parent-of-Origin QTL Effects on Bovine Growth and Carcass Traits

    PubMed Central

    Imumorin, Ikhide G.; Kim, Eun-Hee; Lee, Yun-Mi; De Koning, Dirk-Jan; van Arendonk, Johan A.; De Donato, Marcos; Taylor, Jeremy F.; Kim, Jong-Joo

    2011-01-01

    Parent-of-origin effects (POE) such as genomic imprinting influence growth and body composition in livestock, rodents, and humans. Here, we report the results of a genome scan to detect quantitative trait loci (QTL) with POE on growth and carcass traits in Angus × Brahman cattle crossbreds. We identified 24 POE–QTL on 15 Bos taurus autosomes (BTAs) of which six were significant at 5% genome-wide (GW) level and 18 at the 5% chromosome-wide (CW) significance level. Six QTL were paternally expressed while 15 were maternally expressed. Three QTL influencing post-weaning growth map to the proximal end of BTA2 (linkage region of 0–9 cM; genomic region of 5.0–10.8 Mb), for which only one imprinted ortholog is known so far in the human and mouse genomes, and therefore may potentially represent a novel imprinted region. The detected QTL individually explained 1.4 ∼ 5.1% of each trait’s phenotypic variance. Comparative in silico analysis of bovine genomic locations show that 32 out of 1,442 known mammalian imprinted genes from human and mouse homologs map to the identified QTL regions. Although several of the 32 genes have been associated with quantitative traits in cattle, only two (GNAS and PEG3) have experimental proof of being imprinted in cattle. These results lend additional support to recent reports that POE on quantitative traits in mammals may be more common than previously thought, and strengthen the need to identify and experimentally validate cattle orthologs of imprinted genes so as to investigate their effects on quantitative traits. PMID:22303340

  18. Identifying modifier loci in existing genome scan data.

    PubMed

    Daw, E W; Lu, Y; Marian, A J; Shete, S

    2008-09-01

    In many genetic disorders in which a primary disease-causing locus has been identified, evidence exists for additional trait variation due to genetic factors. These findings have led to studies seeking secondary 'modifier' loci. Identification of modifier loci provides insight into disease mechanisms and may provide additional screening and treatment targets. We believe that modifier loci can be identified by re-analysis of genome screen data while controlling for primary locus effects. To test this hypothesis, we simulated multiple replicates of typical genome screening data on to two real family structures from a study of hypertrophic cardiomyopathy. With this marker data, we simulated two trait models with characteristics similar to one measure of hypertrophic cardiomyopathy. Both trait models included 3 genes. In the first, the trait was influenced by a primary gene, a secondary 'modifier' gene, and a third very small effect gene. In the second, we modeled an interaction between the first two genes. We examined power and false positive rates to map the secondary locus while controlling for the effect of the primary locus with two types of analyses. First, we examined Monte Carlo Markov chain (MCMC) simultaneous segregation and linkage analysis as implemented in Loki, for which we calculated two scoring statistics. Second, we calculated LOD scores using an individual-specific liability class based on the quantitative trait value. We found that both methods produced scores that are significant on a genome-wide level in some replicates. We conclude that mapping of modifier loci in existing samples is possible with these methods.

  19. How reliable are empirical genomic scans for selective sweeps?

    PubMed Central

    Teshima, Kosuke M.; Coop, Graham; Przeworski, Molly

    2006-01-01

    The beneficial substitution of an allele shapes patterns of genetic variation at linked sites. Thus, in principle, adaptations can be mapped by looking for the signature of directional selection in polymorphism data. In practice, such efforts are hampered by the need for an accurate characterization of the demographic history of the species and of the effects of positive selection. In an attempt to circumvent these difficulties, researchers are increasingly taking a purely empirical approach, in which a large number of genomic regions are ordered by summaries of the polymorphism data, and loci with extreme values are considered to be likely targets of positive selection. We evaluated the reliability of the “empirical” approach, focusing on applications to human data and to maize. To do so, we considered a coalescent model of directional selection in a sensible demographic setting, allowing for selection on standing variation as well as on a new mutation. Our simulations suggest that while empirical approaches will identify several interesting candidates, they will also miss many—in some cases, most—loci of interest. The extent of the trade-off depends on the mode of positive selection and the demographic history of the population. Specifically, the false-discovery rate is higher when directional selection involves a recessive rather than a co-dominant allele, when it acts on a previously neutral rather than a new allele, and when the population has experienced a population bottleneck rather than maintained a constant size. One implication of these results is that, insofar as attributes of the beneficial mutation (e.g., the dominance coefficient) affect the power to detect targets of selection, genomic scans will yield an unrepresentative subset of loci that contribute to adaptations. PMID:16687733

  20. Genome scan for linkage to Gilles de la Tourette syndrome

    SciTech Connect

    Barr, C.L.; Livingston, J.; Williamson, R.

    1994-09-01

    Gilles de la Tourette Syndrome (TS) is a familial, neuropsychiatric disorder characterized by chronic, intermittent motor and vocal tics. In addition to tics, affected individuals frequently display symptoms such as attention-deficit hyperactivity disorder and/or obsessive compulsive disorder. Genetic analyses of family data have suggested that susceptibility to the disorder is most likely due to a single genetic locus with a dominant mode of transmission and reduced penetrance. In the search for genetic linkage for TS, we have collected well-characterized pedigrees with multiple affected individuals on whom extensive diagnostic evaluations have been done. The first stage of our study is to scan the genome systematically using a panel of uniformly spaced (10 to 20 cM), highly polymorphic, microsatellite markers on 5 families segregating TS. To date, 290 markers have been typed and 3,660 non-overlapping cM of the genome have been excluded for possible linkage under the assumption of genetic homogeneity. Because of the possibility of locus heterogeneity overall summed exclusion is not considered tantamount to absolute exclusion of a disease locus in that region. The results from each family are carefully evaluated and a positive lod score in a single family is followed up by typing closely linked markers. Linkage to TS was examined by two-point analysis using the following genetic model: single autosomal dominant gene with gene frequency .003 and maximum penetrance of .99. An age-of-onset correction is included using a linear function increasing from age 2 years to 21 years. A small rate of phenocopies is also incorporated into the model. Only individuals with TS or CMT according to DSM III-R criteria were regarded as affected for the purposes of this summary. Additional markers are being tested to provide coverage at 5 cM intervals. Moreover, we are currently analyzing the data non-parametrically using the Affected-Pedigree-Member Method of linkage analysis.

  1. Genome-scan for IQ discrepancy in autism: evidence for loci on chromosomes 10 and 16.

    PubMed

    Chapman, Nicola H; Estes, Annette; Munson, Jeff; Bernier, Raphael; Webb, Sara J; Rothstein, Joseph H; Minshew, Nancy J; Dawson, Geraldine; Schellenberg, Gerard D; Wijsman, Ellen M

    2011-01-01

    Performance IQ (PIQ) greater than verbal IQ (VIQ) is often observed in studies of the cognitive abilities of autistic individuals. This characteristic is correlated with social and communication impairments, key parts of the autism diagnosis. We present the first genetic analyses of IQ discrepancy (PIQ-VIQ) as an autism-related phenotype. We performed genome-wide joint linkage and segregation analyses on 287 multiplex families, using a Markov chain Monte Carlo approach. Genetic data included a genome-scan of 387 micro-satellite markers in 210 families augmented with additional markers added in a subset of families. Empirical P values were calculated for five interesting regions. Linkage analysis identified five chromosomal regions with substantial regional evidence of linkage; 10p12 [P = 0.001; genome-wide (gw) P = 0.05], 16q23 (P = .015; gw P = 0.53), 2p21 (P = 0.03, gw P = 0.78), 6q25 (P = 0.047, gw P = 0.91) and 15q23-25 (P = 0.053, gw P = 0.93). The location of the chromosome 10 linkage signal coincides with a region noted in a much earlier genome-scan for autism, and the chromosome 16 signal coincides exactly with a linkage signal for non-word repetition in specific language impairment. This study provides strong evidence for a QTL influencing IQ discrepancy in families with autistic individuals on chromosome 10, and suggestive evidence for a QTL on chromosome 16. The location of the chromosome 16 signal suggests a candidate gene, CDH13, a T-cadherin expressed in the brain, which has been implicated in previous SNP studies of autism and ADHD.

  2. Genome scan for nonadditive heterotic trait loci reveals mainly underdominant effects in Saccharomyces cerevisiae.

    PubMed

    Laiba, Efrat; Glikaite, Ilana; Levy, Yael; Pasternak, Zohar; Fridman, Eyal

    2016-04-01

    The overdominant model of heterosis explains the superior phenotype of hybrids by synergistic allelic interaction within heterozygous loci. To map such genetic variation in yeast, we used a population doubling time dataset of Saccharomyces cerevisiae 16 × 16 diallel and searched for major contributing heterotic trait loci (HTL). Heterosis was observed for the majority of hybrids, as they surpassed their best parent growth rate. However, most of the local heterozygous loci identified by genome scan were surprisingly underdominant, i.e., reduced growth. We speculated that in these loci adverse effects on growth resulted from incompatible allelic interactions. To test this assumption, we eliminated these allelic interactions by creating hybrids with local hemizygosity for the underdominant HTLs, as well as for control random loci. Growth of hybrids was indeed elevated for most hemizygous to HTL genes but not for control genes, hence validating the results of our genome scan. Assessing the consequences of local heterozygosity by reciprocal hemizygosity and allele replacement assays revealed the influence of genetic background on the underdominant effects of HTLs. Overall, this genome-wide study on a multi-parental hybrid population provides a strong argument against single gene overdominance as a major contributor to heterosis, and favors the dominance complementation model.

  3. Quality control and quality assurance in genotypic data for genome-wide association studies

    PubMed Central

    Laurie, Cathy C.; Doheny, Kimberly F.; Mirel, Daniel B.; Pugh, Elizabeth W.; Bierut, Laura J.; Bhangale, Tushar; Boehm, Frederick; Caporaso, Neil E.; Cornelis, Marilyn C.; Edenberg, Howard J.; Gabriel, Stacy B.; Harris, Emily L.; Hu, Frank B.; Jacobs, Kevin; Kraft, Peter; Landi, Maria Teresa; Lumley, Thomas; Manolio, Teri A.; McHugh, Caitlin; Painter, Ian; Paschall, Justin; Rice, John P.; Rice, Kenneth M.; Zheng, Xiuwen; Weir, Bruce S.

    2011-01-01

    Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies. This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium (HWE) test p-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis (PCA) to SNP selection. The methods are illustrated with examples from the ‘Gene Environment Association Studies’ (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of genome-wide association studies. PMID:20718045

  4. Genome-wide association study of antisocial personality disorder

    PubMed Central

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53–3.14), P=1.9 × 10-5). Two polymorphisms at 6p21.2 LINC00951–LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37–1.85), P=1.6 × 10−9) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  5. Empirical estimation of genome-wide significance thresholds based on the 1000 Genomes Project data set

    PubMed Central

    Kanai, Masahiro; Tanaka, Toshihiro; Okada, Yukinori

    2016-01-01

    To assess the statistical significance of associations between variants and traits, genome-wide association studies (GWAS) should employ an appropriate threshold that accounts for the massive burden of multiple testing in the study. Although most studies in the current literature commonly set a genome-wide significance threshold at the level of P=5.0 × 10−8, the adequacy of this value for respective populations has not been fully investigated. To empirically estimate thresholds for different ancestral populations, we conducted GWAS simulations using the 1000 Genomes Phase 3 data set for Africans (AFR), Europeans (EUR), Admixed Americans (AMR), East Asians (EAS) and South Asians (SAS). The estimated empirical genome-wide significance thresholds were Psig=3.24 × 10−8 (AFR), 9.26 × 10−8 (EUR), 1.83 × 10−7 (AMR), 1.61 × 10−7 (EAS) and 9.46 × 10−8 (SAS). We additionally conducted trans-ethnic meta-analyses across all populations (ALL) and all populations except for AFR (ΔAFR), which yielded Psig=3.25 × 10−8 (ALL) and 4.20 × 10−8 (ΔAFR). Our results indicate that the current threshold (P=5.0 × 10−8) is overly stringent for all ancestral populations except for Africans; however, we should employ a more stringent threshold when conducting a meta-analysis, regardless of the presence of African samples. PMID:27305981

  6. Empirical estimation of genome-wide significance thresholds based on the 1000 Genomes Project data set.

    PubMed

    Kanai, Masahiro; Tanaka, Toshihiro; Okada, Yukinori

    2016-10-01

    To assess the statistical significance of associations between variants and traits, genome-wide association studies (GWAS) should employ an appropriate threshold that accounts for the massive burden of multiple testing in the study. Although most studies in the current literature commonly set a genome-wide significance threshold at the level of P=5.0 × 10(-8), the adequacy of this value for respective populations has not been fully investigated. To empirically estimate thresholds for different ancestral populations, we conducted GWAS simulations using the 1000 Genomes Phase 3 data set for Africans (AFR), Europeans (EUR), Admixed Americans (AMR), East Asians (EAS) and South Asians (SAS). The estimated empirical genome-wide significance thresholds were Psig=3.24 × 10(-8) (AFR), 9.26 × 10(-8) (EUR), 1.83 × 10(-7) (AMR), 1.61 × 10(-7) (EAS) and 9.46 × 10(-8) (SAS). We additionally conducted trans-ethnic meta-analyses across all populations (ALL) and all populations except for AFR (ΔAFR), which yielded Psig=3.25 × 10(-8) (ALL) and 4.20 × 10(-8) (ΔAFR). Our results indicate that the current threshold (P=5.0 × 10(-8)) is overly stringent for all ancestral populations except for Africans; however, we should employ a more stringent threshold when conducting a meta-analysis, regardless of the presence of African samples.

  7. Genetic Control of Canine Leishmaniasis: Genome-Wide Association Study and Genomic Selection Analysis

    PubMed Central

    Quilez, Javier; Martínez, Verónica; Woolliams, John A.; Sanchez, Armand; Pong-Wong, Ricardo; Kennedy, Lorna J.; Quinnell, Rupert J.; Ollier, William E. R.; Roura, Xavier; Ferrer, Lluís; Altet, Laura; Francino, Olga

    2012-01-01

    Background The current disease model for leishmaniasis suggests that only a proportion of infected individuals develop clinical disease, while others are asymptomatically infected due to immune control of infection. The factors that determine whether individuals progress to clinical disease following Leishmania infection are unclear, although previous studies suggest a role for host genetics. Our hypothesis was that canine leishmaniasis is a complex disease with multiple loci responsible for the progression of the disease from Leishmania infection. Methodology/Principal Findings Genome-wide association and genomic selection approaches were applied to a population-based case-control dataset of 219 dogs from a single breed (Boxer) genotyped for ∼170,000 SNPs. Firstly, we aimed to identify individual disease loci; secondly, we quantified the genetic component of the observed phenotypic variance; and thirdly, we tested whether genome-wide SNP data could accurately predict the disease. Conclusions/Significance We estimated that a substantial proportion of the genome is affecting the trait and that its heritability could be as high as 60%. Using the genome-wide association approach, the strongest associations were on chromosomes 1, 4 and 20, although none of these were statistically significant at a genome-wide level and after correcting for genetic stratification and lifestyle. Amongst these associations, chromosome 4: 61.2–76.9 Mb maps to a locus that has previously been associated with host susceptibility to human and murine leishmaniasis, and genomic selection estimated markers in this region to have the greatest effect on the phenotype. We therefore propose these regions as candidates for replication studies. An important finding of this study was the significant predictive value from using the genomic information. We found that the phenotype could be predicted with an accuracy of ∼0.29 in new samples and that the affection status was correctly predicted in 60

  8. Whole-genome scan for guttural pouch tympany in Arabian and German warmblood horses.

    PubMed

    Zeitz, A; Spötter, A; Blazyczek, I; Diesterbeck, U; Ohnesorge, B; Deegen, E; Distl, O

    2009-12-01

    Equine guttural pouch tympany (GPT) is a hereditary disease in foals of several breeds, including thoroughbreds, Arabian, Quarter and warmblood horses. We performed a whole-genome scan for GPT in 143 horses from five Arabian and five German warmblood families and genotyped 257 microsatellites. Chromosome-wide significant linkage was detected on ECA2 and ECA15 using multipoint non-parametric linkage analyses. Analyses stratified by sex revealed chromosome-wide significant linkage on ECA2 for fillies and chromosome-wide significant linkage on ECA15 for colts. For Arabian colts, the quantitative trait locus (QTL) on ECA15 was genome-wide significant. Haplotypes including two to four microsatellites within the QTL on ECA2 and 15 in fillies and colts, respectively, were significantly associated with GPT for both breeds. Thus, our analysis indicated sex-specific QTL, a fact which is in agreement with a two- to fourfold higher incidence of GPT in females. This is the first report of QTL for equine GPT and a first step towards identifying genes responsible for GPT.

  9. Genome-wide characteristics of de novo mutations in autism

    PubMed Central

    Yuen, Ryan K C; Merico, Daniele; Cao, Hongzhi; Pellecchia, Giovanna; Alipanahi, Babak; Thiruvahindrapuram, Bhooma; Tong, Xin; Sun, Yuhui; Cao, Dandan; Zhang, Tao; Wu, Xueli; Jin, Xin; Zhou, Ze; Liu, Xiaomin; Nalpathamkalam, Thomas; Walker, Susan; Howe, Jennifer L.; Wang, Zhuozhi; MacDonald, Jeffrey R.; Chan, Ada; D’Abate, Lia; Deneault, Eric; Siu, Michelle T.; Tammimies, Kristiina; Uddin, Mohammed; Zarrei, Mehdi; Wang, Mingbang; Li, Yingrui; Wang, Jun; Wang, Jian; Yang, Huanming; Bookman, Matt; Bingham, Jonathan; Gross, Samuel S.; Loy, Dion; Pletcher, Mathew; Marshall, Christian R.; Anagnostou, Evdokia; Zwaigenbaum, Lonnie; Weksberg, Rosanna; Fernandez, Bridget A; Roberts, Wendy; Szatmari, Peter; Glazer, David; Frey, Brendan J.; Ring, Robert H.; Xu, Xun; Scherer, Stephen W.

    2016-01-01

    De novo mutations (DNMs) are important in Autism Spectrum Disorder (ASD), but so far analyses have mainly been on the ~1.5% of the genome encoding genes. Here, we performed whole genome sequencing (WGS) of 200 ASD parent-child trios and characterized germline and somatic DNMs. We confirmed that the majority of germline DNMs (75.6%) originated from the father, and these increased significantly with paternal age only (p=4.2×10−10). However, when clustered DNMs (those within 20kb) were found in ASD, not only did they mostly originate from the mother (p=7.7×10−13), but they could also be found adjacent to de novo copy number variations (CNVs) where the mutation rate was significantly elevated (p=2.4×10−24). By comparing DNMs detected in controls, we found a significant enrichment of predicted damaging DNMs in ASD cases (p=8.0×10−9; OR=1.84), of which 15.6% (p=4.3×10−3) and 22.5% (p=7.0×10−5) were in the non-coding or genic non-coding, respectively. The non-coding elements most enriched for DNM were untranslated regions of genes, boundaries involved in exon-skipping and DNase I hypersensitive regions. Using microarrays and a novel outlier detection test, we also found aberrant methylation profiles in 2/185 (1.1%) of ASD cases. These same individuals carried independently identified DNMs in the ASD risk- and epigenetic- genes DNMT3A and ADNP. Our data begins to characterize different genome-wide DNMs, and highlight the contribution of non-coding variants, to the etiology of ASD. PMID:27525107

  10. Genome-Wide Analysis of Human Metapneumovirus Evolution

    PubMed Central

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  11. Genome-Wide Analysis of Human Metapneumovirus Evolution.

    PubMed

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs.

  12. Genome-wide Association Studies for Osteoporosis: A 2013 Update

    PubMed Central

    Liu, Yong-Jun; Zhang, Lei; Papasian, Christopher J.

    2014-01-01

    In the past few years, the bone field has witnessed great advances in genome-wide association studies (GWASs) of osteoporosis, with a number of promising genes identified. In particular, meta-analysis of GWASs, aimed at increasing the power of studies by combining the results from different study populations, have led to the identification of novel associations that would not otherwise have been identified in individual GWASs. Recently, the first whole genome sequencing study for osteoporosis and fractures was published, reporting a novel rare nonsense mutation. This review summarizes the important and representative findings published by December 2013. Comments are made on the notable findings and representative studies for their potential influence and implications on our present understanding of the genetics of osteoporosis. Potential limitations of GWASs and their meta-analyses are evaluated, with an emphasis on understanding the reasons for inconsistent results between different studies and clarification of misinterpretation of GWAS meta-analysis results. Implications and challenges of GWAS are also discussed, including the need for multi- and inter-disciplinary studies. PMID:25006567

  13. Genome-Wide Discriminatory Information Patterns of Cytosine DNA Methylation

    PubMed Central

    Sanchez, Robersy; Mackenzie, Sally A.

    2016-01-01

    Cytosine DNA methylation (CDM) is a highly abundant, heritable but reversible chemical modification to the genome. Herein, a machine learning approach was applied to analyze the accumulation of epigenetic marks in methylomes of 152 ecotypes and 85 silencing mutants of Arabidopsis thaliana. In an information-thermodynamics framework, two measurements were used: (1) the amount of information gained/lost with the CDM changes IR and (2) the uncertainty of not observing a SNP LCR. We hypothesize that epigenetic marks are chromosomal footprints accounting for different ontogenetic and phylogenetic histories of individual populations. A machine learning approach is proposed to verify this hypothesis. Results support the hypothesis by the existence of discriminatory information (DI) patterns of CDM able to discriminate between individuals and between individual subpopulations. The statistical analyses revealed a strong association between the topologies of the structured population of Arabidopsis ecotypes based on IR and on LCR, respectively. A statistical-physical relationship between IR and LCR was also found. Results to date imply that the genome-wide distribution of CDM changes is not only part of the biological signal created by the methylation regulatory machinery, but ensures the stability of the DNA molecule, preserving the integrity of the genetic message under continuous stress from thermal fluctuations in the cell environment. PMID:27322251

  14. Genome-wide profiling of forum domains in Drosophila melanogaster

    PubMed Central

    Tchurikov, Nickolai A.; Kretova, Olga V.; Sosin, Dmitri V.; Zykov, Ivan A.; Zhimulev, Igor F.; Kravatsky, Yuri V.

    2011-01-01

    Forum domains are stretches of chromosomal DNA that are excised from eukaryotic chromosomes during their spontaneous non-random fragmentation. Most forum domains are 50–200 kb in length. We mapped forum domain termini using FISH on polytene chromosomes and we performed genome-wide mapping using a Drosophila melanogaster genomic tiling microarray consisting of overlapping 3 kb fragments. We found that forum termini very often correspond to regions of intercalary heterochromatin and regions of late replication in polytene chromosomes. We found that forum domains contain clusters of several or many genes. The largest forum domains correspond to the main clusters of homeotic genes inside BX-C and ANTP-C, cluster of histone genes and clusters of piRNAs. PRE/TRE and transcription factor binding sites often reside inside domains and do not overlap with forum domain termini. We also found that about 20% of forum domain termini correspond to small chromosomal regions where Ago1, Ago2, small RNAs and repressive chromatin structures are detected. Our results indicate that forum domains correspond to big multi-gene chromosomal units, some of which could be coordinately expressed. The data on the global mapping of forum domains revealed a strong correlation between fragmentation sites in chromosomes, particular sets of mobile elements and regions of intercalary heterochromatin. PMID:21247882

  15. Genome-wide significant risk associations for mucinous ovarian carcinoma

    PubMed Central

    Kelemen, Linda E.; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; M. Lee, Janet; Seo, Ji-Heui; Phelan, Catherine M.; Beesley, Jonathan; Chen, Xiaoqin; Spindler, Tassja J.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chen, Y. Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Engelholm, Svend Aage; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moes-Sosnowska, Joanna; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wlodzimierz, Sawicki; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Freedman, Matthew L.; Chenevix-Trench, Georgia; Pharoah, Paul D.; Gayther, Simon A.; Berchuck, Andrew

    2015-01-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas (OC) but not for mucinous ovarian carcinomas (MOC). Genotypes from OC cases and controls were imputed into the 1000 Genomes Project reference panel. Analysis of 1,644 MOC cases and 21,693 controls identified three novel risk associations: rs752590 at 2q13 (P = 3.3 × 10−8), rs711830 at 2q31.1 (P = 7.5 × 10−12) and rs688187 at 19q13.2 (P = 6.8 × 10−13). Expression Quantitative Trait Locus (eQTL) analysis in ovarian and colorectal tumors (which are histologically similar to MOC) identified significant eQTL associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10−4, FDR = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors, and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790

  16. Genome-Wide Analysis of Polyadenylation Events in Schmidtea mediterranea

    PubMed Central

    Lakshmanan, Vairavan; Bansal, Dhiru; Kulkarni, Jahnavi; Poduval, Deepak; Krishna, Srikar; Sasidharan, Vidyanand; Anand, Praveen; Seshasayee, Aswin; Palakodeti, Dasaradhi

    2016-01-01

    In eukaryotes, 3′ untranslated regions (UTRs) play important roles in regulating posttranscriptional gene expression. The 3′UTR is defined by regulated cleavage/polyadenylation of the pre-mRNA. The advent of next-generation sequencing technology has now enabled us to identify these events on a genome-wide scale. In this study, we used poly(A)-position profiling by sequencing (3P-Seq) to capture all poly(A) sites across the genome of the freshwater planarian, Schmidtea mediterranea, an ideal model system for exploring the process of regeneration and stem cell function. We identified the 3′UTRs for ∼14,000 transcripts and thus improved the existing gene annotations. We found 97 transcripts, which are polyadenylated within an internal exon, resulting in the shrinking of the ORF and loss of a predicted protein domain. Around 40% of the transcripts in planaria were alternatively polyadenylated (ApA), resulting either in an altered 3′UTR or a change in coding sequence. We identified specific ApA transcript isoforms that were subjected to miRNA mediated gene regulation using degradome sequencing. In this study, we also confirmed a tissue-specific expression pattern for alternate polyadenylated transcripts. The insights from this study highlight the potential role of ApA in regulating the gene expression essential for planarian regeneration. PMID:27489207

  17. Genome-wide profiling of forum domains in Drosophila melanogaster.

    PubMed

    Tchurikov, Nickolai A; Kretova, Olga V; Sosin, Dmitri V; Zykov, Ivan A; Zhimulev, Igor F; Kravatsky, Yuri V

    2011-05-01

    Forum domains are stretches of chromosomal DNA that are excised from eukaryotic chromosomes during their spontaneous non-random fragmentation. Most forum domains are 50-200 kb in length. We mapped forum domain termini using FISH on polytene chromosomes and we performed genome-wide mapping using a Drosophila melanogaster genomic tiling microarray consisting of overlapping 3 kb fragments. We found that forum termini very often correspond to regions of intercalary heterochromatin and regions of late replication in polytene chromosomes. We found that forum domains contain clusters of several or many genes. The largest forum domains correspond to the main clusters of homeotic genes inside BX-C and ANTP-C, cluster of histone genes and clusters of piRNAs. PRE/TRE and transcription factor binding sites often reside inside domains and do not overlap with forum domain termini. We also found that about 20% of forum domain termini correspond to small chromosomal regions where Ago1, Ago2, small RNAs and repressive chromatin structures are detected. Our results indicate that forum domains correspond to big multi-gene chromosomal units, some of which could be coordinately expressed. The data on the global mapping of forum domains revealed a strong correlation between fragmentation sites in chromosomes, particular sets of mobile elements and regions of intercalary heterochromatin.

  18. minotaur: A platform for the analysis and visualization of multivariate results from genome scans with R Shiny.

    PubMed

    Verity, Robert; Collins, Caitlin; Card, Daren C; Schaal, Sara M; Wang, Liuyang; Lotterhos, Katie E

    2017-01-01

    Genome scans are widely used to identify 'outliers' in genomic data: loci with different patterns compared with the rest of the genome due to the action of selection or other nonadaptive forces of evolution. These genomic data sets are often high dimensional, with complex correlation structures among variables, making it a challenge to identify outliers in a robust way. The Mahalanobis distance has been widely used, but has the major limitation of assuming that data follow a simple parametric distribution. Here, we develop three new metrics that can be used to identify outliers in multivariate space, while making no strong assumptions about the distribution of the data. These metrics are implemented in the R package minotaur, which also includes an interactive web-based application for visualizing outliers in high-dimensional data sets. We illustrate how these metrics can be used to identify outliers from simulated genetic data and discuss some of the limitations they may face in application.

  19. Genome-wide measurement of RNA folding energies.

    PubMed

    Wan, Yue; Qu, Kun; Ouyang, Zhengqing; Kertesz, Michael; Li, Jun; Tibshirani, Robert; Makino, Debora L; Nutter, Robert C; Segal, Eran; Chang, Howard Y

    2012-10-26

    RNA structural transitions are important in the function and regulation of RNAs. Here, we reveal a layer of transcriptome organization in the form of RNA folding energies. By probing yeast RNA structures at different temperatures, we obtained relative melting temperatures (Tm) for RNA structures in over 4000 transcripts. Specific signatures of RNA Tm demarcated the polarity of mRNA open reading frames and highlighted numerous candidate regulatory RNA motifs in 3' untranslated regions. RNA Tm distinguished noncoding versus coding RNAs and identified mRNAs with distinct cellular functions. We identified thousands of putative RNA thermometers, and their presence is predictive of the pattern of RNA decay in vivo during heat shock. The exosome complex recognizes unpaired bases during heat shock to degrade these RNAs, coupling intrinsic structural stabilities to gene regulation. Thus, genome-wide structural dynamics of RNA can parse functional elements of the transcriptome and reveal diverse biological insights.

  20. Genome-wide studies of telomere biology in budding yeast

    PubMed Central

    Harari, Yaniv; Kupiec, Martin

    2014-01-01

    Telomeres are specialized DNA-protein structures at the ends of eukaryotic chromosomes. Telomeres are essential for chromosomal stability and integrity, as they prevent chromosome ends from being recognized as double strand breaks. In rapidly proliferating cells, telomeric DNA is synthesized by the enzyme telomerase, which copies a short template sequence within its own RNA moiety, thus helping to solve the “end-replication problem”, in which information is lost at the ends of chromosomes with each DNA replication cycle. The basic mechanisms of telomere length, structure and function maintenance are conserved among eukaryotes. Studies in the yeast Saccharomyces cerevisiae have been instrumental in deciphering the basic aspects of telomere biology. In the last decade, technical advances, such as the availability of mutant collections, have allowed carrying out systematic genome-wide screens for mutants affecting various aspects of telomere biology. In this review we summarize these efforts, and the insights that this Systems Biology approach has produced so far.

  1. Ultrafast laser nanosurgery in microfluidics for genome-wide screenings

    PubMed Central

    Ben-Yakar, Adela; Bourgeois, Frederic

    2009-01-01

    Summary The use of ultrafast laser pulses in surgery has allowed for unprecedented precision with minimal collateral damage to surrounding tissues. For these reasons, ultrafast laser nanosurgery, as an injury model, has gained tremendous momentum in experimental biology ranging from in-vitro manipulations of subcellular structures to in-vivo studies in whole living organisms. For example, femtosecond laser nanosurgery on such model organism as the nematode Caenorhabditis elegans (C. elegans) has opened new opportunities for in-vivo nerve regeneration studies. Meanwhile, the development of novel microfluidic devices has brought the control in experimental environment to the level required for precise nanosurgery in various animal models. Merging microfluidics and laser nanosurgery has recently improved the specificities and increased the speed of laser surgeries enabling fast genome-wide screenings that can more readily decode the genetic map of various biological processes. PMID:19278850

  2. Genome-wide association studies in pharmacogenomics of antidepressants.

    PubMed

    Lin, Eugene; Lane, Hsien-Yuan

    2015-01-01

    Major depressive disorder (MDD) is one of the most common psychiatric disorders worldwide. Doctors must prescribe antidepressants based on educated guesses due to the fact that it is unmanageable to predict the effectiveness of any particular antidepressant in an individual patient. With the recent advent of scientific research, the genome-wide association study (GWAS) is extensively employed to analyze hundreds of thousands of single nucleotide polymorphisms by high-throughput genotyping technologies. In addition to the candidate-gene approach, the GWAS approach has recently been utilized to investigate the determinants of antidepressant response to therapy. In this study, we reviewed GWAS studies, their limitations and future directions with respect to the pharmacogenomics of antidepressants in MDD.

  3. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Slof-Op t Landt, Margarita CT; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O’Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2013-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10−7) in SOX2OT and rs17030795 (P=5.84×10−6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10−6) between CUL3 and FAM124B and rs1886797 (P=8.05×10−6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P= 4×10−6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:21079607

  4. A genome-wide association study of anorexia nervosa.

    PubMed

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-10-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field.

  5. Genome-Wide Association Studies for Comb Traits in Chickens

    PubMed Central

    Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61–0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  6. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2015-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  7. Genome-Wide Expression Profiling of Complex Regional Pain Syndrome

    PubMed Central

    Jin, Eun-Heui; Zhang, Enji; Ko, Youngkwon; Sim, Woo Seog; Moon, Dong Eon; Yoon, Keon Jung; Hong, Jang Hee; Lee, Won Hyung

    2013-01-01

    Complex regional pain syndrome (CRPS) is a chronic, progressive, and devastating pain syndrome characterized by spontaneous pain, hyperalgesia, allodynia, altered skin temperature, and motor dysfunction. Although previous gene expression profiling studies have been conducted in animal pain models, there genome-wide expression profiling in the whole blood of CRPS patients has not been reported yet. Here, we successfully identified certain pain-related genes through genome-wide expression profiling in the blood from CRPS patients. We found that 80 genes were differentially expressed between 4 CRPS patients (2 CRPS I and 2 CRPS II) and 5 controls (cut-off value: 1.5-fold change and p<0.05). Most of those genes were associated with signal transduction, developmental processes, cell structure and motility, and immunity and defense. The expression levels of major histocompatibility complex class I A subtype (HLA-A29.1), matrix metalloproteinase 9 (MMP9), alanine aminopeptidase N (ANPEP), l-histidine decarboxylase (HDC), granulocyte colony-stimulating factor 3 receptor (G-CSF3R), and signal transducer and activator of transcription 3 (STAT3) genes selected from the microarray were confirmed in 24 CRPS patients and 18 controls by quantitative reverse transcription-polymerase chain reaction (qRT-PCR). We focused on the MMP9 gene that, by qRT-PCR, showed a statistically significant difference in expression in CRPS patients compared to controls with the highest relative fold change (4.0±1.23 times and p = 1.4×10−4). The up-regulation of MMP9 gene in the blood may be related to the pain progression in CRPS patients. Our findings, which offer a valuable contribution to the understanding of the differential gene expression in CRPS may help in the understanding of the pathophysiology of CRPS pain progression. PMID:24244504

  8. Deciphering Genome-Environment-Wide Interactions Using Exposed Subjects Only

    PubMed Central

    Zhao, Lue Ping; Fan, Wenhong; Goodman, Gary; Radich, Jerry; Martin, Paul

    2015-01-01

    The recent successes of genome-wide association studies (GWAS) have renewed interest in genome-environment-wide interaction studies (GEWIS) to discover genetic factors that modulate penetrance of environmental exposures to human diseases. Indeed, gene-environment interactions (GxE), which have not been emphasized in the GWAS era, could be a source contributing to the missing heritability, a major bottleneck limiting continuing GWAS successes. In this manuscript, we describe a design and analytic strategy to focus on GxE using only exposed subjects, dubbed as e-GEWIS. Operationally, an e-GEWIS analysis is equivalent to a GWAS analysis on exposed subjects only, and it has actually been used in some earlier GWAS without being explicitly identified as such. Through both analytics and simulations, e-GEWIS have been shown better efficiency than the usual cross-product-based analysis of GxE interaction with both cases and controls (cc-GEWIS), and they have comparable efficiency to case-only analysis of GxE (c-GEWIS), with potentially smaller sample sizes. The formalization of e-GEWIS here provides a theoretical basis to legitimize this framework for routine investigation of GxE, for more efficient GxE study designs, and for improvement of reproducibility in replicating GEWIS findings. As an illustration, we apply e-GEWIS to a lung cancer GWAS dataset to perform a GEWIS, focusing on gene and smoking interaction. The e-GEWIS analysis successfully uncovered positive genetic associations on chromosome 15 among current smokers, suggesting a gene-smoking interaction. While this signal was detected earlier, the current finding here serves as a positive control in support of this e-GEWIS strategy. PMID:25694100

  9. Genome-Wide Association Study of Meiotic Recombination Phenotypes

    PubMed Central

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G.; Sherman, Stephanie L.; Feingold, Eleanor

    2016-01-01

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9. By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2. This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events. PMID:27733454

  10. Genome-Wide Association Study of Meiotic Recombination Phenotypes.

    PubMed

    Begum, Ferdouse; Chowdhury, Reshmi; Cheung, Vivian G; Sherman, Stephanie L; Feingold, Eleanor

    2016-12-07

    Meiotic recombination is an essential step in gametogenesis, and is one that also generates genetic diversity. Genome-wide association studies (GWAS) and molecular studies have identified genes that influence of human meiotic recombination. RNF212 is associated with total or average number of recombination events, and PRDM9 is associated with the locations of hotspots, or sequences where crossing over appears to cluster. In addition, a common inversion on chromosome 17 is strongly associated with recombination. Other genes have been identified by GWAS, but those results have not been replicated. In this study, using new datasets, we characterized additional recombination phenotypes to uncover novel candidates and further dissect the role of already known loci. We used three datasets totaling 1562 two-generation families, including 3108 parents with 4304 children. We estimated five different recombination phenotypes including two novel phenotypes (average recombination counts within recombination hotspots and outside of hotspots) using dense SNP array genotype data. We then performed gender-specific and combined-sex genome-wide association studies (GWAS) meta-analyses. We replicated associations for several previously reported recombination genes, including RNF212 and PRDM9 By looking specifically at recombination events outside of hotspots, we showed for the first time that PRDM9 has different effects in males and females. We identified several new candidate loci, particularly for recombination events outside of hotspots. These include regions near the genes SPINK6, EVC2, ARHGAP25, and DLGAP2 This study expands our understanding of human meiotic recombination by characterizing additional features that vary across individuals, and identifying regulatory variants influencing the numbers and locations of recombination events.

  11. Genomic resources and their influence on the detection of the signal of positive selection in genome scans.

    PubMed

    Manel, S; Perrier, C; Pratlong, M; Abi-Rached, L; Paganini, J; Pontarotti, P; Aurelle, D

    2016-01-01

    Genome scans represent powerful approaches to investigate the action of natural selection on the genetic variation of natural populations and to better understand local adaptation. This is very useful, for example, in the field of conservation biology and evolutionary biology. Thanks to Next Generation Sequencing, genomic resources are growing exponentially, improving genome scan analyses in non-model species. Thousands of SNPs called using Reduced Representation Sequencing are increasingly used in genome scans. Besides, genome sequences are also becoming increasingly available, allowing better processing of short-read data, offering physical localization of variants, and improving haplotype reconstruction and data imputation. Ultimately, genome sequences are also becoming the raw material for selection inferences. Here, we discuss how the increasing availability of such genomic resources, notably genome sequences, influences the detection of signals of selection. Mainly, increasing data density and having the information of physical linkage data expand genome scans by (i) improving the overall quality of the data, (ii) helping the reconstruction of demographic history for the population studied to decrease false-positive rates and (iii) improving the statistical power of methods to detect the signal of selection. Of particular importance, the availability of a high-quality reference genome can improve the detection of the signal of selection by (i) allowing matching the potential candidate loci to linked coding regions under selection, (ii) rapidly moving the investigation to the gene and function and (iii) ensuring that the highly variable regions of the genomes that include functional genes are also investigated. For all those reasons, using reference genomes in genome scan analyses is highly recommended.

  12. Genome-Wide Association Studies: Contribution of Genomics to Understanding Blood Pressure and Essential Hypertension

    PubMed Central

    2010-01-01

    Contemporary genomic tools now allow the fast and reliable genotyping of hundreds of thousands of variants and permit an unbiased interrogation of the common variability across the human genome. These technical advances have been the basis of numerous recent investigations of genes underlying complex genetic traits, and the results for blood pressure and hypertension have been of particular interest. The pathophysiology of the complex genetic trait blood pressure and hypertension is unclear. The heritability of essential hypertension is high and insights can be gained by finding associated genes. Current genome-wide association studies (GWAS) have identified 10 to 20 loci in or near genes that generally were not expected to be associated with blood pressure or essential hypertension; more significant variants will be discovered when even larger and more refined studies become available. This article gives a short introduction to GWAS and summarizes the current findings for blood pressure and hypertension. PMID:20425154

  13. Genome-wide association study of circulating retinol levels.

    PubMed

    Mondul, Alison M; Yu, Kai; Wheeler, William; Zhang, Hong; Weinstein, Stephanie J; Major, Jacqueline M; Cornelis, Marilyn C; Männistö, Satu; Hazra, Aditi; Hsing, Ann W; Jacobs, Kevin B; Eliassen, Heather; Tanaka, Toshiko; Reding, Douglas J; Hendrickson, Sara; Ferrucci, Luigi; Virtamo, Jarmo; Hunter, David J; Chanock, Stephen J; Kraft, Peter; Albanes, Demetrius

    2011-12-01

    Retinol is one of the most biologically active forms of vitamin A and is hypothesized to influence a wide range of human diseases including asthma, cardiovascular disease, infectious diseases and cancer. We conducted a genome-wide association study of 5006 Caucasian individuals drawn from two cohorts of men: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention (ATBC) Study and the Prostate, Lung, Colorectal, and Ovarian (PLCO) Cancer Screening Trial. We identified two independent single-nucleotide polymorphisms associated with circulating retinol levels, which are located near the transthyretin (TTR) and retinol binding protein 4 (RBP4) genes which encode major carrier proteins of retinol: rs1667255 (P =2.30× 10(-17)) and rs10882272 (P =6.04× 10(-12)). We replicated the association with rs10882272 in RBP4 in independent samples from the Nurses' Health Study and the Invecchiare in Chianti Study (InCHIANTI) that included 3792 women and 504 men (P =9.49× 10(-5)), but found no association for retinol with rs1667255 in TTR among women, thus suggesting evidence for gender dimorphism (P-interaction=1.31× 10(-5)). Discovery of common genetic variants associated with serum retinol levels may provide further insight into the contribution of retinol and other vitamin A compounds to the development of cancer and other complex diseases.

  14. A synergistic DNA logic predicts genome-wide chromatin accessibility

    PubMed Central

    Hashimoto, Tatsunori; Sherwood, Richard I.; Kang, Daniel D.; Rajagopal, Nisha; Barkal, Amira A.; Zeng, Haoyang; Emons, Bart J.M.; Srinivasan, Sharanya; Jaakkola, Tommi; Gifford, David K.

    2016-01-01

    Enhancers and promoters commonly occur in accessible chromatin characterized by depleted nucleosome contact; however, it is unclear how chromatin accessibility is governed. We show that log-additive cis-acting DNA sequence features can predict chromatin accessibility at high spatial resolution. We develop a new type of high-dimensional machine learning model, the Synergistic Chromatin Model (SCM), which when trained with DNase-seq data for a cell type is capable of predicting expected read counts of genome-wide chromatin accessibility at every base from DNA sequence alone, with the highest accuracy at hypersensitive sites shared across cell types. We confirm that a SCM accurately predicts chromatin accessibility for thousands of synthetic DNA sequences using a novel CRISPR-based method of highly efficient site-specific DNA library integration. SCMs are directly interpretable and reveal that a logic based on local, nonspecific synergistic effects, largely among pioneer TFs, is sufficient to predict a large fraction of cellular chromatin accessibility in a wide variety of cell types. PMID:27456004

  15. Genomic Research and Wide Data Sharing: Views of Prospective Participants

    PubMed Central

    Trinidad, Susan Brown; Fullerton, Stephanie M.; Bares, Julie M.; Jarvik, Gail P.; Larson, Eric B.; Burke, Wylie

    2011-01-01

    Purpose Sharing study data within the research community generates tension between two important goods: promoting scientific goals and protecting the privacy interests of study participants. The present study was designed to explore the perceptions, beliefs, and attitudes of research participants and possible future participants regarding genome-wide association studies (GWAS) and repository-based research. Methods Focus group sessions with (1) current research participants, (2) surrogate decision-makers, and (3) three age-defined cohorts (18–34 years, 35–50, >50). Results Participants expressed a variety of opinions about the acceptability of wide sharing of genetic and phenotypic information for research purposes through large, publicly accessible data repositories. Most believed that making de-identified study data available to the research community is a social good that should be pursued. Privacy and confidentiality concerns were common, though they would not necessarily preclude participation. Many participants voiced reservations about sharing data with for-profit organizations. Conclusions Trust is central in participants’ views regarding GWAS data sharing. Further research is needed to develop governance models that enact the values of stewardship. PMID:20535021

  16. Potential assessment of genome-wide association study and genomic selection in Japanese pear Pyrus pyrifolia.

    PubMed

    Iwata, Hiroyoshi; Hayashi, Takeshi; Terakami, Shingo; Takada, Norio; Sawamura, Yutaka; Yamamoto, Toshiya

    2013-03-01

    Although the potential of marker-assisted selection (MAS) in fruit tree breeding has been reported, bi-parental QTL mapping before MAS has hindered the introduction of MAS to fruit tree breeding programs. Genome-wide association studies (GWAS) are an alternative to bi-parental QTL mapping in long-lived perennials. Selection based on genomic predictions of breeding values (genomic selection: GS) is another alternative for MAS. This study examined the potential of GWAS and GS in pear breeding with 76 Japanese pear cultivars to detect significant associations of 162 markers with nine agronomic traits. We applied multilocus Bayesian models accounting for ordinal categorical phenotypes for GWAS and GS model training. Significant associations were detected at harvest time, black spot resistance and the number of spurs and two of the associations were closely linked to known loci. Genome-wide predictions for GS were accurate at the highest level (0.75) in harvest time, at medium levels (0.38-0.61) in resistance to black spot, firmness of flesh, fruit shape in longitudinal section, fruit size, acid content and number of spurs and at low levels (<0.2) in all soluble solid content and vigor of tree. Results suggest the potential of GWAS and GS for use in future breeding programs in Japanese pear.

  17. Breaking RAD: an evaluation of the utility of restriction site-associated DNA sequencing for genome scans of adaptation.

    PubMed

    Lowry, David B; Hoban, Sean; Kelley, Joanna L; Lotterhos, Katie E; Reed, Laura K; Antolin, Michael F; Storfer, Andrew

    2017-03-01

    Understanding how and why populations evolve is of fundamental importance to molecular ecology. Restriction site-associated DNA sequencing (RADseq), a popular reduced representation method, has ushered in a new era of genome-scale research for assessing population structure, hybridization, demographic history, phylogeography and migration. RADseq has also been widely used to conduct genome scans to detect loci involved in adaptive divergence among natural populations. Here, we examine the capacity of those RADseq-based genome scan studies to detect loci involved in local adaptation. To understand what proportion of the genome is missed by RADseq studies, we developed a simple model using different numbers of RAD-tags, genome sizes and extents of linkage disequilibrium (length of haplotype blocks). Under the best-case modelling scenario, we found that RADseq using six- or eight-base pair cutting restriction enzymes would fail to sample many regions of the genome, especially for species with short linkage disequilibrium. We then surveyed recent studies that have used RADseq for genome scans and found that the median density of markers across these studies was 4.08 RAD-tag markers per megabase (one marker per 245 kb). The length of linkage disequilibrium for many species is one to three orders of magnitude less than density of the typical recent RADseq study. Thus, we conclude that genome scans based on RADseq data alone, while useful for studies of neutral genetic variation and genetic population structure, will likely miss many loci under selection in studies of local adaptation.

  18. A genome-wide association study of gestational diabetes mellitus in Korean women.

    PubMed

    Kwak, Soo Heon; Kim, Sung-Hoon; Cho, Young Min; Go, Min Jin; Cho, Yoon Shin; Choi, Sung Hee; Moon, Min Kyong; Jung, Hye Seung; Shin, Hyoung Doo; Kang, Hyun Min; Cho, Nam H; Lee, In Kyu; Kim, Seong Yeon; Han, Bok-Ghee; Jang, Hak C; Park, Kyong Soo

    2012-02-01

    Knowledge regarding the genetic risk loci for gestational diabetes mellitus (GDM) is still limited. In this study, we performed a two-stage genome-wide association analysis in Korean women. In the stage 1 genome scan, 468 women with GDM and 1,242 nondiabetic control women were compared using 2.19 million genotyped or imputed markers. We selected 11 loci for further genotyping in stage 2 samples of 931 case and 783 control subjects. The joint effect of stage 1 plus stage 2 studies was analyzed by meta-analysis. We also investigated the effect of known type 2 diabetes variants in GDM. Two loci known to be associated with type 2 diabetes had a genome-wide significant association with GDM in the joint analysis. rs7754840, a variant in CDKAL1, had the strongest association with GDM (odds ratio 1.518; P=6.65×10(-16)). A variant near MTNR1B, rs10830962, was also significantly associated with the risk of GDM (1.454; P=2.49×10(-13)). We found that there is an excess of association between known type 2 diabetes variants and GDM above what is expected under the null hypothesis. In conclusion, we have confirmed that genetic variants in CDKAL1 and near MTNR1B are strongly associated with GDM in Korean women. There seems to be a shared genetic basis between GDM and type 2 diabetes.

  19. Mosaic paternal genome-wide uniparental isodisomy with down syndrome.

    PubMed

    Darcy, Diana; Atwal, Paldeep Singh; Angell, Cathy; Gadi, Inder; Wallerstein, Robert

    2015-10-01

    We report on a 6-month-old girl with two apparent cell lines; one with trisomy 21, and the other with paternal genome-wide uniparental isodisomy (GWUPiD), identified using single nucleotide polymorphism (SNP) based microarray and microsatellite analysis of polymorphic loci. The patient has Beckwith-Wiedemann syndrome (BWS) due to paternal uniparental disomy (UPD) at chromosome location 11p15 (UPD 11p15), which was confirmed through methylation analysis. Hyperinsulinemic hypoglycemia is present, which is associated with paternal UPD 11p15.5; and she likely has medullary nephrocalcinosis, which is associated with paternal UPD 20, although this was not biochemically confirmed. Angelman syndrome (AS) analysis was negative but this testing is not completely informative; she has no specific features of AS. Clinical features of this patient include: dysmorphic features consistent with trisomy 21, tetralogy of Fallot, hemihypertrophy, swirled skin hyperpigmentation, hepatoblastoma, and Wilms tumor. Her karyotype is 47,XX,+21[19]/46,XX[4], and microarray results suggest that the cell line with trisomy 21 is biparentally inherited and represents 40-50% of the genomic material in the tested specimen. The difference in the level of cytogenetically detected mosaicism versus the level of mosaicism observed via microarray analysis is likely caused by differences in the test methodologies. While a handful of cases of mosaic paternal GWUPiD have been reported, this patient is the only reported case that also involves trisomy 21. Other GWUPiD patients have presented with features associated with multiple imprinted regions, as does our patient.

  20. Genome-wide methylation analyses in glioblastoma multiforme.

    PubMed

    Lai, Rose K; Chen, Yanwen; Guan, Xiaowei; Nousome, Darryl; Sharma, Charu; Canoll, Peter; Bruce, Jeffrey; Sloan, Andrew E; Cortes, Etty; Vonsattel, Jean-Paul; Su, Tao; Delgado-Cruzata, Lissette; Gurvich, Irina; Santella, Regina M; Ostrom, Quinn; Lee, Annette; Gregersen, Peter; Barnholtz-Sloan, Jill

    2014-01-01

    Few studies had investigated genome-wide methylation in glioblastoma multiforme (GBM). Our goals were to study differential methylation across the genome in gene promoters using an array-based method, as well as repetitive elements using surrogate global methylation markers. The discovery sample set for this study consisted of 54 GBM from Columbia University and Case Western Reserve University, and 24 brain controls from the New York Brain Bank. We assembled a validation dataset using methylation data of 162 TCGA GBM and 140 brain controls from dbGAP. HumanMethylation27 Analysis Bead-Chips (Illumina) were used to interrogate 26,486 informative CpG sites in both the discovery and validation datasets. Global methylation levels were assessed by analysis of L1 retrotransposon (LINE1), 5 methyl-deoxycytidine (5m-dC) and 5 hydroxylmethyl-deoxycytidine (5hm-dC) in the discovery dataset. We validated a total of 1548 CpG sites (1307 genes) that were differentially methylated in GBM compared to controls. There were more than twice as many hypomethylated genes as hypermethylated ones. Both the discovery and validation datasets found 5 tumor methylation classes. Pathway analyses showed that the top ten pathways in hypomethylated genes were all related to functions of innate and acquired immunities. Among hypermethylated pathways, transcriptional regulatory network in embryonic stem cells was the most significant. In the study of global methylation markers, 5m-dC level was the best discriminant among methylation classes, whereas in survival analyses, high level of LINE1 methylation was an independent, favorable prognostic factor in the discovery dataset. Based on a pathway approach, hypermethylation in genes that control stem cell differentiation were significant, poor prognostic factors of overall survival in both the discovery and validation datasets. Approaches that targeted these methylated genes may be a future therapeutic goal.

  1. Genome-Wide Architecture of Disease Resistance Genes in Lettuce.

    PubMed

    Christopoulou, Marilena; Wo, Sebastian Reyes-Chin; Kozik, Alex; McHale, Leah K; Truco, Maria-Jose; Wroblewski, Tadeusz; Michelmore, Richard W

    2015-10-08

    Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes.

  2. Genome-wide screening and identification of antigens for rickettsial vaccine development

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The capacity to identify immunogens for vaccine development by genome-wide screening has been markedly enhanced by the availability of complete microbial genome sequences coupled to rapid proteomic and bioinformatic analysis. Critical to this genome-wide screening is in vivo testing in the context o...

  3. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Canine Cohort, Canine Intelligence Assessment Regimen, Genome-Wide Single Nucleotide Polymorphism (SNP) Typing, and Unsupervised Classification Algorithm for Genome-Wide Association Data Analysis

    DTIC Science & Technology

    2011-09-01

    were down-selected and successfully genotyped for whole genome (WG) single nucleotide polymorphism (SNP) markers by means of the Affymetrix Canine...SUBJECT TERMS Military working dog genome-wide association study genetic marker intelligence... marker , intelligence, Canine Intelligence Testing Protocol, classification technique, clustering analysis Technical Report: September 2011 2

  4. Hybrid wide-field and scanning microscopy for high-speed 3D imaging.

    PubMed

    Duan, Yubo; Chen, Nanguang

    2015-11-15

    Wide-field optical microscopy is efficient and robust in biological imaging, but it lacks depth sectioning. In contrast, scanning microscopic techniques, such as confocal microscopy and multiphoton microscopy, have been successfully used for three-dimensional (3D) imaging with optical sectioning capability. However, these microscopic techniques are not very suitable for dynamic real-time imaging because they usually take a long time for temporal and spatial scanning. Here, a hybrid imaging technique combining wide-field microscopy and scanning microscopy is proposed to accelerate the image acquisition process while maintaining the 3D optical sectioning capability. The performance was demonstrated by proof-of-concept imaging experiments with fluorescent beads and zebrafish liver.

  5. Wide-field optical coherence tomography angiography enabled by two repeated measurements of B-scans.

    PubMed

    Wang, Ruikang K; Zhang, Anqi; Choi, Woo June; Zhang, Qinqin; Chen, Chieh-Li; Miller, Andrew; Gregori, Giovanni; Rosenfeld, Philip J

    2016-05-15

    Optical coherence tomography angiography (OCTA) has increasingly become clinically important, particularly in ophthalmology. However, the field of view (FOV) for current OCTA imaging is severely limited due to A-scan rates that can be afforded by current clinical systems and, more importantly, the requirement of a repeated scanning protocol. This Letter evaluates the possibility of using only two repeated B-scans for OCTA for the purpose of an increased FOV. The effect of repeated numbers on the OCTA result is discussed through experiments on an animal model in vivo and evaluated using quantitative metrics for image quality. Demonstrated through in vivo imaging of a pathological human eye, we show that optical microangiography-based OCTA with two repeated B-scans can provide wide-field angiography up to 12×12  mm with clinically acceptable image quality.

  6. Genome-wide analysis links NFATC2 with asparaginase hypersensitivity

    PubMed Central

    Fernandez, Christian A.; Smith, Colton; Yang, Wenjian; Mullighan, Charles G.; Qu, Chunxu; Larsen, Eric; Bowman, W. Paul; Liu, Chengcheng; Ramsey, Laura B.; Chang, Tamara; Karol, Seth E.; Loh, Mignon L.; Raetz, Elizabeth A.; Winick, Naomi J.; Hunger, Stephen P.; Carroll, William L.; Jeha, Sima; Pui, Ching-Hon; Evans, William E.; Devidas, Meenakshi

    2015-01-01

    Asparaginase is used to treat acute lymphoblastic leukemia (ALL); however, hypersensitivity reactions can lead to suboptimal asparaginase exposure. Our objective was to use a genome-wide approach to identify loci associated with asparaginase hypersensitivity in children with ALL enrolled on St. Jude Children’s Research Hospital (SJCRH) protocols Total XIIIA (n = 154), Total XV (n = 498), and Total XVI (n = 271), or Children’s Oncology Group protocols POG 9906 (n = 222) and AALL0232 (n = 2163). Germline DNA was genotyped using the Affymetrix 500K, Affymetrix 6.0, or the Illumina Exome BeadChip array. In multivariate logistic regression, the intronic rs6021191 variant in nuclear factor of activated T cells 2 (NFATC2) had the strongest association with hypersensitivity (P = 4.1 × 10−8; odds ratio [OR] = 3.11). RNA-seq data available from 65 SJCRH ALL tumor samples and 52 Yoruba HapMap samples showed that samples carrying the rs6021191 variant had higher NFATC2 expression compared with noncarriers (P = 1.1 × 10−3 and 0.03, respectively). The top ranked nonsynonymous polymorphism was rs17885382 in HLA-DRB1 (P = 3.2 × 10−6; OR = 1.63), which is in near complete linkage disequilibrium with the HLA-DRB1*07:01 allele we previously observed in a candidate gene study. The strongest risk factors for asparaginase allergy are variants within genes regulating the immune response. PMID:25987655

  7. Reconstructing Roma History from Genome-Wide Data

    PubMed Central

    Moorjani, Priya; Patterson, Nick; Loh, Po-Ru; Lipson, Mark; Kisfali, Péter; Melegh, Bela I.; Bonin, Michael; Kádaši, Ľudevít; Rieß, Olaf; Berger, Bonnie; Reich, David; Melegh, Béla

    2013-01-01

    The Roma people, living throughout Europe and West Asia, are a diverse population linked by the Romani language and culture. Previous linguistic and genetic studies have suggested that the Roma migrated into Europe from South Asia about 1,000–1,500 years ago. Genetic inferences about Roma history have mostly focused on the Y chromosome and mitochondrial DNA. To explore what additional information can be learned from genome-wide data, we analyzed data from six Roma groups that we genotyped at hundreds of thousands of single nucleotide polymorphisms (SNPs). We estimate that the Roma harbor about 80% West Eurasian ancestry–derived from a combination of European and South Asian sources–and that the date of admixture of South Asian and European ancestry was about 850 years before present. We provide evidence for Eastern Europe being a major source of European ancestry, and North-west India being a major source of the South Asian ancestry in the Roma. By computing allele sharing as a measure of linkage disequilibrium, we estimate that the migration of Roma out of the Indian subcontinent was accompanied by a severe founder event, which appears to have been followed by a major demographic expansion after the arrival in Europe. PMID:23516520

  8. Identification of differential translation in genome wide studies.

    PubMed

    Larsson, Ola; Sonenberg, Nahum; Nadon, Robert

    2010-12-14

    Regulation of gene expression through translational control is a fundamental mechanism implicated in many biological processes ranging from memory formation to innate immunity and whose dysregulation contributes to human diseases. Genome wide analyses of translational control strive to identify differential translation independent of cytosolic mRNA levels. For this reason, most studies measure genes' translation levels as log ratios (translation levels divided by corresponding cytosolic mRNA levels obtained in parallel). Counterintuitively, arising from a mathematical necessity, these log ratios tend to be highly correlated with the cytosolic mRNA levels. Accordingly, they do not effectively correct for cytosolic mRNA level and generate substantial numbers of biological false positives and false negatives. We show that analysis of partial variance, which produces estimates of translational activity that are independent of cytosolic mRNA levels, is a superior alternative. When combined with a variance shrinkage method for estimating error variance, analysis of partial variance has the additional benefit of having greater statistical power and identifying fewer genes as translationally regulated resulting merely from unrealistically low variance estimates rather than from large changes in translational activity. In contrast to log ratios, this formal analytical approach estimates translation effects in a statistically rigorous manner, eliminates the need for inefficient and error-prone heuristics, and produces results that agree with biological function. The method is applicable to datasets obtained from both the commonly used polysome microarray method and the sequencing-based ribosome profiling method.

  9. Genome-Wide Specific Selection in Three Domestic Sheep Breeds

    PubMed Central

    Cao, Jiaxve; Wu, Mingming; Ma, Xiaomeng; Liu, Zhen; Liu, Ruizao; Zhao, Fuping; Wei, Caihong; Du, Lixin

    2015-01-01

    Background Commercial sheep raised for mutton grow faster than traditional Chinese sheep breeds. Here, we aimed to evaluate genetic selection among three different types of sheep breed: two well-known commercial mutton breeds and one indigenous Chinese breed. Results We first combined locus-specific branch lengths and di statistical methods to detect candidate regions targeted by selection in the three different populations. The results showed that the genetic distances reached at least medium divergence for each pairwise combination. We found these two methods were highly correlated, and identified many growth-related candidate genes undergoing artificial selection. For production traits, APOBR and FTO are associated with body mass index. For meat traits, ALDOA, STK32B and FAM190A are related to marbling. For reproduction traits, CCNB2 and SLC8A3 affect oocyte development. We also found two well-known genes, GHR (which affects meat production and quality) and EDAR (associated with hair thickness) were associated with German mutton merino sheep. Furthermore, four genes (POL, RPL7, MSL1 and SHISA9) were associated with pre-weaning gain in our previous genome-wide association study. Conclusions Our results indicated that combine locus-specific branch lengths and di statistical approaches can reduce the searching ranges for specific selection. And we got many credible candidate genes which not only confirm the results of previous reports, but also provide a suite of novel candidate genes in defined breeds to guide hybridization breeding. PMID:26083354

  10. Genome-Wide Identification of KANADI1 Target Genes

    PubMed Central

    Ott, Felix; Weigel, Detlef; Bowman, John L.; Heisler, Marcus G.; Wenkel, Stephan

    2013-01-01

    Plant organ development and polarity establishment is mediated by the action of several transcription factors. Among these, the KANADI (KAN) subclade of the GARP protein family plays important roles in polarity-associated processes during embryo, shoot and root patterning. In this study, we have identified a set of potential direct target genes of KAN1 through a combination of chromatin immunoprecipitation/DNA sequencing (ChIP-Seq) and genome-wide transcriptional profiling using tiling arrays. Target genes are over-represented for genes involved in the regulation of organ development as well as in the response to auxin. KAN1 affects directly the expression of several genes previously shown to be important in the establishment of polarity during lateral organ and vascular tissue development. We also show that KAN1 controls through its target genes auxin effects on organ development at different levels: transport and its regulation, and signaling. In addition, KAN1 regulates genes involved in the response to abscisic acid, jasmonic acid, brassinosteroids, ethylene, cytokinins and gibberellins. The role of KAN1 in organ polarity is antagonized by HD-ZIPIII transcription factors, including REVOLUTA (REV). A comparison of their target genes reveals that the REV/KAN1 module acts in organ patterning through opposite regulation of shared targets. Evidence of mutual repression between closely related family members is also shown. PMID:24155946

  11. Genome-wide association study of aggressive behaviour in chicken

    PubMed Central

    Li, Zhenhui; Zheng, Ming; Abdalla, Bahareldin Ali; Zhang, Zhe; Xu, Zhenqiang; Ye, Qiao; Xu, Haiping; Luo, Wei; Nie, Qinghua; Zhang, Xiquan

    2016-01-01

    In the poultry industry, aggressive behaviour is a large animal welfare issue all over the world. To date, little is known about the underlying genetics of the aggressive behaviour. Here, we performed a genome-wide association study (GWAS) to explore the genetic mechanism associated with aggressive behaviour in chickens. The GWAS results showed that a total of 33 SNPs were associated with aggressive behaviour traits (P < 4.6E-6). rs312463697 on chromosome 4 was significantly associated with aggression (P = 2.10905E-07), and it was in the intron region of the sortilin-related VPS10 domain containing receptor 2 (SORCS2) gene. In addition, biological function analysis of the nearest 26 genes around the significant SNPs was performed with Ingenuity Pathway Analysis. An interaction network contained 17 genes was obtained and SORCS2 was involved in this network, interacted with nerve growth factor (NGF), nerve growth factor receptor (NGFR), dopa decarboxylase (L-dopa) and dopamine. After knockdown of SORCS2, the mRNA levels of NGF, L-dopa and dopamine receptor genes DRD1, DRD2, DRD3 and DRD4 were significantly decreased (P < 0.05). In summary, our data indicated that SORCS2 might play an important role in chicken aggressive behaviour through the regulation of dopaminergic pathways and NGF. PMID:27485826

  12. Weighted SNP set analysis in genome-wide association study.

    PubMed

    Dai, Hui; Zhao, Yang; Qian, Cheng; Cai, Min; Zhang, Ruyang; Chu, Minjie; Dai, Juncheng; Hu, Zhibin; Shen, Hongbing; Chen, Feng

    2013-01-01

    Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA.

  13. Genome-Wide Association Mapping for Phenotypic Plasticity in Rice.

    PubMed

    Kikuchi, Shinji; Bheemanahalli, Raju; Jagadish, Krishna S V; Kumagai, Etsushi; Masuya, Yusuke; Kuroda, Eiki; Raghavan, Chitra; Dingkuhn, Michael; Abe, Akira; Shimono, Hiroyuki

    2017-03-31

    Phenotypic plasticity of plants in response to environmental changes is important for adapting to changing climate. Less attention has been paid to exploring the advantages of phenotypic plasticity in resource-rich environments to enhance the productivity of agricultural crops. Here, we examined genetic variation in phenotypic plasticity in indica rice (Oryza sativa L.) across two diverse panels: (i) a Phenomics of Rice Adaptation and Yield (PRAY) population comprising 301 accessions and (ii) a Multi-parent-Advanced-Generation-Inter-Cross (MAGIC) indica population comprising 151 accessions. Altered planting density was used as a proxy for elevated atmospheric CO2 response. Low planting density significantly increased panicle weight per plant compared with normal density, and the magnitude of the increase ranged from 1.10 to 2.78 times among accessions for the PRAY population and from 1.05 to 2.45 times for the MAGIC population. Genome-wide-association studies revealed three Environmental Responsiveness (ER) candidate alleles (qER1-3) that were associated with relative response of panicle weight to low density. Two of these alleles were tested in 13 genotypes to clarify their biomass responses during vegetative growth under elevated CO2 in Japan. Our study provides evidence for polymorphisms that control rice phenotypic plasticity in environments that are rich in resources such as light and CO2 .

  14. A genome-wide association study in multiple system atrophy

    PubMed Central

    Sailer, Anna; Nalls, Michael A.; Schulte, Claudia; Federoff, Monica; Price, T. Ryan; Lees, Andrew; Ross, Owen A.; Dickson, Dennis W.; Mok, Kin; Mencacci, Niccolo E.; Schottlaender, Lucia; Chelban, Viorica; Ling, Helen; O'Sullivan, Sean S.; Wood, Nicholas W.; Traynor, Bryan J.; Ferrucci, Luigi; Federoff, Howard J.; Mhyre, Timothy R.; Morris, Huw R.; Deuschl, Günther; Quinn, Niall; Widner, Hakan; Albanese, Alberto; Infante, Jon; Bhatia, Kailash P.; Poewe, Werner; Oertel, Wolfgang; Höglinger, Günter U.; Wüllner, Ullrich; Goldwurm, Stefano; Pellecchia, Maria Teresa; Ferreira, Joaquim; Tolosa, Eduardo; Bloem, Bastiaan R.; Rascol, Olivier; Meissner, Wassilios G.; Hardy, John A.; Revesz, Tamas; Holton, Janice L.; Gasser, Thomas; Wenning, Gregor K.; Singleton, Andrew B.

    2016-01-01

    Objective: To identify genetic variants that play a role in the pathogenesis of multiple system atrophy (MSA), we undertook a genome-wide association study (GWAS). Methods: We performed a GWAS with >5 million genotyped and imputed single nucleotide polymorphisms (SNPs) in 918 patients with MSA of European ancestry and 3,864 controls. MSA cases were collected from North American and European centers, one third of which were neuropathologically confirmed. Results: We found no significant loci after stringent multiple testing correction. A number of regions emerged as potentially interesting for follow-up at p < 1 × 10−6, including SNPs in the genes FBXO47, ELOVL7, EDN1, and MAPT. Contrary to previous reports, we found no association of the genes SNCA and COQ2 with MSA. Conclusions: We present a GWAS in MSA. We have identified several potentially interesting gene loci, including the MAPT locus, whose significance will have to be evaluated in a larger sample set. Common genetic variation in SNCA and COQ2 does not seem to be associated with MSA. In the future, additional samples of well-characterized patients with MSA will need to be collected to perform a larger MSA GWAS, but this initial study forms the basis for these next steps. PMID:27629089

  15. Genome-wide transcriptome analysis of human epidermal melanocytes

    PubMed Central

    Haltaufderhyde, Kirk D.; Oancea, Elena

    2015-01-01

    Because human epidermal melanocytes (HEMs) provide critical protection against skin cancer, sunburn, and photoaging, a genome-wide perspective of gene expression in these cells is vital to understanding human skin physiology. In this study we performed high throughput sequencing of HEMs to obtain a complete data set of transcript sizes, abundances, and splicing. As expected, we found that melanocyte specific genes that function in pigmentation were among the highest expressed genes. We analyzed receptor, ion channel and transcription factor gene families to get a better understanding of the cell signalling pathways used by melanocytes. We also performed a comparative transcriptomic analysis of lightly versus darkly pigmented HEMs and found 16 genes differentially expressed in the two pigmentation phenotypes; of those, only one putative melanosomal transporter (SLC45A2) has known function in pigmentation. In addition, we found 166 genes with splice isoforms expressed exclusively in one pigmentation phenotype, 17 of which are genes involved in signal transduction. Our melanocyte transcriptome study provides a comprehensive view and may help identify novel pigmentation genes and potential pharmacological targets. PMID:25451175

  16. Genome-Wide Analysis of DNA Methylation in Human Amnion

    PubMed Central

    Kim, Jinsil; Pitlick, Mitchell M.; Christine, Paul J.; Schaefer, Amanda R.; Saleme, Cesar; Comas, Belén; Cosentino, Viviana; Gadow, Enrique; Murray, Jeffrey C.

    2013-01-01

    The amnion is a specialized tissue in contact with the amniotic fluid, which is in a constantly changing state. To investigate the importance of epigenetic events in this tissue in the physiology and pathophysiology of pregnancy, we performed genome-wide DNA methylation profiling of human amnion from term (with and without labor) and preterm deliveries. Using the Illumina Infinium HumanMethylation27 BeadChip, we identified genes exhibiting differential methylation associated with normal labor and preterm birth. Functional analysis of the differentially methylated genes revealed biologically relevant enriched gene sets. Bisulfite sequencing analysis of the promoter region of the oxytocin receptor (OXTR) gene detected two CpG dinucleotides showing significant methylation differences among the three groups of samples. Hypermethylation of the CpG island of the solute carrier family 30 member 3 (SLC30A3) gene in preterm amnion was confirmed by methylation-specific PCR. This work provides preliminary evidence that DNA methylation changes in the amnion may be at least partially involved in the physiological process of labor and the etiology of preterm birth and suggests that DNA methylation profiles, in combination with other biological data, may provide valuable insight into the mechanisms underlying normal and pathological pregnancies. PMID:23533356

  17. Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome.

    PubMed

    Pedersen, Jakob Skou; Valen, Eivind; Velazquez, Amhed M Vargas; Parker, Brian J; Rasmussen, Morten; Lindgreen, Stinus; Lilje, Berit; Tobin, Desmond J; Kelly, Theresa K; Vang, Søren; Andersson, Robin; Jones, Peter A; Hoover, Cindi A; Tikhonov, Alexei; Prokhortchouk, Egor; Rubin, Edward M; Sandelin, Albin; Gilbert, M Thomas P; Krogh, Anders; Willerslev, Eske; Orlando, Ludovic

    2014-03-01

    Epigenetic information is available from contemporary organisms, but is difficult to track back in evolutionary time. Here, we show that genome-wide epigenetic information can be gathered directly from next-generation sequence reads of DNA isolated from ancient remains. Using the genome sequence data generated from hair shafts of a 4000-yr-old Paleo-Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery of the expected signals at promoter regions, exon/intron boundaries, and CTCF sites. The top-scoring nucleosome calls revealed distinct DNA positioning biases, attesting to nucleotide-level accuracy. The ancient methylation levels exhibited high conservation over time, clustering closely with modern hair tissues. Using ancient methylation information, we estimated the age at death of the Saqqaq individual and illustrate how epigenetic information can be used to infer ancient gene expression. Similar epigenetic signatures were found in other fossil material, such as 110,000- to 130,000-yr-old bones, supporting the contention that ancient epigenomic information can be reconstructed from a deep past. Our findings lay the foundation for extracting epigenomic information from ancient samples, allowing shifts in epialleles to be tracked through evolutionary time, as well as providing an original window into modern epigenomics.

  18. Genome-wide nucleosome map and cytosine methylation levels of an ancient human genome

    PubMed Central

    Pedersen, Jakob Skou; Valen, Eivind; Velazquez, Amhed M. Vargas; Parker, Brian J.; Rasmussen, Morten; Lindgreen, Stinus; Lilje, Berit; Tobin, Desmond J.; Kelly, Theresa K.; Vang, Søren; Andersson, Robin; Jones, Peter A.; Hoover, Cindi A.; Tikhonov, Alexei; Prokhortchouk, Egor; Rubin, Edward M.; Sandelin, Albin; Gilbert, M. Thomas P.; Krogh, Anders; Willerslev, Eske; Orlando, Ludovic

    2014-01-01

    Epigenetic information is available from contemporary organisms, but is difficult to track back in evolutionary time. Here, we show that genome-wide epigenetic information can be gathered directly from next-generation sequence reads of DNA isolated from ancient remains. Using the genome sequence data generated from hair shafts of a 4000-yr-old Paleo-Eskimo belonging to the Saqqaq culture, we generate the first ancient nucleosome map coupled with a genome-wide survey of cytosine methylation levels. The validity of both nucleosome map and methylation levels were confirmed by the recovery of the expected signals at promoter regions, exon/intron boundaries, and CTCF sites. The top-scoring nucleosome calls revealed distinct DNA positioning biases, attesting to nucleotide-level accuracy. The ancient methylation levels exhibited high conservation over time, clustering closely with modern hair tissues. Using ancient methylation information, we estimated the age at death of the Saqqaq individual and illustrate how epigenetic information can be used to infer ancient gene expression. Similar epigenetic signatures were found in other fossil material, such as 110,000- to 130,000-yr-old bones, supporting the contention that ancient epigenomic information can be reconstructed from a deep past. Our findings lay the foundation for extracting epigenomic information from ancient samples, allowing shifts in epialleles to be tracked through evolutionary time, as well as providing an original window into modern epigenomics. PMID:24299735

  19. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identi...

  20. Genome scan of M. tuberculosis infection and disease in Ugandans.

    PubMed

    Stein, Catherine M; Zalwango, Sarah; Malone, LaShaunda L; Won, Sungho; Mayanja-Kizza, Harriet; Mugerwa, Roy D; Leontiev, Dmitry V; Thompson, Cheryl L; Cartier, Kevin C; Elston, Robert C; Iyengar, Sudha K; Boom, W Henry; Whalen, Christopher C

    2008-01-01

    Tuberculosis (TB), caused by Mycobacterium tuberculosis (Mtb), is an enduring public health problem globally, particularly in sub-Saharan Africa. Several studies have suggested a role for host genetic susceptibility in increased risk for TB but results across studies have been equivocal. As part of a household contact study of Mtb infection and disease in Kampala, Uganda, we have taken a unique approach to the study of genetic susceptibility to TB, by studying three phenotypes. First, we analyzed culture confirmed TB disease compared to latent Mtb infection (LTBI) or lack of Mtb infection. Second, we analyzed resistance to Mtb infection in the face of continuous exposure, defined by a persistently negative tuberculin skin test (PTST-); this outcome was contrasted to LTBI. Third, we analyzed an intermediate phenotype, tumor necrosis factor-alpha (TNFalpha) expression in response to soluble Mtb ligands enriched with molecules secreted from Mtb (culture filtrate). We conducted a full microsatellite genome scan, using genotypes generated by the Center for Medical Genetics at Marshfield. Multipoint model-free linkage analysis was conducted using an extension of the Haseman-Elston regression model that includes half sibling pairs, and HIV status was included as a covariate in the model. The analysis included 803 individuals from 193 pedigrees, comprising 258 full sibling pairs and 175 half sibling pairs. Suggestive linkage (p<10(-3)) was observed on chromosomes 2q21-2q24 and 5p13-5q22 for PTST-, and on chromosome 7p22-7p21 for TB; these findings for PTST- are novel and the chromosome 7 region contains the IL6 gene. In addition, we replicated recent linkage findings on chromosome 20q13 for TB (p = 0.002). We also observed linkage at the nominal alpha = 0.05 threshold to a number of promising candidate genes, SLC11A1 (PTST- p = 0.02), IL-1 complex (TB p = 0.01), IL12BR2 (TNFalpha p = 0.006), IL12A (TB p = 0.02) and IFNGR2 (TNFalpha p = 0.002). These results confirm not

  1. Genome Scan of M. tuberculosis Infection and Disease in Ugandans

    PubMed Central

    Stein, Catherine M.; Zalwango, Sarah; Malone, LaShaunda L.; Won, Sungho; Mayanja-Kizza, Harriet; Mugerwa, Roy D.; Leontiev, Dmitry V.; Thompson, Cheryl L.; Cartier, Kevin C.; Elston, Robert C.; Iyengar, Sudha K.; Boom, W. Henry; Whalen, Christopher C.

    2008-01-01

    Tuberculosis (TB), caused by Mycobacterium tuberculosis (Mtb), is an enduring public health problem globally, particularly in sub-Saharan Africa. Several studies have suggested a role for host genetic susceptibility in increased risk for TB but results across studies have been equivocal. As part of a household contact study of Mtb infection and disease in Kampala, Uganda, we have taken a unique approach to the study of genetic susceptibility to TB, by studying three phenotypes. First, we analyzed culture confirmed TB disease compared to latent Mtb infection (LTBI) or lack of Mtb infection. Second, we analyzed resistance to Mtb infection in the face of continuous exposure, defined by a persistently negative tuberculin skin test (PTST-); this outcome was contrasted to LTBI. Third, we analyzed an intermediate phenotype, tumor necrosis factor-alpha (TNFα) expression in response to soluble Mtb ligands enriched with molecules secreted from Mtb (culture filtrate). We conducted a full microsatellite genome scan, using genotypes generated by the Center for Medical Genetics at Marshfield. Multipoint model-free linkage analysis was conducted using an extension of the Haseman-Elston regression model that includes half sibling pairs, and HIV status was included as a covariate in the model. The analysis included 803 individuals from 193 pedigrees, comprising 258 full sibling pairs and 175 half sibling pairs. Suggestive linkage (p<10−3) was observed on chromosomes 2q21-2q24 and 5p13-5q22 for PTST-, and on chromosome 7p22-7p21 for TB; these findings for PTST- are novel and the chromosome 7 region contains the IL6 gene. In addition, we replicated recent linkage findings on chromosome 20q13 for TB (p = 0.002). We also observed linkage at the nominal α = 0.05 threshold to a number of promising candidate genes, SLC11A1 (PTST- p = 0.02), IL-1 complex (TB p = 0.01), IL12BR2 (TNFα p = 0.006), IL12A (TB p = 0.02) and IFNGR2 (TNFα p = 0.002). These results

  2. Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence

    Technology Transfer Automated Retrieval System (TEKTRAN)

    An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions fr...

  3. A genome scanning approach to assess the genetic effects of radiation in mice and humans.

    PubMed

    Asakawa, Jun-ichi; Kuick, Rork; Kodaira, Mieko; Nakamura, Nori; Katayama, Hiroaki; Pierce, Donald; Funamoto, Sachiyo; Preston, Dale; Satoh, Chiyoko; Neel, James V; Hanash, Samir

    2004-04-01

    We used Restriction Landmark Genome Scanning (RLGS) to assess, on a genome-wide basis, the mutation induction rate in mouse germ cells after radiation exposure. Analyses of 1,115 autosomal NotI DNA fragments per mouse for reduced spot intensity, indicative of loss of one copy, in 506 progeny derived from X-irradiated spermatogonia (190, 237 and 79 mice in 0-, 3-, and 5-Gy groups, respectively), permitted us to identify 16 mutations affecting 23 fragments in 20 mice. The 16 mutations were composed of eight small changes (1-9 bp) at microsatellite sequences, five large deletions (more than 25 kb), and three insertions of SINE B2 or LINE1 transposable elements. The maximum induction rate of deletion mutations was estimated as (0.17 +/- 0.09) x 10(-5)/locus Gy(-1). The estimate is considerably lower than 1 x 10(-5)/locus Gy(-1), the mean induction rate of deletion mutations at Russell's 7 loci, which assumed that deletion mutations comprise 50% of all mutations. We interpret the results as indicating that the mean induction rate of mutations in the whole genome may be substantially lower than that at the 7 loci. We also demonstrate the applicability of RLGS for detection of human mutations, which allows direct comparisons between the two species.

  4. Phenotype prediction based on genome-wide DNA methylation data

    PubMed Central

    2014-01-01

    Background DNA methylation (DNAm) has important regulatory roles in many biological processes and diseases. It is the only epigenetic mark with a clear mechanism of mitotic inheritance and the only one easily available on a genome scale. Aberrant cytosine-phosphate-guanine (CpG) methylation has been discussed in the context of disease aetiology, especially cancer. CpG hypermethylation of promoter regions is often associated with silencing of tumour suppressor genes and hypomethylation with activation of oncogenes. Supervised principal component analysis (SPCA) is a popular machine learning method. However, in a recent application to phenotype prediction from DNAm data SPCA was inferior to the specific method EVORA. Results We present Model-Selection-SPCA (MS-SPCA), an enhanced version of SPCA. MS-SPCA applies several models that perform well in the training data to the test data and selects the very best models for final prediction based on parameters of the test data. We have applied MS-SPCA for phenotype prediction from genome-wide DNAm data. CpGs used for prediction are selected based on the quantification of three features of their methylation (average methylation difference, methylation variation difference and methylation-age-correlation). We analysed four independent case–control datasets that correspond to different stages of cervical cancer: (i) cases currently cytologically normal, but will later develop neoplastic transformations, (ii, iii) cases showing neoplastic transformations and (iv) cases with confirmed cancer. The first dataset was split into several smaller case–control datasets (samples either Human Papilloma Virus (HPV) positive or negative). We demonstrate that cytology normal HPV+ and HPV- samples contain DNAm patterns which are associated with later neoplastic transformations. We present evidence that DNAm patterns exist in cytology normal HPV- samples that (i) predispose to neoplastic transformations after HPV infection and (ii

  5. Genome-Wide Analysis in Brazilians Reveals Highly Differentiated Native American Genome Regions.

    PubMed

    Mychaleckyj, Josyf C; Havt, Alexandre; Nayak, Uma; Pinkerton, Relana; Farber, Emily; Concannon, Patrick; Lima, Aldo A; Guerrant, Richard L

    2017-03-01

    Despite its population, geographic size, and emerging economic importance, disproportionately little genome-scale research exists into genetic factors that predispose Brazilians to disease, or the population genetics of risk. After identification of suitable proxy populations and careful analysis of tri-continental admixture in 1,538 North-Eastern Brazilians to estimate individual ancestry and ancestral allele frequencies, we computed 400,000 genome-wide locus-specific branch length (LSBL) Fst statistics of Brazilian Amerindian ancestry compared to European and African; and a similar set of differentiation statistics for their Amerindian component compared with the closest Asian 1000 Genomes population (surprisingly, Bengalis in Bangladesh). After ranking SNPs by these statistics, we identified the top 10 highly differentiated SNPs in five genome regions in the LSBL tests of Brazilian Amerindian ancestry compared to European and African; and the top 10 SNPs in eight regions comparing their Amerindian component to the closest Asian 1000 Genomes population. We found SNPs within or proximal to the genes CIITA (rs6498115), SMC6 (rs1834619), and KLHL29 (rs2288697) were most differentiated in the Amerindian-specific branch, while SNPs in the genes ADAMTS9 (rs7631391), DOCK2 (rs77594147), SLC28A1 (rs28649017), ARHGAP5 (rs7151991), and CIITA (rs45601437) were most highly differentiated in the Asian comparison. These genes are known to influence immune function, metabolic and anthropometry traits, and embryonic development. These analyses have identified candidate genes for selection within Amerindian ancestry, and by comparison of the two analyses, those for which the differentiation may have arisen during the migration from Asia to the Americas.

  6. A genome scan conducted in a multigenerational pedigree with convergent strabismus supports a complex genetic determinism.

    PubMed

    Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel

    2013-01-01

    A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree.

  7. Genome-Wide Association Mapping for Intelligence in Military Working Dogs: Development of Advanced Classification Algorithm for Genome-Wide Single Nucleotide Polymorphism (SNP) Data Analysis

    DTIC Science & Technology

    2011-04-01

    distribution unlimited. QC – quality control QTL – quantitative trait loci SNP – single nucleotide polymorphism TE – Tris + EDTA TBE – Tris + Boric Acid + EDTA WGSA – whole genome sampling assay ...canine intelligence testing protocol EDTA – ethylenediaminetetraacetic acid GWAS – genome-wide association study LD – linkage disequilibrium MWD

  8. Genome-Wide Mapping of Loci Explaining Variance in Scrotal Circumference in Nellore Cattle

    PubMed Central

    Utsunomiya, Yuri T.; Carmo, Adriana S.; Neves, Haroldo H. R.; Carvalheiro, Roberto; Matos, Márcia C.; Zavarez, Ludmilla B.; Ito, Pier K. R. K.; Pérez O'Brien, Ana M.; Sölkner, Johann; Porto-Neto, Laercio R.; Schenkel, Flávio S.; McEwan, John; Cole, John B.; da Silva, Marcos V. G. B.; Van Tassell, Curtis P.; Sonstegard, Tad S.; Garcia, José Fernando

    2014-01-01

    The reproductive performance of bulls has a high impact on the beef cattle industry. Scrotal circumference (SC) is the most recorded reproductive trait in beef herds, and is used as a major selection criterion to improve precocity and fertility. The characterization of genomic regions affecting SC can contribute to the identification of diagnostic markers for reproductive performance and uncover molecular mechanisms underlying complex aspects of bovine reproductive biology. In this paper, we report a genome-wide scan for chromosome segments explaining differences in SC, using data of 861 Nellore bulls (Bos indicus) genotyped for over 777,000 single nucleotide polymorphisms. Loci that excel from the genome background were identified on chromosomes 4, 6, 7, 10, 14, 18 and 21. The majority of these regions were previously found to be associated with reproductive and body size traits in cattle. The signal on chromosome 14 replicates the pleiotropic quantitative trait locus encompassing PLAG1 that affects male fertility in cattle and stature in several species. Based on intensive literature mining, SP4, MAGEL2, SH3RF2, PDE5A and SNAI2 are proposed as novel candidate genes for SC, as they affect growth and testicular size in other animal models. These findings contribute to linking reproductive phenotypes to gene functions, and may offer new insights on the molecular biology of male fertility. PMID:24558400

  9. Big data challenges in bone research: genome-wide association studies and next-generation sequencing

    PubMed Central

    Alonso, Nerea; Lucas, Gavin; Hysi, Pirro

    2015-01-01

    Genome-wide association studies (GWAS) have been developed as a practical method to identify genetic loci associated with disease by scanning multiple markers across the genome. Significant advances in the genetics of complex diseases have been made owing to advances in genotyping technologies, the progress of projects such as HapMap and 1000G and the emergence of genetics as a collaborative discipline. Because of its great potential to be used in parallel by multiple collaborators, it is important to adhere to strict protocols assuring data quality and analyses. Quality control analyses must be applied to each sample and each single-nucleotide polymorphism (SNP). The software package PLINK is capable of performing the whole range of necessary quality control tests. Genotype imputation has also been developed to substantially increase the power of GWAS methodology. Imputation permits the investigation of associations at genetic markers that are not directly genotyped. Results of individual GWAS reports can be combined through meta-analysis. Finally, next-generation sequencing (NGS) has gained popularity in recent years through its capacity to analyse a much greater number of markers across the genome. Although NGS platforms are capable of examining a higher number of SNPs compared with GWA studies, the results obtained by NGS require careful interpretation, as their biological correlation is incompletely understood. In this article, we will discuss the basic features of such protocols. PMID:25709812

  10. Multicentric Genome-Wide Association Study for Primary Spontaneous Pneumothorax

    PubMed Central

    Abrantes, Patrícia; Francisco, Vânia; Teixeira, Gilberto; Monteiro, Marta; Neves, João; Norte, Ana; Robalo Cordeiro, Carlos; Moura e Sá, João; Reis, Ernestina; Santos, Patrícia; Oliveira, Manuela; Sousa, Susana; Fradinho, Marta; Malheiro, Filipa; Negrão, Luís

    2016-01-01

    Despite elevated incidence and recurrence rates for Primary Spontaneous Pneumothorax (PSP), little is known about its etiology, and the genetics of idiopathic PSP remains unexplored. To identify genetic variants contributing to sporadic PSP risk, we conducted the first PSP genome-wide association study. Two replicate pools of 92 Portuguese PSP cases and of 129 age- and sex-matched controls were allelotyped in triplicate on the Affymetrix Human SNP Array 6.0 arrays. Markers passing quality control were ranked by relative allele score difference between cases and controls (|RASdiff|), by a novel cluster method and by a combined Z-test. 101 single nucleotide polymorphisms (SNPs) were selected using these three approaches for technical validation by individual genotyping in the discovery dataset. 87 out of 94 successfully tested SNPs were nominally associated in the discovery dataset. Replication of the 87 technically validated SNPs was then carried out in an independent replication dataset of 100 Portuguese cases and 425 controls. The intergenic rs4733649 SNP in chromosome 8 (between LINC00824 and LINC00977) was associated with PSP in the discovery (P = 4.07E-03, ORC[95% CI] = 1.88[1.22–2.89]), replication (P = 1.50E-02, ORC[95% CI] = 1.50[1.08–2.09]) and combined datasets (P = 8.61E-05, ORC[95% CI] = 1.65[1.29–2.13]). This study identified for the first time one genetic risk factor for sporadic PSP, but future studies are warranted to further confirm this finding in other populations and uncover its functional role in PSP pathogenesis. PMID:27203581

  11. Genome-Wide Association Study of Schizophrenia in Japanese Population

    PubMed Central

    Yamada, Kazuo; Iwayama, Yoshimi; Hattori, Eiji; Iwamoto, Kazuya; Toyota, Tomoko; Ohnishi, Tetsuo; Ohba, Hisako; Maekawa, Motoko; Kato, Tadafumi; Yoshikawa, Takeo

    2011-01-01

    Schizophrenia is a devastating neuropsychiatric disorder with genetically complex traits. Genetic variants should explain a considerable portion of the risk for schizophrenia, and genome-wide association study (GWAS) is a potentially powerful tool for identifying the risk variants that underlie the disease. Here, we report the results of a three-stage analysis of three independent cohorts consisting of a total of 2,535 samples from Japanese and Chinese populations for searching schizophrenia susceptibility genes using a GWAS approach. Firstly, we examined 115,770 single nucleotide polymorphisms (SNPs) in 120 patient-parents trio samples from Japanese schizophrenia pedigrees. In stage II, we evaluated 1,632 SNPs (1,159 SNPs of p<0.01 and 473 SNPs of p<0.05 that located in previously reported linkage regions). The second sample consisted of 1,012 case-control samples of Japanese origin. The most significant p value was obtained for the SNP in the ELAVL2 [(embryonic lethal, abnormal vision, Drosophila)-like 2] gene located on 9p21.3 (p = 0.00087). In stage III, we scrutinized the ELAVL2 gene by genotyping gene-centric tagSNPs in the third sample set of 293 family samples (1,163 individuals) of Chinese descent and the SNP in the gene showed a nominal association with schizophrenia in Chinese population (p = 0.026). The current data in Asian population would be helpful for deciphering ethnic diversity of schizophrenia etiology. PMID:21674006

  12. Genome-Wide Association Studies of Multiple Keratinocyte Cancers

    PubMed Central

    Verkouteren, Joris A. C.; Hofman, Albert; Uitterlinden, André G.; Kraft, Peter; Turman, Constance; Han, Jiali; Cho, Eunyoung; Murabito, Joanne M.; Levy, Daniel; Qureshi, Abrar A.; Nijsten, Tamar

    2017-01-01

    There is strong evidence for a role of environmental risk factors involved in susceptibility to develop multiple keratinocyte cancers (mKCs), but whether genes are also involved in mKCs susceptibility has not been thoroughly investigated. We investigated whether single nucleotide polymorphisms (SNPs) are associated with susceptibility for mKCs. A genome-wide association study (GWAS) of 1,666 cases with mKCs and 1,950 cases with single KC (sKCs; controls) from Harvard cohorts (the Nurses' Health Study [NHS], NHS II, and the Health Professionals Follow-Up Study) and the Framingham Heart Study was carried-out using over 8 million SNPs (stage-1). We sought to replicate the most significant statistical associations (p-value≤ 5.5x10-6) in an independent cohort of 574 mKCs and 872 sKCs from the Rotterdam Study. In the discovery stage, 40 SNPs with suggestive associations (p-value ≤5.5x10-6) were identified, with eight independent SNPs tagging all 40 SNPs. The most significant SNP was located at chromosome 9 (rs7468390; p-value = 3.92x10-7). In stage-2, none of these SNPs replicated and only two of them were associated with mKCs in the same direction in the combined meta-analysis. We tested the associations for 19 previously reported basal cell carcinoma-related SNPs (candidate gene association analysis), and found that rs1805007 (MC1R locus) was significantly associated with risk of mKCs (p-value = 2.80x10-4). Although the suggestive SNPs with susceptibility for mKCs were not replicated, we found that previously identified BCC variants may also be associated with mKC, which the most significant association (rs1805007) located at the MC1R gene. PMID:28081215

  13. Genome-wide association study of sleep in Drosophila melanogaster

    PubMed Central

    2013-01-01

    Background Sleep is a highly conserved behavior, yet its duration and pattern vary extensively among species and between individuals within species. The genetic basis of natural variation in sleep remains unknown. Results We used the Drosophila Genetic Reference Panel (DGRP) to perform a genome-wide association (GWA) study of sleep in D. melanogaster. We identified candidate single nucleotide polymorphisms (SNPs) associated with differences in the mean as well as the environmental sensitivity of sleep traits; these SNPs typically had sex-specific or sex-biased effects, and were generally located in non-coding regions. The majority of SNPs (80.3%) affecting sleep were at low frequency and had moderately large effects. Additive models incorporating multiple SNPs explained as much as 55% of the genetic variance for sleep in males and females. Many of these loci are known to interact physically and/or genetically, enabling us to place them in candidate genetic networks. We confirmed the role of seven novel loci on sleep using insertional mutagenesis and RNA interference. Conclusions We identified many SNPs in novel loci that are potentially associated with natural variation in sleep, as well as SNPs within genes previously known to affect Drosophila sleep. Several of the candidate genes have human homologues that were identified in studies of human sleep, suggesting that genes affecting variation in sleep are conserved across species. Our discovery of genetic variants that influence environmental sensitivity to sleep may have a wider application to all GWA studies, because individuals with highly plastic genotypes will not have consistent phenotypes. PMID:23617951

  14. Genome-wide SNP typing reveals signatures of population history.

    PubMed

    Hughes, Austin L; Welch, Robert; Puri, Vinita; Matthews, Casey; Haque, Kashif; Chanock, Stephen J; Yeager, Meredith

    2008-07-01

    Single-nucleotide polymorphism (SNP) arrays have become a popular technology for disease-association studies, but they also have potential for studying the genetic differentiation of human populations. Application of the Affymetrix GeneChip Human Mapping 500K Array Set to a population of 102 individuals representing the major ethnic groups in the United States (African, Asian, European, and Hispanic) revealed patterns of gene diversity and genetic distance that reflected population history. We analyzed allelic frequencies at 388,654 autosomal SNP sites that showed some variation in our study population and 10% or fewer missing values. Despite the small size (23-31 individuals) of each subpopulation, there were no fixed differences at any site between any two subpopulations. As expected from the African origin of modern humans, greater gene diversity was seen in Africans than in either Asians or Europeans, and the genetic distance between the Asian and the European populations was significantly lower than that between either of these two populations and Africans. Principal components analysis applied to a correlation matrix among individuals was able to separate completely the major continental groups of humans (Africans, Asians, and Europeans), while Hispanics overlapped all three of these groups. Genes containing two or more markers with extraordinarily high genetic distance between subpopulations were identified as candidate genes for health differences between subpopulations. The results show that, even with modest sample sizes, genome-wide SNP genotyping technologies have great promise for capturing signatures of gene frequency difference between human subpopulations, with applications in areas as diverse as forensics and the study of ethnic health disparities.

  15. Assessing statistical significance in multivariable genome wide association analysis

    PubMed Central

    Buzdugan, Laura; Kalisch, Markus; Navarro, Arcadi; Schunk, Daniel; Fehr, Ernst; Bühlmann, Peter

    2016-01-01

    Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whether or not a SNP carries any additional information about the phenotype beyond that available by all the other SNPs. This rules out spurious correlations between phenotypes and SNPs that can arise from marginal methods because the ‘spuriously correlated’ SNP merely happens to be correlated with the ‘truly causal’ SNP. In addition, the method offers a data driven approach to identifying and refining groups of SNPs that jointly contain informative signals about the phenotype. We demonstrate the value of our method by applying it to the seven diseases analyzed by the Wellcome Trust Case Control Consortium (WTCCC). We show, in particular, that our method is also capable of finding significant SNPs that were not identified in the original WTCCC study, but were replicated in other independent studies. Availability and implementation: Reproducibility of our research is supported by the open-source Bioconductor package hierGWAS. Contact: peter.buehlmann@stat.math.ethz.ch Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153677

  16. Technical note: Computing strategies in genome-wide selection.

    PubMed

    Legarra, A; Misztal, I

    2008-01-01

    Genome-wide genetic evaluation might involve the computation of BLUP-like estimations, potentially including thousands of covariates (i.e., single-nucleotide polymorphism markers) for each record. This implies dense Henderson's mixed-model equations and considerable computing resources in time and storage, even for a few thousand records. Possible computing options include the type of storage and the solving algorithm. This work evaluated several computing options, including half-stored Cholesky decomposition, Gauss-Seidel, and 3 matrix-free strategies: Gauss-Seidel, Gauss-Seidel with residuals update, and preconditioned conjugate gradients. Matrix-free Gauss-Seidel with residuals update adjusts the residuals after computing the solution for each effect. This avoids adjusting the left-hand side of the equations by all other effects at every step of the algorithm and saves considerable computing time. Any Gauss-Seidel algorithm can easily be extended for variance component estimation by Markov chain-Monte Carlo. Let m and n be the number of records and markers, respectively. Computing time for Cholesky decomposition is proportional to n3. Computing times per round are proportional to mn2 in matrix-free Gauss-Seidel, to n2 for half-stored Gauss-Seidel, and to n and m for the rest of the algorithms. Algorithms were tested on a real mouse data set, which included 1,928 records and 10,946 single-nucleotide polymorphism markers. Computing times were in the order of a few minutes for Gauss-Seidel with residuals update and preconditioned conjugate gradients, more than 1 h for half-stored Gauss-Seidel, 2 h for Cholesky decomposition, and 4 d for matrix-free Gauss-Seidel. Preconditioned conjugate gradients was the fastest. Gauss-Seidel with residuals update would be the method of choice for variance component estimation as well as solving.

  17. Genome-wide analysis highlights genetic dilution in Algerian sheep.

    PubMed

    Gaouar, S B S; Lafri, M; Djaout, A; El-Bouyahiaoui, R; Bouri, A; Bouchatal, A; Maftah, A; Ciani, E; Da Silva, A B

    2017-03-01

    Algeria represents a reservoir of genetic diversity with local sheep breeds adapted to a large range of environments and showing specific features necessary to deal with harsh conditions. This remarkable diversity results from the traditional management of dryland by pastoralists over centuries. Most of these breeds are poorly productive, and the economic pressure leads farmers to realize anarchic cross-breeding (that is, not carried out in the framework of selection plans) with the hope to increase animal's conformation. In this study, eight of the nine local Algerian sheep breeds (D'men, Hamra, Ouled-Djellal, Rembi, Sidaoun, Tazegzawt, Berber and Barbarine) were investigated for the first time by genome-wide single-nucleotide polymorphism genotyping. At an international scale, Algerian sheep occupied an original position shaped by relations with African and European (particularly Italian) breeds. The strong genetic proximity with Caribbean and Brazilian breeds confirmed that the genetic make-up of these American breeds was largely influenced by the Atlantic slave trade. At a national scale, an alarming genetic dilution of the Berber (a primitive breed) and the Rembi was observed, as a consequence of uncontrolled mating practices with Ouled-Djellal. A similar, though less pronounced, phenomenon was also detected for the Barbarine, another ancestral breed. Genetic originality appeared to be better preserved in Tazegzawt, Hamra, D'men and Sidaoun. These breeds should be given high priority in the establishment of conservation plans to halt their progressive loss. For Berber and Barbarine that also occur in the bordering neighbor countries, urgent concerted transnational actions are needed.

  18. Genome accessibility is widely preserved and locally modulated during mitosis.

    PubMed

    Hsiung, Chris C-S; Morrissey, Christapher S; Udugama, Maheshi; Frank, Christopher L; Keller, Cheryl A; Baek, Songjoon; Giardine, Belinda; Crawford, Gregory E; Sung, Myong-Hee; Hardison, Ross C; Blobel, Gerd A

    2015-02-01

    Mitosis entails global alterations to chromosome structure and nuclear architecture, concomitant with transient silencing of transcription. How cells transmit transcriptional states through mitosis remains incompletely understood. While many nuclear factors dissociate from mitotic chromosomes, the observation that certain nuclear factors and chromatin features remain associated with individual loci during mitosis originated the hypothesis that such mitotically retained molecular signatures could provide transcriptional memory through mitosis. To understand the role of chromatin structure in mitotic memory, we performed the first genome-wide comparison of DNase I sensitivity of chromatin in mitosis and interphase, using a murine erythroblast model. Despite chromosome condensation during mitosis visible by microscopy, the landscape of chromatin accessibility at the macromolecular level is largely unaltered. However, mitotic chromatin accessibility is locally dynamic, with individual loci maintaining none, some, or all of their interphase accessibility. Mitotic reduction in accessibility occurs primarily within narrow, highly DNase hypersensitive sites that frequently coincide with transcription factor binding sites, whereas broader domains of moderate accessibility tend to be more stable. In mitosis, proximal promoters generally maintain their accessibility more strongly, whereas distal regulatory elements tend to lose accessibility. Large domains of DNA hypomethylation mark a subset of promoters that retain accessibility during mitosis and across many cell types in interphase. Erythroid transcription factor GATA1 exerts site-specific changes in interphase accessibility that are most pronounced at distal regulatory elements, but has little influence on mitotic accessibility. We conclude that features of open chromatin are remarkably stable through mitosis, but are modulated at the level of individual genes and regulatory elements.

  19. Genome-wide analysis of condensin binding in Caenorhabditis elegans

    PubMed Central

    2013-01-01

    Background Condensins are multi-subunit protein complexes that are essential for chromosome condensation during mitosis and meiosis, and play key roles in transcription regulation during interphase. Metazoans contain two condensins, I and II, which perform different functions and localize to different chromosomal regions. Caenorhabditis elegans contains a third condensin, IDC, that is targeted to and represses transcription of the X chromosome for dosage compensation. Results To understand condensin binding and function, we performed ChIP-seq analysis of C. elegans condensins in mixed developmental stage embryos, which contain predominantly interphase nuclei. Condensins bind to a subset of active promoters, tRNA genes and putative enhancers. Expression analysis in kle-2-mutant larvae suggests that the primary effect of condensin II on transcription is repression. A DNA sequence motif, GCGC, is enriched at condensin II binding sites. A sequence extension of this core motif, AGGG, creates the condensin IDC motif. In addition to differences in recruitment that result in X-enrichment of condensin IDC and condensin II binding to all chromosomes, we provide evidence for a shared recruitment mechanism, as condensin IDC recruiter SDC-2 also recruits condensin II to the condensin IDC recruitment sites on the X. In addition, we found that condensin sites overlap extensively with the cohesin loader SCC-2, and that SDC-2 also recruits SCC-2 to the condensin IDC recruitment sites. Conclusions Our results provide the first genome-wide view of metazoan condensin II binding in interphase, define putative recruitment motifs, and illustrate shared loading mechanisms for condensin IDC and condensin II. PMID:24125077

  20. A genome-wide DNA methylation study in azoospermia.

    PubMed

    Ferfouri, F; Boitrelle, F; Ghout, I; Albert, M; Molina Gomes, D; Wainer, R; Bailly, M; Selva, J; Vialard, F

    2013-11-01

    The objective of this study was to assess genome-wide DNA methylation in testicular tissue from azoospermic patients. A total of 94 azoospermic patients were recruited and classified into three groups: 29 patients presented obstructive azoospermia (OA), 26 displayed non-obstructive azoospermia (NOA) and successful retrieval of spermatozoa by testicular sperm extraction (TESE+) and 39 displayed NOA and failure to retrieve spermatozoa by TESE (TESE-). An Illumina Infinium Human Methylation27 BeadChip DNA methylation array was used to establish a testicular DNA methylation pattern for each type of azoospermic patient. The OA and NOA groups were compared in terms of the relative M-value (the log2 ratio between methylated and non-methylated probe intensities) for each CpG site. We observed significantly different DNA methylation profiles for the NOA and OA groups, with differences at over 9000 of the 27 578 CpG sites; 212 CpG sites had a relative M-value >3. The results highlighted 14 testis-specific genes. Patient clustering with respect to these 212 CpG sites corresponded closely to the clinical classification. The DNA methylation patterns showed that in the NOA group, 78 of the 212 CpG sites were hypomethylated and 134 were hypermethylated (relative to the OA group). On the basis of these DNA methylation profiles, azoospermic patients could be classified as OA or NOA by considering the 212 CpG sites with the greatest methylation differences. Furthermore, we identified genes that may provide insight into the mechanism of idiopathic NOA.

  1. Genome-wide examination of myoblast cell cycle withdrawal duringdifferentiation

    SciTech Connect

    Shen, Xun; Collier, John Michael; Hlaing, Myint; Zhang, Leanne; Delshad, Elizabeth H.; Bristow, James; Bernstein, Harold S.

    2002-12-02

    Skeletal and cardiac myocytes cease division within weeks of birth. Although skeletal muscle retains limited capacity for regeneration through recruitment of satellite cells, resident populations of adult myocardial stem cells have not been identified. Because cell cycle withdrawal accompanies myocyte differentiation, we hypothesized that C2C12 cells, a mouse myoblast cell line previously used to characterize myocyte differentiation, also would provide a model for studying cell cycle withdrawal during differentiation. C2C12 cells were differentiated in culture medium containing horse serum and harvested at various time points to characterize the expression profiles of known cell cycle and myogenic regulatory factors by immunoblot analysis. BrdU incorporation decreased dramatically in confluent cultures 48 hr after addition of horse serum, as cells started to form myotubes. This finding was preceded by up-regulation of MyoD, followed by myogenin, and activation of Bcl-2. Cyclin D1 was expressed in proliferating cultures and became undetectable in cultures containing 40 percent fused myotubes, as levels of p21(WAF1/Cip1) increased and alpha-actin became detectable. Because C2C12 myoblasts withdraw from the cell cycle during myocyte differentiation following a course that recapitulates this process in vivo, we performed a genome-wide screen to identify other gene products involved in this process. Using microarrays containing approximately 10,000 minimally redundant mouse sequences that map to the UniGene database of the National Center for Biotechnology Information, we compared gene expression profiles between proliferating, differentiating, and differentiated C2C12 cells and verified candidate genes demonstrating differential expression by RT-PCR. Cluster analysis of differentially expressed genes revealed groups of gene products involved in cell cycle withdrawal, muscle differentiation, and apoptosis. In addition, we identified several genes, including DDAH2 and Ly

  2. Nightshift work and genome-wide DNA methylation.

    PubMed

    Bhatti, Parveen; Zhang, Yuzheng; Song, Xiaoling; Makar, Karen W; Sather, Cassandra L; Kelsey, Karl T; Houseman, E Andres; Wang, Pei

    2015-02-01

    The negative health effects of shift work, including carcinogenesis, may be mediated by changes in DNA methylation, particularly in the circadian genes. Using the Infinium HumanMethylation450 Bead Array (Illumina, San Diego, CA), we compared genome-wide methylation between 65 actively working dayshift workers and 59 actively working nightshift workers in the healthcare industry. A total of 473 800 loci, including 391 loci across the 12 core circadian genes, were analyzed to identify methylation markers associated with shift work status using linear regression models adjusted for gender, age, body mass index, race, smoking status and leukocyte cell profile as measured by flow cytometry. Analyses at the level of gene, CpG island and gene region were also conducted. To account for multiple comparisons, we controlled the false discovery rate (FDR ≤0.05). Significant differences between nightshift and dayshift workers were found at 16 135 of 473 800 loci, across 3769 of 20 164 genes, across 7173 of 22 721 CpG islands and across 5508 of 51 843 gene regions. For each significant loci, gene, CpG island or gene region, average methylation was consistently found to be decreased among nightshift workers compared to dayshift workers. Twenty-one loci located in the circadian genes were also found to be significantly hypomethylated among nightshift workers. The largest differences were observed for three loci located in the gene body of PER3. A total of nine significant loci were found in the CSNK1E gene, most of which were located in a CpG island and near the transcription start site of the gene. Methylation changes in these circadian genes may lead to altered expression of these genes which has been associated with cancer in previous studies. Gene ontology enrichment analysis revealed that among the significantly hypomethylated genes, processes related to host defense and immunity were represented. Our results indicate that the health effects of shift work may be

  3. Genome-wide transcription analyses in rice using tiling microarrays.

    PubMed

    Li, Lei; Wang, Xiangfeng; Stolc, Viktor; Li, Xueyong; Zhang, Dongfen; Su, Ning; Tongprasit, Waraporn; Li, Songgang; Cheng, Zhukuan; Wang, Jun; Deng, Xing Wang

    2006-01-01

    Sequencing and computational annotation revealed several features, including high gene numbers, unusual composition of the predicted genes and a large number of genes lacking homology to known genes, that distinguish the rice (Oryza sativa) genome from that of other fully sequenced model species. We report here a full-genome transcription analysis of the indica rice subspecies using high-density oligonucleotide tiling microarrays. Our results provided expression data support for the existence of 35,970 (81.9%) annotated gene models and identified 5,464 unique transcribed intergenic regions that share similar compositional properties with the annotated exons and have significant homology to other plant proteins. Elucidating and mapping of all transcribed regions revealed an association between global transcription and cytological chromosome features, and an overall similarity of transcriptional activity between duplicated segments of the genome. Collectively, our results provide the first whole-genome transcription map useful for further understanding the rice genome.

  4. Integrated genome-wide analysis of genomic changes and gene regulation in human adrenocortical tissue samples.

    PubMed

    Gara, Sudheer Kumar; Wang, Yonghong; Patel, Dhaval; Liu-Chittenden, Yi; Jain, Meenu; Boufraqech, Myriem; Zhang, Lisa; Meltzer, Paul S; Kebebew, Electron

    2015-10-30

    To gain insight into the pathogenesis of adrenocortical carcinoma (ACC) and whether there is progression from normal-to-adenoma-to-carcinoma, we performed genome-wide gene expression, gene methylation, microRNA expression and comparative genomic hybridization (CGH) analysis in human adrenocortical tissue (normal, adrenocortical adenomas and ACC) samples. A pairwise comparison of normal, adrenocortical adenomas and ACC gene expression profiles with more than four-fold expression differences and an adjusted P-value < 0.05 revealed no major differences in normal versus adrenocortical adenoma whereas there are 808 and 1085, respectively, dysregulated genes between ACC versus adrenocortical adenoma and ACC versus normal. The majority of the dysregulated genes in ACC were downregulated. By integrating the CGH, gene methylation and expression profiles of potential miRNAs with the gene expression of dysregulated genes, we found that there are higher alterations in ACC versus normal compared to ACC versus adrenocortical adenoma. Importantly, we identified several novel molecular pathways that are associated with dysregulated genes and further experimentally validated that oncostatin m signaling induces caspase 3 dependent apoptosis and suppresses cell proliferation. Finally, we propose that there is higher number of genomic changes from normal-to-adenoma-to-carcinoma and identified oncostatin m signaling as a plausible druggable pathway for therapeutics.

  5. Genome-wide association study identifies multiple susceptibility loci for diffuse large B cell lymphoma.

    PubMed

    Cerhan, James R; Berndt, Sonja I; Vijai, Joseph; Ghesquières, Hervé; McKay, James; Wang, Sophia S; Wang, Zhaoming; Yeager, Meredith; Conde, Lucia; de Bakker, Paul I W; Nieters, Alexandra; Cox, David; Burdett, Laurie; Monnereau, Alain; Flowers, Christopher R; De Roos, Anneclaire J; Brooks-Wilson, Angela R; Lan, Qing; Severi, Gianluca; Melbye, Mads; Gu, Jian; Jackson, Rebecca D; Kane, Eleanor; Teras, Lauren R; Purdue, Mark P; Vajdic, Claire M; Spinelli, John J; Giles, Graham G; Albanes, Demetrius; Kelly, Rachel S; Zucca, Mariagrazia; Bertrand, Kimberly A; Zeleniuch-Jacquotte, Anne; Lawrence, Charles; Hutchinson, Amy; Zhi, Degui; Habermann, Thomas M; Link, Brian K; Novak, Anne J; Dogan, Ahmet; Asmann, Yan W; Liebow, Mark; Thompson, Carrie A; Ansell, Stephen M; Witzig, Thomas E; Weiner, George J; Veron, Amelie S; Zelenika, Diana; Tilly, Hervé; Haioun, Corinne; Molina, Thierry Jo; Hjalgrim, Henrik; Glimelius, Bengt; Adami, Hans-Olov; Bracci, Paige M; Riby, Jacques; Smith, Martyn T; Holly, Elizabeth A; Cozen, Wendy; Hartge, Patricia; Morton, Lindsay M; Severson, Richard K; Tinker, Lesley F; North, Kari E; Becker, Nikolaus; Benavente, Yolanda; Boffetta, Paolo; Brennan, Paul; Foretova, Lenka; Maynadie, Marc; Staines, Anthony; Lightfoot, Tracy; Crouch, Simon; Smith, Alex; Roman, Eve; Diver, W Ryan; Offit, Kenneth; Zelenetz, Andrew; Klein, Robert J; Villano, Danylo J; Zheng, Tongzhang; Zhang, Yawei; Holford, Theodore R; Kricker, Anne; Turner, Jenny; Southey, Melissa C; Clavel, Jacqueline; Virtamo, Jarmo; Weinstein, Stephanie; Riboli, Elio; Vineis, Paolo; Kaaks, Rudolph; Trichopoulos, Dimitrios; Vermeulen, Roel C H; Boeing, Heiner; Tjonneland, Anne; Angelucci, Emanuele; Di Lollo, Simonetta; Rais, Marco; Birmann, Brenda M; Laden, Francine; Giovannucci, Edward; Kraft, Peter; Huang, Jinyan; Ma, Baoshan; Ye, Yuanqing; Chiu, Brian C H; Sampson, Joshua; Liang, Liming; Park, Ju-Hyun; Chung, Charles C; Weisenburger, Dennis D; Chatterjee, Nilanjan; Fraumeni, Joseph F; Slager, Susan L; Wu, Xifeng; de Sanjose, Silvia; Smedby, Karin E; Salles, Gilles; Skibola, Christine F; Rothman, Nathaniel; Chanock, Stephen J

    2014-11-01

    Diffuse large B cell lymphoma (DLBCL) is the most common lymphoma subtype and is clinically aggressive. To identify genetic susceptibility loci for DLBCL, we conducted a meta-analysis of 3 new genome-wide association studies (GWAS) and 1 previous scan, totaling 3,857 cases and 7,666 controls of European ancestry, with additional genotyping of 9 promising SNPs in 1,359 cases and 4,557 controls. In our multi-stage analysis, five independent SNPs in four loci achieved genome-wide significance marked by rs116446171 at 6p25.3 (EXOC2; P = 2.33 × 10(-21)), rs2523607 at 6p21.33 (HLA-B; P = 2.40 × 10(-10)), rs79480871 at 2p23.3 (NCOA1; P = 4.23 × 10(-8)) and two independent SNPs, rs13255292 and rs4733601, at 8q24.21 (PVT1; P = 9.98 × 10(-13) and 3.63 × 10(-11), respectively). These data provide substantial new evidence for genetic susceptibility to this B cell malignancy and point to pathways involved in immune recognition and immune function in the pathogenesis of DLBCL.

  6. Genome-wide mapping of IBD segments in an Ashkenazi PD cohort identifies associated haplotypes.

    PubMed

    Vacic, Vladimir; Ozelius, Laurie J; Clark, Lorraine N; Bar-Shira, Anat; Gana-Weisz, Mali; Gurevich, Tanya; Gusev, Alexander; Kedmi, Merav; Kenny, Eimear E; Liu, Xinmin; Mejia-Santana, Helen; Mirelman, Anat; Raymond, Deborah; Saunders-Pullman, Rachel; Desnick, Robert J; Atzmon, Gil; Burns, Edward R; Ostrer, Harry; Hakonarson, Hakon; Bergman, Aviv; Barzilai, Nir; Darvasi, Ariel; Peter, Inga; Guha, Saurav; Lencz, Todd; Giladi, Nir; Marder, Karen; Pe'er, Itsik; Bressman, Susan B; Orr-Urtreger, Avi

    2014-09-01

    The recent series of large genome-wide association studies in European and Japanese cohorts established that Parkinson disease (PD) has a substantial genetic component. To further investigate the genetic landscape of PD, we performed a genome-wide scan in the largest to date Ashkenazi Jewish cohort of 1130 Parkinson patients and 2611 pooled controls. Motivated by the reduced disease allele heterogeneity and a high degree of identical-by-descent (IBD) haplotype sharing in this founder population, we conducted a haplotype association study based on mapping of shared IBD segments. We observed significant haplotype association signals at three previously implicated Parkinson loci: LRRK2 (OR = 12.05, P = 1.23 × 10(-56)), MAPT (OR = 0.62, P = 1.78 × 10(-11)) and GBA (multiple distinct haplotypes, OR > 8.28, P = 1.13 × 10(-11) and OR = 2.50, P = 1.22 × 10(-9)). In addition, we identified a novel association signal on chr2q14.3 coming from a rare haplotype (OR = 22.58, P = 1.21 × 10(-10)) and replicated it in a secondary cohort of 306 Ashkenazi PD cases and 2583 controls. Our results highlight the power of our haplotype association method, particularly useful in studies of founder populations, and reaffirm the benefits of studying complex diseases in Ashkenazi Jewish cohorts.

  7. Genome-Wide Association Mapping and Genomic Selection for Alfalfa (Medicago sativa) Forage Quality Traits

    PubMed Central

    Pecetti, Luciano; Brummer, E. Charles; Palmonari, Alberto; Tava, Aldo

    2017-01-01

    Genetic progress for forage quality has been poor in alfalfa (Medicago sativa L.), the most-grown forage legume worldwide. This study aimed at exploring opportunities for marker-assisted selection (MAS) and genomic selection of forage quality traits based on breeding values of parent plants. Some 154 genotypes from a broadly-based reference population were genotyped by genotyping-by-sequencing (GBS), and phenotyped for leaf-to-stem ratio, leaf and stem contents of protein, neutral detergent fiber (NDF) and acid detergent lignin (ADL), and leaf and stem NDF digestibility after 24 hours (NDFD), of their dense-planted half-sib progenies in three growing conditions (summer harvest, full irrigation; summer harvest, suspended irrigation; autumn harvest). Trait-marker analyses were performed on progeny values averaged over conditions, owing to modest germplasm × condition interaction. Genomic selection exploited 11,450 polymorphic SNP markers, whereas a subset of 8,494 M. truncatula-aligned markers were used for a genome-wide association study (GWAS). GWAS confirmed the polygenic control of quality traits and, in agreement with phenotypic correlations, indicated substantially different genetic control of a given trait in stems and leaves. It detected several SNPs in different annotated genes that were highly linked to stem protein content. Also, it identified a small genomic region on chromosome 8 with high concentration of annotated genes associated with leaf ADL, including one gene probably involved in the lignin pathway. Three genomic selection models, i.e., Ridge-regression BLUP, Bayes B and Bayesian Lasso, displayed similar prediction accuracy, whereas SVR-lin was less accurate. Accuracy values were moderate (0.3–0.4) for stem NDFD and leaf protein content, modest for leaf ADL and NDFD, and low to very low for the other traits. Along with previous results for the same germplasm set, this study indicates that GBS data can be exploited to improve both quality traits

  8. Genome-Wide Association Study of HIV Whole Genome Sequences Validated using Drug Resistance

    PubMed Central

    Power, Robert A.; Davaniah, Siva; Derache, Anne; Wilkinson, Eduan; Tanser, Frank; Pillay, Deenan; de Oliveira, Tulio

    2016-01-01

    Background Genome-wide association studies (GWAS) have considerably advanced our understanding of human traits and diseases. With the increasing availability of whole genome sequences (WGS) for pathogens, it is important to establish whether GWAS of viral genomes could reveal important biological insights. Here we perform the first proof of concept viral GWAS examining drug resistance (DR), a phenotype with well understood genetics. Method We performed a GWAS of DR in a sample of 343 HIV subtype C patients failing 1st line antiretroviral treatment in rural KwaZulu-Natal, South Africa. The majority and minority variants within each sequence were called using PILON, and GWAS was performed within PLINK. HIV WGS from patients failing on different antiretroviral treatments were compared to sequences derived from individuals naïve to the respective treatment. Results GWAS methodology was validated by identifying five associations on a genetic level that led to amino acid changes known to cause DR. Further, we highlighted the ability of GWAS to identify epistatic effects, identifying two replicable variants within amino acid 68 of the reverse transcriptase protein previously described as potential fitness compensatory mutations. A possible additional DR variant within amino acid 91 of the matrix region of the Gag protein was associated with tenofovir failure, highlighting GWAS’s ability to identify variants outside classical candidate genes. Our results also suggest a polygenic component to DR. Conclusions These results validate the applicability of GWAS to HIV WGS data even in relative small samples, and emphasise how high throughput sequencing can provide novel and clinically relevant insights. Further they suggested that for viruses like HIV, population structure was only minor concern compared to that seen in bacteria or parasite GWAS. Given the small genome length and reduced burden for multiple testing, this makes HIV an ideal candidate for GWAS. PMID:27677172

  9. Prioritizing genomic applications for action by level of evidence: a horizon-scanning method.

    PubMed

    Dotson, W D; Douglas, M P; Kolor, K; Stewart, A C; Bowen, M S; Gwinn, M; Wulf, A; Anders, H M; Chang, C Q; Clyne, M; Lam, T K; Schully, S D; Marrone, M; Feero, W G; Khoury, M J

    2014-04-01

    As evidence accumulates on the use of genomic tests and other health-related applications of genomic technologies, decision makers may increasingly seek support in identifying which applications have sufficiently robust evidence to suggest they might be considered for action. As an interim working process to provide such support, we developed a horizon-scanning method that assigns genomic applications to tiers defined by availability of synthesized evidence. We illustrate an application of the method to pharmacogenomics tests.

  10. Structure-symptom relationship with wide-area ultrasound scanning of knee osteoarthritis

    PubMed Central

    Podlipská, Jana; Koski, Juhani M.; Kaukinen, Päivi; Haapea, Marianne; Tervonen, Osmo; Arokoski, Jari P.; Saarakkala, Simo

    2017-01-01

    The aetiology of knee pain in osteoarthritis (OA) is heterogeneous and its relationship with structural changes and function is unclear. Our goal was to determine the prevalence of wide-area scanned ultrasound-defined knee OA structural features and their association with pain and functional impairment in 79 symptomatic and 63 asymptomatic subjects. All subjects underwent ultrasound knee wide-area scanning and the severity of articular cartilage degeneration, the presence and size of osteophytes, and meniscal extrusion were evaluated. Subjects filled in a self-administrated questionnaire on present knee pain, and Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) on clinical symptoms and function over the past week. Cartilage changes were the most prevalent followed by lateral meniscal extrusion, osteophytes and medial meniscal extrusion. The global femoral cartilage grade associated strongly with pain and the WOMAC index. Site-specifically, early medial cartilage changes and thinning in sulcus and lateral site were associated with symptoms. The presence of femoral lateral osteophytes was also associated with both outcomes. Using the novel wide-area ultrasound scanning technique, we were able to confirm the negative impact of femoral cartilage OA changes on clinical symptoms. Presence, not necessarily size, of lateral femoral osteophytes was also associated with increased pain and disability. PMID:28295049

  11. Adaptive optics scanning laser ophthalmoscope with integrated wide-field retinal imaging and tracking.

    PubMed

    Ferguson, R Daniel; Zhong, Zhangyi; Hammer, Daniel X; Mujat, Mircea; Patel, Ankit H; Deng, Cong; Zou, Weiyao; Burns, Stephen A

    2010-11-01

    We have developed a new, unified implementation of the adaptive optics scanning laser ophthalmoscope (AOSLO) incorporating a wide-field line-scanning ophthalmoscope (LSO) and a closed-loop optical retinal tracker. AOSLO raster scans are deflected by the integrated tracking mirrors so that direct AOSLO stabilization is automatic during tracking. The wide-field imager and large-spherical-mirror optical interface design, as well as a large-stroke deformable mirror (DM), enable the AOSLO image field to be corrected at any retinal coordinates of interest in a field of >25 deg. AO performance was assessed by imaging individuals with a range of refractive errors. In most subjects, image contrast was measurable at spatial frequencies close to the diffraction limit. Closed-loop optical (hardware) tracking performance was assessed by comparing sequential image series with and without stabilization. Though usually better than 10 μm rms, or 0.03 deg, tracking does not yet stabilize to single cone precision but significantly improves average image quality and increases the number of frames that can be successfully aligned by software-based post-processing methods. The new optical interface allows the high-resolution imaging field to be placed anywhere within the wide field without requiring the subject to re-fixate, enabling easier retinal navigation and faster, more efficient AOSLO montage capture and stitching.

  12. Genome-wide synteny through highly sensitive sequence alignment: Satsuma

    PubMed Central

    Grabherr, Manfred G.; Russell, Pamela; Meyer, Miriah; Mauceli, Evan; Alföldi, Jessica; Di Palma, Federica; Lindblad-Toh, Kerstin

    2010-01-01

    Motivation: Comparative genomics heavily relies on alignments of large and often complex DNA sequences. From an engineering perspective, the problem here is to provide maximum sensitivity (to find all there is to find), specificity (to only find real homology) and speed (to accommodate the billions of base pairs of vertebrate genomes). Results: Satsuma addresses all three issues through novel strategies: (i) cross-correlation, implemented via fast Fourier transform; (ii) a match scoring scheme that eliminates almost all false hits; and (iii) an asynchronous ‘battleship’-like search that allows for aligning two entire fish genomes (470 and 217 Mb) in 120 CPU hours using 15 processors on a single machine. Availability: Satsuma is part of the Spines software package, implemented in C++ on Linux. The latest version of Spines can be freely downloaded under the LGPL license from http://www.broadinstitute.org/science/programs/genome-biology/spines/ Contact: grabherr@broadinstitute.org PMID:20208069

  13. High quality genome-wide genotyping from archived dried blood spots without DNA amplification.

    PubMed

    St Julien, Krystal R; Jelliffe-Pawlowski, Laura L; Shaw, Gary M; Stevenson, David K; O'Brodovich, Hugh M; Krasnow, Mark A

    2013-01-01

    Spots of blood are routinely collected from newborn babies onto filter paper called Guthrie cards and used to screen for metabolic and genetic disorders. The archived dried blood spots are an important and precious resource for genomic research. Whole genome amplification of dried blood spot DNA has been used to provide DNA for genome-wide SNP genotyping. Here we describe a 96 well format procedure to extract DNA from a portion of a dried blood spot that provides sufficient unamplified genomic DNA for genome-wide single nucleotide polymorphism (SNP) genotyping. We show that SNP genotyping of the unamplified DNA is more robust than genotyping amplified dried blood spot DNA, is comparable in cost, and can be done with thousands of samples. This procedure can be used for genome-wide association studies and other large-scale genomic analyses that require robust, high-accuracy genotyping of dried blood spot DNA.

  14. Meta-Analysis of Genome-Wide Association Studies of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schafer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J. L.; Langely, Kate; O'Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association studies (GWAS) have not yielded significant results, we conducted a meta-analysis of…

  15. Case-Control Genome-Wide Association Study of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Neale, Benjamin M.; Medland, Sarah; Ripke, Stephan; Anney, Richard J. L.; Asherson, Philip; Buitelaar, Jan; Franke, Barbara; Gill, Michael; Kent, Lindsey; Holmans, Peter; Middleton, Frank; Thapar, Anita; Lesch, Klaus-Peter; Faraone, Stephen V.; Daly, Mark; Nguyen, Thuy Trang; Schafer, Helmut; Steinhausen, Hans-Christoph; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Warnke, Andreas; Walitza, Susanne; Freitag, Christine; Meyer, Jobst; Palmason, Haukur; Rothenberger, Aribert; Hawi, Ziarih; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph

    2010-01-01

    Objective: Although twin and family studies have shown attention-deficit/hyperactivity disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. Thus additional genome-wide association studies (GWAS) are needed. Method: We used case-control analyses of 896 cases…

  16. Genome-Wide Association of the Laboratory-Based Nicotine Metabolite Ratio in Three Ancestries

    PubMed Central

    Baurley, James W.; Edlund, Christopher K.; Pardamean, Carissa I.; Conti, David V.; Krasnow, Ruth; Javitz, Harold S.; Hops, Hyman; Swan, Gary E.; Benowitz, Neal L.

    2016-01-01

    Introduction: Metabolic enzyme variation and other patient and environmental characteristics influence smoking behaviors, treatment success, and risk of related disease. Population-specific variation in metabolic genes contributes to challenges in developing and optimizing pharmacogenetic interventions. We applied a custom genome-wide genotyping array for addiction research (Smokescreen), to three laboratory-based studies of nicotine metabolism with oral or venous administration of labeled nicotine and cotinine, to model nicotine metabolism in multiple populations. The trans-3′-hydroxycotinine/cotinine ratio, the nicotine metabolite ratio (NMR), was the nicotine metabolism measure analyzed. Methods: Three hundred twelve individuals of self-identified European, African, and Asian American ancestry were genotyped and included in ancestry-specific genome-wide association scans (GWAS) and a meta-GWAS analysis of the NMR. We modeled natural-log transformed NMR with covariates: principal components of genetic ancestry, age, sex, body mass index, and smoking status. Results: African and Asian American NMRs were statistically significantly (P values ≤ 5E-5) lower than European American NMRs. Meta-GWAS analysis identified 36 genome-wide significant variants over a 43 kilobase pair region at CYP2A6 with minimum P = 2.46E-18 at rs12459249, proximal to CYP2A6. Additional minima were located in intron 4 (rs56113850, P = 6.61E-18) and in the CYP2A6-CYP2A7 intergenic region (rs34226463, P = 1.45E-12). Most (34/36) genome-wide significant variants suggested reduced CYP2A6 activity; functional mechanisms were identified and tested in knowledge-bases. Conditional analysis resulted in intergenic variants of possible interest (P values < 5E-5). Conclusions: This meta-GWAS of the NMR identifies CYP2A6 variants, replicates the top-ranked single nucleotide polymorphism from a recent Finnish meta-GWAS of the NMR, identifies functional mechanisms, and provides pan

  17. A Practical Genome Scan for Population-Specific Strong Selective Sweeps That Have Reached Fixation

    PubMed Central

    Kimura, Ryosuke; Fujimoto, Akihiro; Tokunaga, Katsushi; Ohashi, Jun

    2007-01-01

    Phenotypic divergences between modern human populations have developed as a result of genetic adaptation to local environments over the past 100,000 years. To identify genes involved in population-specific phenotypes, it is necessary to detect signatures of recent positive selection in the human genome. Although detection of elongated linkage disequilibrium (LD) has been a powerful tool in the field of evolutionary genetics, current LD-based approaches are not applicable to already fixed loci. Here, we report a method of scanning for population-specific strong selective sweeps that have reached fixation. In this method, genome-wide SNP data is used to analyze differences in the haplotype frequency, nucleotide diversity, and LD between populations, using the ratio of haplotype homozygosity between populations. To estimate the detection power of the statistics used in this study, we performed computer simulations and found that these tests are relatively robust against the density of typed SNPs and demographic parameters if the advantageous allele has reached fixation. Therefore, we could determine the threshold for maintaining high detection power, regardless of SNP density and demographic history. When this method was applied to the HapMap data, it was able to identify the candidates of population-specific strong selective sweeps more efficiently than the outlier approach that depends on the empirical distribution. This study, confirming strong positive selection on genes previously reported to be associated with specific phenotypes, also identifies other candidates that are likely to contribute to phenotypic differences between human populations. PMID:17356696

  18. More heritability probably captured by psoriasis genome-wide association study in Han Chinese.

    PubMed

    Jiang, Long; Liu, Lu; Cheng, Yuyan; Lin, Yan; Shen, Changbing; Zhu, Caihong; Yang, Sen; Yin, Xianyong; Zhang, Xuejun

    2015-11-15

    Missing heritability is a common problem in genome-wide association studies in complex diseases/traits. To quantify the unbiased heritability estimate, we applied the phenotype correlation-genotype correlation regression in psoriasis genome-wide association data in Han Chinese which comprises 1139 cases and 1132 controls. We estimated that 45.7% heritability of psoriasis in Han Chinese were captured by common variants (s.e.=12.5%), which reinforced that the majority of psoriasis heritability can be covered by common variants in genome-wide association data (68.2%). The results provided evidence that the heritability covered by psoriasis genome-wide genotyping data was probably underestimated in previous restricted maximum likelihood method. Our study highlights the broad role of common variants in the etiology of psoriasis and sheds light on the possibility to identify more common variants of small effect by increasing the sample size in psoriasis genome-wide association studies.

  19. Genome-wide miRNA seeds prediction in Archaea.

    PubMed

    Wang, Shengqin; Xu, Yuming; Lu, Zuhong

    2014-01-01

    Growing evidence indicates that miRNA genes exist in the archaeal genome, though the functional role of such noncoding RNA remains unclear. Here, we integrated the phylogenetic information of available archaeal genomes to predict miRNA seeds (typically defined as the 2-8 nucleotides of mature miRNAs) on the genomic scale. Finally, we found 2649 candidate seeds with significant conservation signal. Eleven of 29 unique seeds from previous study support our result (P value <0.01), which demonstrates that the pipeline is suitable to predict experimentally detectable miRNA seeds. The statistical significance of the overlap between the detected archaeal seeds and known eukaryotic seeds shows that the miRNA may evolve before the divergence of these two domains of cellular life. In addition, miRNA targets are enriched for genes involved in transcriptional regulation, which is consistent with the situation in eukaryote. Our research will enhance the regulatory network analysis in Archaea.

  20. Genome-wide comparison of medieval and modern Mycobacterium leprae.

    PubMed

    Schuenemann, Verena J; Singh, Pushpendra; Mendum, Thomas A; Krause-Kyora, Ben; Jäger, Günter; Bos, Kirsten I; Herbig, Alexander; Economou, Christos; Benjak, Andrej; Busso, Philippe; Nebel, Almut; Boldsen, Jesper L; Kjellström, Anna; Wu, Huihai; Stewart, Graham R; Taylor, G Michael; Bauer, Peter; Lee, Oona Y-C; Wu, Houdini H T; Minnikin, David E; Besra, Gurdyal S; Tucker, Katie; Roffey, Simon; Sow, Samba O; Cole, Stewart T; Nieselt, Kay; Krause, Johannes

    2013-07-12

    Leprosy was endemic in Europe until the Middle Ages. Using DNA array capture, we have obtained genome sequences of Mycobacterium leprae from skeletons of five medieval leprosy cases from the United Kingdom, Sweden, and Denmark. In one case, the DNA was so well preserved that full de novo assembly of the ancient bacterial genome could be achieved through shotgun sequencing alone. The ancient M. leprae sequences were compared with those of 11 modern strains, representing diverse genotypes and geographic origins. The comparisons revealed remarkable genomic conservation during the past 1000 years, a European origin for leprosy in the Americas, and the presence of an M. leprae genotype in medieval Europe now commonly associated with the Middle East. The exceptional preservation of M. leprae biomarkers, both DNA and mycolic acids, in ancient skeletons has major implications for palaeomicrobiology and human pathogen evolution.

  1. Nonlinear analysis and dynamic compensation of stylus scanning measurement with wide range

    NASA Astrophysics Data System (ADS)

    Hui, Heiyang; Liu, Xiaojun; Lu, Wenlong

    2011-12-01

    Surface topography is an important geometrical feature of a workpiece that influences its quality and functions such as friction, wearing, lubrication and sealing. Precision measurement of surface topography is fundamental for product quality characterizing and assurance. Stylus scanning technique is a widely used method for surface topography measurement, and it is also regarded as the international standard method for 2-D surface characterizing. Usually surface topography, including primary profile, waviness and roughness, can be measured precisely and efficiently by this method. However, by stylus scanning method to measure curved surface topography, the nonlinear error is unavoidable because of the difference of horizontal position of the actual measured point from given sampling point and the nonlinear transformation process from vertical displacement of the stylus tip to angle displacement of the stylus arm, and the error increases with the increasing of measuring range. In this paper, a wide range stylus scanning measurement system based on cylindrical grating interference principle is constructed, the originations of the nonlinear error are analyzed, the error model is established and a solution to decrease the nonlinear error is proposed, through which the error of the collected data is dynamically compensated.

  2. Genome-Wide Heterogeneity of Nucleotide Substitution Model Fit

    PubMed Central

    Arbiza, Leonardo; Patricio, Mateus; Dopazo, Hernán; Posada, David

    2011-01-01

    At a genomic scale, the patterns that have shaped molecular evolution are believed to be largely heterogeneous. Consequently, comparative analyses should use appropriate probabilistic substitution models that capture the main features under which different genomic regions have evolved. While efforts have concentrated in the development and understanding of model selection techniques, no descriptions of overall relative substitution model fit at the genome level have been reported. Here, we provide a characterization of best-fit substitution models across three genomic data sets including coding regions from mammals, vertebrates, and Drosophila (24,000 alignments). According to the Akaike Information Criterion (AIC), 82 of 88 models considered were selected as best-fit models at least in one occasion, although with very different frequencies. Most parameter estimates also varied broadly among genes. Patterns found for vertebrates and Drosophila were quite similar and often more complex than those found in mammals. Phylogenetic trees derived from models in the 95% confidence interval set showed much less variance and were significantly closer to the tree estimated under the best-fit model than trees derived from models outside this interval. Although alternative criteria selected simpler models than the AIC, they suggested similar patterns. All together our results show that at a genomic scale, different gene alignments for the same set of taxa are best explained by a large variety of different substitution models and that model choice has implications on different parameter estimates including the inferred phylogenetic trees. After taking into account the differences related to sample size, our results suggest a noticeable diversity in the underlying evolutionary process. All together, we conclude that the use of model selection techniques is important to obtain consistent phylogenetic estimates from real data at a genomic scale. PMID:21824869

  3. Preterm Birth Genome Project (PGP) -- validation of resources for preterm birth genome-wide studies.

    PubMed

    Pennell, Craig E; Vadillo-Ortega, Felipe; Olson, David M; Ha, Eun-Hee; Williams, Scott; Frayling, Tim M; Dolan, Siobhan; Katz, Michael; Merialdi, Mario; Menon, Ramkumar

    2013-01-01

    We determined a series of quality control (QC) analyses to assess the usability of DNA collected and processed from different countries utilizing different DNA extraction techniques prior to genome-wide association studies (GWAS). The quality of DNA collected utilizing four different DNA extraction techniques and the impact of shipping DNA at different temperatures on array performance were evaluated. Fifteen maternal-fetal pairs were used from four countries. DNA was extracted using four approaches: whole blood, blood spots with whole genome amplification (WGA), saliva and buccal swab. Samples were sent to a genotyping facility, either on dry ice or at room temperature and genotyped using Affymetrix SNP array 6.0. QC measured included extraction techniques, effect of shipping temperatures, accuracy and Mendelian concordance. Significantly fewer (50 % ) single nucleotide polymorphisms (SNPs) passed QC metrics for buccal swab DNA (P < 0.0001) due to missing genotype data (P < 0.0001). Whole blood or saliva DNA had the highest call rates (99.2 0.4 % and 99.3 0.2 % , respectively) and Mendelian concordance. Shipment temperature had no effect. DNA from blood or saliva had the highest call rate accuracy, and buccal swabs had the lowest. DNA extracted from blood, saliva and blood spots were found suitable for GWAS in our study.

  4. Exploring Relationships between Host Genome and Microbiome: New Insights from Genome-Wide Association Studies

    PubMed Central

    Abdul-Aziz, Muslihudeen A.; Cooper, Alan; Weyrich, Laura S.

    2016-01-01

    As our understanding of the human microbiome expands, impacts on health and disease continue to be revealed. Alterations in the microbiome can result in dysbiosis, which has now been linked to subsequent autoimmune and metabolic diseases, highlighting the need to identify factors that shape the microbiome. Research has identified that the composition and functions of the human microbiome can be influenced by diet, age, sex, and environment. More recently, studies have explored how human genetic variation may also influence the microbiome. Here, we review several recent analytical advances in this new research area, including those that use genome-wide association studies to examine host genome–microbiome interactions, while controlling for the influence of other factors. We find that current research is limited by small sample sizes, lack of cohort replication, and insufficient confirmatory mechanistic studies. In addition, we discuss the importance of understanding long-term interactions between the host genome and microbiome, as well as the potential impacts of disrupting this relationship, and explore new research avenues that may provide information about the co-evolutionary history of humans and their microorganisms. PMID:27785127

  5. Genome-wide map of regulatory interactions in the human genome

    PubMed Central

    Heidari, Nastaran; Phanstiel, Douglas H.; He, Chao; Grubert, Fabian; Jahanbani, Fereshteh; Kasowski, Maya; Zhang, Michael Q.

    2014-01-01

    Increasing evidence suggests that interactions between regulatory genomic elements play an important role in regulating gene expression. We generated a genome-wide interaction map of regulatory elements in human cells (ENCODE tier 1 cells, K562, GM12878) using Chromatin Interaction Analysis by Paired-End Tag sequencing (ChIA-PET) experiments targeting six broadly distributed factors. Bound regions covered 80% of DNase I hypersensitive sites including 99.7% of TSS and 98% of enhancers. Correlating this map with ChIP-seq and RNA-seq data sets revealed cohesin, CTCF, and ZNF143 as key components of three-dimensional chromatin structure and revealed how the distal chromatin state affects gene transcription. Comparison of interactions between cell types revealed that enhancer–promoter interactions were highly cell-type-specific. Construction and comparison of distal and proximal regulatory networks revealed stark differences in structure and biological function. Proximal binding events are enriched at genes with housekeeping functions, while distal binding events interact with genes involved in dynamic biological processes including response to stimulus. This study reveals new mechanistic and functional insights into regulatory region organization in the nucleus. PMID:25228660

  6. Genome-wide association study of tick resistance in South African Nguni cattle.

    PubMed

    Mapholi, N O; Maiwashe, A; Matika, O; Riggio, V; Bishop, S C; MacNeil, M D; Banga, C; Taylor, J F; Dzama, K

    2016-04-01

    Ticks and tick-borne diseases are among the main causes of economic loss in the South African cattle industry through high morbidity and mortality rates. Concerns of the general public regarding chemical residues may tarnish their perceptions of food safety and environmental health when the husbandry of cattle includes frequent use of acaricides to manage ticks. The primary objective of this study was to identify single nucleotide polymorphism (SNP) markers associated with host resistance to ticks in South African Nguni cattle. Tick count data were collected monthly from 586 Nguni cattle reared in four herds under natural grazing conditions over a period of two years. The counts were recorded for six species of ticks attached in eight anatomical locations on the animals and were summed by species and anatomical location. This gave rise to 63 measured phenotypes or traits, with results for 12 of these traits being reported here. Tick count (x) data were transformed using log10(x+1) and the resulting values were examined for normality. DNA was extracted from hair and blood samples and was genotyped using the Illumina BovineSNP50 assay. After quality control (call rate >90%, minor allele frequency >0.02), 40,436 SNPs were retained for analysis. Genetic parameters were estimated and association analysis for tick resistance was carried out using two approaches: a genome-wide association (GWA) analysis using the GenABEL package and a regional heritability mapping (RHM) analysis. The Bonferroni genome-wide (P<0.05) corrected significance threshold was 1.24×10(-6), with 2.47×10(-5) as the suggestive significance threshold (P<0.10) (i.e., one false positive per genome scan) in the GWA analysis. Likelihood ratio test (LRT) thresholds for genome-wide and suggestive significance were 13.5 and 9.15 for the RHM analysis. Six ixodid tick species were identified, with Amblyomma hebraeum (the vector for Heartwater disease) being the dominant species. Heritability estimates (h(2

  7. Genome-wide Association Studies from the Cancer Genetic Markers of Susceptibility (CGEMS) Initiative | Office of Cancer Genomics

    Cancer.gov

    CGEMS identifies common inherited genetic variations associated with a number of cancers, including breast and prostate. Data from these genome-wide association studies (GWAS) are available through the Division of Cancer Epidemiology & Genetics website.

  8. Genome Wide Characterization of Simple Sequence Repeats in Cucumber

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The whole genome sequence of the cucumber cultivar Gy14 was recently sequenced at 15× coverage with the Roche 454 Titanium technology. The microsatellite DNA sequences (simple sequence repeats, SSRs) in the assembled scaffolds were computationally explored and characterized. A total of 112,073 SSRs ...

  9. Pseudogenes and Their Genome-Wide Prediction in Plants

    PubMed Central

    Xiao, Jin; Sekhwal, Manoj Kumar; Li, Pingchuan; Ragupathy, Raja; Cloutier, Sylvie; Wang, Xiue; You, Frank M.

    2016-01-01

    Pseudogenes are paralogs generated from ancestral functional genes (parents) during genome evolution, which contain critical defects in their sequences, such as lacking a promoter, having a premature stop codon or frameshift mutations. Generally, pseudogenes are functionless, but recent evidence demonstrates that some of them have potential roles in regulation. The majority of pseudogenes are generated from functional progenitor genes either by gene duplication (duplicated pseudogenes) or retro-transposition (processed pseudogenes). Pseudogenes are primarily identified by comparison to their parent genes. Bioinformatics tools for pseudogene prediction have been developed, among which PseudoPipe, PSF and Shiu’s pipeline are publicly available. We compared these three tools using the well-annotated Arabidopsis thaliana genome and its known 924 pseudogenes as a test data set. PseudoPipe and Shiu’s pipeline identified ~80% of A. thaliana pseudogenes, of which 94% were shared, while PSF failed to generate adequate results. A need for improvement of the bioinformatics tools for pseudogene prediction accuracy in plant genomes was thus identified, with the ultimate goal of improving the quality of genome annotation in plants. PMID:27916797

  10. Wide-aperture, line-focused ultrasonic material characterization system based on lateral scanning.

    PubMed

    Titov, Sergey; Maev, Roman; Bogatchenkov, Alexey

    2003-08-01

    We present a new wide-aperture, line-focused ultrasonic material characterization system. The foci of the transmitting and receiving transducers are located in the specimen-immersion liquid interface; and the output voltage V(x,t) of the system is recorded as a function of the lateral position of the receiving transducer. The two-dimensional spectrum of V(x, t) can be expressed as a product of the transfer function of the system and the reflectance function of the interface. In comparison with a system based on scanning in the z direction, the angular resolution of the proposed technique increases with decreasing angle of incidence. There are no geometrical restrictions on the length of the recorded spatial data and the angle of incidence in the case of lateral scanning. The temperature coefficient of the measurement error is low because of the constancy of the propagation distance of ultrasound in the immersion fluid during data acquisition.

  11. Characterizing Surfaces of the Wide Bandgap Semiconductor Ilmenite with Scanning Probe Microcopies

    NASA Technical Reports Server (NTRS)

    Wilkins, R.; Powell, Kirk St. A.

    1997-01-01

    Ilmenite (FeTiO3) is a wide bandgap semiconductor with an energy gap of about 2.5eV. Initial radiation studies indicate that ilmenite has properties suited for radiation tolerant applications, as well as a variety of other electronic applications. Two scanning probe microscopy methods have been used to characterize the surface of samples taken from Czochralski grown single crystals. The two methods, atomic force microscopy (AFM) and scanning tunneling microscopy (STM), are based on different physical principles and therefore provide different information about the samples. AFM provides a direct, three-dimensional image of the surface of the samples, while STM give a convolution of topographic and electronic properties of the surface. We will discuss the differences between the methods and present preliminary data of each method for ilmenite samples.

  12. Wide field fluorescence imaging in narrow passageways using scanning fiber endoscope technology

    NASA Astrophysics Data System (ADS)

    Lee, Cameron M.; Chandler, John E.; Seibel, Eric J.

    2010-02-01

    An ultrathin scanning fiber endoscope (SFE) has been developed for high resolution imaging of regions in the body that are commonly inaccessible. The SFE produces 500 line color images at 30 Hz frame rate while maintaining a 1.2-1.7 mm outer diameter. The distal tip of the SFE houses a 9 mm rigid scan engine attached to a highly flexible tether (minimum bend radius < 8 mm) comprised of optical fibers and electrical wires within a protective sheath. Unlike other ultrathin technologies, the unique characteristics of this system have allowed the SFE to navigate narrow passages without sacrificing image quality. To date, the SFE has been used for in vivo imaging of the bile duct, esophagus and peripheral airways. In this study, the standard SFE operation was tailored to capture wide field fluorescence images and spectra. Green (523 nm) and blue (440 nm) lasers were used as illumination sources, while the white balance gain values were adjusted to accentuate red fluorescence signal. To demonstrate wide field fluorescence imaging of small lumens, the SFE was inserted into a phantom model of a human pancreatobiliary tract and navigated to a custom fluorescent target. Both wide field fluorescence and standard color images of the target were captured to demonstrate multimodal imaging.

  13. Epigenome-Wide Scans Identify Differentially Methylated Regions for Age and Age-Related Phenotypes in a Healthy Ageing Population

    PubMed Central

    Yang, Tsun-Po; Pidsley, Ruth; Nisbet, James; Glass, Daniel; Mangino, Massimo; Zhai, Guangju; Zhang, Feng; Valdes, Ana; Shin, So-Youn; Dempster, Emma L.; Murray, Robin M.; Grundberg, Elin; Hedman, Asa K.; Nica, Alexandra; Small, Kerrin S.; Dermitzakis, Emmanouil T.; McCarthy, Mark I.; Mill, Jonathan; Spector, Tim D.; Deloukas, Panos

    2012-01-01

    Age-related changes in DNA methylation have been implicated in cellular senescence and longevity, yet the causes and functional consequences of these variants remain unclear. To elucidate the role of age-related epigenetic changes in healthy ageing and potential longevity, we tested for association between whole-blood DNA methylation patterns in 172 female twins aged 32 to 80 with age and age-related phenotypes. Twin-based DNA methylation levels at 26,690 CpG-sites showed evidence for mean genome-wide heritability of 18%, which was supported by the identification of 1,537 CpG-sites with methylation QTLs in cis at FDR 5%. We performed genome-wide analyses to discover differentially methylated regions (DMRs) for sixteen age-related phenotypes (ap-DMRs) and chronological age (a-DMRs). Epigenome-wide association scans (EWAS) identified age-related phenotype DMRs (ap-DMRs) associated with LDL (STAT5A), lung function (WT1), and maternal longevity (ARL4A, TBX20). In contrast, EWAS for chronological age identified hundreds of predominantly hyper-methylated age DMRs (490 a-DMRs at FDR 5%), of which only one (TBX20) was also associated with an age-related phenotype. Therefore, the majority of age-related changes in DNA methylation are not associated with phenotypic measures of healthy ageing in later life. We replicated a large proportion of a-DMRs in a sample of 44 younger adult MZ twins aged 20 to 61, suggesting that a-DMRs may initiate at an earlier age. We next explored potential genetic and environmental mechanisms underlying a-DMRs and ap-DMRs. Genome-wide overlap across cis-meQTLs, genotype-phenotype associations, and EWAS ap-DMRs identified CpG-sites that had cis-meQTLs with evidence for genotype–phenotype association, where the CpG-site was also an ap-DMR for the same phenotype. Monozygotic twin methylation difference analyses identified one potential environmentally-mediated ap-DMR associated with total cholesterol and LDL (CSMD1). Our results suggest that in a

  14. Genome-wide and fine-resolution association analysis of malaria in West Africa.

    PubMed

    Jallow, Muminatou; Teo, Yik Ying; Small, Kerrin S; Rockett, Kirk A; Deloukas, Panos; Clark, Taane G; Kivinen, Katja; Bojang, Kalifa A; Conway, David J; Pinder, Margaret; Sirugo, Giorgio; Sisay-Joof, Fatou; Usen, Stanley; Auburn, Sarah; Bumpstead, Suzannah J; Campino, Susana; Coffey, Alison; Dunham, Andrew; Fry, Andrew E; Green, Angela; Gwilliam, Rhian; Hunt, Sarah E; Inouye, Michael; Jeffreys, Anna E; Mendy, Alieu; Palotie, Aarno; Potter, Simon; Ragoussis, Jiannis; Rogers, Jane; Rowlands, Kate; Somaskantharajah, Elilan; Whittaker, Pamela; Widden, Claire; Donnelly, Peter; Howie, Bryan; Marchini, Jonathan; Morris, Andrew; SanJoaquin, Miguel; Achidi, Eric Akum; Agbenyega, Tsiri; Allen, Angela; Amodu, Olukemi; Corran, Patrick; Djimde, Abdoulaye; Dolo, Amagana; Doumbo, Ogobara K; Drakeley, Chris; Dunstan, Sarah; Evans, Jennifer; Farrar, Jeremy; Fernando, Deepika; Hien, Tran Tinh; Horstmann, Rolf D; Ibrahim, Muntaser; Karunaweera, Nadira; Kokwaro, Gilbert; Koram, Kwadwo A; Lemnge, Martha; Makani, Julie; Marsh, Kevin; Michon, Pascal; Modiano, David; Molyneux, Malcolm E; Mueller, Ivo; Parker, Michael; Peshu, Norbert; Plowe, Christopher V; Puijalon, Odile; Reeder, John; Reyburn, Hugh; Riley, Eleanor M; Sakuntabhai, Anavaj; Singhasivanon, Pratap; Sirima, Sodiomon; Tall, Adama; Taylor, Terrie E; Thera, Mahamadou; Troye-Blomberg, Marita; Williams, Thomas N; Wilson, Michael; Kwiatkowski, Dominic P

    2009-06-01

    We report a genome-wide association (GWA) study of severe malaria in The Gambia. The initial GWA scan included 2,500 children genotyped on the Affymetrix 500K GeneChip, and a replication study included 3,400 children. We used this to examine the performance of GWA methods in Africa. We found considerable population stratification, and also that signals of association at known malaria resistance loci were greatly attenuated owing to weak linkage disequilibrium (LD). To investigate possible solutions to the problem of low LD, we focused on the HbS locus, sequencing this region of the genome in 62 Gambian individuals and then using these data to conduct multipoint imputation in the GWA samples. This increased the signal of association, from P = 4 × 10(-7) to P = 4 × 10(-14), with the peak of the signal located precisely at the HbS causal variant. Our findings provide proof of principle that fine-resolution multipoint imputation, based on population-specific sequencing data, can substantially boost authentic GWA signals and enable fine mapping of causal variants in African populations.

  15. A genome scan for quantitative trait loci affecting body conformation traits in Spanish Churra dairy sheep.

    PubMed

    Gutiérrez-Gil, B; Alvarez, L; de la Fuente, L F; Sanchez, J P; San Primitivo, F; Arranz, J J

    2011-08-01

    A genome scan for chromosomal regions influencing body conformation traits was conducted for a population of Spanish Churra dairy sheep following a daughter design. A total of 739 ewes from 11 half-sib sire families were included in the study. The ewes were scored for the 5 linear traits used in the breeding scheme of the Churra breed to assess body conformation: stature, rear legs-rear view, foot angle, rump width, and general appearance. All the animals, including the 11 sires, were genotyped for 181 microsatellite markers evenly distributed across the 26 sheep autosomes. Using the yield deviations of the raw scores adjusted for fixed factors as phenotypic measurements, a quantitative trait loci (QTL) analysis was performed on the basis of a multi-marker regression method. Seven suggestive QTL were identified on chromosomes Ovis aries (OAR)2, OAR5, OAR16, OAR23, and OAR26, but none reached a genome-wise significance level. Putative QTL were identified for all of the traits analyzed, except for general appearance score. The suggestive QTL showing the highest test statistic influenced rear legs-rear view and was localized on OAR16, close to the growth hormone receptor coding gene, GHR. Some of the putative linkage associations reported here are consistent with previously reported QTL in cattle for similar traits. To the best of our knowledge, this study provides the first report of QTL for body conformation traits in dairy sheep; further studies will be needed to confirm and redefine the linkage associations reported herein. It is expected that future genome-wide association analyses of larger families will help identify genes underlying these putative genetic effects and provide useful markers for marker-assisted selection of such functional traits.

  16. A genome-wide association study of brain lesion distribution in multiple sclerosis.

    PubMed

    Gourraud, Pierre-Antoine; Sdika, Michael; Khankhanian, Pouya; Henry, Roland G; Beheshtian, Azadeh; Matthews, Paul M; Hauser, Stephen L; Oksenberg, Jorge R; Pelletier, Daniel; Baranzini, Sergio E

    2013-04-01

    Brain magnetic resonance imaging is widely used as a diagnostic and monitoring tool in multiple sclerosis and provides a non-invasive, sensitive and reproducible way to track the disease. Topological characteristics relating to the distribution and shape of lesions are recognized as important neuroradiological markers in the diagnosis of multiple sclerosis, although these have been much less well characterized quantitatively than have traditional measures such as T2 hyperintense or T1 hypointense lesion volumes. Here, we used voxel-level 3 T magnetic resonance imaging T1-weighted scans to reconstruct the 3D topology of lesions in 284 subjects with multiple sclerosis and tested whether this is a heritable phenotype. To this end, we extracted the genotypes from a published genome-wide association study on these same individuals and searched for genetic associations with lesion load, shape and topological distribution. Lesion probability maps were created to identify frequently affected areas and to assess the overall distribution of T1 lesions in the subject population as a whole. We then developed an original algorithm to cluster adjacent lesional voxels (cluxels) in each subject and tested whether cluxel topology was significantly associated with any single-nucleotide polymorphism in our data set. To focus on patterns of lesion distribution, we computed the first 10 principal components. Although principal component 1 correlated with lesion load, none of the remaining orthogonal components correlated with any other known variable. We then conducted genome-wide association studies on each of these and found 31 significant associations (false discovery rate <0.01) with principal component 8, which represents a mode of variation of lesion topology in the population. The majority of the loci can be linked to genes related to immune cell function and to myelin and neural growth; some (SYK, MYT1L, TRAPPC9, SLITKR6 and RIC3) have been previously associated with the

  17. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions.

    PubMed

    Wang, Yu; Li, Wei; Xia, Yingying; Wang, Chongzhi; Tang, Y Tom; Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2014-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information.

  18. Identifying Human Genome-Wide CNV, LOH and UPD by Targeted Sequencing of Selected Regions

    PubMed Central

    Guo, Wenying; Li, Jinliang; Zhao, Xia; Sun, Yepeng; Hu, Juan; Zhen, Hefu; Zhang, Xiandong; Chen, Chao; Shi, Yujian; Li, Lin; Cao, Hongzhi; Du, Hongli; Li, Jian

    2015-01-01

    Copy-number variations (CNV), loss of heterozygosity (LOH), and uniparental disomy (UPD) are large genomic aberrations leading to many common inherited diseases, cancers, and other complex diseases. An integrated tool to identify these aberrations is essential in understanding diseases and in designing clinical interventions. Previous discovery methods based on whole-genome sequencing (WGS) require very high depth of coverage on the whole genome scale, and are cost-wise inefficient. Another approach, whole exome genome sequencing (WEGS), is limited to discovering variations within exons. Thus, we are lacking efficient methods to detect genomic aberrations on the whole genome scale using next-generation sequencing technology. Here we present a method to identify genome-wide CNV, LOH and UPD for the human genome via selectively sequencing a small portion of genome termed Selected Target Regions (SeTRs). In our experiments, the SeTRs are covered by 99.73%~99.95% with sufficient depth. Our developed bioinformatics pipeline calls genome-wide CNVs with high confidence, revealing 8 credible events of LOH and 3 UPD events larger than 5M from 15 individual samples. We demonstrate that genome-wide CNV, LOH and UPD can be detected using a cost-effective SeTRs sequencing approach, and that LOH and UPD can be identified using just a sample grouping technique, without using a matched sample or familial information. PMID:25919136

  19. No Genome-Wide Protein Sequence Convergence for Echolocation

    PubMed Central

    Zou, Zhengting; Zhang, Jianzhi

    2015-01-01

    Toothed whales and two groups of bats independently acquired echolocation, the ability to locate and identify objects by reflected sound. Echolocation requires physiologically complex and coordinated vocal, auditory, and neural functions, but the molecular basis of the capacity for echolocation is not well understood. A recent study suggested that convergent amino acid substitutions widespread in the proteins of echolocators underlay the convergent origins of mammalian echolocation. Here, we show that genomic signatures of molecular convergence between echolocating lineages are generally no stronger than those between echolocating and comparable nonecholocating lineages. The same is true for the group of 29 hearing-related proteins claimed to be enriched with molecular convergence. Reexamining the previous selection test reveals several flaws and invalidates the asserted evidence for adaptive convergence. Together, these findings indicate that the reported genomic signatures of convergence largely reflect the background level of sequence convergence unrelated to the origins of echolocation. PMID:25631925

  20. No genome-wide protein sequence convergence for echolocation.

    PubMed

    Zou, Zhengting; Zhang, Jianzhi

    2015-05-01

    Toothed whales and two groups of bats independently acquired echolocation, the ability to locate and identify objects by reflected sound. Echolocation requires physiologically complex and coordinated vocal, auditory, and neural functions, but the molecular basis of the capacity for echolocation is not well understood. A recent study suggested that convergent amino acid substitutions widespread in the proteins of echolocators underlay the convergent origins of mammalian echolocation. Here, we show that genomic signatures of molecular convergence between echolocating lineages are generally no stronger than those between echolocating and comparable nonecholocating lineages. The same is true for the group of 29 hearing-related proteins claimed to be enriched with molecular convergence. Reexamining the previous selection test reveals several flaws and invalidates the asserted evidence for adaptive convergence. Together, these findings indicate that the reported genomic signatures of convergence largely reflect the background level of sequence convergence unrelated to the origins of echolocation.

  1. Genome-Wide Inference of Ancestral Recombination Graphs

    PubMed Central

    Rasmussen, Matthew D.; Hubisz, Melissa J.; Gronau, Ilan; Siepel, Adam

    2014-01-01

    The complex correlation structure of a collection of orthologous DNA sequences is uniquely captured by the “ancestral recombination graph” (ARG), a complete record of coalescence and recombination events in the history of the sample. However, existing methods for ARG inference are computationally intensive, highly approximate, or limited to small numbers of sequences, and, as a consequence, explicit ARG inference is rarely used in applied population genomics. Here, we introduce a new algorithm for ARG inference that is efficient enough to apply to dozens of complete mammalian genomes. The key idea of our approach is to sample an ARG of chromosomes conditional on an ARG of chromosomes, an operation we call “threading.” Using techniques based on hidden Markov models, we can perform this threading operation exactly, up to the assumptions of the sequentially Markov coalescent and a discretization of time. An extension allows for threading of subtrees instead of individual sequences. Repeated application of these threading operations results in highly efficient Markov chain Monte Carlo samplers for ARGs. We have implemented these methods in a computer program called ARGweaver. Experiments with simulated data indicate that ARGweaver converges rapidly to the posterior distribution over ARGs and is effective in recovering various features of the ARG for dozens of sequences generated under realistic parameters for human populations. In applications of ARGweaver to 54 human genome sequences from Complete Genomics, we find clear signatures of natural selection, including regions of unusually ancient ancestry associated with balancing selection and reductions in allele age in sites under directional selection. The patterns we observe near protein-coding genes are consistent with a primary influence from background selection rather than hitchhiking, although we cannot rule out a contribution from recurrent selective sweeps. PMID:24831947

  2. Transcriptome-wide investigation of genomic imprinting in chicken.

    PubMed

    Frésard, Laure; Leroux, Sophie; Servin, Bertrand; Gourichon, David; Dehais, Patrice; Cristobal, Magali San; Marsaud, Nathalie; Vignoles, Florence; Bed'hom, Bertrand; Coville, Jean-Luc; Hormozdiari, Farhad; Beaumont, Catherine; Zerjal, Tatiana; Vignal, Alain; Morisson, Mireille; Lagarrigue, Sandrine; Pitel, Frédérique

    2014-04-01

    Genomic imprinting is an epigenetic mechanism by which alleles of some specific genes are expressed in a parent-of-origin manner. It has been observed in mammals and marsupials, but not in birds. Until now, only a few genes orthologous to mammalian imprinted ones have been analyzed in chicken and did not demonstrate any evidence of imprinting in this species. However, several published observations such as imprinted-like QTL in poultry or reciprocal effects keep the question open. Our main objective was thus to screen the entire chicken genome for parental-allele-specific differential expression on whole embryonic transcriptomes, using high-throughput sequencing. To identify the parental origin of each observed haplotype, two chicken experimental populations were used, as inbred and as genetically distant as possible. Two families were produced from two reciprocal crosses. Transcripts from 20 embryos were sequenced using NGS technology, producing ∼200 Gb of sequences. This allowed the detection of 79 potentially imprinted SNPs, through an analysis method that we validated by detecting imprinting from mouse data already published. However, out of 23 candidates tested by pyrosequencing, none could be confirmed. These results come together, without a priori, with previous statements and phylogenetic considerations assessing the absence of genomic imprinting in chicken.

  3. Transcriptome-wide investigation of genomic imprinting in chicken

    PubMed Central

    Frésard, Laure; Leroux, Sophie; Servin, Bertrand; Gourichon, David; Dehais, Patrice; Cristobal, Magali San; Marsaud, Nathalie; Vignoles, Florence; Bed'hom, Bertrand; Coville, Jean-Luc; Hormozdiari, Farhad; Beaumont, Catherine; Zerjal, Tatiana; Vignal, Alain; Morisson, Mireille; Lagarrigue, Sandrine; Pitel, Frédérique

    2014-01-01

    Genomic imprinting is an epigenetic mechanism by which alleles of some specific genes are expressed in a parent-of-origin manner. It has been observed in mammals and marsupials, but not in birds. Until now, only a few genes orthologous to mammalian imprinted ones have been analyzed in chicken and did not demonstrate any evidence of imprinting in this species. However, several published observations such as imprinted-like QTL in poultry or reciprocal effects keep the question open. Our main objective was thus to screen the entire chicken genome for parental-allele-specific differential expression on whole embryonic transcriptomes, using high-throughput sequencing. To identify the parental origin of each observed haplotype, two chicken experimental populations were used, as inbred and as genetically distant as possible. Two families were produced from two reciprocal crosses. Transcripts from 20 embryos were sequenced using NGS technology, producing ∼200 Gb of sequences. This allowed the detection of 79 potentially imprinted SNPs, through an analysis method that we validated by detecting imprinting from mouse data already published. However, out of 23 candidates tested by pyrosequencing, none could be confirmed. These results come together, without a priori, with previous statements and phylogenetic considerations assessing the absence of genomic imprinting in chicken. PMID:24452801

  4. A resource of genome-wide single-nucleotide polymorphisms generated by RAD tag sequencing in the critically endangered European eel.

    PubMed

    Pujolar, J M; Jacobsen, M W; Frydenberg, J; Als, T D; Larsen, P F; Maes, G E; Zane, L; Jian, J B; Cheng, L; Hansen, M M

    2013-07-01

    Reduced representation genome sequencing such as restriction-site-associated DNA (RAD) sequencing is finding increased use to identify and genotype large numbers of single-nucleotide polymorphisms (SNPs) in model and nonmodel species. We generated a unique resource of novel SNP markers for the European eel using the RAD sequencing approach that was simultaneously identified and scored in a genome-wide scan of 30 individuals. Whereas genomic resources are increasingly becoming available for this species, including the recent release of a draft genome, no genome-wide set of SNP markers was available until now. The generated SNPs were widely distributed across the eel genome, aligning to 4779 different contigs and 19,703 different scaffolds. Significant variation was identified, with an average nucleotide diversity of 0.00529 across individuals. Results varied widely across the genome, ranging from 0.00048 to 0.00737 per locus. Based on the average nucleotide diversity across all loci, long-term effective population size was estimated to range between 132,000 and 1,320,000, which is much higher than previous estimates based on microsatellite loci. The generated SNP resource consisting of 82,425 loci and 376,918 associated SNPs provides a valuable tool for future population genetics and genomics studies and allows for targeting specific genes and particularly interesting regions of the eel genome.

  5. Genome scans for divergent selection in natural populations of the widespread hardwood species Eucalyptus grandis (Myrtaceae) using microsatellites

    PubMed Central

    Song, Zhijiao; Zhang, Miaomiao; Li, Fagen; Weng, Qijie; Zhou, Chanpin; Li, Mei; Li, Jie; Huang, Huanhua; Mo, Xiaoyong; Gan, Siming

    2016-01-01

    Identification of loci or genes under natural selection is important for both understanding the genetic basis of local adaptation and practical applications, and genome scans provide a powerful means for such identification purposes. In this study, genome-wide simple sequence repeats markers (SSRs) were used to scan for molecular footprints of divergent selection in Eucalyptus grandis, a hardwood species occurring widely in costal areas from 32° S to 16° S in Australia. High population diversity levels and weak population structure were detected with putatively neutral genomic SSRs. Using three FST outlier detection methods, a total of 58 outlying SSRs were collectively identified as loci under divergent selection against three non-correlated climatic variables, namely, mean annual temperature, isothermality and annual precipitation. Using a spatial analysis method, nine significant associations were revealed between FST outlier allele frequencies and climatic variables, involving seven alleles from five SSR loci. Of the five significant SSRs, two (EUCeSSR1044 and Embra394) contained alleles of putative genes with known functional importance for response to climatic factors. Our study presents critical information on the population diversity and structure of the important woody species E. grandis and provides insight into the adaptive responses of perennial trees to climatic variations. PMID:27748400

  6. Genome-wide microsatellite identification in the fungus Anisogramma anomala using Illumina sequencing and genome assembly.

    PubMed

    Cai, Guohong; Leadbetter, Clayton W; Muehlbauer, Megan F; Molnar, Thomas J; Hillman, Bradley I

    2013-01-01

    High-throughput sequencing has been dramatically accelerating the discovery of microsatellite markers (also known as Simple Sequence Repeats). Both 454 and Illumina reads have been used directly in microsatellite discovery and primer design (the "Seq-to-SSR" approach). However, constraints of this approach include: 1) many microsatellite-containing reads do not have sufficient flanking sequences to allow primer design, and 2) difficulties in removing microsatellite loci residing in longer, repetitive regions. In the current study, we applied the novel "Seq-Assembly-SSR" approach to overcome these constraints in Anisogramma anomala. In our approach, Illumina reads were first assembled into a draft genome, and the latter was then used in microsatellite discovery. A. anomala is an obligate biotrophic ascomycete that causes eastern filbert blight disease of commercial European hazelnut. Little is known about its population structure or diversity. Approximately 26 M 146 bp Illumina reads were generated from a paired-end library of a fungal strain from Oregon. The reads were assembled into a draft genome of 333 Mb (excluding gaps), with contig N50 of 10,384 bp and scaffold N50 of 32,987 bp. A bioinformatics pipeline identified 46,677 microsatellite motifs at 44,247 loci, including 2,430 compound loci. Primers were successfully designed for 42,923 loci (97%). After removing 2,886 loci close to assembly gaps and 676 loci in repetitive regions, a genome-wide microsatellite database of 39,361 loci was generated for the fungus. In experimental screening of 236 loci using four geographically representative strains, 228 (96.6%) were successfully amplified and 214 (90.7%) produced single PCR products. Twenty-three (9.7%) were found to be perfect polymorphic loci. A small-scale population study using 11 polymorphic loci revealed considerable gene diversity. Clustering analysis grouped isolates of this fungus into two clades in accordance with their geographic origins. Thus, the

  7. Genome-wide association analysis of autoantibody positivity in type 1 diabetes cases.

    PubMed

    Plagnol, Vincent; Howson, Joanna M M; Smyth, Deborah J; Walker, Neil; Hafler, Jason P; Wallace, Chris; Stevens, Helen; Jackson, Laura; Simmonds, Matthew J; Bingley, Polly J; Gough, Stephen C; Todd, John A

    2011-08-01

    The genetic basis of autoantibody production is largely unknown outside of associations located in the major histocompatibility complex (MHC) human leukocyte antigen (HLA) region. The aim of this study is the discovery of new genetic associations with autoantibody positivity using genome-wide association scan single nucleotide polymorphism (SNP) data in type 1 diabetes (T1D) patients with autoantibody measurements. We measured two anti-islet autoantibodies, glutamate decarboxylase (GADA, n = 2,506), insulinoma-associated antigen 2 (IA-2A, n = 2,498), antibodies to the autoimmune thyroid (Graves') disease (AITD) autoantigen thyroid peroxidase (TPOA, n = 8,300), and antibodies against gastric parietal cells (PCA, n = 4,328) that are associated with autoimmune gastritis. Two loci passed a stringent genome-wide significance level (p<10(-10)): 1q23/FCRL3 with IA-2A and 9q34/ABO with PCA. Eleven of 52 non-MHC T1D loci showed evidence of association with at least one autoantibody at a false discovery rate of 16%: 16p11/IL27-IA-2A, 2q24/IFIH1-IA-2A and PCA, 2q32/STAT4-TPOA, 10p15/IL2RA-GADA, 6q15/BACH2-TPOA, 21q22/UBASH3A-TPOA, 1p13/PTPN22-TPOA, 2q33/CTLA4-TPOA, 4q27/IL2/TPOA, 15q14/RASGRP1/TPOA, and 12q24/SH2B3-GADA and TPOA. Analysis of the TPOA-associated loci in 2,477 cases with Graves' disease identified two new AITD loci (BACH2 and UBASH3A).

  8. Genome-wide interaction study of smoking and bladder cancer risk

    PubMed Central

    Figueroa, Jonine D.; Han, Summer S.; Garcia-Closas, Montserrat; Baris, Dalsu; Jacobs, Eric J.; Kogevinas, Manolis; Schwenn, Molly; Malats, Nuria; Johnson, Alison; Purdue, Mark P.; Caporaso, Neil; Landi, Maria Teresa; Prokunina-Olsson, Ludmila; Wang, Zhaoming; Hutchinson, Amy; Burdette, Laurie; Wheeler, William; Vineis, Paolo; Siddiq, Afshan; Cortessis, Victoria K.; Kooperberg, Charles; Cussenot, Olivier; Benhamou, Simone; Prescott, Jennifer; Porru, Stefano; Bueno-de-Mesquita, H.Bas; Trichopoulos, Dimitrios; Ljungberg, Börje; Clavel-Chapelon, Françoise; Weiderpass, Elisabete; Krogh, Vittorio; Dorronsoro, Miren; Travis, Ruth; Tjønneland, Anne; Brenan, Paul; Chang-Claude, Jenny; Riboli, Elio; Conti, David; Gago-Dominguez, Manuela; Stern, Mariana C.; Pike, Malcolm C.; Van Den Berg, David; Yuan, Jian-Min; Hohensee, Chancellor; Rodabough, Rebecca; Cancel-Tassin, Geraldine; Roupret, Morgan; Comperat, Eva; Chen, Constance; De Vivo, Immaculata; Giovannucci, Edward; Hunter, David J.; Kraft, Peter; Lindstrom, Sara; Carta, Angela; Pavanello, Sofia; Arici, Cecilia; Mastrangelo, Giuseppe; Karagas, Margaret R.; Schned, Alan; Armenti, Karla R.; Hosain, G.M.Monawar; Haiman, Chris A.; Fraumeni, Joseph F.; Chanock, Stephen J.; Chatterjee, Nilanjan; Rothman, Nathaniel; Silverman, Debra T.

    2014-01-01

    Bladder cancer is a complex disease with known environmental and genetic risk factors. We performed a genome-wide interaction study (GWAS) of smoking and bladder cancer risk based on primary scan data from 3002 cases and 4411 controls from the National Cancer Institute Bladder Cancer GWAS. Alternative methods were used to evaluate both additive and multiplicative interactions between individual single nucleotide polymorphisms (SNPs) and smoking exposure. SNPs with interaction P values < 5 × 10− 5 were evaluated further in an independent dataset of 2422 bladder cancer cases and 5751 controls. We identified 10 SNPs that showed association in a consistent manner with the initial dataset and in the combined dataset, providing evidence of interaction with tobacco use. Further, two of these novel SNPs showed strong evidence of association with bladder cancer in tobacco use subgroups that approached genome-wide significance. Specifically, rs1711973 (FOXF2) on 6p25.3 was a susceptibility SNP for never smokers [combined odds ratio (OR) = 1.34, 95% confidence interval (CI) = 1.20–1.50, P value = 5.18 × 10− 7]; and rs12216499 (RSPH3-TAGAP-EZR) on 6q25.3 was a susceptibility SNP for ever smokers (combined OR = 0.75, 95% CI = 0.67–0.84, P value = 6.35 × 10− 7). In our analysis of smoking and bladder cancer, the tests for multiplicative interaction seemed to more commonly identify susceptibility loci with associations in never smokers, whereas the additive interaction analysis identified more loci with associations among smokers—including the known smoking and NAT2 acetylation interaction. Our findings provide additional evidence of gene–environment interactions for tobacco and bladder cancer. PMID:24662972

  9. Genome-wide meta-analysis of common variant differences between men and women

    PubMed Central

    Boraska, Vesna; Jerončić, Ana; Colonna, Vincenza; Southam, Lorraine; Nyholt, Dale R.; William Rayner, Nigel; Perry, John R.B.; Toniolo, Daniela; Albrecht, Eva; Ang, Wei; Bandinelli, Stefania; Barbalic, Maja; Barroso, Inês; Beckmann, Jacques S.; Biffar, Reiner; Boomsma, Dorret; Campbell, Harry; Corre, Tanguy; Erdmann, Jeanette; Esko, Tõnu; Fischer, Krista; Franceschini, Nora; Frayling, Timothy M.; Girotto, Giorgia; Gonzalez, Juan R.; Harris, Tamara B.; Heath, Andrew C.; Heid, Iris M.; Hoffmann, Wolfgang; Hofman, Albert; Horikoshi, Momoko; Hua Zhao, Jing; Jackson, Anne U.; Hottenga, Jouke-Jan; Jula, Antti; Kähönen, Mika; Khaw, Kay-Tee; Kiemeney, Lambertus A.; Klopp, Norman; Kutalik, Zoltán; Lagou, Vasiliki; Launer, Lenore J.; Lehtimäki, Terho; Lemire, Mathieu; Lokki, Marja-Liisa; Loley, Christina; Luan, Jian'an; Mangino, Massimo; Mateo Leach, Irene; Medland, Sarah E.; Mihailov, Evelin; Montgomery, Grant W.; Navis, Gerjan; Newnham, John; Nieminen, Markku S.; Palotie, Aarno; Panoutsopoulou, Kalliope; Peters, Annette; Pirastu, Nicola; Polašek, Ozren; Rehnström, Karola; Ripatti, Samuli; Ritchie, Graham R.S.; Rivadeneira, Fernando; Robino, Antonietta; Samani, Nilesh J.; Shin, So-Youn; Sinisalo, Juha; Smit, Johannes H.; Soranzo, Nicole; Stolk, Lisette; Swinkels, Dorine W.; Tanaka, Toshiko; Teumer, Alexander; Tönjes, Anke; Traglia, Michela; Tuomilehto, Jaakko; Valsesia, Armand; van Gilst, Wiek H.; van Meurs, Joyce B.J.; Smith, Albert Vernon; Viikari, Jorma; Vink, Jacqueline M.; Waeber, Gerard; Warrington, Nicole M.; Widen, Elisabeth; Willemsen, Gonneke; Wright, Alan F.; Zanke, Brent W.; Zgaga, Lina; Boehnke, Michael; d'Adamo, Adamo Pio; de Geus, Eco; Demerath, Ellen W.; den Heijer, Martin; Eriksson, Johan G.; Ferrucci, Luigi; Gieger, Christian; Gudnason, Vilmundur; Hayward, Caroline; Hengstenberg, Christian; Hudson, Thomas J.; Järvelin, Marjo-Riitta; Kogevinas, Manolis; Loos, Ruth J.F.; Martin, Nicholas G.; Metspalu, Andres; Pennell, Craig E.; Penninx, Brenda W.; Perola, Markus; Raitakari, Olli; Salomaa, Veikko; Schreiber, Stefan; Schunkert, Heribert; Spector, Tim D.; Stumvoll, Michael; Uitterlinden, André G.; Ulivi, Sheila; van der Harst, Pim; Vollenweider, Peter; Völzke, Henry; Wareham, Nicholas J.; Wichmann, H.-Erich; Wilson, James F.; Rudan, Igor; Xue, Yali; Zeggini, Eleftheria

    2012-01-01

    The male-to-female sex ratio at birth is constant across world populations with an average of 1.06 (106 male to 100 female live births) for populations of European descent. The sex ratio is considered to be affected by numerous biological and environmental factors and to have a heritable component. The aim of this study was to investigate the presence of common allele modest effects at autosomal and chromosome X variants that could explain the observed sex ratio at birth. We conducted a large-scale genome-wide association scan (GWAS) meta-analysis across 51 studies, comprising overall 114 863 individuals (61 094 women and 53 769 men) of European ancestry and 2 623 828 common (minor allele frequency >0.05) single-nucleotide polymorphisms (SNPs). Allele frequencies were compared between men and women for directly-typed and imputed variants within each study. Forward-time simulations for unlinked, neutral, autosomal, common loci were performed under the demographic model for European populations with a fixed sex ratio and a random mating scheme to assess the probability of detecting significant allele frequency differences. We do not detect any genome-wide significant (P < 5 × 10−8) common SNP differences between men and women in this well-powered meta-analysis. The simulated data provided results entirely consistent with these findings. This large-scale investigation across ∼115 000 individuals shows no detectable contribution from common genetic variants to the observed skew in the sex ratio. The absence of sex-specific differences is useful in guiding genetic association study design, for example when using mixed controls for sex-biased traits. PMID:22843499

  10. Ensembl Genomes 2013: scaling up access to genome-wide data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provi...

  11. A genome-wide map of diversity in Plasmodium falciparum.

    PubMed

    Volkman, Sarah K; Sabeti, Pardis C; DeCaprio, David; Neafsey, Daniel E; Schaffner, Stephen F; Milner, Danny A; Daily, Johanna P; Sarr, Ousmane; Ndiaye, Daouda; Ndir, Omar; Mboup, Soulyemane; Duraisingh, Manoj T; Lukens, Amanda; Derr, Alan; Stange-Thomann, Nicole; Waggoner, Skye; Onofrio, Robert; Ziaugra, Liuda; Mauceli, Evan; Gnerre, Sante; Jaffe, David B; Zainoun, Joanne; Wiegand, Roger C; Birren, Bruce W; Hartl, Daniel L; Galagan, James E; Lander, Eric S; Wirth, Dyann F

    2007-01-01

    Genetic variation allows the malaria parasite Plasmodium falciparum to overcome chemotherapeutic agents, vaccines and vector control strategies and remain a leading cause of global morbidity and mortality. Here we describe an initial survey of genetic variation across the P. falciparum genome. We performed extensive sequencing of 16 geographically diverse parasites and identified 46,937 SNPs, demonstrating rich diversity among P. falciparum parasites (pi = 1.16 x 10(-3)) and strong correlation with gene function. We identified multiple regions with signatures of selective sweeps in drug-resistant parasites, including a previously unidentified 160-kb region with extremely low polymorphism in pyrimethamine-resistant parasites. We further characterized 54 worldwide isolates by genotyping SNPs across 20 genomic regions. These data begin to define population structure among African, Asian and American groups and illustrate the degree of linkage disequilibrium, which extends over relatively short distances in African parasites but over longer distances in Asian parasites. We provide an initial map of genetic diversity in P. falciparum and demonstrate its potential utility in identifying genes subject to recent natural selection and in understanding the population genetics of this parasite.

  12. Genome-wide homozygosity signature and risk of Hodgkin lymphoma

    PubMed Central

    Sud, Amit; Cooke, Rosie; Swerdlow, Anthony J.; Houlston, Richard S.

    2015-01-01

    Recent studies have reported that regions of homozygosity (ROH) in the genome are detectable in outbred populations and can be associated with an increased risk of malignancy. To examine whether homozygosity is associated with an increased risk of developing Hodgkin lymphoma (HL) we analysed 589 HL cases and 5,199 controls genotyped for 484,072 tag single nucleotide polymorphisms (SNPs). Across the genome the cumulative distribution of ROH was not significantly different between cases and controls. Seven ROH at 4q22.3, 4q32.2, 7p12.3–14.1, 7p22.2, 10p11.22–23, 19q13.12-2 and 19p13.2 were associated with HL risk at P < 0.01. Intriguingly 4q22.3 harbours an ROH to which the nuclear factor NF-kappa-B p105 subunit (NFKB1) maps (P = 0.002). The ROH at 19q13.12-2 has previously been implicated in B-cell precursor acute lymphoblastic leukaemia. Aside from these observations which require validation, it is unlikely that levels of measured homozygosity caused by autozygosity, uniparental isodisomy or hemizygosity play a major role in defining HL risk in predominantly outbred populations. PMID:26391888

  13. In Search of Genes Associated with Risk for Psychopathic Tendencies in Children: A Two-Stage Genome-Wide Association Study of Pooled DNA

    ERIC Educational Resources Information Center

    Viding, Essi; Hanscombe, Ken B.; Curtis, Charles J. C.; Davis, Oliver S. P.; Meaburn, Emma L.; Plomin, Robert

    2010-01-01

    Background: Quantitative genetic data from our group indicates that antisocial behaviour (AB) is strongly heritable when coupled with psychopathic, callous-unemotional (CU) personality traits. We have also demonstrated that the genetic influences for AB and CU overlap considerably. We conducted a genome-wide association scan that capitalises on…

  14. Genome-wide linkage using the Social Responsiveness Scale in Utah autism pedigrees

    PubMed Central

    2010-01-01

    Background Autism Spectrum Disorders (ASD) are phenotypically heterogeneous, characterized by impairments in the development of communication and social behaviour and the presence of repetitive behaviour and restricted interests. Dissecting the genetic complexity of ASD may require phenotypic data reflecting more detail than is offered by a categorical clinical diagnosis. Such data are available from the Social Responsiveness Scale (SRS) which is a continuous, quantitative measure of social ability giving scores that range from significant impairment to above average ability. Methods We present genome-wide results for 64 multiplex and extended families ranging from two to nine generations. SRS scores were available from 518 genotyped pedigree subjects, including affected and unaffected relatives. Genotypes from the Illumina 6 k single nucleotide polymorphism panel were provided by the Center for Inherited Disease Research. Quantitative and qualitative analyses were done using MCLINK, a software package that uses Markov chain Monte Carlo (MCMC) methods to perform multilocus linkage analysis on large extended pedigrees. Results When analysed as a qualitative trait, linkage occurred in the same locations as in our previous affected-only genome scan of these families, with findings on chromosomes 7q31.1-q32.3 [heterogeneity logarithm of the odds (HLOD) = 2.91], 15q13.3 (HLOD = 3.64), and 13q12.3 (HLOD = 2.23). Additional positive qualitative results were seen on chromosomes 6 and 10 in regions that may be of interest for other neuropsychiatric disorders. When analysed as a quantitative trait, results replicated a peak found in an independent sample using quantitative SRS scores on chromosome 11p15.1-p15.4 (HLOD = 2.77). Additional positive quantitative results were seen on chromosomes 7, 9, and 19. Conclusions The SRS linkage peaks reported here substantially overlap with peaks found in our previous affected-only genome scan of clinical diagnosis. In addition, we

  15. Genome-wide differentiation of various melon horticultural groups for use in genome wide association study for fruit firmness and construction of a high resolution genetic map

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We generated 13,789 single nucleotide plymorphism (SNP) markers from 97 melon accessions using genotyping by sequencing and anchored them to chromosomes to understand genome-wide fixation index between various melon morphotypes and linkage disequilibrium (LD) decay for inodorus and cantalupensis, th...

  16. Genome-wide searches for bipolar disorder genes.

    PubMed

    Alsabban, Shaza; Rivera, Margarita; McGuffin, Peter

    2011-12-01

    Whole-genome linkage and association studies of bipolar disorder are beginning to provide some compelling evidence for the involvement of several chromosomal regions and susceptibility genes in the pathogenesis of bipolar disorder. Developments in genotyping technology and efforts to combine data from different studies have helped in identifying chromosomes 6q16-q25, 13q, and 16p12 as probable susceptibility loci for bipolar disorder and confirmed CACNA1C and ANK3 as susceptibility genes for bipolar disorder. However, a lack of replication is still apparent in the literature. New studies focusing on copy number variants as well as new analytical approaches utilizing pathway analysis offer a new direction in the study of the genetics of bipolar disorder.

  17. Enhancer function: mechanistic and genome-wide insights come together.

    PubMed

    Plank, Jennifer L; Dean, Ann

    2014-07-03

    Enhancers establish spatial or temporal patterns of gene expression that are critical for development, yet our understanding of how these DNA cis-regulatory elements function from a distance to increase transcription of their target genes and shape the cellular transcriptome has been gleaned primarily from studies of individual genes or gene families. High-throughput sequencing studies place enhancer-gene interactions within the 3D context of chromosome folding, inviting a new look at enhancer function and stimulating provocative new questions. Here, we integrate these whole-genome studies with recent mechanistic studies to illuminate how enhancers physically interact with target genes, how enhancer activity is regulated during development, and the role of noncoding RNAs transcribed from enhancers in their function.

  18. Genome-wide analysis of Polycomb targets in Drosophila

    SciTech Connect

    Schwartz, Yuri B.; Kahn, Tatyana G.; Nix, David A.; Li,Xiao-Yong; Bourgon, Richard; Biggin, Mark; Pirrotta, Vincenzo

    2006-04-01

    Polycomb Group (PcG) complexes are multiprotein assemblages that bind to chromatin and establish chromatin states leading to epigenetic silencing. PcG proteins regulate homeotic genes in flies and vertebrates but little is known about other PcG targets and the role of the PcG in development, differentiation and disease. We have determined the distribution of the PcG proteins PC, E(Z) and PSC and of histone H3K27 trimethylation in the Drosophila genome. At more than 200 PcG target genes, binding sites for the three PcG proteins colocalize to presumptive Polycomb Response Elements (PREs). In contrast, H3 me3K27 forms broad domains including the entire transcription unit and regulatory regions. PcG targets are highly enriched in genes encoding transcription factors but receptors, signaling proteins, morphogens and regulators representing all major developmental pathways are also included.

  19. Signatures of positive selection in East African Shorthorn Zebu: a genome-wide SNP analysis

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...

  20. Genome-wide Association Analysis of Kernel Weight in Hard Winter Wheat

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Wheat kernel weight is an important and heritable component of wheat grain yield and a key predictor of flour extraction. Genome-wide association analysis was conducted to identify genomic regions associated with kernel weight and kernel weight environmental response in 8 trials of 299 hard winter ...

  1. Implementing Meta-analysis for genome-wide association studies of pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality is a critical concern in the meat industry. Implementation of genome-wide association studies (GWA) allows identification of genomic regions that explain a substantial portion of the variation of relevant traits. It is also important to determine the consistency of results of GWA across...

  2. Genome-wide association as a means to understanding the mammary gland

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Next-generation sequencing and related technologies have facilitated the creation of enormous public databases that catalogue genomic variation. These databases have facilitated a variety of approaches to discover new genes that regulate normal biology as well as disease. Genome wide association (...

  3. An Efficient Resampling Method for Assessing Genome-Wide Statistical Significance in Mapping Quantitative Trait Loci

    PubMed Central

    Zou, Fei; Fine, Jason P.; Hu, Jianhua; Lin, D. Y.

    2004-01-01

    Assessing genome-wide statistical significance is an important and difficult problem in multipoint linkage analysis. Due to multiple tests on the same genome, the usual pointwise significance level based on the chi-square approximation is inappropriate. Permutation is widely used to determine genome-wide significance. Theoretical approximations are available for simple experimental crosses. In this article, we propose a resampling procedure to assess the significance of genome-wide QTL mapping for experimental crosses. The proposed method is computationally much less intensive than the permutation procedure (in the order of 102 or higher) and is applicable to complex breeding designs and sophisticated genetic models that cannot be handled by the permutation and theoretical methods. The usefulness of the proposed method is demonstrated through simulation studies and an application to a Drosophila backcross. PMID:15611194

  4. Systems Genetic Validation of the SNP-Metabolite Association in Rice Via Metabolite-Pathway-Based Phenome-Wide Association Scans.

    PubMed

    Lu, Yaping; Liu, Yemao; Niu, Xiaohui; Yang, Qingyong; Hu, Xuehai; Zhang, Hong-Yu; Xia, Jingbo

    2015-01-01

    In the post-GWAS (Genome-Wide Association Scan) era, the interpretation of GWAS results is crucial to screen for highly relevant phenotype-genotype association pairs. Based on the single genotype-phenotype association test and a pathway enrichment analysis, we propose a Metabolite-pathway-based Phenome-Wide Association Scan (M-PheWAS) to analyze the key metabolite-SNP pairs in rice and determine the regulatory relationship by assessing similarities in the changes of enzymes and downstream products in a pathway. Two SNPs, sf0315305925 and sf0315308337, were selected using this approach, and their molecular function and regulatory relationship with Enzyme EC:5.5.1.6 and with flavonoids, a significant downstream regulatory metabolite product, were demonstrated. Moreover, a total of 105 crucial SNPs were screened using M-PheWAS, which may be important for metabolite associations.

  5. Efficient multivariate linear mixed model algorithms for genome-wide association studies.

    PubMed

    Zhou, Xiang; Stephens, Matthew

    2014-04-01

    Multivariate linear mixed models (mvLMMs) are powerful tools for testing associations between single-nucleotide polymorphisms and multiple correlated phenotypes while controlling for population stratification in genome-wide association studies. We present efficient algorithms in the genome-wide efficient mixed model association (GEMMA) software for fitting mvLMMs and computing likelihood ratio tests. These algorithms offer improved computation speed, power and P-value calibration over existing methods, and can deal with more than two phenotypes.

  6. Genome-wide association mapping in plants exemplified for root growth in Arabidopsis thaliana.

    PubMed

    Slovak, Radka; Göschl, Christian; Seren, Ümit; Busch, Wolfgang

    2015-01-01

    Genome-wide association (GWA) mapping is a powerful technique to address the molecular basis of genotype to phenotype relationships and to map regulators of biological processes. This chapter presents a protocol for genome-wide association mapping in Arabidopsis thaliana using the user-friendly internet application GWAPP, and provides a specific protocol for acquiring root trait data suitable for GWA studies using the semi-automated, high-throughput phenotyping pipeline BRAT for early root growth.

  7. Genome-wide target site triplication of Alu elements in the human genome.

    PubMed

    Lee, Wooseok; Mun, Seyoung; Kang, Keunsoo; Hennighausen, Lothar; Han, Kyudong

    2015-05-01

    Alu elements are the most successful short interspersed elements in primate genomes and their retrotransposition is a major source of genomic expansion. Alu elements integrate into genomic regions through target-site primed reverse transcription, which generates target site duplications (TSDs). Unexpectedly, we have identified target site triplications (TSTs) at some loci, where two Alu elements in tandem share one direct repeat. Thus, the three copies of the repeat are present. We located 212 TST loci in the human genome and examined 25 putative human-specific TST loci using PCR validation. As a result, 12 human-specific TST loci were identified. These findings suggest that unequal homologous recombination between TSDs can lead to TST. Through this mechanism, the copy number of Alu elements could have increased in primate genomes without new Alu retrotransposition events. This study provides new insight into the augmentation of Alu elements in the primate genome.

  8. Cryogenic Pressure Calibrator for Wide Temperature Electronically Scanned (ESP) Pressure Modules

    NASA Technical Reports Server (NTRS)

    Faulcon, Nettie D.

    2001-01-01

    Electronically scanned pressure (ESP) modules have been developed that can operate in ambient and in cryogenic environments, particularly Langley's National Transonic Facility (NTF). Because they can operate directly in a cryogenic environment, their use eliminates many of the operational problems associated with using conventional modules at low temperatures. To ensure the accuracy of these new instruments, calibration was conducted in a laboratory simulating the environmental conditions of NTF. This paper discusses the calibration process by means of the simulation laboratory, the system inputs and outputs and the analysis of the calibration data. Calibration results of module M4, a wide temperature ESP module with 16 ports and a pressure range of +/- 4 psid are given.

  9. A TECHNIQUE FOR PRIMARY BEAM CALIBRATION OF DRIFT-SCANNING, WIDE-FIELD ANTENNA ELEMENTS

    SciTech Connect

    Pober, Jonathan C.; Parsons, Aaron R.; Jacobs, Daniel C.; Aguirre, James E.; Moore, David F.; Bradley, Richard F.; Parashare, Chaitali R.; Carilli, Chris L.; Gugliucci, Nicole E.

    2012-02-15

    We present a new technique for calibrating the primary beam of a wide-field, drift-scanning antenna element. Drift-scan observing is not compatible with standard beam calibration routines, and the situation is further complicated by difficult-to-parameterize beam shapes and, at low frequencies, the sparsity of accurate source spectra to use as calibrators. We overcome these challenges by building up an interrelated network of source 'crossing points'-locations where the primary beam is sampled by multiple sources. Using the single assumption that a beam has 180 Degree-Sign rotational symmetry, we can achieve significant beam coverage with only a few tens of sources. The resulting network of crossing points allows us to solve for both a beam model and source flux densities referenced to a single calibrator source, circumventing the need for a large sample of well-characterized calibrators. We illustrate the method with actual and simulated observations from the Precision Array for Probing the Epoch of Reionization.

  10. Genome-Wide Transcriptome Analysis of Cadmium Stress in Rice

    PubMed Central

    Oono, Youko; Yazawa, Takayuki; Kanamori, Hiroyuki; Sasaki, Harumi; Mori, Satomi; Handa, Hirokazu; Matsumoto, Takashi

    2016-01-01

    Rice growth is severely affected by toxic concentrations of the nonessential heavy metal cadmium (Cd). To elucidate the molecular basis of the response to Cd stress, we performed mRNA sequencing of rice following our previous study on exposure to high concentrations of Cd (Oono et al., 2014). In this study, rice plants were hydroponically treated with low concentrations of Cd and approximately 211 million sequence reads were mapped onto the IRGSP-1.0 reference rice genome sequence. Many genes, including some identified under high Cd concentration exposure in our previous study, were found to be responsive to low Cd exposure, with an average of about 11,000 transcripts from each condition. However, genes expressed constitutively across the developmental course responded only slightly to low Cd concentrations, in contrast to their clear response to high Cd concentration, which causes fatal damage to rice seedlings according to phenotypic changes. The expression of metal ion transporter genes tended to correlate with Cd concentration, suggesting the potential of the RNA-Seq strategy to reveal novel Cd-responsive transporters by analyzing gene expression under different Cd concentrations. This study could help to develop novel strategies for improving tolerance to Cd exposure in rice and other cereal crops. PMID:27034955

  11. Genome-wide transcription responses to synchrotron microbeam radiotherapy.

    PubMed

    Sprung, Carl N; Yang, Yuqing; Forrester, Helen B; Li, Jason; Zaitseva, Marina; Cann, Leonie; Restall, Tina; Anderson, Robin L; Crosbie, Jeffrey C; Rogers, Peter A W

    2012-10-01

    The majority of cancer patients achieve benefit from radiotherapy. A significant limitation of radiotherapy is its relatively low therapeutic index, defined as the maximum radiation dose that causes acceptable normal tissue damage to the minimum dose required to achieve tumor control. Recently, a new radiotherapy modality using synchrotron-generated X-ray microbeam radiotherapy has been demonstrated in animal models to ablate tumors with concurrent sparing of normal tissue. Very little work has been undertaken into the cellular and molecular mechanisms that differentiate microbeam radiotherapy from broad beam. The purpose of this study was to investigate and compare the whole genome transcriptional response of in vivo microbeam radiotherapy versus broad beam irradiated tumors. We hypothesized that gene expression changes after microbeam radiotherapy are different from those seen after broad beam. We found that in EMT6.5 tumors at 4-48 h postirradiation, microbeam radiotherapy differentially regulates a number of genes, including major histocompatibility complex (MHC) class II antigen gene family members, and other immunity-related genes including Ciita, Ifng, Cxcl1, Cxcl9, Indo and Ubd when compared to broad beam. Our findings demonstrate molecular differences in the tumor response to microbeam versus broad beam irradiation and these differences provide insight into the underlying mechanisms of microbeam radiotherapy and broad beam.

  12. Genome-wide analysis of TCP family in tobacco.

    PubMed

    Chen, L; Chen, Y Q; Ding, A M; Chen, H; Xia, F; Wang, W F; Sun, Y H

    2016-05-23

    The TCP family is a transcription factor family, members of which are extensively involved in plant growth and development as well as in signal transduction in the response against many physiological and biochemical stimuli. In the present study, 61 TCP genes were identified in tobacco (Nicotiana tabacum) genome. Bioinformatic methods were employed for predicting and analyzing the gene structure, gene expression, phylogenetic analysis, and conserved domains of TCP proteins in tobacco. The 61 NtTCP genes were divided into three diverse groups, based on the division of TCP genes in tomato and Arabidopsis, and the results of the conserved domain and sequence analyses further confirmed the classification of the NtTCP genes. The expression pattern of NtTCP also demonstrated that majority of these genes play important roles in all the tissues, while some special genes exercise their functions only in specific tissues. In brief, the comprehensive and thorough study of the TCP family in other plants provides sufficient resources for studying the structure and functions of TCPs in tobacco.

  13. Genome wide distribution of illegitimate recombination events in Kluyveromyces lactis

    PubMed Central

    Kegel, Andreas; Martinez, Paula; Carter, Sidney D.; Åström, Stefan U.

    2006-01-01

    Illegitimate recombination (IR) is the process by which two DNA molecules not sharing homology to each other are joined. In Kluyveromyces lactis, integration of heterologous DNA occurred very frequently therefore constituting an excellent model organism to study IR. IR was completely dependent on the nonhomologous end-joining (NHEJ) pathway for DNA double strand break (DSB) repair and we detected no other pathways capable of mediating IR. NHEJ was very versatile, capable of repairing both blunt and non-complementary ends efficiently. Mapping the locations of genomic IR-events revealed target site preferences, in which intergenic regions (IGRs) and ribosomal DNA were overrepresented six-fold compared to open reading frames (ORFs). The IGR-events occurred predominantly within transcriptional regulatory regions. In a rad52 mutant strain IR still preferentially occurred at IGRs, indicating that DSBs in ORFs were not primarily repaired by homologous recombination (HR). Introduction of ectopic DSBs resulted in the efficient targeting of IR to these sites, strongly suggesting that IR occurred at spontaneous mitotic DSBs. The targeting efficiency was equal when ectopic breaks were introduced in an ORF or an IGR. We propose that spontaneous DSBs arise more frequently in transcriptional regulatory regions and in rDNA and such DSBs can be mapped by analyzing IR target sites. PMID:16549875

  14. Genome-Wide Identification of Francisella tularensis Virulence Determinants▿

    PubMed Central

    Su, Jingliang; Yang, Jun; Zhao, Daimin; Kawula, Thomas H.; Banas, Jeffrey A.; Zhang, Jing-Ren

    2007-01-01

    Francisella tularensis is a gram-negative pathogen that causes life-threatening infections in humans and has potential for use as a biological weapon. The genetic basis of the F. tularensis virulence is poorly understood. This study screened a total of 3,936 transposon mutants of the live vaccine strain for infection in a mouse model of respiratory tularemia by signature-tagged mutagenesis. We identified 341 mutants attenuated for infection in the lungs. The transposon disruptions were mapped to 95 different genes, virtually all of which are also present in the genomes of other F. tularensis strains, including human pathogenic F. tularensis strain Schu S4. A small subset of these attenuated mutants carried insertions in the genes encoding previously known virulence factors, but the majority of the identified genes have not been previously linked to F. tularensis virulence. Among these are genes encoding putative membrane proteins, proteins associated with stress responses, metabolic proteins, transporter proteins, and proteins with unknown functions. Several attenuated mutants contained disruptions in a putative capsule locus which partially resembles the poly-γ-glutamate capsule biosynthesis locus of Bacillus anthracis, the anthrax agent. Deletional mutation analysis confirmed that this locus is essential for F. tularensis virulence. PMID:17420240

  15. Genome wide distribution of illegitimate recombination events in Kluyveromyces lactis.

    PubMed

    Kegel, Andreas; Martinez, Paula; Carter, Sidney D; Aström, Stefan U

    2006-01-01

    Illegitimate recombination (IR) is the process by which two DNA molecules not sharing homology to each other are joined. In Kluyveromyces lactis, integration of heterologous DNA occurred very frequently therefore constituting an excellent model organism to study IR. IR was completely dependent on the nonhomologous end-joining (NHEJ) pathway for DNA double strand break (DSB) repair and we detected no other pathways capable of mediating IR. NHEJ was very versatile, capable of repairing both blunt and non-complementary ends efficiently. Mapping the locations of genomic IR-events revealed target site preferences, in which intergenic regions (IGRs) and ribosomal DNA were overrepresented six-fold compared to open reading frames (ORFs). The IGR-events occurred predominantly within transcriptional regulatory regions. In a rad52 mutant strain IR still preferentially occurred at IGRs, indicating that DSBs in ORFs were not primarily repaired by homologous recombination (HR). Introduction of ectopic DSBs resulted in the efficient targeting of IR to these sites, strongly suggesting that IR occurred at spontaneous mitotic DSBs. The targeting efficiency was equal when ectopic breaks were introduced in an ORF or an IGR. We propose that spontaneous DSBs arise more frequently in transcriptional regulatory regions and in rDNA and such DSBs can be mapped by analyzing IR target sites.

  16. Genome-wide association with delayed puberty in swine.

    PubMed

    Nonneman, D; Lents, C; Rohrer, G; Rempel, L; Vallet, J

    2014-02-01

    An improvement in the proportion of gilts entering the herd that farrow a litter would increase overall herd performance and profitability. A significant proportion (10-30%) of gilts that enter the herd never farrow a litter; reproductive reasons account for approximately a third of gilt removals, with anestrous and failure to conceive the most common reasons for culling. Tools to select gilts for reproductive longevity through genomics or alternative phenotypes would be of great benefit to the producer. Ninety-one gilts that failed to display behavioral estrus by 240 days (cases) and 127 pubertal littermates (controls) were genotyped with the Illumina Porcine SNP60 Beadchip. One hundred and seventy-four SNPs with the most significant associations were genotyped in an additional 86 cases and 103 controls. Twelve of these associations were significant in the final analysis. The most significant (P < 1.5 × 10(-14) ) region associated with failure to attain puberty was on chromosome 4 surrounding the NHLH2 gene. Delayed pubertal development and age at first estrus have been associated with NHLH2 in mice. Because attainment of puberty is a complex trait, identifying genes that affect pubertal age would greatly contribute to our knowledge of reproductive development as well as overall fertility.

  17. Genome-wide identification of RNA editing in hepatocellular carcinoma.

    PubMed

    Kang, Lin; Liu, Xiaoqiao; Gong, Zhoulin; Zheng, Hancheng; Wang, Jun; Li, Yingrui; Yang, Huanming; Hardwick, James; Dai, Hongyue; Poon, Ronnie T P; Lee, Nikki P; Mao, Mao; Peng, Zhiyu; Chen, Ronghua

    2015-02-01

    We did whole-transcriptome sequencing and whole-genome sequencing on nine pairs of Hepatocellular carcinoma (HCC) tumors and matched adjacent tissues to identify RNA editing events. We identified mean 26,982 editing sites with mean 89.5% canonical A→G edits in each sample using an improved bioinformatics pipeline. The editing rate was significantly higher in tumors than adjacent normal tissues. Comparing the difference between tumor and normal tissues of each patient, we found 7 non-synonymous tissue specific editing events including 4 tumor-specific edits and 3 normal-specific edits in the coding region, as well as 292 edits varying in editing degree. The significant expression changes of 150 genes associated with RNA editing were found in tumors, with 3 of the 4 most significant genes being cancer related. Our results show that editing might be related to higher gene expression. These findings indicate that RNA editing modification may play an important role in the development of HCC.

  18. Systematic Genome-wide Screening and Prediction of microRNAs in EBOV During the 2014 Ebolavirus Outbreak

    PubMed Central

    Teng, Yue; Wang, Yuzhuo; Zhang, Xianglilan; Liu, Wenli; Fan, Hang; Yao, Hongwu; Lin, Baihan; Zhu, Ping; Yuan, Wenjun; Tong, Yigang; Cao, Wuchun

    2015-01-01

    Recently, several thousand people have been killed by the Ebolavirus disease (EVD) in West Africa, yet no current antiviral medications and treatments are available. Systematic investigation of ebolavirus whole genomes during the 2014 outbreak may shed light on the underlying mechanisms of EVD development. Here, using the genome-wide screening in ebolavirus genome sequences, we predicted four putative viral microRNA precursors (pre-miRNAs) and seven putative mature microRNAs (miRNAs). Combing bioinformatics analysis and prediction of the potential ebolavirus miRNA target genes, we suggest that two ebolavirus coding possible miRNAs may be silence and down-regulate the target genes NFKBIE and RIPK1, which are the central mediator of the pathways related with host cell defense mechanism. Additionally, the ebolavirus exploits the miRNAs to inhibit the NF-kB and TNF factors to evade the host defense mechanisms that limit replication by killing infected cells, or to conversely trigger apoptosis as a mechanism to increase virus spreading. This is the first study to use the genome-wide scanning to predict microRNAs in the 2014 outbreak EVD and then to apply systematic bioinformatics to analyze their target genes. We revealed a potential mechanism of miRNAs in ebolavirus infection and possible therapeutic targets for Ebola viral infection treatment. PMID:26011078

  19. Genome-wide mining, characterization, and development of microsatellite markers in Marsupenaeus japonicus by genome survey sequencing

    NASA Astrophysics Data System (ADS)

    Lu, Xia; Luan, Sheng; Kong, Jie; Hu, Longyang; Mao, Yong; Zhong, Shengping

    2017-01-01

    The kuruma prawn, Marsupenaeus japonicus, is one of the most cultivated and consumed species of shrimp. However, very few molecular genetic/genomic resources are publically available for it. Thus, the characterization and distribution of simple sequence repeats (SSRs) remains ambiguous and the use of SSR markers in genomic studies and marker-assisted selection is limited. The goal of this study is to characterize and develop genome-wide SSR markers in M. japonicus by genome survey sequencing for application in comparative genomics and breeding. A total of 326 945 perfect SSRs were identified, among which dinucleotide repeats were the most frequent class (44.08%), followed by mononucleotides (29.67%), trinucleotides (18.96%), tetranucleotides (5.66%), hexanucleotides (1.07%), and pentanucleotides (0.56%). In total, 151 541 SSR loci primers were successfully designed. A subset of 30 SSR primer pairs were synthesized and tested in 42 individuals from a wild population, of which 27 loci (90.0%) were successfully amplified with specific products and 24 (80.0%) were polymorphic. For the amplified polymorphic loci, the alleles ranged from 5 to 17 (with an average of 9.63), and the average PIC value was 0.796. A total of 58 256 SSR-containing sequences had significant Gene Ontology annotation; these are good functional molecular marker candidates for association studies and comparative genomic analysis. The newly identified SSRs significantly contribute to the M. japonicus genomic resources and will facilitate a number of genetic and genomic studies, including high density linkage mapping, genome-wide association analysis, marker-aided selection, comparative genomics analysis, population genetics, and evolution.

  20. Genetic dissection of Al tolerance QTLs in the maize genome by high density SNP scan

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Aluminum (Al) toxicity is an important limitation to food security in the tropical and subtropical regions. High Al saturation in acid soils limits root development and its ability to uptake water and nutrients. In this study, we present a genome scan for Al tolerance loci with over 50,000 GBS-based...

  1. TALENs: a widely applicable technology for targeted genome editing.

    PubMed

    Joung, J Keith; Sander, Jeffry D

    2013-01-01

    Engineered nucleases enable the targeted alteration of nearly any gene in a wide range of cell types and organisms. The newly-developed transcription activator-like effector nucleases (TALENs) comprise a nonspecific DNA-cleaving nuclease fused to a DNA-binding domain that can be easily engineered so that TALENs can target essentially any sequence. The capability to quickly and efficiently alter genes using TALENs promises to have profound impacts on biological research and to yield potential therapeutic strategies for genetic diseases.

  2. genipe: an automated genome-wide imputation pipeline with automatic reporting and statistical tools.

    PubMed

    Lemieux Perreault, Louis-Philippe; Legault, Marc-André; Asselin, Géraldine; Dubé, Marie-Pierre

    2016-12-01

    Genotype imputation is now commonly performed following genome-wide genotyping experiments. Imputation increases the density of analyzed genotypes in the dataset, enabling fine-mapping across the genome. However, the process of imputation using the most recent publicly available reference datasets can require considerable computation power and the management of hundreds of large intermediate files. We have developed genipe, a complete genome-wide imputation pipeline which includes automatic reporting, imputed data indexing and management, and a suite of statistical tests for imputed data commonly used in genetic epidemiology (Sequence Kernel Association Test, Cox proportional hazards for survival analysis, and linear mixed models for repeated measurements in longitudinal studies).

  3. A bi-dimensional genome scan for prolificacy traits in pigs shows the existence of multiple epistatic QTL

    PubMed Central

    2009-01-01

    Background Prolificacy is the most important trait influencing the reproductive efficiency of pig production systems. The low heritability and sex-limited expression of prolificacy have hindered to some extent the improvement of this trait through artificial selection. Moreover, the relative contributions of additive, dominant and epistatic QTL to the genetic variance of pig prolificacy remain to be defined. In this work, we have undertaken this issue by performing one-dimensional and bi-dimensional genome scans for number of piglets born alive (NBA) and total number of piglets born (TNB) in a three generation Iberian by Meishan F2 intercross. Results The one-dimensional genome scan for NBA and TNB revealed the existence of two genome-wide highly significant QTL located on SSC13 (P < 0.001) and SSC17 (P < 0.01) with effects on both traits. This relative paucity of significant results contrasted very strongly with the wide array of highly significant epistatic QTL that emerged in the bi-dimensional genome-wide scan analysis. As much as 18 epistatic QTL were found for NBA (four at P < 0.01 and five at P < 0.05) and TNB (three at P < 0.01 and six at P < 0.05), respectively. These epistatic QTL were distributed in multiple genomic regions, which covered 13 of the 18 pig autosomes, and they had small individual effects that ranged between 3 to 4% of the phenotypic variance. Different patterns of interactions (a × a, a × d, d × a and d × d) were found amongst the epistatic QTL pairs identified in the current work. Conclusions The complex inheritance of prolificacy traits in pigs has been evidenced by identifying multiple additive (SSC13 and SSC17), dominant and epistatic QTL in an Iberian × Meishan F2 intercross. Our results demonstrate that a significant fraction of the phenotypic variance of swine prolificacy traits can be attributed to first-order gene-by-gene interactions emphasizing that the phenotypic effects of alleles might be strongly modulated by the

  4. An EST-based genome scan using 454 sequencing in the marine snail Littorina saxatilis.

    PubMed

    Galindo, J; Grahame, J W; Butlin, R K

    2010-09-01

    Genome scans have been used in the studies of ecological speciation to find genomic regions ('outlier loci') showing reduced gene flow between divergent populations/species. High-throughput sequencing ('454') offers new opportunities in this field via transcriptome sequencing. Divergent ecotypes of the marine gastropod Littorina saxatilis represent a good example of incipient ecological speciation. We performed a 454-based genome scan between H and M ecotypes of L. saxatilis from the British Isles using cDNA of pooled individuals. Allele frequencies were calculated for 2454 single nucleotide polymorphisms (SNPs), within 572 contigs, and 7% of loci were detected as outliers. Functional annotation of the contigs containing outlier SNPs showed that they included shell matrix and muscle proteins (lithostathine, mucin, titin), proteins involved in energetic metabolism (arginine kinase, NADH dehydrogenase) and reverse transcriptases. Follow-up investigations into these proteins and unannotated outliers will be a promising route in the study of ecological speciation in L. saxatilis.

  5. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast.

    PubMed

    Oud, Bart; van Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-03-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages.

  6. Genome-wide analytical approaches for reverse metabolic engineering of industrially relevant phenotypes in yeast

    PubMed Central

    Oud, Bart; Maris, Antonius J A; Daran, Jean-Marc; Pronk, Jack T

    2012-01-01

    Successful reverse engineering of mutants that have been obtained by nontargeted strain improvement has long presented a major challenge in yeast biotechnology. This paper reviews the use of genome-wide approaches for analysis of Saccharomyces cerevisiae strains originating from evolutionary engineering or random mutagenesis. On the basis of an evaluation of the strengths and weaknesses of different methods, we conclude that for the initial identification of relevant genetic changes, whole genome sequencing is superior to other analytical techniques, such as transcriptome, metabolome, proteome, or array-based genome analysis. Key advantages of this technique over gene expression analysis include the independency of genome sequences on experimental context and the possibility to directly and precisely reproduce the identified changes in naive strains. The predictive value of genome-wide analysis of strains with industrially relevant characteristics can be further improved by classical genetics or simultaneous analysis of strains derived from parallel, independent strain improvement lineages. PMID:22152095

  7. Systematic, genome-wide, sex-specific linkage of cardiovascular traits in French Canadians.

    PubMed

    Seda, Ondrej; Tremblay, Johanne; Gaudet, Daniel; Brunelle, Pierre-Luc; Gurau, Alexandru; Merlo, Ettore; Pilote, Louise; Orlov, Sergei N; Boulva, Francis; Petrovich, Milan; Kotchen, Theodore A; Cowley, Allen W; Hamet, Pavel

    2008-04-01

    The sexual dimorphism of cardiovascular traits, as well as susceptibility to a variety of related diseases, has long been recognized, yet their sex-specific genomic determinants are largely unknown. We systematically assessed the sex-specific heritability and linkage of 539 hemodynamic, metabolic, anthropometric, and humoral traits in 120 French-Canadian families from the Saguenay-Lac-St-Jean region of Quebec, Canada. We performed multipoint linkage analysis using microsatellite markers followed by peak-wide linkage scan based on Affymetrix Human Mapping 50K Array Xba240 single nucleotide polymorphism genotypes in 3 settings, including the entire sample and then separately in men and women. Nearly one half of the traits were age and sex independent, one quarter were both age and sex dependent, and one eighth were exclusively age or sex dependent. Sex-specific phenotypes are most frequent in heart rate and blood pressure categories, whereas sex- and age-independent determinants are predominant among humoral and biochemical parameters. Twenty sex-specific loci passing multiple testing criteria were corroborated by 2-point single nucleotide polymorphism linkage. Several resting systolic blood pressure measurements showed significant genotype-by-sex interaction, eg, male-specific locus at chromosome 12 (male-female logarithm of odds difference: 4.16; interaction P=0.0002), which was undetectable in the entire population, even after adjustment for sex. Detailed interrogation of this locus revealed a 220-kb block overlapping parts of TAO-kinase 3 and SUDS3 genes. In summary, a large number of complex cardiovascular traits display significant sexual dimorphism, for which we have demonstrated genomic determinants at the haplotype level. Many of these would have been missed in a traditional, sex-adjusted setting.

  8. Genome-Wide Association Study to Identify Common Variants Associated with Brachial Circumference: A Meta-Analysis of 14 Cohorts

    PubMed Central

    Boraska, Vesna; Day-Williams, Aaron; Franklin, Christopher S.; Elliott, Katherine S.; Panoutsopoulou, Kalliope; Tachmazidou, Ioanna; Albrecht, Eva; Bandinelli, Stefania; Beilin, Lawrence J.; Bochud, Murielle; Cadby, Gemma; Ernst, Florian; Evans, David M.; Hayward, Caroline; Hicks, Andrew A.; Huffman, Jennifer; Huth, Cornelia; James, Alan L.; Klopp, Norman; Kolcic, Ivana; Kutalik, Zoltán; Lawlor, Debbie A.; Musk, Arthur W.; Pehlic, Marina; Pennell, Craig E.; Perry, John R. B.; Peters, Annette; Polasek, Ozren; Pourcain, Beate St; Ring, Susan M.; Salvi, Erika; Schipf, Sabine; Staessen, Jan A.; Teumer, Alexander; Timpson, Nicholas; Vitart, Veronique; Warrington, Nicole M.; Yaghootkar, Hanieh; Zemunik, Tatijana; Zgaga, Lina; An, Ping; Anttila, Verneri; Borecki, Ingrid B.; Holmen, Jostein; Ntalla, Ioanna; Palotie, Aarno; Pietiläinen, Kirsi H.; Wedenoja, Juho; Winsvold, Bendik S.; Dedoussis, George V.; Kaprio, Jaakko; Province, Michael A.; Zwart, John-Anker; Burnier, Michel; Campbell, Harry; Cusi, Daniele; Davey Smith, George; Frayling, Timothy M.; Gieger, Christian; Palmer, Lyle J.; Pramstaller, Peter P.; Rudan, Igor; Völzke, Henry; Wichmann, H. -Erich; Wright, Alan F.; Zeggini, Eleftheria

    2012-01-01

    Brachial circumference (BC), also known as upper arm or mid arm circumference, can be used as an indicator of muscle mass and fat tissue, which are distributed differently in men and women. Analysis of anthropometric measures of peripheral fat distribution such as BC could help in understanding the complex pathophysiology behind overweight and obesity. The purpose of this study is to identify genetic variants associated with BC through a large-scale genome-wide association scan (GWAS) meta-analysis. We used fixed-effects meta-analysis to synthesise summary results across 14 GWAS discovery and 4 replication cohorts comprising overall 22,376 individuals (12,031 women and 10,345 men) of European ancestry. Individual analyses were carried out for men, women, and combined across sexes using linear regression and an additive genetic model: adjusted for age and adjusted for age and BMI. We prioritised signals for follow-up in two-stages. We did not detect any signals reaching genome-wide significance. The FTO rs9939609 SNP showed nominal evidence for association (p<0.05) in the age-adjusted strata for men and across both sexes. In this first GWAS meta-analysis for BC to date, we have not identified any genome-wide significant signals and do not observe robust association of previously established obesity loci with BC. Large-scale collaborations will be necessary to achieve higher power to detect loci underlying BC. PMID:22479309

  9. Genome-wide analysis of HPV integration in human cancers reveals recurrent, focal genomic instability

    PubMed Central

    Akagi, Keiko; Li, Jingfeng; Broutian, Tatevik R.; Padilla-Nash, Hesed; Xiao, Weihong; Jiang, Bo; Rocco, James W.; Teknos, Theodoros N.; Kumar, Bhavna; Wangsa, Danny; He, Dandan; Ried, Thomas; Symer, David E.; Gillison, Maura L.

    2014-01-01

    Genomic instability is a hallmark of human cancers, including the 5% caused by human papillomavirus (HPV). Here we report a striking association between HPV integration and adjacent host genomic structural variation in human cancer cell lines and primary tumors. Whole-genome sequencing revealed HPV integrants flanking and bridging extensive host genomic amplifications and rearrangements, including deletions, inversions, and chromosomal translocations. We present a model of “looping” by which HPV integrant-mediated DNA replication and recombination may result in viral–host DNA concatemers, frequently disrupting genes involved in oncogenesis and amplifying HPV oncogenes E6 and E7. Our high-resolution results shed new light on a catastrophic process, distinct from chromothripsis and other mutational processes, by which HPV directly promotes genomic instability. PMID:24201445

  10. SCAN+

    SciTech Connect

    Kenneth Krebs, John Svoboda

    2009-11-01

    SCAN+ is a software application specifically designed to control the positioning of a gamma spectrometer by a two dimensional translation system above spent fuel bundles located in a sealed spent fuel cask. The gamma spectrometer collects gamma spectrum information for the purpose of spent fuel cask fuel loading verification. SCAN+ performs manual and automatic gamma spectrometer positioning functions as-well-as exercising control of the gamma spectrometer data acquisitioning functions. Cask configuration files are used to determine the positions of spent fuel bundles. Cask scanning files are used to determine the desired scan paths for scanning a spent fuel cask allowing for automatic unattended cask scanning that may take several hours.

  11. Genome-wide linkage analysis for human longevity: Genetics of Healthy Ageing Study

    PubMed Central

    Beekman, Marian; Blanché, Hélène; Perola, Markus; Hervonen, Anti; Bezrukov, Vladyslav; Sikora, Ewa; Flachsbart, Frederieke; Christiansen, Lene; De Craen, Anton J.M.; Kirkwood, Tom B.L.; Rea, I. Meave; Poulain, Michel; Robine, Jean-Marie; Stazi, Maria Antonietta; Passarino, Giuseppe; Deiana, Luca; Gonos, Efstathios S.; Valensin, Silvana; Paternoster, Lavinia; Sørensen, Thorkild I.A.; Tan, Qihua; Helmer, Quinta; Van den Akker, Erik B.; Deelen, Joris; Martella, Francesca; Cordell, Heather J.; Ayers, Kristin L.; Vaupel, James W.; Törnwall, Outi; Johnson, Thomas E.; Schreiber, Stefan; Lathrop, Mark; Skytthe, Axel; Westendorp, Rudi G.J.; Christensen, Kaare; Gampe, Jutta; Nebel, Almut; Houwing-Duistermaat, Jeanine J.; Slagboom, P. Eline; Franceschi, Claudio

    2013-01-01

    Summary Clear evidence exists for heritability of human longevity, and much interest is focused on identifying genes associated with longer lives. To identify such longevity alleles, we performed the largest genome-wide linkage scan thus far reported. Linkage analyses included 2118 nonagenarian Caucasian sibling pairs that have been enrolled in fifteen study centers of eleven European countries as part of the Genetics of Healthy Ageing (GEHA) project. In the joint linkage analyses we observed four regions that show linkage with longevity; chromosome 14q11.2 (LOD=3.47), chromosome 17q12-q22 (LOD=2.95), chromosome 19p13.3-p13.11 (LOD=3.76) and chromosome 19q13.11-q13.32 (LOD=3.57). To fine map these regions linked to longevity, we performed association analysis using GWAS data in a subgroup of 1,228 unrelated nonagenarian and 1,907 geographically matched controls. Using a fixed effect meta-analysis approach, rs4420638 at the TOMM40/APOE/APOC1 gene locus showed significant association with longevity (p-value=9.6 × 10−8). By combined modeling of linkage and association we showed that association of longevity with APOEε4 and APOEε2 alleles explain the linkage at 19q13.11-q13.32 with p-value=0.02 and p-value=1.0 × 10−5, respectively. In the largest linkage scan thus far performed for human familial longevity, we confirm that the APOE locus is a longevity gene and that additional longevity loci may be identified at 14q11.2, 17q12-q22 and 19p13.3-p13.11. Since the latter linkage results are not explained by common variants, we suggest that rare variants play an important role in human familial longevity. PMID:23286790

  12. Genome-wide screening for DNA variants associated with reading and language traits

    PubMed Central

    Gialluisi, A; Newbury, D F; Wilcutt, E G; Olson, R K; DeFries, J C; Brandler, W M; Pennington, B F; Smith, S D; Scerri, T S; Simpson, N H; Luciano, M; Evans, D M; Bates, T C; Stein, J F; Talcott, J B; Monaco, A P; Paracchini, S; Francks, C; Fisher, S E

    2014-01-01

    Reading and language abilities are heritable traits that are likely to share some genetic influences with each other. To identify pleiotropic genetic variants affecting these traits, we first performed a genome-wide association scan (GWAS) meta-analysis using three richly characterized datasets comprising individuals with histories of reading or language problems, and their siblings. GWAS was performed in a total of 1862 participants using the first principal component computed from several quantitative measures of reading- and language-related abilities, both before and after adjustment for performance IQ. We identified novel suggestive associations at the SNPs rs59197085 and rs5995177 (uncorrected P ≈ 10–7 for each SNP), located respectively at the CCDC136/FLNC and RBFOX2 genes. Each of these SNPs then showed evidence for effects across multiple reading and language traits in univariate association testing against the individual traits. FLNC encodes a structural protein involved in cytoskeleton remodelling, while RBFOX2 is an important regulator of alternative splicing in neurons. The CCDC136/FLNC locus showed association with a comparable reading/language measure in an independent sample of 6434 participants from the general population, although involving distinct alleles of the associated SNP. Our datasets will form an important part of on-going international efforts to identify genes contributing to reading and language skills. PMID:25065397

  13. A multi-stage genome-wide association study of bladder cancer identifies multiple susceptibility loci

    PubMed Central

    Rothman, Nathaniel; Garcia-Closas, Montserrat; Chatterjee, Nilanjan; Malats, Nuria; Wu, Xifeng; Figueroa, Jonine; Real, Francisco X; Van Den Berg, David; Matullo, Giuseppe; Baris, Dalsu; Thun, Michael; Kiemeney, Lambertus A; Vineis, Paolo; De Vivo, Immaculata; Albanes, Demetrius; Purdue, Mark P; Rafnar, Thorunn; Hildebrandt, Michelle A T; Kiltie, Anne E; Cussenot, Olivier; Golka, Klaus; Kumar, Rajiv; Taylor, Jack A; Mayordomo, Jose I; Jacobs, Kevin B; Kogevinas, Manolis; Hutchinson, Amy; Wang, Zhaoming; Fu, Yi-Ping; Prokunina-Olsson, Ludmila; Burdette, Laurie; Yeager, Meredith; Wheeler, William; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Lloreta, Josep; Johnson, Alison; Schwenn, Molly; Karagas, Margaret R; Schned, Alan; Andriole, Gerald; Grubb, Robert; Black, Amanda; Jacobs, Eric J; Diver, W Ryan; Gapstur, Susan M; Weinstein, Stephanie J; Virtamo, Jarmo; Cortessis, Victoria K; Gago-Dominguez, Manuela; Pike, Malcolm C; Stern, Mariana C; Yuan, Jian-Min; Hunter, David; McGrath, Monica; Dinney, Colin P; Czerniak, Bogdan; Chen, Meng; Yang, Hushan; Vermeulen, Sita H; Aben, Katja K; Witjes, J Alfred; Makkinje, Remco R; Sulem, Patrick; Besenbacher, Soren; Stefansson, Kari; Riboli, Elio; Brennan, Paul; Panico, Salvatore; Navarro, Carmen; Allen, Naomi E; Bueno-de-Mesquita, H Bas; Trichopoulos, Dimitrios; Caporaso, Neil; Landi, Maria Teresa; Canzian, Federico; Ljungberg, Borje; Tjonneland, Anne; Clavel-Chapelon, Francoise; Bishop, David T; Teo, Mark T W; Knowles, Margaret A; Guarrera, Simonetta; Polidoro, Silvia; Ricceri, Fulvio; Sacerdote, Carlotta; Allione, Alessandra; Cancel-Tassin, Geraldine; Selinski, Silvia; Hengstler, Jan G; Dietrich, Holger; Fletcher, Tony; Rudnai, Peter; Gurzau, Eugen; Koppova, Kvetoslava; Bolick, Sophia C E; Godfrey, Ashley; Xu, Zongli; Sanz-Velez, José I; García-Prats, María D; Sanchez, Manuel; Valdivia, Gabriel; Porru, Stefano; Benhamou, Simone; Hoover, Robert N; Fraumeni, Joseph F; Silverman, Debra T; Chanock, Stephen J

    2010-01-01

    We conducted a multi-stage, genome-wide association study (GWAS) of bladder cancer with a primary scan of 589,299 single nucleotide polymorphisms (SNPs) in 3,532 cases and 5,120 controls of European descent (5 studies) followed by a replication strategy, which included 8,381 cases and 48,275 controls (16 studies). In a combined analysis, we identified three new regions associated with bladder cancer on chromosomes 22q13.1, 19q12 and 2q37.1; rs1014971, (P=8×10−12) maps to a non-genic region of chromosome 22q13.1; rs8102137 (P=2×10−11) on 19q12 maps to CCNE1; and rs11892031 (P=1×10−7) maps to the UGT1A cluster on 2q37.1. We confirmed four previous GWAS associations on chromosomes 3q28, 4p16.3, 8q24.21 and 8q24.3, validated previous candidate associations for the GSTM1 deletion (P=4×10−11) and a tag SNP for NAT2 acetylation status (P=4×10−11), as well as demonstrated smoking interactions with both regions. Our findings on common variants associated with bladder cancer risk should provide new insights into mechanisms of carcinogenesis. PMID:20972438

  14. Genome-Wide Association Analysis to Identify Loci for Milk Yield in Gyr Breed

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome scan was conducted to identify QTL affecting milk yield in a Brazilian Gyr population of progeny test bulls (N=319). Data used in this study was derived from traditional genetic evaluation records computed by the Embrapa Dairy Cattleand released in May/2009 (http://www.cnpgl.embrapa.br/nova...

  15. Ensembl Genomes 2013: scaling up access to genome-wide data

    PubMed Central

    Kersey, Paul Julian; Allen, James E.; Christensen, Mikkel; Davis, Paul; Falin, Lee J.; Grabmueller, Christoph; Hughes, Daniel Seth Toney; Humphrey, Jay; Kerhornou, Arnaud; Khobova, Julia; Langridge, Nicholas; McDowall, Mark D.; Maheswari, Uma; Maslen, Gareth; Nuhn, Michael; Ong, Chuang Kee; Paulini, Michael; Pedro, Helder; Toneva, Iliana; Tuli, Mary Ann; Walts, Brandon; Williams, Gareth; Wilson, Derek; Youens-Clark, Ken; Monaco, Marcela K.; Stein, Joshua; Wei, Xuehong; Ware, Doreen; Bolser, Daniel M.; Howe, Kevin Lee; Kulesha, Eugene; Lawson, Daniel; Staines, Daniel Michael

    2014-01-01

    Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. This article provides an update to the previous publications about the resource, with a focus on recent developments. These include the addition of important new genomes (and related data sets) including crop plants, vectors of human disease and eukaryotic pathogens. In addition, the resource has scaled up its representation of bacterial genomes, and now includes the genomes of over 9000 bacteria. Specific extensions to the web and programmatic interfaces have been developed to support users in navigating these large data sets. Looking forward, analytic tools to allow targeted selection of data for visualization and download are likely to become increasingly important in future as the number of available genomes increases within all domains of life, and some of the challenges faced in representing bacterial data are likely to become commonplace for eukaryotes in future. PMID:24163254

  16. Implementing meta-analysis from genome-wide association studies for pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality plays an important role in the meat processing industry, thus different methodologies have been implemented to elucidate the genetic architecture of traits affecting meat quality. One of the most common and widely used approaches is to perform genome-wide association (GWA) studies. Howe...

  17. PEPIS: A Pipeline for Estimating Epistatic Effects in Quantitative Trait Locus Mapping and Genome-Wide Association Studies.

    PubMed

    Zhang, Wenchao; Dai, Xinbin; Wang, Qishan; Xu, Shizhong; Zhao, Patrick X

    2016-05-01

    The term epistasis refers to interactions between multiple genetic loci. Genetic epistasis is important in regulating biological function and is considered to explain part of the 'missing heritability,' which involves marginal genetic effects that cannot be accounted for in genome-wide association studies. Thus, the study of epistasis is of great interest to geneticists. However, estimating epistatic effects for quantitative traits is challenging due to the large number of interaction effects that must be estimated, thus significantly increasing computing demands. Here, we present a new web server-based tool, the Pipeline for estimating EPIStatic genetic effects (PEPIS), for analyzing polygenic epistatic effects. The PEPIS software package is based on a new linear mixed model that has been used to predict the performance of hybrid rice. The PEPIS includes two main sub-pipelines: the first for kinship matrix calculation, and the second for polygenic component analyses and genome scanning for main and epistatic effects. To accommodate the demand for high-performance computation, the PEPIS utilizes C/C++ for mathematical matrix computing. In addition, the modules for kinship matrix calculations and main and epistatic-effect genome scanning employ parallel computing technology that effectively utilizes multiple computer nodes across our networked cluster, thus significantly improving the computational speed. For example, when analyzing the same immortalized F2 rice population genotypic data examined in a previous study, the PEPIS returned identical results at each analysis step with the original prototype R code, but the computational time was reduced from more than one month to about five minutes. These advances will help overcome the bottleneck frequently encountered in genome wide epistatic genetic effect analysis and enable accommodation of the high computational demand. The PEPIS is publically available at http://bioinfo.noble.org/PolyGenic_QTL/.

  18. PEPIS: A Pipeline for Estimating Epistatic Effects in Quantitative Trait Locus Mapping and Genome-Wide Association Studies

    PubMed Central

    Dai, Xinbin; Wang, Qishan; Xu, Shizhong; Zhao, Patrick X.

    2016-01-01

    The term epistasis refers to interactions between multiple genetic loci. Genetic epistasis is important in regulating biological function and is considered to explain part of the ‘missing heritability,’ which involves marginal genetic effects that cannot be accounted for in genome-wide association studies. Thus, the study of epistasis is of great interest to geneticists. However, estimating epistatic effects for quantitative traits is challenging due to the large number of interaction effects that must be estimated, thus significantly increasing computing demands. Here, we present a new web server-based tool, the Pipeline for estimating EPIStatic genetic effects (PEPIS), for analyzing polygenic epistatic effects. The PEPIS software package is based on a new linear mixed model that has been used to predict the performance of hybrid rice. The PEPIS includes two main sub-pipelines: the first for kinship matrix calculation, and the second for polygenic component analyses and genome scanning for main and epistatic effects. To accommodate the demand for high-performance computation, the PEPIS utilizes C/C++ for mathematical matrix computing. In addition, the modules for kinship matrix calculations and main and epistatic-effect genome scanning employ parallel computing technology that effectively utilizes multiple computer nodes across our networked cluster, thus significantly improving the computational speed. For example, when analyzing the same immortalized F2 rice population genotypic data examined in a previous study, the PEPIS returned identical results at each analysis step with the original prototype R code, but the computational time was reduced from more than one month to about five minutes. These advances will help overcome the bottleneck frequently encountered in genome wide epistatic genetic effect analysis and enable accommodation of the high computational demand. The PEPIS is publically available at http://bioinfo.noble.org/PolyGenic_QTL/. PMID:27224861

  19. From Genome-Wide Association Study to Phenome-Wide Association Study: New Paradigms in Obesity Research.

    PubMed

    Zhang, Y-P; Zhang, Y-Y; Duan, D D

    2016-01-01

    Obesity is a condition in which excess body fat has accumulated over an extent that increases the risk of many chronic diseases. The current clinical classification of obesity is based on measurement of body mass index (BMI), waist-hip ratio, and body fat percentage. However, these measurements do not account for the wide individual variations in fat distribution, degree of fatness or health risks, and genetic variants identified in the genome-wide association studies (GWAS). In this review, we will address this important issue with the introduction of phenome, phenomics, and phenome-wide association study (PheWAS). We will discuss the new paradigm shift from GWAS to PheWAS in obesity research. In the era of precision medicine, phenomics and PheWAS provide the required approaches to better definition and classification of obesity according to the association of obese phenome with their unique molecular makeup, lifestyle, and environmental impact.

  20. Genome-Wide Survey of Large Rare Copy Number Variants in Alzheimer’s Disease Among Caribbean Hispanics

    PubMed Central

    Ghani, Mahdi; Pinto, Dalila; Lee, Joseph H.; Grinberg, Yakov; Sato, Christine; Moreno, Danielle; Scherer, Stephen W.; Mayeux, Richard; St. George-Hyslop, Peter; Rogaeva, Ekaterina

    2012-01-01

    Recently genome-wide association studies have identified significant association between Alzheimer’s disease (AD) and variations in CLU, PICALM, BIN1, CR1, MS4A4/MS4A6E, CD2AP, CD33, EPHA1, and ABCA7. However, the pathogenic variants in these loci have not yet been found. We conducted a genome-wide scan for large copy number variation (CNV) in a dataset of Caribbean Hispanic origin (554 controls and 559 AD cases that were previously investigated in a SNP-based genome-wide association study using Illumina HumanHap 650Y platform). We ran four CNV calling algorithms to obtain high-confidence calls for large CNVs (>100 kb) that were detected by at least two algorithms. Global burden analyses did not reveal significant differences between cases and controls in CNV rate, distribution of deletions or duplications, total or average CNV size; or number of genes affected by CNVs. However, we observed a nominal association between AD and a ∼470 kb duplication on chromosome 15q11.2 (P = 0.037). This duplication, encompassing up to five genes (TUBGCP5, CYFIP1, NIPA2, NIPA1, and WHAMML1) was present in 10 cases (2.6%) and 3 controls (0.8%). The dosage increase of CYFIP1 and NIPA1 genes was further confirmed by quantitative PCR. The current study did not detect CNVs that affect novel AD loci identified by recent genome-wide association studies. However, because the array technology used in our study has limitations in detecting small CNVs, future studies must carefully assess novel AD genes for the presence of disease-related CNVs. PMID:22384383

  1. Genome-wide efficient mixed-model analysis for association studies.

    PubMed

    Zhou, Xiang; Stephens, Matthew

    2012-06-17

    Linear mixed models have attracted considerable attention recently as a powerful and effective tool for accounting for population stratification and relatedness in genetic association tests. However, existing methods for exact computation of standard test statistics are computationally impractical for even moderate-sized genome-wide association studies. To address this issue, several approximate methods have been proposed. Here, we present an efficient exact method, which we refer to as genome-wide efficient mixed-model association (GEMMA), that makes approximations unnecessary in many contexts. This method is approximately n times faster than the widely used exact method known as efficient mixed-model association (EMMA), where n is the sample size, making exact genome-wide association analysis computationally practical for large numbers of individuals.

  2. A reference genome for common bean and genome-wide analysis of dual domestications.

    PubMed

    Schmutz, Jeremy; McClean, Phillip E; Mamidi, Sujan; Wu, G Albert; Cannon, Steven B; Grimwood, Jane; Jenkins, Jerry; Shu, Shengqiang; Song, Qijian; Chavarro, Carolina; Torres-Torres, Mirayda; Geffroy, Valerie; Moghaddam, Samira Mafi; Gao, Dongying; Abernathy, Brian; Barry, Kerrie; Blair, Matthew; Brick, Mark A; Chovatia, Mansi; Gepts, Paul; Goodstein, David M; Gonzales, Michael; Hellsten, Uffe; Hyten, David L; Jia, Gaofeng; Kelly, James D; Kudrna, Dave; Lee, Rian; Richard, Manon M S; Miklas, Phillip N; Osorno, Juan M; Rodrigues, Josiane; Thareau, Vincent; Urrea, Carlos A; Wang, Mei; Yu, Yeisoo; Zhang, Ming; Wing, Rod A; Cregan, Perry B; Rokhsar, Daniel S; Jackson, Scott A

    2014-07-01

    Common bean (Phaseolus vulgaris L.) is the most important grain legume for human consumption and has a role in sustainable agriculture owing to its ability to fix atmospheric nitrogen. We assembled 473 Mb of the 587-Mb genome and genetically anchored 98% of this sequence in 11 chromosome-scale pseudomolecules. We compared the genome for the common bean against the soybean genome to find changes in soybean resulting from polyploidy. Using resequencing of 60 wild individuals and 100 landraces from the genetically differentiated Mesoamerican and Andean gene pools, we confirmed 2 independent domestications from genetic pools that diverged before human colonization. Less than 10% of the 74 Mb of sequence putatively involved in domestication was shared by the two domestication events. We identified a set of genes linked with increased leaf and seed size and combined these results with quantitative trait locus data from Mesoamerican cultivars. Genes affected by domestication may be useful for genomics-enabled crop improvement.

  3. Genic rather than genome-wide differences between sexually deceptive Ophrys orchids with different pollinators.

    PubMed

    Sedeek, Khalid E M; Scopece, Giovanni; Staedler, Yannick M; Schönenberger, Jürg; Cozzolino, Salvatore; Schiestl, Florian P; Schlüter, Philipp M

    2014-12-01

    High pollinator specificity and the potential for simple genetic changes to affect pollinator attraction make sexually deceptive orchids an ideal system for the study of ecological speciation, in which change of flower odour is likely important. This study surveys reproductive barriers and differences in floral phenotypes in a group of four closely related, coflowering sympatric Ophrys species and uses a genotyping-by-sequencing (GBS) approach to obtain information on the proportion of the genome that is differentiated between species. Ophrys species were found to effectively lack postpollination barriers, but are strongly isolated by their different pollinators (floral isolation) and, to a smaller extent, by shifts in flowering time (temporal isolation). Although flower morphology and perhaps labellum coloration may contribute to floral isolation, reproductive barriers may largely be due to differences in flower odour chemistry. GBS revealed shared polymorphism throughout the Ophrys genome, with very little population structure between species. Genome scans for FST outliers identified few markers that are highly differentiated between species and repeatable in several populations. These genome scans also revealed highly differentiated polymorphisms in genes with putative involvement in floral odour production, including a previously identified candidate gene thought to be involved in the biosynthesis of pseudo-pheromones by the orchid flowers. Taken together, these data suggest that ecological speciation associated with different pollinators in sexually deceptive orchids has a genic rather than a genomic basis, placing these species at an early phase of genomic divergence within the 'speciation continuum'.

  4. Reproduction and In-Depth Evaluation of Genome-Wide Association Studies and Genome-Wide Meta-analyses Using Summary Statistics

    PubMed Central

    Niu, Yao-Fang; Ye, Chengyin; He, Ji; Han, Fang; Guo, Long-Biao; Zheng, Hou-Feng; Chen, Guo-Bo

    2017-01-01

    In line with open-source genetics, we report a novel linear regression technique for genome-wide association studies (GWAS), called Open GWAS algoriTHm (OATH). When individual-level data are not available, OATH can not only completely reproduce reported results from an experimental model, but also recover underreported results from other alternative models with a different combination of nuisance parameters using naïve summary statistics (NSS). OATH can also reliably evaluate all reported results in-depth (e.g., p-value variance analysis), as demonstrated for 42 Arabidopsis phenotypes under three magnesium (Mg) conditions. In addition, OATH can be used for consortium-driven genome-wide association meta-analyses (GWAMA), and can greatly improve the flexibility of GWAMA. A prototype of OATH is available in the Genetic Analysis Repository (https://github.com/gc5k/GEAR). PMID:28122950

  5. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development.

    PubMed

    Jiang, Zhihua; Wang, Hongyang; Michal, Jennifer J; Zhou, Xiang; Liu, Bang; Woods, Leah C Solberg; Fuchs, Rita A

    2016-01-01

    Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals.

  6. Genome Wide Sampling Sequencing for SNP Genotyping: Methods, Challenges and Future Development

    PubMed Central

    Jiang, Zhihua; Wang, Hongyang; Michal, Jennifer J.; Zhou, Xiang; Liu, Bang; Woods, Leah C. Solberg; Fuchs, Rita A.

    2016-01-01

    Genetic polymorphisms, particularly single nucleotide polymorphisms (SNPs), have been widely used to advance quantitative, functional and evolutionary genomics. Ideally, all genetic variants among individuals should be discovered when next generation sequencing (NGS) technologies and platforms are used for whole genome sequencing or resequencing. In order to improve the cost-effectiveness of the process, however, the research community has mainly focused on developing genome-wide sampling sequencing (GWSS) methods, a collection of reduced genome complexity sequencing, reduced genome representation sequencing and selective genome target sequencing. Here we review the major steps involved in library preparation, the types of adapters used for ligation and the primers designed for amplification of ligated products for sequencing. Unfortunately, currently available GWSS methods have their drawbacks, such as inconsistency in the number of reads per sample library, the number of sites/targets per individual, and the number of reads per site/target, all of which result in missing data. Suggestions are proposed here to improve library construction, genotype calling accuracy, genome-wide marker density and read mapping rate. In brief, optimized GWSS library preparation should generate a unique set of target sites with dense distribution along chromosomes and even coverage per site across all individuals. PMID:26722221

  7. Novel Genetic Analysis for Case-Control Genome-Wide Association Studies: Quantification of Power and Genomic Prediction Accuracy

    PubMed Central

    Lee, Sang Hong; Wray, Naomi R.

    2013-01-01

    Genome-wide association studies (GWAS) are routinely conducted for both quantitative and binary (disease) traits. We present two analytical tools for use in the experimental design of GWAS. Firstly, we present power calculations quantifying power in a unified framework for a range of scenarios. In this context we consider the utility of quantitative scores (e.g. endophenotypes) that may be available on cases only or both cases and controls. Secondly, we consider, the accuracy of prediction of genetic risk from genome-wide SNPs and derive an expression for genomic prediction accuracy using a liability threshold model for disease traits in a case-control design. The expected values based on our derived equations for both power and prediction accuracy agree well with observed estimates from simulations. PMID:23977056

  8. Phenotype-Genotype Integrator (PheGenI): synthesizing genome-wide association study (GWAS) data with existing genomic resources.

    PubMed

    Ramos, Erin M; Hoffman, Douglas; Junkins, Heather A; Maglott, Donna; Phan, Lon; Sherry, Stephen T; Feolo, Mike; Hindorff, Lucia A

    2014-01-01

    Rapidly accumulating data from genome-wide association studies (GWASs) and other large-scale studies are most useful when synthesized with existing databases. To address this opportunity, we developed the Phenotype-Genotype Integrator (PheGenI), a user-friendly web interface that integrates various National Center for Biotechnology Information (NCBI) genomic databases with association data from the National Human Genome Research Institute GWAS Catalog and supports downloads of search results. Here, we describe the rationale for and development of this resource. Integrating over 66,000 association records with extensive single nucleotide polymorphism (SNP), gene, and expression quantitative trait loci data already available from the NCBI, PheGenI enables deeper investigation and interrogation of SNPs associated with a wide range of traits, facilitating the examination of the relationships between genetic variation and human diseases.

  9. Genomics for public health improvement: relevant international ethical and policy issues around genome-wide association studies and biobanks.

    PubMed

    Pang, T

    2013-01-01

    Genome-wide association studies and biobanks are at the forefront of genomics research and possess unprecedented potential to improve public health. However, for public health genomics to ultimately fulfill its potential, technological and scientific advances alone are insufficient. Scientists, ethicists, policy makers, and regulators must work closely together with research participants and communities in order to craft an equitable and just ethical framework, and a sustainable environment for effective policies. Such a framework should be a 'hybrid' form which balances equity and solidarity with entrepreneurship and scientific advances. A good balance between research and policy on one hand, and privacy, protection and trust on the other is the key for public health improvement based on advances in genomics science.

  10. snpGeneSets: An R Package for Genome-Wide Study Annotation

    PubMed Central

    Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian

    2016-01-01

    Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048

  11. snpGeneSets: An R Package for Genome-Wide Study Annotation.

    PubMed

    Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian

    2016-12-07

    Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/.

  12. Genome-wide survey and analysis of microsatellites in the Pacific oyster genome: abundance, distribution, and potential for marker development

    NASA Astrophysics Data System (ADS)

    Wang, Jiafeng; Qi, Haigang; Li, Li; Zhang, Guofan

    2014-01-01

    Microsatellites are a ubiquitous component of the eukaryote genome and constitute one of the most popular sources of molecular markers for genetic studies. However, no data are currently available regarding microsatellites across the entire genome in oysters, despite their importance to the aquaculture industry. We present the first genome-wide investigation of microsatellites in the Pacific oyster Crassostrea gigas by analysis of the complete genome, resequencing, and expression data. The Pacific oyster genome is rich in microsatellites. A total of 604 653 repeats were identified, in average of one locus per 815 base pairs (bp). A total of 12 836 genes had coding repeats, and 7 332 were expressed normally, including genes with a wide range of molecular functions. Compared with 20 different species of animals, microsatellites in the oyster genome typically exhibited 1) an intermediate overall frequency; 2) relatively uniform contents of (A)n and (C)n repeats and abundant long (C)n repeats (≥24 bp); 3) large average length of (AG)n repeats; and 4) scarcity of trinucleotide repeats. The microsatellite-flanking regions exhibited a high degree of polymorphism with a heterozygosity rate of around 2.0%, but there was no correlation between heterozygosity and microsatellite abundance. A total of 19 462 polymorphic microsatellites were discovered, and dinucleotide repeats were the most active, with over 26% of loci found to harbor allelic variations. In all, 7 451 loci with high potential for marker development were identified. Better knowledge of the microsatellites in the oyster genome will provide information for the future design of a wide range of molecular markers and contribute to further advancements in the field of oyster genetics, particularly for molecular-based selection and breeding.

  13. Genome-Wide Divergence and Linkage Disequilibrium Analyses for Capsicum baccatum Revealed by Genome-Anchored Single Nucleotide Polymorphisms.

    PubMed

    Nimmakayala, Padma; Abburi, Venkata L; Saminathan, Thangasamy; Almeida, Aldo; Davenport, Brittany; Davidson, Joshua; Reddy, C V Chandra Mohan; Hankins, Gerald; Ebert, Andreas; Choi, Doil; Stommel, John; Reddy, Umesh K

    2016-01-01

    Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum, indicating a population bottleneck during domestication of C. baccatum. In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum, 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index (FST ) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9-2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers.

  14. Genome-Wide Divergence and Linkage Disequilibrium Analyses for Capsicum baccatum Revealed by Genome-Anchored Single Nucleotide Polymorphisms

    PubMed Central

    Nimmakayala, Padma; Abburi, Venkata L.; Saminathan, Thangasamy; Almeida, Aldo; Davenport, Brittany; Davidson, Joshua; Reddy, C. V. Chandra Mohan; Hankins, Gerald; Ebert, Andreas; Choi, Doil; Stommel, John; Reddy, Umesh K.

    2016-01-01

    Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum, indicating a population bottleneck during domestication of C. baccatum. In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum, 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index (FST) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9–2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers. PMID:27857720

  15. Applying semantic web technologies for phenome-wide scan using an electronic health record linked Biobank

    PubMed Central

    2012-01-01

    Background The ability to conduct genome-wide association studies (GWAS) has enabled new exploration of how genetic variations contribute to health and disease etiology. However, historically GWAS have been limited by inadequate sample size due to associated costs for genotyping and phenotyping of study subjects. This has prompted several academic medical centers to form “biobanks” where biospecimens linked to personal health information, typically in electronic health records (EHRs), are collected and stored on a large number of subjects. This provides tremendous opportunities to discover novel genotype-phenotype associations and foster hypotheses generation. Results In this work, we study how emerging Semantic Web technologies can be applied in conjunction with clinical and genotype data stored at the Mayo Clinic Biobank to mine the phenotype data for genetic associations. In particular, we demonstrate the role of using Resource Description Framework (RDF) for representing EHR diagnoses and procedure data, and enable federated querying via standardized Web protocols to identify subjects genotyped for Type 2 Diabetes and Hypothyroidism to discover gene-disease associations. Our study highlights the potential of Web-scale data federation techniques to execute complex queries. Conclusions This study demonstrates how Semantic Web technologies can be applied in conjunction with clinical data stored in EHRs to accurately identify subjects with specific diseases and phenotypes, and identify genotype-phenotype associations. PMID:23244446

  16. Development and characterization of genomic and expressed SSRs in citrus by genome-wide analysis.

    PubMed

    Liu, Sheng-Rui; Li, Wen-Yang; Long, Dang; Hu, Chun-Gen; Zhang, Jin-Zhi

    2013-01-01

    Microsatellites or simple sequence repeats (SSRs) are one of the most popular sources of genetic markers and play a significant role in plant genetics and breeding. In this study, we identified citrus SSRs in the genome of Clementine mandarin and analyzed their frequency and distribution in different genomic regions. A total of 80,708 SSRs were detected in the genome with an overall density of 268 SSRs/Mb. While di-nucleotide repeats were the most frequent microsatellites in genomic DNA sequence, tetra-nucleotides, which had more repeat units than any other SSR types, had the highest cumulative sequence length. We identified 6,834 transcripts as containing 8,989 SSRs in 33,929 Clementine mandarin transcripts, among which, tri-nucleotide motifs (36.0%) were the most common, followed by di-nucleotide (26.9%) and hexa-nucleotide motifs (15.1%). The motif AG (16.7%) was most abundant among these SSRs, while motifs AAG (6.6%), AAT (5.0%), and TAG (2.2%) were most common among tri-nucleotides. Functional categorization of transcripts containing SSRs revealed that 5,879 (86.0%) of such transcripts had homology with known proteins, GO and KEGG annotation revealed that transcripts containing SSRs were those implicated in diverse biological processes in plants, including binding, development, transcription, and protein degradation. When 27 genomic and 78 randomly selected SSRs were tested on Clementine mandarin, 95 SSRs revealed polymorphism. These 95 SSRs were further deployed on 18 genotypes of the three generas of Rutaceae for the genetic diversity assessment, genomic SSRs generally show low transferability in comparison to SSRs developed from expressed sequences. These transcript-markers identified in our study may provide a valuable genetic and genomic tool for further genetic research and varietal development in citrus, such as diversity study, QTL mapping, molecular breeding, comparative mapping and other genetic analyses.

  17. Genome-Wide Association Study to Identify Genes Related to Renal Mercury Concentrations in Mice

    PubMed Central

    Alkaissi, Hammoudi; Ekstrand, Jimmy; Jawad, Aksa; Nielsen, Jesper Bo; Havarinasab, Said; Soderkvist, Peter; Hultman, Per

    2016-01-01

    Background: Following human mercury (Hg) exposure, the metal accumulates in considerable concentrations in kidney, liver, and brain. Although the toxicokinetics of Hg have been studied extensively, factors responsible for interindividual variation in humans are largely unknown. Differences in accumulation of renal Hg between inbred mouse strains suggest a genetic interstrain variation regulating retention or/and excretion of Hg. A.SW, DBA/2 and BALB/C mouse strains accumulate higher amounts of Hg than B10.S. Objectives: We aimed to find candidate genes associated with regulation of renal Hg concentrations. Methods: A.SW, B10.S and their F1 and F2 offspring were exposed for 6 weeks to 2.0 mg Hg/L drinking water. Genotyping with microsatellites was conducted on 84 F2 mice for genome-wide scanning with ion pair reverse-phase high-performance liquid chromatography (IP RP HPLC). Quantitative trait loci (QTL) were established. Denaturing HPLC was used to detect single nucleotide polymorphisms for haplotyping and fine mapping in 184 and 32 F2 mice, respectively. Candidate genes (Pprc1, Btrc and Nfkb2) verified by fine mapping and QTL were further investigated by real-time polymerase chain reaction. Genes enhanced by Pprc1 (Nrf1 and Nrf2) were included for gene expression analysis. Results: Renal Hg concentrations differed significantly between A.SW and B10.S mice and between males and females within each strain. QTL analysis showed a peak logarithm of odds ratio score 5.78 on chromosome 19 (p = 0.002). Haplotype and fine mapping associated the Hg accumulation with Pprc1, which encodes PGC-1-related coactivator (PRC), a coactivator for proteins involved in detoxification. Pprc1 and two genes coactivated by Pprc1 (Nrf1 and Nrf2) had significantly lower gene expression in the A.SW strain than in the B10.S strain. Conclusions: This study supports Pprc1 as a key regulator for renal Hg excretion. Citation: Alkaissi H, Ekstrand J, Jawad A, Nielsen JB, Havarinasab S, Soderkvist P

  18. Genome-wide divergence, haplotype distribution and population demographic histories for Gossypium hirsutum and Gossypium barbadense as revealed by genome-anchored SNPs

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using the SNPs distributed genome-wide, we exami...

  19. Bidirectional promoters of insects: genome-wide comparison, evolutionary implication and influence on gene expression.

    PubMed

    Behura, Susanta K; Severson, David W

    2015-01-30

    Bidirectional promoters are widespread in insect genomes. By analyzing 23 insect genomes we show that the frequency of bidirectional gene pairs varies according to genome compactness and density of genes among the species. The density of bidirectional genes expected based on number of genes per megabase of genome explains the observed density suggesting that bidirectional pairing of genes may be due to random event. We identified specific transcription factor binding motifs that are enriched in bidirectional promoters across insect species. Furthermore, we observed that bidirectional promoters may act as transcriptional hotspots in insect genomes where protein coding genes tend to aggregate in significantly biased (p < 0.001) manner compared to unidirectional promoters. Natural selection seems to have an association with the extent of bidirectionality of genes among the species. The rate of non-synonymous-to-synonymous changes (dN/dS) shows a second-order polynomial distribution with bidirectionality between species indicating that bidirectionality is dependent upon evolutionary pressure acting on the genomes. Analysis of genome-wide microarray expression data of multiple insect species suggested that bidirectionality has a similar association with transcriptome variation across species. Furthermore, bidirectional promoters show significant association with correlated expression of the divergent gene pairs depending upon their motif composition. Analysis of gene ontology showed that bidirectional genes tend to have a common association with functions related to "binding" (including ion binding, nucleotide binding and protein binding) across genomes. Such functional constraint of bidirectional genes may explain their widespread persistence in genome of diverse insect species.

  20. A guide to genome-wide association analysis and post-analytic interrogation.

    PubMed

    Reed, Eric; Nunez, Sara; Kulp, David; Qian, Jing; Reilly, Muredach P; Foulkes, Andrea S

    2015-12-10

    This tutorial is a learning resource that outlines the basic process and provides specific software tools for implementing a complete genome-wide association analysis. Approaches to post-analytic visualization and interrogation of potentially novel findings are also presented. Applications are illustrated using the free and open-source R statistical computing and graphics software environment, Bioconductor software for bioinformatics and the UCSC Genome Browser. Complete genome-wide association data on 1401 individuals across 861,473 typed single nucleotide polymorphisms from the PennCATH study of coronary artery disease are used for illustration. All data and code, as well as additional instructional resources, are publicly available through the Open Resources in Statistical Genomics project: http://www.stat-gen.org.

  1. High-resolution, genome-wide mapping of chromatin modifications by GMAT.

    PubMed

    Roh, Tae-Young; Zhao, Keji

    2008-01-01

    One major postgenomic challenge is to characterize the epigenomes that control genome functions. The epigenomes are mainly defined by the specific association of nonhistone proteins with chromatin and the covalent modifications of chromatin, including DNA methylation and posttranslational histone modifications. The in vivo protein-binding and chromatin-modification patterns can be revealed by the chromatin immunoprecipitation assay (ChIP). By combining the ChIP assays and the serial analysis of gene expression (SAGE) protocols, we have developed an unbiased and high-resolution genome-wide mapping technique (GMAT) to determine the genome-wide protein-targeting and chromatin-modification patterns. GMAT has been successfully applied to mapping the target sites of the histone acetyltransferase, Gcn5p, in yeast and to the discovery of the histone acetylation islands as an epigenetic mark for functional regulatory elements in the human genome.

  2. Analysis of Diffusional Solidification in a Wide-Gap Brazing Powder Mixture Using Differential Scanning Calorimetry

    NASA Astrophysics Data System (ADS)

    Corbin, Stephen F.; Murray, D. Clark; Bouthillier, Alain

    2016-12-01

    The diffusional solidification (DS) of a mixed powder system, consisting of pure Ni base metal powder and BNi-2 braze powder, suitable for wide-gap brazing was investigated using differential scanning calorimetry (DSC) and parallel microstructural examination. It was determined that very little interdiffusion between the powders developed in the solid state prior to braze powder melting. Once liquid formed, rapid DS occurred such that, when the powders were loosely mixed together, only 20 to 50 wt pct of the potential liquid fraction actually developed, leading to poor densification. Separating the braze and Ni into a layered structure leads to less DS, increased liquid formation ( i.e., 35 to 80 wt pct of the potential liquid fraction) and improved densification. The rate of isothermal solidification in layered structures consisting of 30 and 40 wt pct BNi-2 braze material was determined using DSC. After 30 minutes of braze time at 1323 K (1050 °C), complete IS occurred, thus avoiding the formation of a continuous network of large borides. The final microstructure of the IS mixtures consisted of a continuous matrix of solid solution Ni, with isolated and dispersed borides.

  3. Obstacle detection and spectral discrimination using multi-wavelength motionless wide angle laser scanning.

    PubMed

    Sahba, Kaveh; Alameh, Kamal E; Smith, Clifton L

    2008-04-14

    Static laser scanning over a wide angle is demonstrated by ranging to 20 laser beams generated by a novel cylindrical quasi-cavity waveguide, using laser triangulation. Baseline distances and outgoing angles unique to each laser beam are calculated by modelling the triangulation arrangement using a system of linear equations and plotting principal rays. The quasi-cavity waveguide, imaging lens and focal plane are also plotted. The system is calibrated by finding optimal values for uncertain instrumental parameters using constrained non-linear optimization. Distances calculated over 5m indoors result in accuracies above 93%. Discrete laser spectroscopy using 640nm and 785nm laser diodes is also demonstrated. Both injected laser beams follow the same optical path through the quasi-cavity waveguide, enabling spectral measurements to be made from the same point on an object for both wavelengths. The reflected red and infrared laser light is digitally recorded by a CCD imager and differences in reflected intensity enable discrimination between various natural objects. This provides more complete information about the perturbing object, including its 3D coordinates as well as limited identification of its surface material.

  4. Mining Gold Dust under the Genome Wide Significance Level: A Two-Stage Approach to Analysis of GWAS

    PubMed Central

    Shi, Gang; Boerwinkle, Eric; Morrison, Alanna C.; Gu, C. Charles; Chakravarti, Aravinda; Rao, DC

    2013-01-01

    We propose a two-stage approach to analyze genome-wide association (GWA) data in order to identify a set of promising single-nucleotide polymorphisms (SNPs). In stage one, we select a list of top signals from single SNP analyses by controlling false discovery rate (FDR). In stage two, we use the least absolute shrinkage and selection operator (LASSO) regression to reduce false positives. The proposed approach was evaluated using simulated quantitative traits based on genome-wide SNP data on 8,861 Caucasian individuals from the Atherosclerosis Risk in Communities (ARIC) Study. Our first stage, targeted at controlling false negatives, yields better power than using Bonferroni corrected significance level. The LASSO regression reduces the number of significant SNPs in stage two: it reduces false positive SNPs and it reduces true positive SNPs also at simulated causal loci due to linkage disequilibrium. Interestingly, the LASSO regression preserves the power from stage one, i.e., the number of causal loci detected from the LASSO regression in stage two is almost the same as in stage one, while reducing false positives further. Real data on systolic blood pressure in the ARIC study was analyzed using our two-stage approach which identified two significant SNPs, one of which was reported to be genome-significant in a meta-analysis containing a much larger sample size. On the other hand, a single SNP association scan did not yield any significant results. PMID:21254218

  5. Mining gold dust under the genome wide significance level: a two-stage approach to analysis of GWAS.

    PubMed

    Shi, Gang; Boerwinkle, Eric; Morrison, Alanna C; Gu, C Charles; Chakravarti, Aravinda; Rao, D C

    2011-02-01

    We propose a two-stage approach to analyze genome-wide association data in order to identify a set of promising single-nucleotide polymorphisms (SNPs). In stage one, we select a list of top signals from single SNP analyses by controlling false discovery rate. In stage two, we use the least absolute shrinkage and selection operator (LASSO) regression to reduce false positives. The proposed approach was evaluated using simulated quantitative traits based on genome-wide SNP data on 8,861 Caucasian individuals from the Atherosclerosis Risk in Communities (ARIC) Study. Our first stage, targeted at controlling false negatives, yields better power than using Bonferroni-corrected significance level. The LASSO regression reduces the number of significant SNPs in stage two: it reduces false-positive SNPs and it reduces true-positive SNPs also at simulated causal loci due to linkage disequilibrium. Interestingly, the LASSO regression preserves the power from stage one, i.e., the number of causal loci detected from the LASSO regression in stage two is almost the same as in stage one, while reducing false positives further. Real data on systolic blood pressure in the ARIC study was analyzed using our two-stage approach which identified two significant SNPs, one of which was reported to be genome-significant in a meta-analysis containing a much larger sample size. On the other hand, a single SNP association scan did not yield any significant results.

  6. A Genome-Wide Investigation of Autozygosity and Breast Cancer Risk

    DTIC Science & Technology

    2011-07-01

    likely to harbor cancer-related genes 6 Early-onset breast cancer GWAS • 3,203 non- Hispanic white participants – 1,647cases , 1,556 controls – From BCFR...cases than in controls, using logistic regression methods. Using genome-wide SNP data (525,000 SNPs) on 1,647 non- Hispanic white, early-onset...specific genomic locations, suggesting these regions harbor important cancer genes (10, 11). Homozygosity mapping is a natural extension of large

  7. A genome-wide perspective of human diversity and its implications in infectious disease.

    PubMed

    Manry, Jérémy; Quintana-Murci, Lluis

    2013-01-01

    Progress in genomic technologies, such as DNA arrays and next-generation sequencing, is allowing systematic characterization of the degree of human genetic variation at the scale of individual genomes. Public efforts, such as the International HapMap Project and the 1000 Genomes Project, have provided a realistic picture of the levels of genetic diversity in individuals and populations. These genomic techniques are also making it possible to evaluate the contribution of host genetic diversity to differences in susceptibility to both rare and common infectious diseases. Recent studies have revealed the power of whole-exome sequencing for dissecting the immunological mechanisms underlying the pathogenesis of severe, rare infectious diseases. Likewise, genome-wide association studies on common viral, bacterial, and parasitic infections have shed light on the host genetic basis of susceptibility to infectious diseases and, in some cases, of disease progression and drug responses.

  8. Variants in the host genome may inhibit tumour growth in devil facial tumours: evidence from genome-wide association.

    PubMed

    Wright, Belinda; Willet, Cali E; Hamede, Rodrigo; Jones, Menna; Belov, Katherine; Wade, Claire M

    2017-03-24

    Devil facial tumour disease (DFTD) has decimated wild populations of Tasmanian devils (Sarcophilus harrisii) due to its ability to avoid immune detection and pass from host to host by biting. A small number of devils have been observed to spontaneously recover from the disease which is otherwise fatal. We have sequenced the genomes of these rare cases and compared them to the genomes of devils who succumbed to the disease. Genome-wide association, based on this limited sampling, highlighted two key genomic regions potentially associated with ability to survive DFTD. Following targeted genotyping in additional samples, both of these loci remain significantly different between cases and controls, with the PAX3 locus retaining significance at the 0.001 level, though genome-wide significance was not achieved. We propose that PAX3 may be involved in a regulatory pathway that influences the slowing of tumour growth and may allow more time for an immune response to be mounted in animals with regressed tumours. This provides an intriguing hypothesis for further research and could provide a novel route of treatment for this devastating disease.

  9. Genome-wide polymorphism and comparative analyses in the white-tailed deer (Odocoileus virginianus): a model for conservation genomics.

    PubMed

    Seabury, Christopher M; Bhattarai, Eric K; Taylor, Jeremy F; Viswanathan, Ganesh G; Cooper, Susan M; Davis, Donald S; Dowd, Scot E; Lockwood, Mitch L; Seabury, Paul M

    2011-01-19

    The white-tailed deer (Odocoileus virginianus) represents one of the most successful and widely distributed large mammal species within North America, yet very little nucleotide sequence information is available. We utilized massively parallel pyrosequencing of a reduced representation library (RRL) and a random shotgun library (RSL) to generate a complete mitochondrial genome sequence and identify a large number of putative single nucleotide polymorphisms (SNPs) distributed throughout the white-tailed deer nuclear and mitochondrial genomes. A SNP validation study designed to test specific classes of putative SNPs provides evidence for as many as 10,476 genome-wide SNPs in the current dataset. Based on cytogenetic evidence for homology between cow (Bos taurus) and white-tailed deer chromosomes, we demonstrate that a divergent genome may be used for estimating the relative distribution and density of de novo sequence contigs as well as putative SNPs for species without draft genome assemblies. Our approach demonstrates that bioinformatic tools developed for model or agriculturally important species may be leveraged to support next-generation research programs for species of biological, ecological and evolutionary importance. We also provide a functional annotation analysis for the de novo sequence contigs assembled from white-tailed deer pyrosequencing reads, a mitochondrial phylogeny involving 13,722 nucleotide positions for 10 unique species of Cervidae, and a median joining haplotype network as a putative representation of mitochondrial evolution in O. virginianus. The results of this study are expected to provide a detailed template enabling genome-wide sequence-based studies of threatened, endangered or conservationally important non-model organisms.

  10. A foundation for provitamin A biofortification of maize: genome-wide association and genomic prediction models of carotenoid levels.

    PubMed

    Owens, Brenda F; Lipka, Alexander E; Magallanes-Lundback, Maria; Tiede, Tyler; Diepenbrock, Christine H; Kandianis, Catherine B; Kim, Eunha; Cepela, Jason; Mateos-Hernandez, Maria; Buell, C Robin; Buckler, Edward S; DellaPenna, Dean; Gore, Michael A; Rocheford, Torbert

    2014-12-01

    Efforts are underway for development of crops with improved levels of provitamin A carotenoids to help combat dietary vitamin A deficiency. As a global staple crop with considerable variation in kernel carotenoid composition, maize (Zea mays L.) could have a widespread impact. We performed a genome-wide association study (GWAS) of quantified seed carotenoids across a panel of maize inbreds ranging from light yellow to dark orange in grain color to identify some of the key genes controlling maize grain carotenoid composition. Significant associations at the genome-wide level were detected within the coding regions of zep1 and lut1, carotenoid biosynthetic genes not previously shown to impact grain carotenoid composition in association studies, as well as within previously associated lcyE and crtRB1 genes. We leveraged existing biochemical and genomic information to identify 58 a priori candidate genes relevant to the biosynthesis and retention of carotenoids in maize to test in a pathway-level analysis. This revealed dxs2 and lut5, genes not previously associated with kernel carotenoids. In genomic prediction models, use of markers that targeted a small set of quantitative trait loci associated with carotenoid levels in prior linkage studies were as effective as genome-wide markers for predicting carotenoid traits. Based on GWAS, pathway-level analysis, and genomic prediction studies, we outline a flexible strategy involving use of a small number of genes that can be selected for rapid conversion of elite white grain germplasm, with minimal amounts of carotenoids, to orange grain versions containing high levels of provitamin A.

  11. Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs

    PubMed Central

    Krapohl, E; Plomin, R

    2016-01-01

    One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES. PMID:25754083

  12. Genetic link between family socioeconomic status and children's educational achievement estimated from genome-wide SNPs.

    PubMed

    Krapohl, E; Plomin, R

    2016-03-01

    One of the best predictors of children's educational achievement is their family's socioeconomic status (SES), but the degree to which this association is genetically mediated remains unclear. For 3000 UK-representative unrelated children we found that genome-wide single-nucleotide polymorphisms could explain a third of the variance of scores on an age-16 UK national examination of educational achievement and half of the correlation between their scores and family SES. Moreover, genome-wide polygenic scores based on a previously published genome-wide association meta-analysis of total number of years in education accounted for ~3.0% variance in educational achievement and ~2.5% in family SES. This study provides the first molecular evidence for substantial genetic influence on differences in children's educational achievement and its association with family SES.

  13. A Genome-Wide Association Study for Regulators of Micronucleus Formation in Mice

    PubMed Central

    McIntyre, Rebecca E.; Nicod, Jérôme; Robles-Espinoza, Carla Daniela; Maciejowski, John; Cai, Na; Hill, Jennifer; Verstraten, Ruth; Iyer, Vivek; Rust, Alistair G.; Balmus, Gabriel; Mott, Richard; Flint, Jonathan; Adams, David J.

    2016-01-01

    In mammals the regulation of genomic instability plays a key role in tumor suppression and also controls genome plasticity, which is important for recombination during the processes of immunity and meiosis. Most studies to identify regulators of genomic instability have been performed in cells in culture or in systems that report on gross rearrangements of the genome, yet subtle differences in the level of genomic instability can contribute to whole organism phenotypes such as tumor predisposition. Here we performed a genome-wide association study in a population of 1379 outbred Crl:CFW(SW)-US_P08 mice to dissect the genetic landscape of micronucleus formation, a biomarker of chromosomal breaks, whole chromosome loss, and extranuclear DNA. Variation in micronucleus levels is a complex trait with a genome-wide heritability of 53.1%. We identify seven loci influencing micronucleus formation (false discovery rate <5%), and define candidate genes at each locus. Intriguingly at several loci we find evidence for sexual dimorphism in micronucleus formation, with a locus on chromosome 11 being specific to males. PMID:27233670

  14. A Genome-Wide Association Study for Regulators of Micronucleus Formation in Mice.

    PubMed

    McIntyre, Rebecca E; Nicod, Jérôme; Robles-Espinoza, Carla Daniela; Maciejowski, John; Cai, Na; Hill, Jennifer; Verstraten, Ruth; Iyer, Vivek; Rust, Alistair G; Balmus, Gabriel; Mott, Richard; Flint, Jonathan; Adams, David J

    2016-08-09

    In mammals the regulation of genomic instability plays a key role in tumor suppression and also controls genome plasticity, which is important for recombination during the processes of immunity and meiosis. Most studies to identify regulators of genomic instability have been performed in cells in culture or in systems that report on gross rearrangements of the genome, yet subtle differences in the level of genomic instability can contribute to whole organism phenotypes such as tumor predisposition. Here we performed a genome-wide association study in a population of 1379 outbred Crl:CFW(SW)-US_P08 mice to dissect the genetic landscape of micronucleus formation, a biomarker of chromosomal breaks, whole chromosome loss, and extranuclear DNA. Variation in micronucleus levels is a complex trait with a genome-wide heritability of 53.1%. We identify seven loci influencing micronucleus formation (false discovery rate <5%), and define candidate genes at each locus. Intriguingly at several loci we find evidence for sexual dimorphism in micronucleus formation, with a locus on chromosome 11 being specific to males.

  15. Genome-Wide Association Study of Receptive Language Ability of 12-Year-Olds

    PubMed Central

    Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert

    2014-01-01

    Purpose Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a genome-wide association approach. Method The authors administered 4 Internet-based measures of receptive language (vocabulary, semantics, syntax, and pragmatics) to a sample of 2,329 twelve-year-olds for whom DNA and genome-wide genotyping were available. Nearly 700,000 single-nucleotide polymorphisms (SNPs) and 1 million imputed SNPs were included in a genome-wide association analysis of receptive language composite scores. Results No SNP associations met the demanding criterion of genome-wide significance that corrects for multiple testing across the genome (p < 5 × 10–8). The strongest SNP association did not replicate in an additional sample of 2,639 twelve-year-olds. Conclusions These results indicate that individual differences in receptive language ability in the general population do not reflect common genetic variants that account for more than 3% of the phenotypic variance. The search for genetic variants associated with language skill will require larger samples and additional methods to identify and functionally characterize the full spectrum of risk variants. PMID:24687471

  16. A Genetic Linkage Map of the Mouse Using Restriction Landmark Genomic Scanning (Rlgs)

    PubMed Central

    Hayashizaki, Y.; Hirotsune, S.; Okazaki, Y.; Shibata, H.; Akasako, A.; Muramatsu, M.; Kawai, J.; Hirasawa, T.; Watanabe, S.; Shiroishi, T.; Moriwaki, K.; Taylor, B. A.; Matsuda, Y.; Elliott, R. W.; Manly, K. F.; Chapman, V. M.

    1994-01-01

    We have developed a multiplex method of genome analysis, restriction landmark genomic scanning (RLGS) that has been used to construct genetic maps in mice. Restriction landmarks are end-labeled restriction fragments of genomic DNA that are separated by using high resolution, two-dimensional gel electrophoresis identifying as many as two thousand landmark loci in a single gel. Variation for several hundred of these loci has been identified between laboratory strains and between these strains and Mus spretus. The segregation of more than 1100 RLGS loci has been analyxed in recombinant inbred (RI) strains and in two separate interspecific genetic crosses. Genetic maps have been derived that link 1045 RLGS loci to reference loci on all of the autosomes and the X chromosome of the mouse genome. The RLGS method can be applied to genome analysis in many different organisms to identify genomic loci because it used end-labeling of restriction landmarks rather than probe hybridization. Different combinations of restriction enzymes yield different sets of RLGS loci providing expanded power for genetic mapping. PMID:7896102

  17. Genomic consequences of selection and genome-wide association mapping in soybean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Crop improvement always involves selection of specific alleles at genes controlling traits of agronomic importance, likely resulting in detectable signatures within the genome of modern soybean. The identification of these signatures is meaningful from the perspective of evolutionary biology, and fo...

  18. A reference genome for common bean and genome wide analysis of dual domestications

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Common bean (Phaseolus vulgaris) is the single most important grain legume for human consumption and, due to its ability to fix atmospheric nitrogen via symbioses with soil-borne microorganisms, has a valuable place in sustainable agriculture. We assembled 473 Mb of the common bean genome and geneti...

  19. Control selection options for genome-wide association studies in cohorts.

    PubMed

    Wacholder, Sholom; Rotunno, Melissa

    2009-03-01

    Investigators planning studies within cohorts have many options for choosing an efficient sampling design for genome-wide association and other molecular epidemiology studies. Consideration of person-year and proportional hazards analyses of full cohorts may add further insight into ramifications of different designs. Empirical evidence from genome-wide association studies can supplement intuition and simulations in comparing properties of various case-control designs within cohorts. Additional theoretical and empirical work, justification of sampling choice in publications, and consideration of context and scientific aims can improve designs and, thereby, increase the scientific value and cost effectiveness of future studies.

  20. Constitutional mosaic genome-wide uniparental disomy due to diploidisation: an unusual cancer-predisposing mechanism.

    PubMed

    Romanelli, Valeria; Nevado, Julián; Fraga, Mario; Trujillo, Alex Martín; Mori, Maria Ángeles; Fernández, Luis; Pérez de Nanclares, Guiomar; Martínez-Glez, Víctor; Pita, Guillermo; Meneses, Heloisa; Gracia, Ricardo; García-Miñaur, Sixto; García de Miguel, Purificación; Lecumberri, Beatriz; Rodríguez, José Ignacio; González Neira, Anna; Monk, David; Lapunzina, Pablo

    2011-03-01

    Molecular studies in a patient with Beckwith-Wiedemann syndrome phenotype who developed two different tumours revealed an unexpected observation of almost complete loss of heterozygosity of all chromosomes. It is shown, by means of numerous molecular methods, that the absence of maternal contribution in somatic cells is due to high-degree (∼ 85%) genome-wide paternal uniparental disomy (UPD). The observations indicate that the genome-wide UPD results from diploidisation, and have important implications for genetic counselling and tumour surveillance for the growing number of UPD associated imprinting disorders.

  1. Genome-Wide Association Mapping of Root Traits in the Context of Plant Hormone Research.

    PubMed

    Ristova, Daniela; Busch, Wolfgang

    2017-01-01

    Genome-wide association (GWA) mapping is a powerful method for the identification of alleles that underlie quantitative traits. It enables one to understand how genetic variation translates into phenotypic variation. In particular, plant hormone signaling pathways play a key role in shaping phenotypes. This chapter presents a protocol for genome-wide association mapping of root traits of Arabidopsis thaliana in the context of hormone research. We describe a specific protocol for acquiring primary and lateral root trait data that is appropriate for GWA studies using FIJI (ImageJ), and subsequent GWA mapping using a user-friendly Internet application.

  2. Genome-wide DNA methylation analysis using massively parallel sequencing technologies.

    PubMed

    Suzuki, Masako; Greally, John M

    2013-01-01

    "Epigenetics" refers to a heritable change in transcriptional status without alteration in the primary nucleotide sequence. Epigenetics provides an extra layer of transcriptional control and plays a crucial role in normal development, as well as in pathological conditions. DNA methylation is one of the best known and well-studied epigenetic modifications. Genome-wide DNA methylation profiling has become recognized as a biologically and clinically important epigenomic assay. In this review, we discuss the strengths and weaknesses of the protocols for genome-wide DNA methylation profiling using massively parallel sequencing (MPS) techniques. We will also describe recently discovered DNA modifications, and the protocols to detect these modifications.

  3. The development of an Arabidopsis model system for genome-wide analysis of polyploidy effects

    PubMed Central

    JEFFREY CHEN, Z.; WANG, JIANLIN; TIAN, LU; LEE, HYEON-SE; WANG, JIYUAN J.; CHEN, MENG; LEE, JINSUK J.; JOSEFSSON, CAROLINE; MADLUNG, ANDREAS; WATSON, BRIAN; LIPPMAN, ZACH; VAUGHN, MATT; CHRIS PIRES, J.; COLOT, VINCENT; DOERGE, R. W.; MARTIENSSEN, ROBERT A.; COMAI, LUCA; OSBORN, THOMAS C.

    2007-01-01

    Arabidopsis is a model system not only for studying numerous aspects of plant biology, but also for understanding mechanisms of the rapid evolutionary process associated with genome duplication and polyploidization. Although in animals interspecific hybrids are often sterile and aneuploids are related to disease syndromes, both Arabidopsis autopolyploids and allopolyploids occur in nature and can be readily formed in the laboratory, providing an attractive system for comparing changes in gene expression and genome structure among relatively ‘young’ and ‘established’ or ‘ancient’ polyploids. Powerful reverse and forward genetics in Arabidopsis offer an exceptional means by which regulatory mechanisms of gene and genome duplication may be revealed. Moreover, the Arabidopsis genome is completely sequenced; both coding and non-coding sequences are available. We have developed spotted oligo-gene and chromosome microarrays using the complete Arabidopsis genome sequence. The oligo-gene microarray consists of ~26 000 70-mer oligonucleotides that are designed from all annotated genes in Arabidopsis, and the chromosome microarray contains 1 kb genomic tiling fragments amplified from a chromosomal region or the complete sequence of chromosome 4. We have demonstrated the utility of microarrays for genome-wide analysis of changes in gene expression, genome organization and chromatin structure in Arabidopsis polyploids and related species. PMID:18079994

  4. Assignment of the horse grey coat colour gene to ECA25 using whole genome scanning.

    PubMed

    Swinburne, June E; Hopkins, A; Binns, M M

    2002-10-01

    The dominant grey coat colour gene of horses has been mapped using a whole genome scanning approach. Samples from a large half-sibling pedigree of Thoroughbred horses were utilized in order to map the grey coat colour locus, G. Multiplex groups of microsatellite markers were developed and used to efficiently screen the horse genome at a resolution of approximately 22 cM, based on an estimated map length for the horse genome of 2720 cM. The grey gene was assigned to chromosome 25 (ECA25), one of the smaller acrocentric horse chromosomes. Based on the current state of knowledge of conserved synteny and coat colour genetics in other mammalian species, there are no obvious candidate genes for the grey gene in the region.

  5. Genome-wide BAC-end sequencing of Cucumis melo using two BAC libraries

    PubMed Central

    2010-01-01

    Background Although melon (Cucumis melo L.) is an economically important fruit crop, no genome-wide sequence information is openly available at the current time. We therefore sequenced BAC-ends representing a total of 33,024 clones, half of them from a previously described melon BAC library generated with restriction endonucleases and the remainder from a new random-shear BAC library. Results We generated a total of 47,140 high-quality BAC-end sequences (BES), 91.7% of which were paired-BES. Both libraries were assembled independently and then cross-assembled to obtain a final set of 33,372 non-redundant, high-quality sequences. These were grouped into 6,411 contigs (4.5 Mb) and 26,961 non-assembled BES (14.4 Mb), representing ~4.2% of the melon genome. The sequences were used to screen genomic databases, identifying 7,198 simple sequence repeats (corresponding to one microsatellite every 2.6 kb) and 2,484 additional repeats of which 95.9% represented transposable elements. The sequences were also used to screen expressed sequence tag (EST) databases, revealing 11,372 BES that were homologous to ESTs. This suggests that ~30% of the melon genome consists of coding DNA. We observed regions of microsynteny between melon paired-BES and six other dicotyledonous plant genomes. Conclusion The analysis of nearly 50,000 BES from two complementary genomic libraries covered ~4.2% of the melon genome, providing insight into properties such as microsatellite and transposable element distribution, and the percentage of coding DNA. The observed synteny between melon paired-BES and six other plant genomes showed that useful comparative genomic data can be derived through large scale BAC-end sequencing by anchoring a small proportion of the melon genome to other sequenced genomes. PMID:21054843

  6. Genome-wide single nucleotide polymorphisms reveal population history and adaptive divergence in wild guppies.

    PubMed

    Willing, Eva-Maria; Bentzen, Paul; van Oosterhout, Cock; Hoffmann, Margarete; Cable, Joanne; Breden, Felix; Weigel, Detlef; Dreyer, Christine

    2010-03-01

    Adaptation of guppies (Poecilia reticulata) to contrasting upland and lowland habitats has been extensively studied with respect to behaviour, morphology and life history traits. Yet population history has not been studied at the whole-genome level. Although single nucleotide polymorphisms (SNPs) are the most abundant form of variation in many genomes and consequently very informative for a genome-wide picture of standing natural variation in populations, genome-wide SNP data are rarely available for wild vertebrates. Here we use genetically mapped SNP markers to comprehensively survey genetic variation within and among naturally occurring guppy populations from a wide geographic range in Trinidad and Venezuela. Results from three different clustering methods, Neighbor-net, principal component analysis (PCA) and Bayesian analysis show that the population substructure agrees with geographic separation and largely with previously hypothesized patterns of historical colonization. Within major drainages (Caroni, Oropouche and Northern), populations are genetically similar, but those in different geographic regions are highly divergent from one another, with some indications of ancient shared polymorphisms. Clear genomic signatures of a previous introduction experiment were seen, and we detected additional potential admixture events. Headwater populations were significantly less heterozygous than downstream populations. Pairwise F(ST) values revealed marked differences in allele frequencies among populations from different regions, and also among populations within the same region. F(ST) outlier methods indicated some regions of the genome as being under directional selection. Overall, this study demonstrates the power of a genome-wide SNP data set to inform for studies on natural variation, adaptation and evolution of wild populations.

  7. Genome-wide (over)view on the actions of vitamin D

    PubMed Central

    Carlberg, Carsten

    2014-01-01

    For a global understanding of the physiological impact of the nuclear hormone 1α,25-dihydroxyvitamin D3 (1,25(OH)2D3) the analysis of the genome-wide locations of its high affinity receptor, the transcription factor vitamin D receptor (VDR), is essential. Chromatin immunoprecipitation sequencing (ChIP-seq) in GM10855 and GM10861 lymphoblastoid cells, undifferentiated and lipopolysaccharide-differentiated THP-1 monocytes, LS180 colorectal cancer cells and LX2 hepatic stellate cells revealed between 1000 and 13,000 VDR-specific genomic binding sites. The harmonized analysis of these ChIP-seq datasets indicates that the mechanistic basis for the action of the VDR is independent of the cell type. Formaldehyde-assisted isolation of regulatory elements sequencing (FAIRE-seq) data highlight accessible chromatin regions, which are under control of 1,25(OH)2D3. In addition, public data, such as from the ENCODE project, allow to relate the genome-wide actions of VDR and 1,25(OH)2D3 to those of other proteins within the nucleus. For example, locations of the insulator protein CTCF suggest a segregation of the human genome into chromatin domains, of which more than 1000 contain at least one VDR binding site. The integration of all these genome-wide data facilitates the identification of the most important VDR binding sites and associated primary 1,25(OH)2D3 target genes. Expression changes of these key genes can serve as biomarkers for the actions of vitamin D3 and its metabolites in different tissues and cell types of human individuals. Analysis of primary tissues obtained from vitamin D3 intervention studies using such markers indicated a large inter-individual variation for the efficiency of vitamin D3 supplementation. In conclusion, a genome-wide (over)view on the genomic locations of VDR provides a broader basis for addressing vitamin D's role in health and disease. PMID:24808867

  8. Cell Context Dependent p53 Genome-Wide Binding Patterns and Enrichment at Repeats

    DOE PAGES

    Botcheva, Krassimira; McCorkle, Sean R.

    2014-11-21

    The p53 ability to elicit stress specific and cell type specific responses is well recognized, but how that specificity is established remains to be defined. Whether upon activation p53 binds to its genomic targets in a cell type and stress type dependent manner is still an open question. Here we show that the p53 binding to the human genome is selective and cell context-dependent. We mapped the genomic binding sites for the endogenous wild type p53 protein in the human cancer cell line HCT116 and compared them to those we previously determined in the normal cell line IMR90. We reportmore » distinct p53 genome-wide binding landscapes in two different cell lines, analyzed under the same treatment and experimental conditions, using the same ChIP-seq approach. This is evidence for cell context dependent p53 genomic binding. The observed differences affect the p53 binding sites distribution with respect to major genomic and epigenomic elements (promoter regions, CpG islands and repeats). We correlated the high-confidence p53 ChIP-seq peaks positions with the annotated human repeats (UCSC Human Genome Browser) and observed both common and cell line specific trends. In HCT116, the p53 binding was specifically enriched at LINE repeats, compared to IMR90 cells. The p53 genome-wide binding patterns in HCT116 and IMR90 likely reflect the different epigenetic landscapes in these two cell lines, resulting from cancer-associated changes (accumulated in HCT116) superimposed on tissue specific differences (HCT116 has epithelial, while IMR90 has mesenchymal origin). In conclusion, our data support the model for p53 binding to the human genome in a highly selective manner, mobilizing distinct sets of genes, contributing to distinct pathways.« less

  9. Cell Context Dependent p53 Genome-Wide Binding Patterns and Enrichment at Repeats

    SciTech Connect

    Botcheva, Krassimira; McCorkle, Sean R.

    2014-11-21

    The p53 ability to elicit stress specific and cell type specific responses is well recognized, but how that specificity is established remains to be defined. Whether upon activation p53 binds to its genomic targets in a cell type and stress type dependent manner is still an open question. Here we show that the p53 binding to the human genome is selective and cell context-dependent. We mapped the genomic binding sites for the endogenous wild type p53 protein in the human cancer cell line HCT116 and compared them to those we previously determined in the normal cell line IMR90. We report distinct p53 genome-wide binding landscapes in two different cell lines, analyzed under the same treatment and experimental conditions, using the same ChIP-seq approach. This is evidence for cell context dependent p53 genomic binding. The observed differences affect the p53 binding sites distribution with respect to major genomic and epigenomic elements (promoter regions, CpG islands and repeats). We correlated the high-confidence p53 ChIP-seq peaks positions with the annotated human repeats (UCSC Human Genome Browser) and observed both common and cell line specific trends. In HCT116, the p53 binding was specifically enriched at LINE repeats, compared to IMR90 cells. The p53 genome-wide binding patterns in HCT116 and IMR90 likely reflect the different epigenetic landscapes in these two cell lines, resulting from cancer-associated changes (accumulated in HCT116) superimposed on tissue specific differences (HCT116 has epithelial, while IMR90 has mesenchymal origin). In conclusion, our data support the model for p53 binding to the human genome in a highly selective manner, mobilizing distinct sets of genes, contributing to distinct pathways.

  10. Cell context dependent p53 genome-wide binding patterns and enrichment at repeats.

    PubMed

    Botcheva, Krassimira; McCorkle, Sean R

    2014-01-01

    The p53 ability to elicit stress specific and cell type specific responses is well recognized, but how that specificity is established remains to be defined. Whether upon activation p53 binds to its genomic targets in a cell type and stress type dependent manner is still an open question. Here we show that the p53 binding to the human genome is selective and cell context-dependent. We mapped the genomic binding sites for the endogenous wild type p53 protein in the human cancer cell line HCT116 and compared them to those we previously determined in the normal cell line IMR90. We report distinct p53 genome-wide binding landscapes in two different cell lines, analyzed under the same treatment and experimental conditions, using the same ChIP-seq approach. This is evidence for cell context dependent p53 genomic binding. The observed differences affect the p53 binding sites distribution with respect to major genomic and epigenomic elements (promoter regions, CpG islands and repeats). We correlated the high-confidence p53 ChIP-seq peaks positions with the annotated human repeats (UCSC Human Genome Browser) and observed both common and cell line specific trends. In HCT116, the p53 binding was specifically enriched at LINE repeats, compared to IMR90 cells. The p53 genome-wide binding patterns in HCT116 and IMR90 likely reflect the different epigenetic landscapes in these two cell lines, resulting from cancer-associated changes (accumulated in HCT116) superimposed on tissue specific differences (HCT116 has epithelial, while IMR90 has mesenchymal origin). Our data support the model for p53 binding to the human genome in a highly selective manner, mobilizing distinct sets of genes, contributing to distinct pathways.

  11. Cell Context Dependent p53 Genome-Wide Binding Patterns and Enrichment at Repeats

    PubMed Central

    Botcheva, Krassimira; McCorkle, Sean R.

    2014-01-01

    The p53 ability to elicit stress specific and cell type specific responses is well recognized, but how that specificity is established remains to be defined. Whether upon activation p53 binds to its genomic targets in a cell type and stress type dependent manner is still an open question. Here we show that the p53 binding to the human genome is selective and cell context-dependent. We mapped the genomic binding sites for the endogenous wild type p53 protein in the human cancer cell line HCT116 and compared them to those we previously determined in the normal cell line IMR90. We report distinct p53 genome-wide binding landscapes in two different cell lines, analyzed under the same treatment and experimental conditions, using the same ChIP-seq approach. This is evidence for cell context dependent p53 genomic binding. The observed differences affect the p53 binding sites distribution with respect to major genomic and epigenomic elements (promoter regions, CpG islands and repeats). We correlated the high-confidence p53 ChIP-seq peaks positions with the annotated human repeats (UCSC Human Genome Browser) and observed both common and cell line specific trends. In HCT116, the p53 binding was specifically enriched at LINE repeats, compared to IMR90 cells. The p53 genome-wide binding patterns in HCT116 and IMR90 likely reflect the different epigenetic landscapes in these two cell lines, resulting from cancer-associated changes (accumulated in HCT116) superimposed on tissue specific differences (HCT116 has epithelial, while IMR90 has mesenchymal origin). Our data support the model for p53 binding to the human genome in a highly selective manner, mobilizing distinct sets of genes, contributing to distinct pathways. PMID:25415302

  12. Genome scans for Q1 and Q2 on general population replicates using Loki.

    PubMed

    Shmulewitz, D; Heath, S C

    2001-01-01

    The Markov Chain Monte Carlo linkage package Loki was used to perform a genome scan under realistic conditions (using a 10-cM marker map without marker data on unsampled individuals, analyzing each chromosome separately, and without knowing the answers) for traits Q1 and Q2 on general population replicate 1. Using this approach we detected and correctly localized MG1 for Q1 and MG3 for Q2. We then repeated the analyses on replicate 1 and the "best replicate" (42) adding more information (using marker data on everyone, fitting a polygenic effect, and analyzing multiple chromosomes jointly) to see the effect on the detection of trait loci. We found that adding more data often improves the quality of the linkage signal, and reduces the false positive rate, but did not allow the detection of trait loci missed by the initial analysis. We also investigated the convergence of the sampler by repeating one multi-chromosome analysis six times with different random number seeds. We concluded that a strategy of performing a single chromosome scan using a moderate number of sampling iterations, followed by a multi-chromosome analysis of all chromosomes with linkage signals detected in the first scan using a longer sampling run, was an effective way of performing a genome scan on this data set.

  13. Genome-wide meta-analysis of cerebral white matter hyperintensities in patients with stroke

    PubMed Central

    Zhang, Cathy R.; Adib-Samii, Poneh; Devan, William J.; Parsons, Owen E.; Lanfranconi, Silvia; Gregory, Sarah; Cloonan, Lisa; Falcone, Guido J.; Radmanesh, Farid; Fitzpatrick, Kaitlin; Kanakis, Allison; Barrick, Thomas R.; Moynihan, Barry; Lewis, Cathryn M.; Boncoraglio, Giorgio B.; Lemmens, Robin; Thijs, Vincent; Sudlow, Cathie; Wardlaw, Joanna; Rothwell, Peter M.; Meschia, James F.; Worrall, Bradford B.; Levi, Christopher; Bevan, Steve; Furie, Karen L.; Dichgans, Martin; Rosand, Jonathan; Markus, Hugh S.; Rost, Natalia

    2016-01-01

    Objective: For 3,670 stroke patients from the United Kingdom, United States, Australia, Belgium, and Italy, we performed a genome-wide meta-analysis of white matter hyperintensity volumes (WMHV) on data imputed to the 1000 Genomes reference dataset to provide insights into disease mechanisms. Methods: We first sought to identify genetic associations with white matter hyperintensities in a stroke population, and then examined whether genetic loci previously linked to WMHV in community populations are also associated in stroke patients. Having established that genetic associations are shared between the 2 populations, we performed a meta-analysis testing which associations with WMHV in stroke-free populations are associated overall when combined with stroke populations. Results: There were no associations at genome-wide significance with WMHV in stroke patients. All previously reported genome-wide significant associations with WMHV in community populations shared direction of effect in stroke patients. In a meta-analysis of the genome-wide significant and suggestive loci (p < 5 × 10−6) from community populations (15 single nucleotide polymorphisms in total) and from stroke patients, 6 independent loci were associated with WMHV in both populations. Four of these are novel associations at the genome-wide level (rs72934505 [NBEAL1], p = 2.2 × 10−8; rs941898 [EVL], p = 4.0 × 10−8; rs962888 [C1QL1], p = 1.1 × 10−8; rs9515201 [COL4A2], p = 6.9 × 10−9). Conclusions: Genetic associations with WMHV are shared in otherwise healthy individuals and patients with stroke, indicating common genetic susceptibility in cerebral small vessel disease. PMID:26674333

  14. A genome-wide association study identifies multiple loci associated with mathematics ability and disability.

    PubMed

    Docherty, S J; Davis, O S P; Kovas, Y; Meaburn, E L; Dale, P S; Petrill, S A; Schalkwyk, L C; Plomin, R

    2010-03-01

    Numeracy is as important as literacy and exhibits a similar frequency of disability. Although its etiology is relatively poorly understood, quantitative genetic research has demonstrated mathematical ability to be moderately heritable. In this first genome-wide association study (GWAS) of mathematical ability and disability, 10 out of 43 single nucleotide polymorphism (SNP) associations nominated from two high- vs. low-ability (n = 600 10-year-olds each) scans of pooled DNA were validated (P < 0.05) in an individually genotyped sample of (*)2356 individuals spanning the entire distribution of mathematical ability, as assessed by teacher reports and online tests. Although the effects are of the modest sizes now expected for complex traits and require further replication, interesting candidate genes are implicated such as NRCAM which encodes a neuronal cell adhesion molecule. When combined into a set, the 10 SNPs account for 2.9% (F = 56.85; df = 1 and 1881; P = 7.277e-14) of the phenotypic variance. The association is linear across the distribution consistent with a quantitative trait locus (QTL) hypothesis; the third of children in our sample who harbour 10 or more of the 20 risk alleles identified are nearly twice as likely (OR = 1.96; df = 1; P = 3.696e-07) to be in the lowest performing 15% of the distribution. Our results correspond with those of quantitative genetic research in indicating that mathematical ability and disability are influenced by many genes generating small effects across the entire spectrum of ability, implying that more highly powered studies will be needed to detect and replicate these QTL associations.

  15. Genome-Wide Significant Linkage of Schizophrenia-Related Neuroanatomical Trait to 12q24

    PubMed Central

    Sprooten, Emma; Gupta, Cota Navin; Knowles, Emma EM; McKay, D Reese; Mathias, Samuel R; Curran, Joanne E; Kent, Jack W; Carless, Melanie A; Almeida, Marcio A; Dyer, Thomas D; Göring, Harald HH; Olvera, Rene L; Kochunov, Peter; Fox, Peter T; Duggirala, Ravi; Almasy, Laura; Calhoun, Vince D.; Blangero, John; Turner, Jessica A; Glahn, David C

    2015-01-01

    The insula and medial prefrontal cortex (mPFC) share functional, histological, transcriptional and developmental characteristics and they serve higher cognitive functions of theoretical relevance to schizophrenia and related disorders. Meta-analyses and multivariate analysis of structural magnetic resonance imaging (MRI) scans indicate that gray matter density and volume reductions in schizophrenia are the most consistent and pronounced in a network primarily composed of the insula and mPFC. We used source-based morphometry, a multivariate technique optimized for structural MRI, in a large sample of randomly ascertained pedigrees (N = 887) to derive an insula-mPFC component and to investigate its genetic determinants. Firstly, we replicated the insula-mPFC gray matter component as an independent source of gray matter variation in the general population, and verified its relevance to schizophrenia in an independent case-control sample. Secondly, we showed that the neuroanatomical variation defined by this component is largely determined by additive genetic variation (h2 = 0.59), and genome-wide linkage analysis resulted in a significant linkage peak at 12q24 (LOD = 3.76). This region has been of significant interest to psychiatric genetics as it contains the Darier’s disease locus and other proposed susceptibility genes (e.g. DAO, NOS1), and it has been linked to affective disorders and schizophrenia in multiple populations. Thus, in conjunction with previous clinical studies, our data imply that one or more psychiatric risk variants at 12q24 are co-inherited with reductions in mPFC and insula gray matter concentration. PMID:26440917

  16. Genome-wide association analysis of eating disorder-related symptoms, behaviors, and personality traits.

    PubMed

    Boraska, Vesna; Davis, Oliver S P; Cherkas, Lynn F; Helder, Sietske G; Harris, Juliette; Krug, Isabel; Liao, Thomas Pei-Chi; Treasure, Janet; Ntalla, Ioanna; Karhunen, Leila; Keski-Rahkonen, Anna; Christakopoulou, Danai; Raevuori, Anu; Shin, So-Youn; Dedoussis, George V; Kaprio, Jaakko; Soranzo, Nicole; Spector, Tim D; Collier, David A; Zeggini, Eleftheria

    2012-10-01

    Eating disorders (EDs) are common, complex psychiatric disorders thought to be caused by both genetic and environmental factors. They share many symptoms, behaviors, and personality traits, which may have overlapping heritability. The aim of the present study is to perform a genome-wide association scan (GWAS) of six ED phenotypes comprising three symptom traits from the Eating Disorders Inventory 2 [Drive for Thinness (DT), Body Dissatisfaction (BD), and Bulimia], Weight Fluctuation symptom, Breakfast Skipping behavior and Childhood Obsessive-Compulsive Personality Disorder trait (CHIRP). Investigated traits were derived from standardized self-report questionnaires completed by the TwinsUK population-based cohort. We tested 283,744 directly typed SNPs across six phenotypes of interest in the TwinsUK discovery dataset and followed-up signals from various strata using a two-stage replication strategy in two independent cohorts of European ancestry. We meta-analyzed a total of 2,698 individuals for DT, 2,680 for BD, 2,789 (821 cases/1,968 controls) for Bulimia, 1,360 (633 cases/727 controls) for Childhood Obsessive-Compulsive Personality Disorder trait, 2,773 (761 cases/2,012 controls) for Breakfast Skipping, and 2,967 (798 cases/2,169 controls) for Weight Fluctuation symptom. In this GWAS analysis of six ED-related phenotypes, we detected association of eight genetic variants with P < 10(-5) . Genetic variants that showed suggestive evidence of association were previously associated with several psychiatric disorders and ED-related phenotypes. Our study indicates that larger-scale collaborative studies will be needed to achieve the necessary power to detect loci underlying ED-related traits.

  17. Genome-wide association studies of female reproduction in tropically adapted beef cattle.

    PubMed

    Hawken, R J; Zhang, Y D; Fortes, M R S; Collis, E; Barris, W C; Corbet, N J; Williams, P J; Fordyce, G; Holroyd, R G; Walkley, J R W; Barendse, W; Johnston, D J; Prayaga, K C; Tier, B; Reverter, A; Lehnert, S A

    2012-05-01

    The genetics of reproduction is poorly understood because the heritabilities of traits currently recorded are low. To elucidate the genetics underlying reproduction in beef cattle, we performed a genome-wide association study using the bovine SNP50 chip in 2 tropically adapted beef cattle breeds, Brahman and Tropical Composite. Here we present the results for 3 female reproduction traits: 1) age at puberty, defined as age in days at first observed corpus luteum (CL) after frequent ovarian ultrasound scans (AGECL); 2) the postpartum anestrous interval, measured as the number of days from calving to first ovulation postpartum (first rebreeding interval, PPAI); and 3) the occurrence of the first postpartum ovulation before weaning in the first rebreeding period (PW), defined from PPAI. In addition, correlated traits such as BW, height, serum IGF1 concentration, condition score, and fatness were also examined. In the Brahman and Tropical Composite cattle, 169 [false positive rate (FPR) = 0.262] and 84 (FPR = 0.581) SNP, respectively, were significant (P < 0.001) for AGECL. In Brahman, 41% of these significant markers mapped to a single chromosomal region on BTA14. In Tropical Composites, 16% of these significant markers were located on BTA5. For PPAI, 66 (FPR = 0.67) and 113 (FPR = 0.432) SNP were significant (P < 0.001) in Brahman and Tropical Composite, respectively, whereas for PW, 68 (FPR = 0.64) and 113 (FPR = 0.432) SNP were significant (P < 0.01). In Tropical Composites, the largest concentration of PPAI markers were located on BTA5 [19% (PPAI) and 23% (PW)], and BTA16 [17% (PPAI) and 18% (PW)]. In Brahman cattle, the largest concentration of markers for postpartum anestrus was located on BTA3 (14% for PPAI and PW) and BTA14 (17% PPAI). Very few of the significant markers for female reproduction traits for the Brahman and Tropical Composite breeds were located in the same chromosomal regions. However, fatness and BW traits as well as serum IGF1 concentration

  18. Genome-wide microsatellite characterization and marker development in the sequenced Brassica crop species.

    PubMed

    Shi, Jiaqin; Huang, Shunmou; Zhan, Jiepeng; Yu, Jingyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2014-02-01

    Although much research has been conducted, the pattern of microsatellite distribution has remained ambiguous, and the development/utilization of microsatellite markers has still been limited/inefficient in Brassica, due to the lack of genome sequences. In view of this, we conducted genome-wide microsatellite characterization and marker development in three recently sequenced Brassica crops: Brassica rapa, Brassica oleracea and Brassica napus. The analysed microsatellite characteristics of these Brassica species were highly similar or almost identical, which suggests that the pattern of microsatellite distribution is likely conservative in Brassica. The genomic distribution of microsatellites was highly non-uniform and positively or negatively correlated with genes or transposable elements, respectively. Of the total of 115 869, 185 662 and 356 522 simple sequence repeat (SSR) markers developed with high frequencies (408.2, 343.8 and 356.2 per Mb or one every 2.45, 2.91 and 2.81 kb, respectively), most represented new SSR markers, the majority had determined physical positions, and a large number were genic or putative single-locus SSR markers. We also constructed a comprehensive database for the newly developed SSR markers, which was integrated with public Brassica SSR markers and annotated genome components. The genome-wide SSR markers developed in this study provide a useful tool to extend the annotated genome resources of sequenced Brassica species to genetic study/breeding in different Brassica species.

  19. Species-wide genome sequence and nucleotide polymorphisms from the model allopolyploid plant Brassica napus.

    PubMed

    Schmutzer, Thomas; Samans, Birgit; Dyrszka, Emmanuelle; Ulpinnis, Chris; Weise, Stephan; Stengel, Doreen; Colmsee, Christian; Lespinasse, Denis; Micic, Zeljko; Abel, Stefan; Duchscherer, Peter; Breuer, Frank; Abbadi, Amine; Leckband, Gunhild; Snowdon, Rod; Scholz, Uwe

    2015-12-08

    Brassica napus (oilseed rape, canola) is one of the world's most important sources of vegetable oil for human nutrition and biofuel, and also a model species for studies investigating the evolutionary consequences of polyploidisation. Strong bottlenecks during its recent origin from interspecific hybridisation, and subsequently through intensive artificial selection, have severely depleted the genetic diversity available for breeding. On the other hand, high-throughput genome profiling technologies today provide unprecedented scope to identify, characterise and utilise genetic diversity in primary and secondary crop gene pools. Such methods also enable implementation of genomic selection strategies to accelerate breeding progress. The key prerequisite is availability of high-quality sequence data and identification of high-quality, genome-wide sequence polymorphisms representing relevant gene pools. We present comprehensive genome resequencing data from a panel of 52 highly diverse natural and synthetic B. napus accessions, along with a stringently selected panel of 4.3 million high-confidence, genome-wide SNPs. The data is of great interest for genomics-assisted breeding and for evolutionary studies on the origins and consequences in allopolyploidisation in plants.

  20. Mapping the sensory perception of apple using descriptive sensory evaluation in a genome wide association study

    PubMed Central

    Amyotte, Beatrice; Bowen, Amy J.; Banks, Travis; Rajcan, Istvan; Somers, Daryl J.

    2017-01-01

    Breeding apples is a long-term endeavour and it is imperative that new cultivars are selected to have outstanding consumer appeal. This study has taken the approach of merging sensory science with genome wide association analyses in order to map the human perception of apple flavour and texture onto the apple genome. The goal was to identify genomic associations that could be used in breeding apples for improved fruit quality. A collection of 85 apple cultivars was examined over two years through descriptive sensory evaluation by a trained sensory panel. The trained sensory panel scored randomized sliced samples of each apple cultivar for seventeen taste, flavour and texture attributes using controlled sensory evaluation practices. In addition, the apple collection was subjected to genotyping by sequencing for marker discovery. A genome wide association analysis suggested significant genomic associations for several sensory traits including juiciness, crispness, mealiness and fresh green apple flavour. The findings include previously unreported genomic regions that could be used in apple breeding and suggest that similar sensory association mapping methods could be applied in other plants. PMID:28231290

  1. Genome-wide association study of swine farrowing traits. Part I: genetic and genomic parameter estimates.

    PubMed

    Schneider, J F; Rempel, L A; Rohrer, G A

    2012-10-01

    The primary objective of this study was to determine genetic and genomic parameters among swine (Sus scrofa) farrowing traits. Genetic parameters were obtained using MTDFREML. Genomic parameters were obtained using GENSEL. Genetic and residual variances obtained from MTDFREML were used as priors for the Bayes C analysis of GENSEL. Farrowing traits included total number born (TNB), number born alive (NBA), number born dead (NBD), number stillborn (NSB), number of mummies (MUM), litter birth weight (LBW), and average piglet birth weight (ABW). Statistically significant heritabilities included TNB (0.09, P = 0.048), NBA (0.09, P = 0.041), LBW (0.20, P = 0.002), and ABW (0.26, P < 0.0001). Statistically significant genetic correlations included TNB-NBA (0.97, P < 0.0001), TNB-LBW (0.74, P < 0.0001), NBA-LBW (0.56, P < 0.0017), NSB-LBW (0.87, P < 0.0395), and LBW-ABW (0.63, P < 0.0002). Genetic parameters are similar to others found in the literature. The proportion of phenotypic variance explained by genomic markers (GP) generated by GENSEL was TNB (0.04), NBA (0.06), NBD (0.00), NSB (0.01), MUM (0.00), LBW (0.11), and ABW (0.31). Limited information is available in the literature about genomic parameters. Only the GP estimate for NSB is significantly lower than what has been published. The GP estimate for ABW is greater than the estimate for heritability found in this study. Other traits with significant heritability had GP estimates half the value of heritability. This research indicates that significant genetic markers will be found for TNB, NBA, LBW, and ABW that will have either immediate use in industry or provide a roadmap to further research with fine mapping or sequencing of areas of significance. Furthermore, these results indicate that genomic selection implemented at an early age would have similar annual progress as traditional selection, and could be incorporated along with traditional selection procedures to improve genetic progress of litter traits.

  2. Genome Wide Allele Frequency Fingerprints (GWAFFs) of Populations via Genotyping by Sequencing

    PubMed Central

    Byrne, Stephen; Czaban, Adrian; Studer, Bruno; Panitz, Frank; Bendixen, Christian; Asp, Torben

    2013-01-01

    Genotyping-by-Sequencing (GBS) is an excellent tool for characterising genetic variation between plant genomes. To date, its use has been reported only for genotyping of single individuals. However, there are many applications where resolving allele frequencies within populations on a genome-wide scale would be very powerful, examples include the breeding of outbreeding species, varietal protection in outbreeding species, monitoring changes in population allele frequencies. This motivated us to test the potential to use GBS to evaluate allele frequencies within populations. Perennial ryegrass is an outbreeding species, and breeding programs are based upon selection on populations. We tested two restriction enzymes for their efficiency in complexity reduction of the perennial ryegrass genome. The resulting profiles have been termed Genome Wide Allele Frequency Fingerprints (GWAFFs), and we have shown how these fingerprints can be used to distinguish between plant populations. Even at current costs and throughput, using sequencing to directly evaluate populations on a genome-wide scale is viable. GWAFFs should find many applications, from varietal development in outbreeding species right through to playing a role in protecting plant breeders’ rights. PMID:23469194

  3. Challenges and pitfalls in the characterization of anonymous outlier AFLP markers in non-model species: lessons from an ocellated lizard genome scan.

    PubMed

    Nunes, V L; Beaumont, M A; Butlin, R K; Paulo, O S

    2012-12-01

    In the last few years, dozens of studies have documented the detection of loci influenced by selection from genome scans in a wide range of non-model species. Many of those studies used amplified fragment length polymorphism (AFLP) markers, which became popular for being easily applicable to any organism. However, because they are anonymous markers, AFLPs impose many challenges for their isolation and identification. Most recent AFLP genome scans used capillary electrophoresis (CE), which adds even more obstacles to the isolation of bands with a specific size for sequencing. These caveats might explain the extremely low number of studies that moved from the detection of outlier AFLP markers to their actual isolation and characterization. We document our efforts to characterize a set of outlier AFLP markers from a previous genome scan with CE in ocellated lizards (Lacerta lepida). Seven outliers were successfully isolated, cloned and sequenced. Their sequences are noncoding and show internal indels or polymorphic repetitive elements (microsatellites). Three outliers were converted into codominant markers by using specific internal primers to sequence and screen population variability from undigested DNA. Amplification in closely related lizard species was also achieved, revealing remarkable interspecific conservation in outlier loci sequences. We stress the importance of following up AFLP genome scans to validate selection signatures of outlier loci, but also report the main challenges and pitfalls that may be faced during the process.

  4. Challenges and pitfalls in the characterization of anonymous outlier AFLP markers in non-model species: lessons from an ocellated lizard genome scan

    PubMed Central

    Nunes, V L; Beaumont, M A; Butlin, R K; Paulo, O S

    2012-01-01

    In the last few years, dozens of studies have documented the detection of loci influenced by selection from genome scans in a wide range of non-model species. Many of those studies used amplified fragment length polymorphism (AFLP) markers, which became popular for being easily applicable to any organism. However, because they are anonymous markers, AFLPs impose many challenges for their isolation and identification. Most recent AFLP genome scans used capillary electrophoresis (CE), which adds even more obstacles to the isolation of bands with a specific size for sequencing. These caveats might explain the extremely low number of studies that moved from the detection of outlier AFLP markers to their actual isolation and characterization. We document our efforts to characterize a set of outlier AFLP markers from a previous genome scan with CE in ocellated lizards (Lacerta lepida). Seven outliers were successfully isolated, cloned and sequenced. Their sequences are noncoding and show internal indels or polymorphic repetitive elements (microsatellites). Three outliers were converted into codominant markers by using specific internal primers to sequence and screen population variability from undigested DNA. Amplification in closely related lizard species was also achieved, revealing remarkable interspecific conservation in outlier loci sequences. We stress the importance of following up AFLP genome scans to validate selection signatures of outlier loci, but also report the main challenges and pitfalls that may be faced during the process. PMID:22892639

  5. A genome-wide analysis of common fragile sites: What features determine chromosomal instability in the human genome?

    PubMed Central

    Fungtammasan, Arkarachai; Walsh, Erin; Chiaromonte, Francesca; Eckert, Kristin A.; Makova, Kateryna D.

    2012-01-01

    Chromosomal common fragile sites (CFSs) are unstable genomic regions that break under replication stress and are involved in structural variation. They frequently are sites of chromosomal rearrangements in cancer and of viral integration. However, CFSs are undercharacterized at the molecular level and thus difficult to predict computationally. Newly available genome-wide profiling studies provide us with an unprecedented opportunity to associate CFSs with features of their local genomic contexts. Here, we contrasted the genomic landscape of cytogenetically defined aphidicolin-induced CFSs (aCFSs) to that of nonfragile sites, using multiple logistic regression. We also analyzed aCFS breakage frequencies as a function of their genomic landscape, using standard multiple regression. We show that local genomic features are effective predictors both of regions harboring aCFSs (explaining ∼77% of the deviance in logistic regression models) and of aCFS breakage frequencies (explaining ∼45% of the variance in standard regression models). In our optimal models (having highest explanatory power), aCFSs are predominantly located in G-negative chromosomal bands and away from centromeres, are enriched in Alu repeats, and have high DNA flexibility. In alternative models, CpG island density, transcription start site density, H3K4me1 coverage, and mononucleotide microsatellite coverage are significant predictors. Also, aCFSs have high fragility when colocated with evolutionarily conserved chromosomal breakpoints. Our models are predictive of the fragility of aCFSs mapped at a higher resolution. Importantly, the genomic features we identified here as significant predictors of fragility allow us to draw valuable inferences on the molecular mechanisms underlying aCFSs. PMID:22456607

  6. Overlapping chromatin-remodeling systems collaborate genome wide at dynamic chromatin transitions.

    PubMed

    Morris, Stephanie A; Baek, Songjoon; Sung, Myong-Hee; John, Sam; Wiench, Malgorzata; Johnson, Thomas A; Schiltz, R Louis; Hager, Gordon L

    2014-01-01

    ATP-dependent chromatin remodeling is an essential process required for the dynamic organization of chromatin structure. Here we describe the genome-wide location and activity of three remodeler proteins with diverse physiological functions in the mouse genome: Brg1, Chd4 and Snf2h. The localization patterns of all three proteins substantially overlap with one another and with regions of accessible chromatin. Furthermore, using inducible mutant variants, we demonstrate that the catalytic activity of these proteins contributes to the remodeling of chromatin genome wide and that each of these remodelers can independently regulate chromatin reorganization at distinct sites. Many regions require the activity of more than one remodeler to regulate accessibility. These findings provide a dynamic view of chromatin organization and highlight the differential contributions of remodelers to chromatin maintenance in higher eukaryotes.

  7. Genome-wide association study for semen quality traits in German Warmblood stallions.

    PubMed

    Gottschalk, Maren; Metzger, Julia; Martinsson, Gunilla; Sieme, Harald; Distl, Ottmar

    2016-08-01

    We performed a genome-wide association study for semen quality traits in 139 German Warmblood stallions. Stallions were genotyped using the Illumina equine SNP50 Beadchip. Traits analysed were de-regressed estimated breeding values (EBVs) for gel-free volume, sperm concentration, total number of sperm, progressive motility and the total number of progressively motile sperm. The GWAS revealed 29 SNPs on 12 different chromosomes as genome-wide significantly associated with semen quality traits. For ten genomic regions we could retrieve candidate genes influencing stallion fertility. Among the candidate genes, we could find the genes encoding cysteine-rich secretory proteins (CRISP1, CRISP2 and CRISP3). This was the first GWAS in horses performed for semen quality traits.

  8. Genome-wide Association Study Identifies Loci for the Polled Phenotype in Yak

    PubMed Central

    Wu, Xiaoyun; Wang, Kun; Ding, Xuezhi; Wang, Mingcheng; Chu, Min; Xie, Xiuyue; Qiu, Qiang; Yan, Ping

    2016-01-01

    The absence of horns, known as the polled phenotype, is an economically important trait in modern yak husbandry, but the genomic structure and genetic basis of this phenotype have yet to be discovered. Here, we conducted a genome-wide association study with a panel of 10 horned and 10 polled yaks using whole genome sequencing. We mapped the POLLED locus to a 200-kb interval, which comprises three protein-coding genes. Further characterization of the candidate region showed recent artificial selection signals resulting from the breeding process. We suggest that expressional variations rather than structural variations in protein probably contribute to the polled phenotype. Our results not only represent the first and important step in establishing the genomic structure of the polled region in yak, but also add to our understanding of the polled trait in bovid species. PMID:27389700

  9. Genome-wide de Novo Prediction of Proximal and Distal Tissue-Specific Enhancers

    SciTech Connect

    Loots, G G; Ovcharenko, I V

    2005-11-03

    Determining how transcriptional regulatory networks are encoded in the human genome is essential for understanding how cellular processes are directed. Here, we present a novel approach for systematically predicting tissue specific regulatory elements (REs) that blends genome-wide expression profiling, vertebrate genome comparisons, and pattern analysis of transcription factor binding sites. This analysis yields 4,670 candidate REs in the human genome with distinct tissue specificities, the majority of which reside far away from transcription start sites. We identify key transcription factors (TFs) for 34 distinct tissues and demonstrate that tissue-specific gene expression relies on multiple regulatory pathways employing similar, but different cohorts of interacting TFs. The methods and results we describe provide a global view of tissue specific gene regulation in humans, and propose a strategy for deciphering the transcriptional regulatory code in eukaryotes.

  10. Genome-wide analysis of zygotic linkage disequilibrium and its components in crossbred cattle

    PubMed Central

    2012-01-01

    Background Linkage disequilibrium (LD) between genes at linked or independent loci can occur at gametic and zygotic levels known asgametic LD and zygotic LD, respectively. Gametic LD is well known for its roles in fine-scale mapping of quantitative trait loci, genomic selection and evolutionary inference. The less-well studied is the zygotic LD and its components that can be also estimated directly from the unphased SNPs. Results This study was set up to investigate the genome-wide extent and patterns of zygotic LD and its components in a crossbred cattle population using the genomic data from the Illumina BovineSNP50 beadchip. The animal population arose from repeated crossbreeding of multiple breeds and selection for growth and cow reproduction. The study showed that similar genomic structures in gametic and zygotic LD were observed, with zygotic LD decaying faster than gametic LD over marker distance. The trigenic and quadrigenic disequilibria were generally two- to three-fold smaller than the usual digenic disequilibria (gametic or composite LD). There was less power of testing for these high-order genic disequilibria than for the digenic disequilibria. The power estimates decreased with the marker distance between markers though the decay trend is more obvious for the digenic disequilibria than for high-order disequilibria. Conclusions This study is the first major genome-wide survey of all non-allelic associations between pairs of SNPs in a cattle population. Such analysis allows us to assess the relative importance of gametic LD vs. all other non-allelic genic LDs regardless of whether or not the population is in HWE. The observed predominance of digenic LD (gametic or composite LD) coupled with insignificant high-order trigenic and quadrigenic disequilibria supports the current intensive focus on the use of high-density SNP markers for genome-wide association studies and genomic selection activities in the cattle population. PMID:22827586

  11. Genome-Wide Association Study in African-Americans with Systemic Lupus Erythematosus

    DTIC Science & Technology

    2013-09-01

    Americans with Systemic Lupus Erythematosus PRINCIPAL INVESTIGATOR: John Harley, M.D., Ph.D...September 2012 – 31 August 2013 4. TITLE AND SUBTITLE 5a. CONTRACT NUMBER Genome-Wide Association Study in African-Americans with Systemic Lupus ...SUPPLEMENTARY NOTES 14. ABSTRACT Systemic lupus erythematosus ( lupus ) is a potentially deadly systemic autoimmune disease that disproportionately

  12. Genome wide association analysis for seedling response traits to thermal stress in sorghum germplasm

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The sorghum association panel exhibited extensive variation for seedling traits under cold and heat stress. Genome-wide analyses identified thirty single nucleotide polymorphisms (SNPs) that were strongly associated with traits measured at seedling stage under cold stress and tagged genes that act a...

  13. Genome-wide association analysis of age at onset and psychotic symptoms in bipolar disorder.

    PubMed

    Belmonte Mahon, Pamela; Pirooznia, Mehdi; Goes, Fernando S; Seifuddin, Fayaz; Steele, Jo; Lee, Phil Hyoun; Huang, Jie; Hamshere, Marian L; Depaulo, J Raymond; Kelsoe, John R; Rietschel, Marcella; Nöthen, Markus; Cichon, Sven; Gurling, Hugh; Purcell, Shaun; Smoller, Jordan W; Craddock, Nick; Schulze, Thomas G; McMahon, Francis J; Potash, James B; Zandi, Peter P

    2011-04-01

    Genome-wide association studies (GWAS) have identified several susceptibility loci for bipolar disorder (BP), most notably ANK3. However, most of the inherited risk for BP remains unexplained. One reason for the limited success may be the genetic heterogeneity of BP. Clinical sub-phenotypes of BP may identify more etiologically homogeneous subsets of patients, which can be studied with increased power to detect genetic variation. Here, we report on a mega-analysis of two widely studied sub-phenotypes of BP, age at onset and psychotic symptoms, which are familial and clinically significant. We combined data from three GWAS: NIMH Bipolar Disorder Genetic Association Information Network (GAIN-BP), NIMH Bipolar Disorder Genome Study (BiGS), and a German sample. The combined sample consisted of 2,836 BP cases with information on sub-phenotypes and 2,744 controls. Imputation was performed, resulting in 2.3 million SNPs available for analysis. No SNP reached genome-wide significance for either sub-phenotype. In addition, no SNP reached genome-wide significance in a meta-analysis with an independent replication sample. We had 80% power to detect associations with a common SNP at an OR of 1.6 for psychotic symptoms and a mean difference of 1.8 years in age at onset. Age at onset and psychotic symptoms in BP may be influenced by many genes of smaller effect sizes or other variants not measured well by SNP arrays, such as rare alleles.

  14. Genome-wide association study of maize identifies genes affecting leaf architecture

    Technology Transfer Automated Retrieval System (TEKTRAN)

    U.S. maize yield has increased eightfold in the past 80 years with half of the improvement attributed to genetics. Changes in maize leaf angle and size provided a basis for more efficient light capture as plant densities increased. Through a genome wide association study (GWAS) of the maize nested a...

  15. A population structure and genome-wide association analysis on the USDA soybean germplasm collection

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genotype-phenotype associations within the soybean (Glycine max) germplasm collection could provide valuable information on the frequency and distribution of alleles affecting economically important traits. Here we performed a genome-wide association study (GWAS) for seed protein and oil content in ...

  16. Mixed linear model approach adapted for genome-wide association studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mixed linear model (MLM) methods have proven useful in controlling for population structure and relatedness within genome-wide association studies. However, MLM-based methods can be computationally challenging for large datasets. We report a compression approach, called ‘compressed MLM,’ that decrea...

  17. Genome-wide association study of swine farrowing traits. Part II: Bayesian analysis of marker data

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Reproductive efficiency has a great impact on the economic success of pork production. Number born alive (NBA) and average piglet birth weight (ABW) contribute greatly to reproductive efficiency. To better understand the underlying genetics of birth traits, a genome wide association study (GWAS) w...

  18. Genome-wide significant predictors of metabolites in the one-carbon metabolism pathway

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Low plasma B-vitamin levels and elevated homocysteine have been associated with cancer, cardiovascular disease, and neurodegenerative disorders. Common variants in FUT2 on chromosome 19q13 were associated with plasma vitamin B12 levels among women in a genome-wide association study (GWAS) in the Nur...

  19. Software engineering the mixed model for genome-wide association studies on large samples

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Mixed models improve the ability to detect phenotype-genotype associations in the presence of population stratification and multiple levels of relatedness in genome-wide association studies (GWAS), but for large data sets the resource consumption becomes impractical. At the same time, the sample siz...

  20. A genome-wide regulatory framework identifies maize Pericarp Color1 (P1) controlled genes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    P1 encodes an R2R3-MYB transcription factor responsible for the accumulation of insecticidal flavones in maize silks and red phlobaphene pigments in pericarps and other floral tissues. Using genome-wide expression analyses (RNA-Seq) in pericarps and silks of plants with contrasting P1 alleles combin...

  1. Genome-wide association mapping of partial resistance to Aphanomyces euteiches in pea

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide association mapping has recently emerged as a valuable approach to refine genetic basis of polygenic resistance to plant diseases, which are increasingly used in integrated strategies for durable crop protection. Aphanomyces euteiches is a soil borne pathogen of pea and other legumes wor...

  2. Genome-wide SNP detection, validation, and development of an 8K SNP array for apple

    Technology Transfer Automated Retrieval System (TEKTRAN)

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide...

  3. Genome-Wide Association Study of Intelligence: Additive Effects of Novel Brain Expressed Genes

    ERIC Educational Resources Information Center

    Loo, Sandra K.; Shtir, Corina; Doyle, Alysa E.; Mick, Eric; McGough, James J.; McCracken, James; Biederman, Joseph; Smalley, Susan L.; Cantor, Rita M.; Faraone, Stephen V.; Nelson, Stanley F.

    2012-01-01

    Objective: The purpose of the present study was to identify common genetic variants that are associated with human intelligence or general cognitive ability. Method: We performed a genome-wide association analysis with a dense set of 1 million single-nucleotide polymorphisms (SNPs) and quantitative intelligence scores within an ancestrally…

  4. Genome-Wide Association Study of Receptive Language Ability of 12-Year-Olds

    ERIC Educational Resources Information Center

    Harlaar, Nicole; Meaburn, Emma L.; Hayiou-Thomas, Marianna E.; Davis, Oliver S. P.; Docherty, Sophia; Hanscombe, Ken B.; Haworth, Claire M. A.; Price, Thomas S.; Trzaskowski, Maciej; Dale, Philip S.; Plomin, Robert

    2014-01-01

    Purpose: Researchers have previously shown that individual differences in measures of receptive language ability at age 12 are highly heritable. In the current study, the authors attempted to identify some of the genes responsible for the heritability of receptive language ability using a "genome-wide association" approach. Method: The…

  5. Genome-wide association of meat quality traits and tenderness in swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality has a large impact on consumer preference and perception of eating quality. A genome-wide association was performed for pork quality traits [intramuscular fat (IMF)], slice shear force (SSF), color attributes, purge, cooking loss, and pH] from 531 to 1,237 records on barrows and gilts o...

  6. Methods for meta-analysis of genome-wide association studies

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. For increasing N, results from different GWA can be combined in a meta-analysis (MA-...

  7. Meta-analysis of genome wide association studies for pork quality traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Given the importance of pork quality in the meat processing industry, genome-wide association studies were performed for eight meat quality traits and also, a meta-analysis (MA) of GWA was implemented combining independent results from pig populations. Data from three pig datasets (USMARC, Commercia...

  8. A genome-wide association study platform built on iPlant cyber-infrastructure

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We demonstrated a flexible Genome-Wide Association (GWA) Study (GWAS) platform built upon the iPlant Collaborative Cyber-infrastructure. The platform supports big data management, sharing, and large scale study of both genotype and phenotype data on clusters. End users can add their own analysis too...

  9. A genome-wide SNP panel for genetic diversity, mapping and breeding studies in rice

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide SNP resource was developed for rice using the GoldenGate assay and used to genotype 400 landrace accessions of O. sativa. SNPs were originally discovered using Perlegen re-sequencing technology in 20 diverse landraces of O. sativa as part of OryzaSNP project (http://irfgc.irri.org). An...

  10. Genome-wide association analysis for drought tolerance and associated traits in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for drought tolerance and related traits in a Middle American diversity panel comprised of 96 common bean (Phaseolus vulgaris) genotypes. The panel grown under irrigated and rainfed conditions and single n...

  11. Genome-wide association study of agronomic traits in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) using a global Andean diversity panel (ADP) of 237 genotypes of common bean, Phaseolus vulgaris was conducted to gain insight into the genetic architecture of several agronomic traits controlling phenology, biomass, yield components and seed yield. The panel wa...

  12. Genome-wide CNV analysis reveals variants associated with growth traits in Bos indicus

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Background: Apart from single nucleotide polymorphism (SNP), copy number variation (CNV) is another important type of genetic variation, which may affect growth traits and play key roles for the production of beef cattle. To date, no genome-wide association study (GWAS) for CNV and body traits in be...

  13. CNV-based genome wide association study reveals additional variants contributing to meat quality in swine

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Pork quality is important both to the meat processing industry and consumers’ purchasing attitudes. Copy number variation (CNV) is a burgeoning kind of variant that may influence meat quality. Herein, a genome-wide association study (GWAS) was performed between CNVs and meat quality traits in swine....

  14. Genome wide search for variation associated with micronutrient density of developing rice grains

    Technology Transfer Automated Retrieval System (TEKTRAN)

    "Omic" tools are rapidly being employed to delineate the biological framework controlling phenotypes of interest in crop species. An advanced understanding of the genetic basis for quantitative trait variation has been made possible through genome wide association studies (GWAS) that make use of gen...

  15. Genome-wide association analysis of symbiotic nitrogen fixation in common bean

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide association study (GWAS) was conducted to explore the genetic basis of variation for symbiotic nitrogen fixation (SNF) and related traits in the Andean diversity panel (ADP) comprised of 259 common bean (Phaseolus vulgaris) genotypes. The ADP was evaluated for SNF and related traits in...

  16. A comparison of reflector antenna designs for wide-angle scanning

    NASA Technical Reports Server (NTRS)

    Zimmerman, M.; Lee, S. W.; Houshmand, B.; Rahmat-Samii, Y.; Acosta, R.

    1989-01-01

    Conventional reflector antennas are typically designed for up to + or - 20 beamwidths scan. An attempt was made to stretch this scan range to some + or - 300 beamwidths. Six single and dual reflector antennas were compared. It is found that a symmetrical parabolic reflector with f/D = 2 and a single circular waveguide feed has the minimum scan loss (only 0.6 dB at Theta sub 0 = 8 deg, or a 114 beamwidths scan). The scan is achieved by tilting the parabolic reflector by an angle equal to the half-scan angle. The f/D may be shortened if a cluster 7 to 19 elements instead of one element is used for the feed. The cluster excitation is adjusted for each new beam scan direction to compensate for the imperfect field distribution over the reflector aperture. The antenna can be folded into a Cassegrain configuration except that, due to spillover and blockage considerations, the amount of folding achievable is small.

  17. Genome-wide target specificities of CRISPR RNA-guided programmable deaminases.

    PubMed

    Kim, Daesik; Lim, Kayeong; Kim, Sang-Tae; Yoon, Sun-Heui; Kim, Kyoungmi; Ryu, Seuk-Min; Kim, Jin-So