Science.gov

Sample records for genome wide scan

  1. Genome-Wide Scan Reveals Mutation Associated with Melanoma

    MedlinePlus

    ... 1999 Spotlight on Research 2012 July 2012 (historical) Genome-Wide Scan Reveals Mutation Associated with Melanoma A ... out to see if a technology called whole genome sequencing would help them find other genetic risk ...

  2. A genome-wide scan for preeclampsia in the Netherlands.

    PubMed

    Lachmeijer, A M; Arngrímsson, R; Bastiaans, E J; Frigge, M L; Pals, G; Sigurdardóttir, S; Stéfansson, H; Pálsson, B; Nicolae, D; Kong, A; Aarnoudse, J G; Gulcher, J R; Dekker, G A; ten Kate, L P; Stéfansson, K

    2001-10-01

    Preeclampsia, hallmarked by de novo hypertension and proteinuria in pregnancy, has a familial tendency. Recently, a large Icelandic genome-wide scan provided evidence for a maternal susceptibility locus for preeclampsia on chromosome 2p13 which was confirmed by a genome scan from Australia and New Zealand (NZ). The current study reports on a genome-wide scan of Dutch affected sib-pair families. In total 67 Dutch affected sib-pair families, comprising at least two siblings with proteinuric preeclampsia, eclampsia or HELLP-syndrome, were typed for 293 polymorphic markers throughout the genome and linkage analysis was performed. The highest allele sharing lod score of 1.99 was seen on chromosome 12q at 109.5 cM. Two peaks overlapped in the same regions between the Dutch and Icelandic genome-wide scan at chromosome 3p and chromosome 15q. No overlap was seen on 2p. Re-analysis in 38 families without HELLP-syndrome (preeclampsia families) and 34 families with at least one sibling with HELLP syndrome (HELLP families), revealed two peaks with suggestive evidence for linkage in the non-HELLP families on chromosome 10q (lod score 2.38, D10S1432, 93.9 cM) and 22q (lod score 2.41, D22S685, 32.4 cM). The peak on 12q appeared to be associated with HELLP syndrome; it increased to a lod score of 2.1 in the HELLP families and almost disappeared in the preeclampsia families. A nominal peak on chromosome 11 in the preeclampsia families showed overlap with the second highest peak in the Australian/NZ study. Results from our Dutch genome-wide scan indicate that HELLP syndrome might have a different genetic background than preeclampsia. PMID:11781687

  3. Genome-Wide Scan for Methylation Profiles in Keloids

    PubMed Central

    Jones, Lamont R.; Young, William; Divine, George; Datta, Indrani; Chen, Kang Mei; Ozog, David; Worsham, Maria J.

    2015-01-01

    Keloids are benign fibroproliferative tumors of the skin which commonly occur after injury mainly in darker skinned patients. Medical treatment is fraught with high recurrence rates mainly because of an incomplete understanding of the biological mechanisms that lead to keloids. The purpose of this project was to examine keloid pathogenesis from the epigenome perspective of DNA methylation. Genome-wide profiling used the Infinium HumanMethylation450 BeadChip to interrogate DNA from 6 fresh keloid and 6 normal skin samples from 12 anonymous donors. A 3-tiered approach was used to call out genes most differentially methylated between keloid and normal. When compared to normal, of the 685 differentially methylated CpGs at Tier 3, 510 were hypomethylated and 175 were hypermethylated with 190 CpGs in promoter and 495 in nonpromoter regions. The 190 promoter region CpGs corresponded to 152 genes: 96 (63%) were hypomethylated and 56 (37%) hypermethylated. This exploratory genome-wide scan of the keloid methylome highlights a predominance of hypomethylated genomic landscapes, favoring nonpromoter regions. DNA methylation, as an additional mechanism for gene regulation in keloid pathogenesis, holds potential for novel treatments that reverse deleterious epigenetic changes. As an alternative mechanism for regulating genes, epigenetics may explain why gene mutations alone do not provide definitive mechanisms for keloid formation. PMID:26074660

  4. Genome wide association scan for chronic periodontitis implicates novel locus

    PubMed Central

    2014-01-01

    Background There is evidence for a genetic contribution to chronic periodontitis. In this study, we conducted a genome wide association study among 866 participants of the University of Pittsburgh Dental Registry and DNA Repository, whose periodontal diagnosis ranged from healthy (N = 767) to severe chronic periodontitis (N = 99). Methods Genotypingi of over half-million single nucleotide polymorphisms was determined. Analyses were done twice, first in the complete dataset of all ethnicities, and second including only samples defined as self-reported Whites. From the top 100 results, twenty single nucleotide polymorphisms had consistent results in both analyses (borderline p-values ranging from 1E-05 to 1E-6) and were selected to be tested in two independent datasets derived from 1,460 individuals from Porto Alegre, and 359 from Rio de Janeiro, Brazil. Meta-analyses of the Single nucleotide polymorphisms showing a trend for association in the independent dataset were performed. Results The rs1477403 marker located on 16q22.3 showed suggestive association in the discovery phase and in the Porto Alegre dataset (p = 0.05). The meta-analysis suggested the less common allele decreases the risk of chronic periodontitis. Conclusions Our data offer a clear hypothesis to be independently tested regarding the contribution of the 16q22.3 locus to chronic periodontitis. PMID:25008200

  5. A Genome-wide Pleiotropy Scan for Prostate Cancer Risk

    PubMed Central

    Panagiotou, Orestis A; Travis, Ruth C; Campa, Daniele; Berndt, Sonja I.; Lindstrom, Sara; Kraft, Peter; Schumacher, Fredrick R.; Siddiq, Afshan; Papatheodorou, Stefania I.; Stanford, Janet L.; Albanes, Demetrius; Virtamo, Jarmo; Weinstein, Stephanie J.; Diver, W. Ryan; Gapstur, Susan M.; Stevens, Victoria L.; Boeing, Heiner; Bueno-de-Mesquita, H. Bas; Gurrea, Aurelio Barricarte; Kaaks, Rudolf; Khaw, Kay-Tee; Krogh, Vittorio; Overvad, Kim; Riboli, Elio; Trichopoulos, Dimitrios; Giovannucci, Edward; Stampfer, Meir; Haiman, Christopher; Henderson, Brian; Le Marchand, Loic; Gaziano, J. Michael; Hunter, DavidJ.; Koutros, Stella; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Wacholder, Sholom; Key, Timothy J.; Tsilidis, Konstantinos K

    2014-01-01

    Background No single-nucleotide polymorphisms (SNPs) specific for aggressive prostate cancer have been identified in genome-wide association studies (GWAS). Objective To test if SNPs associated with other traits may also affect the risk of aggressive prostate cancer. Design, setting, and participants SNPs implicated in any phenotype other than prostate cancer (p ≤ 10−7) were identified through the catalog of published GWAS and tested in 2891 aggressive prostate cancer cases and 4592 controls from the Breast and Prostate Cancer Cohort Consortium (BPC3). The 40 most significant SNPs were followed up in 4872 aggressive prostate cancer cases and 24 534 controls from the Prostate Cancer Association Group to Investigate Cancer Associated Alterations in the Genome (PRACTICAL) consortium. Outcome measurements and statistical analysis Odds ratios (ORs) and 95% confidence intervals (CIs) for aggressive prostate cancer were estimated. Results and limitations A total of 4666 SNPs were evaluated by the BPC3. Two signals were seen in regions already reported for prostate cancer risk. rs7014346 at 8q24.21 was marginally associated with aggressive prostate cancer in the BPC3 trial (p = 1.6 × 10-6), whereas after meta-analysis by PRACTICAL the summary OR was 1.21 (95%CI 1.16–1.27; p = 3.22 × 10−18). rs9900242 at 17q24.3 was also marginally associated with aggressive disease in the meta-analysis (OR 0.90, 95% CI 0.86–0.94; p = 2.5 × 10−6). Neither of these SNPs remained statistically significant when conditioning on correlated known prostate cancer SNPs. The meta-analysis by BPC3 and PRACTICAL identified a third promising signal, marked by rs16844874 at 2q34, independent of known prostate cancer loci (OR 1.12,95% CI 1.06–1.19; p = 4.67 × 10−5); it has been shown that SNPs correlated with this signal affect glycine concentrations. The main limitation is the heterogeneity in the definition of aggressive prostate cancer between BPC3 and PRACTICAL. Conclusions We did

  6. Genome-wide scan revealed genetic loci for energy metabolism in Hispanic children and adolescents

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-wide scans were conducted in a search for genetic locations linked to energy expenditure and substrate oxidation in children. Pedigreed data of 1030 Hispanic children and adolescents were from the Viva La Familia Study, which was designed to investigate genetic and environmental risk factors ...

  7. Meta-analysis of genome-wide linkage scans for renal function traits

    PubMed Central

    Rao, Madhumathi; Mottl, Amy K.; Cole, Shelley A.; Umans, Jason G.; Freedman, Barry I.; Bowden, Donald W.; Langefeld, Carl D.; Fox, Caroline S.; Yang, Qiong; Cupples, Adrienne; Iyengar, Sudha K.; Hunt, Steven C.

    2012-01-01

    Background. Several genome scans have explored the linkage of chronic kidney disease phenotypes to chromosomic regions with disparate results. Genome scan meta-analysis (GSMA) is a quantitative method to synthesize linkage results from independent studies and assess their concordance. Methods. We searched PubMed to identify genome linkage analyses of renal function traits in humans, such as estimated glomerular filtration rate (GFR), albuminuria, serum creatinine concentration and creatinine clearance. We contacted authors for numerical data and extracted information from individual studies. We applied the GSMA nonparametric approach to combine results across 14 linkage studies for GFR, 11 linkage studies for albumin creatinine ratio, 11 linkage studies for serum creatinine and 4 linkage studies for creatinine clearance. Results. No chromosomal region reached genome-wide statistical significance in the main analysis which included all scans under each phenotype; however, regions on Chromosomes 7, 10 and 16 reached suggestive significance for linkage to two or more phenotypes. Subgroup analyses by disease status or ethnicity did not yield additional information. Conclusions. While heterogeneity across populations, methodologies and study designs likely explain this lack of agreement, it is possible that linkage scan methodologies lack the resolution for investigating complex traits. Combining family-based linkage studies with genome-wide association studies may be a powerful approach to detect private mutations contributing to complex renal phenotypes. PMID:21622988

  8. A Fast Implementation of a Scan Statistic for Identifying Chromosomal Patterns of Genome Wide Association Studies

    PubMed Central

    Sun, Yan V.; Jacobsen, Douglas M.; Turner, Stephen T.; Boerwinkle, Eric; Kardia, Sharon L.R.

    2009-01-01

    In order to take into account the complex genomic distribution of SNP variations when identifying chromosomal regions with significant SNP effects, a single nucleotide polymorphism (SNP) association scan statistic was developed. To address the computational needs of genome wide association (GWA) studies, a fast Java application, which combines single-locus SNP tests and a scan statistic for identifying chromosomal regions with significant clusters of significant SNP effects, was developed and implemented. To illustrate this application, SNP associations were analyzed in a pharmacogenomic study of the blood pressure lowering effect of thiazide-diuretics (N=195) using the Affymetrix Human Mapping 100K Set. 55,335 tagSNPs (pair-wise linkage disequilibrium R2<0.5) were selected to reduce the frequency correlation between SNPs. A typical workstation can complete the whole genome scan including 10,000 permutation tests within 3 hours. The most significant regions locate on chromosome 3, 6, 13 and 16, two of which contain candidate genes that may be involved in the underlying drug response mechanism. The computational performance of ChromoScan-GWA and its scalability were tested with up to 1,000,000 SNPs and up to 4,000 subjects. Using 10,000 permutations, the computation time grew linearly in these datasets. This scan statistic application provides a robust statistical and computational foundation for identifying genomic regions associated with disease and provides a method to compare GWA results even across different platforms. PMID:20161066

  9. Genome-wide scan of healthy human connectome discovers SPON1 gene variant influencing dementia severity

    PubMed Central

    Jahanshad, Neda; Rajagopalan, Priya; Hua, Xue; Hibar, Derrek P.; Nir, Talia M.; Toga, Arthur W.; Jack, Clifford R.; Saykin, Andrew J.; Green, Robert C.; Weiner, Michael W.; Medland, Sarah E.; Montgomery, Grant W.; Hansell, Narelle K.; McMahon, Katie L.; de Zubicaray, Greig I.; Martin, Nicholas G.; Wright, Margaret J.; Thompson, Paul M.; Weiner, Michael; Aisen, Paul; Weiner, Michael; Aisen, Paul; Petersen, Ronald; Jack, Clifford R.; Jagust, William; Trojanowski, John Q.; Toga, Arthur W.; Beckett, Laurel; Green, Robert C.; Saykin, Andrew J.; Morris, John; Liu, Enchi; Green, Robert C.; Montine, Tom; Petersen, Ronald; Aisen, Paul; Gamst, Anthony; Thomas, Ronald G.; Donohue, Michael; Walter, Sarah; Gessert, Devon; Sather, Tamie; Beckett, Laurel; Harvey, Danielle; Gamst, Anthony; Donohue, Michael; Kornak, John; Jack, Clifford R.; Dale, Anders; Bernstein, Matthew; Felmlee, Joel; Fox, Nick; Thompson, Paul; Schuff, Norbert; Alexander, Gene; DeCarli, Charles; Jagust, William; Bandy, Dan; Koeppe, Robert A.; Foster, Norm; Reiman, Eric M.; Chen, Kewei; Mathis, Chet; Morris, John; Cairns, Nigel J.; Taylor-Reinwald, Lisa; Trojanowki, J.Q.; Shaw, Les; Lee, Virginia M.Y.; Korecka, Magdalena; Toga, Arthur W.; Crawford, Karen; Neu, Scott; Saykin, Andrew J.; Foroud, Tatiana M.; Potkin, Steven; Shen, Li; Khachaturian, Zaven; Frank, Richard; Snyder, Peter J.; Molchan, Susan; Kaye, Jeffrey; Quinn, Joseph; Lind, Betty; Dolen, Sara; Schneider, Lon S.; Pawluczyk, Sonia; Spann, Bryan M.; Brewer, James; Vanderswag, Helen; Heidebrink, Judith L.; Lord, Joanne L.; Petersen, Ronald; Johnson, Kris; Doody, Rachelle S.; Villanueva-Meyer, Javier; Chowdhury, Munir; Stern, Yaakov; Honig, Lawrence S.; Bell, Karen L.; Morris, John C.; Ances, Beau; Carroll, Maria; Leon, Sue; Mintun, Mark A.; Schneider, Stacy; Marson, Daniel; Griffith, Randall; Clark, David; Grossman, Hillel; Mitsis, Effie; Romirowsky, Aliza; deToledo-Morrell, Leyla; Shah, Raj C.; Duara, Ranjan; Varon, Daniel; Roberts, Peggy; Albert, Marilyn; Onyike, Chiadi; Kielb, Stephanie; Rusinek, Henry; de Leon, Mony J.; Glodzik, Lidia; De Santi, Susan; Doraiswamy, P. Murali; Petrella, Jeffrey R.; Coleman, R. Edward; Arnold, Steven E.; Karlawish, Jason H.; Wolk, David; Smith, Charles D.; Jicha, Greg; Hardy, Peter; Lopez, Oscar L.; Oakley, MaryAnn; Simpson, Donna M.; Porsteinsson, Anton P.; Goldstein, Bonnie S.; Martin, Kim; Makino, Kelly M.; Ismail, M. Saleem; Brand, Connie; Mulnard, Ruth A.; Thai, Gaby; Mc-Adams-Ortiz, Catherine; Womack, Kyle; Mathews, Dana; Quiceno, Mary; Diaz-Arrastia, Ramon; King, Richard; Weiner, Myron; Martin-Cook, Kristen; DeVous, Michael; Levey, Allan I.; Lah, James J.; Cellar, Janet S.; Burns, Jeffrey M.; Anderson, Heather S.; Swerdlow, Russell H.; Apostolova, Liana; Lu, Po H.; Bartzokis, George; Silverman, Daniel H.S.; Graff-Radford, Neill R.; Parfitt, Francine; Johnson, Heather; Farlow, Martin R.; Hake, Ann Marie; Matthews, Brandy R.; Herring, Scott; van Dyck, Christopher H.; Carson, Richard E.; MacAvoy, Martha G.; Chertkow, Howard; Bergman, Howard; Hosein, Chris; Black, Sandra; Stefanovic, Bojana; Caldwell, Curtis; Hsiung, Ging-Yuek Robin; Feldman, Howard; Mudge, Benita; Assaly, Michele; Kertesz, Andrew; Rogers, John; Trost, Dick; Bernick, Charles; Munic, Donna; Kerwin, Diana; Mesulam, Marek-Marsel; Lipowski, Kristina; Wu, Chuang-Kuo; Johnson, Nancy; Sadowsky, Carl; Martinez, Walter; Villena, Teresa; Turner, Raymond Scott; Johnson, Kathleen; Reynolds, Brigid; Sperling, Reisa A.; Johnson, Keith A.; Marshall, Gad; Frey, Meghan; Yesavage, Jerome; Taylor, Joy L.; Lane, Barton; Rosen, Allyson; Tinklenberg, Jared; Sabbagh, Marwan; Belden, Christine; Jacobson, Sandra; Kowall, Neil; Killiany, Ronald; Budson, Andrew E.; Norbash, Alexander; Johnson, Patricia Lynn; Obisesan, Thomas O.; Wolday, Saba; Bwayo, Salome K.; Lerner, Alan; Hudson, Leon; Ogrocki, Paula; Fletcher, Evan; Carmichael, Owen; Olichney, John; DeCarli, Charles; Kittur, Smita; Borrie, Michael; Lee, T.-Y.; Bartha, Rob; Johnson, Sterling; Asthana, Sanjay; Carlsson, Cynthia M.; Potkin, Steven G.; Preda, Adrian; Nguyen, Dana; Tariot, Pierre; Fleisher, Adam; Reeder, Stephanie; Bates, Vernice; Capote, Horacio; Rainka, Michelle; Scharre, Douglas W.; Kataki, Maria; Zimmerman, Earl A.; Celmins, Dzintra; Brown, Alice D.; Pearlson, Godfrey D.; Blank, Karen; Anderson, Karen; Saykin, Andrew J.; Santulli, Robert B.; Schwartz, Eben S.; Sink, Kaycee M.; Williamson, Jeff D.; Garg, Pradeep; Watkins, Franklin; Ott, Brian R.; Querfurth, Henry; Tremont, Geoffrey; Salloway, Stephen; Malloy, Paul; Correia, Stephen; Rosen, Howard J.; Miller, Bruce L.; Mintzer, Jacobo; Longmire, Crystal Flynn; Spicer, Kenneth; Finger, Elizabeth; Rachinsky, Irina; Rogers, John; Kertesz, Andrew; Drost, Dick

    2013-01-01

    Aberrant connectivity is implicated in many neurological and psychiatric disorders, including Alzheimer’s disease and schizophrenia. However, other than a few disease-associated candidate genes, we know little about the degree to which genetics play a role in the brain networks; we know even less about specific genes that influence brain connections. Twin and family-based studies can generate estimates of overall genetic influences on a trait, but genome-wide association scans (GWASs) can screen the genome for specific variants influencing the brain or risk for disease. To identify the heritability of various brain connections, we scanned healthy young adult twins with high-field, high-angular resolution diffusion MRI. We adapted GWASs to screen the brain’s connectivity pattern, allowing us to discover genetic variants that affect the human brain’s wiring. The association of connectivity with the SPON1 variant at rs2618516 on chromosome 11 (11p15.2) reached connectome-wide, genome-wide significance after stringent statistical corrections were enforced, and it was replicated in an independent subsample. rs2618516 was shown to affect brain structure in an elderly population with varying degrees of dementia. Older people who carried the connectivity variant had significantly milder clinical dementia scores and lower risk of Alzheimer’s disease. As a posthoc analysis, we conducted GWASs on several organizational and topological network measures derived from the matrices to discover variants in and around genes associated with autism (MACROD2), development (NEDD4), and mental retardation (UBE2A) significantly associated with connectivity. Connectome-wide, genome-wide screening offers substantial promise to discover genes affecting brain connectivity and risk for brain diseases. PMID:23471985

  10. Genome-Wide Association Scan for Variants Associated with Early-Onset Prostate Cancer

    PubMed Central

    Lange, Ethan M.; Johnson, Anna M.; Wang, Yunfei; Zuhlke, Kimberly A.; Lu, Yurong; Ribado, Jessica V.; Keele, Gregory R.; Li, Jin; Duan, Qing; Li, Ge; Gao, Zhengrong; Li, Yun; Xu, Jianfeng; Isaacs, William B.; Zheng, Siqun; Cooney, Kathleen A.

    2014-01-01

    Prostate cancer is the most common non-skin cancer and the second leading cause of cancer related mortality for men in the United States. There is strong empirical and epidemiological evidence supporting a stronger role of genetics in early-onset prostate cancer. We performed a genome-wide association scan for early-onset prostate cancer. Novel aspects of this study include the focus on early-onset disease (defined as men with prostate cancer diagnosed before age 56 years) and use of publically available control genotype data from previous genome-wide association studies. We found genome-wide significant (p<5×10−8) evidence for variants at 8q24 and 11p15 and strong supportive evidence for a number of previously reported loci. We found little evidence for individual or systematic inflated association findings resulting from using public controls, demonstrating the utility of using public control data in large-scale genetic association studies of common variants. Taken together, these results demonstrate the importance of established common genetic variants for early-onset prostate cancer and the power of including early-onset prostate cancer cases in genetic association studies. PMID:24740154

  11. Genome-wide scans of genetic variants for psychophysiological endophenotypes: a methodological overview.

    PubMed

    Iacono, William G; Malone, Stephen M; Vaidyanathan, Uma; Vrieze, Scott I

    2014-12-01

    This article provides an introductory overview of the investigative strategy employed to evaluate the genetic basis of 17 endophenotypes examined as part of a 20-year data collection effort from the Minnesota Center for Twin and Family Research. Included are characterization of the study samples, descriptive statistics for key properties of the psychophysiological measures, and rationale behind the steps taken in the molecular genetic study design. The statistical approach included (a) biometric analysis of twin and family data, (b) heritability analysis using 527,829 single nucleotide polymorphisms (SNPs), (c) genome-wide association analysis of these SNPs and 17,601 autosomal genes, (d) follow-up analyses of candidate SNPs and genes hypothesized to have an association with each endophenotype, (e) rare variant analysis of nonsynonymous SNPs in the exome, and (f) whole genome sequencing association analysis using 27 million genetic variants. These methods were used in the accompanying empirical articles comprising this special issue, Genome-Wide Scans of Genetic Variants for Psychophysiological Endophenotypes. PMID:25387703

  12. Ion Torrent sequencing for conducting genome-wide scans for mutation mapping analysis.

    PubMed

    Damerla, Rama Rao; Chatterjee, Bishwanath; Li, You; Francis, Richard J B; Fatakia, Sarosh N; Lo, Cecilia W

    2014-04-01

    Mutation mapping in mice can be readily accomplished by genome wide segregation analysis of polymorphic DNA markers. In this study, we showed the efficacy of Ion Torrent next generation sequencing for conducting genome-wide scans to map and identify a mutation causing congenital heart disease in a mouse mutant, Bishu, recovered from a mouse mutagenesis screen. The Bishu mutant line generated in a C57BL/6J (B6) background was intercrossed with another inbred strain, C57BL/10J (B10), and the resulting B6/B10 hybrid offspring were intercrossed to generate mutants used for the mapping analysis. For each mutant sample, a panel of 123 B6/B10 polymorphic SNPs distributed throughout the mouse genome was PCR amplified, bar coded, and then pooled to generate a single library used for Ion Torrent sequencing. Sequencing carried out using the 314 chip yielded >600,000 usable reads. These were aligned and mapped using a custom bioinformatics pipeline. Each SNP was sequenced to a depth >500×, allowing accurate automated calling of the B6/B10 genotypes. This analysis mapped the mutation in Bishu to an interval on the proximal region of mouse chromosome 4. This was confirmed by parallel capillary sequencing of the 123 polymorphic SNPs. Further analysis of genes in the map interval identified a splicing mutation in Dnaic1(c.204+1G>A), an intermediate chain dynein, as the disease causing mutation in Bishu. Overall, our experience shows Ion Torrent amplicon sequencing is high throughput and cost effective for conducting genome-wide mapping analysis and is easily scalable for other high volume genotyping analyses. PMID:24306492

  13. Genome-Wide Scan for Adaptive Divergence and Association with Population-Specific Covariates.

    PubMed

    Gautier, Mathieu

    2015-12-01

    In population genomics studies, accounting for the neutral covariance structure across population allele frequencies is critical to improve the robustness of genome-wide scan approaches. Elaborating on the BayEnv model, this study investigates several modeling extensions (i) to improve the estimation accuracy of the population covariance matrix and all the related measures, (ii) to identify significantly overly differentiated SNPs based on a calibration procedure of the XtX statistics, and (iii) to consider alternative covariate models for analyses of association with population-specific covariables. In particular, the auxiliary variable model allows one to deal with multiple testing issues and, providing the relative marker positions are available, to capture some linkage disequilibrium information. A comprehensive simulation study was carried out to evaluate the performances of these different models. Also, when compared in terms of power, robustness, and computational efficiency to five other state-of-the-art genome-scan methods (BayEnv2, BayScEnv, BayScan, flk, and lfmm), the proposed approaches proved highly effective. For illustration purposes, genotyping data on 18 French cattle breeds were analyzed, leading to the identification of 13 strong signatures of selection. Among these, four (surrounding the KITLG, KIT, EDN3, and ALB genes) contained SNPs strongly associated with the piebald coloration pattern while a fifth (surrounding PLAG1) could be associated to morphological differences across the populations. Finally, analysis of Pool-Seq data from 12 populations of Littorina saxatilis living in two different ecotypes illustrates how the proposed framework might help in addressing relevant ecological issues in nonmodel species. Overall, the proposed methods define a robust Bayesian framework to characterize adaptive genetic differentiation across populations. The BayPass program implementing the different models is available at http://www1.montpellier

  14. A Genome-wide Scan for Selective Sweeps in Racing Horses

    PubMed Central

    Moon, Sunjin; Lee, Jin Woo; Shin, Donghyun; Shin, Kwang-Yun; Kim, Jun; Choi, Ik-Young; Kim, Jaemin; Kim, Heebal

    2015-01-01

    Using next-generation sequencing, we conducted a genome-wide scan of selective sweeps associated with selection toward genetic improvement in Thoroughbreds. We investigated potential phenotypic consequence of putative candidate loci by candidate gene association mapping for the finishing time in 240 Thoroughbred horses. We found a significant association with the trait for Ral GApase alpha 2 (RALGAP2) that regulates a variety of cellular processes of signal trafficking. Neighboring genes around RALGAP2 included insulinoma-associated 1 (INSM1), pallid (PLDN), and Ras and Rab interactor 2 (RIN2) genes have similar roles in signal trafficking, suggesting that a co-evolving gene cluster located on the chromosome 22 is under strong artificial selection in racehorses. PMID:26333666

  15. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    ERIC Educational Resources Information Center

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2010-01-01

    Objective: Genes likely play a substantial role in the etiology of attention-deficit/hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies (GWAS) have not identified a genome-wide significant association. We have conducted a third, independent, multisite GWAS of…

  16. The genetic architecture of seed composition in soybean is refined by genome-wide association scans across multiple populations

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Soybean oil and meal are major contributors to world-wide food production. Consequently, the genetic basis for soybean seed composition has been intensely studied using family-based mapping. Population-based mapping approaches, in the form of genome-wide association (GWA) scans, have been able to re...

  17. A Genome Wide Linkage Scan for Cleft Lip and Palate and Dental Anomalies

    PubMed Central

    Vieira, Alexandre R.; McHenry, Toby G.; Daack-Hirsch, Sandra; Murray, Jeffrey C.; Marazita, Mary L.

    2008-01-01

    We revisited 46 families with two or more siblings affected with an orofacial cleft that participated in previous genome wide studies and collected complete dental information. Genotypes from 392 microsatellite markers at 10 cM intervals were reanalyzed. We carried out four sets of genome wide analyses. First, we ran the analysis solely on the cleft status. Second, we assigned to any dental anomaly (tooth agenesis, supernumerary teeth, and microdontia) an affection status, and repeated the analysis. Third, we ran only the 19 families where the proband had a cleft with no dental anomalies. Finally, we ran only the 27 families that had a proband with cleft and additional dental anomalies outside the cleft area. Chromosomes (1, 2, 6, 8, 16, and 19) presented regions with LOD scores >2.0. Chromosome 19 has the most compelling results in our study. The LOD scores increased from 3.11 (in the scan of all 46 families with clefts as the only assigned affection status) to 3.91 when the 19 families whose probands present with no additional dental anomalies were studied, suggesting the interval 19p13.12-19q12 may contain a gene that contributes to clefts but not to dental anomalies. On the other hand, we found a LOD score of 3.00 in the 2q22.3 region when dental anomalies data were added to the analysis to define affection status. Our preliminary results support the hypothesis that some loci may contribute to both clefts and congenital dental anomalies. Also, adding dental anomalies information will provide new opportunities to map susceptibility loci for clefts. PMID:18442096

  18. Genome-wide Scanning and Characterization of Sorghum bicolor L. Heat Shock Transcription Factors.

    PubMed

    Nagaraju, M; Reddy, Palakolanu Sudhakar; Kumar, S Anil; Srivastava, Rakesh K; Kishor, P B Kavi; Rao, D Manohar

    2015-08-01

    A genome-wide scanning of Sorghum bicolor resulted in the identification of 25 SbHsf genes. Phylogenetic analysis shows the ortholog genes that are clustered with only rice, representing a common ancestor. Promoter analysis revealed the identification of different cis-acting elements that are responsible for abiotic as well as biotic stresses. Hsf domains like DBD, NLS, NES, and AHA have been analyzed for their sequence similarity and functional characterization. Tissue specific expression patterns of Hsfs in different tissues like mature embryo, seedling, root, and panicle were studied using real-time PCR. While Hsfs4 and 22 are highly expressed in panicle, 4 and 9 are expressed in seedlings. Sorghum plants were exposed to different abiotic stress treatments but no expression of any Hsf was observed when seedlings were treated with ABA. High level expression of Hsf1 was noticed during high temperature as well as cold stresses, 4 and 6 during salt and 5, 6, 10, 13, 19, 23 and 25 during drought stress. This comprehensive analysis of SbHsf genes will provide an insight on how these genes are regulated in different tissues and also under different abiotic stresses and help to determine the functions of Hsfs during drought and temperature stress tolerance. PMID:27006630

  19. A genome-wide linkage scan for iron phenotype quantitative trait loci: the HEIRS Family Study.

    PubMed

    Acton, R T; Snively, B M; Barton, J C; McLaren, C E; Adams, P C; Rich, S S; Eckfeldt, J H; Press, R D; Sholinsky, P; Leiendecker-Foster, C; McLaren, G D; Speechley, M R; Harris, E L; Dawkins, F W; Gordeuk, V R

    2007-06-01

    Iron overload phenotypes in persons with and without hemochromatosis are variable. To investigate this further, probands with hemochromatosis or evidence of elevated iron stores and their family members were recruited for a genome-wide linkage scan to identify potential quantitative trait loci (QTL) that contribute to variation in transferrin saturation (TS), unsaturated iron-binding capacity (UIBC), and serum ferritin (SF). Genotyping utilized 402 microsatellite markers with average spacing of 9 cM. A total of 943 individuals, 64% Caucasian, were evaluated from 174 families. After adjusting for age, gender, and race/ethnicity, there was evidence for linkage of UIBC to chromosome 4q logarithm of the odds (LOD) = 2.08, p = 0.001) and of UIBC (LOD = 9.52), TS (LOD = 4.78), and SF (LOD = 2.75) to the chromosome 6p region containing HFE (each p < 0.0001). After adjustments for HFE genotype and other covariates, there was evidence of linkage of SF to chromosome 16p (LOD = 2.63, p = 0.0007) and of UIBC to chromosome 5q (LOD = 2.12, p = 0.002) and to chromosome 17q (LOD = 2.19, p = 0.002). We conclude that these regions should be considered for fine mapping studies to identify QTL that contribute to variation in SF and UIBC. PMID:17539901

  20. Genome-wide association scan for five major dimensions of personality

    PubMed Central

    Terracciano, Antonio; Sanna, Serena; Uda, Manuela; Deiana, Barbara; Usala, Gianluca; Busonero, Fabio; Maschio, Andrea; Scally, Matthew; Patriciu, Nicholas; Chen, Wei-Min; Distel, Marijn A; Slagboom, Eline P; Boomsma, Dorret I; Villafuerte, Sandra; Śliwerska, Elżbieta; Burmeister, Margit; Amin, Najaf; Janssens, A. Cecile J.W.; van Duijn, Cornelia M.; Schlessinger, David; Abecasis, Gonçalo R.; Costa, Paul T.

    2008-01-01

    Personality traits are summarized by five broad dimensions with pervasive influences on major life outcomes, strong links to psychiatric disorders, and clear heritable components. To identify genetic variants associated with each of the five dimensions of personality we performed a genome wide association (GWA) scan of 3,972 individuals from a genetically isolated population within Sardinia, Italy. Based on analyses of 362,129 single nucleotide polymorphisms (SNPs) we found several strong signals within or near genes previously implicated in psychiatric disorders. They include the association of Neuroticism with SNAP25 (rs362584, P = 5 × 10−5), Extraversion with BDNF and two cadherin genes (CDH13 and CDH23; Ps < 5 × 10−5), Openness with CNTNAP2 (rs10251794, P = 3 × 10−5), Agreeableness with CLOCK (rs6832769, P = 9 × 10−6), and Conscientiousness with DYRK1A (rs2835731, P = 3 × 10−5). Effect sizes were small (less than 1% of variance), and most failed to replicate in the follow-up independent samples (N up to 3,903), though the association between Agreeableness and CLOCK was supported in two of three replication samples (overall P = 2 × 10−5). We infer that a large number of loci may influence personality traits and disorders, requiring larger sample sizes for the GWA approach to identify significant genetic variants. PMID:18957941

  1. Genome-wide Scanning and Characterization of Sorghum bicolor L. Heat Shock Transcription Factors

    PubMed Central

    Nagaraju, M.; Reddy, Palakolanu Sudhakar; Kumar, S. Anil; Srivastava, Rakesh K.; Kishor, P. B. Kavi; Rao, D. Manohar

    2015-01-01

    A genome-wide scanning of Sorghum bicolor resulted in the identification of 25 SbHsf genes. Phylogenetic analysis shows the ortholog genes that are clustered with only rice, representing a common ancestor. Promoter analysis revealed the identification of different cis-acting elements that are responsible for abiotic as well as biotic stresses. Hsf domains like DBD, NLS, NES, and AHA have been analyzed for their sequence similarity and functional characterization. Tissue specific expression patterns of Hsfs in different tissues like mature embryo, seedling, root, and panicle were studied using real-time PCR. While Hsfs4 and 22 are highly expressed in panicle, 4 and 9 are expressed in seedlings. Sorghum plants were exposed to different abiotic stress treatments but no expression of any Hsf was observed when seedlings were treated with ABA. High level expression of Hsf1 was noticed during high temperature as well as cold stresses, 4 and 6 during salt and 5, 6, 10, 13, 19, 23 and 25 during drought stress. This comprehensive analysis of SbHsf genes will provide an insight on how these genes are regulated in different tissues and also under different abiotic stresses and help to determine the functions of Hsfs during drought and temperature stress tolerance. PMID:27006630

  2. Extension of Type 2 Diabetes Genome-Wide Association Scan Results in the Diabetes Prevention Program

    PubMed Central

    Moore, Allan F.; Jablonski, Kathleen A.; McAteer, Jarred B.; Saxena, Richa; Pollin, Toni I.; Franks, Paul W.; Hanson, Robert L.; Shuldiner, Alan R.; Knowler, William C.; Altshuler, David; Florez, Jose C.

    2008-01-01

    OBJECTIVE— Genome-wide association scans (GWASs) have identified novel diabetes-associated genes. We evaluated how these variants impact diabetes incidence, quantitative glycemic traits, and response to preventive interventions in 3,548 subjects at high risk of type 2 diabetes enrolled in the Diabetes Prevention Program (DPP), which examined the effects of lifestyle intervention, metformin, and troglitazone versus placebo. RESEARCH DESIGN AND METHODS— We genotyped selected single nucleotide polymorphisms (SNPs) in or near diabetes-associated loci, including EXT2, CDKAL1, CDKN2A/B, IGF2BP2, HHEX, LOC387761, and SLC30A8 in DPP participants and performed Cox regression analyses using genotype, intervention, and their interactions as predictors of diabetes incidence. We evaluated their effect on insulin resistance and secretion at 1 year. RESULTS— None of the selected SNPs were associated with increased diabetes incidence in this population. After adjustments for ethnicity, baseline insulin secretion was lower in subjects with the risk genotype at HHEX rs1111875 (P = 0.01); there were no significant differences in baseline insulin sensitivity. Both at baseline and at 1 year, subjects with the risk genotype at LOC387761 had paradoxically increased insulin secretion; adjustment for self-reported ethnicity abolished these differences. In ethnicity-adjusted analyses, we noted a nominal differential improvement in β-cell function for carriers of the protective genotype at CDKN2A/B after 1 year of troglitazone treatment (P = 0.01) and possibly lifestyle modification (P = 0.05). CONCLUSIONS— We were unable to replicate the GWAS findings regarding diabetes risk in the DPP. We did observe genotype associations with differences in baseline insulin secretion at the HHEX locus and a possible pharmacogenetic interaction at CDKNA2/B. PMID:18544707

  3. Genome-wide linkage scan of quantitative traits representing symptom dimensions in multiplex schizophrenia families.

    PubMed

    Ryu, Seunghyong; Won, Hong-Hee; Oh, Sohee; Kim, Jong-Won; Park, Taesung; Cho, Eun-Young; Cho, Youngah; Park, Dong Yeon; Lee, Yu-Sang; Kwon, Jun Soo; Hong, Kyung Sue

    2013-12-30

    Symptom dimensions of schizophrenia are likely to be the intermediate phenotypes under the control of disease-susceptibility genes, or separate traits related to disease-modifier genes. This study aimed to identify chromosomal loci linked to symptom dimensions of schizophrenia through genome-wide quantitative trait locus (QTL) linkage analysis. The study subjects consisted of 56 families with 183 members including 123 affected individuals. Symptom evaluations were performed on lifetime basis. Through principal component factor analysis, eight quantitative phenotypes representing symptom dimensions were identified. Genotyping was done for 6008 SNP markers, and genome-wide QTL linkage analysis was performed. No symptom dimension showed a significant linkage attaining genome-wide empirical thresholds. We observed seven regions yielding linkage signals attaining genome-wide empirical thresholds for suggestive linkage (NPL Z score = 2.78-3.49); chromosome 15q26.1 for 'non-paranoid delusion factor', 2p24.3 and 7q31.1 for 'prodromal impairment factor', 1q32.1, 9p21.3, and 9q31.2 for 'negative symptom factor', and 10p13 for 'disorganization factor'. Among these loci, chromosome 2p24.3 and 1q32.1 overlap with susceptibility loci of schizophrenia identified in our previous linkage studies. This study suggests the existence of genetic loci related to various clinical features of schizophrenia. Further genetic analyses for these dimensional phenotypes are warranted. PMID:24035701

  4. Genome Wide Scan for Loci influencing Warner Bratzler Shear Force in Five Bos taurus Breeds

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genetic tests for beef tenderness are currently limited to single nucleotide polymorphisms (SNPs) within µ-calpain (CAPN1) and calpastatin (CAST) and explain little of the phenotypic variation in Warner-Bratzler shear force (WBSF). We performed a genome-wide association study for WBSF by genotyping...

  5. A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM

    PubMed Central

    Weiss, Lauren A.; Arking, Dan E.

    2009-01-01

    Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673

  6. A twin study of breastfeeding with a preliminary genome wide association scan

    PubMed Central

    Colodro-Conde, L.; Zhu, G.; Power, R. A.; Henders, A.; Heath, A.C.; Madden, P.A.F.; Montgomery, G.W.; Medland, S. E.; Ordoñana, J.R.; Martin, N.G.

    2015-01-01

    Breastfeeding has been an important survival trait during human history, though it has long been recognised that individuals differ in their exact breastfeeding behaviour. Here our aims were, first, to explore to what extent genetic and environmental influences contributed to the individual differences in breastfeeding behaviour; second, to detect possible genetic variants related to breastfeeding; and lastly, to test if the genetic variants associated with breastfeeding have been previously found to be related with breast size. Data were collected from a large community-based cohort of Australian twins, with 3,364 women for the twin modelling analyses and 1,521 of them included in the genome wide association study. Monozygotic twin correlations (rMZ = .52, 95% CI .46 – .57) were larger than dizygotic twin correlations (rDZ = .35, 95% CI .25 – .43) and the best-fitting model was the one composed by additive genetics and unique environmental factors, explaining 53% and 47% of the variance in breastfeeding behaviour, respectively. No breastfeeding-related genetic variants reached genome-wide significance. The polygenic risk score analyses showed no significant results, suggesting breast size does not influence breastfeeding. This study confers a replication of a previous one exploring the sources of variance of breastfeeding and, to our knowledge, is the first one to conduct a Genome-Wide Association Study on breastfeeding and look at the overlap with variants for breast size. PMID:25475840

  7. Siblings with Ischemic Stroke Study (SWISS): Results of a Genome-wide Scan for Stroke Loci

    PubMed Central

    Meschia, James F.; Nalls, Michael; Matarin, Mar; Brott, Thomas G.; Brown, Robert D.; Hardy, John; Kissela, Brett; Rich, Stephen S.; Singleton, Andrew; Hernandez, Dena; Ferrucci, Luigi; Pearce, Kerra; Keller, Margaret; Worrall, Bradford B.

    2011-01-01

    Background and Purpose Ischemic stroke has a strong familial component to risk. The Siblings with Ischemic Stroke Study (SWISS) is a genome-wide family-based analysis that included use of imputed genotypes. SWISS was conducted to examine associations between SNPs and risk of stroke and stroke subtypes within pairs. Methods SWISS enrolled 312 probands with ischemic stroke across 70 US and Canadian centers. Affected siblings were ascertained by centers and confirmed by central record review; unaffected siblings were ascertained by telephone contact. Ischemic stroke was subtyped using TOAST criteria. Genotyping was performed using an Illumina 610 quad array (probands) and an Illumina linkage V array (affected siblings). SNPs were imputed using 1000 Genomes Project data and MACH software. Family-based association analyses were conducted using the sibling-transmission disequilibrium test. Results For all pairs, the correlation of age at stroke within pairs of affected siblings was r = 0.83 (95%CI, 0.78 to 0.86; P < 2.2×10−16). The correlation did not differ substantially by subtype. The concordance of stroke subtypes among affected pairs was 33.8% (kappa = 0.13; P = 5.06×10−4) and did not differ by age at stroke in the proband. Although no SNP achieved genome-wide significance for risk of ischemic stroke, there was clustering of the most associated SNPs on chromosomes 3p (NOS1) and 6p. Conclusions Stroke subtype and age at stroke in affected sibling pairs exhibit significant clustering. No individual SNP reached genome-wide significance. However, two promising candidate loci were identified, including one that contains NOS1, though these risk loci warrant further examination in larger sample collections. PMID:21940970

  8. Genome-Wide Scans for Delineation of Candidate Genes Regulating Seed-Protein Content in Chickpea

    PubMed Central

    Upadhyaya, Hari D.; Bajaj, Deepak; Narnoliya, Laxmi; Das, Shouvik; Kumar, Vinod; Gowda, C. L. L.; Sharma, Shivali; Tyagi, Akhilesh K.; Parida, Swarup K.

    2016-01-01

    Identification of potential genes/alleles governing complex seed-protein content (SPC) is essential in marker-assisted breeding for quality trait improvement of chickpea. Henceforth, the present study utilized an integrated genomics-assisted breeding strategy encompassing trait association analysis, selective genotyping in traditional bi-parental mapping population and differential expression profiling for the first-time to understand the complex genetic architecture of quantitative SPC trait in chickpea. For GWAS (genome-wide association study), high-throughput genotyping information of 16376 genome-based SNPs (single nucleotide polymorphism) discovered from a structured population of 336 sequenced desi and kabuli accessions [with 150–200 kb LD (linkage disequilibrium) decay] was utilized. This led to identification of seven most effective genomic loci (genes) associated [10–20% with 41% combined PVE (phenotypic variation explained)] with SPC trait in chickpea. Regardless of the diverse desi and kabuli genetic backgrounds, a comparable level of association potential of the identified seven genomic loci with SPC trait was observed. Five SPC-associated genes were validated successfully in parental accessions and homozygous individuals of an intra-specific desi RIL (recombinant inbred line) mapping population (ICC 12299 × ICC 4958) by selective genotyping. The seed-specific expression, including differential up-regulation (>four fold) of six SPC-associated genes particularly in accessions, parents and homozygous individuals of the aforementioned mapping population with a high level of contrasting SPC (21–22%) was evident. Collectively, the integrated genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in six potential candidate genes regulating SPC trait in chickpea. Of these, a non-synonymous SNP allele-carrying zinc finger transcription factor gene exhibiting strong association with SPC trait was found to be the most

  9. Genome-Wide Scans for Delineation of Candidate Genes Regulating Seed-Protein Content in Chickpea.

    PubMed

    Upadhyaya, Hari D; Bajaj, Deepak; Narnoliya, Laxmi; Das, Shouvik; Kumar, Vinod; Gowda, C L L; Sharma, Shivali; Tyagi, Akhilesh K; Parida, Swarup K

    2016-01-01

    Identification of potential genes/alleles governing complex seed-protein content (SPC) is essential in marker-assisted breeding for quality trait improvement of chickpea. Henceforth, the present study utilized an integrated genomics-assisted breeding strategy encompassing trait association analysis, selective genotyping in traditional bi-parental mapping population and differential expression profiling for the first-time to understand the complex genetic architecture of quantitative SPC trait in chickpea. For GWAS (genome-wide association study), high-throughput genotyping information of 16376 genome-based SNPs (single nucleotide polymorphism) discovered from a structured population of 336 sequenced desi and kabuli accessions [with 150-200 kb LD (linkage disequilibrium) decay] was utilized. This led to identification of seven most effective genomic loci (genes) associated [10-20% with 41% combined PVE (phenotypic variation explained)] with SPC trait in chickpea. Regardless of the diverse desi and kabuli genetic backgrounds, a comparable level of association potential of the identified seven genomic loci with SPC trait was observed. Five SPC-associated genes were validated successfully in parental accessions and homozygous individuals of an intra-specific desi RIL (recombinant inbred line) mapping population (ICC 12299 × ICC 4958) by selective genotyping. The seed-specific expression, including differential up-regulation (>four fold) of six SPC-associated genes particularly in accessions, parents and homozygous individuals of the aforementioned mapping population with a high level of contrasting SPC (21-22%) was evident. Collectively, the integrated genomic approach delineated diverse naturally occurring novel functional SNP allelic variants in six potential candidate genes regulating SPC trait in chickpea. Of these, a non-synonymous SNP allele-carrying zinc finger transcription factor gene exhibiting strong association with SPC trait was found to be the most

  10. A genome-wide association scan in admixed Latin Americans identifies loci influencing facial and scalp hair features

    PubMed Central

    Adhikari, Kaustubh; Fontanil, Tania; Cal, Santiago; Mendoza-Revilla, Javier; Fuentes-Guajardo, Macarena; Chacón-Duque, Juan-Camilo; Al-Saadi, Farah; Johansson, Jeanette A.; Quinto-Sanchez, Mirsha; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Barquera Lozano, Rodrigo; Macín Pérez, Gastón; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C.; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M.; Bortolini, Maria-Cátira; Canizales-Quinteros, Samuel; Rothhammer, Francisco; Bedoya, Gabriel; Gonzalez-José, Rolando; Headon, Denis; López-Otín, Carlos; Tobin, Desmond J.; Balding, David; Ruiz-Linares, Andrés

    2016-01-01

    We report a genome-wide association scan in over 6,000 Latin Americans for features of scalp hair (shape, colour, greying, balding) and facial hair (beard thickness, monobrow, eyebrow thickness). We found 18 signals of association reaching genome-wide significance (P values 5 × 10−8 to 3 × 10−119), including 10 novel associations. These include novel loci for scalp hair shape and balding, and the first reported loci for hair greying, monobrow, eyebrow and beard thickness. A newly identified locus influencing hair shape includes a Q30R substitution in the Protease Serine S1 family member 53 (PRSS53). We demonstrate that this enzyme is highly expressed in the hair follicle, especially the inner root sheath, and that the Q30R substitution affects enzyme processing and secretion. The genome regions associated with hair features are enriched for signals of selection, consistent with proposals regarding the evolution of human hair. PMID:26926045

  11. A genome-wide association scan in admixed Latin Americans identifies loci influencing facial and scalp hair features.

    PubMed

    Adhikari, Kaustubh; Fontanil, Tania; Cal, Santiago; Mendoza-Revilla, Javier; Fuentes-Guajardo, Macarena; Chacón-Duque, Juan-Camilo; Al-Saadi, Farah; Johansson, Jeanette A; Quinto-Sanchez, Mirsha; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Barquera Lozano, Rodrigo; Macín Pérez, Gastón; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M; Bortolini, Maria-Cátira; Canizales-Quinteros, Samuel; Rothhammer, Francisco; Bedoya, Gabriel; Gonzalez-José, Rolando; Headon, Denis; López-Otín, Carlos; Tobin, Desmond J; Balding, David; Ruiz-Linares, Andrés

    2016-01-01

    We report a genome-wide association scan in over 6,000 Latin Americans for features of scalp hair (shape, colour, greying, balding) and facial hair (beard thickness, monobrow, eyebrow thickness). We found 18 signals of association reaching genome-wide significance (P values 5 × 10(-8) to 3 × 10(-119)), including 10 novel associations. These include novel loci for scalp hair shape and balding, and the first reported loci for hair greying, monobrow, eyebrow and beard thickness. A newly identified locus influencing hair shape includes a Q30R substitution in the Protease Serine S1 family member 53 (PRSS53). We demonstrate that this enzyme is highly expressed in the hair follicle, especially the inner root sheath, and that the Q30R substitution affects enzyme processing and secretion. The genome regions associated with hair features are enriched for signals of selection, consistent with proposals regarding the evolution of human hair. PMID:26926045

  12. Family-Based Genome-Wide Association Scan of Attention-Deficit/Hyperactivity Disorder

    PubMed Central

    Mick, Eric; Todorov, Alexandre; Smalley, Susan; Hu, Xiaolan; Loo, Sandra; Todd, Richard D.; Biederman, Joseph; Byrne, Deirdre; Dechairo, Bryan; Guiney, Allan; McCracken, James; McGough, James; Nelson, Stanley F.; Reiersen, Angela M.; Wilens, Timothy E.; Wozniak, Janet; Neale, Benjamin M.; Faraone, Stephen V.

    2013-01-01

    Objective . Genes likely play a substantial role in the etiology of attention-deficit hyperactivity disorder (ADHD). However, the genetic architecture of the disorder is unknown, and prior genome-wide association studies have not identified a genome-wide significant association. We have conducted a third, independent multi-site GWAS of DSM-IV-TR ADHD. Method . Families were ascertained at Massachusetts General Hospital (MGH, N=309 trios), Washington University at St Louis (WASH-U, N=272 trios), and University of California at Los Angeles (UCLA, N=156 trios). Genotyping was conducted with the Illumina Human1M or Human1M-Duo BeadChip platforms. After applying quality control filters, association with ADHD was tested with 835,136 SNPs in 735 DSM-IV ADHD trios from 732 families. Results . Our smallest p-value (6.7E-07) did not reach the threshold for genome-wide statistical significance (5.0E-08) but one of the 20 most significant associations was located in a candidate gene of interest for ADHD, (SLC9A9, rs9810857, p=6.4E-6). We also conducted gene-based tests of candidate genes identified in the literature and found additional evidence of association with SLC9A9. Conclusion . We and our colleagues in the Psychiatric GWAS Consortium are working to pool together GWAS samples to establish the large data sets needed to follow-up on these results and to identify genes for ADHD and other disorders. PMID:20732626

  13. Genome-wide scans provide evidence for positive selection of genes implicated in Lassa fever

    PubMed Central

    Andersen, Kristian G.; Shylakhter, Ilya; Tabrizi, Shervin; Grossman, Sharon R.; Happi, Christian T.; Sabeti, Pardis C.

    2012-01-01

    Rapidly evolving viruses and other pathogens can have an immense impact on human evolution as natural selection acts to increase the prevalence of genetic variants providing resistance to disease. With the emergence of large datasets of human genetic variation, we can search for signatures of natural selection in the human genome driven by such disease-causing microorganisms. Based on this approach, we have previously hypothesized that Lassa virus (LASV) may have been a driver of natural selection in West African populations where Lassa haemorrhagic fever is endemic. In this study, we provide further evidence for this notion. By applying tests for selection to genome-wide data from the International Haplotype Map Consortium and the 1000 Genomes Consortium, we demonstrate evidence for positive selection in LARGE and interleukin 21 (IL21), two genes implicated in LASV infectivity and immunity. We further localized the signals of selection, using the recently developed composite of multiple signals method, to introns and putative regulatory regions of those genes. Our results suggest that natural selection may have targeted variants giving rise to alternative splicing or differential gene expression of LARGE and IL21. Overall, our study supports the hypothesis that selective pressures imposed by LASV may have led to the emergence of particular alleles conferring resistance to Lassa fever, and opens up new avenues of research pursuit. PMID:22312054

  14. Genome-wide scans provide evidence for positive selection of genes implicated in Lassa fever.

    PubMed

    Andersen, Kristian G; Shylakhter, Ilya; Tabrizi, Shervin; Grossman, Sharon R; Happi, Christian T; Sabeti, Pardis C

    2012-03-19

    Rapidly evolving viruses and other pathogens can have an immense impact on human evolution as natural selection acts to increase the prevalence of genetic variants providing resistance to disease. With the emergence of large datasets of human genetic variation, we can search for signatures of natural selection in the human genome driven by such disease-causing microorganisms. Based on this approach, we have previously hypothesized that Lassa virus (LASV) may have been a driver of natural selection in West African populations where Lassa haemorrhagic fever is endemic. In this study, we provide further evidence for this notion. By applying tests for selection to genome-wide data from the International Haplotype Map Consortium and the 1000 Genomes Consortium, we demonstrate evidence for positive selection in LARGE and interleukin 21 (IL21), two genes implicated in LASV infectivity and immunity. We further localized the signals of selection, using the recently developed composite of multiple signals method, to introns and putative regulatory regions of those genes. Our results suggest that natural selection may have targeted variants giving rise to alternative splicing or differential gene expression of LARGE and IL21. Overall, our study supports the hypothesis that selective pressures imposed by LASV may have led to the emergence of particular alleles conferring resistance to Lassa fever, and opens up new avenues of research pursuit. PMID:22312054

  15. Genome-wide association scan suggests basis for microtia in Awassi sheep.

    PubMed

    Jawasreh, K; Boettcher, P J; Stella, A

    2016-08-01

    Hereditary underdevelopment of the ear, a condition also known as microtia, has been observed in several sheep breeds as well as in humans and other species. Its genetic basis in sheep is unknown. The Awassi sheep, a breed native to southwest Asia, carries this phenotype and was targeted for molecular characterization via a genome-wide association study. DNA samples were collected from sheep in Jordan. Eight affected and 12 normal individuals were genotyped with the Illumina OvineSNP50(®) chip. Multilocus analyses failed to identify any genotypic association. In contrast, a single-locus analysis revealed a statistically significant association (P = 0.012, genome-wide) with a SNP at basepair 34 647 499 on OAR23. This marker is adjacent to the gene encoding transcription factor GATA-6, which has been shown to play a role in many developmental processes, including chondrogenesis. The lack of extended homozygosity in this region suggests a fairly ancient mutation, and the time of occurrence was estimated to be approximately 3000 years ago. Many of the earless sheep breeds may thus share the causative mutation, especially within the subgroup of fat-tailed, wool sheep. PMID:26990958

  16. A Genome-Wide Linkage Scan for Age at Menarche in Three Populations of European Descent

    PubMed Central

    Anderson, Carl A.; Zhu, Gu; Falchi, Mario; van den Berg, Stéphanie M.; Treloar, Susan A.; Spector, Timothy D.; Martin, Nicholas G.; Boomsma, Dorret I.; Visscher, Peter M.; Montgomery, Grant W.

    2008-01-01

    Context: Age at menarche (AAM) is an important trait both biologically and socially, a clearly defined event in female pubertal development, and has been associated with many clinically significant phenotypes. Objective: The objective of the study was to identify genetic loci influencing variation in AAM in large population-based samples from three countries. Design/Participants: Recalled AAM data were collected from 13,697 individuals and 4,899 pseudoindependent sister-pairs from three different populations (Australia, The Netherlands, and the United Kingdom) by mailed questionnaire or interview. Genome-wide variance components linkage analysis was implemented on each sample individually and in combination. Results: The mean, sd, and heritability of AAM across the three samples was 13.1 yr, 1.5 yr, and 0.69, respectively. No loci were detected that reached genome-wide significance in the combined analysis, but a suggestive locus was detected on chromosome 12 (logarithm of the odds = 2.0). Three loci of suggestive significance were seen in the U.K. sample on chromosomes 1, 4, and 18 (logarithm of the odds = 2.4, 2.2 and 3.2, respectively). Conclusions: There was no evidence for common highly penetrant variants influencing AAM. Linkage and association suggest that one trait locus for AAM is located on chromosome 12, but further studies are required to replicate these results. PMID:18647812

  17. A genome-wide scan for signatures of differential artificial selection in ten cattle breeds

    PubMed Central

    2013-01-01

    Background Since the times of domestication, cattle have been continually shaped by the influence of humans. Relatively recent history, including breed formation and the still enduring enormous improvement of economically important traits, is expected to have left distinctive footprints of selection within the genome. The purpose of this study was to map genome-wide selection signatures in ten cattle breeds and thus improve the understanding of the genome response to strong artificial selection and support the identification of the underlying genetic variants of favoured phenotypes. We analysed 47,651 single nucleotide polymorphisms (SNP) using Cross Population Extended Haplotype Homozygosity (XP-EHH). Results We set the significance thresholds using the maximum XP-EHH values of two essentially artificially unselected breeds and found up to 229 selection signatures per breed. Through a confirmation process we verified selection for three distinct phenotypes typical for one breed (polledness in Galloway, double muscling in Blanc-Bleu Belge and red coat colour in Red Holstein cattle). Moreover, we detected six genes strongly associated with known QTL for beef or dairy traits (TG, ABCG2, DGAT1, GH1, GHR and the Casein Cluster) within selection signatures of at least one breed. A literature search for genes lying in outstanding signatures revealed further promising candidate genes. However, in concordance with previous genome-wide studies, we also detected a substantial number of signatures without any yet known gene content. Conclusions These results show the power of XP-EHH analyses in cattle to discover promising candidate genes and raise the hope of identifying phenotypically important variants in the near future. The finding of plausible functional candidates in some short signatures supports this hope. For instance, MAP2K6 is the only annotated gene of two signatures detected in Galloway and Gelbvieh cattle and is already known to be associated with carcass

  18. A genome-wide scan for common alleles affecting risk for autism.

    PubMed

    Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R; Correia, Catarina; Abrahams, Brett S; Sykes, Nuala; Pagnamenta, Alistair T; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R; Casallo, Guillermo; Casey, Jillian; Chu, Su H; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A; Folstein, Susan E; Fombonne, Eric; Freitag, Christine M; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J; Hakonarson, Hakon; Heron, Elizabeth A; Hill, Matthew; Holt, Richard; Howe, Jennifer L; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M; Lamb, Janine A; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L; Lionel, Anath C; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R; McConachie, Helen; McDougle, Christopher J; McGrath, Jane; McMahon, William M; Melhem, Nadine M; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J; Mirza, Ghazala K; Munson, Jeff; Nelson, Stanley F; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L; Bierut, Laura J; Rice, John P; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D; Cantor, Rita M; Cook, Edwin H; Coon, Hilary; Cuccaro, Michael L; Gallagher, Louise; Geschwind, Daniel H; Gill, Michael; Haines, Jonathan L; Miller, Judith; Monaco, Anthony P; Nurnberger, John I; Paterson, Andrew D; Pericak-Vance, Margaret A; Schellenberg, Gerard D; Scherer, Stephen W; Sutcliffe, James S; Szatmari, Peter; Vicente, Astrid M; Vieland, Veronica J; Wijsman, Ellen M; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim

    2010-10-15

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10(-8). When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10(-8) threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C. PMID:20663923

  19. A genome-wide scan for common alleles affecting risk for autism

    PubMed Central

    Anney, Richard; Klei, Lambertus; Pinto, Dalila; Regan, Regina; Conroy, Judith; Magalhaes, Tiago R.; Correia, Catarina; Abrahams, Brett S.; Sykes, Nuala; Pagnamenta, Alistair T.; Almeida, Joana; Bacchelli, Elena; Bailey, Anthony J.; Baird, Gillian; Battaglia, Agatino; Berney, Tom; Bolshakova, Nadia; Bölte, Sven; Bolton, Patrick F.; Bourgeron, Thomas; Brennan, Sean; Brian, Jessica; Carson, Andrew R.; Casallo, Guillermo; Casey, Jillian; Chu, Su H.; Cochrane, Lynne; Corsello, Christina; Crawford, Emily L.; Crossett, Andrew; Dawson, Geraldine; de Jonge, Maretha; Delorme, Richard; Drmic, Irene; Duketis, Eftichia; Duque, Frederico; Estes, Annette; Farrar, Penny; Fernandez, Bridget A.; Folstein, Susan E.; Fombonne, Eric; Freitag, Christine M.; Gilbert, John; Gillberg, Christopher; Glessner, Joseph T.; Goldberg, Jeremy; Green, Jonathan; Guter, Stephen J.; Hakonarson, Hakon; Heron, Elizabeth A.; Hill, Matthew; Holt, Richard; Howe, Jennifer L.; Hughes, Gillian; Hus, Vanessa; Igliozzi, Roberta; Kim, Cecilia; Klauck, Sabine M.; Kolevzon, Alexander; Korvatska, Olena; Kustanovich, Vlad; Lajonchere, Clara M.; Lamb, Janine A.; Laskawiec, Magdalena; Leboyer, Marion; Le Couteur, Ann; Leventhal, Bennett L.; Lionel, Anath C.; Liu, Xiao-Qing; Lord, Catherine; Lotspeich, Linda; Lund, Sabata C.; Maestrini, Elena; Mahoney, William; Mantoulan, Carine; Marshall, Christian R.; McConachie, Helen; McDougle, Christopher J.; McGrath, Jane; McMahon, William M.; Melhem, Nadine M.; Merikangas, Alison; Migita, Ohsuke; Minshew, Nancy J.; Mirza, Ghazala K.; Munson, Jeff; Nelson, Stanley F.; Noakes, Carolyn; Noor, Abdul; Nygren, Gudrun; Oliveira, Guiomar; Papanikolaou, Katerina; Parr, Jeremy R.; Parrini, Barbara; Paton, Tara; Pickles, Andrew; Piven, Joseph; Posey, David J; Poustka, Annemarie; Poustka, Fritz; Prasad, Aparna; Ragoussis, Jiannis; Renshaw, Katy; Rickaby, Jessica; Roberts, Wendy; Roeder, Kathryn; Roge, Bernadette; Rutter, Michael L.; Bierut, Laura J.; Rice, John P.; Salt, Jeff; Sansom, Katherine; Sato, Daisuke; Segurado, Ricardo; Senman, Lili; Shah, Naisha; Sheffield, Val C.; Soorya, Latha; Sousa, Inês; Stoppioni, Vera; Strawbridge, Christina; Tancredi, Raffaella; Tansey, Katherine; Thiruvahindrapduram, Bhooma; Thompson, Ann P.; Thomson, Susanne; Tryfon, Ana; Tsiantis, John; Van Engeland, Herman; Vincent, John B.; Volkmar, Fred; Wallace, Simon; Wang, Kai; Wang, Zhouzhi; Wassink, Thomas H.; Wing, Kirsty; Wittemeyer, Kerstin; Wood, Shawn; Yaspan, Brian L.; Zurawiecki, Danielle; Zwaigenbaum, Lonnie; Betancur, Catalina; Buxbaum, Joseph D.; Cantor, Rita M.; Cook, Edwin H.; Coon, Hilary; Cuccaro, Michael L.; Gallagher, Louise; Geschwind, Daniel H.; Gill, Michael; Haines, Jonathan L.; Miller, Judith; Monaco, Anthony P.; Nurnberger, John I.; Paterson, Andrew D.; Pericak-Vance, Margaret A.; Schellenberg, Gerard D.; Scherer, Stephen W.; Sutcliffe, James S.; Szatmari, Peter; Vicente, Astrid M.; Vieland, Veronica J.; Wijsman, Ellen M.; Devlin, Bernie; Ennis, Sean; Hallmayer, Joachim

    2010-01-01

    Although autism spectrum disorders (ASDs) have a substantial genetic basis, most of the known genetic risk has been traced to rare variants, principally copy number variants (CNVs). To identify common risk variation, the Autism Genome Project (AGP) Consortium genotyped 1558 rigorously defined ASD families for 1 million single-nucleotide polymorphisms (SNPs) and analyzed these SNP genotypes for association with ASD. In one of four primary association analyses, the association signal for marker rs4141463, located within MACROD2, crossed the genome-wide association significance threshold of P < 5 × 10−8. When a smaller replication sample was analyzed, the risk allele at rs4141463 was again over-transmitted; yet, consistent with the winner's curse, its effect size in the replication sample was much smaller; and, for the combined samples, the association signal barely fell below the P < 5 × 10−8 threshold. Exploratory analyses of phenotypic subtypes yielded no significant associations after correction for multiple testing. They did, however, yield strong signals within several genes, KIAA0564, PLD5, POU6F2, ST8SIA2 and TAF1C. PMID:20663923

  20. Multiple type 2 diabetes susceptibility genes following genome-wide association scan in UK samples

    PubMed Central

    Zeggini, Eleftheria; Weedon, Michael N.; Lindgren, Cecilia M.; Frayling, Timothy M.; Elliott, Katherine S.; Lango, Hana; Timpson, Nicholas J.; Perry, John R.B.; Rayner, Nigel W.; Freathy, Rachel M.; Barrett, Jeffrey C.; Shields, Beverley; Morris, Andrew P.; Ellard, Sian; Groves, Christopher J.; Harries, Lorna W.; Marchini, Jonathan L.; Owen, Katharine R.; Knight, Beatrice; Cardon, Lon R.; Walker, Mark; Hitman, Graham A.; Morris, Andrew D.; Doney, Alex S.F.; McCarthy, Mark I.; Hattersley, Andrew T.

    2013-01-01

    The molecular mechanisms involved in the development of type 2 diabetes are poorly understood. Starting from genome-wide genotype data for 1,924 diabetic cases and 2,938 population controls generated by the Wellcome Trust Case Control Consortium, we set out to detect replicated diabetes association signals through analysis of 3,757 additional cases and 5,346 controls, and by integration of our findings with equivalent data from other international consortia. We detected diabetes susceptibility loci in and around the genes CDKAL1, CDKN2A/CDKN2B and IGF2BP2 and confirmed the recently described associations at HHEX/IDE and SLC30A8. Our findings provide insights into the genetic architecture of type 2 diabetes, emphasizing the contribution of multiple variants of modest effect. The regions identified underscore the importance of pathways influencing pancreatic beta cell development and function in the etiology of type 2 diabetes. PMID:17463249

  1. A genome-wide association scan on estrogen receptor-negative breast cancer

    PubMed Central

    2010-01-01

    Introduction Breast cancer is a heterogeneous disease and may be characterized on the basis of whether estrogen receptors (ER) are expressed in the tumour cells. ER status of breast cancer is important clinically, and is used both as a prognostic indicator and treatment predictor. In this study, we focused on identifying genetic markers associated with ER-negative breast cancer risk. Methods We conducted a genome-wide association analysis of 285,984 single nucleotide polymorphisms (SNPs) genotyped in 617 ER-negative breast cancer cases and 4,583 controls. We also conducted a genome-wide pathway analysis on the discovery dataset using permutation-based tests on pre-defined pathways. The extent of shared polygenic variation between ER-negative and ER-positive breast cancers was assessed by relating risk scores, derived using ER-positive breast cancer samples, to disease state in independent, ER-negative breast cancer cases. Results Association with ER-negative breast cancer was not validated for any of the five most strongly associated SNPs followed up in independent studies (1,011 ER-negative breast cancer cases, 7,604 controls). However, an excess of small P-values for SNPs with known regulatory functions in cancer-related pathways was found (global P = 0.052). We found no evidence to suggest that ER-negative breast cancer shares a polygenic basis to disease with ER-positive breast cancer. Conclusions ER-negative breast cancer is a distinct breast cancer subtype that merits independent analyses. Given the clinical importance of this phenotype and the likelihood that genetic effect sizes are small, greater sample sizes and further studies are required to understand the etiology of ER-negative breast cancers. PMID:21062454

  2. Genome-wide association Scan of dental caries in the permanent dentition

    PubMed Central

    2012-01-01

    Background Over 90% of adults aged 20 years or older with permanent teeth have suffered from dental caries leading to pain, infection, or even tooth loss. Although caries prevalence has decreased over the past decade, there are still about 23% of dentate adults who have untreated carious lesions in the US. Dental caries is a complex disorder affected by both individual susceptibility and environmental factors. Approximately 35-55% of caries phenotypic variation in the permanent dentition is attributable to genes, though few specific caries genes have been identified. Therefore, we conducted the first genome-wide association study (GWAS) to identify genes affecting susceptibility to caries in adults. Methods Five independent cohorts were included in this study, totaling more than 7000 participants. For each participant, dental caries was assessed and genetic markers (single nucleotide polymorphisms, SNPs) were genotyped or imputed across the entire genome. Due to the heterogeneity among the five cohorts regarding age, genotyping platform, quality of dental caries assessment, and study design, we first conducted genome-wide association (GWA) analyses on each of the five independent cohorts separately. We then performed three meta-analyses to combine results for: (i) the comparatively younger, Appalachian cohorts (N = 1483) with well-assessed caries phenotype, (ii) the comparatively older, non-Appalachian cohorts (N = 5960) with inferior caries phenotypes, and (iii) all five cohorts (N = 7443). Top ranking genetic loci within and across meta-analyses were scrutinized for biologically plausible roles on caries. Results Different sets of genes were nominated across the three meta-analyses, especially between the younger and older age cohorts. In general, we identified several suggestive loci (P-value ≤ 10E-05) within or near genes with plausible biological roles for dental caries, including RPS6KA2 and PTK2B, involved in p38-depenedent MAPK signaling

  3. Genome-wide association scan of the time to onset of attention deficit hyperactivity disorder.

    PubMed

    Lasky-Su, Jessica; Anney, Richard J L; Neale, Benjamin M; Franke, Barbara; Zhou, Kaixin; Maller, Julian B; Vasquez, Alejandro Arias; Chen, Wai; Asherson, Philip; Buitelaar, Jan; Banaschewski, Tobias; Ebstein, Richard; Gill, Michael; Miranda, Ana; Mulas, Fernando; Oades, Robert D; Roeyers, Herbert; Rothenberger, Aribert; Sergeant, Joseph; Sonuga-Barke, Edmund; Steinhausen, Hans Christoph; Taylor, Eric; Daly, Mark; Laird, Nan; Lange, Christoph; Faraone, Stephen V

    2008-12-01

    A time-to-onset analysis for family-based samples was performed on the genomewide association (GWAS) data for attention deficit hyperactivity disorder (ADHD) to determine if associations exist with the age at onset of ADHD. The initial dataset consisted of 958 parent-offspring trios that were genotyped on the Perlegen 600,000 SNP array. After data cleaning procedures, 429,981 autosomal SNPs and 930 parent-offspring trios were used found suitable for use and a family-based logrank analysis was performed using that age at first ADHD symptoms as the quantitative trait of interest. No SNP achieved genome-wide significance, and the lowest P-values had a magnitude of 10(-7). Several SNPs among a pre-specified list of candidate genes had nominal associations including SLC9A9, DRD1, ADRB2, SLC6A3, NFIL3, ADRB1, SYT1, HTR2A, ARRB2, and CHRNA4. Of these findings SLC9A9 stood out as a promising candidate, with nominally significant SNPs in six distinct regions of the gene. PMID:18937294

  4. A genome-wide scan identifies variants in NFIB associated with metastasis in patients with osteosarcoma

    PubMed Central

    Mirabello, Lisa; Koster, Roelof; Moriarity, Branden S.; Spector, Logan G.; Meltzer, Paul S.; Gary, Joy; Machiela, Mitchell J.; Pankratz, Nathan; Panagiotou, Orestis A.; Largaespada, David; Wang, Zhaoming; Gastier-Foster, Julie M.; Gorlick, Richard; Khanna, Chand; de Toledo, Silvia Regina Caminada; Petrilli, Antonio S.; Patiño-Garcia, Ana; Sierrasesúmaga, Luis; Lecanda, Fernando; Andrulis, Irene L.; Wunder, Jay S.; Gokgoz, Nalan; Serra, Massimo; Hattinger, Claudia; Picci, Piero; Scotlandi, Katia; Flanagan, Adrienne M.; Tirabosco, Roberto; Amary, Maria Fernanda; Halai, Dina; Ballinger, Mandy L.; Thomas, David M.; Davis, Sean; Barkauskas, Donald A.; Marina, Neyssa; Helman, Lee; Otto, George M.; Becklin, Kelsie L.; Wolf, Natalie K.; Weg, Madison T.; Tucker, Margaret; Wacholder, Sholom; Fraumeni, Joseph F.; Caporaso, Neil E.; Boland, Joseph F.; Hicks, Belynda D.; Vogt, Aurelie; Burdett, Laurie; Yeager, Meredith; Hoover, Robert N.; Chanock, Stephen J.; Savage, Sharon A.

    2015-01-01

    Metastasis is the leading cause of death in osteosarcoma patients, the most common pediatric bone malignancy. We conducted a multi-stage genome-wide association study of osteosarcoma metastasis at diagnosis in 935 osteosarcoma patients to determine whether germline genetic variation contributes to risk of metastasis. We identified a SNP, rs7034162, in NFIB significantly associated with metastasis in European osteosarcoma cases, as well as in cases of African and Brazilian ancestry (meta-analysis of all cases: P=1.2×10−9, OR 2.43, 95% CI 1.83–3.24). The risk allele was significantly associated with lowered NFIB expression, which led to increased osteosarcoma cell migration, proliferation, and colony formation. Additionally, a transposon screen in mice identified a significant proportion of osteosarcomas harboring inactivating insertions in Nfib, and had lowered Nfib expression. These data suggest that germline genetic variation at rs7034162 is important in osteosarcoma metastasis, and that NFIB is an osteosarcoma metastasis susceptibility gene. PMID:26084801

  5. A genome-wide scan for selection signatures in Nelore cattle

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Brazilian Nelore cattle have been selected for growth traits over more than four decades. In recent years, reproductive and meat quality traits have become more important because of increasing consumption, exports and consumer demand. The identification of genomic regions altered by artificial selec...

  6. Genome-wide scans for candidate genes involved in the aquatic adaptation of dolphins.

    PubMed

    Sun, Yan-Bo; Zhou, Wei-Ping; Liu, He-Qun; Irwin, David M; Shen, Yong-Yi; Zhang, Ya-Ping

    2013-01-01

    Since their divergence from the terrestrial artiodactyls, cetaceans have fully adapted to an aquatic lifestyle, which represents one of the most dramatic transformations in mammalian evolutionary history. Numerous morphological and physiological characters of cetaceans have been acquired in response to this drastic habitat transition, such as thickened blubber, echolocation, and ability to hold their breath for a long period of time. However, knowledge about the molecular basis underlying these adaptations is still limited. The sequence of the genome of Tursiops truncates provides an opportunity for a comparative genomic analyses to examine the molecular adaptation of this species. Here, we constructed 11,838 high-quality orthologous gene alignments culled from the dolphin and four other terrestrial mammalian genomes and screened for positive selection occurring in the dolphin lineage. In total, 368 (3.1%) of the genes were identified as having undergone positive selection by the branch-site model. Functional characterization of these genes showed that they are significantly enriched in the categories of lipid transport and localization, ATPase activity, sense perception of sound, and muscle contraction, areas that are potentially related to cetacean adaptations. In contrast, we did not find a similar pattern in the cow, a closely related species. We resequenced some of the positively selected sites (PSSs), within the positively selected genes, and showed that most of our identified PSSs (50/52) could be replicated. The results from this study should have important implications for our understanding of cetacean evolution and their adaptations to the aquatic environment. PMID:23246795

  7. Genome wide scan for quantitative trait loci affecting tick resistance in cattle (Bos taurus × Bos indicus)

    PubMed Central

    2010-01-01

    Background In tropical countries, losses caused by bovine tick Rhipicephalus (Boophilus) microplus infestation have a tremendous economic impact on cattle production systems. Genetic variation between Bos taurus and Bos indicus to tick resistance and molecular biology tools might allow for the identification of molecular markers linked to resistance traits that could be used as an auxiliary tool in selection programs. The objective of this work was to identify QTL associated with tick resistance/susceptibility in a bovine F2 population derived from the Gyr (Bos indicus) × Holstein (Bos taurus) cross. Results Through a whole genome scan with microsatellite markers, we were able to map six genomic regions associated with bovine tick resistance. For most QTL, we have found that depending on the tick evaluation season (dry and rainy) different sets of genes could be involved in the resistance mechanism. We identified dry season specific QTL on BTA 2 and 10, rainy season specific QTL on BTA 5, 11 and 27. We also found a highly significant genome wide QTL for both dry and rainy seasons in the central region of BTA 23. Conclusions The experimental F2 population derived from Gyr × Holstein cross successfully allowed the identification of six highly significant QTL associated with tick resistance in cattle. QTL located on BTA 23 might be related with the bovine histocompatibility complex. Further investigation of these QTL will help to isolate candidate genes involved with tick resistance in cattle. PMID:20433753

  8. A genome-wide sib-pair scan for quantitative language traits reveals linkage to chromosomes 10 and 13

    PubMed Central

    Evans, P. D.; Mueller, K. L.; Gamazon, E. R.; Cox, N. J.; Tomblin, J. B.

    2016-01-01

    Although there is considerable evidence that individual differences in language development are highly heritable, there have been few genome-wide scans to locate genes associated with the trait. Previous analyses of language impairment have yielded replicable evidence for linkage to regions on chromosomes 16q, 19q, 13q (within lab) and at 13q (between labs). Here we report the first linkage study to screen the continuum of language ability, from normal to disordered, as found in the general population. 383 children from 147 sib-ships (214 sib-pairs) were genotyped on the Illumina® Linkage IVb Marker Panel using three composite language-related phenotypes and a measure of phonological memory (PM). Two regions (10q23.33; 13q33.3) yielded genome-wide significant peaks for linkage with PM. A peak suggestive of linkage was also found at 17q12 for the overall language composite. This study presents two novel genetic loci for the study of language development and disorders, but fails to replicate findings by previous groups. Possible reasons for this are discussed. PMID:25997078

  9. Genome-wide scan of 29,141 African Americans finds no evidence of directional selection since admixture.

    PubMed

    Bhatia, Gaurav; Tandon, Arti; Patterson, Nick; Aldrich, Melinda C; Ambrosone, Christine B; Amos, Christopher; Bandera, Elisa V; Berndt, Sonja I; Bernstein, Leslie; Blot, William J; Bock, Cathryn H; Caporaso, Neil; Casey, Graham; Deming, Sandra L; Diver, W Ryan; Gapstur, Susan M; Gillanders, Elizabeth M; Harris, Curtis C; Henderson, Brian E; Ingles, Sue A; Isaacs, William; De Jager, Phillip L; John, Esther M; Kittles, Rick A; Larkin, Emma; McNeill, Lorna H; Millikan, Robert C; Murphy, Adam; Neslund-Dudas, Christine; Nyante, Sarah; Press, Michael F; Rodriguez-Gil, Jorge L; Rybicki, Benjamin A; Schwartz, Ann G; Signorello, Lisa B; Spitz, Margaret; Strom, Sara S; Tucker, Margaret A; Wiencke, John K; Witte, John S; Wu, Xifeng; Yamamura, Yuko; Zanetti, Krista A; Zheng, Wei; Ziegler, Regina G; Chanock, Stephen J; Haiman, Christopher A; Reich, David; Price, Alkes L

    2014-10-01

    The extent of recent selection in admixed populations is currently an unresolved question. We scanned the genomes of 29,141 African Americans and failed to find any genome-wide-significant deviations in local ancestry, indicating no evidence of selection influencing ancestry after admixture. A recent analysis of data from 1,890 African Americans reported that there was evidence of selection in African Americans after their ancestors left Africa, both before and after admixture. Selection after admixture was reported on the basis of deviations in local ancestry, and selection before admixture was reported on the basis of allele-frequency differences between African Americans and African populations. The local-ancestry deviations reported by the previous study did not replicate in our very large sample, and we show that such deviations were expected purely by chance, given the number of hypotheses tested. We further show that the previous study's conclusion of selection in African Americans before admixture is also subject to doubt. This is because the FST statistics they used were inflated and because true signals of unusual allele-frequency differences between African Americans and African populations would be best explained by selection that occurred in Africa prior to migration to the Americas. PMID:25242497

  10. The Genetic Architecture of Seed Composition in Soybean Is Refined by Genome-Wide Association Scans Across Multiple Populations

    PubMed Central

    Vaughn, Justin N.; Nelson, Randall L.; Song, Qijian; Cregan, Perry B.; Li, Zenglu

    2014-01-01

    Soybean oil and meal are major contributors to world-wide food production. Consequently, the genetic basis for soybean seed composition has been intensely studied using family-based mapping. Population-based mapping approaches, in the form of genome-wide association (GWA) scans, have been able to resolve loci controlling moderately complex quantitative traits (QTL) in numerous crop species. Yet, it is still unclear how soybean’s unique population history will affect GWA scans. Using one of the populations in this study, we simulated phenotypes resulting from a range of genetic architectures. We found that with a heritability of 0.5, ∼100% and ∼33% of the 4 and 20 simulated QTL can be recovered, respectively, with a false-positive rate of less than ∼6×10−5 per marker tested. Additionally, we demonstrated that combining information from multi-locus mixed models and compressed linear-mixed models improves QTL identification and interpretation. We applied these insights to exploring seed composition in soybean, refining the linkage group I (chromosome 20) protein QTL and identifying additional oil QTL that may allow some decoupling of highly correlated oil and protein phenotypes. Because the value of protein meal is closely related to its essential amino acid profile, we attempted to identify QTL underlying methionine, threonine, cysteine, and lysine content. Multiple QTL were found that have not been observed in family-based mapping studies, and each trait exhibited associations across multiple populations. Chromosomes 1 and 8 contain strong candidate alleles for essential amino acid increases. Overall, we present these and additional data that will be useful in determining breeding strategies for the continued improvement of soybean’s nutrient portfolio. PMID:25246241

  11. Wide scanning spherical antenna

    NASA Technical Reports Server (NTRS)

    Shen, Bing (Inventor); Stutzman, Warren L. (Inventor)

    1995-01-01

    A novel method for calculating the surface shapes for subreflectors in a suboptic assembly of a tri-reflector spherical antenna system is introduced, modeled from a generalization of Galindo-Israel's method of solving partial differential equations to correct for spherical aberration and provide uniform feed to aperture mapping. In a first embodiment, the suboptic assembly moves as a single unit to achieve scan while the main reflector remains stationary. A feed horn is tilted during scan to maintain the illuminated area on the main spherical reflector fixed throughout the scan thereby eliminating the need to oversize the main spherical reflector. In an alternate embodiment, both the main spherical reflector and the suboptic assembly are fixed. A flat mirror is used to create a virtual image of the suboptic assembly. Scan is achieved by rotating the mirror about the spherical center of the main reflector. The feed horn is tilted during scan to maintain the illuminated area on the main spherical reflector fixed throughout the scan.

  12. The first genome-wide scan in a tunisian family with generalized epilepsy with febrile seizure plus (GEFS+).

    PubMed

    Fendri-Kriaa, Nourhene; Louhichi, Nacim; Mkaouar-Rebai, Emna; Chabchoub, Ghazi; Kammoun, Fatma; Salem, Ikhlass Hadj; Rebai, Ahmed; Triki, Chahnez; Fakhfakh, Faiza

    2010-11-01

    Generalized epilepsy with febrile seizure plus (GEFS+) is an autosomal dominant disorder. In the literature, 5 responsible genes were identified and 2 novel susceptibility loci for GEFS+ at 2p24 and 8p23-p21 were reported, indicating the genetic heterogeneity of this disorder. The aim of this report is to identify the responsible loci in a large affected Tunisian family by performing a 10cM density genome-wide scan. The highest multipoint logarithm of odds (LOD) score (1.04) was found for D5S407 in the absence of recombination. Two other interesting regions were found around marker D19S210 (LOD=0.799) and D7S484 (LOD=0.61) markers. To fine map these loci, additional markers in 2 regions on 5q13.3 and 7p14.2 were analyzed and positive LOD scores for both loci were obtained. Sequencing of the Sodium channel subunit beta-1 gene (SCN1B) (19q13.1) showed the absence of any causal mutation. Our findings emphasized the genetic heterogeneity of febrile seizures. PMID:20382841

  13. A genome-wide linkage scan reveals CD53 as an important regulator of innate TNF-α levels

    PubMed Central

    Bos, Steffan D; Lakenberg, Nico; van der Breggen, Ruud; Houwing-Duistermaat, Jeanine J; Kloppenburg, Margreet; de Craen, Anton JM; Beekman, Marian; Meulenbelt, Ingrid; Slagboom, P Eline

    2010-01-01

    Cytokines are major immune system regulators. Previously, innate cytokine profiles determined by lipopolysaccharide stimulation were shown to be highly heritable. To identify regulating genes in innate immunity, we analyzed data from a genome-wide linkage scan using microsatellites in osteoarthritis (OA) patients (The GARP study) and their innate cytokine data on interleukin (IL)-1β, IL-1Ra, IL-10 and tumor necrosis factor (TNF)α. A confirmation cohort consisted of the Leiden 85-Plus study. In this study, a linkage analysis was followed by manual selection of candidate genes in linkage regions showing LOD scores over 2.5. An single-nucleotide polymorphism (SNP) gene tagging method was applied to select SNPs on the basis of the highest level of gene tagging and possible functional effects. QTDT was used to identify the SNPs associated with innate cytokine production. Initial association signals were modeled by a linear mixed model. Through these analyses, we identified 10 putative genes involved in the regulation of TNFα. SNP rs6679497 in gene CD53 showed significant association with TNFα levels (P=0.001). No association of this SNP was observed with OA. A novel gene involved in the innate immune response of TNFα is identified. Genetic variation in this gene may have a role in diseases and disorders in which TNFα is closely involved. PMID:20407468

  14. A GENOME-WIDE SCAN FOR QTL AFFECTING CARCASS TRAITS AT CONSTANT FAT DEPTH IN A HEREFORD X COMPOSITE DOUBLE BACKCROSS POPULATION

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A genome-wide scan for chromosomal regions influencing carcass traits was conducted spanning 2.497 Morgans on 29 bovine autosomes using 170 microsatellite markers. There were 151 progeny from a single Hereford x composite bull produced by backcross matings to both Hereford and composite dams. Cattl...

  15. Genome-wide scan for visceral leishmaniasis in mixed-breed dogs identifies candidate genes involved in T helper cells and macrophage signaling

    Technology Transfer Automated Retrieval System (TEKTRAN)

    We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2....

  16. High-resolution genome-wide scan of genes, gene-networks and cellular systems impacting the yeast ionome

    Technology Transfer Automated Retrieval System (TEKTRAN)

    To balance the demand for uptake of essential elements with their potential toxicity living cells have complex regulatory mechanisms. Here, we describe a genome-wide screen to identify genes that impact the elemental composition (‘ionome’) of yeast Saccharomyces cerevisiae. Using inductively coupled...

  17. A genome-wide scan of selective sweeps in two broiler chicken lines divergently selected for abdominal fat content

    PubMed Central

    2012-01-01

    Background Genomic regions controlling abdominal fatness (AF) were studied in the Northeast Agricultural University broiler line divergently selected for AF. In this study, the chicken 60KSNP chip and extended haplotype homozygosity (EHH) test were used to detect genome-wide signatures of AF. Results A total of 5357 and 5593 core regions were detected in the lean and fat lines, and 51 and 57 reached a significant level (P<0.01), respectively. A number of genes in the significant core regions, including RB1, BBS7, MAOA, MAOB, EHBP1, LRP2BP, LRP1B, MYO7A, MYO9A and PRPSAP1, were detected. These genes may be important for AF deposition in chickens. Conclusions We provide a genome-wide map of selection signatures in the chicken genome, and make a contribution to the better understanding the mechanisms of selection for AF content in chickens. The selection for low AF in commercial breeding using this information will accelerate the breeding progress. PMID:23241142

  18. A Genome-Wide Scan of Selective Sweeps and Association Mapping of Fruit Traits Using Microsatellite Markers in Watermelon

    PubMed Central

    Reddy, Umesh K.; Abburi, Lavanya; Abburi, Venkata Lakshmi; Saminathan, Thangasamy; Cantrell, Robert; Vajja, Venkata Gopinath; Reddy, Rishi; Tomason, Yan R.; Levi, Amnon; Wehner, Todd C.; Nimmakayala, Padma

    2015-01-01

    Our genetic diversity study uses microsatellites of known map position to estimate genome level population structure and linkage disequilibrium, and to identify genomic regions that have undergone selection during watermelon domestication and improvement. Thirty regions that showed evidence of selective sweep were scanned for the presence of candidate genes using the watermelon genome browser (www.icugi.org). We localized selective sweeps in intergenic regions, close to the promoters, and within the exons and introns of various genes. This study provided an evidence of convergent evolution for the presence of diverse ecotypes with special reference to American and European ecotypes. Our search for location of linked markers in the whole-genome draft sequence revealed that BVWS00358, a GA repeat microsatellite, is the GAGA type transcription factor located in the 5′ untranslated regions of a structure and insertion element that expresses a Cys2His2 Zinc finger motif, with presumed biological processes related to chitin response and transcriptional regulation. In addition, BVWS01708, an ATT repeat microsatellite, located in the promoter of a DTW domain-containing protein (Cla002761); and 2 other simple sequence repeats that association mapping link to fruit length and rind thickness. PMID:25425675

  19. A genome-wide scan of selective sweeps and association mapping of fruit traits using microsatellite markers in watermelon.

    PubMed

    Reddy, Umesh K; Abburi, Lavanya; Abburi, Venkata Lakshmi; Saminathan, Thangasamy; Cantrell, Robert; Vajja, Venkata Gopinath; Reddy, Rishi; Tomason, Yan R; Levi, Amnon; Wehner, Todd C; Nimmakayala, Padma

    2015-01-01

    Our genetic diversity study uses microsatellites of known map position to estimate genome level population structure and linkage disequilibrium, and to identify genomic regions that have undergone selection during watermelon domestication and improvement. Thirty regions that showed evidence of selective sweep were scanned for the presence of candidate genes using the watermelon genome browser (www.icugi.org). We localized selective sweeps in intergenic regions, close to the promoters, and within the exons and introns of various genes. This study provided an evidence of convergent evolution for the presence of diverse ecotypes with special reference to American and European ecotypes. Our search for location of linked markers in the whole-genome draft sequence revealed that BVWS00358, a GA repeat microsatellite, is the GAGA type transcription factor located in the 5' untranslated regions of a structure and insertion element that expresses a Cys2His2 Zinc finger motif, with presumed biological processes related to chitin response and transcriptional regulation. In addition, BVWS01708, an ATT repeat microsatellite, located in the promoter of a DTW domain-containing protein (Cla002761); and 2 other simple sequence repeats that association mapping link to fruit length and rind thickness. PMID:25425675

  20. Genome wide instability scanning in chewing-tobacco associated oral cancer using inter simple sequence repeat PCR.

    PubMed

    Rai, Rekha; Kulkarni, Viraj; Saranath, Dhananjaya

    2004-11-01

    Genomic instability plays a major role in cancer, facilitating tumour progression and tumour heterogeneity. Inter simple sequence repeat PCR (ISSR-PCR) is a sensitive tool for detection of whole genome scanning. In fifteen oral cancer patients, using tumor tissue and adjacent normal tissue DNA, we investigated genomic instability regions using ISSR-PCR assay. The genomic fragments were cloned, sequenced and identified. Two-anchored dinucleotide repeat primers, (CA)(8)A/GG and (CA)(8)A/GC/T, were used in the study. About 40-50 fragments were observed on polyacrylamide gel electrophoresis, with 25 distinct fragments of less than 2 kb. The electrophoretic pattern highlighted several distinct fragments in tumor adjacent normal tissues. The distinct fragments of 258, 325, 430, 440, 600 and 900 bp sizes using (CA)(8)A/GG primer, and 300, 475, 675 and 800 bp using (CA)(8)A/GC/T primers, in the normal tissues showed partial (>50%) or complete loss in multiple tumor tissues. These fragments were eluted from the gel, cloned in pMos Blue vector and subjected to nucleotide sequencing. Insilico analysis defined the specific genomic sequences, given as follows: RP11-399D2 () on chromosome (chr)4; RP1-39J2 (), NKp44RG () and RP11-518I13 () on chr6; NC-T-2 () on chr7; RP11-586K2 () and RP11-495O10 () on chr8; RP11-101K10 () on chr9; R-794A8 () on chr14; and RP11-679B19 () on chr16. The sequences of our clones have been submitted to NCBI gene bank, accession numbers to , and . The Genomic Instability Index was calculated and ranged from 6% to 28.5% (median 12%) in the oral cancer samples, excluding one case where genomic instability was not observed. Thus, our results indicate presence of widespread genomic alterations in chewing-tobacco associated oral cancers. PMID:15509495

  1. Genome-Wide Scan of Gastrointestinal Nematode Resistance in Closed Angus Population Selected for Minimized Influence of MHC

    PubMed Central

    Kim, Eui-Soo; Sonstegard, Tad S.; da Silva, Marcos V. G. B.; Gasbarre, Louis C.; Van Tassell, Curtis P.

    2015-01-01

    Genetic markers associated with parasite indicator traits are ideal targets for study of marker assisted selection aimed at controlling infections that reduce herd use of anthelminthics. For this study, we collected gastrointestinal (GI) nematode fecal egg count (FEC) data from post-weaning animals of an Angus resource population challenged to a 26 week natural exposure on pasture. In all, data from 487 animals was collected over a 16 year period between 1992 and 2007, most of which were selected for a specific DRB1 allele to reduce the influence of potential allelic variant effects of the MHC locus. A genome-wide association study (GWAS) based on BovineSNP50 genotypes revealed six genomic regions located on bovine Chromosomes 3, 5, 8, 15 and 27; which were significantly associated (-log10 p=4.3) with Box-Cox transformed mean FEC (BC-MFEC). DAVID analysis of the genes within the significant genomic regions suggested a correlation between our results and annotation for genes involved in inflammatory response to infection. Furthermore, ROH and selection signature analyses provided strong evidence that the genomic regions associated BC-MFEC have not been affected by local autozygosity or recent experimental selection. These findings provide useful information for parasite resistance prediction for young grazing cattle and suggest new candidate gene targets for development of disease-modifying therapies or future studies of host response to GI parasite infection. PMID:25803687

  2. Family based genome-wide copy number scan identifies complex rearrangements at 17q21.31 in dyslexics.

    PubMed

    Veerappa, Avinash M; Saldanha, Marita; Padakannaya, Prakash; Ramachandra, Nallur B

    2014-10-01

    Developmental dyslexia (DD) is a complex heritable disorder with unexpected difficulty in learning to read and spell despite adequate intelligence, education, environment, and normal senses. We performed genome-wide screening for copy number variations (CNVs) in 10 large Indian dyslexic families using Affymetrix Genome-Wide Human SNP Array 6.0. Results revealed the complex genomic rearrangements due to one non-contiguous deletion and five contiguous micro duplications and micro deletions at 17q21.31 region in three dyslexic families. CNVs in this region harbor the genes KIAA1267, LRRC37A, ARL17A/B, NSFP1, and NSF. The CNVs in case 1 and case 2 at this locus were found to be in homozygous state and case 3 was a de novo CNV. These CNVs were found with at least one CNV having a common break and end points in the parents. This cluster of genes containing NSF is implicated in learning, cognition, and memory, though not formally associated with dyslexia. Molecular network analysis of these and other dyslexia related module genes suggests NSF and other genes to be associated with cellular/vesicular membrane fusion and synaptic transmission. Thus, we suggest that NSF in this cluster would be the nearest gene responsible for the learning disability phenotype. PMID:25139666

  3. Genome-wide scans of genetic variants for psychophysiological endophenotypes: introduction to this special issue of Psychophysiology.

    PubMed

    Iacono, William G

    2014-12-01

    This special issue addresses the heritability and molecular genetic basis of 17 putative endophenotypes involving resting EEG power, P300 event-related potential amplitude, electrodermal orienting and habituation, antisaccade eye tracking, and affective modulation of the startle eye blink. These measures were collected from approximately 4,900 twins and parents who provided DNA samples through their participation in the Minnesota Twin Family Study. Included are papers that detail the methodology followed, genome-wide association analyses of single nucleotide polymorphisms and genes, analysis of rare variants in the human exome, and a whole genome sequencing study. Also included are 11 articles by leading experts in psychophysiology and genetics that provide perspective and commentary. A final integrative report summarizes findings and addresses issues raised. This introduction provides an overview of the aims and rationale behind these studies. PMID:25387700

  4. Genome-Wide Linkage Scan for Primary Open Angle Glaucoma: Influences of Ancestry and Age at Diagnosis

    PubMed Central

    Qin, Xuejun; Liu, Yutao; Gibson, Jason R.; Santiago-Turla, Cecilia; Larocque-Abramson, Karen R.; Del Bono, Elizabeth; Challa, Pratap; Herndon, Leon W.; Akafo, Stephen; Wiggs, Janey L.; Schmidt, Silke; Hauser, Michael A.

    2011-01-01

    Primary open-angle glaucoma (POAG) is the most common form of glaucoma and one of the leading causes of vision loss worldwide. The genetic etiology of POAG is complex and poorly understood. The purpose of this work is to identify genomic regions of interest linked to POAG. This study is the largest genetic linkage study of POAG performed to date: genomic DNA samples from 786 subjects (538 Caucasian ancestry, 248 African ancestry) were genotyped using either the Illumina GoldenGate Linkage 4 Panel or the Illumina Infinium Human Linkage-12 Panel. A total of 5233 SNPs was analyzed in 134 multiplex POAG families (89 Caucasian ancestry, 45 African ancestry). Parametric and non-parametric linkage analyses were performed on the overall dataset and within race-specific datasets (Caucasian ancestry and African ancestry). Ordered subset analysis was used to stratify the data on the basis of age of glaucoma diagnosis. Novel linkage regions were identified on chromosomes 1 and 20, and two previously described loci—GLC1D on chromosome 8 and GLC1I on chromosome 15—were replicated. These data will prove valuable in the context of interpreting results from genome-wide association studies for POAG. PMID:21765929

  5. Additive and epistatic genome-wide association for growth and ultrasound scan measures of carcass-related traits in Brahman cattle.

    PubMed

    Ali, A A; Khatkar, M S; Kadarmideen, H N; Thomson, P C

    2015-04-01

    Genome-wide association studies are routinely used to identify genomic regions associated with traits of interest. However, this ignores an important class of genomic associations, that of epistatic interactions. A genome-wide interaction analysis between single nucleotide polymorphisms (SNPs) using highly dense markers can detect epistatic interactions, but is a difficult task due to multiple testing and computational demand. However, It is important for revealing complex trait heredity. This study considers analytical methods that detect statistical interactions between pairs of loci. We investigated a three-stage modelling procedure: (i) a model without the SNP to estimate the variance components; (ii) a model with the SNP using variance component estimates from (i), thus avoiding iteration; and (iii) using the significant SNPs from (ii) for genome-wide epistasis analysis. We fitted these three-stage models to field data for growth and ultrasound measures for subcutaneous fat thickness in Brahman cattle. The study demonstrated the usefulness of modelling epistasis in the analysis of complex traits as it revealed extra sources of genetic variation and identified potential candidate genes affecting the concentration of insulin-like growth factor-1 and ultrasound scan measure of fat depth traits. Information about epistasis can add to our understanding of the complex genetic networks that form the fundamental basis of biological systems. PMID:25754883

  6. Non-additive genome-wide association scan reveals a new gene associated with habitual coffee consumption.

    PubMed

    Pirastu, Nicola; Kooyman, Maarten; Robino, Antonietta; van der Spek, Ashley; Navarini, Luciano; Amin, Najaf; Karssen, Lennart C; Van Duijn, Cornelia M; Gasparini, Paolo

    2016-01-01

    Coffee is one of the most consumed beverages world-wide and one of the primary sources of caffeine intake. Given its important health and economic impact, the underlying genetics of its consumption has been widely studied. Despite these efforts, much has still to be uncovered. In particular, the use of non-additive genetic models may uncover new information about the genetic variants driving coffee consumption. We have conducted a genome-wide association study in two Italian populations using additive, recessive and dominant models for analysis. This has uncovered a significant association in the PDSS2 gene under the recessive model that has been replicated in an independent cohort from the Netherlands (ERF). The identified gene has been shown to negatively regulate the expression of the caffeine metabolism genes and can thus be linked to coffee consumption. Further bioinformatics analysis of eQTL and histone marks from Roadmap data has evidenced a possible role of the identified SNPs in regulating PDSS2 gene expression through enhancers present in its intron. Our results highlight a novel gene which regulates coffee consumption by regulating the expression of the genes linked to caffeine metabolism. Further studies will be needed to clarify the biological mechanism which links PDSS2 and coffee consumption. PMID:27561104

  7. Non-additive genome-wide association scan reveals a new gene associated with habitual coffee consumption

    PubMed Central

    Pirastu, Nicola; Kooyman, Maarten; Robino, Antonietta; van der Spek, Ashley; Navarini, Luciano; Amin, Najaf; Karssen, Lennart C.; Van Duijn, Cornelia M; Gasparini, Paolo

    2016-01-01

    Coffee is one of the most consumed beverages world-wide and one of the primary sources of caffeine intake. Given its important health and economic impact, the underlying genetics of its consumption has been widely studied. Despite these efforts, much has still to be uncovered. In particular, the use of non-additive genetic models may uncover new information about the genetic variants driving coffee consumption. We have conducted a genome-wide association study in two Italian populations using additive, recessive and dominant models for analysis. This has uncovered a significant association in the PDSS2 gene under the recessive model that has been replicated in an independent cohort from the Netherlands (ERF). The identified gene has been shown to negatively regulate the expression of the caffeine metabolism genes and can thus be linked to coffee consumption. Further bioinformatics analysis of eQTL and histone marks from Roadmap data has evidenced a possible role of the identified SNPs in regulating PDSS2 gene expression through enhancers present in its intron. Our results highlight a novel gene which regulates coffee consumption by regulating the expression of the genes linked to caffeine metabolism. Further studies will be needed to clarify the biological mechanism which links PDSS2 and coffee consumption. PMID:27561104

  8. Evolution of the pygmy phenotype: evidence of positive selection fro genome-wide scans in African, Asian, and Melanesian pygmies.

    PubMed

    Migliano, Andrea Bamberg; Romero, Irene Gallego; Metspalu, Mait; Leavesley, Matthew; Pagani, Luca; Antao, Tiago; Huang, Da-Wei; Sherman, Brad T; Siddle, Katharine; Scholes, Clarissa; Hudjashov, Georgi; Kaitokai, Elton; Babalu, Avis; Belatti, Maggie; Cagan, Alex; Hopkinshaw, Byrony; Shaw, Colin; Nelis, Mari; Metspalu, Ene; Mägi, Reedik; Lempicki, Richard A; Villems, Richard; Lahr, Marta Mirazon; Kivisild, Toomas

    2013-01-01

    Human pygmy populations inhabit different regions of the world, from Africa to Melanesia. In Asia, short-statured populations are often referred to as "negritos." Their short stature has been interpreted as a consequence of thermoregulatory, nutritional, and/or locomotory adaptations to life in tropical forests. A more recent hypothesis proposes that their stature is the outcome of a life history trade-off in high-mortality environments, where early reproduction is favored and, consequently, early sexual maturation and early growth cessation have coevolved. Some serological evidence of deficiencies in the growth hormone/insulin-like growth factor axis have been previously associated with pygmies' short stature. Using genome-wide single-nucleotide polymorphism genotype data, we first tested whether different negrito groups living in the Philippines and Papua New Guinea are closely related and then investigated genomic signals of recent positive selection in African, Asian, and Papuan pygmy populations. We found that negritos in the Philippines and Papua New Guinea are genetically more similar to their nonpygmy neighbors than to one another and have experienced positive selection at different genes. These results indicate that geographically distant pygmy groups are likely to have evolved their short stature independently. We also found that selection on common height variants is unlikely to explain their short stature and that different genes associated with growth, thyroid function, and sexual development are under selection in different pygmy groups. PMID:24297229

  9. A genome-wide "pleiotropy scan" does not identify new susceptibility loci for estrogen receptor negative breast cancer.

    PubMed

    Campa, Daniele; Barrdahl, Myrto; Tsilidis, Konstantinos K; Severi, Gianluca; Diver, W Ryan; Siddiq, Afshan; Chanock, Stephen; Hoover, Robert N; Ziegler, Regina G; Berg, Christine D; Buys, Saundra S; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick R; Le Marchand, Loïc; Flesch-Janys, Dieter; Lindström, Sara; Hunter, David J; Hankinson, Susan E; Willett, Walter C; Kraft, Peter; Cox, David G; Khaw, Kay-Tee; Tjønneland, Anne; Dossus, Laure; Trichopoulos, Dimitrios; Panico, Salvatore; van Gils, Carla H; Weiderpass, Elisabete; Barricarte, Aurelio; Sund, Malin; Gaudet, Mia M; Giles, Graham; Southey, Melissa; Baglietto, Laura; Chang-Claude, Jenny; Kaaks, Rudolf; Canzian, Federico

    2014-01-01

    Approximately 15-30% of all breast cancer tumors are estrogen receptor negative (ER-). Compared with ER-positive (ER+) disease they have an earlier age at onset and worse prognosis. Despite the vast number of risk variants identified for numerous cancer types, only seven loci have been unambiguously identified for ER-negative breast cancer. With the aim of identifying new susceptibility SNPs for this disease we performed a pleiotropic genome-wide association study (GWAS). We selected 3079 SNPs associated with a human complex trait or disease at genome-wide significance level (P<5 × 10(-8)) to perform a secondary analysis of an ER-negative GWAS from the National Cancer Institute's Breast and Prostate Cancer Cohort Consortium (BPC3), including 1998 cases and 2305 controls from prospective studies. We then tested the top ten associations (i.e. with the lowest P-values) using three additional populations with a total sample size of 3509 ER+ cases, 2543 ER- cases and 7031 healthy controls. None of the 3079 selected variants in the BPC3 ER-GWAS were significant at the adjusted threshold. 186 variants were associated with ER- breast cancer risk at a conventional threshold of P<0.05, with P-values ranging from 0.049 to 2.3 × 10(-4). None of the variants reached statistical significance in the replication phase. In conclusion, this study did not identify any novel susceptibility loci for ER-breast cancer using a "pleiotropic approach". PMID:24523857

  10. Genome-wide linkage scan for contraction velocity characteristics of knee musculature in the Leuven Genes for Muscular Strength Study.

    PubMed

    De Mars, Gunther; Windelinckx, An; Huygens, Wim; Peeters, Maarten W; Beunen, Gaston P; Aerssens, Jeroen; Vlietinck, Robert; Thomis, Martine A I

    2008-09-17

    The torque-velocity relationship is known to be affected by ageing, decreasing its protective role in the prevention of falls. Interindividual variability in this torque-velocity relationship is partly determined by genetic factors (h(2): 44-67%). As a first attempt, this genome-wide linkage study aimed to identify chromosomal regions linked to the torque-velocity relationship of the knee flexors and extensors. A selection of 283 informative male siblings (17-36 yr), belonging to 105 families, was used to conduct a genome-wide SNP-based (Illumina Linkage IVb panel) multipoint linkage analysis for the torque-velocity relationship of the knee flexors and extensors. The strongest evidence for linkage was found at 15q23 for the torque-velocity slope of the knee extensors (TVSE). Other interesting linkage regions with LOD scores >2 were found at 7p12.3 [logarithm of the odds ratio (LOD) = 2.03, P = 0.0011] for the torque-velocity ratio of the knee flexors (TVRF), at 2q14.3 (LOD = 2.25, P = 0.0006) for TVSE, and at 4p14 and 18q23 for the torque-velocity ratio of the knee extensors TVRE (LOD = 2.23 and 2.08; P = 0.0007 and 0.001, respectively). We conclude that many small contributing genes are involved in causing variation in the torque-velocity relationship of the knee flexor and extensor muscles. Several earlier reported candidate genes for muscle strength and muscle mass and new candidates are harbored within or in close vicinity of the linkage regions reported in the present study. PMID:18682575

  11. Genome-wide copy number scan identifies IRF6 involvement in Van der Woude syndrome in an Indian family.

    PubMed

    Manjegowda, Dinesh S; Prasad, Manu; Veerappa, Avinash M; Ramachandra, Nallur B

    2014-01-01

    Summary Van der Woude syndrome (VWS) is an autosomal dominant developmental malformation presenting with bilateral lower lip pits related to cleft lip, cleft palate and other malformations. We performed a whole-genome copy number variations (CNVs) scan in an Indian family with members suffering from VWS using 2·6 million combined SNP and CNV markers. We found CNVs affecting IRF6, a known candidate gene for VWS, in all three cases, while none of the non-VWS members showed any CNVs in the IRF6 region. The duplications and deletions of the chromosomal critical region in 1q32-q41 confirm the involvement of CNVs in IRF6 in South Indian VWS patients. Molecular network analysis of these and other cleft lip/palate related module genes suggests that they are associated with cytokine-mediated signalling pathways and response to interferon-gamma mediated signalling pathways. This is a maiden study indicating the involvement of CNVs in IRF6 in causing VWS in the Indian population. PMID:25579819

  12. Detecting Loci under Recent Positive Selection in Dairy and Beef Cattle by Combining Different Genome-Wide Scan Methods

    PubMed Central

    Utsunomiya, Yuri Tani; Pérez O’Brien, Ana Maria; Sonstegard, Tad Stewart; Van Tassell, Curtis Paul; do Carmo, Adriana Santana; Mészáros, Gábor; Sölkner, Johann; Garcia, José Fernando

    2013-01-01

    As the methodologies available for the detection of positive selection from genomic data vary in terms of assumptions and execution, weak correlations are expected among them. However, if there is any given signal that is consistently supported across different methodologies, it is strong evidence that the locus has been under past selection. In this paper, a straightforward frequentist approach based on the Stouffer Method to combine P-values across different tests for evidence of recent positive selection in common variations, as well as strategies for extracting biological information from the detected signals, were described and applied to high density single nucleotide polymorphism (SNP) data generated from dairy and beef cattle (taurine and indicine). The ancestral Bovinae allele state of over 440,000 SNP is also reported. Using this combination of methods, highly significant (P<3.17×10−7) population-specific sweeps pointing out to candidate genes and pathways that may be involved in beef and dairy production were identified. The most significant signal was found in the Cornichon homolog 3 gene (CNIH3) in Brown Swiss (P = 3.82×10−12), and may be involved in the regulation of pre-ovulatory luteinizing hormone surge. Other putative pathways under selection are the glucolysis/gluconeogenesis, transcription machinery and chemokine/cytokine activity in Angus; calpain-calpastatin system and ribosome biogenesis in Brown Swiss; and gangliosides deposition in milk fat globules in Gyr. The composite method, combined with the strategies applied to retrieve functional information, may be a useful tool for surveying genome-wide selective sweeps and providing insights in to the source of selection. PMID:23696874

  13. Asymmetric Introgression in the Horticultural Living Fossil Cycas Sect. Asiorientales Using a Genome-Wide Scanning Approach

    PubMed Central

    Chiang, Yu-Chung; Huang, Bing-Hong; Chang, Chun-Wen; Wan, Yu-Ting; Lai, Shih-Jie; Huang, Shong; Liao, Pei-Chun

    2013-01-01

    The Asian cycads are mostly allopatric, distributed in small population sizes. Hybridization between allopatric species provides clues in determining the mechanism of species divergence. Horticultural introduction provides the chance of interspecific gene flow between allopatric species. Two allopatrically eastern Asian Cycas sect. Asiorientales species, C. revoluta and C. taitungensis, which are widely distributed in Ryukyus and Fujian Province and endemic to Taiwan, respectively, were planted in eastern Taiwan for horticultural reason. Higher degrees of genetic admixture in cultivated samples than wild populations in both cycad species were detected based on multilocus scans by neutral AFLP markers. Furthermore, bidirectional but asymmetric introgression by horticultural introduction of C. revoluta is evidenced by the reanalyses of species associated loci, which are assumed to be diverged after species divergence. Partial loci introgressed from native cycad to the invaders were also detected at the loci of strong species association. Consistent results tested by all neutral loci, and the species-associated loci, specify the recent introgression from the paradox of sharing of ancestral polymorphisms. Phenomenon of introgression of cultivated cycads implies niche conservation among two geographic-isolated cycads, even though the habitats of the extant wild populations of two species are distinct. PMID:23591840

  14. A genome-wide scan for common variants affecting the rate of age-related cognitive decline

    PubMed Central

    De Jager, Philip L.; Shulman, Joshua M.; Chibnik, Lori B.; Keenan, Brendan T.; Raj, Towfique; Wilson, Robert S.; Yu, Lei; Leurgans, Sue E.; Tran, Dong; Aubin, Cristin; Anderson, Christopher D.; Biffi, Alessandro; Corneveaux, Jason J.; Huentelman, Matthew J.; Rosand, Jonathan; Daly, Mark J.; Myers, Amanda J.; Reiman, Eric M.; Bennett, David A.; Evans, Denis A.

    2011-01-01

    Age-related cognitive decline is likely promoted by accumulated brain injury due to chronic conditions of aging, including neurodegenerative and vascular disease. Since common neuronal mechanisms may mediate the adaptation to diverse cerebral insults, we hypothesized that susceptibility for age-related cognitive decline may be due in part to a shared genetic network. We have therefore performed a genome-wide association study using a quantitative measure of global cognitive decline slope, based on repeated measures of 17 cognitive tests in 749 subjects from the Religious Orders Study. Top results were evaluated in three independent replication cohorts, consisting of 2,279 additional subjects with repeated cognitive testing. As expected, we find that the Alzheimer’s disease (AD) susceptibility locus, APOE, is strongly associated with rate of cognitive decline (PDISC=5.6×10−9; PJOINT=3.7×10−27). We additionally discover a variant, rs10808746, which shows consistent effects in the replication cohorts and modestly improved evidence of association in the joint analysis (PDISC=6.7×10−5; PREP=9.4×10−3; PJOINT=2.3×10−5). This variant influences the expression of two adjacent genes, PDE7A and MTFR1, which are potential regulators of inflammation and oxidative injury, respectively. Using aggregate measures of genetic risk, we find that known susceptibility loci for cardiovascular disease, type II diabetes, and inflammatory diseases are not significantly associated with cognitive decline in our cohort. Our results suggest that intermediate phenotypes, when coupled with larger sample sizes, may be a useful tool to dissect susceptibility loci for age-related cognitive decline and uncover shared molecular pathways with a role in neuronal injury. PMID:22054870

  15. A Replication Study and Genome-wide Scan of Single Nucleotide Polymorphisms Associated with Pancreatic Cancer Risk and Overall Survival

    PubMed Central

    Willis, Jason A.; Olson, Sara H.; Orlow, Irene; Mukherjee, Semanti; McWilliams, Robert R.; Kurtz, Robert C.; Klein, Robert J.

    2013-01-01

    Purpose To explore the effects of single nucleotide polymorphisms (SNPs) on pancreatic cancer risk and overall survival. Experimental Design The germline DNA of 531 pancreatic cancer cases and 305 healthy controls from a hospital-based study was genotyped at SNPs previously reported to be associated with pancreatic cancer risk or clinical outcome. We analyzed putative risk SNPs for replication of their reported effects on risk and tested for novel effects on overall survival (OS). Similarly, we analyzed putative survival-associated SNPs for replication of their reported effects on OS and tested for novel effects on risk. Lastly, we performed a genome-wide association study of OS using a subset of 252 cases, with two subsequent validation sets of 261 and 572 patients, respectively. Results Among seven risk SNPs analyzed, two (rs505922, rs9543325) were associated with risk (p<0.05). Among 24 survival-associated SNPs analyzed, one (rs9350) was associated with OS (p<0.05). No putative risk SNPs or putative survival-associated SNPs were found to be associated with OS or risk, respectively. Further, our GWAS identified a novel SNP (rs1482426, combined stage 1 and 2 p = 1.7 ×10−6, per-allele HR = 1.74, 95% CI 1.38–2.18) to be putatively associated with OS. Conclusions The effects of SNPs on pancreatic cancer risk and overall survival were replicated in our study, though further work is necessary to understand the functional mechanisms underlying these effects. More importantly, the putative association with OS identified by GWAS suggests that GWAS may be useful in identifying SNPs associated with clinical outcome in pancreatic cancer. PMID:22665904

  16. Detecting Loci under recent positive selection in dairy and beef cattle by combining different genome-wide scan methods

    Technology Transfer Automated Retrieval System (TEKTRAN)

    As the methodologies available for the detection of positive selection from genomic data vary in terms of assumptions and execution, weak correlations are expected among them. However, if there is any given signal that is consistently supported across different tests, it might be a strong evidence o...

  17. A genome-wide association scan implicates DCHS2, RUNX2, GLI3, PAX1 and EDAR in human facial variation

    PubMed Central

    Adhikari, Kaustubh; Fuentes-Guajardo, Macarena; Quinto-Sánchez, Mirsha; Mendoza-Revilla, Javier; Camilo Chacón-Duque, Juan; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Lozano, Rodrigo Barquera; Pérez, Gastón Macín; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C.; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M.; Bortolini, Maria- Cátira; Canizales-Quinteros, Samuel; Cheeseman, Michael; Rosique, Javier; Bedoya, Gabriel; Rothhammer, Francisco; Headon, Denis; González-José, Rolando; Balding, David; Ruiz-Linares, Andrés

    2016-01-01

    We report a genome-wide association scan for facial features in ∼6,000 Latin Americans. We evaluated 14 traits on an ordinal scale and found significant association (P values<5 × 10−8) at single-nucleotide polymorphisms (SNPs) in four genomic regions for three nose-related traits: columella inclination (4q31), nose bridge breadth (6p21) and nose wing breadth (7p13 and 20p11). In a subsample of ∼3,000 individuals we obtained quantitative traits related to 9 of the ordinal phenotypes and, also, a measure of nasion position. Quantitative analyses confirmed the ordinal-based associations, identified SNPs in 2q12 associated to chin protrusion, and replicated the reported association of nasion position with SNPs in PAX3. Strongest association in 2q12, 4q31, 6p21 and 7p13 was observed for SNPs in the EDAR, DCHS2, RUNX2 and GLI3 genes, respectively. Associated SNPs in 20p11 extend to PAX1. Consistent with the effect of EDAR on chin protrusion, we documented alterations of mandible length in mice with modified Edar funtion. PMID:27193062

  18. A genome-wide association scan implicates DCHS2, RUNX2, GLI3, PAX1 and EDAR in human facial variation.

    PubMed

    Adhikari, Kaustubh; Fuentes-Guajardo, Macarena; Quinto-Sánchez, Mirsha; Mendoza-Revilla, Javier; Camilo Chacón-Duque, Juan; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Lozano, Rodrigo Barquera; Pérez, Gastón Macín; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M; Bortolini, Maria-Cátira; Canizales-Quinteros, Samuel; Cheeseman, Michael; Rosique, Javier; Bedoya, Gabriel; Rothhammer, Francisco; Headon, Denis; González-José, Rolando; Balding, David; Ruiz-Linares, Andrés

    2016-01-01

    We report a genome-wide association scan for facial features in ∼6,000 Latin Americans. We evaluated 14 traits on an ordinal scale and found significant association (P values<5 × 10(-8)) at single-nucleotide polymorphisms (SNPs) in four genomic regions for three nose-related traits: columella inclination (4q31), nose bridge breadth (6p21) and nose wing breadth (7p13 and 20p11). In a subsample of ∼3,000 individuals we obtained quantitative traits related to 9 of the ordinal phenotypes and, also, a measure of nasion position. Quantitative analyses confirmed the ordinal-based associations, identified SNPs in 2q12 associated to chin protrusion, and replicated the reported association of nasion position with SNPs in PAX3. Strongest association in 2q12, 4q31, 6p21 and 7p13 was observed for SNPs in the EDAR, DCHS2, RUNX2 and GLI3 genes, respectively. Associated SNPs in 20p11 extend to PAX1. Consistent with the effect of EDAR on chin protrusion, we documented alterations of mandible length in mice with modified Edar funtion. PMID:27193062

  19. Genome-Wide Scan for Visceral Leishmaniasis in Mixed-Breed Dogs Identifies Candidate Genes Involved in T Helper Cells and Macrophage Signaling

    PubMed Central

    Utsunomiya, Yuri T.; Ribeiro, Érica S.; Quintal, Amanda P. N.; Sangalli, Juliano R.; Gazola, Valquiria R.; Paula, Henrique B.; Trinconi, Cristiana M.; Lima, Valéria M. F.; Perri, Silvia H. V.; Taylor, Jeremy F.; Schnabel, Robert D.; Sonstegard, Tad S.; Garcia, José F.; Nunes, Cáris M.

    2015-01-01

    We conducted a genome-wide scan for visceral leishmaniasis in mixed-breed dogs from a highly endemic area in Brazil using 149,648 single nucleotide polymorphism (SNP) markers genotyped in 20 cases and 28 controls. Using a mixed model approach, we found two candidate loci on canine autosomes 1 and 2. The positional association on chromosome 2 mapped to a predicted DNAse sensitive site in CD14+ monocytes that serve as a cis-regulatory element for the expression of interleukin alpha receptors 2 (IL2RA) and 15 (IL15RA). Both interleukins were previously found to lead to protective T helper 1 cell (Th1) response against Leishmania spp. in humans and mice. The associated marker on chromosome 1 was located between two predicted transcription factor binding sites regulating the expression of the transducin-like enhancer of split 1 gene (TLE1), an important player in Notch signaling. This pathway is critical for macrophage activity and CD4+ T cell differentiation into Th1 and T helper 2. Together, these findings suggest that the human and mouse model for protective response against Leishmania spp., which involves Th1 and macrophage modulation by interleukins 2, 15, gamma interferon and Notch signaling, may also hold for the canine model. PMID:26348501

  20. An Islet-Targeted Genome-Wide Association Scan Identifies Novel Genes Implicated in Cytokine-Mediated Islet Stress in Type 2 Diabetes

    PubMed Central

    Sharma, Poonam R.; Mackey, Aaron J.; Dejene, Eden A.; Ramadan, James W.; Langefeld, Carl D.; Palmer, Nicholette D.; Taylor, Kent D.; Wagenknecht, Lynne E.; Watanabe, Richard M.; Rich, Stephen S.

    2015-01-01

    Genome-wide association studies in human type 2 diabetes (T2D) have renewed interest in the pancreatic islet as a contributor to T2D risk. Chronic low-grade inflammation resulting from obesity is a risk factor for T2D and a possible trigger of β-cell failure. In this study, microarray data were collected from mouse islets after overnight treatment with cytokines at concentrations consistent with the chronic low-grade inflammation in T2D. Genes with a cytokine-induced change of >2-fold were then examined for associations between single nucleotide polymorphisms and the acute insulin response to glucose (AIRg) using data from the Genetics Underlying Diabetes in Hispanics (GUARDIAN) Consortium. Significant evidence of association was found between AIRg and single nucleotide polymorphisms in Arap3 (5q31.3), F13a1 (6p25.3), Klhl6 (3q27.1), Nid1 (1q42.3), Pamr1 (11p13), Ripk2 (8q21.3), and Steap4 (7q21.12). To assess the potential relevance to islet function, mouse islets were exposed to conditions modeling low-grade inflammation, mitochondrial stress, endoplasmic reticulum (ER) stress, glucotoxicity, and lipotoxicity. RT-PCR revealed that one or more forms of stress significantly altered expression levels of all genes except Arap3. Thapsigargin-induced ER stress up-regulated both Pamr1 and Klhl6. Three genes confirmed microarray predictions of significant cytokine sensitivity: F13a1 was down-regulated 3.3-fold by cytokines, Ripk2 was up-regulated 1.5- to 3-fold by all stressors, and Steap4 was profoundly cytokine sensitive (167-fold up-regulation). Three genes were thus closely associated with low-grade inflammation in murine islets and also with a marker for islet function (AIRg) in a diabetes-prone human population. This islet-targeted genome-wide association scan identified several previously unrecognized candidate genes related to islet dysfunction during the development of T2D. PMID:26018251

  1. A Genome-Wide Association Scan on the Levels of Markers of Inflammation in Sardinians Reveals Associations That Underpin Its Complex Regulation

    PubMed Central

    Naitza, Silvia; Porcu, Eleonora; Steri, Maristella; Taub, Dennis D.; Mulas, Antonella; Xiao, Xiang; Strait, James; Dei, Mariano; Lai, Sandra; Busonero, Fabio; Maschio, Andrea; Usala, Gianluca; Zoledziewska, Magdalena; Sidore, Carlo; Zara, Ilenia; Pitzalis, Maristella; Loi, Alessia; Virdis, Francesca; Piras, Roberta; Deidda, Francesca; Whalen, Michael B.; Crisponi, Laura; Concas, Antonio; Podda, Carlo; Uzzau, Sergio; Scheet, Paul; Longo, Dan L.; Lakatta, Edward; Abecasis, Gonçalo R.; Cao, Antonio; Schlessinger, David; Uda, Manuela

    2012-01-01

    Identifying the genes that influence levels of pro-inflammatory molecules can help to elucidate the mechanisms underlying this process. We first conducted a two-stage genome-wide association scan (GWAS) for the key inflammatory biomarkers Interleukin-6 (IL-6), the general measure of inflammation erythrocyte sedimentation rate (ESR), monocyte chemotactic protein-1 (MCP-1), and high-sensitivity C-reactive protein (hsCRP) in a large cohort of individuals from the founder population of Sardinia. By analysing 731,213 autosomal or X chromosome SNPs and an additional ∼1.9 million imputed variants in 4,694 individuals, we identified several SNPs associated with the selected quantitative trait loci (QTLs) and replicated all the top signals in an independent sample of 1,392 individuals from the same population. Next, to increase power to detect and resolve associations, we further genotyped the whole cohort (6,145 individuals) for 293,875 variants included on the ImmunoChip and MetaboChip custom arrays. Overall, our combined approach led to the identification of 9 genome-wide significant novel independent signals—5 of which were identified only with the custom arrays—and provided confirmatory evidence for an additional 7. Novel signals include: for IL-6, in the ABO gene (rs657152, p = 2.13×10−29); for ESR, at the HBB (rs4910472, p = 2.31×10−11) and UCN119B/SPPL3 (rs11829037, p = 8.91×10−10) loci; for MCP-1, near its receptor CCR2 (rs17141006, p = 7.53×10−13) and in CADM3 (rs3026968, p = 7.63×10−13); for hsCRP, within the CRP gene (rs3093077, p = 5.73×10−21), near DARC (rs3845624, p = 1.43×10−10), UNC119B/SPPL3 (rs11829037, p = 1.50×10−14), and ICOSLG/AIRE (rs113459440, p = 1.54×10−08) loci. Confirmatory evidence was found for IL-6 in the IL-6R gene (rs4129267); for ESR at CR1 (rs12567990) and TMEM57 (rs10903129); for MCP-1 at DARC (rs12075); and for hsCRP at CRP (rs1205), HNF1A (rs225918), and APOC-I (rs

  2. A genome-wide linkage scan of bipolar disorder in Latino families identifies susceptibility loci at 8q24 and 14q32.

    PubMed

    Gonzalez, Suzanne; Camarillo, Cynthia; Rodriguez, Marco; Ramirez, Mercedes; Zavala, Juan; Armas, Regina; Contreras, Salvador A; Contreras, Javier; Dassori, Albana; Almasy, Laura; Flores, Deborah; Jerez, Alvaro; Raventós, Henriette; Ontiveros, Alfonso; Nicolini, Humberto; Escamilla, Michael

    2014-09-01

    A genome-wide nonparametric linkage screen was performed to localize Bipolar Disorder (BP) susceptibility loci in a sample of 3757 individuals of Latino ancestry. The sample included 963 individuals with BP phenotype (704 relative pairs) from 686 families recruited from the US, Mexico, Costa Rica, and Guatemala. Non-parametric analyses were performed over a 5 cM grid with an average genetic coverage of 0.67 cM. Multipoint analyses were conducted across the genome using non-parametric Kong & Cox LOD scores along with Sall statistics for all relative pairs. Suggestive and significant genome-wide thresholds were calculated based on 1000 simulations. Single-marker association tests in the presence of linkage were performed assuming a multiplicative model with a population prevalence of 2%. We identified two genome-wide significant susceptibly loci for BP at 8q24 and 14q32, and a third suggestive locus at 2q13-q14. Within these three linkage regions, the top associated single marker (rs1847694, P = 2.40 × 10(-5)) is located 195 Kb upstream of DPP10 in Chromosome 2. DPP10 is prominently expressed in brain neuronal populations, where it has been shown to bind and regulate Kv4-mediated A-type potassium channels. Taken together, these results provide additional evidence that 8q24, 14q32, and 2q13-q14 are susceptibly loci for BP and these regions may be involved in the pathogenesis of BP in the Latino population. PMID:25044503

  3. Genome-wide scan with nearly 700 000 SNPs in two Sardinian sub-populations suggests some regions as candidate targets for positive selection

    PubMed Central

    Piras, Ignazio Stefano; De Montis, Antonella; Calò, Carla Maria; Marini, Monica; Atzori, Manuela; Corrias, Laura; Sazzini, Marco; Boattini, Alessio; Vona, Giuseppe; Contu, Licinio

    2012-01-01

    This paper explores the genetic structure and signatures of natural selection in different sub-populations from the Island of Sardinia, exploiting information from nearly 700 000 autosomal SNPs genotyped with the Affymetrix Genome-Wide Human SNP 6.0 Array. The genetic structure of the Sardinian population and its position within the context of other Mediterranean and European human groups were investigated in depth by comparing our data with publicly available data sets. Principal components and admixture analyses suggest a clustering of the examined samples in two significantly differentiated sub-populations (Ogliastra and Southern Sardinia), as confirmed by AMOVA (FST=0.011; P<0.001). Differentiation of these sub-populations was still evident when they were pooled together with supplementary Sardinian samples from HGDP and compared with several other European, North-African and Near Eastern populations, confirming the uniqueness of the Sardinian genetic background. Moreover, by applying several statistical approaches aimed at assessing differences at the SNP level, the highest differentiated genomic regions between Ogliastra and Southern Sardinia were thus investigated via an extended haplotype homozygosity (EHH)-based test to point out potential selective sweeps. Using this approach, 40 genomic regions were detected, with significant differences between Ogliastra and Southern Sardinia. These regions were subsequently investigated using a long-range haplotype test, which found significant REHH values for SNPs rs11070188 and rs11070192 in the Ogliastra sub-population. In the light of these results and the overlap of the different computed statistics, the region encompassing these loci can be considered a strong candidate to have undergone selective pressure in Ogliastra. PMID:22535185

  4. First genome-wide association scan on neurophysiological endophenotypes points to trans-regulation effects on SLC2A3 in dyslexic children.

    PubMed

    Roeske, D; Ludwig, K U; Neuhoff, N; Becker, J; Bartling, J; Bruder, J; Brockschmidt, F F; Warnke, A; Remschmidt, H; Hoffmann, P; Müller-Myhsok, B; Nöthen, M M; Schulte-Körne, G

    2011-01-01

    Dyslexia is one of the most common learning disorders affecting about 5% of all school-aged children. It has been shown that event-related potential measurements reveal differences between dyslexic children and age-matched controls. This holds particularly true for mismatch negativity (MMN), which reflects automatic speech deviance processing and is altered in dyslexic children. We performed a whole-genome association analysis in 200 dyslexic children, focusing on MMN measurements. We identified rs4234898, a marker located on chromosome 4q32.1, to be significantly associated with the late MMN component. This association could be replicated in an independent second sample of 186 dyslexic children, reaching genome-wide significance in the combined sample (P = 5.14e-08). We also found an association between the late MMN component and a two-marker haplotype of rs4234898 and rs11100040, one of its neighboring single nucleotide polymorphisms (SNPs). In the combined sample, this marker combination withstands correction for multiple testing (P = 6.71e-08). Both SNPs lie in a region devoid of any protein-coding genes; however, they both show significant association with mRNA-expression levels of SLC2A3 on chromosome 12, the predominant facilitative glucose transporter in neurons. Our results suggest a possible trans-regulation effect on SLC2A3, which might lead to glucose deficits in dyslexic children and could explain their attenuated MMN in passive listening tasks. PMID:19786962

  5. Using an Inbred Horse Breed in a High Density Genome-Wide Scan for Genetic Risk Factors of Insect Bite Hypersensitivity (IBH)

    PubMed Central

    Velie, Brandon D.; Shrestha, Merina; Franҫois, Liesbeth; Schurink, Anouk; Tesfayonas, Yohannes G.; Stinckens, Anneleen; Blott, Sarah; Ducro, Bart J.; Mikko, Sofia; Thomas, Ruth; Swinburne, June E.; Sundqvist, Marie; Eriksson, Susanne; Buys, Nadine; Lindgren, Gabriella

    2016-01-01

    While susceptibility to hypersensitive reactions is a common problem amongst humans and animals alike, the population structure of certain animal species and breeds provides a more advantageous route to better understanding the biology underpinning these conditions. The current study uses Exmoor ponies, a highly inbred breed of horse known to frequently suffer from insect bite hypersensitivity, to identify genomic regions associated with a type I and type IV hypersensitive reaction. A total of 110 cases and 170 controls were genotyped on the 670K Axiom Equine Genotyping Array. Quality control resulted in 452,457 SNPs and 268 individuals being tested for association. Genome-wide association analyses were performed using the GenABEL package in R and resulted in the identification of two regions of interest on Chromosome 8. The first region contained the most significant SNP identified, which was located in an intron of the DCC netrin 1 receptor gene. The second region identified contained multiple top SNPs and encompassed the PIGN, KIAA1468, TNFRSF11A, ZCCHC2, and PHLPP1 genes. Although additional studies will be needed to validate the importance of these regions in horses and the relevance of these regions in other species, the knowledge gained from the current study has the potential to be a step forward in unraveling the complex nature of hypersensitive reactions. PMID:27070818

  6. Using an Inbred Horse Breed in a High Density Genome-Wide Scan for Genetic Risk Factors of Insect Bite Hypersensitivity (IBH).

    PubMed

    Velie, Brandon D; Shrestha, Merina; Franҫois, Liesbeth; Schurink, Anouk; Tesfayonas, Yohannes G; Stinckens, Anneleen; Blott, Sarah; Ducro, Bart J; Mikko, Sofia; Thomas, Ruth; Swinburne, June E; Sundqvist, Marie; Eriksson, Susanne; Buys, Nadine; Lindgren, Gabriella

    2016-01-01

    While susceptibility to hypersensitive reactions is a common problem amongst humans and animals alike, the population structure of certain animal species and breeds provides a more advantageous route to better understanding the biology underpinning these conditions. The current study uses Exmoor ponies, a highly inbred breed of horse known to frequently suffer from insect bite hypersensitivity, to identify genomic regions associated with a type I and type IV hypersensitive reaction. A total of 110 cases and 170 controls were genotyped on the 670K Axiom Equine Genotyping Array. Quality control resulted in 452,457 SNPs and 268 individuals being tested for association. Genome-wide association analyses were performed using the GenABEL package in R and resulted in the identification of two regions of interest on Chromosome 8. The first region contained the most significant SNP identified, which was located in an intron of the DCC netrin 1 receptor gene. The second region identified contained multiple top SNPs and encompassed the PIGN, KIAA1468, TNFRSF11A, ZCCHC2, and PHLPP1 genes. Although additional studies will be needed to validate the importance of these regions in horses and the relevance of these regions in other species, the knowledge gained from the current study has the potential to be a step forward in unraveling the complex nature of hypersensitive reactions. PMID:27070818

  7. Genome-wide scan of job-related exhaustion with three replication studies implicate a susceptibility variant at the UST gene locus

    PubMed Central

    Sulkava, Sonja; Ollila, Hanna M.; Ahola, Kirsi; Partonen, Timo; Viitasalo, Katriina; Kettunen, Johannes; Lappalainen, Maarit; Kivimäki, Mika; Vahtera, Jussi; Lindström, Jaana; Härmä, Mikko; Puttonen, Sampsa; Salomaa, Veikko; Paunio, Tiina

    2013-01-01

    Job-related exhaustion is the core dimension of burnout, a work-related stress syndrome that has several negative health consequences. In this study, we explored the molecular genetic background of job-related exhaustion. A genome-wide analysis of job-related exhaustion was performed in the GENMETS subcohort (n = 1256) of the Finnish population-based Health 2000 study. Replication analyses included an analysis of the strongest associations in the rest of the Health 2000 sample (n = 1660 workers) and in three independent populations (the FINRISK population cohort, n = 10 753; two occupational cohorts, total n = 1451). Job-related exhaustion was ascertained using a standard self-administered questionnaire (the Maslach Burnout Inventory (MBI)-GS exhaustion scale in the Health 2000 sample and the occupational cohorts) or a single question (FINRISK). A variant located in an intron of UST, uronyl-2-sulfotransferase (rs13219957), gave the strongest statistical evidence in the initial genome-wide study (P = 1.55 × 10−7), and was associated with job-related exhaustion in all the replication sets (P < 0.05; P = 6.75 × 10−7 from the meta-analysis). Consistent with studies of mood disorders, individual common genetic variants did not have any strong effect on job-related exhaustion. However, the nominally significant signals from the allelic variant of UST in four separate samples suggest that this variant might be a weak risk factor for job-related exhaustion. Together with the previously reported associations of other dermatan/chondroitin sulfate genes with mood disorders, these results indicate a potential molecular pathway for stress-related traits and mark a candidate region for further studies of job-related and general exhaustion. PMID:23620144

  8. A genome-wide linkage scan identifies multiple chromosomal regions influencing serum lipid levels in the population on the Samoan islands* s⃞

    PubMed Central

    Åberg, Karolina; Dai, Feng; Sun, Guangyun; Keighley, Ember; Indugula, Subba Rao; Bausserman, Linda; Viali, Satupaitea; Tuitele, John; Deka, Ranjan; Weeks, Daniel E.; McGarvey, Stephen T.

    2008-01-01

    Abnormal lipid levels are important risk factors for cardiovascular diseases. We conducted genome-wide variance component linkage analyses to search for loci influencing total cholesterol (TC), LDL, HDL and triglyceride in families residing in American Samoa and Samoa as well as in a combined sample from the two polities. We adjusted the traits for a number of environmental covariates, such as smoking, alcohol consumption, physical activity, and material lifestyle. We found suggestive univariate linkage with log of the odds (LOD) scores > 3 for LDL on 6p21-p12 (LOD 3.13) in Samoa and on 12q21-q23 (LOD 3.07) in American Samoa. Furthermore, in American Samoa on 12q21, we detected genome-wide linkage (LODeq 3.38) to the bivariate trait TC-LDL. Telomeric of this region, on 12q24, we found suggestive bivariate linkage to TC-HDL (LODeq 3.22) in the combined study sample. In addition, we detected suggestive univariate linkage (LOD 1.9–2.93) on chromosomes 4p-q, 6p, 7q, 9q, 11q, 12q 13q, 15q, 16p, 18q, 19p, 19q and Xq23 and suggestive bivariate linkage (LODeq 2.05–2.62) on chromosomes 6p, 7q, 12p, 12q, and 19p-q. In conclusion, chromosome 6p and 12q may host promising susceptibility loci influencing lipid levels; however, the low degree of overlap between the three study samples strongly encourages further studies of the lipid-related traits. PMID:18594117

  9. Genome-Wide Scan and Test of Candidate Genes in the Snail Biomphalaria glabrata Reveal New Locus Influencing Resistance to Schistosoma mansoni

    PubMed Central

    Tennessen, Jacob A.; Bonner, Kaitlin M.; Bollmann, Stephanie R.; Johnstun, Joel A.; Yeh, Jan-Ying; Marine, Melanie; Tavalire, Hannah F.; Bayne, Christopher J.; Blouin, Michael S.

    2015-01-01

    Background New strategies to combat the global scourge of schistosomiasis may be revealed by increased understanding of the mechanisms by which the obligate snail host can resist the schistosome parasite. However, few molecular markers linked to resistance have been identified and characterized in snails. Methodology/Principal Findings Here we test six independent genetic loci for their influence on resistance to Schistosoma mansoni strain PR1 in the 13-16-R1 strain of the snail Biomphalaria glabrata. We first identify a genomic region, RADres, showing the highest differentiation between susceptible and resistant inbred lines among 1611 informative restriction-site associated DNA (RAD) markers, and show that it significantly influences resistance in an independent set of 439 outbred snails. The additive effect of each RADres resistance allele is 2-fold, similar to that of the previously identified resistance gene sod1. The data fit a model in which both loci contribute independently and additively to resistance, such that the odds of infection in homozygotes for the resistance alleles at both loci (13% infected) is 16-fold lower than the odds of infection in snails without any resistance alleles (70% infected). Genome-wide linkage disequilibrium is high, with both sod1 and RADres residing on haplotype blocks >2Mb, and with other markers in each block also showing significant effects on resistance; thus the causal genes within these blocks remain to be demonstrated. Other candidate loci had no effect on resistance, including the Guadeloupe Resistance Complex and three genes (aif, infPhox, and prx1) with immunological roles and expression patterns tied to resistance, which must therefore be trans-regulated. Conclusions/Significance The loci RADres and sod1 both have strong effects on resistance to S. mansoni. Future approaches to control schistosomiasis may benefit from further efforts to characterize and harness this natural genetic variation. PMID:26372103

  10. A Genome-Wide Scan Reveals Important Roles of DNA Methylation in Human Longevity by Regulating Age-Related Disease Genes

    PubMed Central

    Li, Qi-Gang; Wu, Huan; Luo, Long-Hai; Kong, Qing-Peng

    2015-01-01

    It is recognized that genetic factors contribute to human longevity. Besides the hypothesis of existence of longevity genes, another suggests that a lower frequency of risk alleles decreases the incidence of age-related diseases in the long-lived people. However, the latter finds no support from recent genetic studies. Considering the crucial role of epigenetic modification in gene regulation, we then hypothesize that suppressing disease-related genes in longevity individuals is likely achieved by epigenetic modification, e.g. DNA methylation. To test this hypothesis, we investigated the genome-wide methylation profile in 4 Chinese female centenarians and 4 middle-aged controls using methyl-DNA immunoprecipitation sequencing. 626 differentially methylated regions (DMRs) were observed between both groups. Interestingly, genes with these DMRs were enriched in age-related diseases, including type-2 diabetes, cardiovascular disease, stroke and Alzheimer’s disease. This pattern remains rather stable after including methylomes of two white individuals. Further analyses suggest that the observed DMRs likely have functional roles in regulating disease-associated gene expressions, with some genes [e.g. caspase 3 (CASP3)] being down-regulated whereas the others [i.e. interleukin 1 receptor, type 2 (IL1R2)] up-regulated. Therefore, our study suggests that suppressing the disease-related genes via epigenetic modification is an important contributor to human longevity. PMID:25793257

  11. Genome-Wide Linkage Scan for Prostate Cancer Susceptibility in Finland: Evidence for a Novel Locus on 2q37.3 and confirmation of signal on 17q21-q22

    PubMed Central

    Cropp, Cheryl D.; Simpson, Claire L; Wahlfors, Tiina; Ha, Nati; George, Asha; Jones, MaryPat S.; Harper, Ursula; Ponciano-Jackson, Damaris; Green, Tiffany A.; Tammela, Teuvo L. J.; Bailey-Wilson, Joan; Schleutker, Johanna

    2011-01-01

    Genome-wide linkage studies have been used to localize rare and highly penetrant prostate cancer (PRCA) susceptibility genes. Linkage studies performed in different ethnic backgrounds and populations have been somewhat disparate, resulting in multiple, often irreproducible signals because of genetic heterogeneity and high sporadic background of the disease. Our first genome-wide linkage study and subsequent fine-mapping study of Finnish hereditary prostate cancer (HPC) families gave evidence of linkage to one region. Here, we conducted subsequent scans with microsatellites and SNPs in a total of 69 Finnish HPC families. GENEHUNTER-PLUS was used for parametric and non-parametric analyses. Our microsatellite genome-wide linkage study provided evidence of linkage to 17q12-q23, with a heterogeneity LOD (HLOD) score of 3.14 in a total of 54 of the 69 families. Genome-wide SNP analysis of 59 of the 69 families gave a highest HLOD score of 3.40 at 2q37.3 under a dominant high penetrance model. Analyzing all 69 families by combining microsatellite and SNP maps also yielded HLOD scores of > 3.3 in two regions (2q37.3 and 17q12-q21.3). These significant linkage peaks on chromosome 2 and 17 confirm previous linkage evidence of a locus on 17q from other populations and provide a basis for continued research into genetic factors involved in PRCA. Fine-mapping analysis of these regions is ongoing and candidate genes at linked loci are currently under analysis. PMID:21207418

  12. A Meta-Analysis of Genome-Wide Association Scans Identifies IL18RAP, PTPN2, TAGAP, and PUS10 As Shared Risk Loci for Crohn's Disease and Celiac Disease

    PubMed Central

    Boucher, Gabrielle; Beauchamp, Claudine; Trynka, Gosia; Dubois, Patrick C.; Lagacé, Caroline; Stokkers, Pieter C. F.; Hommes, Daan W.; Barisani, Donatella; Palmieri, Orazio; Annese, Vito; van Heel, David A.; Weersma, Rinse K.; Daly, Mark J.; Wijmenga, Cisca; Rioux, John D.

    2011-01-01

    Crohn's disease (CD) and celiac disease (CelD) are chronic intestinal inflammatory diseases, involving genetic and environmental factors in their pathogenesis. The two diseases can co-occur within families, and studies suggest that CelD patients have a higher risk to develop CD than the general population. These observations suggest that CD and CelD may share common genetic risk loci. Two such shared loci, IL18RAP and PTPN2, have already been identified independently in these two diseases. The aim of our study was to explicitly identify shared risk loci for these diseases by combining results from genome-wide association study (GWAS) datasets of CD and CelD. Specifically, GWAS results from CelD (768 cases, 1,422 controls) and CD (3,230 cases, 4,829 controls) were combined in a meta-analysis. Nine independent regions had nominal association p-value <1.0×10−5 in this meta-analysis and showed evidence of association to the individual diseases in the original scans (p-value <1×10−2 in CelD and <1×10−3 in CD). These include the two previously reported shared loci, IL18RAP and PTPN2, with p-values of 3.37×10−8 and 6.39×10−9, respectively, in the meta-analysis. The other seven had not been reported as shared loci and thus were tested in additional CelD (3,149 cases and 4,714 controls) and CD (1,835 cases and 1,669 controls) cohorts. Two of these loci, TAGAP and PUS10, showed significant evidence of replication (Bonferroni corrected p-values <0.0071) in the combined CelD and CD replication cohorts and were firmly established as shared risk loci of genome-wide significance, with overall combined p-values of 1.55×10−10 and 1.38×10−11 respectively. Through a meta-analysis of GWAS data from CD and CelD, we have identified four shared risk loci: PTPN2, IL18RAP, TAGAP, and PUS10. The combined analysis of the two datasets provided the power, lacking in the individual GWAS for single diseases, to detect shared loci with a relatively small effect. PMID:21298027

  13. Enhancer scanning to locate regulatory regions in genomic loci.

    PubMed

    Buckley, Melissa; Gjyshi, Anxhela; Mendoza-Fandiño, Gustavo; Baskin, Rebekah; Carvalho, Renato S; Carvalho, Marcelo A; Woods, Nicholas T; Monteiro, Alvaro N A

    2016-01-01

    This protocol provides a rapid, streamlined and scalable strategy to systematically scan genomic regions for the presence of transcriptional regulatory regions that are active in a specific cell type. It creates genomic tiles spanning a region of interest that are subsequently cloned by recombination into a luciferase reporter vector containing the simian virus 40 promoter. Tiling clones are transfected into specific cell types to test for the presence of transcriptional regulatory regions. The protocol includes testing of different single-nucleotide polymorphism (SNP) alleles to determine their effect on regulatory activity. This procedure provides a systematic framework for identifying candidate functional SNPs within a locus during functional analysis of genome-wide association studies. This protocol adapts and combines previous well-established molecular biology methods to provide a streamlined strategy, based on automated primer design and recombinational cloning, allowing one to rapidly go from a genomic locus to a set of candidate functional SNPs in 8 weeks. PMID:26658467

  14. Enhancer scanning to locate regulatory regions in genomic loci

    PubMed Central

    Buckley, Melissa; Gjyshi, Anxhela; Mendoza-Fandiño, Gustavo; Baskin, Rebekah; Carvalho, Renato S.; Carvalho, Marcelo A.; Woods, Nicholas T.; Monteiro, Alvaro N.A.

    2016-01-01

    The present protocol provides a rapid, streamlined and scalable strategy to systematically scan genomic regions for the presence of transcriptional regulatory regions active in a specific cell type. It creates genomic tiles spanning a region of interest that are subsequently cloned by recombination into a luciferase reporter vector containing the Simian Virus 40 promoter. Tiling clones are transfected into specific cell types to test for the presence of transcriptional regulatory regions. The protocol includes testing of different SNP (single nucleotide polymorphism) alleles to determine their effect on regulatory activity. This procedure provides a systematic framework to identify candidate functional SNPs within a locus during functional analysis of genome-wide association studies. This protocol adapts and combines previous well-established molecular biology methods to provide a streamlined strategy, based on automated primer design and recombinational cloning to rapidly go from a genomic locus to a set of candidate functional SNPs in eight weeks. PMID:26658467

  15. Phenome-wide analysis of genome-wide polygenic scores.

    PubMed

    Krapohl, E; Euesden, J; Zabaneh, D; Pingault, J-B; Rimfeld, K; von Stumm, S; Dale, P S; Breen, G; O'Reilly, P F; Plomin, R

    2016-09-01

    Genome-wide polygenic scores (GPS), which aggregate the effects of thousands of DNA variants from genome-wide association studies (GWAS), have the potential to make genetic predictions for individuals. We conducted a systematic investigation of associations between GPS and many behavioral traits, the behavioral phenome. For 3152 unrelated 16-year-old individuals representative of the United Kingdom, we created 13 GPS from the largest GWAS for psychiatric disorders (for example, schizophrenia, depression and dementia) and cognitive traits (for example, intelligence, educational attainment and intracranial volume). The behavioral phenome included 50 traits from the domains of psychopathology, personality, cognitive abilities and educational achievement. We examined phenome-wide profiles of associations for the entire distribution of each GPS and for the extremes of the GPS distributions. The cognitive GPS yielded stronger predictive power than the psychiatric GPS in our UK-representative sample of adolescents. For example, education GPS explained variation in adolescents' behavior problems (~0.6%) and in educational achievement (~2%) but psychiatric GPS were associated with neither. Despite the modest effect sizes of current GPS, quantile analyses illustrate the ability to stratify individuals by GPS and opportunities for research. For example, the highest and lowest septiles for the education GPS yielded a 0.5 s.d. difference in mean math grade and a 0.25 s.d. difference in mean behavior problems. We discuss the usefulness and limitations of GPS based on adult GWAS to predict genetic propensities earlier in development. PMID:26303664

  16. Genome-Wide Association Studies of Cancer

    PubMed Central

    Stadler, Zsofia K.; Thom, Peter; Robson, Mark E.; Weitzel, Jeffrey N.; Kauff, Noah D.; Hurley, Karen E.; Devlin, Vincent; Gold, Bert; Klein, Robert J.; Offit, Kenneth

    2010-01-01

    Knowledge of the inherited risk for cancer is an important component of preventive oncology. In addition to well-established syndromes of cancer predisposition, much remains to be discovered about the genetic variation underlying susceptibility to common malignancies. Increased knowledge about the human genome and advances in genotyping technology have made possible genome-wide association studies (GWAS) of human diseases. These studies have identified many important regions of genetic variation associated with an increased risk for human traits and diseases including cancer. Understanding the principles, major findings, and limitations of GWAS is becoming increasingly important for oncologists as dissemination of genomic risk tests directly to consumers is already occurring through commercial companies. GWAS have contributed to our understanding of the genetic basis of cancer and will shed light on biologic pathways and possible new strategies for targeted prevention. To date, however, the clinical utility of GWAS-derived risk markers remains limited. PMID:20585100

  17. Genome-wide Membrane Protein Structure Prediction

    PubMed Central

    Piccoli, Stefano; Suku, Eda; Garonzi, Marianna; Giorgetti, Alejandro

    2013-01-01

    Transmembrane proteins allow cells to extensively communicate with the external world in a very accurate and specific way. They form principal nodes in several signaling pathways and attract large interest in therapeutic intervention, as the majority pharmaceutical compounds target membrane proteins. Thus, according to the current genome annotation methods, a detailed structural/functional characterization at the protein level of each of the elements codified in the genome is also required. The extreme difficulty in obtaining high-resolution three-dimensional structures, calls for computational approaches. Here we review to which extent the efforts made in the last few years, combining the structural characterization of membrane proteins with protein bioinformatics techniques, could help describing membrane proteins at a genome-wide scale. In particular we analyze the use of comparative modeling techniques as a way of overcoming the lack of high-resolution three-dimensional structures in the human membrane proteome. PMID:24403851

  18. Genome-wide gene-based association study.

    PubMed

    Yang, Hsin-Chou; Liang, Yu-Jen; Chung, Chia-Min; Chen, Jia-Wei; Pan, Wen-Harn

    2009-01-01

    Genome-wide association studies, which analyzes hundreds of thousands of single-nucleotide polymorphisms to identify disease susceptibility genes, are challenging because the work involves intensive computation and complex modeling. We propose a two-stage genome-wide association scanning procedure, consisting of a single-locus association scan for the first stage and a gene-based association scan for the second stage. Marginal effects of single-nucleotide polymorphisms are examined by using the exact Armitage trend test or logistic regression, and gene effects are examined by using a p-value combination method. Compared with some existing single-locus and multilocus methods, the proposed method has the following merits: 1) convenient for definition of biologically meaningful regions, 2) powerful for detection of minor-effect genes, 3) helpful for alleviation of a multiple-testing problem, and 4) convenient for result interpretation. The method was applied to study Genetic Analysis Workshop 16 Problem 1 rheumatoid arthritis data, and strong association signals were found. The results show that the human major histocompatibility complex region is the most important genomic region associated with rheumatoid arthritis. Moreover, previously reported genes including PTPN22, C5, and IL2RB were confirmed; novel genes including HLA-DRA, BTNL2, C6orf10, NOTCH4, TAP2, and TNXB were identified by our analysis. PMID:20018002

  19. Genome-wide patterns of selection in 230 ancient Eurasians

    PubMed Central

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R.; Llamas, Bastien; Dryomov, Stanislav; Pickrel, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vayacheslav; Rojo Guerra, Manuel A.; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W.; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2016-01-01

    Ancient DNA makes it possible to directly witness natural selection by analyzing samples from populations before, during and after adaptation events. Here we report the first scan for selection using ancient DNA, capitalizing on the largest genome-wide dataset yet assembled: 230 West Eurasians dating to between 6500 and 1000 BCE, including 163 with newly reported data. The new samples include the first genome-wide data from the Anatolian Neolithic culture whose genetic material we extracted from the DNA-rich petrous bone and who we show were members of the population that was the source of Europe’s first farmers. We also report a complete transect of the steppe region in Samara between 5500 and 1200 BCE that allows us to recognize admixture from at least two external sources into steppe populations during this period. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  20. Genome-wide patterns of selection in 230 ancient Eurasians.

    PubMed

    Mathieson, Iain; Lazaridis, Iosif; Rohland, Nadin; Mallick, Swapan; Patterson, Nick; Roodenberg, Songül Alpaslan; Harney, Eadaoin; Stewardson, Kristin; Fernandes, Daniel; Novak, Mario; Sirak, Kendra; Gamba, Cristina; Jones, Eppie R; Llamas, Bastien; Dryomov, Stanislav; Pickrell, Joseph; Arsuaga, Juan Luís; de Castro, José María Bermúdez; Carbonell, Eudald; Gerritsen, Fokke; Khokhlov, Aleksandr; Kuznetsov, Pavel; Lozano, Marina; Meller, Harald; Mochalov, Oleg; Moiseyev, Vyacheslav; Guerra, Manuel A Rojo; Roodenberg, Jacob; Vergès, Josep Maria; Krause, Johannes; Cooper, Alan; Alt, Kurt W; Brown, Dorcas; Anthony, David; Lalueza-Fox, Carles; Haak, Wolfgang; Pinhasi, Ron; Reich, David

    2015-12-24

    Ancient DNA makes it possible to observe natural selection directly by analysing samples from populations before, during and after adaptation events. Here we report a genome-wide scan for selection using ancient DNA, capitalizing on the largest ancient DNA data set yet assembled: 230 West Eurasians who lived between 6500 and 300 bc, including 163 with newly reported data. The new samples include, to our knowledge, the first genome-wide ancient DNA from Anatolian Neolithic farmers, whose genetic material we obtained by extracting from petrous bones, and who we show were members of the population that was the source of Europe's first farmers. We also report a transect of the steppe region in Samara between 5600 and 300 bc, which allows us to identify admixture into the steppe from at least two external sources. We detect selection at loci associated with diet, pigmentation and immunity, and two independent episodes of selection on height. PMID:26595274

  1. GWIDD: Genome-wide protein docking database

    PubMed Central

    Kundrotas, Petras J.; Zhu, Zhengwei; Vakser, Ilya A.

    2010-01-01

    Structural information on interacting proteins is important for understanding life processes at the molecular level. Genome-wide docking database is an integrated resource for structural studies of protein–protein interactions on the genome scale, which combines the available experimental data with models obtained by docking techniques. Current database version (August 2009) contains 25 559 experimental and modeled 3D structures for 771 organisms spanned over the entire universe of life from viruses to humans. Data are organized in a relational database with user-friendly search interface allowing exploration of the database content by a number of parameters. Search results can be interactively previewed and downloaded as PDB-formatted files, along with the information relevant to the specified interactions. The resource is freely available at http://gwidd.bioinformatics.ku.edu. PMID:19900970

  2. Massively expedited genome-wide heritability analysis (MEGHA).

    PubMed

    Ge, Tian; Nichols, Thomas E; Lee, Phil H; Holmes, Avram J; Roffman, Joshua L; Buckner, Randy L; Sabuncu, Mert R; Smoller, Jordan W

    2015-02-24

    The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction. PMID:25675487

  3. Massively expedited genome-wide heritability analysis (MEGHA)

    PubMed Central

    Ge, Tian; Nichols, Thomas E.; Lee, Phil H.; Holmes, Avram J.; Roffman, Joshua L.; Buckner, Randy L.; Sabuncu, Mert R.; Smoller, Jordan W.

    2015-01-01

    The discovery and prioritization of heritable phenotypes is a computational challenge in a variety of settings, including neuroimaging genetics and analyses of the vast phenotypic repositories in electronic health record systems and population-based biobanks. Classical estimates of heritability require twin or pedigree data, which can be costly and difficult to acquire. Genome-wide complex trait analysis is an alternative tool to compute heritability estimates from unrelated individuals, using genome-wide data that are increasingly ubiquitous, but is computationally demanding and becomes difficult to apply in evaluating very large numbers of phenotypes. Here we present a fast and accurate statistical method for high-dimensional heritability analysis using genome-wide SNP data from unrelated individuals, termed massively expedited genome-wide heritability analysis (MEGHA) and accompanying nonparametric sampling techniques that enable flexible inferences for arbitrary statistics of interest. MEGHA produces estimates and significance measures of heritability with several orders of magnitude less computational time than existing methods, making heritability-based prioritization of millions of phenotypes based on data from unrelated individuals tractable for the first time to our knowledge. As a demonstration of application, we conducted heritability analyses on global and local morphometric measurements derived from brain structural MRI scans, using genome-wide SNP data from 1,320 unrelated young healthy adults of non-Hispanic European ancestry. We also computed surface maps of heritability for cortical thickness measures and empirically localized cortical regions where thickness measures were significantly heritable. Our analyses demonstrate the unique capability of MEGHA for large-scale heritability-based screening and high-dimensional heritability profile construction. PMID:25675487

  4. Genome-Wide Association Studies: A Primer

    PubMed Central

    Corvin, Aiden; Craddock, Nick; Sullivan, Patrick F.

    2014-01-01

    There have been nearly 400genome-wide association studies published since 2005. The GWAS approach has been exceptionally successful in identifying common genetic variants that predispose to a variety of complex human diseases and biochemical and anthropometric traits. Although this approach is relatively new, there are many excellent reviews of different aspects of the GWAS method. Here, we provide a primer, an annotated overview of the GWAS method with particular reference to psychiatric genetics. We dissect the GWAS methodology into its components and provide a brief description with citations and links to reviews that cover the topic in detail. PMID:19895722

  5. Genome-wide association studies in pharmacogenomics.

    PubMed

    Daly, Ann K

    2010-04-01

    Genome-wide association (GWA) studies for pharmacogenomics-related traits are increasingly being performed to identify loci that affect either drug response or susceptibility to adverse drug reactions. Until now, only the largest effects have been detected, partly because of the challenges of obtaining large numbers of cases for pharmacogenomic studies. Since 2007, a range of pharmacogenomics GWA studies have been published that have identified several interesting and novel associations between drug responses or reactions and clinically relevant loci, showing the value of this approach. PMID:20300088

  6. Genome-wide association studies in neurology

    PubMed Central

    Tan, Meng-Shan; Jiang, Teng

    2014-01-01

    Genome-wide association studies (GWAS) are a powerful tool for understanding the genetic underpinnings of human disease. In this article, we briefly review the role and findings of GWAS in common neurological diseases, including Stroke, Alzheimer’s disease, Parkinson’s disease, epilepsy, multiple sclerosis, migraine, amyotrophic lateral sclerosis, frontotemporal lobar degeneration, restless legs syndrome, intracranial aneurysm, human prion diseases and moyamoya disease. We then discuss the present and future implications of these findings with regards to disease prediction, uncovering basic biology, and the development of potential therapeutic agents. PMID:25568877

  7. Dating the age of admixture via wavelet transform analysis of genome-wide data.

    PubMed

    Pugach, Irina; Matveyev, Rostislav; Wollstein, Andreas; Kayser, Manfred; Stoneking, Mark

    2011-01-01

    We describe a PCA-based genome scan approach to analyze genome-wide admixture structure, and introduce wavelet transform analysis as a method for estimating the time of admixture. We test the wavelet transform method with simulations and apply it to genome-wide SNP data from eight admixed human populations. The wavelet transform method offers better resolution than existing methods for dating admixture, and can be applied to either SNP or sequence data from humans or other species. PMID:21352535

  8. Dating the age of admixture via wavelet transform analysis of genome-wide data

    PubMed Central

    2011-01-01

    We describe a PCA-based genome scan approach to analyze genome-wide admixture structure, and introduce wavelet transform analysis as a method for estimating the time of admixture. We test the wavelet transform method with simulations and apply it to genome-wide SNP data from eight admixed human populations. The wavelet transform method offers better resolution than existing methods for dating admixture, and can be applied to either SNP or sequence data from humans or other species. PMID:21352535

  9. Genome-wide identification of enhancer elements.

    PubMed

    Tulin, Sarah; Barsi, Julius C; Bocconcelli, Carlo; Smith, Joel

    2016-01-01

    We present a prospective genome-wide regulatory element database for the sea urchin embryo and the modified chromosome capture-related methodology used to create it. The method we developed is termed GRIP-seq for genome-wide regulatory element immunoprecipitation and combines features of chromosome conformation capture, chromatin immunoprecipitation, and paired-end next-generation sequencing with molecular steps that enrich for active cis-regulatory elements associated with basal transcriptional machinery. The first GRIP-seq database, available to the community, comes from S. purpuratus 24 hpf embryos and takes advantage of the extremely well-characterized cis-regulatory elements in this system for validation. In addition, using the GRIP-seq database, we identify and experimentally validate a novel, intronic cis-regulatory element at the onecut locus. We find GRIP-seq signal sensitively identifies active cis-regulatory elements with a high signal-to-noise ratio for both distal and intronic elements. This promising GRIP-seq protocol has the potential to address a rate-limiting step in resolving comprehensive, predictive network models in all systems. PMID:27389984

  10. Genome-wide analysis correlates Ayurveda Prakriti

    PubMed Central

    Govindaraj, Periyasamy; Nizamuddin, Sheikh; Sharath, Anugula; Jyothi, Vuskamalla; Rotti, Harish; Raval, Ritu; Nayak, Jayakrishna; Bhat, Balakrishna K.; Prasanna, B. V.; Shintre, Pooja; Sule, Mayura; Joshi, Kalpana S.; Dedge, Amrish P.; Bharadwaj, Ramachandra; Gangadharan, G. G.; Nair, Sreekumaran; Gopinath, Puthiya M.; Patwardhan, Bhushan; Kondaiah, Paturu; Satyamoorthy, Kapaettu; Valiathan, Marthanda Varma Sankaran; Thangaraj, Kumarasamy

    2015-01-01

    The practice of Ayurveda, the traditional medicine of India, is based on the concept of three major constitutional types (Vata, Pitta and Kapha) defined as “Prakriti”. To the best of our knowledge, no study has convincingly correlated genomic variations with the classification of Prakriti. In the present study, we performed genome-wide SNP (single nucleotide polymorphism) analysis (Affymetrix, 6.0) of 262 well-classified male individuals (after screening 3416 subjects) belonging to three Prakritis. We found 52 SNPs (p ≤ 1 × 10−5) were significantly different between Prakritis, without any confounding effect of stratification, after 106 permutations. Principal component analysis (PCA) of these SNPs classified 262 individuals into their respective groups (Vata, Pitta and Kapha) irrespective of their ancestry, which represent its power in categorization. We further validated our finding with 297 Indian population samples with known ancestry. Subsequently, we found that PGM1 correlates with phenotype of Pitta as described in the ancient text of Caraka Samhita, suggesting that the phenotypic classification of India’s traditional medicine has a genetic basis; and its Prakriti-based practice in vogue for many centuries resonates with personalized medicine. PMID:26511157

  11. Genomic scan for genes predisposing to schizophrenia

    SciTech Connect

    Coon, H.; Jensen. S.; Holik, J.

    1994-03-15

    We initiated a genome-wide search for genes predisposing to schizophrenia by ascertaining 9 families, each containing three to five cases of schizophrenia. The 9 pedigrees were initially genotyped with 329 polymorphic DNA loci distributed throughout the genome. Assuming either autosomal dominant or recessive inheritance, 254 DNA loci yielded lod scores less than -2.0 at {theta} = 0.0, 101 DNA markers gave lod scores less than -2.0 at {theta} = 0.05, while 5 DNA loci produced maximum lod scores greater than 1: D4S35, D14S17, D15S1, D22S84, and D22S55. Of the DNA markers yielding lod scores greater than 1, D4S35 and D22S55 also were suggestive of linkage when the Affected-Pedigree-Member method was used. The families were then genotyped with four highly polymorphic simple sequence repeat markers; possible linkage diminished with DNA markers mapping nearby D4S35, while suggestive evidence of linkage remained with loci in the region of D22S55. Although follow-up investigation of these chromosomal regions may be warranted, our linkage results should be viewed as preliminary observations, as 35 unaffected persons are not past the age of risk. 90 refs., 3 tabs.

  12. Genome-wide signals of positive selection in human evolution

    PubMed Central

    Enard, David; Messer, Philipp W.; Petrov, Dmitri A.

    2014-01-01

    The role of positive selection in human evolution remains controversial. On the one hand, scans for positive selection have identified hundreds of candidate loci, and the genome-wide patterns of polymorphism show signatures consistent with frequent positive selection. On the other hand, recent studies have argued that many of the candidate loci are false positives and that most genome-wide signatures of adaptation are in fact due to reduction of neutral diversity by linked deleterious mutations, known as background selection. Here we analyze human polymorphism data from the 1000 Genomes Project and detect signatures of positive selection once we correct for the effects of background selection. We show that levels of neutral polymorphism are lower near amino acid substitutions, with the strongest reduction observed specifically near functionally consequential amino acid substitutions. Furthermore, amino acid substitutions are associated with signatures of recent adaptation that should not be generated by background selection, such as unusually long and frequent haplotypes and specific distortions in the site frequency spectrum. We use forward simulations to argue that the observed signatures require a high rate of strongly adaptive substitutions near amino acid changes. We further demonstrate that the observed signatures of positive selection correlate better with the presence of regulatory sequences, as predicted by the ENCODE Project Consortium, than with the positions of amino acid substitutions. Our results suggest that adaptation was frequent in human evolution and provide support for the hypothesis of King and Wilson that adaptive divergence is primarily driven by regulatory changes. PMID:24619126

  13. A genome-wide DNA methylation study in colorectal carcinoma

    PubMed Central

    2011-01-01

    Background We performed a genome-wide scan of 27,578 CpG loci covering 14,475 genes to identify differentially methylated loci (DML) in colorectal carcinoma (CRC). Methods We used Illumina's Infinium methylation assay in paired DNA samples extracted from 24 fresh frozen CRC tissues and their corresponding normal colon tissues from 24 consecutive diagnosed patients at a tertiary medical center. Results We found a total of 627 DML in CRC covering 513 genes, of which 535 are novel DML covering 465 genes. We also validated the Illumina Infinium methylation data for top-ranking genes by non-bisulfite conversion q-PCR-based methyl profiler assay in a subset of the same samples. We also carried out integration of genome-wide copy number and expression microarray along with methylation profiling to see the functional effect of methylation. Gene Set Enrichment Analysis (GSEA) showed that among the major "gene sets" that are hypermethylated in CRC are the sets: "inhibition of adenylate cyclase activity by G-protein signaling", "Rac guanyl-nucleotide exchange factor activity", "regulation of retinoic acid receptor signaling pathway" and "estrogen receptor activity". Two-level nested cross validation showed that DML-based predictive models may offer reasonable sensitivity (around 89%), specificity (around 95%), positive predictive value (around 95%) and negative predictive value (around 89%), suggesting that these markers may have potential clinical application. Conclusion Our genome-wide methylation study in CRC clearly supports most of the previous findings; additionally we found a large number of novel DML in CRC tissue. If confirmed in future studies, these findings may lead to identification of genomic markers for potential clinical application. PMID:21699707

  14. Genome Wide Methylome Alterations in Lung Cancer.

    PubMed

    Mullapudi, Nandita; Ye, Bin; Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D; Spivack, Simon D

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)-non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  15. Genome Wide Methylome Alterations in Lung Cancer

    PubMed Central

    Suzuki, Masako; Fazzari, Melissa; Han, Weiguo; Shi, Miao K.; Marquardt, Gaby; Lin, Juan; Wang, Tao; Keller, Steven; Zhu, Changcheng; Locker, Joseph D.; Spivack, Simon D.

    2015-01-01

    Aberrant cytosine 5-methylation underlies many deregulated elements of cancer. Among paired non-small cell lung cancers (NSCLC), we sought to profile DNA 5-methyl-cytosine features which may underlie genome-wide deregulation. In one of the more dense interrogations of the methylome, we sampled 1.2 million CpG sites from twenty-four NSCLC tumor (T)–non-tumor (NT) pairs using a methylation-sensitive restriction enzyme- based HELP-microarray assay. We found 225,350 differentially methylated (DM) sites in adenocarcinomas versus adjacent non-tumor tissue that vary in frequency across genomic compartment, particularly notable in gene bodies (GB; p<2.2E-16). Further, when DM was coupled to differential transcriptome (DE) in the same samples, 37,056 differential loci in adenocarcinoma emerged. Approximately 90% of the DM-DE relationships were non-canonical; for example, promoter DM associated with DE in the same direction. Of the canonical changes noted, promoter (PR) DM loci with reciprocal changes in expression in adenocarcinomas included HBEGF, AGER, PTPRM, DPT, CST1, MELK; DM GB loci with concordant changes in expression included FOXM1, FERMT1, SLC7A5, and FAP genes. IPA analyses showed adenocarcinoma-specific promoter DMxDE overlay identified familiar lung cancer nodes [tP53, Akt] as well as less familiar nodes [HBEGF, NQO1, GRK5, VWF, HPGD, CDH5, CTNNAL1, PTPN13, DACH1, SMAD6, LAMA3, AR]. The unique findings from this study include the discovery of numerous candidate The unique findings from this study include the discovery of numerous candidate methylation sites in both PR and GB regions not previously identified in NSCLC, and many non-canonical relationships to gene expression. These DNA methylation features could potentially be developed as risk or diagnostic biomarkers, or as candidate targets for newer methylation locus-targeted preventive or therapeutic agents. PMID:26683690

  16. Wide field of view multifocal scanning microscopy with sparse sampling

    NASA Astrophysics Data System (ADS)

    Wang, Jie; Wu, Jigang

    2016-02-01

    We propose to use sparsely sampled line scans with a sparsity-based reconstruction method to obtain images in a wide field of view (WFOV) multifocal scanning microscope. In the WFOV microscope, we used a holographically generated irregular focus grid to scan the sample in one dimension and then reconstructed the sample image from line scans by measuring the transmission of the foci through the sample during scanning. The line scans were randomly spaced with average spacing larger than the Nyquist sampling requirement, and the image was recovered with sparsity-based reconstruction techniques. With this scheme, the acquisition data can be significantly reduced and the restriction for equally spaced foci positions can be removed, indicating simpler experimental requirement. We built a prototype system and demonstrated the effectiveness of the reconstruction by recovering microscopic images of a U.S. Air Force target and an onion skin cell microscope slide with 40, 60, and 80% missing data with respect to the Nyquist sampling requirement.

  17. Chronic Periodontitis Genome-wide Association Studies

    PubMed Central

    Rhodin, K.; Divaris, K.; North, K.E.; Barros, S.P.; Moss, K.; Beck, J.D.; Offenbacher, S.

    2014-01-01

    Recent genome-wide association studies (GWAS) of chronic periodontitis (CP) offer rich data sources for the investigation of candidate genes, functional elements, and pathways. We used GWAS data of CP (n = 4,504) and periodontal pathogen colonization (n = 1,020) from a cohort of adult Americans of European descent participating in the Atherosclerosis Risk in Communities study and employed a MAGENTA approach (i.e., meta-analysis gene set enrichment of variant associations) to obtain gene-centric and gene set association results corrected for gene size, number of single-nucleotide polymorphisms, and local linkage disequilibrium characteristics based on the human genome build 18 (National Center for Biotechnology Information build 36). We used the Gene Ontology, Ingenuity, KEGG, Panther, Reactome, and Biocarta databases for gene set enrichment analyses. Six genes showed evidence of statistically significant association: 4 with severe CP (NIN, p = 1.6 × 10−7; ABHD12B, p = 3.6 × 10−7; WHAMM, p = 1.7 × 10−6; AP3B2, p = 2.2 × 10−6) and 2 with high periodontal pathogen colonization (red complex–KCNK1, p = 3.4 × 10−7; Porphyromonas gingivalis–DAB2IP, p = 1.0 × 10−6). Top-ranked genes for moderate CP were HGD (p = 1.4 × 10−5), ZNF675 (p = 1.5 × 10−5), TNFRSF10C (p = 2.0 × 10−5), and EMR1 (p = 2.0 × 10−5). Loci containing NIN, EMR1, KCNK1, and DAB2IP had showed suggestive evidence of association in the earlier single-nucleotide polymorphism–based analyses, whereas WHAMM and AP2B2 emerged as novel candidates. The top gene sets included severe CP (“endoplasmic reticulum membrane,” “cytochrome P450,” “microsome,” and “oxidation reduction”) and moderate CP (“regulation of gene expression,” “zinc ion binding,” “BMP signaling pathway,” and “ruffle”). Gene-centric analyses offer a promising avenue for efficient interrogation of large-scale GWAS data. These results highlight genes in previously identified loci and

  18. Voxelwise genome-wide association study (vGWAS)

    PubMed Central

    Stein, Jason L.; Hua, Xue; Lee, Suh; Ho, April J.; Leow, Alex D.; Toga, Arthur W.; Saykin, Andrew J.; Shen, Li; Foroud, Tatiana; Pankratz, Nathan; Huentelman, Matthew J.; Craig, David W.; Gerber, Jill D.; Allen, April N.; Corneveaux, Jason J.; DeChairo, Bryan M.; Potkin, Steven G.; Weiner, Michael W.; Thompson, Paul M.

    2010-01-01

    The structure of the human brain is highly heritable, and is thought to be influenced by many common genetic variants, many of which are currently unknown. Recent advances in neuroimaging and genetics have allowed collection of both highly detailed structural brain scans and genome-wide genotype information. This wealth of information presents a new opportunity to find the genes influencing brain structure. Here we explore the relation between 448,293 single nucleotide polymorphisms in each of 31,622 voxels of the entire brain across 740 elderly subjects (mean age±s.d.: 75.52±6.82 years; 438 male) including subjects with Alzheimer's disease, Mild Cognitive Impairment, and healthy elderly controls from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We used tensor-based morphometry to measure individual differences in brain structure at the voxel level relative to a study-specific template based on healthy elderly subjects. We then conducted a genome-wide association at each voxel to identify genetic variants of interest. By studying only the most associated variant at each voxel, we developed a novel method to address the multiple comparisons problem and computational burden associated with the unprecedented amount of data. No variant survived the strict significance criterion, but several genes worthy of further exploration were identified, including CSMD2 and CADPS2. These genes have high relevance to brain structure. This is the first voxelwise genome wide association study to our knowledge, and offers a novel method to discover genetic influences on brain structure. PMID:20171287

  19. Genome-wide methylation profiling of schizophrenia.

    PubMed

    Rukova, B; Staneva, R; Hadjidekova, S; Stamenov, G; Milanova; Toncheva, D

    2014-12-01

    Schizophrenia is one of the major psychiatric disorders. It is a disorder of complex inheritance, involving both heritable and environmental factors. DNA methylation is an inheritable epigenetic modification that stably alters gene expression. We reasoned that genetic modifications that are a result of environmental stimuli could also make a contribution. We have performed 26 high-resolution genome-wide methylation array analyses to determine the methylation status of 27,627 CpG islands and compared the data between patients and healthy controls. Methylation profiles of DNAs were analyzed in six pools: 220 schizophrenia patients; 220 age-matched healthy controls; 110 female schizophrenia patients; 110 age-matched healthy females; 110 male schizophrenia patients; 110 age-matched healthy males. We also investigated the methylation status of 20 individual patient DNA samples (eight females and 12 males. We found significant differences in the methylation profile between schizophrenia and control DNA pools. We found new candidate genes that principally participate in apoptosis, synaptic transmission and nervous system development (GABRA2, LIN7B, CASP3). Methylation profiles differed between the genders. In females, the most important genes participate in apoptosis and synaptic transmission (XIAP, GABRD, OXT, KRT7), whereas in the males, the implicated genes in the molecular pathology of the disease were DHX37, MAP2K2, FNDC4 and GIPC1. Data from the individual methylation analyses confirmed, the gender-specific pools results. Our data revealed major differences in methylation profiles between schizophrenia patients and controls and between male and female patients. The dysregulated activity of the candidate genes could play a role in schizophrenia pathogenesis. PMID:25937794

  20. Genome-Wide Methylation Profiling of Schizophrenia

    PubMed Central

    Rukova, B; Staneva, R; Hadjidekova, S; Stamenov, G; Milanova; Toncheva, D

    2014-01-01

    Schizophrenia is one of the major psychiatric disorders. It is a disorder of complex inheritance, involving both heritable and environmental factors. DNA methylation is an inheritable epigenetic modification that stably alters gene expression. We reasoned that genetic modifications that are a result of environmental stimuli could also make a contribution. We have performed 26 high-resolution genome-wide methylation array analyses to determine the methylation status of 27,627 CpG islands and compared the data between patients and healthy controls. Methylation profiles of DNAs were analyzed in six pools: 220 schizophrenia patients; 220 age-matched healthy controls; 110 female schizophrenia patients; 110 age-matched healthy females; 110 male schizophrenia patients; 110 age-matched healthy males. We also investigated the methylation status of 20 individual patient DNA samples (eight females and 12 males. We found significant differences in the methylation profile between schizophrenia and control DNA pools. We found new candidate genes that principally participate in apoptosis, synaptic transmission and nervous system development (GABRA2, LIN7B, CASP3). Methylation profiles differed between the genders. In females, the most important genes participate in apoptosis and synaptic transmission (XIAP, GABRD, OXT, KRT7), whereas in the males, the implicated genes in the molecular pathology of the disease were DHX37, MAP2K2, FNDC4 and GIPC1. Data from the individual methylation analyses confirmed, the gender-specific pools results. Our data revealed major differences in methylation profiles between schizophrenia patients and controls and between male and female patients. The dysregulated activity of the candidate genes could play a role in schizophrenia pathogenesis. PMID:25937794

  1. High-Resolution, Wide-Field-of-View Scanning Telescope

    NASA Technical Reports Server (NTRS)

    Sepulveda, Cesar; Wilson, Robert; Seshadri, Suresh

    2007-01-01

    A proposed telescope would afford high resolution over a narrow field of view (<0.10 ) while scanning over a total field of view nominally 16 wide without need to slew the entire massive telescope structure. The telescope design enables resolution of a 1-m-wide object in a 50- km-wide area of the surface of the Earth as part of a 200-km-wide area field of view monitored from an orbit at an altitude of 700 km. The conceptual design of this telescope could also be adapted to other applications both terrestrial and extraterrestrial in which there are requirements for telescopes that afford both wide- and narrow-field capabilities. In the proposed telescope, the scanning would be effected according to a principle similar to that of the Arecibo radio telescope, in which the primary mirror is stationary with respect to the ground and a receiver is moved across the focal surface of the primary mirror. The proposed telescope would comprise (1) a large spherical primary mirror that would afford high resolution over a narrow field of view and (2) a small displaceable optical relay segment that would be pivoted about the center of an aperture stop to effect the required scanning (see figure). Taken together, both comprise a scanning narrow-angle telescope that does not require slewing the telescope structure. In normal operation, the massive telescope structure would stare at a fixed location on the ground. The inner moveable relay optic would be pivoted to scan the narrower field of view over the wider one, making it possible to retain a fixed telescope orientation, while obtaining high-resolution images over multiple target areas during an interval of 3 to 4 minutes in the intended orbit. The pivoting relay segment of the narrow-angle telescope would include refractive and reflective optical elements, including two aspherical mirrors, to counteract the spherical aberration of the primary mirror. Overall, the combination of the primary mirror and the smaller relay optic

  2. Wide-Angle-Scanning Reflectarray Antennas Actuated by MEMS

    NASA Technical Reports Server (NTRS)

    Fang, Houfei; Huang, John; Thomson, Mark W.

    2009-01-01

    An effort to develop large-aperture, wide-angle-scanning reflectarray antennas for microwave radar and communication systems is underway. In an antenna of this type as envisioned, scanning of the radiated or incident microwave beam would be effected through mechanical rotation of the passive (reflective) patch antenna elements, using microelectromechanical systems (MEMS) stepping rotary actuators typified by piezoelectric micromotors. It is anticipated that the cost, mass, and complexity of such an antenna would be less than, and the reliability greater than, those of an electronically scanned phased-array antenna of comparable beam-scanning capability and angular resolution. In the design and operation of a reflectarray, one seeks to position and orient an array of passive patch elements in a geometric pattern such that, through constructive interference of the reflections from them, they collectively act as an efficient single reflector of radio waves within a desired frequency band. Typically, the patches lie in a common plane and radiation is incident upon them from a feed horn.

  3. A Genome-Wide Landscape of Retrocopies in Primate Genomes

    PubMed Central

    Navarro, Fábio C.P.; Galante, Pedro A.F.

    2015-01-01

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. PMID:26224704

  4. A Genome-Wide Landscape of Retrocopies in Primate Genomes.

    PubMed

    Navarro, Fábio C P; Galante, Pedro A F

    2015-08-01

    Gene duplication is a key factor contributing to phenotype diversity across and within species. Although the availability of complete genomes has led to the extensive study of genomic duplications, the dynamics and variability of gene duplications mediated by retrotransposition are not well understood. Here, we predict mRNA retrotransposition and use comparative genomics to investigate their origin and variability across primates. Analyzing seven anthropoid primate genomes, we found a similar number of mRNA retrotranspositions (∼7,500 retrocopies) in Catarrhini (Old Word Monkeys, including humans), but a surprising large number of retrocopies (∼10,000) in Platyrrhini (New World Monkeys), which may be a by-product of higher long interspersed nuclear element 1 activity in these genomes. By inferring retrocopy orthology, we dated most of the primate retrocopy origins, and estimated a decrease in the fixation rate in recent primate history, implying a smaller number of species-specific retrocopies. Moreover, using RNA-Seq data, we identified approximately 3,600 expressed retrocopies. As expected, most of these retrocopies are located near or within known genes, present tissue-specific and even species-specific expression patterns, and no expression correlation to their parental genes. Taken together, our results provide further evidence that mRNA retrotransposition is an active mechanism in primate evolution and suggest that retrocopies may not only introduce great genetic variability between lineages but also create a large reservoir of potentially functional new genomic loci in primate genomes. PMID:26224704

  5. Gene Fusion: A Genome Wide Survey

    NASA Technical Reports Server (NTRS)

    Liang, Ping; Riley, Monica

    2001-01-01

    As a well known fact, organisms form larger and complex multimodular (composite or chimeric) and mostly multi-functional proteins through gene fusion of two or more individual genes which have independent evolution histories and functions. We call each of these components a module. The existence of multimodular proteins may improves the efficiency in gene regulation and in cellular functions, and thus may give the host organism advantages in adaptation to environments. Analysis of all gene fusions in present-day organisms should allow us to examine the patterns of gene fusion in context with cellular functions, to trace back the evolution processes from the ancient smaller and uni-functional proteins to the present-day larger and complex multi-functional proteins, and to estimate the minimal number of ancestor proteins that existed in the last common ancestor for all life on earth. Although many multimodular proteins have been experimentally known, identification of gene fusion events systematically at genome scale had not been possible until recently when large number of completed genome sequences have been becoming available. In addition, technical difficulties for such analysis also exist due to the complexity of this biological and evolutionary process. We report from this study a new strategy to computationally identify multimodular proteins using completed genome sequences and the results surveyed from 22 organisms with the data from over 40 organisms to be presented during the meeting. Additional information is contained in the original extended abstract.

  6. Analysis of Reflector Antenna Systems for Wide - Scanning

    NASA Astrophysics Data System (ADS)

    Houshmand, Bijan

    1990-01-01

    A near-field Cassegrain reflector (NFCR) is an effective way to magnify a small phased array into a much larger aperture antenna for limited scan applications. Traditionally, the pattern analysis of NFCR is based on a plane wave approach. This approach simplifies the computation tremendously, but fails to provide design information about the most critical component of the whole antenna system, namely, the feed array. Here, each element in the feed array is considered individually and its diffraction pattern from the subreflector is computed by GTD. The field contributions from all elements are superimposed at the curved main reflector surface, and a physical optics integration is performed to obtain the secondary pattern. Beam-waveguide-fed Cassegrain reflector (BFCR) antennas are increasingly being used in space communication applications. Using a shooting and bouncing ray approach based on geometrical optics and aperture integration, the far-field pattern of the BFCR is calculated. This method is computationally efficient and is not restricted by the number of reflecting surfaces in the antenna configuration. The diffraction loss in the beam waveguide structure is calculated separately by the conventional near-field physical optics integration. The segmented mirror antenna is designed for the radiometer application on the planned NASA Earth Science Geostationary Platforms in the 1990s. The antenna consists of two parts: a regular parabolic dish of 5 m in diameter which converts the radiation from feeds into a collimated beam, and a movable mirror that redirects the beam to a prescribed scan direction. The mirror is composed of 28 segmented planar conducting plates, mostly one square meter in size. Based on a physical optics analysis, we have analyzed the secondary pattern of the antenna. For frequencies between 50 and 230 GHz, and for a scan range of +/- 8^circ (270 beamwidths scan at 230 GHz), the worst calculated beam efficiency is 95%. To cover such a wide

  7. Water-Immersible MEMS scanning mirror designed for wide-field fast-scanning photoacoustic microscopy

    NASA Astrophysics Data System (ADS)

    Yao, Junjie; Huang, Chih-Hsien; Martel, Catherine; Maslov, Konstantin I.; Wang, Lidai; Yang, Joon-Mo; Gao, Liang; Randolph, Gwendalyn; Zou, Jun; Wang, Lihong V.

    2013-03-01

    By offering images with high spatial resolution and unique optical absorption contrast, optical-resolution photoacoustic microscopy (OR-PAM) has gained increasing attention in biomedical research. Recent developments in OR-PAM have improved its imaging speed, but have sacrificed either the detection sensitivity or field of view or both. We have developed a wide-field fast-scanning OR-PAM by using a water-immersible MEMS scanning mirror (MEMS-ORPAM). Made of silicon with a gold coating, the MEMS mirror plate can reflect both optical and acoustic beams. Because it uses an electromagnetic driving force, the whole MEMS scanning system can be submerged in water. In MEMS-ORPAM, the optical and acoustic beams are confocally configured and simultaneously steered, which ensures uniform detection sensitivity. A B-scan imaging speed as high as 400 Hz can be achieved over a 3 mm scanning range. A diffraction-limited lateral resolution of 2.4 μm in water and a maximum imaging depth of 1.1 mm in soft tissue have been experimentally determined. Using the system, we imaged the flow dynamics of both red blood cells and carbon particles in a mouse ear in vivo. By using Evans blue dye as the contrast agent, we also imaged the flow dynamics of lymphatic vessels in a mouse tail in vivo. The results show that MEMS-OR-PAM could be a powerful tool for studying highly dynamic and time-sensitive biological phenomena.

  8. Production of high density molecular beams with wide velocity scanning

    NASA Astrophysics Data System (ADS)

    Sheffield, L. S.; Woo, S. O.; Rathnayaka, K. D. D.; Lyuksyutov, I. F.; Herschbach, D. R.

    2016-06-01

    We describe modifications of a pulsed rotating supersonic beam source that improve performance, particularly increasing the beam density and sharpening the pulse profiles. As well as providing the familiar virtues of a supersonic molecular beam (high intensity, narrowed velocity distribution, and drastic cooling of rotation and vibration), the rotating source enables scanning the translational velocity over a wide range. Thereby, beams of any atom or molecule available as a gas can be slowed or speeded. Using Xe beams in the slowing mode, we have obtained lab speeds down to about 40 ± 5 m/s with density near 1011 cm-3 and in the speeding mode lab speeds up to about 660 m/s and density near 1014 cm-3. We discuss some congenial applications. Providing low lab speeds can markedly enhance experiments using electric or magnetic fields to deflect, steer, or further slow polar or paramagnetic molecules. The capability to scan molecular speeds facilitates merging velocities with a codirectional partner beam, enabling study of collisions at very low relative kinetic energies, without requiring either beam to be slow.

  9. Ultrahigh vacuum scanning electron microscope system combined with wide-movable scanning tunneling microscope

    SciTech Connect

    Kaneko, A.; Homma, Y.; Hibino, H.; Ogino, T.

    2005-08-15

    A surface analysis system has been newly developed with combination of ultrahigh vacuum scanning electron microscope (SEM) and wide-movable scanning tunneling microscope (STM). The basic performance is experimentally demonstrated. These SEM and STM images are clear enough to obtain details of surface structures. The STM unit moves horizontally over several millimeters by sliding motion of PZT actuators. The motion resolution is proved to be submicrometers. The STM tip mounted on another PZT scanner can be guided to a specific object on the sample surface during SEM observation. In the observation of a Si(111) surface rapidly cooled from high temperature, the STM tip was accurately guided to an isolated atomic step and slightly moved along it during SEM observation. The STM observation shows an asymmetry of the (7x7)-transformed region along the step between the upper and lower terraces. (7x7) bands continuously formed along the edge of terraces, while (7x7) domains distributed on the terraces slightly far from the step. These experiments show the wide-movable STM unit resolves a gap of observation area between SEM and STM and the system enables a specific object found in the SEM image to be observed easily by STM.

  10. A Genome Scan for Positive Selection in Thoroughbred Horses

    PubMed Central

    Gu, Jingjing; Orr, Nick; Park, Stephen D.; Katz, Lisa M.; Sulimova, Galina; MacHugh, David E.; Hill, Emmeline W.

    2009-01-01

    Thoroughbred horses have been selected for exceptional racing performance resulting in system-wide structural and functional adaptations contributing to elite athletic phenotypes. Because selection has been recent and intense in a closed population that stems from a small number of founder animals Thoroughbreds represent a unique population within which to identify genomic contributions to exercise-related traits. Employing a population genetics-based hitchhiking mapping approach we performed a genome scan using 394 autosomal and X chromosome microsatellite loci and identified positively selected loci in the extreme tail-ends of the empirical distributions for (1) deviations from expected heterozygosity (Ewens-Watterson test) in Thoroughbred (n = 112) and (2) global differentiation among four geographically diverse horse populations (FST). We found positively selected genomic regions in Thoroughbred enriched for phosphoinositide-mediated signalling (3.2-fold enrichment; P<0.01), insulin receptor signalling (5.0-fold enrichment; P<0.01) and lipid transport (2.2-fold enrichment; P<0.05) genes. We found a significant overrepresentation of sarcoglycan complex (11.1-fold enrichment; P<0.05) and focal adhesion pathway (1.9-fold enrichment; P<0.01) genes highlighting the role for muscle strength and integrity in the Thoroughbred athletic phenotype. We report for the first time candidate athletic-performance genes within regions targeted by selection in Thoroughbred horses that are principally responsible for fatty acid oxidation, increased insulin sensitivity and muscle strength: ACSS1 (acyl-CoA synthetase short-chain family member 1), ACTA1 (actin, alpha 1, skeletal muscle), ACTN2 (actinin, alpha 2), ADHFE1 (alcohol dehydrogenase, iron containing, 1), MTFR1 (mitochondrial fission regulator 1), PDK4 (pyruvate dehydrogenase kinase, isozyme 4) and TNC (tenascin C). Understanding the genetic basis for exercise adaptation will be crucial for the identification of genes

  11. Power and sample size estimation for epigenome-wide association scans to detect differential DNA methylation

    PubMed Central

    Tsai, Pei-Chien; Bell, Jordana T

    2015-01-01

    Background: Epigenome-wide association scans (EWAS) are under way for many complex human traits, but EWAS power has not been fully assessed. We investigate power of EWAS to detect differential methylation using case-control and disease-discordant monozygotic (MZ) twin designs with genome-wide DNA methylation arrays. Methods and Results: We performed simulations to estimate power under the case-control and discordant MZ twin EWAS study designs, under a range of epigenetic risk effect sizes and conditions. For example, to detect a 10% mean methylation difference between affected and unaffected subjects at a genome-wide significance threshold of P = 1 × 10−6, 98 MZ twin pairs were required to reach 80% EWAS power, and 112 cases and 112 controls pairs were needed in the case-control design. We also estimated the minimum sample size required to reach 80% EWAS power under both study designs. Our analyses highlighted several factors that significantly influenced EWAS power, including sample size, epigenetic risk effect size, the variance of DNA methylation at the locus of interest and the correlation in DNA methylation patterns within the twin sample. Conclusions: We provide power estimates for array-based DNA methylation EWAS under case-control and disease-discordant MZ twin designs, and explore multiple factors that impact on EWAS power. Our results can help guide EWAS experimental design and interpretation for future epigenetic studies. PMID:25972603

  12. Genome-wide Association Study of Chicken Plumage Pigmentation.

    PubMed

    Park, Mi Na; Choi, Jin Ae; Lee, Kyung-Tai; Lee, Hyun-Jeong; Choi, Bong-Hwan; Kim, Heebal; Kim, Tae-Hun; Cho, Seoae; Lee, Taeheon

    2013-11-01

    To increase plumage color uniformity and understand the genetic background of Korean chickens, we performed a genome-wide association study of different plumage color in Korean native chickens. We analyzed 60K SNP chips on 279 chickens with GEMMA methods for GWAS and estimated the genetic heritability for plumage color. The estimated heritability suggests that plumage coloration is a polygenic trait. We found new loci associated with feather pigmentation at the genome-wide level and from the results infer that there are additional genetic effect for plumage color. The results will be used for selecting and breeding chicken for plumage color uniformity. PMID:25049737

  13. Genome-Wide Significant Loci: How Important Are They?

    PubMed Central

    Björkegren, Johan L.M.; Kovacic, Jason C.; Dudley, Joel T.; Schadt, Eric E.

    2015-01-01

    Genome-wide association studies (GWAS) have been extensively used to study common complex diseases such as coronary artery disease (CAD), revealing 153 suggestive CAD loci, of which at least 46 have been validated as having genome-wide significance. However, these loci collectively explain <10% of the genetic variance in CAD. Thus, we must address the key question of what factors constitute the remaining 90% of CAD heritability. We review possible limitations of GWAS, and contextually consider some candidate CAD loci identified by this method. Looking ahead, we propose systems genetics as a complementary approach to unlocking the CAD heritability and etiology. Systems genetics builds network models of relevant molecular processes by combining genetic and genomic datasets to ultimately identify key “drivers” of disease. By leveraging systems-based genetic approaches, we can help reveal the full genetic basis of common complex disorders, enabling novel diagnostic and therapeutic opportunities. PMID:25720628

  14. Genome-wide footprinting: ready for prime time?

    PubMed

    Sung, Myong-Hee; Baek, Songjoon; Hager, Gordon L

    2016-03-01

    High-throughput sequencing technologies have allowed many gene locus-level molecular biology assays to become genome-wide profiling methods. DNA-cleaving enzymes such as DNase I have been used to probe accessible chromatin. The accessible regions contain functional regulatory sites, including promoters, insulators and enhancers. Deep sequencing of DNase-seq libraries and computational analysis of the cut profiles have been used to infer protein occupancy in the genome at the nucleotide level, a method introduced as 'digital genomic footprinting'. The approach has been proposed as an attractive alternative to the analysis of transcription factors (TFs) by chromatin immunoprecipitation followed by sequencing (ChIP-seq), and in theory it should overcome antibody issues, poor resolution and batch effects. Recent reports point to limitations of the DNase-based genomic footprinting approach and call into question the scope of detectable protein occupancy, especially for TFs with short-lived chromatin binding. The genomics community is grappling with issues concerning the utility of genomic footprinting and is reassessing the proposed approaches in terms of robust deliverables. Here we summarize the consensus as well as different views emerging from recent reports, and we describe the remaining issues and hurdles for genomic footprinting. PMID:26914206

  15. Genome-Wide Approaches to Drosophila Heart Development

    PubMed Central

    Frasch, Manfred

    2016-01-01

    The development of the dorsal vessel in Drosophila is one of the first systems in which key mechanisms regulating cardiogenesis have been defined in great detail at the genetic and molecular level. Due to evolutionary conservation, these findings have also provided major inputs into studies of cardiogenesis in vertebrates. Many of the major components that control Drosophila cardiogenesis were discovered based on candidate gene approaches and their functions were defined by employing the outstanding genetic tools and molecular techniques available in this system. More recently, approaches have been taken that aim to interrogate the entire genome in order to identify novel components and describe genomic features that are pertinent to the regulation of heart development. Apart from classical forward genetic screens, the availability of the thoroughly annotated Drosophila genome sequence made new genome-wide approaches possible, which include the generation of massive numbers of RNA interference (RNAi) reagents that were used in forward genetic screens, as well as studies of the transcriptomes and proteomes of the developing heart under normal and experimentally manipulated conditions. Moreover, genome-wide chromatin immunoprecipitation experiments have been performed with the aim to define the full set of genomic binding sites of the major cardiogenic transcription factors, their relevant target genes, and a more complete picture of the regulatory network that drives cardiogenesis. This review will give an overview on these genome-wide approaches to Drosophila heart development and on computational analyses of the obtained information that ultimately aim to provide a description of this process at the systems level. PMID:27294102

  16. A super powerful method for genome wide association study

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Genome-Wide Association Studies shed light on the identification of genes underlying human diseases and agriculturally important traits. This potential has been shadowed by false positive findings. The Mixed Linear Model (MLM) method is flexible enough to simultaneously incorporate population struct...

  17. Design and bioinformatics analysis of genome-wide CLIP experiments

    PubMed Central

    Wang, Tao; Xiao, Guanghua; Chu, Yongjun; Zhang, Michael Q.; Corey, David R.; Xie, Yang

    2015-01-01

    The past decades have witnessed a surge of discoveries revealing RNA regulation as a central player in cellular processes. RNAs are regulated by RNA-binding proteins (RBPs) at all post-transcriptional stages, including splicing, transportation, stabilization and translation. Defects in the functions of these RBPs underlie a broad spectrum of human pathologies. Systematic identification of RBP functional targets is among the key biomedical research questions and provides a new direction for drug discovery. The advent of cross-linking immunoprecipitation coupled with high-throughput sequencing (genome-wide CLIP) technology has recently enabled the investigation of genome-wide RBP–RNA binding at single base-pair resolution. This technology has evolved through the development of three distinct versions: HITS-CLIP, PAR-CLIP and iCLIP. Meanwhile, numerous bioinformatics pipelines for handling the genome-wide CLIP data have also been developed. In this review, we discuss the genome-wide CLIP technology and focus on bioinformatics analysis. Specifically, we compare the strengths and weaknesses, as well as the scopes, of various bioinformatics tools. To assist readers in choosing optimal procedures for their analysis, we also review experimental design and procedures that affect bioinformatics analyses. PMID:25958398

  18. Genome-wide association study identifies five new schizophrenia loci

    PubMed Central

    2012-01-01

    We examined the role of common genetic variation in schizophrenia in a genome-wide association study of substantial size: a stage 1 discovery sample of 21,856 individuals of European ancestry and a stage 2 replication sample of 29,839 independent subjects. The combined stage 1 and 2 analysis yielded genome-wide significant associations with schizophrenia for seven loci, five of which are new (1p21.3, 2q32.3, 8p23.2, 8q21.3 and 10q24.32-q24.33) and two of which have been previously implicated (6p21.32-p22.1 and 18q21.2). The strongest new finding (P = 1.6 × 10−11) was with rs1625579 within an intron of a putative primary transcript for MIR137 (microRNA 137), a known regulator of neuronal development. Four other schizophrenia loci achieving genome-wide significance contain predicted targets of MIR137, suggesting MIR137-mediated dysregulation as a previously unknown etiologic mechanism in schizophrenia. In a joint analysis with a bipolar disorder sample (16,374 affected individuals and 14,044 controls), three loci reached genome-wide significance: CACNA1C (rs4765905, P = 7.0 × 10−9), ANK3 (rs10994359, P = 2.5 × 10−8) and the ITIH3-ITIH4 region (rs2239547, P = 7.8 × 10−9). PMID:21926974

  19. Genome-wide characterization of maize miRNA genes

    Technology Transfer Automated Retrieval System (TEKTRAN)

    MicroRNAs (miRNAs) are small non-coding RNAs that play essential roles in plant growth and development. We conducted a genome-wide survey of maize miRNA genes, characterizing their structure, expression, and evolution. Computational approaches based on homology and secondary structure modeling ident...

  20. Genome-wide polymorphisms show unexpected targets of natural selection.

    PubMed

    Pespeni, Melissa H; Garfield, David A; Manier, Mollie K; Palumbi, Stephen R

    2012-04-01

    Natural selection can act on all the expressed genes of an individual, leaving signatures of genetic differentiation or diversity at many loci across the genome. New power to assay these genome-wide effects of selection comes from associating multi-locus patterns of polymorphism with gene expression and function. Here, we performed one of the first genome-wide surveys in a marine species, comparing purple sea urchins, Strongylocentrotus purpuratus, from two distant locations along the species' wide latitudinal range. We examined 9112 polymorphic loci from upstream non-coding and coding regions of genes for signatures of selection with respect to gene function and tissue- and ontogenetic gene expression. We found that genetic differentiation (F(ST)) varied significantly across functional gene classes. The strongest enrichment occurred in the upstream regions of E3 ligase genes, enzymes known to regulate protein abundance during development and environmental stress. We found enrichment for high heterozygosity in genes directly involved in immune response, particularly NALP genes, which mediate pro-inflammatory signals during bacterial infection. We also found higher heterozygosity in immune genes in the southern population, where disease incidence and pathogen diversity are greater. Similar to the major histocompatibility complex in mammals, balancing selection may enhance genetic diversity in the innate immune system genes of this invertebrate. Overall, our results show that how genome-wide polymorphism data coupled with growing databases on gene function and expression can combine to detect otherwise hidden signals of selection in natural populations. PMID:21993504

  1. Genome-wide association mapping of soybean aphid resistance traits

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Soybean aphid is the most damaging insect pest of soybean in the Upper Midwest and is primarily controlled by insecticides. Soybean aphid resistance (i.e., Rag genes) has been documented in some soybean lines at chromosomes 6, 7, 13, and 16, but more sources of resistance are needed. Genome-wide ass...

  2. AUTOGSCAN: powerful tools for automated genome-wide linkage and linkage disequilibrium analysis.

    PubMed

    Hiekkalinna, Tero; Terwilliger, Joseph D; Sammalisto, Sampo; Peltonen, Leena; Perola, Markus

    2005-02-01

    Genome-wide linkage analysis using multiple traits and statistical software packages is a tedious process which requires a significant amount of manual file manipulation. Different linkage analysis programs require different input file formats, making the task of analyzing data with multiple methods even more time-consuming. We have developed a software tool, AUTOGSCAN, that automates file formatting, the running of statistical analyses, and the summarizing of resulting statistics for whole genome scans with a push of a button, using several independent, and often idiosyncratic, statistical software packages such as MERLIN, SOLAR and GENEHUNTER. We also describe a program, ANALYZE, designed to run qualitative linkage analysis with several different statistical strategies and programs to efficiently screen for linkage and linkage disequilibrium for a given discrete trait. The ANALYZE program can also be used by AUTOGSCAN in a genome-wide sense. PMID:15836805

  3. Genome-wide identification of hypoxia-induced enhancer regions

    PubMed Central

    Preston, Jessica L.; Randel, Melissa A.; Johnson, Eric A.

    2015-01-01

    Here we present a genome-wide method for de novo identification of enhancer regions. This approach enables massively parallel empirical investigation of DNA sequences that mediate transcriptional activation and provides a platform for discovery of regulatory modules capable of driving context-specific gene expression. The method links fragmented genomic DNA to the transcription of randomer molecule identifiers and measures the functional enhancer activity of the library by massively parallel sequencing. We transfected a Drosophila melanogaster library into S2 cells in normoxia and hypoxia, and assayed 4,599,881 genomic DNA fragments in parallel. The locations of the enhancer regions strongly correlate with genes up-regulated after hypoxia and previously described enhancers. Novel enhancer regions were identified and integrated with RNAseq data and transcription factor motifs to describe the hypoxic response on a genome-wide basis as a complex regulatory network involving multiple stress-response pathways. This work provides a novel method for high-throughput assay of enhancer activity and the genome-scale identification of 31 hypoxia-activated enhancers in Drosophila. PMID:26713262

  4. Genome-Wide Detection and Analysis of Multifunctional Genes

    PubMed Central

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-01-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms—H. sapiens, D. melanogaster, and S. cerevisiae—and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  5. Genome-Wide Detection and Analysis of Multifunctional Genes.

    PubMed

    Pritykin, Yuri; Ghersi, Dario; Singh, Mona

    2015-10-01

    Many genes can play a role in multiple biological processes or molecular functions. Identifying multifunctional genes at the genome-wide level and studying their properties can shed light upon the complexity of molecular events that underpin cellular functioning, thereby leading to a better understanding of the functional landscape of the cell. However, to date, genome-wide analysis of multifunctional genes (and the proteins they encode) has been limited. Here we introduce a computational approach that uses known functional annotations to extract genes playing a role in at least two distinct biological processes. We leverage functional genomics data sets for three organisms--H. sapiens, D. melanogaster, and S. cerevisiae--and show that, as compared to other annotated genes, genes involved in multiple biological processes possess distinct physicochemical properties, are more broadly expressed, tend to be more central in protein interaction networks, tend to be more evolutionarily conserved, and are more likely to be essential. We also find that multifunctional genes are significantly more likely to be involved in human disorders. These same features also hold when multifunctionality is defined with respect to molecular functions instead of biological processes. Our analysis uncovers key features about multifunctional genes, and is a step towards a better genome-wide understanding of gene multifunctionality. PMID:26436655

  6. Genome-wide profiling of alternative splicing in Alzheimer's disease

    PubMed Central

    Lai, Mitchell K.P.; Esiri, Margaret M.; Tan, Michelle G.K.

    2014-01-01

    Alternative splicing is a highly regulated process which generates transcriptome and proteome diversity through the skipping or inclusion of exons within gene loci. Identification of aberrant alternative splicing associated with human diseases has become feasible with the development of new genomic technologies and powerful bioinformatics. We have previously reported genome-wide gene alterations in the neocortex of a well-characterized cohort of Alzheimer's disease (AD) patients and matched elderly controls using a commercial exon microarray platform [1]. Here, we provide detailed description of analyses aimed at identifying differential alternative splicing events associated with AD. PMID:26484111

  7. Brewing yeast genomes and genome-wide expression and proteome profiling during fermentation.

    PubMed

    Smart, Katherine A

    2007-11-01

    The genome structure, ancestry and instability of the brewing yeast strains have received considerable attention. The hybrid nature of brewing lager yeast strains provides adaptive potential but yields genome instability which can adversely affect fermentation performance. The requirement to differentiate between production strains and assess master cultures for genomic instability has led to significant adoption of specialized molecular tool kits by the industry. Furthermore, the development of genome-wide transcriptional and protein expression technologies has generated significant interest from brewers. The opportunity presented to explore, and the concurrent requirement to understand both, the constraints and potential of their strains to generate existing and new products during fermentation is discussed. PMID:17879324

  8. Metabolite-based genome-wide association studies in plants.

    PubMed

    Luo, Jie

    2015-04-01

    The plant metabolome is the readout of plant physiological status and is regarded as the bridge between the genome and the phenome of plants. Unraveling the natural variation and the underlying genetic basis of plant metabolism has received increasing interest from plant biologists. Enabled by the recent advances in high-throughput profiling and genotyping technologies, metabolite-based genome-wide association study (mGWAS) has emerged as a powerful alternative forward genetics strategy to dissect the genetic and biochemical bases of metabolism in model and crop plants. In this review, recent progress and applications of mGWAS in understanding the genetic control of plant metabolism and in interactive functional genomics and metabolomics are presented. Further directions and perspectives of mGWAS in plants are also discussed. PMID:25637954

  9. Genome-Wide Mapping of DNA Strand Breaks

    PubMed Central

    Leduc, Frédéric; Faucher, David; Bikond Nkoma, Geneviève; Grégoire, Marie-Chantal; Arguin, Mélina; Wellinger, Raymund J.; Boissonneault, Guylain

    2011-01-01

    Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed “damaged DNA immunoprecipitation” (dDIP), uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL) to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage. PMID:21364894

  10. Quantitative prediction of genome-wide resource allocation in bacteria.

    PubMed

    Goelzer, Anne; Muntel, Jan; Chubukov, Victor; Jules, Matthieu; Prestel, Eric; Nölker, Rolf; Mariadassou, Mahendra; Aymerich, Stéphane; Hecker, Michael; Noirot, Philippe; Becher, Dörte; Fromion, Vincent

    2015-11-01

    Predicting resource allocation between cell processes is the primary step towards decoding the evolutionary constraints governing bacterial growth under various conditions. Quantitative prediction at genome-scale remains a computational challenge as current methods are limited by the tractability of the problem or by simplifying hypotheses. Here, we show that the constraint-based modeling method Resource Balance Analysis (RBA), calibrated using genome-wide absolute protein quantification data, accurately predicts resource allocation in the model bacterium Bacillus subtilis for a wide range of growth conditions. The regulation of most cellular processes is consistent with the objective of growth rate maximization except for a few suboptimal processes which likely integrate more complex objectives such as coping with stressful conditions and survival. As a proof of principle by using simulations, we illustrated how calibrated RBA could aid rational design of strains for maximizing protein production, offering new opportunities to investigate design principles in prokaryotes and to exploit them for biotechnological applications. PMID:26498510

  11. Unsupervised genome-wide recognition of local relationship patterns

    PubMed Central

    2013-01-01

    Background Phenomena such as incomplete lineage sorting, horizontal gene transfer, gene duplication and subsequent sub- and neo-functionalisation can result in distinct local phylogenetic relationships that are discordant with species phylogeny. In order to assess the possible biological roles for these subdivisions, they must first be identified and characterised, preferably on a large scale and in an automated fashion. Results We developed Saguaro, a combination of a Hidden Markov Model (HMM) and a Self Organising Map (SOM), to characterise local phylogenetic relationships among aligned sequences using cacti, matrices of pair-wise distance measures. While the HMM determines the genomic boundaries from aligned sequences, the SOM hypothesises new cacti in an unsupervised and iterative fashion based on the regions that were modelled least well by existing cacti. After testing the software on simulated data, we demonstrate the utility of Saguaro by testing two different data sets: (i) 181 Dengue virus strains, and (ii) 5 primate genomes. Saguaro identifies regions under lineage-specific constraint for the first set, and genomic segments that we attribute to incomplete lineage sorting in the second dataset. Intriguingly for the primate data, Saguaro also classified an additional ~3% of the genome as most incompatible with the expected species phylogeny. A substantial fraction of these regions was found to overlap genes associated with both the innate and adaptive immune systems. Conclusions Saguaro detects distinct cacti describing local phylogenetic relationships without requiring any a priori hypotheses. We have successfully demonstrated Saguaro’s utility with two contrasting data sets, one containing many members with short sequences (Dengue viral strains: n = 181, genome size = 10,700 nt), and the other with few members but complex genomes (related primate species: n = 5, genome size = 3 Gb), suggesting that the software is applicable to a wide variety of

  12. Accurate Computation of Survival Statistics in Genome-Wide Studies

    PubMed Central

    Vandin, Fabio; Papoutsaki, Alexandra; Raphael, Benjamin J.; Upfal, Eli

    2015-01-01

    A key challenge in genomics is to identify genetic variants that distinguish patients with different survival time following diagnosis or treatment. While the log-rank test is widely used for this purpose, nearly all implementations of the log-rank test rely on an asymptotic approximation that is not appropriate in many genomics applications. This is because: the two populations determined by a genetic variant may have very different sizes; and the evaluation of many possible variants demands highly accurate computation of very small p-values. We demonstrate this problem for cancer genomics data where the standard log-rank test leads to many false positive associations between somatic mutations and survival time. We develop and analyze a novel algorithm, Exact Log-rank Test (ExaLT), that accurately computes the p-value of the log-rank statistic under an exact distribution that is appropriate for any size populations. We demonstrate the advantages of ExaLT on data from published cancer genomics studies, finding significant differences from the reported p-values. We analyze somatic mutations in six cancer types from The Cancer Genome Atlas (TCGA), finding mutations with known association to survival as well as several novel associations. In contrast, standard implementations of the log-rank test report dozens-hundreds of likely false positive associations as more significant than these known associations. PMID:25950620

  13. Assessing Predictive Properties of Genome-Wide Selection in Soybeans

    PubMed Central

    Xavier, Alencar; Muir, William M.; Rainey, Katy Martin

    2016-01-01

    Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set. PMID:27317786

  14. Optical mapping discerns genome wide DNA methylation profiles

    PubMed Central

    Ananiev, Gene E; Goldstein, Steve; Runnheim, Rod; Forrest, Dan K; Zhou, Shiguo; Potamousis, Konstantinos; Churas, Chris P; Bergendahl, Veit; Thomson, James A; Schwartz, David C

    2008-01-01

    Background Methylation of CpG dinucleotides is a fundamental mechanism of epigenetic regulation in eukaryotic genomes. Development of methods for rapid genome wide methylation profiling will greatly facilitate both hypothesis and discovery driven research in the field of epigenetics. In this regard, a single molecule approach to methylation profiling offers several unique advantages that include elimination of chemical DNA modification steps and PCR amplification. Results A single molecule approach is presented for the discernment of methylation profiles, based on optical mapping. We report results from a series of pilot studies demonstrating the capabilities of optical mapping as a platform for methylation profiling of whole genomes. Optical mapping was used to discern the methylation profile from both an engineered and wild type Escherichia coli. Furthermore, the methylation status of selected loci within the genome of human embryonic stem cells was profiled using optical mapping. Conclusion The optical mapping platform effectively detects DNA methylation patterns. Due to single molecule detection, optical mapping offers significant advantages over other technologies. This advantage stems from obviation of DNA modification steps, such as bisulfite treatment, and the ability of the platform to assay repeat dense regions within mammalian genomes inaccessible to techniques using array-hybridization technologies. PMID:18667073

  15. Comparative analysis of methods for genome-wide nucleosome cartography.

    PubMed

    Quintales, Luis; Vázquez, Enrique; Antequera, Francisco

    2015-07-01

    Nucleosomes contribute to compacting the genome into the nucleus and regulate the physical access of regulatory proteins to DNA either directly or through the epigenetic modifications of the histone tails. Precise mapping of nucleosome positioning across the genome is, therefore, essential to understanding the genome regulation. In recent years, several experimental protocols have been developed for this purpose that include the enzymatic digestion, chemical cleavage or immunoprecipitation of chromatin followed by next-generation sequencing of the resulting DNA fragments. Here, we compare the performance and resolution of these methods from the initial biochemical steps through the alignment of the millions of short-sequence reads to a reference genome to the final computational analysis to generate genome-wide maps of nucleosome occupancy. Because of the lack of a unified protocol to process data sets obtained through the different approaches, we have developed a new computational tool (NUCwave), which facilitates their analysis, comparison and assessment and will enable researchers to choose the most suitable method for any particular purpose. NUCwave is freely available at http://nucleosome.usal.es/nucwave along with a step-by-step protocol for its use. PMID:25296770

  16. Assessing Predictive Properties of Genome-Wide Selection in Soybeans.

    PubMed

    Xavier, Alencar; Muir, William M; Rainey, Katy Martin

    2016-01-01

    Many economically important traits in plant breeding have low heritability or are difficult to measure. For these traits, genomic selection has attractive features and may boost genetic gains. Our goal was to evaluate alternative scenarios to implement genomic selection for yield components in soybean (Glycine max L. merr). We used a nested association panel with cross validation to evaluate the impacts of training population size, genotyping density, and prediction model on the accuracy of genomic prediction. Our results indicate that training population size was the factor most relevant to improvement in genome-wide prediction, with greatest improvement observed in training sets up to 2000 individuals. We discuss assumptions that influence the choice of the prediction model. Although alternative models had minor impacts on prediction accuracy, the most robust prediction model was the combination of reproducing kernel Hilbert space regression and BayesB. Higher genotyping density marginally improved accuracy. Our study finds that breeding programs seeking efficient genomic selection in soybeans would best allocate resources by investing in a representative training set. PMID:27317786

  17. Quality Control Procedures for Genome Wide Association Studies

    PubMed Central

    Turner, Stephen; Armstrong, Loren L.; Bradford, Yuki; Carlson, Christopher S.; Crawford, Dana C.; Crenshaw, Andrew T.; de Andrade, Mariza; Doheny, Kimberly F.; Haines, Jonathan L.; Hayes, Geoffrey; Jarvik, Gail; Jiang, Lan; Kullo, Iftikhar J.; Li, Rongling; Ling, Hua; Manolio, Teri A.; Matsumoto, Martha; McCarty, Catherine A.; McDavid, Andrew N.; Mirel, Daniel B.; Paschall, Justin E.; Pugh, Elizabeth W.; Rasmussen, Luke V.; Wilke, Russell A.; Zuvich, Rebecca L.; Ritchie, Marylyn D.

    2011-01-01

    Genome-wide association studies (GWAS) are being conducted at an unprecedented rate in population-based cohorts and have increased our understanding of the pathophysiology of complex disease. The recent application of GWAS to clinic-based cohorts has also yielded genetic predictors of clinical outcomes. Regardless of context, the practical utility of this information will ultimately depend upon the quality of the original data. Quality control (QC) procedures for GWAS are computationally intensive, operationally challenging, and constantly evolving. With each new dataset, new realities are discovered about GWAS data and best practices continue to be developed. The Genomics Workgroup of the National Human Genome Research Institute (NHGRI) funded electronic Medical Records and Genomics (eMERGE) network has invested considerable effort in developing strategies for QC of these data. The lessons learned by this group will be valuable for other investigators dealing with large scale genomic datasets. Here we enumerate some of the challenges in QC of GWAS data and describe the approaches that the eMERGE network is using for quality assurance in GWAS data, thereby minimizing potential bias and error in GWAS results. In this protocol we discuss common issues associated with QC of GWAS data, including data file formats, software packages for data manipulation and analysis, sex chromosome anomalies, sample identity, sample relatedness, population substructure, batch effects, and marker quality. We propose best practices and discuss areas of ongoing and future research. PMID:21234875

  18. Automated quality control for genome wide association studies

    PubMed Central

    Ellingson, Sally R.; Fardo, David W.

    2016-01-01

    This paper provides details on the necessary steps to assess and control data in genome wide association studies (GWAS) using genotype information on a large number of genetic markers for large number of individuals. Due to varied study designs and genotyping platforms between multiple sites/projects as well as potential genotyping errors, it is important to ensure high quality data. Scripts and directions are provided to facilitate others in this process.

  19. Genome-wide association study of schizophrenia in Ashkenazi Jews.

    PubMed

    Goes, Fernando S; McGrath, John; Avramopoulos, Dimitrios; Wolyniec, Paula; Pirooznia, Mehdi; Ruczinski, Ingo; Nestadt, Gerald; Kenny, Eimear E; Vacic, Vladimir; Peters, Inga; Lencz, Todd; Darvasi, Ariel; Mulle, Jennifer G; Warren, Stephen T; Pulver, Ann E

    2015-12-01

    Schizophrenia is a common, clinically heterogeneous disorder associated with lifelong morbidity and early mortality. Several genetic variants associated with schizophrenia have been identified, but the majority of the heritability remains unknown. In this study, we report on a case-control sample of Ashkenazi Jews (AJ), a founder population that may provide additional insights into genetic etiology of schizophrenia. We performed a genome-wide association analysis (GWAS) of 592 cases and 505 controls of AJ ancestry ascertained in the US. Subsequently, we performed a meta-analysis with an Israeli AJ sample of 913 cases and 1640 controls, followed by a meta-analysis and polygenic risk scoring using summary results from Psychiatric GWAS Consortium 2 schizophrenia study. The U.S. AJ sample showed strong evidence of polygenic inheritance (pseudo-R(2) ∼9.7%) and a SNP-heritability estimate of 0.39 (P = 0.00046). We found no genome-wide significant associations in the U.S. sample or in the combined US/Israeli AJ meta-analysis of 1505 cases and 2145 controls. The strongest AJ specific associations (P-values in 10(-6) -10(-7) range) were in the 22q 11.2 deletion region and included the genes TBX1, GLN1, and COMT. Supportive evidence (meta P < 1 × 10(-4) ) was also found for several previously identified genome-wide significant findings, including the HLA region, CNTN4, IMMP2L, and GRIN2A. The meta-analysis of the U.S. sample with the PGC2 results provided initial genome-wide significant evidence for six new loci. Among the novel potential susceptibility genes is PEPD, a gene involved in proline metabolism, which is associated with a Mendelian disorder characterized by developmental delay and cognitive deficits. PMID:26198764

  20. Genome-wide association study of periodontal pathogen colonization.

    PubMed

    Divaris, K; Monda, K L; North, K E; Olshan, A F; Lange, E M; Moss, K; Barros, S P; Beck, J D; Offenbacher, S

    2012-07-01

    Pathological shifts of the human microbiome are characteristic of many diseases, including chronic periodontitis. To date, there is limited evidence on host genetic risk loci associated with periodontal pathogen colonization. We conducted a genome-wide association (GWA) study among 1,020 white participants of the Atherosclerosis Risk in Communities Study, whose periodontal diagnosis ranged from healthy to severe chronic periodontitis, and for whom "checkerboard" DNA-DNA hybridization quantification of 8 periodontal pathogens was performed. We examined 3 traits: "high red" and "high orange" bacterial complexes, and "high" Aggregatibacter actinomycetemcomitans (Aa) colonization. Genotyping was performed on the Affymetrix 6.0 platform. Imputation to 2.5 million markers was based on HapMap II-CEU, and a multiple-test correction was applied (genome-wide threshold of p < 5 × 10(-8)). We detected no genome-wide significant signals. However, 13 loci, including KCNK1, FBXO38, UHRF2, IL33, RUNX2, TRPS1, CAMTA1, and VAMP3, provided suggestive evidence (p < 5 × 10(-6)) of association. All associations reported for "red" and "orange" complex microbiota, but not for Aa, had the same effect direction in a second sample of 123 African-American participants. None of these polymorphisms was associated with periodontitis diagnosis. Investigations replicating these findings may lead to an improved understanding of the complex nature of host-microbiome interactions that characterizes states of health and disease. PMID:22699663

  1. A Genome-Wide Association Study of Depressive Symptoms

    PubMed Central

    Cornelis, Marilyn C.; Amin, Najaf; Bakshis, Erin; Baumert, Jens; Ding, Jingzhong; Liu, Yongmei; Marciante, Kristin; Meirelles, Osorio; Nalls, Michael A.; Sun, Yan V.; Vogelzangs, Nicole; Yu, Lei; Bandinelli, Stefania; Benjamin, Emelia J.; Bennett, David A.; Boomsma, Dorret; Cannas, Alessandra; Coker, Laura H.; de Geus, Eco; De Jager, Philip L.; Diez-Roux, Ana V.; Purcell, Shaun; Hu, Frank B.; Rimma, Eric B.; Hunter, David J.; Jensen, Majken K.; Curhan, Gary; Rice, Kenneth; Penman, Alan D.; Rotter, Jerome I.; Sotoodehnia, Nona; Emeny, Rebecca; Eriksson, Johan G.; Evans, Denis A.; Ferrucci, Luigi; Fornage, Myriam; Gudnason, Vilmundur; Hofman, Albert; Illig, Thomas; Kardia, Sharon; Kelly-Hayes, Margaret; Koenen, Karestan; Kraft, Peter; Kuningas, Maris; Massaro, Joseph M.; Melzer, David; Mulas, Antonella; Mulder, Cornelis L.; Murray, Anna; Oostra, Ben A.; Palotie, Aarno; Penninx, Brenda; Petersmann, Astrid; Pilling, Luke C.; Psaty, Bruce; Rawal, Rajesh; Reiman, Eric M.; Schulz, Andrea; Shulman, Joshua M.; Singleton, Andrew B.; Smith, Albert V.; Sutin, Angelina R.; Uitterlinden, André G.; Völzke, Henry; Widen, Elisabeth; Yaffe, Kristine; Zonderman, Alan B.; Cucca, Francesco; Harris, Tamara; Ladwig, Karl-Heinz; Llewellyn, David J.; Räikkönen, Katri; Tanaka, Toshiko

    2013-01-01

    Background Depression is a heritable trait that exists on a continuum of varying severity and duration. Yet, the search for genetic variants associated with depression has had few successes. We exploit the entire continuum of depression to find common variants for depressive symptoms. Methods In this genome-wide association study, we combined the results of 17 population-based studies assessing depressive symptoms with the Center for Epidemiological Studies Depression Scale. Replication of the independent top hits (p < 1 × 10−5) was performed in five studies assessing depressive symptoms with other instruments. In addition, we performed a combined meta-analysis of all 22 discovery and replication studies. Results The discovery sample comprised 34,549 individuals (mean age of 66.5) and no loci reached genome-wide significance (lowest p = 1.05 × 10−7). Seven independent single nucleotide polymorphisms were considered for replication. In the replication set (n = 16,709), we found suggestive association of one single nucleotide polymorphism with depressive symptoms (rs161645, 5q21, p = 9.19 × 10−3). This 5q21 region reached genome-wide significance (p = 4.78 × 10−8) in the overall meta-analysis combining discovery and replication studies (n = 51,258). Conclusions The results suggest that only a large sample comprising more than 50,000 subjects may be sufficiently powered to detect genes for depressive symptoms. PMID:23290196

  2. Genome-wide association discoveries of alcohol dependence

    PubMed Central

    Zuo, Lingjun; Lu, Lingeng; Tan, Yunlong; Pan, Xinghua; Cai, Yiqiang; Wang, Xiaoping; Hong, Jiang; Zhong, Chunlong; Wang, Fei; Zhang, Xiang-yang; Vanderlinden, Lauren A.; Tabakoff, Boris; Luo, Xingguang

    2014-01-01

    Objective To report the genome-wide significant and/or replicable risk variants for alcohol dependence and explore their potential biological functions. Methods We searched in PubMed for all genome-wide association studies (GWASs) of alcohol dependence. The following three types of the results were extracted: (1) genome-wide significant associations in an individual sample, the combined samples, or the meta-analysis (p<5×10−8); (2) top-ranked associations in an individual sample (p<10−5) that were nominally replicated in other samples (p<0.05); and (3) nominally replicable associations across at least three independent GWAS samples (p<0.05). These results were meta-analyzed. cis-eQTLs in human, RNA expression in rat and mouse brain and bioinformatics properties of all of these risk variants were analyzed. Results The variants located within ADH cluster were significantly associated with alcohol dependence at genome-wide level (p<5×10−8) in at least one sample. Some associations with the ADH cluster were replicable across six independent GWAS samples. The variants located within or near SERINC2, KIAA0040, MREG-PECR or PKNOX2 were significantly associated with alcohol dependence at genome-wide level (p<5×10−8) in meta-analysis or combined samples, and these associations were replicable across at least one sample. The associations with the variants within NRD1, GPD1L-CMTM8 or MAP3K9-PCNX were suggestive (5×10−8

  3. Genome-wide metabolic (re-) annotation of Kluyveromyces lactis

    PubMed Central

    2012-01-01

    Background Even before having its genome sequence published in 2004, Kluyveromyces lactis had long been considered a model organism for studies in genetics and physiology. Research on Kluyveromyces lactis is quite advanced and this yeast species is one of the few with which it is possible to perform formal genetic analysis. Nevertheless, until now, no complete metabolic functional annotation has been performed to the proteins encoded in the Kluyveromyces lactis genome. Results In this work, a new metabolic genome-wide functional re-annotation of the proteins encoded in the Kluyveromyces lactis genome was performed, resulting in the annotation of 1759 genes with metabolic functions, and the development of a methodology supported by merlin (software developed in-house). The new annotation includes novelties, such as the assignment of transporter superfamily numbers to genes identified as transporter proteins. Thus, the genes annotated with metabolic functions could be exclusively enzymatic (1410 genes), transporter proteins encoding genes (301 genes) or have both metabolic activities (48 genes). The new annotation produced by this work largely surpassed the Kluyveromyces lactis currently available annotations. A comparison with KEGG’s annotation revealed a match with 844 (~90%) of the genes annotated by KEGG, while adding 850 new gene annotations. Moreover, there are 32 genes with annotations different from KEGG. Conclusions The methodology developed throughout this work can be used to re-annotate any yeast or, with a little tweak of the reference organism, the proteins encoded in any sequenced genome. The new annotation provided by this study offers basic knowledge which might be useful for the scientific community working on this model yeast, because new functions have been identified for the so-called metabolic genes. Furthermore, it served as the basis for the reconstruction of a compartmentalized, genome-scale metabolic model of Kluyveromyces lactis, which is

  4. Overview of Genomic Insights into Chicken Growth Traits Based on Genome-Wide Association Study and microRNA Regulation

    PubMed Central

    Xu, Zhenqiang; Nie, Qinghua; Zhang, Xiquan

    2013-01-01

    Over the two past decades, a significant number of studies have observed animal growth traits to examine animal genetic mechanisms due to their ease of measurement and high heritability. Chicken which has a significant impact on fundamental biology is a major source of protein worldwide, making it an ideal model for examining animal growth trait development. The genetic mechanisms of chicken growth traits have been studied using quantitative trait loci mapping through genome-scan and candidate gene approaches, genome-wide association studies (GWAS), comparative genomic strategies, microRNA (miRNA) regulation of growth development analysis, and epigenomic analysis. This review focuses on chicken GWAS and miRNA regulation of growth traits. Several recently published GWAS reports showed that most genome-wide significant single nucleotide polymorphisms are located on chromosomes 1 and 4 in chickens. Chicken growth, particularly skeletal muscle growth and development, is greatly regulated by miRNA. Using dwarf and normal chickens, let-7b was found to be involved in determining chicken dwarf phenotypes by regulating growth hormone receptor gene expression. PMID:24082823

  5. Comparative analysis of genome-wide divergence, domestication footprints and genome-wide association study of root traits for Gossypium hirsutum and Gossypium barbadense

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using genome-wide distributed SNPs, we examined ...

  6. A powerful test of independent assortment that determines genome-wide significance quickly and accurately

    PubMed Central

    Stewart, W C L; Hager, V R

    2016-01-01

    In the analysis of DNA sequences on related individuals, most methods strive to incorporate as much information as possible, with little or no attention paid to the issue of statistical significance. For example, a modern workstation can easily handle the computations needed to perform a large-scale genome-wide inheritance-by-descent (IBD) scan, but accurate assessment of the significance of that scan is often hindered by inaccurate approximations and computationally intensive simulation. To address these issues, we developed gLOD—a test of co-segregation that, for large samples, models chromosome-specific IBD statistics as a collection of stationary Gaussian processes. With this simple model, the parametric bootstrap yields an accurate and rapid assessment of significance—the genome-wide corrected P-value. Furthermore, we show that (i) under the null hypothesis, the limiting distribution of the gLOD is the standard Gumbel distribution; (ii) our parametric bootstrap simulator is approximately 40 000 times faster than gene-dropping methods, and it is more powerful than methods that approximate the adjusted P-value; and, (iii) the gLOD has the same statistical power as the widely used maximum Kong and Cox LOD. Thus, our approach gives researchers the ability to determine quickly and accurately the significance of most large-scale IBD scans, which may contain multiple traits, thousands of families and tens of thousands of DNA sequences. PMID:27245422

  7. A powerful test of independent assortment that determines genome-wide significance quickly and accurately.

    PubMed

    Stewart, W C L; Hager, V R

    2016-08-01

    In the analysis of DNA sequences on related individuals, most methods strive to incorporate as much information as possible, with little or no attention paid to the issue of statistical significance. For example, a modern workstation can easily handle the computations needed to perform a large-scale genome-wide inheritance-by-descent (IBD) scan, but accurate assessment of the significance of that scan is often hindered by inaccurate approximations and computationally intensive simulation. To address these issues, we developed gLOD-a test of co-segregation that, for large samples, models chromosome-specific IBD statistics as a collection of stationary Gaussian processes. With this simple model, the parametric bootstrap yields an accurate and rapid assessment of significance-the genome-wide corrected P-value. Furthermore, we show that (i) under the null hypothesis, the limiting distribution of the gLOD is the standard Gumbel distribution; (ii) our parametric bootstrap simulator is approximately 40 000 times faster than gene-dropping methods, and it is more powerful than methods that approximate the adjusted P-value; and, (iii) the gLOD has the same statistical power as the widely used maximum Kong and Cox LOD. Thus, our approach gives researchers the ability to determine quickly and accurately the significance of most large-scale IBD scans, which may contain multiple traits, thousands of families and tens of thousands of DNA sequences. PMID:27245422

  8. Genome-wide association interaction analysis for Alzheimer's disease

    PubMed Central

    Gusareva, Elena S.; Carrasquillo, Minerva M.; Bellenguez, Céline; Cuyvers, Elise; Colon, Samuel; Graff-Radford, Neill R.; Petersen, Ronald C.; Dickson, Dennis W.; Mahachie Johna, Jestinah M.; Bessonov, Kyrylo; Van Broeckhoven, Christine; Williams, Julie; Amouyel, Philippe; Sleegers, Kristel; Ertekin-Taner, Nilüfer; Lambert, Jean-Charles; Van Steen, Kristel

    2015-01-01

    We propose a minimal protocol for exhaustive genome-wide association interaction analysis that involves screening for epistasis over large-scale genomic data combining strengths of different methods and statistical tools. The different steps of this protocol are illustrated on a real-life data application for Alzheimer's disease (AD) (2259 patients and 6017 controls from France). Particularly, in the exhaustive genome-wide epistasis screening we identified AD-associated interacting SNPs-pair from chromosome 6q11.1 (rs6455128, the KHDRBS2 gene) and 13q12.11 (rs7989332, the CRYL1 gene) (p = 0.006, corrected for multiple testing). A replication analysis in the independent AD cohort from Germany (555 patients and 824 controls) confirmed the discovered epistasis signal (p = 0.036). This signal was also supported by a meta-analysis approach in 5 independent AD cohorts that was applied in the context of epistasis for the first time. Transcriptome analysis revealed negative correlation between expression levels of KHDRBS2 and CRYL1 in both the temporal cortex (β = −0.19, p = 0.0006) and cerebellum (β = −0.23, p < 0.0001) brain regions. This is the first time a replicable epistasis associated with AD was identified using a hypothesis free screening approach. PMID:24958192

  9. A Genome-Wide Association Study of Aging

    PubMed Central

    Walter, Stefan; Atzmon, Gil; Demerath, Ellen W.; Garcia, Melissa E.; Kaplan, Robert C.; Kumari, Meena; Lunetta, Kathryn L.; Milaneschi, Yuri; Tanaka, Toshiko; Tranah, Gregory J.; Völker, Uwe; Yu, Lei; Arnold, Alice; Benjamin, Emelia J.; Biffar, Reiner; Buchman, Aron S.; Boerwinkle, Eric; Couper, David; De Jager, Philip L.; Evans, Denis A.; Harris, Tamara B.; Hoffmann, Wolfgang; Hofman, Albert; Karasik, David; Kiel, Douglas P.; Kocher, Thomas; Kuningas, Maris; Launer, Lenore J.; Lohman, Kurt K.; Lutsey, Pamela L.; Mackenbach, Johan; Marciante, Kristin; Psaty, Bruce M.; Reiman, Eric M.; Rotter, Jerome I.; Seshadri, Sudha; Shardell, Michelle D.; Smith, Albert V.; van Duijn, Cornelia; Walston, Jeremy; Zillikens, M. Carola; Bandinelli, Stefania; Baumeister, Sebastian E.; Bennett, David A.; Ferrucci, Luigi; Gudnason, Vilmundur; Kivimaki, Mika; Liu, Yongmei; Murabito, Joanne M.; Newman, Anne B.; Tiemeier, Henning; Franceschini, Nora

    2011-01-01

    Human longevity and healthy aging show moderate heritability (20–50%). We conducted a meta-analysis of genome-wide association studies from nine studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology Consortium for two outcomes: a) all-cause mortality and b) survival free of major disease or death. No single nucleotide polymorphism (SNP) was a genome-wide significant predictor of either outcome (p < 5 × 10−8). We found fourteen independent SNPs that predicted risk of death, and eight SNPs that predicted event-free survival (p < 10−5). These SNPs are in or near genes that are highly expressed in the brain (HECW2, HIP1, BIN2, GRIA1), genes involved in neural development and function (KCNQ4, LMO4, GRIA1, NETO1) and autophagy (ATG4C), and genes that are associated with risk of various diseases including cancer and Alzheimer’s disease. In addition to considerable overlap between the traits, pathway and network analysis corroborated these findings. These findings indicate that variation in genes involved in neurological processes may be an important factor in regulating aging free of major disease and achieving longevity. PMID:21782286

  10. Consistency of genome-wide associations across major ancestral groups.

    PubMed

    Ntzani, Evangelia E; Liberopoulos, George; Manolio, Teri A; Ioannidis, John P A

    2012-07-01

    It is not well known whether genetic markers identified through genome-wide association studies (GWAS) confer similar or different risks across people of different ancestry. We screened a regularly updated catalog of all published GWAS curated at the NHGRI website for GWAS-identified associations that had reached genome-wide significance (p ≤ 5 × 10(-8)) in at least one major ancestry group (European, Asian, African) and for which replication data were available for comparison in at least two different major ancestry groups. These groups were compared for the correlation between and differences in risk allele frequencies and genetic effects' estimates. Data on 108 eligible GWAS-identified associations with a total of 900 datasets (European, n = 624; Asian, n = 217; African, n = 60) were analyzed. Risk-allele frequencies were modestly correlated between ancestry groups, with >10% absolute differences in 75-89% of the three pairwise comparisons of ancestry groups. Genetic effect (odds ratio) point estimates between ancestry groups correlated modestly (pairwise comparisons' correlation coefficients: 0.20-0.33) and point estimates of risks were opposite in direction or differed more than twofold in 57%, 79%, and 89% of the European versus Asian, European versus African, and Asian versus African comparisons, respectively. The modest correlations, differing risk estimates, and considerable between-association heterogeneity suggest that differential ancestral effects can be anticipated and genomic risk markers may need separate further evaluation in different ancestry groups. PMID:22183176

  11. Genome-wide association study of Tourette's syndrome.

    PubMed

    Scharf, J M; Yu, D; Mathews, C A; Neale, B M; Stewart, S E; Fagerness, J A; Evans, P; Gamazon, E; Edlund, C K; Service, S K; Tikhomirov, A; Osiecki, L; Illmann, C; Pluzhnikov, A; Konkashbaev, A; Davis, L K; Han, B; Crane, J; Moorjani, P; Crenshaw, A T; Parkin, M A; Reus, V I; Lowe, T L; Rangel-Lugo, M; Chouinard, S; Dion, Y; Girard, S; Cath, D C; Smit, J H; King, R A; Fernandez, T V; Leckman, J F; Kidd, K K; Kidd, J R; Pakstis, A J; State, M W; Herrera, L D; Romero, R; Fournier, E; Sandor, P; Barr, C L; Phan, N; Gross-Tsur, V; Benarroch, F; Pollak, Y; Budman, C L; Bruun, R D; Erenberg, G; Naarden, A L; Lee, P C; Weiss, N; Kremeyer, B; Berrío, G B; Campbell, D D; Cardona Silgado, J C; Ochoa, W C; Mesa Restrepo, S C; Muller, H; Valencia Duarte, A V; Lyon, G J; Leppert, M; Morgan, J; Weiss, R; Grados, M A; Anderson, K; Davarya, S; Singer, H; Walkup, J; Jankovic, J; Tischfield, J A; Heiman, G A; Gilbert, D L; Hoekstra, P J; Robertson, M M; Kurlan, R; Liu, C; Gibbs, J R; Singleton, A; Hardy, J; Strengman, E; Ophoff, R A; Wagner, M; Moessner, R; Mirel, D B; Posthuma, D; Sabatti, C; Eskin, E; Conti, D V; Knowles, J A; Ruiz-Linares, A; Rouleau, G A; Purcell, S; Heutink, P; Oostra, B A; McMahon, W M; Freimer, N B; Cox, N J; Pauls, D L

    2013-06-01

    Tourette's syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (P<5 × 10(-8)); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (P=1.85 × 10(-6)). A secondary analysis including an additional 211 cases and 285 controls from two closely related Latin American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (P=3.6 × 10(-7) for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  12. Genome-wide association study of Tourette Syndrome

    PubMed Central

    Scharf, Jeremiah M.; Yu, Dongmei; Mathews, Carol A.; Neale, Benjamin M.; Stewart, S. Evelyn; Fagerness, Jesen A; Evans, Patrick; Gamazon, Eric; Edlund, Christopher K.; Service, Susan; Tikhomirov, Anna; Osiecki, Lisa; Illmann, Cornelia; Pluzhnikov, Anna; Konkashbaev, Anuar; Davis, Lea K; Han, Buhm; Crane, Jacquelyn; Moorjani, Priya; Crenshaw, Andrew T.; Parkin, Melissa A.; Reus, Victor I.; Lowe, Thomas L.; Rangel-Lugo, Martha; Chouinard, Sylvain; Dion, Yves; Girard, Simon; Cath, Danielle C; Smit, Jan H; King, Robert A.; Fernandez, Thomas; Leckman, James F.; Kidd, Kenneth K.; Kidd, Judith R.; Pakstis, Andrew J.; State, Matthew; Herrera, Luis Diego; Romero, Roxana; Fournier, Eduardo; Sandor, Paul; Barr, Cathy L; Phan, Nam; Gross-Tsur, Varda; Benarroch, Fortu; Pollak, Yehuda; Budman, Cathy L.; Bruun, Ruth D.; Erenberg, Gerald; Naarden, Allan L; Lee, Paul C; Weiss, Nicholas; Kremeyer, Barbara; Berrío, Gabriel Bedoya; Campbell, Desmond; Silgado, Julio C. Cardona; Ochoa, William Cornejo; Restrepo, Sandra C. Mesa; Muller, Heike; Duarte, Ana V. Valencia; Lyon, Gholson J; Leppert, Mark; Morgan, Jubel; Weiss, Robert; Grados, Marco A.; Anderson, Kelley; Davarya, Sarah; Singer, Harvey; Walkup, John; Jankovic, Joseph; Tischfield, Jay A.; Heiman, Gary A.; Gilbert, Donald L.; Hoekstra, Pieter J.; Robertson, Mary M.; Kurlan, Roger; Liu, Chunyu; Gibbs, J. Raphael; Singleton, Andrew; Hardy, John; Strengman, Eric; Ophoff, Roel; Wagner, Michael; Moessner, Rainald; Mirel, Daniel B.; Posthuma, Danielle; Sabatti, Chiara; Eskin, Eleazar; Conti, David V.; Knowles, James A.; Ruiz-Linares, Andres; Rouleau, Guy A.; Purcell, Shaun; Heutink, Peter; Oostra, Ben A.; McMahon, William; Freimer, Nelson; Cox, Nancy J.; Pauls, David L.

    2012-01-01

    Tourette Syndrome (TS) is a developmental disorder that has one of the highest familial recurrence rates among neuropsychiatric diseases with complex inheritance. However, the identification of definitive TS susceptibility genes remains elusive. Here, we report the first genome-wide association study (GWAS) of TS in 1285 cases and 4964 ancestry-matched controls of European ancestry, including two European-derived population isolates, Ashkenazi Jews from North America and Israel, and French Canadians from Quebec, Canada. In a primary meta-analysis of GWAS data from these European ancestry samples, no markers achieved a genome-wide threshold of significance (p<5 × 10−8); the top signal was found in rs7868992 on chromosome 9q32 within COL27A1 (p=1.85 × 10−6). A secondary analysis including an additional 211 cases and 285 controls from two closely-related Latin-American population isolates from the Central Valley of Costa Rica and Antioquia, Colombia also identified rs7868992 as the top signal (p=3.6 × 10−7 for the combined sample of 1496 cases and 5249 controls following imputation with 1000 Genomes data). This study lays the groundwork for the eventual identification of common TS susceptibility variants in larger cohorts and helps to provide a more complete understanding of the full genetic architecture of this disorder. PMID:22889924

  13. Genome-wide association analysis identifies three psoriasis susceptibility loci

    PubMed Central

    Stuart, Philip E.; Nair, Rajan P.; Ellinghaus, Eva; Ding, Jun; Tejasvi, Trilokraj; Gudjonsson, Johann E.; Li, Yun; Weidinger, Stephan; Eberlein, Bernadette; Gieger, Christian; Wichmann, H. Erich; Kunz, Manfred; Ike, Robert; Krueger, Gerald G.; Bowcock, Anne M.; Mroweitz, Ulrich; Lim, Henry W.; Voorhees, John J.; Abecasis, Goncalo R.; Weichenthal, Michael; Franke, Andre; Rahman, Proton; Gladman, Dafna D.; Elder, James T.

    2010-01-01

    To identify novel psoriasis susceptibility loci, we carried out a meta-analysis of two recent genome-wide association studies 1,2, yielding a discovery sample of 1,831 cases and 2,546 controls. 102 of the most promising loci in the discovery analysis were followed up in a three-stage replication study using 4,064 cases and 4,685 controls from Michigan, Toronto, Newfoundland, and Germany. Association at a genome-wide level of significance for the combined discovery and replication samples was found for three genomic regions. One contains NOS2 (rs4795067, p = 4 × 10−11), another contains FBXL19 (rs10782001, p = 9 × 10−10), and a third contains PSMA6 and NFKBIA (rs12586317, p = 2 × 10−8). All three loci were also strongly associated with the subphenotypes of psoriatic arthritis and purely cutaneous psoriasis. Finally, we confirmed a recently identified3 association signal near RNF114. PMID:20953189

  14. A Pooled Genome-Wide Association Study of Asperger Syndrome

    PubMed Central

    Warrier, Varun; Chakrabarti, Bhismadev; Murphy, Laura; Chan, Allen; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E.; Baron-Cohen, Simon

    2015-01-01

    Asperger Syndrome (AS) is a neurodevelopmental condition characterized by impairments in social interaction and communication, alongside the presence of unusually repetitive, restricted interests and stereotyped behaviour. Individuals with AS have no delay in cognitive and language development. It is a subset of Autism Spectrum Conditions (ASC), which are highly heritable and has a population prevalence of approximately 1%. Few studies have investigated the genetic basis of AS. To address this gap in the literature, we performed a genome-wide pooled DNA association study to identify candidate loci in 612 individuals (294 cases and 318 controls) of Caucasian ancestry, using the Affymetrix GeneChip Human Mapping version 6.0 array. We identified 11 SNPs that had a p-value below 1x10-5. These SNPs were independently genotyped in the same sample. Three of the SNPs (rs1268055, rs7785891 and rs2782448) were nominally significant, though none remained significant after Bonferroni correction. Two of our top three SNPs (rs7785891 and rs2782448) lie in loci previously implicated in ASC. However, investigation of the three SNPs in the ASC genome-wide association dataset from the Psychiatric Genomics Consortium indicated that these three SNPs were not significantly associated with ASC. The effect sizes of the variants were modest, indicating that our study was not sufficiently powered to identify causal variants with precision. PMID:26176695

  15. Development and application of a novel genome-wide SNP array reveals domestication history in soybean

    PubMed Central

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  16. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  17. Genome-wide genetic changes during modern breeding of maize.

    PubMed

    Jiao, Yinping; Zhao, Hainan; Ren, Longhui; Song, Weibin; Zeng, Biao; Guo, Jinjie; Wang, Baobao; Liu, Zhipeng; Chen, Jing; Li, Wei; Zhang, Mei; Xie, Shaojun; Lai, Jinsheng

    2012-07-01

    The success of modern maize breeding has been demonstrated by remarkable increases in productivity over the last four decades. However, the underlying genetic changes correlated with these gains remain largely unknown. We report here the sequencing of 278 temperate maize inbred lines from different stages of breeding history, including deep resequencing of 4 lines with known pedigree information. The results show that modern breeding has introduced highly dynamic genetic changes into the maize genome. Artificial selection has affected thousands of targets, including genes and non-genic regions, leading to a reduction in nucleotide diversity and an increase in the proportion of rare alleles. Genetic changes during breeding happen rapidly, with extensive variation (SNPs, indels and copy-number variants (CNVs)) occurring, even within identity-by-descent regions. Our genome-wide assessment of genetic changes during modern maize breeding provides new strategies as well as practical targets for future crop breeding and biotechnology. PMID:22660547

  18. Integrative analysis of genome-wide RNA interference screens.

    PubMed

    Berndt, Jason D; Biechele, Travis L; Moon, Randall T; Major, Michael B

    2009-01-01

    High-throughput genetic screens have exponentially increased the functional annotation of the genome over the past 10 years. Likewise, genome-scale efforts to map DNA methylation, chromatin state and occupancy, messenger RNA expression patterns, and disease-associated genetic polymorphisms, and proteome-wide efforts to map protein-protein interactions, have also created vast resources of data. An emerging trend involves combining multiple types of data, referred to as integrative screening. Examples include papers that report integrated data generated from large-scale RNA interference screens on the Wnt/beta-catenin pathway with either genotypic or proteomic data in colorectal cancer. These studies demonstrate the power of data integration to generate focused, validated data sets and to identify high-confidence candidate genes for follow-up experiments. We present the ongoing evolution and new strategies for the integrative screening approach with respect to understanding and treating human disease. PMID:19436058

  19. Genome-wide nucleosome positioning during embryonic stem cell development.

    PubMed

    Teif, Vladimir B; Vainshtein, Yevhen; Caudron-Herger, Maïwen; Mallm, Jan-Philipp; Marth, Caroline; Höfer, Thomas; Rippe, Karsten

    2012-11-01

    We determined genome-wide nucleosome occupancies in mouse embryonic stem cells and their neural progenitor and embryonic fibroblast counterparts to assess features associated with nucleosome positioning during lineage commitment. Cell-type- and protein-specific binding preferences of transcription factors to sites with either low (Myc, Klf4 and Zfx) or high (Nanog, Oct4 and Sox2) nucleosome occupancy as well as complex patterns for CTCF were identified. Nucleosome-depleted regions around transcription start and transcription termination sites were broad and more pronounced for active genes, with distinct patterns for promoters classified according to CpG content or histone methylation marks. Throughout the genome, nucleosome occupancy was correlated with certain histone methylation or acetylation modifications. In addition, the average nucleosome repeat length increased during differentiation by 5-7 base pairs, with local variations for specific regions. Our results reveal regulatory mechanisms of cell differentiation that involve nucleosome repositioning. PMID:23085715

  20. Quality control procedures for genome-wide association studies.

    PubMed

    Turner, Stephen; Armstrong, Loren L; Bradford, Yuki; Carlson, Christopher S; Crawford, Dana C; Crenshaw, Andrew T; de Andrade, Mariza; Doheny, Kimberly F; Haines, Jonathan L; Hayes, Geoffrey; Jarvik, Gail; Jiang, Lan; Kullo, Iftikhar J; Li, Rongling; Ling, Hua; Manolio, Teri A; Matsumoto, Martha; McCarty, Catherine A; McDavid, Andrew N; Mirel, Daniel B; Paschall, Justin E; Pugh, Elizabeth W; Rasmussen, Luke V; Wilke, Russell A; Zuvich, Rebecca L; Ritchie, Marylyn D

    2011-01-01

    Genome-wide association studies (GWAS) are being conducted at an unprecedented rate in population-based cohorts and have increased our understanding of the pathophysiology of complex disease. Regardless of context, the practical utility of this information will ultimately depend upon the quality of the original data. Quality control (QC) procedures for GWAS are computationally intensive, operationally challenging, and constantly evolving. Here we enumerate some of the challenges in QC of GWAS data and describe the approaches that the electronic MEdical Records and Genomics (eMERGE) network is using for quality assurance in GWAS data, thereby minimizing potential bias and error in GWAS results. We discuss common issues associated with QC of GWAS data, including data file formats, software packages for data manipulation and analysis, sex chromosome anomalies, sample identity, sample relatedness, population substructure, batch effects, and marker quality. We propose best practices and discuss areas of ongoing and future research. PMID:21234875

  1. Genome-wide DNA methylation profiling in zebrafish.

    PubMed

    Murphy, P J; Cairns, B R

    2016-01-01

    Genomic DNA methylation functions to repress gene expression by interfering with transcription factor binding and/or recruiting repressive chromatin machinery. Recent data support contribution of regulated DNA methylation to embryonic pluripotency, development, and tissue differentiation; this important epigenetic mark is chemically stable yet enzymatically reversible-and heritable through the germline. Importantly, all the major components involved in dynamic DNA methylation are conserved in zebrafish, including the factors that "write, read, and erase" this mark. Therefore, the zebrafish has become an excellent model for studying most biological processes associated with DNA methylation in mammals. Here we briefly review the zebrafish model for studying DNA methylation and describe a series of methods for performing genome-wide DNA methylation analysis. We address and provide methods for methylated DNA immunoprecipitation followed by sequencing (MeDIP-Seq), bisulfite sequencing (BS-Seq), and reduced representation bisulfite sequencing (RRBS-Seq). PMID:27443935

  2. Genome-wide association study identifies novel loci predisposing to cutaneous melanoma.

    PubMed

    Amos, Christopher I; Wang, Li-E; Lee, Jeffrey E; Gershenwald, Jeffrey E; Chen, Wei V; Fang, Shenying; Kosoy, Roman; Zhang, Mingfeng; Qureshi, Abrar A; Vattathil, Selina; Schacherer, Christopher W; Gardner, Julie M; Wang, Yuling; Bishop, D Tim; Barrett, Jennifer H; MacGregor, Stuart; Hayward, Nicholas K; Martin, Nicholas G; Duffy, David L; Mann, Graham J; Cust, Anne; Hopper, John; Brown, Kevin M; Grimm, Elizabeth A; Xu, Yaji; Han, Younghun; Jing, Kaiyan; McHugh, Caitlin; Laurie, Cathy C; Doheny, Kim F; Pugh, Elizabeth W; Seldin, Michael F; Han, Jiali; Wei, Qingyi

    2011-12-15

    We performed a multistage genome-wide association study of melanoma. In a discovery cohort of 1804 melanoma cases and 1026 controls, we identified loci at chromosomes 15q13.1 (HERC2/OCA2 region) and 16q24.3 (MC1R) regions that reached genome-wide significance within this study and also found strong evidence for genetic effects on susceptibility to melanoma from markers on chromosome 9p21.3 in the p16/ARF region and on chromosome 1q21.3 (ARNT/LASS2/ANXA9 region). The most significant single-nucleotide polymorphisms (SNPs) in the 15q13.1 locus (rs1129038 and rs12913832) lie within a genomic region that has profound effects on eye and skin color; notably, 50% of variability in eye color is associated with variation in the SNP rs12913832. Because eye and skin colors vary across European populations, we further evaluated the associations of the significant SNPs after carefully adjusting for European substructure. We also evaluated the top 10 most significant SNPs by using data from three other genome-wide scans. Additional in silico data provided replication of the findings from the most significant region on chromosome 1q21.3 rs7412746 (P = 6 × 10(-10)). Together, these data identified several candidate genes for additional studies to identify causal variants predisposing to increased risk for developing melanoma. PMID:21926416

  3. Genome-wide congealing and rapid transitions across the speciation continuum during speciation with gene flow.

    PubMed

    Feder, Jeffrey L; Nosil, Patrik; Wacholder, Aaron C; Egan, Scott P; Berlocher, Stewart H; Flaxman, Samuel M

    2014-01-01

    Our current understanding of speciation is often based on considering a relatively small number of genes, sometimes in isolation of one another. Here, we describe a possible emergent genome process involving the aggregate effect of many genes contributing to the evolution of reproductive isolation across the speciation continuum. When a threshold number of divergently selected mutations of modest to low fitness effects accumulate between populations diverging with gene flow, nonlinear transitions can occur in which levels of adaptive differentiation, linkage disequilibrium, and reproductive isolation dramatically increase. In effect, the genomes of the populations start to "congeal" into distinct entities representing different species. At this stage, reproductive isolation changes from being a characteristic of specific, divergently selected genes to a property of the genome. We examine conditions conducive to such genome-wide congealing (GWC), describe how to empirically test for GWC, and highlight a putative empirical example involving Rhagoletis fruit flies. We conclude with cautious optimism that the models and concepts discussed here, once extended to large numbers of neutral markers, may provide a framework for integrating information from genome scans, selection experiments, quantitative trait loci mapping, association studies, and natural history to develop a deeper understanding of the genomics of speciation. PMID:25149256

  4. Genome wide association identifies novel loci involved in fungal communication.

    PubMed

    Palma-Guerrero, Javier; Hall, Charles R; Kowbel, David; Welch, Juliet; Taylor, John W; Brem, Rachel B; Glass, N Louise

    2013-01-01

    Understanding how genomes encode complex cellular and organismal behaviors has become the outstanding challenge of modern genetics. Unlike classical screening methods, analysis of genetic variation that occurs naturally in wild populations can enable rapid, genome-scale mapping of genotype to phenotype with a medium-throughput experimental design. Here we describe the results of the first genome-wide association study (GWAS) used to identify novel loci underlying trait variation in a microbial eukaryote, harnessing wild isolates of the filamentous fungus Neurospora crassa. We genotyped each of a population of wild Louisiana strains at 1 million genetic loci genome-wide, and we used these genotypes to map genetic determinants of microbial communication. In N. crassa, germinated asexual spores (germlings) sense the presence of other germlings, grow toward them in a coordinated fashion, and fuse. We evaluated germlings of each strain for their ability to chemically sense, chemotropically seek, and undergo cell fusion, and we subjected these trait measurements to GWAS. This analysis identified one gene, NCU04379 (cse-1, encoding a homolog of a neuronal calcium sensor), at which inheritance was strongly associated with the efficiency of germling communication. Deletion of cse-1 significantly impaired germling communication and fusion, and two genes encoding predicted interaction partners of CSE1 were also required for the communication trait. Additionally, mining our association results for signaling and secretion genes with a potential role in germling communication, we validated six more previously unknown molecular players, including a secreted protease and two other genes whose deletion conferred a novel phenotype of increased communication and multi-germling fusion. Our results establish protein secretion as a linchpin of germling communication in N. crassa and shed light on the regulation of communication molecules in this fungus. Our study demonstrates the power

  5. Identification of Neural Outgrowth Genes using Genome-Wide RNAi

    PubMed Central

    Sepp, Katharine J.; Hong, Pengyu; Lizarraga, Sofia B.; Liu, Judy S.; Mejia, Luis A.; Walsh, Christopher A.; Perrimon, Norbert

    2008-01-01

    While genetic screens have identified many genes essential for neurite outgrowth, they have been limited in their ability to identify neural genes that also have earlier critical roles in the gastrula, or neural genes for which maternally contributed RNA compensates for gene mutations in the zygote. To address this, we developed methods to screen the Drosophila genome using RNA-interference (RNAi) on primary neural cells and present the results of the first full-genome RNAi screen in neurons. We used live-cell imaging and quantitative image analysis to characterize the morphological phenotypes of fluorescently labelled primary neurons and glia in response to RNAi-mediated gene knockdown. From the full genome screen, we focused our analysis on 104 evolutionarily conserved genes that when downregulated by RNAi, have morphological defects such as reduced axon extension, excessive branching, loss of fasciculation, and blebbing. To assist in the phenotypic analysis of the large data sets, we generated image analysis algorithms that could assess the statistical significance of the mutant phenotypes. The algorithms were essential for the analysis of the thousands of images generated by the screening process and will become a valuable tool for future genome-wide screens in primary neurons. Our analysis revealed unexpected, essential roles in neurite outgrowth for genes representing a wide range of functional categories including signalling molecules, enzymes, channels, receptors, and cytoskeletal proteins. We also found that genes known to be involved in protein and vesicle trafficking showed similar RNAi phenotypes. We confirmed phenotypes of the protein trafficking genes Sec61alpha and Ran GTPase using Drosophila embryo and mouse embryonic cerebral cortical neurons, respectively. Collectively, our results showed that RNAi phenotypes in primary neural culture can parallel in vivo phenotypes, and the screening technique can be used to identify many new genes that have

  6. Genome Wide Association Identifies Novel Loci Involved in Fungal Communication

    PubMed Central

    Kowbel, David; Welch, Juliet; Taylor, John W.; Brem, Rachel B.; Glass, N. Louise

    2013-01-01

    Understanding how genomes encode complex cellular and organismal behaviors has become the outstanding challenge of modern genetics. Unlike classical screening methods, analysis of genetic variation that occurs naturally in wild populations can enable rapid, genome-scale mapping of genotype to phenotype with a medium-throughput experimental design. Here we describe the results of the first genome-wide association study (GWAS) used to identify novel loci underlying trait variation in a microbial eukaryote, harnessing wild isolates of the filamentous fungus Neurospora crassa. We genotyped each of a population of wild Louisiana strains at 1 million genetic loci genome-wide, and we used these genotypes to map genetic determinants of microbial communication. In N. crassa, germinated asexual spores (germlings) sense the presence of other germlings, grow toward them in a coordinated fashion, and fuse. We evaluated germlings of each strain for their ability to chemically sense, chemotropically seek, and undergo cell fusion, and we subjected these trait measurements to GWAS. This analysis identified one gene, NCU04379 (cse-1, encoding a homolog of a neuronal calcium sensor), at which inheritance was strongly associated with the efficiency of germling communication. Deletion of cse-1 significantly impaired germling communication and fusion, and two genes encoding predicted interaction partners of CSE1 were also required for the communication trait. Additionally, mining our association results for signaling and secretion genes with a potential role in germling communication, we validated six more previously unknown molecular players, including a secreted protease and two other genes whose deletion conferred a novel phenotype of increased communication and multi-germling fusion. Our results establish protein secretion as a linchpin of germling communication in N. crassa and shed light on the regulation of communication molecules in this fungus. Our study demonstrates the power

  7. Advances in genome-wide DNA methylation analysis

    PubMed Central

    Gupta, Romi; Nagarajan, Arvindhan; Wajapeyee, Narendra

    2013-01-01

    The covalent DNA modification of cytosine at position 5 (5-methylcytosine; 5mC) has emerged as an important epigenetic mark most commonly present in the context of CpG dinucleotides in mammalian cells. In pluripotent stem cells and plants, it is also found in non-CpG and CpNpG contexts, respectively. 5mC has important implications in a diverse set of biological processes, including transcriptional regulation. Aberrant DNA methylation has been shown to be associated with a wide variety of human ailments and thus is the focus of active investigation. Methods used for detecting DNA methylation have revolutionized our understanding of this epigenetic mark and provided new insights into its role in diverse biological functions. Here we describe recent technological advances in genome-wide DNA methylation analysis and discuss their relative utility and drawbacks, providing specific examples from studies that have used these technologies for genome-wide DNA methylation analysis to address important biological questions. Finally, we discuss a newly identified covalent DNA modification, 5-hydroxymethylcytosine (5hmC), and speculate on its possible biological function, as well as describe a new methodology that can distinguish 5hmC from 5mC. PMID:20964631

  8. Genome-wide association study of obsessive-compulsive disorder.

    PubMed

    Stewart, S E; Yu, D; Scharf, J M; Neale, B M; Fagerness, J A; Mathews, C A; Arnold, P D; Evans, P D; Gamazon, E R; Davis, L K; Osiecki, L; McGrath, L; Haddad, S; Crane, J; Hezel, D; Illman, C; Mayerfeld, C; Konkashbaev, A; Liu, C; Pluzhnikov, A; Tikhomirov, A; Edlund, C K; Rauch, S L; Moessner, R; Falkai, P; Maier, W; Ruhrmann, S; Grabe, H-J; Lennertz, L; Wagner, M; Bellodi, L; Cavallini, M C; Richter, M A; Cook, E H; Kennedy, J L; Rosenberg, D; Stein, D J; Hemmings, S M J; Lochner, C; Azzam, A; Chavira, D A; Fournier, E; Garrido, H; Sheppard, B; Umaña, P; Murphy, D L; Wendland, J R; Veenstra-VanderWeele, J; Denys, D; Blom, R; Deforce, D; Van Nieuwerburgh, F; Westenberg, H G M; Walitza, S; Egberts, K; Renner, T; Miguel, E C; Cappi, C; Hounie, A G; Conceição do Rosário, M; Sampaio, A S; Vallada, H; Nicolini, H; Lanzagorta, N; Camarena, B; Delorme, R; Leboyer, M; Pato, C N; Pato, M T; Voyiaziakis, E; Heutink, P; Cath, D C; Posthuma, D; Smit, J H; Samuels, J; Bienvenu, O J; Cullen, B; Fyer, A J; Grados, M A; Greenberg, B D; McCracken, J T; Riddle, M A; Wang, Y; Coric, V; Leckman, J F; Bloch, M; Pittenger, C; Eapen, V; Black, D W; Ophoff, R A; Strengman, E; Cusi, D; Turiel, M; Frau, F; Macciardi, F; Gibbs, J R; Cookson, M R; Singleton, A; Hardy, J; Crenshaw, A T; Parkin, M A; Mirel, D B; Conti, D V; Purcell, S; Nestadt, G; Hanna, G L; Jenike, M A; Knowles, J A; Cox, N; Pauls, D L

    2013-07-01

    Obsessive-compulsive disorder (OCD) is a common, debilitating neuropsychiatric illness with complex genetic etiology. The International OCD Foundation Genetics Collaborative (IOCDF-GC) is a multi-national collaboration established to discover the genetic variation predisposing to OCD. A set of individuals affected with DSM-IV OCD, a subset of their parents, and unselected controls, were genotyped with several different Illumina SNP microarrays. After extensive data cleaning, 1465 cases, 5557 ancestry-matched controls and 400 complete trios remained, with a common set of 469,410 autosomal and 9657 X-chromosome single nucleotide polymorphisms (SNPs). Ancestry-stratified case-control association analyses were conducted for three genetically-defined subpopulations and combined in two meta-analyses, with and without the trio-based analysis. In the case-control analysis, the lowest two P-values were located within DLGAP1 (P=2.49 × 10(-6) and P=3.44 × 10(-6)), a member of the neuronal postsynaptic density complex. In the trio analysis, rs6131295, near BTBD3, exceeded the genome-wide significance threshold with a P-value=3.84 × 10(-8). However, when trios were meta-analyzed with the case-control samples, the P-value for this variant was 3.62 × 10(-5), losing genome-wide significance. Although no SNPs were identified to be associated with OCD at a genome-wide significant level in the combined trio-case-control sample, a significant enrichment of methylation QTLs (P<0.001) and frontal lobe expression quantitative trait loci (eQTLs) (P=0.001) was observed within the top-ranked SNPs (P<0.01) from the trio-case-control analysis, suggesting these top signals may have a broad role in gene expression in the brain, and possibly in the etiology of OCD. PMID:22889921

  9. Genome-Wide Association Study of Metabolic Syndrome in Koreans

    PubMed Central

    Jeong, Seok Won; Chung, Myungguen; Park, Soo-Jung; Cho, Seong Beom

    2014-01-01

    Metabolic syndrome (METS) is a disorder of energy utilization and storage and increases the risk of developing cardiovascular disease and diabetes. To identify the genetic risk factors of METS, we carried out a genome-wide association study (GWAS) for 2,657 cases and 5,917 controls in Korean populations. As a result, we could identify 2 single nucleotide polymorphisms (SNPs) with genome-wide significance level p-values (<5 × 10-8), 8 SNPs with genome-wide suggestive p-values (5 × 10-8 ≤ p < 1 × 10-5), and 2 SNPs of more functional variants with borderline p-values (5 × 10-5 ≤ p < 1 × 10-4). On the other hand, the multiple correction criteria of conventional GWASs exclude false-positive loci, but simultaneously, they discard many true-positive loci. To reconsider the discarded true-positive loci, we attempted to include the functional variants (nonsynonymous SNPs [nsSNPs] and expression quantitative trait loci [eQTL]) among the top 5,000 SNPs based on the proportion of phenotypic variance explained by genotypic variance. In total, 159 eQTLs and 18 nsSNPs were presented in the top 5,000 SNPs. Although they should be replicated in other independent populations, 6 eQTLs and 2 nsSNP loci were located in the molecular pathways of LPL, APOA5, and CHRM2, which were the significant or suggestive loci in the METS GWAS. Conclusively, our approach using the conventional GWAS, reconsidering functional variants and pathway-based interpretation, suggests a useful method to understand the GWAS results of complex traits and can be expanded in other genomewide association studies. PMID:25705157

  10. Genome-Wide Association Study of Parity in Bangladeshi Women

    PubMed Central

    Aschebrook-Kilfoy, Briseis; Argos, Maria; Pierce, Brandon L.; Tong, Lin; Jasmine, Farzana; Roy, Shantanu; Parvez, Faruque; Ahmed, Alauddin; Islam, Tariqul; Kibriya, Muhammad G.; Ahsan, Habibul

    2015-01-01

    Human fertility is a complex trait determined by gene-environment interactions in which genetic factors represent a significant component. To better understand inter-individual variability in fertility, we performed one of the first genome-wide association studies (GWAS) of common fertility phenotypes, lifetime number of pregnancies and number of children in a developing country population. The fertility phenotype data and DNA samples were obtained at baseline recruitment from individuals participating in a large prospective cohort study in Bangladesh. GWAS analyses of fertility phenotypes were conducted among 1,686 married women. One SNP on chromosome 4 was non-significantly associated with number of children at P <10-7 and number of pregnancies at P <10-6. This SNP is located in a region without a gene within 1 Mb. One SNP on chromosome 6 was non-significantly associated with extreme number of children at P <10-6. The closest gene to this SNP is HDGFL1, a hepatoma-derived growth factor. When we excluded hormonal contraceptive users, a SNP on chromosome 5 was non-significantly associated at P <10-5 for number of children and number of pregnancies. This SNP is located near C5orf64, an open reading frame, and ZSWIM6, a zinc ion binding gene. We also estimated the heritability of these phenotypes from our genotype data using GCTA (Genome-wide Complex Trait Analysis) for number of children (hg2 = 0.149, SE = 0.24, p-value = 0.265) and number of pregnancies (hg2 = 0.007, SE = 0.22, p-value = 0.487). Our genome-wide association study and heritability estimates of number of pregnancies and number of children in Bangladesh did not confer strong evidence of common variants for parity variation. However, our results suggest that future studies may want to consider the role of 3 notable SNPs in their analysis. PMID:25742292

  11. Genome-wide mapping of DNA hydroxymethylation in osteoarthritic chondrocytes

    PubMed Central

    Taylor, Sarah E. B.; Li, Ye Henry; Wong, Wing H.; Bhutani, Nidhi

    2015-01-01

    Objective To examine genome-wide 5hmC distribution in osteoarthritic (OA) and normal chondrocytes to investigate the effect on OA-specific gene expression. Methods Cartilage was obtained from OA patients undergoing total knee arthroplasty or control patients undergoing anterior cruciate ligament reconstruction. Genome-wide sequencing of 5hmC-enriched DNA (5hmC-seq) was performed for a small cohort of normal and OA chondrocytes to identify differentially hydroxymethylated regions (DhMRs) in OA chondrocytes. 5hmC-seq data was intersected with global OA gene expression data to define subsets of genes and pathways potentially affected by increased 5hmC levels in OA chondrocytes. Results 70591 DhMRs were identified in OA chondrocytes compared to normal chondrocytes, 44288 (63%) of which were increased in OA chondrocytes. The majority of DhMRs (66%) were gained in gene bodies. Increased DhMRs were observed in ~50% of genes previously implicated in OA pathology including MMP3, LRP5, GDF5 and COL11A1. Furthermore, analyses of gene expression data revealed gene body gain of 5hmC appears to be preferentially associated with activated but not repressed genes in OA chondrocytes. Conclusion This study provides the first genome-wide profiling of 5hmC distribution in OA chondrocytes. We had previously reported a global increase in 5hmC levels in OA chondrocytes. Gain of 5hmC in the gene body is found to be characteristic of activated genes in OA chondrocytes, highlighting the influence of 5hmC as an epigenetic mark in OA. In addition, this study identifies multiple OA-associated genes that are potentially regulated either singularly by gain of DNA hydroxymethylation or in combination with loss of DNA methylation. PMID:25940674

  12. Genome-wide association study of parity in Bangladeshi women.

    PubMed

    Aschebrook-Kilfoy, Briseis; Argos, Maria; Pierce, Brandon L; Tong, Lin; Jasmine, Farzana; Roy, Shantanu; Parvez, Faruque; Ahmed, Alauddin; Islam, Tariqul; Kibriya, Muhammad G; Ahsan, Habibul

    2015-01-01

    Human fertility is a complex trait determined by gene-environment interactions in which genetic factors represent a significant component. To better understand inter-individual variability in fertility, we performed one of the first genome-wide association studies (GWAS) of common fertility phenotypes, lifetime number of pregnancies and number of children in a developing country population. The fertility phenotype data and DNA samples were obtained at baseline recruitment from individuals participating in a large prospective cohort study in Bangladesh. GWAS analyses of fertility phenotypes were conducted among 1,686 married women. One SNP on chromosome 4 was non-significantly associated with number of children at P <10(-7) and number of pregnancies at P <10(-6). This SNP is located in a region without a gene within 1 Mb. One SNP on chromosome 6 was non-significantly associated with extreme number of children at P <10(-6). The closest gene to this SNP is HDGFL1, a hepatoma-derived growth factor. When we excluded hormonal contraceptive users, a SNP on chromosome 5 was non-significantly associated at P <10(-5) for number of children and number of pregnancies. This SNP is located near C5orf64, an open reading frame, and ZSWIM6, a zinc ion binding gene. We also estimated the heritability of these phenotypes from our genotype data using GCTA (Genome-wide Complex Trait Analysis) for number of children (hg2 = 0.149, SE = 0.24, p-value = 0.265) and number of pregnancies (hg2 = 0.007, SE = 0.22, p-value = 0.487). Our genome-wide association study and heritability estimates of number of pregnancies and number of children in Bangladesh did not confer strong evidence of common variants for parity variation. However, our results suggest that future studies may want to consider the role of 3 notable SNPs in their analysis. PMID:25742292

  13. Refining genome-wide linkage intervals using a meta-analysis of genome-wide association studies identifies loci influencing personality dimensions.

    PubMed

    Amin, Najaf; Hottenga, Jouke-Jan; Hansell, Narelle K; Janssens, A Cecile J W; de Moor, Marleen H M; Madden, Pamela A F; Zorkoltseva, Irina V; Penninx, Brenda W; Terracciano, Antonio; Uda, Manuela; Tanaka, Toshiko; Esko, Tonu; Realo, Anu; Ferrucci, Luigi; Luciano, Michelle; Davies, Gail; Metspalu, Andres; Abecasis, Goncalo R; Deary, Ian J; Raikkonen, Katri; Bierut, Laura J; Costa, Paul T; Saviouk, Viatcheslav; Zhu, Gu; Kirichenko, Anatoly V; Isaacs, Aaron; Aulchenko, Yurii S; Willemsen, Gonneke; Heath, Andrew C; Pergadia, Michele L; Medland, Sarah E; Axenovich, Tatiana I; de Geus, Eco; Montgomery, Grant W; Wright, Margaret J; Oostra, Ben A; Martin, Nicholas G; Boomsma, Dorret I; van Duijn, Cornelia M

    2013-08-01

    Personality traits are complex phenotypes related to psychosomatic health. Individually, various gene finding methods have not achieved much success in finding genetic variants associated with personality traits. We performed a meta-analysis of four genome-wide linkage scans (N=6149 subjects) of five basic personality traits assessed with the NEO Five-Factor Inventory. We compared the significant regions from the meta-analysis of linkage scans with the results of a meta-analysis of genome-wide association studies (GWAS) (N∼17 000). We found significant evidence of linkage of neuroticism to chromosome 3p14 (rs1490265, LOD=4.67) and to chromosome 19q13 (rs628604, LOD=3.55); of extraversion to 14q32 (ATGG002, LOD=3.3); and of agreeableness to 3p25 (rs709160, LOD=3.67) and to two adjacent regions on chromosome 15, including 15q13 (rs970408, LOD=4.07) and 15q14 (rs1055356, LOD=3.52) in the individual scans. In the meta-analysis, we found strong evidence of linkage of extraversion to 4q34, 9q34, 10q24 and 11q22, openness to 2p25, 3q26, 9p21, 11q24, 15q26 and 19q13 and agreeableness to 4q34 and 19p13. Significant evidence of association in the GWAS was detected between openness and rs677035 at 11q24 (P-value=2.6 × 10(-06), KCNJ1). The findings of our linkage meta-analysis and those of the GWAS suggest that 11q24 is a susceptible locus for openness, with KCNJ1 as the possible candidate gene. PMID:23211697

  14. [Genome-wide association study for adolescent idiopathic scoliosis].

    PubMed

    Ogura, Yoji; Kou, Ikuyo; Scoliosis, Japan; Matsumoto, Morio; Watanabe, Kota; Ikegawa, Shiro

    2016-04-01

    Adolescent idiopathic scoliosis(AIS)is a polygenic disease. Genome-wide association studies(GWASs)have been performed for a lot of polygenic diseases. For AIS, we conducted GWAS and identified the first AIS locus near LBX1. After the discovery, we have extended our study by increasing the numbers of subjects and SNPs. In total, our Japanese GWAS has identified four susceptibility genes. GWASs for AIS have also been performed in the USA and China, which identified one and three susceptibility genes, respectively. Here we review GWASs in Japan and abroad and functional analysis to clarify the pathomechanism of AIS. PMID:27013625

  15. [New insight of genome-wide association study (GWAS)].

    PubMed

    Hotta, Kikuko

    2013-02-01

    The number of obese patients is increasing in Japan, due to the westernization of lifestyle. Obesity, especially visceral fat obesity, is important for the development of metabolic syndrome. Genetic factors are important for the development of obesity as well as environmental factors. Importance of genetic factors of fat distribution is also reported. Recent genome-wide association studies (GWASs) have revealed the obesity and fat distribution-related polymorphisms. GWAS will highlight a better understanding of the underlying molecular mechanisms in the regulation of obesity and distribution of body fat. PMID:23631198

  16. Genome-wide association studies in diverse populations

    PubMed Central

    Rosenberg, Noah A; Huang, Lucy; Jewett, Ethan M; Szpiech, Zachary A; Jankovic, Ivana; Boehnke, Michael

    2011-01-01

    Genome-wide association (GWA) studies have identified a large number of single-nucleotide polymorphisms (SNPs) associated with disease phenotypes. As most GWA studies have been performed primarily in populations of European descent, this review examines the issues involved in extending consideration of GWA studies to diverse worldwide populations. Although challenges exist with such issues as imputation, admixture, and replication, investigation of diverse populations in GWA studies has significant potential to advance the project of mapping the genetic determinants of complex diseases for the human population as a whole. PMID:20395969

  17. Genome-Wide Association Studies for Polycystic Ovary Syndrome.

    PubMed

    Liu, Hongbin; Zhao, Han; Chen, Zi-Jiang

    2016-07-01

    Over the past several years, the field of reproductive medicine has witnessed great advances in genome-wide association studies (GWASs) of polycystic ovary syndrome (PCOS), leading to identification of several promising genes involved in hormone action, type 2 diabetes, and cell proliferation. This review summarizes the key findings and discusses their potential implications with regard to genetic mechanisms of PCOS. Limitations of GWAS are evaluated, emphasizing the understanding of the reasons for variability in results between individual studies. Root causes of misinterpretations of GWASs are also addressed. Finally, the impact of GWAS on future directions of multi- and interdisciplinary studies is discussed. PMID:27513023

  18. Heritability and Genome-Wide Association Analyses of Human Gait Suggest Contribution of Common Variants.

    PubMed

    Adams, Hieab H H; Verlinden, Vincentius J A; Callisaya, Michele L; van Duijn, Cornelia M; Hofman, Albert; Thomson, Russell; Uitterlinden, André G; Vernooij, Meike W; van der Geest, Jos N; Srikanth, Velandai; Ikram, M Arfan

    2016-06-01

    Human gait is a complex neurological and musculoskeletal function, of which the genetic basis remains largely unknown. To determine the influence of common genetic variants on gait parameters, we studied 2,946 participants of the Rotterdam Study, a population-based cohort of unrelated elderly individuals. We assessed 30 gait parameters using an electronic walkway, which yielded seven independent gait domains after principal component analysis. Genotypes of participants were imputed to the 1,000 Genomes reference panel for generating genetic relationship matrices to estimate heritability of gait parameters, and for subsequent genome-wide association scans (GWASs) to identify specific variants. Gait domains with the highest age- and sex-adjusted heritability were Variability (h (2) = 61%), Rhythm (37%), and Tandem (32%). For other gait domains, heritability estimates attenuated after adjustment for height and weight. Genome-wide association scans identified a variant on 1p22.3 that was significantly associated with single support time, a variable from the Rhythm domain (rs72953990; N = 2,946; β [SE] = 0.0069 (0.0012), p = 2.30×10(-8)). This variant did not replicate in an independent sample (N = 362; p = .78). In conclusion, human gait has highly heritable components that are explained by common genetic variation, which are partly attributed to height and weight. Collaborative efforts are needed to identify robust single variant associations for the heritable parameters. PMID:26219847

  19. Insights into kidney diseases from genome-wide association studies.

    PubMed

    Wuttke, Matthias; Köttgen, Anna

    2016-09-01

    Over the past decade, genome-wide association studies (GWAS) have considerably improved our understanding of the genetic basis of kidney function and disease. Population-based studies, used to investigate traits that define chronic kidney disease (CKD), have identified >50 genomic regions in which common genetic variants associate with estimated glomerular filtration rate or urinary albumin-to-creatinine ratio. Case-control studies, used to study specific CKD aetiologies, have yielded risk loci for specific kidney diseases such as IgA nephropathy and membranous nephropathy. In this Review, we summarize important findings from GWAS and clinical and experimental follow-up studies. We also compare risk allele frequency, effect sizes, and specificity in GWAS of CKD-defining traits and GWAS of specific CKD aetiologies and the implications for study design. Genomic regions identified in GWAS of CKD-defining traits can contain causal genes for monogenic kidney diseases. Population-based research on kidney function traits can therefore generate insights into more severe forms of kidney diseases. Experimental follow-up studies have begun to identify causal genes and variants, which are potential therapeutic targets, and suggest mechanisms underlying the high allele frequency of causal variants. GWAS are thus a useful approach to advance knowledge in nephrology. PMID:27477491

  20. Genome-wide analysis of epistasis in body mass index using multiple human populations

    PubMed Central

    Wei, Wen-Hua; Hemani, Gib; Gyenesei, Attila; Vitart, Veronique; Navarro, Pau; Hayward, Caroline; Cabrera, Claudia P; Huffman, Jennifer E; Knott, Sara A; Hicks, Andrew A; Rudan, Igor; Pramstaller, Peter P; Wild, Sarah H; Wilson, James F; Campbell, Harry; Hastie, Nicholas D; Wright, Alan F; Haley, Chris S

    2012-01-01

    We surveyed gene–gene interactions (epistasis) in human body mass index (BMI) in four European populations (n<1200) via exhaustive pair-wise genome scans where interactions were computed as F ratios by testing a linear regression model fitting two single-nucleotide polymorphisms (SNPs) with interactions against the one without. Before the association tests, BMI was corrected for sex and age, normalised and adjusted for relatedness. Neither single SNPs nor SNP interactions were genome-wide significant in either cohort based on the consensus threshold (P=5.0E−08) and a Bonferroni corrected threshold (P=1.1E−12), respectively. Next we compared sub genome-wide significant SNP interactions (P<5.0E−08) across cohorts to identify common epistatic signals, where SNPs were annotated to genes to test for gene ontology (GO) enrichment. Among the epistatic genes contributing to the commonly enriched GO terms, 19 were shared across study cohorts of which 15 are previously published genome-wide association loci, including CDH13 (cadherin 13) associated with height and SORCS2 (sortilin-related VPS10 domain containing receptor 2) associated with circulating insulin-like growth factor 1 and binding protein 3. Interactions between the 19 shared epistatic genes and those involving BMI candidate loci (P<5.0E−08) were tested across cohorts and found eight replicated at the SNP level (P<0.05) in at least one cohort, which were further tested and showed limited replication in a separate European population (n>5000). We conclude that genome-wide analysis of epistasis in multiple populations is an effective approach to provide new insights into the genetic regulation of BMI but requires additional efforts to confirm the findings. PMID:22333899

  1. Whole genome SNP scanning of snow sheep (Ovis nivicola).

    PubMed

    Deniskova, T E; Okhlopkov, I M; Sermyagin, A A; Gladyr', E A; Bagirov, V A; Sölkner, J; Mamaev, N V; Brem, G; Zinov'eva, N A

    2016-07-01

    This is the first report performing the whole genome SNP scanning of snow sheep (Ovis nivicola). Samples of snow sheep (n = 18) collected in six different regions of the Republic of Sakha (Yakutia) from 64° to 71° N. For SNP genotyping, we applied Ovine 50K SNP BeadChip (Illumina, United States), designed for domestic sheep. The total number of genotyped SNPs (call rate 90%) was 47796 (88.1% of total SNPs), wherein 1006 SNPs were polymorphic (2.1%). Principal component analysis (PCA) showed the clear differentiation within the species O. nivicola: studied individuals were distributed among five distinct arrays corresponding to the geographical locations of sampling points. Our results demonstrate that the DNA chip designed for domestic sheep can be successfully used to study the allele pool and the genetic structure of snow sheep populations. PMID:27599514

  2. Genome-wide analysis of DNA methylation in hepatoblastoma tissues

    PubMed Central

    Cui, Ximao; Liu, Baihui; Zheng, Shan; Dong, Kuiran; Dong, Rui

    2016-01-01

    DNA methylation has a crucial role in cancer biology. In the present study, a genome-wide analysis of DNA methylation in hepatoblastoma (HB) tissues was performed to verify differential methylation levels between HB and normal tissues. As alpha-fetoprotein (AFP) has a critical role in HB, AFP methylation levels were also detected using pyrosequencing. Normal and HB liver tissue samples (frozen tissue) were obtained from patients with HB. Genome-wide analysis of DNA methylation in these tissues was performed using an Infinium HumanMethylation450 BeadChip, and the results were confirmed with reverse transcription-quantitative polymerase chain reaction. The Infinium HumanMethylation450 BeadChip demonstrated distinctively less methylation in HB tissues than in non-tumor tissues. In addition, methylation enrichment was observed in positions near the transcription start site of AFP, which exhibited lower methylation levels in HB tissues than in non-tumor liver tissues. Lastly, a significant negative correlation was observed between AFP messenger RNA expression and DNA methylation percentage, using linear Pearson's R correlation coefficients. The present results demonstrate differential methylation levels between HB and normal tissues, and imply that aberrant methylation of AFP in HB could reflect HB development. Expansion of these findings could provide useful insight into HB biology. PMID:27446465

  3. Genome-Wide Patterns of Nucleotide Polymorphism in Domesticated Rice

    PubMed Central

    Hernandez, Ryan D; Boyko, Adam; Fledel-Alon, Adi; York, Thomas L; Polato, Nicholas R; Olsen, Kenneth M; Nielsen, Rasmus; McCouch, Susan R; Bustamante, Carlos D; Purugganan, Michael D

    2007-01-01

    Domesticated Asian rice (Oryza sativa) is one of the oldest domesticated crop species in the world, having fed more people than any other plant in human history. We report the patterns of DNA sequence variation in rice and its wild ancestor, O. rufipogon, across 111 randomly chosen gene fragments, and use these to infer the evolutionary dynamics that led to the origins of rice. There is a genome-wide excess of high-frequency derived single nucleotide polymorphisms (SNPs) in O. sativa varieties, a pattern that has not been reported for other crop species. We developed several alternative models to explain contemporary patterns of polymorphisms in rice, including a (i) selectively neutral population bottleneck model, (ii) bottleneck plus migration model, (iii) multiple selective sweeps model, and (iv) bottleneck plus selective sweeps model. We find that a simple bottleneck model, which has been the dominant demographic model for domesticated species, cannot explain the derived nucleotide polymorphism site frequency spectrum in rice. Instead, a bottleneck model that incorporates selective sweeps, or a more complex demographic model that includes subdivision and gene flow, are more plausible explanations for patterns of variation in domesticated rice varieties. If selective sweeps are indeed the explanation for the observed nucleotide data of domesticated rice, it suggests that strong selection can leave its imprint on genome-wide polymorphism patterns, contrary to expectations that selection results only in a local signature of variation. PMID:17907810

  4. Establishing an analytic pipeline for genome-wide DNA methylation.

    PubMed

    Wright, Michelle L; Dozmorov, Mikhail G; Wolen, Aaron R; Jackson-Cook, Colleen; Starkweather, Angela R; Lyon, Debra E; York, Timothy P

    2016-01-01

    The need for research investigating DNA methylation (DNAm) in clinical studies has increased, leading to the evolution of new analytic methods to improve accuracy and reproducibility of the interpretation of results from these studies. The purpose of this article is to provide clinical researchers with a summary of the major data processing steps routinely applied in clinical studies investigating genome-wide DNAm using the Illumina HumanMethylation 450K BeadChip. In most studies, the primary goal of employing DNAm analysis is to identify differential methylation at CpG sites among phenotypic groups. Experimental design considerations are crucial at the onset to minimize bias from factors related to sample processing and avoid confounding experimental variables with non-biological batch effects. Although there are currently no de facto standard methods for analyzing these data, we review the major steps in processing DNAm data recommended by several research studies. We describe several variations available for clinical researchers to process, analyze, and interpret DNAm data. These insights are applicable to most types of genome-wide DNAm array platforms and will be applicable for the next generation of DNAm array technologies (e.g., the 850K array). Selection of the DNAm analytic pipeline followed by investigators should be guided by the research question and supported by recently published methods. PMID:27127542

  5. Measuring genome-wide nucleosome turnover using CATCH-IT.

    PubMed

    Teves, Sheila S; Deal, Roger B; Henikoff, Steven

    2012-01-01

    The dynamic interplay between DNA-binding proteins and nucleosomes underlies essential nuclear processes such as transcription, replication, and DNA repair. Manifestations of this interplay include the assembly, eviction, and replacement of nucleosomes. Hence, measurements of nucleosome turnover kinetics can lead to insights into the regulation of dynamic chromatin processes. In this chapter, we describe a genome-wide method for measuring nucleosome turnover that uses metabolic labeling followed by capture of newly synthesized histones, which we have termed Covalent Attachment of Tagged Histones to Capture and Identify Turnover (CATCH-IT). Although CATCH-IT can be used with any genome-wide mapping procedure, high-resolution profiling is attainable using paired-end sequencing of native chromatin. Our protocol also includes an efficient Solexa DNA sequencing library preparation protocol that can be used for single base-pair resolution mapping of both nucleosome and subnucleosomal particles. We not only describe the use of these protocols in the context of a Drosophila cell line but also provide the necessary changes for adaptation to other model systems. PMID:22929769

  6. Genome-wide signatures of convergent evolution in echolocating mammals

    PubMed Central

    Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A.; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J.

    2013-01-01

    Evolution is typically thought to proceed through divergence of genes, proteins, and ultimately phenotypes1-3. However, similar traits might also evolve convergently in unrelated taxa due to similar selection pressures4,5. Adaptive phenotypic convergence is widespread in nature, and recent results from a handful of genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level6-9. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution9,10 although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show for the first time that convergence is not a rare process restricted to a handful of loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four new bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Surprisingly we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognised. PMID:24005325

  7. Genome-wide signatures of convergent evolution in echolocating mammals.

    PubMed

    Parker, Joe; Tsagkogeorga, Georgia; Cotton, James A; Liu, Yuan; Provero, Paolo; Stupka, Elia; Rossiter, Stephen J

    2013-10-10

    Evolution is typically thought to proceed through divergence of genes, proteins and ultimately phenotypes. However, similar traits might also evolve convergently in unrelated taxa owing to similar selection pressures. Adaptive phenotypic convergence is widespread in nature, and recent results from several genes have suggested that this phenomenon is powerful enough to also drive recurrent evolution at the sequence level. Where homoplasious substitutions do occur these have long been considered the result of neutral processes. However, recent studies have demonstrated that adaptive convergent sequence evolution can be detected in vertebrates using statistical methods that model parallel evolution, although the extent to which sequence convergence between genera occurs across genomes is unknown. Here we analyse genomic sequence data in mammals that have independently evolved echolocation and show that convergence is not a rare process restricted to several loci but is instead widespread, continuously distributed and commonly driven by natural selection acting on a small number of sites per locus. Systematic analyses of convergent sequence evolution in 805,053 amino acids within 2,326 orthologous coding gene sequences compared across 22 mammals (including four newly sequenced bat genomes) revealed signatures consistent with convergence in nearly 200 loci. Strong and significant support for convergence among bats and the bottlenose dolphin was seen in numerous genes linked to hearing or deafness, consistent with an involvement in echolocation. Unexpectedly, we also found convergence in many genes linked to vision: the convergent signal of many sensory genes was robustly correlated with the strength of natural selection. This first attempt to detect genome-wide convergent sequence evolution across divergent taxa reveals the phenomenon to be much more pervasive than previously recognized. PMID:24005325

  8. Bioinformatics challenges for genome-wide association studies

    PubMed Central

    Moore, Jason H.; Asselbergs, Folkert W.; Williams, Scott M.

    2010-01-01

    Motivation: The sequencing of the human genome has made it possible to identify an informative set of >1 million single nucleotide polymorphisms (SNPs) across the genome that can be used to carry out genome-wide association studies (GWASs). The availability of massive amounts of GWAS data has necessitated the development of new biostatistical methods for quality control, imputation and analysis issues including multiple testing. This work has been successful and has enabled the discovery of new associations that have been replicated in multiple studies. However, it is now recognized that most SNPs discovered via GWAS have small effects on disease susceptibility and thus may not be suitable for improving health care through genetic testing. One likely explanation for the mixed results of GWAS is that the current biostatistical analysis paradigm is by design agnostic or unbiased in that it ignores all prior knowledge about disease pathobiology. Further, the linear modeling framework that is employed in GWAS often considers only one SNP at a time thus ignoring their genomic and environmental context. There is now a shift away from the biostatistical approach toward a more holistic approach that recognizes the complexity of the genotype–phenotype relationship that is characterized by significant heterogeneity and gene–gene and gene–environment interaction. We argue here that bioinformatics has an important role to play in addressing the complexity of the underlying genetic basis of common human diseases. The goal of this review is to identify and discuss those GWAS challenges that will require computational methods. Contact: jason.h.moore@dartmouth.edu PMID:20053841

  9. Meta-analysis of genome-wide association from genomic prediction models

    Technology Transfer Automated Retrieval System (TEKTRAN)

    A limitation of many genome-wide association studies (GWA) in animal breeding is that there are many loci with small effect sizes; thus, larger sample sizes (N) are required to guarantee suitable power of detection. To increase sample size, results from different GWA can be combined in a meta-analys...

  10. Quantitative trait loci on chromosome 8q24 for pancreatic beta-cell function and 7q11 for insulin sensitivity in obese nondiabetic white and black families: evidence from genome-wide linkage scans in the NHLBI Hypertension Genetic Epidemiology Network (HyperGEN) study.

    PubMed

    An, Ping; Freedman, Barry I; Rich, Stephen S; Mandel, Stephen A; Arnett, Donna K; Myers, Richard H; Chen, Yii-Der I; Hunt, Steven C; Rao, D C

    2006-02-01

    Genome-wide linkage scans were carried out using a multipoint variance components method in white and black families of the NHLBI Hypertension Genetic Epidemiology Network (HyperGEN) study to identify quantitative trait loci (QTLs) for pancreatic beta-cell function and insulin sensitivity estimated through the newly released nonlinear computer version of homeostasis model assessment 2. Participants fasting <8 h, with diagnosed type 2 diabetes, or taking blood glucose or blood lipid-lowering medications were excluded. Both phenotypes were adjusted separately by race and sex for the effects of age, BMI, and field center before linkage scans using 370 microsatellite markers were performed. A total of 685 white families (1,180 sibpairs) and 773 black families (775 sibpairs) were evaluated as well as subsets including 267 obese white families (757 sibpairs) and 427 obese black families (599 sibpairs) identified through tree-linkage analyses using interacting covariates of age, sex, and BMI. For beta-cell function in the obese white families, significant (logarithm of odds [LOD] score >3.6) evidence supporting linkages was detected on chromosome 8q24 at D8S1179 (135 cM, LOD score 4.2, empirical P = 0.002) and at D8S1128 (140 cM, LOD score 3.7, empirical P = 0.003). In addition, two regions supported linkage for insulin sensitivity index in the obese black families on chromosome 7q11 at D7S3046 (79 cM, LOD score 3.0, empirical P = 0.018) and on chromosome 6q26 at D6S1277 (173 cM, LOD score 3.0, empirical P = 0.018). Reducing clinical heterogeneity using obesity data and improved estimates of beta-cell function and insulin sensitivity may have permitted identification of a QTL on chromosome 8q24 for beta-cell function in the presence of estimated insulin resistance and a QTL on chromosome 7q11 for insulin sensitivity. These regions replicate previous reports for type 2 diabetes-associated traits. PMID:16443794

  11. A New Method to Scan Genomes for Introgression in a Secondary Contact Model

    PubMed Central

    Geneva, Anthony J.; Muirhead, Christina A.; Kingan, Sarah B.; Garrigan, Daniel

    2015-01-01

    Secondary contact between divergent populations or incipient species may result in the exchange and introgression of genomic material. We develop a simple DNA sequence measure, called Gmin, which is designed to identify genomic regions experiencing introgression in a secondary contact model. Gmin is defined as the ratio of the minimum between-population number of nucleotide differences in a genomic window to the average number of between-population differences. Although it is conceptually simple, one advantage of Gmin is that it is computationally inexpensive relative to model-based methods for detecting gene flow and it scales easily to the level of whole-genome analysis. We compare the sensitivity and specificity of Gmin to those of the widely used index of population differentiation, FST, and suggest a simple statistical test for identifying genomic outliers. Extensive computer simulations demonstrate that Gmin has both greater sensitivity and specificity for detecting recent introgression than does FST. Furthermore, we find that the sensitivity of Gmin is robust with respect to both the population mutation and recombination rates. Finally, a scan of Gmin across the X chromosome of Drosophila melanogaster identifies candidate regions of introgression between sub-Saharan African and cosmopolitan populations that were previously missed by other methods. These results show that Gmin is a biologically straightforward, yet powerful, alternative to FST, as well as to more computationally intensive model-based methods for detecting gene flow. PMID:25874895

  12. Meta-analysis of genome-wide association studies of attention deficit/hyperactivity disorder

    PubMed Central

    Neale, Benjamin M; Medland, Sarah E.; Ripke, Stephan; Asherson, Philip; Franke, Barbara; Lesch, Klaus-Peter; Faraone, Stephen V.; Nguyen, Thuy Trang; Schäfer, Helmut; Holmans, Peter; Daly, Mark; Steinhausen, Hans-Christoph; Freitag, Christine; Reif, Andreas; Renner, Tobias J.; Romanos, Marcel; Romanos, Jasmin; Walitza, Susanne; Warnke, Andreas; Meyer, Jobst; Palmason, Haukur; Buitelaar, Jan; Vasquez, Alejandro Arias; Lambregts-Rommelse, Nanda; Gill, Michael; Anney, Richard J.L.; Langely, Kate; O’Donovan, Michael; Williams, Nigel; Owen, Michael; Thapar, Anita; Kent, Lindsey; Sergeant, Joseph; Roeyers, Herbert; Mick, Eric; Biederman, Joseph; Doyle, Alysa; Smalley, Susan; Loo, Sandra; Hakonarson, Hakon; Elia, Josephine; Todorov, Alexandre; Miranda, Ana; Mulas, Fernando; Ebstein, Richard P.; Rothenberger, Aribert; Banaschewski, Tobias; Oades, Robert D.; Sonuga-Barke, Edmund; McGough, James; Nisenbaum, Laura; Middleton, Frank; Hu, Xiaolan; Nelson, Stan

    2010-01-01

    Objective Although twin and family studies have shown Attention Deficit/Hyperactivity Disorder (ADHD) to be highly heritable, genetic variants influencing the trait at a genome-wide significant level have yet to be identified. As prior genome-wide association scans (GWAS) have not yielded significant results, we conducted a meta-analysis of existing studies to boost statistical power. Method We used data from four projects: a) the Children’s Hospital of Philadelphia (CHOP), b) phase I of the International Multicenter ADHD Genetics project (IMAGE), c) phase II of IMAGE (IMAGE II), and d) the Pfizer funded study from the University of California, Los Angeles, Washington University and the Massachusetts General Hospital (PUWMa). The final sample size consisted of 2,064 trios, 896 cases and 2,455 controls. For each study, we imputed HapMap SNPs, computed association test statistics and transformed them to Z-scores, and then combined weighted Z-scores in a meta-analysis. Results No genome-wide significant associations were found, although an analysis of candidate genes suggests they may be involved in the disorder. Conclusions Given that ADHD is a highly heritable disorder, our negative results suggest that the effects of common ADHD risk variants must, individually, be very small or that other types of variants, e.g. rare ones, account for much of the disorder’s heritability. PMID:20732625

  13. Genome-wide association study of antisocial personality disorder.

    PubMed

    Rautiainen, M-R; Paunio, T; Repo-Tiihonen, E; Virkkunen, M; Ollila, H M; Sulkava, S; Jolanki, O; Palotie, A; Tiihonen, J

    2016-01-01

    The pathophysiology of antisocial personality disorder (ASPD) remains unclear. Although the most consistent biological finding is reduced grey matter volume in the frontal cortex, about 50% of the total liability to developing ASPD has been attributed to genetic factors. The contributing genes remain largely unknown. Therefore, we sought to study the genetic background of ASPD. We conducted a genome-wide association study (GWAS) and a replication analysis of Finnish criminal offenders fulfilling DSM-IV criteria for ASPD (N=370, N=5850 for controls, GWAS; N=173, N=3766 for controls and replication sample). The GWAS resulted in suggestive associations of two clusters of single-nucleotide polymorphisms at 6p21.2 and at 6p21.32 at the human leukocyte antigen (HLA) region. Imputation of HLA alleles revealed an independent association with DRB1*01:01 (odds ratio (OR)=2.19 (1.53-3.14), P=1.9 × 10(-5)). Two polymorphisms at 6p21.2 LINC00951-LRFN2 gene region were replicated in a separate data set, and rs4714329 reached genome-wide significance (OR=1.59 (1.37-1.85), P=1.6 × 10(-9)) in the meta-analysis. The risk allele also associated with antisocial features in the general population conditioned for severe problems in childhood family (β=0.68, P=0.012). Functional analysis in brain tissue in open access GTEx and Braineac databases revealed eQTL associations of rs4714329 with LINC00951 and LRFN2 in cerebellum. In humans, LINC00951 and LRFN2 are both expressed in the brain, especially in the frontal cortex, which is intriguing considering the role of the frontal cortex in behavior and the neuroanatomical findings of reduced gray matter volume in ASPD. To our knowledge, this is the first study showing genome-wide significant and replicable findings on genetic variants associated with any personality disorder. PMID:27598967

  14. A Genome-Wide Association Study of Optic Disc Parameters

    PubMed Central

    Jansonius, Nomdo M.; de Jong, Paulus T. V. M.; Bergen, Arthur A. B.; Isaacs, Aaron; Amin, Najaf; Aulchenko, Yurii S.; Wolfs, Roger C. W.; Hofman, Albert; Rivadeneira, Fernando; Oostra, Ben A.; Uitterlinden, Andre G.; Hysi, Pirro; Hammond, Christopher J.; Lemij, Hans G.; Vingerling, Johannes R.

    2010-01-01

    The optic nerve head is involved in many ophthalmic disorders, including common diseases such as myopia and open-angle glaucoma. Two of the most important parameters are the size of the optic disc area and the vertical cup-disc ratio (VCDR). Both are highly heritable but genetically largely undetermined. We performed a meta-analysis of genome-wide association (GWA) data to identify genetic variants associated with optic disc area and VCDR. The gene discovery included 7,360 unrelated individuals from the population-based Rotterdam Study I and Rotterdam Study II cohorts. These cohorts revealed two genome-wide significant loci for optic disc area, rs1192415 on chromosome 1p22 (p = 6.72×10−19) within 117 kb of the CDC7 gene and rs1900004 on chromosome 10q21.3-q22.1 (p = 2.67×10−33) within 10 kb of the ATOH7 gene. They revealed two genome-wide significant loci for VCDR, rs1063192 on chromosome 9p21 (p = 6.15×10−11) in the CDKN2B gene and rs10483727 on chromosome 14q22.3-q23 (p = 2.93×10−10) within 40 kbp of the SIX1 gene. Findings were replicated in two independent Dutch cohorts (Rotterdam Study III and Erasmus Rucphen Family study; N = 3,612), and the TwinsUK cohort (N = 843). Meta-analysis with the replication cohorts confirmed the four loci and revealed a third locus at 16q12.1 associated with optic disc area, and four other loci at 11q13, 13q13, 17q23 (borderline significant), and 22q12.1 for VCDR. ATOH7 was also associated with VCDR independent of optic disc area. Three of the loci were marginally associated with open-angle glaucoma. The protein pathways in which the loci of optic disc area are involved overlap with those identified for VCDR, suggesting a common genetic origin. PMID:20548946

  15. A Comprehensive Genome-Wide Map of Autonomously Replicating Sequences in a Naive Genome

    PubMed Central

    Liachko, Ivan; Bhaskar, Anand; Lee, Chanmi; Chung, Shau Chee Claire

    2010-01-01

    Eukaryotic chromosomes initiate DNA synthesis from multiple replication origins. The machinery that initiates DNA synthesis is highly conserved, but the sites where the replication initiation proteins bind have diverged significantly. Functional comparative genomics is an obvious approach to study the evolution of replication origins. However, to date, the Saccharomyces cerevisiae replication origin map is the only genome map available. Using an iterative approach that combines computational prediction and functional validation, we have generated a high-resolution genome-wide map of DNA replication origins in Kluyveromyces lactis. Unlike other yeasts or metazoans, K. lactis autonomously replicating sequences (KlARSs) contain a 50 bp consensus motif suggestive of a dimeric structure. This motif is necessary and largely sufficient for initiation and was used to dependably identify 145 of the up to 156 non-repetitive intergenic ARSs projected for the K. lactis genome. Though similar in genome sizes, K. lactis has half as many ARSs as its distant relative S. cerevisiae. Comparative genomic analysis shows that ARSs in K. lactis and S. cerevisiae preferentially localize to non-syntenic intergenic regions, linking ARSs with loci of accelerated evolutionary change. PMID:20485513

  16. Genome Scan for Parent-of-Origin QTL Effects on Bovine Growth and Carcass Traits.

    PubMed

    Imumorin, Ikhide G; Kim, Eun-Hee; Lee, Yun-Mi; De Koning, Dirk-Jan; van Arendonk, Johan A; De Donato, Marcos; Taylor, Jeremy F; Kim, Jong-Joo

    2011-01-01

    Parent-of-origin effects (POE) such as genomic imprinting influence growth and body composition in livestock, rodents, and humans. Here, we report the results of a genome scan to detect quantitative trait loci (QTL) with POE on growth and carcass traits in Angus × Brahman cattle crossbreds. We identified 24 POE-QTL on 15 Bos taurus autosomes (BTAs) of which six were significant at 5% genome-wide (GW) level and 18 at the 5% chromosome-wide (CW) significance level. Six QTL were paternally expressed while 15 were maternally expressed. Three QTL influencing post-weaning growth map to the proximal end of BTA2 (linkage region of 0-9 cM; genomic region of 5.0-10.8 Mb), for which only one imprinted ortholog is known so far in the human and mouse genomes, and therefore may potentially represent a novel imprinted region. The detected QTL individually explained 1.4 ∼ 5.1% of each trait's phenotypic variance. Comparative in silico analysis of bovine genomic locations show that 32 out of 1,442 known mammalian imprinted genes from human and mouse homologs map to the identified QTL regions. Although several of the 32 genes have been associated with quantitative traits in cattle, only two (GNAS and PEG3) have experimental proof of being imprinted in cattle. These results lend additional support to recent reports that POE on quantitative traits in mammals may be more common than previously thought, and strengthen the need to identify and experimentally validate cattle orthologs of imprinted genes so as to investigate their effects on quantitative traits. PMID:22303340

  17. Genome Scan for Parent-of-Origin QTL Effects on Bovine Growth and Carcass Traits

    PubMed Central

    Imumorin, Ikhide G.; Kim, Eun-Hee; Lee, Yun-Mi; De Koning, Dirk-Jan; van Arendonk, Johan A.; De Donato, Marcos; Taylor, Jeremy F.; Kim, Jong-Joo

    2011-01-01

    Parent-of-origin effects (POE) such as genomic imprinting influence growth and body composition in livestock, rodents, and humans. Here, we report the results of a genome scan to detect quantitative trait loci (QTL) with POE on growth and carcass traits in Angus × Brahman cattle crossbreds. We identified 24 POE–QTL on 15 Bos taurus autosomes (BTAs) of which six were significant at 5% genome-wide (GW) level and 18 at the 5% chromosome-wide (CW) significance level. Six QTL were paternally expressed while 15 were maternally expressed. Three QTL influencing post-weaning growth map to the proximal end of BTA2 (linkage region of 0–9 cM; genomic region of 5.0–10.8 Mb), for which only one imprinted ortholog is known so far in the human and mouse genomes, and therefore may potentially represent a novel imprinted region. The detected QTL individually explained 1.4 ∼ 5.1% of each trait’s phenotypic variance. Comparative in silico analysis of bovine genomic locations show that 32 out of 1,442 known mammalian imprinted genes from human and mouse homologs map to the identified QTL regions. Although several of the 32 genes have been associated with quantitative traits in cattle, only two (GNAS and PEG3) have experimental proof of being imprinted in cattle. These results lend additional support to recent reports that POE on quantitative traits in mammals may be more common than previously thought, and strengthen the need to identify and experimentally validate cattle orthologs of imprinted genes so as to investigate their effects on quantitative traits. PMID:22303340

  18. Genome-wide characteristics of de novo mutations in autism

    PubMed Central

    Yuen, Ryan K C; Merico, Daniele; Cao, Hongzhi; Pellecchia, Giovanna; Alipanahi, Babak; Thiruvahindrapuram, Bhooma; Tong, Xin; Sun, Yuhui; Cao, Dandan; Zhang, Tao; Wu, Xueli; Jin, Xin; Zhou, Ze; Liu, Xiaomin; Nalpathamkalam, Thomas; Walker, Susan; Howe, Jennifer L.; Wang, Zhuozhi; MacDonald, Jeffrey R.; Chan, Ada; D’Abate, Lia; Deneault, Eric; Siu, Michelle T.; Tammimies, Kristiina; Uddin, Mohammed; Zarrei, Mehdi; Wang, Mingbang; Li, Yingrui; Wang, Jun; Wang, Jian; Yang, Huanming; Bookman, Matt; Bingham, Jonathan; Gross, Samuel S.; Loy, Dion; Pletcher, Mathew; Marshall, Christian R.; Anagnostou, Evdokia; Zwaigenbaum, Lonnie; Weksberg, Rosanna; Fernandez, Bridget A; Roberts, Wendy; Szatmari, Peter; Glazer, David; Frey, Brendan J.; Ring, Robert H.; Xu, Xun; Scherer, Stephen W.

    2016-01-01

    De novo mutations (DNMs) are important in Autism Spectrum Disorder (ASD), but so far analyses have mainly been on the ~1.5% of the genome encoding genes. Here, we performed whole genome sequencing (WGS) of 200 ASD parent-child trios and characterized germline and somatic DNMs. We confirmed that the majority of germline DNMs (75.6%) originated from the father, and these increased significantly with paternal age only (p=4.2×10−10). However, when clustered DNMs (those within 20kb) were found in ASD, not only did they mostly originate from the mother (p=7.7×10−13), but they could also be found adjacent to de novo copy number variations (CNVs) where the mutation rate was significantly elevated (p=2.4×10−24). By comparing DNMs detected in controls, we found a significant enrichment of predicted damaging DNMs in ASD cases (p=8.0×10−9; OR=1.84), of which 15.6% (p=4.3×10−3) and 22.5% (p=7.0×10−5) were in the non-coding or genic non-coding, respectively. The non-coding elements most enriched for DNM were untranslated regions of genes, boundaries involved in exon-skipping and DNase I hypersensitive regions. Using microarrays and a novel outlier detection test, we also found aberrant methylation profiles in 2/185 (1.1%) of ASD cases. These same individuals carried independently identified DNMs in the ASD risk- and epigenetic- genes DNMT3A and ADNP. Our data begins to characterize different genome-wide DNMs, and highlight the contribution of non-coding variants, to the etiology of ASD. PMID:27525107

  19. Genome-wide significant risk associations for mucinous ovarian carcinoma

    PubMed Central

    Kelemen, Linda E.; Lawrenson, Kate; Tyrer, Jonathan; Li, Qiyuan; M. Lee, Janet; Seo, Ji-Heui; Phelan, Catherine M.; Beesley, Jonathan; Chen, Xiaoqin; Spindler, Tassja J.; Aben, Katja K.H.; Anton-Culver, Hoda; Antonenkova, Natalia; Baker, Helen; Bandera, Elisa V.; Bean, Yukie; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bruinsma, Fiona; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Chen, Y. Ann; Chen, Zhihua; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas T.; Edwards, Robert P.; Eilber, Ursula; Ekici, Arif B.; Engelholm, Svend Aage; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goode, Ellen L.; Goodman, Marc T.; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hasmad, Hanis Nazihah; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A.T.; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus; Hosono, Satoyo; Iversen, Edwin S.; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kellar, Melissa; Kelley, Joseph L.; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F.A.G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Modugno, Francesmary; Moes-Sosnowska, Joanna; Moysich, Kirsten B.; Narod, Steven A.; Nedergaard, Lotte; Ness, Roberta B.; Nevanlinna, Heli; Azmi, Mat Adenan Noor; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M.; Permuth-Wey, Jennifer; Pike, Malcolm C.; Poole, Elizabeth M.; Ramus, Susan J.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schildkraut, Joellen M.; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Sucheston, Lara; Tangen, Ingvild L.; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J; Tworoger, Shelley S.; van Altena, Anne M.; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A.; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wlodzimierz, Sawicki; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna H.; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A.; Freedman, Matthew L.; Chenevix-Trench, Georgia; Pharoah, Paul D.; Gayther, Simon A.; Berchuck, Andrew

    2015-01-01

    Genome-wide association studies have identified several risk associations for ovarian carcinomas (OC) but not for mucinous ovarian carcinomas (MOC). Genotypes from OC cases and controls were imputed into the 1000 Genomes Project reference panel. Analysis of 1,644 MOC cases and 21,693 controls identified three novel risk associations: rs752590 at 2q13 (P = 3.3 × 10−8), rs711830 at 2q31.1 (P = 7.5 × 10−12) and rs688187 at 19q13.2 (P = 6.8 × 10−13). Expression Quantitative Trait Locus (eQTL) analysis in ovarian and colorectal tumors (which are histologically similar to MOC) identified significant eQTL associations for HOXD9 at 2q31.1 in ovarian (P = 4.95 × 10−4, FDR = 0.003) and colorectal (P = 0.01, FDR = 0.09) tumors, and for PAX8 at 2q13 in colorectal tumors (P = 0.03, FDR = 0.09). Chromosome conformation capture analysis identified interactions between the HOXD9 promoter and risk SNPs at 2q31.1. Overexpressing HOXD9 in MOC cells augmented the neoplastic phenotype. These findings provide the first evidence for MOC susceptibility variants and insights into the underlying biology of the disease. PMID:26075790

  20. cgmisc: enhanced genome-wide association analyses and visualization

    PubMed Central

    Kierczak, Marcin; Jabłońska, Jagoda; Forsberg, Simon K. G.; Bianchi, Matteo; Tengvall, Katarina; Pettersson, Mats; Scholz, Veronika; Meadows, Jennifer R. S.; Jern, Patric; Carlborg, Örjan; Lindblad-Toh, Kerstin

    2015-01-01

    Summary: High-throughput genotyping and sequencing technologies facilitate studies of complex genetic traits and provide new research opportunities. The increasing popularity of genome-wide association studies (GWAS) leads to the discovery of new associated loci and a better understanding of the genetic architecture underlying not only diseases, but also other monogenic and complex phenotypes. Several softwares are available for performing GWAS analyses, R environment being one of them. Results: We present cgmisc, an R package that enables enhanced data analysis and visualization of results from GWAS. The package contains several utilities and modules that complement and enhance the functionality of the existing software. It also provides several tools for advanced visualization of genomic data and utilizes the power of the R language to aid in preparation of publication-quality figures. Some of the package functions are specific for the domestic dog (Canis familiaris) data. Availability and implementation: The package is operating system-independent and is available from: https://github.com/cgmisc-team/cgmisc Contact: marcin.kierczak@imbim.uu.se Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26249815

  1. Realizing privacy preserving genome-wide association studies

    PubMed Central

    Simmons, Sean; Berger, Bonnie

    2016-01-01

    Motivation: As genomics moves into the clinic, there has been much interest in using this medical data for research. At the same time the use of such data raises many privacy concerns. These circumstances have led to the development of various methods to perform genome-wide association studies (GWAS) on patient records while ensuring privacy. In particular, there has been growing interest in applying differentially private techniques to this challenge. Unfortunately, up until now all methods for finding high scoring SNPs in a differentially private manner have had major drawbacks in terms of either accuracy or computational efficiency. Results: Here we overcome these limitations with a substantially modified version of the neighbor distance method for performing differentially private GWAS, and thus are able to produce a more viable mechanism. Specifically, we use input perturbation and an adaptive boundary method to overcome accuracy issues. We also design and implement a convex analysis based algorithm to calculate the neighbor distance for each SNP in constant time, overcoming the major computational bottleneck in the neighbor distance method. It is our hope that methods such as ours will pave the way for more widespread use of patient data in biomedical research. Availability and implementation: A python implementation is available at http://groups.csail.mit.edu/cb/DiffPriv/. Contact: bab@csail.mit.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:26769317

  2. Genome-Wide Association Studies of the Human Gut Microbiota.

    PubMed

    Davenport, Emily R; Cusanovich, Darren A; Michelini, Katelyn; Barreiro, Luis B; Ober, Carole; Gilad, Yoav

    2015-01-01

    The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both). These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%). For example, we identified an association between a taxon known to affect obesity (genus Akkermansia) and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL) mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10-7). Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut. PMID:26528553

  3. Genome-Wide Discriminatory Information Patterns of Cytosine DNA Methylation.

    PubMed

    Sanchez, Robersy; Mackenzie, Sally A

    2016-01-01

    Cytosine DNA methylation (CDM) is a highly abundant, heritable but reversible chemical modification to the genome. Herein, a machine learning approach was applied to analyze the accumulation of epigenetic marks in methylomes of 152 ecotypes and 85 silencing mutants of Arabidopsis thaliana. In an information-thermodynamics framework, two measurements were used: (1) the amount of information gained/lost with the CDM changes I R and (2) the uncertainty of not observing a SNP L C R . We hypothesize that epigenetic marks are chromosomal footprints accounting for different ontogenetic and phylogenetic histories of individual populations. A machine learning approach is proposed to verify this hypothesis. Results support the hypothesis by the existence of discriminatory information (DI) patterns of CDM able to discriminate between individuals and between individual subpopulations. The statistical analyses revealed a strong association between the topologies of the structured population of Arabidopsis ecotypes based on I R and on LCR, respectively. A statistical-physical relationship between I R and L C R was also found. Results to date imply that the genome-wide distribution of CDM changes is not only part of the biological signal created by the methylation regulatory machinery, but ensures the stability of the DNA molecule, preserving the integrity of the genetic message under continuous stress from thermal fluctuations in the cell environment. PMID:27322251

  4. Genome-Wide Association Studies of the Human Gut Microbiota

    PubMed Central

    Davenport, Emily R.; Cusanovich, Darren A.; Michelini, Katelyn; Barreiro, Luis B.; Ober, Carole; Gilad, Yoav

    2015-01-01

    The bacterial composition of the human fecal microbiome is influenced by many lifestyle factors, notably diet. It is less clear, however, what role host genetics plays in dictating the composition of bacteria living in the gut. In this study, we examined the association of ~200K host genotypes with the relative abundance of fecal bacterial taxa in a founder population, the Hutterites, during two seasons (n = 91 summer, n = 93 winter, n = 57 individuals collected in both). These individuals live and eat communally, minimizing variation due to environmental exposures, including diet, which could potentially mask small genetic effects. Using a GWAS approach that takes into account the relatedness between subjects, we identified at least 8 bacterial taxa whose abundances were associated with single nucleotide polymorphisms in the host genome in each season (at genome-wide FDR of 20%). For example, we identified an association between a taxon known to affect obesity (genus Akkermansia) and a variant near PLD1, a gene previously associated with body mass index. Moreover, we replicate a previously reported association from a quantitative trait locus (QTL) mapping study of fecal microbiome abundance in mice (genus Lactococcus, rs3747113, P = 3.13 x 10−7). Finally, based on the significance distribution of the associated microbiome QTLs in our study with respect to chromatin accessibility profiles, we identified tissues in which host genetic variation may be acting to influence bacterial abundance in the gut. PMID:26528553

  5. Genome-wide association studies in pharmacogenetics research debate

    PubMed Central

    Bailey, Kent R; Cheng, Cheng

    2016-01-01

    Will genome-wide association studies (GWAS) ‘work’ for pharmacogenetics research? This question was the topic of a staged debate, with pro and con sides, aimed to bring out the strengths and weaknesses of GWAS for pharmacogenetics studies. After a full day of seminars at the Fifth Statistical Analysis Workshop of the Pharmacogenetics Research Network, the lively debate was held – appropriately – at Goonies Comedy Club in Rochester (MN, USA). The pro side emphasized that the many GWAS successes for identifying genetic variants associated with disease risk show that it works; that the current genotyping platforms are efficient, with good imputation methods to fill in missing data; that its global assessment is always a success even if no significant associations are detected; and that genetic effects are likely to be large because humans have not evolved in a drug-therapy environment. By contrast, the con side emphasized that we have limited knowledge of the complexity of the genome; limited clinical phenotypes compromise studies; the likely multifactorial nature of drug response clouding the small genetic effects; and limitations of sample size and replication studies in pharmacogenetic studies. Lively and insightful discussions emphasized further research efforts that might benefit GWAS in pharmacogenetics. PMID:20235786

  6. Genome-Wide Analysis of Human Metapneumovirus Evolution

    PubMed Central

    Kim, Jin Il; Park, Sehee; Lee, Ilseob; Park, Kwang Sook; Kwak, Eun Jung; Moon, Kwang Mee; Lee, Chang Kyu; Bae, Joon-Yong; Park, Man-Seong; Song, Ki-Joon

    2016-01-01

    Human metapneumovirus (HMPV) has been described as an important etiologic agent of upper and lower respiratory tract infections, especially in young children and the elderly. Most of school-aged children might be introduced to HMPVs, and exacerbation with other viral or bacterial super-infection is common. However, our understanding of the molecular evolution of HMPVs remains limited. To address the comprehensive evolutionary dynamics of HMPVs, we report a genome-wide analysis of the eight genes (N, P, M, F, M2, SH, G, and L) using 103 complete genome sequences. Phylogenetic reconstruction revealed that the eight genes from one HMPV strain grouped into the same genetic group among the five distinct lineages (A1, A2a, A2b, B1, and B2). A few exceptions of phylogenetic incongruence might suggest past recombination events, and we detected possible recombination breakpoints in the F, SH, and G coding regions. The five genetic lineages of HMPVs shared quite remote common ancestors ranging more than 220 to 470 years of age with the most recent origins for the A2b sublineage. Purifying selection was common, but most protein genes except the F and M2-2 coding regions also appeared to experience episodic diversifying selection. Taken together, these suggest that the five lineages of HMPVs maintain their individual evolutionary dynamics and that recombination and selection forces might work on shaping the genetic diversity of HMPVs. PMID:27046055

  7. Genome-Wide Discriminatory Information Patterns of Cytosine DNA Methylation

    PubMed Central

    Sanchez, Robersy; Mackenzie, Sally A.

    2016-01-01

    Cytosine DNA methylation (CDM) is a highly abundant, heritable but reversible chemical modification to the genome. Herein, a machine learning approach was applied to analyze the accumulation of epigenetic marks in methylomes of 152 ecotypes and 85 silencing mutants of Arabidopsis thaliana. In an information-thermodynamics framework, two measurements were used: (1) the amount of information gained/lost with the CDM changes IR and (2) the uncertainty of not observing a SNP LCR. We hypothesize that epigenetic marks are chromosomal footprints accounting for different ontogenetic and phylogenetic histories of individual populations. A machine learning approach is proposed to verify this hypothesis. Results support the hypothesis by the existence of discriminatory information (DI) patterns of CDM able to discriminate between individuals and between individual subpopulations. The statistical analyses revealed a strong association between the topologies of the structured population of Arabidopsis ecotypes based on IR and on LCR, respectively. A statistical-physical relationship between IR and LCR was also found. Results to date imply that the genome-wide distribution of CDM changes is not only part of the biological signal created by the methylation regulatory machinery, but ensures the stability of the DNA molecule, preserving the integrity of the genetic message under continuous stress from thermal fluctuations in the cell environment. PMID:27322251

  8. Genome-wide linkage-disequilibrium profiles from single individuals.

    PubMed

    Lynch, Michael; Xu, Sen; Maruki, Takahiro; Jiang, Xiaoqian; Pfaffelhuber, Peter; Haubold, Bernhard

    2014-09-01

    Although the analysis of linkage disequilibrium (LD) plays a central role in many areas of population genetics, the sampling variance of LD is known to be very large with high sensitivity to numbers of nucleotide sites and individuals sampled. Here we show that a genome-wide analysis of the distribution of heterozygous sites within a single diploid genome can yield highly informative patterns of LD as a function of physical distance. The proposed statistic, the correlation of zygosity, is closely related to the conventional population-level measure of LD, but is agnostic with respect to allele frequencies and hence likely less prone to outlier artifacts. Application of the method to several vertebrate species leads to the conclusion that >80% of recombination events are typically resolved by gene-conversion-like processes unaccompanied by crossovers, with the average lengths of conversion patches being on the order of one to several kilobases in length. Thus, contrary to common assumptions, the recombination rate between sites does not scale linearly with distance, often even up to distances of 100 kb. In addition, the amount of LD between sites separated by <200 bp is uniformly much greater than can be explained by the conventional neutral model, possibly because of the nonindependent origin of mutations within this spatial scale. These results raise questions about the application of conventional population-genetic interpretations to LD on short spatial scales and also about the use of spatial patterns of LD to infer demographic histories. PMID:24948778

  9. Progress of genome wide association study in domestic animals

    PubMed Central

    2012-01-01

    Domestic animals are invaluable resources for study of the molecular architecture of complex traits. Although the mapping of quantitative trait loci (QTL) responsible for economically important traits in domestic animals has achieved remarkable results in recent decades, not all of the genetic variation in the complex traits has been captured because of the low density of markers used in QTL mapping studies. The genome wide association study (GWAS), which utilizes high-density single-nucleotide polymorphism (SNP), provides a new way to tackle this issue. Encouraging achievements in dissection of the genetic mechanisms of complex diseases in humans have resulted from the use of GWAS. At present, GWAS has been applied to the field of domestic animal breeding and genetics, and some advances have been made. Many genes or markers that affect economic traits of interest in domestic animals have been identified. In this review, advances in the use of GWAS in domestic animals are described. PMID:22958308

  10. Genome-wide association studies in pediatric chronic kidney disease.

    PubMed

    Gupta, Jayanta; Kanetsky, Peter A; Wuttke, Matthias; Köttgen, Anna; Schaefer, Franz; Wong, Craig S

    2016-08-01

    The genome-wide association study (GWAS) has become an established scientific method that provides an unbiased screen for genetic loci potentially associated with phenotypes of clinical interest, such as chronic kidney disease (CKD). Thus, GWAS provides opportunities to gain new perspectives regarding the genetic architecture of CKD progression by identifying new candidate genes and targets for intervention. As such, it has become an important arm of translational science providing a complementary line of investigation to identify novel therapeutics to treat CKD. In this review, we describe the method and the challenges of performing GWAS in the pediatric CKD population. We also provide an overview of successful GWAS for kidney disease, and we discuss the established pediatric CKD cohorts in North America and Europe that are poised to identify genetic risk variants associated with CKD progression. PMID:26490952

  11. Ultrafast laser nanosurgery in microfluidics for genome-wide screenings

    PubMed Central

    Ben-Yakar, Adela; Bourgeois, Frederic

    2009-01-01

    Summary The use of ultrafast laser pulses in surgery has allowed for unprecedented precision with minimal collateral damage to surrounding tissues. For these reasons, ultrafast laser nanosurgery, as an injury model, has gained tremendous momentum in experimental biology ranging from in-vitro manipulations of subcellular structures to in-vivo studies in whole living organisms. For example, femtosecond laser nanosurgery on such model organism as the nematode Caenorhabditis elegans (C. elegans) has opened new opportunities for in-vivo nerve regeneration studies. Meanwhile, the development of novel microfluidic devices has brought the control in experimental environment to the level required for precise nanosurgery in various animal models. Merging microfluidics and laser nanosurgery has recently improved the specificities and increased the speed of laser surgeries enabling fast genome-wide screenings that can more readily decode the genetic map of various biological processes. PMID:19278850

  12. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Landt, Margarita CT Slof-Op t; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O'Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2015-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10-7) in SOX2OT and rs17030795 (P=5.84×10-6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10-6) between CUL3 and FAM124B and rs1886797 (P=8.05×10-6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4×10-6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  13. A genome-wide association study of anorexia nervosa

    PubMed Central

    Boraska, Vesna; Franklin, Christopher S; Floyd, James AB; Thornton, Laura M; Huckins, Laura M; Southam, Lorraine; Rayner, N William; Tachmazidou, Ioanna; Klump, Kelly L; Treasure, Janet; Lewis, Cathryn M; Schmidt, Ulrike; Tozzi, Federica; Kiezebrink, Kirsty; Hebebrand, Johannes; Gorwood, Philip; Adan, Roger AH; Kas, Martien JH; Favaro, Angela; Santonastaso, Paolo; Fernández-Aranda, Fernando; Gratacos, Monica; Rybakowski, Filip; Dmitrzak-Weglarz, Monika; Kaprio, Jaakko; Keski-Rahkonen, Anna; Raevuori, Anu; Van Furth, Eric F; Slof-Op t Landt, Margarita CT; Hudson, James I; Reichborn-Kjennerud, Ted; Knudsen, Gun Peggy S; Monteleone, Palmiero; Kaplan, Allan S; Karwautz, Andreas; Hakonarson, Hakon; Berrettini, Wade H; Guo, Yiran; Li, Dong; Schork, Nicholas J.; Komaki, Gen; Ando, Tetsuya; Inoko, Hidetoshi; Esko, Tõnu; Fischer, Krista; Männik, Katrin; Metspalu, Andres; Baker, Jessica H; Cone, Roger D; Dackor, Jennifer; DeSocio, Janiece E; Hilliard, Christopher E; O’Toole, Julie K; Pantel, Jacques; Szatkiewicz, Jin P; Taico, Chrysecolla; Zerwas, Stephanie; Trace, Sara E; Davis, Oliver SP; Helder, Sietske; Bühren, Katharina; Burghardt, Roland; de Zwaan, Martina; Egberts, Karin; Ehrlich, Stefan; Herpertz-Dahlmann, Beate; Herzog, Wolfgang; Imgart, Hartmut; Scherag, André; Scherag, Susann; Zipfel, Stephan; Boni, Claudette; Ramoz, Nicolas; Versini, Audrey; Brandys, Marek K; Danner, Unna N; de Kovel, Carolien; Hendriks, Judith; Koeleman, Bobby PC; Ophoff, Roel A; Strengman, Eric; van Elburg, Annemarie A; Bruson, Alice; Clementi, Maurizio; Degortes, Daniela; Forzan, Monica; Tenconi, Elena; Docampo, Elisa; Escaramís, Geòrgia; Jiménez-Murcia, Susana; Lissowska, Jolanta; Rajewski, Andrzej; Szeszenia-Dabrowska, Neonila; Slopien, Agnieszka; Hauser, Joanna; Karhunen, Leila; Meulenbelt, Ingrid; Slagboom, P Eline; Tortorella, Alfonso; Maj, Mario; Dedoussis, George; Dikeos, Dimitris; Gonidakis, Fragiskos; Tziouvas, Konstantinos; Tsitsika, Artemis; Papezova, Hana; Slachtova, Lenka; Martaskova, Debora; Kennedy, James L.; Levitan, Robert D.; Yilmaz, Zeynep; Huemer, Julia; Koubek, Doris; Merl, Elisabeth; Wagner, Gudrun; Lichtenstein, Paul; Breen, Gerome; Cohen-Woods, Sarah; Farmer, Anne; McGuffin, Peter; Cichon, Sven; Giegling, Ina; Herms, Stefan; Rujescu, Dan; Schreiber, Stefan; Wichmann, H-Erich; Dina, Christian; Sladek, Rob; Gambaro, Giovanni; Soranzo, Nicole; Julia, Antonio; Marsal, Sara; Rabionet, Raquel; Gaborieau, Valerie; Dick, Danielle M; Palotie, Aarno; Ripatti, Samuli; Widén, Elisabeth; Andreassen, Ole A; Espeseth, Thomas; Lundervold, Astri; Reinvang, Ivar; Steen, Vidar M; Le Hellard, Stephanie; Mattingsdal, Morten; Ntalla, Ioanna; Bencko, Vladimir; Foretova, Lenka; Janout, Vladimir; Navratilova, Marie; Gallinger, Steven; Pinto, Dalila; Scherer, Stephen; Aschauer, Harald; Carlberg, Laura; Schosser, Alexandra; Alfredsson, Lars; Ding, Bo; Klareskog, Lars; Padyukov, Leonid; Finan, Chris; Kalsi, Gursharan; Roberts, Marion; Logan, Darren W; Peltonen, Leena; Ritchie, Graham RS; Barrett, Jeffrey C; Estivill, Xavier; Hinney, Anke; Sullivan, Patrick F; Collier, David A; Zeggini, Eleftheria; Bulik, Cynthia M

    2013-01-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2,907 cases with AN from 14 countries (15 sites) and 14,860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery datasets. Seventy-six (72 independent) SNPs were taken forward for in silico (two datasets) or de novo (13 datasets) replication genotyping in 2,677 independent AN cases and 8,629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication datasets comprised 5,551 AN cases and 21,080 controls. AN subtype analyses (1,606 AN restricting; 1,445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01×10−7) in SOX2OT and rs17030795 (P=5.84×10−6) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76×10−6) between CUL3 and FAM124B and rs1886797 (P=8.05×10−6) near SPATA13. Comparing discovery to replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P= 4×10−6), strongly suggesting that true findings exist but that our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:21079607

  14. A genome-wide association study of anorexia nervosa.

    PubMed

    Boraska, V; Franklin, C S; Floyd, J A B; Thornton, L M; Huckins, L M; Southam, L; Rayner, N W; Tachmazidou, I; Klump, K L; Treasure, J; Lewis, C M; Schmidt, U; Tozzi, F; Kiezebrink, K; Hebebrand, J; Gorwood, P; Adan, R A H; Kas, M J H; Favaro, A; Santonastaso, P; Fernández-Aranda, F; Gratacos, M; Rybakowski, F; Dmitrzak-Weglarz, M; Kaprio, J; Keski-Rahkonen, A; Raevuori, A; Van Furth, E F; Slof-Op 't Landt, M C T; Hudson, J I; Reichborn-Kjennerud, T; Knudsen, G P S; Monteleone, P; Kaplan, A S; Karwautz, A; Hakonarson, H; Berrettini, W H; Guo, Y; Li, D; Schork, N J; Komaki, G; Ando, T; Inoko, H; Esko, T; Fischer, K; Männik, K; Metspalu, A; Baker, J H; Cone, R D; Dackor, J; DeSocio, J E; Hilliard, C E; O'Toole, J K; Pantel, J; Szatkiewicz, J P; Taico, C; Zerwas, S; Trace, S E; Davis, O S P; Helder, S; Bühren, K; Burghardt, R; de Zwaan, M; Egberts, K; Ehrlich, S; Herpertz-Dahlmann, B; Herzog, W; Imgart, H; Scherag, A; Scherag, S; Zipfel, S; Boni, C; Ramoz, N; Versini, A; Brandys, M K; Danner, U N; de Kovel, C; Hendriks, J; Koeleman, B P C; Ophoff, R A; Strengman, E; van Elburg, A A; Bruson, A; Clementi, M; Degortes, D; Forzan, M; Tenconi, E; Docampo, E; Escaramís, G; Jiménez-Murcia, S; Lissowska, J; Rajewski, A; Szeszenia-Dabrowska, N; Slopien, A; Hauser, J; Karhunen, L; Meulenbelt, I; Slagboom, P E; Tortorella, A; Maj, M; Dedoussis, G; Dikeos, D; Gonidakis, F; Tziouvas, K; Tsitsika, A; Papezova, H; Slachtova, L; Martaskova, D; Kennedy, J L; Levitan, R D; Yilmaz, Z; Huemer, J; Koubek, D; Merl, E; Wagner, G; Lichtenstein, P; Breen, G; Cohen-Woods, S; Farmer, A; McGuffin, P; Cichon, S; Giegling, I; Herms, S; Rujescu, D; Schreiber, S; Wichmann, H-E; Dina, C; Sladek, R; Gambaro, G; Soranzo, N; Julia, A; Marsal, S; Rabionet, R; Gaborieau, V; Dick, D M; Palotie, A; Ripatti, S; Widén, E; Andreassen, O A; Espeseth, T; Lundervold, A; Reinvang, I; Steen, V M; Le Hellard, S; Mattingsdal, M; Ntalla, I; Bencko, V; Foretova, L; Janout, V; Navratilova, M; Gallinger, S; Pinto, D; Scherer, S W; Aschauer, H; Carlberg, L; Schosser, A; Alfredsson, L; Ding, B; Klareskog, L; Padyukov, L; Courtet, P; Guillaume, S; Jaussent, I; Finan, C; Kalsi, G; Roberts, M; Logan, D W; Peltonen, L; Ritchie, G R S; Barrett, J C; Estivill, X; Hinney, A; Sullivan, P F; Collier, D A; Zeggini, E; Bulik, C M

    2014-10-01

    Anorexia nervosa (AN) is a complex and heritable eating disorder characterized by dangerously low body weight. Neither candidate gene studies nor an initial genome-wide association study (GWAS) have yielded significant and replicated results. We performed a GWAS in 2907 cases with AN from 14 countries (15 sites) and 14 860 ancestrally matched controls as part of the Genetic Consortium for AN (GCAN) and the Wellcome Trust Case Control Consortium 3 (WTCCC3). Individual association analyses were conducted in each stratum and meta-analyzed across all 15 discovery data sets. Seventy-six (72 independent) single nucleotide polymorphisms were taken forward for in silico (two data sets) or de novo (13 data sets) replication genotyping in 2677 independent AN cases and 8629 European ancestry controls along with 458 AN cases and 421 controls from Japan. The final global meta-analysis across discovery and replication data sets comprised 5551 AN cases and 21 080 controls. AN subtype analyses (1606 AN restricting; 1445 AN binge-purge) were performed. No findings reached genome-wide significance. Two intronic variants were suggestively associated: rs9839776 (P=3.01 × 10(-7)) in SOX2OT and rs17030795 (P=5.84 × 10(-6)) in PPP3CA. Two additional signals were specific to Europeans: rs1523921 (P=5.76 × 10(-)(6)) between CUL3 and FAM124B and rs1886797 (P=8.05 × 10(-)(6)) near SPATA13. Comparing discovery with replication results, 76% of the effects were in the same direction, an observation highly unlikely to be due to chance (P=4 × 10(-6)), strongly suggesting that true findings exist but our sample, the largest yet reported, was underpowered for their detection. The accrual of large genotyped AN case-control samples should be an immediate priority for the field. PMID:24514567

  15. Genome-Wide Association Studies for Comb Traits in Chickens.

    PubMed

    Shen, Manman; Qu, Liang; Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61-0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  16. Deciphering Genome-Environment-Wide Interactions Using Exposed Subjects Only

    PubMed Central

    Zhao, Lue Ping; Fan, Wenhong; Goodman, Gary; Radich, Jerry; Martin, Paul

    2015-01-01

    The recent successes of genome-wide association studies (GWAS) have renewed interest in genome-environment-wide interaction studies (GEWIS) to discover genetic factors that modulate penetrance of environmental exposures to human diseases. Indeed, gene-environment interactions (GxE), which have not been emphasized in the GWAS era, could be a source contributing to the missing heritability, a major bottleneck limiting continuing GWAS successes. In this manuscript, we describe a design and analytic strategy to focus on GxE using only exposed subjects, dubbed as e-GEWIS. Operationally, an e-GEWIS analysis is equivalent to a GWAS analysis on exposed subjects only, and it has actually been used in some earlier GWAS without being explicitly identified as such. Through both analytics and simulations, e-GEWIS have been shown better efficiency than the usual cross-product-based analysis of GxE interaction with both cases and controls (cc-GEWIS), and they have comparable efficiency to case-only analysis of GxE (c-GEWIS), with potentially smaller sample sizes. The formalization of e-GEWIS here provides a theoretical basis to legitimize this framework for routine investigation of GxE, for more efficient GxE study designs, and for improvement of reproducibility in replicating GEWIS findings. As an illustration, we apply e-GEWIS to a lung cancer GWAS dataset to perform a GEWIS, focusing on gene and smoking interaction. The e-GEWIS analysis successfully uncovered positive genetic associations on chromosome 15 among current smokers, suggesting a gene-smoking interaction. While this signal was detected earlier, the current finding here serves as a positive control in support of this e-GEWIS strategy. PMID:25694100

  17. Genome-wide linkage in Utah autism pedigrees.

    PubMed

    Allen-Brady, K; Robison, R; Cannon, D; Varvil, T; Villalobos, M; Pingree, C; Leppert, M F; Miller, J; McMahon, W M; Coon, H

    2010-10-01

    Genetic studies of autism over the past decade suggest a complex landscape of multiple genes. In the face of this heterogeneity, studies that include large extended pedigrees may offer valuable insights, as the relatively few susceptibility genes within single large families may be more easily discerned. This genome-wide screen of 70 families includes 20 large extended pedigrees of 6-9 generations, 6 moderate-sized families of 4-5 generations and 44 smaller families of 2-3 generations. The Center for Inherited Disease Research (CIDR) provided genotyping using the Illumina Linkage Panel 12, a 6K single-nucleotide polymorphism (SNP) platform. Results from 192 subjects with an autism spectrum disorder (ASD) and 461 of their relatives revealed genome-wide significance on chromosome 15q, with three possibly distinct peaks: 15q13.1-q14 (heterogeneity LOD (HLOD)=4.09 at 29 459 872 bp); 15q14-q21.1 (HLOD=3.59 at 36 837 208 bp); and 15q21.1-q22.2 (HLOD=5.31 at 55 629 733 bp). Two of these peaks replicate earlier findings. There were additional suggestive results on chromosomes 2p25.3-p24.1 (HLOD=1.87), 7q31.31-q32.3 (HLOD=1.97) and 13q12.11-q12.3 (HLOD=1.93). Affected subjects in families supporting the linkage peaks found in this study did not reveal strong evidence for distinct phenotypic subgroups. PMID:19455147

  18. Genome-Wide Association Studies for Comb Traits in Chickens

    PubMed Central

    Ma, Meng; Dou, Taocun; Lu, Jian; Guo, Jun; Hu, Yuping; Yi, Guoqiang; Yuan, Jingwei; Sun, Congjiao; Wang, Kehua; Yang, Ning

    2016-01-01

    The comb, as a secondary sexual character, is an important trait in chicken. Indicators of comb length (CL), comb height (CH), and comb weight (CW) are often selected in production. DNA-based marker-assisted selection could help chicken breeders to accelerate genetic improvement for comb or related economic characters by early selection. Although a number of quantitative trait loci (QTL) and candidate genes have been identified with advances in molecular genetics, candidate genes underlying comb traits are limited. The aim of the study was to use genome-wide association (GWA) studies by 600 K Affymetrix chicken SNP arrays to detect genes that are related to comb, using an F2 resource population. For all comb characters, comb exhibited high SNP-based heritability estimates (0.61–0.69). Chromosome 1 explained 20.80% genetic variance, while chromosome 4 explained 6.89%. Independent univariate genome-wide screens for each character identified 127, 197, and 268 novel significant SNPs with CL, CH, and CW, respectively. Three candidate genes, VPS36, AR, and WNT11B, were determined to have a plausible function in all comb characters. These genes are important to the initiation of follicle development, gonadal growth, and dermal development, respectively. The current study provides the first GWA analysis for comb traits. Identification of the genetic basis as well as promising candidate genes will help us understand the underlying genetic architecture of comb development and has practical significance in breeding programs for the selection of comb as an index for sexual maturity or reproduction. PMID:27427764

  19. Systems-Level Analysis of Genome-Wide Association Data

    PubMed Central

    Farber, Charles R.

    2013-01-01

    Genome-wide association studies (GWAS) have emerged as the method of choice for identifying common variants affecting complex disease. In a GWAS, particular attention is placed, for obvious reasons, on single-nucleotide polymorphisms (SNPs) that exceed stringent genome-wide significance thresholds. However, it is expected that many SNPs with only nominal evidence of association (e.g., P < 0.05) truly influence disease. Efforts to extract additional biological information from entire GWAS datasets have primarily focused on pathway-enrichment analyses. However, these methods suffer from a number of limitations and typically fail to lead to testable hypotheses. To evaluate alternative approaches, we performed a systems-level analysis of GWAS data using weighted gene coexpression network analysis. A weighted gene coexpression network was generated for 1918 genes harboring SNPs that displayed nominal evidence of association (P ≤ 0.05) from a GWAS of bone mineral density (BMD) using microarray data on circulating monocytes isolated from individuals with extremely low or high BMD. Thirteen distinct gene modules were identified, each comprising coexpressed and highly interconnected GWAS genes. Through the characterization of module content and topology, we illustrate how network analysis can be used to discover disease-associated subnetworks and characterize novel interactions for genes with a known role in the regulation of BMD. In addition, we provide evidence that network metrics can be used as a prioritizing tool when selecting genes and SNPs for replication studies. Our results highlight the advantages of using systems-level strategies to add value to and inform GWAS. PMID:23316444

  20. The genome-wide structure of the Jewish people.

    PubMed

    Behar, Doron M; Yunusbayev, Bayazit; Metspalu, Mait; Metspalu, Ene; Rosset, Saharon; Parik, Jüri; Rootsi, Siiri; Chaubey, Gyaneshwer; Kutuev, Ildus; Yudkovsky, Guennady; Khusnutdinova, Elza K; Balanovsky, Oleg; Semino, Ornella; Pereira, Luisa; Comas, David; Gurwitz, David; Bonne-Tamir, Batsheva; Parfitt, Tudor; Hammer, Michael F; Skorecki, Karl; Villems, Richard

    2010-07-01

    Contemporary Jews comprise an aggregate of ethno-religious communities whose worldwide members identify with each other through various shared religious, historical and cultural traditions. Historical evidence suggests common origins in the Middle East, followed by migrations leading to the establishment of communities of Jews in Europe, Africa and Asia, in what is termed the Jewish Diaspora. This complex demographic history imposes special challenges in attempting to address the genetic structure of the Jewish people. Although many genetic studies have shed light on Jewish origins and on diseases prevalent among Jewish communities, including studies focusing on uniparentally and biparentally inherited markers, genome-wide patterns of variation across the vast geographic span of Jewish Diaspora communities and their respective neighbours have yet to be addressed. Here we use high-density bead arrays to genotype individuals from 14 Jewish Diaspora communities and compare these patterns of genome-wide diversity with those from 69 Old World non-Jewish populations, of which 25 have not previously been reported. These samples were carefully chosen to provide comprehensive comparisons between Jewish and non-Jewish populations in the Diaspora, as well as with non-Jewish populations from the Middle East and north Africa. Principal component and structure-like analyses identify previously unrecognized genetic substructure within the Middle East. Most Jewish samples form a remarkably tight subcluster that overlies Druze and Cypriot samples but not samples from other Levantine populations or paired Diaspora host populations. In contrast, Ethiopian Jews (Beta Israel) and Indian Jews (Bene Israel and Cochini) cluster with neighbouring autochthonous populations in Ethiopia and western India, respectively, despite a clear paternal link between the Bene Israel and the Levant. These results cast light on the variegated genetic architecture of the Middle East, and trace the origins

  1. Dissecting ancestry genomic background in substance dependence genome-wide association studies

    PubMed Central

    Polimanti, Renato; Yang, Can; Zhao, Hongyu; Gelernter, Joel

    2015-01-01

    Aims To understand the role of ancestral genomic background in substance dependence (SD) genome-wide association studies (GWAS), we analyzed population diversity at genetic loci associated with SD traits and evaluated its effect on GWAS outcomes. Materials & methods We investigated 24 genes with variants associated with SD by GWAS; and 82 loci with putative subordinate roles with respect to SD-associated genes. Results We observed high ancestry-related frequency differences in common functional alleles in GWAS relevant genes and their interactive partners. Common functional alleles with high frequency differences demonstrated significant effects on the GWAS outcomes. Conclusion Population differences in SD GWAS outcomes seem not to be influenced by general variation across the genome, but by ancestry-related local haplotype structures at SD-associated loci. PMID:26267224

  2. Genome wide identification of regulatory motifs in Bacillus subtilis

    PubMed Central

    Mwangi, Michael M; Siggia, Eric D

    2003-01-01

    Background To explain the vastly different phenotypes exhibited by the same organism under different conditions, it is essential that we understand how the organism's genes are coordinately regulated. While there are many excellent tools for predicting sequences encoding proteins or RNA genes, few algorithms exist to predict regulatory sequences on a genome wide scale with no prior information. Results To identify motifs involved in the control of transcription, an algorithm was developed that searches upstream of operons for improbably frequent dimers. The algorithm was applied to the B. subtilis genome, which is predicted to encode for approximately 200 DNA binding proteins. The dimers found to be over-represented could be clustered into 317 distinct groups, each thought to represent a class of motifs uniquely recognized by some transcription factor. For each cluster of dimers, a representative weight matrix was derived and scored over the regions upstream of the operons to predict the sites recognized by the cluster's factor, and a putative regulon of the operons immediately downstream of the sites was inferred. The distribution in number of operons per predicted regulon is comparable to that for well characterized transcription factors. The most highly over-represented dimers matched σA, the T-box, and σW sites. We have evidence to suggest that at least 52 of our clusters of dimers represent actual regulatory motifs, based on the groups' weight matrix matches to experimentally characterized sites, the functional similarity of the component operons of the groups' regulons, and the positional biases of the weight matrix matches. All predictions are assigned a significance value, and thresholds are set to avoid false positives. Where possible, we examine our false negatives, drawing examples from known regulatory motifs and regulons inferred from RNA expression data. Conclusions We have demonstrated that in the case of B. subtilis our algorithm allows for the

  3. Genome scan for nonadditive heterotic trait loci reveals mainly underdominant effects in Saccharomyces cerevisiae.

    PubMed

    Laiba, Efrat; Glikaite, Ilana; Levy, Yael; Pasternak, Zohar; Fridman, Eyal

    2016-04-01

    The overdominant model of heterosis explains the superior phenotype of hybrids by synergistic allelic interaction within heterozygous loci. To map such genetic variation in yeast, we used a population doubling time dataset of Saccharomyces cerevisiae 16 × 16 diallel and searched for major contributing heterotic trait loci (HTL). Heterosis was observed for the majority of hybrids, as they surpassed their best parent growth rate. However, most of the local heterozygous loci identified by genome scan were surprisingly underdominant, i.e., reduced growth. We speculated that in these loci adverse effects on growth resulted from incompatible allelic interactions. To test this assumption, we eliminated these allelic interactions by creating hybrids with local hemizygosity for the underdominant HTLs, as well as for control random loci. Growth of hybrids was indeed elevated for most hemizygous to HTL genes but not for control genes, hence validating the results of our genome scan. Assessing the consequences of local heterozygosity by reciprocal hemizygosity and allele replacement assays revealed the influence of genetic background on the underdominant effects of HTLs. Overall, this genome-wide study on a multi-parental hybrid population provides a strong argument against single gene overdominance as a major contributor to heterosis, and favors the dominance complementation model. PMID:26967146

  4. Genome scan for linkage to Gilles de la Tourette syndrome

    SciTech Connect

    Barr, C.L.; Livingston, J.; Williamson, R.

    1994-09-01

    Gilles de la Tourette Syndrome (TS) is a familial, neuropsychiatric disorder characterized by chronic, intermittent motor and vocal tics. In addition to tics, affected individuals frequently display symptoms such as attention-deficit hyperactivity disorder and/or obsessive compulsive disorder. Genetic analyses of family data have suggested that susceptibility to the disorder is most likely due to a single genetic locus with a dominant mode of transmission and reduced penetrance. In the search for genetic linkage for TS, we have collected well-characterized pedigrees with multiple affected individuals on whom extensive diagnostic evaluations have been done. The first stage of our study is to scan the genome systematically using a panel of uniformly spaced (10 to 20 cM), highly polymorphic, microsatellite markers on 5 families segregating TS. To date, 290 markers have been typed and 3,660 non-overlapping cM of the genome have been excluded for possible linkage under the assumption of genetic homogeneity. Because of the possibility of locus heterogeneity overall summed exclusion is not considered tantamount to absolute exclusion of a disease locus in that region. The results from each family are carefully evaluated and a positive lod score in a single family is followed up by typing closely linked markers. Linkage to TS was examined by two-point analysis using the following genetic model: single autosomal dominant gene with gene frequency .003 and maximum penetrance of .99. An age-of-onset correction is included using a linear function increasing from age 2 years to 21 years. A small rate of phenocopies is also incorporated into the model. Only individuals with TS or CMT according to DSM III-R criteria were regarded as affected for the purposes of this summary. Additional markers are being tested to provide coverage at 5 cM intervals. Moreover, we are currently analyzing the data non-parametrically using the Affected-Pedigree-Member Method of linkage analysis.

  5. Detecting Gene-Environment Interactions for a Quantitative Trait in a Genome-Wide Association Study.

    PubMed

    Zhang, Pingye; Lewinger, Juan Pablo; Conti, David; Morrison, John L; Gauderman, W James

    2016-07-01

    A genome-wide association study (GWAS) typically is focused on detecting marginal genetic effects. However, many complex traits are likely to be the result of the interplay of genes and environmental factors. These SNPs may have a weak marginal effect and thus unlikely to be detected from a scan of marginal effects, but may be detectable in a gene-environment (G × E) interaction analysis. However, a genome-wide interaction scan (GWIS) using a standard test of G × E interaction is known to have low power, particularly when one corrects for testing multiple SNPs. Two 2-step methods for GWIS have been previously proposed, aimed at improving efficiency by prioritizing SNPs most likely to be involved in a G × E interaction using a screening step. For a quantitative trait, these include a method that screens on marginal effects [Kooperberg and Leblanc, 2008] and a method that screens on variance heterogeneity by genotype [Paré et al., 2010] In this paper, we show that the Paré et al. approach has an inflated false-positive rate in the presence of an environmental marginal effect, and we propose an alternative that remains valid. We also propose a novel 2-step approach that combines the two screening approaches, and provide simulations demonstrating that the new method can outperform other GWIS approaches. Application of this method to a G × Hispanic-ethnicity scan for childhood lung function reveals a SNP near the MARCO locus that was not identified by previous marginal-effect scans. PMID:27230133

  6. Genomic Research and Wide Data Sharing: Views of Prospective Participants

    PubMed Central

    Trinidad, Susan Brown; Fullerton, Stephanie M.; Bares, Julie M.; Jarvik, Gail P.; Larson, Eric B.; Burke, Wylie

    2011-01-01

    Purpose Sharing study data within the research community generates tension between two important goods: promoting scientific goals and protecting the privacy interests of study participants. The present study was designed to explore the perceptions, beliefs, and attitudes of research participants and possible future participants regarding genome-wide association studies (GWAS) and repository-based research. Methods Focus group sessions with (1) current research participants, (2) surrogate decision-makers, and (3) three age-defined cohorts (18–34 years, 35–50, >50). Results Participants expressed a variety of opinions about the acceptability of wide sharing of genetic and phenotypic information for research purposes through large, publicly accessible data repositories. Most believed that making de-identified study data available to the research community is a social good that should be pursued. Privacy and confidentiality concerns were common, though they would not necessarily preclude participation. Many participants voiced reservations about sharing data with for-profit organizations. Conclusions Trust is central in participants’ views regarding GWAS data sharing. Further research is needed to develop governance models that enact the values of stewardship. PMID:20535021

  7. Genomic resources and their influence on the detection of the signal of positive selection in genome scans.

    PubMed

    Manel, S; Perrier, C; Pratlong, M; Abi-Rached, L; Paganini, J; Pontarotti, P; Aurelle, D

    2016-01-01

    Genome scans represent powerful approaches to investigate the action of natural selection on the genetic variation of natural populations and to better understand local adaptation. This is very useful, for example, in the field of conservation biology and evolutionary biology. Thanks to Next Generation Sequencing, genomic resources are growing exponentially, improving genome scan analyses in non-model species. Thousands of SNPs called using Reduced Representation Sequencing are increasingly used in genome scans. Besides, genome sequences are also becoming increasingly available, allowing better processing of short-read data, offering physical localization of variants, and improving haplotype reconstruction and data imputation. Ultimately, genome sequences are also becoming the raw material for selection inferences. Here, we discuss how the increasing availability of such genomic resources, notably genome sequences, influences the detection of signals of selection. Mainly, increasing data density and having the information of physical linkage data expand genome scans by (i) improving the overall quality of the data, (ii) helping the reconstruction of demographic history for the population studied to decrease false-positive rates and (iii) improving the statistical power of methods to detect the signal of selection. Of particular importance, the availability of a high-quality reference genome can improve the detection of the signal of selection by (i) allowing matching the potential candidate loci to linked coding regions under selection, (ii) rapidly moving the investigation to the gene and function and (iii) ensuring that the highly variable regions of the genomes that include functional genes are also investigated. For all those reasons, using reference genomes in genome scan analyses is highly recommended. PMID:26562485

  8. A genome-wide linkage analysis of dementia in the Amish

    PubMed Central

    Hahs, Daniel W.; McCauley, Jacob L.; Crunk, Amy E.; McFarland, Lynne L.; Gaskell, Perry C.; Jiang, Lan; Slifer, Susan H.; Vance, Jeffery M.; Scott, William K.; Welsh-Bohmer, Kathleen A.; Johnson, Stephanie R.; Jackson, Charles E.; Pericak-Vance, Margaret A.; Haines, Jonathan L.

    2008-01-01

    Susceptibility genes for Alzheimer's disease are proving to be highly challenging to detect and verify. Population heterogeneity may be a significant confounding factor contributing to this difficulty. To increase the power for disease susceptibility gene detection we conducted a genome-wide genetic linkage screen using individuals from the relatively isolated, genetically homogeneous, Amish population. Our genome linkage analysis used a 407 microsatellite marker map (average density 7 cM) to search for autosomal genes linked to dementia in five Amish families from four Midwestern U.S. counties. Our highest two-point lod score (3.01) was observed at marker D4S1548 on chromosome 4q31. Five other regions (10q22, 3q28, 11p13, 4q28, 19p13) also demonstrated suggestive linkage with markers having two-point lod scores >2.0. While two of these regions are novel (4q31 and 11p13), the other regions lie close to regions identified in previous genome scans in other populations. Our results identify regions of the genome that may harbor genes involved in a subset of dementia patients, in particular the North American Amish community. PMID:16389594

  9. Genome-wide pathway analysis of genome-wide association studies on systemic lupus erythematosus and rheumatoid arthritis.

    PubMed

    Lee, Young Ho; Bae, Sang-Cheol; Choi, Sung Jae; Ji, Jong Dae; Song, Gwan Gyu

    2012-12-01

    The aim of this study was to explore candidate single nucleotide polymorphisms (SNPs) and candidate mechanisms of systemic lupus erythematosus (SLE) and rheumatoid arthritis (RA). Two SLE genome-wide association studies (GWASs) datasets were included in this study. Meta-analysis was conducted using 737,984 SNPs in 1,527 SLE cases and 3,421 controls of European ancestry, and 4,429 SNPs that met a threshold of p < 0.01 in a Korean RA GWAS dataset was used. ICSNPathway (identify candidate causal SNPs and pathways) analysis was applied to the meta-analysis results of the SLE GWAS datasets, and a RA GWAS dataset. The most significant result of SLE GWAS meta-analysis concerned rs2051549 in the human leukocyte antigen (HLA) region (p = 3.36E-22). In the non-HLA region, meta-analysis identified 6 SNPs associated with SLE with genome-wide significance (STAT4, TNPO3, BLK, FAM167A, and IRF5). ICSNPathway identified five candidate causal SNPs and 13 candidate causal pathways. This pathway-based analysis provides three hypotheses of the biological mechanism involved. First, rs8084 and rs7192 → HLA-DRA → bystander B cell activation. Second, rs1800629 → TNF → cytokine network. Third, rs1150752 and rs185819 → TNXB → collagen metabolic process. ICSNPathway analysis identified three candidate causal non-HLA SNPs and four candidate causal pathways involving the PADI4, MTR, PADI2, and TPH2 genes of RA. We identified five candidate SNPs and thirteen pathways, involving bystander B cell activation, cytokine network, and collagen metabolic processing, which may contribute to SLE susceptibility, and we revealed candidate causal non-HLA SNPs, genes, and pathways of RA. PMID:23053960

  10. Genome-wide screening and identification of antigens for rickettsial vaccine development

    Technology Transfer Automated Retrieval System (TEKTRAN)

    The capacity to identify immunogens for vaccine development by genome-wide screening has been markedly enhanced by the availability of complete microbial genome sequences coupled to rapid proteomic and bioinformatic analysis. Critical to this genome-wide screening is in vivo testing in the context o...