Patterns of linkage disequilibrium at PARK16 may explain variances in genetic association studies.
Li, Huihua; Teo, Yik-Ying; Tan, Eng-King
2015-09-01
Reproducing genomewide association studies findings in different populations is challenging, because the reproducibility fundamentally relies on the similar patterns of linkage disequilibrium between the unknown causal variants and the genotyped single-nucleotide polymorphisms (SNPs). The PARK16 locus was reported to alter the risk of Parkinson's disease (PD) in genomewide association studies in Japanese and Caucasians. We evaluated the regional linkage disequilibrium pattern at PARK16 locus in Caucasians, Japanese, and Chinese from HapMap and Chinese, Malays, and Indians from the Singapore Genome Variation Project, using the traditional heatmaps and targeted analysis of PARK16 gene via Monte Carlo simulation through varLD scores of these ethnic groups. One hundred SNPs in Caucasians, 95 SNPs in Chinese, 78 SNPs in Japanese from HapMap, 86 SNPs in Chinese, 99 SNPs in Indians, and 97 SNPs in Malays from the Singapore Genome Variation Project were included. Our targeted analysis showed that the linkage disequilibrium pattern of SNPs close to rs947211 was similar in Caucasians and Asians, including Chinese, Japanese, and Malay (all P > 0.0001), whereas different linkage disequilibrium patterns around rs823128, rs823156, and rs708730 were found between Caucasians and these Asian groups (all P < 0.0001). Our study suggests a higher chance to detect the association between rs947211 and PD in Chinese, Malay, and other Caucasian groups because of the similar linkage disequilibrium pattern around rs947211. The associations between rs823128/rs823156/rs708730 and PD are more likely to be replicated in Chinese and Malay populations. © 2015 International Parkinson and Movement Disorder Society.
Genotyping-by-sequencing enables linkage mapping in three octoploid cultivated strawberry families
Salinas, Natalia; Tennessen, Jacob A.; Zurn, Jason D.; Sargent, Daniel James; Hancock, James; Bassil, Nahla V.
2017-01-01
Genotyping-by-sequencing (GBS) was used to survey genome-wide single-nucleotide polymorphisms (SNPs) in three biparental strawberry (Fragaria × ananassa) populations with the goal of evaluating this technique in a species with a complex octoploid genome. GBS sequence data were aligned to the F. vesca ‘Fvb’ reference genome in order to call SNPs. Numbers of polymorphic SNPs per population ranged from 1,163 to 3,190. Linkage maps consisting of 30–65 linkage groups were produced from the SNP sets derived from each parent. The linkage groups covered 99% of the Fvb reference genome, with three to seven linkage groups from a given parent aligned to any particular chromosome. A phylogenetic analysis performed using the POLiMAPS pipeline revealed linkage groups that were most similar to ancestral species F. vesca for each chromosome. Linkage groups that were most similar to a second ancestral species, F. iinumae, were only resolved for Fvb 4. The quantity of missing data and heterogeneity in genome coverage inherent in GBS complicated the analysis, but POLiMAPS resolved F. × ananassa chromosomal regions derived from diploid ancestor F. vesca. PMID:28875078
Exploiting Genome Structure in Association Analysis
Kim, Seyoung
2014-01-01
Abstract A genome-wide association study involves examining a large number of single-nucleotide polymorphisms (SNPs) to identify SNPs that are significantly associated with the given phenotype, while trying to reduce the false positive rate. Although haplotype-based association methods have been proposed to accommodate correlation information across nearby SNPs that are in linkage disequilibrium, none of these methods directly incorporated the structural information such as recombination events along chromosome. In this paper, we propose a new approach called stochastic block lasso for association mapping that exploits prior knowledge on linkage disequilibrium structure in the genome such as recombination rates and distances between adjacent SNPs in order to increase the power of detecting true associations while reducing false positives. Following a typical linear regression framework with the genotypes as inputs and the phenotype as output, our proposed method employs a sparsity-enforcing Laplacian prior for the regression coefficients, augmented by a first-order Markov process along the sequence of SNPs that incorporates the prior information on the linkage disequilibrium structure. The Markov-chain prior models the structural dependencies between a pair of adjacent SNPs, and allows us to look for association SNPs in a coupled manner, combining strength from multiple nearby SNPs. Our results on HapMap-simulated datasets and mouse datasets show that there is a significant advantage in incorporating the prior knowledge on linkage disequilibrium structure for marker identification under whole-genome association. PMID:21548809
Kharrat, Najla; Abdelmouleh, Wafa; Abdelhedi, Rania; Alfadhli, Suad; Rebai, Ahmed
2012-01-01
DNA variations within the Angiotensin-Converting Enzyme (ACE) gene have been shown to be involved in the aetiology of several common diseases and the therapeutic response. This study reports a comparison of haplotype analysis of five SNPs in the ACE gene region using a sample of 100 healthy subjects derived from five different populations (Tunisian, Iranian, Kuwaiti, Bahraini and Indian). Strong linkage disequilibrium was found among all SNPs studied for all populations. Two SNPs (rs1800764 and rs4340) were identified as key SNPs for all populations. These SNPs will be valuable for future effective association studies of the ACE gene polymorphisms in Arab and Asian populations.
Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing
2014-01-01
Background Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families. Results Approximately 6,000 SNPs were assigned to 29 linkage groups, utilising markers from known genomic locations as anchors. Linkage maps were then constructed for the four mapping parents separately. Overall map lengths were comparable between male and female parents, but the distribution of the SNPs showed sex-specific patterns with a greater degree of clustering of sire-segregating SNPs to single chromosome regions. The maps were integrated with the Atlantic salmon draft reference genome contigs, allowing the unique assignment of ~4,000 contigs to a linkage group. 112 genome contigs mapped to two or more linkage groups, highlighting regions of putative homeology within the salmon genome. A comparative genomics analysis with the stickleback reference genome identified putative genes closely linked to approximately half of the ordered SNPs and demonstrated blocks of orthology between the Atlantic salmon and stickleback genomes. A subset of 47 RAD-Seq SNPs were successfully validated using a high-throughput genotyping assay, with a correspondence of 97% between the two assays. Conclusions This Atlantic salmon RAD-Seq linkage map is a resource for salmonid genomics research as genotyping-by-sequencing becomes increasingly common. This is aided by the integration of the SbfI RAD-Seq SNPs with existing reference maps and the draft reference genome, as well as the identification of putative genes proximal to the SNPs. Differences in the distribution of recombination events between the sexes is evident, and regions of homeology have been identified which are reflective of the recent salmonid whole genome duplication. PMID:24571138
PRKCA: A Positional Candidate Gene for Body Mass Index and Asthma
Murphy, Amy; Tantisira, Kelan G.; Soto-Quirós, Manuel E.; Avila, Lydiana; Klanderman, Barbara J.; Lake, Stephen; Weiss, Scott T.; Celedón, Juan C.
2009-01-01
Asthma incidence and prevalence are higher in obese individuals. A potential mechanistic basis for this relationship is pleiotropy. We hypothesized that significant linkage and candidate-gene association would be found for body mass index (BMI) in a population ascertained on asthma affection status. Linkage analysis for BMI was performed on 657 subjects in eight Costa Rican families enrolled in a study of asthma. Family-based association studies were conducted for BMI with SNPs within a positional candidate gene, PRKCA. SNPs within PRKCA were also tested for association with asthma. Association studies were conducted in 415 Costa Rican parent-child trios and 493 trios participating in the Childhood Asthma Management Program (CAMP). Although only modest evidence of linkage for BMI was obtained for the whole cohort, significant linkage was noted for BMI in females on chromosome 17q (peak LOD = 3.39). Four SNPs in a candidate gene in this region (PRKCA) had unadjusted association p values < 0.05 for BMI in both cohorts, with the joint p value for two SNPs remaining significant after adjustment for multiple comparisons (rs228883 and rs1005651, joint p values = 9.5 × 10−5 and 5.6 × 10−5). Similarly, eight SNPs had unadjusted association p values < 0.05 for asthma in both populations, with one SNP remaining significant after adjustment for multiple comparisons (rs11079657, joint p value = 2.6 × 10−5). PRKCA is a pleiotropic locus that is associated with both BMI and asthma and that has been identified via linkage analysis of BMI in a population ascertained on asthma. PMID:19576566
Nagano, Soichiro; Shirasawa, Kenta; Hirakawa, Hideki; Maeda, Fumi; Ishikawa, Masami; Isobe, Sachiko N
2017-05-12
The strawberry, Fragaria × ananassa, is an allo-octoploid (2n = 8x = 56) and outcrossing species. Although it is the most widely consumed berry crop in the world, its complex genome structure has hindered its genetic and genomic analysis, and thus discrimination of subgenome-specific loci among the homoeologous chromosomes is needed. In the present study, we identified candidate subgenome-specific single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) loci, and constructed a linkage map using an S 1 mapping population of the cultivar 'Reikou' with an IStraw90 Axiom® SNP array and previously published SSR markers. The 'Reikou' linkage map consisted of 11,574 loci (11,002 SNPs and 572 SSR loci) spanning 2816.5 cM of 31 linkage groups. The 11,574 loci were located on 4738 unique positions (bin) on the linkage map. Of the mapped loci, 8999 (8588 SNPs and 411 SSR loci) showed a 1:2:1 segregation ratio of AA:AB:BB allele, which suggested the possibility of deriving loci from candidate subgenome-specific sequences. In addition, 2575 loci (2414 SNPs and 161 SSR loci) showed a 3:1 segregation of AB:BB allele, indicating they were derived from homoeologous genomic sequences. Comparative analysis of the homoeologous linkage groups revealed differences in genome structure among the subgenomes. Our results suggest that candidate subgenome-specific loci are randomly located across the genomes, and that there are small- to large-scale structural variations among the subgenomes. The mapped SNPs and SSR loci on the linkage map are expected to be seed points for the construction of pseudomolecules in the octoploid strawberry.
Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann
2017-01-01
Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs. PMID:28387238
Caucasian Families Exhibit Significant Linkage of Myopia to Chromosome 11p.
Musolf, Anthony M; Simpson, Claire L; Moiz, Bilal A; Long, Kyle A; Portas, Laura; Murgia, Federico; Ciner, Elise B; Stambolian, Dwight; Bailey-Wilson, Joan E
2017-07-01
Myopia is a common visual disorder caused by eye overgrowth, resulting in blurry vision. It affects one in four Americans, and its prevalence is increasing. The genetic mechanisms that underpin myopia are not completely understood. Here, we use genotype data and linkage analyses to identify high-risk genetic loci that are significantly linked to myopia. Individuals from 56 Caucasian families with a history of myopia were genotyped on an exome-based array, and the single nucleotide polymorphism (SNP) data were merged with microsatellite genotype data. Refractive error measures on the samples were converted into binary phenotypes consisting of affected, unaffected, or unknown myopia status. Parametric linkage analyses assuming an autosomal dominant model with 90% penetrance and 10% phenocopy rate were performed. Single variant two-point analyses yielded three significantly linked SNPs at 11p14.1 and 11p11.2; a further 45 SNPs at 11p were found to be suggestive. No other chromosome had any significant SNPs or more than seven suggestive linkages. Two of the significant SNPs were located in BBOX1-AS1 and one in the intergenic region between ORA47 and TRIM49B. Collapsed haplotype pattern two-point analysis and multipoint analyses also yielded multiple suggestively linked genes at 11p. Multipoint analysis also identified suggestive evidence of linkage on 20q13. We identified three genome-wide significant linked variants on 11p for myopia in Caucasians. Although the novel specific signals still need to be replicated, 11p is a promising region that has been identified by other linkage studies with a number of potentially interesting candidate genes. We hope that the identification of these regions on 11p as potential causal regions for myopia will lead to more focus on these regions and maybe possible replication of our specific linkage peaks in other studies. We further plan targeted sequencing on 11p for our most highly linked families to more clearly understand the source of the linkage in this region.
Rankinen, Tuomo; Argyropoulos, George; Rice, Treva; Rao, D C; Bouchard, Claude
2010-06-01
A genome-wide linkage scan identified a quantitative trait locus for exercise training-induced changes in submaximal exercise (50 W) heart rate (DeltaHR50) on chromosome 2q33.3-q34 in the HERITAGE Family Study (n=472). To fine-map the region, 1450 tag SNPs were genotyped between 205 and 215 Mb on chromosome 2. The strongest evidence of association with DeltaHR50 was observed with 2 single-nucleotide polymorphisms (SNPs) located in the 5' region of the cAMP-responsive element-binding protein 1 (CREB1) gene (rs2253206: P=1.6x10(-5) and rs2360969: P=4.3x10(-5)). The associations remained significant (P=0.01 and P=0.023, respectively) after accounting for multiple testing. Regression modeling of the 39 most significant SNPs in the single-SNP analysis identified 9 SNPs that collectively explained 20% of the DeltaHR50 variance. CREB1 SNP rs2253206 had the strongest effect (5.45% of variance), followed by SNPs in the FASTKD2 (3.1%), MAP2 (2.6%), SPAG16 (2.1%), ERBB4 (3 SNPs approximately 1.4% each), IKZF2 (1.4%), and PARD3B (1.0%) loci. In conditional linkage analysis, 6 SNPs from the final regression model (CREB1, FASTKD2, MAP2, ERBB4, IKZF2, and PARD3B) accounted for the original linkage signal: The log of the odds score dropped from 2.10 to 0.41 after adjusting for all 6 SNPs. Functional studies revealed that the common allele of rs2253206 exhibits significantly (P<0.05) lower promoter activity than the minor allele. Our data suggest that functional DNA sequence variation in the CREB1 locus is strongly associated with DeltaHR50 and explains a considerable proportion of the quantitative trait locus variance. However, at least 5 additional SNPs seem to be required to fully account for the original linkage signal.
McCaskie, Pamela A; Carter, Kim W; McCaskie, Simon R; Palmer, Lyle J
2005-01-01
We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes. PMID:16451612
Hellwege, Jacklyn N; Palmer, Nicholette D; Mark Brown, W; Brown, Mark W; Ziegler, Julie T; Sandy An, S; An, Sandy S; Guo, Xiuqing; Ida Chen, Y-D; Chen, Ida Y-D; Taylor, Kent; Hawkins, Gregory A; Ng, Maggie C Y; Speliotes, Elizabeth K; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Wagenknecht, Lynne E; Langefeld, Carl D; Bowden, Donald W
2015-02-01
We previously identified a low-frequency (1.1 %) coding variant (G45R; rs200573126) in the adiponectin gene (ADIPOQ) which was the basis for a multipoint microsatellite linkage signal (LOD = 8.2) for plasma adiponectin levels in Hispanic families. We have empirically evaluated the ability of data from targeted common variants, exome chip genotyping, and genome-wide association study data to detect linkage and association to adiponectin protein levels at this locus. Simple two-point linkage and association analyses were performed in 88 Hispanic families (1,150 individuals) using 10,958 SNPs on chromosome 3. Approaches were compared for their ability to map the functional variant, G45R, which was strongly linked (two-point LOD = 20.98) and powerfully associated (p value = 8.1 × 10(-50)). Over 450 SNPs within a broad 61 Mb interval around rs200573126 showed nominal evidence of linkage (LOD > 3) but only four other SNPs in this region were associated with p values < 1.0 × 10(-4). When G45R was accounted for, the maximum LOD score across the interval dropped to 4.39 and the best p value was 1.1 × 10(-5). Linked and/or associated variants ranged in frequency (0.0018-0.50) and type (coding, non-coding) and had little detectable linkage disequilibrium with rs200573126 (r (2) < 0.20). In addition, the two-point linkage approach empirically outperformed multipoint microsatellite and multipoint SNP analysis. In the absence of data for rs200573126, family-based linkage analysis using a moderately dense SNP dataset, including both common and low-frequency variants, resulted in stronger evidence for an adiponectin locus than association data alone. Thus, linkage analysis can be a useful tool to facilitate identification of high-impact genetic variants.
2011-01-01
Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation sequencing technologies and will include SNPs from comparative orthologous sequences that were identified in the present study, providing a wider collection of anchor points for comparative genomics among the conifers. PMID:21767361
Association of single-nucleotide polymorphisms of the tau gene with late-onset Parkinson disease.
Martin, E R; Scott, W K; Nance, M A; Watts, R L; Hubble, J P; Koller, W C; Lyons, K; Pahwa, R; Stern, M B; Colcher, A; Hiner, B C; Jankovic, J; Ondo, W G; Allen, F H; Goetz, C G; Small, G W; Masterman, D; Mastaglia, F; Laing, N G; Stajich, J M; Ribble, R C; Booze, M W; Rogala, A; Hauser, M A; Zhang, F; Gibson, R A; Middleton, L T; Roses, A D; Haines, J L; Scott, B L; Pericak-Vance, M A; Vance, J M
2001-11-14
The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. To investigate whether the tau gene is involved in idiopathic PD. Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Family-based tests of association, calculated using asymptotic distributions. Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P =.03; SNP 9i, P =.04; and SNP 11, P =.04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P =.11, and SNP 9iii, P =.87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P =.009) and a negative association with another haplotype (P =.007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3, 9i, 9ii, and 11). This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD.
Amundsen, Silja Svanstrøm; Adamovic, Svetlana; Hellqvist, Asa; Nilsson, Staffan; Gudjónsdóttir, Audur H; Ascher, Henry; Ek, Johan; Larsson, Kristina; Wahlström, Jan; Lie, Benedicte A; Sollid, Ludvig M; Naluai, Asa Torinsson
2007-09-01
Celiac disease (CD) is a gluten-induced enteropathy, which results from the interplay between environmental and genetic factors. There is a strong human leukocyte antigen (HLA) association with the disease, and HLA-DQ alleles represent a major genetic risk factor. In addition to HLA-DQ, non-HLA genes appear to be crucial for CD development. Chromosomal region 5q31-33 has demonstrated linkage with CD in several genome-wide studies, including in our Swedish/Norwegian cohort. In a European meta-analysis 5q31-33 was the only region that reached a genome-wide level of significance except for the HLA region. To identify the genetic variant(s) responsible for this linkage signal, we performed a comprehensive single nucleotide polymorphism (SNP) association screen in 97 Swedish/Norwegian multiplex families who demonstrate linkage to the region. We selected tag SNPs from a 16 Mb region representing the 95% confidence interval of the linkage peak. A total of 1,404 SNPs were used for the association analysis. We identified several regions with SNPs demonstrating moderate single- or multipoint associations. However, the isolated association signals appeared insufficient to account for the linkage signal seen in our cohort. Collective effects of multiple risk genes within the region, incomplete genetic coverage or effects related to copy number variation are possible explanations for our findings.
Nguyen, Nguyen H; Rastas, Pasi M A; Premachandra, H K A; Knibb, Wayne
2018-01-01
The genetic resources available for the commercially important fish species Yellowtail kingfish (YTK) ( Seriola lalandi) are relative sparse. To overcome this, we aimed (1) to develop a linkage map for this species, and (2) to identify markers/variants associated with economically important traits in kingfish (with an emphasis on body weight). Genetic and genomic analyses were conducted using 13,898 single nucleotide polymorphisms (SNPs) generated from a new high-throughput genotyping by sequencing platform, Diversity Arrays Technology (DArTseq TM ) in a pedigreed population comprising 752 animals. The linkage analysis enabled to map about 4,000 markers to 24 linkage groups (LGs), with an average density of 3.4 SNPs per cM. The linkage map was integrated into a genome-wide association study (GWAS) and identified six variants/SNPs associated with body weight ( P < 5e -8 ) when a multi-locus mixed model was used. Two out of the six significant markers were mapped to LGs 17 and 23, and collectively they explained 5.8% of the total genetic variance. It is concluded that the newly developed linkage map and the significantly associated markers with body weight provide fundamental information to characterize genetic architecture of growth-related traits in this population of YTK S. lalandi .
SNP Discovery and Linkage Map Construction in Cultivated Tomato
Shirasawa, Kenta; Isobe, Sachiko; Hirakawa, Hideki; Asamizu, Erika; Fukuoka, Hiroyuki; Just, Daniel; Rothan, Christophe; Sasamoto, Shigemi; Fujishiro, Tsunakazu; Kishida, Yoshie; Kohara, Mitsuyo; Tsuruoka, Hisano; Wada, Tsuyuko; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi
2010-01-01
Few intraspecific genetic linkage maps have been reported for cultivated tomato, mainly because genetic diversity within Solanum lycopersicum is much less than that between tomato species. Single nucleotide polymorphisms (SNPs), the most abundant source of genomic variation, are the most promising source of polymorphisms for the construction of linkage maps for closely related intraspecific lines. In this study, we developed SNP markers based on expressed sequence tags for the construction of intraspecific linkage maps in tomato. Out of the 5607 SNP positions detected through in silico analysis, 1536 were selected for high-throughput genotyping of two mapping populations derived from crosses between ‘Micro-Tom’ and either ‘Ailsa Craig’ or ‘M82’. A total of 1137 markers, including 793 out of the 1338 successfully genotyped SNPs, along with 344 simple sequence repeat and intronic polymorphism markers, were mapped onto two linkage maps, which covered 1467.8 and 1422.7 cM, respectively. The SNP markers developed were then screened against cultivated tomato lines in order to estimate the transferability of these SNPs to other breeding materials. The molecular markers and linkage maps represent a milestone in the genomics and genetics, and are the first step toward molecular breeding of cultivated tomato. Information on the DNA markers, linkage maps, and SNP genotypes for these tomato lines is available at http://www.kazusa.or.jp/tomato/. PMID:21044984
Sillén, Anna; Brohede, Jesper; Forsell, Charlotte; Lilius, Lena; Andrade, Jorge; Odeberg, Jacob; Kimura, Toru; Winblad, Bengt; Graff, Caroline
2011-01-01
We have previously reported the results of an extended genome-wide scan of Swedish Alzheimer disease (AD)-affected families; in this paper, we analyzed a subset of these families with autopsy-confirmed AD. We report the fine-mapping, using both microsatellite markers and single-nucleotide polymorphisms (SNPs), in the observed maximum logarithm of the odds (LOD)-2 unit (LOD(max)-2) region under the identified linkage peak, linkage analysis of the fine-mapping data with additionally analyzed pedigrees, and association analysis of SNPs selected from candidate genes in the linked interval. The subset was made on the criterion of at least one autopsy-confirmed AD case per family, resulting in 24 families. Linkage analysis of a family subset having at least one autopsy-confirmed AD case showed a significant nonparametric single-point LOD score of 4.4 in 8q24. Fine-mapping under the linkage peak with 10 microsatellite markers yielded an increase in the multipoint (mpt) LOD score from 2.1 to 3.0. SNP genotyping was performed on 21 selected candidate transcripts of the LOD(max)-2 region. Both family-based association and linkage analysis were performed on extended material from 30 families, resulting in a suggestive linkage at peak marker rs6577853 (mpt LOD score = 2.4). The 8q24 region has been implicated to be involved in AD etiology. Copyright © 2011 S. Karger AG, Basel.
Jasinska, A.J.; Service, S.; Jawaheer, D.; DeYoung, J.; Levinson, M.; Zhang, Z.; Kremeyer, B.; Muller, H.; Aldana, I.; Garcia, J.; Restrepo, G.; Lopez, C.; Palacio, C.; Duque, C.; Parra, M.; Vega, J.; Ortiz, D.; Bedoya, G.; Mathews, C.; Davanzo, P.; Fournier, E.; Bejarano, J.; Ramirez, M.; Ortiz, C. Araya; Araya, X.; Molina, J.; Sabatti, C.; Reus, V.; Ospina, J.; Macaya, G.; Ruiz-Linares, A.; Freimer, N.B.
2016-01-01
We previously reported linkage of bipolar disorder to 5q33-q34 in families from two closely related population isolates, the Central Valley of Costa Rica (CVCR) and Antioquia, Colombia (CO). Here we present follow up results from fine-scale mapping in large CVCR and CO families segregating severe bipolar disorder, BP-I, and in 343 population trios/duos from CVCR and CO. Employing densely spaced SNPs to fine map the prior linkage peak region increases linkage evidence and clarifies the position of the putative BP-I locus. We performed two-point linkage analysis with 1134 SNPs in an approximately 9 Mb region between markers D5S410 and D5S422. Combining pedigrees from CVCR and CO yields a LOD score of 4.9 at SNP rs10035961. Two other SNPs (rs7721142 and rs1422795) within the same 94 kb region also displayed LOD scores greater than 4. This linkage peak coincides with our prior microsatellite results and suggests a narrowed BP-I susceptibility regions in these families. To investigate if the locus implicated in the familial form of BP-I also contributes to disease risk in the population, we followed up the family results with association analysis in duo and trio samples, obtaining signals within 2 Mb of the peak linkage signal in the pedigrees; rs12523547 and rs267015 (P = 0.00004 and 0.00016, respectively) in the CO sample and rs244960 in the CVCR sample and the combined sample, with P = 0.00032 and 0.00016, respectively. It remains unclear whether these association results reflect the same locus contributing to BP susceptibility within the extended pedigrees. PMID:19319892
Rankinen, Tuomo; Argyropoulos, George; Rice, Treva; Rao, D.C.; Bouchard, Claude
2011-01-01
Background A genome-wide linkage scan identified a quantitative trait locus (QTL) for exercise training-induced changes in submaximal exercise (50W) heart rate (ΔHR50) on chromosome 2q33.3-q34 in the HERITAGE Family Study (N=472). Methods and Results To fine map the region, 1,450 tagSNPs were genotyped between 205 and 215 Mb on chromosome 2. The strongest evidence of association with ΔHR50 was observed with two SNPs located in the 5′ region of the cAMP responsive element binding protein 1 (CREB1) gene (rs2253206: p=1.6×10−5 and rs2360969: p=4.3×10−5). The associations remained significant (p=0.01 and p=0.023, respectively) after accounting for multiple testing. Regression modeling of the 39 most significant SNPs in the single-SNP analyses identified nine SNPs that collectively explained 20% of the ΔHR50 variance. CREB1 SNP rs2253206 had the strongest effect (5.45% of variance), followed by SNPs in the FASTKD2 (3.1%), MAP2 (2.6%), SPAG16 (2.1%), ERBB4 (3 SNPs ~1.4% each), IKZF2 (1.4%), and PARD3B (1.0%) loci. In conditional linkage analysis, six SNPs from the final regression model (CREB1, FASTKD2, MAP2, ERBB4, IKZF2, and PARD3B) accounted for the original linkage signal: the LOD score dropped from 2.10 to 0.41 after adjusting for all six SNPs. Functional studies revealed that the common allele of rs2253206 exhibits significantly (p<0.05) lower promoter activity than the minor allele. Conclusions Our data suggest that functional DNA sequence variation in the CREB1 locus is strongly associated with ΔHR50 and explains considerable proportion of the QTL variance. However, at least five additional SNPs seem to be required to fully account for the original linkage signal. PMID:20407090
Multiple loci on 8q24 associated with prostate cancer susceptibility.
Al Olama, Ali Amin; Kote-Jarai, Zsofia; Giles, Graham G; Guy, Michelle; Morrison, Jonathan; Severi, Gianluca; Leongamornlert, Daniel A; Tymrakiewicz, Malgorzata; Jhavar, Sameer; Saunders, Ed; Hopper, John L; Southey, Melissa C; Muir, Kenneth R; English, Dallas R; Dearnaley, David P; Ardern-Jones, Audrey T; Hall, Amanda L; O'Brien, Lynne T; Wilkinson, Rosemary A; Sawyer, Emma; Lophatananon, Artitaya; Horwich, Alan; Huddart, Robert A; Khoo, Vincent S; Parker, Christopher C; Woodhouse, Christopher J; Thompson, Alan; Christmas, Tim; Ogden, Chris; Cooper, Colin; Donovan, Jenny L; Hamdy, Freddie C; Neal, David E; Eeles, Rosalind A; Easton, Douglas F
2009-10-01
Previous studies have identified multiple loci on 8q24 associated with prostate cancer risk. We performed a comprehensive analysis of SNP associations across 8q24 by genotyping tag SNPs in 5,504 prostate cancer cases and 5,834 controls. We confirmed associations at three previously reported loci and identified additional loci in two other linkage disequilibrium blocks (rs1006908: per-allele OR = 0.87, P = 7.9 x 10(-8); rs620861: OR = 0.90, P = 4.8 x 10(-8)). Eight SNPs in five linkage disequilibrium blocks were independently associated with prostate cancer susceptibility.
Association of Single-Nucleotide Polymorphisms of the Tau Gene With Late-Onset Parkinson Disease
Martin, Eden R.; Scott, William K.; Nance, Martha A.; Watts, Ray L.; Hubble, Jean P.; Koller, William C.; Lyons, Kelly; Pahwa, Rajesh; Stern, Matthew B.; Colcher, Amy; Hiner, Bradley C.; Jankovic, Joseph; Ondo, William G.; Allen, Fred H.; Goetz, Christopher G.; Small, Gary W.; Masterman, Donna; Mastaglia, Frank; Laing, Nigel G.; Stajich, Jeffrey M.; Ribble, Robert C.; Booze, Michael W.; Rogala, Allison; Hauser, Michael A.; Zhang, Fengyu; Gibson, Rachel A.; Middleton, Lefkos T.; Roses, Allen D.; Haines, Jonathan L.; Scott, Burton L.; Pericak-Vance, Margaret A.; Vance, Jeffery M.
2013-01-01
Context The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. Objective To investigate whether the tau gene is involved in idiopathic PD. Design, Setting, and Participants Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Main Outcome Measure Family-based tests of association, calculated using asymptotic distributions. Results Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P = .03; SNP 9i, P = .04; and SNP 11, P = .04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P = .11, and SNP 9iii, P = .87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P = .009) and a negative association with another haplotype (P = .007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3,9i, 9ii, and 11). Conclusions This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD. PMID:11710889
Combined linkage and association analyses identify a novel locus for obesity near PROX1 in Asians.
Kim, Hyun-Jin; Yoo, Yun Joo; Ju, Young Seok; Lee, Seungbok; Cho, Sung-Il; Sung, Joohon; Kim, Jong-Il; Seo, Jeong-Sun
2013-11-01
Although genome-wide association studies (GWAS) have substantially contributed to understanding the genetic architecture, unidentified variants for complex traits remain an issue. One of the efficient approaches is the improvement of the power of GWAS scan by weighting P values with prior linkage signals. Our objective was to identify the novel candidates for obesity in Asian populations by using genemapping strategies that combine linkage and association analyses. To obtain linkage information for body mass index (BMI) and waist circumference (WC), we performed a multipoint genome-wide linkage study in an isolated Mongolian sample of 1,049 individuals from 74 families. Next, a family-based GWAS, which integrates within- and between-family components, was performed using the genotype data of 756 individuals of the Mongolian sample, and P values for association were weighted using linkage information obtained previously. For both BMI (LOD = 3.3) and WC (LOD = 2.6), the highest linkage peak was discovered at chromosome 10q11.22. In family-based GWAS combined with linkage information, six single-nucleotide polymorphisms (SNPs) for BMI and five SNPs for WC reached a significant level of association (linkage weighted P < 1 × 10(-5) ). Of these, only one of the SNPs associated with WC (rs1704198) was replicated in 327 Korean families comprising 1,301 individuals. This SNP was located in the proximity of the prosperorelated homeobox 1 (PROX1) gene, the function of which was validated previously in a mouse model. Our powerful strategic analysis enabled the discovery of a novel candidate gene, PROX1, associated with WC in an Asian population. Copyright © 2012 The Obesity Society.
Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel
Houle, David; Márquez, Eladio J.
2015-01-01
We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r2 ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines. PMID:26068573
Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel.
Houle, David; Márquez, Eladio J
2015-06-10
We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r(2) ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines. Copyright © 2015 Houle and Márquez.
Leak, Tennille S.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Keene, Keith L.; Gordon, Candace J.; Hicks, Pamela J.; Freedman, Barry I.; Bowden, Donald W.; Sale, Michèle M.
2009-01-01
Previously we performed a genome scan for type 2 diabetes (T2DM) using 638 African-American (AA) affected sibling pairs from 247 families; non-parametric linkage analysis suggested evidence of linkage at 6q24-27 (LOD 2.26). To comprehensively evaluate this region we performed a 2-stage association study by first constructing a SNP map of 754 SNPs selected from HapMap on the basis of linkage disequilibrium (LD) in 300 AAT2DM-ESRD subjects, 311 AA controls, 43 European American controls and 45 Yoruba Nigerian samples (Set 1). Replication analyses were conducted in an independent population of 283 AA T2DM-ESRD subjects and 282 AA controls (Set 2). In addition, we adjusted for the impact of admixture on association results by using ancestry informative markers (AIMs). In Stage 1, 137 (18.2%) SNPs showed nominal evidence of association (P<0.05) in one or more of tests of association: allelic (n=33), dominant (n=36), additive (n=29), or recessive (n=34) genotypic models, and 2- (n=47) and 3-SNP (n=43) haplotypic analyses. These SNPs were selected for follow-up genotyping. Stage 2 analyses confirmed association with a predicted 2-SNP “risk” haplotype in the PARK2 gene. Also, two intergenic SNPs showed consistent genotypic association with T2DM-ESRD: rs12197043 and rs4897081. Combined analysis of all subjects from both stages revealed nominal associations with 17 SNPs within genes; including suggestive associations in ESR1 and PARK2. This study confirms known diabetic nephropathy loci and identifies potentially novel susceptibility variants located within 6q24-27 in AA. PMID:18560894
Linkage Analysis in Autoimmune Addison's Disease: NFATC1 as a Potential Novel Susceptibility Locus.
Mitchell, Anna L; Bøe Wolff, Anette; MacArthur, Katie; Weaver, Jolanta U; Vaidya, Bijay; Erichsen, Martina M; Darlay, Rebecca; Husebye, Eystein S; Cordell, Heather J; Pearce, Simon H S
2015-01-01
Autoimmune Addison's disease (AAD) is a rare, highly heritable autoimmune endocrinopathy. It is possible that there may be some highly penetrant variants which confer disease susceptibility that have yet to be discovered. DNA samples from 23 multiplex AAD pedigrees from the UK and Norway (50 cases, 67 controls) were genotyped on the Affymetrix SNP 6.0 array. Linkage analysis was performed using Merlin. EMMAX was used to carry out a genome-wide association analysis comparing the familial AAD cases to 2706 UK WTCCC controls. To explore some of the linkage findings further, a replication study was performed by genotyping 64 SNPs in two of the four linked regions (chromosomes 7 and 18), on the Sequenom iPlex platform in three European AAD case-control cohorts (1097 cases, 1117 controls). The data were analysed using a meta-analysis approach. In a parametric analysis, applying a rare dominant model, loci on chromosomes 7, 9 and 18 had LOD scores >2.8. In a non-parametric analysis, a locus corresponding to the HLA region on chromosome 6, known to be associated with AAD, had a LOD score >3.0. In the genome-wide association analysis, a SNP cluster on chromosome 2 and a pair of SNPs on chromosome 6 were associated with AAD (P <5x10-7). A meta-analysis of the replication study data demonstrated that three chromosome 18 SNPs were associated with AAD, including a non-synonymous variant in the NFATC1 gene. This linkage study has implicated a number of novel chromosomal regions in the pathogenesis of AAD in multiplex AAD families and adds further support to the role of HLA in AAD. The genome-wide association analysis has also identified a region of interest on chromosome 2. A replication study has demonstrated that the NFATC1 gene is worthy of future investigation, however each of the regions identified require further, systematic analysis.
Significant association of APOA5 and APOC3 gene polymorphisms with meat quality traits in Kele pigs.
Hui, Y T; Yang, Y Q; Liu, R Y; Zhang, Y Y; Xiang, C J; Liu, Z Z; Ding, Y H; Zhang, Y L; Wang, B R
2013-09-13
Apolipoprotein A5 (APOA5) and C3 (APOC3) genes are involved in the PPAR lipid metabolism pathway and thus associated with elevated triglyceride levels. However, whether APOA5 and APOC3 genetic polymorphisms affect intramuscular fat deposition and other meat quality traits remains unknown in pigs. One hundred and seventy-one Kele pigs were sampled to investigate genetic variants in the APOA5 and APOC3 genes and their association with seven pork quality traits. We identified 5 single nucleotide polymorphisms (SNPs) in the promoter region of the APOA5 gene and 17 SNPs in the APOC3 gene. Linkage disequilibrium analysis revealed 5 complete linkage disequilibria among these 22 SNPs. We found that 10 SNPs were significantly correlated with meat quality traits, including the mutation A5/-769 in the APOA5 gene, which was significantly associated with cooked weight percentage, and 9 SNPs in the APOC3 gene that were significantly associated with drip loss rate, meat color value of longissimus dorsi muscle and shear force. Therefore, these SNP markers will be useful for marker-assisted selection for improved pork quality.
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are ideally suited for the construction of high-resolution genetic maps, studying population evolutionary history and performing genome-wide association mapping experiments. Here we used a genome-wide set of 1536 SNPs to study linkage disequilibrium (LD) and po...
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are the marker of choice for many researchers due to their abundance and the high-throughput methods available for their multiplex analysis. Only recently have SNP markers been available to researchers in soybean [Glycine max (L.) Merr.] with the release of th...
Leak, Tennille S; Langefeld, Carl D; Keene, Keith L; Gallagher, Carla J; Lu, Lingyi; Mychaleckyj, Josyf C; Rich, Stephen S; Freedman, Barry I; Bowden, Donald W; Sale, Michèle M
2010-02-08
Previously we performed a linkage scan of 638 African American affected sibling pairs (ASP) with type 2 diabetes (T2D) enriched for end-stage renal disease (ESRD). Ordered subset linkage analysis (OSA) revealed a linkage peak on chromosome 7p in the subset of families with earlier age of T2D diagnosis. We fine mapped this region by genotyping 11 additional polymorphic markers in the same ASP and investigated a total of 68 single nucleotide polymorphisms (SNPs) in functional candidate genes (GCK1, IL6, IGFBP1 and IGFBP3) for association with age of T2D diagnosis, age of ESRD diagnosis, duration of T2D to onset of ESRD, body mass index (BMI) in African American cases and T2D-ESRD in an African American case-control cohort. OSA of fine mapping markers supported linkage at 28 cM on 7p (near D7S3051) in early-onset T2D families (max. LOD = 3.61, P = 0.002). SNPs in candidate genes and 70 ancestry-informative markers (AIMs) were evaluated in 577 African American T2D-ESRD cases and 596 African American controls. The most significant association was observed between ESRD age of diagnosis and SNP rs730497, located in intron 1 of the GCK1 gene (recessive T2D age-adjusted P = 0.0006). Nominal associations were observed with GCK1 SNPs and T2D age of diagnosis (BMI-adjusted P = 0.014 to 0.032). Also, one IGFBP1 and four IGFBP3 SNPs showed nominal genotypic association with T2D-ESRD (P = 0.002-0.049). After correcting for multiple tests, only rs730497 remanined significant. Variant rs730947 in the GCK1 gene appears to play a role in early ESRD onset in African Americans.
Hüls, Anke; Ickstadt, Katja; Schikowski, Tamara; Krämer, Ursula
2017-06-12
For the analysis of gene-environment (GxE) interactions commonly single nucleotide polymorphisms (SNPs) are used to characterize genetic susceptibility, an approach that mostly lacks power and has poor reproducibility. One promising approach to overcome this problem might be the use of weighted genetic risk scores (GRS), which are defined as weighted sums of risk alleles of gene variants. The gold-standard is to use external weights from published meta-analyses. In this study, we used internal weights from the marginal genetic effects of the SNPs estimated by a multivariate elastic net regression and thereby provided a method that can be used if there are no external weights available. We conducted a simulation study for the detection of GxE interactions and compared power and type I error of single SNPs analyses with Bonferroni correction and corresponding analysis with unweighted and our weighted GRS approach in scenarios with six risk SNPs and an increasing number of highly correlated (up to 210) and noise SNPs (up to 840). Applying weighted GRS increased the power enormously in comparison to the common single SNPs approach (e.g. 94.2% vs. 35.4%, respectively, to detect a weak interaction with an OR ≈ 1.04 for six uncorrelated risk SNPs and n = 700 with a well-controlled type I error). Furthermore, weighted GRS outperformed the unweighted GRS, in particular in the presence of SNPs without any effect on the phenotype (e.g. 90.1% vs. 43.9%, respectively, when 20 noise SNPs were added to the six risk SNPs). This outperforming of the weighted GRS was confirmed in a real data application on lung inflammation in the SALIA cohort (n = 402). However, in scenarios with a high number of noise SNPs (>200 vs. 6 risk SNPs), larger sample sizes are needed to avoid an increased type I error, whereas a high number of correlated SNPs can be handled even in small samples (e.g. n = 400). In conclusion, weighted GRS with weights from the marginal genetic effects of the SNPs estimated by a multivariate elastic net regression were shown to be a powerful tool to detect gene-environment interactions in scenarios of high Linkage disequilibrium and noise.
Radhakrishna, Uppala; Ratnamala, Uppala; Gaines, Mathew; Beiraghi, Soraya; Hutchings, David; Golla, Jeffrey; Husain, Syed A.; Gambhir, Prakash S.; Sheth, Jayesh J.; Sheth, Frenny J.; Chetan, Ghati K.; Naveed, Mohammed; Solanki, Jitendra V.; Patel, Uday C.; Master, Dilipkumar C.; Memon, Rafiq; Antonarakis, Gregory S.; Antonarakis, Stylianos E.; Nath, Swapan K.
2006-01-01
Nonsyndromic cleft lip with or without cleft palate (CL-P) is a common congenital anomaly with incidence ranging from 1 in 300 to 1 in 2,500 live births. We analyzed two Indian pedigrees (UR017 and UR019) with isolated, nonsyndromic CL-P, in which the anomaly segregates as an autosomal dominant trait. The phenotype was variable, ranging from unilateral to bilateral CL-P. A genomewide linkage scan that used ∼10,000 SNPs was performed. Nonparametric linkage (NPL) analysis identified 11 genomic regions (NPL>3.5; P<.005) that could potentially harbor CL-P susceptibility variations. Among those, the most significant evidence was for chromosome 13q33.1-34 at marker rs1830756 (NPL=5.57; P=.00024). This was also supported by parametric linkage; MOD score (LOD scores maximized over genetic model parameters) analysis favored an autosomal dominant model. The maximum LOD score was 4.45, and heterogeneity LOD was 4.45 (α=100%). Haplotype analysis with informative crossovers enabled the mapping of the CL-P locus to a region of ∼20.17 cM (7.42 Mb) between SNPs rs951095 and rs726455. Thus, we have identified a novel genomic region on 13q33.1-34 that harbors a high-risk variant for CL-P in these Indian families. PMID:16909398
DOE Office of Scientific and Technical Information (OSTI.GOV)
Song, Qijian; Jia, Gaofeng; Hyten, David L.
A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of largemore » scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.« less
Song, Qijian; Jia, Gaofeng; Hyten, David L.; ...
2015-08-28
A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of largemore » scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.« less
Song, Qijian; Jia, Gaofeng; Hyten, David L; Jenkins, Jerry; Hwang, Eun-Young; Schroeder, Steven G; Osorno, Juan M; Schmutz, Jeremy; Jackson, Scott A; McClean, Phillip E; Cregan, Perry B
2015-08-28
A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of large scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad. Copyright © 2015 Song et al.
Nimmakayala, Padma; Abburi, Venkata L.; Saminathan, Thangasamy; Almeida, Aldo; Davenport, Brittany; Davidson, Joshua; Reddy, C. V. Chandra Mohan; Hankins, Gerald; Ebert, Andreas; Choi, Doil; Stommel, John; Reddy, Umesh K.
2016-01-01
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum, indicating a population bottleneck during domestication of C. baccatum. In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum, 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index (FST) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9–2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers. PMID:27857720
Nimmakayala, Padma; Abburi, Venkata L; Saminathan, Thangasamy; Almeida, Aldo; Davenport, Brittany; Davidson, Joshua; Reddy, C V Chandra Mohan; Hankins, Gerald; Ebert, Andreas; Choi, Doil; Stommel, John; Reddy, Umesh K
2016-01-01
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to characterize population structure and species domestication of these two important incompatible cultivated pepper species. Estimated mean nucleotide diversity (π) and Tajima's D across various chromosomes revealed biased distribution toward negative values on all chromosomes (except for chromosome 4) in cultivated C. baccatum , indicating a population bottleneck during domestication of C. baccatum . In contrast, C. annuum chromosomes showed positive π and Tajima's D on all chromosomes except chromosome 8, which may be because of domestication at multiple sites contributing to wider genetic diversity. For C. baccatum , 13,129 SNPs were available, with minor allele frequency (MAF) ≥0.05; PCA of the SNPs revealed 283 C. baccatum accessions grouped into 3 distinct clusters, for strong population structure. The fixation index ( F ST ) between domesticated C. annuum and C. baccatum was 0.78, which indicates genome-wide divergence. We conducted extensive linkage disequilibrium (LD) analysis of C. baccatum var. pendulum cultivars on all adjacent SNP pairs within a chromosome to identify regions of high and low LD interspersed with a genome-wide average LD block size of 99.1 kb. We characterized 1742 haplotypes containing 4420 SNPs (range 9-2 SNPs per haplotype). Genome-wide association study (GWAS) of peduncle length, a trait that differentiates wild and domesticated C. baccatum types, revealed 36 significantly associated genome-wide SNPs. Population structure, identity by state (IBS) and LD patterns across the genome will be of potential use for future GWAS of economically important traits in C. baccatum peppers.
Gueorguiev, Maria; Wiltshire, Steven; Garcia, Edwin A; Mein, Charles; Lecoeur, Cecile; Kristen, Brigitte; Allotey, Rebecca; Hattersley, Andrew T; Walker, Mark; O'rahilly, Stephen; Froguel, Philippe; Grossman, Ashley B; McCarthy, Mark I; Hitman, Graham A; Korbonits, Márta
2007-06-01
Recently, a quantitative trait locus for stature was reported on chromosome 3p26 in patients with type 2 diabetes. Given that ghrelin is a peptide involved in GH release and located on 3p26, we hypothesized that variation within its gene (GHRL) may be responsible for the quantitative trait locus on 3p26. The evidence for linkage around GHRL was refined with the genotyping of an additional four microsatellites (D3S4545, D3S1537, D3S1597, and D3S3611), giving a total of 27 markers, followed by multipoint variance components linkage analysis. Probands from the linkage families were typed for five common single nucleotide polymorphisms (SNPs) within GHRL and tested for association with adult stature using haplotype trend regression. The maximum multipoint evidence for linkage between adult stature and the 27 microsatellites yielded an LOD score of 2.58 (P = 0.0003) between D3S1297 and D3S1304. Five common (frequency of > or =5%) SNPs were typed in the probands [two promoter SNPs (rs27647 and rs26802), two exonic (rs696217 and rs4684677), and one intronic (rs35683)] capturing 80% of the total common variation in GHRL. No association was found between any SNP (or haplotypes thereof) and adult stature. Common genetic variation within GHRL is not responsible for variation in adult stature in this population.
Meta-analysis of the Association Between Variants in SORL1 and Alzheimer Disease
Reitz, Christiane; Cheng, Rong; Rogaeva, Ekaterina; Lee, Joseph H.; Tokuhiro, Shinya; Zou, Fanggeng; Bettens, Karolien; Sleegers, Kristel; Tan, Eng King; Kimura, Ryo; Shibata, Nobuto; Arai, Heii; Kamboh, M. Ilyas; Prince, Jonathan A.; Maier, Wolfgang; Riemenschneider, Matthias; Owen, Michael; Harold, Denise; Hollingworth, Paul; Cellini, Elena; Sorbi, Sandro; Nacmias, Benedetta; Takeda, Masatoshi; Pericak-Vance, Margaret A.; Haines, Jonathan L.; Younkin, Steven; Williams, Julie; van Broeckhoven, Christine; Farrer, Lindsay A.; St George–Hyslop, Peter H.; Mayeux, Richard
2011-01-01
Objective To reexamine the association between the neuronal sortilin-related receptor gene (SORL1) and Alzheimer disease (AD). Design Comprehensive and unbiased meta-analysis of all published and unpublished data from case-control studies for the SORL1 single-nucleotide polymorphisms (SNPs) that had been repeatedly assessed across studies. Setting Academic research institutions in the United States, the Netherlands, Canada, Belgium, the United Kingdom, Singapore, Japan, Sweden, Germany, France, and Italy. Participants All published white and Asian case-control data sets, which included a total of 12 464 cases and 17 929 controls. Main Outcome Measures Alzheimer disease according to the Diagnostic and Statistical Manual of Mental Disorders (Fourth Edition) and the National Institute of Neurological and Communicative Disorders and Stroke and the Alzheimer’s Disease and Related Disorders Association (now known as the Alzheimer’s Association). Results In the white data sets, several markers were associated with AD after correction for multiple testing, including previously reported SNPs 8, 9, and 10 (P<.001). In addition, the C-G-C haplotype at SNPs 8 through 10 was associated with AD risk (P<.001). In the combined Asian data sets, SNPs 19 and 23 through 25 were associated with AD risk (P<.001). The disease-associated alleles at SNPs 8, 9, and 10 (120 873 131-120 886 175 base pairs [bp]; C-G-C alleles), at SNP 19 (120 953 300 bp; G allele), and at SNPs 24 through 25 (120 988 611 bp; T and C alleles) were the same previously reported alleles. The SNPs 4 through 5, 8 through 10, 12, and 19 through 25 belong to distinct linkage disequilibrium blocks. The same alleles at SNPs 8 through 10 (C-G-C), 19 (G), and 24 and 25 (T and C) have also been associated with AD endophenotypes, including white matter hyperintensities and hippocampal atrophy on magnetic resonance imaging, cerebrospinal fluid measures of amyloid β-peptide 42, and full-length SORL1 expression in the human brain. Conclusion This comprehensive meta-analysis provides confirmatory evidence that multiple SORL1 variants in distinct linkage disequilibrium blocks are associated with AD. PMID:21220680
Abnet, Christian C.; Wang, Zhaoming; Song, Xin; Hu, Nan; Zhou, Fu-You; Freedman, Neal D.; Li, Xue-Min; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Dawsey, Sanford M.; Liao, Linda M.; Lee, Maxwell P.; Ding, Ti; Qiao, You-Lin; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Chung, Charles C.; Wang, Chaoyu; Wheeler, William; Yeager, Meredith; Yuenger, Jeff; Hutchinson, Amy; Jacobs, Kevin B.; Giffen, Carol A.; Burdett, Laurie; Fraumeni, Joseph F.; Tucker, Margaret A.; Chow, Wong-Ho; Zhao, Xue-Ke; Li, Jiang-Man; Li, Ai-Li; Sun, Liang-Dan; Wei, Wu; Li, Ji-Lin; Zhang, Peng; Li, Hong-Lei; Cui, Wen-Yan; Wang, Wei-Peng; Liu, Zhi-Cai; Yang, Xia; Fu, Wen-Jing; Cui, Ji-Li; Lin, Hong-Li; Zhu, Wen-Liang; Liu, Min; Chen, Xi; Chen, Jie; Guo, Li; Han, Jing-Jing; Zhou, Sheng-Li; Huang, Jia; Wu, Yue; Yuan, Chao; Huang, Jing; Ji, Ai-Fang; Kul, Jian-Wei; Fan, Zhong-Min; Wang, Jian-Po; Zhang, Dong-Yun; Zhang, Lian-Qun; Zhang, Wei; Chen, Yuan-Fang; Ren, Jing-Li; Li, Xiu-Min; Dong, Jin-Cheng; Xing, Guo-Lan; Guo, Zhi-Gang; Yang, Jian-Xue; Mao, Yi-Ming; Yuan, Yuan; Guo, Er-Tao; Zhang, Wei; Hou, Zhi-Chao; Liu, Jing; Li, Yan; Tang, Sa; Chang, Jia; Peng, Xiu-Qin; Han, Min; Yin, Wan-Li; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Yang, Liu-Qin; Zhu, Fu-Guo; Yang, Xiu-Feng; Feng, Xiao-Shan; Wang, Zhou; Li, Yin; Gao, She-Gan; Liu, Hai-Lin; Yuan, Ling; Jin, Yan; Zhang, Yan-Rui; Sheyhidin, Ilyar; Li, Feng; Chen, Bao-Ping; Ren, Shu-Wei; Liu, Bin; Li, Dan; Zhang, Gao-Fu; Yue, Wen-Bin; Feng, Chang-Wei; Qige, Qirenwang; Zhao, Jian-Ting; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Xu, Li-Yan; Wu, Zhi-Yong; Bao, Zhi-Qin; Chen, Ji-Li; Li, Xian-Chang; Zhuang, Xiang; Zhou, Ying-Fa; Zuo, Xian-Bo; Dong, Zi-Ming; Wang, Lu-Wen; Fan, Xue-Pin; Wang, Jin; Zhou, Qi; Ma, Guo-Shun; Zhang, Qin-Xian; Liu, Hai; Jian, Xin-Ying; Lian, Sin-Yong; Wang, Jin-Sheng; Chang, Fu-Bao; Lu, Chang-Dong; Miao, Jian-Jun; Chen, Zhi-Guo; Wang, Ran; Guo, Ming; Fan, Zeng-Lin; Tao, Ping; Liu, Tai-Jing; Wei, Jin-Chang; Kong, Qing-Peng; Fan, Lei; Wang, Xian-Zeng; Gao, Fu-Sheng; Wang, Tian-Yun; Xie, Dong; Wang, Li; Chen, Shu-Qing; Yang, Wan-Cai; Hong, Jun-Yan; Wang, Liang; Qiu, Song-Liang; Goldstein, Alisa M.; Yuan, Zhi-Qing; Chanock, Stephen J.; Zhang, Xue-Jun; Taylor, Philip R.; Wang, Li-Dong
2012-01-01
Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10−8, and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19–1.40) and P= 7.63 × 10−10. An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants. PMID:22323360
Abnet, Christian C; Wang, Zhaoming; Song, Xin; Hu, Nan; Zhou, Fu-You; Freedman, Neal D; Li, Xue-Min; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Dawsey, Sanford M; Liao, Linda M; Lee, Maxwell P; Ding, Ti; Qiao, You-Lin; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Chung, Charles C; Wang, Chaoyu; Wheeler, William; Yeager, Meredith; Yuenger, Jeff; Hutchinson, Amy; Jacobs, Kevin B; Giffen, Carol A; Burdett, Laurie; Fraumeni, Joseph F; Tucker, Margaret A; Chow, Wong-Ho; Zhao, Xue-Ke; Li, Jiang-Man; Li, Ai-Li; Sun, Liang-Dan; Wei, Wu; Li, Ji-Lin; Zhang, Peng; Li, Hong-Lei; Cui, Wen-Yan; Wang, Wei-Peng; Liu, Zhi-Cai; Yang, Xia; Fu, Wen-Jing; Cui, Ji-Li; Lin, Hong-Li; Zhu, Wen-Liang; Liu, Min; Chen, Xi; Chen, Jie; Guo, Li; Han, Jing-Jing; Zhou, Sheng-Li; Huang, Jia; Wu, Yue; Yuan, Chao; Huang, Jing; Ji, Ai-Fang; Kul, Jian-Wei; Fan, Zhong-Min; Wang, Jian-Po; Zhang, Dong-Yun; Zhang, Lian-Qun; Zhang, Wei; Chen, Yuan-Fang; Ren, Jing-Li; Li, Xiu-Min; Dong, Jin-Cheng; Xing, Guo-Lan; Guo, Zhi-Gang; Yang, Jian-Xue; Mao, Yi-Ming; Yuan, Yuan; Guo, Er-Tao; Zhang, Wei; Hou, Zhi-Chao; Liu, Jing; Li, Yan; Tang, Sa; Chang, Jia; Peng, Xiu-Qin; Han, Min; Yin, Wan-Li; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Yang, Liu-Qin; Zhu, Fu-Guo; Yang, Xiu-Feng; Feng, Xiao-Shan; Wang, Zhou; Li, Yin; Gao, She-Gan; Liu, Hai-Lin; Yuan, Ling; Jin, Yan; Zhang, Yan-Rui; Sheyhidin, Ilyar; Li, Feng; Chen, Bao-Ping; Ren, Shu-Wei; Liu, Bin; Li, Dan; Zhang, Gao-Fu; Yue, Wen-Bin; Feng, Chang-Wei; Qige, Qirenwang; Zhao, Jian-Ting; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Xu, Li-Yan; Wu, Zhi-Yong; Bao, Zhi-Qin; Chen, Ji-Li; Li, Xian-Chang; Zhuang, Xiang; Zhou, Ying-Fa; Zuo, Xian-Bo; Dong, Zi-Ming; Wang, Lu-Wen; Fan, Xue-Pin; Wang, Jin; Zhou, Qi; Ma, Guo-Shun; Zhang, Qin-Xian; Liu, Hai; Jian, Xin-Ying; Lian, Sin-Yong; Wang, Jin-Sheng; Chang, Fu-Bao; Lu, Chang-Dong; Miao, Jian-Jun; Chen, Zhi-Guo; Wang, Ran; Guo, Ming; Fan, Zeng-Lin; Tao, Ping; Liu, Tai-Jing; Wei, Jin-Chang; Kong, Qing-Peng; Fan, Lei; Wang, Xian-Zeng; Gao, Fu-Sheng; Wang, Tian-Yun; Xie, Dong; Wang, Li; Chen, Shu-Qing; Yang, Wan-Cai; Hong, Jun-Yan; Wang, Liang; Qiu, Song-Liang; Goldstein, Alisa M; Yuan, Zhi-Qing; Chanock, Stephen J; Zhang, Xue-Jun; Taylor, Philip R; Wang, Li-Dong
2012-05-01
Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10(-8), and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19-1.40) and P= 7.63 × 10(-10). An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants.
Fine mapping on chromosome 13q32-34 and brain expression analysis implicates MYO16 in schizophrenia.
Rodriguez-Murillo, Laura; Xu, Bin; Roos, J Louw; Abecasis, Gonçalo R; Gogos, Joseph A; Karayiorgou, Maria
2014-03-01
We previously reported linkage of schizophrenia and schizoaffective disorder to 13q32-34 in the European descent Afrikaner population from South Africa. The nature of genetic variation underlying linkage peaks in psychiatric disorders remains largely unknown and both rare and common variants may be contributing. Here, we examine the contribution of common variants located under the 13q32-34 linkage region. We used densely spaced SNPs to fine map the linkage peak region using both a discovery sample of 415 families and a meta-analysis incorporating two additional replication family samples. In a second phase of the study, we use one family-based data set with 237 families and independent case-control data sets for fine mapping of the common variant association signal using HapMap SNPs. We report a significant association with a genetic variant (rs9583277) within the gene encoding for the myosin heavy-chain Myr 8 (MYO16), which has been implicated in neuronal phosphoinositide 3-kinase signaling. Follow-up analysis of HapMap variation within MYO16 in a second set of Afrikaner families and additional case-control data sets of European descent highlighted a region across introns 2-6 as the most likely region to harbor common MYO16 risk variants. Expression analysis revealed a significant increase in the level of MYO16 expression in the brains of schizophrenia patients. Our results suggest that common variation within MYO16 may contribute to the genetic liability to schizophrenia.
LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs
Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong
2009-01-01
Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380
Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R.; Wang, Xiaolu
2016-01-01
Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon. PMID:27162496
Kuhn, Alexandre; Ong, Yao Min; Cheng, Ching-Yu; Wong, Tien Yin; Quake, Stephen R; Burkholder, William F
2014-06-03
Insertions of the human-specific subfamily of LINE-1 (L1) retrotransposon are highly polymorphic across individuals and can critically influence the human transcriptome. We hypothesized that L1 insertions could represent genetic variants determining important human phenotypic traits, and performed an integrated analysis of L1 elements and single nucleotide polymorphisms (SNPs) in several human populations. We found that a large fraction of L1s were in high linkage disequilibrium with their surrounding genomic regions and that they were well tagged by SNPs. However, L1 variants were only partially captured by SNPs on standard SNP arrays, so that their potential phenotypic impact would be frequently missed by SNP array-based genome-wide association studies. We next identified potential phenotypic effects of L1s by looking for signatures of natural selection linked to L1 insertions; significant extended haplotype homozygosity was detected around several L1 insertions. This finding suggests that some of these L1 insertions may have been the target of recent positive selection.
Hsueh, Wen-Chi; Nair, Anup K; Kobes, Sayuko; Chen, Peng; Göring, Harald H H; Pollin, Toni I; Malhotra, Alka; Knowler, William C; Baier, Leslie J; Hanson, Robert L
2017-12-01
Identity-by-descent mapping using empirical estimates of identity-by-descent allele sharing may be useful for studies of complex traits in founder populations, where hidden relationships may augment the inherent genetic information that can be used for localization. Through identity-by-descent mapping, using ≈400 000 single-nucleotide polymorphisms (SNPs), of serum lipid profiles, we identified a major linkage signal for triglycerides in 1007 Pima Indians (LOD=9.23; P =3.5×10 -11 on chromosome 11q). In subsequent fine-mapping and replication association studies in ≈7500 Amerindians, we determined that this signal reflects effects of a loss-of-function Ala43Thr substitution in APOC3 (rs147210663) and 3 established functional SNPs in APOA5 . The association with rs147210663 was particularly strong; each copy of the Thr allele conferred 42% lower triglycerides (β=-0.92±0.059 SD unit; P =9.6×10 -55 in 4668 Pimas and 2793 Southwest Amerindians combined). The Thr allele is extremely rare in most global populations but has a frequency of 2.5% in Pimas. We further demonstrated that 3 APOA5 SNPs with established functional impact could explain the association with the most well-replicated SNP (rs964184) for triglycerides identified by genome-wide association studies. Collectively, these 4 SNPs account for 6.9% of variation in triglycerides in Pimas (and 4.1% in Southwest Amerindians), and their inclusion in the original linkage model reduced the linkage signal to virtually null. APOC3/APOA5 constitutes a major locus for serum triglycerides in Amerindians, especially the Pimas, and these results provide an empirical example for the concept that population-based linkage analysis is a useful strategy to identify complex trait variants. © 2017 American Heart Association, Inc.
Gragnoli, Claudia
2014-09-01
Proteasome modulator 9 (PSMD9) gene single nucleotide polymorphism (SNP) rs1043307/rs2514259 (E197G) is associated with significant clinical response to the anti-depressant desipramine. PSMD9 SNP rs74421874 [intervening sequence (IVS) 3 + nt460 G>A], rs3825172 (IVS3 + nt437 C>T) and rs1043307/rs2514259 (E197G A>G) are all linked to type 2 diabetes (T2D), maturity-onset-diabetes-of the young 3 (MODY3), obesity and waist circumference, hypertension, hypercholesterolemia, T2D-macrovascular and T2D-microvascular disease, T2D-neuropathy, T2D-carpal tunnel syndrome, T2D-nephropathy, T2D-retinopathy, non-diabetic retinopathy and depression. PSMD9 rs149556654 rare SNP (N166S A>G) and the variant S143G A>G also contribute to T2D. PSMD9 is located in the chromosome 12q24 locus, which per se is in linkage with depression, bipolar disorder and anxiety. In the present study, we wanted to determine whether PSMD9 is linked to general anxiety disorder in Italian T2D families. Two-hundred Italian T2D families were phenotyped for generalized anxiety disorder, using the diagnostic criteria of DSM-IV. When the diagnosis was unavailable or unclear, the trait was reported as unknown. The 200 Italians families were tested for the PSMD9 T2D risk SNPs rs74421874 (IVS3 + nt460 G>A), rs3825172 (IVS3 +nt437 T>C) and for the T2D risk and anti-depressant response SNP rs1043307/rs2514259 (E197G A>G) for evidence of linkage with generalized anxiety disorder. Non-parametric linkage analysis was executed via Merlin software. One-thousand simulation tests were performed to exclude results due to random chance. In our study, the PSMD9 gene SNPs rs74421874, rs3825172, and rs1043307/rs2514259 result in linkage to generalized anxiety disorder. This is the first report describing PSMD9 gene SNPs in linkage to generalized anxiety disorder in T2D families. © 2014 Wiley Periodicals, Inc.
Talbert, Matthew E; Langefeld, Carl D; Ziegler, Julie; Mychaleckyj, Josyf C; Haffner, Steven M; Norris, Jill M; Bowden, Donald W
2009-01-01
The SOCS3 gene product participates in the feedback inhibition of a range of cytokine signals. Most notably, SOCS3 inhibits the functioning of leptin and downstream steps in insulin signaling after being expressed by terminal transcription factors, such as STAT3 and c-fos. The SOCS3 gene is located in the chromosome region 17q24–17q25, previously linked to body mass index (BMI), visceral adipose tissue (VAT), and waist circumference (WAIST) in Hispanic families in the Insulin Resistance Atherosclerosis Family Study (IRASFS). A high density map of 1536 single nucleotide polymorphisms (SNPs) was constructed to cover a portion of the 17q linkage interval in DNA samples from 1425 Hispanic subjects from 90 extended families in IRASFS. Analysis of this dense SNP map data revealed evidence of association of rs9914220 (located 10 kb 5’ of the SOCS3 gene) with BMI, VAT, and WAIST (P-value ranging from 0 003 to 0.017). Using a tagging SNP approach, rs9914220 and 22 additional SOCS3 SNPs were genotyped for genetic association analysis with measures of adiposity and glucose homeostasis. The adiposity phenotypes utilized in association analyses included BMI, WAIST, waist to hip ratio (WHR), subcutaneous adipose tissue (SAT), VAT, and visceral to subcutaneous ratio (VSR). Linkage disequilibrium (LD) calculations revealed three haplotype blocks near SOCS3. Haplotype Block 1 (5’ of SOCS3) contained SNPs consistently associated with BMI, WAIST, WHR, and VAT (P-values ranging from 2.00x10−4 to .036). Haplotype Block 3 contained single-SNPs that were associated with most adiposity traits except for VSR (P-values ranging from 0.002 to 0.047). When trait associated SNPs were included in linkage analyses as covariates, a reduction of VAT LOD score from 1.26 to .76 above the SOCS3 locus (110 cM) was observed. Multi-SNP haplotype testing using the quantitative pedigree disequilibrium test (QPDT) was broadly consistent with the single-SNP associations. In conclusion, these results support a role for SOCS3 genetic variants in human obesity. PMID:19083014
Fine Mapping on Chromosome 13q32–34 and Brain Expression Analysis Implicates MYO16 in Schizophrenia
Rodriguez-Murillo, Laura; Xu, Bin; Roos, J Louw; Abecasis, Gonçalo R; Gogos, Joseph A; Karayiorgou, Maria
2014-01-01
We previously reported linkage of schizophrenia and schizoaffective disorder to 13q32–34 in the European descent Afrikaner population from South Africa. The nature of genetic variation underlying linkage peaks in psychiatric disorders remains largely unknown and both rare and common variants may be contributing. Here, we examine the contribution of common variants located under the 13q32–34 linkage region. We used densely spaced SNPs to fine map the linkage peak region using both a discovery sample of 415 families and a meta-analysis incorporating two additional replication family samples. In a second phase of the study, we use one family-based data set with 237 families and independent case–control data sets for fine mapping of the common variant association signal using HapMap SNPs. We report a significant association with a genetic variant (rs9583277) within the gene encoding for the myosin heavy-chain Myr 8 (MYO16), which has been implicated in neuronal phosphoinositide 3-kinase signaling. Follow-up analysis of HapMap variation within MYO16 in a second set of Afrikaner families and additional case–control data sets of European descent highlighted a region across introns 2–6 as the most likely region to harbor common MYO16 risk variants. Expression analysis revealed a significant increase in the level of MYO16 expression in the brains of schizophrenia patients. Our results suggest that common variation within MYO16 may contribute to the genetic liability to schizophrenia. PMID:24141571
Shao, Changwei; Niu, Yongchao; Rastas, Pasi; Liu, Yang; Xie, Zhiyuan; Li, Hengde; Wang, Lei; Jiang, Yong; Tai, Shuaishuai; Tian, Yongsheng; Sakamoto, Takashi; Chen, Songlin
2015-01-01
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1–8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species. PMID:25762582
Parent-Of-Origin Effects in Autism Identified through Genome-Wide Linkage Analysis of 16,000 SNPs
Fradin, Delphine; Cheslack-Postava, Keely; Ladd-Acosta, Christine; Newschaffer, Craig; Chakravarti, Aravinda; Arking, Dan E.; Feinberg, Andrew; Fallin, M. Daniele
2010-01-01
Background Autism is a common heritable neurodevelopmental disorder with complex etiology. Several genome-wide linkage and association scans have been carried out to identify regions harboring genes related to autism or autism spectrum disorders, with mixed results. Given the overlap in autism features with genetic abnormalities known to be associated with imprinting, one possible reason for lack of consistency would be the influence of parent-of-origin effects that may mask the ability to detect linkage and association. Methods and Findings We have performed a genome-wide linkage scan that accounts for potential parent-of-origin effects using 16,311 SNPs among families from the Autism Genetic Resource Exchange (AGRE) and the National Institute of Mental Health (NIMH) autism repository. We report parametric (GH, Genehunter) and allele-sharing linkage (Aspex) results using a broad spectrum disorder case definition. Paternal-origin genome-wide statistically significant linkage was observed on chromosomes 4 (LODGH = 3.79, empirical p<0.005 and LODAspex = 2.96, p = 0.008), 15 (LODGH = 3.09, empirical p<0.005 and LODAspex = 3.62, empirical p = 0.003) and 20 (LODGH = 3.36, empirical p<0.005 and LODAspex = 3.38, empirical p = 0.006). Conclusions These regions may harbor imprinted sites associated with the development of autism and offer fruitful domains for molecular investigation into the role of epigenetic mechanisms in autism. PMID:20824079
Zhang, Kunlin; Chang, Suhua; Cui, Sijia; Guo, Liyuan; Zhang, Liuyan; Wang, Jing
2011-07-01
Genome-wide association study (GWAS) is widely utilized to identify genes involved in human complex disease or some other trait. One key challenge for GWAS data interpretation is to identify causal SNPs and provide profound evidence on how they affect the trait. Currently, researches are focusing on identification of candidate causal variants from the most significant SNPs of GWAS, while there is lack of support on biological mechanisms as represented by pathways. Although pathway-based analysis (PBA) has been designed to identify disease-related pathways by analyzing the full list of SNPs from GWAS, it does not emphasize on interpreting causal SNPs. To our knowledge, so far there is no web server available to solve the challenge for GWAS data interpretation within one analytical framework. ICSNPathway is developed to identify candidate causal SNPs and their corresponding candidate causal pathways from GWAS by integrating linkage disequilibrium (LD) analysis, functional SNP annotation and PBA. ICSNPathway provides a feasible solution to bridge the gap between GWAS and disease mechanism study by generating hypothesis of SNP → gene → pathway(s). The ICSNPathway server is freely available at http://icsnpathway.psych.ac.cn/.
Shearman, Jeremy R.; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke
2015-01-01
Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly. PMID:25831195
Shearman, Jeremy R; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-Areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke
2015-01-01
Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly.
Tomato breeding in the genomics era: insights from a SNP array.
Víquez-Zamora, Marcela; Vosman, Ben; van de Geest, Henri; Bovy, Arnaud; Visser, Richard G F; Finkers, Richard; van Heusden, Adriaan W
2013-05-27
The major bottle neck in genetic and linkage studies in tomato has been the lack of a sufficient number of molecular markers. This has radically changed with the application of next generation sequencing and high throughput genotyping. A set of 6000 SNPs was identified and 5528 of them were used to evaluate tomato germplasm at the level of species, varieties and segregating populations. From the 5528 SNPs, 1980 originated from 454-sequencing, 3495 from Illumina Solexa sequencing and 53 were additional known markers. Genotyping different tomato samples allowed the evaluation of the level of heterozygosity and introgressions among commercial varieties. Cherry tomatoes were especially different from round/beefs in chromosomes 4, 5 and 12. We were able to identify a set of 750 unique markers distinguishing S. lycopersicum 'Moneymaker' from all its distantly related wild relatives. Clustering and neighbour joining analysis among varieties and species showed expected grouping patterns, with S. pimpinellifolium as the most closely related to commercial tomatoes earlier results. Our results show that a SNP search in only a few breeding lines already provides generally applicable markers in tomato and its wild relatives. It also shows that the Illumina bead array generated data are highly reproducible. Our SNPs can roughly be divided in two categories: SNPs of which both forms are present in the wild relatives and in domesticated tomatoes (originating from common ancestors) and SNPs unique for the domesticated tomato (originating from after the domestication event). The SNPs can be used for genotyping, identification of varieties, comparison of genetic and physical linkage maps and to confirm (phylogenetic) relations. In the SNPs used for the array there is hardly any overlap with the SolCAP array and it is strongly recommended to combine both SNP sets and to select a core collection of robust SNPs completely covering the entire tomato genome.
The versican gene and the risk of intracranial aneurysms.
Ruigrok, Ynte M; Rinkel, Gabriël J E; Wijmenga, Cisca
2006-09-01
The proteoglycan versican is an excellent candidate gene for intracranial aneurysms (IAs) because it plays an important role in extracellular matrix assembly and is localized in a previously implicated locus for IAs on chromosome 5q. We analyzed all the common variations using 16-tag single nucleotide polymorphisms (SNPs) and haplotypes in the versican gene using a 2-stage genotyping approach. For stage 1, 16 SNPs were genotyped in 307 cases and 639 controls. For stage 2, the two SNPs yielding the most significant associations (P<0.01) were genotyped in a second independent cohort of 310 cases for confirmation of the associations. In stage 1, we found several SNPs in strong linkage disequilibrium and haplotypes constituting these SNPs associated with IAs in the Dutch population (strongest SNP association for rs173686 with odds ratio=1.34, 95% CI=1.09 to 1.65, P=0.004). In stage 2, we confirmed association for the 2 SNPs with the most significant associations (strongest SNP association for rs173686 with odds ratio=1.36, 95% CI=1.11 to 1.67, P=0.003). SNPs in strong linkage disequilibrium and haplotypes constituting these SNPs in the versican gene are associated with IAs suggesting that variation in or near the versican gene plays a role in susceptibility to IAs.
Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel
2013-01-01
A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree. PMID:24376720
Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel
2013-01-01
A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree.
USDA-ARS?s Scientific Manuscript database
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...
Sun, Yujia; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Chen, Hong
2015-06-01
The aim of this study was to examine the association of cofilin2 (CFL2) gene polymorphisms with growth traits in Chinese Qinchuan cattle. Three single nucleotide polymorphisms (SNPs) were identified in the bovine CFL2 gene using DNA sequencing and (forced) PCR-RFLP methods. These polymorphisms included a missense mutation (NC_007319.5: g. C 2213 G) in exon 4, one synonymous mutation (NC_007319.5: g. T 1694 A) in exon 4, and a mutation (NC_007319.5: g. G 1500 A) in intron 2, respectively. In addition, we evaluated the haplotype frequency and linkage disequilibrium coefficient of three sequence variants in 488 individuals in QC cattle. All the three SNPs in QC cattle belonged to an intermediate level of genetic diversity (0.25
Troggio, Michela; Surbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James
2013-01-01
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
Quantitative trait locus mapping of deep rooting by linkage and association analysis in rice
Lou, Qiaojun; Chen, Liang; Mei, Hanwei; Wei, Haibin; Feng, Fangjun; Wang, Pei; Xia, Hui; Li, Tiemei; Luo, Lijun
2015-01-01
Deep rooting is a very important trait for plants’ drought avoidance, and it is usually represented by the ratio of deep rooting (RDR). Three sets of rice populations were used to determine the genetic base for RDR. A linkage mapping population with 180 recombinant inbred lines and an association mapping population containing 237 rice varieties were used to identify genes linked to RDR. Six quantitative trait loci (QTLs) of RDR were identified as being located on chromosomes 1, 2, 4, 7, and 10. Using 1 019 883 single-nucleotide polymorphisms (SNPs), a genome-wide association study of the RDR was performed. Forty-eight significant SNPs of the RDR were identified and formed a clear peak on the short arm of chromosome 1 in a Manhattan plot. Compared with the shallow-rooting group and the whole collection, the deep-rooting group had selective sweep regions on chromosomes 1 and 2, especially in the major QTL region on chromosome 2. Seven of the nine candidate SNPs identified by association mapping were verified in two RDR extreme groups. The findings from this study will be beneficial to rice drought-resistance research and breeding. PMID:26022253
Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl
2017-01-01
Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273
The search for loci under selection: trends, biases and progress.
Ahrens, Collin W; Rymer, Paul D; Stow, Adam; Bragg, Jason; Dillon, Shannon; Umbers, Kate D L; Dudaniec, Rachael Y
2018-03-01
Detecting genetic variants under selection using F ST outlier analysis (OA) and environmental association analyses (EAAs) are popular approaches that provide insight into the genetic basis of local adaptation. Despite the frequent use of OA and EAA approaches and their increasing attractiveness for detecting signatures of selection, their application to field-based empirical data have not been synthesized. Here, we review 66 empirical studies that use Single Nucleotide Polymorphisms (SNPs) in OA and EAA. We report trends and biases across biological systems, sequencing methods, approaches, parameters, environmental variables and their influence on detecting signatures of selection. We found striking variability in both the use and reporting of environmental data and statistical parameters. For example, linkage disequilibrium among SNPs and numbers of unique SNP associations identified with EAA were rarely reported. The proportion of putatively adaptive SNPs detected varied widely among studies, and decreased with the number of SNPs analysed. We found that genomic sampling effort had a greater impact than biological sampling effort on the proportion of identified SNPs under selection. OA identified a higher proportion of outliers when more individuals were sampled, but this was not the case for EAA. To facilitate repeatability, interpretation and synthesis of studies detecting selection, we recommend that future studies consistently report geographical coordinates, environmental data, model parameters, linkage disequilibrium, and measures of genetic structure. Identifying standards for how OA and EAA studies are designed and reported will aid future transparency and comparability of SNP-based selection studies and help to progress landscape and evolutionary genomics. © 2018 John Wiley & Sons Ltd.
Varela, Miguel A; Curtis, Helen J; Douglas, Andrew GL; Hammond, Suzan M; O'Loughlin, Aisling J; Sobrido, Maria J; Scholefield, Janine; Wood, Matthew JA
2016-01-01
Allele-specific gene therapy aims to silence expression of mutant alleles through targeting of disease-linked single-nucleotide polymorphisms (SNPs). However, SNP linkage to disease varies between populations, making such molecular therapies applicable only to a subset of patients. Moreover, not all SNPs have the molecular features necessary for potent gene silencing. Here we provide knowledge to allow the maximisation of patient coverage by building a comprehensive understanding of SNPs ranked according to their predicted suitability toward allele-specific silencing in 14 repeat expansion diseases: amyotrophic lateral sclerosis and frontotemporal dementia, dentatorubral-pallidoluysian atrophy, myotonic dystrophy 1, myotonic dystrophy 2, Huntington's disease and several spinocerebellar ataxias. Our systematic analysis of DNA sequence variation shows that most annotated SNPs are not suitable for potent allele-specific silencing across populations because of suboptimal sequence features and low variability (>97% in HD). We suggest maximising patient coverage by selecting SNPs with high heterozygosity across populations, and preferentially targeting SNPs that lead to purine:purine mismatches in wild-type alleles to obtain potent allele-specific silencing. We therefore provide fundamental knowledge on strategies for optimising patient coverage of therapeutics for microsatellite expansion disorders by linking analysis of population genetic variation to the selection of molecular targets. PMID:25990798
Varela, Miguel A; Curtis, Helen J; Douglas, Andrew G L; Hammond, Suzan M; O'Loughlin, Aisling J; Sobrido, Maria J; Scholefield, Janine; Wood, Matthew J A
2016-02-01
Allele-specific gene therapy aims to silence expression of mutant alleles through targeting of disease-linked single-nucleotide polymorphisms (SNPs). However, SNP linkage to disease varies between populations, making such molecular therapies applicable only to a subset of patients. Moreover, not all SNPs have the molecular features necessary for potent gene silencing. Here we provide knowledge to allow the maximisation of patient coverage by building a comprehensive understanding of SNPs ranked according to their predicted suitability toward allele-specific silencing in 14 repeat expansion diseases: amyotrophic lateral sclerosis and frontotemporal dementia, dentatorubral-pallidoluysian atrophy, myotonic dystrophy 1, myotonic dystrophy 2, Huntington's disease and several spinocerebellar ataxias. Our systematic analysis of DNA sequence variation shows that most annotated SNPs are not suitable for potent allele-specific silencing across populations because of suboptimal sequence features and low variability (>97% in HD). We suggest maximising patient coverage by selecting SNPs with high heterozygosity across populations, and preferentially targeting SNPs that lead to purine:purine mismatches in wild-type alleles to obtain potent allele-specific silencing. We therefore provide fundamental knowledge on strategies for optimising patient coverage of therapeutics for microsatellite expansion disorders by linking analysis of population genetic variation to the selection of molecular targets.
Population-based case-control study of DRD2 gene polymorphisms and alcoholism.
Bhaskar, L V K S; Thangaraj, K; Non, A L; Singh, Lalji; Rao, V R
2010-10-01
Several independent lines of evidence for genetic contributions to vulnerability to alcoholism exist. Dopamine is thought to play a major role in the mechanism of reward and reinforcement in response to alcohol. D2 dopamine receptor (DRD2) gene has been among the stronger candidate genes implicated in alcoholism. In this study, alcohol use was assessed in 196 randomly selected Kota individuals of Nilgiri Hills, South India. Six DRD2 SNPs were assessed in 81 individuals with alcoholism and 151 controls to evaluate the association between single nucleotide polymorphisms (SNPs) and alcoholism. Of the three models (dominant, recessive, and additive) tested for association between alcoholism and DRD2 SNPs, only the additive model shows association for three loci (rs1116313, TaqID, and rs2734835). Of six studied polymorphisms, five are in strong linkage disequilibrium forming onesingle haplotype block. Though the global haplotype analysis with these five SNPs was not significant, haplotype analysis using all six SNPs yielded a global P value of .033, even after adjusting for age. These findings support the importance of dopamine receptor gene polymorphisms in alcoholism. Further studies to replicate these findings in different populations are needed to confirm these results.
Genetic linkage maps are valuable tools in evolutionary biology; however, their availability for wild populations is extremely limited. Fundulus heteroclitus (Atlantic killifish) is a non-migratory estuarine fish that exhibits high allelic and phenotypic diversity partitioned among subpopulations that reside in disparate environmental conditions. An ideal candidate model organism for studying gene-environment interactions, the molecular toolbox for F. heteroclitus is limited. We identified hundreds of novel microsatellites which, when combined with existing microsatellites and single nucleotide polymorphisms (SNPs), were used to construct the first genetic linkage map for this species. By integrating independent linkage maps from three genetic crosses, we developed a consensus map containing 24 linkage groups, consistent with the number of chromosomes reported for this species. These linkage groups span 2300 centimorgans (cM) of recombinant genomic space, intermediate in size relative to the current linkage maps for the teleosts, medaka and zebrafish. Comparisons between fish genomes support a high degree of synteny between the consensus F. heteroclitus linkage map and the medaka and (to a lesser extent) zebrafish physical genome assemblies.This dataset is associated with the following publication:Waits , E., J. Martinson , B. Rinner, S. Morris, D. Proestou, D. Champlin , and D. Nacci. Genetic linkage map and comparative genome analysis for the estuarine Atlanti
Ma, Yu; Coyne, Clarice J; Grusak, Michael A; Mazourek, Michael; Cheng, Peng; Main, Dorrie; McGee, Rebecca J
2017-02-13
Marker-assisted breeding is now routinely used in major crops to facilitate more efficient cultivar improvement. This has been significantly enabled by the use of next-generation sequencing technology to identify loci and markers associated with traits of interest. While rich in a range of nutritional components, such as protein, mineral nutrients, carbohydrates and several vitamins, pea (Pisum sativum L.), one of the oldest domesticated crops in the world, remains behind many other crops in the availability of genomic and genetic resources. To further improve mineral nutrient levels in pea seeds requires the development of genome-wide tools. The objectives of this research were to develop these tools by: identifying genome-wide single nucleotide polymorphisms (SNPs) using genotyping by sequencing (GBS); constructing a high-density linkage map and comparative maps with other legumes, and identifying quantitative trait loci (QTL) for levels of boron, calcium, iron, potassium, magnesium, manganese, molybdenum, phosphorous, sulfur, and zinc in the seed, as well as for seed weight. In this study, 1609 high quality SNPs were found to be polymorphic between 'Kiflica' and 'Aragorn', two parents of an F 6 -derived recombinant inbred line (RIL) population. Mapping 1683 markers including 75 previously published markers and 1608 SNPs developed from the present study generated a linkage map of size 1310.1 cM. Comparative mapping with other legumes demonstrated that the highest level of synteny was observed between pea and the genome of Medicago truncatula. QTL analysis of the RIL population across two locations revealed at least one QTL for each of the mineral nutrient traits. In total, 46 seed mineral concentration QTLs, 37 seed mineral content QTLs, and 6 seed weight QTLs were discovered. The QTLs explained from 2.4% to 43.3% of the phenotypic variance. The genome-wide SNPs and the genetic linkage map developed in this study permitted QTL identification for pea seed mineral nutrients that will serve as important resources to enable marker-assisted selection (MAS) for nutritional quality traits in pea breeding programs.
Whole-genome association studies of alcoholism with loci linked to schizophrenia susceptibility.
Namkung, Junghyun; Kim, Youngchul; Park, Taesung
2005-12-30
Alcoholism is a complex disease. There have been many reports on significant comorbidity between alcoholism and schizophrenia. For the genetic study of complex diseases, association analysis has been recommended because of its higher power than that of the linkage analysis for detecting genes with modest effects on disease. To identify alcoholism susceptibility loci, we performed genome-wide single-nucleotide polymorphisms (SNP) association tests, which yielded 489 significant SNPs at the 1% significance level. The association tests showed that tsc0593964 (P-value 0.000013) on chromosome 7 was most significantly associated with alcoholism. From 489 SNPs, 74 genes were identified. Among these genes, GABRA1 is a member of the same gene family with GABRA2 that was recently reported as alcoholism susceptibility gene. By comparing 74 genes to the published results of various linkage studies of schizophrenia, we identified 13 alcoholism associated genes that were located in the regions reported to be linked to schizophrenia. These 13 identified genes can be important candidate genes to study the genetic mechanism of co-occurrence of both diseases.
Shanker, Jayashree; Perumal, Ganapathy; Rao, Veena S; Khadrinarasimhiah, Natesha B; John, Shibu; Hebbagodi, Sridhara; Mukherjee, Manjari; Kakkar, Vijay V
2008-01-01
Background The APOA1-C3-A5 gene cluster plays an important role in the regulation of lipids. Asian Indians have an increased tendency for abnormal lipid levels and high risk of Coronary Artery Disease (CAD). Therefore, the present study aimed to elucidate the relationship of four single nucleotide polymorphisms (SNPs) in the Apo11q cluster, namely the -75G>A, +83C>T SNPs in the APOA1 gene, the Sac1 SNP in the APOC3 gene and the S19W variant in the APOA5 gene to plasma lipids and CAD in 190 affected sibling pairs (ASPs) belonging to Asian Indian families with a strong CAD history. Methods & results Genotyping and lipid assays were carried out using standard protocols. Plasma lipids showed a strong heritability (h2 48% – 70%; P < 0.0001). A subset of 77 ASPs with positive sign of Logarithm of Odds (LOD) score showed significant linkage to CAD trait by multi-point analysis (LOD score 7.42, P < 0.001) and to Sac1 (LOD score 4.49) and -75G>A (LOD score 2.77) SNPs by single-point analysis (P < 0.001). There was significant proportion of mean allele sharing (pi) for the Sac1 (pi 0.59), -75G>A (pi 0.56) and +83C>T (pi 0.52) (P < 0.001) SNPs, respectively. QTL analysis showed suggestive evidence of linkage of the Sac1 SNP to Total Cholesterol (TC), High Density Lipoprotein-cholesterol (HDL-C) and Apolipoprotein B (ApoB) with LOD scores of 1.42, 1.72 and 1.19, respectively (P < 0.01). The Sac1 and -75G>A SNPs along with hypertension showed maximized correlations with TC, TG and Apo B by association analysis. Conclusion The APOC3-Sac1 SNP is an important genetic variant that is associated with CAD through its interaction with plasma lipids and other standard risk factors among Asian Indians. PMID:18801202
2010-01-01
Background The inability of aspirin (ASA) to adequately suppress platelet aggregation is associated with future risk of coronary artery disease (CAD). Heritability studies of agonist-induced platelet function phenotypes suggest that genetic variation may be responsible for ASA responsiveness. In this study, we leverage independent information from genome-wide linkage and association data to determine loci controlling platelet phenotypes before and after treatment with ASA. Methods Clinical data on 37 agonist-induced platelet function phenotypes were evaluated before and after a 2-week trial of ASA (81 mg/day) in 1231 European American and 846 African American healthy subjects with a family history of premature CAD. Principal component analysis was performed to minimize the number of independent factors underlying the covariance of these various phenotypes. Multi-point sib-pair based linkage analysis was performed using a microsatellite marker set, and single-SNP association tests were performed using markers from the Illumina 1 M genotyping chip from deCODE Genetics, Inc. All analyses were performed separately within each ethnic group. Results Several genomic regions appear to be linked to ASA response factors: a 10 cM region in African Americans on chromosome 5q11.2 had several STRs with suggestive (p-value < 7 × 10-4) and significant (p-value < 2 × 10-5) linkage to post aspirin platelet response to ADP, and ten additional factors had suggestive evidence for linkage (p-value < 7 × 10-4) to thirteen genomic regions. All but one of these factors were aspirin response variables. While the strength of genome-wide SNP association signals for factors showing evidence for linkage is limited, especially at the strict thresholds of genome-wide criteria (N = 9 SNPs for 11 factors), more signals were considered significant when the association signal was weighted by evidence for linkage (N = 30 SNPs). Conclusions Our study supports the hypothesis that platelet phenotypes in response to ASA likely have genetic control and the combined approach of linkage and association offers an alternative approach to prioritizing regions of interest for subsequent follow-up. PMID:20529293
Sasaki, Tomonari; Tahira, Tomoko; Suzuki, Akari; Higasa, Koichiro; Kukita, Yoji; Baba, Shingo; Hayashi, Kenshi
2001-01-01
We show that single-nucleotide polymorphisms (SNPs) of moderate to high heterozygosity (minor allele frequencies >10%) can be efficiently detected, and their allele frequencies accurately estimated, by pooling the DNA samples and applying a capillary-based SSCP analysis. In this method, alleles are separated into peaks, and their frequencies can be reliably and accurately quantified from their peak heights (SD <1.8%). We found that as many as 40% of publicly available SNPs that were analyzed by this method have widely differing allele frequency distributions among groups of different ethnicity (parents of Centre d'Etude Polymorphisme Humaine families vs. Japanese individuals). These results demonstrate the effectiveness of the present pooling method in the reevaluation of candidate SNPs that have been collected by examination of limited numbers of individuals. The method should also serve as a robust quantitative technique for studies in which a precise estimate of SNP allele frequencies is essential—for example, in linkage disequilibrium analysis. PMID:11083945
A tool for selecting SNPs for association studies based on observed linkage disequilibrium patterns.
De La Vega, Francisco M; Isaac, Hadar I; Scafe, Charles R
2006-01-01
The design of genetic association studies using single-nucleotide polymorphisms (SNPs) requires the selection of subsets of the variants providing high statistical power at a reasonable cost. SNPs must be selected to maximize the probability that a causative mutation is in linkage disequilibrium (LD) with at least one marker genotyped in the study. The HapMap project performed a genome-wide survey of genetic variation with about a million SNPs typed in four populations, providing a rich resource to inform the design of association studies. A number of strategies have been proposed for the selection of SNPs based on observed LD, including construction of metric LD maps and the selection of haplotype tagging SNPs. Power calculations are important at the study design stage to ensure successful results. Integrating these methods and annotations can be challenging: the algorithms required to implement these methods are complex to deploy, and all the necessary data and annotations are deposited in disparate databases. Here, we present the SNPbrowser Software, a freely available tool to assist in the LD-based selection of markers for association studies. This stand-alone application provides fast query capabilities and swift visualization of SNPs, gene annotations, power, haplotype blocks, and LD map coordinates. Wizards implement several common SNP selection workflows including the selection of optimal subsets of SNPs (e.g. tagging SNPs). Selected SNPs are screened for their conversion potential to either TaqMan SNP Genotyping Assays or the SNPlex Genotyping System, two commercially available genotyping platforms, expediting the set-up of genetic studies with an increased probability of success.
Liu, Yanfang; Liao, Huidan; Liu, Ying; Guo, Juanjuan; Sun, Yi; Fu, Xiaoliang; Xiao, Ding; Cai, Jifeng; Lan, Lingmei; Xie, Pingli; Zha, Lagabaiyila
2017-04-01
Nonbinary single-nucleotide polymorphisms (SNPs) are potential forensic genetic markers because their discrimination power is greater than that of normal binary SNPs, and that they can detect highly degraded samples. We previously developed a nonbinary SNP multiplex typing assay. In this study, we selected additional 20 nonbinary SNPs from the NCBI SNP database and verified them through pyrosequencing. These 20 nonbinary SNPs were analyzed using the fluorescent-labeled SNaPshot multiplex SNP typing method. The allele frequencies and genetic parameters of these 20 nonbinary SNPs were determined among 314 unrelated individuals from Han populations from China. The total power of discrimination was 0.9999999999994, and the cumulative probability of exclusion was 0.9986. Moreover, the result of the combination of this 20 nonbinary SNP assay with the 20 nonbinary SNP assay we previously developed demonstrated that the cumulative probability of exclusion of the 40 nonbinary SNPs was 0.999991 and that no significant linkage disequilibrium was observed in all 40 nonbinary SNPs. Thus, we concluded that this new system consisting of new 20 nonbinary SNPs could provide highly informative polymorphic data which would be further used in forensic application and would serve as a potentially valuable supplement to forensic DNA analysis. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genetic association of SNPs in the FTO gene and predisposition to obesity in Malaysian Malays.
Apalasamy, Y D; Ming, M F; Rampal, S; Bulgiba, A; Mohamed, Z
2012-12-01
The common variants in the fat mass- and obesity-associated (FTO) gene have been previously found to be associated with obesity in various adult populations. The objective of the present study was to investigate whether the single nucleotide polymorphisms (SNPs) and linkage disequilibrium (LD) blocks in various regions of the FTO gene are associated with predisposition to obesity in Malaysian Malays. Thirty-one FTO SNPs were genotyped in 587 (158 obese and 429 non-obese) Malaysian Malay subjects. Obesity traits and lipid profiles were measured and single-marker association testing, LD testing, and haplotype association analysis were performed. LD analysis of the FTO SNPs revealed the presence of 57 regions with complete LD (D' = 1.0). In addition, we detected the association of rs17817288 with low-density lipoprotein cholesterol. The FTO gene may therefore be involved in lipid metabolism in Malaysian Malays. Two haplotype blocks were present in this region of the FTO gene, but no particular haplotype was found to be significantly associated with an increased risk of obesity in Malaysian Malays.
Genetic association of SNPs in the FTO gene and predisposition to obesity in Malaysian Malays
Apalasamy, Y.D.; Ming, M.F.; Rampal, S.; Bulgiba, A.; Mohamed, Z.
2012-01-01
The common variants in the fat mass- and obesity-associated (FTO) gene have been previously found to be associated with obesity in various adult populations. The objective of the present study was to investigate whether the single nucleotide polymorphisms (SNPs) and linkage disequilibrium (LD) blocks in various regions of the FTO gene are associated with predisposition to obesity in Malaysian Malays. Thirty-one FTO SNPs were genotyped in 587 (158 obese and 429 non-obese) Malaysian Malay subjects. Obesity traits and lipid profiles were measured and single-marker association testing, LD testing, and haplotype association analysis were performed. LD analysis of the FTO SNPs revealed the presence of 57 regions with complete LD (D' = 1.0). In addition, we detected the association of rs17817288 with low-density lipoprotein cholesterol. The FTO gene may therefore be involved in lipid metabolism in Malaysian Malays. Two haplotype blocks were present in this region of the FTO gene, but no particular haplotype was found to be significantly associated with an increased risk of obesity in Malaysian Malays. PMID:22911346
Tsai, Hsin Y; Robledo, Diego; Lowe, Natalie R; Bekaert, Michael; Taggart, John B; Bron, James E; Houston, Ross D
2016-07-07
High density linkage maps are useful tools for fine-scale mapping of quantitative trait loci, and characterization of the recombination landscape of a species' genome. Genomic resources for Atlantic salmon (Salmo salar) include a well-assembled reference genome, and high density single nucleotide polymorphism (SNP) arrays. Our aim was to create a high density linkage map, and to align it with the reference genome assembly. Over 96,000 SNPs were mapped and ordered on the 29 salmon linkage groups using a pedigreed population comprising 622 fish from 60 nuclear families, all genotyped with the 'ssalar01' high density SNP array. The number of SNPs per group showed a high positive correlation with physical chromosome length (r = 0.95). While the order of markers on the genetic and physical maps was generally consistent, areas of discrepancy were identified. Approximately 6.5% of the previously unmapped reference genome sequence was assigned to chromosomes using the linkage map. Male recombination rate was lower than females across the vast majority of the genome, but with a notable peak in subtelomeric regions. Finally, using RNA-Seq data to annotate the reference genome, the mapped SNPs were categorized according to their predicted function, including annotation of ∼2500 putative nonsynonymous variants. The highest density SNP linkage map for any salmonid species has been created, annotated, and integrated with the Atlantic salmon reference genome assembly. This map highlights the marked heterochiasmy of salmon, and provides a useful resource for salmonid genetics and genomics research. Copyright © 2016 Tsai et al.
Shao, Changwei; Niu, Yongchao; Rastas, Pasi; Liu, Yang; Xie, Zhiyuan; Li, Hengde; Wang, Lei; Jiang, Yong; Tai, Shuaishuai; Tian, Yongsheng; Sakamoto, Takashi; Chen, Songlin
2015-04-01
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1-8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Hong, Yanbin; Pandey, Manish K; Liu, Ying; Chen, Xiaoping; Liu, Hong; Varshney, Rajeev K; Liang, Xuanqiang; Huang, Shangzhi
2015-01-01
The cultivated peanut (Arachis hypogaea L.) is an allotetraploid (AABB) species derived from the A-genome (Arachis duranensis) and B-genome (Arachis ipaensis) progenitors. Presence of two versions of a DNA sequence based on the two progenitor genomes poses a serious technical and analytical problem during single nucleotide polymorphism (SNP) marker identification and analysis. In this context, we have analyzed 200 amplicons derived from expressed sequence tags (ESTs) and genome survey sequences (GSS) to identify SNPs in a panel of genotypes consisting of 12 cultivated peanut varieties and two diploid progenitors representing the ancestral genomes. A total of 18 EST-SNPs and 44 genomic-SNPs were identified in 12 peanut varieties by aligning the sequence of A. hypogaea with diploid progenitors. The average frequency of sequence polymorphism was higher for genomic-SNPs than the EST-SNPs with one genomic-SNP every 1011 bp as compared to one EST-SNP every 2557 bp. In order to estimate the potential and further applicability of these identified SNPs, 96 peanut varieties were genotyped using high resolution melting (HRM) method. Polymorphism information content (PIC) values for EST-SNPs ranged between 0.021 and 0.413 with a mean of 0.172 in the set of peanut varieties, while genomic-SNPs ranged between 0.080 and 0.478 with a mean of 0.249. Total 33 SNPs were used for polymorphism detection among the parents and 10 selected lines from mapping population Y13Zh (Zhenzhuhei × Yueyou13). Of the total 33 SNPs, nine SNPs showed polymorphism in the mapping population Y13Zh, and seven SNPs were successfully mapped into five linkage groups. Our results showed that SNPs can be identified in allotetraploid peanut with high accuracy through amplicon sequencing and HRM assay. The identified SNPs were very informative and can be used for different genetic and breeding applications in peanut.
Evaluation of Bovine High-Density SNP Genotyping Array in Indigenous Dairy Cattle Breeds.
Dash, S; Singh, A; Bhatia, A K; Jayakumar, S; Sharma, A; Singh, S; Ganguly, I; Dixit, S P
2018-04-03
In total 52 samples of Sahiwal ( 19 ), Tharparkar ( 17 ), and Gir ( 16 ) were genotyped by using BovineHD SNP chip to analyze minor allele frequency (MAF), genetic diversity, and linkage disequilibrium among these cattle. The common SNPs of BovineHD and 54K SNP Chips were also extracted and evaluated for their performance. Only 40%-50% SNPs of these arrays was found informative for genetic analysis in these cattle breeds. The overall mean of MAF for SNPs of BovineHD SNPChip was 0.248 ± 0.006, 0.241 ± 0.007, and 0.242 ± 0.009 in Sahiwal, Tharparkar and Gir, respectively, while that for 54K SNPs was on lower side. The average Reynold's genetic distance between breeds ranged from 0.042 to 0.055 based on BovineHD Beadchip, and from 0.052 to 0.084 based on 54K SNP Chip. The estimates of genetic diversity based on HD and 54K chips were almost same and, hence, low density chip seems to be good enough to decipher genetic diversity of these cattle breeds. The linkage disequilibrium started decaying (r 2 < 0.2) at 140 kb inter-marker distance and, hence, a 20K low density customized SNP array from HD chip could be designed for genomic selection in these cattle else the 54K Bead Chip as such will be useful.
Khatkar, Mehar S.; Zenger, Kyall R.; Hobbs, Matthew; Hawken, Rachel J.; Cavanagh, Julie A. L.; Barris, Wes; McClintock, Alexander E.; McClintock, Sara; Thomson, Peter C.; Tier, Bruce; Nicholas, Frank W.; Raadsma, Herman W.
2007-01-01
Analysis of data on 1000 Holstein–Friesian bulls genotyped for 15,036 single-nucleotide polymorphisms (SNPs) has enabled genomewide identification of haplotype blocks and tag SNPs. A final subset of 9195 SNPs in Hardy–Weinberg equilibrium and mapped on autosomes on the bovine sequence assembly (release Btau 3.1) was used in this study. The average intermarker spacing was 251.8 kb. The average minor allele frequency (MAF) was 0.29 (0.05–0.5). Following recent precedents in human HapMap studies, a haplotype block was defined where 95% of combinations of SNPs within a region are in very high linkage disequilibrium. A total of 727 haplotype blocks consisting of ≥3 SNPs were identified. The average block length was 69.7 ± 7.7 kb, which is ∼5–10 times larger than in humans. These blocks comprised a total of 2964 SNPs and covered 50,638 kb of the sequence map, which constitutes 2.18% of the length of all autosomes. A set of tag SNPs, which will be useful for further fine-mapping studies, has been identified. Overall, the results suggest that as many as 75,000–100,000 tag SNPs would be needed to track all important haplotype blocks in the bovine genome. This would require ∼250,000 SNPs in the discovery phase. PMID:17435229
Fine mapping of the chromosome 10q11-q21 linkage region in Alzheimer's disease cases and controls.
Fallin, Margaret Daniele; Szymanski, Megan; Wang, Ruihua; Gherman, Adrian; Bassett, Susan S; Avramopoulos, Dimitrios
2010-07-01
We have previously reported strong linkage on chromosome 10q in pedigrees transmitting Alzheimer's disease through the mother, overlapping with many significant linkage reports including the largest reported study. Here, we report the most comprehensive fine mapping of this region to date. In a sample of 638 late-onset Alzheimer's disease (LOAD) cases and controls including 104 maternal LOAD cases, we genotyped 3,884 single nucleotide polymorphisms (SNPs) covering 15.2 Mb. We then used imputations and publicly available data to generate an extended dataset including 4,329 SNPs for 1,209 AD cases and 839 controls in the same region. Further, we screened eight genes in this region for rare alleles in 283 individuals by nucleotide sequencing, and we tested for possible monoallelic expression as it might underlie our maternal parent of origin linkage. We excluded the possibility of multiple rare coding risk variants for these genes and monoallelic expression when we could test for it. One SNP, rs10824310 in the PRKG1 gene, showed study-wide significant association without a parent of origin effect, but the effect size estimate is not of sufficient magnitude to explain the linkage, and no association is observed in an independent genome-wide association studies (GWAS) report. Further, no causative variants were identified though sequencing. Analysis of cases with maternal disease origin pointed to a few regions of interest that included the genes PRKG1 and PCDH15 and an intergenic interval of 200 Kb. It is likely that non-transcribed rare variants or other mechanisms involving these genomic regions underlie the observed linkage and parent of origin effect. Acquiring additional support and clarifying the mechanisms of such involvement is important for AD and other complex disorder genetics research.
PExFInS: An Integrative Post-GWAS Explorer for Functional Indels and SNPs
Cheng, Zhongshan; Chu, Hin; Fan, Yanhui; Li, Cun; Song, You-Qiang; Zhou, Jie; Yuen, Kwok-Yung
2015-01-01
Expression quantitative trait loci (eQTLs) mapping and linkage disequilibrium (LD) analysis have been widely employed to interpret findings of genome-wide association studies (GWAS). With the availability of deep sequencing data of 423 lymphoblastoid cell lines (LCLs) from six global populations and the microarray expression data, we performed eQTL analysis, identified more than 228 K SNP cis-eQTLs and 21 K indel cis-eQTLs and generated a LCL cis-eQTL database. We demonstrate that the percentages of population-shared and population-specific cis-eQTLs are comparable; while indel cis-eQTLs in the population-specific subsection make more contribution to gene expression variations than those in the population-shared subsection. We found cis-eQTLs, especially the population-shared cis-eQTLs are significantly enriched toward transcription start site. Moreover, the National Human Genome Research Institute cataloged GWAS SNPs are enriched for LCL cis-eQTLs. Specifically, 32.8% GWAS SNPs are LCL cis-eQTLs, among which 12.5% can be tagged by indel cis-eQTLs, suggesting the fundamental contribution of indel cis-eQTLs to GWAS association signals. To search for functional indels and SNPs tagging GWAS SNPs, a pipeline Post-GWAS Explorer for Functional Indels and SNPs (PExFInS) has been developed, integrating LD analysis, functional annotation from public databases, cis-eQTL mapping with our LCL cis-eQTL database and other published cis-eQTL datasets. PMID:26612672
Quantitative trait locus mapping of deep rooting by linkage and association analysis in rice.
Lou, Qiaojun; Chen, Liang; Mei, Hanwei; Wei, Haibin; Feng, Fangjun; Wang, Pei; Xia, Hui; Li, Tiemei; Luo, Lijun
2015-08-01
Deep rooting is a very important trait for plants' drought avoidance, and it is usually represented by the ratio of deep rooting (RDR). Three sets of rice populations were used to determine the genetic base for RDR. A linkage mapping population with 180 recombinant inbred lines and an association mapping population containing 237 rice varieties were used to identify genes linked to RDR. Six quantitative trait loci (QTLs) of RDR were identified as being located on chromosomes 1, 2, 4, 7, and 10. Using 1 019 883 single-nucleotide polymorphisms (SNPs), a genome-wide association study of the RDR was performed. Forty-eight significant SNPs of the RDR were identified and formed a clear peak on the short arm of chromosome 1 in a Manhattan plot. Compared with the shallow-rooting group and the whole collection, the deep-rooting group had selective sweep regions on chromosomes 1 and 2, especially in the major QTL region on chromosome 2. Seven of the nine candidate SNPs identified by association mapping were verified in two RDR extreme groups. The findings from this study will be beneficial to rice drought-resistance research and breeding. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Morgan, A R; Hamilton, G; Turic, D; Jehu, L; Harold, D; Abraham, R; Hollingworth, P; Moskvina, V; Brayne, C; Rubinsztein, D C; Lynch, A; Lawlor, B; Gill, M; O'Donovan, M; Powell, J; Lovestone, S; Williams, J; Owen, M J
2008-09-05
Late-onset Alzheimer's disease (LOAD) is a genetically complex neurodegenerative disorder. Currently, only the epsilon4 allele of the Apolipoprotein E gene has been identified unequivocally as a genetic susceptibility factor for LOAD. Others remain to be found. In 2002 we observed genome-wide significant evidence of linkage to a region on chromosome 10q11.23-q21.3 [Myers et al. (2002) Am J Med Genet 114:235-244]. Our objective in this study was to test every gene within the maximum LOD-1 linkage region, for association with LOAD. We obtained results for 528 SNPs from 67 genes, with an average density of 1 SNP every 10 kb within the genes. We demonstrated nominally significant association with LOAD for 4 SNPs: rs1881747 near DKK1 (P = 0.011, OR = 1.24), rs2279420 in ANK3 (P = 0.022, OR = 0.79), rs2306402 in CTNNA3 (P = 0.024, OR = 1.18), and rs5030882 in CXXC6 (P = 0.046, OR = 1.29) in 1,160 cases and 1,389 controls. These results would not survive correction for multiple testing but warrant attempts at confirmation in independent samples. 2007 Wiley-Liss, Inc.
Lack of association of two chromosome 10q24 SNPs with Alzheimer’s disease
Minster, Ryan L.; DeKosky, Steven T.; Kamboh, M. Ilyas
2006-01-01
Several groups have reported evidence of linkage on chromosome 10 to late-onset Alzheimer’s disease (LOAD). In a recent scan of single nucleotide polymorphisms (SNPs) on chromosome 10, significant associations between the rs498055 and rs4417206 SNPs and risk of LOAD were observed. We examined the association of these two SNPs with LOAD risk in a large Caucasian American cohort comprising about 2,000 cases and controls. Neither SNP revealed significant association with LOAD risk or age-at-onset. PMID:17000046
Genome-wide association study of rust traits in orchardgrass using SLAF-seq technology.
Zeng, Bing; Yan, Haidong; Liu, Xinchun; Zang, Wenjing; Zhang, Ailing; Zhou, Sifan; Huang, Linkai; Liu, Jinping
2017-01-01
While orchardgrass ( Dactylis glomerata L.) is a well-known perennial forage species, rust diseases cause serious reductions in the yield and quality of orchardgrass; however, genetic mechanisms of rust resistance are not well understood in orchardgrass. In this study, a genome-wide association study (GWAS) was performed using specific-locus amplified fragment sequencing (SLAF-seq) technology in orchardgrass. A total of 2,334,889 SLAF tags were generated to produce 2,309,777 SNPs. ADMIXTURE analysis revealed unstructured subpopulations for 33 accessions, indicating that this orchardgrass population could be used for association analysis. Linkage disequilibrium (LD) analysis revealed an average r 2 of 0.4 across all SNP pairs, indicating a high extent of LD in these samples. Through GWAS, a total of 4,604 SNPs were found to be significantly ( P < 0.01) associated with the rust trait. The bulk analysis discovered a number of 5,211 SNPs related to rust trait. Two candidate genes, including cytochrome P450, and prolamin were implicated in disease resistance through prediction of functional genes surrounding each high-quality SNP ( P < 0.01) associated with rust traits based on GWAS analysis and bulk analysis. The large number of SNPs associated with rust traits and these two candidate genes may provide the basis for further research on rust resistance mechanisms and marker-assisted selection (MAS) for rust-resistant lineages.
Polymorphisms within the FANCA gene associate with premature ovarian failure in Korean women.
Pyun, Jung-A; Kim, Sunshin; Cha, Dong Hyun; Kwack, KyuBum
2014-05-01
This study investigated whether polymorphisms within the Fanconi anemia complementation group A (FANCA) gene contribute to the increased risk of premature ovarian failure (POF) in Korean women. Ninety-eight women with POF and 218 controls participated in this study. Genomic DNA from peripheral blood was isolated, and GoldenGate genotyping assay was used to identify single nucleotide polymorphisms (SNPs) within the FANCA gene. Two significant SNPs (rs1006547 and rs2239359; P < 0.05) were identified by logistic regression analysis, but results were insignificant after Bonferroni correction. Six SNPs formed a linkage disequilibrium block, and three main haplotypes were found. Two of three haplotypes (AAAGAA and GGGAGG) distributed highly in the POF group, whereas the remaining haplotype (GGAAGG) distributed highly in the control group by logistic regression analysis (highest odds ratio, 2.515; 95% CI, 1.515-4.175; P = 0.00036). Our observations suggest that genetic variations in the FANCA gene may increase the risk for POF in Korean women.
Robinson, Nicholas; Baranski, Matthew; Mahapatra, Kanta Das; Saha, Jatindra Nath; Das, Sweta; Mishra, Jashobanta; Das, Paramananda; Kent, Matthew; Arnyasi, Mariann; Sahoo, Pramoda Kumar
2014-06-30
Production of carp dominates world aquaculture. More than 1.1 million tonnes of rohu carp, Labeo rohita (Hamilton), were produced in 2010. Aeromonas hydrophila is a bacterial pathogen causing aeromoniasis in rohu, and is a major problem for carp production worldwide. There is a need to better understand the genetic mechanisms affecting resistance to this disease, and to develop tools that can be used with selective breeding to improve resistance. Here we use a 6 K SNP array to genotype 21 full-sibling families of L. rohita that were experimentally challenged intra-peritoneally with a virulent strain of A. hydrophila to scan the genome for quantitative trait loci associated with disease resistance. In all, 3193 SNPs were found to be informative and were used to create a linkage map and to scan for QTL affecting resistance to A. hydrophila. The linkage map consisted of 25 linkage groups, corresponding to the number of haploid chromosomes in L. rohita. Male and female linkage maps were similar in terms of order, coverage (1384 and 1393 cM, respectively) and average interval distances (1.32 and 1.35 cM, respectively). Forty-one percent of the SNPs were annotated with gene identity using BLAST (cut off E-score of 0.001). Twenty-one SNPs mapping to ten linkage groups showed significant associations with the traits hours of survival and dead or alive (P <0.05 after Bonferroni correction). Of the SNPs showing significant or suggestive associations with the traits, several were homologous to genes of known immune function or were in close linkage to such genes. Genes of interest included heat shock proteins (70, 60, 105 and "small heat shock proteins"), mucin (5b precursor and 2), lectin (receptor and CD22), tributyltin-binding protein, major histocompatibility loci (I and II), complement protein component c7-1, perforin 1, ubiquitin (ligase, factor e4b isoform 2 and conjugation enzyme e2 c), proteasome subunit, T-cell antigen receptor and lymphocyte specific protein tyrosine kinase. A panel of markers has been identified that will be validated for use with both genomic and marker-assisted selection to improve resistance of L. rohita to A. hydrophila.
Valdisser, Paula Arielle M R; Pappas, Georgios J; de Menezes, Ivandilson P P; Müller, Bárbara S F; Pereira, Wendell J; Narciso, Marcelo G; Brondani, Claudio; Souza, Thiago L P O; Borba, Tereza C O; Vianello, Rosana P
2016-06-01
Researchers have made great advances into the development and application of genomic approaches for common beans, creating opportunities to driving more real and applicable strategies for sustainable management of the genetic resource towards plant breeding. This work provides useful polymorphic single-nucleotide polymorphisms (SNPs) for high-throughput common bean genotyping developed by RAD (restriction site-associated DNA) sequencing. The RAD tags were generated from DNA pooled from 12 common bean genotypes, including breeding lines of different gene pools and market classes. The aligned sequences identified 23,748 putative RAD-SNPs, of which 3357 were adequate for genotyping; 1032 RAD-SNPs with the highest ADT (assay design tool) score are presented in this article. The RAD-SNPs were structurally annotated in different coding (47.00 %) and non-coding (53.00 %) sequence components of genes. A subset of 384 RAD-SNPs with broad genome distribution was used to genotype a diverse panel of 95 common bean germplasms and revealed a successful amplification rate of 96.6 %, showing 73 % of polymorphic SNPs within the Andean group and 83 % in the Mesoamerican group. A slightly increased He (0.161, n = 21) value was estimated for the Andean gene pool, compared to the Mesoamerican group (0.156, n = 74). For the linkage disequilibrium (LD) analysis, from a group of 580 SNPs (289 RAD-SNPs and 291 BARC-SNPs) genotyped for the same set of genotypes, 70.2 % were in LD, decreasing to 0.10 %in the Andean group and 0.77 % in the Mesoamerican group. Haplotype patterns spanning 310 Mb of the genome (60 %) were characterized in samples from different origins. However, the haplotype frameworks were under-represented for the Andean (7.85 %) and Mesoamerican (5.55 %) gene pools separately. In conclusion, RAD sequencing allowed the discovery of hundreds of useful SNPs for broad genetic analysis of common bean germplasm. From now, this approach provides an excellent panel of molecular tools for whole genome analysis, allowing integrating and better exploring the common bean breeding practices.
Livingstone, Donald; Stack, Conrad; Mustiga, Guiliana M; Rodezno, Dayana C; Suarez, Carmen; Amores, Freddy; Feltus, Frank A; Mockaitis, Keithanne; Cornejo, Omar E; Motamayor, Juan C
2017-01-01
Cacao ( Theobroma cacao L.) is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance.
Troggio, Michela; Šurbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James
2013-01-01
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies. PMID:23826289
Thameem, Farook; Igo, Robert P; Freedman, Barry I; Langefeld, Carl; Hanson, Robert L; Schelling, Jeffrey R; Elston, Robert C; Duggirala, Ravindranath; Nicholas, Susanne B; Goddard, Katrina A B; Divers, Jasmin; Guo, Xiuqing; Ipp, Eli; Kimmel, Paul L; Meoni, Lucy A; Shah, Vallabh O; Smith, Michael W; Winkler, Cheryl A; Zager, Philip G; Knowler, William C; Nelson, Robert G; Pahl, Madeline V; Parekh, Rulan S; Kao, W H Linda; Rasooly, Rebekah S; Adler, Sharon G; Abboud, Hanna E; Iyengar, Sudha K; Sedor, John R
2013-01-01
Estimated glomerular filtration rate (eGFR), a measure of kidney function, is heritable, suggesting that genes influence renal function. Genes that influence eGFR have been identified through genome-wide association studies. However, family-based linkage approaches may identify loci that explain a larger proportion of the heritability. This study used genome-wide linkage and association scans to identify quantitative trait loci (QTL) that influence eGFR. Genome-wide linkage and sparse association scans of eGFR were performed in families ascertained by probands with advanced diabetic nephropathy (DN) from the multi-ethnic Family Investigation of Nephropathy and Diabetes (FIND) study. This study included 954 African Americans (AA), 781 American Indians (AI), 614 European Americans (EA) and 1,611 Mexican Americans (MA). A total of 3,960 FIND participants were genotyped for 6,000 single nucleotide polymorphisms (SNPs) using the Illumina Linkage IVb panel. GFR was estimated by the Modification of Diet in Renal Disease (MDRD) formula. The non-parametric linkage analysis, accounting for the effects of diabetes duration and BMI, identified the strongest evidence for linkage of eGFR on chromosome 20q11 (log of the odds [LOD] = 3.34; P = 4.4 × 10(-5)) in MA and chromosome 15q12 (LOD = 2.84; P = 1.5 × 10(-4)) in EA. In all subjects, the strongest linkage signal for eGFR was detected on chromosome 10p12 (P = 5.5 × 10(-4)) at 44 cM near marker rs1339048. A subsequent association scan in both ancestry-specific groups and the entire population identified several SNPs significantly associated with eGFR across the genome. The present study describes the localization of QTL influencing eGFR on 20q11 in MA, 15q21 in EA and 10p12 in the combined ethnic groups participating in the FIND study. Identification of causal genes/variants influencing eGFR, within these linkage and association loci, will open new avenues for functional analyses and development of novel diagnostic markers for DN.
Thameem, Farook; Igo, Robert P.; Freedman, Barry I.; Langefeld, Carl; Hanson, Robert L.; Schelling, Jeffrey R.; Elston, Robert C.; Duggirala, Ravindranath; Nicholas, Susanne B.; Goddard, Katrina A. B.; Divers, Jasmin; Guo, Xiuqing; Ipp, Eli; Kimmel, Paul L.; Meoni, Lucy A.; Shah, Vallabh O.; Smith, Michael W.; Winkler, Cheryl A.; Zager, Philip G.; Knowler, William C.; Nelson, Robert G.; Pahl, Madeline V.; Parekh, Rulan S.; Kao, W. H. Linda; Rasooly, Rebekah S.; Adler, Sharon G.; Abboud, Hanna E.; Iyengar, Sudha K.; Sedor, John R.
2013-01-01
Objective Estimated glomerular filtration rate (eGFR), a measure of kidney function, is heritable, suggesting that genes influence renal function. Genes that influence eGFR have been identified through genome-wide association studies. However, family-based linkage approaches may identify loci that explain a larger proportion of the heritability. This study used genome-wide linkage and association scans to identify quantitative trait loci (QTL) that influence eGFR. Methods Genome-wide linkage and sparse association scans of eGFR were performed in families ascertained by probands with advanced diabetic nephropathy (DN) from the multi-ethnic Family Investigation of Nephropathy and Diabetes (FIND) study. This study included 954 African Americans (AA), 781 American Indians (AI), 614 European Americans (EA) and 1,611 Mexican Americans (MA). A total of 3,960 FIND participants were genotyped for 6,000 single nucleotide polymorphisms (SNPs) using the Illumina Linkage IVb panel. GFR was estimated by the Modification of Diet in Renal Disease (MDRD) formula. Results The non-parametric linkage analysis, accounting for the effects of diabetes duration and BMI, identified the strongest evidence for linkage of eGFR on chromosome 20q11 (log of the odds [LOD] = 3.34; P = 4.4×10−5) in MA and chromosome 15q12 (LOD = 2.84; P = 1.5×10−4) in EA. In all subjects, the strongest linkage signal for eGFR was detected on chromosome 10p12 (P = 5.5×10−4) at 44 cM near marker rs1339048. A subsequent association scan in both ancestry-specific groups and the entire population identified several SNPs significantly associated with eGFR across the genome. Conclusion The present study describes the localization of QTL influencing eGFR on 20q11 in MA, 15q21 in EA and 10p12 in the combined ethnic groups participating in the FIND study. Identification of causal genes/variants influencing eGFR, within these linkage and association loci, will open new avenues for functional analyses and development of novel diagnostic markers for DN. PMID:24358131
Linkage and association study of late-onset Alzheimer disease families linked to 9p21.3.
Züchner, S; Gilbert, J R; Martin, E R; Leon-Guerrero, C R; Xu, P-T; Browning, C; Bronson, P G; Whitehead, P; Schmechel, D E; Haines, J L; Pericak-Vance, M A
2008-11-01
A chromosomal locus for late-onset Alzheimer disease (LOAD) has previously been mapped to 9p21.3. The most significant results were reported in a sample of autopsy-confirmed families. Linkage to this locus has been independently confirmed in AD families from a consanguineous Israeli-Arab community. In the present study we analyzed an expanded clinical sample of 674 late-onset AD families, independently ascertained by three different consortia. Sample subsets were stratified by site and autopsy-confirmation. Linkage analysis of a dense array of SNPs across the chromosomal locus revealed the most significant results in the 166 autopsy-confirmed families of the NIMH sample. Peak HLOD scores of 4.95 at D9S741 and 2.81 at the nearby SNP rs2772677 were obtained in a dominant model. The linked region included the cyclin-dependent kinase inhibitor 2A gene (CDKN2A), which has been suggested as an AD candidate gene. By re-sequencing all exons in the vicinity of CDKN2A in 48 AD cases, we identified and genotyped four novel SNPs, including a non-synonymous, a synonymous, and two variations located in untranslated RNA sequences. Family-based allelic and genotypic association analysis yielded significant results in CDKN2A (rs11515: PDT p = 0.003, genotype-PDT p = 0.014). We conclude that CDKN2A is a promising new candidate gene potentially contributing to AD susceptibility on chromosome 9p.
Muchero, Wellington; Diop, Ndeye N; Bhat, Prasanna R; Fenton, Raymond D; Wanamaker, Steve; Pottorff, Marti; Hearne, Sarah; Cisse, Ndiaga; Fatokun, Christian; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J
2009-10-27
Consensus genetic linkage maps provide a genomic framework for quantitative trait loci identification, map-based cloning, assessment of genetic diversity, association mapping, and applied breeding in marker-assisted selection schemes. Among "orphan crops" with limited genomic resources such as cowpea [Vigna unguiculata (L.) Walp.] (2n = 2x = 22), the use of transcript-derived SNPs in genetic maps provides opportunities for automated genotyping and estimation of genome structure based on synteny analysis. Here, we report the development and validation of a high-throughput EST-derived SNP assay for cowpea, its application in consensus map building, and determination of synteny to reference genomes. SNP mining from 183,118 ESTs sequenced from 17 cDNA libraries yielded approximately 10,000 high-confidence SNPs from which an Illumina 1,536-SNP GoldenGate genotyping array was developed and applied to 741 recombinant inbred lines from six mapping populations. Approximately 90% of the SNPs were technically successful, providing 1,375 dependable markers. Of these, 928 were incorporated into a consensus genetic map spanning 680 cM with 11 linkage groups and an average marker distance of 0.73 cM. Comparison of this cowpea genetic map to reference legumes, soybean (Glycine max) and Medicago truncatula, revealed extensive macrosynteny encompassing 85 and 82%, respectively, of the cowpea map. Regions of soybean genome duplication were evident relative to the simpler diploid cowpea. Comparison with Arabidopsis revealed extensive genomic rearrangement with some conserved microsynteny. These results support evolutionary closeness between cowpea and soybean and identify regions for synteny-based functional genomics studies in legumes.
Dennis, J; Truong, V; Aïssi, D; Medina-Rivera, A; Blankenberg, S; Germain, M; Lemire, M; Antounians, L; Civelek, M; Schnabel, R; Wells, P; Wilson, M D; Morange, P-E; Trégouët, D-A; Gagnon, F
2016-10-01
Essentials Tissue factor pathway inhibitor (TFPI) regulates the blood coagulation cascade. We replicated previously reported linkage of TFPI plasma levels to the chromosome 2q region. The putative causal locus, rs62187992, was associated with TFPI plasma levels and thrombosis. rs62187992 was marginally associated with TFPI expression in human aortic endothelial cells. Click to hear Ann Gil's presentation on new insights into thrombin activatable fibrinolysis inhibitor SUMMARY: Background Tissue factor pathway inhibitor (TFPI) regulates fibrin clot formation, and low TFPI plasma levels increase the risk of arterial thromboembolism and venous thromboembolism (VTE). TFPI plasma levels are also heritable, and a previous linkage scan implicated the chromosome 2q region, but no specific genes. Objectives To replicate the finding of the linkage region in an independent sample, and to identify the causal locus. Methods We first performed a linkage analysis of microsatellite markers and TFPI plasma levels in 251 individuals from the F5L Family Study, and replicated the finding of the linkage peak on chromosome 2q (LOD = 3.06). We next defined a follow-up region that included 112 603 single nucleotide polymorphisms (SNPs) under the linkage peak, and meta-analyzed associations between these SNPs and TFPI plasma levels across the F5L Family Study and the Marseille Thrombosis Association (MARTHA) Study, a study of 1033 unrelated VTE patients. SNPs with false discovery rate q-values of < 0.10 were tested for association with TFPI plasma levels in 892 patients with coronary artery disease in the AtheroGene Study. Results and Conclusions One SNP, rs62187992, was associated with TFPI plasma levels in all three samples (β = + 0.14 and P = 4.23 × 10 -6 combined; β = + 0.16 and P = 0.02 in the F5L Family Study; β = + 0.13 and P = 6.3 × 10 -4 in the MARTHA Study; β = + 0.17 and P = 0.03 in the AtheroGene Study), and contributed to the linkage peak in the F5L Family Study. rs62187992 was also associated with clinical VTE (odds ratio 0.90, P = 0.03) in the INVENT Consortium of > 7000 cases and their controls, and was marginally associated with TFPI expression (β = + 0.19, P = 0.08) in human aortic endothelial cells, a primary site of TFPI synthesis. The biological mechanisms underlying these associations remain to be elucidated. © 2016 International Society on Thrombosis and Haemostasis.
Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Schlautman, Brandon; Deutsch, Joseph; Salazar, Walter; Hernandez-Ochoa, Miguel; Grygleski, Edward; Steffan, Shawn; Iorizzo, Massimo; Polashock, James; Vorsa, Nicholi; Zalapa, Juan
2016-06-13
The application of genotyping by sequencing (GBS) approaches, combined with data imputation methodologies, is narrowing the genetic knowledge gap between major and understudied, minor crops. GBS is an excellent tool to characterize the genomic structure of recently domesticated (~200 years) and understudied species, such as cranberry (Vaccinium macrocarpon Ait.), by generating large numbers of markers for genomic studies such as genetic mapping. We identified 10842 potentially mappable single nucleotide polymorphisms (SNPs) in a cranberry pseudo-testcross population wherein 5477 SNPs and 211 short sequence repeats (SSRs) were used to construct a high density linkage map in cranberry of which a total of 4849 markers were mapped. Recombination frequency, linkage disequilibrium (LD), and segregation distortion at the genomic level in the parental and integrated linkage maps were characterized for first time in cranberry. SSR markers, used as the backbone in the map, revealed high collinearity with previously published linkage maps. The 4849 point map consisted of twelve linkage groups spanning 1112 cM, which anchored 2381 nuclear scaffolds accounting for ~13 Mb of the estimated 470 Mb cranberry genome. Bin mapping identified 592 and 672 unique bins in the parentals and a total of 1676 unique marker positions in the integrated map. Synteny analyses comparing the order of anchored cranberry scaffolds to their homologous positions in kiwifruit, grape, and coffee genomes provided initial evidence of homology between cranberry and closely related species. GBS data was used to rapidly saturate the cranberry genome with markers in a pseudo-testcross population. Collinearity between the present saturated genetic map and previous cranberry SSR maps suggests that the SNP locations represent accurate marker order and chromosome structure of the cranberry genome. SNPs greatly improved current marker genome coverage, which allowed for genome-wide structure investigations such as segregation distortion, recombination, linkage disequilibrium, and synteny analyses. In the future, GBS can be used to accelerate cranberry molecular breeding through QTL mapping and genome-wide association studies (GWAS).
Grant, Audrey V.; El Baghdadi, Jamila; Sabri, Ayoub; El Azbaoui, Safa; Alaoui-Tahiri, Kebir; Abderrahmani Rhorfi, Ismail; Gharbaoui, Yasser; Abid, Ahmed; Benkirane, Majid; Raharimanga, Vaomalala; Richard, Vincent; Orlova, Marianna; Boland, Anne; Migaud, Mélanie; Okada, Satoshi; Nolan, Daniel K.; Bustamante, Jacinta; Barreiro, Luis B.; Schurr, Erwin; Boisson-Dupuis, Stephanie; Rasolofo, Voahangy; Casanova, Jean-Laurent; Abel, Laurent
2013-01-01
Only a small fraction of individuals infected with Mycobacterium tuberculosis develop clinical tuberculosis (TB) in their lifetime. Genetic epidemiological evidence suggests a genetic determinism of pulmonary TB (PTB), but the molecular basis of genetic predisposition to PTB remains largely unknown. We used a positional-cloning approach to carry out ultrafine linkage-disequilibrium mapping of a previously identified susceptibility locus in chromosomal region 8q12–13 by genotyping 3,216 SNPs in a family-based Moroccan sample including 286 offspring with PTB. We observed 44 PTB-associated SNPs (p < 0.01), which were genotyped in an independent set of 317 cases and 650 controls from Morocco. A single signal, consisting of two correlated SNPs close to TOX, rs1568952 and rs2726600 (combined p = 1.1 × 10−5 and 9.2 × 10−5, respectively), was replicated. Stronger evidence of association was found in individuals who developed PTB before the age of 25 years (combined p for rs1568952 = 4.4 × 10−8; odds ratio of PTB for AA versus AG/GG = 3.09 [1.99–4.78]). The association with rs2726600 (p = 0.04) was subsequently replicated in PTB-affected subjects under 25 years in a study of 243 nuclear families from Madagascar. Stronger evidence of replication in Madagascar was obtained for additional SNPs in strong linkage disequilibrium with the two initial SNPs (p = 0.003 for rs2726597), further confirming the signal. We thus identified around rs1568952 and rs2726600 a cluster of SNPs strongly associated with early-onset PTB in Morocco and Madagascar. SNP rs2726600 is located in a transcription-factor binding site in the 3′ region of TOX, and further functional explorations will focus on CD4 T lymphocytes. PMID:23415668
Zhang, N N; Hu, J W; Liu, H H; Xu, H Y; He, H; Li, L
2015-12-29
Tyrosinase, encoded by the TYR gene, is the rate-limiting enzyme in the production of melanin pigment. In this study, plumage color separation was observed in Cherry Valley duck line D and F1 and F2 hybrid generations of Liancheng white ducks. Gene sequencing and bioinformatic analysis were applied to the 5'-regulatory region of TYR, to explore the connection between TYR sequence variation and duck plumage color. Four SNPs were found in the 5'-regulatory region. The SNPs were in tight linkage and formed three haplotypes. However, the genotype distribution in groups with different plumage color was not significantly different, and there were no changes in the transcription factor binding sites between the different genotypes. In conclusion, these SNP variations may not cause the differences in feather color observed in this test group.
Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai
2014-01-01
The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047
Freedman, Jennifer A; Wang, Yanru; Li, Xuechan; Liu, Hongliang; Moorman, Patricia G; George, Daniel J; Lee, Norman H; Hyslop, Terry; Wei, Qingyi; Patierno, Steven R
2018-05-03
Prostate cancer is a clinically and molecularly heterogeneous disease, with variation in outcomes only partially predicted by grade and stage. Additional tools to distinguish indolent from aggressive disease are needed. Phenotypic characteristics of stemness correlate with poor cancer prognosis. Given this correlation, we identified single nucleotide polymorphisms (SNPs) of stemness-related genes and examined their associations with prostate cancer survival. SNPs within stemness-related genes were analyzed for association with overall survival of prostate cancer in the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial. Significant SNPs predicted to be functional were selected for linkage disequilibrium analysis and combined and stratified analyses. Identified SNPs were evaluated for association with gene expression. SNPs of CD44 (rs9666607), ABCC1 (rs35605 and rs212091) and GDF15 (rs1058587) were associated with prostate cancer survival and predicted to be functional. A role for rs9666607 of CD44 and rs35605 of ABCC1 in RNA splicing regulation, rs212091 of ABCC1 in miRNA binding site activity and rs1058587 of GDF15 in causing an amino acid change was predicted. These SNPs represent potential novel prognostic markers for overall survival of prostate cancer and support a contribution of the stemness pathway to prostate cancer patient outcome.
Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay
2013-09-23
Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.
SNP-markers in Allium species to facilitate introgression breeding in onion.
Scholten, Olga E; van Kaauwen, Martijn P W; Shahin, Arwa; Hendrickx, Patrick M; Keizer, L C Paul; Burger, Karin; van Heusden, Adriaan W; van der Linden, C Gerard; Vosman, Ben
2016-08-31
Within onion, Allium cepa L., the availability of disease resistance is limited. The identification of sources of resistance in related species, such as Allium roylei and Allium fistulosum, was a first step towards the improvement of onion cultivars by breeding. SNP markers linked to resistance and polymorphic between these related species and onion cultivars are a valuable tool to efficiently introgress disease resistance genes. In this paper we describe the identification and validation of SNP markers valuable for onion breeding. Transcriptome sequencing resulted in 192 million RNA seq reads from the interspecific F1 hybrid between A. roylei and A. fistulosum (RF) and nine onion cultivars. After assembly, reliable SNPs were discovered in about 36 % of the contigs. For genotyping of the interspecific three-way cross population, derived from a cross between an onion cultivar and the RF (CCxRF), 1100 SNPs that are polymorphic in RF and monomorphic in the onion cultivars (RF SNPs) were selected for the development of KASP assays. A molecular linkage map based on 667 RF-SNP markers was constructed for CCxRF. In addition, KASP assays were developed for 1600 onion-SNPs (SNPs polymorphic among onion cultivars). A second linkage map was constructed for an F2 of onion x A. roylei (F2(CxR)) that consisted of 182 onion-SNPs and 119 RF-SNPs, and 76 previously mapped markers. Markers co-segregating in both the F2(CxR) and the CCxRF population were used to assign the linkage groups of RF to onion chromosomes. To validate usefulness of these SNP markers, QTL mapping was applied in the CCxRF population that segregates for resistance to Botrytis squamosa and resulted in a QTL for resistance on chromosome 6 of A. roylei. Our research has more than doubled the publicly available marker sequences of expressed onion genes and two onion-related species. It resulted in a detailed genetic map for the interspecific CCxRF population. This is the first paper that reports the detection of a QTL for resistance to B. squamosa in A. roylei.
Leak, Tennille S.; Keene, Keith L.; Langefeld, Carl D.; Gallagher, Carla J.; Mychaleckyj, Josyf C.; Freedman, Barry I.; Bowden, Donald W.; Rich, Stephen S.; Sale, Michèle M.
2009-01-01
In a genome-wide scan for type 2 diabetes (T2DM) in African American (AA) families, ordered subsets analysis (OSA) provided evidence for linkage to chromosome 20p in a subset with later age at diagnosis (max. LOD 2.57, P = 0.008). The proprotein convertase subtilisin/kexin-type 2 (PCSK2) gene is within the LOD-1 interval of this linkage peak. Twenty-nine single nucleotide polymorphisms (SNPs) were genotyped across this gene in 380 unrelated AA individuals with T2DM and end-stage renal disease (T2DM-ESRD), 278 AA controls, 96 European Americans (EA) and 120 Yoruba Nigerian (YRI) controls. In addition, 22 ancestry-informative markers (AIMs) were genotyped in all AA subjects, 120 YRI, and 96 EA controls. ADMIXMAP was used to model the distributions of admixture and generate score tests of allelic and haplotypic association. Association with T2DM was observed among 4 SNPs: rs2021785 (admixture-adjusted Pa = 0.00014), rs1609659 (Pa = 0.028), rs4814597 (Pa = 0.039) and rs2269023 (Pa = 0.043). None of the PCSK2 SNPs were associated with age at T2DM diagnosis. A variant in the PCKS2 gene, rs2021785, appears to play a role in susceptibility to T2DM in this AA population. PMID:17618154
Zhao, Lan-Juan; Guo, Yan-Fang; Xiong, Dong-Hai; Xiao, Peng; Recker, Robert R; Deng, Hong-Wen
2006-11-01
In light of findings that osteoporosis and obesity may share some common genetic determination and previous reports that RANK (receptor activator of nuclear factor-kappaB) is expressed in skeletal muscles which are important for energy metabolism, we hypothesize that RANK, a gene essential for osteoclastogenesis, is also important for obesity. In order to test the hypothesis with solid data we first performed a linkage analysis around the RANK gene in 4,102 Caucasian subjects from 434 pedigrees, then we genotyped 19 SNPs in or around the RANK gene. A family-based association test (FBAT) was performed with both a quantitative measure of obesity [fat mass, lean mass, body mass index (BMI), and percentage fat mass (PFM)] and a dichotomously defined obesity phenotype-OB (OB if BMI > or = 30 kg/m(2)). In the linkage analysis, an empirical P = 0.004 was achieved at the location of the RANK gene for BMI. Family-based association analysis revealed significant associations of eight SNPs with at least one obesity-related phenotype (P < 0.05). Evidence of association was obtained at SNP10 (P = 0.002) and SNP16 (P = 0.001) with OB; SNP1 with fat mass (P = 0.003); SNP1 (P = 0.003) and SNP7 (P = 0.003) with lean mass; SNP1 (P = 0.002) and SNP7 (P = 0.002) with BMI; SNP1 (P = 0.003), SNP4 (P = 0.007), and SNP7 (P = 0.002) with PFM. In order to deal with the complex multiple testing issues, we performed FBAT multi-marker test (FBAT-MM) to evaluate the association between all the 18 SNPs and each obesity phenotype. The P value is 0.126 for OB, 0.033 for fat mass, 0.021 for lean mass, 0.016 for BMI, and 0.006 for PFM. The haplotype data analyses provide further association evidence. In conclusion, for the first time, our results suggest that RANK is a novel candidate for determination of obesity.
Chen, Qianqian; Wang, Hong; Schwender, Holger; Zhang, Tianxiao; Hetmanski, Jacqueline B; Chou, Yah-Huei Wu; Ye, Xiaoqian; Yeow, Vincent; Chong, Samuel S; Zhang, Bo; Jabs, Ethylin Wang; Parker, Margaret M; Scott, Alan F; Beaty, Terri H
2014-01-01
Non-syndromic cleft lip with or without cleft palate (NSCL/P) is a common disorder with complex etiology. The Bone Morphogenetic Protein 4 gene (BMP4) has been considered a prime candidate gene with evidence accumulated from animal experimental studies, human linkage studies, as well as candidate gene association studies. The aim of the current study is to test for linkage and association between BMP4 and NSCL/P that could be missed in genome-wide association studies (GWAS) when genotypic (G) main effects alone were considered. We performed the analysis considering G and interactions with multiple maternal environmental exposures using additive conditional logistic regression models in 895 Asian and 681 European complete NSCL/P trios. Single nucleotide polymorphisms (SNPs) that passed the quality control criteria among 122 genotyped and 25 imputed single nucleotide variants in and around the gene were used in analysis. Selected maternal environmental exposures during 3 months prior to and through the first trimester of pregnancy included any personal tobacco smoking, any environmental tobacco smoke in home, work place or any nearby places, any alcohol consumption and any use of multivitamin supplements. A novel significant association held for rs7156227 among Asian NSCL/P and non-syndromic cleft lip and palate (NSCLP) trios after Bonferroni correction which was not seen when G main effects alone were considered in either allelic or genotypic transmission disequilibrium tests. Odds ratios for carrying one copy of the minor allele without maternal exposure to any of the four environmental exposures were 0.58 (95%CI = 0.44, 0.75) and 0.54 (95%CI = 0.40, 0.73) for Asian NSCL/P and NSCLP trios, respectively. The Bonferroni P values corrected for the total number of 117 tested SNPs were 0.0051 (asymptotic P = 4.39*10(-5)) and 0.0065 (asymptotic P = 5.54*10(-5)), accordingly. In European trios, no significant association was seen for any SNPs after Bonferroni corrections for the total number of 120 tested SNPs. Our findings add evidence from GWAS to support the role of BMP4 in susceptibility to NSCL/P originally identified in linkage and candidate gene association studies.
Kim, Kyung-Seon; Kim, Ghi-Su; Hwang, Joo-Yeon; Lee, Hye-Ja; Park, Mi-Hyun; Kim, Kwang-joong; Jung, Jongsun; Cha, Hyo-Soung; Shin, Hyoung Doo; Kang, Jong-Ho; Park, Eui Kyun; Kim, Tae-Ho; Hong, Jung-Min; Koh, Jung-Min; Oh, Bermseok; Kimm, Kuchan; Kim, Shin-Yoon; Lee, Jong-Young
2007-01-01
Background Osteoporosis is defined as the loss of bone mineral density that leads to bone fragility with aging. Population-based case-control studies have identified polymorphisms in many candidate genes that have been associated with bone mass maintenance or osteoporotic fracture. To investigate single nucleotide polymorphisms (SNPs) that are associated with osteoporosis, we examined the genetic variation among Koreans by analyzing 81 genes according to their function in bone formation and resorption during bone remodeling. Methods We resequenced all the exons, splice junctions and promoter regions of candidate osteoporosis genes using 24 unrelated Korean individuals. Using the common SNPs from our study and the HapMap database, a statistical analysis of deviation in heterozygosity depicted. Results We identified 942 variants, including 888 SNPs, 43 insertion/deletion polymorphisms, and 11 microsatellite markers. Of the SNPs, 557 (63%) had been previously identified and 331 (37%) were newly discovered in the Korean population. When compared SNPs in the Korean population with those in HapMap database, 1% (or less) of SNPs in the Japanese and Chinese subpopulations and 20% of those in Caucasian and African subpopulations were significantly differentiated from the Hardy-Weinberg expectations. In addition, an analysis of the genetic diversity showed that there were no significant differences among Korean, Han Chinese and Japanese populations, but African and Caucasian populations were significantly differentiated in selected genes. Nevertheless, in the detailed analysis of genetic properties, the LD and Haplotype block patterns among the five sub-populations were substantially different from one another. Conclusion Through the resequencing of 81 osteoporosis candidate genes, 118 unknown SNPs with a minor allele frequency (MAF) > 0.05 were discovered in the Korean population. In addition, using the common SNPs between our study and HapMap, an analysis of genetic diversity and deviation in heterozygosity was performed and the polymorphisms of the above genes among the five populations were substantially differentiated from one another. Further studies of osteoporosis could utilize the polymorphisms identified in our data since they may have important implications for the selection of highly informative SNPs for future association studies. PMID:18036257
Livingstone, Donald; Stack, Conrad; Mustiga, Guiliana M.; Rodezno, Dayana C.; Suarez, Carmen; Amores, Freddy; Feltus, Frank A.; Mockaitis, Keithanne; Cornejo, Omar E.; Motamayor, Juan C.
2017-01-01
Cacao (Theobroma cacao L.) is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance. PMID:29259608
Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira
2014-01-01
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection.
Shi, Yuan Yuan; Sun, Liang Xian; Huang, Zachary Y; Wu, Xiao Bo; Zhu, Yong Qiang; Zheng, Hua Jun; Zeng, Zhi Jiang
2013-01-01
The Eastern honey bee, Apis cerana Fabricius, is distributed in southern and eastern Asia, from India and China to Korea and Japan and southeast to the Moluccas. This species is also widely kept for honey production besides Apis mellifera. Apis cerana is also a model organism for studying social behavior, caste determination, mating biology, sexual selection, and host-parasite interactions. Few resources are available for molecular research in this species, and a linkage map was never constructed. A linkage map is a prerequisite for quantitative trait loci mapping and for analyzing genome structure. We used the Chinese honey bee, Apis cerana cerana to construct the first linkage map in the Eastern honey bee. F2 workers (N = 103) were genotyped for 126,990 single nucleotide polymorphisms (SNPs). After filtering low quality and those not passing the Mendel test, we obtained 3,000 SNPs, 1,535 of these were informative and used to construct a linkage map. The preliminary map contains 19 linkage groups, we then mapped the 19 linkage groups to 16 chromosomes by comparing the markers to the genome of A. mellfiera. The final map contains 16 linkage groups with a total of 1,535 markers. The total genetic distance is 3,942.7 centimorgans (cM) with the largest linkage group (180 loci) measuring 574.5 cM. Average marker interval for all markers across the 16 linkage groups is 2.6 cM. We constructed a high density linkage map for A. c. cerana with 1,535 markers. Because the map is based on SNP markers, it will enable easier and faster genotyping assays than randomly amplified polymorphic DNA or microsatellite based maps used in A. mellifera.
Association Analysis of WNT10B With Bone Mass and Structure Among Individuals of African Ancestry
Zmuda, Joseph M; Yerges, Laura M; Kammerer, Candace M; Cauley, Jane A; Wang, Xiaojing; Nestlerode, Cara S; Wheeler, Victor W; Patrick, Alan L; Bunker, ClareAnn H; Moffett, Susan P; Ferrell, Robert E
2009-01-01
Wnts comprise a family of secreted growth factors that regulate the development and maintenance of many organs. Recently, Wnt10b was shown to stimulate osteoblastogenesis and bone formation in mice. To evaluate further the role of Wnt10b in bone health in humans, we performed bidirectional sequencing of ∼8 kb of the WNT10B gene region in 192 individuals (96 African, 96 white) to identify single nucleotide polymorphisms (SNPs). We identified 19 SNPs with minor allele frequency (MAF) ≥0.01. Ten of these SNPs were not present in the NCBI dbSNP database (build 127), whereas 10 of the 20 SNPs (50%) reported in dbSNP were not verified. We initially genotyped seven tagging SNPs that captured common (MAF ≥ 0.05) variation in the region with r 2 > 0.80 and a potentially functional SNP in exon 5 in 1035 Afro-Caribbean men ≥40 yr of age. Association analysis showed three SNPs in a 3′ region of linkage disequilibrium that were associated with DXA measures of hip BMD. Associations between two of these three SNPs (rs1051886, rs3741627) with hip BMD were replicated in an additional 980 Afro-Caribbean men (p < 0.05), in the combined sample of 2015 men (p ≤ 0.006), and in 416 individuals ≥18 yr of age (mean, 44 yr) belonging to eight extended, multigenerational Afro-Caribbean families with mean family size >50 (3535 relative pairs; p < 0.05). Further analysis showed that rs1051886 and rs3741627 were associated with cortical cross-sectional area, periosteal circumference, and BMC in the radius, such that individuals with the minor alleles had lower biomechanical indices of long-bone bending strength. This analysis implicates the WNT10B locus as a genetic element in the regulation of bone mass and structural geometry. PMID:19016593
Association analysis of WNT10B with bone mass and structure among individuals of African ancestry.
Zmuda, Joseph M; Yerges, Laura M; Kammerer, Candace M; Cauley, Jane A; Wang, Xiaojing; Nestlerode, Cara S; Wheeler, Victor W; Patrick, Alan L; Bunker, ClareAnn H; Moffett, Susan P; Ferrell, Robert E
2009-03-01
Wnts comprise a family of secreted growth factors that regulate the development and maintenance of many organs. Recently, Wnt10b was shown to stimulate osteoblastogenesis and bone formation in mice. To evaluate further the role of Wnt10b in bone health in humans, we performed bidirectional sequencing of approximately 8 kb of the WNT10B gene region in 192 individuals (96 African, 96 white) to identify single nucleotide polymorphisms (SNPs). We identified 19 SNPs with minor allele frequency (MAF) > or =0.01. Ten of these SNPs were not present in the NCBI dbSNP database (build 127), whereas 10 of the 20 SNPs (50%) reported in dbSNP were not verified. We initially genotyped seven tagging SNPs that captured common (MAF > or = 0.05) variation in the region with r (2) > 0.80 and a potentially functional SNP in exon 5 in 1035 Afro-Caribbean men > or =40 yr of age. Association analysis showed three SNPs in a 3' region of linkage disequilibrium that were associated with DXA measures of hip BMD. Associations between two of these three SNPs (rs1051886, rs3741627) with hip BMD were replicated in an additional 980 Afro-Caribbean men (p < 0.05), in the combined sample of 2015 men (p < or = 0.006), and in 416 individuals > or =18 yr of age (mean, 44 yr) belonging to eight extended, multigenerational Afro-Caribbean families with mean family size >50 (3535 relative pairs; p < 0.05). Further analysis showed that rs1051886 and rs3741627 were associated with cortical cross-sectional area, periosteal circumference, and BMC in the radius, such that individuals with the minor alleles had lower biomechanical indices of long-bone bending strength. This analysis implicates the WNT10B locus as a genetic element in the regulation of bone mass and structural geometry.
USDA-ARS?s Scientific Manuscript database
Large datasets containing single nucleotide polymorphisms (SNPs) are used to analyze genome-wide diversity in a robust collection of cultivars from representative accessions, across the world. The extent of linkage disequilibrium (LD) within a population determines the number of markers required fo...
Rommelse, Nanda N.J.; Arias-Vásquez, Alejandro; Altink, Marieke E.; Buschgens, Cathelijne J.M.; Fliers, Ellen; Asherson, Philip; Faraone, Stephen V.; Buitelaar, Jan K.; Sergeant, Joseph A.; Oosterlaan, Jaap; Franke, Barbara
2008-01-01
ADHD linkage findings have not all been consistently replicated, suggesting that other approaches to linkage analysis in ADHD might be necessary, such as the use of (quantitative) endophenotypes (heritable traits associated with an increased risk for ADHD). Genome-wide linkage analyses were performed in the Dutch subsample of the International Multi-Center ADHD Genetics (IMAGE) study comprising 238 DSM-IV combined-type ADHD probands and their 112 affected and 195 nonaffected siblings. Eight candidate neuropsychological ADHD endophenotypes with heritabilities > 0.2 were used as quantitative traits. In addition, an overall component score of neuropsychological functioning was used. A total of 5407 autosomal single-nucleotide polymorphisms (SNPs) were used to run multipoint regression-based linkage analyses. Two significant genome-wide linkage signals were found, one for Motor Timing on chromosome 2q21.1 (LOD score: 3.944) and one for Digit Span on 13q12.11 (LOD score: 3.959). Ten suggestive linkage signals were found (LOD scores ≥ 2) on chromosomes 2p, 2q, 3p, 4q, 8q, 12p, 12q, 14q, and 17q. The suggestive linkage signal for the component score that was found at 2q14.3 (LOD score: 2.878) overlapped with the region significantly linked to Motor Timing. Endophenotype approaches may increase power to detect susceptibility loci in ADHD and possibly in other complex disorders. PMID:18599010
Ichikawa, Shoji; Koller, Daniel L.; Peacock, Munro; Johnson, Michelle L.; Lai, Dongbing; Hui, Siu L.; Johnston, C. Conrad; Foroud, Tatiana M.; Econs, Michael J.
2007-01-01
Context A major determinant of osteoporotic fractures is peak bone mineral density (BMD), which is a highly heritable trait. Recently, we identified significant linkage for hip BMD in premenopausal sister pairs at chromosome 14q (LOD score = 3.5), where the estrogen receptor β gene (ESR2) is located. Objective The objective of the study was to determine whether ESR2 polymorphisms are associated with normal BMD variation. Design This was a population‐based genetic association study, using 11 single nucleotide polymorphisms (SNPs) distributed across the ESR2 gene. Setting The study was conducted at an academic research laboratory and medical center. Patients and Other Participants A total of 411 healthy men (aged 18–61 yr) and 1291 healthy premenopausal women (aged 20–50 yr) living in Indiana participated in the study. Intervention(s) There were no interventions. Main Outcome Measure(s) The main outcome measures were SNP genotype distributions and their association with BMD at the femoral neck and lumbar spine. Results Significant association of spine BMD was found with three SNPs in men and one SNP in women (P ≤ 0.05). The conditional linkage analysis using the ESR2 haplotypes showed that the ESR2 gene accounts for, at most, 18% of the original linkage. Conclusions ESR2 polymorphisms are significantly associated with bone mass in both men and women. However, the ESR2 gene is not entirely responsible for our original linkage, and an additional gene(s) in chromosome 14q contributes to the determination of BMD. PMID:16118344
Li, Hongkai; Yuan, Zhongshang; Ji, Jiadong; Xu, Jing; Zhang, Tao; Zhang, Xiaoshuai; Xue, Fuzhong
2016-03-09
We propose a novel Markov Blanket-based repeated-fishing strategy (MBRFS) in attempt to increase the power of existing Markov Blanket method (DASSO-MB) and maintain its advantages in omic data analysis. Both simulation and real data analysis were conducted to assess its performances by comparing with other methods including χ(2) test with Bonferroni and B-H adjustment, least absolute shrinkage and selection operator (LASSO) and DASSO-MB. A serious of simulation studies showed that the true discovery rate (TDR) of proposed MBRFS was always close to zero under null hypothesis (odds ratio = 1 for each SNPs) with excellent stability in all three scenarios of independent phenotype-related SNPs without linkage disequilibrium (LD) around them, correlated phenotype-related SNPs without LD around them, and phenotype-related SNPs with strong LD around them. As expected, under different odds ratio and minor allel frequency (MAFs), MBRFS always had the best performances in capturing the true phenotype-related biomarkers with higher matthews correlation coefficience (MCC) for all three scenarios above. More importantly, since proposed MBRFS using the repeated fishing strategy, it still captures more phenotype-related SNPs with minor effects when non-significant phenotype-related SNPs emerged under χ(2) test after Bonferroni multiple correction. The various real omics data analysis, including GWAS data, DNA methylation data, gene expression data and metabolites data, indicated that the proposed MBRFS always detected relatively reasonable biomarkers. Our proposed MBRFS can exactly capture the true phenotype-related biomarkers with the reduction of false negative rate when the phenotype-related biomarkers are independent or correlated, as well as the circumstance that phenotype-related biomarkers are associated with non-phenotype-related ones.
Zhang, Wensheng; Edwards, Andrea; Zhu, Dongxiao; Flemington, Erik K.; Deininger, Prescott; Zhang, Kun
2012-01-01
In metazoans, miRNAs regulate gene expression primarily through binding to target sites in the 3′ UTRs (untranslated regions) of messenger RNAs (mRNAs). Cis-acting variants within, or close to, a gene are crucial in explaining the variability of gene expression measures. Single nucleotide polymorphisms (SNPs) in the 3′ UTRs of genes can affect the base-pairing between miRNAs and mRNAs, and hence disrupt existing target sites (in the reference sequence) or create novel target sites, suggesting a possible mechanism for cis regulation of gene expression. Moreover, because the alleles of different SNPs within a DNA sequence of limited length tend to be in strong linkage disequilibrium (LD), we hypothesize the variants of miRNA target sites caused by SNPs potentially function as bridges linking the documented cis-SNP markers to the expression of the associated genes. A large-scale analysis was herein performed to test this hypothesis. By systematically integrating multiple latest information sources, we found 21 significant gene-level SNP-involved miRNA-mediated post-transcriptional regulation modules (SNP-MPRMs) in the form of SNP-miRNA-mRNA triplets in lymphocyte cell lines for the CEU and YRI populations. Among the cognate genes, six including ALG8, DGKE, GNA12, KLF11, LRPAP1, and MMAB are related to multiple genetic diseases such as depressive disorder and Type-II diabetes. Furthermore, we found that ∼35% of the documented transcript intensity-related cis-SNPs (∼950) in a recent publication are identical to, or in significant linkage disequilibrium (LD) (p<0.01) with, one or multiple SNPs located in miRNA target sites. Based on these associations (or identities), 69 significant exon-level SNP-MPRMs and 12 disease genes were further determined for two populations. These results provide concrete in silico evidence for the proposed hypothesis. The discovered modules warrant additional follow-up in independent laboratory studies. PMID:22348086
NASA Astrophysics Data System (ADS)
Satyanarayana, M.; Kumar, V. Sunil; Gobi, K. Vengatajalabathy
2016-04-01
In this research, silver nanoparticles (SNPs) are prepared on the surface of carbon nanotubes via chitosan, a biopolymer linkage. Here chitosan act as stabilizing agent for nanoparticles and forms a network on the surface of carbon nanotubes. Synthesized silver nanoparticles-MWCNT hybrid composite is characterized by UV-Visible spectroscopy, XRD analysis, and FESEM with EDS to evaluate the structural and chemical properties of the nanocomposite. The electrocatalytic activity of the fabricated SNP-MWCNT hybrid modified glassy carbon electrode has been evaluated by cyclic voltammetry and electrochemical impedance analysis. The silver nanoparticles are of size ˜35 nm and are well distributed on the surface of carbon nanotubes with chitosan linkage. The prepared nanocomposite shows efficient electrocatalytic properties with high active surface area and excellent electron transfer behaviour.
Nimmakayala, Padma; Abburi, Venkata L.; Saminathan, Thangasamy; Alaparthi, Suresh B.; Almeida, Aldo; Davenport, Brittany; Nadimi, Marjan; Davidson, Joshua; Tonapi, Krittika; Yadav, Lav; Malkaram, Sridhar; Vajja, Gopinath; Hankins, Gerald; Harris, Robert; Park, Minkyu; Choi, Doil; Stommel, John; Reddy, Umesh K.
2016-01-01
Accumulated capsaicinoid content and increased fruit size are traits resulting from Capsicum annuum domestication. In this study, we used a diverse collection of C. annuum to generate 66,960 SNPs using genotyping by sequencing. The study identified 1189 haplotypes containing 3413 SNPs. Length of individual linkage disequilibrium (LD) blocks varied along chromosomes, with regions of high and low LD interspersed with an average LD of 139 kb. Principal component analysis (PCA), Bayesian model based population structure analysis and an Euclidean tree built based on identity by state (IBS) indices revealed that the clustering pattern of diverse accessions are in agreement with capsaicin content (CA) and fruit weight (FW) classifications indicating the importance of these traits in shaping modern pepper genome. PCA and IBS were used in a mixed linear model of capsaicin and dihydrocapsaicin content and fruit weight to reduce spurious associations because of confounding effects of subpopulations in genome-wide association study (GWAS). Our GWAS results showed SNPs in Ankyrin-like protein, IKI3 family protein, ABC transporter G family and pentatricopeptide repeat protein are the major markers for capsaicinoids and of 16 SNPs strongly associated with FW in both years of the study, 7 are located in known fruit weight controlling genes. PMID:27901114
Nimmakayala, Padma; Abburi, Venkata L; Saminathan, Thangasamy; Alaparthi, Suresh B; Almeida, Aldo; Davenport, Brittany; Nadimi, Marjan; Davidson, Joshua; Tonapi, Krittika; Yadav, Lav; Malkaram, Sridhar; Vajja, Gopinath; Hankins, Gerald; Harris, Robert; Park, Minkyu; Choi, Doil; Stommel, John; Reddy, Umesh K
2016-11-30
Accumulated capsaicinoid content and increased fruit size are traits resulting from Capsicum annuum domestication. In this study, we used a diverse collection of C. annuum to generate 66,960 SNPs using genotyping by sequencing. The study identified 1189 haplotypes containing 3413 SNPs. Length of individual linkage disequilibrium (LD) blocks varied along chromosomes, with regions of high and low LD interspersed with an average LD of 139 kb. Principal component analysis (PCA), Bayesian model based population structure analysis and an Euclidean tree built based on identity by state (IBS) indices revealed that the clustering pattern of diverse accessions are in agreement with capsaicin content (CA) and fruit weight (FW) classifications indicating the importance of these traits in shaping modern pepper genome. PCA and IBS were used in a mixed linear model of capsaicin and dihydrocapsaicin content and fruit weight to reduce spurious associations because of confounding effects of subpopulations in genome-wide association study (GWAS). Our GWAS results showed SNPs in Ankyrin-like protein, IKI3 family protein, ABC transporter G family and pentatricopeptide repeat protein are the major markers for capsaicinoids and of 16 SNPs strongly associated with FW in both years of the study, 7 are located in known fruit weight controlling genes.
Haplotypes of CYP3A4 and their close linkage with CYP3A5 haplotypes in a Japanese population.
Fukushima-Uesaka, Hiromi; Saito, Yoshiro; Watanabe, Hidemi; Shiseki, Kisho; Saeki, Mayumi; Nakamura, Takahiro; Kurose, Kouichi; Sai, Kimie; Komamura, Kazuo; Ueno, Kazuyuki; Kamakura, Shiro; Kitakaze, Masafumi; Hanai, Sotaro; Nakajima, Toshiharu; Matsumoto, Kenji; Saito, Hirohisa; Goto, Yu-ichi; Kimura, Hideo; Katoh, Masaaki; Sugai, Kenji; Minami, Narihiro; Shirao, Kuniaki; Tamura, Tomohide; Yamamoto, Noboru; Minami, Hironobu; Ohtsu, Atsushi; Yoshida, Teruhiko; Saijo, Nagahiro; Kitamura, Yutaka; Kamatani, Naoyuki; Ozawa, Shogo; Sawada, Jun-ichi
2004-01-01
In order to identify single nucleotide polymorphisms (SNPs) and haplotype frequencies of CYP3A4 in a Japanese population, the distal enhancer and proximal promoter regions, all exons, and the surrounding introns were sequenced from genomic DNA of 416 Japanese subjects. We found 24 SNPs, including 17 novel ones: two in the distal enhancer, four in the proximal promoter, one in the 5'-untranslated region (UTR), seven in the introns, and three in the 3'-UTR. The most common SNP was c.1026+12G>A (IVS10+12G>A), with a 0.249 frequency. Four non-synonymous SNPs, c.554C>G (p.T185S, CYP3A4(*)16), c.830_831insA (p.E277fsX8, (*)6), c.878T>C (p.L293P, (*)18), and c.1088 C>T (p.T363M, (*)11) were found with frequencies of 0.014, 0.001, 0.028, and 0.002, respectively. No SNP was found in the known nuclear transcriptional factor-binding sites in the enhancer and promoter regions. Using these 24 SNPs, 16 haplotypes were unambiguously identified, and nine haplotypes were inferred by aid of an expectation-maximization-based program. In addition, using data from 186 subjects enabled a close linkage to be found between CYP3A4 and CYP3A5 SNPs, especially among the SNPs at c.1026+12 in CYP3A4 and c.219-237 (IVS3-237, a key SNP site for CYP3A5(*)3), c.865+77 (IVS9+77) and c.1523 in CYP3A5. This result suggested that CYP3A4 and CYP3A5 are within the same gene block. Haplotype analysis between CYP3A4 and CYP3A5 revealed several major haplotype combinations in the CYP3A4-CYP3A5 block. Our findings provide fundamental and useful information for genotyping CYP3A4 (and CYP3A5) in the Japanese, and probably Asian populations. Copyright 2003 Wiley-Liss, Inc.
Genome-wide linkage and association analysis of cardiometabolic phenotypes in Hispanic Americans.
Hellwege, Jacklyn N; Palmer, Nicholette D; Dimitrov, Latchezar; Keaton, Jacob M; Tabb, Keri L; Sajuthi, Satria; Taylor, Kent D; Ng, Maggie C Y; Speliotes, Elizabeth K; Hawkins, Gregory A; Long, Jirong; Ida Chen, Yii-Der; Lorenzo, Carlos; Norris, Jill M; Rotter, Jerome I; Langefeld, Carl D; Wagenknecht, Lynne E; Bowden, Donald W
2017-02-01
Linkage studies of complex genetic diseases have been largely replaced by genome-wide association studies, due in part to limited success in complex trait discovery. However, recent interest in rare and low-frequency variants motivates re-examination of family-based methods. In this study, we investigated the performance of two-point linkage analysis for over 1.6 million single-nucleotide polymorphisms (SNPs) combined with single variant association analysis to identify high impact variants, which are both strongly linked and associated with cardiometabolic traits in up to 1414 Hispanics from the Insulin Resistance Atherosclerosis Family Study (IRASFS). Evaluation of all 50 phenotypes yielded 83 557 000 LOD (logarithm of the odds) scores, with 9214 LOD scores ⩾3.0, 845 ⩾4.0 and 89 ⩾5.0, with a maximal LOD score of 6.49 (rs12956744 in the LAMA1 gene for tumor necrosis factor-α (TNFα) receptor 2). Twenty-seven variants were associated with P<0.005 as well as having an LOD score >4, including variants in the NFIB gene under a linkage peak with TNFα receptor 2 levels on chromosome 9. Linkage regions of interest included a broad peak (31 Mb) on chromosome 1q with acute insulin response (max LOD=5.37). This region was previously documented with type 2 diabetes in family-based studies, providing support for the validity of these results. Overall, we have demonstrated the utility of two-point linkage and association in comprehensive genome-wide array-based SNP genotypes.
Tillmar, Andreas O; Phillips, Chris
2017-01-01
Advances in massively parallel sequencing technology have enabled the combination of a much-expanded number of DNA markers (notably STRs and SNPs in one or combined multiplexes), with the aim of increasing the weight of evidence in forensic casework. However, when data from multiple loci on the same chromosome are used, genetic linkage can affect the final likelihood calculation. In order to study the effect of linkage for different sets of markers we developed the biostatistical tool ILIR, (Impact of Linkage on forensic markers for Identity and Relationship tests). The ILIR tool can be used to study the overall impact of genetic linkage for an arbitrary set of markers used in forensic testing. Application of ILIR can be useful during marker selection and design of new marker panels, as well as being highly relevant for existing marker sets as a way to properly evaluate the effects of linkage on a case-by-case basis. ILIR, implemented via the open source platform R, includes variation and genomic position reference data for over 40 STRs and 140 SNPs, combined with the ability to include additional forensic markers of interest. The use of the software is demonstrated with examples from several different established marker sets (such as the expanded CODIS core loci) including a review of the interpretation of linked genetic data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Zhang, Long-Chao; Li, Na; Liu, Xin; Liang, Jing; Yan, Hua; Zhao, Ke-Bin; Pu, Lei; Shi, Hui-Bi; Zhang, Yue-Bo; Wang, Li-Gang; Wang, Li-Xian
2014-11-04
In pig, limb bone length influences ham yield and body height to a great extent and has important economic implications for pig industry. In this study, an intercross population was constructed between the indigenous Chinese Minzhu pig breed and the western commercial Large White pig breed to examine the genetic basis for variation in limb bone length. The aim of this study was to detect potential genetic variants associated with porcine limb bone length. A total of 571 F2 individuals from a Large White and Minzhu intercross population were genotyped using the Illumina PorcineSNP60K Beadchip, and phenotyped for femur length (FL), humerus length (HL), hipbone length (HIPL), scapula length (SL), tibia length (TL), and ulna length (UL). A genome-wide association study was performed by applying the previously reported approach of genome-wide rapid association using mixed model and regression. Statistical significance of the associations was based on Bonferroni-corrected P-values. A total of 39 significant SNPs were mapped to a 11.93 Mb long region on pig chromosome 7 (SSC7). Linkage analysis of these significant SNPs revealed three haplotype blocks of 495 kb, 376 kb and 492 kb, respectively, in the 11.93 Mb region. Annotation based on the pig reference genome identified 15 genes that were located near or contained the significant SNPs in these linkage disequilibrium intervals. Conditioned analysis revealed that four SNPs, one on SSC2 and three on SSC4, showed significant associations with SL and HL, respectively. Analysis of the 15 annotated genes that were identified in these three haplotype blocks indicated that HMGA1 and PPARD, which are expressed in limbs and influence chondrocyte cell growth and differentiation, could be considered as relevant biological candidates for limb bone length in pig, with potential applications in breeding programs. Our results may also be useful for the study of the mechanisms that underlie human limb length and body height.
Teixeira, Thallita Monteiro; da Silva, Hugo Delleon; Goveia, Rebeca Mota; Ribolla, Paulo Eduardo Martins; Alonso, Diego Peres; Alves, Alessandro Arruda; Melo E Silva, Daniela; Collevatti, Rosane Garcia; Bicudo, Lucilene Arilho; Bérgamo, Nádia Aparecida; de Paula Silveira-Lacerda, Elisângela
2017-12-01
Worldwide, different studies have reported an association of alcohol-use disorder (AUD) with different types of Single Nucleotide Polymorphisms (SNPs) in the genes for aldehyde dehydrogenase (ALDH) and alcohol dehydrogenase (ADH). In Brazil, there is little information about the occurrence of these SNPs in the AUD population and an absence of studies characterizing the population in the Central-West Region of Brazil. Actually, in Brazil, there are more than 4 million people with AUD. Despite the major health hazards of AUD, information on alcohol consumption and its consequences are not well understood. Therefore, it is extremely important to characterize these SNPs for the better understanding of AUD as a genetic disease in the Brazilian population. The present study, unlike other studies in other countries, is done with a subject population that shows a significant amount of racial homogenization. We evaluated the presence of SNPs in the ADH (ADH1B, ADH1C, and ADH4) and ALDH (ALDH2) genes in alcohol users of Goiânia, State of Goiás - Brazil, and then we established a possible relationship with AUD by allelic and genotypic study. This study was conducted with a population of people with AUD (n = 99) from Goiás Alcohol Dependence Recovery Center (GO CEREA) and Psychosocial Care Center for Alcohol and Drugs (CAPS AD), and with a population of people without AUD as controls (n = 100). DNA was extracted from whole-blood samples and the genotyping was performed using TaqMan ® SNP genotyping assays. For characterization and evaluation of SNPs in the population, genotype frequency, allele frequency, haplotype frequency, Hardy-Weinberg equilibrium, and linkage disequilibrium were analyzed. Statistical analyses were calculated by GENEPOP 4.5 and Haploview software. The allele 1 was considered as "wild" (or *1) and allele 2 as mutant (or *2). Significant differences were found for ADH1B*, ADH4*2, and ALDH2*2 SNPs when the genotype and allele frequencies were analyzed. In addition, four haplotypes were observed between ADH1B*2 and ADH1C*2 through linkage disequilibrium analysis. The genetic variants may be associated with protection against AUD in the population studied. Copyright © 2017 Elsevier Inc. All rights reserved.
Efficient selection of tagging single-nucleotide polymorphisms in multiple populations.
Howie, Bryan N; Carlson, Christopher S; Rieder, Mark J; Nickerson, Deborah A
2006-08-01
Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.
Genome-wide association study of the age of onset of childhood asthma.
Forno, Erick; Lasky-Su, Jessica; Himes, Blanca; Howrylak, Judie; Ramsey, Clare; Brehm, John; Klanderman, Barbara; Ziniti, John; Melén, Erik; Pershagen, Goran; Wickman, Magnus; Martinez, Fernando; Mauger, Dave; Sorkness, Christine; Tantisira, Kelan; Raby, Benjamin A; Weiss, Scott T; Celedón, Juan C
2012-07-01
Childhood asthma is a complex disease with known heritability and phenotypic diversity. Although an earlier onset has been associated with more severe disease, there has been no genome-wide association study of the age of onset of asthma in children. We sought to identify genetic variants associated with earlier onset of childhood asthma. We conducted the first genome-wide association study of the age of onset of childhood asthma among participants in the Childhood Asthma Management Program (CAMP) and used 3 independent cohorts from North America, Costa Rica, and Sweden for replication. Two single nucleotide polymorphisms (SNPs) were associated with earlier onset of asthma in the combined analysis of CAMP and the replication cohorts: rs9815663 (Fisher P= 2.31 × 10(-8)) and rs7927044 (P= 6.54 × 10(-9)). Of these 2 SNPs, rs9815663 was also significantly associated with earlier asthma onset in an analysis including only the replication cohorts. Ten SNPs in linkage disequilibrium with rs9815663 were also associated with earlier asthma onset (2.24 × 10(-7)
Mantello, Camila Campos; Cardoso-Silva, Claudio Benicio; da Silva, Carla Cristina; de Souza, Livia Moura; Scaloppi Junior, Erivaldo José; de Souza Gonçalves, Paulo; Vicentini, Renato; de Souza, Anete Pereira
2014-01-01
Hevea brasiliensis (Willd. Ex Adr. Juss.) Muell.-Arg. is the primary source of natural rubber that is native to the Amazon rainforest. The singular properties of natural rubber make it superior to and competitive with synthetic rubber for use in several applications. Here, we performed RNA sequencing (RNA-seq) of H. brasiliensis bark on the Illumina GAIIx platform, which generated 179,326,804 raw reads on the Illumina GAIIx platform. A total of 50,384 contigs that were over 400 bp in size were obtained and subjected to further analyses. A similarity search against the non-redundant (nr) protein database returned 32,018 (63%) positive BLASTx hits. The transcriptome analysis was annotated using the clusters of orthologous groups (COG), gene ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Pfam databases. A search for putative molecular marker was performed to identify simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). In total, 17,927 SSRs and 404,114 SNPs were detected. Finally, we selected sequences that were identified as belonging to the mevalonate (MVA) and 2-C-methyl-D-erythritol 4-phosphate (MEP) pathways, which are involved in rubber biosynthesis, to validate the SNP markers. A total of 78 SNPs were validated in 36 genotypes of H. brasiliensis. This new dataset represents a powerful information source for rubber tree bark genes and will be an important tool for the development of microsatellites and SNP markers for use in future genetic analyses such as genetic linkage mapping, quantitative trait loci identification, investigations of linkage disequilibrium and marker-assisted selection. PMID:25048025
Pombar-Gomez, Maria; Lopez-Lopez, Elixabet; Martin-Guerrero, Idoia; Garcia-Orad Carles, Africa; de Pancorbo, Marian M
2015-05-01
Single nucleotide polymorphisms (SNPs) are an interesting option to facilitate the analysis of highly degraded DNA by allowing the reduction of the size of the DNA amplicons. The SNPforID 52-plex panel is a clear example of the use of non-coding SNPs in forensic genetics. However, nonstop advances in studies of genetic polymorphisms are leading to the discovery of new associations between SNPs and diseases. The aim of this study was to perform a comprehensive review of the state of association between the 52 SNPs in the 52-plex panel and diseases or other traits related to their treatment, such as drug response characters. In order to achieve this goal, we have conducted a bioinformatic search for each SNP included in the panel and the SNPs in linkage disequilibrium (LD) with them in the European population (r (2) > 0.8). A total of 424 SNPs (52 in the panel and 372 in LD) were investigated in PubMed, Scopus, and dbSNP databases. Our results show that three SNPs in the SNPforID 52-plex panel (rs2107612, rs1979255, rs1463729) have been associated with diseases such as hypertension or macular degeneration, as well as drug response. Similarly, three out of the 372 SNPs in LD (rs2107614, r (2) = 0.859; rs765250, r (2) = 0.858; rs11064560, r (2) = 0,887) are also associated with various pathologies. In view of these results, we propose the need for a periodic review of the SNPs used in forensic genetics in order to keep their associations with diseases or related phenotypes updated and to evaluate their continuity in forensic panels for avoiding legal and ethical conflicts.
Hierarchical Naive Bayes for genetic association studies.
Malovini, Alberto; Barbarini, Nicola; Bellazzi, Riccardo; de Michelis, Francesca
2012-01-01
Genome Wide Association Studies represent powerful approaches that aim at disentangling the genetic and molecular mechanisms underlying complex traits. The usual "one-SNP-at-the-time" testing strategy cannot capture the multi-factorial nature of this kind of disorders. We propose a Hierarchical Naïve Bayes classification model for taking into account associations in SNPs data characterized by Linkage Disequilibrium. Validation shows that our model reaches classification performances superior to those obtained by the standard Naïve Bayes classifier for simulated and real datasets. In the Hierarchical Naïve Bayes implemented, the SNPs mapping to the same region of Linkage Disequilibrium are considered as "details" or "replicates" of the locus, each contributing to the overall effect of the region on the phenotype. A latent variable for each block, which models the "population" of correlated SNPs, can be then used to summarize the available information. The classification is thus performed relying on the latent variables conditional probability distributions and on the SNPs data available. The developed methodology has been tested on simulated datasets, each composed by 300 cases, 300 controls and a variable number of SNPs. Our approach has been also applied to two real datasets on the genetic bases of Type 1 Diabetes and Type 2 Diabetes generated by the Wellcome Trust Case Control Consortium. The approach proposed in this paper, called Hierarchical Naïve Bayes, allows dealing with classification of examples for which genetic information of structurally correlated SNPs are available. It improves the Naïve Bayes performances by properly handling the within-loci variability.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Satyanarayana, M.; Kumar, V. Sunil; Gobi, K. Vengatajalabathy, E-mail: drkvgobi@gmail.com, E-mail: satyam.nitw@gmail.com
In this research, silver nanoparticles (SNPs) are prepared on the surface of carbon nanotubes via chitosan, a biopolymer linkage. Here chitosan act as stabilizing agent for nanoparticles and forms a network on the surface of carbon nanotubes. Synthesized silver nanoparticles-MWCNT hybrid composite is characterized by UV-Visible spectroscopy, XRD analysis, and FESEM with EDS to evaluate the structural and chemical properties of the nanocomposite. The electrocatalytic activity of the fabricated SNP-MWCNT hybrid modified glassy carbon electrode has been evaluated by cyclic voltammetry and electrochemical impedance analysis. The silver nanoparticles are of size ∼35 nm and are well distributed on the surface ofmore » carbon nanotubes with chitosan linkage. The prepared nanocomposite shows efficient electrocatalytic properties with high active surface area and excellent electron transfer behaviour.« less
Hewett, Duncan; Samuelsson, Lena; Polding, Joanne; Enlund, Fredrik; Smart, Devi; Cantone, Kathryn; See, Chee Gee; Chadha, Sapna; Inerot, Annica; Enerback, Charlotta; Montgomery, Doug; Christodolou, Chris; Robinson, Phil; Matthews, Paul; Plumpton, Mary; Wahlstrom, Jan; Swanbeck, Gunnar; Martinsson, Tommy; Roses, Allen; Riley, John; Purvis, Ian
2002-03-01
Psoriasis is a chronic inflammatory disease of the skin with both genetic and environmental risk factors. Here we describe the creation of a single-nucleotide polymorphism (SNP) map spanning 900-1200 kb of chromosome 3q21, which had been previously recognized as containing a psoriasis susceptibility locus, PSORS5. We genotyped 644 individuals, from 195 Swedish psoriatic families, for 19 polymorphisms. Linkage disequilibrium (LD) between marker and disease was assessed using the transmission/disequilibrium test (TDT). In the TDT analysis, alleles of three of these SNPs showed significant association with disease (P<0.05). A 160-kb interval encompassing these three SNPs was sequenced, and a coding sequence consisting of 13 exons was identified. The predicted protein shares 30-40% homology with the family of cation/chloride cotransporters. A five-marker haplotype spanning the 3' half of this gene is associated with psoriasis to a P value of 3.8<10(-5). We have called this gene SLC12A8, coding for a member of the solute carrier family 12 proteins. It belongs to a class of genes that were previously unrecognized as playing a role in psoriasis pathogenesis.
Bertolini, F; Galimberti, G; Schiavo, G; Mastrangelo, S; Di Gerlando, R; Strillacci, M G; Bagnato, A; Portolano, B; Fontanesi, L
2018-01-01
Commercial single nucleotide polymorphism (SNP) arrays have been recently developed for several species and can be used to identify informative markers to differentiate breeds or populations for several downstream applications. To identify the most discriminating genetic markers among thousands of genotyped SNPs, a few statistical approaches have been proposed. In this work, we compared several methods of SNPs preselection (Delta, F st and principal component analyses (PCA)) in addition to Random Forest classifications to analyse SNP data from six dairy cattle breeds, including cosmopolitan (Holstein, Brown and Simmental) and autochthonous Italian breeds raised in two different regions and subjected to limited or no breeding programmes (Cinisara, Modicana, raised only in Sicily and Reggiana, raised only in Emilia Romagna). From these classifications, two panels of 96 and 48 SNPs that contain the most discriminant SNPs were created for each preselection method. These panels were evaluated in terms of the ability to discriminate as a whole and breed-by-breed, as well as linkage disequilibrium within each panel. The obtained results showed that for the 48-SNP panel, the error rate increased mainly for autochthonous breeds, probably as a consequence of their admixed origin lower selection pressure and by ascertaining bias in the construction of the SNP chip. The 96-SNP panels were generally more able to discriminate all breeds. The panel derived by PCA-chrom (obtained by a preselection chromosome by chromosome) could identify informative SNPs that were particularly useful for the assignment of minor breeds that reached the lowest value of Out Of Bag error even in the Cinisara, whose value was quite high in all other panels. Moreover, this panel contained also the lowest number of SNPs in linkage disequilibrium. Several selected SNPs are located nearby genes affecting breed-specific phenotypic traits (coat colour and stature) or associated with production traits. In general, our results demonstrated the usefulness of Random Forest in combination to other reduction techniques to identify population informative SNPs.
2013-01-01
Background Field pea (Pisum sativum L.) is a self-pollinating, diploid, cool-season food legume. Crop production is constrained by multiple biotic and abiotic stress factors, including salinity, that cause reduced growth and yield. Recent advances in genomics have permitted the development of low-cost high-throughput genotyping systems, allowing the construction of saturated genetic linkage maps for identification of quantitative trait loci (QTLs) associated with traits of interest. Genetic markers in close linkage with the relevant genomic regions may then be implemented in varietal improvement programs. Results In this study, single nucleotide polymorphism (SNP) markers associated with expressed sequence tags (ESTs) were developed and used to generate comprehensive linkage maps for field pea. From a set of 36,188 variant nucleotide positions detected through in silico analysis, 768 were selected for genotyping of a recombinant inbred line (RIL) population. A total of 705 SNPs (91.7%) successfully detected segregating polymorphisms. In addition to SNPs, genomic and EST-derived simple sequence repeats (SSRs) were assigned to the genetic map in order to obtain an evenly distributed genome-wide coverage. Sequences associated with the mapped molecular markers were used for comparative genomic analysis with other legume species. Higher levels of conserved synteny were observed with the genomes of Medicago truncatula Gaertn. and chickpea (Cicer arietinum L.) than with soybean (Glycine max [L.] Merr.), Lotus japonicus L. and pigeon pea (Cajanus cajan [L.] Millsp.). Parents and RIL progeny were screened at the seedling growth stage for responses to salinity stress, imposed by addition of NaCl in the watering solution at a concentration of 18 dS m-1. Salinity-induced symptoms showed normal distribution, and the severity of the symptoms increased over time. QTLs for salinity tolerance were identified on linkage groups Ps III and VII, with flanking SNP markers suitable for selection of resistant cultivars. Comparison of sequences underpinning these SNP markers to the M. truncatula genome defined genomic regions containing candidate genes associated with saline stress tolerance. Conclusion The SNP assays and associated genetic linkage maps developed in this study permitted identification of salinity tolerance QTLs and candidate genes. This constitutes an important set of tools for marker-assisted selection (MAS) programs aimed at performance enhancement of field pea cultivars. PMID:24134188
Leonforte, Antonio; Sudheesh, Shimna; Cogan, Noel O I; Salisbury, Philip A; Nicolas, Marc E; Materne, Michael; Forster, John W; Kaur, Sukhjiwan
2013-10-17
Field pea (Pisum sativum L.) is a self-pollinating, diploid, cool-season food legume. Crop production is constrained by multiple biotic and abiotic stress factors, including salinity, that cause reduced growth and yield. Recent advances in genomics have permitted the development of low-cost high-throughput genotyping systems, allowing the construction of saturated genetic linkage maps for identification of quantitative trait loci (QTLs) associated with traits of interest. Genetic markers in close linkage with the relevant genomic regions may then be implemented in varietal improvement programs. In this study, single nucleotide polymorphism (SNP) markers associated with expressed sequence tags (ESTs) were developed and used to generate comprehensive linkage maps for field pea. From a set of 36,188 variant nucleotide positions detected through in silico analysis, 768 were selected for genotyping of a recombinant inbred line (RIL) population. A total of 705 SNPs (91.7%) successfully detected segregating polymorphisms. In addition to SNPs, genomic and EST-derived simple sequence repeats (SSRs) were assigned to the genetic map in order to obtain an evenly distributed genome-wide coverage. Sequences associated with the mapped molecular markers were used for comparative genomic analysis with other legume species. Higher levels of conserved synteny were observed with the genomes of Medicago truncatula Gaertn. and chickpea (Cicer arietinum L.) than with soybean (Glycine max [L.] Merr.), Lotus japonicus L. and pigeon pea (Cajanus cajan [L.] Millsp.). Parents and RIL progeny were screened at the seedling growth stage for responses to salinity stress, imposed by addition of NaCl in the watering solution at a concentration of 18 dS m-1. Salinity-induced symptoms showed normal distribution, and the severity of the symptoms increased over time. QTLs for salinity tolerance were identified on linkage groups Ps III and VII, with flanking SNP markers suitable for selection of resistant cultivars. Comparison of sequences underpinning these SNP markers to the M. truncatula genome defined genomic regions containing candidate genes associated with saline stress tolerance. The SNP assays and associated genetic linkage maps developed in this study permitted identification of salinity tolerance QTLs and candidate genes. This constitutes an important set of tools for marker-assisted selection (MAS) programs aimed at performance enhancement of field pea cultivars.
Ichikawa, Shoji; Koller, Daniel L; Curry, Leah R; Lai, Dongbing; Xuei, Xiaoling; Pugh, Elizabeth W; Tsai, Ya-Yu; Doheny, Kimberly F; Edenberg, Howard J; Hui, Siu L; Foroud, Tatiana; Peacock, Munro; Econs, Michael J
2008-01-01
Osteoporosis is a complex disease with both genetic and environmental risk factors. A major determinant of osteoporotic fractures is peak BMD obtained during young adulthood. We previously reported linkage of chromosome 1q (LOD = 4.3) with variation in spinal areal BMD in healthy premenopausal white women. In this study, we used a two-stage genotyping approach to identify genes in the linked region that contributed to the variation of femoral neck and lumbar spine areal BMD. In the first stage, 654 SNPs across the linked region were genotyped in a sample of 1309 premenopausal white women. The most significant evidence of association for lumbar spine (p = 1.3 × 10−6) was found with rs1127091 in the GATAD2B gene. In the second stage, 52 SNPs around this candidate gene were genotyped in an expanded sample of 1692 white women. Significant evidence of association with spinal BMD (p < 10−5), and to a lesser extent with femoral neck BMD, was observed with eight SNPs within a single 230-kb linkage disequilibrium (LD) block. The most significant SNP (p = 3.4 × 10−7) accounted for >2.5% of the variation in spinal BMD in these women. The 230-kb LD block contains 11 genes, but because of the extensive LD, the specific gene(s) contributing to the variation in BMD could not be determined. In conclusion, the significant association between spinal BMD and SNPs in the 230-kb LD block in chromosome 1q indicates that genetic factor(s) in this block plays an important role in peak spinal BMD in healthy premenopausal white women. PMID:18505370
Oikkonen, J.; Huang, Y.; Onkamo, P.; Ukkola-Vuoti, L.; Raijas, P.; Karma, K.; Vieland, V. J.; Järvelä, I.
2014-01-01
Humans have developed the perception, production and processing of sounds into the art of music. A genetic contribution to these skills of musical aptitude has long been suggested. We performed a genome-wide scan in 76 pedigrees (767 individuals) characterized for the ability to discriminate pitch (SP), duration (ST) and sound patterns (KMT), which are primary capacities for music perception. Using the Bayesian linkage and association approach implemented in program package KELVIN, especially designed for complex pedigrees, several SNPs near genes affecting the functions of the auditory pathway and neurocognitive processes were identified. The strongest association was found at 3q21.3 (rs9854612) with combined SP, ST and KMT test scores (COMB). This region is located a few dozen kilobases upstream of the GATA binding protein 2 (GATA2) gene. GATA2 regulates the development of cochlear hair cells and the inferior colliculus (IC), which are important in tonotopic mapping. The highest probability of linkage was obtained for phenotype SP at 4p14, located next to the region harboring the protocadherin 7 gene, PCDH7. Two SNPs rs13146789 and rs13109270 of PCDH7 showed strong association. PCDH7 has been suggested to play a role in cochlear and amygdaloid complexes. Functional class analysis showed that inner ear and schizophrenia related genes were enriched inside the linked regions. This study is the first to show the importance of auditory pathway genes in musical aptitude. PMID:24614497
Kujur, Alice; Upadhyaya, Hari D.; Shree, Tanima; Bajaj, Deepak; Das, Shouvik; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We discovered 26785 and 16573 high-quality SNPs differentiating two parental genotypes of a RIL mapping population using reference desi and kabuli genome-based GBS assay. Of these, 3625 and 2177 SNPs have been integrated into eight desi and kabuli chromosomes, respectively in order to construct ultra-high density (0.20–0.37 cM) intra-specific chickpea genetic linkage maps. One of these constructed high-resolution genetic map has potential to identify 33 major genomic regions harbouring 35 robust QTLs (PVE: 17.9–39.7%) associated with three agronomic traits, which were mapped within <1 cM mean marker intervals on desi chromosomes. The extended LD (linkage disequilibrium) decay (~15 cM) in chromosomes of genetic maps have encouraged us to use a rapid integrated approach (comparative QTL mapping, QTL-region specific haplotype/LD-based trait association analysis, expression profiling and gene haplotype-based association mapping) rather than a traditional QTL map-based cloning method to narrow-down one major seed weight (SW) robust QTL region. It delineated favourable natural allelic variants and superior haplotype-containing one seed-specific candidate embryo defective gene regulating SW in chickpea. The ultra-high-resolution genetic maps, QTLs/genes and alleles/haplotypes-related genomic information generated and integrated strategy for rapid QTL/gene identification developed have potential to expedite genomics-assisted breeding applications in crop plants, including chickpea for their genetic enhancement. PMID:25942004
Oikkonen, J; Huang, Y; Onkamo, P; Ukkola-Vuoti, L; Raijas, P; Karma, K; Vieland, V J; Järvelä, I
2015-02-01
Humans have developed the perception, production and processing of sounds into the art of music. A genetic contribution to these skills of musical aptitude has long been suggested. We performed a genome-wide scan in 76 pedigrees (767 individuals) characterized for the ability to discriminate pitch (SP), duration (ST) and sound patterns (KMT), which are primary capacities for music perception. Using the Bayesian linkage and association approach implemented in program package KELVIN, especially designed for complex pedigrees, several single nucleotide polymorphisms (SNPs) near genes affecting the functions of the auditory pathway and neurocognitive processes were identified. The strongest association was found at 3q21.3 (rs9854612) with combined SP, ST and KMT test scores (COMB). This region is located a few dozen kilobases upstream of the GATA binding protein 2 (GATA2) gene. GATA2 regulates the development of cochlear hair cells and the inferior colliculus (IC), which are important in tonotopic mapping. The highest probability of linkage was obtained for phenotype SP at 4p14, located next to the region harboring the protocadherin 7 gene, PCDH7. Two SNPs rs13146789 and rs13109270 of PCDH7 showed strong association. PCDH7 has been suggested to play a role in cochlear and amygdaloid complexes. Functional class analysis showed that inner ear and schizophrenia-related genes were enriched inside the linked regions. This study is the first to show the importance of auditory pathway genes in musical aptitude.
Qi, Jie; Liu, Xudong; Liu, Jinxiang; Yu, Haiyang; Wang, Wenji; Wang, Zhigang; Zhang, Quanqi
2014-08-01
Ambient temperature is one of the major abiotic environmental factors determining the main parameters of fish vital activity. HSP70 plays an essential role in heat response. In this investigation, the promoter and structure of Paralichthys olivaceus hsp70 (Pohsp70) gene was cloned and predicted. 2558 bp upstream regulatory region of Pohsp70 was annotated with four potential promoter elements and four putative binding sites of transcription factors heat shock elements (HSE, nGAAn) in the upstream of the transcription start site. In addition, one intron with 454 bp in the 5'-noncoding region was found. Quantitative Real Time PCR analysis indicated that the transcript level of Pohsp70 was raised markedly after 1 h by heat shocked. Furthermore, 25 SNPs were identified in Pohsp70 by resequencing, seven of which was associated with heat resistance. In addition, two of the seven SNPs, namely SNP14 and SNP16, were observed in strong linkage disequilibrium. The haplotype with association analysis showed TAGGAG haplotype was more represented in heat susceptible group while (DEL/T) GAATA haplotype was more frequent in heat resistant group. The heat resistant SNPs and haplotype could be candidate markers potentially serving for selective breeding programs of Japanese flounder aimed at improving anti-stress and production. Copyright © 2014 Elsevier Ltd. All rights reserved.
Liu, Yong-Jun; Rocha-Sanchez, Sonia M S; Liu, Peng-Yuan; Long, Ji-Rong; Lu, Yan; Elze, Leo; Recker, Robert R; Deng, Hong-Wen
2004-04-13
Genetic variations in the leptin receptor (LEPR) gene have been conceived to affect body weight in general populations. In this study, using the tests implemented in the statistical package QTDT, we evaluated association and/or linkage of the LEPR gene with obesity phenotypes in a large sample comprising 1,873 subjects from 405 Caucasian nuclear families. Obesity phenotypes tested include body mass index (BMI), fat mass, percentage fat mass (PFM), and lean mass, with the latter three measured by dual-energy X-ray absorptiometry (DXA). Three single nucleotide polymorphisms (SNPs), namely Lys109Arg (A/G), Lys656Asn (G/C), Pro1019Pro (G/A), in the LEPR gene were analyzed. Significant linkage disequilibrium (0.394 < or = |D'| < or = 0.688, P < 0.001) was observed between pairs of the three SNPs. No significant population stratification was found for any SNP/phenotype. In single-locus analyses, evidence of association was observed for Lys656Asn with lean mass (P = 0.002) and fat mass (P = 0.015). The contribution of this polymorphism to the phenotypic variation of lean mass and fat mass was 2.63% and 1.15%, respectively. Subjects carrying allele G at the Lys656Asn site had, on average, 3.16% higher lean mass and 2.71% higher fat mass than those without it. In the analyses for haplotypes defined by the three SNPs, significant associations were detected between haplotype GCA (P = 0.005) and lean mass. In addition, marginally significant evidence of association was observed for this haplotype with fat mass (P = 0.012). No statistically significant linkage was found, largely due to the limited power of the linkage approach to detect small genetic effects in our data sets. Our results suggest that the LEPR gene polymorphisms contribute to variation in obesity phenotypes.
Levran, Orna; Randesi, Matthew; Peles, Einat; Correa da Rosa, Joel; Ott, Jurg; Rotrosen, John; Adelson, Miriam; Kreek, Mary Jeanne
2016-06-01
This study was designed to determine whether polymorphisms in acetylcholine receptors contribute to opioid dependence and/or cocaine dependence. The sample (n = 1860) was divided by drug and ancestry, and 55 polymorphisms (nine genes) were analyzed. Of the 20 SNPs that showed nominally significant associations, the association of the African-specific CHRM4 SNP rs2229163 (Asn417=) with cocaine dependence survived correction for multiple testing (Pcorrected = 0.047). CHRM4 is located in a region of strong linkage disequilibrium on chromosome 11 that includes genes associated with schizophrenia. CHRM4 SNP rs2229163 is in strong linkage disequilibrium with several African-specific SNPs in DGKZ and AMBRA1. Cholinergic receptors' variants may contribute to drug addiction and have a potential role as pharmacogenetic markers.
Michailidou, S; Tsangaris, G; Fthenakis, G C; Tzora, A; Skoufos, I; Karkabounas, S C; Banos, G; Argiriou, A; Arsenos, G
2018-06-01
In the present study, genome-wide genotyping was applied to characterize the genetic diversity and population structure of three autochthonous Greek breeds: Boutsko, Karagouniko and Chios. Dairy sheep are among the most significant livestock species in Greece numbering approximately 9 million animals which are characterized by large phenotypic variation and reared under various farming systems. A total of 96 animals were genotyped with the Illumina's OvineSNP50K microarray beadchip, to study the population structure of the breeds and develop a specialized panel of single-nucleotide polymorphisms (SNPs), which could distinguish one breed from the others. Quality control on the dataset resulted in 46,125 SNPs, which were used to evaluate the genetic structure of the breeds. Population structure was assessed through principal component analysis (PCA) and admixture analysis, whereas inbreeding was estimated based on runs of homozygosity (ROHs) coefficients, genomic relationship matrix inbreeding coefficients (F GRM ) and patterns of linkage disequilibrium (LD). Associations between SNPs and breeds were analyzed with different inheritance models, to identify SNPs that distinguish among the breeds. Results showed high levels of genetic heterogeneity in the three breeds. Genetic distances among breeds were modest, despite their different ancestries. Chios and Karagouniko breeds were more genetically related to each other compared to Boutsko. Analysis revealed 3802 candidate SNPs that can be used to identify two-breed crosses and purebred animals. The present study provides, for the first time, data on the genetic background of three Greek indigenous dairy sheep breeds as well as a specialized marker panel that can be applied for traceability purposes as well as targeted genetic improvement schemes and conservation programs.
Jannink, Jean-Luc
2010-01-01
Genome-wide association studies (GWAS) may benefit from utilizing haplotype information for making marker-phenotype associations. Several rationales for grouping single nucleotide polymorphisms (SNPs) into haplotype blocks exist, but any advantage may depend on such factors as genetic architecture of traits, patterns of linkage disequilibrium in the study population, and marker density. The objective of this study was to explore the utility of haplotypes for GWAS in barley (Hordeum vulgare) to offer a first detailed look at this approach for identifying agronomically important genes in crops. To accomplish this, we used genotype and phenotype data from the Barley Coordinated Agricultural Project and constructed haplotypes using three different methods. Marker-trait associations were tested by the efficient mixed-model association algorithm (EMMA). When QTL were simulated using single SNPs dropped from the marker dataset, a simple sliding window performed as well or better than single SNPs or the more sophisticated methods of blocking SNPs into haplotypes. Moreover, the haplotype analyses performed better 1) when QTL were simulated as polymorphisms that arose subsequent to marker variants, and 2) in analysis of empirical heading date data. These results demonstrate that the information content of haplotypes is dependent on the particular mutational and recombinational history of the QTL and nearby markers. Analysis of the empirical data also confirmed our intuition that the distribution of QTL alleles in nature is often unlike the distribution of marker variants, and hence utilizing haplotype information could capture associations that would elude single SNPs. We recommend routine use of both single SNP and haplotype markers for GWAS to take advantage of the full information content of the genotype data. PMID:21124933
Genome-wide association study of alcohol dependence
Treutlein, Jens; Cichon, Sven; Ridinger, Monika; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Moessner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Fehr, Christoph; Scherbaum, Norbert; Steffens, Michael; Ludwig, Kerstin U.; Frank, Josef; Wichmann, H.- Erich; Schreiber, Stefan; Dragano, Nico; Sommer, Wolfgang; Leonardi-Essmann, Fernando; Lourdusamy, Anbarasu; Gebicke-Haerter, Peter; Wienker, Thomas F.; Sullivan, Patrick F.; Nöthen, Markus M.; Kiefer, Falk; Spanagel, Rainer; Mann, Karl; Rietschel, Marcella
2014-01-01
Context Identification of genes contributing to alcohol dependence will improve our understanding of the mechanisms underlying this disorder. Objective To identify susceptibility genes for alcohol dependence through a genome-wide association study (GWAS) and follow-up study in a population of German male inpatients with an early age at onset. Design The GWAS included 487 male inpatients with DSM-IV alcohol dependence with an age at onset below 28 years and 1,358 population based control individuals. The follow-up study included 1,024 male inpatients and 996 age-matched male controls. All subjects were of German descent. The GWAS tested 524,396 single nucleotide polymorphisms (SNPs). All SNPs with p<10-4 were subjected to the follow-up study. In addition, nominally significant SNPs from those genes that had also shown expression changes in rat brains after chronic alcohol consumption were selected for the follow-up step. Results The GWAS produced 121 SNPs with nominal p<10-4. These, together with 19 additional SNPs from homologs of rat genes showing differential expression, were genotyped in the follow-up sample. Fifteen SNPs showed significant association with the same allele as in the GWAS. In the combined analysis, two closely linked intergenic SNPs met genome-wide significance (rs7590720 p=9.72×10-9; rs1344694 p=1.69×10-8). They are located on chromosome 2q35, a region which has been implicated in linkage studies for alcohol phenotypes. Nine SNPs were located in genes, including CDH13 and ADH1C genes which have been reported to be associated with alcohol dependence. Conclusion This is the first GWAS and follow-up study to identify a genome-wide significant association in alcohol dependence. Further independent studies are required to confirm these findings. PMID:19581569
A Comprehensive Linkage Map of the Dog Genome
Wong, Aaron K.; Ruhe, Alison L.; Dumont, Beth L.; Robertson, Kathryn R.; Guerrero, Giovanna; Shull, Sheila M.; Ziegle, Janet S.; Millon, Lee V.; Broman, Karl W.; Payseur, Bret A.; Neff, Mark W.
2010-01-01
We have leveraged the reference sequence of a boxer to construct the first complete linkage map for the domestic dog. The new map improves access to the dog's unique biology, from human disease counterparts to fascinating evolutionary adaptations. The map was constructed with ∼3000 microsatellite markers developed from the reference sequence. Familial resources afforded 450 mostly phase-known meioses for map assembly. The genotype data supported a framework map with ∼1500 loci. An additional ∼1500 markers served as map validators, contributing modestly to estimates of recombination rate but supporting the framework content. Data from ∼22,000 SNPs informing on a subset of meioses supported map integrity. The sex-averaged map extended 21 M and revealed marked region- and sex-specific differences in recombination rate. The map will enable empiric coverage estimates and multipoint linkage analysis. Knowledge of the variation in recombination rate will also inform on genomewide patterns of linkage disequilibrium (LD), and thus benefit association, selective sweep, and phylogenetic mapping approaches. The computational and wet-bench strategies can be applied to the reference genome of any nonmodel organism to assemble a de novo linkage map. PMID:19966068
Analysis of genetic polymorphisms in skeletal Class I crowding.
Ting, Tung Yuen; Wong, Ricky Wing Kit; Rabie, A Bakr M
2011-07-01
Dental crowding is a problem for both adolescents and adults in modern society. The purpose of this research was to identify single nucleotide polymorphisms (SNPs) responsible for crowding in subjects with skeletal Class I relationships. The case subjects consisted of healthy Chinese people living in Hong Kong with skeletal Class I relationships and at least 5 mm of crowding in either arch. The control subjects met the same requirements but lacked crowding or spacing. SNP genotyping was performed on the MassARRAY platform. The chi-square test was used to compare genotype and allele type distributions between the case and the control groups. Logistic regression was used to calculate odds ratios with 95% confidence intervals, and the effects of age and sex for each SNP. Analyses of linkage disequilibrium and haplotype associations between SNPs were performed with software. Five SNPs were found to be significantly different in genotype or allele type distributions. SNP rs372024 was significantly associated with crowding (P = 0.004). Two SNPs, rs3764746 and rs3795170, on the EDA gene were found to be associated marginally. SNPs rs1005464 and rs15705 also exhibited marginal association with crowding. The effects of associated SNPs remained significant after adjustments for age and sex factors. This study suggests an association for the genes EDA and XEDAR in dental crowding in the Hong Kong Chinese population. Copyright © 2011 American Association of Orthodontists. Published by Mosby, Inc. All rights reserved.
Association of the FGA and SLC6A4 genes with autistic spectrum disorder in a Korean population.
Ro, Myungja; Won, Seongsik; Kang, Hyunjun; Kim, Su-Yeon; Lee, Seung Ku; Nam, Min; Bang, Hee Jung; Yang, Jae Won; Choi, Kyung-Sik; Kim, Su Kang; Chung, Joo-Ho; Kwack, Kyubum
2013-01-01
Autism spectrum disorder (ASD) is a neurobiological disorder characterized by distinctive impairments in cognitive function, language, and behavior. Linkage and population studies suggest a genetic association between solute carrier family 6 member 4 (SLC6A4) variants and ASD. Logistic regression was used to identify associations between single-nucleotide polymorphisms (SNPs) and ASD with 3 alternative models (additive, dominant, and recessive). Linear regression analysis was performed to determine the influence of SNPs on Childhood Autism Rating Scale (CARS) scores as a quantitative phenotype. In the present study, we examined the associations of SNPs in the SLC6A4 gene and the fibrinogen alpha chain (FGA) gene. Logistic regression analysis showed a significant association between the risk of ASD and rs2070025 and rs2070011 in the FGA gene. The gene-gene interaction between SLC6A4 and FGA was not significantly associated with ASD susceptibility. However, polymorphisms in both SLC6A4 and the FGA gene significantly affected the symptoms of ASD. Our findings indicate that FGA and SLC6A4 gene interactions may contribute to the phenotypes of ASD rather than the incidence of ASD. © 2013 S. Karger AG, Basel.
Perttilä, Julia; Merikanto, Krista; Naukkarinen, Jussi; Surakka, Ida; Martin, Nicolas W; Tanhuanpää, Kimmo; Grimard, Vinciane; Taskinen, Marja-Riitta; Thiele, Christoph; Salomaa, Veikko; Jula, Antti; Perola, Markus; Virtanen, Ismo; Peltonen, Leena; Olkkonen, Vesa M
2009-08-01
Analysis of variants in three genes encoding oxysterol-binding protein (OSBP) homologues (OSBPL2, OSBPL9, OSBPL10) in Finnish families with familial low high-density lipoprotein (HDL) levels (N = 426) or familial combined hyperlipidemia (N = 684) revealed suggestive linkage of OSBPL10 single-nucleotide polymorphisms (SNPs) with extreme end high triglyceride (TG; >90th percentile) trait. Prompted by this initial finding, we carried out association analysis in a metabolic syndrome subcohort (Genmets) of Health2000 examination survey (N = 2,138), revealing association of multiple OSBPL10 SNPs with high serum TG levels (>95th percentile). To investigate whether OSBPL10 could be the gene underlying the observed linkage and association, we carried out functional experiments in the human hepatoma cell line Huh7. Silencing of OSBPL10 increased the incorporation of [(3)H]acetate into cholesterol and both [(3)H]acetate and [(3)H]oleate into triglycerides and enhanced the accumulation of secreted apolipoprotein B100 in growth medium, suggesting that the encoded protein ORP10 suppresses hepatic lipogenesis and very-low-density lipoprotein production. ORP10 was shown to associate dynamically with microtubules, consistent with its involvement in intracellular transport or organelle positioning. The data introduces OSBPL10 as a gene whose variation may contribute to high triglyceride levels in dyslipidemic Finnish subjects and provides evidence for ORP10 as a regulator of cellular lipid metabolism.
An investigation of obesity susceptibility genes in Northern Han Chinese by targeted resequencing.
Wu, Yili; Wang, Weijing; Jiang, Wenjie; Yao, Jie; Zhang, Dongfeng
2017-02-01
Our earlier genome-wide linkage study of body mass index (BMI) showed strong signals from 7q36.3 and 8q21.13. This case-control study set to investigate 2 genomic regions which may harbor variants contributed to development of obesity.We employed targeted resequencing technology to detect single nucleotide polymorphisms (SNPs) in 7q36.3 and 8q21.13 from 16 individuals with obesity. These were compared with 504 East Asians in the 1000 Genomes Project as a reference panel. Linkage disequilibrium (LD) block analysis was performed for the significant SNPs located near the same gene. Genes involved in statistically significant loci were then subject to gene set enrichment analysis (GSEA).The 16 individuals aged between 30 and 60 years with BMI = 33.25 ± 2.22 kg/m. A total of 12,131 genetic variants across all of samples were found. After correcting for multiple testing, 65 SNPs from 25 nearest genes (INSIG1, FABP5, PTPRN2, VIPR2, WDR60, SHH, UBE3C, LMBR1, PAG1, IMPA1, CHMP4, SNX16, BLACE, EN2, CNPY1, LOC100506302, RBM33, LOC389602, LOC285889, LINC01006, NOM1, DNAJB6, LOC101927914, ESYT2, LINC00689) were associated with obesity at significant level q-value ≤ 0.05. LD block analysis showed there were 10 pairs of loci with D' ≥ 0.8 and r ≥ 0.8. GSEA further identified 2 major related gene sets, involving lipid raft and lipid metabolic process, with FDR values <0.12 and <0.4, respectively.Our data are the first documentation of genetic variants in 7q36.3 and 8q21.13 associated with obesity using target capture sequencing and Northern Han Chinese samples. Additional replication and functional studies are merited to validate our findings.
Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe
2010-01-01
Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950
Ho, Daniel W. H.; Yap, Maurice K. H.; Ng, Po Wah; Fung, Wai Yan; Yip, Shea Ping
2012-01-01
Background Myopia is the most common ocular disorder worldwide and imposes tremendous burden on the society. It is a complex disease. The MYP6 locus at 22 q12 is of particular interest because many studies have detected linkage signals at this interval. The MYP6 locus is likely to contain susceptibility gene(s) for myopia, but none has yet been identified. Methodology/Principal Findings Two independent subject groups of southern Chinese in Hong Kong participated in the study an initial study using a discovery sample set of 342 cases and 342 controls, and a follow-up study using a replication sample set of 316 cases and 313 controls. Cases with high myopia were defined by spherical equivalent ≤ -8 dioptres and emmetropic controls by spherical equivalent within ±1.00 dioptre for both eyes. Manual candidate gene selection from the MYP6 locus was supported by objective in silico prioritization. DNA samples of discovery sample set were genotyped for 178 tagging single nucleotide polymorphisms (SNPs) from 26 genes. For replication, 25 SNPs (tagging or located at predicted transcription factor or microRNA binding sites) from 4 genes were subsequently examined using the replication sample set. Fisher P value was calculated for all SNPs and overall association results were summarized by meta-analysis. Based on initial and replication studies, rs2009066 located in the crystallin beta A4 (CRYBA4) gene was identified to be the most significantly associated with high myopia (initial study: P = 0.02; replication study: P = 1.88e-4; meta-analysis: P = 1.54e-5) among all the SNPs tested. The association result survived correction for multiple comparisons. Under the allelic genetic model for the combined sample set, the odds ratio of the minor allele G was 1.41 (95% confidence intervals, 1.21-1.64). Conclusions/Significance A novel susceptibility gene (CRYBA4) was discovered for high myopia. Our study also signified the potential importance of appropriate gene prioritization in candidate selection. PMID:22792142
Liao, R; Zhang, X; Chen, Q; Wang, Z; Wang, Q; Yang, C; Pan, Y
2016-10-01
This study was designed to investigate the genetic basis of growth and egg traits in Dongxiang blue-shelled chickens and White Leghorn chickens. In this study, we employed a reduced representation sequencing approach called genotyping by genome reducing and sequencing to detect genome-wide SNPs in 252 Dongxiang blue-shelled chickens and 252 White Leghorn chickens. The Dongxiang blue-shelled chicken breed has many specific traits and is characterized by blue-shelled eggs, black plumage, black skin, black bone and black organs. The White Leghorn chicken is an egg-type breed with high productivity. As multibreed genome-wide association studies (GWASs) can improve precision due to less linkage disequilibrium across breeds, a multibreed GWAS was performed with 156 575 SNPs to identify the associated variants underlying growth and egg traits within the two chicken breeds. The analysis revealed 32 SNPs exhibiting a significant genome-wide association with growth and egg traits. Some of the significant SNPs are located in genes that are known to impact growth and egg traits, but nearly half of the significant SNPs are located in genes with unclear functions in chickens. To our knowledge, this is the first multibreed genome-wide report for the genetics of growth and egg traits in the Dongxiang blue-shelled and White Leghorn chickens. © 2016 Stichting International Foundation for Animal Genetics.
Genome-wide QTL analysis for anxiety trait in bipolar disorder type I.
Contreras, J; Hare, E; Chavarría-Soley, G; Raventós, H
2018-07-01
Genetic studies have been consistent that bipolar disorder type I (BPI) runs in families and that this familial aggregation is strongly influenced by genes. In a preliminary study, we proved that anxiety trait meets endophenotype criteria for BPI. We assessed 619 individuals from the Central Valley of Costa Rica (CVCR) who have received evaluation for anxiety following the same methodological procedure used for the initial pilot study. Our goal was to conduct a multipoint quantitative trait linkage analysis to identify quantitative trait loci (QTLs) related to anxiety trait in subjects with BPI. We conducted the statistical analyses using Quantitative Trait Loci method (Variance-components models), implemented in Sequential Oligogenic Linkage Analysis Routines (SOLAR), using 5606 single nucleotide polymorphism (SNPs). We identified a suggestive linkage signal with a LOD score of 2.01 at chromosome 2 (2q13-q14). Since confounding factors such as substance abuse, medical illness and medication history were not assessed in our study, these conclusions should be taken as preliminary. We conclude that region 2q13-q14 may harbor a candidate gene(s) with an important role in the pathophysiology of BPI and anxiety. Published by Elsevier B.V.
Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio)
2014-01-01
Background A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. Results The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. Conclusions The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species. PMID:24762296
Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio).
Xu, Jian; Zhao, Zixia; Zhang, Xiaofeng; Zheng, Xianhu; Li, Jiongtang; Jiang, Yanliang; Kuang, Youyi; Zhang, Yan; Feng, Jianxin; Li, Chuangju; Yu, Juhua; Li, Qiang; Zhu, Yuanyuan; Liu, Yuanyuan; Xu, Peng; Sun, Xiaowen
2014-04-24
A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species.
Global genetic differentiation of complex traits shaped by natural selection in humans.
Guo, Jing; Wu, Yang; Zhu, Zhihong; Zheng, Zhili; Trzaskowski, Maciej; Zeng, Jian; Robinson, Matthew R; Visscher, Peter M; Yang, Jian
2018-05-14
There are mean differences in complex traits among global human populations. We hypothesize that part of the phenotypic differentiation is due to natural selection. To address this hypothesis, we assess the differentiation in allele frequencies of trait-associated SNPs among African, Eastern Asian, and European populations for ten complex traits using data of large sample size (up to ~405,000). We show that SNPs associated with height ([Formula: see text]), waist-to-hip ratio ([Formula: see text]), and schizophrenia ([Formula: see text]) are significantly more differentiated among populations than matched "control" SNPs, suggesting that these trait-associated SNPs have undergone natural selection. We further find that SNPs associated with height ([Formula: see text]) and schizophrenia ([Formula: see text]) show significantly higher variance in linkage disequilibrium (LD) scores across populations than control SNPs. Our results support the hypothesis that natural selection has shaped the genetic differentiation of complex traits, such as height and schizophrenia, among worldwide populations.
Philippi, Anne; Tores, Frédéric; Carayol, Jérome; Rousseau, Francis; Letexier, Mélanie; Roschmann, Elke; Lindenbaum, Pierre; Benajjou, Abdel; Fontaine, Karine; Vazart, Céline; Gesnouin, Philippe; Brooks, Peter; Hager, Jörg
2007-01-01
Background Autism is a complex, heterogeneous, behaviorally-defined disorder characterized by disruptions of the nervous system and of other systems such as the pituitary-hypothalamic axis. In a previous genome wide screen, we reported linkage of autism with a 1.2 Megabase interval on chromosome 5q31. For the current study, we hypothesized that 3 of the genes in this region could be involved in the development of autism: 1) paired-like homeodomain transcription factor 1 (PITX1), which is a key regulator of hormones within the pituitary-hypothalamic axis, 2) neurogenin 1, a transcription factor involved in neurogenesis, and 3) histone family member Y (H2AFY), which is involved in X-chromosome inactivation in females and could explain the 4:1 male:female gender distortion present in autism. Methods A total of 276 families from the Autism Genetic Resource Exchange (AGRE) repository composed of 1086 individuals including 530 affected children were included in the study. Single nucleotide polymorphisms tagging the three candidate genes were genotyped on the initial linkage sample of 116 families. A second step of analysis was performed using tightly linked SNPs covering the PITX1 gene. Association was evaluated using the FBAT software version 1.7.3 for single SNP analysis and the HBAT command from the same package for haplotype analysis respectively. Results Association between SNPs and autism was only detected for PITX1. Haplotype analysis within PITX1 showed evidence for overtransmission of the A-C haplotype of markers rs11959298 – rs6596189 (p = 0.0004). Individuals homozygous or heterozygous for the A-C haplotype risk allele were 2.54 and 1.59 fold more likely to be autistic than individuals who were not carrying the allele, respectively. Conclusion Strong and consistent association was observed between a 2 SNPs within PITX1 and autism. Our data suggest that PITX1, a key regulator of hormones within the pituitary-hypothalamic axis, may be implicated in the etiology of autism. PMID:18053270
Philippi, Anne; Tores, Frédéric; Carayol, Jérome; Rousseau, Francis; Letexier, Mélanie; Roschmann, Elke; Lindenbaum, Pierre; Benajjou, Abdel; Fontaine, Karine; Vazart, Céline; Gesnouin, Philippe; Brooks, Peter; Hager, Jörg
2007-12-06
Autism is a complex, heterogeneous, behaviorally-defined disorder characterized by disruptions of the nervous system and of other systems such as the pituitary-hypothalamic axis. In a previous genome wide screen, we reported linkage of autism with a 1.2 Megabase interval on chromosome 5q31. For the current study, we hypothesized that 3 of the genes in this region could be involved in the development of autism: 1) paired-like homeodomain transcription factor 1 (PITX1), which is a key regulator of hormones within the pituitary-hypothalamic axis, 2) neurogenin 1, a transcription factor involved in neurogenesis, and 3) histone family member Y (H2AFY), which is involved in X-chromosome inactivation in females and could explain the 4:1 male:female gender distortion present in autism. A total of 276 families from the Autism Genetic Resource Exchange (AGRE) repository composed of 1086 individuals including 530 affected children were included in the study. Single nucleotide polymorphisms tagging the three candidate genes were genotyped on the initial linkage sample of 116 families. A second step of analysis was performed using tightly linked SNPs covering the PITX1 gene. Association was evaluated using the FBAT software version 1.7.3 for single SNP analysis and the HBAT command from the same package for haplotype analysis respectively. Association between SNPs and autism was only detected for PITX1. Haplotype analysis within PITX1 showed evidence for overtransmission of the A-C haplotype of markers rs11959298 - rs6596189 (p = 0.0004). Individuals homozygous or heterozygous for the A-C haplotype risk allele were 2.54 and 1.59 fold more likely to be autistic than individuals who were not carrying the allele, respectively. Strong and consistent association was observed between a 2 SNPs within PITX1 and autism. Our data suggest that PITX1, a key regulator of hormones within the pituitary-hypothalamic axis, may be implicated in the etiology of autism.
Evidence for rare and common genetic risk variants for schizophrenia at protein kinase C, alpha.
Carroll, L S; Williams, N M; Moskvina, V; Russell, E; Norton, N; Williams, H J; Peirce, T; Georgieva, L; Dwyer, S; Grozeva, D; Greene, E; Farmer, A; McGuffin, P; Morris, D W; Corvin, A; Gill, M; Rujescu, D; Sham, P; Holmans, P; Jones, I; Kirov, G; Craddock, N; O'Donovan, M C; Owen, M J
2010-11-01
We earlier reported a genome-wide significant linkage to schizophrenia at chromosome 17 that was identified in a single pedigree (C702) consisting of six affected, male siblings with DSM-IV schizophrenia and prominent mood symptoms. In this study, we adopted several approaches in an attempt to map the putative disease locus. First, mapping the source of linkage to chromosome 17 in pedigree C702. We refined the linkage region in family C702 to a 21-marker segment spanning 11.7 Mb at 17q23-q24 by genotyping a total of 50 microsatellites across chromosome 17 in the pedigree. Analysis of data from 1028 single nucleotide polymorphisms (SNPs) across the refined linkage region identified a single region of homozygosity present in pedigree C702 but not in 2938 UK controls. This spanned ~432 kb of the gene encoding protein kinase C, alpha (PRKCA), the encoded protein of which has been implicated in the pathogenesis of psychiatric disorders. Analysis of pedigree C702 by oligonucleotide-array comparative genome hybridization excluded the possibility that this region of homozygosity was because of a deletion. Mutation screening of PRKCA identified a rare, four-marker haplotype (C-HAP) in the 3' untranslated region of the gene, which was present in the homozygous state in all six affected members of pedigree C702. No other homozygotes were observed in genotype data for a total of 6597 unrelated Europeans (case N=1755, control N=3580 and parents of probands N=1262). Second, association analysis of C702 alleles at PRKCA. The low-frequency haplotype (C-HAP) showed a trend for association in a study of unrelated schizophrenia cases and controls from the UK (661 cases, 2824 controls, P=0.078 and odd ratio (OR)=1.9) and significant evidence for association when the sample was expanded to include cases with bipolar (N=710) and schizoaffective disorder (N=50) (psychosis sample: 1421 cases, 2824 controls, P=0.037 and OR=1.9). Given that all the affected members of C702 are male, we also undertook sex-specific analyses. This revealed that the association was strongest in males for both schizophrenia (446 male cases, 1421 male controls, P=0.008 and OR=3.9) and in the broader psychosis group (730 male cases, 1421 male controls, P=0.008 and OR=3.6). Analysis of C-HAP in follow-up samples from Ireland and Bulgaria revealed no evidence for association in either the whole sample or in males alone, and meta-analysis of all male psychosis samples yielded no significant evidence of association (969 male cases, 1939 male controls, 311 male probands P=0.304 and OR=1.4). Third, association mapping of the pedigree C702 linkage region. Independent of pedigree C702, genotype data from the Affymetrix 500k GeneChip set were available for 476 patients with schizophrenia and 2938 controls from the United Kingdom. SNPs in PRKCA showed evidence for association with schizophrenia that achieved gene-wide significance (P=0.027). Moreover, the same SNP was the most significantly associated marker out of the 1028 SNPs genotyped across the linkage region (rs873417, allelic P=0.0004). Follow-up genotyping in samples from Ireland, Bulgaria and Germany did not show consistent replication, but meta-analysis of all samples (4116 cases and 6491 controls) remained nominally significant (meta-analysis P=0.026, OR=1.1). We conclude that, although we have obtained convergent lines of evidence implicating both rare and common schizophrenia risk variants at PRKCA, none of these is individually compelling. However, the evidence across all approaches suggests that further study of this locus is warranted.
Groen-Blokhuis, Maria M.; Pourcain, Beate St.; Greven, Corina U.; Pappa, Irene; Tiesler, Carla M.T.; Ang, Wei; Nolte, Ilja M.; Vilor-Tejedor, Natalia; Bacelis, Jonas; Ebejer, Jane L.; Zhao, Huiying; Davies, Gareth E.; Ehli, Erik A.; Evans, David M.; Fedko, Iryna O.; Guxens, Mònica; Hottenga, Jouke-Jan; Hudziak, James J.; Jugessur, Astanand; Kemp, John P.; Krapohl, Eva; Martin, Nicholas G.; Murcia, Mario; Myhre, Ronny; Ormel, Johan; Ring, Susan M.; Standl, Marie; Stergiakouli, Evie; Stoltenberg, Camilla; Thiering, Elisabeth; Timpson, Nicholas J.; Trzaskowski, Maciej; van der Most, Peter J.; Wang, Carol; Nyholt, Dale R.; Medland, Sarah E.; Neale, Benjamin; Jacobsson, Bo; Sunyer, Jordi; Hartman, Catharina A.; Whitehouse, Andrew J.O.; Pennell, Craig E.; Heinrich, Joachim; Plomin, Robert; Smith, George Davey; Tiemeier, Henning; Posthuma, Danielle; Boomsma, Dorret I.
2016-01-01
Objective To elucidate the influence of common genetic variants on childhood attention-deficit/hyperactivity disorder (ADHD) symptoms, to identify genetic variants that explain its high heritability, and to investigate the genetic overlap of ADHD symptom scores with ADHD diagnosis. Method Within the EArly Genetics and Lifecourse Epidemiology (EAGLE) consortium, genome-wide single nucleotide polymorphisms (SNPs) and ADHD symptom scores were available for 17,666 children (< 13 years) from nine population-based cohorts. SNP-based heritability was estimated in data from the three largest cohorts. Meta-analysis based on genome-wide association (GWA) analyses with SNPs was followed by gene-based association tests, and the overlap in results with a meta-analysis in the Psychiatric Genomics Consortium (PGC) case-control ADHD study was investigated. Results SNP-based heritability ranged from 5% to 34%, indicating that variation in common genetic variants influences ADHD symptom scores. The meta-analysis did not detect genome-wide significant SNPs, but three genes, lying close to each other with SNPs in high linkage disequilibrium (LD), showed a gene-wide significant association (p values between 1.46×10-6 and 2.66×10-6). One gene, WASL, is involved in neuronal development. Both SNP- and gene-based analyses indicated overlap with the PGC meta-analysis results with the genetic correlation estimated at 0.96. Conclusion The SNP-based heritability for ADHD symptom scores indicates a polygenic architecture and genes involved in neurite outgrowth are possibly involved. Continuous and dichotomous measures of ADHD appear to assess a genetically common phenotype. A next step is to combine data from population-based and case-control cohorts in genetic association studies to increase sample size and improve statistical power for identifying genetic variants. PMID:27663945
A GENOME-WIDE LINKAGE AND ASSOCIATION SCAN REVEALS NOVEL LOCI FOR AUTISM
Weiss, Lauren A.; Arking, Dan E.
2009-01-01
Summary Although autism is a highly heritable neurodevelopmental disorder, attempts to identify specific susceptibility genes have thus far met with limited success 1. Genome-wide association studies (GWAS) using half a million or more markers, particularly those with very large sample sizes achieved through meta-analysis, have shown great success in mapping genes for other complex genetic traits (http://www.genome.gov/26525384). Consequently, we initiated a linkage and association mapping study using half a million genome-wide SNPs in a common set of 1,031 multiplex autism families (1,553 affected offspring). We identified regions of suggestive and significant linkage on chromosomes 6q27 and 20p13, respectively. Initial analysis did not yield genome-wide significant associations; however, genotyping of top hits in additional families revealed a SNP on chromosome 5p15 (between SEMA5A and TAS2R1) that was significantly associated with autism (P = 2 × 10−7). We also demonstrated that expression of SEMA5A is reduced in brains from autistic patients, further implicating SEMA5A as an autism susceptibility gene. The linkage regions reported here provide targets for rare variation screening while the discovery of a single novel association demonstrates the action of common variants. PMID:19812673
Li, Cong; Sun, Dongxiao; Zhang, Shengli; Yang, Shaohua; Alim, M A; Zhang, Qin; Li, Yanhua; Liu, Lin
2016-07-28
A previous genome-wide association study deduced that one (ARS-BFGL-NGS-39328), two (Hapmap26001-BTC-038813 and Hapmap31284-BTC-039204), two (Hapmap26001-BTC-038813 and BTB-00246150), and one (Hapmap50366-BTA-46960) genome-wide significant single nucleotide polymorphisms (SNPs) associated with milk fatty acids were close to or within the fatty acid synthase (FASN), peroxisome proliferator-activated receptor gamma, coactivator 1 alpha (PPARGC1A), ATP-binding cassette, sub-family G, member 2 (ABCG2) and insulin-like growth factor 1 (IGF1) genes. To further confirm the linkage and reveal the genetic effects of these four candidate genes on milk fatty acid composition, genetic polymorphisms were identified and genotype-phenotype associations were performed in a Chinese Holstein cattle population. Nine SNPs were identified in FASN, among which SNP rs41919985 was predicted to result in an amino acid substitution from threonine (ACC) to alanine (GCC), five SNPs (rs136947640, rs134340637, rs41919992, rs41919984 and rs41919986) were synonymous mutations, and the remaining three (rs41919999, rs132865003 and rs133498277) were found in FASN introns. Only one SNP each was identified for PPARGC1A, ABCG2 and IGF1. Association studies revealed that FASN, PPARGC1A, ABCG2 and IGF1 were mainly associated with medium-chain saturated fatty acids and long-chain unsaturated fatty acids, especially FASN for C10:0, C12:0 and C14:0. Strong linkage disequilibrium was observed among ARS-BFGL-NGS-39328 and rs132865003 and rs134340637 in FASN (D´ > 0.9), and among Hapmap26001-BTC-038813 and Hapmap31284-BTC-039204 and rs109579682 in PPARGC1A (D´ > 0.9). Subsequently, haplotype-based analysis revealed significant associations of the haplotypes encompassing eight FASN SNPs (rs41919999, rs132865003, rs134340637, rs41919992, rs133498277, rs41919984, rs41919985 and rs41919986) with C10:0, C12:0, C14:0, C18:1n9c, saturated fatty acids (SFA) and unsaturated fatty acids (UFA) (P = 0.0204 to P < 0.0001). Our study confirmed the linkage between the significant SNPs in our previous genome-wide association study and variants in FASN and PPARGC1A. SNPs within FASN, PPARGC1A, ABCG2 and IGF1 showed significant genetic effects on milk fatty acid composition in dairy cattle, indicating their potential functions in milk fatty acids synthesis and metabolism. The findings presented here provide evidence for the selection of dairy cows with healthier milk fatty acid composition by marker-assisted breeding or genomic selection schemes, as well as furthering our understanding of technological processing aspects of cows' milk.
Dosage Transmission Disequilibrium Test (dTDT) for Linkage and Association Detection
Zhang, Zhehao; Wang, Jen-Chyong; Howells, William; Lin, Peng; Agrawal, Arpana; Edenberg, Howard J.; Tischfield, Jay A.; Schuckit, Marc A.; Bierut, Laura J.; Goate, Alison; Rice, John P.
2013-01-01
Both linkage and association studies have been successfully applied to identify disease susceptibility genes with genetic markers such as microsatellites and Single Nucleotide Polymorphisms (SNPs). As one of the traditional family-based studies, the Transmission/Disequilibrium Test (TDT) measures the over-transmission of an allele in a trio from its heterozygous parents to the affected offspring and can be potentially useful to identify genetic determinants for complex disorders. However, there is reduced information when complete trio information is unavailable. In this study, we developed a novel approach to “infer” the transmission of SNPs by combining both the linkage and association data, which uses microsatellite markers from families informative for linkage together with SNP markers from the offspring who are genotyped for both linkage and a Genome-Wide Association Study (GWAS). We generalized the traditional TDT to process these inferred dosage probabilities, which we name as the dosage-TDT (dTDT). For evaluation purpose, we developed a simulation procedure to assess its operating characteristics. We applied the dTDT to the simulated data and documented the power of the dTDT under a number of different realistic scenarios. Finally, we applied our methods to a family study of alcohol dependence (COGA) and performed individual genotyping on complete families for the top signals. One SNP (rs4903712 on chromosome 14) remained significant after correcting for multiple testing Methods developed in this study can be adapted to other platforms and will have widespread applicability in genomic research when case-control GWAS data are collected in families with existing linkage data. PMID:23691058
Vaughan, Laura Kelly; Wiener, Howard W.; Aslibekyan, Stella; Allison, David B.; Havel, Peter J.; Stanhope, Kimber L.; O’Brien, Diane M.; Hopkins, Scarlett E.; Lemas, Dominick J.; Boyer, Bert B.; Tiwari, Hemant K.
2015-01-01
Objective To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup’ik people. Material and Methods We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. Results We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). Conclusions This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup’ik people. PMID:25772781
Vaughan, Laura Kelly; Wiener, Howard W; Aslibekyan, Stella; Allison, David B; Havel, Peter J; Stanhope, Kimber L; O'Brien, Diane M; Hopkins, Scarlett E; Lemas, Dominick J; Boyer, Bert B; Tiwari, Hemant K
2015-06-01
To identify novel genetic markers of obesity-related traits and to identify gene-diet interactions with n-3 polyunsaturated fatty acid (n-3 PUFA) intake in Yup'ik people. We measured body composition, plasma adipokines and ghrelin in 982 participants enrolled in the Center for Alaska Native Health Research (CANHR) Study. We conducted a genome-wide SNP linkage scan and targeted association analysis, fitting additional models to investigate putative gene-diet interactions. Finally, we performed bioinformatic analysis to uncover likely candidate genes within the identified linkage peaks. We observed evidence of linkage for all obesity-related traits, replicating previous results and identifying novel regions of interest for adiponectin (10q26.13-2) and thigh circumference (8q21.11-13). Bioinformatic analysis revealed DOCK1, PTPRE (10q26.13-2) and FABP4 (8q21.11-13) as putative candidate genes in the newly identified regions. Targeted SNP analysis under the linkage peaks identified associations between three SNPs and obesity-related traits: rs1007750 on chromosome 8 and thigh circumference (P=0.0005), rs878953 on chromosome 5 and thigh skinfold (P=0.0004), and rs1596854 on chromosome 11 for waist circumference (P=0.0003). Finally, we showed that n-3 PUFA modified the association between obesity related traits and two additional variants (rs2048417 on chromosome 3 for adiponectin, P for interaction=0.0006 and rs730414 on chromosome 11 for percentage body fat, P for interaction=0.0004). This study presents evidence of novel genomic regions and gene-diet interactions that may contribute to the pathophysiology of obesity-related traits among Yup'ik people. Copyright © 2015 Elsevier Inc. All rights reserved.
Gallagher, Carla J; Keene, Keith L; Mychaleckyj, Josyf C; Langefeld, Carl D; Hirschhorn, Joel N; Henderson, Brian E; Gordon, Candace J; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M
2007-03-01
The estrogen receptor-alpha gene (ESR1) was selected as a positional candidate under a type 2 diabetes linkage peak at 6q24-27. A total of 42 ESR1 single nucleotide polymorphisms (SNPs) were genotyped in 380 African-American type 2 diabetic case subjects with end-stage renal disease (ESRD) and 276 African-American control subjects. A total of 22 ancestry informative markers were also genotyped, and the program Admixmap was used to adjust allelic and haplotypic association tests for individual estimates of admixture. The most significant association with type 2 diabetes-ESRD was with rs1033182 in intron 2 (P = 0.013, admixture-adjusted P(a) = 0.021). Genotyping 17 SNPs across a region of ESR1 intron 1-intron 2 in an expanded population of 851 case and 635 control subjects supported association with rs1033182 (P = 0.004, P(a) = 0.027) and with an independent six-SNP haplotype of high linkage disequilibrium spanning 6.4 kb (P < 0.0001, P(a) < 0.0001). The same 17 ESR1 SNPs were genotyped in 300 European-American type 2 diabetes-ESRD case subjects and 310 European-American control subjects. Two intron 2 SNPs, rs2431260 (P = 0.015) and rs1709183 (P = 0.019), and a four-SNP haplotype containing these SNPs (P = 0.033) were associated with type 2 diabetes and/or ESRD. Results suggest that intron 1 and intron 2 of the ESR1 gene may contain functionally important regions related to type 2 diabetes or ESRD risk.
Lawhorn, Collene; Yuferov, Vadim; Randesi, Matthew; Ho, Ann; Morgello, Susan; Kreek, Mary Jeanne; Levran, Orna
2013-01-01
Background Chemokine receptors CCR2 and CCR5 play a key role in immune and inflammatory responses and have been associated with several diseases, including AIDS. In order to comprehend health disparities it is important to understand the nature of genetic variation in specific genes of interest in different populations. Current studies of the CCR2 and CCR5 receptor genes are primarily focused on the CCR5-Δ32, and CCR2-V64I SNPs. Methods Sanger sequencing was used to sequence the regions containing 16 SNPs in the adjacent CCR2 and CCR5 genes (including CCR5-Δ32, and CCR2-V64I) in 249 subjects of African, European and Hispanic ancestry. Linkage disequilibrium (LD) and haplotypes were determined using Haploview. Results The data revealed large differences in allele frequencies of several SNPs and LD patterns among the ethnic groups, including SNPs that were restricted to Africans or Europeans. Seven known CCR5 haplotypes and six novel CCR2 haplotypes were identified. A rare case of an HIV+ subject with the CCR5-Δ32/Δ32 was identified. Conclusions These data demonstrate a LD between CCR2 and CCR5 at several loci and provide new information about CCR2 that contributes to our understanding of its population-specific genetic variability. The data indicate that in addition to CCR5-Δ32 and CCR2-V64I, other SNPs and haplotypes may be important genetic determinants of disease and should be investigated. PMID:24011637
Zhao, Huiying; Nyholt, Dale R; Yang, Yuanhao; Wang, Jihua; Yang, Yuedong
2017-06-14
Genome-wide association studies (GWAS) have successfully identified single variants associated with diseases. To increase the power of GWAS, gene-based and pathway-based tests are commonly employed to detect more risk factors. However, the gene- and pathway-based association tests may be biased towards genes or pathways containing a large number of single-nucleotide polymorphisms (SNPs) with small P-values caused by high linkage disequilibrium (LD) correlations. To address such bias, numerous pathway-based methods have been developed. Here we propose a novel method, DGAT-path, to divide all SNPs assigned to genes in each pathway into LD blocks, and to sum the chi-square statistics of LD blocks for assessing the significance of the pathway by permutation tests. The method was proven robust with the type I error rate >1.6 times lower than other methods. Meanwhile, the method displays a higher power and is not biased by the pathway size. The applications to the GWAS summary statistics for schizophrenia and breast cancer indicate that the detected top pathways contain more genes close to associated SNPs than other methods. As a result, the method identified 17 and 12 significant pathways containing 20 and 21 novel associated genes, respectively for two diseases. The method is available online by http://sparks-lab.org/server/DGAT-path .
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
El-Dakdouki, Mohammad H; Puré, Ellen; Huang, Xuefei
2013-05-07
Nanoparticles (NPs) are being extensively studied as carriers for drug delivery, but they often have limited penetration inside tumors. We envision that by targeting an endocytic receptor on the cell surface, the uptake of NPs can be significantly enhanced through receptor mediated endocytosis. In addition, if the receptor is recycled to the cell surface, the NP cargo can be transported out of the cells, which is then taken up by neighboring cells thus enhancing solid tumor penetration. To validate our hypothesis, in the first of two articles, we report the synthesis of doxorubicin (DOX)-loaded, hyaluronan (HA) coated silica nanoparticles (SNPs) containing a highly fluorescent core to target CD44, a receptor expressed on the cancer cell surface. HA was conjugated onto amine-functionalized SNPs prepared through an oil-water microemulsion method. The immobilization of the cytotoxic drug DOX was achieved through an acid sensitive hydrazone linkage. The NPs were fully characterized by transmission electron microscopy (TEM), dynamic light scattering (DLS), zeta potential measurements, thermogravimetric analysis (TGA), UV-vis absorbance, and nuclear magnetic resonance (NMR). Initial biological evaluation experiments demonstrated that compared to ligand-free SNPs, the uptake of HA-SNPs by the CD44-expressing SKOV-3 ovarian cancer cells was significantly enhanced when evaluated in the 2D monolayer cell culture. Mechanistic studies suggested that cellular uptake of HA-SNPs was mainly through CD44 mediated endocytosis. HA-SNPs with immobilized DOX were endocytosed efficiently by the SKOV-3 cells as well. The enhanced tumor penetration and drug delivery properties of HA-SNPs will be evaluated in 3D tumor models in the subsequent paper.
Dvornyk, Volodymyr; Long, Ji-Rong; Xiong, Dong-Hai; Liu, Peng-Yuan; Zhao, Lan-Juan; Shen, Hui; Zhang, Yuan-Yuan; Liu, Yong-Jun; Rocha-Sanchez, Sonia; Xiao, Peng; Recker, Robert R; Deng, Hong-Wen
2004-02-25
Public SNP databases are frequently used to choose SNPs for candidate genes in the association and linkage studies of complex disorders. However, their utility for such studies of diseases with ethnic-dependent background has never been evaluated. To estimate the accuracy and completeness of SNP public databases, we analyzed the allele frequencies of 41 SNPs in 10 candidate genes for obesity and/or osteoporosis in a large American-Caucasian sample (1,873 individuals from 405 nuclear families) by PCR-invader assay. We compared our results with those from the databases and other published studies. Of the 41 SNPs, 8 were monomorphic in our sample. Twelve were reported for the first time for Caucasians and the other 29 SNPs in our sample essentially confirmed the respective allele frequencies for Caucasians in the databases and previous studies. The comparison of our data with other ethnic groups showed significant differentiation between the three major world ethnic groups at some SNPs (Caucasians and Africans differed at 3 of the 18 shared SNPs, and Caucasians and Asians differed at 13 of the 22 shared SNPs). This genetic differentiation may have an important implication for studying the well-known ethnic differences in the prevalence of obesity and osteoporosis, and complex disorders in general. A comparative analysis of the SNP data of the candidate genes obtained in the present study, as well as those retrieved from the public domain, suggests that the databases may currently have serious limitations for studying complex disorders with an ethnic-dependent background due to the incomplete and uneven representation of the candidate SNPs in the databases for the major ethnic groups. This conclusion attests to the imperative necessity of large-scale and accurate characterization of these SNPs in different ethnic groups.
Dvornyk, Volodymyr; Long, Ji-Rong; Xiong, Dong-Hai; Liu, Peng-Yuan; Zhao, Lan-Juan; Shen, Hui; Zhang, Yuan-Yuan; Liu, Yong-Jun; Rocha-Sanchez, Sonia; Xiao, Peng; Recker, Robert R; Deng, Hong-Wen
2004-01-01
Background Public SNP databases are frequently used to choose SNPs for candidate genes in the association and linkage studies of complex disorders. However, their utility for such studies of diseases with ethnic-dependent background has never been evaluated. Results To estimate the accuracy and completeness of SNP public databases, we analyzed the allele frequencies of 41 SNPs in 10 candidate genes for obesity and/or osteoporosis in a large American-Caucasian sample (1,873 individuals from 405 nuclear families) by PCR-invader assay. We compared our results with those from the databases and other published studies. Of the 41 SNPs, 8 were monomorphic in our sample. Twelve were reported for the first time for Caucasians and the other 29 SNPs in our sample essentially confirmed the respective allele frequencies for Caucasians in the databases and previous studies. The comparison of our data with other ethnic groups showed significant differentiation between the three major world ethnic groups at some SNPs (Caucasians and Africans differed at 3 of the 18 shared SNPs, and Caucasians and Asians differed at 13 of the 22 shared SNPs). This genetic differentiation may have an important implication for studying the well-known ethnic differences in the prevalence of obesity and osteoporosis, and complex disorders in general. Conclusion A comparative analysis of the SNP data of the candidate genes obtained in the present study, as well as those retrieved from the public domain, suggests that the databases may currently have serious limitations for studying complex disorders with an ethnic-dependent background due to the incomplete and uneven representation of the candidate SNPs in the databases for the major ethnic groups. This conclusion attests to the imperative necessity of large-scale and accurate characterization of these SNPs in different ethnic groups. PMID:15113403
Identification of genomic regions contributing to etoposide-induced cytotoxicity.
Bleibel, Wasim K; Duan, Shiwei; Huang, R Stephanie; Kistner, Emily O; Shukla, Sunita J; Wu, Xiaolin; Badner, Judith A; Dolan, M Eileen
2009-03-01
Etoposide is routinely used in combination-based chemotherapy for testicular cancer and small-cell lung cancer; however, myelosuppression, therapy-related leukemia and neurotoxicity limit its utility. To determine the genetic contribution to cellular sensitivity to etoposide, we evaluated cell growth inhibition in Centre d' Etude du Polymorphisme Humain lymphoblastoid cell lines from 24 multi-generational pedigrees (321 samples) following treatment with 0.02-2.5 microM etoposide for 72 h. Heritability analysis showed that genetic variation contributes significantly to the cytotoxic phenotypes (h (2) = 0.17-0.25, P = 4.9 x 10(-5)-7.3 x 10(-3)). Whole genome linkage scans uncovered 8 regions with peak LOD scores ranging from 1.57 to 2.55, with the most significant signals being found on chromosome 5 (LOD = 2.55) and chromosome 6 (LOD = 2.52). Linkage-directed association was performed on a subset of HapMap samples within the pedigrees to find 22 SNPs significantly associated with etoposide cytotoxicity at one or more treatment concentrations. UVRAG, a DNA repair gene, SEMA5A, SLC7A6 and PRMT7 are implicated from these unbiased studies. Our findings suggest that susceptibility to etoposide-induced cytotoxicity is heritable and using an integrated genomics approach we identified both genomic regions and SNPs associated with the cytotoxic phenotypes.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro
Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Ichikawa, Shoji; Koller, Daniel L.; Curry, Leah R.; Lai, Dongbing; Xuei, Xiaoling; Edenberg, Howard J.; Hui, Siu L.; Peacock, Munro; Foroud, Tatiana; Econs, Michael J.
2010-01-01
Phenotypic variation in bone mineral density (BMD) among healthy adults is influenced by both genetic and environmental factors. Genetic sequence variations in the adenylate cyclase 10 (ADCY10) gene, which is also called soluble adenylate cyclase, have previously been reported to be associated with low spinal BMD in hypercalciuric patients. Since ADCY10 is located in the region linked to spinal BMD in our previous linkage analysis, we tested whether polymorphisms in this gene are also associated with normal BMD variation in healthy adults. Sixteen single nucleotide polymorphisms (SNPs) distributed throughout ADCY10 were genotyped in two healthy groups of American whites: 1,692 premenopausal women and 715 men. Statistical analyses were performed in the two groups to test for association between these SNPs and femoral neck and lumbar spine areal BMD. We observed significant evidence of association (p<0.01) with one SNP each in men and women. Genotypes at these SNPs accounted for less than 1% of hip BMD variation in men, but 1.5% of spinal BMD in women. However, adjacent SNPs did not corroborate the association in either males or females. In conclusion, we found a modest association between an ADCY10 polymorphism and spinal areal BMD in premenopausal white women. PMID:19093065
Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro; ...
2017-03-10
Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Bassil, Nahla V; Davis, Thomas M; Zhang, Hailong; Ficklin, Stephen; Mittmann, Mike; Webster, Teresa; Mahoney, Lise; Wood, David; Alperin, Elisabeth S; Rosyara, Umesh R; Koehorst-Vanc Putten, Herma; Monfort, Amparo; Sargent, Daniel J; Amaya, Iraida; Denoyes, Beatrice; Bianco, Luca; van Dijk, Thijs; Pirani, Ali; Iezzoni, Amy; Main, Dorrie; Peace, Cameron; Yang, Yilong; Whitaker, Vance; Verma, Sujeet; Bellon, Laurent; Brew, Fiona; Herrera, Raul; van de Weg, Eric
2015-03-07
A high-throughput genotyping platform is needed to enable marker-assisted breeding in the allo-octoploid cultivated strawberry Fragaria × ananassa. Short-read sequences from one diploid and 19 octoploid accessions were aligned to the diploid Fragaria vesca 'Hawaii 4' reference genome to identify single nucleotide polymorphisms (SNPs) and indels for incorporation into a 90 K Affymetrix® Axiom® array. We report the development and preliminary evaluation of this array. About 36 million sequence variants were identified in a 19 member, octoploid germplasm panel. Strategies and filtering pipelines were developed to identify and incorporate markers of several types: di-allelic SNPs (66.6%), multi-allelic SNPs (1.8%), indels (10.1%), and ploidy-reducing "haploSNPs" (11.7%). The remaining SNPs included those discovered in the diploid progenitor F. iinumae (3.9%), and speculative "codon-based" SNPs (5.9%). In genotyping 306 octoploid accessions, SNPs were assigned to six classes with Affymetrix's "SNPolisher" R package. The highest quality classes, PolyHigh Resolution (PHR), No Minor Homozygote (NMH), and Off-Target Variant (OTV) comprised 25%, 38%, and 1% of array markers, respectively. These markers were suitable for genetic studies as demonstrated in the full-sib family 'Holiday' × 'Korona' with the generation of a genetic linkage map consisting of 6,594 PHR SNPs evenly distributed across 28 chromosomes with an average density of approximately one marker per 0.5 cM, thus exceeding our goal of one marker per cM. The Affymetrix IStraw90 Axiom array is the first high-throughput genotyping platform for cultivated strawberry and is commercially available to the worldwide scientific community. The array's high success rate is likely driven by the presence of naturally occurring variation in ploidy level within the nominally octoploid genome, and by effectiveness of the employed array design and ploidy-reducing strategies. This array enables genetic analyses including generation of high-density linkage maps, identification of quantitative trait loci for economically important traits, and genome-wide association studies, thus providing a basis for marker-assisted breeding in this high value crop.
Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci
Brorsson, Caroline A.; Pociot, Flemming
2014-01-01
Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376
López, Sonia; Buil, Alfonso; Ordoñez, Jordi; Souto, Juan Carlos; Almasy, Laura; Lathrop, Mark; Blangero, John; Blanco-Vaca, Francisco; Fontcuberta, Jordi; Soria, José Manuel
2009-01-01
Lipoprotein Lp(a) levels are highly heritable and are associated with cardiovascular risk. We performed a genome-wide linkage analysis to delineate the genomic regions that influence the concentration of Lp(a) in families from the Genetic Analysis of Idiopathic Thrombophilia (GAIT) Project. Lp(a) levels were measured in 387 individuals belonging to 21 extended Spanish families. A total of 485 DNA microsatellite markers were genotyped to provide a 7.1 cM genetic map. A variance component linkage method was used to evaluate linkage and to detect quantitative trait loci (QTLs). The main QTL that showed strong evidence of linkage with Lp(a) levels was located at the structural gene for apo(a) on Chromosome 6 (LOD score=13.8). Interestingly, another QTL influencing Lp(a) concentration was located on Chromosome 2 with a LOD score of 2.01. This region contains several candidate genes. One of them is the tissue factor pathway inhibitor (TFPI), which has antithrombotic action and also has the ability to bind lipoproteins. However, quantitative trait association analyses performed with 12 SNPs in TFPI gene revealed no association with Lp(a) levels. Our study confirms previous results on the genetic basis of Lp(a) levels. In addition, we report a new QTL on Chromosome 2 involved in the quantitative variation of Lp(a). These data should serve as the basis for further detection of candidate genes and to elucidate the relationship between the concentration of Lp(a) and cardiovascular risk. PMID:18560444
Verardo, Lucas L; Sevón-Aimonen, Marja-Liisa; Serenius, Timo; Hietakangas, Ville; Uimari, Pekka
2017-02-13
One of the most commonly used quality measurements of pork is pH measured 24 h after slaughter. The most probable mode of inheritance for this trait is oligogenic with several known major genes, such as PRKAG3. In this study, we used whole-genome SNP genotypes of over 700 AI boars; after a quality check, 42,385 SNPs remained for association analysis. All the boars were purebred Finnish Yorkshire. To account for relatedness of the animals, a pedigree-based relationship matrix was used in a mixed linear model to test the effect of SNPs on pH measured from loin. A bioinformatics analysis was performed to identify the most promising genes in the significant regions related to meat quality. Genome-wide association study (GWAS) revealed three significant chromosomal regions: one on chromosome 3 (39.9 Mb-40.1 Mb) and two on chromosome 15 (58.5 Mb-60.5 Mb and 132 Mb-135 Mb including PRKAG3). A conditional analysis with a significant SNP in the PRKAG3 region, MARC0083357, as a covariate in the model retained the significant SNPs on chromosome 3. Even though linkage disequilibrium was relatively high over a long distance between MARC0083357 and other significant SNPs on chromosome 15, some SNPs retained their significance in the conditional analysis, even in the vicinity of PRKAG3. The significant regions harbored several genes, including two genes involved in cyclic AMP (cAMP) signaling: ADCY9 and CREBBP. Based on functional and transcription factor-gene networks, the most promising candidate genes for meat pH are ADCY9, CREBBP, TRAP1, NRG1, PRKAG3, VIL1, TNS1, and IGFBP5, and the key transcription factors related to these genes are HNF4A, PPARG, and Nkx2-5. Based on SNP association, pathway, and transcription factor analysis, we were able to identify several genes with potential to control muscle cell homeostasis and meat quality. The associated SNPs can be used in selection for better pork. We also showed that post-GWAS analysis reveals important information about the genes' potential role on meat quality. The gained information can be used in later functional studies.
Association of OPRD1 polymorphisms with heroin dependence in a large case-control series.
Nelson, Elliot C; Lynskey, Michael T; Heath, Andrew C; Wray, Naomi; Agrawal, Arpana; Shand, Fiona L; Henders, Anjali K; Wallace, Leanne; Todorov, Alexandre A; Schrage, Andrew J; Madden, Pamela A F; Degenhardt, Louisa; Martin, Nicholas G; Montgomery, Grant W
2014-01-01
Genes encoding the opioid receptors (OPRM1, OPRD1 and OPRK1) are obvious candidates for involvement in risk for heroin dependence. Prior association studies commonly had samples of modest size, included limited single nucleotide polymorphism (SNP) coverage of these genes and yielded inconsistent results. Participants for the current investigation included 1459 heroin-dependent cases ascertained from maintenance clinics in New South Wales, Australia, 1495 unrelated individuals selected from an Australian sample of twins and siblings as not meeting DSM-IV criteria for lifetime alcohol or illicit drug dependence (non-dependent controls) and 531 controls ascertained from economically disadvantaged neighborhoods in proximity to the maintenance clinics. A total of 136 OPRM1, OPRD1 and OPRK1 SNPs were genotyped in this sample. After controlling for admixture with principal components analysis, our comparison of cases to non-dependent controls found four OPRD1 SNPs in fairly high linkage disequilibrium for which adjusted P values remained significant (e.g. rs2236857; OR 1.25; P=2.95×10(-4) ) replicating a previously reported association. A post hoc analysis revealed that the two SNP (rs2236857 and rs581111) GA haplotype in OPRD1 is associated with greater risk (OR 1.68; P=1.41×10(-5) ). No OPRM1 or OPRK1 SNPs reached more than nominal significance. Comparisons of cases to neighborhood controls reached only nominal significance. Our results replicate a prior report providing strong evidence implicating OPRD1 SNPs and, in particular, the two SNP (rs2236857 and rs581111) GA haplotype in liability for heroin dependence. Support was not found for similar association involving either OPRM1 or OPRK1 SNPs. © 2012 The Authors, Addiction Biology © 2012 Society for the Study of Addiction.
Genetics of Sputum Gene Expression in Chronic Obstructive Pulmonary Disease
Qiu, Weiliang; Cho, Michael H.; Riley, John H.; Anderson, Wayne H.; Singh, Dave; Bakke, Per; Gulsvik, Amund; Litonjua, Augusto A.; Lomas, David A.; Crapo, James D.; Beaty, Terri H.; Celli, Bartolome R.; Rennard, Stephen; Tal-Singer, Ruth; Fox, Steven M.; Silverman, Edwin K.; Hersh, Craig P.
2011-01-01
Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus. PMID:21949713
Linkage disequilibrium between STRPs and SNPs across the human genome.
Payseur, Bret A; Place, Michael; Weber, James L
2008-05-01
Patterns of linkage disequilibrium (LD) reveal the action of evolutionary processes and provide crucial information for association mapping of disease genes. Although recent studies have described the landscape of LD among single nucleotide polymorphisms (SNPs) from across the human genome, associations involving other classes of molecular variation remain poorly understood. In addition to recombination and population history, mutation rate and process are expected to shape LD. To test this idea, we measured associations between short-tandem-repeat polymorphisms (STRPs), which can mutate rapidly and recurrently, and SNPs in 721 regions across the human genome. We directly compared STRP-SNP LD with SNP-SNP LD from the same genomic regions in the human HapMap populations. The intensity of STRP-SNP LD, measured by the average of D', was reduced, consistent with the action of recurrent mutation. Nevertheless, a higher fraction of STRP-SNP pairs than SNP-SNP pairs showed significant LD, on both short (up to 50 kb) and long (cM) scales. These results reveal the substantial effects of mutational processes on LD at STRPs and provide important measures of the potential of STRPs for association mapping of disease genes.
Salazar, Juan A; Pacheco, Igor; Shinya, Paulina; Zapata, Patricio; Silva, Claudia; Aradhya, Mallikarjuna; Velasco, Dianne; Ruiz, David; Martínez-Gómez, Pedro; Infante, Rodrigo
2017-01-01
Marker-assisted selection (MAS) in stone fruit ( Prunus species) breeding is currently difficult to achieve due to the polygenic nature of the most relevant agronomic traits linked to fruit quality. Genotyping by sequencing (GBS), however, provides a large quantity of useful data suitable for fine mapping using Single Nucleotide Polymorphisms (SNPs) from a reference genome. In this study, GBS was used to genotype 272 seedlings of three F1 Japanese plum ( Prunus salicina Lindl) progenies derived from crossing "98-99" (as a common female parent) with "Angeleno," "September King," and "September Queen" as male parents. Raw sequences were aligned to the Peach genome v1, and 42,909 filtered SNPs were obtained after sequence alignment. In addition, 153 seedlings from the "98-99" × "Angeleno" cross were used to develop a genetic map for each parent. A total of 981 SNPs were mapped (479 for "98-99" and 502 for "Angeleno"), covering a genetic distance of 688.8 and 647.03 cM, respectively. Fifty five seedlings from this progeny were phenotyped for different fruit quality traits including ripening time, fruit weight, fruit shape, chlorophyll index, skin color, flesh color, over color, firmness, and soluble solids content in the years 2015 and 2016. Linkage-based QTL analysis allowed the identification of genomic regions significantly associated with ripening time (LG4 of both parents and both phenotyping years), fruit skin color (LG3 and LG4 of both parents and both years), chlorophyll degradation index (LG3 of both parents in 2015) and fruit weight (LG7 of both parents in 2016). These results represent a promising situation for GBS in the identification of SNP variants associated to fruit quality traits, potentially applicable in breeding programs through MAS, in a highly heterozygous crop species such as Japanese plum.
Baranski, Matthew; Gopikrishna, Gopalapillay; Robinson, Nicholas A.; Katneni, Vinaya Kumar; Shekhar, Mudagandur S.; Shanmugakarthik, Jayakani; Jothivel, Sarangapani; Gopal, Chavali; Ravichandran, Pitchaiyappan; Kent, Matthew; Arnyasi, Mariann; Ponniah, Alphis G.
2014-01-01
Transcriptome sequencing using Illumina RNA-seq was performed on populations of black tiger shrimp from India. Samples were collected from (i) four landing centres around the east coastline (EC) of India, (ii) survivors of a severe WSSV infection during pond culture (SUR) and (iii) the Andaman Islands (AI) in the Bay of Bengal. Equal quantities of purified total RNA from homogenates of hepatopancreas, muscle, nervous tissue, intestinal tract, heart, gonad, gills, pleopod and lymphoid organs were combined to create AI, EC and SUR pools for RNA sequencing. De novo transcriptome assembly resulted in 136,223 contigs (minimum size 100 base pairs, bp) with a total length 61 Mb, an average length of 446 bp and an average coverage of 163× across all pools. Approximately 16% of contigs were annotated with BLAST hit information and gene ontology annotations. A total of 473,620 putative SNPs/indels were identified. An Illumina iSelect genotyping array containing 6,000 SNPs was developed and used to genotype 1024 offspring belonging to seven full-sibling families. A total of 3959 SNPs were mapped to 44 linkage groups. The linkage groups consisted of between 16–129 and 13–130 markers, of length between 139–10.8 and 109.1–10.5 cM and with intervals averaging between 1.2 and 0.9 cM for the female and male maps respectively. The female map was 28% longer than the male map (4060 and 2917 cM respectively) with a 1.6 higher recombination rate observed for female compared to male meioses. This approach has substantially increased expressed sequence and DNA marker resources for tiger shrimp and is a useful resource for QTL mapping and association studies for evolutionarily and commercially important traits. PMID:24465553
Examining ERBB2 as a candidate gene for susceptibility to leprosy (Hansen’s disease) in Brazil
Araújo, Sérgio Ricardo Fernandes; Jamieson, Sarra Elisabeth; Dupnik, Kathryn Margaret; Monteiro, Glória Regina; Nobre, Maurício Lisboa; Dias, Márcia Sousa; Trindade, Pedro Bezerra; Queiroz, Maria do Carmo Palmeira; Gomes, Carlos Eduardo Maia; Blackwell, Jenefer Mary; Jeronimo, Selma Maria Bezerra
2014-01-01
Leprosy remains prevalent in Brazil. ErbB2 is a receptor for leprosy bacilli entering Schwann cells, which mediates Mycobacterium leprae-induced demyelination and the ERBB2 gene lies within a leprosy susceptibility locus on chromosome 17q11-q21. To determine whether polymorphisms at the ERBB2 locus contribute to this linkage peak, three haplotype tagging single nucleotide polymorphisms (tag-SNPs) (rs2517956, rs2952156, rs1058808) were genotyped in 72 families (208 cases; 372 individuals) from the state of Pará (PA). All three tag-SNPs were associated with leprosy per se [best SNP rs2517959 odds ratio (OR) = 2.22; 95% confidence interval (CI) 1.37-3.59; p = 0.001]. Lepromatous (LL) (OR = 3.25; 95% CI 1.37-7.70; p = 0.007) and tuberculoid (TT) (OR = 1.79; 95% CI 1.04-3.05; p = 0.034) leprosy both contributed to the association, which is consistent with the previous linkage to chromosome 17q11-q21 in the population from PA and supports the functional role of ErbB2 in disease pathogenesis. To attempt to replicate these findings, six SNPs (rs2517955, rs2517956, rs1810132, rs2952156, rs1801200, rs1058808) were genotyped in a population-based sample of 570 leprosy cases and 370 controls from the state of Rio Grande do Norte (RN) and the results were analysed using logistic regression analysis. However, none of the associations were replicated in the RN sample, whether analysed for leprosy per se, LL leprosy, TT leprosy, erythema nodosum leprosum or reversal reaction conditions. The role of polymorphisms at ERBB2 in controlling susceptibility to leprosy in Brazil therefore remains unclear. PMID:24676663
Antoni, G; Morange, P-E; Luo, Y; Saut, N; Burgos, G; Heath, S; Germain, M; Biron-Andreani, C; Schved, J-F; Pernod, G; Galan, P; Zelenika, D; Alessi, M-C; Drouet, L; Visvikis-Siest, S; Wells, P S; Lathrop, M; Emmerich, J; Tregouet, D-A; Gagnon, F
2010-12-01
Factor VIII (FVIII) and von Willebrand factor (VWF) are two known quantitative risk factors for venous thromboembolism (VTE). To identify new loci that could contribute to VTE susceptibility and to modulating FVIII and/or VWF levels. A pedigree linkage analysis was first performed in five extended French-Canadian families, including 253 individuals, to identify genomic regions linked to FVIII or VWF levels. Identified regions were further explored using 'in silico' genome-wide association studies (GWAS) data on VTE (419 patients and 1228 controls), and two independent case-control studies (MARTHA and FARIVE) for VTE, gathering 1166 early-onset patients and 1408 healthy individuals. Single nucleotide polymorphisms (SNPs) associated with VTE risk were further investigated in relation to plasma levels of FVIII and VWF in a cohort of 108 healthy nuclear families. Four main linkage regions were identified, among which the well-characterized ABO locus, the recently identified STAB 2 gene, and a third one, on chromosome 6q13-14, harbouring four non-redundant SNPs, associated with VTE at P < 10(-4) in the GWAS dataset. The association of one of these SNPs, rs9363864, with VTE was further replicated in the MARTHA and FARIVE studies. The rs9363864-AA genotype was associated with a lower risk for VTE (OR = 0.58 [0.42-0.80], P = 0.0005) but mainly in non-carriers of the FV Leiden mutation. This genotype was further found to be associated with the lowest levels of FVIII (P = 0.006) and VWF (P = 0.001). The BAI3 locus where the rs9363864 maps is a new candidate for VTE risk. © 2010 International Society on Thrombosis and Haemostasis.
2011-01-01
Background A number of molecular marker linkage maps have been developed for melon (Cucumis melo L.) over the last two decades. However, these maps were constructed using different marker sets, thus, making comparative analysis among maps difficult. In order to solve this problem, a consensus genetic map in melon was constructed using primarily highly transferable anchor markers that have broad potential use for mapping, synteny, and comparative quantitative trait loci (QTL) analysis, increasing breeding effectiveness and efficiency via marker-assisted selection (MAS). Results Under the framework of the International Cucurbit Genomics Initiative (ICuGI, http://www.icugi.org), an integrated genetic map has been constructed by merging data from eight independent mapping experiments using a genetically diverse array of parental lines. The consensus map spans 1150 cM across the 12 melon linkage groups and is composed of 1592 markers (640 SSRs, 330 SNPs, 252 AFLPs, 239 RFLPs, 89 RAPDs, 15 IMAs, 16 indels and 11 morphological traits) with a mean marker density of 0.72 cM/marker. One hundred and ninety-six of these markers (157 SSRs, 32 SNPs, 6 indels and 1 RAPD) were newly developed, mapped or provided by industry representatives as released markers, including 27 SNPs and 5 indels from genes involved in the organic acid metabolism and transport, and 58 EST-SSRs. Additionally, 85 of 822 SSR markers contributed by Syngenta Seeds were included in the integrated map. In addition, 370 QTL controlling 62 traits from 18 previously reported mapping experiments using genetically diverse parental genotypes were also integrated into the consensus map. Some QTL associated with economically important traits detected in separate studies mapped to similar genomic positions. For example, independently identified QTL controlling fruit shape were mapped on similar genomic positions, suggesting that such QTL are possibly responsible for the phenotypic variability observed for this trait in a broad array of melon germplasm. Conclusions Even though relatively unsaturated genetic maps in a diverse set of melon market types have been published, the integrated saturated map presented herein should be considered the initial reference map for melon. Most of the mapped markers contained in the reference map are polymorphic in diverse collection of germplasm, and thus are potentially transferrable to a broad array of genetic experimentation (e.g., integration of physical and genetic maps, colinearity analysis, map-based gene cloning, epistasis dissection, and marker-assisted selection). PMID:21797998
Diaz, Aurora; Fergany, Mohamed; Formisano, Gelsomina; Ziarsolo, Peio; Blanca, José; Fei, Zhanjun; Staub, Jack E; Zalapa, Juan E; Cuevas, Hugo E; Dace, Gayle; Oliver, Marc; Boissot, Nathalie; Dogimont, Catherine; Pitrat, Michel; Hofstede, René; van Koert, Paul; Harel-Beja, Rotem; Tzuri, Galil; Portnoy, Vitaly; Cohen, Shahar; Schaffer, Arthur; Katzir, Nurit; Xu, Yong; Zhang, Haiying; Fukino, Nobuko; Matsumoto, Satoru; Garcia-Mas, Jordi; Monforte, Antonio J
2011-07-28
A number of molecular marker linkage maps have been developed for melon (Cucumis melo L.) over the last two decades. However, these maps were constructed using different marker sets, thus, making comparative analysis among maps difficult. In order to solve this problem, a consensus genetic map in melon was constructed using primarily highly transferable anchor markers that have broad potential use for mapping, synteny, and comparative quantitative trait loci (QTL) analysis, increasing breeding effectiveness and efficiency via marker-assisted selection (MAS). Under the framework of the International Cucurbit Genomics Initiative (ICuGI, http://www.icugi.org), an integrated genetic map has been constructed by merging data from eight independent mapping experiments using a genetically diverse array of parental lines. The consensus map spans 1150 cM across the 12 melon linkage groups and is composed of 1592 markers (640 SSRs, 330 SNPs, 252 AFLPs, 239 RFLPs, 89 RAPDs, 15 IMAs, 16 indels and 11 morphological traits) with a mean marker density of 0.72 cM/marker. One hundred and ninety-six of these markers (157 SSRs, 32 SNPs, 6 indels and 1 RAPD) were newly developed, mapped or provided by industry representatives as released markers, including 27 SNPs and 5 indels from genes involved in the organic acid metabolism and transport, and 58 EST-SSRs. Additionally, 85 of 822 SSR markers contributed by Syngenta Seeds were included in the integrated map. In addition, 370 QTL controlling 62 traits from 18 previously reported mapping experiments using genetically diverse parental genotypes were also integrated into the consensus map. Some QTL associated with economically important traits detected in separate studies mapped to similar genomic positions. For example, independently identified QTL controlling fruit shape were mapped on similar genomic positions, suggesting that such QTL are possibly responsible for the phenotypic variability observed for this trait in a broad array of melon germplasm. Even though relatively unsaturated genetic maps in a diverse set of melon market types have been published, the integrated saturated map presented herein should be considered the initial reference map for melon. Most of the mapped markers contained in the reference map are polymorphic in diverse collection of germplasm, and thus are potentially transferrable to a broad array of genetic experimentation (e.g., integration of physical and genetic maps, colinearity analysis, map-based gene cloning, epistasis dissection, and marker-assisted selection).
Construction of an almond linkage map in an Australian population Nonpareil × Lauranne
2010-01-01
Background Despite a high genetic similarity to peach, almonds (Prunus dulcis) have a fleshless fruit and edible kernel, produced as a crop for human consumption. While the release of peach genome v1.0 provides an excellent opportunity for almond genetic and genomic studies, well-assessed segregating populations and the respective saturated genetic linkage maps lay the foundation for such studies to be completed in almond. Results Using an almond intraspecific cross between 'Nonpareil' and 'Lauranne' (N × L), we constructed a moderately saturated map with SSRs, SNPs, ISSRs and RAPDs. The N × L map covered 591.4 cM of the genome with 157 loci. The average marker distance of the map was 4.0 cM. The map displayed high synteny and colinearity with the Prunus T × E reference map in all eight linkage groups (G1-G8). The positions of 14 mapped gene-anchored SNPs corresponded approximately with the positions of homologous sequences in the peach genome v1.0. Analysis of Mendelian segregation ratios showed that 17.9% of markers had significantly skewed genotype ratios at the level of P < 0.05. Due to the large number of skewed markers in the linkage group 7, the potential existence of deleterious gene(s) was assessed in the group. Integrated maps produced by two different mapping methods using JoinMap® 3 were compared, and their high degree of similarity was evident despite the positional inconsistency of a few markers. Conclusions We presented a moderately saturated Australian almond map, which is highly syntenic and collinear with the Prunus reference map and peach genome V1.0. Therefore, the well-assessed almond population reported here can be used to investigate the traits of interest under Australian growing conditions, and provides more information on the almond genome for the international community. PMID:20932335
Chen, N B; Ma, Y; Yang, T; Lin, F; Fu, W W; Xu, Y J; Li, F; Li, J Y; Gao, S X
2015-08-01
Angiopoietin-like protein 3 (ANGPTL3) is a secreted protein that regulates lipid, glucose and energy metabolism. This study was conducted to better understand the effect of ANGPTL3 on important economic traits in cattle. First, transcript profiles for ANGPTL3 were measured in nine different Jiaxian cattle tissues. Second, polymorphisms were identified in the complete coding region and promoter region of the bovine ANGPTL3 gene in 707 cattle samples. Finally, an association study was carried out utilizing these single nucleotide polymorphisms (SNPs) to determine the effect of these SNPs on the growth and meat quality traits. Quantitative real-time PCR analysis showed that ANGPTL3 was mainly expressed in the liver. The promoter of the bovine ANGPTL3 contained several putative transcription factor binding sites (SF1, HNF-1, LXRα, NFκβ, HNF-3 and C/EBP). In total, four SNPs of the bovine ANGPTL3 gene were identified by direct sequencing. SNP1 (rs469906272: g.-38T>C) was identified in the promoter, SNP2 (rs451104723:g.104A>T) and SNP3 (rs482516226: g.509A>G) were identified in exon 1, and SNP4 (rs477165942: g.8661T>C) was identified in exon 6. Changes in predicted protein structures due to non-synonymous SNPs were analyzed. Haplotype frequencies and linkage disequilibrium were also investigated. Analysis of four SNPs in cattle from different native Chinese breeds (Nanyang (NY) and Jiaxian (JX)) and commercial breeds (Angus (AG), Hereford (HF), Limousin (LM), Luxi (LX), Simmental (ST) and Jinnan (JN)) revealed a significant association with growth traits (including: BW and hipbone width) and meat quality traits (including: Warner-Bratzler shear force and ribeye area). Therefore, implementation of these four mutations in selection indices in the beef industry may be beneficial in selecting individuals with superior growth and meat quality traits.
Han, Yuepeng; Chagné, David; Gasic, Ksenija; Rikkerink, Erik H A; Beever, Jonathan E; Gardiner, Susan E; Korban, Schuyler S
2009-03-01
A genome-wide BAC physical map of the apple, Malus x domestica Borkh., has been recently developed. Here, we report on integrating the physical and genetic maps of the apple using a SNP-based approach in conjunction with bin mapping. Briefly, BAC clones located at ends of BAC contigs were selected, and sequenced at both ends. The BAC end sequences (BESs) were used to identify candidate SNPs. Subsequently, these candidate SNPs were genetically mapped using a bin mapping strategy for the purpose of mapping the physical onto the genetic map. Using this approach, 52 (23%) out of 228 BESs tested were successfully exploited to develop SNPs. These SNPs anchored 51 contigs, spanning approximately 37 Mb in cumulative physical length, onto 14 linkage groups. The reliability of the integration of the physical and genetic maps using this SNP-based strategy is described, and the results confirm the feasibility of this approach to construct an integrated physical and genetic maps for apple.
Chromosome 17q12 variants contribute to risk of early-onset prostate cancer
Levin, Albert M.; Machiela, Mitchell J.; Zuhlke, Kimberly A.; Ray, Anna M.; Cooney, Kathleen A.; Douglas, Julie A.
2008-01-01
In a recent genome-wide association study by Gudmundsson et al. (2007), two prostate cancer susceptibility loci were identified on chromosome 17q. The first locus, at 17q12, was distinguished by two intronic single nucleotide polymorphisms (SNPs) in the TCF2 gene (rs4430796 and rs7501939). The second locus was in a gene-poor region of 17q24, where the strongest evidence of association was for SNP rs1859962. To determine if these loci were also associated with hereditary prostate cancer, we genotyped them in a family-based association sample of 403 non-Hispanic white families, including 1,015 men with and without prostate cancer. SNPs rs4430796 and rs7501939, which were in strong linkage disequilibrium (r2=0.68), showed the strongest evidence of prostate cancer association. Using a family-based association test, the “A” allele of SNP rs4430796 was over-transmitted to affected men (p=0.006), with an odds ratio of 1.40 (95%CI=1.09–1.81) under an additive genetic model. Notably, rs4430796 was significantly associated with prostate cancer among men diagnosed at an early (<50 years) but not later age (p=0.006 versus p=0.118). Our results confirm the prostate cancer association with SNPs on chromosome 17q12 initially reported by Gudmundsson et al. In addition, our results suggest that the increased risk associated with these SNPs is approximately doubled in individuals predisposed to develop early onset disease. Importantly, these SNPs do not account for a significant portion of our prior prostate cancer linkage evidence on chromosome 17. Thus, there likely exist one or more additional independent prostate cancer susceptibility loci in this region. PMID:18701471
LPA and PLG sequence variation and kringle IV-2 copy number in two populations.
Crawford, Dana C; Peng, Ze; Cheng, Jan-Fang; Boffelli, Dario; Ahearn, Magdalena; Nguyen, Dan; Shaffer, Tristan; Yi, Qian; Livingston, Robert J; Rieder, Mark J; Nickerson, Deborah A
2008-01-01
Lp(a) levels have long been recognized as a potential risk factor for coronary heart disease that is almost completely under genetic control. Much of the genetics impacting Lp(a) levels has been attributed to the highly polymorphic LPA kringle IV-2 copy number variant, and most of the variance in Lp(a) levels in populations of European-descent is inversely correlated with kringle IV copy number. However, less of the variance is explained in African-descent populations for the same structural variation. African-descent populations have, on average, higher levels of Lp(a), suggesting other genetic factors contribute to Lp(a) level variability across populations. To identify potential cis-acting factors, we re-sequenced the gene LPA for single nucleotide polymorphism (SNP) discovery in 23 European-Americans and 24 African-Americans. We also re- sequenced the neighboring gene plasminogen (PLG) and genotyped the kringle IV copy number variant in the same reference samples. These data are the most comprehensive description of sequence variation in LPA and its relationship with the kringle IV copy number variant. With these data, we demonstrate that only a fraction of LPA sequence diversity has been previously documented. Also, we identify several high frequency SNPs present in the African-American sample but absent in the European-American sample. Finally, we show that SNPs within PLG are not in linkage disequilibrium with SNPs in LPA, and we show that kringle IV copy number variation is not in linkage disequilibrium with either LPA or PLG SNPs. Together, these data suggest that LPA SNPs could independently contribute to Lp(a) levels in the general population. Copyright (c) 2008 S. Karger AG, Basel.
Achenbach, Ute; Paulo, Joao; Ilarionova, Evgenyia; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Gebhardt, Christiane
2009-02-01
The damage caused by the parasitic root cyst nematode Globodera pallida is a major yield-limiting factor in potato cultivation . Breeding for resistance is facilitated by the PCR-based marker 'HC', which is diagnostic for an allele conferring high resistance against G. pallida pathotype Pa2/3 that has been introgressed from the wild potato species Solanum vernei into the Solanum tuberosum tetraploid breeding pool. The major quantitative trait locus (QTL) controlling this nematode resistance maps on potato chromosome V in a hot spot for resistance to various pathogens including nematodes and the oomycete Phytophthora infestans. An unstructured sample of 79 tetraploid, highly heterozygous varieties and breeding clones was selected based on presence (41 genotypes) or absence (38 genotypes) of the HC marker. Testing the clones for resistance to G. pallida confirmed the diagnostic power of the HC marker. The 79 individuals were genotyped for 100 single nucleotide polymorphisms (SNPs) at 10 loci distributed over 38 cM on chromosome V. Forty-five SNPs at six loci spanning 2 cM in the interval between markers GP21-GP179 were associated with resistance to G. pallida. Based on linkage disequilibrium (LD) between SNP markers, six LD groups comprising between 2 and 18 SNPs were identified. The LD groups indicated the existence of multiple alleles at a single resistance locus or at several, physically linked resistance loci. LD group C comprising 18 SNPs corresponded to the 'HC' marker. LD group E included 16 SNPs and showed an association peak, which positioned one nematode resistance locus physically close to the R1 gene family.
Yang, Xueli; Gu, Dongfeng; He, Jiang; Hixson, James E.; Rao, Dabeeru C.; Lu, Fanghong; Mu, Jianjun; Jaquish, Cashell E.; Chen, Jing; Huang, Jianfeng; Shimmin, Lawrence C.; Rice, Treva K.; Chen, Jichun; Wu, Xigui; Liu, Depei; Kelly, Tanika N.
2014-01-01
Background Blood pressure (BP) response to cold pressor test (CPT) is associated with increased risk of cardiovascular disease. We performed a genome-wide linkage scan and regional association analysis to identify genetic determinants of BP response to CPT. Methods and Results A total of 1,961 Chinese participants completed the CPT. Multipoint quantitative trait linkage analysis was performed, followed by single-marker and gene-based analyses of variants in promising linkage regions (logarithm of odds, LOD ≥ 2). A suggestive linkage signal was identified for systolic BP (SBP) response to CPT at 20p13-20p12.3, with a maximum multipoint LOD score of 2.37. Based on regional association analysis with 1,351 SNPs in the linkage region, we found that marker rs2326373 at 20p13 was significantly associated with mean arterial pressure (MAP) responses to CPT (P = 8.8×10−6) after FDR adjustment for multiple comparisons. A similar trend was also observed for SBP response (P = 0.03) and DBP response (P = 4.6×10−5). Results of gene-based analyses showed that variants in genes MCM8 and SLC23A2 were associated with SBP response to CPT (P = 4.0×10−5 and 2.7×10−4, respectively), and variants in genes MCM8 and STK35 were associated with MAP response to CPT (P = 1.5×10−5 and 5.0×10−5, respectively). Conclusions Within a suggestive linkage region on chromosome 20, we identified a novel variant associated with BP responses to CPT. We also found gene-based associations of MCM8, SLC23A2 and STK35 in this region. Further work is warranted to confirm these findings. Clinical Trial Registration URL: http://www.clinicaltrials.gov; Unique identifier: NCT00721721. PMID:25028485
Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S
2018-01-01
Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.
Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple.
Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick; Ordidge, Matthew; Paprstein, Frantisek; Sedlak, Jiri; Nybom, Hilde; Garkava-Gustavsson, Larisa; Troggio, Michela; Bianco, Luca; Velasco, Riccardo; Poncet, Charles; Théron, Anthony; Moriya, Shigeki; Bink, Marco C A M; Laurens, François; Tartarini, Stefano; Durel, Charles-Eric
2017-01-01
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs.
Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
The genome-wide discovery and high-throughput genotyping of SNPs in chickpea natural germplasm lines is indispensable to extrapolate their natural allelic diversity, domestication, and linkage disequilibrium (LD) patterns leading to the genetic enhancement of this vital legume crop. We discovered 44,844 high-quality SNPs by sequencing of 93 diverse cultivated desi, kabuli, and wild chickpea accessions using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays that were physically mapped across eight chromosomes of desi and kabuli. Of these, 22,542 SNPs were structurally annotated in different coding and non-coding sequence components of genes. Genes with 3296 non-synonymous and 269 regulatory SNPs could functionally differentiate accessions based on their contrasting agronomic traits. A high experimental validation success rate (92%) and reproducibility (100%) along with strong sensitivity (93–96%) and specificity (99%) of GBS-based SNPs was observed. This infers the robustness of GBS as a high-throughput assay for rapid large-scale mining and genotyping of genome-wide SNPs in chickpea with sub-optimal use of resources. With 23,798 genome-wide SNPs, a relatively high intra-specific polymorphic potential (49.5%) and broader molecular diversity (13–89%)/functional allelic diversity (18–77%) was apparent among 93 chickpea accessions, suggesting their tremendous applicability in rapid selection of desirable diverse accessions/inter-specific hybrids in chickpea crossbred varietal improvement program. The genome-wide SNPs revealed complex admixed domestication pattern, extensive LD estimates (0.54–0.68) and extended LD decay (400–500 kb) in a structured population inclusive of 93 accessions. These findings reflect the utility of our identified SNPs for subsequent genome-wide association study (GWAS) and selective sweep-based domestication trait dissection analysis to identify potential genomic loci (gene-associated targets) specifically regulating important complex quantitative agronomic traits in chickpea. The numerous informative genome-wide SNPs, natural allelic diversity-led domestication pattern, and LD-based information generated in our study have got multidimensional applicability with respect to chickpea genomics-assisted breeding. PMID:25873920
regSNPs: a strategy for prioritizing regulatory single nucleotide substitutions
Teng, Mingxiang; Ichikawa, Shoji; Padgett, Leah R.; Wang, Yadong; Mort, Matthew; Cooper, David N.; Koller, Daniel L.; Foroud, Tatiana; Edenberg, Howard J.; Econs, Michael J.; Liu, Yunlong
2012-01-01
Motivation: One of the fundamental questions in genetics study is to identify functional DNA variants that are responsible to a disease or phenotype of interest. Results from large-scale genetics studies, such as genome-wide association studies (GWAS), and the availability of high-throughput sequencing technologies provide opportunities in identifying causal variants. Despite the technical advances, informatics methodologies need to be developed to prioritize thousands of variants for potential causative effects. Results: We present regSNPs, an informatics strategy that integrates several established bioinformatics tools, for prioritizing regulatory SNPs, i.e. the SNPs in the promoter regions that potentially affect phenotype through changing transcription of downstream genes. Comparing to existing tools, regSNPs has two distinct features. It considers degenerative features of binding motifs by calculating the differences on the binding affinity caused by the candidate variants and integrates potential phenotypic effects of various transcription factors. When tested by using the disease-causing variants documented in the Human Gene Mutation Database, regSNPs showed mixed performance on various diseases. regSNPs predicted three SNPs that can potentially affect bone density in a region detected in an earlier linkage study. Potential effects of one of the variants were validated using luciferase reporter assay. Contact: yunliu@iupui.edu Supplementary information: Supplementary data are available at Bioinformatics online PMID:22611130
A meta-analysis of genome-wide association studies of asthma in Puerto Ricans.
Yan, Qi; Brehm, John; Pino-Yanes, Maria; Forno, Erick; Lin, Jerome; Oh, Sam S; Acosta-Perez, Edna; Laurie, Cathy C; Cloutier, Michelle M; Raby, Benjamin A; Stilp, Adrienne M; Sofer, Tamar; Hu, Donglei; Huntsman, Scott; Eng, Celeste S; Conomos, Matthew P; Rastogi, Deepa; Rice, Kenneth; Canino, Glorisa; Chen, Wei; Barr, R Graham; Burchard, Esteban G; Celedón, Juan C
2017-05-01
Puerto Ricans are disproportionately affected with asthma in the USA. In this study, we aim to identify genetic variants that confer susceptibility to asthma in Puerto Ricans.We conducted a meta-analysis of genome-wide association studies (GWAS) of asthma in Puerto Ricans, including participants from: the Genetics of Asthma in Latino Americans (GALA) I-II, the Hartford-Puerto Rico Study and the Hispanic Community Health Study. Moreover, we examined whether susceptibility loci identified in previous meta-analyses of GWAS are associated with asthma in Puerto Ricans.The only locus to achieve genome-wide significance was chromosome 17q21, as evidenced by our top single nucleotide polymorphism (SNP), rs907092 (OR 0.71, p=1.2×10 -12 ) at IKZF3 Similar to results in non-Puerto Ricans, SNPs in genes in the same linkage disequilibrium block as IKZF3 ( e.g. ZPBP2 , ORMDL3 and GSDMB ) were significantly associated with asthma in Puerto Ricans. With regard to results from a meta-analysis in Europeans, we replicated findings for rs2305480 at GSDMB , but not for SNPs in any other genes. On the other hand, we replicated results from a meta-analysis of North American populations for SNPs at IL1RL1 , TSLP and GSDMB but not for IL33 Our findings suggest that common variants on chromosome 17q21 have the greatest effects on asthma in Puerto Ricans. Copyright ©ERS 2017.
Levinson, Douglas F; Evgrafov, Oleg V; Knowles, James A; Potash, James B; Weissman, Myrna M; Scheftner, William A; Depaulo, J Raymond; Crowe, Raymond R; Murphy-Eberenz, Kathleen; Marta, Diana H; McInnis, Melvin G; Adams, Philip; Gladis, Madeline; Miller, Erin B; Thomas, Jo; Holmans, Peter
2007-02-01
The authors studied a dense map of single nucleotide polymorphism (SNP) DNA markers on chromosome 15q25-q26 to maximize the informativeness of genetic linkage analyses in a region where they previously reported suggestive evidence for linkage of recurrent early-onset major depressive disorder. In 631 European-ancestry families with multiple cases of recurrent early-onset major depressive disorder, 88 SNPs were genotyped, and multipoint allele-sharing linkage analyses were carried out. Marker-marker linkage disequilibrium was minimized, and a simulation study with founder haplotypes from these families suggested that linkage scores were not inflated by linkage disequilibrium. The dense SNP map increased the information content of the analysis from around 0.7 to over 0.9. The maximum evidence for linkage was the Z likelihood ratio score statistic of Kong and Cox (Z(LR))=4.69 at 109.8 cM. The exact p value was below the genomewide significance threshold. By contrast, in the genome scan with microsatellite markers at 9 cM spacing, the maximum Z(LR) for European-ancestry families was 3.43 (106.53 cM). It was estimated that the linked locus or loci in this region might account for a 20% or less populationwide increase in risk to siblings of cases. This region has produced modestly positive evidence for linkage to depression and related traits in other studies. These results suggest that DNA sequence variations in one or more genes in the 15q25-q26 region can increase susceptibility to major depression and that efforts are warranted to identify these genes.
Leal-Gutiérrez, Joel D.; Elzo, Mauricio A.; Johnson, Dwain D.; Scheffler, Tracy L.; Scheffler, Jason M.; Mateescu, Raluca G.
2018-01-01
Autogenous proteolytic enzymes of the calpain family are implicated in myofibrillar protein degradation. As a result, the μ-calpain gene and its specific inhibitor, calpastatin, have been repeatedly investigated for their association with meat quality traits in cattle; however, no functional mutation has been identified for these two genes. The objectives of this study were: (1) to assess breed composition effect on tenderness; (2) to perform a linkage disequilibrium (LD) analysis in μ-calpain and calpastatin genes as well as an association analyses with tenderness; and (3) to analyze putative functional SNPs inside the significant LD block for an effect on tenderness. Tenderness measurements and genotypes for 16 SNPs in μ-calpain gene and 28 SNPs in calpastatin gene from 673 steers were analyzed. A bioinformatic analysis identified “putative functional SNPs” inside the associated LD block – polymorphisms able to produce a physical and/or chemical change in the DNA, mRNA, or translated protein in silico. Breed composition had a significant (P < 0.0001) effect on tenderness where animals with more than 80% Angus composition had the most tender meat. One 11-kb LD-block and three LD-blocks of 37, 17, and 14 kb in length were identified in the μ-calpain and calpastatin genes, respectively. Out of these, the LD-block 3 in calpastatin, tagged by SNPs located at 7-98566391 and 7-98581038, had a significant effect on tenderness with the TG-CG diplotype being approximately 1 kg more tender than the toughest diplotype, TG-CG. A total of 768 SNPs in the LD-block 3 of calpastatin were included in the bioinformatic analysis, and 28 markers were selected as putative functional SNPs inside the LD-block 3 of calpastatin; however, none of them were polymorphic in this population. Out of 15 initial polymorphisms segregating inside the LD-block 3 of calpastatin in this population, markers ARSUSMARC116, Cast5, rs730723459, and rs210861835 were found to be significantly associated with tenderness. PMID:29520298
Chatsuriyawong, Siriporn; Gozal, David; Kheirandish-Gozal, Leila; Bhattacharjee, Rakesh; Khalyfa, Ahamed A; Wang, Yang; Sukhumsirichart, Wasana; Khalyfa, Abdelnaby
2013-09-06
Obstructive sleep apnea (OSA) is associated with adverse and interdependent cognitive and cardiovascular consequences. Increasing evidence suggests that nitric oxide synthase (NOS) and endothelin family (EDN) genes underlie mechanistic aspects of OSA-associated morbidities. We aimed to identify single nucleotide polymorphisms (SNPs) in the NOS family (3 isoforms), and EDN family (3 isoforms) to identify potential associations of these SNPs in children with OSA. A pediatric community cohort (ages 5-10 years) enriched for snoring underwent overnight polysomnographic (NPSG) and a fasting morning blood draw. The diagnostic criteria for OSA were an obstructive apnea-hypopnea Index (AHI) >2/h total sleep time (TST), snoring during the night, and a nadir oxyhemoglobin saturation <92%. Control children were defined as non-snoring children with AHI <2/h TST (NOSA). Endothelial function was assessed using a modified post-occlusive hyperemic test. The time to peak reperfusion (Tmax) was considered as the indicator for normal endothelial function (NEF; Tmax<45 sec), or ED (Tmax ≥ 45 sec). Genomic DNA from peripheral blood was extracted and allelic frequencies were assessed for, NOS1 (209 SNPs), NOS2 (122 SNPs), NOS3 (50 SNPs), EDN1 (43 SNPs), EDN2 (48 SNPs), EDN3 (14 SNPs), endothelin receptor A, EDNRA, (27 SNPs), and endothelin receptor B, EDNRB (23 SNPs) using a custom SNPs array. The relative frequencies of NOS-1,-2, and -3, and EDN-1,-2,-3,-EDNRA, and-EDNRB genotypes were evaluated in 608 subjects [128 with OSA, and 480 without OSA (NOSA)]. Furthermore, subjects with OSA were divided into 2 subgroups: OSA with normal endothelial function (OSA-NEF), and OSA with endothelial dysfunction (OSA-ED). Linkage disequilibrium was analyzed using Haploview version 4.2 software. For NOSA vs. OSA groups, 15 differentially distributed SNPs for NOS1 gene, and 1 SNP for NOS3 emerged, while 4 SNPs for EDN1 and 1 SNP for both EDN2 and EDN3 were identified. However, in the smaller sub-group for whom endothelial function was available, none of the significant SNPs was retained due to lack of statistical power. Differences in the distribution of polymorphisms among NOS and EDN gene families suggest that these SNPs could play a contributory role in the pathophysiology and risk of OSA-induced cardiovascular morbidity. Thus, analysis of genotype-phenotype interactions in children with OSA may assist in the formulation of categorical risk estimates.
Sánchez, Cecilia Castaño; Smith, Timothy P L; Wiedmann, Ralph T; Vallejo, Roger L; Salem, Mohamed; Yao, Jianbo; Rexroad, Caird E
2009-11-25
To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population. The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme HaeIII; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the sequences from the validated markers were associated with rainbow trout transcripts. The use of reduced representation libraries and pyrosequencing technology proved to be an effective strategy for the discovery of a high number of putative SNPs in rainbow trout; however, modifications to the technique to decrease the false discovery rate resulting from the evolutionary recent genome duplication would be desirable.
Testing of diabetes-associated WFS1 polymorphisms in the Diabetes Prevention Program
Florez, J. C.; Jablonski, K. A.; McAteer, J.; Sandhu, M. S.; Wareham, N. J.; Barroso, I.; Franks, P. W.; Altshuler, D.; Knowler, W. C.
2008-01-01
Aims/hypothesis Wolfram syndrome (diabetes insipidus, diabetes mellitus, optic atrophy and deafness) is caused by mutations in the WFS1 gene. Recently, single nucleotide polymorphisms (SNPs) in WFS1 have been reproducibly associated with type 2 diabetes. We therefore examined the effects of these variants on diabetes incidence and response to interventions in the Diabetes Prevention Program (DPP), in which a lifestyle intervention or metformin treatment was compared with placebo. Methods We genotyped the WFS1 SNPs rs10010131, rs752 854 and rs734312 (H611R) in 3,548 DPP participants and performed Cox regression analysis using genotype, intervention and their interactions as predictors of diabetes incidence. We also evaluated the effect of these SNPs on insulin resistance and beta cell function at 1 year. Results Although none of the three SNPs was associated with diabetes incidence in the overall cohort, white homozygotes for the previously reported protective alleles appeared less likely to develop diabetes in the lifestyle arm. Examination of the publicly available Diabetes Genetics Initiative genome-wide association dataset revealed that rs10012946, which is in strong linkage disequilibrium with the three WFS1 SNPs (r2=0.88–1.0), was associated with type 2 diabetes (allelic odds ratio 0.85, 95% CI 0.75–0.97, p=0.026). In the DPP, we noted a trend towards increased insulin secretion in carriers of the protective variants, although for most SNPs this was seen as compensatory for the diminished insulin sensitivity. Conclusions/interpretation The previously reported protective effect of select WFS1 alleles may be magnified by a lifestyle intervention. These variants appear to confer an improvement in beta cell function. PMID:18060660
Waide, Emily H; Tuggle, Christopher K; Serão, Nick V L; Schroyen, Martine; Hess, Andrew; Rowland, Raymond R R; Lunney, Joan K; Plastow, Graham; Dekkers, Jack C M
2018-02-01
Genomic prediction of the pig's response to the porcine reproductive and respiratory syndrome (PRRS) virus (PRRSV) would be a useful tool in the swine industry. This study investigated the accuracy of genomic prediction based on porcine SNP60 Beadchip data using training and validation datasets from populations with different genetic backgrounds that were challenged with different PRRSV isolates. Genomic prediction accuracy averaged 0.34 for viral load (VL) and 0.23 for weight gain (WG) following experimental PRRSV challenge, which demonstrates that genomic selection could be used to improve response to PRRSV infection. Training on WG data during infection with a less virulent PRRSV, KS06, resulted in poor accuracy of prediction for WG during infection with a more virulent PRRSV, NVSL. Inclusion of single nucleotide polymorphisms (SNPs) that are in linkage disequilibrium with a major quantitative trait locus (QTL) on chromosome 4 was vital for accurate prediction of VL. Overall, SNPs that were significantly associated with either trait in single SNP genome-wide association analysis were unable to predict the phenotypes with an accuracy as high as that obtained by using all genotyped SNPs across the genome. Inclusion of data from close relatives into the training population increased whole genome prediction accuracy by 33% for VL and by 37% for WG but did not affect the accuracy of prediction when using only SNPs in the major QTL region. Results show that genomic prediction of response to PRRSV infection is moderately accurate and, when using all SNPs on the porcine SNP60 Beadchip, is not very sensitive to differences in virulence of the PRRSV in training and validation populations. Including close relatives in the training population increased prediction accuracy when using the whole genome or SNPs other than those near a major QTL.
Hack, Laura M.; Kalsi, Gursharan; Aliev, Fazil; Kuo, Po-Hsiu; Prescott, Carol A.; Patterson, Diana G.; Walsh, Dermot; Dick, Danielle M.; Riley, Brien P.; Kendler, Kenneth S.
2012-01-01
Background Over 50 years of evidence from research has established that the central dopaminergic reward pathway is likely involved in alcohol dependence (AD). Additional evidence supports a role for dopamine (DA) in other disinhibitory psychopathology, which is often comorbid with AD. Family and twin studies demonstrate that a common genetic component accounts for most of the genetic variance in these traits. Thus, DA-related genes represent putative candidates for the genetic risk that underlies not only AD but also behavioral disinhibition. Many linkage and association studies have examined these relationships with inconsistent results, possibly because of low power, poor marker coverage, and/or an inappropriate correction for multiple testing. Methods We conducted an association study on the products encoded by 10 DA-related genes (DRD1-D5, SLC18A2, SLC6A3, DDC, TH, COMT) using a large, ethnically homogeneous sample with severe AD (n = 545) and screened controls (n = 509). We collected genotypes from linkage disequilibrium (LD)-tagging single nucleotide polymorphisms (SNPs) and employed a gene-based method of correction. We tested for association with AD diagnosis in cases and controls and with a variety of alcohol-related traits (including age-at-onset, initial sensitivity, tolerance, maximum daily drinks, and a withdrawal factor score), disinhibitory symptoms, and a disinhibitory factor score in cases only. A total of 135 SNPs were genotyped using the Illumina GoldenGate and Taqman Assays-on-Demand protocols. Results Of the 101 SNPs entered into standard analysis, 6 independent SNPs from 5 DA genes were associated with AD or a quantitative alcohol-related trait. Two SNPs across 2 genes were associated with a disinhibitory symptom count, while 1 SNP in DRD5 was positive for association with the general disinhibitory factor score. Conclusions Our study provides evidence of modest associations between a small number of DA-related genes and AD as well as a range of alcohol-related traits and measures of behavioral disinhibition. While we did conduct gene-based correction for multiple testing, we did not correct for multiple traits because the traits are correlated. However, false-positive findings remain possible, so our results must be interpreted with caution. PMID:21083670
Wu, X; Offenbacher, S; Lόpez, N J; Chen, D; Wang, H-Y; Rogus, J; Zhou, J; Beck, J; Jiang, S; Bao, X; Wilkins, L; Doucette-Stamm, L; Kornman, K
2015-01-01
Background and Objective Genetic markers associated with disease are often non-functional and generally tag one or more functional “causative” variants in linkage disequilibrium. Markers may not show tight linkage to the causative variants across multiple ethnicities due to evolutionary divergence, and therefore may not be informative across different population groups. Validated markers of disease suggest causative variants exist in the gene and, if the causative variants can be identified, it is reasonable to hypothesize that such variants will be informative across diverse populations. The aim of this study was to test that hypothesis using functional Interleukin-1 (IL-1) gene variations across multiple ethnic populations to replace the non-functional markers originally associated with chronic adult periodontitis in Caucasians. Material and Methods Adult chronic periodontitis cases and controls from four ethnic groups (Caucasians, African Americans, Hispanics and Asians) were recruited in the USA, Chile and China. Genotypes of IL1B gene single nucleotide polymorphisms (SNPs), including three functional SNPs (rs16944, rs1143623, rs4848306) in the promoter and one intronic SNP (rs1143633), were determined using a single base extension method or TaqMan 5′ nuclease assay. Logistic regression and other statistical analyses were used to examine the association between moderate to severe periodontitis and IL1B gene variations, including SNPs, haplotypes and composite genotypes. Genotype patterns associated with disease in the discovery study were then evaluated in independent validation studies. Results Significant associations were identified in the discovery study, consisting of Caucasians and African Americans, between moderate to severe adult chronic periodontitis and functional variations in the IL1B gene, including a pattern of four IL1B SNPs (OR = 1.87, p < 0.0001). The association between the disease and this IL1B composite genotype pattern was validated in two additional studies consisting of Hispanics (OR = 1.95, p = 0.04) or Asians (OR = 3.27, p = 0.01). A meta-analysis of the three populations supported the association between the IL-1 genotype pattern and moderate to severe periodontitis (OR 1.95; p < 0.001). Our analysis also demonstrated that IL1B gene variations had added value to conventional risk factors in predicting chronic periodontitis. Conclusion This study validated the influence of IL-1 genetic factors on the severity of chronic periodontitis in four different ethnicities. PMID:24690098
Li, Yaoguo; He, Maoxian
2014-01-01
The pearl oyster, Pinctada fucata (P. fucata), is one of the marine bivalves that is predominantly cultured for pearl production. To obtain more genetic information for breeding purposes, we constructed a high-density linkage map of P. fucata and identified quantitative trait loci (QTL) for growth-related traits. One F1 family, which included the two parents, 48 largest progeny and 50 smallest progeny, was sampled to construct a linkage map using restriction site-associated DNA sequencing (RAD-Seq). With low coverage data, 1956.53 million clean reads and 86,342 candidate RAD loci were generated. A total of 1373 segregating SNPs were used to construct a sex-average linkage map. This spanned 1091.81 centimorgans (cM), with 14 linkage groups and an average marker interval of 1.41 cM. The genetic linkage map coverage, Coa, was 97.24%. Thirty-nine QTL-peak loci, for seven growth-related traits, were identified using the single-marker analysis, nonparametric mapping Kruskal-Wallis (KW) test. Parameters included three for shell height, six for shell length, five for shell width, four for hinge length, 11 for total weight, eight for soft tissue weight and two for shell weight. The QTL peak loci for shell height, shell length and shell weight were all located in linkage group 6. The genotype frequencies of most QTL peak loci showed significant differences between the large subpopulation and the small subpopulation (P<0.05). These results highlight the effectiveness of RAD-Seq as a tool for generation of QTL-targeted and genome-wide marker data in the non-model animal, P. fucata, and its possible utility in marker-assisted selection (MAS). PMID:25369421
Ban, Yoshiyuki; Tozaki, Teruaki; Taniyama, Matsuo; Skrabanek, Luce; Nakano, Yasuko; Ban, Yoshio; Hirano, Tsutomu
2012-01-01
Background The etiology of the autoimmune thyroid diseases (AITDs), Graves' disease (GD) and Hashimoto's thyroiditis (HT), is largely unknown. However, genetic susceptibility is believed to play a major role. Two whole genome scans from Japan and from the US identified a locus on chromosome 8q24 that showed evidence for linkage with AITD and HT. Recent studies have demonstrated an association between thyroglobulin (Tg) polymorphisms and AITD in Caucasians, suggesting that Tg is a susceptibility gene on 8q24. Objectives The objective of the study was to refine Tg association with AITD, by analyzing a panel of 25 SNPs across an extended 260 kb region of the Tg. Methods We studied 458 Japanese AITD patients (287 GD and 171 HT patients) and 221 matched Japanese control subjects in association studies. Case-control association studies were performed using 25 Tg single nucleotide polymorphisms (SNPs) chosen from a database of the Single Nucleotide Polymorphism Database (dbSNP). Haplotype analysis was undertaken using the computer program SNPAlyze version 7.0. Principal Findings and Conclusions In total, 5 SNPs revealed association with GD (P<0.05), with the strongest SNP associations at rs2256366 (P = 0.002) and rs2687836 (P = 0.0077), both located in intron 41 of the Tg gene. Because of the strong LD between these two strongest associated variants, we performed the haplotype analysis, and identified a major protective haplotype for GD (P = 0.001).These results suggested that the Tg gene is involved in susceptibility for GD and AITD in the Japanese. PMID:22662162
Sun, Jiajie; Gao, Yuan; Liu, Dong; Ma, Wei; Xue, Jing; Zhang, Chunlei; Lan, Xianyong; Lei, Chuzhao; Chen, Hong
2012-06-01
The insulin-induced gene 1 (INSIG1) gene encodes a protein that blocks proteolytic activation of sterol regulatory element binding proteins, which are transcription factors that activate genes that regulate cholesterol, fatty acid, and glucose metabolism. However, similar research for the bovine INSIG1 gene is lacking. Therefore, in this study, polymorphisms of the bovine INSIG1 gene were detected in 643 individuals from four cattle breeds by DNA pooling, forced PCR-RFLP, PCR-SSCP, and DNA sequencing methods. Only 10 novel SNPs were identified, which included four mutations in the coding region and the others in the introns. In Nanyang individuals, seven common haplotypes were identified based on four coding region SNPs. The haplotype GACT, with a frequency of 75.4%, was the most prevalent haplotypes and SNPs formed two linkage disequilibrium blocks with strong multi-allelic D' (D' = 1). Additionally, association analysis between mutations of the bovine INSIG1 gene and growth traits in Nanyang cattle at 6, 12, 18, and 24 months old was performed, and the results indicated that the polymorphisms were not significantly associated with body mass.
Identification of genomic regions contributing to etoposide-induced cytotoxicity
Bleibel, Wasim K.; Duan, Shiwei; Huang, R. Stephanie; Kistner, Emily O.; Shukla, Sunita J.; Wu, Xiaolin; Badner, Judith A.
2009-01-01
Etoposide is routinely used in combination based chemotherapy for testicular cancer and small-cell lung cancer; however, myelosuppression, therapy-related leukemia and neurotoxicity limit its utility. To determine the genetic contribution to cellular sensitivity to etoposide, we evaluated cell growth inhibition in Centre d’ Etude du Polymorphisme Humain lymphoblastoid cell lines from 24 multi-generational pedigrees (321 samples) following treatment with 0.02–2.5 µM etoposide for 72 h. Heritability analysis showed that genetic variation contributes significantly to the cytotoxic phenotypes (h2 = 0.17–0.25, P = 4.9 × 10−5−7.3 × 10−3). Whole genome linkage scans uncovered 8 regions with peak LOD scores ranging from 1.57 to 2.55, with the most significant signals being found on chromosome 5 (LOD = 2.55) and chromosome 6 (LOD = 2.52). Linkage-directed association was performed on a subset of HapMap samples within the pedigrees to find 22 SNPs significantly associated with etoposide cytotoxicity at one or more treatment concentrations. UVRAG, a DNA repair gene, SEMA5A, SLC7A6 and PRMT7 are implicated from these unbiased studies. Our findings suggest that susceptibility to etoposide-induced cytotoxicity is heritable and using an integrated genomics approach we identified both genomic regions and SNPs associated with the cytotoxic phenotypes. PMID:19089452
Reddy, Umesh K.; Nimmakayala, Padma; Abburi, Venkata Lakshmi; Reddy, C. V. C. M.; Saminathan, Thangasamy; Percy, Richard G.; Yu, John Z.; Frelichowski, James; Udall, Joshua A.; Page, Justin T.; Zhang, Dong; Shehzad, Tariq; Paterson, Andrew H.
2017-01-01
Use of 10,129 singleton SNPs of known genomic location in tetraploid cotton provided unique opportunities to characterize genome-wide diversity among 440 Gossypium hirsutum and 219 G. barbadense cultivars and landrace accessions of widespread origin. Using the SNPs distributed genome-wide, we examined genetic diversity, haplotype distribution and linkage disequilibrium patterns in the G. hirsutum and G. barbadense genomes to clarify population demographic history. Diversity and identity-by-state analyses have revealed little sharing of alleles between the two cultivated allotetraploid genomes, with a few exceptions that indicated sporadic gene flow. We found a high number of new alleles, representing increased nucleotide diversity, on chromosomes 1 and 2 in cultivated G. hirsutum as compared with low nucleotide diversity on these chromosomes in landrace G. hirsutum. In contrast, G. barbadense chromosomes showed negative Tajima’s D on several chromosomes for both cultivated and landrace types, which indicate that speciation of G. barbadense itself, might have occurred with relatively narrow genetic diversity. The presence of conserved linkage disequilibrium (LD) blocks and haplotypes between G. hirsutum and G. barbadense provides strong evidence for comparable patterns of evolution in their domestication processes. Our study illustrates the potential use of population genetic techniques to identify genomic regions for domestication. PMID:28128280
Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M
2007-01-01
Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
Metzger, Julia; Ohnesorge, Bernhard; Distl, Ottmar
2012-01-01
Equine guttural pouch tympany (GPT) is a hereditary condition affecting foals in their first months of life. Complex segregation analyses in Arabian and German warmblood horses showed the involvement of a major gene as very likely. Genome-wide linkage and association analyses including a high density marker set of single nucleotide polymorphisms (SNPs) were performed to map the genomic region harbouring the potential major gene for GPT. A total of 85 Arabian and 373 German warmblood horses were genotyped on the Illumina equine SNP50 beadchip. Non-parametric multipoint linkage analyses showed genome-wide significance on horse chromosomes (ECA) 3 for German warmblood at 16–26 Mb and 34–55 Mb and for Arabian on ECA15 at 64–65 Mb. Genome-wide association analyses confirmed the linked regions for both breeds. In Arabian, genome-wide association was detected at 64 Mb within the region with the highest linkage peak on ECA15. For German warmblood, signals for genome-wide association were close to the peak region of linkage at 52 Mb on ECA3. The odds ratio for the SNP with the highest genome-wide association was 0.12 for the Arabian. In conclusion, the refinement of the regions with the Illumina equine SNP50 beadchip is an important step to unravel the responsible mutations for GPT. PMID:22848553
Fesinmeyer, Megan D; Meigs, James B; North, Kari E; Schumacher, Fredrick R; Bůžková, Petra; Franceschini, Nora; Haessler, Jeffrey; Goodloe, Robert; Spencer, Kylee L; Voruganti, Venkata Saroja; Howard, Barbara V; Jackson, Rebecca; Kolonel, Laurence N; Liu, Simin; Manson, JoAnn E; Monroe, Kristine R; Mukamal, Kenneth; Dilks, Holli H; Pendergrass, Sarah A; Nato, Andrew; Wan, Peggy; Wilkens, Lynne R; Le Marchand, Loic; Ambite, José Luis; Buyske, Steven; Florez, Jose C; Crawford, Dana C; Hindorff, Lucia A; Haiman, Christopher A; Peters, Ulrike; Pankow, James S
2013-09-25
Multiple genome-wide association studies (GWAS) within European populations have implicated common genetic variants associated with insulin and glucose concentrations. In contrast, few studies have been conducted within minority groups, which carry the highest burden of impaired glucose homeostasis and type 2 diabetes in the U.S. As part of the 'Population Architecture using Genomics and Epidemiology (PAGE) Consortium, we investigated the association of up to 10 GWAS-identified single nucleotide polymorphisms (SNPs) in 8 genetic regions with glucose or insulin concentrations in up to 36,579 non-diabetic subjects including 23,323 European Americans (EA) and 7,526 African Americans (AA), 3,140 Hispanics, 1,779 American Indians (AI), and 811 Asians. We estimated the association between each SNP and fasting glucose or log-transformed fasting insulin, followed by meta-analysis to combine results across PAGE sites. Overall, our results show that 9/9 GWAS SNPs are associated with glucose in EA (p = 0.04 to 9 × 10-15), versus 3/9 in AA (p= 0.03 to 6 × 10-5), 3/4 SNPs in Hispanics, 2/4 SNPs in AI, and 1/2 SNPs in Asians. For insulin we observed a significant association with rs780094/GCKR in EA, Hispanics and AI only. Generalization of results across multiple racial/ethnic groups helps confirm the relevance of some of these loci for glucose and insulin metabolism. Lack of association in non-EA groups may be due to insufficient power, or to unique patterns of linkage disequilibrium.
Ito, Yoshihito; Yamada, Shinnosuke; Takahashi, Nagahide; Saito, Shinichi; Yoshimi, Akira; Inada, Toshiya; Noda, Yukihiro; Ozaki, Norio
2008-10-05
NRG1-ERBB signaling influences the risk for schizophrenia pathology. A recent study has reported that MAGI1, MAGI2, and protein tyrosine phosphatase, receptor-type, Z polypeptide 1 (PTPRZ1; located on 7q31.3) gene products regulate the NRG1-ERBB4 signaling pathway, and PTPRZ1 is associated with schizophrenia in a Caucasian population. By applying a gene-based association concept, we analyzed any association between PTPRZ1 tagging SNPs and schizophrenia in the Japanese population (576 schizophrenics and 768 controls). After linkage disequilibrium analysis, 29 single nucleotide polymorphisms (SNPs) were genotyped using a 5'-exonuclease allelic discrimination assay. We found a significant association of one tagging SNP in a genotype-wise analysis (P = 0.007); however, this might be resulted from type I error due to multiple testing (P = 0.17 after SNPSpD correction). No association was observed between schizophrenic patients and controls in either allelic, genotypic, or haplotypic analyses. Our results therefore suggest that PTPRZ1 is unlikely to be related to the development of schizophrenia in the Japanese population.
Genetic analysis of prolactin gene in Pakistani cattle.
Uddin, Raza Mohy; Babar, Masroor Ellahi; Nadeem, Asif; Hussain, Tanveer; Ahmad, Shakil; Munir, Sadia; Mehboob, Riffat; Ahmad, Fridoon Jawad
2013-10-01
Prolactin (PRL) is a polypeptide hormone, secreted mainly by the anterior pituitary gland. It is involved in many endocrine activities. The key functions of PRL are related to reproduction and lactation in mammals. To ascertain the presence of polymorphisms in the bovine PRL gene (bPRL), the bPRL gene was sequenced. Five mutations were identified in exonic region and eleven in associated intronic regions in 100 cattle from four Pakistani cattle breeds. Haplotype of predicted amino acid changes represent a common alteration at codon 222 from R-Arginine into K-Lysine in all four breeds. Significant statistical variations were observed in the distribution of single nucleotide polymorphism (SNP) in various cattle populations. However, on basis of present study, an association of these SNPs with milk performance traits in four Pakistani cow breeds cannot be truly replicated but at least can be effective DNA markers for some of the breeds studied. Linkage analysis between these SNPs on larger populations can be useful for the association with milk production traits. Furthermore, present study may be used for marker-assisted selection and management in cattle breeding program in local cattle breeds.
Covariance Between Genotypic Effects and its Use for Genomic Inference in Half-Sib Families
Wittenburg, Dörte; Teuscher, Friedrich; Klosa, Jan; Reinsch, Norbert
2016-01-01
In livestock, current statistical approaches utilize extensive molecular data, e.g., single nucleotide polymorphisms (SNPs), to improve the genetic evaluation of individuals. The number of model parameters increases with the number of SNPs, so the multicollinearity between covariates can affect the results obtained using whole genome regression methods. In this study, dependencies between SNPs due to linkage and linkage disequilibrium among the chromosome segments were explicitly considered in methods used to estimate the effects of SNPs. The population structure affects the extent of such dependencies, so the covariance among SNP genotypes was derived for half-sib families, which are typical in livestock populations. Conditional on the SNP haplotypes of the common parent (sire), the theoretical covariance was determined using the haplotype frequencies of the population from which the individual parent (dam) was derived. The resulting covariance matrix was included in a statistical model for a trait of interest, and this covariance matrix was then used to specify prior assumptions for SNP effects in a Bayesian framework. The approach was applied to one family in simulated scenarios (few and many quantitative trait loci) and using semireal data obtained from dairy cattle to identify genome segments that affect performance traits, as well as to investigate the impact on predictive ability. Compared with a method that does not explicitly consider any of the relationship among predictor variables, the accuracy of genetic value prediction was improved by 10–22%. The results show that the inclusion of dependence is particularly important for genomic inference based on small sample sizes. PMID:27402363
He, Yanxia; Yuan, Wangjun; Dong, Meifang; Han, Yuanji; Shang, Fude
2017-01-01
Osmanthus fragrans is an ornamental plant of substantial commercial value, and no genetic linkage maps of this species have previously been reported. Specific-locus amplified fragment sequencing (SLAF-seq) is a recently developed technology that allows massive single nucleotide polymorphisms (SNPs) to be identified and high-resolution genotyping. In our current research, we generated the first genetic map of O. fragrans using SLAF-seq, which is composed with 206.92 M paired-end reads and 173,537 SLAF markers. Among total 90,715 polymorphic SLAF markers, 15,317 polymorphic SLAFs could be used for genetic map construction. The integrated map contained 14,189 high quality SLAFs that were grouped in 23 genetic linkage groups, with a total length of 2962.46 cM and an average distance of 0.21 cM between two adjacent markers. In addition, 23,664 SNPs were identified from the mapped markers. As far as we know, this is the first of the genetic map of O. fragrans. Our results are further demonstrate that SLAF-seq is a very effective method for developing markers and constructing high-density linkage maps. The SNP markers and the genetic map reported in this study should be valuable resource in future research. PMID:29018460
efficient association study design via power-optimized tag SNP selection
HAN, BUHM; KANG, HYUN MIN; SEO, MYEONG SEONG; ZAITLEN, NOAH; ESKIN, ELEAZAR
2008-01-01
Discovering statistical correlation between causal genetic variation and clinical traits through association studies is an important method for identifying the genetic basis of human diseases. Since fully resequencing a cohort is prohibitively costly, genetic association studies take advantage of local correlation structure (or linkage disequilibrium) between single nucleotide polymorphisms (SNPs) by selecting a subset of SNPs to be genotyped (tag SNPs). While many current association studies are performed using commercially available high-throughput genotyping products that define a set of tag SNPs, choosing tag SNPs remains an important problem for both custom follow-up studies as well as designing the high-throughput genotyping products themselves. The most widely used tag SNP selection method optimizes over the correlation between SNPs (r2). However, tag SNPs chosen based on an r2 criterion do not necessarily maximize the statistical power of an association study. We propose a study design framework that chooses SNPs to maximize power and efficiently measures the power through empirical simulation. Empirical results based on the HapMap data show that our method gains considerable power over a widely used r2-based method, or equivalently reduces the number of tag SNPs required to attain the desired power of a study. Our power-optimized 100k whole genome tag set provides equivalent power to the Affymetrix 500k chip for the CEU population. For the design of custom follow-up studies, our method provides up to twice the power increase using the same number of tag SNPs as r2-based methods. Our method is publicly available via web server at http://design.cs.ucla.edu. PMID:18702637
Antal, Péter; Kiszel, Petra Sz.; Gézsi, András; Hadadi, Éva; Virág, Viktor; Hajós, Gergely; Millinghoffer, András; Nagy, Adrienne; Kiss, András; Semsei, Ágnes F.; Temesi, Gergely; Melegh, Béla; Kisfali, Péter; Széll, Márta; Bikov, András; Gálffy, Gabriella; Tamási, Lilla; Falus, András; Szalai, Csaba
2012-01-01
Genetic studies indicate high number of potential factors related to asthma. Based on earlier linkage analyses we selected the 11q13 and 14q22 asthma susceptibility regions, for which we designed a partial genome screening study using 145 SNPs in 1201 individuals (436 asthmatic children and 765 controls). The results were evaluated with traditional frequentist methods and we applied a new statistical method, called Bayesian network based Bayesian multilevel analysis of relevance (BN-BMLA). This method uses Bayesian network representation to provide detailed characterization of the relevance of factors, such as joint significance, the type of dependency, and multi-target aspects. We estimated posteriors for these relations within the Bayesian statistical framework, in order to estimate the posteriors whether a variable is directly relevant or its association is only mediated. With frequentist methods one SNP (rs3751464 in the FRMD6 gene) provided evidence for an association with asthma (OR = 1.43(1.2–1.8); p = 3×10−4). The possible role of the FRMD6 gene in asthma was also confirmed in an animal model and human asthmatics. In the BN-BMLA analysis altogether 5 SNPs in 4 genes were found relevant in connection with asthma phenotype: PRPF19 on chromosome 11, and FRMD6, PTGER2 and PTGDR on chromosome 14. In a subsequent step a partial dataset containing rhinitis and further clinical parameters was used, which allowed the analysis of relevance of SNPs for asthma and multiple targets. These analyses suggested that SNPs in the AHNAK and MS4A2 genes were indirectly associated with asthma. This paper indicates that BN-BMLA explores the relevant factors more comprehensively than traditional statistical methods and extends the scope of strong relevance based methods to include partial relevance, global characterization of relevance and multi-target relevance. PMID:22432035
Zhang, Jie; Chen, Yuewen; Shao, Yong; Wu, Qi; Guan, Ming; Zhang, Wei; Wan, Jun; Yu, Bo
2012-01-01
Background. TNFα-induced protein 3 (TNFAIP3) interacting with protein 1 (TNIP1) acts as a negative regulator of NF-κB and plays an important role in maintaining the homeostasis of immune system. A recent genome-wide association study (GWAS) showed that the polymorphism of TNIP1 was associated with the disease risk of SLE in Caucasian. In this study, we investigated whether the association of TNIP1 with SLE was replicated in Chinese population. Methods. The association of TNIP1 SNP rs7708392 (G/C) was determined by high resolution melting (HRM) analysis with unlabeled probe in 285 SLE patients and 336 healthy controls. Results. A new SNP rs79937737 located on 5 bp upstream of rs7708392 was discovered during the HRM analysis. No association of rs7708392 or rs79937737 with the disease risk of SLE was found. Furthermore, rs7708392 and rs79937737 were in weak linkage disequilibrium (LD). Hypotypes analysis of the two SNPs also showed no association with SLE in Chinese population. Conclusions. High resolution melting analysis with unlabeled probes proves to be a powerful and efficient genotyping method for identifying and screening SNPs. No association of rs7708392 or rs79937737 with the disease risk of SLE was observed in Chinese population. PMID:22852072
Johnston, Susan E.; Huisman, Jisca; Ellis, Philip A.; Pemberton, Josephine M.
2017-01-01
High-density linkage maps are an important tool to gain insight into the genetic architecture of traits of evolutionary and economic interest, and provide a resource to characterize variation in recombination landscapes. Here, we used information from the cattle genome and the 50 K Cervine Illumina BeadChip to inform and refine a high-density linkage map in a wild population of red deer (Cervus elaphus). We constructed a predicted linkage map of 38,038 SNPs and a skeleton map of 10,835 SNPs across 34 linkage groups. We identified several chromosomal rearrangements in the deer lineage relative to sheep and cattle, including six chromosome fissions, one fusion, and two large inversions. Otherwise, our findings showed strong concordance with map orders in the cattle genome. The sex-averaged linkage map length was 2739.7 cM and the genome-wide autosomal recombination rate was 1.04 cM/Mb. The female autosomal map length was 1.21 longer than that of males (2767.4 cM vs. 2280.8 cM, respectively). Sex differences in map length were driven by high female recombination rates in peri-centromeric regions, a pattern that is unusual relative to other mammal species. This effect was more pronounced in fission chromosomes that would have had to produce new centromeres. We propose two hypotheses to explain this effect: (1) that this mechanism may have evolved to counteract centromeric drive associated with meiotic asymmetry in oocyte production; and/or (2) that sequence and structural characteristics suppressing recombination in close proximity to the centromere may not have evolved at neo-centromeres. Our study provides insight into how recombination landscapes vary and evolve in mammals, and will provide a valuable resource for studies of evolution, genetic improvement, and population management in red deer and related species. PMID:28667018
Complex Variation in Measures of General Intelligence and Cognitive Change
Rowe, Suzanne J.; Rowlatt, Amy; Davies, Gail; Harris, Sarah E.; Porteous, David J.; Liewald, David C.; McNeill, Geraldine; Starr, John M.
2013-01-01
Combining information from multiple SNPs may capture a greater amount of genetic variation than from the sum of individual SNP effects and help identifying missing heritability. Regions may capture variation from multiple common variants of small effect, multiple rare variants or a combination of both. We describe regional heritability mapping of human cognition. Measures of crystallised (gc) and fluid intelligence (gf) in late adulthood (64–79 years) were available for 1806 individuals genotyped for 549,692 autosomal single nucleotide polymorphisms (SNPs). The same individuals were tested at age 11, enabling us the rare opportunity to measure cognitive change across most of their lifespan. 547,750 SNPs ranked by position are divided into 10, 908 overlapping regions of 101 SNPs to estimate the genetic variance each region explains, an approach that resembles classical linkage methods. We also estimate the genetic variation explained by individual autosomes and by SNPs within genes. Empirical significance thresholds are estimated separately for each trait from whole genome scans of 500 permutated data sets. The 5% significance threshold for the likelihood ratio test of a single region ranged from 17–17.5 for the three traits. This is the equivalent to nominal significance under the expectation of a chi-squared distribution (between 1df and 0) of P<1.44×10−5. These thresholds indicate that the distribution of the likelihood ratio test from this type of variance component analysis should be estimated empirically. Furthermore, we show that estimates of variation explained by these regions can be grossly overestimated. After applying permutation thresholds, a region for gf on chromosome 5 spanning the PRRC1 gene is significant at a genome-wide 10% empirical threshold. Analysis of gene methylation on the temporal cortex provides support for the association of PRRC1 and fluid intelligence (P = 0.004), and provides a prime candidate gene for high throughput sequencing of these uniquely informative cohorts. PMID:24349040
Painter, Jodie N; O'Mara, Tracy A; Batra, Jyotsna; Cheng, Timothy; Lose, Felicity A; Dennis, Joe; Michailidou, Kyriaki; Tyrer, Jonathan P; Ahmed, Shahana; Ferguson, Kaltin; Healey, Catherine S; Kaufmann, Susanne; Hillman, Kristine M; Walpole, Carina; Moya, Leire; Pollock, Pamela; Jones, Angela; Howarth, Kimberley; Martin, Lynn; Gorman, Maggie; Hodgson, Shirley; De Polanco, Ma Magdalena Echeverry; Sans, Monica; Carracedo, Angel; Castellvi-Bel, Sergi; Rojas-Martinez, Augusto; Santos, Erika; Teixeira, Manuel R; Carvajal-Carmona, Luis; Shu, Xiao-Ou; Long, Jirong; Zheng, Wei; Xiang, Yong-Bing; Montgomery, Grant W; Webb, Penelope M; Scott, Rodney J; McEvoy, Mark; Attia, John; Holliday, Elizabeth; Martin, Nicholas G; Nyholt, Dale R; Henders, Anjali K; Fasching, Peter A; Hein, Alexander; Beckmann, Matthias W; Renner, Stefan P; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Lambrechts, Diether; Coenegrachts, Lieve; Schrauwen, Stefanie; Amant, Frederic; Winterhoff, Boris; Dowdy, Sean C; Goode, Ellen L; Teoman, Attila; Salvesen, Helga B; Trovik, Jone; Njolstad, Tormund S; Werner, Henrica M J; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Tzortzatos, Gerasimos; Mints, Miriam; Tham, Emma; Hall, Per; Czene, Kamila; Liu, Jianjun; Li, Jingmei; Hopper, John L; Southey, Melissa C; Ekici, Arif B; Ruebner, Matthias; Johnson, Nicola; Peto, Julian; Burwinkel, Barbara; Marme, Frederik; Brenner, Hermann; Dieffenbach, Aida K; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Depreeuw, Jeroen; Moisse, Matthieu; Chang-Claude, Jenny; Rudolph, Anja; Couch, Fergus J; Olson, Janet E; Giles, Graham G; Bruinsma, Fiona; Cunningham, Julie M; Fridley, Brooke L; Børresen-Dale, Anne-Lise; Kristensen, Vessela N; Cox, Angela; Swerdlow, Anthony J; Orr, Nicholas; Bolla, Manjeet K; Wang, Qin; Weber, Rachel Palmieri; Chen, Zhihua; Shah, Mitul; French, Juliet D; Pharoah, Paul D P; Dunning, Alison M; Tomlinson, Ian; Easton, Douglas F; Edwards, Stacey L; Thompson, Deborah J; Spurdle, Amanda B
2015-03-01
Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10(-14), odds ratio = 0.86, 95% confidence interval = 0.82-0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Painter, Jodie N.; O'Mara, Tracy A.; Batra, Jyotsna; Cheng, Timothy; Lose, Felicity A.; Dennis, Joe; Michailidou, Kyriaki; Tyrer, Jonathan P.; Ahmed, Shahana; Ferguson, Kaltin; Healey, Catherine S.; Kaufmann, Susanne; Hillman, Kristine M.; Walpole, Carina; Moya, Leire; Pollock, Pamela; Jones, Angela; Howarth, Kimberley; Martin, Lynn; Gorman, Maggie; Hodgson, Shirley; De Polanco, Ma. Magdalena Echeverry; Sans, Monica; Carracedo, Angel; Castellvi-Bel, Sergi; Rojas-Martinez, Augusto; Santos, Erika; Teixeira, Manuel R.; Carvajal-Carmona, Luis; Shu, Xiao-Ou; Long, Jirong; Zheng, Wei; Xiang, Yong-Bing; Montgomery, Grant W.; Webb, Penelope M.; Scott, Rodney J.; McEvoy, Mark; Attia, John; Holliday, Elizabeth; Martin, Nicholas G.; Nyholt, Dale R.; Henders, Anjali K.; Fasching, Peter A.; Hein, Alexander; Beckmann, Matthias W.; Renner, Stefan P.; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Lambrechts, Diether; Coenegrachts, Lieve; Schrauwen, Stefanie; Amant, Frederic; Winterhoff, Boris; Dowdy, Sean C.; Goode, Ellen L.; Teoman, Attila; Salvesen, Helga B.; Trovik, Jone; Njolstad, Tormund S.; Werner, Henrica M.J.; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Tzortzatos, Gerasimos; Mints, Miriam; Tham, Emma; Hall, Per; Czene, Kamila; Liu, Jianjun; Li, Jingmei; Hopper, John L.; Southey, Melissa C.; Ekici, Arif B.; Ruebner, Matthias; Johnson, Nicola; Peto, Julian; Burwinkel, Barbara; Marme, Frederik; Brenner, Hermann; Dieffenbach, Aida K.; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Depreeuw, Jeroen; Moisse, Matthieu; Chang-Claude, Jenny; Rudolph, Anja; Couch, Fergus J.; Olson, Janet E.; Giles, Graham G.; Bruinsma, Fiona; Cunningham, Julie M.; Fridley, Brooke L.; Børresen-Dale, Anne-Lise; Kristensen, Vessela N.; Cox, Angela; Swerdlow, Anthony J.; Orr, Nicholas; Bolla, Manjeet K.; Wang, Qin; Weber, Rachel Palmieri; Chen, Zhihua; Shah, Mitul; French, Juliet D.; Pharoah, Paul D.P.; Dunning, Alison M.; Tomlinson, Ian; Easton, Douglas F.; Edwards, Stacey L.; Thompson, Deborah J.; Spurdle, Amanda B.
2015-01-01
Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10−14, odds ratio = 0.86, 95% confidence interval = 0.82–0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression. PMID:25378557
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar).
Houston, Ross D; Taggart, John B; Cézard, Timothé; Bekaert, Michaël; Lowe, Natalie R; Downing, Alison; Talbot, Richard; Bishop, Stephen C; Archibald, Alan L; Bron, James E; Penman, David J; Davassi, Alessandro; Brew, Fiona; Tinch, Alan E; Gharbi, Karim; Hamilton, Alastair
2014-02-06
Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection.
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)
2014-01-01
Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection. PMID:24524230
Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple
Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick; Ordidge, Matthew; Paprstein, Frantisek; Sedlak, Jiri; Nybom, Hilde; Garkava-Gustavsson, Larisa; Troggio, Michela; Bianco, Luca; Velasco, Riccardo; Poncet, Charles; Théron, Anthony; Moriya, Shigeki; Bink, Marco C. A. M.; Laurens, François; Tartarini, Stefano; Durel, Charles-Eric
2017-01-01
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs. PMID:29176988
Functional Characterization of Schizophrenia-Associated Variation in CACNA1C
Eckart, Nicole; Song, Qifeng; Yang, Rebecca; Wang, Ruihua; Zhu, Heng; McCallion, Andrew S.; Avramopoulos, Dimitrios
2016-01-01
Calcium channel subunits, including CACNA1C, have been associated with multiple psychiatric disorders. Specifically, genome wide association studies (GWAS) have repeatedly identified the single nucleotide polymorphism (SNP) rs1006737 in intron 3 of CACNA1C to be strongly associated with schizophrenia and bipolar disorder. Here, we show that rs1006737 marks a quantitative trait locus for CACNA1C transcript levels. We test 16 SNPs in high linkage disequilibrium with rs1007637 and find one, rs4765905, consistently showing allele-dependent regulatory function in reporter assays. We find allele-specific protein binding for 13 SNPs including rs4765905. Using protein microarrays, we identify several proteins binding ≥3 SNPs, but not control sequences, suggesting possible functional interactions and combinatorial haplotype effects. Finally, using circular chromatin conformation capture, we show interaction of the disease-associated region including the 16 SNPs with the CACNA1C promoter and other potential regulatory regions. Our results elucidate the pathogenic relevance of one of the best-supported risk loci for schizophrenia and bipolar disorder. PMID:27276213
Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin
2010-01-01
Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease. PMID:21203467
Kathrani, Aarti; House, Arthur; Catchpole, Brian; Murphy, Angela; German, Alex; Werling, Dirk; Allenspach, Karin
2010-12-23
Inflammatory bowel disease (IBD) is considered to be the most common cause of vomiting and diarrhoea in dogs, and the German shepherd dog (GSD) is particularly susceptible. The exact aetiology of IBD is unknown, however associations have been identified between specific single-nucleotide polymorphisms (SNPs) in Toll-like receptors (TLRs) and human IBD. However, to date, no genetic studies have been undertaken in canine IBD. The aim of this study was to investigate whether polymorphisms in canine TLR 2, 4 and 5 genes are associated with IBD in GSDs. Mutational analysis of TLR2, TLR4 and TLR5 was performed in 10 unrelated GSDs with IBD. Four non-synonymous SNPs (T23C, G1039A, A1571T and G1807A) were identified in the TLR4 gene, and three non-synonymous SNPs (G22A, C100T and T1844C) were identified in the TLR5 gene. The non-synonymous SNPs identified in TLR4 and TLR5 were evaluated further in a case-control study using a SNaPSHOT multiplex reaction. Sequencing information from 55 unrelated GSDs with IBD were compared to a control group consisting of 61 unrelated GSDs. The G22A SNP in TLR5 was significantly associated with IBD in GSDs, whereas the remaining two SNPs were found to be significantly protective for IBD. Furthermore, the two SNPs in TLR4 (A1571T and G1807A) were in complete linkage disequilibrium, and were also significantly associated with IBD. The TLR5 risk haplotype (ACC) without the two associated TLR4 SNP alleles was significantly associated with IBD, however the presence of the two TLR4 SNP risk alleles without the TLR5 risk haplotype was not statistically associated with IBD. Our study suggests that the three TLR5 SNPs and two TLR4 SNPs; A1571T and G1807A could play a role in the pathogenesis of IBD in GSDs. Further studies are required to confirm the functional importance of these polymorphisms in the pathogenesis of this disease.
2013-01-01
Background The genomic architecture of adaptive traits remains poorly understood in non-model plants. Various approaches can be used to bridge this gap, including the mapping of quantitative trait loci (QTL) in pedigrees, and genetic association studies in non-structured populations. Here we present results on the genomic architecture of adaptive traits in black spruce, which is a widely distributed conifer of the North American boreal forest. As an alternative to the usual candidate gene approach, a candidate SNP approach was developed for association testing. Results A genetic map containing 231 gene loci was used to identify QTL that were related to budset timing and to tree height assessed over multiple years and sites. Twenty-two unique genomic regions were identified, including 20 that were related to budset timing and 6 that were related to tree height. From results of outlier detection and bulk segregant analysis for adaptive traits using DNA pool sequencing of 434 genes, 52 candidate SNPs were identified and subsequently tested in genetic association studies for budset timing and tree height assessed over multiple years and sites. A total of 34 (65%) SNPs were significantly associated with budset timing, or tree height, or both. Although the percentages of explained variance (PVE) by individual SNPs were small, several significant SNPs were shared between sites and among years. Conclusions The sharing of genomic regions and significant SNPs between budset timing and tree height indicates pleiotropic effects. Significant QTLs and SNPs differed quite greatly among years, suggesting that different sets of genes for the same characters are involved at different stages in the tree’s life history. The functional diversity of genes carrying significant SNPs and low observed PVE further indicated that a large number of polymorphisms are involved in adaptive genetic variation. Accordingly, for undomesticated species such as black spruce with natural populations of large effective size and low linkage disequilibrium, efficient marker systems that are predictive of adaptation should require the survey of large numbers of SNPs. Candidate SNP approaches like the one developed in the present study could contribute to reducing these numbers. PMID:23724860
Family, Leila; Bensen, Jeannette T.; Troester, Melissa A.; Wu, Michael C.; Anders, Carey K.; Olshan, Andrew F.
2015-01-01
DNA damage recognition and repair is a complex system of genes focused on maintaining genomic stability. Recently, there has been a focus on how breast cancer susceptibility relates to genetic variation in the DNA bypass polymerases pathway. Race-stratified and subtype-specific logistic regression models were used to estimate odds ratios (ORs) and 95 % confidence intervals (CIs) for the association between 22 single-nucleotide polymorphisms (SNPs) in seven bypass polymerase genes and breast cancer risk in the Carolina Breast Cancer Study, a population-based, case–control study (1,972 cases and 1,776 controls). We used SNP-set kernel association test (SKAT) to evaluate the multi-gene, multi-locus (combined) SNP effects within bypass polymerase genes. We found similar ORs for breast cancer with three POLQ SNPs (rs487848 AG/AA vs. GG; OR = 1.31, 95 % CI 1.03–1.68 for Whites and OR = 1.22, 95 % CI 1.00–1.49 for African Americans), (rs532411 CT/TT vs. CC; OR = 1.31, 95 % CI 1.02–1.66 for Whites and OR = 1.22, 95 % CI 1.00–1.48 for African Americans), and (rs3218634 CG/CC vs. GG; OR = 1.29, 95 % CI 1.02–1.65 for Whites). These three SNPs are in high linkage disequilibrium in both races. Tumor subtype analysis showed the same SNPs to be associated with increased risk of Luminal breast cancer. SKAT analysis showed no significant combined SNP effects. These results suggest that variants in the POLQ gene may be associated with the risk of Luminal breast cancer. PMID:25417172
Aydin, A Fatih; Vural, Pervin; Oruç, Çoşkun Umut; Doğru-Abbasoğlu, Semra; Özderya, Ayşenur; Karadağ, Berrin; Uysal, Müjdat
2014-07-01
Endothelin1 (EDN1) is well established marker of inflammation. The functions of EDN1 are mediated mainly by endothelin receptors type A (EDNRA). The etiopathogenesis of Hashimoto's thyroiditis (HT) remains still elusive although the role of chronic inflammation and subsequent endothelial dysfunction has been established. This study examined firstly the possible association of EDN1 (G5665Tand T-1370G) and EDNRA (C+70G and G-231A) single nucleotide polymorphisms (SNPs) with the occurrence of HT, and evaluates the relationship between genotypes and clinical/laboratory manifestation of HT. We analyzed genotype and allele distributions of above mentioned polymorphisms in 163 patients with HT and 181 healthy controls by real-time PCR combined with melting curve analysis. No significant associations between HT and variant alleles of EDN1 5665 and -1370, as well as EDNRA +70 and -231 SNPs were found. Haplotype analysis demonstrated that there was a strong (D'=0.76, r(2)=0.487) and weak (D'=0.403, r(2)=0.086) linkage disequilibrium (LD) between EDN1 -1370 and 5665, and between EDNRA -231 and +70 SNPs, respectively. However, haplotype frequencies in patients were similar to those in controls. In addition, it was observed that the EDNRA +70 G allele had protective effect against early (at age before 40 years) disease onset of HT (OR: 0.51, 95% CI=0.32-0.79, p=0.003). Although no significant associations between susceptibility to HT with EDN1 5665 and -1370, as well as with EDNRA+70 and -231 SNPs were found, EDNRA +70 polymorphism was related with decreased risk for early onset HT. Copyright © 2014 Elsevier B.V. All rights reserved.
Ramanathan, Gnanasambandan; Ghosh, Santu; Elumalai, Ramprasad; Periyasamy, Soundararajan; Lakkakula, Bhaskar V K S
2016-06-01
Autosomal dominant polycystic kidney disease (ADPKD) is an inherited systemic disorder, characterized by the fluid filled cysts in the kidneys leading to end stage renal failure in later years of life. Hypertension is one of the major factors independently contributing to the chronic kidney disease (CKD) progression. The renin-angiotensin aldosterone system (RAAS) genes have been extensively studied as hypertension candidate genes. The aim of the present study was to investigate the role of angiotensin converting enzyme tagging - single nucleotide polymorphisms (ACE tag-SNPs) in progression of CKD in patients with ADPKD. m0 ethods: In the present study six ACE tagSNPs (angiotensin converting enzyme tag single nucleotide polymorphisms) and insertion/deletion (I/D) in 102 ADPKD patients and 106 control subjects were investigated. The tagSNPs were genotyped using FRET-based KASPar method and ACE ID by polymerase chain reaction (PCR) and electrophoresis. Genotypes and haplotypes were compared between ADPKD patients and controls. Univariate and multivariate logistic regression analyses were performed to assess the effect of genotypes and hypertension on CKD advancement. Mantel-Haenszel (M-H) stratified analysis was performed to study the relationship between different CKD stages and hypertension and their interaction. All loci were polymorphic and except rs4293 SNP the remaining loci followed Hardy-Weinberg equilibrium. Distribution of ACE genotypes and haplotypes in controls and ADPKD patients was not significant. A significant linkage disequilibrium (LD) was observed between SNPs forming two LD blocks. The univariate analysis revealed that the age, hypertension, family history of diabetes and ACE rs4362 contributed to the advancement of CKD. The results suggest that the ACE genotypes are effect modifiers of the relationship between hypertension and CKD advancement among the ADPKD patients.
Linked genetic variants on chromosome 10 control ear morphology and body mass among dog breeds.
Webster, Matthew T; Kamgari, Nona; Perloski, Michele; Hoeppner, Marc P; Axelsson, Erik; Hedhammar, Åke; Pielberg, Gerli; Lindblad-Toh, Kerstin
2015-06-23
The domestic dog is a rich resource for mapping the genetic components of phenotypic variation due to its unique population history involving strong artificial selection. Genome-wide association studies have revealed a number of chromosomal regions where genetic variation associates with morphological characters that typify dog breeds. A region on chromosome 10 is among those with the highest levels of genetic differentiation between dog breeds and is associated with body mass and ear morphology, a common motif of animal domestication. We characterised variation in this region to uncover haplotype structure and identify candidate functional variants. We first identified SNPs that strongly associate with body mass and ear type by comparing sequence variation in a 3 Mb region between 19 breeds with a variety of phenotypes. We next genotyped a subset of 123 candidate SNPs in 288 samples from 46 breeds to identify the variants most highly associated with phenotype and infer haplotype structure. A cluster of SNPs that associate strongly with the drop ear phenotype is located within a narrow interval downstream of the gene MSRB3, which is involved in human hearing. These SNPs are in strong genetic linkage with another set of variants that correlate with body mass within the gene HMGA2, which affects human height. In addition we find evidence that this region has been under selection during dog domestication, and identify a cluster of SNPs within MSRB3 that are highly differentiated between dogs and wolves. We characterise genetically linked variants that potentially influence ear type and body mass in dog breeds, both key traits that have been modified by selective breeding that may also be important for domestication. The finding that variants on long haplotypes have effects on more than one trait suggests that genetic linkage can be an important determinant of the phenotypic response to selection in domestic animals.
Jones, David B; Jerry, Dean R; Khatkar, Mehar S; Raadsma, Herman W; Zenger, Kyall R
2013-11-20
The silver-lipped pearl oyster, Pinctada maxima, is an important tropical aquaculture species extensively farmed for the highly sought "South Sea" pearls. Traditional breeding programs have been initiated for this species in order to select for improved pearl quality, but many economic traits under selection are complex, polygenic and confounded with environmental factors, limiting the accuracy of selection. The incorporation of a marker-assisted selection (MAS) breeding approach would greatly benefit pearl breeding programs by allowing the direct selection of genes responsible for pearl quality. However, before MAS can be incorporated, substantial genomic resources such as genetic linkage maps need to be generated. The construction of a high-density genetic linkage map for P. maxima is not only essential for unravelling the genomic architecture of complex pearl quality traits, but also provides indispensable information on the genome structure of pearl oysters. A total of 1,189 informative genome-wide single nucleotide polymorphisms (SNPs) were incorporated into linkage map construction. The final linkage map consisted of 887 SNPs in 14 linkage groups, spans a total genetic distance of 831.7 centimorgans (cM), and covers an estimated 96% of the P. maxima genome. Assessment of sex-specific recombination across all linkage groups revealed limited overall heterochiasmy between the sexes (i.e. 1.15:1 F/M map length ratio). However, there were pronounced localised differences throughout the linkage groups, whereby male recombination was suppressed near the centromeres compared to female recombination, but inflated towards telomeric regions. Mean values of LD for adjacent SNP pairs suggest that a higher density of markers will be required for powerful genome-wide association studies. Finally, numerous nacre biomineralization genes were localised providing novel positional information for these genes. This high-density SNP genetic map is the first comprehensive linkage map for any pearl oyster species. It provides an essential genomic tool facilitating studies investigating the genomic architecture of complex trait variation and identifying quantitative trait loci for economically important traits useful in genetic selection programs within the P. maxima pearling industry. Furthermore, this map provides a foundation for further research aiming to improve our understanding of the dynamic process of biomineralization, and pearl oyster evolution and synteny.
Hulse-Kemp, Amanda M.; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A.; Scheffler, Brian E.; Fang, David D.; Chen, Z. Jeffrey; Van Deynze, Allen; Stelly, David M.
2015-01-01
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. PMID:25858960
Obesity-related genetic variants, human pigmentation, and risk of melanoma
Li, Xin; Liang, Liming; Zhang, Mingfeng; Song, Fengju; Nan, Hongmei; Wang, Li-E; Wei, Qingyi; Lee, Jeffrey E.; Amos, Christopher I.; Qureshi, Abrar A.; Han, Jiali
2013-01-01
Previous biological studies showed evidence of a genetic link between obesity and pigmentation in both animal models and humans. Our study investigated the individual and joint associations between obesity-related single nucleotide polymorphisms (SNPs) and both human pigmentation and risk of melanoma. Eight obesity-related SNPs in the FTO, MAP2K5, NEGR1, FLJ35779, ETV5, CADM2, and NUDT3 genes were nominally significantly associated with hair color among 5,876 individuals of European ancestry. The genetic score combining 35 independent obesity-risk loci was significantly associated with darker hair color (beta-coefficient per ten alleles=0.12, P-value=4 10−5). However, single SNPs or genetic scores showed non-significant association with tanning ability. We further examined the SNPs at the FTO locus for their associations with pigmentation and risk of melanoma. Among the 783 SNPs in the FTO gene with imputation R-square quality metric >0.8 using the 1000 genome data set, ten and three independent SNPs were significantly associated with hair color and tanning ability respectively. Moreover, five independent FTO SNPs showed nominally significant association with risk of melanoma in 1,804 cases and 1,026 controls. But none of them was associated with obesity or in linkage disequilibrium with obesity-related variants. FTO locus may confer variation in human pigmentation and risk of melanoma, which may be independent of its effect on obesity. PMID:23539184
Multi-ethnic fine-mapping of 14 central adiposity loci.
Liu, Ching-Ti; Buchkovich, Martin L; Winkler, Thomas W; Heid, Iris M; Borecki, Ingrid B; Fox, Caroline S; Mohlke, Karen L; North, Kari E; Adrienne Cupples, L
2014-09-01
The Genetic Investigation of Anthropometric Traits (GIANT) consortium identified 14 loci in European Ancestry (EA) individuals associated with waist-to-hip ratio (WHR) adjusted for body mass index. These loci are wide and narrowing the signals remains necessary. Twelve of 14 loci identified in GIANT EA samples retained strong associations with WHR in our joint EA/individuals of African Ancestry (AA) analysis (log-Bayes factor >6.1). Trans-ethnic analyses at five loci (TBX15-WARS2, LYPLAL1, ADAMTS9, LY86 and ITPR2-SSPN) substantially narrowed the signals to smaller sets of variants, some of which are in regions that have evidence of regulatory activity. By leveraging varying linkage disequilibrium structures across different populations, single-nucleotide polymorphisms (SNPs) with strong signals and narrower credible sets from trans-ethnic meta-analysis of central obesity provide more precise localizations of potential functional variants and suggest a possible regulatory role. Meta-analysis results for WHR were obtained from 77 167 EA participants from GIANT and 23 564 AA participants from the African Ancestry Anthropometry Genetics Consortium. For fine mapping we interrogated SNPs within ± 250 kb flanking regions of 14 previously reported index SNPs from loci discovered in EA populations by performing trans-ethnic meta-analysis of results from the EA and AA meta-analyses. We applied a Bayesian approach that leverages allelic heterogeneity across populations to combine meta-analysis results and aids in fine-mapping shared variants at these locations. We annotated variants using information from the ENCODE Consortium and Roadmap Epigenomics Project to prioritize variants for possible functionality. Published by Oxford University Press 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Genome-Wide Association Mapping of Crown Rust Resistance in Oat Elite Germplasm.
Klos, Kathy Esvelt; Yimer, Belayneh A; Babiker, Ebrahiem M; Beattie, Aaron D; Bonman, J Michael; Carson, Martin L; Chong, James; Harrison, Stephen A; Ibrahim, Amir M H; Kolb, Frederic L; McCartney, Curt A; McMullen, Michael; Fetch, Jennifer Mitchell; Mohammadi, Mohsen; Murphy, J Paul; Tinker, Nicholas A
2017-07-01
Oat crown rust, caused by f. sp. , is a major constraint to oat ( L.) production in many parts of the world. In this first comprehensive multienvironment genome-wide association map of oat crown rust, we used 2972 single-nucleotide polymorphisms (SNPs) genotyped on 631 oat lines for association mapping of quantitative trait loci (QTL). Seedling reaction to crown rust in these lines was assessed as infection type (IT) with each of 10 crown rust isolates. Adult plant reaction was assessed in the field in a total of 10 location-years as percentage severity (SV) and as infection reaction (IR) in a 0-to-1 scale. Overall, 29 SNPs on 12 linkage groups were predictive of crown rust reaction in at least one experiment at a genome-wide level of statistical significance. The QTL identified here include those in regions previously shown to be linked with seedling resistance genes , , , , , and and also with adult-plant resistance and adaptation-related QTL. In addition, QTL on linkage groups Mrg03, Mrg08, and Mrg23 were identified in regions not previously associated with crown rust resistance. Evaluation of marker genotypes in a set of crown rust differential lines supported as the identity of . The SNPs with rare alleles associated with lower disease scores may be suitable for use in marker-assisted selection of oat lines for crown rust resistance. Copyright © 2017 Crop Science Society of America.
Polymorphisms in IL12A and cockroach allergy in children with asthma.
Pistiner, Michael; Hunninghake, Gary M; Soto-Quiros, Manuel E; Avila, Lydiana; Murphy, Amy; Lasky-Su, Jessica; Schuemann, Brooke; Klanderman, Barbara J; Raby, Benjamin A; Celedón, Juan C
2008-07-31
IL12A has been implicated in T-cell development and may thus influence the development of atopy and allergic diseases. We tested for association between four linkage disequilibrium (LD)-tagging SNPs (rs2243123, rs2243151, rs668998, and rs17826053) in IL12A and asthma and allergy-related (serum total and allergen-specific IgE, and skin test reactivity [STR] to two common allergens) phenotypes in two samples: 417 Costa Rican children with asthma and their parents, and 470 families of 503 white children in the Childhood Asthma Management Program (CAMP). The analysis was conducted using the family-based association test (FBAT) statistic implemented in the PBAT program. Among Costa Rican children with asthma, homozygosity for the minor allele of each of two SNPs in IL12A (rs2243123 and rs2243151) was associated with increased risks of STR to American cockroach (P = 0.03 for both SNPs), STR to German cockroach (P = 0.01 for both SNPs), and having a positive IgE to German cockroach (P < 0.05 for both SNPs). Among children in CAMP, homozygosity for the minor allele of SNP rs2243151 in IL12A was inversely associated with STR to German cockroach (P = 0.03) and homozygosity for the minor allele of SNP rs17826053 in IL12A was associated with increased risks of STR to American cockroach (P = 0.01) and STR to German cockroach (P = 0.007). There was no significant association between any SNP in IL12A and asthma, STR to dust mite, or total IgE in Costa Rica or CAMP. Our findings suggest that variants in IL12A influence cockroach allergy among children with asthma.
Associations of IL6 polymorphisms with lung function decline and COPD
He, Jian-Qing; Foreman, Marilyn G.; Shumansky, Karey; Zhang, Xuekui; Akhabir, Loubna; Sin, Don D; Man, S F Paul; DeMeo, Dawn L.; Litonjua, Augusto A.; Silverman, Edwin K.; Connett, John E; Anthonisen, Nicholas R; Wise, Robert A; Paré, Peter D; Sandford, Andrew J
2010-01-01
Background Interleukin-6 (IL6) is a pleiotropic pro-inflammatory and immunomodulatory cytokine which likely plays an important role in the pathogenesis of COPD. There is a functional single nucleotide polymorphism (SNP), −174G/C, in the promoter region of IL6. We hypothesized that IL6 SNPs influence susceptibility for impaired lung function and COPD in smokers. Methods Seven and 5 SNPs in IL6 were genotyped in two nested case-control samples derived from the Lung Health Study (LHS) based on phenotypes of rate of decline of forced expiratory volume in one second (FEV1) over 5 years and baseline FEV1 at the beginning of the LHS. Serum IL6 concentrations were measured for all subjects. A partially overlapping panel of 9 IL6 SNPs was genotyped in 389 COPD cases from the National Emphysema Treatment Trial (NETT) and 420 controls from the Normative Aging Study (NAS). Results In the LHS, three IL6 SNPs were associated with FEV1 decline (0.023 ≤ P ≤ 0.041 in additive models). Among them the IL6_−174C allele was associated with rapid decline of lung function. The association was more significant in a genotype-based analysis (P = 0.006). In the NETT-NAS study, IL6_−174G/C and four other IL6 SNPs, all of which are in linkage disequilibrium with IL6_−174G/C, were associated with susceptibility to COPD (0.01 ≤ P ≤ 0.04 in additive genetic models). Conclusion Our results suggest that the IL6_−174G/C SNP is associated with rapid decline of FEV1 and susceptibility to COPD in smokers. PMID:19359268
FTO Genetic Variation and Association With Obesity in West Africans and African Americans
Adeyemo, Adebowale; Chen, Guanjie; Zhou, Jie; Shriner, Daniel; Doumatey, Ayo; Huang, Hanxia; Rotimi, Charles
2010-01-01
OBJECTIVE The FTO gene is one of the most consistently replicated loci for obesity. However, data from populations of African ancestry are limited. We evaluated genetic variation in the FTO gene and investigated associations with obesity in West Africans and African Americans. RESEARCH DESIGN AND METHODS The study samples comprised 968 African Americans (59% female, mean age 49 years, mean BMI 30.8 kg/m2) and 517 West Africans (58% female, mean age 54 years, mean BMI 25.5 kg/m2). FTO genetic variation was evaluated by genotyping 262 tag single nucleotide polymorphisms (SNPs) across the entire gene. Association of each SNP with BMI, waist circumference, and percent fat mass was investigated under an additive model. RESULTS As expected, both African-ancestry samples showed weaker linkage disequilibrium (LD) patterns compared with other continental (e.g., European) populations. Several intron 8 SNPs, in addition to intron 1 SNPs, showed significant associations in both study samples. The combined effect size for BMI for the top SNPs from meta-analysis was 0.77 kg/m2 (P = 0.009, rs9932411) and 0.70 kg/m2 (P = 0.006, rs7191513). Two previously reported associations with intron 1 SNPs (rs1121980 and rs7204609, r2 = 0.001) were replicated among the West Africans. CONCLUSIONS The FTO gene shows significant differences in allele frequency and LD patterns in populations of African ancestry compared with other continental populations. Despite these differences, we observed evidence of associations with obesity in African Americans and West Africans, as well as evidence of heterogeneity in association. More studies of FTO in multiple ethnic groups are needed. PMID:20299471
Modeling haplotype block variation using Markov chains.
Greenspan, G; Geiger, D
2006-04-01
Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity.
Modeling Haplotype Block Variation Using Markov Chains
Greenspan, G.; Geiger, D.
2006-01-01
Models of background variation in genomic regions form the basis of linkage disequilibrium mapping methods. In this work we analyze a background model that groups SNPs into haplotype blocks and represents the dependencies between blocks by a Markov chain. We develop an error measure to compare the performance of this model against the common model that assumes that blocks are independent. By examining data from the International Haplotype Mapping project, we show how the Markov model over haplotype blocks is most accurate when representing blocks in strong linkage disequilibrium. This contrasts with the independent model, which is rendered less accurate by linkage disequilibrium. We provide a theoretical explanation for this surprising property of the Markov model and relate its behavior to allele diversity. PMID:16361244
The first report of prion-related protein gene (PRNT) polymorphisms in goat.
Kim, Yong-Chan; Jeong, Byung-Hoon
2017-06-01
Prion protein is encoded by the prion protein gene (PRNP). Polymorphisms of several members of the prion gene family have shown association with prion diseases in several species. Recent studies on a novel member of the prion gene family in rams have shown that prion-related protein gene (PRNT) has a linkage with codon 26 of prion-like protein (PRND). In a previous study, codon 26 polymorphism of PRND has shown connection with PRNP haplotype which is strongly associated with scrapie vulnerability. In addition, the genotype of a single nucleotide polymorphism (SNP) at codon 26 of PRND is related to fertilisation capacity. These findings necessitate studies on the SNP of PRNT gene which is connected with PRND. In goat, several polymorphism studies have been performed for PRNP, PRND, and shadow of prion protein gene (SPRN). However, polymorphism on PRNT has not been reported. Hence, the objective of this study was to determine the genotype and allelic distribution of SNPs of PRNT in 238 Korean native goats and compare PRNT DNA sequences between Korean native goats and several ruminant species. A total of five SNPs, including PRNT c.-114G > T, PRNT c.-58A > G in the upstream of PRNT gene, PRNT c.71C > T (p.Ala24Val) and PRNT c.102G > A in the open reading frame (ORF) and c.321C > T in the downstream of PRNT gene, were found in this study. All five SNPs of caprine PRNT gene in Korean native goat are in complete linkage disequilibrium (LD) with a D' value of 1.0. Interestingly, comparative sequence analysis of the PRNT gene revealed five mismatches between DNA sequences of Korean native goats and those of goats deposited in the GenBank. Korean native black goats also showed 5 mismatches in PRNT ORF with cattle. To the best of our knowledge, this is the first genetic research of the PRNT gene in goat.
Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice
2011-05-05
High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to export genotyping data or sequences in various formats. Our experiments on grapevine genetic projects showed that SNiPlay allows geneticists to rapidly obtain advanced results in several key research areas of plant genetic diversity. Both the management and treatment of large amounts of SNP data are rendered considerably easier for end-users through automation and integration. Current developments are taking into account new advances in high-throughput technologies.SNiPlay is available at: http://sniplay.cirad.fr/.
Braz, C U; Camargo, G M F; Cardoso, D F; Gil, F M M; Fonseca, P D S; Cyrillo, J N S G; Mercadante, M E Z; Oliveira, H N; Tonhati, H
2015-12-28
The hormone ghrelin is produced in the stomach wall, has an orexigenic function, stimulates growth hormone secretion, and affects the energy balance of the animal. Therefore, the ghrelin gene (GHRL) is considered to be a good candidate marker for the identification of traits of great economic importance in cattle, such as those associated with feed intake, growth, and carcass quality. The use of molecular genetic markers associated with such traits permits the earlier and more accurate identification of superior animals, thus reducing the interval between generations, and increasing the genetic gain. Six SNPs were found in the GHRL gene, located in intron 3, intron 4, and exon 5. The positions of the SNPs on the gene and the substitutions were: g.2184A>G, g.2347T>C, g.4469T>C, g.4548A>G, g.4663T>C, and g.4729T>C (GenBank accession No. JX565585). After analysis of linkage disequilibrium, association tests were performed between four SNPs with the traits year weight for males, yearling weight for females, dry matter intake, loin eye area, and rump fat thickness (P ≤ 0.05). Therefore, GHRL is an important candidate gene that may be used to identify genetic variations that influence traits of economic importance in beef cattle.
KIAA0319 gene polymorphisms are associated with developmental dyslexia in Chinese Uyghur children
Zhao, Hua; Chen, Yun; Zhang, Bao-ping; Zuo, Peng-xiang
2016-01-01
The gene KIAA0319 has been reported to be associated with developmental dyslexia (DD) in previous studies, although the results have not always been consistent. However, few studies have been conducted in Uyghur populations. In the present study, we aimed to investigate the association of KIAA0319 polymorphisms and DD in individuals of Uyghurian descent. We used a custom-by-design 48-Plex SNPscan Kit to genotype 18 single-nucleotide polymorphisms (SNPs) of KIAA0319 in a group of 196 children with dyslexia and 196 controls of Uyghur descent aged 8–12 years. As a result, 7 SNPs (Pmin=0.001) of KIAA0319 had nominal significant differences between the cases and controls under specific genotypic models. The two SNPs rs6935076 (P=0.020 under dominant model; P=0.028 under additive model) and rs3756821 (P=0.021 under additive model) remained significantly associated with dyslexia after Bonferroni correction. Linkage disequilibrium analysis showed three blocks within KIAA0319, and only a 10-SNP haplotype in block 3 was present at significantly different frequencies in the dyslexic children and controls. This study indicated that genetic polymorphisms of KIAA0319 are associated with an increased risk of DD in the Uyghur population. PMID:27098879
Prasad, Pushplata; Kumar, Ashok; Gupta, Rajiva; Juyal, Ramesh C.; B. K., Thelma
2012-01-01
Genome-wide association studies and meta-analysis indicate that several genes/loci are consistently associated with rheumatoid arthritis (RA) in European and Asian populations. To evaluate the transferability status of these findings to an ethnically diverse north Indian population, we performed a replication analysis. We investigated the association of 47 single-nucleotide polymorphisms (SNPs) at 43 of these genes/loci with RA in a north Indian cohort comprising 983 RA cases and 1007 age and gender matched controls. Genotyping was done using Infinium human 660w-quad. Association analysis by chi-square test implemented in plink was carried out in two steps. Firstly, association of the index or surrogate SNP (r2>0.8, calculated from reference GIH Hap-Map population) was tested. In the second step, evidence for allelic/locus heterogeneity at aforementioned genes/loci was assessed for by testing additional flanking SNPs in linkage equilibrium with index/surrogate marker. Of the 44 European specific index SNPs, neither index nor surrogate SNPs were present for nine SNPs in the genotyping array. Of the remaining 35, associations were replicated at seven genes namely PTPN22 (rs1217407, p = 3×10−3); IL2–21 (rs13119723, p = 0.008); HLA-DRB1 (rs660895, p = 2.56×10−5; rs6457617, p = 1.6×10−09; rs13192471, p = 6.7×10−16); TNFA1P3 (rs9321637, p = 0.03); CCL21 (rs13293020, p = 0.01); IL2RA (rs2104286, p = 1.9×10−4) and ZEB1 (rs2793108, p = 0.006). Of the three Asian specific loci tested, rs2977227 in PADI4 showed modest association (p<0.02). Further, of the 140 SNPs (in LE with index/surrogate variant) tested, association was observed at 11 additional genes: PTPRC, AFF3, CD28, CTLA4, PXK, ANKRD55, TAGAP, CCR6, BLK, CD40 and IL2RB. This study indicates limited replication of European and Asian index SNPs and apparent allelic heterogeneity in RA etiology among north Indians warranting independent GWAS in this population. However, replicated associations of HLA-DRB1, PTPN22 (which confer ∼50% of the heritable risk to RA) and IL2RA suggest that cross-ethnicity fine mapping of such loci is apposite for identification of causal variants. PMID:22355377
Association Analysis of the Ephrin-B2 Gene in African-Americans with End-Stage Renal Disease
Hicks, Pamela J.; Staten, Jennifer L.; Palmer, Nicholette D.; Langefeld, Carl D.; Ziegler, Julie T.; Keene, Keith L.; Sale, Michele M.; Bowden, Donald W.; Freedman, Barry I.
2008-01-01
Background Genome scans in African-Americans with end-stage renal disease (ESRD) identified linkage on chromosome 13q33 in the region containing the ephrin-B2 ligand (EFNB2) genes. Interactions between the ephrin-B2 receptor and ephrin-B2 ligand play essential roles in renal angiogenesis, blood vessel maturation, and kidney disease. Methods The EFNB2 gene was evaluated as a positional candidate for non-diabetic and diabetic ESRD susceptibility in 1,071 unrelated African-American subjects; 316 with non-diabetic etiologies of ESRD, 394 with type 2 diabetes-associated ESRD and 361 healthy controls. Single nucleotide polymorphism (SNP) genotyping was performed on the Sequenom Mass Array System. Statistical analyses were computed using Dandelion version 1.26, Snpaddmix version 1.4 and Haploview version 3.32. Results Twenty-eight HapMap tag SNPs were genotyped spanning the 39 kilobases (kb) of the EFNB2 coding region, with average spacing of 1.43 kb. Analysis of 710 ESRD patient samples and 361 controls provided no evidence of single SNP associations in either diabetic or non-diabetic ESRD; although nominal evidence of association with all-cause ESRD was observed with a two SNP (p = 0.022) and three SNP (p = 0.023) haplotype, both containing SNPs rs7490924 and rs2391335 in intron 1. Conclusions Although an attractive positional candidate gene, polymorphisms in the EFNB2 gene do not appear to contribute in a substantial way to non-diabetic, diabetic or all-cause ESRD susceptibility in African-Americans. Additional genes within the chromosome 13q33 linkage interval are likely contributors to African-American non-diabetic ESRD. PMID:18580054
Kongchum, Pawapol; Palti, Yniv; Hallerman, Eric M; Hulata, Gideon; David, Lior
2010-08-01
Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers for susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpesvirus 3 (CyHV-3) is highly contagious and virulent in common carp (Cyprinus carpio). With the aim to develop molecular tools for breeding CyHV-3-resistant carp, we have amplified and sequenced 11 candidate genes for viral disease resistance including TLR2, TLR3, TLR4ba, TLR7, TLR9, TLR21, TLR22, MyD88, TRAF6, type I IFN and IL-1beta. For each gene, we initially cloned and sequenced PCR amplicons from 8 to 12 fish (2-3 fish per strain) from the SNP discovery panel. We then identified and evaluated putative SNPs for their polymorphisms in the SNP discovery panel and validated their usefulness for linkage analysis in a full-sib family using the SNaPshot method. Our sequencing results and phylogenetic analyses suggested that TLR3, TLR7 and MyD88 genes are duplicated in the common carp genome. We, therefore, developed locus-specific PCR primers and SNP genotyping assays for the duplicated loci. A total of 48 SNP markers were developed from PCR fragments of the 13 loci (7 single-locus and 3 duplicated genes). Thirty-nine markers were polymorphic with estimated minor allele frequencies of more than 0.1. The utility of the SNP markers was evaluated in one full-sib family and revealed that 20 markers from 9 loci segregated in a disomic and Mendelian pattern and would be useful for linkage analysis. Published by Elsevier Ltd.
Kogelman, Lisette J. A.; Pant, Sameer D.; Fredholm, Merete; Kadarmideen, Haja N.
2014-01-01
Obesity is a complex condition with world-wide exponentially rising prevalence rates, linked with severe diseases like Type 2 Diabetes. Economic and welfare consequences have led to a raised interest in a better understanding of the biological and genetic background. To date, whole genome investigations focusing on single genetic variants have achieved limited success, and the importance of including genetic interactions is becoming evident. Here, the aim was to perform an integrative genomic analysis in an F2 pig resource population that was constructed with an aim to maximize genetic variation of obesity-related phenotypes and genotyped using the 60K SNP chip. Firstly, Genome Wide Association (GWA) analysis was performed on the Obesity Index to locate candidate genomic regions that were further validated using combined Linkage Disequilibrium Linkage Analysis and investigated by evaluation of haplotype blocks. We built Weighted Interaction SNP Hub (WISH) and differentially wired (DW) networks using genotypic correlations amongst obesity-associated SNPs resulting from GWA analysis. GWA results and SNP modules detected by WISH and DW analyses were further investigated by functional enrichment analyses. The functional annotation of SNPs revealed several genes associated with obesity, e.g., NPC2 and OR4D10. Moreover, gene enrichment analyses identified several significantly associated pathways, over and above the GWA study results, that may influence obesity and obesity related diseases, e.g., metabolic processes. WISH networks based on genotypic correlations allowed further identification of various gene ontology terms and pathways related to obesity and related traits, which were not identified by the GWA study. In conclusion, this is the first study to develop a (genetic) obesity index and employ systems genetics in a porcine model to provide important insights into the complex genetic architecture associated with obesity and many biological pathways that underlie it. PMID:25071839
Gene mapping study for constitutive skin color in an isolated Mongolian population.
Paik, Seung Hwan; Kim, Hyun-Jin; Son, Ho-Young; Lee, Seungbok; Im, Sun-Wha; Ju, Young Seok; Yeon, Je Ho; Jo, Seong Jin; Eun, Hee Chul; Seo, Jeong-Sun; Kwon, Oh Sang; Kim, Jong-Il
2012-03-31
To elucidate the genes responsible for constitutive human skin color, we measured the extent of skin pigmentation in the buttock, representative of lifelong non-sun-exposed skin, and conducted a gene mapping study on skin color in an isolated Mongolian population composed of 344 individuals from 59 families who lived in Dashbalbar, Mongolia. The heritability of constitutive skin color was 0.82, indicating significant genetic association on this trait. Through the linkage analysis using 1,039 short tandem repeat (STR) microsatellite markers, we identified a novel genomic region regulating constitutive skin color on 11q24.2 with an logarithm of odds (LOD) score of 3.39. In addition, we also found other candidate regions on 17q23.2, 6q25.1, and 13q33.2 (LOD ≥ 2). Family-based association tests on these regions with suggestive linkage peaks revealed ten and two significant single nucleotide polymorphisms (SNPs) on the linkage regions of chromosome 11 and 17, respectively. We were able to discover four possible candidate genes that would be implicated to regulate human skin color: ETS1, UBASH3B, ASAM, and CLTC.
Gene mapping study for constitutive skin color in an isolated Mongolian population
Paik, Seung Hwan; Kim, Hyun-Jin; Son, Ho-Young; Lee, Seungbok; Im, Sun-Wha; Ju, Young Seok; Yeon, Je Ho; Jo, Seong Jin; Eun, Hee Chul; Seo, Jeong-Sun
2012-01-01
To elucidate the genes responsible for constitutive human skin color, we measured the extent of skin pigmentation in the buttock, representative of lifelong non-sun-exposed skin, and conducted a gene mapping study on skin color in an isolated Mongolian population composed of 344 individuals from 59 families who lived in Dashbalbar, Mongolia. The heritability of constitutive skin color was 0.82, indicating significant genetic association on this trait. Through the linkage analysis using 1,039 short tandem repeat (STR) microsatellite markers, we identified a novel genomic region regulating constitutive skin color on 11q24.2 with an logarithm of odds (LOD) score of 3.39. In addition, we also found other candidate regions on 17q23.2, 6q25.1, and 13q33.2 (LOD ≥ 2). Family-based association tests on these regions with suggestive linkage peaks revealed ten and two significant single nucleotide polymorphisms (SNPs) on the linkage regions of chromosome 11 and 17, respectively. We were able to discover four possible candidate genes that would be implicated to regulate human skin color: ETS1, UBASH3B, ASAM, and CLTC. PMID:22198297
Le Clerc, Sigrid; Coulonges, Cédric; Delaneau, Olivier; Van Manen, Danielle; Herbeck, Joshua T.; Limou, Sophie; An, Ping; Martinson, Jeremy J.; Spadoni, Jean-Louis; Therwath, Amu; Veldink, Jan H.; van den Berg, Leonard H.; Taing, Lieng; Labib, Taoufik; Mellak, Safa; Montes, Matthieu; Delfraissy, Jean-François; Schächter, François; Winkler, Cheryl; Froguel, Philippe; Mullins, James I.; Schuitemaker, Hanneke; Zagury, Jean-François
2011-01-01
Background Seven genome-wide association studies (GWAS) have been published in AIDS and only associations in the HLA region on chromosome 6 and CXCR6 have passed genome-wide significance. Methods We reanalyzed the data from three previously published GWAS, targeting specifically low frequency SNPs (minor allele frequency (MAF)<5%). Two groups composed of 365 slow progressors (SP) and 147 rapid progressors (RP) from Europe and the US were compared with a control group of 1394 seronegative individuals using Eigenstrat corrections. Results Of the 8584 SNPs with MAF<5% in cases and controls (Bonferroni threshold=5.8×10−6), four SNPs showed statistical evidence of association with the SP phenotype. The best result was for HCP5 rs2395029 (p=8.54×10−15, OR=3.41) in the HLA locus, in partial linkage disequilibrium with two additional chromosome 6 associations in C6orf48 (p=3.03×10−10, OR=2.9) and NOTCH4 (9.08×10−07, OR=2.32). The fourth association corresponded to rs2072255 located in RICH2 (p=3.30×10−06, OR=0.43) in chromosome 17. Using HCP5 rs2395029 as a covariate, the C6orf48 and NOTCH4 signals disappeared, but the RICH2 signal still remained significant. Conclusion Besides the already known chromosome 6 associations, the analysis of low frequency SNPs brought up a new association in the RICH2 gene. Interestingly, RICH2 interacts with BST-2 known to be a major restriction factor for HIV-1 infection. Our study has thus identified a new candidate gene for AIDS molecular etiology and confirms the interest of singling out low frequency SNPs in order to exploit GWAS data. PMID:21107268
Mejía-Benítez, Aurora; Klünder-Klünder, Miguel; Yengo, Loic; Meyre, David; Aradillas, Celia; Cruz, Esperanza; Pérez-Luque, Elva; Malacara, Juan Manuel; Garay, Maria Eugenia; Peralta-Romero, Jesús; Flores-Huerta, Samuel; García-Mena, Jaime; Froguel, Philippe; Cruz, Miguel; Bonnefond, Amélie
2013-02-01
Recent genome wide association studies (GWAS) and previous positional linkage studies have identified more than 50 single nucleotide polymorphisms (SNPs) associated with obesity, mostly in Europeans. We aimed to assess the contribution of some of these SNPs to obesity risk and to the variation of related metabolic traits, in Mexican children. The association of six European obesity-related SNPs in or near FTO, NPC1, ENPP1, NEGR1, GNPDA2 and MC4R genes with risk of obesity was tested in 1,463 school-aged Mexican children (N(cases) = 514; N(controls) = 949). We also assessed effects of these SNPs on the variation of body mass index (BMI), fasting serum insulin levels, fasting plasma glucose levels, total cholesterol and triglyceride levels, in a subset of 1,171 nonobese Mexican children. We found a significant effect of GNPDA2 rs10938397 on risk of obesity (odds ratio [OR] = 1.30; P = 1.34 × 10-3). Furthermore, we found nominal associations between obesity risk or BMI variation and the following SNPs: ENPP1 rs7754561, MC4R rs17782313 and NEGR1 rs2815752. Importantly, the at-risk alleles of both MC4R rs17782313 and NPC1 rs1805081 showed significant effect on increased fasting glucose levels (β = 0.36 mmol/L; P = 1.47 × 10(-3)) and decreased fasting serum insulin levels (β = -0.10 μU/mL; P = 1.21 × 10(-3)), respectively. Our present results suggest that some obesity-associated SNPs previously reported in Europeans also associate with risk of obesity, or metabolic quantitative traits, in Mexican children. Importantly, we found new associations between MC4R and fasting glucose levels, and between NPC1 and fasting insulin levels.
Bosch, T M; Doodeman, V D; Smits, P H M; Meijerman, I; Schellens, J H M; Beijnen, J H
2006-01-01
A possible explanation for the wide interindividual variability in toxicity and efficacy of drug therapy is variation in genes encoding drug-metabolizing enzymes and drug transporters. The allelic frequency of these genetic variants, linkage disequilibrium (LD), and haplotype of these polymorphisms are important parameters in determining the genetic differences between patients. The aim of this study was to explore the frequencies of polymorphisms in drug-metabolizing enzymes (CYP1A1, CYP2C9, CYP2C19, CYP3A4, CYP2D6, CYP3A5, DPYD, UGT1A1, GSTM1, GSTP1, GSTT1) and drug transporters (ABCB1[MDR1] and ABCC2[MRP2]), and to investigate the LD and perform haplotype analysis of these polymorphisms in a Dutch population. Blood samples were obtained from 100 healthy volunteers and genomic DNA was isolated and amplified by PCR. The amplification products were sequenced and analyzed for the presence of polymorphisms by sequence alignment. In the study population, we identified 13 new single nucleotide polymorphisms (SNPs) in Caucasians and three new SNPs in non-Caucasians, in addition to previously recognized SNPs. Three of the new SNPs were found within exons, of which two resulted in amino acid changes (A428T in CYP2C9 resulting in the amino acid substitution D143V; and C4461T in ABCC2 in a non-Caucasian producing the amino acid change T1476M). Several LDs and haplotypes were found in the Caucasian individuals. In this Dutch population, the frequencies of 16 new SNPs and those of previously recognized SNPs were determined in genes coding for drug-metabolizing enzymes and drug transporters. Several LDs and haplotypes were also inferred. These data are important for further research to help explain the interindividual pharmacokinetic and pharmacodynamic variability in response to drug therapy.
Rao, Kollu Nageswara; Ritch, Robert; Dorairaj, Syril K; Kaur, Inderjeet; Liebmann, Jeffrey M; Thomas, Ravi; Chakrabarti, Subhabrata
2008-07-09
Single nucleotide polymorphisms (SNPs) in the LOXL1 gene have been implicated in exfoliation syndrome (XFS) and exfoliation glaucoma (XFG). We have shown that these SNPs are not associated with the primary glaucomas such as primary open-angle (POAG) glaucoma and primary angle-closure glaucoma (PACG). To further establish the specificity of LOXL1 SNPs for XFS and XFG, we determined whether these SNPs were involved in pigment dispersion syndrome (PDS) and pigmentary glaucoma (PG). Three SNPs of LOXL1 (rs1048661, rs3825942, and rs2165241) were screened in a cohort of 78 unrelated and clinically well characterized glaucoma cases comprising of PG (n=44) and PDS (n=34) patients as well as 108 ethnically matched normal controls of Caucasian origin. The criteria for diagnosis of PDS/PG were Krukenberg spindle, hyperpigmentation of the trabecular meshwork, and wide open angle. Transillumination defects were detected by infrared pupillography, and the presence of a Zentmayer ring was considered as a confirmatory sign. All three SNPs were genotyped in cases and controls by resequencing the genomic region of LOXL1 harboring these variants and were further confirmed by polymerase chain reaction (PCR)-based restriction digestions. Haplotypes were generated from the genotype data, and the linkage disequilibrium (LD) and haplotype analysis were done with Haploview software that uses the expectation maximization (EM) algorithm. The LOXL1 SNPs showed no significant association with PDS or PG. There was no significant difference in the frequencies of the risk alleles of rs1048661 ('G' allele; p=0.309), rs3825942 ('G' allele' p=0.461), and rs2165241 ('T' allele; p=0.432) between PG/PDS cases and controls. Similarly, there was no involvement of the XFS/XFG-associated haplotypes, 'G-G' (p=0.643; [OR=1.08, 95%CI, 0.59-1.97]) and 'T-G' (p=0.266; [OR=1.35, 95%CI, 0.70-2.60]), with the PDS/PG phenotypes. The risk haplotype 'G-G' was observed in ~55% of the normal controls. There was no involvement of the LOXL1 SNPs in patients with PDS and PG. The results further indicate that the associations of these SNPs are specific to XFS/XFG.
The TERT gene harbors multiple variants associated with pancreatic cancer susceptibility
Campa, Daniele; Rizzato, Cosmeri; Stolzenberg-Solomon, Rachael; Pacetti, Paola; Vodicka, Pavel; Cleary, Sean P.; Capurso, Gabriele; Bueno-de-Mesquita, H. Bas; Werner, Jens; Gazouli, Maria; Butterbach, Katja; Ivanauskas, Audrius; Giese, Nathalia; Petersen, Gloria M.; Fogar, Paola; Wang, Zhaoming; Bassi, Claudio; Ryska, Miroslav; Theodoropoulos, George E.; Kooperberg, Charles; Li, Donghui; Greenhalf, William; Pasquali, Claudio; Hackert, Thilo; Fuchs, Charles S.; Mohelnikova-Duchonova, Beatrice; Sperti, Cosimo; Funel, Niccola; Dieffenbach, Aida Karina; Wareham, Nicholas J.; Buring, Julie; Holcátová, Ivana; Costello, Eithne; Zambon, Carlo-Federico; Kupcinskas, Juozas; Risch, Harvey A.; Kraft, Peter; Bracci, Paige M.; Pezzilli, Raffaele; Olson, Sara H.; Sesso, Howard D.; Hartge, Patricia; Strobel, Oliver; Małecka-Panas, Ewa; Visvanathan, Kala; Arslan, Alan A.; Pedrazzoli, Sergio; Souček, Pavel; Gioffreda, Domenica; Key, Timothy J.; Talar-Wojnarowska, Renata; Scarpa, Aldo; Mambrini, Andrea; Jacobs, Eric J.; Jamroziak, Krzysztof; Klein, Alison; Tavano, Francesca; Bambi, Franco; Landi, Stefano; Austin, Melissa A.; Vodickova, Ludmila; Brenner, Hermann; Chanock, Stephen J.; Fave, Gianfranco Delle; Piepoli, Ada; Cantore, Maurizio; Zheng, Wei; Wolpin, Brian M.; Amundadottir, Laufey T.; Canzian, Federico
2015-01-01
A small number of common susceptibility loci have been identified for pancreatic cancer, one of which is marked by rs401681 in the TERT – CLPTM1L gene region on chr5p15.33. Since this region is characterized by low linkage disequilibrium (LD), we sought to identify additional SNPs could be related to pancreatic cancer risk, independently of rs401681. We performed an in-depth analysis of genetic variability of the telomerase reverse transcriptase (TERT) and the telomerase RNA component (TERC) genes, in 5,550 subjects with pancreatic cancer and 7,585 controls from the PANcreatic Disease ReseArch (PANDoRA) and the PanScan consortia. We identified a significant association between a variant in TERT and pancreatic cancer risk (rs2853677, OR=0.85; 95% CI=0.80–0.90, P=8.3×10−8). Additional analysis adjusting rs2853677 for rs401681 indicated that the two SNPs are independently associated with pancreatic cancer risk, as suggested by the low LD between them (r2=0.07, D´=0.28). Three additional SNPs in TERT reached statistical significance after correction for multiple testing: rs2736100 (P=3.0×10−5), rs4583925 (P=4.0×10−5) and rs2735948 (P=5.0×10−5). In conclusion, we confirmed that the TERT locus is associated with pancreatic cancer risk, possibly through several independent variants. PMID:25940397
Genetic association of ubiquilin with Alzheimer's disease and related quantitative measures.
Kamboh, M I; Minster, R L; Feingold, E; DeKosky, S T
2006-03-01
The gene coding for ubiquilin 1 (UBQLN1) is located near a linkage peak on chromosome 9q22.2 and it also impacts the function of presenilin proteins involved in early-onset Alzheimer's disease (AD). Recently, genetic variation in UBQLN1 has been shown to affect the risk of AD in two independent family-based samples. The purpose of this study was to confirm the reported association in a large case-control sample and to also examine the association of UBQLN1 SNPs with quantitative measures of AD progression, namely age-at-onset (AAO), disease duration and Mini-Mental State Examination (MMSE) score. We examined the associations of three SNPs in the UBQLN1 gene (intron 6/A>C, intron 8/T>C and intron 9/A>G) in up to 978 LOAD cases and 808 controls. All SNPs were in significant linkage disequilibrium (P<0.0001). While modest significant associations were observed in the single-site regression analysis, 3-site haplotype analysis revealed significant associations (P<0.0001 for overall haplotype analysis). One common haplotype (H4) defined by intron 6/A-intron 8/C-intron 9/G alleles was associated with AD risk and one less common haplotype (H5) defined by intron 6/C-intron 8/C-intron 9/A alleles was associated with protection. The adjusted odds ratios with potentially one and two copies of risk haplotype H4 were 1.5 (95% CI: 0.99-2.26; P=0.054) and 3.66 (95% CI: 1.43-9.39; P=0.007), respectively, and odds ratio for haplotype H5 carriers was 0.31 (95% CI: 0.10-0.95; P=0.0398). In addition to disease risk, the homozygosity of the risk haplotype was also associated with older AAO, longer disease duration and lower MMSE score. In summary, our data from a large case-control cohort indicate that genetic variation in the UBQLN1 gene has a modest effect on risk, AAO and disease duration of AD. Our haplotype data suggest the presence of additional putative functional variants either in the UBQLN1 gene or nearby genes and provide strong justification for additional work in this region on chromosome 9.
Yamada, Yoshiji; Kato, Kimihiko; Oguri, Mitsutoshi; Horibe, Hideki; Fujimaki, Tetsuo; Yasukochi, Yoshiki; Takeuchi, Ichiro; Sakuma, Jun
2018-07-01
Given that substantial genetic components have been shown in ischemic stroke, intracerebral hemorrhage (ICH), and subarachnoid hemorrhage (SAH), heritability may be higher in early-onset than late-onset individuals with these conditions. Although genome-wide association studies (GWASs) have identified various genes and loci significantly associated with ischemic stroke, ICH, or intracranial aneurysm mainly in European ancestry populations, genetic variants that contribute to susceptibility to these disorders remain to be identified definitively. We performed exome-wide association studies (EWASs) to identify genetic variants that confer susceptibility to ischemic stroke, ICH, or SAH in early-onset subjects with these conditions. A total of 6,649 individuals aged ≤65 years were examined. For the EWAS of ischemic or hemorrhagic stroke, 6,224 individuals (450 subjects with ischemic stroke, 5,774 controls) or 6,179 individuals (261 subjects with ICH, 176 subjects with SAH, 5,742 controls), respectively, were examined. EWASs were performed with the use of Illumina Human Exome-12 v1.2 DNA Analysis BeadChip or Infinium Exome-24 v1.0 BeadChip. To compensate for multiple comparisons of allele frequencies with ischemic stroke, ICH, or SAH, we applied a false discovery rate (FDR) of <0.05 for statistical significance of association. The association of allele frequencies of 31,245 single nucleotide polymorphisms (SNPs) that passed quality control to ischemic stroke was examined with Fisher's exact test, and 31 SNPs were significantly (FDR <0.05) associated with ischemic stroke. The association of allele frequencies of 31,253 or 30,970 SNPs to ICH or SAH, respectively, was examined with Fisher's exact test, and six or two SNPs were significantly associated with ICH or SAH, respectively. Multivariable logistic regression analysis with adjustment for age, sex, and the prevalence of hypertension and diabetes mellitus revealed that 12 SNPs were significantly [P<0.0004 (0.05/124)] related to ischemic stroke. Similar analysis with adjustment for age, sex, and the prevalence of hypertension revealed that six or two SNPs were significantly [P<0.0016 (0.05/32)] related to ICH or SAH, respectively. After examination of linkage disequilibrium of identified SNPs and results of previous GWASs, we identified HHIPL2, CTNNA3, LOC643770, UTP20 , and TRIB3 as susceptibility loci for ischemic stroke, DNTTIP2 and FAM205A as susceptibility loci for ICH, and FAM160A1 and OR52E4 as such loci for SAH. Therefore, to the best of our knowledge, we have newly identified nine genes that confer susceptibility to early-onset ischemic stroke, ICH, or SAH. Determination of genotypes for the SNPs in these genes may prove informative for assessment of the genetic risk for ischemic stroke, ICH, or SAH in Japanese.
Sato, Youichi; Tajima, Atsushi; Tsunematsu, Kouki; Nozawa, Shiari; Yoshiike, Miki; Koh, Eitetsue; Kanaya, Jiro; Namiki, Mikio; Matsumiya, Kiyomi; Tsujimura, Akira; Komatsu, Kiyoshi; Itoh, Naoki; Eguchi, Jiro; Imoto, Issei; Yamauchi, Aiko; Iwamoto, Teruaki
2015-06-01
Are the four candidate loci (rs7867029, rs12870438, rs7174015 and rs724078) for human male fertility traits, identified in a genome-wide association study (GWAS) of a Hutterite population in the USA, associated with semen quality traits in a Japanese population? The four single nucleotide polymorphisms (SNPs) rs7867029, rs12870438, rs7174015 and rs724078 have no association with semen parameters in a meta-analysis of two Japanese male cohorts. Four (rs7867029, rs12870438, rs7174015 and rs724078) of the SNPs associated with family size or birth rate in the GWAS of a Hutterite population in the USA were associated with semen parameters in ethnically diverse men from Chicago, USA. This is a replication study in a total of 2015 Japanese subjects, including 791 fertile men and 1224 young men from the general population. We performed a replication study in two cohorts to assess whether the SNPs rs7867029, rs12870438, rs7174015 and rs724078 are associated with sperm concentration, semen volume, total sperm numbers, total motile sperm numbers or sperm motility. The rs12870438 SNP was detected by restriction fragment length polymorphism PCR while rs7174015, rs724078 and rs7867029 SNPs were genotyped using TaqMan probes. This study indicated that none of the four SNPs rs7867029, rs12870438, rs7174015 and rs724078 displayed a significant association with semen parameters in the meta-analysis of two Japanese male cohorts. Only four SNPs identified in the Hutterite GWAS were examined for associations with semen quality traits in a Japanese population. In addition, the linkage disequilibrium structures around the testing markers were different between ethnic groups. Locus mapping studies using a set of tagging SNPs across the loci will be necessary in populations with larger sample sizes in order to understand the contribution of specific genes to semen quality. This study was supported in part by the Ministry of Health and Welfare of Japan (1013201) (to T.I.), Grant-in-Aids for Scientific Research (C) (23510242) (to A.Ta.) from the Japan Society for the Promotion of Science, the European Union (BMH4-CT96-0314) (to T.I.), and the Takeda Science Foundation (to A.Ta.). None of the authors has any competing interests to declare. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
IRGM Variants and Susceptibility to Inflammatory Bowel Disease in the German Population
Bues, Stephanie; Stallhofer, Johannes; Fries, Christoph; Olszak, Torsten; Tsekeri, Eleni; Wetzke, Martin; Beigel, Florian; Steib, Christian; Friedrich, Matthias; Göke, Burkhard; Diegelmann, Julia; Czamara, Darina; Brand, Stephan
2013-01-01
Background & Aims Genome-wide association studies identified the autophagy gene IRGM to be strongly associated with Crohn's disease (CD) but its impact in ulcerative colitis (UC), its phenotypic effects and potential epistatic interactions with other IBD susceptibility genes are less clear which we therefore analyzed in this study. Methodology/Principal Findings Genomic DNA from 2060 individuals including 817 CD patients, 283 UC patients, and 961 healthy, unrelated controls (all of Caucasian origin) was analyzed for six IRGM single nucleotide polymorphisms (SNPs) (rs13371189, rs10065172 = p.Leu105Leu, rs4958847, rs1000113, rs11747270, rs931058). In all patients, a detailed genotype-phenotype analysis and testing for epistasis with the three major CD susceptibility genes NOD2, IL23R and ATG16L1 were performed. Our analysis revealed an association of the IRGM SNPs rs13371189 (p = 0.02, OR 1.31 [95% CI 1.05–1.65]), rs10065172 = p.Leu105Leu (p = 0.016, OR 1.33 [95% CI 1.06–1.66]) and rs1000113 (p = 0.047, OR 1.27 [95% CI 1.01–1.61]) with CD susceptibility. There was linkage disequilibrium between these three IRGM SNPs. In UC, several IRGM haplotypes were weakly associated with UC susceptibility (p<0.05). Genotype-phenotype analysis revealed no significant associations with a specific IBD phenotype or ileal CD involvement. There was evidence for weak gene-gene-interaction between several SNPs of the autophagy genes IRGM and ATG16L1 (p<0.05), which, however, did not remain significant after Bonferroni correction. Conclusions/Significance Our results confirm IRGM as susceptibility gene for CD in the German population, supporting a role for the autophagy genes IRGM and ATG16L1 in the pathogenesis of CD. PMID:23365659
Batra, Jyotsna; Lose, Felicity; O'Mara, Tracy; Marquart, Louise; Stephens, Carson; Alexander, Kimberly; Srinivasan, Srilakshmi; Eeles, Rosalind A.; Easton, Douglas F.; Olama, Ali Amin Al; Kote-Jarai, Zsofia; Guy, Michelle; Muir, Kenneth; Lophatananon, Artitaya; Rahman, Aneela A.; Neal, David E.; Hamdy, Freddie C.; Donovan, Jenny L.; Chambers, Suzanne; Gardiner, Robert A.; Aitken, Joanne; Yaxley, John; Kedda, Mary-Anne
2011-01-01
Background Kallikrein 15 (KLK15)/Prostinogen is a plausible candidate for prostate cancer susceptibility. Elevated KLK15 expression has been reported in prostate cancer and it has been described as an unfavorable prognostic marker for the disease. Objectives We performed a comprehensive analysis of association of variants in the KLK15 gene with prostate cancer risk and aggressiveness by genotyping tagSNPs, as well as putative functional SNPs identified by extensive bioinformatics analysis. Methods and Data Sources Twelve out of 22 SNPs, selected on the basis of linkage disequilibrium pattern, were analyzed in an Australian sample of 1,011 histologically verified prostate cancer cases and 1,405 ethnically matched controls. Replication was sought from two existing genome wide association studies (GWAS): the Cancer Genetic Markers of Susceptibility (CGEMS) project and a UK GWAS study. Results Two KLK15 SNPs, rs2659053 and rs3745522, showed evidence of association (p<0.05) but were not present on the GWAS platforms. KLK15 SNP rs2659056 was found to be associated with prostate cancer aggressiveness and showed evidence of association in a replication cohort of 5,051 patients from the UK, Australia, and the CGEMS dataset of US samples. A highly significant association with Gleason score was observed when the data was combined from these three studies with an Odds Ratio (OR) of 0.85 (95% CI = 0.77–0.93; p = 2.7×10−4). The rs2659056 SNP is predicted to alter binding of the RORalpha transcription factor, which has a role in the control of cell growth and differentiation and has been suggested to control the metastatic behavior of prostate cancer cells. Conclusions Our findings suggest a role for KLK15 genetic variation in the etiology of prostate cancer among men of European ancestry, although further studies in very large sample sets are necessary to confirm effect sizes. PMID:22132073
Genetic factors controlling wool shedding in a composite Easycare sheep flock.
Matika, O; Bishop, S C; Pong-Wong, R; Riggio, V; Headon, D J
2013-12-01
Historically, sheep have been selectively bred for desirable traits including wool characteristics. However, recent moves towards extensive farming and reduced farm labour have seen a renewed interest in Easycare breeds. The aim of this study was to quantify the underlying genetic architecture of wool shedding in an Easycare flock. Wool shedding scores were collected from 565 pedigreed commercial Easycare sheep from 2002 to 2010. The wool scoring system was based on a 10-point (0-9) scale, with score 0 for animals retaining full fleece and 9 for those completely shedding. DNA was sampled from 200 animals of which 48 with extreme phenotypes were genotyped using a 50-k SNP chip. Three genetic analyses were performed: heritability analysis, complex segregation analysis to test for a major gene hypothesis and a genome-wide association study to map regions in the genome affecting the trait. Phenotypes were treated as a continuous or binary variable and categories. High estimates of heritability (0.80 when treated as a continuous, 0.65-0.75 as binary and 0.75 as categories) for shedding were obtained from linear mixed model analyses. Complex segregation analysis gave similar estimates (0.80 ± 0.06) to those above with additional evidence for a major gene with dominance effects. Mixed model association analyses identified four significant (P < 0.05) SNPs. Further analyses of these four SNPs in all 200 animals revealed that one of the SNPs displayed dominance effects similar to those obtained from the complex segregation analyses. In summary, we found strong genetic control for wool shedding, demonstrated the possibility of a single putative dominant gene controlling this trait and identified four SNPs that may be in partial linkage disequilibrium with gene(s) controlling shedding. © 2013 University of Edinburgh, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
Yoneyama, Sachiko; Yao, Jie; Guo, Xiuqing; Fernandez-Rhodes, Lindsay; Lim, Unhee; Boston, Jonathan; Buzková, Petra; Carlson, Christopher S.; Cheng, Iona; Cochran, Barbara; Cooper, Richard; Ehret, Georg; Fornage, Myriam; Gong, Jian; Gross, Myron; Gu, C. Charles; Haessler, Jeff; Haiman, Christopher A.; Henderson, Brian; Hindorff, Lucia A.; Houston, Denise; Irvin, Marguerite R.; Jackson, Rebecca; Kuller, Lew; Leppert, Mark; Lewis, Cora E.; Li, Rongling; Le Marchand, Loic; Matise, Tara C.; Nguyen, Khanh-Dung H.; Chakravarti, Aravinda; Pankow, James S.; Pankratz, Nathan; Pooler, Loreall; Ritchie, Marylyn D.; Bien, Stephanie A.; Wassel, Christina L.; Chen, Yii-Der I.; Taylor, Kent D.; Allison, Matthew; Rotter, Jerome I.; Schreiner, Pamela J.; Schumacher, Fredrick; Wilkens, Lynne; Boerwinkle, Eric; Kooperberg, Charles; Peters, Ulrike; Buyske, Steven; Graff, Mariaelisa; North, Kari E.
2016-01-01
Background/Objectives Central adiposity measures such as waist circumference (WC) and waist-to-hip ratio (WHR) are associated with cardiometabolic disorders independently of BMI and are gaining clinically utility. Several studies report genetic variants associated with central adiposity, but most utilize only European ancestry populations. Understanding whether the genetic associations discovered among mainly European descendants are shared with African ancestry populations will help elucidate the biological underpinnings of abdominal fat deposition. Subjects/Methods To identify the underlying functional genetic determinants of body fat distribution, we conducted an array-wide association meta-analysis among persons of African ancestry across seven studies/consortia participating in the Population Architecture using Genomics and Epidemiology (PAGE) consortium. We used the Metabochip array, designed for fine mapping cardiovascular associated loci, to explore novel array-wide associations with WC and WHR among 15 945 African descendants using all and sex-stratified groups. We further interrogated 17 known WHR regions for African ancestry-specific variants. Results Of the 17 WHR loci, eight SNPs located in four loci were replicated in the sex-combined or sex-stratified meta-analyses. Two of these eight independently associated with WHR after conditioning on the known variant in European descendants (rs12096179 in TBX15-WARS2 and rs2059092 in ADAMTS9). In the fine mapping assessment, the putative functional region was reduced across all four loci but to varying degrees (average 40% drop in number of putative SNPs and 20% drop in genomic region). Similar to previous studies, the significant SNPs in the female stratified analysis were stronger than the significant SNPs from the sex-combined analysis. No novel associations were detected in the array-wide analyses. Conclusions Of 17 previously identified loci, four loci replicated in the African ancestry populations of this study. Utilizing different linkage disequilibrium patterns observed between European and African ancestries, we narrowed the suggestive region containing causative variants for all four loci. PMID:27867202
Micro-RNA Binding Site Polymorphisms in the WFS1 Gene Are Risk Factors of Diabetes Mellitus
Elek, Zsuzsanna; Németh, Nóra; Nagy, Géza; Németh, Helga; Somogyi, Anikó; Hosszufalusi, Nóra; Sasvári-Székely, Mária; Rónai, Zsolt
2015-01-01
The absolute or relative lack of insulin is the key factor in the pathogenesis of diabetes mellitus. Although the connection between loss of function mutations of the WFS1 gene and DIDMOAD-syndrome including diabetes mellitus underpins the significance of wolframin in the pathogenesis, exact role of WFS1 polymorphic variants in the development of type 1 and type 2 diabetes has not been discovered yet. In this analysis, 787 patients with diabetes and 900 healthy people participated. Genotyping of the 7 WFS1 SNPs was carried out by TaqMan assays. Association study was performed by χ 2-test in combination with correction for multiple testing. For functional analysis, the entire 3’ UTR of the WFS1 gene was subcloned in a pMIR-Report plasmid and relative luciferase activities were determined. Linkage disequilibrium analysis showed a generally high LD within the investigated region, however the rs1046322 locus was not in LD with the other SNPs. The two miR-SNPs, rs1046322 and rs9457 showed significant association with T1DM and T2DM, respectively. Haplotype analysis also confirmed the association between the 3’ UTR loci and both disease types. In vitro experiments showed that miR-185 reduces the amount of the resulting protein, and rs9457 miRSNP significantly influences the rate of reduction in a luciferase reporter assay. Genetic variants of the WFS1 gene might contribute to the genetic risk of T1DM and T2DM. Furthermore demonstrating the effect of rs9457 in binding of miR-185, we suggest that the optimal level of wolframin protein, potentially influenced by miR-regulation, is crucial in normal beta cell function. PMID:26426397
Micro-RNA Binding Site Polymorphisms in the WFS1 Gene Are Risk Factors of Diabetes Mellitus.
Elek, Zsuzsanna; Németh, Nóra; Nagy, Géza; Németh, Helga; Somogyi, Anikó; Hosszufalusi, Nóra; Sasvári-Székely, Mária; Rónai, Zsolt
2015-01-01
The absolute or relative lack of insulin is the key factor in the pathogenesis of diabetes mellitus. Although the connection between loss of function mutations of the WFS1 gene and DIDMOAD-syndrome including diabetes mellitus underpins the significance of wolframin in the pathogenesis, exact role of WFS1 polymorphic variants in the development of type 1 and type 2 diabetes has not been discovered yet. In this analysis, 787 patients with diabetes and 900 healthy people participated. Genotyping of the 7 WFS1 SNPs was carried out by TaqMan assays. Association study was performed by χ2-test in combination with correction for multiple testing. For functional analysis, the entire 3' UTR of the WFS1 gene was subcloned in a pMIR-Report plasmid and relative luciferase activities were determined. Linkage disequilibrium analysis showed a generally high LD within the investigated region, however the rs1046322 locus was not in LD with the other SNPs. The two miR-SNPs, rs1046322 and rs9457 showed significant association with T1DM and T2DM, respectively. Haplotype analysis also confirmed the association between the 3' UTR loci and both disease types. In vitro experiments showed that miR-185 reduces the amount of the resulting protein, and rs9457 miRSNP significantly influences the rate of reduction in a luciferase reporter assay. Genetic variants of the WFS1 gene might contribute to the genetic risk of T1DM and T2DM. Furthermore demonstrating the effect of rs9457 in binding of miR-185, we suggest that the optimal level of wolframin protein, potentially influenced by miR-regulation, is crucial in normal beta cell function.
Association of SSTR2 Polymorphisms and Glucose Homeostasis Phenotypes
Sutton, Beth S.; Palmer, Nicholette D.; Langefeld, Carl D.; Xue, Bingzhong; Proctor, Alexandria; Ziegler, Julie T.; Haffner, Steven M.; Norris, Jill M.; Bowden, Donald W.
2009-01-01
OBJECTIVE This study evaluated the influence of somatostatin receptor type 2 (SSTR2) polymorphisms on measures of glucose homeostasis in the Insulin Resistance Atherosclerosis Family Study (IRASFS). SSTR2 is a G-protein–coupled receptor that, in response to somatostatin, mediates inhibition of insulin, glucagon, and growth hormone release and thus may affect glucose homeostasis. RESEARCH DESIGN AND METHODS Ten single nucleotide polymorphisms (SNPs) spanning the gene were chosen using a SNP density selection algorithm and genotyped on 1,425 Hispanic-American individuals from 90 families in the IRASFS. These families comprised two samples (set 1 and set 2), which were analyzed individually and as a combined set. Single SNP tests of association were performed for four glucose homeostasis measures—insulin sensitivity (SI), acute insulin response (AIR), disposition index (DI), and fasting blood glucose (FBG)—using generalized estimating equations. RESULTS The SSTR2 locus was encompassed by a single linkage disequilibrium (LD) block (D′ = 0.91–1.00; r2 = 0.09–0.97) that contained four of the ten SNPs evaluated. Within the SSTR2-containing LD block, evidence of association was observed in each of the two sets and in a combined analysis with decreased SI(βhomozygous = −0.16; Pmeta-analysis = 0.0024–0.0030), decreased DI (βhomozygous = −0.35 to −5.16; Pmeta-analysis = 0.0075–0.027), and increased FBG (βhomozygous = 2.30; Pmeta-analysis = 0.045). SNPs outside the SSTR2-containing LD block were not associated with measures of glucose homeostasis. CONCLUSIONS We observed evidence for association of SSTR2 polymorphisms with measures of glucose homeostasis. Thus, variants in SSTR2 may influence pathways of SIto modulate glucose homeostasis. PMID:19324939
Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J
2012-05-25
A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the 'Golden Delicious' reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
[The joint applications of DNA chips and single nucleotide polymorphisms in forensic science].
Bai, Peng; Tian, Li; Zhou, Xue-ping
2005-05-01
DNA chip technology, being a new high-technology, shows its vigorous life and rapid growth. Single Nucleotide Polymorphisms (SNPs) is the most common diversity in the human genome. It provides suitable genetic markers which play a key role in disease linkage study, pharmacogenomics, forensic medicine, population evolution and immigration study. Their advantage such as being analyzed with DNA chips technology, is predicted to play an important role in the field of forensic medicine, especially in paternity test and individual identification. This report mainly reviews the characteristics of DNA chip and SNPs, and their joint applications in the practice of forensic medicine.
Uemoto, Yoshinobu; Ohtake, Tsuyoshi; Sasago, Nanae; Takeda, Masayuki; Abe, Tsuyoshi; Sakuma, Hironori; Kojima, Takatoshi; Sasaki, Shinji
2017-11-13
Umami is a Japanese term for the fifth basic taste and is an important sensory property of beef palatability. Inosine 5'-monophosphate (IMP) contributes to umami taste in beef. Thus, the overall change in concentration of IMP and its degradation products can potentially affect the beef palatability. In this study, we investigated the genetic architecture of IMP and its degradation products in Japanese Black beef. First, we performed genome-wide association study (GWAS), candidate gene analysis, and functional analysis to detect the causal variants that affect IMP, inosine, and hypoxanthine. Second, we evaluated the allele frequencies in the different breeds, the contribution of genetic variance, and the effect on other economical traits using the detected variants. A total of 574 Japanese Black cattle were genotyped using the Illumina BovineSNP50 BeadChip and were then used for GWAS. The results of GWAS showed that the genome-wide significant single nucleotide polymorphisms (SNPs) on BTA9 were detected for IMP, inosine, and hypoxanthine. The ecto-5'-nucleotidase (NT5E) gene, which encodes the enzyme NT5E for the extracellular degradation of IMP to inosine, was located near the significant region on BTA9. The results of candidate gene analysis and functional analysis showed that two non-synonymous SNPs (c.1318C > T and c.1475 T > A) in NT5E affected the amount of IMP and its degradation products in beef by regulating the enzymatic activity of NT5E. The Q haplotype showed a positive effect on IMP and a negative effect on the enzymatic activity of NT5E in IMP degradation. The two SNPs were under perfect linkage disequilibrium in five different breeds, and different haplotype frequencies were seen among breeds. The two SNPs contribute to about half of the total genetic variance in IMP, and the results of genetic relationship between IMP and its degradation products showed that NT5E affected the overall concentration balance of IMP and its degradation products. In addition, the SNPs in NT5E did not have an unfavorable effect on the other economical traits. Based on all the above findings taken together, two non-synonymous SNPs in NT5E would be useful for improving IMP and its degradation products by marker-assisted selection in Japanese Black cattle.
A genetic map and germplasm diversity estimation of Mangifera indica (mango) with SNPs
USDA-ARS?s Scientific Manuscript database
Mango (Mangifera indica) is often referred to as the “King of Fruits”. As the first steps in developing a mango genomics project, we genotyped 582 individuals comprising six mapping populations with 1054 SNP markers. The resulting consensus map had 20 linkage groups defined by 726 SNP markers with...
USDA-ARS?s Scientific Manuscript database
Genotyping by sequencing (GBS) has been developed as an affordable application of next-generation sequencing for the purposes of discovering and genotyping SNPs in a variety of crop species and populations. In this study we employed a double restriction enzyme digestion protocol (HindIII and NlaIII)...
Malenfant, René M; Coltman, David W; Davis, Corey S
2015-05-01
Single-nucleotide polymorphisms (SNPs) offer numerous advantages over anonymous markers such as microsatellites, including improved estimation of population parameters, finer-scale resolution of population structure and more precise genomic dissection of quantitative traits. However, many SNPs are needed to equal the resolution of a single microsatellite, and reliable large-scale genotyping of SNPs remains a challenge in nonmodel species. Here, we document the creation of a 9K Illumina Infinium BeadChip for polar bears (Ursus maritimus), which will be used to investigate: (i) the fine-scale population structure among Canadian polar bears and (ii) the genomic architecture of phenotypic traits in the Western Hudson Bay subpopulation. To this end, we used restriction-site associated DNA (RAD) sequencing from 38 bears across their circumpolar range, as well as blood/fat transcriptome sequencing of 10 individuals from Western Hudson Bay. Six-thousand RAD SNPs and 3000 transcriptomic SNPs were selected for the chip, based primarily on genomic spacing and gene function respectively. Of the 9000 SNPs ordered from Illumina, 8042 were successfully printed, and - after genotyping 1450 polar bears - 5441 of these SNPs were found to be well clustered and polymorphic. Using this array, we show rapid linkage disequilibrium decay among polar bears, we demonstrate that in a subsample of 78 individuals, our SNPs detect known genetic structure more clearly than 24 microsatellites genotyped for the same individuals and that these results are not driven by the SNP ascertainment scheme. Here, we present one of the first large-scale genotyping resources designed for a threatened species. © 2014 John Wiley & Sons Ltd.
Bahri, Bochra A; Daverdin, Guillaume; Xu, Xiangyang; Cheng, Jan-Fang; Barry, Kerrie W; Brummer, E Charles; Devos, Katrien M
2018-06-14
Advances in genomic technologies have expanded our ability to accurately and exhaustively detect natural genomic variants that can be applied in crop improvement and to increase our knowledge of plant evolution and adaptation. Switchgrass (Panicum virgatum L.), an allotetraploid (2n = 4× = 36) perennial C4 grass (Poaceae family) native to North America and a feedstock crop for cellulosic biofuel production, has a large potential for genetic improvement due to its high genotypic and phenotypic variation. In this study, we analyzed single nucleotide polymorphism (SNP) variation in 372 switchgrass genotypes belonging to 36 accessions for 12 genes putatively involved in biomass production to investigate signatures of selection that could have led to ecotype differentiation and to population adaptation to geographic zones. A total of 11,682 SNPs were mined from ~ 15 Gb of sequence data, out of which 251 SNPs were retained after filtering. Population structure analysis largely grouped upland accessions into one subpopulation and lowland accessions into two additional subpopulations. The most frequent SNPs were in homozygous state within accessions. Sixty percent of the exonic SNPs were non-synonymous and, of these, 45% led to non-conservative amino acid changes. The non-conservative SNPs were largely in linkage disequilibrium with one haplotype being predominantly present in upland accessions while the other haplotype was commonly present in lowland accessions. Tajima's test of neutrality indicated that PHYB, a gene involved in photoperiod response, was under positive selection in the switchgrass population. PHYB carried a SNP leading to a non-conservative amino acid change in the PAS domain, a region that acts as a sensor for light and oxygen in signal transduction. Several non-conservative SNPs in genes potentially involved in plant architecture and adaptation have been identified and led to population structure and genetic differentiation of ecotypes in switchgrass. We suggest here that PHYB is a key gene involved in switchgrass natural selection. Further analyses are needed to determine whether any of the non-conservative SNPs identified play a role in the differential adaptation of upland and lowland switchgrass.
Chai, H C; Phipps, M E; Othman, I; Tan, L P; Chua, K H
2013-02-01
Human leukocyte antigen (HLA) antigens and genes have long been reported associated with systemic lupus erythematosus (SLE) susceptibility in many populations. With the advance in technologies such as genome-wide association studies, many newly discovered SLE-associated single-nucleotide polymorphisms (SNPs) have been reported in recent years. These include HLA-DRB1/HLA-DQA1 rs9271366 and HLA-DQB1/HLA-DQA2 rs9275328. Our aim was to investigate these SNPs in a Malaysian SLE cohort. SNPs rs9271366 and rs9275328 were screened across 790 Malaysian citizens from three ethnic groups (360 patients and 430 healthy volunteers) by Taqman SNP genotyping assays. Allele and genotyping frequencies, Hardy-Weinberg equilibrium, Fisher's exact test and odds ratio were calculated for each SNP and ethnic group. Linkage disequilibrium and interaction between the two SNPs were also evaluated. The minor allele G and its homozygous genotype GG of HLA-DRB1/HLA-DQA1 rs9271366 significantly increased the SLE susceptibility in Malaysian patients, including those of Malay and Chinese ethnicity (odds ratio (OR) > 1, p < 0.05). As for HLA-DQB1/HLA-DQA2 rs9275328, the minor allele T and the heterozygous genotype CT conferred protective effect to SLE in Malaysians, as well as in Malays and Chinese, by having OR < 1 and p value <0.05. Both SNPs did not show associations to SLE in Indians. D' and r (2) values for the two SNPs in LD analysis were 0.941 and 0.065, respectively, with haplotype GC and AT being significantly associated with SLE (p < 5.0 × 10(-4)) after 10,000 permutations were performed. The MDR test clustered the genotype combinations of GG and CC, and AG and CC of rs9271366 and rs9275328, accordingly, as high-risk group, and the two SNPs interacted redundantly by removing 1.96% of the entropy. Our findings suggest that in addition to some classical HLA variants, rs9271366 and rs9275328 are additional polymorphisms worth considering in the Malaysian and possibly in a larger Asian SLE scenario.
Wang, Yanru; Freedman, Jennifer A; Liu, Hongliang; Moorman, Patricia G; Hyslop, Terry; George, Daniel J; Lee, Norman H; Patierno, Steven R; Wei, Qingyi
2017-08-15
Evidence suggests that cells with a stemness phenotype play a pivotal role in oncogenesis, and prostate cells exhibiting this phenotype have been identified. We used two genome-wide association study (GWAS) datasets of African descendants, from the Multiethnic/Minority Cohort Study of Diet and Cancer (MEC) and the Ghana Prostate Study, and two GWAS datasets of non-Hispanic whites, from the Prostate, Lung, Colorectal and Ovarian (PLCO) Cancer Screening Trial and the Breast and Prostate Cancer Cohort Consortium (BPC3), to analyze the associations between genetic variants of stemness-related genes and racial disparities in susceptibility to prostate cancer. We evaluated associations of single-nucleotide polymorphisms (SNPs) in 25 stemness-related genes with prostate cancer risk in 1,609 cases and 2,550 controls of non-Hispanic whites (4,934 SNPs) and 1,144 cases and 1,116 controls of African descendants (5,448 SNPs) with correction by false discovery rate ≤0.2. We identified 32 SNPs in five genes (TP63, ALDH1A1, WNT1, MET and EGFR) that were significantly associated with prostate cancer risk, of which six SNPs in three genes (TP63, ALDH1A1 and WNT1) and eight EGFR SNPs showed heterogeneity in susceptibility between these two racial groups. In addition, 13 SNPs in MET and one in ALDH1A1 were found only in African descendants. The in silico bioinformatics analyses revealed that EGFR rs2072454 and SNPs in linkage with the identified SNPs in MET and ALDH1A1 (r 2 > 0.6) were predicted to regulate RNA splicing. These variants may serve as novel biomarkers for racial disparities in prostate cancer risk. © 2017 UICC.
Hulse-Kemp, Amanda M; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A; Scheffler, Brian E; Fang, David D; Chen, Z Jeffrey; Van Deynze, Allen; Stelly, David M
2015-04-09
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. Copyright © 2015 Hulse-Kemp et al.
Greenberg, David A; Zhang, Junying; Shmulewitz, Dvora; Strug, Lisa J; Zimmerman, Regina; Singh, Veena; Marathe, Sudhir
2005-12-30
The Genetic Analysis Workshop 14 simulated dataset was designed 1) To test the ability to find genes related to a complex disease (such as alcoholism). Such a disease may be given a variety of definitions by different investigators, have associated endophenotypes that are common in the general population, and is likely to be not one disease but a heterogeneous collection of clinically similar, but genetically distinct, entities. 2) To observe the effect on genetic analysis and gene discovery of a complex set of gene x gene interactions. 3) To allow comparison of microsatellite vs. large-scale single-nucleotide polymorphism (SNP) data. 4) To allow testing of association to identify the disease gene and the effect of moderate marker x marker linkage disequilibrium. 5) To observe the effect of different ascertainment/disease definition schemes on the analysis. Data was distributed in two forms. Data distributed to participants contained about 1,000 SNPs and 400 microsatellite markers. Internet-obtainable data consisted of a finer 10,000 SNP map, which also contained data on controls. While disease characteristics and parameters were constant, four "studies" used varying ascertainment schemes based on differing beliefs about disease characteristics. One of the studies contained multiplex two- and three-generation pedigrees with at least four affected members. The simulated disease was a psychiatric condition with many associated behaviors (endophenotypes), almost all of which were genetic in origin. The underlying disease model contained four major genes and two modifier genes. The four major genes interacted with each other to produce three different phenotypes, which were themselves heterogeneous. The population parameters were calibrated so that the major genes could be discovered by linkage analysis in most datasets. The association evidence was more difficult to calibrate but was designed to find statistically significant association in 50% of datasets. We also simulated some marker x marker linkage disequilibrium around some of the genes and also in areas without disease genes. We tried two different methods to simulate the linkage disequilibrium.
Genome-wide scan reveals LEMD3 and WIF1 on SSC5 as the candidates for porcine ear size.
Zhang, Longchao; Liang, Jing; Luo, Weizhen; Liu, Xin; Yan, Hua; Zhao, Kebin; Shi, Huibi; Zhang, Yuebo; Wang, Ligang; Wang, Lixian
2014-01-01
The quantitative trait loci (QTL) for porcine ear size was previously reported to mainly focus on SSC5 and SSC7. Recently, a missense mutation, G32E, in PPARD in the QTL interval on SSC7 was identified as the causative mutation for ear size. However, on account of the large interval of QTL, the responsible gene on SSC5 has not been identified. In this study, an intercross population was constructed from the large-eared Minzhu, an indigenous Chinese pig breed, and the Western commercial Large White pig to examine the genetic basis of ear size diversity. A GWAS was performed to detect SNPs significantly associated with ear size. Thirty-five significant SNPs defined a 10.78-Mb (30.14-40.92 Mb) region on SSC5. Further, combining linkage disequilibrium and haplotype sharing analysis, a reduced region of 3.07-Mb was obtained. Finally, by using a selective sweep analysis, a critical region of about 450-kb interval containing two annotated genes LEMD3 and WIF1 was refined in this work. Functional analysis indicated that both represent biological candidates for porcine ear size, with potential application in breeding programs. The two genes could also be used as novel references for further study of the mechanism underlying human microtia.
Genetic association between ghrelin polymorphisms and Alzheimer's disease in a Japanese population.
Shibata, Nobuto; Ohnuma, Tohru; Kuerban, Bolati; Komatsu, Miwa; Arai, Heii
2011-01-01
Ghrelin has been reported to enter the hippocampus and to bind to the neurons of the hippocampal formation. This peptide also affects neuronal glucose uptake and decreases tau hyperphosphorylation. There is increasing evidence suggesting an association between ghrelin and Alzheimer's disease (AD) pathology. The aim of this study was to investigate whether single nucleotide polymorphisms (SNPs) of the ghrelin gene are associated with AD. The SNPs were genotyped using TaqMan technology and were analyzed using a case-control study design. Our case-control dataset consisted of 182 AD patients and 143 age-matched controls. Hardy-Weinberg equilibrium and linkage disequilibrium analyses suggest that the region in and around the gene is highly polymorphic. One SNP, rs4684677 (Leu90Gln), showed a marginal association with age of AD onset. We did not detect any association between the other SNPs of the ghrelin gene and AD. There have been few genetic studies on the relationship between circulating ghrelin and functional SNPs. Further multifactorial studies are needed to clarify the relationship between ghrelin and AD. Copyright © 2011 S. Karger AG, Basel.
High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).
Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C
2016-03-01
Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. © 2015 John Wiley & Sons Ltd.
Pereira, Rinaldo Wellerson; Pena, Sérgio D J
2006-01-01
We studied five microsatellites (DXS995, DXS8076, DXS8114, DXS1002 and DXS1050) located in a region of very low recombination rate in the long arm of the human X chromosome (Xq13.3-Xq21.3). No recombination was seen in 291 meioses in CEPH families. To test whether haplotypes composed of the five microsatellites could differentiate among distinct human continental populations, we studied an international panel containing 72 males from Africa, Europe, Asia and the America. Haplotypic diversity was very high within these groups and no haplotypes were shared among them. This led to the hope that we might be able to identify continent-specific lineages. However, in a median joining network there was no clear discrimination of the different continental groups. We then tested whether we could identify X chromosomal lineages from different continental origins in Brazilians. We typed 180 white Brazilians from four different geographical regions and examined their proportions of haplotype sharing with Africans, Asians, Europeans and Amerindians. No phylogeographical patterns emerged from the data. Moreover, there were several instances of the same haplotype being shared by many (and in one instance all) groups, suggesting that recombination might be occurring. We thus studied pairwise the level of linkage disequilibrium (LD) between the microsatellites. No detectable linkage disequilibrium between the most external loci DXS995 and DXS1050 was observed. Thus, even though recombination may be absent on short time spans, as seen in the CEPH pedigrees, on a long term basis it occurs often enough to dissipate all linkage disequilibrium. On the other hand, we observed very strong linkage disequilibrium between the pairs DXS995/DXS8076 and DXS1002/DXS8114, raising the possibility of resequencing the segment between them to identify single nucleotide polymorphisms (SNPs) in their intervals. The combination of X-linked microsatellites and SNPs in strong linkage disequilibrium might provide a powerful new tool to investigate human demographic history.
Greenwood, Tiffany A; Lazzeroni, Laura C; Calkins, Monica E; Freedman, Robert; Green, Michael F; Gur, Raquel E; Gur, Ruben C; Light, Gregory A; Nuechterlein, Keith H; Olincy, Ann; Radant, Allen D; Seidman, Larry J; Siever, Larry J; Silverman, Jeremy M; Stone, William S; Sugar, Catherine A; Swerdlow, Neal R; Tsuang, Debby W; Tsuang, Ming T; Turetsky, Bruce I; Braff, David L
2016-01-01
The Consortium on the Genetics of Schizophrenia Family Study (COGS-1) has previously reported our efforts to characterize the genetic architecture of 12 primary endophenotypes for schizophrenia. We now report the characterization of 13 additional measures derived from the same endophenotype test paradigms in the COGS-1 families. Nine of the measures were found to discriminate between schizophrenia patients and controls, were significantly heritable (31 to 62%), and were sufficiently independent of previously assessed endophenotypes, demonstrating utility as additional endophenotypes. Genotyping via a custom array of 1536 SNPs from 94 candidate genes identified associations for CTNNA2, ERBB4, GRID1, GRID2, GRIK3, GRIK4, GRIN2B, NOS1AP, NRG1, and RELN across multiple endophenotypes. An experiment-wide p value of 0.003 suggested that the associations across all SNPs and endophenotypes collectively exceeded chance. Linkage analyses performed using a genome-wide SNP array further identified significant or suggestive linkage for six of the candidate endophenotypes, with several genes of interest located beneath the linkage peaks (e.g., CSMD1, DISC1, DLGAP2, GRIK2, GRIN3A, and SLC6A3). While the partial convergence of the association and linkage likely reflects differences in density of gene coverage provided by the distinct genotyping platforms, it is also likely an indication of the differential contribution of rare and common variants for some genes and methodological differences in detection ability. Still, many of the genes implicated by COGS through endophenotypes have been identified by independent studies of common, rare, and de novo variation in schizophrenia, all converging on a functional genetic network related to glutamatergic neurotransmission that warrants further investigation. Copyright © 2015 Elsevier B.V. All rights reserved.
Lu, Xia; Luan, Sheng; Hu, Long Yang; Mao, Yong; Tao, Ye; Zhong, Sheng Ping; Kong, Jie
2016-06-01
The Kuruma prawn, Marsupenaeus japonicus, is one of the most promising marine invertebrates in the industry in Asia, Europe and Australia. However, the increasing global temperatures result in considerable economic losses in M. japonicus farming. In the present study, to select genetically improved animals for the sustainable development of the Kuruma prawn industry, a high-resolution genetic linkage map and quantitative trait locus (QTL) identification were performed using the RAD technology. The maternal map contained 5849 SNP markers and spanned 3127.23 cM, with an average marker interval of 0.535 cM. Instead, the paternal map contained 3927 SNP markers and spanned 3326.19 cM, with an average marker interval of 0.847 cM. The consensus map contained 9289 SNP markers and spanned 3610.90 cM, with an average marker interval of 0.388 cM and coverage of 99.06 % of the genome. The markers were grouped into 41 linkage groups in the maps. Significantly, negative correlation was detected between high-temperature tolerance (UTT) and body weight (BW). The QTL mapping revealed 129 significant QTL loci for UTT and four significant QTL loci for BW at the genome-wide significance threshold. Among these QTLs, 129 overlapped with linked SNPs, and the remaining four were located in regions between contiguous SNPs. They explained the total phenotypic variance ranging from 8.9 to 12.4 %. Because of a significantly negative correlation between growth and high-temperature tolerance, we demonstrate that this high-resolution linkage map and QTLs would be useful for further marker-assisted selection in the genetic improvement of M. japonicus.
Xiong, Lan; Dion, Patrick; Montplaisir, Jacques; Levchenko, Anastasia; Thibodeau, Pascale; Karemera, Liliane; Rivière, Jean-Baptiste; St-Onge, Judith; Gaspar, Claudia; Dubé, Marie-Pierre; Desautels, Alex; Turecki, Gustavo; Rouleau, Guy A
2007-10-05
Converging evidence from clinical observations, brain imaging and pathological findings strongly indicate impaired brain iron regulation in restless legs syndrome (RLS). Animal models with mutation in (DMT1) divalent metal transporter 1 gene, an important brain iron transporter, demonstrate a similar iron deficiency profile as found in RLS brain. The human DMT1 gene, mapped to chromosome 12q near the RLS1 locus, qualifies as an excellent functional and possible positional candidate for RLS. DMT1 protein levels were assessed in lymphoblastoid cell lines from RLS patients and controls. Linkage analyses were carried out with markers flanking and within the DMT1 gene. Selected patient samples from RLS families with compatible linkage to the RLS1 locus on 12q were fully sequenced in both the coding regions and the long stretches of UTR sequences. Finally, selected sequence variants were further studied in case/control and family-based association tests. A clinical association of anemia and RLS was further confirmed in this study. There was no detectable difference in DMT1 protein levels between RLS patient lymphoblastoid cell lines and normal controls. Non-parametric linkage analyses failed to identify any significant linkage signals within the DMT1 gene region. Sequencing of selected patients did not detect any sequence variant(s) compatible with DMT1 harboring RLS causative mutation(s). Further studies did not find any association between ten SNPs, spanning the whole DMT1 gene region, and RLS affection status. Finally, two DMT1 intronic SNPs showed positive association with RLS in patients with a history of anemia, when compared to RLS patients without anemia. (c) 2007 Wiley-Liss, Inc.
2014-01-01
Background The accessibility of high-throughput genotyping technologies has contributed greatly to the development of genomic resources in non-model organisms. High-density genotyping arrays have only recently been developed for some economically important species such as conifers. The potential for using genomic technologies in association mapping and breeding depends largely on the genome wide patterns of diversity and linkage disequilibrium in current breeding populations. This study aims to deepen our knowledge regarding these issues in maritime pine, the first species used for reforestation in south western Europe. Results Using a new map merging algorithm, we first established a 1,712 cM composite linkage map (comprising 1,838 SNP markers in 12 linkage groups) by bringing together three already available genetic maps. Using rigorous statistical testing based on kernel density estimation and resampling we identified cold and hot spots of recombination. In parallel, 186 unrelated trees of a mass-selected population were genotyped using a 12k-SNP array. A total of 2,600 informative SNPs allowed to describe historical recombination, genetic diversity and genetic structure of this recently domesticated breeding pool that forms the basis of much of the current and future breeding of this species. We observe very low levels of population genetic structure and find no evidence that artificial selection has caused a reduction in genetic diversity. By combining these two pieces of information, we provided the map position of 1,671 SNPs corresponding to 1,192 different loci. This made it possible to analyze the spatial pattern of genetic diversity (H e ) and long distance linkage disequilibrium (LD) along the chromosomes. We found no particular pattern in the empirical variogram of H e across the 12 linkage groups and, as expected for an outcrossing species with large effective population size, we observed an almost complete lack of long distance LD. Conclusions These results are a stepping stone for the development of strategies for studies in population genomics, association mapping and genomic prediction in this economical and ecologically important forest tree species. PMID:24581176
O'Brien, Timothy D; Jia, Peilin; Caporaso, Neil E; Landi, Maria Teresa; Zhao, Zhongming
2018-02-27
There are two main types of lung cancer: small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). NSCLC has many subtypes, but the two most common are lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC). These subtypes are mainly classified by physiological and pathological characteristics, although there is increasing evidence of genetic and molecular differences as well. Although some work has been done at the somatic level to explore the genetic and biological differences among subtypes, little work has been done that interrogates these differences at the germline level to characterize the unique and shared susceptibility genes for each subtype. We used single-nucleotide polymorphisms (SNPs) from a genome-wide association study (GWAS) of European samples to interrogate the similarity of the subtypes at the SNP, gene, pathway, and regulatory levels. We expanded these genotyped SNPs to include all SNPs in linkage disequilibrium (LD) using data from the 1000 Genomes Project. We mapped these SNPs to several lung tissue expression quantitative trait loci (eQTL) and enhancer datasets to identify regulatory SNPs and their target genes. We used these genes to perform a biological pathway analysis for each subtype. We identified 8295, 8734, and 8361 SNPs with moderate association signals for LUAD, LUSC, and SCLC, respectively. Those SNPs had p < 1 × 10 - 3 in the original GWAS or were within LD (r 2 > 0.8, Europeans) to the genotyped SNPs. We identified 215, 320, and 172 disease-associated genes for LUAD, LUSC, and SCLC, respectively. Only five genes (CHRNA5, IDH3A, PSMA4, RP11-650 L12.2, and TBC1D2B) overlapped all subtypes. Furthermore, we observed only two pathways from the Kyoto Encyclopedia of Genes and Genomes shared by all subtypes. At the regulatory level, only three eQTL target genes and two enhancer target genes overlapped between all subtypes. Our results suggest that the three lung cancer subtypes do not share much genetic signal at the SNP, gene, pathway, or regulatory level, which differs from the common subtype classification based upon histology. However, three (CHRNA5, IDH3A, and PSMA4) of the five genes shared between the subtypes are well-known lung cancer genes that may act as general lung cancer genes regardless of subtype.
Ito, Yoshihito; Nakamura, Yukako; Takahashi, Nagahide; Saito, Shinichi; Aleksic, Branko; Iwata, Nakao; Inada, Toshiya; Ozaki, Norio
2008-06-13
The FXYD domain containing ion transport regulator 6 (FXYD6) gene is located within a region of chromosome 11 (11q23.3) that has been shown by a number of genome scans to be one of the most well-established linkages to schizophrenia. FXYD6 encodes the protein phosphohippolin, which is primarily expressed in the brain. Phosphohippolin modulates the kinetic activity of Na,K-ATPase and has long-term physiological importance in maintaining cation homeostasis. A recent study reported that FXYD6 was associated with schizophrenia in the United Kingdom samples. Applying the gene-based association concept, we carried out an association study regarding FXYD6 and schizophrenia in a Japanese population, with a sample consisting of 2026 subjects (906 schizophrenics and 1120 controls). After linkage disequilibrium analysis, 23 single nucleotide polymorphisms (SNPs) were genotyped using 5'-exonuclease allelic discrimination assay. We found a significant association of two SNPs (rs11216573; genotypic P value: 0.022 and rs555577; genotypic P value: 0.026, allelic P value: 0.011, uncorrected). Nominal P values did not survive correction for multiple testing (rs11216573; genotypic P value: 0.47 and rs555577; genotypic P value: 0.55, allelic P value: 0.24, after SNPSpD correction). No association was observed between schizophrenia patients and controls in allelic, genotypic and haplotypic analyses. Our findings suggest that FXYD6 is unlikely to be related to the development of schizophrenia in a Japanese population.
Sun, Xiaomei; Li, Mingxun; Hao, Dan; Hua, Liushuai; Lan, Xianyong; Lei, Chuzhao; Hu, Shenrong; Qi, Xinglei; Chen, Hong
2015-03-01
Identification of polymorphisms associated with economic traits is important for successful marker-assisted selection in cattle breeding. The family of mammalian sirtuin regulates many biological functions, such as life span extension and energy metabolism. SIRT2, a most abundant sirtuin in adipocytes, acts as a crucial regulator of adipogenic differentiation and plays a key role in controlling adipose tissue function and mass. Here we investigated single nucleotide polymorphisms (SNPs) of bovine SIRT2 in 1226 cattle from five breeds and further evaluated the effects of identified SNPs on economically important traits of Nanyang cattle. Our results revealed four novel SNPs in bovine SIRT2, one was located in intronic region and the other three were synonymous mutations. Linkage disequilibrium and haplotype analyses based on the identified SNPs showed obvious difference between crossbred breed and the other four beef breeds. Association analyses demonstrated that SNPs g.17333C > T and g.17578A > G have a significantly effect on 18-months-old body weight of Nanyang population. Animals with combined genotype TTGG at the above two loci exhibited especially higher body weight. Our data for the first time demonstrated that polymorphisms in bovine SIRT2 are associated with economic traits of Nanyang cattle, which will be helpful for future cattle selection practices.
Larson, Wesley; Palti, Yniv; Gao, G.; Warheit, Kenneth I.; Seeb, James E.
2017-01-01
Natural-origin steelhead trout (Oncorhynchus mykiss (Walbaum, 1792)) in the Pacific Northwest, USA, are threatened by a number of factors including habitat destruction, disease, decline in marine survival, and a potential erosion of genetic viability due to introgression from hatchery strains. Our major goal was to use a recently developed SNP array containing ∼57 000 SNPs to identify a subset of SNPs that differentiate hatchery and natural-origin populations. We analyzed 35 765 polymorphic SNPs in nine populations of steelhead trout sampled from Puget Sound, Washington, USA. We then conducted two outlier tests and found 360 loci that were candidates for divergent selection between hatchery and natural-origin populations (mean FCT = 0.29, maximum = 0.65) and 595 SNPs that were candidates for selection among natural-origin populations (mean FST = 0.25, maximum = 0.51). Comparisons with a linkage map revealed that two chromosomes (Omy05 and Omy25) contained significantly more outliers than other chromosomes, suggesting that regions on Omy05 and Omy25 may be of adaptive significance. Our results highlight several advantages of the 57 000 SNP array as a tool for population and conservation genomics studies.
Genetic Variation in FABP4 and Evaluation of Its Effects on Beef Cattle Fat Content.
Goszczynski, Daniel E; Papaleo-Mazzucco, Juliana; Ripoli, María V; Villarreal, Edgardo L; Rogberg-Muñoz, Andrés; Mezzadra, Carlos A; Melucci, Lilia M; Giovambattista, Guillermo
2017-07-03
FABP4 is a protein primarily expressed in adipocytes and macrophages that plays a key role in fatty acid trafficking and lipid hydrolysis. FABP4 gene polymorphisms have been associated with meat quality traits in cattle, mostly in Asian breeds under feedlot conditions. The objectives of this work were to characterize FABP4 genetic variation in several worldwide cattle breeds and evaluate possible genotype effects on fat content in a pasture-fed crossbred (Angus-Hereford-Limousin) population. We re-sequenced 43 unrelated animals from nine cattle breeds (Angus, Brahman, Creole, Hereford, Holstein, Limousin, Nelore, Shorthorn, and Wagyu) and obtained 22 single nucleotide polymorphisms (SNPs) over 3,164 bp, including four novel polymorphisms. Haplotypes and linkage disequilibrium analyses showed a high variability. Five SNPs were selected to perform validation and association studies in our crossbred population. Four SNPs showed well-balanced allele frequencies (minor frequency > 0.159), and three showed no significant deviations from Hardy-Weinberg proportions. SNPs showed significant effects on backfat thickness and fatty acid composition (P < 0.05). The protein structure of one of the missense SNPs was analyzed to elucidate its possible effect on fat content in our studied population. Our results revealed a possible blockage of the fatty acid binding site by the missense mutation.
Zhernakova, Alexandra; Stahl, Eli A.; Trynka, Gosia; Raychaudhuri, Soumya; Festen, Eleanora A.; Franke, Lude; Westra, Harm-Jan; Fehrmann, Rudolf S. N.; Kurreeman, Fina A. S.; Thomson, Brian; Gupta, Namrata; Romanos, Jihane; McManus, Ross; Ryan, Anthony W.; Turner, Graham; Brouwer, Elisabeth; Posthumus, Marcel D.; Remmers, Elaine F.; Tucci, Francesca; Toes, Rene; Grandone, Elvira; Mazzilli, Maria Cristina; Rybak, Anna; Cukrowska, Bozena; Coenen, Marieke J. H.; Radstake, Timothy R. D. J.; van Riel, Piet L. C. M.; Li, Yonghong; de Bakker, Paul I. W.; Gregersen, Peter K.; Worthington, Jane; Siminovitch, Katherine A.; Klareskog, Lars; Huizinga, Tom W. J.
2011-01-01
Epidemiology and candidate gene studies indicate a shared genetic basis for celiac disease (CD) and rheumatoid arthritis (RA), but the extent of this sharing has not been systematically explored. Previous studies demonstrate that 6 of the established non-HLA CD and RA risk loci (out of 26 loci for each disease) are shared between both diseases. We hypothesized that there are additional shared risk alleles and that combining genome-wide association study (GWAS) data from each disease would increase power to identify these shared risk alleles. We performed a meta-analysis of two published GWAS on CD (4,533 cases and 10,750 controls) and RA (5,539 cases and 17,231 controls). After genotyping the top associated SNPs in 2,169 CD cases and 2,255 controls, and 2,845 RA cases and 4,944 controls, 8 additional SNPs demonstrated P<5×10−8 in a combined analysis of all 50,266 samples, including four SNPs that have not been previously confirmed in either disease: rs10892279 near the DDX6 gene (Pcombined = 1.2×10−12), rs864537 near CD247 (Pcombined = 2.2×10−11), rs2298428 near UBE2L3 (Pcombined = 2.5×10−10), and rs11203203 near UBASH3A (Pcombined = 1.1×10−8). We also confirmed that 4 gene loci previously established in either CD or RA are associated with the other autoimmune disease at combined P<5×10−8 (SH2B3, 8q24, STAT4, and TRAF1-C5). From the 14 shared gene loci, 7 SNPs showed a genome-wide significant effect on expression of one or more transcripts in the linkage disequilibrium (LD) block around the SNP. These associations implicate antigen presentation and T-cell activation as a shared mechanism of disease pathogenesis and underscore the utility of cross-disease meta-analysis for identification of genetic risk factors with pleiotropic effects between two clinically distinct diseases. PMID:21383967
Zhernakova, Alexandra; Stahl, Eli A; Trynka, Gosia; Raychaudhuri, Soumya; Festen, Eleanora A; Franke, Lude; Westra, Harm-Jan; Fehrmann, Rudolf S N; Kurreeman, Fina A S; Thomson, Brian; Gupta, Namrata; Romanos, Jihane; McManus, Ross; Ryan, Anthony W; Turner, Graham; Brouwer, Elisabeth; Posthumus, Marcel D; Remmers, Elaine F; Tucci, Francesca; Toes, Rene; Grandone, Elvira; Mazzilli, Maria Cristina; Rybak, Anna; Cukrowska, Bozena; Coenen, Marieke J H; Radstake, Timothy R D J; van Riel, Piet L C M; Li, Yonghong; de Bakker, Paul I W; Gregersen, Peter K; Worthington, Jane; Siminovitch, Katherine A; Klareskog, Lars; Huizinga, Tom W J; Wijmenga, Cisca; Plenge, Robert M
2011-02-01
Epidemiology and candidate gene studies indicate a shared genetic basis for celiac disease (CD) and rheumatoid arthritis (RA), but the extent of this sharing has not been systematically explored. Previous studies demonstrate that 6 of the established non-HLA CD and RA risk loci (out of 26 loci for each disease) are shared between both diseases. We hypothesized that there are additional shared risk alleles and that combining genome-wide association study (GWAS) data from each disease would increase power to identify these shared risk alleles. We performed a meta-analysis of two published GWAS on CD (4,533 cases and 10,750 controls) and RA (5,539 cases and 17,231 controls). After genotyping the top associated SNPs in 2,169 CD cases and 2,255 controls, and 2,845 RA cases and 4,944 controls, 8 additional SNPs demonstrated P<5 × 10(-8) in a combined analysis of all 50,266 samples, including four SNPs that have not been previously confirmed in either disease: rs10892279 near the DDX6 gene (P(combined) = 1.2 × 10(-12)), rs864537 near CD247 (P(combined) = 2.2 × 10(-11)), rs2298428 near UBE2L3 (P(combined) = 2.5 × 10(-10)), and rs11203203 near UBASH3A (P(combined) = 1.1 × 10(-8)). We also confirmed that 4 gene loci previously established in either CD or RA are associated with the other autoimmune disease at combined P<5 × 10(-8) (SH2B3, 8q24, STAT4, and TRAF1-C5). From the 14 shared gene loci, 7 SNPs showed a genome-wide significant effect on expression of one or more transcripts in the linkage disequilibrium (LD) block around the SNP. These associations implicate antigen presentation and T-cell activation as a shared mechanism of disease pathogenesis and underscore the utility of cross-disease meta-analysis for identification of genetic risk factors with pleiotropic effects between two clinically distinct diseases.
Gan, W; Song, Q; Zhang, N N; Xiong, X P; Wang, D M C; Li, L
2015-06-18
The fat mass and obesity-associated gene (FTO) is an excellent candidate gene that affects energy metabolism. Single nucleotide polymorphisms (SNPs) in FTO are associated with carcass and meat quality traits in pigs, cattle, and rabbits. The aim of this study was to investigate the association between novel SNPs in the FTO coding region and carcass and meat quality traits in 95 crossbred ducks, using DNA sequencing. We found two transitions G/A (SNP 387 and 473) within exon 3. SNP 387 was a synonymous mutation, whereas SNP 473 was a missense mutation. Association analysis suggested that SNP g.387G>A was significantly associated with all of the carcass traits measured, the intramuscular fat content (IMF), cooking yield (CY), pH values 45 min after slaughter (pH45m), drip losses from the breast muscle, and the leg muscle (P < 0.05). For SNP g.473G>A, the genotype AA exhibited greater leg muscle weight than the genotypes GG or AG (P < 0.05). The D value suggested that the two SNPs exhibited strong linkage disequilibrium. Three haplotypes (G1G2, G1A2, and A1A2) were significantly associated with IMF, CY, the a* value, and all of the carcass traits measured (P < 0.05). The results suggest that FTO is a candidate locus that affects carcass and meat quality traits in ducks.
Haplotype analysis of the apolipoprotein gene cluster on human chromosome 11
Olivier, Michael; Wang, Xujing; Cole, Regina; Gau, Brian; Kim, Jessica; Rubin, Edward M.; Pennacchio, Len A.
2009-01-01
Members of the apolipoprotein gene cluster (APOA1/C3/A4/A5) on human chromosome 11q23 play an important role in lipid metabolism. Polymorphisms in both APOA5 and APOC3 are strongly associated with plasma triglyceride concentrations. The close genomic locations of these two genes as well as their functional similarity have hindered efforts to define whether each gene independently influences human triglyceride concentrations. In this study, we examined the linkage disequilibrium and haplotype structure of 49 SNPs in a 150-kb region spanning the gene cluster. We identified a total of five common APOA5 haplotypes with a frequency of greater than 8% in samples of northern European origin. The APOA5 haplotype block did not extend past the 7 SNPs in the gene and was separated from the other apolipoprotein gene in the cluster by a region of significantly increased recombination. Furthermore, one previously identified triglyceride risk haplotype of APOA5 (APOA5*3) showed no association with three APOC3 SNPs previously associated with triglyceride concentrations, in contrast to the other risk haplotype (APOA5*2), which was associated with all three minor APOC3 SNP alleles. These results highlight the complex genetic relationship between APOA5 and APOC3 and support the notion that APOA5 represents an independent risk gene affecting plasma triglyceride concentrations in humans. PMID:15081120
Bian, Yang; De Vries, Brian; Tracy, William F.
2016-01-01
Physiological leaf spotting, or flecking, is a mild-lesion phenotype observed on the leaves of several commonly used maize (Zea mays) inbred lines and has been anecdotally linked to enhanced broad-spectrum disease resistance. Flecking was assessed in the maize nested association mapping (NAM) population, comprising 4,998 recombinant inbred lines from 25 biparental families, and in an association population, comprising 279 diverse maize inbreds. Joint family linkage analysis was conducted with 7,386 markers in the NAM population. Genome-wide association tests were performed with 26.5 million single-nucleotide polymorphisms (SNPs) in the NAM population and with 246,497 SNPs in the association population, resulting in the identification of 18 and three loci associated with variation in flecking, respectively. Many of the candidate genes colocalizing with associated SNPs are similar to genes that function in plant defense response via cell wall modification, salicylic acid- and jasmonic acid-dependent pathways, redox homeostasis, stress response, and vesicle trafficking/remodeling. Significant positive correlations were found between increased flecking, stronger defense response, increased disease resistance, and increased pest resistance. A nonlinear relationship with total kernel weight also was observed whereby lines with relatively high levels of flecking had, on average, lower total kernel weight. We present evidence suggesting that mild flecking could be used as a selection criterion for breeding programs trying to incorporate broad-spectrum disease resistance. PMID:27670817
Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela
2014-01-01
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.
Voruganti, Venkata Saroja; Kent, Jack W.; Debnath, Subrata; Cole, Shelley A.; Haack, Karin; Göring, Harald H. H.; Carless, Melanie A.; Curran, Joanne E.; Johnson, Matthew P.; Almasy, Laura; Dyer, Thomas D.; MacCluer, Jean W.; Moses, Eric K.; Abboud, Hanna E.; Mahaney, Michael C.; Blangero, John; Comuzzie, Anthony G.
2013-01-01
Increased serum uric acid (SUA) is a risk factor for gout and renal and cardiovascular disease (CVD). The purpose of this study was to identify genetic factors that affect the variation in SUA in 632 Mexican Americans participants of the San Antonio Family Heart Study (SAFHS). A genome-wide association (GWA) analysis was performed using the Illumina Human Hap 550K single nucleotide polymorphism (SNP) microarray. We used a linear regression-based association test under an additive model of allelic effect, while accounting for non-independence among family members via a kinship variance component. All analyses were performed in the software package SOLAR. SNPs rs6832439, rs13131257, and rs737267 in solute carrier protein 2 family, member 9 (SLC2A9) were associated with SUA at genome-wide significance (p < 1.3 × 10−7). The minor alleles of these SNPs had frequencies of 36.2, 36.2, and 38.2%, respectively, and were associated with decreasing SUA levels. All of these SNPs were located in introns 3–7 of SLC2A9, the location of the previously reported associations in European populations. When analyzed for association with cardiovascular-renal disease risk factors, conditional on SLC2A9 SNPs strongly associated with SUA, significant associations were found for SLC2A9 SNPs with BMI, body weight, and waist circumference (p < 1.4 × 10−3) and suggestive associations with albumin-creatinine ratio and total antioxidant status (TAS). The SLC2A9 gene encodes an urate transporter that has considerable influence on variation in SUA. In addition to the primary association locus, suggestive evidence (p < 1.9 × 10−6) for joint linkage/association (JLA) was found at a previously-reported urate quantitative trait locus (Logarithm of odds score = 3.6) on 3p26.3. In summary, our GWAS extends and confirms the association of SLC2A9 with SUA for the first time in a Mexican American cohort and also shows for the first time its association with cardiovascular-renal disease risk factors. PMID:24379826
Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela
2014-01-01
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088
Jeske, Y W A; So, A; Kelemen, L; Sukor, N; Willys, C; Bulmer, B; Gordon, R D; Duffy, D; Stowasser, M
2008-04-01
1. There are two types of familial hyperaldosteronism (FH): FH-I and FH-II. FH-I is caused by a hybrid CYP11B1/CYP11B2 gene mutation. The genetic cause of FH-II, which is more common, is unknown. Adrenal hyperplasia and adenomas are features. We previously reported linkage of FH-II to a approximately 5 Mb region on chromosome 7p22. We subsequently reported finding no causative mutations in the retinoblastoma-associated Kruppel-associated box gene (RBaK), a candidate at 7p22 involved in tumorigenesis and cell cycle control. 2. In the current study we investigated RBaK regulatory regions and two other candidate genes: postmeiotic segregation increased 2 (PMS2, involved in DNA mismatch repair and tumour predisposition) and guanine nucleotide-binding protein alpha-12 (GNA12, a transforming oncogene). 3. The GNA12 and PMS2 genes were examined in two affected (A1, A2) and two unaffected (U1, U2) subjects from a large 7p22-linked FH-II family (family 1). No mutations were found. 4. The RBaK and PMS2 distal promoters were sequenced to -2150 bp from the transcription start site for RBaK and-2800 bp for PMS2. Five unreported single nucleotide polymorphisms (SNPs) were found in subjects A1, A2 but not in U1 or U2; A(-2031 bp)T, T(-2030 bp)G, G(-834 bp)C, C(-821 bp)G in RBaK and A(-876 bp)G in PMS2. Additional affected and unaffected subjects from family 1 and from two other 7p22-linked FH-II families and 58 unrelated normotensive control subjects were genotyped for these SNPs. 5. The five novel SNPs were found to be present in a significant proportion of normotensive controls. The four RBaK promoter SNPs were found to be in linkage disequilibrium in the normal population. The RBaK promoter (-)2031T/2030G/834C/821T allele was found to be in linkage disequilibrium with the causative mutation in FH-II family 1, but not in families 2 and 3. The PMS2 promoter (-)876G allele was also found to be linked to affected phenotypes in family 1. 6. The RBaK and PMS2 promoter SNPs alter the binding sites for several transcription factors. Although present in the normal population, it is possible that the RBaK (-)2031T/2030G/834C/821T and PMS2 (-)876G alleles may have functional roles contributing to the FH-II phenotype in family 1.
Strong Signature of Natural Selection within an FHIT Intron Implicated in Prostate Cancer Risk
Ding, Yan; Larson, Garrett; Rivas, Guillermo; Lundberg, Cathryn; Geller, Louis; Ouyang, Ching; Weitzel, Jeffrey; Archambeau, John; Slater, Jerry; Daly, Mary B.; Benson, Al B.; Kirkwood, John M.; O'Dwyer, Peter J.; Sutphen, Rebecca; Stewart, James A.; Johnson, David; Nordborg, Magnus; Krontiris, Theodore G.
2008-01-01
Previously, a candidate gene linkage approach on brother pairs affected with prostate cancer identified a locus of prostate cancer susceptibility at D3S1234 within the fragile histidine triad gene (FHIT), a tumor suppressor that induces apoptosis. Subsequent association tests on 16 SNPs spanning approximately 381 kb surrounding D3S1234 in Americans of European descent revealed significant evidence of association for a single SNP within intron 5 of FHIT. In the current study, re-sequencing and genotyping within a 28.5 kb region surrounding this SNP further delineated the association with prostate cancer risk to a 15 kb region. Multiple SNPs in sequences under evolutionary constraint within intron 5 of FHIT defined several related haplotypes with an increased risk of prostate cancer in European-Americans. Strong associations were detected for a risk haplotype defined by SNPs 138543, 142413, and 152494 in all cases (Pearson's χ2 = 12.34, df 1, P = 0.00045) and for the homozygous risk haplotype defined by SNPs 144716, 142413, and 148444 in cases that shared 2 alleles identical by descent with their affected brothers (Pearson's χ2 = 11.50, df 1, P = 0.00070). In addition to highly conserved sequences encompassing SNPs 148444 and 152413, population studies revealed strong signatures of natural selection for a 1 kb window covering the SNP 144716 in two human populations, the European American (π = 0.0072, Tajima's D = 3.31, 14 SNPs) and the Japanese (π = 0.0049, Fay & Wu's H = 8.05, 14 SNPs), as well as in chimpanzees (Fay & Wu's H = 8.62, 12 SNPs). These results strongly support the involvement of the FHIT intronic region in an increased risk of prostate cancer. PMID:18953408
The Role of Local Ancestry Adjustment in Association Studies Using Admixed Populations
Zhang, Jianqi; Stram, Daniel O.
2016-01-01
Association analysis using admixed populations imposes challenges and opportunities for disease mapping. By developing some explicit results for the variance of an allele of interest conditional on either local or global ancestry and by simulation of recently admixed genomes we evaluate power and false-positive rates under a variety of scenarios concerning linkage disequilibrium (LD) and the presence of unmeasured variants. Pairwise LD patterns were compared between admixed and nonadmixed populations using the HapMap phase 3 data. Based on the above, we showed that as follows: For causal variants with similar effect size in all populations, power is generally higher in a study using admixed population than using nonadmixed population, especially for highly differentiated SNPs. This gain of power is achieved with adjustment of global ancestry, which completely removes any cross-chromosome inflation of type I error rates, and addresses much of the intrachromosome inflation.If reliably estimated, adjusting for local ancestry precisely recovers the localization that could have been achieved in a stratified analysis of source populations. Improved localization is most evident for highly differentiated SNPs; however, the advantage of higher power is lost on exactly the same differentiated SNPs.In the real admixed populations such as African Americans and Latinos, the expansion of LD is not as dramatic as in our simulation.While adjustment for global ancestry is required prior to announcing a novel association seen in an admixed population, local ancestry adjustment may best be regarded as a localization tool not strictly required for discovery purposes. PMID:25043967
Wu, Zhikun; Wang, Bo; Chen, Xun; Wu, Jiangsheng; King, Graham J; Xiao, Yingjie; Liu, Kede
2016-01-01
High-density genetic markers are the prerequisite for understanding linkage disequilibrium (LD) and genome-wide association studies (GWASs) of complex traits in crops. To evaluate the LD pattern in oilseed rape, we sequenced a previous association panel containing 189 B. napus inbred lines using double-digested restriction-site associated DNA (ddRAD) and genotyped 19,327 RAD tags. A total of 15,921 RAD tags were assigned to a published genetic linkage map and the majority (71.1%) of these tags was uniquely mapped to the draft reference genome "Darmor-bzh." The distance of LD decay was 1,214 kb across the genome at the background level (r2 = 0.26), with the distances of LD decay being 405 kb and 2,111 kb in the A and C subgenomes, respectively. A total of 361 haplotype blocks with length > 100 kb were identified in the entire genome. The association panel could be classified into two groups, P1 and P2, which are essentially consistent with the geographical origins of varieties. A large number of group-specific haplotypes were identified, reflecting that varieties in the P1 and P2 groups experienced distinct selection in breeding programs to adapt their different growth habitats. GWAS repeatedly detected two loci significantly associated with oil content of seeds based on the developed SNPs, suggesting that the high-density SNPs were useful for understanding the genetic determinants of complex traits in GWAS.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Agusa, Tetsuro; Department of Legal Medicine, Shimane University Faculty of Medicine, Enya 89-1, Izumo 693-8501; Iwata, Hisato
To elucidate the role of genetic factors in arsenic (As) metabolism, we studied associations of single nucleotide polymorphisms (SNPs) in As (+ 3 oxidation state) methyltransferase (AS3MT) with the As concentrations in hair and urine, and urinary As profile in residents in the Red River Delta, Vietnam. Concentrations of total As in groundwater were 0.7-502 {mu}g/l. Total As levels in groundwater drastically decreased by using sand filter, indicating that the filter could be effective to remove As from raw groundwater. Concentrations of inorganic As (IAs) in urine and total As in hair of males were higher than those of females.more » A significant positive correlation between monomethylarsonic acid (MMA)/IAs and age in females indicates that older females have higher methylation capacity from IAs to MMA. Body mass index negatively correlated with urinary As concentrations in males. Homozygote for SNPs 4602AA, 35991GG, and 37853GG, which showed strong linkage disequilibrium (LD), had higher percentage (%) of dimethylarsinic acid (DMA) in urine. SNPs 4740 and 12590 had strong LD and associated with urinary %DMA. Although SNPs 6144, 12390, 14215, and 35587 comprised LD cluster, homozygotes in SNPs 12390GG and 35587CC had lower DMA/MMA in urine, suggesting low methylation capacity from MMA to DMA in homo types for these SNPs. SNPs 5913 and 8973 correlated with %MMA and %DMA, respectively. Heterozygote for SNP 14458TC had higher MMA/IAs in urine than TT homozygote, indicating that the heterozygote may have stronger methylation ability of IAs. To our knowledge, this is the first study on the association of genetic factors with As metabolism in Vietnamese.« less
Draft Sequences of the Radish (Raphanus sativus L.) Genome
Kitashiba, Hiroyasu; Li, Feng; Hirakawa, Hideki; Kawanabe, Takahiro; Zou, Zhongwei; Hasegawa, Yoichi; Tonosaki, Kaoru; Shirasawa, Sachiko; Fukushima, Aki; Yokoi, Shuji; Takahata, Yoshihito; Kakizaki, Tomohiro; Ishida, Masahiko; Okamoto, Shunsuke; Sakamoto, Koji; Shirasawa, Kenta; Tabata, Satoshi; Nishio, Takeshi
2014-01-01
Radish (Raphanus sativus L., n = 9) is one of the major vegetables in Asia. Since the genomes of Brassica and related species including radish underwent genome rearrangement, it is quite difficult to perform functional analysis based on the reported genomic sequence of Brassica rapa. Therefore, we performed genome sequencing of radish. Short reads of genomic sequences of 191.1 Gb were obtained by next-generation sequencing (NGS) for a radish inbred line, and 76,592 scaffolds of ≥300 bp were constructed along with the bacterial artificial chromosome-end sequences. Finally, the whole draft genomic sequence of 402 Mb spanning 75.9% of the estimated genomic size and containing 61,572 predicted genes was obtained. Subsequently, 221 single nucleotide polymorphism markers and 768 PCR-RFLP markers were used together with the 746 markers produced in our previous study for the construction of a linkage map. The map was combined further with another radish linkage map constructed mainly with expressed sequence tag-simple sequence repeat markers into a high-density integrated map of 1,166 cM with 2,553 DNA markers. A total of 1,345 scaffolds were assigned to the linkage map, spanning 116.0 Mb. Bulked PCR products amplified by 2,880 primer pairs were sequenced by NGS, and SNPs in eight inbred lines were identified. PMID:24848699
Malhotra, Alka; Igo, Robert P; Thameem, Farook; Kao, W H Linda; Abboud, Hanna E; Adler, Sharon G; Arar, Nedal H; Bowden, Donald W; Duggirala, Ravindranath; Freedman, Barry I; Goddard, Katrina A B; Ipp, Eli; Iyengar, Sudha K; Kimmel, Paul L; Knowler, William C; Kohn, Orly; Leehey, David; Meoni, Lucy A; Nelson, Robert G; Nicholas, Susanne B; Parekh, Rulan S; Rich, Stephen S; Chen, Yii-Der I; Saad, Mohammed F; Scavini, Marina; Schelling, Jeffrey R; Sedor, John R; Shah, Vallabh O; Taylor, Kent D; Thornley-Brown, Denyse; Zager, Philip G; Horvath, Amanda; Hanson, Robert L
2009-11-01
Previous studies have shown that in addition to environmental influences, type 2 diabetes mellitus (T2DM) has a strong genetic component. The goal of the current study is to identify regions of linkage for T2DM in ethnically diverse populations. Phenotypic and genotypic data were obtained from African American (AA; total number of individuals [N] = 1004), American Indian (AI; N = 883), European American (EA; N = 537), and Mexican American (MA; N = 1634) individuals from the Family Investigation of Nephropathy and Diabetes. Non-parametric linkage analysis, using an average of 4404 SNPs, was performed in relative pairs affected with T2DM in each ethnic group. In addition, family-based tests were performed to detect association with T2DM. Statistically significant evidence for linkage was observed on chromosome 4q21.1 (LOD = 3.13; genome-wide p = 0.04) in AA. In addition, a total of 11 regions showed suggestive evidence for linkage (estimated at LOD > 1.71), with the highest LOD scores on chromosomes 12q21.31 (LOD = 2.02) and 22q12.3 (LOD = 2.38) in AA, 2p11.1 (LOD = 2.23) in AI, 6p12.3 (LOD = 2.77) in EA, and 13q21.1 (LOD = . 2.24) in MA. While no region overlapped across all ethnic groups, at least five loci showing LOD > 1.71 have been identified in previously published studies. The results from this study provide evidence for the presence of genes affecting T2DM on chromosomes 4q, 12q, and 22q in AA; 6p in EA; 2p in AI; and 13q in MA. The strong evidence for linkage on chromosome 4q in AA provides important information given the paucity of diabetes genetic studies in this population.
Álvarez, Isabel; Pérez-Pardal, Lucía; Traoré, Amadou; Fernández, Iván; Goyache, Félix
2016-08-01
A panel of 81 Asian, African and European cattle (Bos taurus and B. indicus) was analysed for the whole sequence of the CXCR4 gene (3844bp), a strong candidate for cattle trypanotolerance. Thirty-one polymorphic sites identified gave 31 different haplotypes. Neutrality tests rejected the hypothesis of either positive or purifying selection. Bayesian phylogenetic tree showed differentiation of haplotypes into two clades gathering genetic variability predating domestication. Related with clades definition, linkage disequilibrium analyses suggested the existence of one only linkage block on the CXCR4 gene. Two tag SNPs identified on exon 2 captured 50% of variability. Whatever the analysis carried out, no clear separation between cattle groups was identified. Most haplotypes identified in West African taurine cattle were also found in European cattle and in Asian and West African zebu. West African taurine samples did not carry unique variants on the CXCR4 gene sequence. The current analysis failed in identifying a causal mutation on the CXCR4 gene underlying a previously reported QTL for cattle trypanotolerance on BTA2. Copyright © 2016 Elsevier B.V. All rights reserved.
Mapping QTL influencing gastrointestinal nematode burden in Dutch Holstein-Friesian dairy cattle
Coppieters, Wouter; Mes, Ted HM; Druet, Tom; Farnir, Frédéric; Tamma, Nico; Schrooten, Chris; Cornelissen, Albert WCA; Georges, Michel; Ploeger, Harm W
2009-01-01
Background Parasitic gastroenteritis caused by nematodes is only second to mastitis in terms of health costs to dairy farmers in developed countries. Sustainable control strategies complementing anthelmintics are desired, including selective breeding for enhanced resistance. Results and Conclusion To quantify and characterize the genetic contribution to variation in resistance to gastro-intestinal parasites, we measured the heritability of faecal egg and larval counts in the Dutch Holstein-Friesian dairy cattle population. The heritability of faecal egg counts ranged from 7 to 21% and was generally higher than for larval counts. We performed a whole genome scan in 12 paternal half-daughter groups for a total of 768 cows, corresponding to the ~10% most and least infected daughters within each family (selective genotyping). Two genome-wide significant QTL were identified in an across-family analysis, respectively on chromosomes 9 and 19, coinciding with previous findings in orthologous chromosomal regions in sheep. We identified six more suggestive QTL by within-family analysis. An additional 73 informative SNPs were genotyped on chromosome 19 and the ensuing high density map used in a variance component approach to simultaneously exploit linkage and linkage disequilibrium in an initial inconclusive attempt to refine the QTL map position. PMID:19254385
Li, Feng; Kitashiba, Hiroyasu; Inaba, Kiyofumi; Nishio, Takeshi
2009-01-01
For identification of genes responsible for varietal differences in flowering time and leaf morphological traits, we constructed a linkage map of Brassica rapa DNA markers including 170 EST-based markers, 12 SSR markers, and 59 BAC sequence-based markers, of which 151 are single nucleotide polymorphism (SNP) markers. By BLASTN, 223 markers were shown to have homologous regions in Arabidopsis thaliana, and these homologous loci covered nearly the whole genome of A. thaliana. Synteny analysis between B. rapa and A. thaliana revealed 33 large syntenic regions. Three quantitative trait loci (QTLs) for flowering time were detected. BrFLC1 and BrFLC2 were linked to the QTLs for bolting time, budding time, and flowering time. Three SNPs in the promoter, which may be the cause of low expression of BrFLC2 in the early-flowering parental line, were identified. For leaf lobe depth and leaf hairiness, one major QTL corresponding to a syntenic region containing GIBBERELLIN 20 OXIDASE 3 and one major QTL containing BrGL1, respectively, were detected. Analysis of nucleotide sequences and expression of these genes suggested possible involvement of these genes in leaf morphological traits. PMID:19884167
Yang, Wanling; Wang, Zhanyong; Wang, Lusheng; Sham, Pak-Chung; Huang, Peng; Lau, Yu Lung
2008-12-01
With genotyping of high-density single nucleotide polymorphisms (SNPs) replacing that of microsatellite markers in linkage studies, it becomes possible to accurately determine the genomic regions shared identity by descent (IBD) by family members. In addition to evaluating the likelihood of linkage for a region with the underlining disease (the LOD score approach), an appropriate question to ask is what would be the expected number and sizes of IBD regions among the affecteds, as there could be more than one region reaching the maximum achievable LOD score for a given family. Here, we introduce a computer program to allow the prediction of the total number of IBD regions among family members and their sizes. Reversely, it can be used to predict the portion of the genome that can be excluded from consideration according to the family size and user-defined inheritance mode and penetrance. Such information has implications on the feasibility of conducting linkage analysis on a given family of certain size and structure or on a few small families when interfamily homogeneity can be assumed. It can also help determine the most relevant members to be genotyped for such a study. Simulation results showed that the IBD regions containing true mutations are usually larger than regions IBD due to random chance. We have made use of this feature in our program to allow evaluation of the identified IBD regions based on Bayesian probability calculation and simulation results.
Chromosome 9p21 In Ischemic Stroke: Population Structure and Meta-Analysis
Anderson, CD; Biffi, A; Rost, NS; Cortellini, L; Furie, KL; Rosand, J
2011-01-01
Background and Purpose Sequence variants on chromosome 9p21.3 are implicated in coronary artery disease (CAD) and myocardial infarction (MI), but studies in ischemic stroke have produced inconsistent results. We investigated whether these conflicting findings were due to false positive studies confounded by population stratification, or false negative studies that failed to account for effects specific to certain stroke subtypes. Methods After assessing for population stratification at 9p21.3 using genome-wide data, we meta-analyzed 8 ischemic stroke studies. This analysis focused on two single nucleotide polymorphisms (SNPs), rs1537378 and rs10757278, as these variants are in strong linkage disequilibrium with most SNPs analyzed in prior studies of the region. Results Principal component analysis of the genome-wide data showed no evidence of population stratification at that locus. Meta-analysis confirmed that both rs1537378 and rs10757278 are risk factors for ischemic stroke (odds ratios 1.09, [p = 0.0014], and 1.11, [p = 0.001] respectively). Subtype analysis revealed a substantial increase in the effect of each SNP for risk of large artery (LA) stroke, achieving an effect size similar to that seen in CAD/MI. Conclusions Variants on 9p21.3 are associated with ischemic stroke, and restriction of analysis to LA stroke increases effect size towards that observed in prior association studies of CAD/MI. Previous inconsistent findings are best explained by this subtype-specificity rather than any unmeasured confounding by population stratification. PMID:20395606
de Miguel, Marina; de Maria, Nuria; Guevara, M Angeles; Diaz, Luis; Sáez-Laguna, Enrique; Sánchez-Gómez, David; Chancerel, Emilie; Aranda, Ismael; Collada, Carmen; Plomion, Christophe; Cabezas, José-Antonio; Cervera, María-Teresa
2012-10-04
Pinus pinaster Ait. is a major resin producing species in Spain. Genetic linkage mapping can facilitate marker-assisted selection (MAS) through the identification of Quantitative Trait Loci and selection of allelic variants of interest in breeding populations. In this study, we report annotated genetic linkage maps for two individuals (C14 and C15) belonging to a breeding program aiming to increase resin production. We use different types of DNA markers, including last-generation molecular markers. We obtained 13 and 14 linkage groups for C14 and C15 maps, respectively. A total of 211 and 215 markers were positioned on each map and estimated genome length was between 1,870 and 2,166 cM respectively, which represents near 65% of genome coverage. Comparative mapping with previously developed genetic linkage maps for P. pinaster based on about 60 common markers enabled aligning linkage groups to this reference map. The comparison of our annotated linkage maps and linkage maps reporting QTL information revealed 11 annotated SNPs in candidate genes that co-localized with previously reported QTLs for wood properties and water use efficiency. This study provides genetic linkage maps from a Spanish population that shows high levels of genetic divergence with French populations from which segregating progenies have been previously mapped. These genetic maps will be of interest to construct a reliable consensus linkage map for the species. The importance of developing functional genetic linkage maps is highlighted, especially when working with breeding populations for its future application in MAS for traits of interest.
2012-01-01
Background Pinus pinaster Ait. is a major resin producing species in Spain. Genetic linkage mapping can facilitate marker-assisted selection (MAS) through the identification of Quantitative Trait Loci and selection of allelic variants of interest in breeding populations. In this study, we report annotated genetic linkage maps for two individuals (C14 and C15) belonging to a breeding program aiming to increase resin production. We use different types of DNA markers, including last-generation molecular markers. Results We obtained 13 and 14 linkage groups for C14 and C15 maps, respectively. A total of 211 and 215 markers were positioned on each map and estimated genome length was between 1,870 and 2,166 cM respectively, which represents near 65% of genome coverage. Comparative mapping with previously developed genetic linkage maps for P. pinaster based on about 60 common markers enabled aligning linkage groups to this reference map. The comparison of our annotated linkage maps and linkage maps reporting QTL information revealed 11 annotated SNPs in candidate genes that co-localized with previously reported QTLs for wood properties and water use efficiency. Conclusions This study provides genetic linkage maps from a Spanish population that shows high levels of genetic divergence with French populations from which segregating progenies have been previously mapped. These genetic maps will be of interest to construct a reliable consensus linkage map for the species. The importance of developing functional genetic linkage maps is highlighted, especially when working with breeding populations for its future application in MAS for traits of interest. PMID:23036012
Karjalainen, Minna K.; Huusko, Johanna M.; Ulvila, Johanna; Sotkasiira, Jenni; Luukkonen, Aino; Teramo, Kari; Plunkett, Jevon; Anttila, Verneri; Palotie, Aarno; Haataja, Ritva; Muglia, Louis J.; Hallman, Mikko
2012-01-01
Preterm birth is the major cause of neonatal mortality and morbidity. In many cases, it has severe life-long consequences for the health and neurological development of the newborn child. More than 50% of all preterm births are spontaneous, and currently there is no effective prevention. Several studies suggest that genetic factors play a role in spontaneous preterm birth (SPTB). However, its genetic background is insufficiently characterized. The aim of the present study was to perform a linkage analysis of X chromosomal markers in SPTB in large northern Finnish families with recurrent SPTBs. We found a significant linkage signal (HLOD = 3.72) on chromosome locus Xq13.1 when the studied phenotype was being born preterm. There were no significant linkage signals when the studied phenotype was giving preterm deliveries. Two functional candidate genes, those encoding the androgen receptor (AR) and the interleukin-2 receptor gamma subunit (IL2RG), located near this locus were analyzed as candidates for SPTB in subsequent case-control association analyses. Nine single-nucleotide polymorphisms (SNPs) within these genes and an AR exon-1 CAG repeat, which was previously demonstrated to be functionally significant, were analyzed in mothers with preterm delivery (n = 272) and their offspring (n = 269), and in mothers with exclusively term deliveries (n = 201) and their offspring (n = 199), all originating from northern Finland. A replication study population consisting of individuals born preterm (n = 111) and term (n = 197) from southern Finland was also analyzed. Long AR CAG repeats (≥26) were overrepresented and short repeats (≤19) underrepresented in individuals born preterm compared to those born at term. Thus, our linkage and association results emphasize the role of the fetal genome in genetic predisposition to SPTB and implicate AR as a potential novel fetal susceptibility gene for SPTB. PMID:23227263
A second generation genetic linkage map of Japanese flounder (Paralichthys olivaceus)
2010-01-01
Background Japanese flounder (Paralichthys olivaceus) is one of the most economically important marine species in Northeast Asia. Information on genetic markers associated with quantitative trait loci (QTL) can be used in breeding programs to identify and select individuals carrying desired traits. Commercial production of Japanese flounder could be increased by developing disease-resistant fish and improving commercially important traits. Previous maps have been constructed with AFLP markers and a limited number of microsatellite markers. In this study, improved genetic linkage maps are presented. In contrast with previous studies, these maps were built mainly with a large number of codominant markers so they can potentially be used to analyze different families and populations. Results Sex-specific genetic linkage maps were constructed for the Japanese flounder including a total of 1,375 markers [1,268 microsatellites, 105 single nucleotide polymorphisms (SNPs) and two genes]; 1,167 markers are linked to the male map and 1,067 markers are linked to the female map. The lengths of the male and female maps are 1,147.7 cM and 833.8 cM, respectively. Based on estimations of map lengths, the female and male maps covered 79 and 82% of the genome, respectively. Recombination ratio in the new maps revealed F:M of 1:0.7. All linkage groups in the maps presented large differences in the location of sex-specific recombination hot-spots. Conclusions The improved genetic linkage maps are very useful for QTL analyses and marker-assisted selection (MAS) breeding programs for economically important traits in Japanese flounder. In addition, SNP flanking sequences were blasted against Tetraodon nigroviridis (puffer fish) and Danio rerio (zebrafish), and synteny analysis has been carried out. The ability to detect synteny among species or genera based on homology analysis of SNP flanking sequences may provide opportunities to complement initial QTL experiments with candidate gene approaches from homologous chromosomal locations identified in related model organisms. PMID:20937088
2012-01-01
Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety. PMID:22631220
The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies
Barnett, Ian; Mukherjee, Rajarshi; Lin, Xihong
2017-01-01
It is of substantial interest to study the effects of genes, genetic pathways, and networks on the risk of complex diseases. These genetic constructs each contain multiple SNPs, which are often correlated and function jointly, and might be large in number. However, only a sparse subset of SNPs in a genetic construct is generally associated with the disease of interest. In this article, we propose the generalized higher criticism (GHC) to test for the association between an SNP set and a disease outcome. The higher criticism is a test traditionally used in high-dimensional signal detection settings when marginal test statistics are independent and the number of parameters is very large. However, these assumptions do not always hold in genetic association studies, due to linkage disequilibrium among SNPs and the finite number of SNPs in an SNP set in each genetic construct. The proposed GHC overcomes the limitations of the higher criticism by allowing for arbitrary correlation structures among the SNPs in an SNP-set, while performing accurate analytic p-value calculations for any finite number of SNPs in the SNP-set. We obtain the detection boundary of the GHC test. We compared empirically using simulations the power of the GHC method with existing SNP-set tests over a range of genetic regions with varied correlation structures and signal sparsity. We apply the proposed methods to analyze the CGEM breast cancer genome-wide association study. Supplementary materials for this article are available online. PMID:28736464
Novel efficient genome-wide SNP panels for the conservation of the highly endangered Iberian lynx.
Kleinman-Ruiz, Daniel; Martínez-Cruz, Begoña; Soriano, Laura; Lucena-Perez, Maria; Cruz, Fernando; Villanueva, Beatriz; Fernández, Jesús; Godoy, José A
2017-07-21
The Iberian lynx (Lynx pardinus) has been acknowledged as the most endangered felid species in the world. An intense contraction and fragmentation during the twentieth century left less than 100 individuals split in two isolated and genetically eroded populations by 2002. Genetic monitoring and management so far have been based on 36 STRs, but their limited variability and the more complex situation of current populations demand more efficient molecular markers. The recent characterization of the Iberian lynx genome identified more than 1.6 million SNPs, of which 1536 were selected and genotyped in an extended Iberian lynx sample. We validated 1492 SNPs and analysed their heterozygosity, Hardy-Weinberg equilibrium, and linkage disequilibrium. We then selected a panel of 343 minimally linked autosomal SNPs from which we extracted subsets optimized for four different typical tasks in conservation applications: individual identification, parentage assignment, relatedness estimation, and admixture classification, and compared their power to currently used STR panels. We ascribed 21 SNPs to chromosome X based on their segregation patterns, and identified one additional marker that showed significant differentiation between sexes. For all applications considered, panels of autosomal SNPs showed higher power than the currently used STR set with only a very modest increase in the number of markers. These novel panels of highly informative genome-wide SNPs provide more powerful, efficient, and flexible tools for the genetic management and non-invasive monitoring of Iberian lynx populations. This example highlights an important outcome of whole-genome studies in genetically threatened species.
The ankyrin-3 gene is associated with posttraumatic stress disorder and externalizing comorbidity
Logue, Mark W.; Solovieff, Nadia; Leussis, Melanie P.; Wolf, Erika J.; Melista, Efi; Baldwin, Clinton; Koenen, Karestan C.; Petryshen, Tracey; Miller, Mark W.
2013-01-01
Background The ankyrin 3 gene (ANK3) produces the ankyrin G protein that plays an integral role in regulating neuronal activity. Previous studies have linked ANK3 to bipolar disorder and schizophrenia. A recent mouse study suggests that ANK3 may regulate behavioral disinhibition and stress reactivity. This led us to hypothesize that ANK3 might also be associated with stress-related psychopathology such as posttraumatic stress disorder (PTSD), as well as disorders of the externalizing spectrum such as antisocial personality disorder and substance-related disorders that are etiologically linked to impulsivity and temperamental disinhibition. Methods We examined the possibility of association between ANK3 SNPs and both PTSD and externalizing (defined by a factor score representing a composite of adult antisociality and substance abuse) in a cohort of white non-Hispanic combat veterans and their intimate partners (N=554). Initially, we focused on rs9804190— a SNP previously reported to be associated with bipolar disorder, schizophrenia, and ankyrin G expression in brain. Then we examined 358 additional ANK3 SNPs utilizing a multiple-testing correction. Results rs9804190 was associated with both externalizing and PTSD (p=0.028 and p=0.042 respectively). Analysis of other ANK3 SNPs identified several that were more strongly associated with either trait. The most significant association with externalizing was observed at rs1049862 (p=0.00040, pcorrected=0.60). The most significant association with PTSD (p=0.00060, pcorrected=0.045) was found with three SNPs in complete linkage disequilibrium (LD)—rs28932171, rs11599164, and rs17208576. Conclusions These findings support a role of ANK3 in risk of stress-related and externalizing disorders, beyond its previous associations with bipolar disorder and schizophrenia. PMID:23796624
Common variant of ALPK1 is not associated with gout: a replication study.
Chiba, Toshinori; Matsuo, Hirotaka; Sakiyama, Masayuki; Nakayama, Akiyoshi; Shimizu, Seiko; Wakai, Kenji; Suma, Shino; Nakashima, Hiroshi; Sakurai, Yutaka; Shimizu, Toru; Ichida, Kimiyoshi; Shinomiya, Nariyoshi
2015-01-01
Gout is one of the most kinds of common inflammatory arthritis as a consequence of hyperuricemia. Alpha-protein kinase 1 (ALPK1) gene locates in a gout-susceptibility locus on chromosome 4q21-31, and encodes ALPK1 protein which plays a pivotal role in the phosphorylation of myosin 1. In the previous genetic study of Taiwanese populations, 3 single nucleotide polymorphisms (SNPs), rs11726117, rs231247 and rs231253, in ALPK1 gene were reported to have a significant association with gout. However, no replication study has been performed to confirm this association. Therefore, we first conducted a replication study with clinically defined gout patients in a different population. Linkage disequilibrium (LD) analyzes of the 3 SNPs in ALPK1 revealed that these SNPs are in strong LD in a Japanese population. Among the 3 SNPs of ALPK1, rs11726117 (M861T) is the only missense SNP. Therefore, rs11726117 was genotyped in a Japanese population of 903 clinically defined gout cases and 1,302 controls, and was evaluated for a possible association with gout. The minor allele frequencies of rs11726117 were 0.26 and 0.25 in the case and control groups, respectively. The association analysis has not detected a significant association between rs11726117 and gout susceptibility in a Japanese population (p = 0.44). Because ABCG2, a major causative gene for gout, also locates in the gout-susceptibility locus on chromosome 4q, these findings suggest that among genes in a gout-susceptibility locus, not ALPK1 but ABCG2 could be important as a gout-susceptible gene.
2013-01-01
Background Autism spectrum conditions (ASC) are associated with deficits in social interaction and communication, alongside repetitive, restricted, and stereotyped behavior. ASC is highly heritable. The gamma-aminobutyric acid (GABA)-ergic system has been associated consistently with atypicalities in autism, in both genetic association and expression studies. A key component of the GABA-ergic system is encoded by the GABRB3 gene, which has been previously implicated both in ASC and in individual differences in empathy. Methods In this study, 45 genotyped single nucleotide polymorphisms (SNPs) within GABRB3 were tested for association with Asperger syndrome (AS), and related quantitative traits measured through the following tests: the Empathy Quotient (EQ), the Autism Spectrum Quotient (AQ), the Systemizing Quotient-Revised (SQ-R), the Embedded Figures Test (EFT), the Reading the Mind in the Eyes Test (RMET), and the Mental Rotation Test (MRT). Two-loci, three-loci, four-loci haplotype analyses, and one seven-loci haplotype analysis were also performed in the AS case–control sample. Results Three SNPs (rs7180158, rs7165604, rs12593579) were significantly associated with AS, and two SNPs (rs9806546, rs11636966) were significantly associated with EQ. Two SNP-SNP pairs, rs12438141-rs1035751 and rs12438141-rs7179514, showed significant association with variation in the EFT scores. One SNP-SNP pair, rs7174437-rs1863455, was significantly associated with variation in the MRT scores. Additionally, a few haplotypes, including a 19 kb genomic region that formed a linkage disequilibrium (LD) block in our sample and contained several nominally significant SNPs, were found to be significantly associated with AS. Conclusion The current study confirms the role of GABRB3 as an important candidate gene in both ASC and normative variation in related endophenotypes. PMID:24321478
Meldrum, Suzanne J; Li, Yuchun; Zhang, Guicheng; Heaton, Alexandra E M; D'Vaz, Nina; Manz, Judith; Reischl, Eva; Koletzko, Berthold V; Prescott, Susan L; Simmer, Karen
2017-09-19
The enzymes encoded by fatty acid desaturases (FADS) genes determine the desaturation of long-chain polyunsaturated fatty acids (LCPUFA). We investigated if haplotype and single nucleotide polymorphisms (SNPs) in FADS gene cluster can influence LCPUFA status in infants who received either fish oil or placebo supplementation. Children enrolled in the Infant Fish Oil Supplementation Study (IFOS) were randomly allocated to receive either fish oil or placebo from birth to 6 months of age. Blood was collected at 6 months of age for the measurement of fatty acids and for DNA extraction. A total of 276 participant DNA samples underwent genotyping, and 126 erythrocyte and 133 plasma fatty acid measurements were available for analysis. Twenty-two FADS SNPs were selected on the basis of literature and linkage disequilibrium patterns identified from the HapMap data. Haplotype construction was completed using PHASE. For participants allocated to the fish oil group who had two copies of the FADS1 haplotype consisting of SNP minor alleles, DHA levels were significantly higher compared to other haplotypes. This finding was not observed for the placebo group. Furthermore, for members of the fish oil group only, the minor homozygous carriers of all the FADS1 SNPs investigated had significantly higher DHA than other genotypes (rs174545, rs174546, rs174548, rs174553, rs174556, rs174537, rs174448, and rs174455). Overall results of this preliminary study suggest that supplementation with fish oil may only significantly increase DHA in minor allele carriers of FADS1 SNPs. Further research is required to confirm this novel finding.
Singh, Kanhaiya; Kant, Shri; Singh, Vivek Kumar; Agrawal, Neeraj K.; Gupta, Sanjeev K.
2014-01-01
Purpose Persistent inflammation and impaired neovascularization in type 2 diabetes mellitus (T2DM) patients may lead to development of macro- and microvascular complications. Diabetic retinopathy (DR) is one of the secondary microvascular complications of T2DM. Improper activation of the innate immune system may be an important contributor in the pathophysiology of DR. Toll-like receptor 4 (TLR4) is an important mediator of innate immunity, and genetic alterations in TLR4 support inflammation in the hyperglycemic condition. The present work was designed to investigate whether the TLR4 single nucleotide polymorphisms (SNPs) rs4986790, rs4986791, rs10759931, rs1927911, and rs1927914 are associated with DR in a north Indian population. Methods The study group of 698 individuals (128 DR, 250 T2DM, 320 controls) was genotyped by PCR-RFLP. Haplotype and linkage disequilibrium between SNPs were determined using Haploview software. Results Combined risk genotypes of TLR4 SNPs rs10759931 (odds ratio [OR] 1.50, p = 0.05) and rs1927914 (OR 1.48, p = 0.05) were found to be significantly associated with pathogenesis of DR. A total of 14 haplotypes with frequency >1% were obtained using Haploview software. Haplotypes ACATC (37.5%) and ACATT (14.8%) were the two most common haplotypes obtained. Conclusions Results of the present case-control study that included 698 north Indian subjects suggested that TLR4 SNPs rs10759931 and rs1927914 modulate the risk of DR in T2DM cases. Association analysis using haplotypes showed none of the haplotypes were associated with either susceptibility or resistance to DR in a north Indian population. PMID:24883015
Brant, Steven R.; Okou, David T.; Simpson, Claire L.; Cutler, David J.; Haritunians, Talin; Bradfield, Jonathan P.; Chopra, Pankaj; Prince, Jarod; Begum, Ferdouse; Kumar, Archana; Huang, Chengrui; Venkateswaran, Suresh; Datta, Lisa W.; Wei, Zhi; Thomas, Kelly; Herrinton, Lisa J.; Klapproth, Jan-Micheal A.; Quiros, Antonio J.; Seminerio, Jenifer; Liu, Zhenqiu; Alexander, Jonathan S.; Baldassano, Robert N.; Dudley-Brown, Sharon; Cross, Raymond K.; Dassopoulos, Themistocles; Denson, Lee A.; Dhere, Tanvi A.; Dryden, Gerald W.; Hanson, John S.; Hou, Jason K.; Hussain, Sunny Z.; Hyams, Jeffrey S.; Isaacs, Kim L.; Kader, Howard; Kappelman, Michael D.; Katz, Jeffry; Kellermayer, Richard; Kirschner, Barbara S.; Kuemmerle, John F.; Kwon, John H.; Lazarev, Mark; Li, Ellen; Mack, David; Mannon, Peter; Moulton, Dedrick E.; Newberry, Rodney D.; Osuntokun, Bankole O.; Patel, Ashish S.; Saeed, Shehzad A.; Targan, Stephan R.; Valentine, John F.; Wang, Ming-Hsi; Zonca, Martin; Rioux, John D.; Duerr, Richard H.; Silverberg, Mark S.; Cho, Judy H.; Hakonarson, Hakon; Zwick, Michael E.; McGovern, Dermot P.B.; Kugathasan, Subra
2016-01-01
Background & Aims The inflammatory bowel diseases (IBD) ulcerative colitis (UC) and Crohn’s disease (CD) cause significant morbidity and are increasing in prevalence among all populations, including African Americans. More than 200 susceptibility loci have been identified in populations of predominantly European ancestry, but few loci have been associated with IBD in other ethnicities. Methods We performed 2 high-density, genome-wide scans comprising 2345 cases of African Americans with IBD (1646 with CD, 583 with UC, and 116 inflammatory bowel disease unclassified [IBD-U]) and 5002 individuals without IBD (controls, identified from the Health Retirement Study and Kaiser Permanente database). Single-nucleotide polymorphisms (SNPs) associated at P<5.0×10−8 in meta-analysis with a nominal evidence (P<.05) in each scan were considered to have genome-wide significance. Results We detected SNPs at HLA-DRB1, and African-specific SNPs at ZNF649 and LSAMP, with associations of genome-wide significance for UC. We detected SNPs at USP25 with associations of genome-wide significance associations for IBD. No associations of genome-wide significance were detected for CD. In addition, 9 genes previously associated with IBD contained SNPs with significant evidence for replication (P<1.6×10−6): ADCY3, CXCR6, HLA-DRB1 to HLA-DQA1 (genome-wide significance on conditioning), IL12B, PTGER4, and TNC for IBD; IL23R, PTGER4, and SNX20 (in strong linkage disequilibrium with NOD2) for CD; and KCNQ2 (near TNFRSF6B) for UC. Several of these genes, such as TNC (near TNFSF15), CXCR6, and genes associated with IBD at the HLA locus, contained SNPs with unique association patterns with African-specific alleles. Conclusions We performed a genome-wide association study of African Americans with IBD and identified loci associated with CD and UC in only this population; we also replicated loci identified in European populations. The detection of variants associated with IBD risk in only people of African descent demonstrates the importance of studying the genetics of IBD and other complex diseases in populations beyond those of European ancestry. PMID:27693347
A genome-wide association study of seed protein and oil content in soybean
2014-01-01
Background Association analysis is an alternative to conventional family-based methods to detect the location of gene(s) or quantitative trait loci (QTL) and provides relatively high resolution in terms of defining the genome position of a gene or QTL. Seed protein and oil concentration are quantitative traits which are determined by the interaction among many genes with small to moderate genetic effects and their interaction with the environment. In this study, a genome-wide association study (GWAS) was performed to identify quantitative trait loci (QTL) controlling seed protein and oil concentration in 298 soybean germplasm accessions exhibiting a wide range of seed protein and oil content. Results A total of 55,159 single nucleotide polymorphisms (SNPs) were genotyped using various methods including Illumina Infinium and GoldenGate assays and 31,954 markers with minor allele frequency >0.10 were used to estimate linkage disequilibrium (LD) in heterochromatic and euchromatic regions. In euchromatic regions, the mean LD (r 2 ) rapidly declined to 0.2 within 360 Kbp, whereas the mean LD declined to 0.2 at 9,600 Kbp in heterochromatic regions. The GWAS results identified 40 SNPs in 17 different genomic regions significantly associated with seed protein. Of these, the five SNPs with the highest associations and seven adjacent SNPs were located in the 27.6-30.0 Mbp region of Gm20. A major seed protein QTL has been previously mapped to the same location and potential candidate genes have recently been identified in this region. The GWAS results also detected 25 SNPs in 13 different genomic regions associated with seed oil. Of these markers, seven SNPs had a significant association with both protein and oil. Conclusions This research indicated that GWAS not only identified most of the previously reported QTL controlling seed protein and oil, but also resulted in narrower genomic regions than the regions reported as containing these QTL. The narrower GWAS-defined genome regions will allow more precise marker-assisted allele selection and will expedite positional cloning of the causal gene(s). PMID:24382143
A genome-wide association study of seed protein and oil content in soybean.
Hwang, Eun-Young; Song, Qijian; Jia, Gaofeng; Specht, James E; Hyten, David L; Costa, Jose; Cregan, Perry B
2014-01-02
Association analysis is an alternative to conventional family-based methods to detect the location of gene(s) or quantitative trait loci (QTL) and provides relatively high resolution in terms of defining the genome position of a gene or QTL. Seed protein and oil concentration are quantitative traits which are determined by the interaction among many genes with small to moderate genetic effects and their interaction with the environment. In this study, a genome-wide association study (GWAS) was performed to identify quantitative trait loci (QTL) controlling seed protein and oil concentration in 298 soybean germplasm accessions exhibiting a wide range of seed protein and oil content. A total of 55,159 single nucleotide polymorphisms (SNPs) were genotyped using various methods including Illumina Infinium and GoldenGate assays and 31,954 markers with minor allele frequency >0.10 were used to estimate linkage disequilibrium (LD) in heterochromatic and euchromatic regions. In euchromatic regions, the mean LD (r2) rapidly declined to 0.2 within 360 Kbp, whereas the mean LD declined to 0.2 at 9,600 Kbp in heterochromatic regions. The GWAS results identified 40 SNPs in 17 different genomic regions significantly associated with seed protein. Of these, the five SNPs with the highest associations and seven adjacent SNPs were located in the 27.6-30.0 Mbp region of Gm20. A major seed protein QTL has been previously mapped to the same location and potential candidate genes have recently been identified in this region. The GWAS results also detected 25 SNPs in 13 different genomic regions associated with seed oil. Of these markers, seven SNPs had a significant association with both protein and oil. This research indicated that GWAS not only identified most of the previously reported QTL controlling seed protein and oil, but also resulted in narrower genomic regions than the regions reported as containing these QTL. The narrower GWAS-defined genome regions will allow more precise marker-assisted allele selection and will expedite positional cloning of the causal gene(s).
Guan, Yan-Ping; Yang, Xue-Xi; Yao, Guang-Yu; Qiu, Fei; Chen, Jun; Chen, Lu-Jia; Ye, Chang-Sheng; Li, Ming
2014-01-01
Genome-wide association studies (GWAS) have identified various genetic susceptibility loci for breast cancer based mainly on European-ancestry populations. Differing linkage disequilibrium patterns exist between European and Asian populations. Ten SNPs (rs2075555 in COL1A1, rs12652447 in FBXL17, rs10941679 in 5p12/MRPS30, rs11878583 in ZNF577, rs7166081 in SMAD3, rs16917302 in ZNF365, rs311499 in 20q13.3, rs1045485 in CASP8, rs12964873 in CDH1 and rs8170 in 19p13.1) were here genotyped in 1009 Chinese females (487 patients with breast cancer and 522 control subjects) using the Sequenom MassARRAY iPLEX platform. Association analysis based on unconditional logistic regression was carried out to determine the odds ratio (OR) and 95% confidence interval (95% CI) for each SNP. Stratification analyses were carried out based on the estrogen receptor (ER) and progesterone receptor (PR) status. Among the 10 SNPs, rs10941679 showed significant association with breast cancer when differences between the case and control groups in this Han Chinese population were compared (30.09% GG, 45.4% GA and 23.7% AA; P = 0.012). Four SNPs (rs311499, rs1045485, rs12964873 and rs8170) showed no polymorphisms in our study. The remaining five SNPs showed no association with breast cancer in the present population. Immunohistochemical tests showed that rs2075555 was associated with ER status; the AA genotype showed greater association with ER negative than ER positive (OR = 0.54, 95% CI, 0.29-0.99; P = 0.046). AA of rs7166081 was also associated with ER status, but showed a greater association with ER positive than negative (OR = 1.59, 95% CI = 1.04-2.44; P = 0.031). However, no significant associations were found among the SNPs and PR status. In this study using a Han Chinese population, rs10941679 was the only SNP associated with breast cancer risk, indicating a difference between European and Chinese populations in susceptibility loci. Therefore, confirmation studies are necessary before utilization of these loci in Chinese.
The utility of low-density genotyping for imputation in the Thoroughbred horse
2014-01-01
Background Despite the dramatic reduction in the cost of high-density genotyping that has occurred over the last decade, it remains one of the limiting factors for obtaining the large datasets required for genomic studies of disease in the horse. In this study, we investigated the potential for low-density genotyping and subsequent imputation to address this problem. Results Using the haplotype phasing and imputation program, BEAGLE, it is possible to impute genotypes from low- to high-density (50K) in the Thoroughbred horse with reasonable to high accuracy. Analysis of the sources of variation in imputation accuracy revealed dependence both on the minor allele frequency of the single nucleotide polymorphisms (SNPs) being imputed and on the underlying linkage disequilibrium structure. Whereas equidistant spacing of the SNPs on the low-density panel worked well, optimising SNP selection to increase their minor allele frequency was advantageous, even when the panel was subsequently used in a population of different geographical origin. Replacing base pair position with linkage disequilibrium map distance reduced the variation in imputation accuracy across SNPs. Whereas a 1K SNP panel was generally sufficient to ensure that more than 80% of genotypes were correctly imputed, other studies suggest that a 2K to 3K panel is more efficient to minimize the subsequent loss of accuracy in genomic prediction analyses. The relationship between accuracy and genotyping costs for the different low-density panels, suggests that a 2K SNP panel would represent good value for money. Conclusions Low-density genotyping with a 2K SNP panel followed by imputation provides a compromise between cost and accuracy that could promote more widespread genotyping, and hence the use of genomic information in horses. In addition to offering a low cost alternative to high-density genotyping, imputation provides a means to combine datasets from different genotyping platforms, which is becoming necessary since researchers are starting to use the recently developed equine 70K SNP chip. However, more work is needed to evaluate the impact of between-breed differences on imputation accuracy. PMID:24495673
Common variants on chromosome 6p22.1 are associated with schizophrenia
Shi, Jianxin; Levinson, Douglas F.; Duan, Jubao; Sanders, Alan R.; Zheng, Yonglan; Pe'er, Itsik; Dudbridge, Frank; Holmans, Peter A.; Whittemore, Alice S.; Mowry, Bryan J.; Olincy, Ann; Amin, Farooq; Cloninger, C. Robert; Silverman, Jeremy M.; Buccola, Nancy G.; Byerley, William F.; Black, Donald W.; Crowe, Raymond R.; Oksenberg, Jorge R.; Mirel, Daniel B.; Kendler, Kenneth S.; Freedman, Robert; Gejman, Pablo V.
2009-01-01
Schizophrenia, a devastating psychiatric disorder, has a prevalence of 0.5–1%, with high heritability (80–85%) and complex transmission.1 Recent studies implicate rare, large, high-penetrance copy number variants (CNVs) in some cases2, but it is not known what genes or biological mechanisms underlie susceptibility. Here we show that schizophrenia is significantly associated with single nucleotide polymorphisms (SNPs) in the extended Major Histocompatibility Complex (MHC) region on chromosome 6. We carried out a genome-wide association study (GWAS) of common SNPs in the Molecular Genetics of Schizophrenia (MGS) case-control sample, and then a meta-analysis of data from the MGS, International Schizophrenia Consortium (ISC) and SGENE datasets. No MGS finding achieved genome-wide statistical significance. In the meta-analysis of European-ancestry subjects (8,008 cases, 19,077 controls), significant association with schizophrenia was observed in a region of linkage disequilibrium on chromosome 6p22.1 (P = 9.54 × 10−9). This region includes a histone gene cluster and several immunity-related genes, possibly implicating etiologic mechanisms involving chromatin modification, transcriptional regulation, auto-immunity and/or infection. These results demonstrate that common schizophrenia susceptibility alleles can be detected. The characterization of these signals will suggest important directions for research on susceptibility mechanisms. PMID:19571809
Belamkar, Vikas; Farmer, Andrew D.; Weeks, Nathan T.; Kalberer, Scott R.; Blackmon, William J.; Cannon, Steven B.
2016-01-01
For species with potential as new crops, rapid improvement may be facilitated by new genomic methods. Apios (Apios americana Medik.), once a staple food source of Native American Indians, produces protein-rich tubers, tolerates a wide range of soils, and symbiotically fixes nitrogen. We report the first high-quality de novo transcriptome assembly, an expression atlas, and a set of 58,154 SNP and 39,609 gene expression markers (GEMs) for characterization of a breeding collection. Both SNPs and GEMs identify six genotypic clusters in the collection. Transcripts mapped to the Phaseolus vulgaris genome–another phaseoloid legume with the same chromosome number–provide provisional genetic locations for 46,852 SNPs. Linkage disequilibrium decays within 10 kb (based on the provisional genetic locations), consistent with outcrossing reproduction. SNPs and GEMs identify more than 21 marker-trait associations for at least 11 traits. This study demonstrates a holistic approach for mining plant collections to accelerate crop improvement. PMID:27721469
Kernel-Based Measure of Variable Importance for Genetic Association Studies.
Gallego, Vicente; Luz Calle, M; Oller, Ramon
2017-06-17
The identification of genetic variants that are associated with disease risk is an important goal of genetic association studies. Standard approaches perform univariate analysis where each genetic variant, usually Single Nucleotide Polymorphisms (SNPs), is tested for association with disease status. Though many genetic variants have been identified and validated so far using this univariate approach, for most complex diseases a large part of their genetic component is still unknown, the so called missing heritability. We propose a Kernel-based measure of variable importance (KVI) that provides the contribution of a SNP, or a group of SNPs, to the joint genetic effect of a set of genetic variants. KVI can be used for ranking genetic markers individually, sets of markers that form blocks of linkage disequilibrium or sets of genetic variants that lie in a gene or a genetic pathway. We prove that, unlike the univariate analysis, KVI captures the relationship with other genetic variants in the analysis, even when measured at the individual level for each genetic variable separately. This is specially relevant and powerful for detecting genetic interactions. We illustrate the results with data from an Alzheimer's disease study and show through simulations that the rankings based on KVI improve those rankings based on two measures of importance provided by the Random Forest. We also prove with a simulation study that KVI is very powerful for detecting genetic interactions.
ParallABEL: an R library for generalized parallelization of genome-wide association studies.
Sangket, Unitsa; Mahasirimongkol, Surakameth; Chantratita, Wasun; Tandayya, Pichaya; Aulchenko, Yurii S
2010-04-29
Genome-Wide Association (GWA) analysis is a powerful method for identifying loci associated with complex traits and drug response. Parts of GWA analyses, especially those involving thousands of individuals and consuming hours to months, will benefit from parallel computation. It is arduous acquiring the necessary programming skills to correctly partition and distribute data, control and monitor tasks on clustered computers, and merge output files. Most components of GWA analysis can be divided into four groups based on the types of input data and statistical outputs. The first group contains statistics computed for a particular Single Nucleotide Polymorphism (SNP), or trait, such as SNP characterization statistics or association test statistics. The input data of this group includes the SNPs/traits. The second group concerns statistics characterizing an individual in a study, for example, the summary statistics of genotype quality for each sample. The input data of this group includes individuals. The third group consists of pair-wise statistics derived from analyses between each pair of individuals in the study, for example genome-wide identity-by-state or genomic kinship analyses. The input data of this group includes pairs of SNPs/traits. The final group concerns pair-wise statistics derived for pairs of SNPs, such as the linkage disequilibrium characterisation. The input data of this group includes pairs of individuals. We developed the ParallABEL library, which utilizes the Rmpi library, to parallelize these four types of computations. ParallABEL library is not only aimed at GenABEL, but may also be employed to parallelize various GWA packages in R. The data set from the North American Rheumatoid Arthritis Consortium (NARAC) includes 2,062 individuals with 545,080, SNPs' genotyping, was used to measure ParallABEL performance. Almost perfect speed-up was achieved for many types of analyses. For example, the computing time for the identity-by-state matrix was linearly reduced from approximately eight hours to one hour when ParallABEL employed eight processors. Executing genome-wide association analysis using the ParallABEL library on a computer cluster is an effective way to boost performance, and simplify the parallelization of GWA studies. ParallABEL is a user-friendly parallelization of GenABEL.
Palmer, Nicholette D.; Hester, Jessica M.; An, S. Sandy; Adeyemo, Adebowale; Rotimi, Charles; Langefeld, Carl D.; Freedman, Barry I.; Ng, Maggie C.Y.; Bowden, Donald W.
2011-01-01
OBJECTIVE Variation in the transcription factor 7-like 2 (TCF7L2) locus is associated with type 2 diabetes across multiple ethnicities. The aim of this study was to elucidate which variant in TCF7L2 confers diabetes susceptibility in African Americans. RESEARCH DESIGN AND METHODS Through the evaluation of tagging single nucleotide polymorphisms (SNPs), type 2 diabetes susceptibility was limited to a 4.3-kb interval, which contains the YRI (African) linkage disequilibrium (LD) block containing rs7903146. To better define the relationship between type 2 diabetes risk and genetic variation we resequenced this 4.3-kb region in 96 African American DNAs. Thirty-three novel and 13 known SNPs were identified: 20 with minor allele frequencies (MAF) >0.05 and 12 with MAF >0.10. These polymorphisms and the previously identified DG10S478 microsatellite were evaluated in African American type 2 diabetic cases (n = 1,033) and controls (n = 1,106). RESULTS Variants identified from direct sequencing and databases were genotyped or imputed. Fifteen SNPs showed association with type 2 diabetes (P < 0.05) with rs7903146 being the most significant (P = 6.32 × 10−6). Results of imputation, haplotype, and conditional analysis of SNPs were consistent with rs7903146 being the trait-defining SNP. Analysis of the DG10S478 microsatellite, which is outside the 4.3-kb LD block, revealed consistent association of risk allele 8 with type 2 diabetes (odds ratio [OR] = 1.33; P = 0.022) as reported in European populations; however, allele 16 (MAF = 0.016 cases and 0.032 controls) was strongly associated with reduced risk (OR = 0.39; P = 5.02 × 10−5) in contrast with previous studies. CONCLUSIONS In African Americans, these observations suggest that rs7903146 is the trait-defining polymorphism associated with type 2 diabetes risk. Collectively, these results support ethnic differences in type 2 diabetes associations. PMID:20980453
Chu, Winston S.; Das, Swapan Kumar; Wang, Hua; Chan, Juliana C.; Deloukas, Panos; Froguel, Philippe; Baier, Leslie J.; Jia, Weiping; McCarthy, Mark I.; Ng, Maggie C.Y.; Damcott, Coleen; Shuldiner, Alan R.; Zeggini, Eleftheria; Elbein, Steven C.
2009-01-01
Activating transcription factor 6 (ATF6) is located within the region of linkage to type 2 diabetes on chromosome 1q21-q23 and is a key activator of the endoplasmic reticulum stress response. We evaluated 78 single nucleotide polymorphisms (SNPs) spanning >213 kb in 95 people, from which we selected 64 SNPs for evaluation in 191 Caucasian case subjects from Utah and between 165 and 188 control subjects. Six SNPs showed nominal associations with type 2 diabetes (P = 0.001-0.04), including the nonsynonymous SNP rs1058405 (M67V) in exon 3 and rs11579627 in the 3′ flanking region. Only rs1159627 remained significant on permutation testing. The associations were not replicated in 353 African-American case subjects and 182 control subjects, nor were ATF6 SNPs associated with altered insulin secretion or insulin sensitivity in nondiabetic Caucasian individuals. No association with type 2 diabetes was found in a subset of 44 SNPs in Caucasian (n = 2,099), Pima Indian (n = 293), and Chinese (n = 287) samples. Allelic expression imbalance was found in transformed lymphocyte cDNA for 3′ untranslated region variants, thus suggesting cis-acting regulatory variants. ATF6 does not appear to play a major role in type 2 diabetes, but further work is required to identify the cause of the allelic expression imbalance. PMID:17327457
Bekris, Lynn M.; Millard, Steven P.; Galloway, Nichole M.; Vuletic, Simona; Albers, John J.; Li, Ge; Galasko, Douglas R.; DeCarli, Charles; Farlow, Martin R.; Clark, Chris M.; Quinn, Joseph F.; Kaye, Jeffrey A.; Schellenberg, Gerard D.; Tsuang, Debby; Peskind, Elaine R.; Yu, Chang-En
2010-01-01
The ε4 allele of the apolipoprotein E gene (APOE) is associated with increased risk and earlier age at onset in late onset Alzheimer’s disease (AD). Other factors, such as expression level of apolipoprotein E protein (apoE), have been postulated to modify the APOE related risk of developing AD. Multiple loci in and outside of APOE are associated with a high risk of AD. The aim of this exploratory hypothesis generating investigation was to determine if some of these loci predict cerebrospinal fluid (CSF) apoE levels in healthy non-demented subjects. CSF apoE levels were measured from healthy non-demented subjects 21–87 years of age (n = 134). Backward regression models were used to evaluate the influence of 21 SNPs, within and surrounding APOE, on CSF apoE levels while taking into account age, gender, APOE ε4 and correlation between SNPs (linkage disequilibrium). APOE ε4 genotype does not predict CSF apoE levels. Three SNPs within the TOMM40 gene, one APOE promoter SNP and two SNPs within distal APOE enhancer elements (ME1 and BCR) predict CSF apoE levels. Further investigation of the genetic influence of these loci on apoE expression levels in the central nervous system is likely to provide new insight into apoE regulation as well as AD pathogenesis. PMID:18430993
Allelic variation in TLR4 is linked to resistance to Salmonella Enteritidis infection in chickens.
Li, Peng; Wang, Huihua; Zhao, Xingwang; Gou, Zhongyong; Liu, Ranran; Song, Yongmei; Li, Qinghe; Zheng, Maiqing; Cui, Huanxian; Everaert, Nadia; Zhao, Guiping; Wen, Jie
2017-07-01
Salmonella Enteritidis (SE) is a foodborne pathogen that negatively affects both animal and human health. Polymorphisms of the TLR4 gene may affect recognition by Toll-like receptor 4 (TLR4) of bacterial lipopolysaccharide (LPS), leading to differences in host resistance to pathogenic infections. The present study has investigated polymorphic loci of chicken TLR4 (ChTLR4) in ten chicken breeds, electrostatic potentials of mutant structures of TLR4, and a linkage analysis between allelic variation and survival ratio to infection with SE in specific-pathogen-free (SPF) White Leghorns. A total of 19 Single Nucleotide Polymorphisms (SNPs), of which 10 were novel, were found in chicken breeds. Seven newly identified amino acid variants (C68G, G674A, G782A, A896T, T959G, T986A, and A1104C) and previously reported important mutations (G247A, G1028A, C1147T, and A1832G) were demonstrated in the extracellular domain of the ChTLR4 gene. Significant changes in surface electrostatic potential of the ectodomain of TLR4, built by homology modeling, were observed at the Glu83Lys (G247A), Arg298Ser (A896T), Ser368Arg (A1104C), and Gln611Arg (A1832G) substitutions. Linkage analysis showed that one polymorphic locus G247A of TLR4 gene, common in all breeds examined, was significantly associated with increased resistance to SE in SPF White Leghorns chicks (log-rank P-value = 0.04). The genotypes from A1832G SNPs did not show statistically significant survival differences. This study has provided the first direct evidence that G247A substitution in ChTLR4 is associated with increased resistance to Salmonella Enteritidis. © 2017 Poultry Science Association Inc.
Rachakonda, Sivaramakrishna P; Penack, Olaf; Dietrich, Sascha; Blau, Olga; Blau, Igor Wolfgang; Radujkovic, Aleksandar; Isermann, Berend; Ho, Anthony D; Uharek, Lutz; Dreger, Peter; Kumar, Rajiv; Luft, Thomas
2014-10-20
Steroid-refractory graft-versus-host disease (GVHD) is a major and often fatal complication after allogeneic stem-cell transplantation (alloSCT). Although the pathophysiology of steroid refractoriness is not fully understood, evidence is accumulating that endothelial cell stress is involved, and endothelial thrombomodulin (THBD) plays a role in this process. Here we assess whether single-nucleotide polymorphisms (SNPs) within the THBD gene predict outcome after alloSCT. Seven SNPs within the THBD gene were studied (rs1962, rs1042579, rs1042580, rs3176123, rs3176124, rs3176126, and rs3176134) in a training cohort of 306 patients. The relevant genotypes were then validated in an independent cohort (n = 321). In the training cohort, an increased risk of nonrelapse mortality (NRM) was associated with three of seven SNPs tested: rs1962, rs1042579 (in linkage disequilibrium with rs3176123), and rs1042580. When patients were divided into risk groups (one v no high-risk SNP), a strong correlation with NRM was observed (hazard ratio [HR], 2.31; 95% CI, 1.36 to 3.95; P = .002). More specifically, NRM was predicted by THBD SNPs in patients who later developed GVHD (HR, 3.03; 95% CI, 1.61 to 5.68; P < .001) but not in patients without GVHD. In contrast, THBD SNPs did not predict incidence of acute GVHD. Multivariable analyses adjusting for clinical variables confirmed the independent effect of THBD SNPs on NRM. All findings could be reproduced in the validation cohort. THBD SNPs predict mortality of manifest GVHD but not the risk of acquiring GVHD, supporting the hypothesis that endothelial vulnerability contributes to GVHD refractoriness. © 2014 by American Society of Clinical Oncology.
Ozturk, Ayla; Desai, Purnima P; Minster, Ryan L; Dekosky, Steven T; Kamboh, M Ilyas
2005-01-01
Linkage studies suggest the presence of putative risk and/or age-at-onset genes for Alzheimer's disease on Chromosome 10. Recently, a genomic converging approach using a combination of linkage, expression and association studies has reported significant associations of the glutathione S-transferase omega 1 and 2 (GSTO1 and GSTO2) genes and possibly the protease serine 11 (PRSS11) gene on chromosome 10 with age-at-onset, but not risk, for Alzheimer's disease (AD) and Parkinson disease. We investigated the association of the reported three polymorphisms in 990 sporadic late-onset AD cases (26% autopsy confirmed) and 735 controls. In our sample, we found no association either with age-at-onset in AD cases or with disease risk in the case-control cohort. However, haplotype analysis revealed a modest association of one haplotype with AD risk (p = 0.04). Additional markers in these genes need to be screened to explore their role in the etiology of AD.
The extent of linkage disequilibrium in beef cattle breeds using high-density SNP genotypes.
Porto-Neto, Laercio R; Kijas, James W; Reverter, Antonio
2014-03-24
The extent of linkage disequilibrium (LD) between molecular markers impacts genome-wide association studies and implementation of genomic selection. The availability of high-density single nucleotide polymorphism (SNP) genotyping platforms makes it possible to investigate LD at an unprecedented resolution. In this work, we characterised LD decay in breeds of beef cattle of taurine, indicine and composite origins and explored its variation across autosomes and the X chromosome. In each breed, LD decayed rapidly and r2 was less than 0.2 for marker pairs separated by 50 kb. The LD decay curves clustered into three groups of similar LD decay that distinguished the three main cattle types. At short distances between markers (<10 kb), taurine breeds showed higher LD (r2=0.45) than their indicine (r2=0.25) and composite (r2=0.32) counterparts. This higher LD in taurine breeds was attributed to a smaller effective population size and a stronger bottleneck during breed formation. Using all SNPs on only the X chromosome, the three cattle types could still be distinguished. However for taurine breeds, the LD decay on the X chromosome was much faster and the background level much lower than for indicine breeds and composite populations. When using only SNPs that were polymorphic in all breeds, the analysis of the X chromosome mimicked that of the autosomes. The pattern of LD mirrored some aspects of the history of breed populations and showed a sharp decay with increasing physical distance between markers. We conclude that the availability of the HD chip can be used to detect association signals that remained hidden when using lower density genotyping platforms, since LD dropped below 0.2 at distances of 50 kb.
Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.
2015-01-01
High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569
Talmud, Philippa J; Hawe, Emma; Martin, Steve; Olivier, Michael; Miller, George J; Rubin, Edward M; Pennacchio, Len A; Humphries, Steve E
2002-11-15
Since triglycerides (TG) are a major independent risk factor for coronary heart disease, understanding their genetic and environmental determinants is of major importance. Mouse models indicate an inverse relationship between levels of the newly identified apolipoprotein AV (APOAV) and TG concentrations. We have examined the relative influence of human APOA5 variants on plasma lipids, compared to the impact of variation in APOC3 and APOA4 which lie in the same cluster. Single nucleotide polymorphisms (SNPs) in APOA5 (S19W, -1131T>C) and APOA4 (T347S, Q360H) and an APOA4/A5 intergenic T>C SNP were examined in a large study of healthy middle-aged men (n=2808). APOA5 19WW and -1131CC men had 52% and 40% higher TG (P<0.003) compared to common allele homozygotes, respectively, effects which were independent and additive. APOA4 347SS men had 23% lower TG compared to TT men (P<0.002). Haplotype analysis was carried out to identify TG-raising alleles and included, in addition, four previously genotyped APOC3 SNPs (-2845T>G, -482C>T, 1100C>T, and 3238C>G). The major TG-raising alleles were defined by APOA5 W19 and APOC3 -482T. This suggests that the TG-lowering effect of APOA4 S347 might merely reflect the strong negative linkage disequilibrium with the common alleles of these variants. Thus variation in APOA5 is associated with differences in TGs in healthy men, independent of those previously reported for APOC3, while association between APOA4 and TG reflects linkage disequilibrium with these sites. The molecular mechanisms for these effects remain to be determined.
Makina, Sithembile O; Taylor, Jeremy F; van Marle-Köster, Este; Muchadeyi, Farai C; Makgahlela, Mahlako L; MacNeil, Michael D; Maiwashe, Azwihangwisi
2015-01-01
Knowledge on the extent of linkage disequilibrium (LD) in livestock populations is essential to determine the minimum distance between markers required for effective coverage when conducting genome-wide association studies (GWAS). This study evaluated the extent of LD, persistence of allelic phase and effective population size (Ne) for four Sanga cattle breeds in South Africa including the Afrikaner (n = 44), Nguni (n = 54), Drakensberger (n = 47), and Bonsmara breeds (n = 46), using Angus (n = 31) and Holstein (n = 29) as reference populations. We found that moderate LD extends up to inter-marker distances of 40-60 kb in Angus (0.21) and Holstein (0.21) and up to 100 kb in Afrikaner (0.20). This suggests that genomic selection and association studies performed within these breeds using an average inter-marker r (2)≥ 0.20 would require about 30,000-50,000 SNPs. However, r (2)≥ 0.20 extended only up to 10-20 kb in the Nguni and Drakensberger and 20-40 kb in the Bonsmara indicating that 75,000 to 150,000 SNPs would be necessary for GWAS in these breeds. Correlation between alleles at contiguous loci indicated that phase was not strongly preserved between breeds. This suggests the need for breed-specific reference populations in which a much greater density of markers should be scored to identify breed specific haplotypes which may then be imputed into multi-breed commercial populations. Analysis of effective population size based on the extent of LD, revealed Ne = 95 (Nguni), Ne = 87 (Drakensberger), Ne = 77 (Bonsmara), and Ne = 41 (Afrikaner). Results of this study form the basis for implementation of genomic selection programs in the Sanga breeds of South Africa.
Makina, Sithembile O.; Taylor, Jeremy F.; van Marle-Köster, Este; Muchadeyi, Farai C.; Makgahlela, Mahlako L.; MacNeil, Michael D.; Maiwashe, Azwihangwisi
2015-01-01
Knowledge on the extent of linkage disequilibrium (LD) in livestock populations is essential to determine the minimum distance between markers required for effective coverage when conducting genome-wide association studies (GWAS). This study evaluated the extent of LD, persistence of allelic phase and effective population size (Ne) for four Sanga cattle breeds in South Africa including the Afrikaner (n = 44), Nguni (n = 54), Drakensberger (n = 47), and Bonsmara breeds (n = 46), using Angus (n = 31) and Holstein (n = 29) as reference populations. We found that moderate LD extends up to inter-marker distances of 40–60 kb in Angus (0.21) and Holstein (0.21) and up to 100 kb in Afrikaner (0.20). This suggests that genomic selection and association studies performed within these breeds using an average inter-marker r2≥ 0.20 would require about 30,000–50,000 SNPs. However, r2≥ 0.20 extended only up to 10–20 kb in the Nguni and Drakensberger and 20–40 kb in the Bonsmara indicating that 75,000 to 150,000 SNPs would be necessary for GWAS in these breeds. Correlation between alleles at contiguous loci indicated that phase was not strongly preserved between breeds. This suggests the need for breed-specific reference populations in which a much greater density of markers should be scored to identify breed specific haplotypes which may then be imputed into multi-breed commercial populations. Analysis of effective population size based on the extent of LD, revealed Ne = 95 (Nguni), Ne = 87 (Drakensberger), Ne = 77 (Bonsmara), and Ne = 41 (Afrikaner). Results of this study form the basis for implementation of genomic selection programs in the Sanga breeds of South Africa. PMID:26648975
Stahl, Bethany A.; Gross, Joshua B.; Speiser, Daniel I.; Oakley, Todd H.; Patel, Nipam H.; Gould, Douglas B.; Protas, Meredith E.
2015-01-01
Cave animals, compared to surface-dwelling relatives, tend to have reduced eyes and pigment, longer appendages, and enhanced mechanosensory structures. Pressing questions include how certain cave-related traits are gained and lost, and if they originate through the same or different genetic programs in independent lineages. An excellent system for exploring these questions is the isopod, Asellus aquaticus. This species includes multiple cave and surface populations that have numerous morphological differences between them. A key feature is that hybrids between cave and surface individuals are viable, which enables genetic crosses and linkage analyses. Here, we advance this system by analyzing single animal transcriptomes of Asellus aquaticus. We use high throughput sequencing of non-normalized cDNA derived from the head of a surface-dwelling male, the head of a cave-dwelling male, the head of a hybrid male (produced by crossing a surface individual with a cave individual), and a pooled sample of surface embryos and hatchlings. Assembling reads from surface and cave head RNA pools yielded an integrated transcriptome comprised of 23,984 contigs. Using this integrated assembly as a reference transcriptome, we aligned reads from surface-, cave- and hybrid- head tissue and pooled surface embryos and hatchlings. Our approach identified 742 SNPs and placed four new candidate genes to an existing linkage map for A. aquaticus. In addition, we examined SNPs for allele-specific expression differences in the hybrid individual. All of these resources will facilitate identification of genes and associated changes responsible for cave adaptation in A. aquaticus and, in concert with analyses of other species, will inform our understanding of the evolutionary processes accompanying adaptation to the subterranean environment. PMID:26462237
USDA-ARS?s Scientific Manuscript database
In this study, we generated a linkage map containing 1,151,856 high quality SNPs between Mo17 and B73, which were verified in the maize intermated B73'×'Mo17 (IBM) Syn10 population. This resource is an excellent complement to existing maize genetic maps available in an online database (iPlant, http:...
Analysis of Plasmodium falciparum diversity in natural infections by deep sequencing
Manske, Magnus; Miotto, Olivo; Campino, Susana; Auburn, Sarah; Almagro-Garcia, Jacob; Maslen, Gareth; O’Brien, Jack; Djimde, Abdoulaye; Doumbo, Ogobara; Zongo, Issaka; Ouedraogo, Jean-Bosco; Michon, Pascal; Mueller, Ivo; Siba, Peter; Nzila, Alexis; Borrmann, Steffen; Kiara, Steven M.; Marsh, Kevin; Jiang, Hongying; Su, Xin-Zhuan; Amaratunga, Chanaki; Fairhurst, Rick; Socheat, Duong; Nosten, Francois; Imwong, Mallika; White, Nicholas J.; Sanders, Mandy; Anastasi, Elisa; Alcock, Dan; Drury, Eleanor; Oyola, Samuel; Quail, Michael A.; Turner, Daniel J.; Rubio, Valentin Ruano; Jyothi, Dushyanth; Amenga-Etego, Lucas; Hubbart, Christina; Jeffreys, Anna; Rowlands, Kate; Sutherland, Colin; Roper, Cally; Mangano, Valentina; Modiano, David; Tan, John C.; Ferdig, Michael T.; Amambua-Ngwa, Alfred; Conway, David J.; Takala-Harrison, Shannon; Plowe, Christopher V.; Rayner, Julian C.; Rockett, Kirk A.; Clark, Taane G.; Newbold, Chris I.; Berriman, Matthew; MacInnis, Bronwyn; Kwiatkowski, Dominic P.
2013-01-01
Malaria elimination strategies require surveillance of the parasite population for genetic changes that demand a public health response, such as new forms of drug resistance. 1,2 Here we describe methods for large-scale analysis of genetic variation in Plasmodium falciparum by deep sequencing of parasite DNA obtained from the blood of patients with malaria, either directly or after short term culture. Analysis of 86,158 exonic SNPs that passed genotyping quality control in 227 samples from Africa, Asia and Oceania provides genome-wide estimates of allele frequency distribution, population structure and linkage disequilibrium. By comparing the genetic diversity of individual infections with that of the local parasite population, we derive a metric of within-host diversity that is related to the level of inbreeding in the population. An open-access web application has been established for exploration of regional differences in allele frequency and of highly differentiated loci in the P. falciparum genome. PMID:22722859
Identification of Genes Promoting Skin Youthfulness by Genome-Wide Association Study
Chang, Anne L.S.; Atzmon, Gil; Bergman, Aviv; Brugmann, Samantha; Atwood, Scott X; Chang, Howard Y; Barzilai, Nir
2014-01-01
To identify genes that promote facial skin youthfulness (SY), a genome-wide association study on an Ashkenazi Jewish discovery group (n=428) was performed using Affymetrix 6.0 Single-Nucleotide Polymorphism (SNP) Array. After SNP quality controls, 901,470 SNPs remained for analysis. The eigenstrat method showed no stratification. Cases and controls were identified by global facial skin aging severity including intrinsic and extrinsic parameters. Linear regression adjusted for age and gender, with no significant differences in smoking history, body mass index, menopausal status, or personal or family history of centenarians. Six SNPs met the Bonferroni threshold with Pallele<10−8; two of these six had Pgenotype<10−8. Quantitative trait loci mapping confirmed linkage disequilibrium. The six SNPs were interrogated by MassARRAY in a replication group (n=436) with confirmation of rs6975107, an intronic region of KCND2 (potassium voltage-gated channel, Shal-related family member 2) (Pgenotype=0.023). A second replication group (n=371) confirmed rs318125, downstream of DIAPH2 (diaphanous homolog 2 (Drosophila)) (Pallele=0.010, Pgenotype=0.002) and rs7616661, downstream of EDEM1 (ER degradation enhancer, mannosidase α-like 1) (Pgenotype=0.042). DIAPH2 has been associated with premature ovarian insufficiency, an aging phenotype in humans. EDEM1 associates with lifespan in animal models, although not humans. KCND2 is expressed in human skin, but has not been associated with aging. These genes represent new candidate genes to study the molecular basis of healthy skin aging. PMID:24037343
Extended diversity analysis of cultivated grapevine Vitis vinifera with 10K genome-wide SNPs.
Laucou, Valérie; Launay, Amandine; Bacilieri, Roberto; Lacombe, Thierry; Adam-Blondon, Anne-Françoise; Bérard, Aurélie; Chauveau, Aurélie; de Andrés, Maria Teresa; Hausmann, Ludger; Ibáñez, Javier; Le Paslier, Marie-Christine; Maghradze, David; Martinez-Zapater, José Miguel; Maul, Erika; Ponnaiah, Maharajah; Töpfer, Reinhard; Péros, Jean-Pierre; Boursiquot, Jean-Michel
2018-01-01
Grapevine is a very important crop species that is mainly cultivated worldwide for fruits, wine and juice. Identification of the genetic bases of performance traits through association mapping studies requires a precise knowledge of the available diversity and how this diversity is structured and varies across the whole genome. An 18k SNP genotyping array was evaluated on a panel of Vitis vinifera cultivars and we obtained a data set with no missing values for a total of 10207 SNPs and 783 different genotypes. The average inter-SNP spacing was ~47 kbp, the mean minor allele frequency (MAF) was 0.23 and the genetic diversity in the sample was high (He = 0.32). Fourteen SNPs, chosen from those with the highest MAF values, were sufficient to identify each genotype in the sample. Parentage analysis revealed 118 full parentages and 490 parent-offspring duos, thus confirming the close pedigree relationships within the cultivated grapevine. Structure analyses also confirmed the main divisions due to an eastern-western gradient and human usage (table vs. wine). Using a multivariate approach, we refined the structure and identified a total of eight clusters. Both the genetic diversity (He, 0.26-0.32) and linkage disequilibrium (LD, 28.8-58.2 kbp) varied between clusters. Despite the short span LD, we also identified some non-recombining haplotype blocks that may complicate association mapping. Finally, we performed a genome-wide association study that confirmed previous works and also identified new regions for important performance traits such as acidity. Taken together, all the results contribute to a better knowledge of the genetics of the cultivated grapevine.
Extended diversity analysis of cultivated grapevine Vitis vinifera with 10K genome-wide SNPs
Launay, Amandine; Bacilieri, Roberto; Lacombe, Thierry; Adam-Blondon, Anne-Françoise; Bérard, Aurélie; Chauveau, Aurélie; de Andrés, Maria Teresa; Maghradze, David; Maul, Erika; Ponnaiah, Maharajah; Töpfer, Reinhard; Péros, Jean-Pierre; Boursiquot, Jean-Michel
2018-01-01
Grapevine is a very important crop species that is mainly cultivated worldwide for fruits, wine and juice. Identification of the genetic bases of performance traits through association mapping studies requires a precise knowledge of the available diversity and how this diversity is structured and varies across the whole genome. An 18k SNP genotyping array was evaluated on a panel of Vitis vinifera cultivars and we obtained a data set with no missing values for a total of 10207 SNPs and 783 different genotypes. The average inter-SNP spacing was ~47 kbp, the mean minor allele frequency (MAF) was 0.23 and the genetic diversity in the sample was high (He = 0.32). Fourteen SNPs, chosen from those with the highest MAF values, were sufficient to identify each genotype in the sample. Parentage analysis revealed 118 full parentages and 490 parent-offspring duos, thus confirming the close pedigree relationships within the cultivated grapevine. Structure analyses also confirmed the main divisions due to an eastern-western gradient and human usage (table vs. wine). Using a multivariate approach, we refined the structure and identified a total of eight clusters. Both the genetic diversity (He, 0.26–0.32) and linkage disequilibrium (LD, 28.8–58.2 kbp) varied between clusters. Despite the short span LD, we also identified some non-recombining haplotype blocks that may complicate association mapping. Finally, we performed a genome-wide association study that confirmed previous works and also identified new regions for important performance traits such as acidity. Taken together, all the results contribute to a better knowledge of the genetics of the cultivated grapevine. PMID:29420602
Genetic variation in ORM1-like 3 (ORMDL3) and gasdermin-like (GSDML) and childhood asthma.
Wu, H; Romieu, I; Sienra-Monge, J-J; Li, H; del Rio-Navarro, B E; London, S J
2009-04-01
A genome-wide association study identified ORM1-like 3 (orosomucoid 1-like 3, ORMDL3) as an asthma candidate gene. Single nucleotide polymorphisms (SNPs) in the region including ORMDL3 on chromosome 17q21 were related to childhood asthma risk and ORMDL3 expression levels in Europeans. We examined whether polymorphisms in ORMDL3 and the adjacent gasdermin-like (GSDML) gene associated with asthma in the genome-wide association study are related to childhood asthma and atopy in a Mexico City population. We genotyped rs4378650 in ORMDL3 and rs7216389 in GSDML in 615 nuclear families consisting of asthmatic children aged 4-17 years and their parents. Atopy was determined by skin prick tests to 25 aeroallergens. Individuals carrying the C allele of rs4378650 or the T allele of rs7216389 had increased risk of asthma [relative risk (RR) = 1.73, 95% confidence interval (CI) 1.19-2.53, P = 0.003 for one or two copies of rs4378650 C, and RR = 1.64, 95% CI 1.12-2.38, P = 0.009 for one or two copies of rs7216389 T). Linkage disequilibrium between the two SNPs was high (r(2) = 0.92). Neither of the SNPs was associated with the degree of atopy. A meta-analysis of five published studies on rs7216389 in nine populations gave an odds ratio for asthma of 1.44 (95% CI, 1.35-1.54, P < 0.00001). Our results and the meta-analysis provide evidence to confirm the finding from a recent genome-wide association study that polymorphisms in ORMDL3 and the adjacent GSDML may contribute to childhood asthma.
Genetic variation in ORM1-like 3 (ORMDL3) and gasdermin-like (GSDML) and childhood asthma
Wu, H.; Romieu, I.; Sienra-Monge, J.-J.; Li, H.; del Rio-Navarro, B. E.; London, S. J.
2009-01-01
Background A genome-wide association study identified ORM1-like 3 (orosomucoid 1-like 3, ORMDL3) as an asthma candidate gene. Single nucleotide polymorphisms (SNPs) in the region including ORMDL3 on chromosome 17q21 were related to childhood asthma risk and ORMDL3 expression levels in Europeans. Objective We examined whether polymorphisms in ORMDL3 and the adjacent gasdermin-like (GSDML) gene associated with asthma in the genome-wide association study are related to childhood asthma and atopy in a Mexico City population. Methods We genotyped rs4378650 in ORMDL3 and rs7216389 in GSDML in 615 nuclear families consisting of asthmatic children aged 4–17 years and their parents. Atopy was determined by skin prick tests to 25 aeroallergens. Results Individuals carrying the C allele of rs4378650 or the T allele of rs7216389 had increased risk of asthma [relative risk (RR) = 1.73, 95% con.- dence interval (CI) 1.19–2.53, P = 0.003 for one or two copies of rs4378650 C, and RR = 1.64, 95% CI 1.12–2.38, P = 0.009 for one or two copies of rs7216389 T). Linkage disequilibrium between the two SNPs was high (r2 = 0.92). Neither of the SNPs was associated with the degree of atopy. A meta-analysis of five published studies on rs7216389 in nine populations gave an odds ratio for asthma of 1.44 (95% CI, 1.35–1.54, P < 0.00001). Conclusions Our results and the meta-analysis provide evidence to confirm the finding from a recent genome-wide association study that polymorphisms in ORMDL3 and the adjacent GSDML may contribute to childhood asthma. PMID:19133921
Uncoupling protein 2 gene polymorphisms are associated with obesity
2012-01-01
Background Uncoupling protein 2 (UCP2) gene polymorphisms have been reported as genetic risk factors for obesity and type 2 diabetes mellitus (T2DM). We examined the association of commonly observed UCP2 G(−866)A (rs659366) and Ala55Val (C > T) (rs660339) single nucleotide polymorphisms (SNPs) with obesity, high fasting plasma glucose, and serum lipids in a Balinese population. Methods A total of 603 participants (278 urban and 325 rural subjects) were recruited from Bali Island, Indonesia. Fasting plasma glucose (FPG), triglyceride (TG), high density lipoprotein cholesterol (HDL-C), low density lipoprotein cholesterol (LDL-C) and total cholesterol (TC) were measured. Obesity was determined based on WHO classifications for adult Asians. Participants were genotyped for G(−866)A and Ala55Val polymorphisms of the UCP2 gene. Results Obesity prevalence was higher in urban subjects (51%) as compared to rural subjects (23%). The genotype, minor allele (MAF), and heterozygosity frequencies were similar between urban and rural subjects for both SNPs. All genotype frequencies were in Hardy-Weinberg equilibrium. A combined analysis of genotypes and environment revealed that the urban subjects carrying the A/A genotype of the G(−866)A SNP have higher BMI than the rural subjects with the same genotype. Since the two SNPs showed strong linkage disequilibrium (D’ = 0.946, r2 = 0.657), a haplotype analysis was performed. We found that the AT haplotype was associated with high BMI only when the urban environment was taken into account. Conclusions We have demonstrated the importance of environmental settings in studying the influence of the common UCP2 gene polymorphisms in the development of obesity in a Balinese population. PMID:22533685
Mao, Yongjiang; Zhu, Xiaorui; Xing, Shiyu; Zhang, Meirong; Zhang, Huimin; Wang, Xiaolong; Karrow, Niel; Yang, Liguo; Yang, Zhangping
2015-12-01
Lactoferrin is an iron-binding protein found in cow's milk that plays an important role in preventing mastitis caused by intramammary infection. In this study, 20 Chinese Holstein cows were selected randomly for PCR amplification and sequencing of the bovine lactoferrin gene promoter region and used for SNP discovery in the region between nucleotide positions -461 to -132. Three SNPs (-270T>C, -190G>A and -156A>G) were identified in bovine lactoferrin, then Chinese Holstein cows (n=866) were genotyped using Sequenom MassARRAY (Sequenom Inc., San Diego, CA) based on the previous SNP information in this study, and the associations between SNPs or haplotype and milk somatic cell score (SCS) and production traits were analyzed by the least squares method in the GLM procedure of SAS. SNPs -270T>C and -156A>G showed close linkage disequilibrium (r(2)=0.76). The SNP -190G>A showed a significant association with SCS, and individuals with genotype GG had higher SCS than genotypes AG and AA. Associations were found between the SNPs -270T>C and -190G>A with SCS and the milk composition. The software MatInspector revealed that these SNPs were located within several potential transcription factor binding sites, including NF-κB p50, KLF7 and SP1, and may alter gene expression, but further investigation will be required to elucidate the biological and practical relevance of these SNPs. Copyright © 2015 Elsevier Ltd. All rights reserved.
Kurz, Thorsten; Hoffjan, Sabine; Hayes, M Geoffrey; Schneider, Dan; Nicolae, Raluca; Heinzmann, Andrea; Jerkic, Sylvija P; Parry, Rod; Cox, Nancy J; Deichmann, Klaus A; Ober, Carole
2006-08-01
Genome-wide linkage scans to identify asthma susceptibility loci have revealed many linked regions, including a broad region on chromosome 5p. To identify a 5p-linked asthma or bronchial hyperresponsiveness (BHR) locus. We performed fine mapping and positional candidate studies of this region in the Hutterites and an outbred case-control sample from Germany by genotyping 89 single nucleotide polymorphisms (SNPs) in 22 genes. SNP and haplotype analyses were performed. Three genes in a distal region (zinc finger RNA binding protein [ZFR], natriuretic peptide receptor C, and a disintegrin and metalloproteinase domain with thrombospondin type 1 motif [ADAMTS12]) were associated with BHR, whereas 4 genes in a proximal region (prolactin receptor, IL-7 receptor [IL7R], leukemia inhibitory factor receptor [LIFR], and prostaglandin E4 receptor [PTGER4]) were associated with asthma symptoms in the Hutterites. Furthermore, nearly the entire original linkage signal in the Hutterites was generated by individuals who had the risk-associated alleles in ZFR3, natriuretic peptide receptor C, ADAMTS12, LIFR, and PTGER4. Variation in ADAMTS12, IL7R, and PTGER4 were also associated with asthma in the outbred Germans, and the frequencies of long-range haplotypes composed of SNPs at ZFR, ADAMTS12, IL7R, LIFR, and PTGER4 were significantly different between both the German and Hutterite cases and controls. There is little linkage disequilbrium between alleles in these 2 regions in either population. These results suggest that a broad region on 5p, separated by >9 Mb, harbors at least 2 and possibly 5 asthma or BHR susceptibility loci. These findings are consistent with the hypothesis that regions providing evidence for linkage in multiple populations may, in fact, house more than 1 susceptibility locus, as appears to be the case for the linked region on 5p. Identifying asthma or BHR genes could lead to novel therapeutic approaches.
Population differentiation in allele frequencies of obesity-associated SNPs.
Mao, Linyong; Fang, Yayin; Campbell, Michael; Southerland, William M
2017-11-10
Obesity is emerging as a global health problem, with more than one-third of the world's adult population being overweight or obese. In this study, we investigated worldwide population differentiation in allele frequencies of obesity-associated SNPs (single nucleotide polymorphisms). We collected a total of 225 obesity-associated SNPs from a public database. Their population-level allele frequencies were derived based on the genotype data from 1000 Genomes Project (phase 3). We used hypergeometric model to assess whether the effect allele at a given SNP is significantly enriched or depleted in each of the 26 populations surveyed in the 1000 Genomes Project with respect to the overall pooled population. Our results indicate that 195 out of 225 SNPs (86.7%) possess effect alleles significantly enriched or depleted in at least one of the 26 populations. Populations within the same continental group exhibit similar allele enrichment/depletion patterns whereas inter-continental populations show distinct patterns. Among the 225 SNPs, 15 SNPs cluster in the first intron region of the FTO gene, which is a major gene associated with body-mass index (BMI) and fat mass. African populations exhibit much smaller blocks of LD (linkage disequilibrium) among these15 SNPs while European and Asian populations have larger blocks. To estimate the cumulative effect of all variants associated with obesity, we developed the personal composite genetic risk score for obesity. Our results indicate that the East Asian populations have the lowest averages of the composite risk scores, whereas three European populations have the highest averages. In addition, the population-level average of composite genetic risk scores is significantly correlated (R 2 = 0.35, P = 0.0060) with obesity prevalence. We have detected substantial population differentiation in allele frequencies of obesity-associated SNPs. The results will help elucidate the genetic basis which may contribute to population disparities in obesity prevalence.
Chung, Chia-Min; Wang, Ruey-Yun; Fann, Cathy S. J.; Chen, Jaw-Wen; Jong, Yuh-Shiun; Jou, Yuh-Shan; Yang, Hsin-Chou; Kang, Chih-Sen; Chen, Chien-Chung; Chang, Huan-Cheng; Pan, Wen-Harn
2013-01-01
Angiotensin-converting enzyme (ACE) has been implicated in multiple biological system, particularly cardiovascular diseases. However, findings associating ACE insertion/deletion polymorphism with hypertension or other related traits are inconsistent. Therefore, in a two-stage approach, we aimed to fine-map ACE in order to narrow-down the function-specific locations. We genotyped 31 single nucleotide polymorphisms (SNPs) of ACE from 1168 individuals from 305 young-onset (age ≤40) hypertension pedigrees, and found four linkage disequilibrium (LD) blocks. A tag-SNP, rs1800764 on LD block 2, upstream of and near the ACE promoter, was significantly associated with young-onset hypertension (p = 0.04). Tag-SNPs on all LD blocks were significantly associated with ACE activity (p-value: 10–16 to <10–33). The two regions most associated with ACE activity were found between exon13 and intron18 and between intron 20 and 3′UTR, as revealed by measured haplotype analysis. These two major QTLs of ACE activity and the moderate effect variant upstream of ACE promoter for young-onset hypertension were replicated by another independent association study with 842 subjects. PMID:23469169
De La Vega, Francisco M; Dailey, David; Ziegle, Janet; Williams, Julie; Madden, Dawn; Gilbert, Dennis A
2002-06-01
Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies--specifically for candidate-gene, candidate-region, and whole-genome association studies--will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics' human genome assembly and enormous resource ofphysically mapped SNPs (approximately 4,000,000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency ofthe polymorphisms. We also discuss an advanced, high-performance SNP assay chemisty--a new generation of the TaqMan probe-based, 5' nuclease assay-and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.
Pan, Wei; Song, Im-Sook; Shin, Ho-Jung; Kim, Min-Hye; Choi, Yeong-Lim; Lim, Su-Jeong; Kim, Woo-Young; Lee, Sang-Seop; Shin, Jae-Gook
2011-06-01
Genetic variants of Na(+)-taurocholate co-transporting polypeptide (NTCP; SLC10A1) and ileal apical sodium-dependent bile acid transporter (ASBT; SLC10A2), which greatly contribute to bile acid homeostasis, were extensively explored in the Korean population and functional variants of NTCP were compared among Asian populations. From direct DNA sequencing, six SNPs were identified in the SLC10A1 gene and 14 SNPs in the SLC10A2 gene. Three of seven coding variants were non-synonymous SNPs: two variants from SLC10A1 (A64T, S267F) and one from SLC10A2 (A171S). No linkage was analysed in the SLC10A1 gene because of low frequencies of genetic variants, and the SLC10A2 gene was composed of two separated linkage disequilibrium blocks contrary to the white population. The stably transfected NTCP-A64T variant showed significantly decreased uptakes of taurocholate and rosuvastatin compared with wild-type NTCP. The decreased taurocholate uptake and increased rosuvastatin uptake were shown in the NTCP-S267F variant. The allele frequencies of these functional variants were 1.0% and 3.1%, respectively, in a Korean population. However, NTCP-A64T was not found in Chinese and Vietnamese subjects. The frequency distribution of NTCP-S267F in Koreans was significantly lower than those in Chinese and Vietnamese populations. Our data suggest that NTCP-A64T and -S267F variants cause substrate-dependent functional change in vitro, and show ethnic difference in their allelic frequencies among Asian populations although the clinical relevance of these variants is remained to be evaluated.
Song, Bao-Hua; Windsor, Aaron J.; Schmid, Karl J.; Ramos-Onsins, Sebastian; Schranz, M. Eric; Heidel, Andrew J.; Mitchell-Olds, Thomas
2009-01-01
Information about polymorphism, population structure, and linkage disequilibrium (LD) is crucial for association studies of complex trait variation. However, most genomewide studies have focused on model systems, with very few analyses of undisturbed natural populations. Here, we sequenced 86 mapped nuclear loci for a sample of 46 genotypes of Boechera stricta and two individuals of B. holboellii, both wild relatives of Arabidopsis. Isolation by distance was significant across the species range of B. stricta, and three geographic groups were identified by structure analysis, principal coordinates analysis, and distance-based phylogeny analyses. The allele frequency spectrum indicated a genomewide deviation from an equilibrium neutral model, with silent nucleotide diversity averaging 0.004. LD decayed rapidly, declining to background levels in ∼10 kb or less. For tightly linked SNPs separated by <1 kb, LD was dependent on the reference population. LD was lower in the specieswide sample than within populations, suggesting that low levels of LD found in inbreeding species such as B. stricta, Arabidopsis thaliana, and barley may result from broad geographic sampling that spans heterogeneous genetic groups. Finally, analyses also showed that inbreeding B. stricta and A. thaliana have ∼45% higher recombination per kilobase than outcrossing A. lyrata. PMID:19104077
Underwood, S L; Christoforou, A; Thomson, P A; Wray, N R; Tenesa, A; Whittaker, J; Adams, R A; Le Hellard, S; Morris, S W; Blackwood, D H R; Muir, W J; Porteous, D J; Evans, K L
2006-04-01
The orphan G protein-coupled receptor 78 (GPR78) gene lies within a region of chromosome 4p where we have previously shown linkage to bipolar affective disorder (BPAD) in a large Scottish family. GPR78 was screened for single-nucleotide polymorphisms (SNPs) and a linkage disequilibrium map was constructed. Six tagging SNPs were selected and tested for association on a sample of 377 BPAD, 392 schizophrenia (SCZ) and 470 control individuals. Using standard chi(2) statistics and a backwards logistic regression approach to adjust for the effect of sex, SNP rs1282, located approximately 3 kb upstream of the coding region, was identified as a potentially important variant in SCZ (chi(2) P=0.044; LRT P=0.065). When the analysis was restricted to females, the strength of association increased to an uncorrected allele P-value of 0.015 (odds ratios (OR)=1.688, 95% confidence intervals (CI): 1.104-2.581) and uncorrected genotype P-value of 0.015 (OR=5.991, 95% CI: 1.545-23.232). Under the recessive model, the genotype P-value improved further to 0.005 (OR=5.618, 95% CI: 1.460-21.617) and remained significant after correcting for multiple testing (P=0.017). No single-marker association was detected in the SCZ males, in the BPAD individuals or with any other SNP. Haplotype analysis of the case-control samples revealed several global and individual haplotypes, with P-values <0.05, all but one of which contained SNP rs1282. After correcting for multiple testing, two haplotypes remained significant in both the female BPAD individuals (P=0.038 and 0.032) and in the full sample of affected female individuals (P=0.044 and 0.033). Our results provide preliminary evidence for the involvement of GPR78 in susceptibility to BPAD and SCZ in the Scottish population. Molecular Psychiatry (2006) 11, 384-394. doi:10.1038/sj.mp.4001786; published online 3 January 2006.
Paes, Geísa Pinheiro; Viana, José Marcelo Soriano; Silva, Fabyano Fonseca e; Mundim, Gabriel Borges
2016-01-01
Abstract The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis. PMID:27007903
Paes, Geísa Pinheiro; Viana, José Marcelo Soriano; Silva, Fabyano Fonseca E; Mundim, Gabriel Borges
2016-03-01
The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis.
Yan, H; Guo, Y; Yang, T-L; Zhao, L-J; Deng, H-W
2012-08-06
The cytochrome P450c17α gene (CYP17) encodes a key biosynthesis enzyme of estrogen, which is critical in regulating adipogenesis and adipocyte development in humans. We therefore hypothesized that CYP17 is a candidate gene for predicting obesity. In order to test this hypothesis, we performed a family-based association test to investigate the relationship between the CYP17 gene and obesity phenotypes in a large sample comprising 1873 subjects from 405 Caucasian nuclear families of European origin recruited by the Osteoporosis Research Center of Creighton University, USA. Both single SNPs and haplotypes were tested for associations with obesity-related phenotypes, including body mass index (BMI) and fat mass. We identified three SNPs to be significantly associated with BMI, including rs3740397, rs6163, and rs619824. We further characterized the linkage disequilibrium structure for CYP17 and found that the whole CYP17 gene was located in a single-linkage disequilibrium block. This block was observed to be significantly associated with BMI. A major haplotype in this block was significantly associated with both BMI and fat mass. In conclusion, we suggest that the CYP17 gene has an effect on obesity in the Caucasian population. Further independent studies will be needed to confirm our findings.
A second generation human haplotype map of over 3.1 million SNPs.
Frazer, Kelly A; Ballinger, Dennis G; Cox, David R; Hinds, David A; Stuve, Laura L; Gibbs, Richard A; Belmont, John W; Boudreau, Andrew; Hardenbol, Paul; Leal, Suzanne M; Pasternak, Shiran; Wheeler, David A; Willis, Thomas D; Yu, Fuli; Yang, Huanming; Zeng, Changqing; Gao, Yang; Hu, Haoran; Hu, Weitao; Li, Chaohua; Lin, Wei; Liu, Siqi; Pan, Hao; Tang, Xiaoli; Wang, Jian; Wang, Wei; Yu, Jun; Zhang, Bo; Zhang, Qingrun; Zhao, Hongbin; Zhao, Hui; Zhou, Jun; Gabriel, Stacey B; Barry, Rachel; Blumenstiel, Brendan; Camargo, Amy; Defelice, Matthew; Faggart, Maura; Goyette, Mary; Gupta, Supriya; Moore, Jamie; Nguyen, Huy; Onofrio, Robert C; Parkin, Melissa; Roy, Jessica; Stahl, Erich; Winchester, Ellen; Ziaugra, Liuda; Altshuler, David; Shen, Yan; Yao, Zhijian; Huang, Wei; Chu, Xun; He, Yungang; Jin, Li; Liu, Yangfan; Shen, Yayun; Sun, Weiwei; Wang, Haifeng; Wang, Yi; Wang, Ying; Xiong, Xiaoyan; Xu, Liang; Waye, Mary M Y; Tsui, Stephen K W; Xue, Hong; Wong, J Tze-Fei; Galver, Luana M; Fan, Jian-Bing; Gunderson, Kevin; Murray, Sarah S; Oliphant, Arnold R; Chee, Mark S; Montpetit, Alexandre; Chagnon, Fanny; Ferretti, Vincent; Leboeuf, Martin; Olivier, Jean-François; Phillips, Michael S; Roumy, Stéphanie; Sallée, Clémentine; Verner, Andrei; Hudson, Thomas J; Kwok, Pui-Yan; Cai, Dongmei; Koboldt, Daniel C; Miller, Raymond D; Pawlikowska, Ludmila; Taillon-Miller, Patricia; Xiao, Ming; Tsui, Lap-Chee; Mak, William; Song, You Qiang; Tam, Paul K H; Nakamura, Yusuke; Kawaguchi, Takahisa; Kitamoto, Takuya; Morizono, Takashi; Nagashima, Atsushi; Ohnishi, Yozo; Sekine, Akihiro; Tanaka, Toshihiro; Tsunoda, Tatsuhiko; Deloukas, Panos; Bird, Christine P; Delgado, Marcos; Dermitzakis, Emmanouil T; Gwilliam, Rhian; Hunt, Sarah; Morrison, Jonathan; Powell, Don; Stranger, Barbara E; Whittaker, Pamela; Bentley, David R; Daly, Mark J; de Bakker, Paul I W; Barrett, Jeff; Chretien, Yves R; Maller, Julian; McCarroll, Steve; Patterson, Nick; Pe'er, Itsik; Price, Alkes; Purcell, Shaun; Richter, Daniel J; Sabeti, Pardis; Saxena, Richa; Schaffner, Stephen F; Sham, Pak C; Varilly, Patrick; Altshuler, David; Stein, Lincoln D; Krishnan, Lalitha; Smith, Albert Vernon; Tello-Ruiz, Marcela K; Thorisson, Gudmundur A; Chakravarti, Aravinda; Chen, Peter E; Cutler, David J; Kashuk, Carl S; Lin, Shin; Abecasis, Gonçalo R; Guan, Weihua; Li, Yun; Munro, Heather M; Qin, Zhaohui Steve; Thomas, Daryl J; McVean, Gilean; Auton, Adam; Bottolo, Leonardo; Cardin, Niall; Eyheramendy, Susana; Freeman, Colin; Marchini, Jonathan; Myers, Simon; Spencer, Chris; Stephens, Matthew; Donnelly, Peter; Cardon, Lon R; Clarke, Geraldine; Evans, David M; Morris, Andrew P; Weir, Bruce S; Tsunoda, Tatsuhiko; Mullikin, James C; Sherry, Stephen T; Feolo, Michael; Skol, Andrew; Zhang, Houcan; Zeng, Changqing; Zhao, Hui; Matsuda, Ichiro; Fukushima, Yoshimitsu; Macer, Darryl R; Suda, Eiko; Rotimi, Charles N; Adebamowo, Clement A; Ajayi, Ike; Aniagwu, Toyin; Marshall, Patricia A; Nkwodimmah, Chibuzor; Royal, Charmaine D M; Leppert, Mark F; Dixon, Missy; Peiffer, Andy; Qiu, Renzong; Kent, Alastair; Kato, Kazuto; Niikawa, Norio; Adewole, Isaac F; Knoppers, Bartha M; Foster, Morris W; Clayton, Ellen Wright; Watkin, Jessica; Gibbs, Richard A; Belmont, John W; Muzny, Donna; Nazareth, Lynne; Sodergren, Erica; Weinstock, George M; Wheeler, David A; Yakub, Imtaz; Gabriel, Stacey B; Onofrio, Robert C; Richter, Daniel J; Ziaugra, Liuda; Birren, Bruce W; Daly, Mark J; Altshuler, David; Wilson, Richard K; Fulton, Lucinda L; Rogers, Jane; Burton, John; Carter, Nigel P; Clee, Christopher M; Griffiths, Mark; Jones, Matthew C; McLay, Kirsten; Plumb, Robert W; Ross, Mark T; Sims, Sarah K; Willey, David L; Chen, Zhu; Han, Hua; Kang, Le; Godbout, Martin; Wallenburg, John C; L'Archevêque, Paul; Bellemare, Guy; Saeki, Koji; Wang, Hongguang; An, Daochang; Fu, Hongbo; Li, Qing; Wang, Zhen; Wang, Renwu; Holden, Arthur L; Brooks, Lisa D; McEwen, Jean E; Guyer, Mark S; Wang, Vivian Ota; Peterson, Jane L; Shi, Michael; Spiegel, Jack; Sung, Lawrence M; Zacharia, Lynn F; Collins, Francis S; Kennedy, Karen; Jamieson, Ruth; Stewart, John
2007-10-18
We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.
Zhang, Hengyou; Song, Qijian; Griffin, Joshua D; Song, Bao-Hua
2017-12-01
The soybean cyst nematode (SCN) is one of the most destructive pathogens of soybean plants worldwide. Host-plant resistance is an environmentally friendly method to mitigate SCN damage. To date, the resistant soybean cultivars harbor limited genetic variation, and some are losing resistance. Thus, a better understanding of the genetic mechanisms of the SCN resistance, as well as developing diverse resistant soybean cultivars, is urgently needed. In this study, a genome-wide association study (GWAS) was conducted using 1032 wild soybean (Glycine soja) accessions with over 42,000 single-nucleotide polymorphisms (SNPs) to understand the genetic architecture of G. soja resistance to SCN race 1. Ten SNPs were significantly associated with the response to race 1. Three SNPs on chromosome 18 were localized within the previously identified quantitative trait loci (QTLs), and two of which were localized within a strong linkage disequilibrium block encompassing a nucleotide-binding (NB)-ARC disease resistance gene (Glyma.18G102600). Genes encoding methyltransferases, the calcium-dependent signaling protein, the leucine-rich repeat kinase family protein, and the NB-ARC disease resistance protein, were identified as promising candidate genes. The identified SNPs and candidate genes can not only shed light on the molecular mechanisms underlying SCN resistance, but also can facilitate soybean improvement employing wild genetic resources.
A ddRAD Based Linkage Map of the Cultivated Strawberry, Fragaria xananassa
Davik, Jahn; Sargent, Daniel James; Brurberg, May Bente; Lien, Sigbjørn; Kent, Matthew; Alsheikh, Muath
2015-01-01
The cultivated strawberry (Fragaria ×ananassa Duch.) is an allo-octoploid considered difficult to disentangle genetically due to its four relatively similar sub-genomic chromosome sets. This has been alleviated by the recent release of the strawberry IStraw90 whole genome genotyping array. However, array resolution relies on the genotypes used in the array construction and may be of limited general use. SNP detection based on reduced genomic sequencing approaches has the potential of providing better coverage in cases where the studied genotypes are only distantly related from the SNP array’s construction foundation. Here we have used double digest restriction-associated DNA sequencing (ddRAD) to identify SNPs in a 145 seedling F1 hybrid population raised from the cross between the cultivars Sonata (♀) and Babette (♂). A linkage map containing 907 markers which spanned 1,581.5 cM across 31 linkage groups representing the 28 chromosomes of the species. Comparing the physical span of the SNP markers with the F. vesca genome sequence, the linkage groups resolved covered 79% of the estimated 830 Mb of the F. ×ananassa genome. Here, we have developed the first linkage map for F. ×ananassa using ddRAD and show that this technique and other related techniques are useful tools for linkage map development and downstream genetic studies in the octoploid strawberry. PMID:26398886
Haralambieva, Iana H.; Dhiman, Neelam; Ovsyannikova, Inna G.; Vierkant, Robert A.; Pankratz, V. Shane; Jacobson, Robert M.; Poland, Gregory A.
2010-01-01
Interferon (IFN)-induced antiviral genes are crucial players in innate antiviral defense and potential determinants of immune response heterogeneity. We selected 114 candidate SNPs from 12 antiviral genes using an LD tagSNP selection approach and genotyped them in a cohort of 738 schoolchildren immunized with two doses of rubella vaccine. Associations between SNPs/haplotypes and rubella virus-specific immune measures were assessed using linear regression methodologies. We identified 23 significant associations (p<0.05) between polymorphisms within the 2′-5′-oligoadenylate synthetase (OAS) gene cluster, and rubella virus-specific IL-2, IL-10, IL-6 secretion and antibody levels. The minor allele variants of three OAS1 SNPs (rs3741981/Ser162Gly, rs1051042/Thr361Arg, rs2660), located in a linkage disequilibrium block of functional importance, were significantly associated with an increase in rubella virus-specific IL-2/Th1 response (p≤0.024). Seven OAS1 and OAS3 promoter/regulatory SNPs were similarly associated with IL-2 secretion. Importantly, two SNPs (rs3741981 and rs10774670), independently cross-regulated rubella virus-specific IL-10 secretion levels (p≤0.031). Furthermore, both global tests and individual haplotype analyses revealed significant associations between OAS1 haplotypes and rubella virus-specific cytokine secretion. Our results suggest that innate immunity and OAS genetic variations are likely involved in modulating the magnitude and quality of the adaptive immune responses to live attenuated rubella vaccine. PMID:20079393
Singh, Kanhaiya; Agrawal, Neeraj K.; Gupta, Sanjeev K.
2013-01-01
The Toll-Like receptor 4 (TLR4) plays an important role in immunity, tissue repair, and regeneration. The objective of the present work was to evaluate the association of TLR4 single nucleotide polymorphisms (SNPs) rs4986790, rs4986791, rs11536858 (merged into rs10759931), rs1927911, and rs1927914 with increased diabetic foot ulcer (DFU) risk in patients with type 2 diabetes mellitus (T2DM). PCR-RFLP was used for genotyping TLR4 SNPs in 125 T2DM patients with DFU and 130 controls. The haplotypes and linkage disequilibrium between the SNPs were determined using Haploview software. Multivariate linear regression (MLR) and artificial neural network (ANN) modeling was done to observe their predictability for the risk of DFU in T2DM patients. Risk genotypes of all SNPs except rs1927914 were significantly associated with DFU. Haplotype ACATC (P value = 9.3E − 5) showed strong association with DFU risk. Two haplotypes ATATC (P value = 0.0119) and ATGTT (P value = 0.0087) were found to be protective against DFU. In conclusion TLR4 SNPs and their haplotypes may increase the risk of impairment of wound healing in T2DM patients. ANN model (83%) is found to be better than the MLR model (76%) and can be used as a tool for the DFU risk assessment in T2DM patients. PMID:23936790
Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr.
Privé, Florian; Aschard, Hugues; Ziyatdinov, Andrey; Blum, Michael G B
2017-03-30
Genome-wide datasets produced for association studies have dramatically increased in size over the past few years, with modern datasets commonly including millions of variants measured in dozens of thousands of individuals. This increase in data size is a major challenge severely slowing down genomic analyses, leading to some software becoming obsolete and researchers having limited access to diverse analysis tools. Here we present two R packages, bigstatsr and bigsnpr, allowing for the analysis of large scale genomic data to be performed within R. To address large data size, the packages use memory-mapping for accessing data matrices stored on disk instead of in RAM. To perform data pre-processing and data analysis, the packages integrate most of the tools that are commonly used, either through transparent system calls to existing software, or through updated or improved implementation of existing methods. In particular, the packages implement fast and accurate computations of principal component analysis and association studies, functions to remove SNPs in linkage disequilibrium and algorithms to learn polygenic risk scores on millions of SNPs. We illustrate applications of the two R packages by analyzing a case-control genomic dataset for celiac disease, performing an association study and computing Polygenic Risk Scores. Finally, we demonstrate the scalability of the R packages by analyzing a simulated genome-wide dataset including 500,000 individuals and 1 million markers on a single desktop computer. https://privefl.github.io/bigstatsr/ & https://privefl.github.io/bigsnpr/. florian.prive@univ-grenoble-alpes.fr & michael.blum@univ-grenoble-alpes.fr. Supplementary materials are available at Bioinformatics online.
Barbero, Marina M. D.; Oliveira, Henrique N.; de Camargo, Gregório M. F.; Fernandes Júnior, Gerardo A.; Aspilcueta-Borquis, Rusbel R.; Souza, Fabio R. P.; Boligon, Arione A.; Melo, Thaise P.; Regatieri, Inaê C.; Feitosa, Fabieli L. B.; Fonseca, Larissa F. S.; Magalhães, Ana F. B.; Costa, Raphael B.; Albuquerque, Lucia G.
2018-01-01
Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs. PMID:29293544
Zavarella, S; Petrone, A; Zampetti, S; Gueorguiev, M; Spoletini, M; Mein, C A; Leto, G; Korbonits, M; Buzzetti, R
2008-04-01
Previous studies suggested that polymorphisms in the coding region of the preproghrelin were involved in the etiology of obesity and might modulate glucose-induced insulin secretion. We evaluated the association of a new variation, -604C>T, in the promoter region of the ghrelin gene, of Leu72Met (247C>A) and of Gln90Leu (265A>T), all haplotype-tagging single nucleotide polymorphisms (SNPs), with measures of insulin sensitivity in 1420 adult individuals. The three SNPs were genotyped using ABI PRISM 7900 HT Sequence Detection System. We used multiple linear regression analysis for quantitative traits and THESIAS software for haplotype analysis. We observed a protective effect exerted by Met72 variant of Leu72Met SNP on insulin resistance parameters; a significant decreasing trend from Leu/Leu to Leu/Met and to Met/Met homozygous subjects in triglycerides, fasting insulin levels and HOMA-IR index (P=0.02, 0.01 and 0.003, respectively), and, consistently, an increase in ghrelin levels (P=0.003) was found. A significant decrease from CC to TC and to TT genotypes in insulin levels and HOMA-IR index was also detected (P=0.00l for both), but only in subjects homozygous for Leu72, where the protective effect of Met72 was not present. The haplotype analysis results supported the data obtained by the evaluation of each single SNP, showing the highest value of insulin levels and HOMA-IR index in the -604(c)247(c) haplotype intermediate value in -604(T)247(C) and lowest value in -604(C)247(A). Our observations suggest a protective role of the Met72 variant and of -604 T allele in modulating insulin resistance. These SNPs or an unknown functional variant in linkage disequilibrium could increase ghrelin levels and probably insulin sensitivity.
Takada, Luciana; Barbero, Marina M D; Oliveira, Henrique N; de Camargo, Gregório M F; Fernandes Júnior, Gerardo A; Aspilcueta-Borquis, Rusbel R; Souza, Fabio R P; Boligon, Arione A; Melo, Thaise P; Regatieri, Inaê C; Feitosa, Fabieli L B; Fonseca, Larissa F S; Magalhães, Ana F B; Costa, Raphael B; Albuquerque, Lucia G
2018-01-01
Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs.
Zhu, Xiao; Kong, Qingming; Xie, Liwei; Chen, Zhihong; Li, Hongmei; Zhu, Zhu; Huang, Yongmei; Lan, Feifei; Luo, Haiqing; Zhan, Jingting; Ding, Hongrong; Lei, Jinli; Xiao, Qin; Fu, Weiming; Fan, Wenguo; Zhang, Jinfang; Luo, Hui
2018-01-01
Previous studies showed that the low expressions of chromodomain-helicase-DNA-binding protein 5 (CHD5) were intensively associated with deteriorative biologic and clinical characteristics as well as outcomes in many tumors. The aim of this study is to determine whether CHD5 single nucleotide polymorphisms (SNPs) contribute to the prognosis of hepatocellular carcima (HCC). The SNPs were selected according to their linkage disequilibrium (LD) in the targeted next-generation sequencing (NGS) and then genotyped with TaqMan probers. We revealed a rare haplotype AG in CHD5 (SNPs: rs12564469-rs9434711) was markedly associated with HCC prognosis. The univariate and multivariate regression analyses revealed the patients with worse overall survival time were those with tumor metastasis and haplotype AG, as well as cirrhosis, poor differentiation and IV-TNM stage. Based on the available public databases, we discovered the significant association between haplotype AG and CHD5 mRNA expressions only existed in Chinese. These data proposed that the potentially genetic haplotype might functionally contribute to HCC prognosis and CHD5 mRNA expressions. PMID:29568352
Influences of APOA5 Variants on Plasma Triglyceride Levels in Uyghur Population
Wang, Yi; Wu, Di; Jin, Li; Wang, Xiaofeng
2014-01-01
Objective Single nucleotide polymorphisms (SNPs) in apolipoprotein A5 (APOA5) gene are associated with triglyceride (TG) levels. However, the minor allele frequencies and linkage disequilibriums (LDs) of the SNPs in addition to their effects on TG levels vary greatly between Caucasians and East Asians. The distributions of the SNPs/haplotypes and their associations with TG levels in Uyghur population, an admixture population of Caucasians and East Asians, have not been reported to date. Here, we performed a cross-sectional study to address these. Methods Genotyping of four SNPs in APOA5 (rs662799, rs3135506, rs2075291, and rs2266788) was performed in 1174 unrelated Uyghur subjects. SNP/haplotype and TG association analyses were conducted. Results The frequencies of the SNPs in Uyghurs were in between those in Caucasians and East Asians. The LD between rs662799 and rs2266788 in Uyghurs was stronger than that in East Asians but weaker than that in Caucasians, and the four SNPs resulted in four haplotypes (TGGT, CGGC, TCGT, and CGTT arranged in the order of rs662799, rs3135506, rs2075291, and rs2266788) representing 99.2% of the population. All the four SNPs were significantly associated with TG levels. Compared with non-carriers, carriers of rs662799-C, rs3135506-C, rs2075291-T, and rs2266788-C alleles had 16.0%, 15.1%, 17.1%, and 12.4% higher TG levels, respectively. When haplotype TGGT was defined as the reference, the haplotypes CGGC, TCGT, and CGTT resulted in 16.1%, 19.0%, and 19.8% higher TG levels, respectively. The proportions of variance in TG explained by APOA5 locus were 2.5%, 0.3%, 0.4%, and 1.9% for single SNP rs662799, rs3135506, rs2075291, and rs2266788, respectively, and 3.0% for the haplotypes constructed by them. Conclusions The association profiles between the SNPs and haplotypes at APOA5 locus and TG levels in this admixture population differed from those in Caucasians and East Asians. The functions of these SNPs and haplotypes need to be elucidated comprehensively. PMID:25313938
A genome-wide association study of seed composition traits in wild soybean (Glycine soja).
Leamy, Larry J; Zhang, Hengyou; Li, Changbao; Chen, Charles Y; Song, Bao-Hua
2017-01-05
Cultivated soybean (Glycine max) is a major agricultural crop that provides a crucial source of edible protein and oil. Decreased amounts of saturated palmitic acid and increased amounts of unsaturated oleic acid in soybean oil are considered optimal for human cardiovascular health and therefore there has considerable interest by breeders in discovering genes affecting the relative concentrations of these fatty acids. Using a genome-wide association (GWA) approach with nearly 30,000 single nucleotide polymorphisms (SNPs), we investigated the genetic basis of protein, oil and all five fatty acid levels in seeds from a sample of 570 wild soybeans (Glycine soja), the progenitor of domesticated soybean, to identify quantitative trait loci (QTLs) affecting these seed composition traits. We discovered 29 SNPs located on ten different chromosomes that are significantly associated with the seven seed composition traits in our wild soybean sample. Eight SNPs co-localized with QTLs previously uncovered in linkage or association mapping studies conducted with cultivated soybean samples, while the remaining SNPs appeared to be in novel locations. Twenty-four of the SNPs significantly associated with fatty acid variation, with the majority located on chromosomes 14 (6 SNPs) and seven (8 SNPs). Two SNPs were common for two or more fatty acids, suggesting loci with pleiotropic effects. We also identified some candidate genes that are involved in fatty acid metabolism and regulation. For each of the seven traits, most of the SNPs produced differences between the average phenotypic values of the two homozygotes of about one-half standard deviation and contributed over 3% of their total variability. This is the first GWA study conducted on seed composition traits solely in wild soybean populations, and a number of QTLs were found that have not been previously discovered. Some of these may be useful to breeders who select for increased protein/oil content or altered fatty acid ratios in the seeds. The results also provide additional insight into the genetic architecture of these traits in a large sample of wild soybean, and suggest some new candidate genes whose molecular effects on these traits need to be further studied.
Miller, Mark W.; Wolf, Erika J.; Sadeh, Naomi; Logue, Mark; Spielberg, Jeffrey M.; Hayes, Jasmeet P.; Sperbeck, Emily; Schichman, Steven A.; Stone, Angie; Carter, Weleetka C.; Humphries, Donald E.; Milberg, William; McGlinchey, Regina
2015-01-01
Oxidative stress has been implicated in many common age-related diseases and is hypothesized to play a role in posttraumatic stress disorder (PTSD)-related neurodegeneration (Miller and Sadeh, 2014). This study examined the influence of the oxidative stress-related genes ALOX 12 and ALOX 15 on the association between PTSD and cortical thickness. Factor analyses were used to identify and compare alternative models of the structure of cortical thickness in a sample of 218 veterans. The best-fitting model was then used for a genetic association analysis in White non-Hispanic participants (n = 146) that examined relationships between 33 single nucleotide polymorphisms (SNPs) spanning the two genes, 8 cortical thickness factors, and each SNP × PTSD interaction. Results identified a novel ALOX12 locus (indicated by two SNPs in perfect linkage disequilibrium: rs1042357 and rs10852889) that moderated the association between PTSD and reduced thickness of the right prefrontal cortex. A whole-cortex vertex-wise analysis showed this effect to be localized to clusters spanning the rostral middle frontal gyrus, superior frontal gyrus, rostral anterior cingulated cortex, and medial orbitofrontal cortex. These findings illustrate a novel factor-analytic approach to neuroimaging-genetic analyses and provide new evidence for the possible involvement of oxidative stress in PTSD-related neurodegeneration. PMID:26372769
Miller, Mark W; Wolf, Erika J; Sadeh, Naomi; Logue, Mark; Spielberg, Jeffrey M; Hayes, Jasmeet P; Sperbeck, Emily; Schichman, Steven A; Stone, Angie; Carter, Weleetka C; Humphries, Donald E; Milberg, William; McGlinchey, Regina
2015-12-01
Oxidative stress has been implicated in many common age-related diseases and is hypothesized to play a role in posttraumatic stress disorder (PTSD)-related neurodegeneration (Miller and Sadeh, 2014). This study examined the influence of the oxidative stress-related genes ALOX 12 and ALOX 15 on the association between PTSD and cortical thickness. Factor analyses were used to identify and compare alternative models of the structure of cortical thickness in a sample of 218 veterans. The best-fitting model was then used for a genetic association analysis in White non-Hispanic participants (n=146) that examined relationships between 33 single nucleotide polymorphisms (SNPs) spanning the two genes, 8 cortical thickness factors, and each SNP×PTSD interaction. Results identified a novel ALOX12 locus (indicated by two SNPs in perfect linkage disequilibrium: rs1042357 and rs10852889) that moderated the association between PTSD and reduced thickness of the right prefrontal cortex. A whole-cortex vertex-wise analysis showed this effect to be localized to clusters spanning the rostral middle frontal gyrus, superior frontal gyrus, rostral anterior cingulate cortex, and medial orbitofrontal cortex. These findings illustrate a novel factor-analytic approach to neuroimaging-genetic analyses and provide new evidence for the possible involvement of oxidative stress in PTSD-related neurodegeneration. Published by Elsevier Ltd.
Genetic dissection of host immune response in pneumonia development and progression.
Smelaya, Tamara V; Belopolskaya, Olesya B; Smirnova, Svetlana V; Kuzovlev, Artem N; Moroz, Viktor V; Golubev, Arkadiy M; Pabalan, Noel A; Salnikova, Lyubov E
2016-10-11
The role of host genetic variation in pneumonia development and outcome is poorly understood. We studied common polymorphisms in the genes of proinflammatory cytokines (IL6 rs1800795, IL8 rs4073, IL1B rs16944), anti-inflammatory cytokines (IL10 rs1800896, IL4 rs2243250, IL13 rs20541) and toll-like receptors (TLR2 rs5743708 and rs4696480, TLR4 rs4986791, TLR9 rs352139, rs5743836 and rs187084) in patients with community-acquired pneumonia (CAP) (390 cases, 203 controls) and nosocomial pneumonia (355 cases, 216 controls). Experimental data were included in a series of 11 meta-analyses and eight subset analyses related to pneumonia susceptibility and outcome. TLR2 rs5743708 minor genotype appeared to be associated with CAP/Legionnaires' disease/pneumococcal disease. In CAP patients, the IL6 rs1800795-C allele was associated with severe sepsis/septic shock/severe systemic inflammatory response, while the IL10 rs1800896-A allele protected against the development of these critical conditions. To contribute to deciphering of the above results, we performed an in silico analysis and a qualitative synthesis of literature data addressing basal and stimulated genotype-specific expression level. This data together with database information on transcription factors' affinity changes caused by SNPs in putative promoter regions, the results of linkage disequilibrium analysis along with SNPs functional annotations supported assumptions about the complexity underlying the revealed associations.
A powerful score-based test statistic for detecting gene-gene co-association.
Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun
2016-01-29
The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.
Khatkar, Mehar S; Nicholas, Frank W; Collins, Andrew R; Zenger, Kyall R; Cavanagh, Julie A L; Barris, Wes; Schnabel, Robert D; Taylor, Jeremy F; Raadsma, Herman W
2008-04-24
The extent of linkage disequilibrium (LD) within a population determines the number of markers that will be required for successful association mapping and marker-assisted selection. Most studies on LD in cattle reported to date are based on microsatellite markers or small numbers of single nucleotide polymorphisms (SNPs) covering one or only a few chromosomes. This is the first comprehensive study on the extent of LD in cattle by analyzing data on 1,546 Holstein-Friesian bulls genotyped for 15,036 SNP markers covering all regions of all autosomes. Furthermore, most studies in cattle have used relatively small sample sizes and, consequently, may have had biased estimates of measures commonly used to describe LD. We examine minimum sample sizes required to estimate LD without bias and loss in accuracy. Finally, relatively little information is available on comparative LD structures including other mammalian species such as human and mouse, and we compare LD structure in cattle with public-domain data from both human and mouse. We computed three LD estimates, D', Dvol and r2, for 1,566,890 syntenic SNP pairs and a sample of 365,400 non-syntenic pairs. Mean D' is 0.189 among syntenic SNPs, and 0.105 among non-syntenic SNPs; mean r2 is 0.024 among syntenic SNPs and 0.0032 among non-syntenic SNPs. All three measures of LD for syntenic pairs decline with distance; the decline is much steeper for r2 than for D' and Dvol. The value of D' and Dvol are quite similar. Significant LD in cattle extends to 40 kb (when estimated as r2) and 8.2 Mb (when estimated as D'). The mean values for LD at large physical distances are close to those for non-syntenic SNPs. Minor allelic frequency threshold affects the distribution and extent of LD. For unbiased and accurate estimates of LD across marker intervals spanning < 1 kb to > 50 Mb, minimum sample sizes of 400 (for D') and 75 (for r2) are required. The bias due to small samples sizes increases with inter-marker interval. LD in cattle is much less extensive than in a mouse population created from crossing inbred lines, and more extensive than in humans. For association mapping in Holstein-Friesian cattle, for a given design, at least one SNP is required for each 40 kb, giving a total requirement of at least 75,000 SNPs for a low power whole-genome scan (median r2 > 0.19) and up to 300,000 markers at 10 kb intervals for a high power genome scan (median r2 > 0.62). For estimation of LD by D' and Dvol with sufficient precision, a sample size of at least 400 is required, whereas for r2 a minimum sample of 75 is adequate.
Markunas, Christina A; Johnson, Eric O; Hancock, Dana B
2017-07-01
Genome-wide association study (GWAS)-identified variants are enriched for functional elements. However, we have limited knowledge of how functional enrichment may differ by disease/trait and tissue type. We tested a broad set of eight functional elements for enrichment among GWAS-identified SNPs (p < 5×10 -8 ) from the NHGRI-EBI Catalog across seven disease/trait categories: cancer, cardiovascular disease, diabetes, autoimmune disease, psychiatric disease, neurological disease, and anthropometric traits. SNPs were annotated using HaploReg for the eight functional elements across any tissue: DNase sites, expression quantitative trait loci (eQTL), sequence conservation, enhancers, promoters, missense variants, sequence motifs, and protein binding sites. In addition, tissue-specific annotations were considered for brain vs. blood. Disease/trait SNPs were compared to a control set of 4809 SNPs matched to the GWAS SNPs (N = 1639) on allele frequency, gene density, distance to nearest gene, and linkage disequilibrium at ~3:1 ratio. Enrichment analyses were conducted using logistic regression, with Bonferroni correction. Overall, a significant enrichment was observed for all functional elements, except sequence motifs. Missense SNPs showed the strongest magnitude of enrichment. eQTLs were the only functional element significantly enriched across all diseases/traits. Magnitudes of enrichment were generally similar across diseases/traits, where enrichment was statistically significant. Blood vs. brain tissue effects on enrichment were dependent on disease/trait and functional element (e.g., cardiovascular disease: eQTLs P TissueDifference = 1.28 × 10 -6 vs. enhancers P TissueDifference = 0.94). Identifying disease/trait-relevant functional elements and tissue types could provide new insight into the underlying biology, by guiding a priori GWAS analyses (e.g., brain enhancer elements for psychiatric disease) or facilitating post hoc interpretation.
Behar, Doron M.; Rosset, Saharon; Tzur, Shay; Selig, Sara; Yudkovsky, Guennady; Bercovici, Sivan; Kopp, Jeffrey B.; Winkler, Cheryl A.; Nelson, George W.; Wasser, Walter G.; Skorecki, Karl
2010-01-01
Recent studies identified MYH9 as a major susceptibility gene for common forms of non-diabetic end-stage kidney disease (ESKD). A set of African ancestry DNA sequence variants comprising the E-1 haplotype, was significantly associated with ESKD. In order to determine whether African ancestry variants are also associated with disease susceptibility in admixed populations with differing genomic backgrounds, we genotyped a total of 1425 African and Hispanic American subjects comprising dialysis patients with diabetic and non-diabetic ESKD and controls, using 42 single nucleotide polymorphisms (SNPs) within the MYH9 gene and 40 genome-wide and 38 chromosome 22 ancestry informative markers. Following ancestry correction, logistic regression demonstrated that three of the E-1 SNPs are also associated with non-diabetic ESKD in the new sample sets of both African and Hispanic Americans, with a stronger association in Hispanic Americans. We also identified MYH9 SNPs that are even more powerfully associated with the disease phenotype than the E-1 SNPs. These newly associated SNPs, could be divided into those comprising a haplotype termed S-1 whose association was significant under a recessive or additive inheritance mode (rs5750248, OR 4.21, P < 0.01, Hispanic Americans, recessive), and those comprising a haplotype termed F-1 whose association was significant under a dominant or additive inheritance mode (rs11912763, OR 4.59, P < 0.01, Hispanic Americans, dominant). These findings strengthen the contention that a sequence variant of MYH9, common in populations with varying degrees of African ancestry admixture, and in strong linkage disequilibrium with the associated SNPs and haplotypes reported herein, strongly predisposes to non-diabetic ESKD. PMID:20144966
Behar, Doron M; Rosset, Saharon; Tzur, Shay; Selig, Sara; Yudkovsky, Guennady; Bercovici, Sivan; Kopp, Jeffrey B; Winkler, Cheryl A; Nelson, George W; Wasser, Walter G; Skorecki, Karl
2010-05-01
Recent studies identified MYH9 as a major susceptibility gene for common forms of non-diabetic end-stage kidney disease (ESKD). A set of African ancestry DNA sequence variants comprising the E-1 haplotype, was significantly associated with ESKD. In order to determine whether African ancestry variants are also associated with disease susceptibility in admixed populations with differing genomic backgrounds, we genotyped a total of 1425 African and Hispanic American subjects comprising dialysis patients with diabetic and non-diabetic ESKD and controls, using 42 single nucleotide polymorphisms (SNPs) within the MYH9 gene and 40 genome-wide and 38 chromosome 22 ancestry informative markers. Following ancestry correction, logistic regression demonstrated that three of the E-1 SNPs are also associated with non-diabetic ESKD in the new sample sets of both African and Hispanic Americans, with a stronger association in Hispanic Americans. We also identified MYH9 SNPs that are even more powerfully associated with the disease phenotype than the E-1 SNPs. These newly associated SNPs, could be divided into those comprising a haplotype termed S-1 whose association was significant under a recessive or additive inheritance mode (rs5750248, OR 4.21, P < 0.01, Hispanic Americans, recessive), and those comprising a haplotype termed F-1 whose association was significant under a dominant or additive inheritance mode (rs11912763, OR 4.59, P < 0.01, Hispanic Americans, dominant). These findings strengthen the contention that a sequence variant of MYH9, common in populations with varying degrees of African ancestry admixture, and in strong linkage disequilibrium with the associated SNPs and haplotypes reported herein, strongly predisposes to non-diabetic ESKD.
HapMap tagSNP transferability in multiple populations: general guidelines
Xing, Jinchuan; Witherspoon, David J.; Watkins, W. Scott; Zhang, Yuhua; Tolpinrud, Whitney; Jorde, Lynn B.
2008-01-01
This PDF receipt will only be used as the basis for generating PubMed Central (PMC) documents. PMC documents will be made available for review after conversion (approx. 2–3 weeks time). Any corrections that need to be made will be done at that time. No materials will be released to PMC without the approval of an author. Only the PMC documents will appear on PubMed Central -- this PDF Receipt will not appear on PubMed Central. Linkage disequilibrium (LD) has received much recent attention because of its value in localizing disease-causing genes. Due to the extensive LD between neighboring loci in the human genome, it is believed that a subset of the single nucleotide polymorphisms in a region (tagSNPs) can be selected to capture most of the remaining SNP variants. In this study, we examined LD patterns and HapMap tagSNP transferability in more than 300 individuals. A South Indian and an African Mbuti Pygmy population sample were included to evaluate the performance of HapMap tagSNPs in geographically distinct and genetically isolated populations. Our results show that HapMap tagSNPs selected with r2 >= 0.8 can capture more than 85% of the SNPs in populations that are from the same continental group. Combined tagSNPs from HapMap CEU and CHB+JPT serve as the best reference for the Indian sample. The HapMap YRI are a sufficient reference for tagSNP selection in the Pygmy sample. In addition to our findings, we reviewed over 25 recent studies of tagSNP transferability and propose a general guideline for selecting tagSNPs from HapMap populations. PMID:18482828
Valdisser, Paula A M R; Pereira, Wendell J; Almeida Filho, Jâneo E; Müller, Bárbara S F; Coelho, Gesimária R C; de Menezes, Ivandilson P P; Vianna, João P G; Zucchi, Maria I; Lanna, Anna C; Coelho, Alexandre S G; de Oliveira, Jaison P; Moraes, Alessandra da Cunha; Brondani, Claudio; Vianello, Rosana P
2017-05-30
Common bean is a legume of social and nutritional importance as a food crop, cultivated worldwide especially in developing countries, accounting for an important source of income for small farmers. The availability of the complete sequences of the two common bean genomes has dramatically accelerated and has enabled new experimental strategies to be applied for genetic research. DArTseq has been widely used as a method of SNP genotyping allowing comprehensive genome coverage with genetic applications in common bean breeding programs. Using this technology, 6286 SNPs (1 SNP/86.5 Kbp) were genotyped in genic (43.3%) and non-genic regions (56.7%). Genetic subdivision associated to the common bean gene pools (K = 2) and related to grain types (K = 3 and K = 5) were reported. A total of 83% and 91% of all SNPs were polymorphic within the Andean and Mesoamerican gene pools, respectively, and 26% were able to differentiate the gene pools. Genetic diversity analysis revealed an average H E of 0.442 for the whole collection, 0.102 for Andean and 0.168 for Mesoamerican gene pools (F ST = 0.747 between gene pools), 0.440 for the group of cultivars and lines, and 0.448 for the group of landrace accessions (F ST = 0.002 between cultivar/line and landrace groups). The SNP effects were predicted with predominance of impact on non-coding regions (77.8%). SNPs under selection were identified within gene pools comparing landrace and cultivar/line germplasm groups (Andean: 18; Mesoamerican: 69) and between the gene pools (59 SNPs), predominantly on chromosomes 1 and 9. The LD extension estimate corrected for population structure and relatedness (r 2 SV ) was ~ 88 kbp, while for the Andean gene pool was ~ 395 kbp, and for the Mesoamerican was ~ 130 kbp. For common bean, DArTseq provides an efficient and cost-effective strategy of generating SNPs for large-scale genome-wide studies. The DArTseq resulted in an operational panel of 560 polymorphic SNPs in linkage equilibrium, providing high genome coverage. This SNP set could be used in genotyping platforms with many applications, such as population genetics, phylogeny relation between common bean varieties and support to molecular breeding approaches.
Brant, Steven R; Okou, David T; Simpson, Claire L; Cutler, David J; Haritunians, Talin; Bradfield, Jonathan P; Chopra, Pankaj; Prince, Jarod; Begum, Ferdouse; Kumar, Archana; Huang, Chengrui; Venkateswaran, Suresh; Datta, Lisa W; Wei, Zhi; Thomas, Kelly; Herrinton, Lisa J; Klapproth, Jan-Micheal A; Quiros, Antonio J; Seminerio, Jenifer; Liu, Zhenqiu; Alexander, Jonathan S; Baldassano, Robert N; Dudley-Brown, Sharon; Cross, Raymond K; Dassopoulos, Themistocles; Denson, Lee A; Dhere, Tanvi A; Dryden, Gerald W; Hanson, John S; Hou, Jason K; Hussain, Sunny Z; Hyams, Jeffrey S; Isaacs, Kim L; Kader, Howard; Kappelman, Michael D; Katz, Jeffry; Kellermayer, Richard; Kirschner, Barbara S; Kuemmerle, John F; Kwon, John H; Lazarev, Mark; Li, Ellen; Mack, David; Mannon, Peter; Moulton, Dedrick E; Newberry, Rodney D; Osuntokun, Bankole O; Patel, Ashish S; Saeed, Shehzad A; Targan, Stephan R; Valentine, John F; Wang, Ming-Hsi; Zonca, Martin; Rioux, John D; Duerr, Richard H; Silverberg, Mark S; Cho, Judy H; Hakonarson, Hakon; Zwick, Michael E; McGovern, Dermot P B; Kugathasan, Subra
2017-01-01
The inflammatory bowel diseases (IBD) ulcerative colitis (UC) and Crohn's disease (CD) cause significant morbidity and are increasing in prevalence among all populations, including African Americans. More than 200 susceptibility loci have been identified in populations of predominantly European ancestry, but few loci have been associated with IBD in other ethnicities. We performed 2 high-density, genome-wide scans comprising 2345 cases of African Americans with IBD (1646 with CD, 583 with UC, and 116 inflammatory bowel disease unclassified) and 5002 individuals without IBD (controls, identified from the Health Retirement Study and Kaiser Permanente database). Single-nucleotide polymorphisms (SNPs) associated at P < 5.0 × 10 -8 in meta-analysis with a nominal evidence (P < .05) in each scan were considered to have genome-wide significance. We detected SNPs at HLA-DRB1, and African-specific SNPs at ZNF649 and LSAMP, with associations of genome-wide significance for UC. We detected SNPs at USP25 with associations of genome-wide significance for IBD. No associations of genome-wide significance were detected for CD. In addition, 9 genes previously associated with IBD contained SNPs with significant evidence for replication (P < 1.6 × 10 -6 ): ADCY3, CXCR6, HLA-DRB1 to HLA-DQA1 (genome-wide significance on conditioning), IL12B,PTGER4, and TNC for IBD; IL23R, PTGER4, and SNX20 (in strong linkage disequilibrium with NOD2) for CD; and KCNQ2 (near TNFRSF6B) for UC. Several of these genes, such as TNC (near TNFSF15), CXCR6, and genes associated with IBD at the HLA locus, contained SNPs with unique association patterns with African-specific alleles. We performed a genome-wide association study of African Americans with IBD and identified loci associated with UC in only this population; we also replicated IBD, CD, and UC loci identified in European populations. The detection of variants associated with IBD risk in only people of African descent demonstrates the importance of studying the genetics of IBD and other complex diseases in populations beyond those of European ancestry. Copyright © 2017 AGA Institute. Published by Elsevier Inc. All rights reserved.
Evaluation of TRAF6 in a large multiancestral lupus cohort.
Namjou, Bahram; Choi, Chan-Bum; Harley, Isaac T W; Alarcón-Riquelme, Marta E; Kelly, Jennifer A; Glenn, Stuart B; Ojwang, Joshua O; Adler, Adam; Kim, Kwangwoo; Gallant, Caroline J; Boackle, Susan A; Criswell, Lindsey A; Kimberly, Robert P; Brown, Elizabeth E; Edberg, Jeffrey; Alarcón, Graciela S; Stevens, Anne M; Jacob, Chaim O; Gilkeson, Gary S; Kamen, Diane L; Tsao, Betty P; Anaya, Juan-Manuel; Kim, Eun-Mi; Park, So-Yeon; Sung, Yoon-Kyoung; Guthridge, Joel M; Merrill, Joan T; Petri, Michelle; Ramsey-Goldman, Rosalind; Vilá, Luis M; Niewold, Timothy B; Martin, Javier; Pons-Estel, Bernardo A; Vyse, Timothy J; Freedman, Barry I; Moser, Kathy L; Gaffney, Patrick M; Williams, Adrienne H; Comeau, Mary E; Reveille, John D; Kang, Changwon; James, Judith A; Scofield, R Hal; Langefeld, Carl D; Kaufman, Kenneth M; Harley, John B; Bae, Sang-Cheol
2012-06-01
Systemic lupus erythematosus (SLE) is a heterogeneous autoimmune disease with significant immune system aberrations resulting from complex heritable genetics as well as environmental factors. We undertook to study the role of TRAF6 as a candidate gene for SLE, since it plays a major role in several signaling pathways that are important for immunity and organ development. Fifteen single-nucleotide polymorphisms (SNPs) across TRAF6 were evaluated in 7,490 SLE patients and 6,780 control subjects from different ancestries. Population-based case-control association analyses and meta-analyses were performed. P values, false discovery rate q values, and odds ratios (ORs) with 95% confidence intervals (95% CIs) were calculated. Evidence of associations was detected in multiple SNPs. The best overall P values were obtained for SNPs rs5030437 and rs4755453 (P = 7.85 × 10(-5) and P = 4.73 × 10(-5) , respectively) without significant heterogeneity among populations (P = 0.67 and P = 0.50, respectively, in Q statistic). In addition, SNP rs540386, which was previously reported to be associated with rheumatoid arthritis (RA), was found to be in linkage disequilibrium with these 2 SNPs (r(2) = 0.95) and demonstrated evidence of association with SLE in the same direction (meta-analysis P = 9.15 × 10(-4) , OR 0.89 [95% CI 0.83-0.95]). The presence of thrombocytopenia improved the overall results in different populations (meta-analysis P = 1.99 × 10(-6) , OR 0.57 [95% CI 0.45-0.72], for rs5030470). Finally, evidence of family-based association in 34 African American pedigrees with the presence of thrombocytopenia was detected in 1 available SNP (rs5030437) with a Z score magnitude of 2.28 (P = 0.02) under a dominant model. Our data indicate the presence of association of TRAF6 with SLE, consistent with the previous report of association with RA. These data provide further support for the involvement of TRAF6 in the pathogenesis of autoimmunity. Copyright © 2012 by the American College of Rheumatology.
Zhao, Suli; Fang, Fang; Tang, Xianfa; Dou, Jinfa; Wang, Wenjun; Zheng, Xiaodong; Sun, Liangdan; Zhang, Anping
2017-10-01
Vitiligo is an autoimmune disease, characterized by progressive loss of skin pigmentation, which is caused by the interactions of multiple factors, such as heredity, immunity and environment. Recently, a single nucleotide polymorphism (SNP) rs638893 at 11q23.3 region was identified as a risk factor for vitiligo in genome-wide association studies and multiple SNPs in this region have been associated with other autoimmune diseases. This study aims to identify additional susceptibility variants associated with vitiligo at 11q23.3 in the Chinese Han population. We selected and genotyped 26 SNPs at 11q23.3 in an independent cohort including 2924 cases and 4048 controls using the Sequenom MassArray iPLEX ® system. Bonferroni adjustment was used for multiple comparisons and P value <1.92×10 -3 (0.05/26) was considered statistically significant. The A allele of rs613791 and G allele of rs523604 located in CXCR5 were observed to be significantly associated with vitiligo (OR=1.21, 95% CI: 1.11-1.31, P=1.20×10 -5 ; OR=1.14, 95% CI: 1.07-1.23, P=1.90×10 -4 , respectively). The C allele of rs638893 (a previously reported one) located upstream of DDX6 was also significantly associated with vitiligo (OR=1.25, 95% CI: 1.12-1.38, P=3.04×10 -5 ). The genotypes distribution of 3 SNPs also showed significant differences between case and control (rs613791: P=7.00×10 -6 , rs523604: P=4.00×10 -3 , rs638893: P=1.20×10 -5 , respectively). The two newly identified SNPs (rs613791 and rs523604) showed independent associations with vitiligo by linkage disequilibrium analysis and conditional logistic regression. The study identified two new independent signals in the associated locus 11q23.3 for vitiligo. The presence of multiple independent variants emphasizes an important role of this region in disease susceptibility. Copyright © 2017 Japanese Society for Investigative Dermatology. Published by Elsevier B.V. All rights reserved.
Dahmen, Norbert; Beckmann, Lars; Lindström, Sara; Schoof, Nils; Czene, Kamila; Mittelstraß, Kirstin; Illig, Thomas; Seibold, Petra; Behrens, Sabine; Humphreys, Keith; Li, Jingmei; Liu, Jianjun; Olson, Janet E.; Wang, Xianshu; Hankinson, Susan E.; Truong, Thérèse; Menegaux, Florence; dos Santos Silva, Isabel; Johnson, Nichola; Chen, Shou-Tung; Yu, Jyh-Cherng; Ziogas, Argyrios; Kataja, Vesa; Kosma, Veli-Matti; Mannermaa, Arto; Anton-Culver, Hoda; Shen, Chen-Yang; Brauch, Hiltrud; Peto, Julian; Guénel, Pascal; Kraft, Peter; Couch, Fergus J.; Easton, Douglas F.; Hall, Per; Chang-Claude, Jenny
2013-01-01
Menopausal hormone therapy (MHT) is associated with an elevated risk of breast cancer in postmenopausal women. To identify genetic loci that modify breast cancer risk related to MHT use in postmenopausal women, we conducted a two-stage genome-wide association study (GWAS) with replication. In stage I, we performed a case-only GWAS in 731 invasive breast cancer cases from the German case-control study Mammary Carcinoma Risk Factor Investigation (MARIE). The 1,200 single nucleotide polymorphisms (SNPs) showing the lowest P values for interaction with current MHT use (within 6 months prior to breast cancer diagnosis), were carried forward to stage II, involving pooled case-control analyses including additional MARIE subjects (1,375 cases, 1,974 controls) as well as 795 cases and 764 controls of a Swedish case-control study. A joint P value was calculated for a combined analysis of stages I and II. Replication of the most significant interaction of the combined stage I and II was performed using 5,795 cases and 5,390 controls from nine studies of the Breast Cancer Association Consortium (BCAC). The combined stage I and II yielded five SNPs on chromosomes 2, 7, and 18 with joint P values <6 × 10−6 for effect modification of current MHT use. The most significant interaction was observed for rs6707272 (P = 3 × 10−7) on chromosome 2 but was not replicated in the BCAC studies (P = 0.21). The potentially modifying SNPs are in strong linkage disequilibrium with SNPs in TRIP12 and DNER on chromosome 2 and SETBP1 on chromosome 18, previously linked to carcinogenesis. However, none of the interaction effects reached genome-wide significance. The inability to replicate the top SNP × MHT interaction may be due to limited power of the replication phase. Our study, however, suggests that there are unlikely to be SNPs that interact strongly enough with MHT use to be clinically significant in European women. PMID:23423446
Grams, Vanessa; Wellmann, Robin; Preuß, Siegfried; Grashorn, Michael A; Kjaer, Jörgen B; Bessei, Werner; Bennewitz, Jörn
2015-09-30
Feather pecking (FP) in laying hens is a well-known and multi-factorial behaviour with a genetic background. In a selection experiment, two lines were developed for 11 generations for high (HFP) and low (LFP) feather pecking, respectively. Starting with the second generation of selection, there was a constant difference in mean number of FP bouts between both lines. We used the data from this experiment to perform a quantitative genetic analysis and to map selection signatures. Pedigree and phenotypic data were available for the last six generations of both lines. Univariate quantitative genetic analyses were conducted using mixed linear and generalized mixed linear models assuming a Poisson distribution. Selection signatures were mapped using 33,228 single nucleotide polymorphisms (SNPs) genotyped on 41 HFP and 34 LFP individuals of generation 11. For each SNP, we estimated Wright's fixation index (FST). We tested the null hypothesis that FST is driven purely by genetic drift against the alternative hypothesis that it is driven by genetic drift and selection. The mixed linear model failed to analyze the LFP data because of the large number of 0s in the observation vector. The Poisson model fitted the data well and revealed a small but continuous genetic trend in both lines. Most of the 17 genome-wide significant SNPs were located on chromosomes 3 and 4. Thirteen clusters with at least two significant SNPs within an interval of 3 Mb maximum were identified. Two clusters were mapped on chromosomes 3, 4, 8 and 19. Of the 17 genome-wide significant SNPs, 12 were located within the identified clusters. This indicates a non-random distribution of significant SNPs and points to the presence of selection sweeps. Data on FP should be analysed using generalised linear mixed models assuming a Poisson distribution, especially if the number of FP bouts is small and the distribution is heavily peaked at 0. The FST-based approach was suitable to map selection signatures that need to be confirmed by linkage or association mapping.
Han, Bo; Liang, Weijun; Liu, Lin; Li, Yanhua; Sun, Dongxiao
2017-05-19
Our previous RNA-sequencing study revealed that the ATF3 and CDKN1A genes were remarkably differentially expressed between the mammary glands of lactating Holstein cows with extremely high and low milk protein and fat percentage so that both of them were considered as candidates for milk composition. Herein, we further verified whether these genes have genetic effects on milk production traits in a Chinese Holstein cow population. By re-sequencing the entire coding and regulatory regions, we identified four SNPs in 5'promoter region, two in exons, seven in 3' un-translated region (UTR), and six in 3'flanking region of ATF3 gene, and one SNP in exon 5, two in 3'UTR, and two in 3'flanking region of CDKN1A gene. Of these, only the SNP, c.271C > T (rs442346530), in exon 5 of CDKN1A gene was predicted to result in an amino acid replacement from arginine to tryptophan. Subsequent genotype-phenotype association analysis revealed that 19 SNPs in ATF3 and 5 SNPs in CDKN1A were evidently associated with 305-days milk yield, fat yield, protein yield, or protein percentage (P = < 0.0001 ~ 0.0494). Whilst, no significant SNPs in ATF3 gene were associated with fat percentage in both first and second lactations (P > 0.05), and only two SNPs of CDKN1A gene, c.271C > T (P = 0.0377) and c.*654C > T (P = 0.0144), were markedly associated with fat percentage in the first lactation. Further, linkage disequilibrium (LD) analyses were conducted among the identified SNPs in ATF3 and/or CDKN1A genes to further confirm the association results. We also observed that the four SNPs, g.72834301C > A, g.72834229C > A, g.72833969A > G, and g.72833562G > T altered the specific transcription factor (TF) binding sites in ATF3 promoter, and one SNP, c.271C > T, changed the CDKN1A protein secondary structure, suggesting they might be the promising potential functional mutations. Our findings first profiled the genetic effects of ATF3 and CDKN1A genes for milk production traits in dairy cattle and will be available for marker-assisted breeding in dairy cattle.
Sequences associated with human iris pigmentation.
Frudakis, Tony; Thomas, Matthew; Gaskin, Zach; Venkateswarlu, K; Chandra, K Suresh; Ginjupalli, Siva; Gunturi, Sitaram; Natrajan, Sivamani; Ponnuswamy, Viswanathan K; Ponnuswamy, K N
2003-01-01
To determine whether and how common polymorphisms are associated with natural distributions of iris colors, we surveyed 851 individuals of mainly European descent at 335 SNP loci in 13 pigmentation genes and 419 other SNPs distributed throughout the genome and known or thought to be informative for certain elements of population structure. We identified numerous SNPs, haplotypes, and diplotypes (diploid pairs of haplotypes) within the OCA2, MYO5A, TYRP1, AIM, DCT, and TYR genes and the CYP1A2-15q22-ter, CYP1B1-2p21, CYP2C8-10q23, CYP2C9-10q24, and MAOA-Xp11.4 regions as significantly associated with iris colors. Half of the associated SNPs were located on chromosome 15, which corresponds with results that others have previously obtained from linkage analysis. We identified 5 additional genes (ASIP, MC1R, POMC, and SILV) and one additional region (GSTT2-22q11.23) with haplotype and/or diplotypes, but not individual SNP alleles associated with iris colors. For most of the genes, multilocus gene-wise genotype sequences were more strongly associated with iris colors than were haplotypes or SNP alleles. Diplotypes for these genes explain 15% of iris color variation. Apart from representing the first comprehensive candidate gene study for variable iris pigmentation and constituting a first step toward developing a classification model for the inference of iris color from DNA, our results suggest that cryptic population structure might serve as a leverage tool for complex trait gene mapping if genomes are screened with the appropriate ancestry informative markers. PMID:14704187
Comprehensive genotyping of the USA national maize inbred seed bank
2013-01-01
Background Genotyping by sequencing, a new low-cost, high-throughput sequencing technology was used to genotype 2,815 maize inbred accessions, preserved mostly at the National Plant Germplasm System in the USA. The collection includes inbred lines from breeding programs all over the world. Results The method produced 681,257 single-nucleotide polymorphism (SNP) markers distributed across the entire genome, with the ability to detect rare alleles at high confidence levels. More than half of the SNPs in the collection are rare. Although most rare alleles have been incorporated into public temperate breeding programs, only a modest amount of the available diversity is present in the commercial germplasm. Analysis of genetic distances shows population stratification, including a small number of large clusters centered on key lines. Nevertheless, an average fixation index of 0.06 indicates moderate differentiation between the three major maize subpopulations. Linkage disequilibrium (LD) decays very rapidly, but the extent of LD is highly dependent on the particular group of germplasm and region of the genome. The utility of these data for performing genome-wide association studies was tested with two simply inherited traits and one complex trait. We identified trait associations at SNPs very close to known candidate genes for kernel color, sweet corn, and flowering time; however, results suggest that more SNPs are needed to better explore the genetic architecture of complex traits. Conclusions The genotypic information described here allows this publicly available panel to be exploited by researchers facing the challenges of sustainable agriculture through better knowledge of the nature of genetic diversity. PMID:23759205
Nagaie, Satoshi; Ogishima, Soichi; Nakaya, Jun; Tanaka, Hiroshi
2015-01-01
Genome-wide association studies (GWAS) and linkage analysis has identified many single nucleotide polymorphisms (SNPs) related to disease. There are many unknown SNPs whose minor allele frequencies (MAFs) as low as 0.005 having intermediate effects with odds ratio between 1.5~3.0. Low frequency variants having intermediate effects on disease pathogenesis are believed to have complex interactions with environmental factors called gene-environment interactions (GxE). Hence, we describe a model using 3D Manhattan plot called GxE landscape plot to visualize the association of p-values for gene-environment interactions (GxE). We used the Gene-Environment iNteraction Simulator 2 (GENS2) program to simulate interactions between two genetic loci and one environmental factor in this exercise. The dataset used for training contains disease status, gender, 20 environmental exposures and 100 genotypes for 170 subjects, and p-values were calculated by Cochran-Mantel-Haenszel chi-squared test on known data. Subsequently, we created a 3D GxE landscape plot of negative logarithm of the association of p-values for all the possible combinations of genetic and environmental factors with their hierarchical clustering. Thus, the GxE landscape plot is a valuable model to predict association of p-values for GxE and similarity among genotypes and environments in the context of disease pathogenesis. GxE - Gene-environment interactions, GWAS - Genome-wide association study, MAFs - Minor allele frequencies, SNPs - Single nucleotide polymorphisms, EWAS - Environment-wide association study, FDR - False discovery rate, JPT+CHB - HapMap population of Japanese in Tokyo, Japan - Han Chinese in Beijing.
Dubey, Amit; Biswas, Sanjay Kumar; Sinha, Ekata; Chakma, Joy Kumar; Kamal, Raj; Arora, Mamta; Sagar, Harish; Natarajan, Mohan; Bhagyawant, Sameer S; Mohanty, Keshar Kunja
2017-07-01
The pathogen Mycobacterium leprae causes leprosy that affects mainly skin and nerves. Polymorphisms of certain genes are substantiated to be associated with the susceptibility/resistance to leprosy. The present investigation addressed the association of Nitric Oxide Synthase2 gene polymorphisms and leprosy in a population from northern part of India. A total of 323 leprosy cases and 288 healthy controls were genotyped for four NOS2 promoter variants (rs1800482, rs2779249, rs8078340 and rs2301369) using FRET technology in Real Time PCR. None of these SNPs in promoter sites was associated with susceptibility/resistance to leprosy. NOS2 rs1800482 was found to be monomorphic with GG genotype. However, NOS2-1026T allele was observed to be in higher frequency with leprosy cases (BL and LL) who were not suffering from any reactional episodes compared to cases with ENL reaction {OR=0.30, 95% CI (0.10-0.86), p=0.024}. NOS2-1026GT genotype was more prevalent in cases without reaction (BT, BB and BL) compared to RR reactional patients {OR=0.38, 95% CI (0.17-0.86), p=0.02}. Although haplotype analysis revealed that no haplotype was associated with leprosy susceptibility/resistance with statistical significance, GTG haplotype was noted to be more frequent in healthy controls. These SNPs are observed to be in linkage disequilibrium. Although, these SNPs are not likely to influence leprosy vulnerability, -1026G>T SNP was indicated to have noteworthy role in leprosy reactions. Copyright © 2017 Elsevier B.V. All rights reserved.
Adaptations to Climate-Mediated Selective Pressures in Sheep
Lv, Feng-Hua; Agha, Saif; Kantanen, Juha; Colli, Licia; Stucki, Sylvie; Kijas, James W.; Joost, Stéphane; Li, Meng-Hua; Ajmone Marsan, Paolo
2014-01-01
Following domestication, sheep (Ovis aries) have become essential farmed animals across the world through adaptation to a diverse range of environments and varied production systems. Climate-mediated selective pressure has shaped phenotypic variation and has left genetic “footprints” in the genome of breeds raised in different agroecological zones. Unlike numerous studies that have searched for evidence of selection using only population genetics data, here, we conducted an integrated coanalysis of environmental data with single nucleotide polymorphism (SNP) variation. By examining 49,034 SNPs from 32 old, autochthonous sheep breeds that are adapted to a spectrum of different regional climates, we identified 230 SNPs with evidence for selection that is likely due to climate-mediated pressure. Among them, 189 (82%) showed significant correlation (P ≤ 0.05) between allele frequency and climatic variables in a larger set of native populations from a worldwide range of geographic areas and climates. Gene ontology analysis of genes colocated with significant SNPs identified 17 candidates related to GTPase regulator and peptide receptor activities in the biological processes of energy metabolism and endocrine and autoimmune regulation. We also observed high linkage disequilibrium and significant extended haplotype homozygosity for the core haplotype TBC1D12-CH1 of TBC1D12. The global frequency distribution of the core haplotype and allele OAR22_18929579-A showed an apparent geographic pattern and significant (P ≤ 0.05) correlations with climatic variation. Our results imply that adaptations to local climates have shaped the spatial distribution of some variants that are candidates to underpin adaptive variation in sheep. PMID:25249477
CYP3A4 and CYP3A5 genotyping by Pyrosequencing
Garsa, Adam A; McLeod, Howard L; Marsh, Sharon
2005-01-01
Background Human cytochrome P450 3A enzymes, particularly CYP3A4 and CYP3A5, play an important role in drug metabolism. CYP3A expression exhibits substantial interindividual variation, much of which may result from genetic variation. This study describes Pyrosequencing assays for key SNPs in CYP3A4 (CYP3A4*1B, CYP3A4*2, and CYP3A4*3) and CYP3A5 (CYP3A5*3C and CYP3A5*6). Methods Genotyping of 95 healthy European and 95 healthy African volunteers was performed using Pyrosequencing. Linkage disequilibrium, haplotype inference, Hardy-Weinberg equilibrium, and tag SNPs were also determined for these samples. Results CYP3A4*1B allele frequencies were 4% in Europeans and 82% in Africans. The CYP3A4*2 allele was found in neither population sample. CYP3A4*3 had an allele frequency of 2% in Europeans and 0% in Africans. The frequency of CYP3A5*3C was 94% in Europeans and 12% in Africans. No CYP3A5*6 variants were found in the European samples, but this allele had a frequency of 16% in the African samples. Allele frequencies and haplotypes show interethnic variation, highlighting the need to analyze clinically relevant SNPs and haplotypes in a variety of ethnic groups. Conclusion Pyrosequencing is a versatile technique that could improve the efficiency of SNP analysis for pharmacogenomic research with the ultimate goal of pre-screening patients for individual therapy selection. PMID:15882469
Comprehensive genotyping of the USA national maize inbred seed bank.
Romay, Maria C; Millard, Mark J; Glaubitz, Jeffrey C; Peiffer, Jason A; Swarts, Kelly L; Casstevens, Terry M; Elshire, Robert J; Acharya, Charlotte B; Mitchell, Sharon E; Flint-Garcia, Sherry A; McMullen, Michael D; Holland, James B; Buckler, Edward S; Gardner, Candice A
2013-06-11
Genotyping by sequencing, a new low-cost, high-throughput sequencing technology was used to genotype 2,815 maize inbred accessions, preserved mostly at the National Plant Germplasm System in the USA. The collection includes inbred lines from breeding programs all over the world. The method produced 681,257 single-nucleotide polymorphism (SNP) markers distributed across the entire genome, with the ability to detect rare alleles at high confidence levels. More than half of the SNPs in the collection are rare. Although most rare alleles have been incorporated into public temperate breeding programs, only a modest amount of the available diversity is present in the commercial germplasm. Analysis of genetic distances shows population stratification, including a small number of large clusters centered on key lines. Nevertheless, an average fixation index of 0.06 indicates moderate differentiation between the three major maize subpopulations. Linkage disequilibrium (LD) decays very rapidly, but the extent of LD is highly dependent on the particular group of germplasm and region of the genome. The utility of these data for performing genome-wide association studies was tested with two simply inherited traits and one complex trait. We identified trait associations at SNPs very close to known candidate genes for kernel color, sweet corn, and flowering time; however, results suggest that more SNPs are needed to better explore the genetic architecture of complex traits. The genotypic information described here allows this publicly available panel to be exploited by researchers facing the challenges of sustainable agriculture through better knowledge of the nature of genetic diversity.
Below, Jennifer E.; Parra, Esteban J.; Gamazon, Eric R.; Torres, Jason; Krithika, S.; Candille, Sophie; Lu, Yingchang; Manichakul, Ani; Peralta-Romero, Jesus; Duan, Qing; Li, Yun; Morris, Andrew P.; Gottesman, Omri; Bottinger, Erwin; Wang, Xin-Qun; Taylor, Kent D.; Ida Chen, Y.-D.; Rotter, Jerome I.; Rich, Stephen S.; Loos, Ruth J. F.; Tang, Hua; Cox, Nancy J.; Cruz, Miguel; Hanis, Craig L.; Valladares-Salgado, Adan
2016-01-01
We performed genome-wide meta-analysis of lipid traits on three samples of Mexican and Mexican American ancestry comprising 4,383 individuals, and followed up significant and highly suggestive associations in three additional Hispanic samples comprising 7,876 individuals. Genome-wide significant signals were observed in or near CELSR2, ZNF259/APOA5, KANK2/DOCK6 and NCAN/MAU2 for total cholesterol, LPL, ABCA1, ZNF259/APOA5, LIPC and CETP for HDL cholesterol, CELSR2, APOB and NCAN/MAU2 for LDL cholesterol, and GCKR, TRIB1, ZNF259/APOA5 and NCAN/MAU2 for triglycerides. Linkage disequilibrium and conditional analyses indicate that signals observed at ABCA1 and LIPC for HDL cholesterol and NCAN/MAU2 for triglycerides are independent of previously reported lead SNP associations. Analyses of lead SNPs from the European Global Lipids Genetics Consortium (GLGC) dataset in our Hispanic samples show remarkable concordance of direction of effects as well as strong correlation in effect sizes. A meta-analysis of the European GLGC and our Hispanic datasets identified five novel regions reaching genome-wide significance: two for total cholesterol (FN1 and SAMM50), two for HDL cholesterol (LOC100996634 and COPB1) and one for LDL cholesterol (LINC00324/CTC1/PFAS). The top meta-analysis signals were found to be enriched for SNPs associated with gene expression in a tissue-specific fashion, suggesting an enrichment of tissue-specific function in lipid-associated loci. PMID:26780889
In search of causal variants: refining disease association signals using cross-population contrasts.
Saccone, Nancy L; Saccone, Scott F; Goate, Alison M; Grucza, Richard A; Hinrichs, Anthony L; Rice, John P; Bierut, Laura J
2008-08-29
Genome-wide association (GWA) using large numbers of single nucleotide polymorphisms (SNPs) is now a powerful, state-of-the-art approach to mapping human disease genes. When a GWA study detects association between a SNP and the disease, this signal usually represents association with a set of several highly correlated SNPs in strong linkage disequilibrium. The challenge we address is to distinguish among these correlated loci to highlight potential functional variants and prioritize them for follow-up. We implemented a systematic method for testing association across diverse population samples having differing histories and LD patterns, using a logistic regression framework. The hypothesis is that important underlying biological mechanisms are shared across human populations, and we can filter correlated variants by testing for heterogeneity of genetic effects in different population samples. This approach formalizes the descriptive comparison of p-values that has typified similar cross-population fine-mapping studies to date. We applied this method to correlated SNPs in the cholinergic nicotinic receptor gene cluster CHRNA5-CHRNA3-CHRNB4, in a case-control study of cocaine dependence composed of 504 European-American and 583 African-American samples. Of the 10 SNPs genotyped in the r2 > or = 0.8 bin for rs16969968, three demonstrated significant cross-population heterogeneity and are filtered from priority follow-up; the remaining SNPs include rs16969968 (heterogeneity p = 0.75). Though the power to filter out rs16969968 is reduced due to the difference in allele frequency in the two groups, the results nevertheless focus attention on a smaller group of SNPs that includes the non-synonymous SNP rs16969968, which retains a similar effect size (odds ratio) across both population samples. Filtering out SNPs that demonstrate cross-population heterogeneity enriches for variants more likely to be important and causative. Our approach provides an important and effective tool to help interpret results from the many GWA studies now underway.
Shete, Sanjay; Lau, Ching C; Houlston, Richard S; Claus, Elizabeth B; Barnholtz-Sloan, Jill; Lai, Rose; Il’yasova, Dora; Schildkraut, Joellen; Sadetzki, Siegal; Johansen, Christoffer; Bernstein, Jonine L; Olson, Sara H; Jenkins, Robert B; Yang, Ping; Vick, Nicholas A; Wrensch, Margaret; Davis, Faith G; McCarthy, Bridget J; Leung, Eastwood Hon-chiu; Davis, Caleb; Cheng, Rita; Hosking, Fay J; Armstrong, Georgina N; Liu, Yanhong; Yu, Robert K; Henriksson, Roger; Consortium, The Gliogene; Melin, Beatrice S; Bondy, Melissa L
2011-01-01
Gliomas, which generally have a poor prognosis, are the most common primary malignant brain tumors in adults. Recent genome-wide association studies have demonstrated that inherited susceptibility plays a role in the development of glioma. Although first-degree relatives of patients exhibit a two-fold increased risk of glioma, the search for susceptibility loci in familial forms of the disease has been challenging because the disease is relatively rare, fatal, and heterogeneous, making it difficult to collect sufficient biosamples from families for statistical power. To address this challenge, the Genetic Epidemiology of Glioma International Consortium (Gliogene) was formed to collect DNA samples from families with two or more cases of histologically confirmed glioma. In this study, we present results obtained from 46 U.S. families in which multipoint linkage analyses were undertaken using nonparametric (model-free) methods. After removal of high linkage disequilibrium SNPs, we obtained a maximum nonparametric linkage score (NPL) of 3.39 (P=0.0005) at 17q12–21.32 and the Z-score of 4.20 (P=0.000007). To replicate our findings, we genotyped 29 independent U.S. families and obtained a maximum NPL score of 1.26 (P=0.008) and the Z-score of 1.47 (P=0.035). Accounting for the genetic heterogeneity using the ordered subset analysis approach, the combined analyses of 75 families resulted in a maximum NPL score of 3.81 (P=0.00001). The genomic regions we have implicated in this study may offer novel insights into glioma susceptibility, focusing future work to identify genes that cause familial glioma. PMID:22037877
Kos, Mark Z.; Glahn, David C.; Carless, Melanie A.; Olvera, Rene; McKay, D. Reese; Quillen, Ellen E.; Gelernter, Joel; Chen, Xiang-Ding; Deng, Hong-Wen; Kent, Jack W.; Dyer, Thomas D.; Göring, Harald H.H.; Curran, Joanne E.; Duggirala, Ravi; Blangero, John; Almasy, Laura
2014-01-01
Linkage studies of alcoholism have implicated several chromosome regions, leading to the successful identification of susceptibility genes, including ADH4 and GABRA2 on chromosome 4. Quantitative endophenotypes that are potentially closer to gene action than clinical endpoints offer a means of obtaining more refined linkage signals of genes that predispose alcohol use disorders (AUD). In this study we examine a self-reported measure of the maximum number of drinks consumed in a 24-hour period (abbreviated Max Drinks), a significantly heritable phenotype (h2 = 0.32 ± 0.05; P = 4.61 × 10−14) with a strong genetic correlation with AUD (ρg = 0.99 ± 0.13) for the San Antonio Family Study (n = 1,203). Genome-wide SNPs were analyzed using variance components linkage methods in the program SOLAR, revealing a novel, genome-wide significant QTL (LOD = 4.17; P = 5.85 × 10−6) for Max Drinks at chromosome 6p22.3, a region with a number of compelling candidate genes implicated in neuronal function and psychiatric illness. Joint analysis of Max Drinks and AUD status shows that the QTL has a significant non-zero effect on diagnosis (P = 4.04 × 10−3), accounting for 8.6% of the total variation. Significant SNP associations for Max Drinks were also identified at the linkage region, including one, rs7761213 (P = 2.14 × 10−4), obtained for an independent sample of Chinese families. Thus, our study identifies a potential risk locus for AUD at 6p22.3, with significant pleiotropic effects on the heaviness of alcohol consumption that may not be population specific. PMID:24692236
Pemberton, T J; Jakobsson, M; Conrad, D F; Coop, G; Wall, J D; Pritchard, J K; Patel, P I; Rosenberg, N A
2008-07-01
When performing association studies in populations that have not been the focus of large-scale investigations of haplotype variation, it is often helpful to rely on genomic databases in other populations for study design and analysis - such as in the selection of tag SNPs and in the imputation of missing genotypes. One way of improving the use of these databases is to rely on a mixture of database samples that is similar to the population of interest, rather than using the single most similar database sample. We demonstrate the effectiveness of the mixture approach in the application of African, European, and East Asian HapMap samples for tag SNP selection in populations from India, a genetically intermediate region underrepresented in genomic studies of haplotype variation.
Bassi, Denis; Briñez, Boris; Rosa, Juliana Santa; Oblessuc, Paula Rodrigues; Almeida, Caléo Panhoca de; Nucci, Stella Maris; Silva, Larissa Chariel Domingos da; Chiorato, Alisson Fernando; Vianello, Rosana Pereira; Camargo, Luis Eduardo Aranha; Blair, Matthew Wohlgemuth; Benchimol-Reis, Luciana Lasry
2017-01-01
Angular leaf spot (ALS) and powdery mildew (PWM) are two important fungi diseases causing significant yield losses in common beans. In this study, a new genetic linkage map was constructed using single sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs), in a segregating population derived from the AND 277 x SEA 5 cross, with 105 recombinant inbred lines. Phenotypic evaluations were performed in the greenhouse to identify quantitative trait loci (QTLs) associated with resistance by means of the composite interval mapping analysis. Four QTLs were identified for ALS resistance. The QTL ALS11AS, linked on the SNP BAR 5054, mapped on chromosome Pv11, showed the greatest effect (R2 = 26.5%) on ALS phenotypic variance. For PWM resistance, two QTLs were detected, PWM2AS and PWM11AS, on Pv2 and Pv11, explaining 7% and 66% of the phenotypic variation, respectively. Both QTLs on Pv11 were mapped on the same genomic region, suggesting that it is a pleiotropic region. The present study resulted in the identification of new markers closely linked to ALS and PWM QTLs, which can be used for marker-assisted selection, fine mapping and positional cloning.
PHIP - a novel candidate breast cancer susceptibility locus on 6q14.1
Jiao, Xiang; Aravidis, Christos; Marikkannu, Rajeshwari; Rantala, Johanna; Picelli, Simone; Adamovic, Tatjana; Liu, Tao; Maguire, Paula; Kremeyer, Barbara; Luo, Liping; von Holst, Susanna; Kontham, Vinaykumar; Thutkawkorapin, Jessada; Margolin, Sara; Du, Quan; Lundin, Johanna; Michailidou, Kyriaki; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Lush, Michael; Ambrosone, Christine B.; Andrulis, Irene L.; Anton-Culver, Hoda; Antonenkova, Natalia N.; Arndt, Volker; Beckmann, Matthias W.; Blomqvist, Carl; Blot, William; Boeckx, Bram; Bojesen, Stig E.; Bonanni, Bernardo; Brand, Judith S.; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Cai, Qiuyin; Chang-Claude, Jenny; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Deming-Halverson, Sandra L.; Devilee, Peter; dos-Santos-Silva, Isabel; Dörk, Thilo; Eriksson, Mikael; Fasching, Peter A.; Figueroa, Jonine; Flesch-Janys, Dieter; Flyger, Henrik; Gabrielson, Marike; García-Closas, Montserrat; Giles, Graham G.; González-Neira, Anna; Guénel, Pascal; Guo, Qi; Gündert, Melanie; Haiman, Christopher A.; Hallberg, Emily; Hamann, Ute; Harrington, Patricia; Hooning, Maartje J.; Hopper, John L.; Huang, Guanmengqian; Jakubowska, Anna; Jones, Michael E.; Kerin, Michael J.; Kosma, Veli-Matti; Kristensen, Vessela N.; Lambrechts, Diether; Le Marchand, Loic; Lubinski, Jan; Mannermaa, Arto; Martens, John W.M.; Meindl, Alfons; Milne, Roger L.; Mulligan, Anna Marie; Neuhausen, Susan L.; Nevanlinna, Heli; Peto, Julian; Pylkäs, Katri; Radice, Paolo; Rhenius, Valerie; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Seynaeve, Caroline; Shah, Mitul; Simard, Jacques; Southey, Melissa C.; Swerdlow, Anthony J.; Truong, Thérèse; Wendt, Camilla; Winqvist, Robert; Zheng, Wei; Benitez, Javier; Dunning, Alison M.; Pharoah, Paul D.P.; Easton, Douglas F.; Czene, Kamila; Hall, Per; Lindblom, Annika
2017-01-01
Most non-BRCA1/2 breast cancer families have no identified genetic cause. We used linkage and haplotype analyses in familial and sporadic breast cancer cases to identify a susceptibility locus on chromosome 6q. Two independent genome-wide linkage analysis studies suggested a 3 Mb locus on chromosome 6q and two unrelated Swedish families with a LOD >2 together seemed to share a haplotype in 6q14.1. We hypothesized that this region harbored a rare high-risk founder allele contributing to breast cancer in these two families. Sequencing of DNA and RNA from the two families did not detect any pathogenic mutations. Finally, 29 SNPs in the region were analyzed in 44,214 cases and 43,532 controls from BCAC, and the original haplotypes in the two families were suggested as low-risk alleles for European and Swedish women specifically. There was also some support for one additional independent moderate-risk allele in Swedish familial samples. The results were consistent with our previous findings in familial breast cancer and supported a breast cancer susceptibility locus at 6q14.1 around the PHIP gene. PMID:29262523
ParallABEL: an R library for generalized parallelization of genome-wide association studies
2010-01-01
Background Genome-Wide Association (GWA) analysis is a powerful method for identifying loci associated with complex traits and drug response. Parts of GWA analyses, especially those involving thousands of individuals and consuming hours to months, will benefit from parallel computation. It is arduous acquiring the necessary programming skills to correctly partition and distribute data, control and monitor tasks on clustered computers, and merge output files. Results Most components of GWA analysis can be divided into four groups based on the types of input data and statistical outputs. The first group contains statistics computed for a particular Single Nucleotide Polymorphism (SNP), or trait, such as SNP characterization statistics or association test statistics. The input data of this group includes the SNPs/traits. The second group concerns statistics characterizing an individual in a study, for example, the summary statistics of genotype quality for each sample. The input data of this group includes individuals. The third group consists of pair-wise statistics derived from analyses between each pair of individuals in the study, for example genome-wide identity-by-state or genomic kinship analyses. The input data of this group includes pairs of SNPs/traits. The final group concerns pair-wise statistics derived for pairs of SNPs, such as the linkage disequilibrium characterisation. The input data of this group includes pairs of individuals. We developed the ParallABEL library, which utilizes the Rmpi library, to parallelize these four types of computations. ParallABEL library is not only aimed at GenABEL, but may also be employed to parallelize various GWA packages in R. The data set from the North American Rheumatoid Arthritis Consortium (NARAC) includes 2,062 individuals with 545,080, SNPs' genotyping, was used to measure ParallABEL performance. Almost perfect speed-up was achieved for many types of analyses. For example, the computing time for the identity-by-state matrix was linearly reduced from approximately eight hours to one hour when ParallABEL employed eight processors. Conclusions Executing genome-wide association analysis using the ParallABEL library on a computer cluster is an effective way to boost performance, and simplify the parallelization of GWA studies. ParallABEL is a user-friendly parallelization of GenABEL. PMID:20429914
Buzanskas, Marcos Eli; Ventura, Ricardo Vieira; Seleguim Chud, Tatiane Cristina; Bernardes, Priscila Arrigucci; Santos, Daniel Jordan de Abreu; Regitano, Luciana Correia de Almeida; de Alencar, Maurício Mello; Mudadu, Maurício de Alvarenga; Zanella, Ricardo; da Silva, Marcos Vinícius Gualberto Barbosa; Li, Changxi; Schenkel, Flavio Schramm; Munari, Danísio Prado
2017-01-01
The aim of this study was to evaluate the level of introgression of breeds in the Canchim (CA: 62.5% Charolais—37.5% Zebu) and MA genetic group (MA: 65.6% Charolais—34.4% Zebu) cattle using genomic information on Charolais (CH), Nelore (NE), and Indubrasil (IB) breeds. The number of animals used was 395 (CA and MA), 763 (NE), 338 (CH), and 37 (IB). The Bovine50SNP BeadChip from Illumina panel was used to estimate the levels of introgression of breeds considering the Maximum likelihood, Bayesian, and Single Regression method. After genotype quality control, 32,308 SNPs were considered in the analysis. Furthermore, three thresholds to prune out SNPs in linkage disequilibrium higher than 0.10, 0.05, and 0.01 were considered, resulting in 15,286, 7,652, and 1,582 SNPs, respectively. For k = 2, the proportion of taurine and indicine varied from the expected proportion based on pedigree for all methods studied. For k = 3, the Regression method was able to differentiate the animals in three main clusters assigned to each purebred breed, showing more reasonable according to its biological viewpoint. Analyzing the data considering k = 2 seems to be more appropriate for Canchim-MA animals due to its biological interpretation. The usage of 32,308 SNPs in the analyses resulted in similar findings between the estimated and expected breed proportions. Using the Regression approach, a contribution of Indubrasil was observed in Canchim-MA when k = 3 was considered. Genetic parameter estimation could account for this breed composition information as a source of variation in order to improve the accuracy of genetic models. Our findings may help assemble appropriate reference populations for genomic prediction for Canchim-MA in order to improve prediction accuracy. Using the information on the level of introgression in each individual could also be useful in breeding or crossing design to improve individual heterosis in crossbred cattle. PMID:28182737
Buzanskas, Marcos Eli; Ventura, Ricardo Vieira; Seleguim Chud, Tatiane Cristina; Bernardes, Priscila Arrigucci; Santos, Daniel Jordan de Abreu; Regitano, Luciana Correia de Almeida; Alencar, Maurício Mello de; Mudadu, Maurício de Alvarenga; Zanella, Ricardo; da Silva, Marcos Vinícius Gualberto Barbosa; Li, Changxi; Schenkel, Flavio Schramm; Munari, Danísio Prado
2017-01-01
The aim of this study was to evaluate the level of introgression of breeds in the Canchim (CA: 62.5% Charolais-37.5% Zebu) and MA genetic group (MA: 65.6% Charolais-34.4% Zebu) cattle using genomic information on Charolais (CH), Nelore (NE), and Indubrasil (IB) breeds. The number of animals used was 395 (CA and MA), 763 (NE), 338 (CH), and 37 (IB). The Bovine50SNP BeadChip from Illumina panel was used to estimate the levels of introgression of breeds considering the Maximum likelihood, Bayesian, and Single Regression method. After genotype quality control, 32,308 SNPs were considered in the analysis. Furthermore, three thresholds to prune out SNPs in linkage disequilibrium higher than 0.10, 0.05, and 0.01 were considered, resulting in 15,286, 7,652, and 1,582 SNPs, respectively. For k = 2, the proportion of taurine and indicine varied from the expected proportion based on pedigree for all methods studied. For k = 3, the Regression method was able to differentiate the animals in three main clusters assigned to each purebred breed, showing more reasonable according to its biological viewpoint. Analyzing the data considering k = 2 seems to be more appropriate for Canchim-MA animals due to its biological interpretation. The usage of 32,308 SNPs in the analyses resulted in similar findings between the estimated and expected breed proportions. Using the Regression approach, a contribution of Indubrasil was observed in Canchim-MA when k = 3 was considered. Genetic parameter estimation could account for this breed composition information as a source of variation in order to improve the accuracy of genetic models. Our findings may help assemble appropriate reference populations for genomic prediction for Canchim-MA in order to improve prediction accuracy. Using the information on the level of introgression in each individual could also be useful in breeding or crossing design to improve individual heterosis in crossbred cattle.
Smith, C T; Dang, L C; Buckholtz, J W; Tetreault, A M; Cowan, R L; Kessler, R M; Zald, D H
2017-04-11
Dopamine function is broadly implicated in multiple neuropsychiatric conditions believed to have a genetic basis. Although a few positron emission tomography (PET) studies have investigated the impact of single-nucleotide polymorphisms (SNPs) in the dopamine D2 receptor gene (DRD2) on D2/3 receptor availability (binding potential, BP ND ), these studies have often been limited by small sample size. Furthermore, the most commonly studied SNP in D2/3 BP ND (Taq1A) is not located in the DRD2 gene itself, suggesting that its linkage with other DRD2 SNPs may explain previous PET findings. Here, in the largest PET genetic study to date (n=84), we tested for effects of the C957T and -141C Ins/Del SNPs (located within DRD2) as well as Taq1A on BP ND of the high-affinity D2 receptor tracer 18 F-Fallypride. In a whole-brain voxelwise analysis, we found a positive linear effect of C957T T allele status on striatal BP ND bilaterally. The multilocus genetic scores containing C957T and one or both of the other SNPs produced qualitatively similar striatal results to C957T alone. The number of C957T T alleles predicted BP ND in anatomically defined putamen and ventral striatum (but not caudate) regions of interest, suggesting some regional specificity of effects in the striatum. By contrast, no significant effects arose in cortical regions. Taken together, our data support the critical role of C957T in striatal D2/3 receptor availability. This work has implications for a number of psychiatric conditions in which dopamine signaling and variation in C957T status have been implicated, including schizophrenia and substance use disorders.
Yoshikawa, Munemitsu; Yamashiro, Kenji; Miyake, Masahiro; Oishi, Maho; Akagi-Kurashige, Yumiko; Kumagai, Kyoko; Nakata, Isao; Nakanishi, Hideo; Oishi, Akio; Gotoh, Norimoto; Yamada, Ryo; Matsuda, Fumihiko; Yoshimura, Nagahisa
2014-10-21
We investigated the association between refractive error in a Japanese population and myopia-related genes identified in two recent large-scale genome-wide association studies. Single-nucleotide polymorphisms (SNPs) in 51 genes that were reported by the Consortium for Refractive Error and Myopia and/or the 23andMe database were genotyped in 3712 healthy Japanese volunteers from the Nagahama Study using HumanHap610K Quad, HumanOmni2.5M, and/or HumanExome Arrays. To evaluate the association between refractive error and recently identified myopia-related genes, we used three approaches to perform quantitative trait locus analyses of mean refractive error in both eyes of the participants: per-SNP, gene-based top-SNP, and gene-based all-SNP analyses. Association plots of successfully replicated genes also were investigated. In our per-SNP analysis, eight myopia gene associations were replicated successfully: GJD2, RASGRF1, BICC1, KCNQ5, CD55, CYP26A1, LRRC4C, and B4GALNT2.Seven additional gene associations were replicated in our gene-based analyses: GRIA4, BMP2, QKI, BMP4, SFRP1, SH3GL2, and EHBP1L1. The signal strength of the reported SNPs and their tagging SNPs increased after considering different linkage disequilibrium patterns across ethnicities. Although two previous studies suggested strong associations between PRSS56, LAMA2, TOX, and RDH5 and myopia, we could not replicate these results. Our results confirmed the significance of the myopia-related genes reported previously and suggested that gene-based replication analyses are more effective than per-SNP analyses. Our comparison with two previous studies suggested that BMP3 SNPs cause myopia primarily in Caucasian populations, while they may exhibit protective effects in Asian populations. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.
Zhou, Fei; Wang, Yanru; Liu, Hongliang; Ready, Neal; Han, Younghun; Hung, Rayjean J; Brhane, Yonathan; McLaughlin, John; Brennan, Paul; Bickeböller, Heike; Rosenberger, Albert; Houlston, Richard S; Caporaso, Neil; Landi, Maria Teresa; Brüske, Irene; Risch, Angela; Ye, Yuanqing; Wu, Xifeng; Christiani, David C; Goodman, Gary; Chen, Chu; Amos, Christopher I; Wei, Qingyi
2017-04-01
mRNA degradation is an important regulatory step for controlling gene expression and cell functions. Genetic abnormalities involved in mRNA degradation genes were found to be associated with cancer risks. Therefore, we systematically investigated the roles of genetic variants in the general mRNA degradation pathway in lung cancer risk. Meta-analyses were conducted using summary data from six lung cancer genome-wide association studies (GWASs) from the Transdisciplinary Research in Cancer of the Lung and additional two GWASs from Harvard University and deCODE in the International Lung Cancer Consortium. Expression quantitative trait loci analysis (eQTL) was used for in silico functional validation of the identified significant susceptibility loci. This pathway-based analysis included 6816 single nucleotide polymorphisms (SNP) in 68 genes in 14 463 lung cancer cases and 44 188 controls. In the single-locus analysis, we found that 20 SNPs were associated with lung cancer risk with a false discovery rate threshold of <0.05. Among the 11 newly identified SNPs in CNOT6, which were in high linkage disequilibrium, the rs2453176 with a RegulomDB score "1f" was chosen as the tagSNP for further analysis. We found that the rs2453176 T allele was significantly associated with lung cancer risk (odds ratio = 1.11, 95% confidence interval = 1.04-1.18) in the eight GWASs. In the eQTL analysis, we found that levels of CNOT6 mRNA expression were significantly correlated with the rs2453176 T allele, which provided additional biological basis for the observed positive association. The CNOT6 rs2453176 SNP may be a new functional susceptible locus for lung cancer risk. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Examination of Association to Autism of Common Genetic Variation in Genes Related to Dopamine
Anderson, B.M.; Schnetz-Boutaud, N.; Bartlett, J.; Wright, H.H.; Abramson, R.K.; Cuccaro, M.L.; Gilbert, J.R.; Pericak-Vance, M.A.; Haines, J.L.
2010-01-01
Autism is a severe neurodevelopmental disorder characterized by a triad of complications. Autistic individuals display significant disturbances in language and reciprocal social interactions, combined with repetitive and stereotypic behaviors. Prevalence studies suggest that autism is more common than originally believed, with recent estimates citing a rate of one in 150. Although this genomic approach has yielded multiple suggestive regions, a specific risk locus has yet to be identified and widely confirmed. Because many etiologies have been suggested for this complex syndrome, we hypothesize that one of the difficulties in identifying autism genes is that multiple genetic variants may be required to significantly increase the risk of developing autism. Thus we took the alternative approach of examining 14 prominent dopamine pathway candidate genes for detailed study by genotyping 28 SNPs. Although we did observe a nominally significant association for rs2239535 (p=.008) on chromosome 20, single locus analysis did not reveal any results as significant after correction for multiple comparisons. No significant interaction was identified when Multifactor Dimensionality Reduction (MDR) was employed to test specifically for multilocus effects. Although genome-wide linkage scans in autism have provided support for linkage to various loci along the dopamine pathway, our study does not provide strong evidence of linkage or association to any specific gene or combination of genes within the pathway. These results demonstrate that common genetic variation within the tested genes located within this pathway at most play a minor to moderate role in overall autism pathogenesis. PMID:19360691
Portis, Ezio; Scaglione, Davide; Acquadro, Alberto; Mauromicale, Giovanni; Mauro, Rosario; Knapp, Steven J; Lanteri, Sergio
2012-05-23
The Asteraceae species Cynara cardunculus (2n = 2x = 34) includes the two fully cross-compatible domesticated taxa globe artichoke (var. scolymus L.) and cultivated cardoon (var. altilis DC). As both are out-pollinators and suffer from marked inbreeding depression, linkage analysis has focussed on the use of a two way pseudo-test cross approach. A set of 172 microsatellite (SSR) loci derived from expressed sequence tag DNA sequence were integrated into the reference C. cardunculus genetic maps, based on segregation among the F1 progeny of a cross between a globe artichoke and a cultivated cardoon. The resulting maps each detected 17 major linkage groups, corresponding to the species' haploid chromosome number. A consensus map based on 66 co-dominant shared loci (64 SSRs and two SNPs) assembled 694 loci, with a mean inter-marker spacing of 2.5 cM. When the maps were used to elucidate the pattern of inheritance of head production earliness, a key commercial trait, seven regions were shown to harbour relevant quantitative trait loci (QTL). Together, these QTL accounted for up to 74% of the overall phenotypic variance. The newly developed consensus as well as the parental genetic maps can accelerate the process of tagging and eventually isolating the genes underlying earliness in both the domesticated C. cardunculus forms. The largest single effect mapped to the same linkage group in each parental maps, and explained about one half of the phenotypic variance, thus representing a good candidate for marker assisted selection.
Qiu, Gao-Feng; Xiong, Liang-Wei; Han, Zhi-Ke; Liu, Zhi-Qiang; Feng, Jian-Bin; Wu, Xu-Gan; Yan, Yin-Long; Shen, Hong; Huang, Long; Chen, Li
2017-01-01
The Chinese mitten crab Eriocheir sinensis is the most economically important cultivated crab species in China, and its genome has a high number of chromosomes (2n = 146). To obtain sufficient markers for construction of a dense genetic map for this species, we employed the recently developed specific-locus amplified fragment sequencing (SLAF-seq) method for large-scale SNPs screening and genotyping in a F1 full-sib family of 149 individuals. SLAF-seq generated 127,677 polymorphic SNP markers, of which 20,803 valid markers were assigned into five segregation types and were used together with previous SSR markers for linkage map construction. The final integrated genetic map included 17,680 SNP and 629 SSR markers on the 73 linkage groups (LG), and spanned 14,894.9 cM with an average marker interval of 0.81 cM. QTL mapping localized three significant growth-related QTL to a 1.2 cM region in LG53 as well as 146 sex-linked markers in LG48. Genome-wide QTL-association analysis further identified four growth-related QTL genes named LNX2, PAK2, FMRFamide and octopamine receptors. These genes are involved in a variety of different signaling pathways including cell proliferation and growth. The map and SNP markers described here will be a valuable resource for the E. sinensis genome project and selective breeding programs. PMID:28045132
Chromatin remodeling gene EZH2 involved in the genetic etiology of autism in Chinese Han population.
Li, Jun; You, Yang; Yue, Weihua; Yu, Hao; Lu, Tianlan; Wu, Zhiliu; Jia, Meixiang; Ruan, Yanyan; Liu, Jing; Zhang, Dai; Wang, Lifang
2016-01-01
Autism spectrum disorder (ASD) is a group of severe neurodevelopmental disorders. Epigenetic factors play a critical role in the etiology of ASD. Enhancer of zest homolog 2 (EZH2), which encodes a histone methyltransferase, plays an important role in the process of chromatin remodeling during neurodevelopment. Further, EZH2 is located in chromosome 7q35-36, which is one of the linkage regions for autism. However, the genetic relationship between autism and EZH2 remains unclear. To investigate the association between EZH2 and autism in Chinese Han population, we performed a family-based association study between autism and three tagged single nucleotide polymorphisms (SNPs) that covered 95.4% of the whole region of EZH2. In the discovery cohort of 239 trios, two SNPs (rs740949 and rs6464926) showed a significant association with autism. To decrease false positive results, we expanded the sample size to 427 trios. A SNP (rs6464926) was significantly associated with autism even after Bonferroni correction (p=0.008). Haplotype G-T (rs740949 and rs6464926) was a risk factor for autism (Z=2.655, p=0.008, Global p=0.024). In silico function prediction for SNPs indicated that these two SNPs might be regulatory SNPs. Expression pattern of EZH2 showed that it is highly expressed in human embryonic brains. In conclusion, our findings demonstrate that EZH2 might contribute to the genetic etiology of autism in Chinese Han population. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Engelman, Corinne D; Meyers, Kristin J; Ziegler, Julie T; Taylor, Kent D; Palmer, Nicholette D; Haffner, Steven M; Fingerlin, Tasha E; Wagenknecht, Lynne E; Rotter, Jerome I; Bowden, Donald W; Langefeld, Carl D; Norris, Jill M
2010-10-01
Vitamin D deficiency is associated with many adverse health outcomes. There are several well established environmental predictors of vitamin D concentrations, yet studies of the genetic determinants of vitamin D concentrations are in their infancy. Our objective was to conduct a pilot genome-wide association (GWA) study of 25-hydroxyvitamin D (25[OH]D) and 1,25-dihydroxyvitamin D (1,25[OH](2)D) concentrations in a subset of 229 Hispanic subjects, followed by replication genotyping of 50 single nucleotide polymorphisms (SNPs) in the entire sample of 1190 Hispanics from San Antonio, Texas and San Luis Valley, Colorado. Of the 309,200 SNPs that met all quality control criteria, three SNPs in high linkage disequilibrium (LD) with each other were significantly associated with 1,25[OH](2)D (rs6680429, rs9970802, and rs10889028) at a Bonferroni corrected P-value threshold of 1.62 × 10(-7), however none met the threshold for 25[OH]D. Of the 50 SNPs selected for replication genotyping, five for 25[OH]D (rs2806508, rs10141935, rs4778359, rs1507023, and rs9937918) and eight for 1,25[OH](2)D (rs6680429, rs1348864, rs4559029, rs12667374, rs7781309, rs10505337, rs2486443, and rs2154175) were replicated in the entire sample of Hispanics (P<0.01). In conclusion, we identified several SNPs that were associated with vitamin D metabolite concentrations in Hispanics. These candidate polymorphisms merit further investigation in independent populations and other ethnicities. Copyright © 2010 Elsevier Ltd. All rights reserved.
Ulloa, Mauricio; Hulse-Kemp, Amanda M; De Santiago, Luis M; Stelly, David M; Burke, John J
2017-01-01
High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton ( Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F 2 , a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD) 1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the A t and D t subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the D t subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral A t -subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum . However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid A t and D t subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
HTRA1 promoter polymorphism predisposes Japanese to age-related macular degeneration.
Yoshida, Tsunehiko; DeWan, Andrew; Zhang, Hong; Sakamoto, Ryosuke; Okamoto, Haru; Minami, Masayoshi; Obazawa, Minoru; Mizota, Atsushi; Tanaka, Minoru; Saito, Yoshihiro; Takagi, Ikue; Hoh, Josephine; Iwata, Takeshi
2007-04-04
To study the effect of candidate single nucleotide polymorphisms (SNPs) on chromosome 10q26, recently shown to be associated with wet age-related macular degeneration (AMD) in Chinese and Caucasian cohorts, in a Japanese cohort. Using genomic DNA isolated from peripheral blood of wet AMD cases and age-matched controls, we genotyped two SNPs, rs10490924, and rs11200638, on chromosome 10q26, 6.6 kb and 512 bp upstream of the HTRA1 gene, respectively, using temperature gradient capillary electrophoresis (TGCE) and direct sequencing. Association tests were performed for individual SNPs and jointly with SNP complement factor H (CFH) Y402H. The two SNPs, rs10490924 and rs11200638, are in complete linkage disequilibrium (D'=1). Previous sequence comparisons among seventeen species revealed that the genomic region containing rs11200638 was highly conserved while the region surrounding rs10490924 was not. The allelic association test for rs11200638 yielded a p-value <10(-11). SNP rs11200638 conferred disease risk in an autosomal recessive fashion: Odds ratio was 10.1 (95% CI 4.36, 23.06), adjusted for SNP CFH 402, for those carrying two copies of the risk allele, whereas indistinguishable from unity if carrying only one risk allele. The HTRA1 promoter polymorphism, rs11200638, is a strong candidate with a functional consequence that predisposes Japanese to develop neovascular AMD.
Knüppel, Sven; Meidtner, Karina; Arregui, Maria; Holzhütter, Hermann-Georg; Boeing, Heiner
2015-07-01
Analyzing multiple single nucleotide polymorphisms (SNPs) is a promising approach to finding genetic effects beyond single-locus associations. We proposed the use of multilocus stepwise regression (MSR) to screen for allele combinations as a method to model joint effects, and compared the results with the often used genetic risk score (GRS), conventional stepwise selection, and the shrinkage method LASSO. In contrast to MSR, the GRS, conventional stepwise selection, and LASSO model each genotype by the risk allele doses. We reanalyzed 20 unlinked SNPs related to type 2 diabetes (T2D) in the EPIC-Potsdam case-cohort study (760 cases, 2193 noncases). No SNP-SNP interactions and no nonlinear effects were found. Two SNP combinations selected by MSR (Nagelkerke's R² = 0.050 and 0.048) included eight SNPs with mean allele combination frequency of 2%. GRS and stepwise selection selected nearly the same SNP combinations consisting of 12 and 13 SNPs (Nagelkerke's R² ranged from 0.020 to 0.029). LASSO showed similar results. The MSR method showed the best model fit measured by Nagelkerke's R² suggesting that further improvement may render this method a useful tool in genetic research. However, our comparison suggests that the GRS is a simple way to model genetic effects since it does not consider linkage, SNP-SNP interactions, and no non-linear effects. © 2015 John Wiley & Sons Ltd/University College London.
Yi, Liuxi; Gao, Fengyun; Siqin, Bateer; Zhou, Yu; Li, Qiang; Zhao, Xiaoqing; Jia, Xiaoyun; Zhang, Hui
2017-01-01
Flax is an important crop for oil and fiber, however, no high-density genetic maps have been reported for this species. Specific length amplified fragment sequencing (SLAF-seq) is a high-resolution strategy for large scale de novo discovery and genotyping of single nucleotide polymorphisms. In this study, SLAF-seq was employed to develop SNP markers in an F2 population to construct a high-density genetic map for flax. In total, 196.29 million paired-end reads were obtained. The average sequencing depth was 25.08 in male parent, 32.17 in the female parent, and 9.64 in each F2 progeny. In total, 389,288 polymorphic SLAFs were detected, from which 260,380 polymorphic SNPs were developed. After filtering, 4,638 SNPs were found suitable for genetic map construction. The final genetic map included 4,145 SNP markers on 15 linkage groups and was 2,632.94 cM in length, with an average distance of 0.64 cM between adjacent markers. To our knowledge, this map is the densest SNP-based genetic map for flax. The SNP markers and genetic map reported in here will serve as a foundation for the fine mapping of quantitative trait loci (QTLs), map-based gene cloning and marker assisted selection (MAS) for flax.
NASA Astrophysics Data System (ADS)
Liu, Siwei; Li, Qi; Yu, Hong; Kong, Lingfeng
2017-02-01
Glycogen is important not only for the energy supplementary of oysters, but also for human consumption. High glycogen content can improve the stress survival of oyster. A key enzyme in glycogenesis is glycogen synthase that is encoded by glycogen synthase gene GYS. In this study, the relationship between single nucleotide polymorphisms (SNPs) in coding regions of Crassostrea gigas GYS (Cg-GYS) and individual glycogen content was investigated with 321 individuals from five full-sib families. Single-strand conformation polymorphism (SSCP) procedure was combined with sequencing to confirm individual SNP genotypes of Cg-GYS. Least-square analysis of variance was performed to assess the relationship of variation in glycogen content of C. gigas with single SNP genotype and SNP haplotype. As a consequence, six SNPs were found in coding regions to be significantly associated with glycogen content ( P < 0.01), from which we constructed four main haplotypes due to linkage disequilibrium. Furthermore, the most effective haplotype H2 (GAGGAT) had extremely significant relationship with high glycogen content ( P < 0.0001). These findings revealed the potential influence of Cg-GYS polymorphism on the glycogen content and provided molecular biological information for the selective breeding of good quality traits of C. gigas.
Cheng, Yunyun; Liu, Songcai; Su, Dan; Lu, Chao; Zhang, Xin; Wu, Qingyan; Li, Siming; Fu, Haoyu; Yu, Hao; Hao, Linlin
2016-03-01
Growth hormone (GH) has been considered as a candidate gene for growth and body size in pigs. In this study, polymorphisms of the GH1 gene were evaluated for associations with body size traits in 190 pig individuals. Seventeen single-nucleotide polymorphisms (SNPs) were identified in GH1 gene of the large pig breeds and miniature pig breeds using direct sequencing and genotyped by allele-specific PCR approach. Notably, six (g.237A>G, g.283T>C, g.309A>G, g.318A>G, g.540A>G and g.544A>G) of them were significantly associated with body size, of which three loci (g.283T>C, g.309A>G, g.318A>G) located in the signal-peptide coding region of GH1 gene compose a CGG haplotype for large pigs and TAA haplotype for miniature pigs (P <0.001), two loci (g.540A>G and g.544A>G) located in the second intron of GH1 gene compose a GG haplotype for large pigs and AA haplotype for miniature pigs (P < 0.001). Our results demonstrate that these SNPs in GH1 gene are associated with the body size of pigs providing genetic basis for pig breeding with the improved economic benefits.
Liu, Xiaoxi; Kawamura, Yoshiya; Shimada, Takafumi; Otowa, Takeshi; Koishi, Shinko; Sugiyama, Toshiro; Nishida, Hisami; Hashimoto, Ohiko; Nakagami, Ryoichi; Tochigi, Mamoru; Umekage, Tadashi; Kano, Yukiko; Miyagawa, Taku; Kato, Nobumasa; Tokunaga, Katsushi; Sasaki, Tsukasa
2010-03-01
The oxytocin receptor (OXTR) gene, which is located on chromosome 3p25.3, has been implicated as a candidate gene for susceptibility of autism spectrum disorder (ASD). Positive associations between OXTR and ASD have been reported in earlier studies. However, the results were inconsistent and demand further studies. In this study, we investigated the associations between OXTR and ASD in a Japanese population by analyzing 11 single-nucleotide polymorphisms (SNPs) using both family-based association test (FBAT) and population-based case-control test. No significant signal was detected in the FBAT test. However, significant differences were observed in allelic frequencies of four SNPs, including rs2254298 between patients and controls. The risk allele of rs2254298 was 'A', which was consistent with the previous study in Chinese, and not with the observations in Caucasian. The difference in the risk allele of this SNP in previous studies might be attributable to an ethnic difference in the linkage disequilibrium structure between the Asians and Caucasians. In addition, haplotype analysis exhibits a significant association between a five-SNP haplotype and ASD, including rs22542898. In conclusion, our study might support that OXTR has a significant role in conferring the risk of ASD in the Japanese population.
Rao, Shuquan; Ghani, Mahdi; Guo, Zhiyun; Deming, Yuetiva; Wang, Kesheng; Sims, Rebecca; Mao, Canquan; Yao, Yao; Cruchaga, Carlos; Stephan, Dietrich A; Rogaeva, Ekaterina
2018-06-01
Although multiple susceptibility loci for late-onset Alzheimer's disease (LOAD) have been identified, a large portion of the genetic risk for this disease remains unexplained. LOAD risk may be associated with single-nucleotide polymorphisms responsible for changes in gene expression (eSNPs). To detect eSNPs associated with LOAD, we integrated data from LOAD genome-wide association studies and expression quantitative trait loci using Sherlock (a Bayesian statistical method). We identified a cis-regulatory eSNP (rs2927438) located on chromosome 19q13.32, for which subsequent analyses confirmed the association with both LOAD risk and the expression level of several nearby genes. Importantly, rs2927438 may represent an APOE-independent LOAD eSNP according to the weak linkage disequilibrium of rs2927438 with the 2 polymorphisms (rs7412 and rs429358) defining the APOE-ε2, -ε3, and -ε4 alleles. Furthermore, rs2927438 does not influence chromatin interaction events at the APOE locus or cis-regulation of APOE expression. Further exploratory analysis revealed that rs2927438 is significantly associated with tau levels in the cerebrospinal fluid. Our findings suggest that rs2927438 may confer APOE-independent risk for LOAD. Copyright © 2017 Elsevier Inc. All rights reserved.
Zhou, Fei; Wang, Yanru; Liu, Hongliang; Ready, Neal; Han, Younghun; Hung, Rayjean J.; Brhane, Yonathan; McLaughlin, John; Brennan, Paul; Bickeböller, Heike; Rosenberger, Albert; Houlston, Richard S.; Caporaso, Neil; Landi, Maria Teresa; Brüske, Irene; Risch, Angela; Ye, Yuanqing; Wu, Xifeng; Christiani, David C.; Goodman, Gary; Chen, Chu; Amos, Christopher I.; Qingyi, Wei
2017-01-01
Purpose mRNA degradation is an important regulatory step for controlling gene expression and cell functions. Genetic abnormalities of the genes involved in mRNA degradation were found to be associated with cancer risks. Therefore, we systematically investigated the roles of genetic variants of genes in the general mRNA degradation pathway in lung cancer risk. Experimental design Meta-analyses were conducted in six lung cancer genome-wide association studies (GWASs) from the Transdisciplinary Research in Cancer of the Lung and additional two GWASs from Harvard University and deCODE in the International Lung Cancer Consortium. Expression quantitative trait loci analysis (eQTL) was used for in silico functional validation of the identified significant susceptibility loci. Results This pathway-based analysis included 4,603 single nucleotide polymorphisms (SNP) in 68 genes in 14,463 lung cancer cases and 44,188 controls, of which 20 SNPs were found to be associated with lung cancer risk with a false discovery rate threshold of <0.05. Among the 11 newly identified SNPs in CNOT6, which were in high linkage disequilibrium, the rs2453176 with a RegulomDB score “1f” was chosen as the tag SNP for further analysis. We found that the rs2453176 T allele was significantly associated with lung cancer risk (odds ratio=1.11, 95% confidence interval=1.04–1.18, P=0.001) in the eight GWASs. In the eQTL analysis, we found that levels of CNOT6 mRNA expression were significantly correlated with the rs2453176 T allele, which provided additional biological basis for the observed positive association. Conclusion The CNOT6 rs2453176 SNP may be a new functional susceptible locus for lung cancer risk. PMID:27805284
Tran, C.; Gagnon, F.; Wigg, K.G.; Feng, Y.; Gomez, L.; Cate-Carter, T.D.; Kerr, E.N.; Field, L.L.; Kaplan, B.J.; Lovett, M.W.; Barr, C.L.
2017-01-01
Reading disabilities (RD) have a significant genetic basis and have shown linkage to multiple regions including chromosome 15q. Dyslexia susceptibility 1 candidate gene 1 (DYX1C1) on chromosome 15q21 was originally proposed as a candidate gene with two potentially functional polymorphisms at the −3G/A and 1249G/T positions showing association with RD. However, subsequent studies have yielded mixed results. We performed a literature review and meta-analysis of the −3G/A and 1249G/T polymorphisms, including new unpublished data from two family-based samples. Ten markers in DYX1C1 were genotyped in the two independently ascertained samples. Single marker and −3G/A:1249G/T haplotype analyses were performed for RD in both samples, and quantitative trait analyses using standardized reading-related measures was performed in one of the samples. For the meta-analysis, we used a random-effects model to summarize studies that tested for association between −3G/A or 1249G/T and RD. No significant association was found between the DYX1C1 SNPs and RD or any of the reading-related measures tested after correction for the number of tests performed. The previously reported risk haplotype (−3A:1249T) was not biased in transmission. A total of 9 and 10 study samples were included in the meta-analysis of the −3G/A and 1249G/T polymorphisms, respectively. Neither polymorphism reached statistical significance, but the heterogeneity for the 1249G/T polymorphism was high. The results of this study do not provide evidence for association between the putatively functional SNPs −3G/A and 1249G/T and RD. PMID:23341075
High-Density SNP Genotyping to Define β-Globin Locus Haplotypes
Liu, Li; Muralidhar, Shalini; Singh, Manisha; Sylvan, Caprice; Kalra, Inderdeep S.; Quinn, Charles T.; Onyekwere, Onyinye C.; Pace, Betty S.
2014-01-01
Five major β-globin locus haplotypes have been established in individuals with sickle cell disease (SCD) from the Benin, Bantu, Senegal, Cameroon, and Arab-Indian populations. Historically, β-haplotypes were established using restriction fragment length polymorphism (RFLP) analysis across the β-locus, which consists of five functional β-like globin genes located on chromosome 11. Previous attempts to correlate these haplotypes as robust predictors of clinical phenotypes observed in SCD have not been successful. We speculate that the coverage and distribution of the RFLP sites located proximal to or within the globin genes are not sufficiently dense to accurately reflect the complexity of this region. To test our hypothesis, we performed RFLP analysis and high-density single nucleotide polymorphism (SNP) genotyping across the β-locus using DNA samples from either healthy African Americans with normal hemoglobin A (HbAA) or individuals with homozygous SS (HbSS) disease. Using the genotyping data from 88 SNPs and Haploview analysis, we generated a greater number of haplotypes than that observed with RFLP analysis alone. Furthermore, a unique pattern of long-range linkage disequilibrium between the locus control region and the β-like globin genes was observed in the HbSS group. Interestingly, we observed multiple SNPs within the HindIII restriction site located in the Gγ-globin intervening sequence II which produced the same RFLP pattern. These findings illustrated the inability of RFLP analysis to decipher the complexity of sequence variations that impacts genomic structure in this region. Our data suggest that high density SNP mapping may be required to accurately define β-haplotypes that correlate with the different clinical phenotypes observed in SCD. PMID:18829352
Hughes, Travis; Kim-Howard, Xana; Kelly, Jennifer A; Kaufman, Kenneth M; Langefeld, Carl D; Ziegler, Julie; Sanchez, Elena; Kimberly, Robert P; Edberg, Jeffrey C; Ramsey-Goldman, Rosalind; Petri, Michelle; Reveille, John D; Martín, Javier; Brown, Elizabeth E; Vilá, Luis M; Alarcón, Graciela S; James, Judith A; Gilkeson, Gary S; Moser, Kathy L; Gaffney, Patrick M; Merrill, Joan T; Vyse, Timothy J; Alarcón-Riquelme, Marta E; Nath, Swapan K; Harley, John B; Sawalha, Amr H
2011-06-01
Genetic association of the IL2/IL21 region at chromosome 4q27 has previously been reported in lupus and a number of autoimmune and inflammatory diseases. This study was undertaken to determine whether this genetic effect could be localized, using a very large cohort of lupus patients and controls. We genotyped 45 tag single-nucleotide polymorphisms (SNPs) across the IL2/IL21 locus in 2 large independent lupus sample sets. We studied a set of subjects of European descent consisting of 4,248 lupus patients and 3,818 healthy controls, and an African American set of 1,569 patients and 1,893 healthy controls. Imputation in 3,004 additional controls from the Wellcome Trust Case Control Consortium was also performed. Genetic association between the genotyped markers was determined, and pairwise conditional analysis was performed to localize the independent genetic effect in the IL2/IL21 locus in lupus. We established and confirmed the genetic association between IL2/IL21 and lupus. Using conditional analysis and transethnic mapping, we localized the genetic effect in this locus to 2 SNPs in high linkage disequilibrium: rs907715 located within IL21 (odds ratio 1.16 [95% confidence interval 1.10-1.22], P=2.17×10(-8)) and rs6835457 located in the 3'-untranslated flanking region of IL21 (odds ratio 1.11 [95% confidence interval 1.05-1.17], P=9.35×10(-5)). Our findings establish the genetic association between lupus and IL2/IL21 with a genome-wide level of significance. Further, our findings indicate that this genetic association within the IL2/IL21 linkage disequilibrium block is localized to IL21. If other autoimmune IL2/IL21 genetic associations are similarly localized, then the IL21 risk alleles would be predicted to operate by a fundamental mechanism that influences the course of a number of autoimmune disease processes. Copyright © 2011 by the American College of Rheumatology.
Nouri, Kazem; Haslinger, Peter; Szabo, Ladislaus; Sator, Michael; Schreiber, Martin; Schneeberger, Christian; Pietrowski, Detlef
2014-01-01
Ovarian hyperstimulation syndrome (OHSS) is the most serious complication of IVF/ICSI therapy. The pathophysiology and etiology of the disease is still not fully clarified. To assess whether polymorphisms of the VEGF/VEGF-receptor system contribute to the occurrence of ovarian hyperstimulation syndrome (OHSS), we performed a retrospective analysis of 116 OHSS patients, and 124 female controls. The following SNPs were genotyped: Rs2071559 (VEGFR2-604); rs2305948 (VEGFR2-1192); rs1870377 (VEGFR2-1719); rs2010963 (VEGF-405); and rs111458691 (VEGFR1-519). Odds ratios (ORs) were estimated with a 95% confidence interval (CI). Linkage disequilibrium (LD) analysis was performed in the three loci of the VEGFR2 gene. We found an overrepresentation of the T allele of the VEGFR1-519 polymorphism in OHSS patients (P = 0.02, OR: 3.62, CI: 1.16 - 11.27). By genotype modeling, we found that polymorphism of VEGFR1-519 and VEGF-405 showed significant differences in patients and controls (p = 0.02, OR: 3.79 CI: 1.98 - 11.97 and p = 0.000005, OR: 0.29, CI: 0.17 - 0.50). LD analysis revealed significant linkage disequilibrium in VEGFR2. Polymorphisms in the VEGFR2 gene and in the VEGF gene are associated with the occurrence of OHSS. This strengthens the evidence for an important role of the VEGF/VEGF- receptor system in the occurrence of OHSS.
Kang, Ho-Jin; Song, Im-Sook; Shin, Ho Jung; Kim, Woo-Young; Lee, Choong-Hee; Shim, Joo-Cheol; Zhou, Hong-Hao; Lee, Sang Seop; Shin, Jae-Gook
2007-04-01
Genetic variants of three human organic cation transporter genes (hOCTs) were extensively explored in a Korean population. The functional changes of hOCT2 variants were evaluated in vitro, and those genetic polymorphisms of hOCTs were compared among different ethnic populations. From direct DNA sequencing, 7 of 13 coding variants were nonsynonymous single-nucleotide polymorphisms (SNPs), including four variants from hOCT1 (F160L, P283L, P341L, and M408V) and three from hOCT2 (T199I, T201M, and A270S), whereas 6 were synonymous SNPs. The linkage disequilibrium analysis presented for three independent LD blocks for each hOCT gene showed no significant linkage among all three hOCT genes. The transporter activities of MDCK cells that overexpress the hOCT2-T199I, -T201M, and -A270S variants showed significantly decreased uptake of [(3)H]methyl-4-phenylpyridinium acetate (MPP(+)) or [(14)C]tetraethylammonium compared with those cells that overexpress wild-type hOCT2, and the estimated kinetic parameters of these variants for [(3)H]MPP(+) uptake in oocytes showed a 2- to 5-fold increase in K(m) values and a 10- to 20-fold decrease in V(max) values. The allele frequencies of the five functional variants hOCT1-P283L, -P341L, and hOCT2-T199I, -T201M, and -A270S were 1.3, 17, 0.7, 0.7, and 11%, respectively, in a Korean population; the frequency distributions of these variants were not significantly different from those of Chinese and Vietnamese populations. These findings suggest that genetic variants of hOCTs are not linked among three genes in a Korean population, and several of the hOCT genetic variants cause decreased transport activity in vitro compared with the wild type, although the clinical relevance of these variants remains to be evaluated.
Extent of Linkage Disequilibrium in the Domestic Cat, Felis silvestris catus, and Its Breeds
Alhaddad, Hasan; Khan, Razib; Grahn, Robert A.; Gandolfi, Barbara; Mullikin, James C.; Cole, Shelley A.; Gruffydd-Jones, Timothy J.; Häggström, Jens; Lohi, Hannes; Longeri, Maria; Lyons, Leslie A.
2013-01-01
Domestic cats have a unique breeding history and can be used as models for human hereditary and infectious diseases. In the current era of genome-wide association studies, insights regarding linkage disequilibrium (LD) are essential for efficient association studies. The objective of this study is to investigate the extent of LD in the domestic cat, Felis silvestris catus, particularly within its breeds. A custom illumina GoldenGate Assay consisting of 1536 single nucleotide polymorphisms (SNPs) equally divided over ten 1 Mb chromosomal regions was developed, and genotyped across 18 globally recognized cat breeds and two distinct random bred populations. The pair-wise LD descriptive measure (r 2) was calculated between the SNPs in each region and within each population independently. LD decay was estimated by determining the non-linear least-squares of all pair-wise estimates as a function of distance using established models. The point of 50% decay of r2 was used to compare the extent of LD between breeds. The longest extent of LD was observed in the Burmese breed, where the distance at which r2 ≈ 0.25 was ∼380 kb, comparable to several horse and dog breeds. The shortest extent of LD was found in the Siberian breed, with an r2 ≈ 0.25 at approximately 17 kb, comparable to random bred cats and human populations. A comprehensive haplotype analysis was also conducted. The haplotype structure of each region within each breed mirrored the LD estimates. The LD of cat breeds largely reflects the breeds’ population history and breeding strategies. Understanding LD in diverse populations will contribute to an efficient use of the newly developed SNP array for the cat in the design of genome-wide association studies, as well as to the interpretation of results for the fine mapping of disease and phenotypic traits. PMID:23308248
Kawasaki, Eiji; Awata, Takuya; Ikegami, Hiroshi; Kobayashi, Tetsuro; Maruyama, Taro; Nakanishi, Koji; Shimada, Akira; Uga, Miho; Uga, Mho; Kurihara, Susumu; Kawabata, Yumiko; Tanaka, Shoichiro; Kanazawa, Yasuhiko; Lee, Inkyu; Eguchi, Katsumi
2006-03-15
The protein tyrosine phosphatase, nonreceptor 22 gene (PTPN22) maps to human chromosome 1p13.3-p13.1 and encodes an important negative regulator of T-cell activation, lymphoid-specific phosphatase (Lyp). Recently, the minor allele of a single-nucleotide polymorphism (SNP) at nucleotide position 1858 (rs2476601, +1858C > T) was found to be associated with type 1 diabetes. However, the degree of the association is variable among ethnic populations, suggesting the presence of other disease-associated variants in PTPN22. To examine this possibility, we carried out a systemic search for PTPN22 using direct sequencing of PCR-amplified products in the Japanese population. Association and linkage studies were also conducted in 1,690 Japanese samples, 180 Korean samples, and 472 Caucasian samples from 95 nuclear families. We identified five novel SNPs, but not the +1858C > T SNP. Of these two frequent SNPs, -1123G > C, and +2740C > T were in strong linkage disequilibrium (LD), and the -1123G > C promoter SNP was associated with acute-onset but not slow-onset type 1 diabetes in the Japanese population (odds ratio [OR] = 1.42, 95% CI = 1.07-1.89, P = 0.015). This association was observed also in Korean patients with type 1 diabetes (Mantel-Haenszel chi2= 6.543, P = 0.0105, combined OR = 1.41 95% CI = 1.09-1.82). Furthermore, the affected family-based control (AFBAC) association test and the transmission disequilibrium analysis of multiplex families of European descent from the British Diabetes Association (BDA) Warren Repository indicated that the association was stronger in -1123G > C compared to +1858C > T. In conclusion, the type 1 diabetes association with PTPN22 is confirmed, but it cannot be attributed solely to the +1858C > T variant. The promoter -1123G > C SNP is a more likely causative variant in PTPN22. 2006 Wiley-Liss, Inc.
Extent of linkage disequilibrium in the domestic cat, Felis silvestris catus, and its breeds.
Alhaddad, Hasan; Khan, Razib; Grahn, Robert A; Gandolfi, Barbara; Mullikin, James C; Cole, Shelley A; Gruffydd-Jones, Timothy J; Häggström, Jens; Lohi, Hannes; Longeri, Maria; Lyons, Leslie A
2013-01-01
Domestic cats have a unique breeding history and can be used as models for human hereditary and infectious diseases. In the current era of genome-wide association studies, insights regarding linkage disequilibrium (LD) are essential for efficient association studies. The objective of this study is to investigate the extent of LD in the domestic cat, Felis silvestris catus, particularly within its breeds. A custom illumina GoldenGate Assay consisting of 1536 single nucleotide polymorphisms (SNPs) equally divided over ten 1 Mb chromosomal regions was developed, and genotyped across 18 globally recognized cat breeds and two distinct random bred populations. The pair-wise LD descriptive measure (r(2)) was calculated between the SNPs in each region and within each population independently. LD decay was estimated by determining the non-linear least-squares of all pair-wise estimates as a function of distance using established models. The point of 50% decay of r(2) was used to compare the extent of LD between breeds. The longest extent of LD was observed in the Burmese breed, where the distance at which r(2) ≈ 0.25 was ∼380 kb, comparable to several horse and dog breeds. The shortest extent of LD was found in the Siberian breed, with an r(2) ≈ 0.25 at approximately 17 kb, comparable to random bred cats and human populations. A comprehensive haplotype analysis was also conducted. The haplotype structure of each region within each breed mirrored the LD estimates. The LD of cat breeds largely reflects the breeds' population history and breeding strategies. Understanding LD in diverse populations will contribute to an efficient use of the newly developed SNP array for the cat in the design of genome-wide association studies, as well as to the interpretation of results for the fine mapping of disease and phenotypic traits.
Polymorphisms of pesticide-metabolizing genes in children living in intensive farming communities.
Gómez-Martín, Antonio; Hernández, Antonio F; Martínez-González, Luis Javier; González-Alzaga, Beatriz; Rodríguez-Barranco, Miguel; López-Flores, Inmaculada; Aguilar-Garduno, Clemente; Lacasana, Marina
2015-11-01
Polymorphisms in genes encoding xenobiotic-metabolizing enzymes (XME) are important parameters accounting for the wide inter-individual variability to environmental exposures. Paraoxonase-1 (PON1), butyrylcholinesterase (BChE) and Cytochrome-P450 constitute major classes of XME involved in the detoxification of pesticide chemicals, in particular organophosphates. This study explored the allelic frequency, linkage disequilibrium and haplotype analysis of ten common polymorphic variants of seven key genes involved in organophosphate metabolism (BCHE-K, BCHE-A, PON1 Q192R, PON1 L55M, PON1 -108C/T, CYP2C19 G681A, CYP2D6 G1846A, CYP3AP1 -44G/A, GSTM1∗0 and GSTT1∗0) in a children population living near an intensive agriculture area in Spain. It was hypothesized that individuals with unfavorable combinations of gene variants will be more susceptible to adverse effects from organophosphate exposure. Genomic DNA from 496 healthy children was isolated and amplified by PCR. Hydrolysis probes were used for the detection of eight specific SNPs and two copy number variants (CNVs) by using TaqMan® Assay-based real-time PCR. Frequencies of SNPs and CNVs in the target genes were in Hardy-Weinberg equilibrium and broadly consistent with European populations. Linkage disequilibrium was found between the three PON1 genetic polymorphisms studied and between BCHE-K and BCHE-A. The adverse genotype combination (unusual BCHE variants, PON1 55MM/-108TT and null genotype for both GSTM1 and GSTT1) potentially conferring a greater genetic risk from exposure to organophosphates was observed in 0.2% of our study population. This information allows broadening our knowledge about differential susceptibility toward environmental toxicants and may be helpful for further research to understand the inter-individual toxicokinetic variability in response to organophosphate pesticides exposure. Copyright © 2015 Elsevier Ltd. All rights reserved.
Takeshima, Shin-Nosuke; Sasaki, Shinji; Meripet, Polat; Sugimoto, Yoshikazu; Aida, Yoko
2017-04-04
Bovine leukemia virus (BLV) is the causative agent of enzootic bovine leukosis, a malignant B cell lymphoma that has spread worldwide and causes serious problems for the cattle industry. The BLV proviral load, which represents the BLV genome integrated into host genome, is a useful index for estimating disease progression and transmission risk. Here, we conducted a genome-wide association study to identify single nucleotide polymorphisms (SNPs) associated with BLV proviral load in Japanese Black cattle. The study examined 93 cattle with a high proviral load and 266 with a low proviral load. Three SNPs showed a significant association with proviral load. One SNP was detected in the CNTN3 gene on chromosome 22, and two (which were not in linkage disequilibrium) were detected in the bovine major histocompatibility complex region on chromosome 23. These results suggest that polymorphisms in the major histocompatibility complex region affect proviral load. This is the first report to detect SNPs associated with BLV proviral load in Japanese Black cattle using whole genome association study, and understanding host factors may provide important clues for controlling the spread of BLV in Japanese Black cattle.
Hill, W D; Davies, G; Harris, S E; Hagenaars, S P; Liewald, D C; Penke, L; Gale, C R; Deary, I J
2016-12-13
Differences in general cognitive function have been shown to be partly heritable and to show genetic correlations with several psychiatric and physical disease states. However, to date, few single-nucleotide polymorphisms (SNPs) have demonstrated genome-wide significance, hampering efforts aimed at determining which genetic variants are most important for cognitive function and which regions drive the genetic associations between cognitive function and disease states. Here, we combine multiple large genome-wide association study (GWAS) data sets, from the CHARGE cognitive consortium (n=53 949) and UK Biobank (n=36 035), to partition the genome into 52 functional annotations and an additional 10 annotations describing tissue-specific histone marks. Using stratified linkage disequilibrium score regression we show that, in two measures of cognitive function, SNPs associated with cognitive function cluster in regions of the genome that are under evolutionary negative selective pressure. These conserved regions contained ~2.6% of the SNPs from each GWAS but accounted for ~40% of the SNP-based heritability. The results suggest that the search for causal variants associated with cognitive function, and those variants that exert a pleiotropic effect between cognitive function and health, will be facilitated by examining these enriched regions.
Hill, W D; Davies, G; Harris, S E; Hagenaars, S P; Davies, Gail; Deary, Ian J; Debette, Stephanie; Verbaas, Carla I; Bressler, Jan; Schuur, Maaike; Smith, Albert V; Bis, Joshua C; Bennett, David A; Ikram, M Arfan; Launer, Lenore J; Fitzpatrick, Annette L; Seshadri, Sudha; van Duijn, Cornelia M; Mosley Jr, Thomas H; Liewald, D C; Penke, L; Gale, C R; Deary, I J
2016-01-01
Differences in general cognitive function have been shown to be partly heritable and to show genetic correlations with several psychiatric and physical disease states. However, to date, few single-nucleotide polymorphisms (SNPs) have demonstrated genome-wide significance, hampering efforts aimed at determining which genetic variants are most important for cognitive function and which regions drive the genetic associations between cognitive function and disease states. Here, we combine multiple large genome-wide association study (GWAS) data sets, from the CHARGE cognitive consortium (n=53 949) and UK Biobank (n=36 035), to partition the genome into 52 functional annotations and an additional 10 annotations describing tissue-specific histone marks. Using stratified linkage disequilibrium score regression we show that, in two measures of cognitive function, SNPs associated with cognitive function cluster in regions of the genome that are under evolutionary negative selective pressure. These conserved regions contained ~2.6% of the SNPs from each GWAS but accounted for ~40% of the SNP-based heritability. The results suggest that the search for causal variants associated with cognitive function, and those variants that exert a pleiotropic effect between cognitive function and health, will be facilitated by examining these enriched regions. PMID:27959336
The α‐synuclein gene in multiple system atrophy
Ozawa, T; Healy, D G; Abou‐Sleiman, P M; Ahmadi, K R; Quinn, N; Lees, A J; Shaw, K; Wullner, U; Berciano, J; Moller, J C; Kamm, C; Burk, K; Josephs, K A; Barone, P; Tolosa, E; Goldstein, D B; Wenning, G; Geser, F; Holton, J L; Gasser, T; Revesz, T; Wood, N W
2006-01-01
Background The formation of α‐synuclein aggregates may be a critical event in the pathogenesis of multiple system atrophy (MSA). However, the role of this gene in the aetiology of MSA is unknown and untested. Method The linkage disequilibrium (LD) structure of the α‐synuclein gene was established and LD patterns were used to identify a set of tagging single nucleotide polymorphisms (SNPs) that represent 95% of the haplotype diversity across the entire gene. The effect of polymorphisms on the pathological expression of MSA in pathologically confirmed cases was also evaluated. Results and conclusion In 253 Gilman probable or definite MSA patients, 457 possible, probable, and definite MSA cases and 1472 controls, a frequency difference for the individual tagging SNPs or tag‐defined haplotypes was not detected. No effect was observed of polymorphisms on the pathological expression of MSA in pathologically confirmed cases. PMID:16543523
Population Structure of Hispanics in the United States: The Multi-Ethnic Study of Atherosclerosis
Manichaikul, Ani; Palmas, Walter; Rodriguez, Carlos J.; Peralta, Carmen A.; Divers, Jasmin; Guo, Xiuqing; Chen, Wei-Min; Wong, Quenna; Williams, Kayleen; Kerr, Kathleen F.; Taylor, Kent D.; Tsai, Michael Y.; Goodarzi, Mark O.; Sale, Michèle M.; Diez-Roux, Ana V.; Rich, Stephen S.; Rotter, Jerome I.; Mychaleckyj, Josyf C.
2012-01-01
Using ∼60,000 SNPs selected for minimal linkage disequilibrium, we perform population structure analysis of 1,374 unrelated Hispanic individuals from the Multi-Ethnic Study of Atherosclerosis (MESA), with self-identification corresponding to Central America (n = 93), Cuba (n = 50), the Dominican Republic (n = 203), Mexico (n = 708), Puerto Rico (n = 192), and South America (n = 111). By projection of principal components (PCs) of ancestry to samples from the HapMap phase III and the Human Genome Diversity Panel (HGDP), we show the first two PCs quantify the Caucasian, African, and Native American origins, while the third and fourth PCs bring out an axis that aligns with known South-to-North geographic location of HGDP Native American samples and further separates MESA Mexican versus Central/South American samples along the same axis. Using k-means clustering computed from the first four PCs, we define four subgroups of the MESA Hispanic cohort that show close agreement with self-identification, labeling the clusters as primarily Dominican/Cuban, Mexican, Central/South American, and Puerto Rican. To demonstrate our recommendations for genetic analysis in the MESA Hispanic cohort, we present pooled and stratified association analysis of triglycerides for selected SNPs in the LPL and TRIB1 gene regions, previously reported in GWAS of triglycerides in Caucasians but as yet unconfirmed in Hispanic populations. We report statistically significant evidence for genetic association in both genes, and we further demonstrate the importance of considering population substructure and genetic heterogeneity in genetic association studies performed in the United States Hispanic population. PMID:22511882
Huson, Heather J; vonHoldt, Bridgett M; Rimbault, Maud; Byers, Alexandra M; Runstadler, Jonathan A; Parker, Heidi G; Ostrander, Elaine A
2012-02-01
Alaskan sled dogs are a genetically distinct population shaped by generations of selective interbreeding with purebred dogs to create a group of high-performance athletes. As a result of selective breeding strategies, sled dogs present a unique opportunity to employ admixture-mapping techniques to investigate how breed composition and trait selection impact genomic structure. We used admixture mapping to investigate genetic ancestry across the genomes of two classes of sled dogs, sprint and long-distance racers, and combined that with genome-wide association studies (GWAS) to identify regions that correlate with performance-enhancing traits. The sled dog genome is enhanced by differential contributions from four non-admixed breeds (Alaskan Malamute, Siberian Husky, German Shorthaired Pointer, and Borzoi). A principal components analysis (PCA) of 115,000 genome-wide SNPs clearly resolved the sprint and distance populations as distinct genetic groups, with longer blocks of linkage disequilibrium (LD) observed in the distance versus sprint dogs (7.5-10 and 2.5-3.75 kb, respectively). Furthermore, we identified eight regions with the genomic signal from either a selective sweep or an association analysis, corroborated by an excess of ancestry when comparing sprint and distance dogs. A comparison of elite and poor-performing sled dogs identified a single region significantly associated with heat tolerance. Within the region we identified seven SNPs within the myosin heavy chain 9 gene (MYH9) that were significantly associated with heat tolerance in sprint dogs, two of which correspond to conserved promoter and enhancer regions in the human ortholog.
Common Variants in the MKL1 Gene Confer Risk of Schizophrenia
Luo, Xiong-jian; Huang, Liang; van den Oord, Edwin J.; Aberg, Karolina A.; Gan, Lin; Zhao, Zhongming; Yao, Yong-Gang
2015-01-01
Genome-wide association studies (GWAS) of schizophrenia have identified multiple risk variants with robust association signals for schizophrenia. However, these variants could explain only a small proportion of schizophrenia heritability. Furthermore, the effect size of these risk variants is relatively small (eg, most of them had an OR less than 1.2), suggesting that additional risk variants may be detected when increasing sample size in analysis. Here, we report the identification of a genome-wide significant schizophrenia risk locus at 22q13.1 by combining 2 large-scale schizophrenia cohort studies. Our meta-analysis revealed that 7 single nucleotide polymorphism (SNPs) on chromosome 22q13.1 reached the genome-wide significance level (P < 5.0×10–8) in the combined samples (a total of 38441 individuals). Among them, SNP rs6001946 had the most significant association with schizophrenia (P = 2.04×10–8). Interestingly, all 7 SNPs are in high linkage disequilibrium and located in the MKL1 gene. Expression analysis showed that MKL1 is highly expressed in human and mouse brains. We further investigated functional links between MKL1 and proteins encoded by other schizophrenia susceptibility genes in the whole human protein interaction network. We found that MKL1 physically interacts with GSK3B, a protein encoded by a well-characterized schizophrenia susceptibility gene. Collectively, our results revealed that genetic variants in MKL1 might confer risk to schizophrenia. Further investigation of the roles of MKL1 in the pathogenesis of schizophrenia is warranted. PMID:25380769
van Geest, Geert; Voorrips, Roeland E; Esselink, Danny; Post, Aike; Visser, Richard Gf; Arens, Paul
2017-08-07
Cultivated chrysanthemum is an outcrossing hexaploid (2n = 6× = 54) with a disputed mode of inheritance. In this paper, we present a single nucleotide polymorphism (SNP) selection pipeline that was used to design an Affymetrix Axiom array with 183 k SNPs from RNA sequencing data (1). With this array, we genotyped four bi-parental populations (with sizes of 405, 53, 76 and 37 offspring plants respectively), and a cultivar panel of 63 genotypes. Further, we present a method for dosage scoring in hexaploids from signal intensities of the array based on mixture models (2) and validation of selection steps in the SNP selection pipeline (3). The resulting genotypic data is used to draw conclusions on the mode of inheritance in chrysanthemum (4), and to make an inference on allelic expression bias (5). With use of the mixture model approach, we successfully called the dosage of 73,936 out of 183,130 SNPs (40.4%) that segregated in any of the bi-parental populations. To investigate the mode of inheritance, we analysed markers that segregated in the large bi-parental population (n = 405). Analysis of segregation of duplex x nulliplex SNPs resulted in evidence for genome-wide hexasomic inheritance. This evidence was substantiated by the absence of strong linkage between markers in repulsion, which indicated absence of full disomic inheritance. We present the success rate of SNP discovery out of RNA sequencing data as affected by different selection steps, among which SNP coverage over genotypes and use of different types of sequence read mapping software. Genomic dosage highly correlated with relative allele coverage from the RNA sequencing data, indicating that most alleles are expressed according to their genomic dosage. The large population, genotyped with a very large number of markers, is a unique framework for extensive genetic analyses in hexaploid chrysanthemum. As starting point, we show conclusive evidence for genome-wide hexasomic inheritance.
PAX6 Haplotypes Are Associated with High Myopia in Han Chinese
Jiang, Bo; Yap, Maurice K. H.; Leung, Kim Hung; Ng, Po Wah; Fung, Wai Yan; Lam, Wai Wa; Gu, Yang-shun; Yip, Shea Ping
2011-01-01
Background The paired box 6 (PAX6) gene is considered as a master gene for eye development. Linkage of myopia to the PAX6 region on chromosome 11p13 was shown in several studies, but the results for association between myopia and PAX6 were inconsistent so far. Methodology/Principal Findings We genotyped 16 single nucleotide polymorphisms (SNPs) in the PAX6 gene and its regulatory regions in an initial study for 300 high myopia cases and 300 controls (Group 1), and successfully replicated the positive results with another independent group of 299 high myopia cases and 299 controls (Group 2). Five SNPs were genotyped in the replication study. The spherical equivalent of subjects with high myopia was ≤−8.0 dioptres. The PLINK package was used for genetic data analysis. No association was found between each of the SNPs and high myopia. However, exhaustive sliding-window haplotype analysis highlighted an important role for rs12421026 because haplotypes containing this SNP were found to be associated with high myopia. The most significant results were given by the 4-SNP haplotype window consisting of rs2071754, rs3026393, rs1506 and rs12421026 (P = 3.54×10−10, 4.06×10−11 and 1.56×10−18 for Group 1, Group 2 and Combined Group, respectively) and the 3-SNP haplotype window composed of rs3026393, rs1506 and rs12421026 (P = 5.48×10−10, 7.93×10−12 and 6.28×10−23 for the three respective groups). The results remained significant after correction for multiple comparisons by permutations. The associated haplotyes found in a previous study were also successfully replicated in this study. Conclusions/Significance PAX6 haplotypes are associated with susceptibility to the development of high myopia in Chinese. The PAX6 locus plays a role in high myopia. PMID:21589860
Voruganti, V Saroja; Higgins, Paul B; Ebbesson, Sven O E; Kennish, John; Göring, Harald H H; Haack, Karin; Laston, Sandra; Drigalenko, Eugene; Wenger, Charlotte R; Harris, William S; Fabsitz, Richard R; Devereux, Richard B; Maccluer, Jean W; Curran, Joanne E; Carless, Melanie A; Johnson, Matthew P; Moses, Eric K; Blangero, John; Umans, Jason G; Howard, Barbara V; Cole, Shelley A; Comuzzie, Anthony Gean
2012-01-01
The delta-5 and delta-6 desaturases (D5D and D6D), encoded by fatty acid desaturase 1 (FADS1) and 2 (FADS2) genes, respectively, are rate-limiting enzymes in the metabolism of ω-3 and ω-6 fatty acids. The objective of this study was to identify genes influencing variation in estimated D5D and D6D activities in plasma and erythrocytes in Alaskan Eskimos (n = 761) participating in the genetics of coronary artery disease in Alaska Natives (GOCADAN) study. Desaturase activity was estimated by product: precursor ratio of polyunsaturated fatty acids. We found evidence of linkage for estimated erythrocyte D5D (eD5D) on chromosome 11q12-q13 (logarithm of odds score = 3.5). The confidence interval contains candidate genes FADS1, FADS2, 7-dehydrocholesterol reductase (DHCR7), and carnitine palmitoyl transferase 1A, liver (CPT1A). Measured genotype analysis found association between CPT1A, FADS1, and FADS2 single-nucleotide polymorphisms (SNPs) and estimated eD5D activity (p-values between 10(-28) and 10(-5)). A Bayesian quantitative trait nucleotide analysis showed that rs3019594 in CPT1A, rs174541 in FADS1, and rs174568 in FADS2 had posterior probabilities > 0.8, thereby demonstrating significant statistical support for a functional effect on eD5D activity. Highly significant associations of FADS1, FADS2, and CPT1A transcripts with their respective SNPs (p-values between 10(-75) and 10(-7)) in Mexican Americans of the San Antonio Family Heart Study corroborated our results. These findings strongly suggest a functional role for FADS1, FADS2, and CPT1A SNPs in the variation in eD5D activity.
Provenzani, Alessio; Notarbartolo, Monica; Labbozzetta, Manuela; Poma, Paola; Biondi, Filippo; Sanguedolce, Rosario; Vizzini, Giovanni; Palazzo, Ugo; Polidori, Piera; Triolo, Fabio; Gridelli, Bruno; D'Alessandro, Natale
2009-01-01
Tacrolimus is a substrate of cytochrome P-450 (CYP) 3A enzyme and of the drug transporter ABCB1. We have investigated the effects of possible relevant CYP3A5 and ABCB1 single nucleotide polymorphisms (SNPs) present in both donors and recipients on tacrolimus blood levels achieved in a population of 32 Caucasian liver transplant patients. At 1, 3 and 6 months after transplantation, tacrolimus doses (mg/kg/day) and trough blood levels (C(0)) were determined. Polymerase chain reaction followed by restriction fragment length polymorphism analysis was used for genotyping CYP3A5*3 [6986A>G] as well as ABCB1 at exons 21 [2677G>T] and 26 [3435C>T]. 87.5% of the population showed a CYP3A5*3/*3 genotype. For the ABCB1 SNPs, in the case of 3435C>T the total frequency observed for the allelic variant was 50%. For the 2677G>T, the total frequency of the allelic variant was 12.5%, lower than in other Caucasian populations and without any significant linkage with 3435C>T. At 3 and 6 months after transplantation, tacrolimus dose requirements were significantly higher in patients receiving a liver with one copy of the *1 allele compared to those homozygous for the *3 allele (0.111+/-0.057 vs. 0.057+/-0.030 [P<0.05] at 3 month and 0.086+/-0.051 vs. 0.044+/-0.025 [P<0.05] at 6 month). For the recipients' genotypes, the presence of at least one *1 copy tended, though not statistically significantly, to increase tacrolimus doses. With regard to the ABCB1 SNPs, they did not show any influence on tacrolimus dosing requirements. Pharmacogenetic analysis of CYP3A5 in the donor could contribute to determine the appropriate initial dosage of tacrolimus in liver transplant patients.
Norton, Gareth J.; Douglas, Alex; Lahner, Brett; Yakubova, Elena; Guerinot, Mary Lou; Pinson, Shannon R. M.; Tarpley, Lee; Eizenga, Georgia C.; McGrath, Steve P.; Zhao, Fang-Jie; Islam, M. Rafiqul; Islam, Shofiqul; Duan, Guilan; Zhu, Yongguan; Salt, David E.; Meharg, Andrew A.; Price, Adam H.
2014-01-01
The mineral concentrations in cereals are important for human health, especially for individuals who consume a cereal subsistence diet. A number of elements, such as zinc, are required within the diet, while some elements are toxic to humans, for example arsenic. In this study we carry out genome-wide association (GWA) mapping of grain concentrations of arsenic, copper, molybdenum and zinc in brown rice using an established rice diversity panel of ∼300 accessions and 36.9 k single nucleotide polymorphisms (SNPs). The study was performed across five environments: one field site in Bangladesh, one in China and two in the US, with one of the US sites repeated over two years. GWA mapping on the whole dataset and on separate subpopulations of rice revealed a large number of loci significantly associated with variation in grain arsenic, copper, molybdenum and zinc. Seventeen of these loci were detected in data obtained from grain cultivated in more than one field location, and six co-localise with previously identified quantitative trait loci. Additionally, a number of candidate genes for the uptake or transport of these elements were located near significantly associated SNPs (within 200 kb, the estimated global linkage disequilibrium previously employed in this rice panel). This analysis highlights a number of genomic regions and candidate genes for further analysis as well as the challenges faced when mapping environmentally-variable traits in a highly genetically structured diversity panel. PMID:24586963
Apolipoprotein A-V: a potential modulator of plasma triglyceride levels in Turks.
Hodoglugil, Ugur; Tanyolaç, Sinan; Williamson, David W; Huang, Yadong; Mahley, Robert W
2006-01-01
The apolipoprotein A-V gene (APOA5) plays an important role in determining plasma triglyceride levels. We studied the effects of APOA5 polymorphisms on plasma triglyceride levels in Turks, a population with low levels of HDL cholesterol and a high prevalence of coronary artery disease. We found 15 polymorphisms, three of which were novel. Seven haplotype-tagging single nucleotide polymorphisms (SNPs) were chosen and genotyped in approximately 3,000 subjects. The rare alleles of the -1464T>C, -1131T>C, S19W, and 1259T>C SNPs were significantly associated with increased triglyceride levels (19-86 mg/dl; P < 0.05) and had clear gene-dose effects. Haplotype analysis of the nine common APOA5 haplotypes revealed significant effects on triglyceride levels (P < 0.001). Detailed analysis of haplotypes clearly showed that the -1464T>C polymorphism had no effect by itself but was a marker for the -1131T>C, S19W, and 1259T>C polymorphisms. The -1131T>C and 1259T>C polymorphisms were in a strong but incomplete linkage disequilibrium and appeared to have independent effects. Thus, the APOA5 -1131T>C, S19W, and 1259T>C rare alleles were associated with significant increases in plasma triglyceride levels. At least one of these alleles was present in approximately 40% of the Turks. Similar associations were observed for -1131T>C and S19W in white Americans living in San Francisco, California.
Genetic dissection of host immune response in pneumonia development and progression
Smelaya, Tamara V.; Belopolskaya, Olesya B.; Smirnova, Svetlana V.; Kuzovlev, Artem N.; Moroz, Viktor V.; Golubev, Arkadiy M.; Pabalan, Noel A.; Salnikova, Lyubov E.
2016-01-01
The role of host genetic variation in pneumonia development and outcome is poorly understood. We studied common polymorphisms in the genes of proinflammatory cytokines (IL6 rs1800795, IL8 rs4073, IL1B rs16944), anti-inflammatory cytokines (IL10 rs1800896, IL4 rs2243250, IL13 rs20541) and toll-like receptors (TLR2 rs5743708 and rs4696480, TLR4 rs4986791, TLR9 rs352139, rs5743836 and rs187084) in patients with community-acquired pneumonia (CAP) (390 cases, 203 controls) and nosocomial pneumonia (355 cases, 216 controls). Experimental data were included in a series of 11 meta-analyses and eight subset analyses related to pneumonia susceptibility and outcome. TLR2 rs5743708 minor genotype appeared to be associated with CAP/Legionnaires’ disease/pneumococcal disease. In CAP patients, the IL6 rs1800795-C allele was associated with severe sepsis/septic shock/severe systemic inflammatory response, while the IL10 rs1800896-A allele protected against the development of these critical conditions. To contribute to deciphering of the above results, we performed an in silico analysis and a qualitative synthesis of literature data addressing basal and stimulated genotype-specific expression level. This data together with database information on transcription factors’ affinity changes caused by SNPs in putative promoter regions, the results of linkage disequilibrium analysis along with SNPs functional annotations supported assumptions about the complexity underlying the revealed associations. PMID:27725770
Morgan, A R; Turic, D; Jehu, L; Hamilton, G; Hollingworth, P; Moskvina, V; Jones, L; Lovestone, S; Brayne, C; Rubinsztein, D C; Lawlor, B; Gill, M; O'Donovan, M C; Owen, M J; Williams, J
2007-09-05
Late-onset Alzheimer's disease (LOAD) is a common neurodegenerative disorder, with a complex etiology. APOE is the only confirmed susceptibility gene for LOAD. Others remain yet to be found. Evidence from linkage studies suggests that a gene (or genes) conferring susceptibility for LOAD resides on chromosome 10. We studied 23 positional/functional candidate genes from our linkage region on chromosome 10 (APBB1IP, ALOX5, AD037, SLC18A3, DKK1, ZWINT, ANK3, UBE2D1, CDC2, SIRT1, JDP1, NET7, SUPV3L1, NEN3, SAR1, SGPL1, SEC24C, CAMK2G, PP3CB, SNCG, CH25H, PLCE1, ANXV111) in the MRC genetic resource for LOAD. These candidates were screened for sequence polymorphisms in a sample of 14 LOAD subjects and detected polymorphisms tested for association with LOAD in a three-stage design involving two stages of genotyping pooled DNA samples followed by a third stage in which markers showing evidence for association in the first stages were subjected to individual genotyping. One hundred and twenty polymorphisms were identified and tested in stage 1 (4 case + 4 control pools totaling 366 case and 366 control individuals). Single nucleotide polymorphisms (SNPs) showing evidence of association with LOAD were then studied in stage 2 (8 case + 4 control pools totaling 1,001 case and 1,001 control individuals). Five SNPs, in four genes, showed evidence for association (P < 0.1) at stage 2 and were individually genotyped in the complete dataset, comprising 1,160 LOAD cases and 1,389 normal controls. Two SNPs in SGPL1 demonstrated marginal evidence of association, with uncorrected P values of 0.042 and 0.056, suggesting that variation in SGPL1 may confer susceptibility to LOAD. Copyright 2007 Wiley-Liss, Inc.
Schoeps, Anja; Rudolph, Anja; Seibold, Petra; Dunning, Alison M.; Milne, Roger L.; Bojesen, Stig E.; Swerdlow, Anthony; Andrulis, Irene; Brenner, Hermann; Behrens, Sabine; Orr, Nicholas; Jones, Michael; Ashworth, Alan; Li, Jingmei; Cramp, Helen; Connley, Dan; Czene, Kamila; Darabi, Hatef; Chanock, Stephen J.; Lissowska, Jolanta; Figueroa, Jonine D.; Knight, Julia; Glendon, Gord; Mulligan, Anna M.; Dumont, Martine; Severi, Gianluca; Baglietto, Laura; Olson, Janet; Vachon, Celine; Purrington, Kristen; Moisse, Matthieu; Neven, Patrick; Wildiers, Hans; Spurdle, Amanda; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana M.; Hamann, Ute; Ko, Yon-Dschun; Dieffenbach, Aida K.; Arndt, Volker; Stegmaier, Christa; Malats, Núria; Arias Perez, JoséI.; Benítez, Javier; Flyger, Henrik; Nordestgaard, Børge G.; Truong, Théresè; Cordina-Duverger, Emilie; Menegaux, Florence; Silva, Isabel dos Santos; Fletcher, Olivia; Johnson, Nichola; Häberle, Lothar; Beckmann, Matthias W.; Ekici, Arif B.; Braaf, Linde; Atsma, Femke; van den Broek, Alexandra J.; Makalic, Enes; Schmidt, Daniel F.; Southey, Melissa C.; Cox, Angela; Simard, Jacques; Giles, Graham G.; Lambrechts, Diether; Mannermaa, Arto; Brauch, Hiltrud; Guénel, Pascal; Peto, Julian; Fasching, Peter A.; Hopper, John; Flesch-Janys, Dieter; Couch, Fergus; Chenevix-Trench, Georgia; Pharoah, Paul D. P.; Garcia-Closas, Montserrat; Schmidt, Marjanka K.; Hall, Per; Easton, Douglas F.; Chang-Claude, Jenny
2014-01-01
Genes that alter disease risk only in combination with certain environmental exposures may not be detected in genetic association analysis. By using methods accounting for gene-environment (G × E) interaction, we aimed to identify novel genetic loci associated with breast cancer risk. Up to 34,475 cases and 34,786 controls of European ancestry from up to 23 studies in the Breast Cancer Association Consortium were included. Overall, 71,527 single nucleotide polymorphisms (SNPs), enriched for association with breast cancer, were tested for interaction with 10 environmental risk factors using three recently proposed hybrid methods and a joint test of association and interaction. Analyses were adjusted for age, study, population stratification, and confounding factors as applicable. Three SNPs in two independent loci showed statistically significant association: SNPs rs10483028 and rs2242714 in perfect linkage disequilibrium on chromosome 21 and rs12197388 in ARID1B on chromosome 6. While rs12197388 was identified using the joint test with parity and with age at menarche (P-values = 3 × 10−07), the variants on chromosome 21 q22.12, which showed interaction with adult body mass index (BMI) in 8,891 postmenopausal women, were identified by all methods applied. SNP rs10483028 was associated with breast cancer in women with a BMI below 25 kg/m2 (OR = 1.26, 95% CI 1.15–1.38) but not in women with a BMI of 30 kg/m2 or higher (OR = 0.89, 95% CI 0.72–1.11, P for interaction = 3.2 × 10−05). Our findings confirm comparable power of the recent methods for detecting G × E interaction and the utility of using G × E interaction analyses to identify new susceptibility loci. PMID:24248812
Association of TUSC1 and DPF3 gene polymorphisms with male infertility.
Sato, Youichi; Hasegawa, Chise; Tajima, Atsushi; Nozawa, Shiari; Yoshiike, Miki; Koh, Eitetsue; Kanaya, Jiro; Namiki, Mikio; Matsumiya, Kiyomi; Tsujimura, Akira; Komatsu, Kiyoshi; Itoh, Naoki; Eguchi, Jiro; Yamauchi, Aiko; Iwamoto, Teruaki
2018-02-01
Recently, genome-wide association studies of a Hutterite population in the USA revealed that five single nucleotide polymorphisms (SNPs) with a significant association with sperm quality and/or function in ethnically diverse men from Chicago were significantly correlated with family size. Of these, three SNPs (rs7867029, rs7174015, and rs12870438) were found to be significantly associated with the risk of azoospermia and/or oligozoospermia in a Japanese population. In this study, we investigated whether the rs10966811 (located in an intergenic region between the TUSC1 and IZUMO3 genes) and rs10129954 (located in the DPF3 gene) SNPs, previously related to family size, are associated with male infertility. In addition, we performed association analysis between rs12348 in TUSC1 and rs2772579 in IZUMO3 and male infertility. We genotyped 145 patients with infertility (including 83 patients with azoospermia and 62 with oligozoospermia) and 713 fertile controls by PCR-RFLP technique for polymorphism. Because rs10966811 has no restriction sites, the SNP rs12376894 with strong linkage disequilibrium was selected as an alternative to rs10966811. There was a statistically significant association between rs12376894 proxy SNP of rs10966811 and oligozoospermia. Also, a statistically significant association between rs10129954 and azoospermia, and oligozoospermia was observed. When we assessed the relationship between rs12348 in TUSC1 and rs2772579 in IZUMO3 and male infertility traits, we found that rs12348 in TUSC1 was significantly associated with azoospermia and oligozoospermia, but rs2772579 in IZUMO3 was not associated with male infertility. We found that the polymorphisms in TUSC1 and DPF3 displayed strong associations with male infertility.
Huang, Yong-Zhen; Zhan, Zhao-Yang; Sun, Yu-Jia; Wang, Jing; Li, Ming-Xun; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Chen, Hong
2013-06-01
Muscle growth is a complex phenomenon regulated by many factors, whereby net growth results from the combined action of synthesis and turnover. Insulin-like growth factor 2 (IGF2) is a fetal growth and differentiation factor that plays an important role in muscle growth and in myoblast proliferation and differentiation; Zinc finger, BED-type containing 6 (ZBED6) is a novel transcription factor that was identified and shown to act as a repressor of IGF2 transcription in skeletal muscle. In this study, a total of seven single nucleotide polymorphisms (SNPs) were identified, four SNPs in intron 8 of IGF2 and one promoter SNP and two missense mutations in the coding region of ZBED6, two of which were in complete linkage disequilibrium (LD) in the bovine IGF2. The 58 haplotypes were inferred in 1522 individuals representing four purebred cattle breeds from China. The seven SNPs, 79 and 66 combined diplotypes were revealed for association with body mass in Nanyang and Jiaxian cattle populations at five different ages (P < 0.05 or 0.01). The mutant-type variants and haplotype 58 (likely in LD with the beneficial quantitative trait nucleotide allele) was superior for body mass; the heterozygote diplotype of the most common haplotypes 58 was associated with higher body mass compared to either heterozygote or homozygote. The statistical analyses indicated that the mutant-type variants and haplotypes are significantly associated with body mass in study cattle populations at different ages. These data demonstrate that variants and haplotypes are associated with growth traits, and these results may provide important biological insights into the phenotypic differentiation that is associated with adaptation and specialization of cattle breeds.
Kalantari, Naser; Keshavarz Mohammadi, Nastaran; Izadi, Pantea; Doaei, Saeid; Gholamalizadeh, Maryam; Eini-Zinab, Hassan; Salonurmi, Tuire; Rafieifar, Shahram; Janipoor, Reza; Azizi Tabesh, Ghasem
2018-01-01
Single-nucleotide polymorphisms (SNPs), which are located in the first intron of the FTO gene, are reported to be associated with body weight and the body mass index (BMI). However, their effects on anthropometric measurements in adolescents are poorly understood. This study aimed to investigate the association of three adjacent polymorphisms (rs9930506, rs9930501, & rs9932754) in the FTO gene with anthropometric indices in Iranian adolescent males. The participants comprised a total of 237 adolescent males who were recruited randomly from two high schools in Tehran, Iran. The DNA samples were genotyped for the FTO gene polymorphisms by DNA sequencing. BMI, body fat percentage (BF%), and body muscle percentage (BM%) were determined using a validated bioelectrical impedance analysis scale. The association of the FTO polymorphisms with weight, height, BMI, BF%, and BM% was investigated. A haplotype of rs9930506, rs9930501, and rs9932754 (GGT) in the first intron of the FTO with complete linkage disequilibrium (LD) was found to be significantly associated with higher weight (OR = 1.32), BMI (OR = 5.36) and BF% (OR = 1.46), and lower BM% (OR = 3.59) (all P<0.001). None of the students with GGC genotypes were underweight, while all of the students with AAT genotypes had high muscle mass. A haplotype in the first intron of the FTO gene had a strong association with obesity indices in Iranian adolescent males. The FTO gene polymorphisms might have greater effects on anthropometric indices than what was previously imagined. Moreover, we suggested that the FTO gene exerted their effects on anthropometric measurements through haplotypes (and not single SNPs).
Wang, Yunpeng; Thompson, Wesley K.; Schork, Andrew J.; Holland, Dominic; Chen, Chi-Hua; Bettella, Francesco; Desikan, Rahul S.; Li, Wen; Witoelar, Aree; Zuber, Verena; Devor, Anna; Nöthen, Markus M.; Rietschel, Marcella; Chen, Qiang; Werge, Thomas; Cichon, Sven; Weinberger, Daniel R.; Djurovic, Srdjan; O’Donovan, Michael; Visscher, Peter M.; Andreassen, Ole A.; Dale, Anders M.
2016-01-01
Most of the genetic architecture of schizophrenia (SCZ) has not yet been identified. Here, we apply a novel statistical algorithm called Covariate-Modulated Mixture Modeling (CM3), which incorporates auxiliary information (heterozygosity, total linkage disequilibrium, genomic annotations, pleiotropy) for each single nucleotide polymorphism (SNP) to enable more accurate estimation of replication probabilities, conditional on the observed test statistic (“z-score”) of the SNP. We use a multiple logistic regression on z-scores to combine information from auxiliary information to derive a “relative enrichment score” for each SNP. For each stratum of these relative enrichment scores, we obtain nonparametric estimates of posterior expected test statistics and replication probabilities as a function of discovery z-scores, using a resampling-based approach that repeatedly and randomly partitions meta-analysis sub-studies into training and replication samples. We fit a scale mixture of two Gaussians model to each stratum, obtaining parameter estimates that minimize the sum of squared differences of the scale-mixture model with the stratified nonparametric estimates. We apply this approach to the recent genome-wide association study (GWAS) of SCZ (n = 82,315), obtaining a good fit between the model-based and observed effect sizes and replication probabilities. We observed that SNPs with low enrichment scores replicate with a lower probability than SNPs with high enrichment scores even when both they are genome-wide significant (p < 5x10-8). There were 693 and 219 independent loci with model-based replication rates ≥80% and ≥90%, respectively. Compared to analyses not incorporating relative enrichment scores, CM3 increased out-of-sample yield for SNPs that replicate at a given rate. This demonstrates that replication probabilities can be more accurately estimated using prior enrichment information with CM3. PMID:26808560
Wang, Mengmeng; Xin, Lihong; Cai, Guoqi; Zhang, Xu; Yang, Xiao; Li, Xiaona; Xia, Qing; Wang, Li; Xu, Shengqian; Xu, Jianhua; Shuai, Zongwen; Ding, Changhai; Pan, Faming
2017-01-01
Previous studies have found the association between rs10865331 in 2p15 area and ankylosing spondylitis (AS). This study aimed to identify additional functional genetic variants in 2p15 region associated with AS susceptibility. We used next generation sequencing (NGS) in 100 AS cases and 100 healthy controls to screen AS susceptible genetic variants, and validated these variants in 620 cases and 620 controls by using imLDRTM technique for single nucleotide polymorphism (SNP) genotyping. Totally, we identified 12 SNPs that might confer susceptibility to AS. Of those SNPs, three (rs14170, rs2123111 and rs1729674) were nominally associated (P<0.05) with AS, but were no longer statistically significant after Bonferroni correction. After stratified by gender, another two SNPs (rs11428092 and rs10208769 in USP34) were associated with AS in males but not females, though this was not statistically significant after Bonferroni correction. In addition, rs1729674, rs14170, rs2123111 and rs10208769 were in strong linkage disequilibrium (LD) and were further enrolled in haplotype analysis. A novel haplotype TAGA was found to be associated with a decreased risk of AS (odds ratio (OR) (95% confidence interval (CI)) = 0.832 (0.705-0.982)). Beyond that, we also demonstrated a strong relationship between rs10865331 and AS susceptibility (OR (95% CI) = 1.303(1.111-1.526)). rs14170 and rs2123111 inUSP34 and rs1729674 in C2orf74 may be associated with AS susceptibility in Han Chinese population. USP34 and C2orf74 in 2p15 region may be AS novel susceptibility genes.
Malmberg, M Michelle; Shi, Fan; Spangenberg, German C; Daetwyler, Hans D; Cogan, Noel O I
2018-01-01
Intensive breeding of Brassica napus has resulted in relatively low diversity, such that B. napus would benefit from germplasm improvement schemes that sustain diversity. As such, samples representative of global germplasm pools need to be assessed for existing population structure, diversity and linkage disequilibrium (LD). Complexity reduction genotyping-by-sequencing (GBS) methods, including GBS-transcriptomics (GBS-t), enable cost-effective screening of a large number of samples, while whole genome re-sequencing (WGR) delivers the ability to generate large numbers of unbiased genomic single nucleotide polymorphisms (SNPs), and identify structural variants (SVs). Furthermore, the development of genomic tools based on whole genomes representative of global oilseed diversity and orientated by the reference genome has substantial industry relevance and will be highly beneficial for canola breeding. As recent studies have focused on European and Chinese varieties, a global diversity panel as well as a substantial number of Australian spring types were included in this study. Focusing on industry relevance, 633 varieties were initially genotyped using GBS-t to examine population structure using 61,037 SNPs. Subsequently, 149 samples representative of global diversity were selected for WGR and both data sets used for a side-by-side evaluation of diversity and LD. The WGR data was further used to develop genomic resources consisting of a list of 4,029,750 high-confidence SNPs annotated using SnpEff, and SVs in the form of 10,976 deletions and 2,556 insertions. These resources form the basis of a reliable and repeatable system allowing greater integration between canola genomics studies, with a strong focus on breeding germplasm and industry applicability.
Jia, Jing; Wei, Yi-Liang; Qin, Cui-Jiao; Hu, Lan; Wan, Li-Hua; Li, Cai-Xia
2014-01-01
Inferring the ancestral origin of DNA samples can be helpful in correcting population stratification in disease association studies or guiding crime investigations. Populations throughout the world vary in appearance features and biological characteristics. Based on this idea, we performed a genome-wide scan for SNPs within genes that are related to physical and biological traits. Using the HapMap database, we screened 52 genes and their flanking regions. Thirty-five SNPs that displayed highly contrasting allele frequencies (F(st)>0.3, linkage disequilibrium r(2)<0.2, and Hardy-Weinberg equilibrium P>0.001) among Africans, Europeans, and East Asians were selected and validated. A multiplexed assay was developed to genotype these 35 SNPs in 357 individuals from 10 populations worldwide. This panel provided accurate estimates of individual ancestry proportions with balanced discriminatory power among the three continental ancestries: Africans, Europeans, and East Asians. It also proved very effective in evaluating admixed populations living in joint regions of continents (e.g., Uyghurs and Indians) and discriminating some subpopulations within each of the three continents. Structure analysis was performed to establish and evaluate the panel of ancestry-informative markers, and the components of each population were also described to indicate the structural composition. The 21 population structures in our study are consistent with geographic patterns, and individuals were properly assigned to their original ancestral populations with proportion analyses and random match probability calculations. Thus, the panel and its population information will be useful resources to minimize the effects of population stratification in association analyses and to assign the most likely origin of an unknown DNA contributor in forensic investigations. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Broekgaarden, Colette; Bucher, Johan; Bac-Molenaar, Johanna; Keurentjes, Joost J. B.; Kruijer, Willem; Voorrips, Roeland E.; Vosman, Ben
2015-01-01
Plants have evolved a variety of ways to defend themselves against biotic attackers. This has resulted in the presence of substantial variation in defense mechanisms among plants, even within a species. Genome-wide association (GWA) mapping is a useful tool to study the genetic architecture of traits, but has so far only had limited exploitation in studies of plant defense. Here, we study the genetic architecture of defense against the phloem-feeding insect cabbage whitefly (Aleyrodes proletella) in Arabidopsis thaliana. We determined whitefly performance, i.e. the survival and reproduction of whitefly females, on 360 worldwide selected natural accessions and subsequently performed GWA mapping using 214,051 SNPs. Substantial variation for whitefly adult survival and oviposition rate (number of eggs laid per female per day) was observed between the accessions. We identified 39 candidate SNPs for either whitefly adult survival or oviposition rate, all with relatively small effects, underpinning the complex architecture of defense traits. Among the corresponding candidate genes, i.e. genes in linkage disequilibrium (LD) with candidate SNPs, none have previously been identified as a gene playing a role in the interaction between plants and phloem-feeding insects. Whitefly performance on knock-out mutants of a number of candidate genes was significantly affected, validating the potential of GWA mapping for novel gene discovery in plant-insect interactions. Our results show that GWA analysis is a very useful tool to gain insight into the genetic architecture of plant defense against herbivorous insects, i.e. we identified and validated several genes affecting whitefly performance that have not previously been related to plant defense against herbivorous insects. PMID:26699853
Empirical Distributions of F ST from Large-Scale Human Polymorphism Data
Elhaik, Eran
2012-01-01
Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright’s F ST that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-F ST may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically F ST analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global F ST distribution closely follows an exponential distribution. Third, although the overall F ST distribution is similarly shaped (inverse J), F ST distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-F ST of these groups is linear in allele frequency. These results suggest that investigating the extremes of the F ST distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection. PMID:23185452
Dai, Yaoyao; Zhang, Xia; Pan, Hongqiu; Tang, Shaowen; Shen, Hongbing; Wang, Jianming
2011-10-22
Recently, one genome-wide association study identified a susceptibility locus of rs4331426 on chromosome 18q11.2 for tuberculosis in the African population. To validate the significance of this susceptibility locus in other areas, we conducted a case-control study in the Chinese population. The present study consisted of 578 cases and 756 controls. The SNP rs4331426 and other six tag SNPs in the 100 Kbp up and down stream of rs4331426 on chromosome 18q11.2 were genotyped by using the Taqman-based allelic discrimination system. As compared with the findings from the African population, genetic variation of the SNP rs4331426 was rare among the Chinese. No significant differences were observed in genotypes or allele frequencies of the tag SNPs between cases and controls either before or after adjusting for age, sex, education, smoking, and drinking history. However, we observed strong linkage disequilibrium of SNPs. Constructed haplotypes within this block were linked the altered risks of tuberculosis. For example, in comparison with the common haplotype AA(rs8087945-rs12456774), haplotypes AG(rs8087945-rs12456774) and GA(rs8087945-rs12456774) were associated with a decreased risk of tuberculosis, with the adjusted odds ratio(95% confidence interval) of 0.34(0.27-0.42) and 0.22(0.16-0.29), respectively. Susceptibility locus of rs4331426 discovered in the African population could not be validated in the Chinese population. None of genetic polymorphisms we genotyped were related to tuberculosis in the single-point analysis. However, haplotypes on chromosome 18q11.2 might contribute to an individual's susceptibility. More work is necessary to identify the true causative variants of tuberculosis.
Li, M-H; Tiirikka, T; Kantanen, J
2014-01-01
In sheep, coat colour (and pattern) is one of the important traits of great biological, economic and social importance. However, the genetics of sheep coat colour has not yet been fully clarified. We conducted a genome-wide association study of sheep coat colours by genotyping 47 303 single-nucleotide polymorphisms (SNPs) in the Finnsheep population in Finland. We identified 35 SNPs associated with all the coat colours studied, which cover genomic regions encompassing three known pigmentation genes (TYRP1, ASIP and MITF) in sheep. Eighteen of these associations were confirmed in further tests between white versus non-white individuals, but none of the 35 associations were significant in the analysis of only non-white colours. Across the tests, the s66432.1 in ASIP showed significant association (P=4.2 × 10−11 for all the colours; P=2.3 × 10−11 for white versus non-white colours) with the variation in coat colours and strong linkage disequilibrium with other significant variants surrounding the ASIP gene. The signals detected around the ASIP gene were explained by differences in white versus non-white alleles. Further, a genome scan for selection for white coat pigmentation identified a strong and striking selection signal spanning ASIP. Our study identified the main candidate gene for the coat colour variation between white and non-white as ASIP, an autosomal gene that has been directly implicated in the pathway regulating melanogenesis. Together with ASIP, the two other newly identified genes (TYRP1 and MITF) in the Finnsheep, bordering associated SNPs, represent a new resource for enriching sheep coat-colour genetics and breeding. PMID:24022497
Empirical distributions of F(ST) from large-scale human polymorphism data.
Elhaik, Eran
2012-01-01
Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright's F(ST) that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-F(ST) may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically F(ST) analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global F(ST) distribution closely follows an exponential distribution. Third, although the overall F(ST) distribution is similarly shaped (inverse J), F(ST) distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-F(ST) of these groups is linear in allele frequency. These results suggest that investigating the extremes of the F(ST) distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection.
Rice, Treva K; Sarzynski, Mark A; Sung, Yun Ju; Argyropoulos, George; Stütz, Adrian M; Teran-Garcia, Margarita; Rao, D C; Bouchard, Claude; Rankinen, Tuomo
2012-08-01
Although regular exercise improves submaximal aerobic capacity, there is large variability in its response to exercise training. While this variation is thought to be partly due to genetic differences, relatively little is known about the causal genes. Submaximal aerobic capacity traits in the current report include the responses of oxygen consumption (ΔVO(2)60), power output (ΔWORK60), and cardiac output (ΔQ60) at 60% of VO2max to a standardized 20-week endurance exercise training program. Genome-wide linkage analysis in 475 HERITAGE Family Study Caucasians identified a locus on chromosome 13q for ΔVO(2)60 (LOD = 3.11). Follow-up fine mapping involved a dense marker panel of over 1,800 single-nucleotide polymorphisms (SNPs) in a 7.9-Mb region (21.1-29.1 Mb from p-terminus). Single-SNP analyses found 14 SNPs moderately associated with both ΔVO(2)60 at P ≤ 0.005 and the correlated traits of ΔWORK60 and ΔQ60 at P < 0.05. Haplotype analyses provided several strong signals (P < 1.0 × 10(-5)) for ΔVO(2)60. Overall, association analyses narrowed the target region and included potential biological candidate genes (MIPEP and SGCG). Consistent with maximal heritability estimates of 23%, up to 20% of the phenotypic variance in ΔVO(2)60 was accounted for by these SNPs. These results implicate candidate genes on chromosome 13q12 for the ability to improve submaximal exercise capacity in response to regular exercise. Submaximal exercise at 60% of maximal capacity is an exercise intensity that falls well within the range recommended in the Physical Activity Guidelines for Americans and thus has potential public health relevance.
Rice, Treva K.; Sarzynski, Mark A.; Sung, Yun Ju; Argyropoulos, George; Stütz, Adrian M.; Teran-Garcia, Margarita; Rao, D. C.; Bouchard, Claude
2014-01-01
Although regular exercise improves submaximal aerobic capacity, there is large variability in its response to exercise training. While this variation is thought to be partly due to genetic differences, relatively little is known about the causal genes. Submaximal aerobic capacity traits in the current report include the responses of oxygen consumption (ΔVO260), power output (ΔWORK60), and cardiac output (ΔQ60) at 60% of VO2max to a standardized 20-week endurance exercise training program. Genome-wide linkage analysis in 475 HERITAGE Family Study Caucasians identified a locus on chromosome 13q for ΔVO260 (LOD = 3.11). Follow-up fine mapping involved a dense marker panel of over 1,800 single-nucleotide polymorphisms (SNPs) in a 7.9-Mb region (21.1–29.1 Mb from p-terminus). Single-SNP analyses found 14 SNPs moderately associated with both ΔVO260 at P ≤ 0.005 and the correlated traits of ΔWORK60 and ΔQ60 at P < 0.05. Haplotype analyses provided several strong signals (P<1.0 × 10−5) for ΔVO260. Overall, association analyses narrowed the target region and included potential biological candidate genes (MIPEP and SGCG). Consistent with maximal heritability estimates of 23%, up to 20% of the phenotypic variance in ΔVO260 was accounted for by these SNPs. These results implicate candidate genes on chromosome 13q12 for the ability to improve submaximal exercise capacity in response to regular exercise. Submaximal exercise at 60% of maximal capacity is an exercise intensity that falls well within the range recommended in the Physical Activity Guidelines for Americans and thus has potential public health relevance. PMID:22170014
Adaptations to climate-mediated selective pressures in sheep.
Lv, Feng-Hua; Agha, Saif; Kantanen, Juha; Colli, Licia; Stucki, Sylvie; Kijas, James W; Joost, Stéphane; Li, Meng-Hua; Ajmone Marsan, Paolo
2014-12-01
Following domestication, sheep (Ovis aries) have become essential farmed animals across the world through adaptation to a diverse range of environments and varied production systems. Climate-mediated selective pressure has shaped phenotypic variation and has left genetic "footprints" in the genome of breeds raised in different agroecological zones. Unlike numerous studies that have searched for evidence of selection using only population genetics data, here, we conducted an integrated coanalysis of environmental data with single nucleotide polymorphism (SNP) variation. By examining 49,034 SNPs from 32 old, autochthonous sheep breeds that are adapted to a spectrum of different regional climates, we identified 230 SNPs with evidence for selection that is likely due to climate-mediated pressure. Among them, 189 (82%) showed significant correlation (P ≤ 0.05) between allele frequency and climatic variables in a larger set of native populations from a worldwide range of geographic areas and climates. Gene ontology analysis of genes colocated with significant SNPs identified 17 candidates related to GTPase regulator and peptide receptor activities in the biological processes of energy metabolism and endocrine and autoimmune regulation. We also observed high linkage disequilibrium and significant extended haplotype homozygosity for the core haplotype TBC1D12-CH1 of TBC1D12. The global frequency distribution of the core haplotype and allele OAR22_18929579-A showed an apparent geographic pattern and significant (P ≤ 0.05) correlations with climatic variation. Our results imply that adaptations to local climates have shaped the spatial distribution of some variants that are candidates to underpin adaptive variation in sheep. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genome-wide association mapping of partial resistance to Aphanomyces euteiches in pea.
Desgroux, Aurore; L'Anthoëne, Virginie; Roux-Duparque, Martine; Rivière, Jean-Philippe; Aubert, Grégoire; Tayeh, Nadim; Moussart, Anne; Mangin, Pierre; Vetel, Pierrick; Piriou, Christophe; McGee, Rebecca J; Coyne, Clarice J; Burstin, Judith; Baranger, Alain; Manzanares-Dauleux, Maria; Bourion, Virginie; Pilet-Nayel, Marie-Laure
2016-02-20
Genome-wide association (GWA) mapping has recently emerged as a valuable approach for refining the genetic basis of polygenic resistance to plant diseases, which are increasingly used in integrated strategies for durable crop protection. Aphanomyces euteiches is a soil-borne pathogen of pea and other legumes worldwide, which causes yield-damaging root rot. Linkage mapping studies reported quantitative trait loci (QTL) controlling resistance to A. euteiches in pea. However the confidence intervals (CIs) of these QTL remained large and were often linked to undesirable alleles, which limited their application in breeding. The aim of this study was to use a GWA approach to validate and refine CIs of the previously reported Aphanomyces resistance QTL, as well as identify new resistance loci. A pea-Aphanomyces collection of 175 pea lines, enriched in germplasm derived from previously studied resistant sources, was evaluated for resistance to A. euteiches in field infested nurseries in nine environments and with two strains in climatic chambers. The collection was genotyped using 13,204 SNPs from the recently developed GenoPea Infinium® BeadChip. GWA analysis detected a total of 52 QTL of small size-intervals associated with resistance to A. euteiches, using the recently developed Multi-Locus Mixed Model. The analysis validated six of the seven previously reported main Aphanomyces resistance QTL and detected novel resistance loci. It also provided marker haplotypes at 14 consistent QTL regions associated with increased resistance and highlighted accumulation of favourable haplotypes in the most resistant lines. Previous linkages between resistance alleles and undesired late-flowering alleles for dry pea breeding were mostly confirmed, but the linkage between loci controlling resistance and coloured flowers was broken due to the high resolution of the analysis. A high proportion of the putative candidate genes underlying resistance loci encoded stress-related proteins and others suggested that the QTL are involved in diverse functions. This study provides valuable markers, marker haplotypes and germplasm lines to increase levels of partial resistance to A. euteiches in pea breeding.
Comparative analysis of methods for detecting interacting loci
2011-01-01
Background Interactions among genetic loci are believed to play an important role in disease risk. While many methods have been proposed for detecting such interactions, their relative performance remains largely unclear, mainly because different data sources, detection performance criteria, and experimental protocols were used in the papers introducing these methods and in subsequent studies. Moreover, there have been very few studies strictly focused on comparison of existing methods. Given the importance of detecting gene-gene and gene-environment interactions, a rigorous, comprehensive comparison of performance and limitations of available interaction detection methods is warranted. Results We report a comparison of eight representative methods, of which seven were specifically designed to detect interactions among single nucleotide polymorphisms (SNPs), with the last a popular main-effect testing method used as a baseline for performance evaluation. The selected methods, multifactor dimensionality reduction (MDR), full interaction model (FIM), information gain (IG), Bayesian epistasis association mapping (BEAM), SNP harvester (SH), maximum entropy conditional probability modeling (MECPM), logistic regression with an interaction term (LRIT), and logistic regression (LR) were compared on a large number of simulated data sets, each, consistent with complex disease models, embedding multiple sets of interacting SNPs, under different interaction models. The assessment criteria included several relevant detection power measures, family-wise type I error rate, and computational complexity. There are several important results from this study. First, while some SNPs in interactions with strong effects are successfully detected, most of the methods miss many interacting SNPs at an acceptable rate of false positives. In this study, the best-performing method was MECPM. Second, the statistical significance assessment criteria, used by some of the methods to control the type I error rate, are quite conservative, thereby limiting their power and making it difficult to fairly compare them. Third, as expected, power varies for different models and as a function of penetrance, minor allele frequency, linkage disequilibrium and marginal effects. Fourth, the analytical relationships between power and these factors are derived, aiding in the interpretation of the study results. Fifth, for these methods the magnitude of the main effect influences the power of the tests. Sixth, most methods can detect some ground-truth SNPs but have modest power to detect the whole set of interacting SNPs. Conclusion This comparison study provides new insights into the strengths and limitations of current methods for detecting interacting loci. This study, along with freely available simulation tools we provide, should help support development of improved methods. The simulation tools are available at: http://code.google.com/p/simulation-tool-bmc-ms9169818735220977/downloads/list. PMID:21729295
Comparative analysis of methods for detecting interacting loci.
Chen, Li; Yu, Guoqiang; Langefeld, Carl D; Miller, David J; Guy, Richard T; Raghuram, Jayaram; Yuan, Xiguo; Herrington, David M; Wang, Yue
2011-07-05
Interactions among genetic loci are believed to play an important role in disease risk. While many methods have been proposed for detecting such interactions, their relative performance remains largely unclear, mainly because different data sources, detection performance criteria, and experimental protocols were used in the papers introducing these methods and in subsequent studies. Moreover, there have been very few studies strictly focused on comparison of existing methods. Given the importance of detecting gene-gene and gene-environment interactions, a rigorous, comprehensive comparison of performance and limitations of available interaction detection methods is warranted. We report a comparison of eight representative methods, of which seven were specifically designed to detect interactions among single nucleotide polymorphisms (SNPs), with the last a popular main-effect testing method used as a baseline for performance evaluation. The selected methods, multifactor dimensionality reduction (MDR), full interaction model (FIM), information gain (IG), Bayesian epistasis association mapping (BEAM), SNP harvester (SH), maximum entropy conditional probability modeling (MECPM), logistic regression with an interaction term (LRIT), and logistic regression (LR) were compared on a large number of simulated data sets, each, consistent with complex disease models, embedding multiple sets of interacting SNPs, under different interaction models. The assessment criteria included several relevant detection power measures, family-wise type I error rate, and computational complexity. There are several important results from this study. First, while some SNPs in interactions with strong effects are successfully detected, most of the methods miss many interacting SNPs at an acceptable rate of false positives. In this study, the best-performing method was MECPM. Second, the statistical significance assessment criteria, used by some of the methods to control the type I error rate, are quite conservative, thereby limiting their power and making it difficult to fairly compare them. Third, as expected, power varies for different models and as a function of penetrance, minor allele frequency, linkage disequilibrium and marginal effects. Fourth, the analytical relationships between power and these factors are derived, aiding in the interpretation of the study results. Fifth, for these methods the magnitude of the main effect influences the power of the tests. Sixth, most methods can detect some ground-truth SNPs but have modest power to detect the whole set of interacting SNPs. This comparison study provides new insights into the strengths and limitations of current methods for detecting interacting loci. This study, along with freely available simulation tools we provide, should help support development of improved methods. The simulation tools are available at: http://code.google.com/p/simulation-tool-bmc-ms9169818735220977/downloads/list.
Gao, J B; Li, Y K; Yang, N; Ma, X H; Adoligbe, C; Jiang, B J; Fu, C Z; Cheng, G; Zan, L S
2013-02-28
The aim of this study was to determine whether single nucleotide polymorphisms (SNPs) of bovine Dickkopf homolog 4 (DKK4) are associated with body measurement traits in Qinchuan cattle. By using PCR-SSCP technology and DNA sequencing, we discovered 5 DKK4 SNPs in Qingchuan cattle, including -65G>A and -77G>T in the 5'-untranslated region, 1532C>G and 1533T>C in exon 2, and 2088C>T in exon 3. The sequencing map showed that 1532C>G and 1533T>C were in close linkage disequilibrium and were treated as 1532C>G-1533T>C in this study. Allele frequencies were calculated and analyzed by the chi-square test, which showed that -65G>A and 1532C>G-1533T>C were in Hardy-Weinberg equilibrium (P > 0.05), whereas -77G>T and 2088C>T were not in all 633 tested Qinchuan cattle individuals (P < 0.01). Gene heterozygosity (HE), effective allele number (NE), and polymorphism information content (PIC) were 0.407, 1.686, and 0.324 at -65G>A; 0.472, 1.894, and 0.361 at -77G>T; 0.476, 1.908, and 0.363 at 1532C>G-1533T>C; and 0.218, 1.279, and 0.195 at 2088C>T. We also evaluated the potential association of these SNPs with body measurement traits in all 633 individuals; the results suggest that several SNPs in Qinchuan cattle DKK4 were significantly associated with body length, hip height, rump length, hip width, heart girth, and pin bone width (P < 0.05 and P < 0.01). These results suggest that bovine DKK4 could be used as candidate gene for Qinchuan cattle breeding.
Ken-Dror, Gie; Goldbourt, Uri; Dankner, Rachel
2010-05-01
Several polymorphisms in the ApoA5 gene emerged as important candidate genes in triglyceride metabolism. The aim of this study was to determine the associations between ApoA5 polymorphisms, plasma triglyceride concentrations and the presence of cardiovascular disease (CVD) in three ethnic origins. Genotypes for 15 single nucleotide polymorphisms (SNPs) were determined in 659 older adults (mean age 71+/-7 years) who immigrated to Israel or whose ancestors originated from East Europe (Ashkenazi), North Africa, Asia (Sephardic) or Yemen (Yemenite). The minor alleles of the four common SNPs (rs662799, rs651821, rs2072560 and rs2266788) are associated with an increase of 27-38% in triglyceride concentration among Ashkenazi and Yemenite Jews compared with the major alleles, but not among those of Sephardic origin. Conversely, among the Sephardic group, the presence of the minor allele in SNP rs3135506 compared with the major allele was associated with an increase of 34% in triglyceride concentration. The four SNPs were in significant linkage disequilibrium (D'=0.96-0.99), resulting in three haplotypes H1, H2 and H3, representing 98-99% of the population. Haplotype H2 was significantly associated with triglyceride concentration among Ashkenazi and Yemenite but not among Sephardic Jews. Conversely, haplotype H3 was associated with triglyceride concentration in Sephardic but not in Ashkenazi and Yemenite Jews. Ashkenazi carriers of H2 haplotype had a CVD odds ratio of 2.19 (95% CI: 1.05-4.58) compared with H1 (the most frequent), after adjustment for all other risk factors. These results suggest that different SNPs in ApoA5 polymorphisms may be associated with triglyceride concentration and CVD in each of these ethnic origins.
Sabar, Muhammad Farooq; Ghani, Muhammad Usman; Shahid, Mariam; Sumrin, Aleena; Ali, Amjad; Akram, Muhammad; Tariq, Muhammad Akram; Bano, Iqbal
2016-01-01
A disintegrin and metalloproteinase 33 (ADAM33) gene has been considered as an asthma susceptibility gene due to its possible role in airway remodeling, abnormal cell proliferation, and differentiation. Association of this gene with asthma has been reported in several genetic studies on various populations. The current study aims to evaluate the association of ADAM33 gene polymorphisms with the risk of asthma in the Punjabi population of Pakistan. A total of 101 asthma patients and 102 age-matched healthy controls from Lahore, a city in Punjab, were recruited. ADAM33 single nucleotide polymorphisms (SNPs) T + 1[rs2280089], T2[rs2280090], T1[rs2280091], ST + 5[rs597980], ST + 4[rs44707], S2[rs528557], Q - 1[rs612709], and F + 1[rs511898] were genotyped in both patients and controls using single base extension and capillary electrophoresis-based genetic analyzer. The basic allelic and genotypic model was analyzed for association of the SNPs with asthma using SHEsis software. Haploview software was used to calculate pairwise linkage disequilibrium (LD) among six of the genotyped SNPs. Of the 8 SNPs genotyped, only S2[rs528557] showed significant association with asthma (Allele p = 0.0189, Genotype p = 0.021). SNPs T + 1[rs2280089], T2[rs2280090], T1[rs2280091], ST + 4[rs44707], S2[rs528557], and Q - 1[rs612709] were found to be in moderate to strong LD. The significantly higher frequency of haplotype "AAGTCG" in healthy controls suggests a protective effect against asthma risk in the studied population (p = 0.0059). These findings suggest that genetic variants of ADAM33 gene may play important roles in asthma susceptibility in the Punjabi population of Pakistan.
Liu, Jing; Wang, Zheng; He, Guanglin; Zhao, Xueying; Wang, Mengge; Luo, Tao; Li, Chengtao; Hou, Yiping
2018-07-01
Massively parallel sequencing (MPS) technologies can sequence many targeted regions of multiple samples simultaneously and are gaining great interest in the forensic community. The Precision ID Identity Panel contains 90 autosomal SNPs and 34 upper Y-Clade SNPs, which was designed with small amplicons and optimized for forensic degraded or challenging samples. Here, 184 unrelated individuals from three East Asian minority ethnicities (Tibetan, Uygur and Hui) were analyzed using the Precision ID Identity Panel and the Ion PGM System. The sequencing performance and corresponding forensic statistical parameters of this MPS-SNP panel were investigated. The inter-population relationships and substructures among three investigated populations and 30 worldwide populations were further investigated using PCA, MDS, cladogram and STRUCTURE. No significant deviation from Hardy-Weinberg equilibrium (HWE) and Linkage Disequilibrium (LD) tests was observed across all 90 autosomal SNPs. The combined matching probability (CMP) for Tibetan, Uygur and Hui were 2.5880 × 10 -33 , 1.7480 × 10 -35 and 4.6326 × 10 -34 respectively, and the combined power of exclusion (CPE) were 0.999999386152271, 0.999999607712827 and 0.999999696360182 respectively. For 34 Y-SNPs, only 16 haplogroups were obtained, but the haplogroup distributions differ among the three populations. Tibetans from the Sino-Tibetan population and Hui with multiple ethnicities with an admixture population have genetic affinity with East Asian populations, while Uygurs of a Eurasian admixture population have similar genetic components to the South Asian populations and are distributed between East Asian and European populations. The aforementioned results suggest that the Precision ID Identity Panel is informative and polymorphic in three investigated populations and could be used as an effective tool for human forensics. Copyright © 2018 Elsevier B.V. All rights reserved.
Müller, Bárbara S F; Neves, Leandro G; de Almeida Filho, Janeo E; Resende, Márcio F R; Muñoz, Patricio R; Dos Santos, Paulo E T; Filho, Estefano Paludzyszyn; Kirst, Matias; Grattapaglia, Dario
2017-07-11
The advent of high-throughput genotyping technologies coupled to genomic prediction methods established a new paradigm to integrate genomics and breeding. We carried out whole-genome prediction and contrasted it to a genome-wide association study (GWAS) for growth traits in breeding populations of Eucalyptus benthamii (n =505) and Eucalyptus pellita (n =732). Both species are of increasing commercial interest for the development of germplasm adapted to environmental stresses. Predictive ability reached 0.16 in E. benthamii and 0.44 in E. pellita for diameter growth. Predictive abilities using either Genomic BLUP or different Bayesian methods were similar, suggesting that growth adequately fits the infinitesimal model. Genomic prediction models using ~5000-10,000 SNPs provided predictive abilities equivalent to using all 13,787 and 19,506 SNPs genotyped in the E. benthamii and E. pellita populations, respectively. No difference was detected in predictive ability when different sets of SNPs were utilized, based on position (equidistantly genome-wide, inside genes, linkage disequilibrium pruned or on single chromosomes), as long as the total number of SNPs used was above ~5000. Predictive abilities obtained by removing relatedness between training and validation sets fell near zero for E. benthamii and were halved for E. pellita. These results corroborate the current view that relatedness is the main driver of genomic prediction, although some short-range historical linkage disequilibrium (LD) was likely captured for E. pellita. A GWAS identified only one significant association for volume growth in E. pellita, illustrating the fact that while genome-wide regression is able to account for large proportions of the heritability, very little or none of it is captured into significant associations using GWAS in breeding populations of the size evaluated in this study. This study provides further experimental data supporting positive prospects of using genome-wide data to capture large proportions of trait heritability and predict growth traits in trees with accuracies equal or better than those attainable by phenotypic selection. Additionally, our results document the superiority of the whole-genome regression approach in accounting for large proportions of the heritability of complex traits such as growth in contrast to the limited value of the local GWAS approach toward breeding applications in forest trees.
Gutierrez, Alejandro P; Turner, Frances; Gharbi, Karim; Talbot, Richard; Lowe, Natalie R; Peñaloza, Carolina; McCullough, Mark; Prodöhl, Paulo A; Bean, Tim P; Houston, Ross D
2017-07-05
SNP arrays are enabling tools for high-resolution studies of the genetic basis of complex traits in farmed and wild animals. Oysters are of critical importance in many regions from both an ecological and economic perspective, and oyster aquaculture forms a key component of global food security. The aim of our study was to design a combined-species, medium density SNP array for Pacific oyster ( Crassostrea gigas ) and European flat oyster ( Ostrea edulis ), and to test the performance of this array on farmed and wild populations from multiple locations, with a focus on European populations. SNP discovery was carried out by whole-genome sequencing (WGS) of pooled genomic DNA samples from eight C. gigas populations, and restriction site-associated DNA sequencing (RAD-Seq) of 11 geographically diverse O. edulis populations. Nearly 12 million candidate SNPs were discovered and filtered based on several criteria, including preference for SNPs segregating in multiple populations and SNPs with monomorphic flanking regions. An Affymetrix Axiom Custom Array was created and tested on a diverse set of samples ( n = 219) showing ∼27 K high quality SNPs for C. gigas and ∼11 K high quality SNPs for O. edulis segregating in these populations. A high proportion of SNPs were segregating in each of the populations, and the array was used to detect population structure and levels of linkage disequilibrium (LD). Further testing of the array on three C. gigas nuclear families ( n = 165) revealed that the array can be used to clearly distinguish between both families based on identity-by-state (IBS) clustering parental assignment software. This medium density, combined-species array will be publicly available through Affymetrix, and will be applied for genome-wide association and evolutionary genetic studies, and for genomic selection in oyster breeding programs. Copyright © 2017 Gutierrez et al.
Genome wide association study (GWAS) for grain yield in rice cultivated under water deficit.
Pantalião, Gabriel Feresin; Narciso, Marcelo; Guimarães, Cléber; Castro, Adriano; Colombari, José Manoel; Breseghello, Flavio; Rodrigues, Luana; Vianello, Rosana Pereira; Borba, Tereza Oliveira; Brondani, Claudio
2016-12-01
The identification of rice drought tolerant materials is crucial for the development of best performing cultivars for the upland cultivation system. This study aimed to identify markers and candidate genes associated with drought tolerance by Genome Wide Association Study analysis, in order to develop tools for use in rice breeding programs. This analysis was made with 175 upland rice accessions (Oryza sativa), evaluated in experiments with and without water restriction, and 150,325 SNPs. Thirteen SNP markers associated with yield under drought conditions were identified. Through stepwise regression analysis, eight SNP markers were selected and validated in silico, and when tested by PCR, two out of the eight SNP markers were able to identify a group of rice genotypes with higher productivity under drought. These results are encouraging for deriving markers for the routine analysis of marker assisted selection. From the drought experiment, including the genes inherited in linkage blocks, 50 genes were identified, from which 30 were annotated, and 10 were previously related to drought and/or abiotic stress tolerance, such as the transcription factors WRKY and Apetala2, and protein kinases.
Hodoğlugil, Uğur; Williamson, David W.; Yu, Yi; Farrer, Lindsay A.; Mahley, Robert W.
2011-01-01
Summary We narrowed chromosome 15q21-23 linkage to plasma high density lipoprotein cholesterol (HDL-C) levels in atherogenic dyslipidemic Turkish families by fine mapping, then focused on glucuronic acid epimerase (GLCE), a heparan sulfate proteoglycan (HSPG) biosynthesis enzyme. HSPGs participate in lipid metabolism along with apolipoprotein (apo) E. Of 31 SNPs in the GLCE locus, nine analyzed by haplotype were associated with plasma HDL-C and triglyceride levels (permuted p = 0.006 and 0.013, respectively) in families. Of five tagging GLCE SNPs in two cohorts of unrelated subjects, three (rs16952868, rs11631403, rs3865014) were associated with triglyceride and HDL-C levels in males (non-permuted p < 0.05). The association was stronger in APOE 2/3 subjects (apoE2 has reduced binding to HSPGs) and reached multiple-testing significance (p < 0.05) in both males and females (n = 2612). Similar results were obtained in the second cohort (n = 1164). Interestingly, at the GLCE locus, bounded by recombination hotspots, Turks had a minor allele frequency of SNPs resembling Chinese more than European ancestry; adjoining regions on chromosome 15 resembled the European pattern. Studies of glce+/–apoe–/– mice fed a chow or high-fat diet supported a role for GLCE in lipid metabolism. Thus, SNPs in GLCE are associated with triglyceride and HDL-C levels in Turks, and mouse studies support a role for glce in lipid metabolism. PMID:21488854
2011-01-01
Introduction The Toll-like receptor 7 (TLR7) gene, encoded on human chromosome Xp22.3, is crucial for type I interferon production. A recent multicenter study in East Asian populations, comprising Chinese, Korean and Japanese participants, identified an association of a TLR7 single-nucleotide polymorphism (SNP) located in the 3' untranslated region (3' UTR), rs3853839, with systemic lupus erythematosus (SLE), especially in males, although some difference was observed among the tested populations. To test whether additional polymorphisms contribute to SLE in Japanese, we systematically analyzed the association of TLR7 with SLE in a Japanese female population. Methods A case-control association study was conducted on eight tag SNPs in the TLR7 region, including rs3853839, in 344 Japanese females with SLE and 274 healthy female controls. Results In addition to rs3853839, two SNPs in intron 2, rs179019 and rs179010, which were in moderate linkage disequilibrium with each other (r2 = 0.53), showed an association with SLE (rs179019: P = 0.016, odds ratio (OR) 2.02, 95% confidence interval (95% CI) 1.15 to 3.54; rs179010: P = 0.018, OR 1.75, 95% CI 1.10 to 2.80 (both under the recessive model)). Conditional logistic regression analysis revealed that the association of the intronic SNPs and the 3' UTR SNP remained significant after we adjusted them for each other. When only the patients and controls carrying the risk genotypes at the 3' UTR SNPpositionwere analyzed, the risk of SLE was significantly increased when the individuals also carried the risk genotypes at both of the intronic SNPs (P = 0.0043, OR 2.45, 95% CI 1.31 to 4.60). Furthermore, the haplotype containing the intronic risk alleles in addition to the 3' UTR risk allele was associated with SLE under the recessive model (P = 0.016, OR 2.37, 95% CI 1.17 to 4.80), but other haplotypes were not associated with SLE. Conclusions The TLR7 intronic SNPs rs179019 and rs179010 are associated with SLE independently of the 3' UTR SNP rs3853839 in Japanese women. Our findings support a role of TLR7 in predisposition for SLE in Asian populations. PMID:21396113
Sung, Yun J; Gu, C Charles; Tiwari, Hemant K; Arnett, Donna K; Broeckel, Ulrich; Rao, Dabeeru C
2012-07-01
Genotype imputation provides imputation of untyped single nucleotide polymorphisms (SNPs) that are present on a reference panel such as those from the HapMap Project. It is popular for increasing statistical power and comparing results across studies using different platforms. Imputation for African American populations is challenging because their linkage disequilibrium blocks are shorter and also because no ideal reference panel is available due to admixture. In this paper, we evaluated three imputation strategies for African Americans. The intersection strategy used a combined panel consisting of SNPs polymorphic in both CEU and YRI. The union strategy used a panel consisting of SNPs polymorphic in either CEU or YRI. The merge strategy merged results from two separate imputations, one using CEU and the other using YRI. Because recent investigators are increasingly using the data from the 1000 Genomes (1KG) Project for genotype imputation, we evaluated both 1KG-based imputations and HapMap-based imputations. We used 23,707 SNPs from chromosomes 21 and 22 on Affymetrix SNP Array 6.0 genotyped for 1,075 HyperGEN African Americans. We found that 1KG-based imputations provided a substantially larger number of variants than HapMap-based imputations, about three times as many common variants and eight times as many rare and low-frequency variants. This higher yield is expected because the 1KG panel includes more SNPs. Accuracy rates using 1KG data were slightly lower than those using HapMap data before filtering, but slightly higher after filtering. The union strategy provided the highest imputation yield with next highest accuracy. The intersection strategy provided the lowest imputation yield but the highest accuracy. The merge strategy provided the lowest imputation accuracy. We observed that SNPs polymorphic only in CEU had much lower accuracy, reducing the accuracy of the union strategy. Our findings suggest that 1KG-based imputations can facilitate discovery of significant associations for SNPs across the whole MAF spectrum. Because the 1KG Project is still under way, we expect that later versions will provide better imputation performance. © 2012 Wiley Periodicals, Inc.
Uemoto, Yoshinobu; Sasaki, Shinji; Kojima, Takatoshi; Sugimoto, Yoshikazu; Watanabe, Toshio
2015-11-19
Genetic variance that is not captured by single nucleotide polymorphisms (SNPs) is due to imperfect linkage disequilibrium (LD) between SNPs and quantitative trait loci (QTLs), and the extent of LD between SNPs and QTLs depends on different minor allele frequencies (MAF) between them. To evaluate the impact of MAF of QTLs on genomic evaluation, we performed a simulation study using real cattle genotype data. In total, 1368 Japanese Black cattle and 592,034 SNPs (Illumina BovineHD BeadChip) were used. We simulated phenotypes using real genotypes under different scenarios, varying the MAF categories, QTL heritability, number of QTLs, and distribution of QTL effect. After generating true breeding values and phenotypes, QTL heritability was estimated and the prediction accuracy of genomic estimated breeding value (GEBV) was assessed under different SNP densities, prediction models, and population size by a reference-test validation design. The extent of LD between SNPs and QTLs in this population was higher in the QTLs with high MAF than in those with low MAF. The effect of MAF of QTLs depended on the genetic architecture, evaluation strategy, and population size in genomic evaluation. In genetic architecture, genomic evaluation was affected by the MAF of QTLs combined with the QTL heritability and the distribution of QTL effect. The number of QTL was not affected on genomic evaluation if the number of QTL was more than 50. In the evaluation strategy, we showed that different SNP densities and prediction models affect the heritability estimation and genomic prediction and that this depends on the MAF of QTLs. In addition, accurate QTL heritability and GEBV were obtained using denser SNP information and the prediction model accounted for the SNPs with low and high MAFs. In population size, a large sample size is needed to increase the accuracy of GEBV. The MAF of QTL had an impact on heritability estimation and prediction accuracy. Most genetic variance can be captured using denser SNPs and the prediction model accounted for MAF, but a large sample size is needed to increase the accuracy of GEBV under all QTL MAF categories.
Gonzalez-Pena, Dianelys; Gao, Guangtu; Baranski, Matthew; Moen, Thomas; Cleveland, Beth M; Kenney, P Brett; Vallejo, Roger L; Palti, Yniv; Leeds, Timothy D
2016-01-01
Fillet yield (FY, %) is an economically-important trait in rainbow trout aquaculture that affects production efficiency. Despite that, FY has received little attention in breeding programs because it is difficult to measure on a large number of fish and cannot be directly measured on breeding candidates. The recent development of a high-density SNP array for rainbow trout has provided the needed tool for studying the underlying genetic architecture of this trait. A genome-wide association study (GWAS) was conducted for FY, body weight at 10 (BW10) and 13 (BW13) months post-hatching, head-off carcass weight (CAR), and fillet weight (FW) in a pedigreed rainbow trout population selectively bred for improved growth performance. The GWAS analysis was performed using the weighted single-step GBLUP method (wssGWAS). Phenotypic records of 1447 fish (1.5 kg at harvest) from 299 full-sib families in three successive generations, of which 875 fish from 196 full-sib families were genotyped, were used in the GWAS analysis. A total of 38,107 polymorphic SNPs were analyzed in a univariate model with hatch year and harvest group as fixed effects, harvest weight as a continuous covariate, and animal and common environment as random effects. A new linkage map was developed to create windows of 20 adjacent SNPs for use in the GWAS. The two windows with largest effect for FY and FW were located on chromosome Omy9 and explained only 1.0-1.5% of genetic variance, thus suggesting a polygenic architecture affected by multiple loci with small effects in this population. One window on Omy5 explained 1.4 and 1.0% of the genetic variance for BW10 and BW13, respectively. Three windows located on Omy27, Omy17, and Omy9 (same window detected for FY) explained 1.7, 1.7, and 1.0%, respectively, of genetic variance for CAR. Among the detected 100 SNPs, 55% were located directly in genes (intron and exons). Nucleotide sequences of intragenic SNPs were blasted to the Mus musculus genome to create a putative gene network. The network suggests that differences in the ability to maintain a proliferative and renewable population of myogenic precursor cells may affect variation in growth and fillet yield in rainbow trout.
Gonzalez-Pena, Dianelys; Gao, Guangtu; Baranski, Matthew; Moen, Thomas; Cleveland, Beth M.; Kenney, P. Brett; Vallejo, Roger L.; Palti, Yniv; Leeds, Timothy D.
2016-01-01
Fillet yield (FY, %) is an economically-important trait in rainbow trout aquaculture that affects production efficiency. Despite that, FY has received little attention in breeding programs because it is difficult to measure on a large number of fish and cannot be directly measured on breeding candidates. The recent development of a high-density SNP array for rainbow trout has provided the needed tool for studying the underlying genetic architecture of this trait. A genome-wide association study (GWAS) was conducted for FY, body weight at 10 (BW10) and 13 (BW13) months post-hatching, head-off carcass weight (CAR), and fillet weight (FW) in a pedigreed rainbow trout population selectively bred for improved growth performance. The GWAS analysis was performed using the weighted single-step GBLUP method (wssGWAS). Phenotypic records of 1447 fish (1.5 kg at harvest) from 299 full-sib families in three successive generations, of which 875 fish from 196 full-sib families were genotyped, were used in the GWAS analysis. A total of 38,107 polymorphic SNPs were analyzed in a univariate model with hatch year and harvest group as fixed effects, harvest weight as a continuous covariate, and animal and common environment as random effects. A new linkage map was developed to create windows of 20 adjacent SNPs for use in the GWAS. The two windows with largest effect for FY and FW were located on chromosome Omy9 and explained only 1.0–1.5% of genetic variance, thus suggesting a polygenic architecture affected by multiple loci with small effects in this population. One window on Omy5 explained 1.4 and 1.0% of the genetic variance for BW10 and BW13, respectively. Three windows located on Omy27, Omy17, and Omy9 (same window detected for FY) explained 1.7, 1.7, and 1.0%, respectively, of genetic variance for CAR. Among the detected 100 SNPs, 55% were located directly in genes (intron and exons). Nucleotide sequences of intragenic SNPs were blasted to the Mus musculus genome to create a putative gene network. The network suggests that differences in the ability to maintain a proliferative and renewable population of myogenic precursor cells may affect variation in growth and fillet yield in rainbow trout. PMID:27920797
Nieuwenhuis, Maartje A.; Siedlinski, Matteusz; van den Berge, Maarten; Granell, Raquel; Li, Xingnan; Niens, Marijke; van der Vlies, Pieter; Altmüller, Janine; Nürnberg, Peter; Kerkhof, Marjan; van Schayck, Onno C.; Riemersma, Ronald A.; van der Molen, Thys; de Monchy, Jan G.; Bossé, Yohan; Sandford, Andrew; Bruijnzeel-Koomen, Carla A.; van Wijk, Roy G.; ten Hacken, Nick H.; Timens, Wim; Boezen, H. Marike; Henderson, John; Kabesch, Michael; Vonk, Judith M.; Postma, Dirkje S.; Koppelman, Gerard H.
2016-01-01
Background Genome wide association studies (GWAS) of asthma have identified single nucleotide polymorphisms (SNPs) that modestly increase the risk for asthma. This could be due to phenotypic heterogeneity of asthma. Bronchial hyperresponsiveness (BHR) is a phenotypic hallmark of asthma. We aim to identify susceptibility genes for asthma combined with BHR and analyse the presence of cis-eQTLs among replicated SNPs. Secondly, we compare the genetic association of SNPs previously associated with (doctor diagnosed) asthma to our GWAS of asthma with BHR. Methods A GWAS was performed in 920 asthmatics with BHR and 980 controls. Top SNPs of our GWAS were analysed in four replication cohorts and lung cis-eQTL analysis was performed on replicated SNPs. We investigated association of SNPs previously associated with asthma in our data. Results 368 SNPs were followed up for replication. Six SNPs in genes encoding ABI3BP, NAF1, MICA and the 17q21 locus replicated in one or more cohorts, with one locus (17q21) achieving genome wide significance after meta-analysis. Five out of 6 replicated SNPs regulated 35 gene transcripts in whole lung. Eight of 20 asthma associated SNPs from previous GWAS were significantly associated with asthma and BHR. Three SNPs, in IL-33 and GSDMB, showed larger effect sizes in our data compared to published literature. Conclusions Combining GWAS with subsequent lung eQTL analysis revealed disease associated SNPs regulating lung mRNA expression levels of potential new asthma genes. Adding BHR to the asthma definition does not lead to an overall larger genetic effect size than analysing (doctor’s diagnosed) asthma. PMID:27439200
Kawakami, Takeshi; Backström, Niclas; Burri, Reto; Husby, Arild; Olason, Pall; Rice, Amber M; Ålund, Murielle; Qvarnström, Anna; Ellegren, Hans
2014-01-01
With the access to draft genome sequence assemblies and whole-genome resequencing data from population samples, molecular ecology studies will be able to take truly genome-wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single-nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10-fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later-generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system. PMID:24784959
2012-01-01
Background The Asteraceae species Cynara cardunculus (2n = 2x = 34) includes the two fully cross-compatible domesticated taxa globe artichoke (var. scolymus L.) and cultivated cardoon (var. altilis DC). As both are out-pollinators and suffer from marked inbreeding depression, linkage analysis has focussed on the use of a two way pseudo-test cross approach. Results A set of 172 microsatellite (SSR) loci derived from expressed sequence tag DNA sequence were integrated into the reference C. cardunculus genetic maps, based on segregation among the F1 progeny of a cross between a globe artichoke and a cultivated cardoon. The resulting maps each detected 17 major linkage groups, corresponding to the species’ haploid chromosome number. A consensus map based on 66 co-dominant shared loci (64 SSRs and two SNPs) assembled 694 loci, with a mean inter-marker spacing of 2.5 cM. When the maps were used to elucidate the pattern of inheritance of head production earliness, a key commercial trait, seven regions were shown to harbour relevant quantitative trait loci (QTL). Together, these QTL accounted for up to 74% of the overall phenotypic variance. Conclusion The newly developed consensus as well as the parental genetic maps can accelerate the process of tagging and eventually isolating the genes underlying earliness in both the domesticated C. cardunculus forms. The largest single effect mapped to the same linkage group in each parental maps, and explained about one half of the phenotypic variance, thus representing a good candidate for marker assisted selection. PMID:22621324
Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W
2014-09-01
A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.
Bassi, Denis; Briñez, Boris; Rosa, Juliana Santa; Oblessuc, Paula Rodrigues; de Almeida, Caléo Panhoca; Nucci, Stella Maris; da Silva, Larissa Chariel Domingos; Chiorato, Alisson Fernando; Vianello, Rosana Pereira; Camargo, Luis Eduardo Aranha; Blair, Matthew Wohlgemuth; Benchimol-Reis, Luciana Lasry
2017-01-01
Abstract Angular leaf spot (ALS) and powdery mildew (PWM) are two important fungi diseases causing significant yield losses in common beans. In this study, a new genetic linkage map was constructed using single sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs), in a segregating population derived from the AND 277 x SEA 5 cross, with 105 recombinant inbred lines. Phenotypic evaluations were performed in the greenhouse to identify quantitative trait loci (QTLs) associated with resistance by means of the composite interval mapping analysis. Four QTLs were identified for ALS resistance. The QTL ALS11AS, linked on the SNP BAR 5054, mapped on chromosome Pv11, showed the greatest effect (R2 = 26.5%) on ALS phenotypic variance. For PWM resistance, two QTLs were detected, PWM2AS and PWM11AS, on Pv2 and Pv11, explaining 7% and 66% of the phenotypic variation, respectively. Both QTLs on Pv11 were mapped on the same genomic region, suggesting that it is a pleiotropic region. The present study resulted in the identification of new markers closely linked to ALS and PWM QTLs, which can be used for marker-assisted selection, fine mapping and positional cloning. PMID:28222201
Shared heritability of attention-deficit/hyperactivity disorder and autism spectrum disorder.
Rommelse, Nanda N J; Franke, Barbara; Geurts, Hilde M; Hartman, Catharina A; Buitelaar, Jan K
2010-03-01
Attention-deficit/hyperactivity disorder (ADHD) and autism spectrum disorder (ASD) are both highly heritable neurodevelopmental disorders. Evidence indicates both disorders co-occur with a high frequency, in 20-50% of children with ADHD meeting criteria for ASD and in 30-80% of ASD children meeting criteria for ADHD. This review will provide an overview on all available studies [family based, twin, candidate gene, linkage, and genome wide association (GWA) studies] shedding light on the role of shared genetic underpinnings of ADHD and ASD. It is concluded that family and twin studies do provide support for the hypothesis that ADHD and ASD originate from partly similar familial/genetic factors. Only a few candidate gene studies, linkage studies and GWA studies have specifically addressed this co-occurrence, pinpointing to some promising pleiotropic genes, loci and single nucleotide polymorphisms (SNPs), but the research field is in urgent need for better designed and powered studies to tackle this complex issue. We propose that future studies examining shared familial etiological factors for ADHD and ASD use a family-based design in which the same phenotypic (ADHD and ASD), candidate endophenotypic, and environmental measurements are obtained from all family members. Multivariate multi-level models are probably best suited for the statistical analysis.
Zhu, Xiaofeng; Yan, Denise; Cooper, Richard S.; Luke, Amy; Ikeda, Morna A.; Chang, Yen-Pei C.; Weder, Alan; Chakravarti, Aravinda
2003-01-01
Association studies of candidate genes with complex traits have generally used one or a few single nucleotide polymorphisms (SNPs), although variation in the extent of linkage disequilibrium (LD) within genes markedly influences the sensitivity and precision of association studies. The extent of LD and the underlying haplotype structure for most candidate genes are still unavailable. We sampled 193 blacks (African-Americans) and 160 whites (European-Americans) and estimated the intragenic LD and the haplotype structure in four genes of the renin–angiotensin system. We genotyped 25 SNPs, with all but one of the pairs spaced between 1 and 20 kb, thus providing resolution at small scale. The pattern of LD within a gene was very heterogeneous. Using a robust method to define haplotype blocks, blocks of limited haplotype diversity were identified at each locus; between these blocks, LD was lost owing to the history of recombination events. As anticipated, there was less LD among blacks, the number of haplotypes was substantially larger, and shorter haplotype segments were found, compared with whites. These findings have implications for candidate-gene association studies and indicate that variation between populations of European and African origin in haplotype diversity is characteristic of most genes. [The sequence data described in this paper are available in GenBank under the following accession nos: AGT, MIM 106150; Renin, MIM 179820; ACE, MIM 106180; Angiotensin receptor I, MIM 106165. Supplementary material is available online at http://www.genome.org.] PMID:12566395
Association of the oxytocin receptor gene (OXTR) in Caucasian children and adolescents with autism.
Jacob, Suma; Brune, Camille W; Carter, C S; Leventhal, Bennett L; Lord, Catherine; Cook, Edwin H
2007-04-24
The oxytocin receptor gene (OXTR) has been studied in autism because of the role of oxytocin (OT) in social cognition. Linkage has also been demonstrated to the region of OXTR in a large sample. Two single nucleotide polymorphisms (SNPs) and a haplotype constructed from them in OXTR have been associated with autism in the Chinese Han population. We tested whether these associations replicated in a Caucasian sample with strictly defined autistic disorder. We genotyped the two previously associated SNPs (rs2254298, rs53576) in 57 Caucasian autism trios. Probands met clinical, ADI-R, and ADOS criteria for autistic disorder. Significant association was detected at rs2254298 (p=0.03) but not rs53576. For rs2254298, overtransmission of the G allele to probands with autistic disorder was found which contrasts with the overtransmission of A previously reported in the Chinese Han sample. In both samples, G was more frequent than A. However, in our Caucasian autism trios and the CEU Caucasian HapMap samples the frequency of A was less than that reported in the Chinese Han and Chinese in Bejing HapMap samples. The haplotype test of association did not reveal excess transmission from parents to affected offspring. These findings provide support for association of OXTR with autism in a Caucasian population. Overtransmission of different alleles in different populations may be due to a different pattern of linkage disequilibrium between the marker rs2254298 and an as yet undetermined susceptibility variant in OXTR.