CsSNP: A Web-Based Tool for the Detecting of Comparative Segments SNPs.
Wang, Yi; Wang, Shuangshuang; Zhou, Dongjie; Yang, Shuai; Xu, Yongchao; Yang, Chao; Yang, Long
2016-07-01
SNP (single nucleotide polymorphism) is a popular tool for the study of genetic diversity, evolution, and other areas. Therefore, it is necessary to develop a convenient, utility, robust, rapid, and open source detecting-SNP tool for all researchers. Since the detection of SNPs needs special software and series steps including alignment, detection, analysis and present, the study of SNPs is limited for nonprofessional users. CsSNP (Comparative segments SNP, http://biodb.sdau.edu.cn/cssnp/ ) is a freely available web tool based on the Blat, Blast, and Perl programs to detect comparative segments SNPs and to show the detail information of SNPs. The results are filtered and presented in the statistics figure and a Gbrowse map. This platform contains the reference genomic sequences and coding sequences of 60 plant species, and also provides new opportunities for the users to detect SNPs easily. CsSNP is provided a convenient tool for nonprofessional users to find comparative segments SNPs in their own sequences, and give the users the information and the analysis of SNPs, and display these data in a dynamic map. It provides a new method to detect SNPs and may accelerate related studies.
Han, Yuepeng; Chagné, David; Gasic, Ksenija; Rikkerink, Erik H A; Beever, Jonathan E; Gardiner, Susan E; Korban, Schuyler S
2009-03-01
A genome-wide BAC physical map of the apple, Malus x domestica Borkh., has been recently developed. Here, we report on integrating the physical and genetic maps of the apple using a SNP-based approach in conjunction with bin mapping. Briefly, BAC clones located at ends of BAC contigs were selected, and sequenced at both ends. The BAC end sequences (BESs) were used to identify candidate SNPs. Subsequently, these candidate SNPs were genetically mapped using a bin mapping strategy for the purpose of mapping the physical onto the genetic map. Using this approach, 52 (23%) out of 228 BESs tested were successfully exploited to develop SNPs. These SNPs anchored 51 contigs, spanning approximately 37 Mb in cumulative physical length, onto 14 linkage groups. The reliability of the integration of the physical and genetic maps using this SNP-based strategy is described, and the results confirm the feasibility of this approach to construct an integrated physical and genetic maps for apple.
2011-01-01
Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation sequencing technologies and will include SNPs from comparative orthologous sequences that were identified in the present study, providing a wider collection of anchor points for comparative genomics among the conifers. PMID:21767361
Localization of Allotetraploid Gossypium SNPs Using Physical Mapping Resources
USDA-ARS?s Scientific Manuscript database
Recent efforts in Gossypium SNP development have produced thousands of putative SNPs for G. barbadense, G. mustelinum, and G. tomentosum relative to G. hirsutum. Here we report on current efforts to localize putative SNPs using physical mapping resources. Recent advances in physical mapping resour...
Shetty, Priya B.; Tang, Hua; Feng, Tao; Tayo, Bamidele; Morrison, Alanna C.; Kardia, Sharon L.R.; Hanis, Craig L.; Arnett, Donna K.; Hunt, Steven C.; Boerwinkle, Eric; Rao, D.C.; Cooper, R.S.; Risch, Neil; Zhu, Xiaofeng
2015-01-01
Background Admixture mapping of lipids was followed-up by family-based association analysis to identify variants for cardiovascular disease in African-Americans. Methods and Results The present study conducted admixture mapping analysis for total cholesterol, high-density lipoprotein cholesterol (HDL-C), low-density lipoprotein cholesterol (LDL-C) and triglycerides. The analysis was performed in 1,905 unrelated African-American subjects from the National Heart, Lung and Blood Institute’s Family Blood Pressure Program. Regions showing admixture evidence were followed-up with family-based association analysis in 3,556 African-American subjects from the FBPP. The admixture mapping and family-based association analyses were adjusted for age, age2, sex, body-mass-index, and genome-wide mean ancestry to minimize the confounding due to population stratification. Regions that were suggestive of local ancestry association evidence were found on chromosomes 7 (LDL-C), 8 (HDL-C), 14 (triglycerides) and 19 (total cholesterol and triglycerides). In the fine-mapping analysis, 52,939 SNPs were tested and 11 SNPs (8 independent SNPs) showed nominal significant association with HDL-C (2 SNPs), LDL-C (4 SNPs) and triglycerides (5 SNPs). The family data was used in the fine-mapping to identify SNPs that showed novel associations with lipids and regions including genes with known associations for cardiovascular disease. Conclusions This study identified regions on chromosomes 7, 8, 14 and 19 and 11 SNPs from the fine-mapping analysis that were associated with HDL-C, LDL-C and triglycerides for further studies of cardiovascular disease in African-Americans. PMID:25552592
Aromatase Inhibitor-Associated Bone Fractures: A Case-Cohort GWAS and Functional Genomics
Liu, Mohan; Goss, Paul E.; Ingle, James N.; Kubo, Michiaki; Furukawa, Yoichi; Batzler, Anthony; Jenkins, Gregory D.; Carlson, Erin E.; Nakamura, Yusuke; Schaid, Daniel J.; Chapman, Judy-Anne W.; Shepherd, Lois E.; Ellis, Matthew J.; Khosla, Sundeep; Wang, Liewei
2014-01-01
Bone fractures are a major consequence of osteoporosis. There is a direct relationship between serum estrogen concentrations and osteoporosis risk. Aromatase inhibitors (AIs) greatly decrease serum estrogen levels in postmenopausal women, and increased incidence of fractures is a side effect of AI therapy. We performed a discovery case-cohort genome-wide association study (GWAS) using samples from 1071 patients, 231 cases and 840 controls, enrolled in the MA.27 breast cancer AI trial to identify genetic factors involved in AI-related fractures, followed by functional genomic validation. Association analyses identified 20 GWAS single nucleotide polymorphism (SNP) signals with P < 5E-06. After removal of signals in gene deserts and those composed entirely of imputed SNPs, we applied a functional validation “decision cascade” that resulted in validation of the CTSZ-SLMO2-ATP5E, TRAM2-TMEM14A, and MAP4K4 genes. These genes all displayed estradiol (E2)-dependent induction in human fetal osteoblasts transfected with estrogen receptor-α, and their knockdown altered the expression of known osteoporosis-related genes. These same genes also displayed SNP-dependent variation in E2 induction that paralleled the SNP-dependent induction of known osteoporosis genes, such as osteoprotegerin. In summary, our case-cohort GWAS identified SNPs in or near CTSZ-SLMO2-ATP5E, TRAM2-TMEM14A, and MAP4K4 that were associated with risk for bone fracture in estrogen receptor-positive breast cancer patients treated with AIs. These genes displayed E2-dependent induction, their knockdown altered the expression of genes related to osteoporosis, and they displayed SNP genotype-dependent variation in E2 induction. These observations may lead to the identification of novel mechanisms associated with fracture risk in postmenopausal women treated with AIs. PMID:25148458
Khatkar, Mehar S.; Zenger, Kyall R.; Hobbs, Matthew; Hawken, Rachel J.; Cavanagh, Julie A. L.; Barris, Wes; McClintock, Alexander E.; McClintock, Sara; Thomson, Peter C.; Tier, Bruce; Nicholas, Frank W.; Raadsma, Herman W.
2007-01-01
Analysis of data on 1000 Holstein–Friesian bulls genotyped for 15,036 single-nucleotide polymorphisms (SNPs) has enabled genomewide identification of haplotype blocks and tag SNPs. A final subset of 9195 SNPs in Hardy–Weinberg equilibrium and mapped on autosomes on the bovine sequence assembly (release Btau 3.1) was used in this study. The average intermarker spacing was 251.8 kb. The average minor allele frequency (MAF) was 0.29 (0.05–0.5). Following recent precedents in human HapMap studies, a haplotype block was defined where 95% of combinations of SNPs within a region are in very high linkage disequilibrium. A total of 727 haplotype blocks consisting of ≥3 SNPs were identified. The average block length was 69.7 ± 7.7 kb, which is ∼5–10 times larger than in humans. These blocks comprised a total of 2964 SNPs and covered 50,638 kb of the sequence map, which constitutes 2.18% of the length of all autosomes. A set of tag SNPs, which will be useful for further fine-mapping studies, has been identified. Overall, the results suggest that as many as 75,000–100,000 tag SNPs would be needed to track all important haplotype blocks in the bovine genome. This would require ∼250,000 SNPs in the discovery phase. PMID:17435229
Yokoyama, Eiji; Hirai, Shinichiro; Ishige, Taichiro; Murakami, Satoshi
2018-01-02
Seventeen clusters of Shiga toxin-producing Escherichia coli O157:H7/- (O157) strains, determined by cluster analysis of pulsed-field gel electrophoresis patterns, were analyzed using whole genome sequence (WGS) data to investigate this pathogen's molecular epidemiology. The 17 clusters included 136 strains containing strains from nine outbreaks, with each outbreak caused by a single source contaminated with the organism, as shown by epidemiological contact surveys. WGS data of these strains were used to identify single nucleotide polymorphisms (SNPs) by two methods: short read data were directly mapped to a reference genome (mapping derived SNPs) and common SNPs between the mapping derived SNPs and SNPs in assembled data of short read data (common SNPs). Among both SNPs, those that were detected in genes with a gap were excluded to remove ambiguous SNPs from further analysis. The effectiveness of both SNPs was investigated among all the concatenated SNPs that were detected (whole SNP set); SNPs were divided into three categories based on the genes in which they were located (i.e., backbone SNP set, O-island SNP set, and mobile element SNP set); and SNPs in non-coding regions (intergenic region SNP set). When SNPs from strains isolated from the nine single source derived outbreaks were analyzed using an unweighted pair group method with arithmetic mean tree (UPGMA) and a minimum spanning tree (MST), the maximum pair-wise distances of the backbone SNP set of the mapping derived SNPs were significantly smaller than those of the whole and intergenic region SNP set on both UPGMAs and MSTs. This significant difference was also observed when the backbone SNP set of the common SNPs were examined (Steel-Dwass test, P≤0.01). When the maximum pair-wise distances were compared between the mapping derived and common SNPs, significant differences were observed in those of the whole, mobile element, and intergenic region SNP set (Wilcoxon signed rank test, P≤0.01). When all the strains included in one complex on an MST or one cluster on a UPGMA were designated as the same genotype, the values of the Hunter-Gaston Discriminatory Power Index for the backbone SNP set of the mapping derived and common SNPs were higher than those of other SNP sets. In contrast, the mobile element SNP set could not robustly subdivide lineage I strains of tested O157 strains using both the mapping derived and common SNPs. These results suggested that the backbone SNP set were the most effective for analysis of WGS data for O157 in enabling an appropriation of its molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.
Shetty, Priya B; Tang, Hua; Feng, Tao; Tayo, Bamidele; Morrison, Alanna C; Kardia, Sharon L R; Hanis, Craig L; Arnett, Donna K; Hunt, Steven C; Boerwinkle, Eric; Rao, Dabeeru C; Cooper, Richard S; Risch, Neil; Zhu, Xiaofeng
2015-02-01
Admixture mapping of lipids was followed-up by family-based association analysis to identify variants for cardiovascular disease in African Americans. The present study conducted admixture mapping analysis for total cholesterol, high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, and triglycerides. The analysis was performed in 1905 unrelated African American subjects from the National Heart, Lung and Blood Institute's Family Blood Pressure Program (FBPP). Regions showing admixture evidence were followed-up with family-based association analysis in 3556 African American subjects from the FBPP. The admixture mapping and family-based association analyses were adjusted for age, age(2), sex, body mass index, and genome-wide mean ancestry to minimize the confounding caused by population stratification. Regions that were suggestive of local ancestry association evidence were found on chromosomes 7 (low-density lipoprotein cholesterol), 8 (high-density lipoprotein cholesterol), 14 (triglycerides), and 19 (total cholesterol and triglycerides). In the fine-mapping analysis, 52 939 single-nucleotide polymorphisms (SNPs) were tested and 11 SNPs (8 independent SNPs) showed nominal significant association with high-density lipoprotein cholesterol (2 SNPs), low-density lipoprotein cholesterol (4 SNPs), and triglycerides (5 SNPs). The family data were used in the fine-mapping to identify SNPs that showed novel associations with lipids and regions, including genes with known associations for cardiovascular disease. This study identified regions on chromosomes 7, 8, 14, and 19 and 11 SNPs from the fine-mapping analysis that were associated with high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, and triglycerides for further studies of cardiovascular disease in African Americans. © 2014 American Heart Association, Inc.
Allelic expression mapping across cellular lineages to establish impact of non-coding SNPs
Adoue, Veronique; Schiavi, Alicia; Light, Nicholas; Almlöf, Jonas Carlsson; Lundmark, Per; Ge, Bing; Kwan, Tony; Caron, Maxime; Rönnblom, Lars; Wang, Chuan; Chen, Shu-Huang; Goodall, Alison H; Cambien, Francois; Deloukas, Panos; Ouwehand, Willem H; Syvänen, Ann-Christine; Pastinen, Tomi
2014-01-01
Most complex disease-associated genetic variants are located in non-coding regions and are therefore thought to be regulatory in nature. Association mapping of differential allelic expression (AE) is a powerful method to identify SNPs with direct cis-regulatory impact (cis-rSNPs). We used AE mapping to identify cis-rSNPs regulating gene expression in 55 and 63 HapMap lymphoblastoid cell lines from a Caucasian and an African population, respectively, 70 fibroblast cell lines, and 188 purified monocyte samples and found 40–60% of these cis-rSNPs to be shared across cell types. We uncover a new class of cis-rSNPs, which disrupt footprint-derived de novo motifs that are predominantly bound by repressive factors and are implicated in disease susceptibility through overlaps with GWAS SNPs. Finally, we provide the proof-of-principle for a new approach for genome-wide functional validation of transcription factor–SNP interactions. By perturbing NFκB action in lymphoblasts, we identified 489 cis-regulated transcripts with altered AE after NFκB perturbation. Altogether, we perform a comprehensive analysis of cis-variation in four cell populations and provide new tools for the identification of functional variants associated to complex diseases. PMID:25326100
Linkage maps of the Atlantic salmon (Salmo salar) genome derived from RAD sequencing
2014-01-01
Background Genetic linkage maps are useful tools for mapping quantitative trait loci (QTL) influencing variation in traits of interest in a population. Genotyping-by-sequencing approaches such as Restriction-site Associated DNA sequencing (RAD-Seq) now enable the rapid discovery and genotyping of genome-wide SNP markers suitable for the development of dense SNP linkage maps, including in non-model organisms such as Atlantic salmon (Salmo salar). This paper describes the development and characterisation of a high density SNP linkage map based on SbfI RAD-Seq SNP markers from two Atlantic salmon reference families. Results Approximately 6,000 SNPs were assigned to 29 linkage groups, utilising markers from known genomic locations as anchors. Linkage maps were then constructed for the four mapping parents separately. Overall map lengths were comparable between male and female parents, but the distribution of the SNPs showed sex-specific patterns with a greater degree of clustering of sire-segregating SNPs to single chromosome regions. The maps were integrated with the Atlantic salmon draft reference genome contigs, allowing the unique assignment of ~4,000 contigs to a linkage group. 112 genome contigs mapped to two or more linkage groups, highlighting regions of putative homeology within the salmon genome. A comparative genomics analysis with the stickleback reference genome identified putative genes closely linked to approximately half of the ordered SNPs and demonstrated blocks of orthology between the Atlantic salmon and stickleback genomes. A subset of 47 RAD-Seq SNPs were successfully validated using a high-throughput genotyping assay, with a correspondence of 97% between the two assays. Conclusions This Atlantic salmon RAD-Seq linkage map is a resource for salmonid genomics research as genotyping-by-sequencing becomes increasingly common. This is aided by the integration of the SbfI RAD-Seq SNPs with existing reference maps and the draft reference genome, as well as the identification of putative genes proximal to the SNPs. Differences in the distribution of recombination events between the sexes is evident, and regions of homeology have been identified which are reflective of the recent salmonid whole genome duplication. PMID:24571138
Almlöf, Jonas Carlsson; Lundmark, Per; Lundmark, Anders; Ge, Bing; Maouche, Seraya; Göring, Harald H. H.; Liljedahl, Ulrika; Enström, Camilla; Brocheton, Jessy; Proust, Carole; Godefroy, Tiphaine; Sambrook, Jennifer G.; Jolley, Jennifer; Crisp-Hihn, Abigail; Foad, Nicola; Lloyd-Jones, Heather; Stephens, Jonathan; Gwilliam, Rhian; Rice, Catherine M.; Hengstenberg, Christian; Samani, Nilesh J.; Erdmann, Jeanette; Schunkert, Heribert; Pastinen, Tomi; Deloukas, Panos; Goodall, Alison H.; Ouwehand, Willem H.; Cambien, François; Syvänen, Ann-Christine
2012-01-01
A large number of genome-wide association studies have been performed during the past five years to identify associations between SNPs and human complex diseases and traits. The assignment of a functional role for the identified disease-associated SNP is not straight-forward. Genome-wide expression quantitative trait locus (eQTL) analysis is frequently used as the initial step to define a function while allele-specific gene expression (ASE) analysis has not yet gained a wide-spread use in disease mapping studies. We compared the power to identify cis-acting regulatory SNPs (cis-rSNPs) by genome-wide allele-specific gene expression (ASE) analysis with that of traditional expression quantitative trait locus (eQTL) mapping. Our study included 395 healthy blood donors for whom global gene expression profiles in circulating monocytes were determined by Illumina BeadArrays. ASE was assessed in a subset of these monocytes from 188 donors by quantitative genotyping of mRNA using a genome-wide panel of SNP markers. The performance of the two methods for detecting cis-rSNPs was evaluated by comparing associations between SNP genotypes and gene expression levels in sample sets of varying size. We found that up to 8-fold more samples are required for eQTL mapping to reach the same statistical power as that obtained by ASE analysis for the same rSNPs. The performance of ASE is insensitive to SNPs with low minor allele frequencies and detects a larger number of significantly associated rSNPs using the same sample size as eQTL mapping. An unequivocal conclusion from our comparison is that ASE analysis is more sensitive for detecting cis-rSNPs than standard eQTL mapping. Our study shows the potential of ASE mapping in tissue samples and primary cells which are difficult to obtain in large numbers. PMID:23300628
Development of a set of SNP markers present in expressed genes of the apple.
Chagné, David; Gasic, Ksenija; Crowhurst, Ross N; Han, Yuepeng; Bassett, Heather C; Bowatte, Deepa R; Lawrence, Timothy J; Rikkerink, Erik H A; Gardiner, Susan E; Korban, Schuyler S
2008-11-01
Molecular markers associated with gene coding regions are useful tools for bridging functional and structural genomics. Due to their high abundance in plant genomes, single nucleotide polymorphisms (SNPs) are present within virtually all genomic regions, including most coding sequences. The objective of this study was to develop a set of SNPs for the apple by taking advantage of the wealth of genomics resources available for the apple, including a large collection of expressed sequenced tags (ESTs). Using bioinformatics tools, a search for SNPs within an EST database of approximately 350,000 sequences developed from a variety of apple accessions was conducted. This resulted in the identification of a total of 71,482 putative SNPs. As the apple genome is reported to be an ancient polyploid, attempts were made to verify whether those SNPs detected in silico were attributable either to allelic polymorphisms or to gene duplication or paralogous or homeologous sequence variations. To this end, a set of 464 PCR primer pairs was designed, PCR was amplified using two subsets of plants, and the PCR products were sequenced. The SNPs retrieved from these sequences were then mapped onto apple genetic maps, including a newly constructed map of a Royal Gala x A689-24 cross and a Malling 9 x Robusta 5, map using a bin mapping strategy. The SNP genotyping was performed using the high-resolution melting (HRM) technique. A total of 93 new markers containing 210 coding SNPs were successfully mapped. This new set of SNP markers for the apple offers new opportunities for understanding the genetic control of important horticultural traits using quantitative trait loci (QTL) or linkage disequilibrium analysis. These also serve as useful markers for aligning physical and genetic maps, and as potential transferable markers across the Rosaceae family.
Hong, Kyung-Won; Min, Haesook; Heo, Byeong-Mun; Joo, Seong Eun; Kim, Sung Soo; Kim, Yeonjung
2012-06-01
Increased pulse pressure (PP) and decreased mean arterial pressure (MAP) are strong prognostic predictors of adverse cardiovascular events. Recently, the International Consortium for Blood Pressure Genome-Wide Association Studies (ICBP-GWAS) reported eight loci that influenced PP and MAP. The ICBP-GWAS examined 51 cohorts--comprising 122 671 individuals of European ancestry--and identified eight SNPs: five that governed PP and three that controlled MAP. Six of these loci were novel. To replicate these newly identified loci and examine genetic architecture of PP and MAP between European and Asian populations, we conducted a meta-analysis of the eight SNPs combining data from ICBP and general population-based Korean cohorts. Two SNPs (rs13002573 (FIGN) and rs871606 (CHIC2)) for PP and two SNPs (rs1446468 (FIGN) and rs319690 (MAP4)) for MAP were replicated in Koreans. Although our GWAS only found moderate association, we believe that the findings promote us to propose that a similar genetic architecture governs PP and MAP in Asians and Europeans. However, further studies will be needed to confirm the possibility using other Asian population.
Construction of an almond linkage map in an Australian population Nonpareil × Lauranne
2010-01-01
Background Despite a high genetic similarity to peach, almonds (Prunus dulcis) have a fleshless fruit and edible kernel, produced as a crop for human consumption. While the release of peach genome v1.0 provides an excellent opportunity for almond genetic and genomic studies, well-assessed segregating populations and the respective saturated genetic linkage maps lay the foundation for such studies to be completed in almond. Results Using an almond intraspecific cross between 'Nonpareil' and 'Lauranne' (N × L), we constructed a moderately saturated map with SSRs, SNPs, ISSRs and RAPDs. The N × L map covered 591.4 cM of the genome with 157 loci. The average marker distance of the map was 4.0 cM. The map displayed high synteny and colinearity with the Prunus T × E reference map in all eight linkage groups (G1-G8). The positions of 14 mapped gene-anchored SNPs corresponded approximately with the positions of homologous sequences in the peach genome v1.0. Analysis of Mendelian segregation ratios showed that 17.9% of markers had significantly skewed genotype ratios at the level of P < 0.05. Due to the large number of skewed markers in the linkage group 7, the potential existence of deleterious gene(s) was assessed in the group. Integrated maps produced by two different mapping methods using JoinMap® 3 were compared, and their high degree of similarity was evident despite the positional inconsistency of a few markers. Conclusions We presented a moderately saturated Australian almond map, which is highly syntenic and collinear with the Prunus reference map and peach genome V1.0. Therefore, the well-assessed almond population reported here can be used to investigate the traits of interest under Australian growing conditions, and provides more information on the almond genome for the international community. PMID:20932335
Hulse-Kemp, Amanda M.; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A.; Scheffler, Brian E.; Fang, David D.; Chen, Z. Jeffrey; Van Deynze, Allen; Stelly, David M.
2015-01-01
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. PMID:25858960
SEAN: SNP prediction and display program utilizing EST sequence clusters.
Huntley, Derek; Baldo, Angela; Johri, Saurabh; Sergot, Marek
2006-02-15
SEAN is an application that predicts single nucleotide polymorphisms (SNPs) using multiple sequence alignments produced from expressed sequence tag (EST) clusters. The algorithm uses rules of sequence identity and SNP abundance to determine the quality of the prediction. A Java viewer is provided to display the EST alignments and predicted SNPs.
HapMap tagSNP transferability in multiple populations: general guidelines
Xing, Jinchuan; Witherspoon, David J.; Watkins, W. Scott; Zhang, Yuhua; Tolpinrud, Whitney; Jorde, Lynn B.
2008-01-01
This PDF receipt will only be used as the basis for generating PubMed Central (PMC) documents. PMC documents will be made available for review after conversion (approx. 2–3 weeks time). Any corrections that need to be made will be done at that time. No materials will be released to PMC without the approval of an author. Only the PMC documents will appear on PubMed Central -- this PDF Receipt will not appear on PubMed Central. Linkage disequilibrium (LD) has received much recent attention because of its value in localizing disease-causing genes. Due to the extensive LD between neighboring loci in the human genome, it is believed that a subset of the single nucleotide polymorphisms in a region (tagSNPs) can be selected to capture most of the remaining SNP variants. In this study, we examined LD patterns and HapMap tagSNP transferability in more than 300 individuals. A South Indian and an African Mbuti Pygmy population sample were included to evaluate the performance of HapMap tagSNPs in geographically distinct and genetically isolated populations. Our results show that HapMap tagSNPs selected with r2 >= 0.8 can capture more than 85% of the SNPs in populations that are from the same continental group. Combined tagSNPs from HapMap CEU and CHB+JPT serve as the best reference for the Indian sample. The HapMap YRI are a sufficient reference for tagSNP selection in the Pygmy sample. In addition to our findings, we reviewed over 25 recent studies of tagSNP transferability and propose a general guideline for selecting tagSNPs from HapMap populations. PMID:18482828
Hong, Yanbin; Pandey, Manish K; Liu, Ying; Chen, Xiaoping; Liu, Hong; Varshney, Rajeev K; Liang, Xuanqiang; Huang, Shangzhi
2015-01-01
The cultivated peanut (Arachis hypogaea L.) is an allotetraploid (AABB) species derived from the A-genome (Arachis duranensis) and B-genome (Arachis ipaensis) progenitors. Presence of two versions of a DNA sequence based on the two progenitor genomes poses a serious technical and analytical problem during single nucleotide polymorphism (SNP) marker identification and analysis. In this context, we have analyzed 200 amplicons derived from expressed sequence tags (ESTs) and genome survey sequences (GSS) to identify SNPs in a panel of genotypes consisting of 12 cultivated peanut varieties and two diploid progenitors representing the ancestral genomes. A total of 18 EST-SNPs and 44 genomic-SNPs were identified in 12 peanut varieties by aligning the sequence of A. hypogaea with diploid progenitors. The average frequency of sequence polymorphism was higher for genomic-SNPs than the EST-SNPs with one genomic-SNP every 1011 bp as compared to one EST-SNP every 2557 bp. In order to estimate the potential and further applicability of these identified SNPs, 96 peanut varieties were genotyped using high resolution melting (HRM) method. Polymorphism information content (PIC) values for EST-SNPs ranged between 0.021 and 0.413 with a mean of 0.172 in the set of peanut varieties, while genomic-SNPs ranged between 0.080 and 0.478 with a mean of 0.249. Total 33 SNPs were used for polymorphism detection among the parents and 10 selected lines from mapping population Y13Zh (Zhenzhuhei × Yueyou13). Of the total 33 SNPs, nine SNPs showed polymorphism in the mapping population Y13Zh, and seven SNPs were successfully mapped into five linkage groups. Our results showed that SNPs can be identified in allotetraploid peanut with high accuracy through amplicon sequencing and HRM assay. The identified SNPs were very informative and can be used for different genetic and breeding applications in peanut.
Montanari, Sara; Saeed, Munazza; Knäbel, Mareike; Kim, YoonKyeong; Troggio, Michela; Malnoy, Mickael; Velasco, Riccardo; Fontana, Paolo; Won, KyungHo; Durel, Charles-Eric; Perchepied, Laure; Schaffer, Robert; Wiedow, Claudia; Bus, Vincent; Brewer, Lester; Gardiner, Susan E; Crowhurst, Ross N; Chagné, David
2013-01-01
We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear ('Old Home'×'Louise Bon Jersey') and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality.
Hulse-Kemp, Amanda M; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A; Scheffler, Brian E; Fang, David D; Chen, Z Jeffrey; Van Deynze, Allen; Stelly, David M
2015-04-09
A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. Copyright © 2015 Hulse-Kemp et al.
A tool for selecting SNPs for association studies based on observed linkage disequilibrium patterns.
De La Vega, Francisco M; Isaac, Hadar I; Scafe, Charles R
2006-01-01
The design of genetic association studies using single-nucleotide polymorphisms (SNPs) requires the selection of subsets of the variants providing high statistical power at a reasonable cost. SNPs must be selected to maximize the probability that a causative mutation is in linkage disequilibrium (LD) with at least one marker genotyped in the study. The HapMap project performed a genome-wide survey of genetic variation with about a million SNPs typed in four populations, providing a rich resource to inform the design of association studies. A number of strategies have been proposed for the selection of SNPs based on observed LD, including construction of metric LD maps and the selection of haplotype tagging SNPs. Power calculations are important at the study design stage to ensure successful results. Integrating these methods and annotations can be challenging: the algorithms required to implement these methods are complex to deploy, and all the necessary data and annotations are deposited in disparate databases. Here, we present the SNPbrowser Software, a freely available tool to assist in the LD-based selection of markers for association studies. This stand-alone application provides fast query capabilities and swift visualization of SNPs, gene annotations, power, haplotype blocks, and LD map coordinates. Wizards implement several common SNP selection workflows including the selection of optimal subsets of SNPs (e.g. tagging SNPs). Selected SNPs are screened for their conversion potential to either TaqMan SNP Genotyping Assays or the SNPlex Genotyping System, two commercially available genotyping platforms, expediting the set-up of genetic studies with an increased probability of success.
USDA-ARS?s Scientific Manuscript database
Using next generation sequencing technology the International Swine SNP Consortium has identified 500,000 SNPs and used these to design an Illumina Infinium iSelect™ SNP BeadChip with a selection of 60,218 SNPs. The selected SNPs include previously validated SNPs and SNPs identified de novo using se...
Marker-assisted backcross approach for important agronomic traits of sorghum
USDA-ARS?s Scientific Manuscript database
Sequencing technologies are useful for identification of thousands of single nucleotide polymorphisms (SNPs) in a cost effective manner. QTL mapping, association mapping and Mutmap approaches provide opportunities for use of such SNPs to associate and identify genes that control important agronomic ...
Ribeiro, Antonio; Golicz, Agnieszka; Hackett, Christine Anne; Milne, Iain; Stephen, Gordon; Marshall, David; Flavell, Andrew J; Bayer, Micha
2015-11-11
Single Nucleotide Polymorphisms (SNPs) are widely used molecular markers, and their use has increased massively since the inception of Next Generation Sequencing (NGS) technologies, which allow detection of large numbers of SNPs at low cost. However, both NGS data and their analysis are error-prone, which can lead to the generation of false positive (FP) SNPs. We explored the relationship between FP SNPs and seven factors involved in mapping-based variant calling - quality of the reference sequence, read length, choice of mapper and variant caller, mapping stringency and filtering of SNPs by read mapping quality and read depth. This resulted in 576 possible factor level combinations. We used error- and variant-free simulated reads to ensure that every SNP found was indeed a false positive. The variation in the number of FP SNPs generated ranged from 0 to 36,621 for the 120 million base pairs (Mbp) genome. All of the experimental factors tested had statistically significant effects on the number of FP SNPs generated and there was a considerable amount of interaction between the different factors. Using a fragmented reference sequence led to a dramatic increase in the number of FP SNPs generated, as did relaxed read mapping and a lack of SNP filtering. The choice of reference assembler, mapper and variant caller also significantly affected the outcome. The effect of read length was more complex and suggests a possible interaction between mapping specificity and the potential for contributing more false positives as read length increases. The choice of tools and parameters involved in variant calling can have a dramatic effect on the number of FP SNPs produced, with particularly poor combinations of software and/or parameter settings yielding tens of thousands in this experiment. Between-factor interactions make simple recommendations difficult for a SNP discovery pipeline but the quality of the reference sequence is clearly of paramount importance. Our findings are also a stark reminder that it can be unwise to use the relaxed mismatch settings provided as defaults by some read mappers when reads are being mapped to a relatively unfinished reference sequence from e.g. a non-model organism in its early stages of genomic exploration.
Troggio, Michela; Malnoy, Mickael; Velasco, Riccardo; Fontana, Paolo; Won, KyungHo; Durel, Charles-Eric; Perchepied, Laure; Schaffer, Robert; Wiedow, Claudia; Bus, Vincent; Brewer, Lester; Gardiner, Susan E.; Crowhurst, Ross N.; Chagné, David
2013-01-01
We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear (‘Old Home’בLouise Bon Jersey’) and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality. PMID:24155917
Huang, Shunmou; Yang, Hongli; Zhan, Gaomiao; Wang, Xinfa; Liu, Guihua; Wang, Hanzhong
2012-01-01
Background Single nucleotide polymorphisms (SNPs) are an important class of genetic marker for target gene mapping. As of yet, there is no rapid and effective method to identify SNPs linked with agronomic traits in rapeseed and other crop species. Methodology/Principal Findings We demonstrate a novel method for identifying SNP markers in rapeseed by deep sequencing a representative library and performing bulk segregant analysis. With this method, SNPs associated with rapeseed pod shatter-resistance were discovered. Firstly, a reduced representation of the rapeseed genome was used. Genomic fragments ranging from 450–550 bp were prepared from the susceptible bulk (ten F2 plants with the silique shattering resistance index, SSRI <0.10) and the resistance bulk (ten F2 plants with SSRI >0.90), and also Solexa sequencing-produced 90 bp reads. Approximately 50 million of these sequence reads were assembled into contigs to a depth of 20-fold coverage. Secondly, 60,396 ‘simple SNPs’ were identified, and the statistical significance was evaluated using Fisher's exact test. There were 70 associated SNPs whose –log10 p value over 16 were selected to be further analyzed. The distribution of these SNPs appeared a tight cluster, which consisted of 14 associated SNPs within a 396 kb region on chromosome A09. Our evidence indicates that this region contains a major quantitative trait locus (QTL). Finally, two associated SNPs from this region were mapped on a major QTL region. Conclusions/Significance 70 associated SNPs were discovered and a major QTL for rapeseed pod shatter-resistance was found on chromosome A09 using our novel method. The associated SNP markers were used for mapping of the QTL, and may be useful for improving pod shatter-resistance in rapeseed through marker-assisted selection and map-based cloning. This approach will accelerate the discovery of major QTLs and the cloning of functional genes for important agronomic traits in rapeseed and other crop species. PMID:22529909
Association of Common Genetic Variants in the MAP4K4 Locus with Prediabetic Traits in Humans
Ketterer, Caroline; Heni, Martin; Machicao, Fausto; Guilherme, Adilson; Grallert, Harald; Schulze, Matthias B.; Boeing, Heiner; Stefan, Norbert; Fritsche, Andreas; Czech, Michael P.; Häring, Hans-Ulrich
2012-01-01
Mitogen-activated protein kinase kinase kinase kinase 4 (MAP4K4) is expressed in all diabetes-relevant tissues and mediates cytokine-induced insulin resistance. We investigated whether common single nucleotide polymorphisms (SNPs) in the MAP4K4 locus associate with glucose intolerance, insulin resistance, impaired insulin release, or elevated plasma cytokines. The best hit was tested for association with type 2 diabetes. Subjects (N = 1,769) were recruited from the Tübingen Family (TÜF) study for type 2 diabetes and genotyped for tagging SNPs. In a subgroup, cytokines were measured. Association with type 2 diabetes was tested in a prospective case-cohort study (N = 2,971) derived from the EPIC-Potsdam study. Three SNPs (rs6543087, rs17801985, rs1003376) revealed nominal and two SNPs (rs11674694, rs11678405) significant associations with 2-hour glucose levels. SNPs rs6543087 and rs11674694 were also nominally associated with decreased insulin sensitivity. Another two SNPs (rs2236936, rs2236935) showed associations with reduced insulin release, driven by effects in lean subjects only. Three SNPs (rs11674694, rs13003883, rs2236936) revealed nominal associations with IL-6 levels. SNP rs11674694 was significantly associated with type 2 diabetes. In conclusion, common variation in MAP4K4 is associated with insulin resistance and β-cell dysfunction, possibly via this gene’s role in inflammatory signalling. This variation’s impact on insulin sensitivity may be more important since its effect on insulin release vanishes with increasing BMI. PMID:23094072
SNP discovery by high-throughput sequencing in soybean
2010-01-01
Background With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology. Results A total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp. Conclusions We have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops. PMID:20701770
The Population Genomics of Repeated Evolution in the Blind Cavefish Astyanax mexicanus
Bradic, Martina; Teotónio, Henrique; Borowsky, Richard L.
2013-01-01
Distinct populations of Astyanax mexicanus cavefish offer striking examples of repeatable convergence or parallelism in their independent evolutions from surface to cave phenotypes. However, the extent to which the repeatability of evolution occurred at the genetic level remains poorly understood. To address this, we first characterized the genetic diversity of 518 single-nucleotide polymorphisms (SNPs), obtained through RAD tag sequencing and distributed throughout the genome, in seven cave and three groups of surface populations. The cave populations represented two distinct lineages (old and new). Thirty-one SNPs were significantly differentiated between surface and old cave populations, two SNPs were differentiated between surface and new cave populations, and 44 SNPs were significantly differentiated in both old and new cave populations. In addition, we determined whether these SNPs map to the same locations of previously described quantitative trait loci (QTL) between surface and cave populations. A total of 25 differentiated SNPs co-map with several QTL, such as one containing a fibroblast growth factor gene (Fgf8) involved in eye development and lens size. Further, the identity of many SNPs that co-mapped with QTL was the same in independently derived cave populations. These conclusions were further confirmed by haplotype analyses of SNPs within QTL regions. Our findings indicate that the repeatability of evolution at the genetic level is substantial, suggesting that ancestral standing genetic variation significantly contributed to the population genetic variability used in adaptation to the cave environment. PMID:23927992
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are ideally suited for the construction of high-resolution genetic maps, studying population evolutionary history and performing genome-wide association mapping experiments. Here we used a genome-wide set of 1536 SNPs to study linkage disequilibrium (LD) and po...
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro
Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Shirasawa, Kenta; Tanaka, Masaru; Takahata, Yasuhiro; ...
2017-03-10
Sweetpotato (Ipomoea batatas) is an autohexaploid species with 90 chromosomes (2n = 6x = 90) and a basic chromosome number of 15, and is therefore regarded as one of the most challenging species for high-density genetic map construction. Here, we used single nucleotide polymorphisms (SNPs) identified by double-digest restriction site-associated DNA sequencing based on next-generation sequencing technology to construct a map for sweetpotato. We then aligned the sequence reads onto the reference genome sequence of I. trifida, a likely diploid ancestor of sweetpotato, to detect SNPs. In addition, to simplify analysis of the complex genetic mode of autohexaploidy, we usedmore » an S1 mapping population derived from self-pollination of a single parent. As a result, 28,087 double-simplex SNPs showing a Mendelian segregation ratio in the S1 progeny could be mapped onto 96 linkage groups (LGs), covering a total distance of 33,020.4 cM. Based on the positions of the SNPs on the I. trifida genome, the LGs were classified into 15 groups, each with roughly six LGs and six small extra groups. The molecular genetic techniques used in this study are applicable to high-density mapping of other polyploid plant species, including important crops.« less
Roshandel, Delnaz; Thomson, Wendy; Pye, Stephen R.; Boonen, Steven; Borghs, Herman; Vanderschueren, Dirk; Huhtaniemi, Ilpo T.; Adams, Judith E.; Ward, Kate A.; Bartfai, Gyorgy; Casanueva, Felipe F.; Finn, Joseph D.; Forti, Gianni; Giwercman, Aleksander; Han, Thang S.; Kula, Krzysztof; Lean, Michael E.; Pendleton, Neil; Punab, Margus; Wu, Frederick C.
2011-01-01
Introduction In this study, we aimed to investigate the association between single nucleotide polymorphisms (SNPs) within two genes involved in the NF-κB cascade (GPR177 and MAP3K14) and bone mineral density (BMD) assessed at different skeletal sites, radial geometric parameters and bone turnover. Methods Ten GPR177 SNPs previously associated with BMD with genome-wide significance and twelve tag SNPs (r2≥0.8) within MAP3K14 (±10 kb) were genotyped in 2359 men aged 40–79 years recruited from 8 centres for participation in the European Male Aging Study (EMAS). Measurement of bone turnover markers (PINP and CTX-I) in the serum and quantitative ultrasound (QUS) at the calcaneus were performed in all centres. Dual energy X-ray absorptiometry (DXA), at the lumbar spine and hip, and peripheral quantitative computed tomography (pQCT), at the distal and midshaft radius, were performed in a subsample (2 centres). Linear regression was used to test for association between the SNPs and bone measures under an additive genetic model adjusting for study centre. Results We validated the associations between SNPs in GPR177 and BMDa previously reported and also observed evidence of pleiotrophic effects on density and geometry. Rs2772300 in GPR177 was associated with increased total hip and LS BMDa, increased total and cortical vBMD at the radius and increased cortical area, thickness and stress strain index. We also found evidence of association with BMDa, vBMD, geometric parameters and CTX-I for SNPs in MAP3K14. None of the GPR177 and MAP3K14 SNPs were associated with calcaneal estimated BMD measured by QUS. Conclusion Our findings suggest that SNPs in GPR177 and MAP3K14 involved in the NF-κB signalling pathway influence bone mineral density, geometry and turnover in a population-based cohort of middle aged and elderly men. This adds to the understanding of the role of genetic variation in this pathway in determining bone health. PMID:22132199
Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl
2017-01-01
Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273
HapMap filter 1.0: a tool to preprocess the HapMap genotypic data for association studies.
Zhang, Wei; Duan, Shiwei; Dolan, M Eileen
2008-05-13
The International HapMap Project provides a resource of genotypic data on single nucleotide polymorphisms (SNPs), which can be used in various association studies to identify the genetic determinants for phenotypic variations. Prior to the association studies, the HapMap dataset should be preprocessed in order to reduce the computation time and control the multiple testing problem. The less informative SNPs including those with very low genotyping rate and SNPs with rare minor allele frequencies to some extent in one or more population are removed. Some research designs only use SNPs in a subset of HapMap cell lines. Although the HapMap website and other association software packages have provided some basic tools for optimizing these datasets, a fast and user-friendly program to generate the output for filtered genotypic data would be beneficial for association studies. Here, we present a flexible, straight-forward bioinformatics program that can be useful in preparing the HapMap genotypic data for association studies by specifying cell lines and two common filtering criteria: minor allele frequencies and genotyping rate. The software was developed for Microsoft Windows and written in C++. The Windows executable and source code in Microsoft Visual C++ are available at Google Code (http://hapmap-filter-v1.googlecode.com/) or upon request. Their distribution is subject to GNU General Public License v3.
Roorkiwal, Manish; Jain, Ankit; Kale, Sandip M; Doddamani, Dadakhalandar; Chitikineni, Annapurna; Thudi, Mahendar; Varshney, Rajeev K
2018-04-01
To accelerate genomics research and molecular breeding applications in chickpea, a high-throughput SNP genotyping platform 'Axiom ® CicerSNP Array' has been designed, developed and validated. Screening of whole-genome resequencing data from 429 chickpea lines identified 4.9 million SNPs, from which a subset of 70 463 high-quality nonredundant SNPs was selected using different stringent filter criteria. This was further narrowed down to 61 174 SNPs based on p-convert score ≥0.3, of which 50 590 SNPs could be tiled on array. Among these tiled SNPs, a total of 11 245 SNPs (22.23%) were from the coding regions of 3673 different genes. The developed Axiom ® CicerSNP Array was used for genotyping two recombinant inbred line populations, namely ICCRIL03 (ICC 4958 × ICC 1882) and ICCRIL04 (ICC 283 × ICC 8261). Genotyping data reflected high success and polymorphic rate, with 15 140 (29.93%; ICCRIL03) and 20 018 (39.57%; ICCRIL04) polymorphic SNPs. High-density genetic maps comprising 13 679 SNPs spanning 1033.67 cM and 7769 SNPs spanning 1076.35 cM were developed for ICCRIL03 and ICCRIL04 populations, respectively. QTL analysis using multilocation, multiseason phenotyping data on these RILs identified 70 (ICCRIL03) and 120 (ICCRIL04) main-effect QTLs on genetic map. Higher precision and potential of this array is expected to advance chickpea genetics and breeding applications. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Muchero, Wellington; Diop, Ndeye N; Bhat, Prasanna R; Fenton, Raymond D; Wanamaker, Steve; Pottorff, Marti; Hearne, Sarah; Cisse, Ndiaga; Fatokun, Christian; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J
2009-10-27
Consensus genetic linkage maps provide a genomic framework for quantitative trait loci identification, map-based cloning, assessment of genetic diversity, association mapping, and applied breeding in marker-assisted selection schemes. Among "orphan crops" with limited genomic resources such as cowpea [Vigna unguiculata (L.) Walp.] (2n = 2x = 22), the use of transcript-derived SNPs in genetic maps provides opportunities for automated genotyping and estimation of genome structure based on synteny analysis. Here, we report the development and validation of a high-throughput EST-derived SNP assay for cowpea, its application in consensus map building, and determination of synteny to reference genomes. SNP mining from 183,118 ESTs sequenced from 17 cDNA libraries yielded approximately 10,000 high-confidence SNPs from which an Illumina 1,536-SNP GoldenGate genotyping array was developed and applied to 741 recombinant inbred lines from six mapping populations. Approximately 90% of the SNPs were technically successful, providing 1,375 dependable markers. Of these, 928 were incorporated into a consensus genetic map spanning 680 cM with 11 linkage groups and an average marker distance of 0.73 cM. Comparison of this cowpea genetic map to reference legumes, soybean (Glycine max) and Medicago truncatula, revealed extensive macrosynteny encompassing 85 and 82%, respectively, of the cowpea map. Regions of soybean genome duplication were evident relative to the simpler diploid cowpea. Comparison with Arabidopsis revealed extensive genomic rearrangement with some conserved microsynteny. These results support evolutionary closeness between cowpea and soybean and identify regions for synteny-based functional genomics studies in legumes.
Mapping a Mutation in "Caenorhabditis elegans" Using a Polymerase Chain Reaction-Based Approach
ERIC Educational Resources Information Center
Myers, Edith M.
2014-01-01
Many single nucleotide polymorphisms (SNPs) have been identified within the "Caenorhabditis elegans" genome. SNPs present in the genomes of two isogenic "C. elegans" strains have been routinely used as a tool in forward genetics to map a mutation to a particular chromosome. This article describes a laboratory exercise in which…
Novel and efficient tag SNPs selection algorithms.
Chen, Wen-Pei; Hung, Che-Lun; Tsai, Suh-Jen Jane; Lin, Yaw-Ling
2014-01-01
SNPs are the most abundant forms of genetic variations amongst species; the association studies between complex diseases and SNPs or haplotypes have received great attention. However, these studies are restricted by the cost of genotyping all SNPs; thus, it is necessary to find smaller subsets, or tag SNPs, representing the rest of the SNPs. In fact, the existing tag SNP selection algorithms are notoriously time-consuming. An efficient algorithm for tag SNP selection was presented, which was applied to analyze the HapMap YRI data. The experimental results show that the proposed algorithm can achieve better performance than the existing tag SNP selection algorithms; in most cases, this proposed algorithm is at least ten times faster than the existing methods. In many cases, when the redundant ratio of the block is high, the proposed algorithm can even be thousands times faster than the previously known methods. Tools and web services for haplotype block analysis integrated by hadoop MapReduce framework are also developed using the proposed algorithm as computation kernels.
SNP-VISTA: An interactive SNP visualization tool
Shah, Nameeta; Teplitsky, Michael V; Minovitsky, Simon; Pennacchio, Len A; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L
2005-01-01
Background Recent advances in sequencing technologies promise to provide a better understanding of the genetics of human disease as well as the evolution of microbial populations. Single Nucleotide Polymorphisms (SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it has become possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease in an attempt to identify causative mutations. In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples enables more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at [1]. Results We have developed and present two modifications of an interactive visualization tool, SNP-VISTA, to aid in the analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein evolutionary conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. Conclusion The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNP data by the user. PMID:16336665
Shao, Changwei; Niu, Yongchao; Rastas, Pasi; Liu, Yang; Xie, Zhiyuan; Li, Hengde; Wang, Lei; Jiang, Yong; Tai, Shuaishuai; Tian, Yongsheng; Sakamoto, Takashi; Chen, Songlin
2015-01-01
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1–8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species. PMID:25762582
Jasinska, A.J.; Service, S.; Jawaheer, D.; DeYoung, J.; Levinson, M.; Zhang, Z.; Kremeyer, B.; Muller, H.; Aldana, I.; Garcia, J.; Restrepo, G.; Lopez, C.; Palacio, C.; Duque, C.; Parra, M.; Vega, J.; Ortiz, D.; Bedoya, G.; Mathews, C.; Davanzo, P.; Fournier, E.; Bejarano, J.; Ramirez, M.; Ortiz, C. Araya; Araya, X.; Molina, J.; Sabatti, C.; Reus, V.; Ospina, J.; Macaya, G.; Ruiz-Linares, A.; Freimer, N.B.
2016-01-01
We previously reported linkage of bipolar disorder to 5q33-q34 in families from two closely related population isolates, the Central Valley of Costa Rica (CVCR) and Antioquia, Colombia (CO). Here we present follow up results from fine-scale mapping in large CVCR and CO families segregating severe bipolar disorder, BP-I, and in 343 population trios/duos from CVCR and CO. Employing densely spaced SNPs to fine map the prior linkage peak region increases linkage evidence and clarifies the position of the putative BP-I locus. We performed two-point linkage analysis with 1134 SNPs in an approximately 9 Mb region between markers D5S410 and D5S422. Combining pedigrees from CVCR and CO yields a LOD score of 4.9 at SNP rs10035961. Two other SNPs (rs7721142 and rs1422795) within the same 94 kb region also displayed LOD scores greater than 4. This linkage peak coincides with our prior microsatellite results and suggests a narrowed BP-I susceptibility regions in these families. To investigate if the locus implicated in the familial form of BP-I also contributes to disease risk in the population, we followed up the family results with association analysis in duo and trio samples, obtaining signals within 2 Mb of the peak linkage signal in the pedigrees; rs12523547 and rs267015 (P = 0.00004 and 0.00016, respectively) in the CO sample and rs244960 in the CVCR sample and the combined sample, with P = 0.00032 and 0.00016, respectively. It remains unclear whether these association results reflect the same locus contributing to BP susceptibility within the extended pedigrees. PMID:19319892
Fast and robust group-wise eQTL mapping using sparse graphical models.
Cheng, Wei; Shi, Yu; Zhang, Xiang; Wang, Wei
2015-01-16
Genome-wide expression quantitative trait loci (eQTL) studies have emerged as a powerful tool to understand the genetic basis of gene expression and complex traits. The traditional eQTL methods focus on testing the associations between individual single-nucleotide polymorphisms (SNPs) and gene expression traits. A major drawback of this approach is that it cannot model the joint effect of a set of SNPs on a set of genes, which may correspond to hidden biological pathways. We introduce a new approach to identify novel group-wise associations between sets of SNPs and sets of genes. Such associations are captured by hidden variables connecting SNPs and genes. Our model is a linear-Gaussian model and uses two types of hidden variables. One captures the set associations between SNPs and genes, and the other captures confounders. We develop an efficient optimization procedure which makes this approach suitable for large scale studies. Extensive experimental evaluations on both simulated and real datasets demonstrate that the proposed methods can effectively capture both individual and group-wise signals that cannot be identified by the state-of-the-art eQTL mapping methods. Considering group-wise associations significantly improves the accuracy of eQTL mapping, and the successful multi-layer regression model opens a new approach to understand how multiple SNPs interact with each other to jointly affect the expression level of a group of genes.
Zhu, Qian-Hao; Spriggs, Andrew; Taylor, Jennifer M.; Llewellyn, Danny; Wilson, Iain
2014-01-01
Varietal single nucleotide polymorphisms (SNPs) are the differences within one of the two subgenomes between different tetraploid cotton varieties and have not been practically used in cotton genetics and breeding because they are difficult to identify due to low genetic diversity and very high sequence identity between homeologous genes in cotton. We have used transcriptome and restriction site−associated DNA sequencing to identify varietal SNPs among 18 G. hirsutum varieties based on the rationale that varietal SNPs can be more confidently called when flanked by subgenome-specific SNPs. Using transcriptome data, we successfully identified 37,413 varietal SNPs and, of these, 22,121 did not have an additional varietal SNP within their 20-bp flanking regions so can be used in most SNP genotyping assays. From restriction site−associated DNA sequencing data, we identified an additional 3090 varietal SNPs between two of the varieties. Of the 1583 successful SNP assays achieved using different genotyping platforms, 1363 were verified. Many of the SNPs behaved as dominant markers because of coamplification from homeologous loci, but the number of SNPs acting as codominant markers increased when one or more subgenome-specific SNP(s) were incorporated in their assay primers, giving them greater utility for breeding applications. A G. hirsutum genetic map with 1244 SNP markers was constructed covering 5557.42 centiMorgan and used to map qualitative and quantitative traits. This collection of G. hirsutum varietal SNPs complements existing intra-specific SNPs and provides the cotton community with a valuable marker resource applicable to genetic analyses and breeding programs. PMID:25106949
Linear reduction method for predictive and informative tag SNP selection.
He, Jingwu; Westbrooks, Kelly; Zelikovsky, Alexander
2005-01-01
Constructing a complete human haplotype map is helpful when associating complex diseases with their related SNPs. Unfortunately, the number of SNPs is very large and it is costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNPs that should be sequenced to a small number of informative representatives called tag SNPs. In this paper, we propose a new linear algebra-based method for selecting and using tag SNPs. We measure the quality of our tag SNP selection algorithm by comparing actual SNPs with SNPs predicted from selected linearly independent tag SNPs. Our experiments show that for sufficiently long haplotypes, knowing only 0.4% of all SNPs the proposed linear reduction method predicts an unknown haplotype with the error rate below 2% based on 10% of the population.
Sung, Yun J; Gu, C Charles; Tiwari, Hemant K; Arnett, Donna K; Broeckel, Ulrich; Rao, Dabeeru C
2012-07-01
Genotype imputation provides imputation of untyped single nucleotide polymorphisms (SNPs) that are present on a reference panel such as those from the HapMap Project. It is popular for increasing statistical power and comparing results across studies using different platforms. Imputation for African American populations is challenging because their linkage disequilibrium blocks are shorter and also because no ideal reference panel is available due to admixture. In this paper, we evaluated three imputation strategies for African Americans. The intersection strategy used a combined panel consisting of SNPs polymorphic in both CEU and YRI. The union strategy used a panel consisting of SNPs polymorphic in either CEU or YRI. The merge strategy merged results from two separate imputations, one using CEU and the other using YRI. Because recent investigators are increasingly using the data from the 1000 Genomes (1KG) Project for genotype imputation, we evaluated both 1KG-based imputations and HapMap-based imputations. We used 23,707 SNPs from chromosomes 21 and 22 on Affymetrix SNP Array 6.0 genotyped for 1,075 HyperGEN African Americans. We found that 1KG-based imputations provided a substantially larger number of variants than HapMap-based imputations, about three times as many common variants and eight times as many rare and low-frequency variants. This higher yield is expected because the 1KG panel includes more SNPs. Accuracy rates using 1KG data were slightly lower than those using HapMap data before filtering, but slightly higher after filtering. The union strategy provided the highest imputation yield with next highest accuracy. The intersection strategy provided the lowest imputation yield but the highest accuracy. The merge strategy provided the lowest imputation accuracy. We observed that SNPs polymorphic only in CEU had much lower accuracy, reducing the accuracy of the union strategy. Our findings suggest that 1KG-based imputations can facilitate discovery of significant associations for SNPs across the whole MAF spectrum. Because the 1KG Project is still under way, we expect that later versions will provide better imputation performance. © 2012 Wiley Periodicals, Inc.
SNP-markers in Allium species to facilitate introgression breeding in onion.
Scholten, Olga E; van Kaauwen, Martijn P W; Shahin, Arwa; Hendrickx, Patrick M; Keizer, L C Paul; Burger, Karin; van Heusden, Adriaan W; van der Linden, C Gerard; Vosman, Ben
2016-08-31
Within onion, Allium cepa L., the availability of disease resistance is limited. The identification of sources of resistance in related species, such as Allium roylei and Allium fistulosum, was a first step towards the improvement of onion cultivars by breeding. SNP markers linked to resistance and polymorphic between these related species and onion cultivars are a valuable tool to efficiently introgress disease resistance genes. In this paper we describe the identification and validation of SNP markers valuable for onion breeding. Transcriptome sequencing resulted in 192 million RNA seq reads from the interspecific F1 hybrid between A. roylei and A. fistulosum (RF) and nine onion cultivars. After assembly, reliable SNPs were discovered in about 36 % of the contigs. For genotyping of the interspecific three-way cross population, derived from a cross between an onion cultivar and the RF (CCxRF), 1100 SNPs that are polymorphic in RF and monomorphic in the onion cultivars (RF SNPs) were selected for the development of KASP assays. A molecular linkage map based on 667 RF-SNP markers was constructed for CCxRF. In addition, KASP assays were developed for 1600 onion-SNPs (SNPs polymorphic among onion cultivars). A second linkage map was constructed for an F2 of onion x A. roylei (F2(CxR)) that consisted of 182 onion-SNPs and 119 RF-SNPs, and 76 previously mapped markers. Markers co-segregating in both the F2(CxR) and the CCxRF population were used to assign the linkage groups of RF to onion chromosomes. To validate usefulness of these SNP markers, QTL mapping was applied in the CCxRF population that segregates for resistance to Botrytis squamosa and resulted in a QTL for resistance on chromosome 6 of A. roylei. Our research has more than doubled the publicly available marker sequences of expressed onion genes and two onion-related species. It resulted in a detailed genetic map for the interspecific CCxRF population. This is the first paper that reports the detection of a QTL for resistance to B. squamosa in A. roylei.
selectSNP – An R package for selecting SNPs optimal for genetic evaluation
USDA-ARS?s Scientific Manuscript database
There has been a huge increase in the number of SNPs in the public repositories. This has made it a challenge to design low and medium density SNP panels, which requires careful selection of available SNPs considering many criteria, such as map position, allelic frequency, possible biological functi...
Al-Tobasei, Rafet; Ali, Ali; Leeds, Timothy D; Liu, Sixin; Palti, Yniv; Kenney, Brett; Salem, Mohamed
2017-08-07
Coding/functional SNPs change the biological function of a gene and, therefore, could serve as "large-effect" genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7-93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout.
Gala, Manish; Abecasis, Goncalo; Bezieau, Stephane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Carlson, Christopher S.; Casey, Graham; Chang-Claude, Jenny; Conti, David V.; Curtis, Keith R.; Duggan, David; Gallinger, Steven; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Jenkins, Mark A.; Küry, Sébastien; Le Marchand, Loic; Leal, Suzanne M.; Newcomb, Polly A.; Nickerson, Deborah A.; Potter, John D.; Schoen, Robert E.; Schumacher, Fredrick R.; Seminara, Daniela; Slattery, Martha L.; Hsu, Li; Chan, Andrew T.; White, Emily; Berndt, Sonja I.; Peters, Ulrike
2016-01-01
Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) associated with colorectal cancer risk. These SNPs may tag correlated variants with biological importance. Fine-mapping around GWAS loci can facilitate detection of functional candidates and additional independent risk variants. We analyzed 11,900 cases and 14,311 controls in the Genetics and Epidemiology of Colorectal Cancer Consortium and the Colon Cancer Family Registry. To fine-map genomic regions containing all known common risk variants, we imputed high-density genetic data from the 1000 Genomes Project. We tested single-variant associations with colorectal tumor risk for all variants spanning genomic regions 250-kb upstream or downstream of 31 GWAS-identified SNPs (index SNPs). We queried the University of California, Santa Cruz Genome Browser to examine evidence for biological function. Index SNPs did not show the strongest association signals with colorectal tumor risk in their respective genomic regions. Bioinformatics analysis of SNPs showing smaller P-values in each region revealed 21 functional candidates in 12 loci (5q31.1, 8q24, 11q13.4, 11q23, 12p13.32, 12q24.21, 14q22.2, 15q13, 18q21, 19q13.1, 20p12.3, and 20q13.33). We did not observe evidence of additional independent association signals in GWAS-identified regions. Our results support the utility of integrating data from comprehensive fine-mapping with expanding publicly available genomic databases to help clarify GWAS associations and identify functional candidates that warrant more onerous laboratory follow-up. Such efforts may aid the eventual discovery of disease-causing variant(s). PMID:27379672
USDA-ARS?s Scientific Manuscript database
Single-nucleotide polymorphisms (SNPs) are the most common genetic markers in Theobroma cacao, occurring approximately once in every 200 nucleotides. SNPs, like microsatellites, are co-dominant and PCR-based, but they have several advantages over microsatellites. They are unambiguous, so that a SN...
Nagano, Soichiro; Shirasawa, Kenta; Hirakawa, Hideki; Maeda, Fumi; Ishikawa, Masami; Isobe, Sachiko N
2017-05-12
The strawberry, Fragaria × ananassa, is an allo-octoploid (2n = 8x = 56) and outcrossing species. Although it is the most widely consumed berry crop in the world, its complex genome structure has hindered its genetic and genomic analysis, and thus discrimination of subgenome-specific loci among the homoeologous chromosomes is needed. In the present study, we identified candidate subgenome-specific single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) loci, and constructed a linkage map using an S 1 mapping population of the cultivar 'Reikou' with an IStraw90 Axiom® SNP array and previously published SSR markers. The 'Reikou' linkage map consisted of 11,574 loci (11,002 SNPs and 572 SSR loci) spanning 2816.5 cM of 31 linkage groups. The 11,574 loci were located on 4738 unique positions (bin) on the linkage map. Of the mapped loci, 8999 (8588 SNPs and 411 SSR loci) showed a 1:2:1 segregation ratio of AA:AB:BB allele, which suggested the possibility of deriving loci from candidate subgenome-specific sequences. In addition, 2575 loci (2414 SNPs and 161 SSR loci) showed a 3:1 segregation of AB:BB allele, indicating they were derived from homoeologous genomic sequences. Comparative analysis of the homoeologous linkage groups revealed differences in genome structure among the subgenomes. Our results suggest that candidate subgenome-specific loci are randomly located across the genomes, and that there are small- to large-scale structural variations among the subgenomes. The mapped SNPs and SSR loci on the linkage map are expected to be seed points for the construction of pseudomolecules in the octoploid strawberry.
Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai
2014-01-01
The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047
Exploiting Genome Structure in Association Analysis
Kim, Seyoung
2014-01-01
Abstract A genome-wide association study involves examining a large number of single-nucleotide polymorphisms (SNPs) to identify SNPs that are significantly associated with the given phenotype, while trying to reduce the false positive rate. Although haplotype-based association methods have been proposed to accommodate correlation information across nearby SNPs that are in linkage disequilibrium, none of these methods directly incorporated the structural information such as recombination events along chromosome. In this paper, we propose a new approach called stochastic block lasso for association mapping that exploits prior knowledge on linkage disequilibrium structure in the genome such as recombination rates and distances between adjacent SNPs in order to increase the power of detecting true associations while reducing false positives. Following a typical linear regression framework with the genotypes as inputs and the phenotype as output, our proposed method employs a sparsity-enforcing Laplacian prior for the regression coefficients, augmented by a first-order Markov process along the sequence of SNPs that incorporates the prior information on the linkage disequilibrium structure. The Markov-chain prior models the structural dependencies between a pair of adjacent SNPs, and allows us to look for association SNPs in a coupled manner, combining strength from multiple nearby SNPs. Our results on HapMap-simulated datasets and mouse datasets show that there is a significant advantage in incorporating the prior knowledge on linkage disequilibrium structure for marker identification under whole-genome association. PMID:21548809
Evidence that breast cancer risk at the 2q35 locus is mediated through IGFBP5 regulation.
Ghoussaini, Maya; Edwards, Stacey L; Michailidou, Kyriaki; Nord, Silje; Cowper-Sal Lari, Richard; Desai, Kinjal; Kar, Siddhartha; Hillman, Kristine M; Kaufmann, Susanne; Glubb, Dylan M; Beesley, Jonathan; Dennis, Joe; Bolla, Manjeet K; Wang, Qin; Dicks, Ed; Guo, Qi; Schmidt, Marjanka K; Shah, Mitul; Luben, Robert; Brown, Judith; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Klevebring, Daniel; Bojesen, Stig E; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Lambrechts, Diether; Thienpont, Bernard; Neven, Patrick; Wildiers, Hans; Broeks, Annegien; Van't Veer, Laura J; Th Rutgers, Emiel J; Couch, Fergus J; Olson, Janet E; Hallberg, Emily; Vachon, Celine; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Peto, Julian; Dos-Santos-Silva, Isabel; Gibson, Lorna; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K; Noh, Dong-Young; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Guénel, Pascal; Truong, Thérèse; Menegaux, Florence; Sanchez, Marie; Burwinkel, Barbara; Marme, Frederik; Schneeweiss, Andreas; Sohn, Christof; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Benitez, Javier; Zamora, M Pilar; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Qiuyin; Cox, Angela; Cross, Simon S; Reed, Malcolm W R; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Tchatchou, Sandrine; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Lindblom, Annika; Margolin, Sara; Teo, Soo Hwang; Yip, Cheng Har; Lee, Daphne S C; Wong, Tien Y; Hooning, Maartje J; Martens, John W M; Collée, J Margriet; van Deurzen, Carolien H M; Hopper, John L; Southey, Melissa C; Tsimiklis, Helen; Kapuscinski, Miroslav K; Shen, Chen-Yang; Wu, Pei-Ei; Yu, Jyh-Cherng; Chen, Shou-Tung; Alnæs, Grethe Grenaker; Borresen-Dale, Anne-Lise; Giles, Graham G; Milne, Roger L; McLean, Catriona; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Hartman, Mikael; Miao, Hui; Buhari, Shaik Ahmad Bin Syed; Teo, Yik Ying; Fasching, Peter A; Haeberle, Lothar; Ekici, Arif B; Beckmann, Matthias W; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J; Lissowska, Jolanta; Simard, Jacques; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Brauch, Hiltrud; Brüning, Thomas; Koto, Yon-Dschun; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Volorio, Sara; Dörk, Thilo; Bogdanova, Natalia V; Helbig, Sonja; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Van Asperen, Christi J; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Slager, Susan; Toland, Amanda E; Ambrosone, Christine B; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Hamann, Ute; Torres, Diana; Zheng, Wei; Long, Jirong; Anton-Culver, Hoda; Neuhausen, Susan L; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Maranian, Mel; Healey, Catherine S; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Alvarez, Nuria; Herrero, Daniel; Tessier, Daniel C; Vincent, Daniel; Bacot, Francois; de Santiago, Ines; Carroll, Jason; Caldas, Carlos; Brown, Melissa A; Lupien, Mathieu; Kristensen, Vessela N; Pharoah, Paul D P; Chenevix-Trench, Georgia; French, Juliet D; Easton, Douglas F; Dunning, Alison M
2014-09-23
GWAS have identified a breast cancer susceptibility locus on 2q35. Here we report the fine mapping of this locus using data from 101,943 subjects from 50 case-control studies. We genotype 276 SNPs using the 'iCOGS' genotyping array and impute genotypes for a further 1,284 using 1000 Genomes Project data. All but two, strongly correlated SNPs (rs4442975 G/T and rs6721996 G/A) are excluded as candidate causal variants at odds against >100:1. The best functional candidate, rs4442975, is associated with oestrogen receptor positive (ER+) disease with an odds ratio (OR) in Europeans of 0.85 (95% confidence interval=0.84-0.87; P=1.7 × 10(-43)) per t-allele. This SNP flanks a transcriptional enhancer that physically interacts with the promoter of IGFBP5 (encoding insulin-like growth factor-binding protein 5) and displays allele-specific gene expression, FOXA1 binding and chromatin looping. Evidence suggests that the g-allele confers increased breast cancer susceptibility through relative downregulation of IGFBP5, a gene with known roles in breast cell biology.
High-throughput gene mapping in Caenorhabditis elegans.
Swan, Kathryn A; Curtis, Damian E; McKusick, Kathleen B; Voinov, Alexander V; Mapa, Felipa A; Cancilla, Michael R
2002-07-01
Positional cloning of mutations in model genetic systems is a powerful method for the identification of targets of medical and agricultural importance. To facilitate the high-throughput mapping of mutations in Caenorhabditis elegans, we have identified a further 9602 putative new single nucleotide polymorphisms (SNPs) between two C. elegans strains, Bristol N2 and the Hawaiian mapping strain CB4856, by sequencing inserts from a CB4856 genomic DNA library and using an informatics pipeline to compare sequences with the canonical N2 genomic sequence. When combined with data from other laboratories, our marker set of 17,189 SNPs provides even coverage of the complete worm genome. To date, we have confirmed >1099 evenly spaced SNPs (one every 91 +/- 56 kb) across the six chromosomes and validated the utility of our SNP marker set and new fluorescence polarization-based genotyping methods for systematic and high-throughput identification of genes in C. elegans by cloning several proprietary genes. We illustrate our approach by recombination mapping and confirmation of the mutation in the cloned gene, dpy-18.
Shao, Changwei; Niu, Yongchao; Rastas, Pasi; Liu, Yang; Xie, Zhiyuan; Li, Hengde; Wang, Lei; Jiang, Yong; Tai, Shuaishuai; Tian, Yongsheng; Sakamoto, Takashi; Chen, Songlin
2015-04-01
High-resolution genetic maps are essential for fine mapping of complex traits, genome assembly, and comparative genomic analysis. Single-nucleotide polymorphisms (SNPs) are the primary molecular markers used for genetic map construction. In this study, we identified 13,362 SNPs evenly distributed across the Japanese flounder (Paralichthys olivaceus) genome. Of these SNPs, 12,712 high-confidence SNPs were subjected to high-throughput genotyping and assigned to 24 consensus linkage groups (LGs). The total length of the genetic linkage map was 3,497.29 cM with an average distance of 0.47 cM between loci, thereby representing the densest genetic map currently reported for Japanese flounder. Nine positive quantitative trait loci (QTLs) forming two main clusters for Vibrio anguillarum disease resistance were detected. All QTLs could explain 5.1-8.38% of the total phenotypic variation. Synteny analysis of the QTL regions on the genome assembly revealed 12 immune-related genes, among them 4 genes strongly associated with V. anguillarum disease resistance. In addition, 246 genome assembly scaffolds with an average size of 21.79 Mb were anchored onto the LGs; these scaffolds, comprising 522.99 Mb, represented 95.78% of assembled genomic sequences. The mapped assembly scaffolds in Japanese flounder were used for genome synteny analyses against zebrafish (Danio rerio) and medaka (Oryzias latipes). Flounder and medaka were found to possess almost one-to-one synteny, whereas flounder and zebrafish exhibited a multi-syntenic correspondence. The newly developed high-resolution genetic map, which will facilitate QTL mapping, scaffold assembly, and genome synteny analysis of Japanese flounder, marks a milestone in the ongoing genome project for this species. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
SNP Discovery and Linkage Map Construction in Cultivated Tomato
Shirasawa, Kenta; Isobe, Sachiko; Hirakawa, Hideki; Asamizu, Erika; Fukuoka, Hiroyuki; Just, Daniel; Rothan, Christophe; Sasamoto, Shigemi; Fujishiro, Tsunakazu; Kishida, Yoshie; Kohara, Mitsuyo; Tsuruoka, Hisano; Wada, Tsuyuko; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi
2010-01-01
Few intraspecific genetic linkage maps have been reported for cultivated tomato, mainly because genetic diversity within Solanum lycopersicum is much less than that between tomato species. Single nucleotide polymorphisms (SNPs), the most abundant source of genomic variation, are the most promising source of polymorphisms for the construction of linkage maps for closely related intraspecific lines. In this study, we developed SNP markers based on expressed sequence tags for the construction of intraspecific linkage maps in tomato. Out of the 5607 SNP positions detected through in silico analysis, 1536 were selected for high-throughput genotyping of two mapping populations derived from crosses between ‘Micro-Tom’ and either ‘Ailsa Craig’ or ‘M82’. A total of 1137 markers, including 793 out of the 1338 successfully genotyped SNPs, along with 344 simple sequence repeat and intronic polymorphism markers, were mapped onto two linkage maps, which covered 1467.8 and 1422.7 cM, respectively. The SNP markers developed were then screened against cultivated tomato lines in order to estimate the transferability of these SNPs to other breeding materials. The molecular markers and linkage maps represent a milestone in the genomics and genetics, and are the first step toward molecular breeding of cultivated tomato. Information on the DNA markers, linkage maps, and SNP genotypes for these tomato lines is available at http://www.kazusa.or.jp/tomato/. PMID:21044984
Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data
Degner, Jacob F.; Marioni, John C.; Pai, Athma A.; Pickrell, Joseph K.; Nkadori, Everlyne; Gilad, Yoav; Pritchard, Jonathan K.
2009-01-01
Motivation: Next-generation sequencing has become an important tool for genome-wide quantification of DNA and RNA. However, a major technical hurdle lies in the need to map short sequence reads back to their correct locations in a reference genome. Here, we investigate the impact of SNP variation on the reliability of read-mapping in the context of detecting allele-specific expression (ASE). Results: We generated 16 million 35 bp reads from mRNA of each of two HapMap Yoruba individuals. When we mapped these reads to the human genome we found that, at heterozygous SNPs, there was a significant bias toward higher mapping rates of the allele in the reference sequence, compared with the alternative allele. Masking known SNP positions in the genome sequence eliminated the reference bias but, surprisingly, did not lead to more reliable results overall. We find that even after masking, ∼5–10% of SNPs still have an inherent bias toward more effective mapping of one allele. Filtering out inherently biased SNPs removes 40% of the top signals of ASE. The remaining SNPs showing ASE are enriched in genes previously known to harbor cis-regulatory variation or known to show uniparental imprinting. Our results have implications for a variety of applications involving detection of alternate alleles from short-read sequence data. Availability: Scripts, written in Perl and R, for simulating short reads, masking SNP variation in a reference genome and analyzing the simulation output are available upon request from JFD. Raw short read data were deposited in GEO (http://www.ncbi.nlm.nih.gov/geo/) under accession number GSE18156. Contact: jdegner@uchicago.edu; marioni@uchicago.edu; gilad@uchicago.edu; pritch@uchicago.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19808877
Mercenaro, Luca; Nieddu, Giovanni; Porceddu, Andrea; Pezzotti, Mario; Camiolo, Salvatore
2017-01-01
The genetic diversity among grapevine (Vitis vinifera L.) cultivars that underlies differences in agronomic performance and wine quality reflects the accumulation of single nucleotide polymorphisms (SNPs) and small indels as well as larger genomic variations. A combination of high throughput sequencing and mapping against the grapevine reference genome allows the creation of comprehensive sequence variation maps. We used next generation sequencing and bioinformatics to generate an inventory of SNPs and small indels in four widely cultivated Sardinian grape cultivars (Bovale sardo, Cannonau, Carignano and Vermentino). More than 3,200,000 SNPs were identified with high statistical confidence. Some of the SNPs caused the appearance of premature stop codons and thus identified putative pseudogenes. The analysis of SNP distribution along chromosomes led to the identification of large genomic regions with uninterrupted series of homozygous SNPs. We used a digital comparative genomic hybridization approach to identify 6526 genomic regions with significant differences in copy number among the four cultivars compared to the reference sequence, including 81 regions shared between all four cultivars and 4953 specific to single cultivars (representing 1.2 and 75.9% of total copy number variation, respectively). Reads mapping at a distance that was not compatible with the insert size were used to identify a dataset of putative large deletions with cultivar Cannonau revealing the highest number. The analysis of genes mapping to these regions provided a list of candidates that may explain some of the phenotypic differences among the Bovale sardo, Cannonau, Carignano and Vermentino cultivars. PMID:28775732
A genetic variation map for chicken with 2.8 million single nucleotide polymorphisms
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wong, G K; Hillier, L; Brandstrom, M
2005-02-20
We describe a genetic variation map for the chicken genome containing 2.8 million single nucleotide polymorphisms (SNPs), based on a comparison of the sequences of 3 domestic chickens (broiler, layer, Silkie) to their wild ancestor Red Jungle Fowl (RJF). Subsequent experiments indicate that at least 90% are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds. Mean nucleotide diversity is about 5 SNP/kb for almost every possible comparison between RJF and domestic lines, between two different domestic lines, and within domestic lines--contrary to the idea that domestic animals are highly inbred relative to theirmore » wild ancestors. In fact, most of the SNPs originated prior to domestication, and there is little to no evidence of selective sweeps for adaptive alleles on length scales of greater than 100 kb.« less
LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures.
Ryan, Michael; Diekhans, Mark; Lien, Stephanie; Liu, Yun; Karchin, Rachel
2009-06-01
LS-SNP/PDB is a new WWW resource for genome-wide annotation of human non-synonymous (amino acid changing) SNPs. It serves high-quality protein graphics rendered with UCSF Chimera molecular visualization software. The system is kept up-to-date by an automated, high-throughput build pipeline that systematically maps human nsSNPs onto Protein Data Bank structures and annotates several biologically relevant features. LS-SNP/PDB is available at (http://ls-snp.icm.jhu.edu/ls-snp-pdb) and via links from protein data bank (PDB) biology and chemistry tabs, UCSC Genome Browser Gene Details and SNP Details pages and PharmGKB Gene Variants Downloads/Cross-References pages.
Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci
Brorsson, Caroline A.; Pociot, Flemming
2014-01-01
Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376
Kim, Kyung-Seon; Kim, Ghi-Su; Hwang, Joo-Yeon; Lee, Hye-Ja; Park, Mi-Hyun; Kim, Kwang-joong; Jung, Jongsun; Cha, Hyo-Soung; Shin, Hyoung Doo; Kang, Jong-Ho; Park, Eui Kyun; Kim, Tae-Ho; Hong, Jung-Min; Koh, Jung-Min; Oh, Bermseok; Kimm, Kuchan; Kim, Shin-Yoon; Lee, Jong-Young
2007-01-01
Background Osteoporosis is defined as the loss of bone mineral density that leads to bone fragility with aging. Population-based case-control studies have identified polymorphisms in many candidate genes that have been associated with bone mass maintenance or osteoporotic fracture. To investigate single nucleotide polymorphisms (SNPs) that are associated with osteoporosis, we examined the genetic variation among Koreans by analyzing 81 genes according to their function in bone formation and resorption during bone remodeling. Methods We resequenced all the exons, splice junctions and promoter regions of candidate osteoporosis genes using 24 unrelated Korean individuals. Using the common SNPs from our study and the HapMap database, a statistical analysis of deviation in heterozygosity depicted. Results We identified 942 variants, including 888 SNPs, 43 insertion/deletion polymorphisms, and 11 microsatellite markers. Of the SNPs, 557 (63%) had been previously identified and 331 (37%) were newly discovered in the Korean population. When compared SNPs in the Korean population with those in HapMap database, 1% (or less) of SNPs in the Japanese and Chinese subpopulations and 20% of those in Caucasian and African subpopulations were significantly differentiated from the Hardy-Weinberg expectations. In addition, an analysis of the genetic diversity showed that there were no significant differences among Korean, Han Chinese and Japanese populations, but African and Caucasian populations were significantly differentiated in selected genes. Nevertheless, in the detailed analysis of genetic properties, the LD and Haplotype block patterns among the five sub-populations were substantially different from one another. Conclusion Through the resequencing of 81 osteoporosis candidate genes, 118 unknown SNPs with a minor allele frequency (MAF) > 0.05 were discovered in the Korean population. In addition, using the common SNPs between our study and HapMap, an analysis of genetic diversity and deviation in heterozygosity was performed and the polymorphisms of the above genes among the five populations were substantially differentiated from one another. Further studies of osteoporosis could utilize the polymorphisms identified in our data since they may have important implications for the selection of highly informative SNPs for future association studies. PMID:18036257
Rankinen, Tuomo; Argyropoulos, George; Rice, Treva; Rao, D C; Bouchard, Claude
2010-06-01
A genome-wide linkage scan identified a quantitative trait locus for exercise training-induced changes in submaximal exercise (50 W) heart rate (DeltaHR50) on chromosome 2q33.3-q34 in the HERITAGE Family Study (n=472). To fine-map the region, 1450 tag SNPs were genotyped between 205 and 215 Mb on chromosome 2. The strongest evidence of association with DeltaHR50 was observed with 2 single-nucleotide polymorphisms (SNPs) located in the 5' region of the cAMP-responsive element-binding protein 1 (CREB1) gene (rs2253206: P=1.6x10(-5) and rs2360969: P=4.3x10(-5)). The associations remained significant (P=0.01 and P=0.023, respectively) after accounting for multiple testing. Regression modeling of the 39 most significant SNPs in the single-SNP analysis identified 9 SNPs that collectively explained 20% of the DeltaHR50 variance. CREB1 SNP rs2253206 had the strongest effect (5.45% of variance), followed by SNPs in the FASTKD2 (3.1%), MAP2 (2.6%), SPAG16 (2.1%), ERBB4 (3 SNPs approximately 1.4% each), IKZF2 (1.4%), and PARD3B (1.0%) loci. In conditional linkage analysis, 6 SNPs from the final regression model (CREB1, FASTKD2, MAP2, ERBB4, IKZF2, and PARD3B) accounted for the original linkage signal: The log of the odds score dropped from 2.10 to 0.41 after adjusting for all 6 SNPs. Functional studies revealed that the common allele of rs2253206 exhibits significantly (P<0.05) lower promoter activity than the minor allele. Our data suggest that functional DNA sequence variation in the CREB1 locus is strongly associated with DeltaHR50 and explains a considerable proportion of the quantitative trait locus variance. However, at least 5 additional SNPs seem to be required to fully account for the original linkage signal.
Rankinen, Tuomo; Argyropoulos, George; Rice, Treva; Rao, D.C.; Bouchard, Claude
2011-01-01
Background A genome-wide linkage scan identified a quantitative trait locus (QTL) for exercise training-induced changes in submaximal exercise (50W) heart rate (ΔHR50) on chromosome 2q33.3-q34 in the HERITAGE Family Study (N=472). Methods and Results To fine map the region, 1,450 tagSNPs were genotyped between 205 and 215 Mb on chromosome 2. The strongest evidence of association with ΔHR50 was observed with two SNPs located in the 5′ region of the cAMP responsive element binding protein 1 (CREB1) gene (rs2253206: p=1.6×10−5 and rs2360969: p=4.3×10−5). The associations remained significant (p=0.01 and p=0.023, respectively) after accounting for multiple testing. Regression modeling of the 39 most significant SNPs in the single-SNP analyses identified nine SNPs that collectively explained 20% of the ΔHR50 variance. CREB1 SNP rs2253206 had the strongest effect (5.45% of variance), followed by SNPs in the FASTKD2 (3.1%), MAP2 (2.6%), SPAG16 (2.1%), ERBB4 (3 SNPs ~1.4% each), IKZF2 (1.4%), and PARD3B (1.0%) loci. In conditional linkage analysis, six SNPs from the final regression model (CREB1, FASTKD2, MAP2, ERBB4, IKZF2, and PARD3B) accounted for the original linkage signal: the LOD score dropped from 2.10 to 0.41 after adjusting for all six SNPs. Functional studies revealed that the common allele of rs2253206 exhibits significantly (p<0.05) lower promoter activity than the minor allele. Conclusions Our data suggest that functional DNA sequence variation in the CREB1 locus is strongly associated with ΔHR50 and explains considerable proportion of the QTL variance. However, at least five additional SNPs seem to be required to fully account for the original linkage signal. PMID:20407090
Das, Shouvik; Singh, Mohar; Srivastava, Rishi; Bajaj, Deepak; Saxena, Maneesha S.; Rana, Jai C.; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2016-01-01
The present study used a whole-genome, NGS resequencing-based mQTL-seq (multiple QTL-seq) strategy in two inter-specific mapping populations (Pusa 1103 × ILWC 46 and Pusa 256 × ILWC 46) to scan the major genomic region(s) underlying QTL(s) governing pod number trait in chickpea. Essentially, the whole-genome resequencing of low and high pod number-containing parental accessions and homozygous individuals (constituting bulks) from each of these two mapping populations discovered >8 million high-quality homozygous SNPs with respect to the reference kabuli chickpea. The functional significance of the physically mapped SNPs was apparent from the identified 2,264 non-synonymous and 23,550 regulatory SNPs, with 8–10% of these SNPs-carrying genes corresponding to transcription factors and disease resistance-related proteins. The utilization of these mined SNPs in Δ (SNP index)-led QTL-seq analysis and their correlation between two mapping populations based on mQTL-seq, narrowed down two (CaqaPN4.1: 867.8 kb and CaqaPN4.2: 1.8 Mb) major genomic regions harbouring robust pod number QTLs into the high-resolution short QTL intervals (CaqbPN4.1: 637.5 kb and CaqbPN4.2: 1.28 Mb) on chickpea chromosome 4. The integration of mQTL-seq-derived one novel robust QTL with QTL region-specific association analysis delineated the regulatory (C/T) and coding (C/A) SNPs-containing one pentatricopeptide repeat (PPR) gene at a major QTL region regulating pod number in chickpea. This target gene exhibited anther, mature pollen and pod-specific expression, including pronounced higher up-regulated (∼3.5-folds) transcript expression in high pod number-containing parental accessions and homozygous individuals of two mapping populations especially during pollen and pod development. The proposed mQTL-seq-driven combinatorial strategy has profound efficacy in rapid genome-wide scanning of potential candidate gene(s) underlying trait-associated high-resolution robust QTL(s), thereby expediting genomics-assisted breeding and genetic enhancement of crop plants, including chickpea. PMID:26685680
Genomic Regions Associated with Root Traits under Drought Stress in Tropical Maize (Zea mays L.)
Zaidi, P. H.; Krishna, Girish; Krishnamurthy, L.; Gajanan, S.; Babu, Raman; Zerka, M.; Vinayan, M. T.; Vivek, B. S.
2016-01-01
An association mapping panel, named as CIMMYT Asia association mapping (CAAM) panel, involving 396 diverse tropical maize lines were phenotyped for various structural and functional traits of roots under drought and well-watered conditions. The experiment was conducted during Kharif (summer-rainy) season of 2012 and 2013 in root phenotyping facility at CIMMYT-Hyderabad, India. The CAAM panel was genotyped to generate 955, 690 SNPs through GBS v2.7 using Illumina Hi-seq 2000/2500 at Institute for Genomic Diversity, Cornell University, Ithaca, NY, USA. GWAS analysis was carried out using 331,390 SNPs filtered from the entire set of SNPs revealed a total of 50 and 67 SNPs significantly associated for root functional (transpiration efficiency, flowering period water use) and structural traits (rooting depth, root dry weight, root length, root volume, root surface area and root length density), respectively. In addition to this, 37 SNPs were identified for grain yield and shoot biomass under well-watered and drought stress. Though many SNPs were found to have significant association with the traits under study, SNPs that were common for more than one trait were discussed in detail. A total 18 SNPs were found to have common association with more than one trait, out of which 12 SNPs were found within or near the various gene functional regions. In this study we attempted to identify the trait specific maize lines based on the presence of favorable alleles for the SNPs associated with multiple traits. Two SNPs S3_128533512 and S7_151238865 were associated with transpiration efficiency, shoot biomass and grain yield under well-watered condition. Based on favorable allele for these SNPs seven inbred lines were identified. Similarly, four lines were identified for transpiration efficiency and shoot biomass under drought stress based on the presence of favorable allele for the common SNPs S1_211520521, S2_20017716, S3_57210184 and S7_130878458 and three lines were identified for flowering period water-use, transpiration efficiency, root dry weight and root volume based on the presence of favorable allele for the common SNPs S3_162065732 and S3_225760139. PMID:27768702
Nguyen, Nguyen H; Rastas, Pasi M A; Premachandra, H K A; Knibb, Wayne
2018-01-01
The genetic resources available for the commercially important fish species Yellowtail kingfish (YTK) ( Seriola lalandi) are relative sparse. To overcome this, we aimed (1) to develop a linkage map for this species, and (2) to identify markers/variants associated with economically important traits in kingfish (with an emphasis on body weight). Genetic and genomic analyses were conducted using 13,898 single nucleotide polymorphisms (SNPs) generated from a new high-throughput genotyping by sequencing platform, Diversity Arrays Technology (DArTseq TM ) in a pedigreed population comprising 752 animals. The linkage analysis enabled to map about 4,000 markers to 24 linkage groups (LGs), with an average density of 3.4 SNPs per cM. The linkage map was integrated into a genome-wide association study (GWAS) and identified six variants/SNPs associated with body weight ( P < 5e -8 ) when a multi-locus mixed model was used. Two out of the six significant markers were mapped to LGs 17 and 23, and collectively they explained 5.8% of the total genetic variance. It is concluded that the newly developed linkage map and the significantly associated markers with body weight provide fundamental information to characterize genetic architecture of growth-related traits in this population of YTK S. lalandi .
High-density genetic map of Miscanthus sinensis reveals inheritance of zebra stripe
DOE Office of Scientific and Technical Information (OSTI.GOV)
Liu, Siyao; Clark, Lindsay V.; Swaminathan, Kankshita
Miscanthus is a perennial C4 grass that has recently become an important bioenergy crop. The efficiency of breeding improved Miscanthus biomass cultivars could be greatly increased by marker-assisted selection. Thus, a high-density genetic map is critical to Miscanthus improvement. In this study, a mapping population of 261 F1 progeny was developed from a cross between two diploid M. sinensis cultivars, ‘Strictus’ and ‘Kaskade’. High-density genetic maps for the two parents were produced with 3044 newly developed single nucleotide polymorphisms (SNPs) obtained from restriction site-associated DNA sequencing, and 138 previously mapped GoldenGate SNPs. The female parent (‘Strictus’) map spanned 1599 cM,more » with 1989 SNPs on 19 linkage groups, and an average intermarker spacing of 0.8 cM. The length of the male parent (‘Kaskade’) map was 1612 cM, with 1821 SNPs, and an average intermarker spacing of 0.9 cM. The utility of the map was confirmed by locating quantitative trait loci (QTL) for the zebra-striped trait, which was segregating in this population. Three QTL for zebra-striped presence/absence (zb1, zb2 on LG 7, and zb3 on LG 10) and three for zebra-striped intensity (zbi1, zbi2, zbi3 on LGs 7, 10, 3) were identified. Each allele that caused striping was recessive. Incomplete penetrance was observed for each zb QTL, but penetrance was greatest when two or more zb QTL were homozygous for the causative alleles. Similarly, the intensity of striping was greatest when two or more zbi QTL were homozygous for alleles that conferred the trait. Comparative mapping indicated putative correspondence between zb3 and/or zbi2 on LG 10 to previously sequenced genes conferring zebra stripe in maize and rice. These results demonstrate that the new map is useful for identifying marker–trait associations. The mapped markers will become a valuable community resource, facilitating comparisons among studies and the breeding of Miscanthus.« less
High-density genetic map of Miscanthus sinensis reveals inheritance of zebra stripe
Liu, Siyao; Clark, Lindsay V.; Swaminathan, Kankshita; ...
2015-05-06
Miscanthus is a perennial C4 grass that has recently become an important bioenergy crop. The efficiency of breeding improved Miscanthus biomass cultivars could be greatly increased by marker-assisted selection. Thus, a high-density genetic map is critical to Miscanthus improvement. In this study, a mapping population of 261 F1 progeny was developed from a cross between two diploid M. sinensis cultivars, ‘Strictus’ and ‘Kaskade’. High-density genetic maps for the two parents were produced with 3044 newly developed single nucleotide polymorphisms (SNPs) obtained from restriction site-associated DNA sequencing, and 138 previously mapped GoldenGate SNPs. The female parent (‘Strictus’) map spanned 1599 cM,more » with 1989 SNPs on 19 linkage groups, and an average intermarker spacing of 0.8 cM. The length of the male parent (‘Kaskade’) map was 1612 cM, with 1821 SNPs, and an average intermarker spacing of 0.9 cM. The utility of the map was confirmed by locating quantitative trait loci (QTL) for the zebra-striped trait, which was segregating in this population. Three QTL for zebra-striped presence/absence (zb1, zb2 on LG 7, and zb3 on LG 10) and three for zebra-striped intensity (zbi1, zbi2, zbi3 on LGs 7, 10, 3) were identified. Each allele that caused striping was recessive. Incomplete penetrance was observed for each zb QTL, but penetrance was greatest when two or more zb QTL were homozygous for the causative alleles. Similarly, the intensity of striping was greatest when two or more zbi QTL were homozygous for alleles that conferred the trait. Comparative mapping indicated putative correspondence between zb3 and/or zbi2 on LG 10 to previously sequenced genes conferring zebra stripe in maize and rice. These results demonstrate that the new map is useful for identifying marker–trait associations. The mapped markers will become a valuable community resource, facilitating comparisons among studies and the breeding of Miscanthus.« less
Qiu, Chao; Chang, Ranran; Yang, Jie; Ge, Shengju; Xiong, Liu; Zhao, Mei; Li, Man; Sun, Qingjie
2017-04-15
Essential oils (EOs), including menthone, oregano, cinnamon, lavender, and citral, are natural products that have antimicrobial and antioxidant activities. However, extremely low water solubility, and easy degradation by heat, restrict their application. The aim of this work was to evaluate the enhancement in antioxidative and antimicrobial activities of EOs encapsulated in starch nanoparticles (SNPs) prepared by short glucan chains. For the first time, we have successfully fabricated menthone-loaded SNPs (SNPs-M) at different complexation temperatures (30, 60, and 90°C) by an in situ nanoprecipitation method. The SNPs-M displayed spherical shapes, and the particle sizes ranged from 93 to 113nm. The encapsulation efficiency (EE) of SNPs-M increased significantly with an increase in complexation temperature, and the maximum EE was 86.6%. The SNPs-M formed at 90°C had high crystallization and thermal stability. The durations of the antioxidant and antimicrobial activities of EOs was extended by their encapsulation in the SNPs. Copyright © 2016 Elsevier Ltd. All rights reserved.
Bhattarai, Dinesh; Chen, Xing; Ur Rehman, Zia; Hao, Xingjie; Ullah, Farman; Dad, Rahim; Talpur, Hira Sajjad; Kadariya, Ishwari; Cui, Lu; Fan, Mingxia; Zhang, Shujun
2017-02-01
The objective of the studies presented in this Research Communication was to investigate the association of single nucleotide polymorphisms present in the MAP4K4 gene with different milk traits in dairy cows. Based on previous QTL fine mapping results on bovine chromosome 11, the MAP4K4 gene was selected as a candidate gene to evaluate its effect on somatic cell count and milk traits in ChineseHolstein cows. Milk production traits including milk yield, fat percentage, and protein percentage of each cow were collected using 305 d lactation records. Association between MAP4K4 genotype and different traits and Somatic Cell Score (SCS) was performed using General Linear Regression Model of R. Two SNPs at exon 18 (c.2061T > G and c.2196T > C) with genotype TT in both SNPs were found significantly higher for somatic SCS. We found the significant effect of exon 18 (c.2061T > G) on protein percentage, milk yield and SCS. We identified SNPs at different location of MAP4K4 gene of the cattle and several of them were significantly associated with the somatic cell score and other different milk traits. Thus, MAP4K4 gene could be a useful candidate gene for selection of dairy cattle against mastitis and the identified polymorphisms might potentially be strong genetic markers.
Patterns of linkage disequilibrium at PARK16 may explain variances in genetic association studies.
Li, Huihua; Teo, Yik-Ying; Tan, Eng-King
2015-09-01
Reproducing genomewide association studies findings in different populations is challenging, because the reproducibility fundamentally relies on the similar patterns of linkage disequilibrium between the unknown causal variants and the genotyped single-nucleotide polymorphisms (SNPs). The PARK16 locus was reported to alter the risk of Parkinson's disease (PD) in genomewide association studies in Japanese and Caucasians. We evaluated the regional linkage disequilibrium pattern at PARK16 locus in Caucasians, Japanese, and Chinese from HapMap and Chinese, Malays, and Indians from the Singapore Genome Variation Project, using the traditional heatmaps and targeted analysis of PARK16 gene via Monte Carlo simulation through varLD scores of these ethnic groups. One hundred SNPs in Caucasians, 95 SNPs in Chinese, 78 SNPs in Japanese from HapMap, 86 SNPs in Chinese, 99 SNPs in Indians, and 97 SNPs in Malays from the Singapore Genome Variation Project were included. Our targeted analysis showed that the linkage disequilibrium pattern of SNPs close to rs947211 was similar in Caucasians and Asians, including Chinese, Japanese, and Malay (all P > 0.0001), whereas different linkage disequilibrium patterns around rs823128, rs823156, and rs708730 were found between Caucasians and these Asian groups (all P < 0.0001). Our study suggests a higher chance to detect the association between rs947211 and PD in Chinese, Malay, and other Caucasian groups because of the similar linkage disequilibrium pattern around rs947211. The associations between rs823128/rs823156/rs708730 and PD are more likely to be replicated in Chinese and Malay populations. © 2015 International Parkinson and Movement Disorder Society.
Fu, Junling; Li, Ge; Li, Lujiao; Yin, Jinhua; Cheng, Hong; Han, Lanwen; Zhang, Qian; Li, Naishi; Xiao, Xinhua; Grant, Struan F.A.; Li, Mingyao; Gao, Shan; Mi, Jie; Li, Ming
2017-01-01
Genome-wide association studies have identified multiple variants associated with adult obesity, mostly in European-ancestry populations. We aimed to systematically assess the contribution of key loci, which had been previously shown to be associated in East Asian adults, to childhood obesity, related adipokine profiles and metabolic traits in a Chinese pediatric population. Twelve single-nucleotide polymorphisms (SNPs) plus metabolic profiles and levels of five adipokines (leptin, adiponectin, resistin, fibroblast growth factor 21 and retinol binding protein 4) were evaluated in 3,506 Chinese children and adolescents aged 6-18. After correction for multiple comparisons, six of these SNPs were robustly associated with childhood obesity: FTO-rs1558902 (P=5.6×10−5), MC4R-rs2331841 (P=4.4×10−4), GNPDA2-rs16858082 (P = 3.4×10−4), PCSK1-rs261967 (P = 0.001), SEC16B-rs516636 (P = 0.004) and MAP2K5-rs4776970 (P = 0.004), with odds ratios ranging from 1.211 to 1.421; while ITIH4-rs2535633 and BDNF-rs2030323 yielded nominal association with the same trait (P < 0.05). Moreover, the risk alleles of six SNPs displayed significant (P < 0.004) or nominal (P < 0.05) association with leptin levels, namely at in/near PCSK1, MC4R, FTO, MAP2K5, GNPDA2 and BDNF plus their cumulative genetic score yielded stronger association with increased leptin levels (P = 6.2×10−11). Our results reveal that key obesity-associated loci previously reported in Europeans, but also associated with East Asian adults, are also associated with obesity and/or metabolic quantitative traits in Chinese children. These associations coincide with six brain-expressed loci that correlate with leptin levels, thus may point to an important neuronal influence on body weight regulation in the pediatric setting. PMID:29212175
Fu, Junling; Li, Ge; Li, Lujiao; Yin, Jinhua; Cheng, Hong; Han, Lanwen; Zhang, Qian; Li, Naishi; Xiao, Xinhua; Grant, Struan F A; Li, Mingyao; Gao, Shan; Mi, Jie; Li, Ming
2017-11-07
Genome-wide association studies have identified multiple variants associated with adult obesity, mostly in European-ancestry populations. We aimed to systematically assess the contribution of key loci, which had been previously shown to be associated in East Asian adults, to childhood obesity, related adipokine profiles and metabolic traits in a Chinese pediatric population. Twelve single-nucleotide polymorphisms (SNPs) plus metabolic profiles and levels of five adipokines (leptin, adiponectin, resistin, fibroblast growth factor 21 and retinol binding protein 4) were evaluated in 3,506 Chinese children and adolescents aged 6-18. After correction for multiple comparisons, six of these SNPs were robustly associated with childhood obesity: FTO -rs1558902 ( P =5.6×10 -5 ), MC4R -rs2331841 ( P =4.4×10 -4 ), GNPDA2 -rs16858082 ( P = 3.4×10 -4 ), PCSK1 -rs261967 ( P = 0.001), SEC16B -rs516636 ( P = 0.004) and MAP2K5 -rs4776970 ( P = 0.004), with odds ratios ranging from 1.211 to 1.421; while ITIH4 -rs2535633 and BDNF -rs2030323 yielded nominal association with the same trait ( P < 0.05). Moreover, the risk alleles of six SNPs displayed significant ( P < 0.004) or nominal ( P < 0.05) association with leptin levels, namely at in/near PCSK1, MC4R, FTO, MAP2K5, GNPDA2 and BDNF plus their cumulative genetic score yielded stronger association with increased leptin levels ( P = 6.2×10 -11 ). Our results reveal that key obesity-associated loci previously reported in Europeans, but also associated with East Asian adults, are also associated with obesity and/or metabolic quantitative traits in Chinese children. These associations coincide with six brain-expressed loci that correlate with leptin levels, thus may point to an important neuronal influence on body weight regulation in the pediatric setting.
Yucesoy, Berran; Kaufman, Kenneth M.; Lummus, Zana L.; Weirauch, Matthew T.; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M.; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B.; Bernstein, David I.
2015-01-01
Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10−14). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10−9 and rs2514805, p = 1.22 × 10−8, respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10−6). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies. PMID:25918132
Chopra, Rupali; Ali, Shafat; Srivastava, Amit K; Aggarwal, Shweta; Kumar, Bhupender; Manvati, Siddharth; Kalaiarasan, Ponnusamy; Jena, Mamta; Garg, Vijay K; Bhattacharya, Sambit N; Bamezai, Rameshwar N K
2013-01-01
Leprosy is a chronic infectious disease caused by Mycobacterium Leprae, where the host genetic background plays an important role toward the disease pathogenesis. Various studies have identified a number of human genes in association with leprosy or its clinical forms. However, non-replication of results has hinted at the heterogeneity among associations between different population groups, which could be due to differently evolved LD structures and differential frequencies of SNPs within the studied regions of the genome. A need for systematic and saturated mapping of the associated regions with the disease is warranted to unravel the observed heterogeneity in different populations. Mapping of the PARK2 and PACRG gene regulatory region with 96 SNPs, with a resolution of 1 SNP per 1 Kb for PARK2 gene regulatory region in a North Indian population, showed an involvement of 11 SNPs in determining the susceptibility towards leprosy. The association was replicated in a geographically distinct and unrelated population from Orissa in eastern India. In vitro reporter assays revealed that the two significantly associated SNPs, located 63.8 kb upstream of PARK2 gene and represented in a single BIN of 8 SNPs, influenced the gene expression. A comparison of BINs between Indian and Vietnamese populations revealed differences in the BIN structures, explaining the heterogeneity and also the reason for non-replication of the associated genomic region in different populations.
Baeza-Richer, Carlos; Arroyo-Pardo, Eduardo; Blanco-Rojo, Ruth; Toxqui, Laura; Remacha, Angel; Vaquero, M Pilar; López-Parra, Ana M
2015-12-01
Numerous studies associate genetic markers with iron- and erythrocyte-related parameters, but few relate them to iron-clinical phenotypes. Novel SNP rs1375515, located in a subunit of the calcium channel gene CACNA2D3, is associated with a higher risk of anaemia. The aim of this study is to further investigate the association of this SNP with iron-related parameters and iron-clinical phenotypes, and to explore the potential role of calcium channel subunit region in iron regulation. Furthermore, we aim to replicate the association of other SNPs reported previously in our population. We tested 45 SNPs selected via systematic review and fine mapping of CACNA2D3 region, with haematological and biochemical traits in 358 women of reproductive age. Multivariate analyses include back-step logistic regression and decision trees. The results replicate the association of SNPs with iron-related traits, and also confirm the protective effect of both A allele of rs1800562 (HFE) and G allele of rs4895441 (HBS1L-MYB). The risk of developing anaemia is increased in reproductive age women carriers of A allele of rs1868505 (CACNA2D3) and/or T allele of rs13194491 (HIST1H2BJ). Association of SNPs from fine mapping with ferritin and serum iron suggests that calcium channels could be a potential pathway for iron uptake in physiological conditions. Copyright © 2015 Elsevier Inc. All rights reserved.
Chopra, Rupali; Aggarwal, Shweta; Kumar, Bhupender; Manvati, Siddharth; Kalaiarasan, Ponnusamy; Jena, Mamta; Garg, Vijay K.; Bhattacharya, Sambit N.; Bamezai, Rameshwar N. K.
2013-01-01
Leprosy is a chronic infectious disease caused by Mycobacterium Leprae, where the host genetic background plays an important role toward the disease pathogenesis. Various studies have identified a number of human genes in association with leprosy or its clinical forms. However, non-replication of results has hinted at the heterogeneity among associations between different population groups, which could be due to differently evolved LD structures and differential frequencies of SNPs within the studied regions of the genome. A need for systematic and saturated mapping of the associated regions with the disease is warranted to unravel the observed heterogeneity in different populations. Mapping of the PARK2 and PACRG gene regulatory region with 96 SNPs, with a resolution of 1 SNP per 1 Kb for PARK2 gene regulatory region in a North Indian population, showed an involvement of 11 SNPs in determining the susceptibility towards leprosy. The association was replicated in a geographically distinct and unrelated population from Orissa in eastern India. In vitro reporter assays revealed that the two significantly associated SNPs, located 63.8 kb upstream of PARK2 gene and represented in a single BIN of 8 SNPs, influenced the gene expression. A comparison of BINs between Indian and Vietnamese populations revealed differences in the BIN structures, explaining the heterogeneity and also the reason for non-replication of the associated genomic region in different populations. PMID:23861666
Buchanan, Carrie C; Torstenson, Eric S; Bush, William S; Ritchie, Marylyn D
2012-01-01
Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more ethnic groups). HapMap along with advances in genotyping technology led to genome-wide association studies which have identified common variants associated with many traits and diseases. In 2008 the 1000 Genomes Project aimed to sequence 2500 individuals and identify rare variants and 99% of variants with a MAF of <1%. To determine whether the 1000 Genomes Project includes all the variants in HapMap, we examined the overlap between single nucleotide polymorphisms (SNPs) genotyped in the two resources using merged phase II/III HapMap data and low coverage pilot data from 1000 Genomes. Comparison of the two data sets showed that approximately 72% of HapMap SNPs were also found in 1000 Genomes Project pilot data. After filtering out HapMap variants with a MAF of <5% (separately for each population), 99% of HapMap SNPs were found in 1000 Genomes data. Not all variants cataloged in HapMap are also cataloged in 1000 Genomes. This could affect decisions about which resource to use for SNP queries, rare variant validation, or imputation. Both the HapMap and 1000 Genomes Project databases are useful resources for human genetics, but it is important to understand the assumptions made and filtering strategies employed by these projects.
Tsai, Hsin Y; Robledo, Diego; Lowe, Natalie R; Bekaert, Michael; Taggart, John B; Bron, James E; Houston, Ross D
2016-07-07
High density linkage maps are useful tools for fine-scale mapping of quantitative trait loci, and characterization of the recombination landscape of a species' genome. Genomic resources for Atlantic salmon (Salmo salar) include a well-assembled reference genome, and high density single nucleotide polymorphism (SNP) arrays. Our aim was to create a high density linkage map, and to align it with the reference genome assembly. Over 96,000 SNPs were mapped and ordered on the 29 salmon linkage groups using a pedigreed population comprising 622 fish from 60 nuclear families, all genotyped with the 'ssalar01' high density SNP array. The number of SNPs per group showed a high positive correlation with physical chromosome length (r = 0.95). While the order of markers on the genetic and physical maps was generally consistent, areas of discrepancy were identified. Approximately 6.5% of the previously unmapped reference genome sequence was assigned to chromosomes using the linkage map. Male recombination rate was lower than females across the vast majority of the genome, but with a notable peak in subtelomeric regions. Finally, using RNA-Seq data to annotate the reference genome, the mapped SNPs were categorized according to their predicted function, including annotation of ∼2500 putative nonsynonymous variants. The highest density SNP linkage map for any salmonid species has been created, annotated, and integrated with the Atlantic salmon reference genome assembly. This map highlights the marked heterochiasmy of salmon, and provides a useful resource for salmonid genetics and genomics research. Copyright © 2016 Tsai et al.
Kote-Jarai, Zsofia; Saunders, Edward J.; Leongamornlert, Daniel A.; Tymrakiewicz, Malgorzata; Dadaev, Tokhir; Jugurnauth-Little, Sarah; Ross-Adams, Helen; Al Olama, Ali Amin; Benlloch, Sara; Halim, Silvia; Russel, Roslin; Dunning, Alison M.; Luccarini, Craig; Dennis, Joe; Neal, David E.; Hamdy, Freddie C.; Donovan, Jenny L.; Muir, Ken; Giles, Graham G.; Severi, Gianluca; Wiklund, Fredrik; Gronberg, Henrik; Haiman, Christopher A.; Schumacher, Fredrick; Henderson, Brian E.; Le Marchand, Loic; Lindstrom, Sara; Kraft, Peter; Hunter, David J.; Gapstur, Susan; Chanock, Stephen; Berndt, Sonja I.; Albanes, Demetrius; Andriole, Gerald; Schleutker, Johanna; Weischer, Maren; Canzian, Federico; Riboli, Elio; Key, Tim J.; Travis, Ruth C.; Campa, Daniele; Ingles, Sue A.; John, Esther M.; Hayes, Richard B.; Pharoah, Paul; Khaw, Kay-Tee; Stanford, Janet L.; Ostrander, Elaine A.; Signorello, Lisa B.; Thibodeau, Stephen N.; Schaid, Dan; Maier, Christiane; Vogel, Walther; Kibel, Adam S.; Cybulski, Cezary; Lubinski, Jan; Cannon-Albright, Lisa; Brenner, Hermann; Park, Jong Y.; Kaneva, Radka; Batra, Jyotsna; Spurdle, Amanda; Clements, Judith A.; Teixeira, Manuel R.; Govindasami, Koveela; Guy, Michelle; Wilkinson, Rosemary A.; Sawyer, Emma J.; Morgan, Angela; Dicks, Ed; Baynes, Caroline; Conroy, Don; Bojesen, Stig E.; Kaaks, Rudolf; Vincent, Daniel; Bacot, François; Tessier, Daniel C.; Easton, Douglas F.; Eeles, Rosalind A.
2013-01-01
Associations between single nucleotide polymorphisms (SNPs) at 5p15 and multiple cancer types have been reported. We have previously shown evidence for a strong association between prostate cancer (PrCa) risk and rs2242652 at 5p15, intronic in the telomerase reverse transcriptase (TERT) gene that encodes TERT. To comprehensively evaluate the association between genetic variation across this region and PrCa, we performed a fine-mapping analysis by genotyping 134 SNPs using a custom Illumina iSelect array or Sequenom MassArray iPlex, followed by imputation of 1094 SNPs in 22 301 PrCa cases and 22 320 controls in The PRACTICAL consortium. Multiple stepwise logistic regression analysis identified four signals in the promoter or intronic regions of TERT that independently associated with PrCa risk. Gene expression analysis of normal prostate tissue showed evidence that SNPs within one of these regions also associated with TERT expression, providing a potential mechanism for predisposition to disease. PMID:23535824
DOE Office of Scientific and Technical Information (OSTI.GOV)
Urano, Tomohiko; Inoue, Satoshi, E-mail: INOUE-GER@h.u-tokyo.ac.jp; Department of Anti-Aging Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo 113-8655
Highlights: • Single-nucleotide polymorphisms (SNPs) associated with osteoporosis were identified. • SNPs mapped close to or within VDR and ESR1 are associated with bone mineral density. • WNT signaling pathway plays a pivotal role in regulating bone mineral density. • Genetic studies will be useful for identification of new therapeutic targets. - Abstract: Osteoporosis is a skeletal disease characterized by low bone mineral density (BMD) and microarchitectural deterioration of bone tissue, which increases susceptibility to fractures. BMD is a complex quantitative trait with normal distribution and seems to be genetically controlled (in 50–90% of the cases), according to studies onmore » twins and families. Over the last 20 years, candidate gene approach and genome-wide association studies (GWAS) have identified single-nucleotide polymorphisms (SNPs) that are associated with low BMD, osteoporosis, and osteoporotic fractures. These SNPs have been mapped close to or within genes including those encoding nuclear receptors and WNT-β-catenin signaling proteins. Understanding the genetics of osteoporosis will help identify novel candidates for diagnostic and therapeutic targets.« less
SNP Discovery for mapping alien introgressions in wheat
2014-01-01
Background Monitoring alien introgressions in crop plants is difficult due to the lack of genetic and molecular mapping information on the wild crop relatives. The tertiary gene pool of wheat is a very important source of genetic variability for wheat improvement against biotic and abiotic stresses. By exploring the 5Mg short arm (5MgS) of Aegilops geniculata, we can apply chromosome genomics for the discovery of SNP markers and their use for monitoring alien introgressions in wheat (Triticum aestivum L). Results The short arm of chromosome 5Mg of Ae. geniculata Roth (syn. Ae. ovata L.; 2n = 4x = 28, UgUgMgMg) was flow-sorted from a wheat line in which it is maintained as a telocentric chromosome. DNA of the sorted arm was amplified and sequenced using an Illumina Hiseq 2000 with ~45x coverage. The sequence data was used for SNP discovery against wheat homoeologous group-5 assemblies. A total of 2,178 unique, 5MgS-specific SNPs were discovered. Randomly selected samples of 59 5MgS-specific SNPs were tested (44 by KASPar assay and 15 by Sanger sequencing) and 84% were validated. Of the selected SNPs, 97% mapped to a chromosome 5Mg addition to wheat (the source of t5MgS), and 94% to 5Mg introgressed from a different accession of Ae. geniculata substituting for chromosome 5D of wheat. The validated SNPs also identified chromosome segments of 5MgS origin in a set of T5D-5Mg translocation lines; eight SNPs (25%) mapped to TA5601 [T5DL · 5DS-5MgS(0.75)] and three (8%) to TA5602 [T5DL · 5DS-5MgS (0.95)]. SNPs (gsnp_5ms83 and gsnp_5ms94), tagging chromosome T5DL · 5DS-5MgS(0.95) with the smallest introgression carrying resistance to leaf rust (Lr57) and stripe rust (Yr40), were validated in two released germplasm lines with Lr57 and Yr40 genes. Conclusion This approach should be widely applicable for the identification of species/genome-specific SNPs. The development of a large number of SNP markers will facilitate the precise introgression and monitoring of alien segments in crop breeding programs and further enable mapping and cloning novel genes from the wild relatives of crop plants. PMID:24716476
Sato, Youichi; Tajima, Atsushi; Tsunematsu, Kouki; Nozawa, Shiari; Yoshiike, Miki; Koh, Eitetsue; Kanaya, Jiro; Namiki, Mikio; Matsumiya, Kiyomi; Tsujimura, Akira; Komatsu, Kiyoshi; Itoh, Naoki; Eguchi, Jiro; Imoto, Issei; Yamauchi, Aiko; Iwamoto, Teruaki
2015-06-01
Are the four candidate loci (rs7867029, rs12870438, rs7174015 and rs724078) for human male fertility traits, identified in a genome-wide association study (GWAS) of a Hutterite population in the USA, associated with semen quality traits in a Japanese population? The four single nucleotide polymorphisms (SNPs) rs7867029, rs12870438, rs7174015 and rs724078 have no association with semen parameters in a meta-analysis of two Japanese male cohorts. Four (rs7867029, rs12870438, rs7174015 and rs724078) of the SNPs associated with family size or birth rate in the GWAS of a Hutterite population in the USA were associated with semen parameters in ethnically diverse men from Chicago, USA. This is a replication study in a total of 2015 Japanese subjects, including 791 fertile men and 1224 young men from the general population. We performed a replication study in two cohorts to assess whether the SNPs rs7867029, rs12870438, rs7174015 and rs724078 are associated with sperm concentration, semen volume, total sperm numbers, total motile sperm numbers or sperm motility. The rs12870438 SNP was detected by restriction fragment length polymorphism PCR while rs7174015, rs724078 and rs7867029 SNPs were genotyped using TaqMan probes. This study indicated that none of the four SNPs rs7867029, rs12870438, rs7174015 and rs724078 displayed a significant association with semen parameters in the meta-analysis of two Japanese male cohorts. Only four SNPs identified in the Hutterite GWAS were examined for associations with semen quality traits in a Japanese population. In addition, the linkage disequilibrium structures around the testing markers were different between ethnic groups. Locus mapping studies using a set of tagging SNPs across the loci will be necessary in populations with larger sample sizes in order to understand the contribution of specific genes to semen quality. This study was supported in part by the Ministry of Health and Welfare of Japan (1013201) (to T.I.), Grant-in-Aids for Scientific Research (C) (23510242) (to A.Ta.) from the Japan Society for the Promotion of Science, the European Union (BMH4-CT96-0314) (to T.I.), and the Takeda Science Foundation (to A.Ta.). None of the authors has any competing interests to declare. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.
Karchin, Rachel; Diekhans, Mark; Kelly, Libusha; Thomas, Daryl J; Pieper, Ursula; Eswar, Narayanan; Haussler, David; Sali, Andrej
2005-06-15
The NCBI dbSNP database lists over 9 million single nucleotide polymorphisms (SNPs) in the human genome, but currently contains limited annotation information. SNPs that result in amino acid residue changes (nsSNPs) are of critical importance in variation between individuals, including disease and drug sensitivity. We have developed LS-SNP, a genomic scale software pipeline to annotate nsSNPs. LS-SNP comprehensively maps nsSNPs onto protein sequences, functional pathways and comparative protein structure models, and predicts positions where nsSNPs destabilize proteins, interfere with the formation of domain-domain interfaces, have an effect on protein-ligand binding or severely impact human health. It currently annotates 28,043 validated SNPs that produce amino acid residue substitutions in human proteins from the SwissProt/TrEMBL database. Annotations can be viewed via a web interface either in the context of a genomic region or by selecting sets of SNPs, genes, proteins or pathways. These results are useful for identifying candidate functional SNPs within a gene, haplotype or pathway and in probing molecular mechanisms responsible for functional impacts of nsSNPs. http://www.salilab.org/LS-SNP CONTACT: rachelk@salilab.org http://salilab.org/LS-SNP/supp-info.pdf.
Evidence that breast cancer risk at the 2q35 locus is mediated through IGFBP5 regulation
Ghoussaini, Maya; Edwards, Stacey L.; Michailidou, Kyriaki; Nord, Silje; Cowper-Sal·lari, Richard; Desai, Kinjal; Kar, Siddhartha; Hillman, Kristine M.; Kaufmann, Susanne; Glubb, Dylan M.; Beesley, Jonathan; Dennis, Joe; Bolla, Manjeet K.; Wang, Qin; Dicks, Ed; Guo, Qi; Schmidt, Marjanka K.; Shah, Mitul; Luben, Robert; Brown, Judith; Czene, Kamila; Darabi, Hatef; Eriksson, Mikael; Klevebring, Daniel; Bojesen, Stig E.; Nordestgaard, Børge G.; Nielsen, Sune F.; Flyger, Henrik; Lambrechts, Diether; Thienpont, Bernard; Neven, Patrick; Wildiers, Hans; Broeks, Annegien; Van’t Veer, Laura J.; Th Rutgers, Emiel J.; Couch, Fergus J.; Olson, Janet E.; Hallberg, Emily; Vachon, Celine; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Peto, Julian; dos-Santos-Silva, Isabel; Gibson, Lorna; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Hall, Per; Li, Jingmei; Liu, Jianjun; Humphreys, Keith; Kang, Daehee; Choi, Ji-Yeob; Park, Sue K.; Noh, Dong-Young; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Yatabe, Yasushi; Guénel, Pascal; Truong, Thérèse; Menegaux, Florence; Sanchez, Marie; Burwinkel, Barbara; Marme, Frederik; Schneeweiss, Andreas; Sohn, Christof; Wu, Anna H.; Tseng, Chiu-chen; Van Den Berg, David; Stram, Daniel O.; Benitez, Javier; Zamora, M. Pilar; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Qiuyin; Cox, Angela; Cross, Simon S.; Reed, Malcolm W. R.; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Tchatchou, Sandrine; Sawyer, Elinor J.; Tomlinson, Ian; Kerin, Michael J.; Miller, Nicola; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Lindblom, Annika; Margolin, Sara; TEO, Soo Hwang; YIP, Cheng Har; Lee, Daphne S. C.; Wong, Tien Y.; Hooning, Maartje J.; Martens, John W. M.; Collée, J. Margriet; van Deurzen, Carolien H. M.; Hopper, John L.; Southey, Melissa C.; Tsimiklis, Helen; Kapuscinski, Miroslav K.; Shen, Chen-Yang; Wu, Pei-Ei; Yu, Jyh-Cherng; Chen, Shou-Tung; Alnæs, Grethe Grenaker; Borresen-Dale, Anne-Lise; Giles, Graham G.; Milne, Roger L.; McLean, Catriona; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Hartman, Mikael; Miao, Hui; Buhari, Shaik Ahmad Bin Syed; Teo, Yik Ying; Fasching, Peter A.; Haeberle, Lothar; Ekici, Arif B.; Beckmann, Matthias W.; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Schoemaker, Minouk J.; García-Closas, Montserrat; Figueroa, Jonine; Chanock, Stephen J.; Lissowska, Jolanta; Simard, Jacques; Goldberg, Mark S.; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Brauch, Hiltrud; Brüning, Thomas; Koto, Yon-Dschun; Radice, Paolo; Peterlongo, Paolo; Bonanni, Bernardo; Volorio, Sara; Dörk, Thilo; Bogdanova, Natalia V.; Helbig, Sonja; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Devilee, Peter; Tollenaar, Robert A. E. M.; Seynaeve, Caroline; Van Asperen, Christi J.; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Slager, Susan; Toland, Amanda E.; Ambrosone, Christine B.; Yannoukakos, Drakoulis; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Hamann, Ute; Torres, Diana; Zheng, Wei; Long, Jirong; Anton-Culver, Hoda; Neuhausen, Susan L.; Luccarini, Craig; Baynes, Caroline; Ahmed, Shahana; Maranian, Mel; Healey, Catherine S.; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Álvarez, Nuria; Herrero, Daniel; Tessier, Daniel C.; Vincent, Daniel; Bacot, Francois; de Santiago, Ines; Carroll, Jason; Caldas, Carlos; Brown, Melissa A.; Lupien, Mathieu; Kristensen, Vessela N.; Pharoah, Paul D P; Chenevix-Trench, Georgia; French, Juliet D; Easton, Douglas F.; Dunning, Alison M.; Chenevix-Trench, Georgia; Webb, Penny; Bowtell, David; De Fazio, Anna
2014-01-01
GWAS have identified a breast cancer susceptibility locus on 2q35. Here we report the fine mapping of this locus using data from 101,943 subjects from 50 case-control studies. We genotype 276 SNPs using the ‘iCOGS’ genotyping array and impute genotypes for a further 1,284 using 1000 Genomes Project data. All but two, strongly correlated SNPs (rs4442975 G/T and rs6721996 G/A) are excluded as candidate causal variants at odds against >100:1. The best functional candidate, rs4442975, is associated with oestrogen receptor positive (ER+) disease with an odds ratio (OR) in Europeans of 0.85 (95% confidence interval=0.84−0.87; P=1.7 × 10−43) per t-allele. This SNP flanks a transcriptional enhancer that physically interacts with the promoter of IGFBP5 (encoding insulin-like growth factor-binding protein 5) and displays allele-specific gene expression, FOXA1 binding and chromatin looping. Evidence suggests that the g-allele confers increased breast cancer susceptibility through relative downregulation of IGFBP5, a gene with known roles in breast cell biology. PMID:25248036
Reddy, Umesh K; Almeida, Aldo; Abburi, Venkata L; Alaparthi, Suresh Babu; Unselt, Desiree; Hankins, Gerald; Park, Minkyu; Choi, Doil; Nimmakayala, Padma
2014-01-01
Pepper (Capsicum annuum L.) is an economically important crop with added nutritional value. Production of capsaicin is an important quantitative trait with high environmental variance, so the development of markers regulating capsaicinoid accumulation is important for pepper breeding programs. In this study, we performed association mapping at the gene level to identify single nucleotide polymorphisms (SNPs) associated with capsaicin pathway metabolites in a diverse Capsicum annuum collection during two seasons. The genes Pun1, CCR, KAS and HCT were sequenced and matched with the whole-genome sequence draft of pepper to identify SNP locations and for further characterization. The identified SNPs for each gene underwent candidate gene association mapping. Association mapping results revealed Pun1 as a key regulator of major metabolites in the capsaicin pathway mainly affecting capsaicinoids and precursors for acyl moieties of capsaicinoids. Six different SNPs in the promoter sequence of Pun1 were found associated with capsaicin in plants from both seasons. Our results support that CCR is an important control point for the flux of p-coumaric acid to specific biosynthesis pathways. KAS was found to regulate the major precursors for acyl moieties of capsaicinoids and may play a key role in capsaicinoid production. Candidate gene association mapping of Pun1 suggested that the accumulation of capsaicinoids depends on the expression of Pun1, as revealed by the most important associated SNPs found in the promoter region of Pun1.
Abburi, Venkata L.; Alaparthi, Suresh Babu; Unselt, Desiree; Hankins, Gerald; Park, Minkyu; Choi, Doil
2014-01-01
Pepper (Capsicum annuum L.) is an economically important crop with added nutritional value. Production of capsaicin is an important quantitative trait with high environmental variance, so the development of markers regulating capsaicinoid accumulation is important for pepper breeding programs. In this study, we performed association mapping at the gene level to identify single nucleotide polymorphisms (SNPs) associated with capsaicin pathway metabolites in a diverse Capsicum annuum collection during two seasons. The genes Pun1, CCR, KAS and HCT were sequenced and matched with the whole-genome sequence draft of pepper to identify SNP locations and for further characterization. The identified SNPs for each gene underwent candidate gene association mapping. Association mapping results revealed Pun1 as a key regulator of major metabolites in the capsaicin pathway mainly affecting capsaicinoids and precursors for acyl moieties of capsaicinoids. Six different SNPs in the promoter sequence of Pun1 were found associated with capsaicin in plants from both seasons. Our results support that CCR is an important control point for the flux of p-coumaric acid to specific biosynthesis pathways. KAS was found to regulate the major precursors for acyl moieties of capsaicinoids and may play a key role in capsaicinoid production. Candidate gene association mapping of Pun1 suggested that the accumulation of capsaicinoids depends on the expression of Pun1, as revealed by the most important associated SNPs found in the promoter region of Pun1. PMID:24475113
A second generation human haplotype map of over 3.1 million SNPs.
Frazer, Kelly A; Ballinger, Dennis G; Cox, David R; Hinds, David A; Stuve, Laura L; Gibbs, Richard A; Belmont, John W; Boudreau, Andrew; Hardenbol, Paul; Leal, Suzanne M; Pasternak, Shiran; Wheeler, David A; Willis, Thomas D; Yu, Fuli; Yang, Huanming; Zeng, Changqing; Gao, Yang; Hu, Haoran; Hu, Weitao; Li, Chaohua; Lin, Wei; Liu, Siqi; Pan, Hao; Tang, Xiaoli; Wang, Jian; Wang, Wei; Yu, Jun; Zhang, Bo; Zhang, Qingrun; Zhao, Hongbin; Zhao, Hui; Zhou, Jun; Gabriel, Stacey B; Barry, Rachel; Blumenstiel, Brendan; Camargo, Amy; Defelice, Matthew; Faggart, Maura; Goyette, Mary; Gupta, Supriya; Moore, Jamie; Nguyen, Huy; Onofrio, Robert C; Parkin, Melissa; Roy, Jessica; Stahl, Erich; Winchester, Ellen; Ziaugra, Liuda; Altshuler, David; Shen, Yan; Yao, Zhijian; Huang, Wei; Chu, Xun; He, Yungang; Jin, Li; Liu, Yangfan; Shen, Yayun; Sun, Weiwei; Wang, Haifeng; Wang, Yi; Wang, Ying; Xiong, Xiaoyan; Xu, Liang; Waye, Mary M Y; Tsui, Stephen K W; Xue, Hong; Wong, J Tze-Fei; Galver, Luana M; Fan, Jian-Bing; Gunderson, Kevin; Murray, Sarah S; Oliphant, Arnold R; Chee, Mark S; Montpetit, Alexandre; Chagnon, Fanny; Ferretti, Vincent; Leboeuf, Martin; Olivier, Jean-François; Phillips, Michael S; Roumy, Stéphanie; Sallée, Clémentine; Verner, Andrei; Hudson, Thomas J; Kwok, Pui-Yan; Cai, Dongmei; Koboldt, Daniel C; Miller, Raymond D; Pawlikowska, Ludmila; Taillon-Miller, Patricia; Xiao, Ming; Tsui, Lap-Chee; Mak, William; Song, You Qiang; Tam, Paul K H; Nakamura, Yusuke; Kawaguchi, Takahisa; Kitamoto, Takuya; Morizono, Takashi; Nagashima, Atsushi; Ohnishi, Yozo; Sekine, Akihiro; Tanaka, Toshihiro; Tsunoda, Tatsuhiko; Deloukas, Panos; Bird, Christine P; Delgado, Marcos; Dermitzakis, Emmanouil T; Gwilliam, Rhian; Hunt, Sarah; Morrison, Jonathan; Powell, Don; Stranger, Barbara E; Whittaker, Pamela; Bentley, David R; Daly, Mark J; de Bakker, Paul I W; Barrett, Jeff; Chretien, Yves R; Maller, Julian; McCarroll, Steve; Patterson, Nick; Pe'er, Itsik; Price, Alkes; Purcell, Shaun; Richter, Daniel J; Sabeti, Pardis; Saxena, Richa; Schaffner, Stephen F; Sham, Pak C; Varilly, Patrick; Altshuler, David; Stein, Lincoln D; Krishnan, Lalitha; Smith, Albert Vernon; Tello-Ruiz, Marcela K; Thorisson, Gudmundur A; Chakravarti, Aravinda; Chen, Peter E; Cutler, David J; Kashuk, Carl S; Lin, Shin; Abecasis, Gonçalo R; Guan, Weihua; Li, Yun; Munro, Heather M; Qin, Zhaohui Steve; Thomas, Daryl J; McVean, Gilean; Auton, Adam; Bottolo, Leonardo; Cardin, Niall; Eyheramendy, Susana; Freeman, Colin; Marchini, Jonathan; Myers, Simon; Spencer, Chris; Stephens, Matthew; Donnelly, Peter; Cardon, Lon R; Clarke, Geraldine; Evans, David M; Morris, Andrew P; Weir, Bruce S; Tsunoda, Tatsuhiko; Mullikin, James C; Sherry, Stephen T; Feolo, Michael; Skol, Andrew; Zhang, Houcan; Zeng, Changqing; Zhao, Hui; Matsuda, Ichiro; Fukushima, Yoshimitsu; Macer, Darryl R; Suda, Eiko; Rotimi, Charles N; Adebamowo, Clement A; Ajayi, Ike; Aniagwu, Toyin; Marshall, Patricia A; Nkwodimmah, Chibuzor; Royal, Charmaine D M; Leppert, Mark F; Dixon, Missy; Peiffer, Andy; Qiu, Renzong; Kent, Alastair; Kato, Kazuto; Niikawa, Norio; Adewole, Isaac F; Knoppers, Bartha M; Foster, Morris W; Clayton, Ellen Wright; Watkin, Jessica; Gibbs, Richard A; Belmont, John W; Muzny, Donna; Nazareth, Lynne; Sodergren, Erica; Weinstock, George M; Wheeler, David A; Yakub, Imtaz; Gabriel, Stacey B; Onofrio, Robert C; Richter, Daniel J; Ziaugra, Liuda; Birren, Bruce W; Daly, Mark J; Altshuler, David; Wilson, Richard K; Fulton, Lucinda L; Rogers, Jane; Burton, John; Carter, Nigel P; Clee, Christopher M; Griffiths, Mark; Jones, Matthew C; McLay, Kirsten; Plumb, Robert W; Ross, Mark T; Sims, Sarah K; Willey, David L; Chen, Zhu; Han, Hua; Kang, Le; Godbout, Martin; Wallenburg, John C; L'Archevêque, Paul; Bellemare, Guy; Saeki, Koji; Wang, Hongguang; An, Daochang; Fu, Hongbo; Li, Qing; Wang, Zhen; Wang, Renwu; Holden, Arthur L; Brooks, Lisa D; McEwen, Jean E; Guyer, Mark S; Wang, Vivian Ota; Peterson, Jane L; Shi, Michael; Spiegel, Jack; Sung, Lawrence M; Zacharia, Lynn F; Collins, Francis S; Kennedy, Karen; Jamieson, Ruth; Stewart, John
2007-10-18
We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.
Rashid, Zerka; Singh, Pradeep Kumar; Vemuri, Hindu; Zaidi, Pervez Haider; Prasanna, Boddupalli Maruthi; Nair, Sudha Krishnan
2018-01-10
Globally, downy mildews are among the important foliar diseases of maize that cause significant yield losses. We conducted a genome-wide association study for sorghum downy mildew (SDM; Peronosclerospora sorghi) resistance in a panel of 368 inbred lines adapted to the Asian tropics. High density SNPs from Genotyping-by-sequencing were used in GWAS after controlling for population structure and kinship in the panel using a single locus mixed model. The study identified a set of 26 SNPs that were significantly associated with SDM resistance, with Bonferroni corrected P values ≤ 0.05. Among all the identified SNPs, the minor alleles were found to be favorable to SDM resistance in the mapping panel. Trend regression analysis with 16 independent genetic variants including 12 SNPs and four haplotype blocks identified SNP S2_6154311 on chromosome 2 with P value 2.61E-24 and contributing 26.7% of the phenotypic variation. Six of the SNPs/haplotypes were within the same chromosomal bins as the QTLs for SDM resistance mapped in previous studies. Apart from this, eight novel genomic regions for SDM resistance were identified in this study; they need further validation before being applied in the breeding pipeline. Ten SNPs identified in this study were co-located in reported mildew resistance genes.
Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann
2017-01-01
Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs. PMID:28387238
Li, Feng; Chen, Biyun; Xu, Kun; Wu, Jinfeng; Song, Weilin; Bancroft, Ian; Harper, Andrea L.; Trick, Martin; Liu, Shengyi; Gao, Guizhen; Wang, Nian; Yan, Guixin; Qiao, Jiangwei; Li, Jun; Li, Hao; Xiao, Xin; Zhang, Tianyao; Wu, Xiaoming
2014-01-01
Association mapping can quickly and efficiently dissect complex agronomic traits. Rapeseed is one of the most economically important polyploid oil crops, although its genome sequence is not yet published. In this study, a recently developed 60K Brassica Infinium® SNP array was used to analyse an association panel with 472 accessions. The single-nucleotide polymorphisms (SNPs) of the array were in silico mapped using ‘pseudomolecules’ representative of the genome of rapeseed to establish their hypothetical order and to perform association mapping of seed weight and seed quality. As a result, two significant associations on A8 and C3 of Brassica napus were detected for erucic acid content, and the peak SNPs were found to be only 233 and 128 kb away from the key genes BnaA.FAE1 and BnaC.FAE1. BnaA.FAE1 was also identified to be significantly associated with the oil content. Orthologues of Arabidopsis thaliana HAG1 were identified close to four clusters of SNPs associated with glucosinolate content on A9, C2, C7 and C9. For seed weight, we detected two association signals on A7 and A9, which were consistent with previous studies of quantitative trait loci mapping. The results indicate that our association mapping approach is suitable for fine mapping of the complex traits in rapeseed. PMID:24510440
Fine Mapping Causal Variants with an Approximate Bayesian Method Using Marginal Test Statistics
Chen, Wenan; Larrabee, Beth R.; Ovsyannikova, Inna G.; Kennedy, Richard B.; Haralambieva, Iana H.; Poland, Gregory A.; Schaid, Daniel J.
2015-01-01
Two recently developed fine-mapping methods, CAVIAR and PAINTOR, demonstrate better performance over other fine-mapping methods. They also have the advantage of using only the marginal test statistics and the correlation among SNPs. Both methods leverage the fact that the marginal test statistics asymptotically follow a multivariate normal distribution and are likelihood based. However, their relationship with Bayesian fine mapping, such as BIMBAM, is not clear. In this study, we first show that CAVIAR and BIMBAM are actually approximately equivalent to each other. This leads to a fine-mapping method using marginal test statistics in the Bayesian framework, which we call CAVIAR Bayes factor (CAVIARBF). Another advantage of the Bayesian framework is that it can answer both association and fine-mapping questions. We also used simulations to compare CAVIARBF with other methods under different numbers of causal variants. The results showed that both CAVIARBF and BIMBAM have better performance than PAINTOR and other methods. Compared to BIMBAM, CAVIARBF has the advantage of using only marginal test statistics and takes about one-quarter to one-fifth of the running time. We applied different methods on two independent cohorts of the same phenotype. Results showed that CAVIARBF, BIMBAM, and PAINTOR selected the same top 3 SNPs; however, CAVIARBF and BIMBAM had better consistency in selecting the top 10 ranked SNPs between the two cohorts. Software is available at https://bitbucket.org/Wenan/caviarbf. PMID:25948564
Genome-wide SNP identification and QTL mapping for black rot resistance in cabbage.
Lee, Jonghoon; Izzah, Nur Kholilatul; Jayakodi, Murukarthick; Perumal, Sampath; Joh, Ho Jun; Lee, Hyeon Ju; Lee, Sang-Choon; Park, Jee Young; Yang, Ki-Woung; Nou, Il-Sup; Seo, Joodeok; Yoo, Jaeheung; Suh, Youngdeok; Ahn, Kyounggu; Lee, Ji Hyun; Choi, Gyung Ja; Yu, Yeisoo; Kim, Heebal; Yang, Tae-Jin
2015-02-03
Black rot is a destructive bacterial disease causing large yield and quality losses in Brassica oleracea. To detect quantitative trait loci (QTL) for black rot resistance, we performed whole-genome resequencing of two cabbage parental lines and genome-wide SNP identification using the recently published B. oleracea genome sequences as reference. Approximately 11.5 Gb of sequencing data was produced from each parental line. Reference genome-guided mapping and SNP calling revealed 674,521 SNPs between the two cabbage lines, with an average of one SNP per 662.5 bp. Among 167 dCAPS markers derived from candidate SNPs, 117 (70.1%) were validated as bona fide SNPs showing polymorphism between the parental lines. We then improved the resolution of a previous genetic map by adding 103 markers including 87 SNP-based dCAPS markers. The new map composed of 368 markers and covers 1467.3 cM with an average interval of 3.88 cM between adjacent markers. We evaluated black rot resistance in the mapping population in three independent inoculation tests using F2:3 progenies and identified one major QTL and three minor QTLs. We report successful utilization of whole-genome resequencing for large-scale SNP identification and development of molecular markers for genetic map construction. In addition, we identified novel QTLs for black rot resistance. The high-density genetic map will promote QTL analysis for other important agricultural traits and marker-assisted breeding of B. oleracea.
Quantitative trait locus mapping of deep rooting by linkage and association analysis in rice
Lou, Qiaojun; Chen, Liang; Mei, Hanwei; Wei, Haibin; Feng, Fangjun; Wang, Pei; Xia, Hui; Li, Tiemei; Luo, Lijun
2015-01-01
Deep rooting is a very important trait for plants’ drought avoidance, and it is usually represented by the ratio of deep rooting (RDR). Three sets of rice populations were used to determine the genetic base for RDR. A linkage mapping population with 180 recombinant inbred lines and an association mapping population containing 237 rice varieties were used to identify genes linked to RDR. Six quantitative trait loci (QTLs) of RDR were identified as being located on chromosomes 1, 2, 4, 7, and 10. Using 1 019 883 single-nucleotide polymorphisms (SNPs), a genome-wide association study of the RDR was performed. Forty-eight significant SNPs of the RDR were identified and formed a clear peak on the short arm of chromosome 1 in a Manhattan plot. Compared with the shallow-rooting group and the whole collection, the deep-rooting group had selective sweep regions on chromosomes 1 and 2, especially in the major QTL region on chromosome 2. Seven of the nine candidate SNPs identified by association mapping were verified in two RDR extreme groups. The findings from this study will be beneficial to rice drought-resistance research and breeding. PMID:26022253
Genotyping-by-sequencing enables linkage mapping in three octoploid cultivated strawberry families
Salinas, Natalia; Tennessen, Jacob A.; Zurn, Jason D.; Sargent, Daniel James; Hancock, James; Bassil, Nahla V.
2017-01-01
Genotyping-by-sequencing (GBS) was used to survey genome-wide single-nucleotide polymorphisms (SNPs) in three biparental strawberry (Fragaria × ananassa) populations with the goal of evaluating this technique in a species with a complex octoploid genome. GBS sequence data were aligned to the F. vesca ‘Fvb’ reference genome in order to call SNPs. Numbers of polymorphic SNPs per population ranged from 1,163 to 3,190. Linkage maps consisting of 30–65 linkage groups were produced from the SNP sets derived from each parent. The linkage groups covered 99% of the Fvb reference genome, with three to seven linkage groups from a given parent aligned to any particular chromosome. A phylogenetic analysis performed using the POLiMAPS pipeline revealed linkage groups that were most similar to ancestral species F. vesca for each chromosome. Linkage groups that were most similar to a second ancestral species, F. iinumae, were only resolved for Fvb 4. The quantity of missing data and heterogeneity in genome coverage inherent in GBS complicated the analysis, but POLiMAPS resolved F. × ananassa chromosomal regions derived from diploid ancestor F. vesca. PMID:28875078
Buchanan, Carrie C; Torstenson, Eric S; Bush, William S
2012-01-01
Background Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more ethnic groups). HapMap along with advances in genotyping technology led to genome-wide association studies which have identified common variants associated with many traits and diseases. In 2008 the 1000 Genomes Project aimed to sequence 2500 individuals and identify rare variants and 99% of variants with a MAF of <1%. Methods To determine whether the 1000 Genomes Project includes all the variants in HapMap, we examined the overlap between single nucleotide polymorphisms (SNPs) genotyped in the two resources using merged phase II/III HapMap data and low coverage pilot data from 1000 Genomes. Results Comparison of the two data sets showed that approximately 72% of HapMap SNPs were also found in 1000 Genomes Project pilot data. After filtering out HapMap variants with a MAF of <5% (separately for each population), 99% of HapMap SNPs were found in 1000 Genomes data. Conclusions Not all variants cataloged in HapMap are also cataloged in 1000 Genomes. This could affect decisions about which resource to use for SNP queries, rare variant validation, or imputation. Both the HapMap and 1000 Genomes Project databases are useful resources for human genetics, but it is important to understand the assumptions made and filtering strategies employed by these projects. PMID:22319179
Martin, Eden R.; Lai, Eric H.; Gilbert, John R.; Rogala, Allison R.; Afshari, A. J.; Riley, John; Finch, K. L.; Stevens, J. F.; Livak, K. J.; Slotterbeck, Brandon D.; Slifer, Susan H.; Warren, Liling L.; Conneally, P. Michael; Schmechel, Donald E.; Purvis, Ian; Pericak-Vance, Margaret A.; Roses, Allen D.; Vance, Jeffery M.
2000-01-01
There has been great interest in the prospects of using single-nucleotide polymorphisms (SNPs) in the search for complex disease genes, and several initiatives devoted to the identification and mapping of SNPs throughout the human genome are currently underway. However, actual data investigating the use of SNPs for identification of complex disease genes are scarce. To begin to look at issues surrounding the use of SNPs in complex disease studies, we have initiated a collaborative SNP mapping study around APOE, the well-established susceptibility gene for late-onset Alzheimer disease (AD). Sixty SNPs in a 1.5-Mb region surrounding APOE were genotyped in samples of unrelated cases of AD, in controls, and in families with AD. Standard tests were conducted to look for association of SNP alleles with AD, in cases and controls. We also used family-based association analyses, including recently developed methods to look for haplotype association. Evidence of association (P⩽.05) was identified for 7 of 13 SNPs, including the APOE-4 polymorphism, spanning 40 kb on either side of APOE. As expected, very strong evidence for association with AD was seen for the APOE-4 polymorphism, as well as for two other SNPs that lie <16 kb from APOE. Haplotype analysis using family data increased significance over that seen in single-locus tests for some of the markers, and, for these data, improved localization of the gene. Our results demonstrate that associations can be detected at SNPs near a complex disease gene. We found that a high density of markers will be necessary in order to have a good chance of including SNPs with detectable levels of allelic association with the disease mutation, and statistical analysis based on haplotypes can provide additional information with respect to tests of significance and fine localization of complex disease genes. PMID:10869235
Martin, E R; Lai, E H; Gilbert, J R; Rogala, A R; Afshari, A J; Riley, J; Finch, K L; Stevens, J F; Livak, K J; Slotterbeck, B D; Slifer, S H; Warren, L L; Conneally, P M; Schmechel, D E; Purvis, I; Pericak-Vance, M A; Roses, A D; Vance, J M
2000-08-01
There has been great interest in the prospects of using single-nucleotide polymorphisms (SNPs) in the search for complex disease genes, and several initiatives devoted to the identification and mapping of SNPs throughout the human genome are currently underway. However, actual data investigating the use of SNPs for identification of complex disease genes are scarce. To begin to look at issues surrounding the use of SNPs in complex disease studies, we have initiated a collaborative SNP mapping study around APOE, the well-established susceptibility gene for late-onset Alzheimer disease (AD). Sixty SNPs in a 1.5-Mb region surrounding APOE were genotyped in samples of unrelated cases of AD, in controls, and in families with AD. Standard tests were conducted to look for association of SNP alleles with AD, in cases and controls. We also used family-based association analyses, including recently developed methods to look for haplotype association. Evidence of association (P=.05) was identified for 7 of 13 SNPs, including the APOE-4 polymorphism, spanning 40 kb on either side of APOE. As expected, very strong evidence for association with AD was seen for the APOE-4 polymorphism, as well as for two other SNPs that lie <16 kb from APOE. Haplotype analysis using family data increased significance over that seen in single-locus tests for some of the markers, and, for these data, improved localization of the gene. Our results demonstrate that associations can be detected at SNPs near a complex disease gene. We found that a high density of markers will be necessary in order to have a good chance of including SNPs with detectable levels of allelic association with the disease mutation, and statistical analysis based on haplotypes can provide additional information with respect to tests of significance and fine localization of complex disease genes.
Molecular mechanisms underlying variations in lung function: a systems genetics analysis
Obeidat, Ma’en; Hao, Ke; Bossé, Yohan; Nickle, David C; Nie, Yunlong; Postma, Dirkje S; Laviolette, Michel; Sandford, Andrew J; Daley, Denise D; Hogg, James C; Elliott, W Mark; Fishbane, Nick; Timens, Wim; Hysi, Pirro G; Kaprio, Jaakko; Wilson, James F; Hui, Jennie; Rawal, Rajesh; Schulz, Holger; Stubbe, Beate; Hayward, Caroline; Polasek, Ozren; Järvelin, Marjo-Riitta; Zhao, Jing Hua; Jarvis, Deborah; Kähönen, Mika; Franceschini, Nora; North, Kari E; Loth, Daan W; Brusselle, Guy G; Smith, Albert Vernon; Gudnason, Vilmundur; Bartz, Traci M; Wilk, Jemma B; O’Connor, George T; Cassano, Patricia A; Tang, Wenbo; Wain, Louise V; Artigas, María Soler; Gharib, Sina A; Strachan, David P; Sin, Don D; Tobin, Martin D; London, Stephanie J; Hall, Ian P; Paré, Peter D
2016-01-01
Summary Background Lung function measures reflect the physiological state of the lung, and are essential to the diagnosis of chronic obstructive pulmonary disease (COPD). The SpiroMeta-CHARGE consortium undertook the largest genome-wide association study (GWAS) so far (n=48 201) for forced expiratory volume in 1 s (FEV1) and the ratio of FEV1 to forced vital capacity (FEV1/FVC) in the general population. The lung expression quantitative trait loci (eQTLs) study mapped the genetic architecture of gene expression in lung tissue from 1111 individuals. We used a systems genetics approach to identify single nucleotide polymorphisms (SNPs) associated with lung function that act as eQTLs and change the level of expression of their target genes in lung tissue; termed eSNPs. Methods The SpiroMeta-CHARGE GWAS results were integrated with lung eQTLs to map eSNPs and the genes and pathways underlying the associations in lung tissue. For comparison, a similar analysis was done in peripheral blood. The lung mRNA expression levels of the eSNP-regulated genes were tested for associations with lung function measures in 727 individuals. Additional analyses identified the pleiotropic effects of eSNPs from the published GWAS catalogue, and mapped enrichment in regulatory regions from the ENCODE project. Finally, the Connectivity Map database was used to identify potential therapeutics in silico that could reverse the COPD lung tissue gene signature. Findings SNPs associated with lung function measures were more likely to be eQTLs and vice versa. The integration mapped the specific genes underlying the GWAS signals in lung tissue. The eSNP-regulated genes were enriched for developmental and inflammatory pathways; by comparison, SNPs associated with lung function that were eQTLs in blood, but not in lung, were only involved in inflammatory pathways. Lung function eSNPs were enriched for regulatory elements and were over-represented among genes showing differential expression during fetal lung development. An mRNA gene expression signature for COPD was identified in lung tissue and compared with the Connectivity Map. This in-silico drug repurposing approach suggested several compounds that reverse the COPD gene expression signature, including a nicotine receptor antagonist. These findings represent novel therapeutic pathways for COPD. Interpretation The system genetics approach identified lung tissue genes driving the variation in lung function and susceptibility to COPD. The identification of these genes and the pathways in which they are enriched is essential to understand the pathophysiology of airway obstruction and to identify novel therapeutic targets and biomarkers for COPD, including drugs that reverse the COPD gene signature in silico. Funding The research reported in this article was not specifically funded by any agency. See Acknowledgments for a full list of funders of the lung eQTL study and the Spiro-Meta CHARGE GWAS. PMID:26404118
Ma, Yu; Coyne, Clarice J; Grusak, Michael A; Mazourek, Michael; Cheng, Peng; Main, Dorrie; McGee, Rebecca J
2017-02-13
Marker-assisted breeding is now routinely used in major crops to facilitate more efficient cultivar improvement. This has been significantly enabled by the use of next-generation sequencing technology to identify loci and markers associated with traits of interest. While rich in a range of nutritional components, such as protein, mineral nutrients, carbohydrates and several vitamins, pea (Pisum sativum L.), one of the oldest domesticated crops in the world, remains behind many other crops in the availability of genomic and genetic resources. To further improve mineral nutrient levels in pea seeds requires the development of genome-wide tools. The objectives of this research were to develop these tools by: identifying genome-wide single nucleotide polymorphisms (SNPs) using genotyping by sequencing (GBS); constructing a high-density linkage map and comparative maps with other legumes, and identifying quantitative trait loci (QTL) for levels of boron, calcium, iron, potassium, magnesium, manganese, molybdenum, phosphorous, sulfur, and zinc in the seed, as well as for seed weight. In this study, 1609 high quality SNPs were found to be polymorphic between 'Kiflica' and 'Aragorn', two parents of an F 6 -derived recombinant inbred line (RIL) population. Mapping 1683 markers including 75 previously published markers and 1608 SNPs developed from the present study generated a linkage map of size 1310.1 cM. Comparative mapping with other legumes demonstrated that the highest level of synteny was observed between pea and the genome of Medicago truncatula. QTL analysis of the RIL population across two locations revealed at least one QTL for each of the mineral nutrient traits. In total, 46 seed mineral concentration QTLs, 37 seed mineral content QTLs, and 6 seed weight QTLs were discovered. The QTLs explained from 2.4% to 43.3% of the phenotypic variance. The genome-wide SNPs and the genetic linkage map developed in this study permitted QTL identification for pea seed mineral nutrients that will serve as important resources to enable marker-assisted selection (MAS) for nutritional quality traits in pea breeding programs.
Fine-Scale Map of Encyclopedia of DNA Elements Regions in the Korean Population
Yoo, Yeon-Kyeong; Ke, Xiayi; Hong, Sungwoo; Jang, Hye-Yoon; Park, Kyunghee; Kim, Sook; Ahn, TaeJin; Lee, Yeun-Du; Song, Okryeol; Rho, Na-Young; Lee, Moon Sue; Lee, Yeon-Su; Kim, Jaeheup; Kim, Young J.; Yang, Jun-Mo; Song, Kyuyoung; Kimm, Kyuchan; Weir, Bruce; Cardon, Lon R.; Lee, Jong-Eun; Hwang, Jung-Joo
2006-01-01
The International HapMap Project aims to generate detailed human genome variation maps by densely genotyping single-nucleotide polymorphisms (SNPs) in CEPH, Chinese, Japanese, and Yoruba samples. This will undoubtedly become an important facility for genetic studies of diseases and complex traits in the four populations. To address how the genetic information contained in such variation maps is transferable to other populations, the Korean government, industries, and academics have launched the Korean HapMap project to genotype high-density Encyclopedia of DNA Elements (ENCODE) regions in 90 Korean individuals. Here we show that the LD pattern, block structure, haplotype diversity, and recombination rate are highly concordant between Korean and the two HapMap Asian samples, particularly Japanese. The availability of information from both Chinese and Japanese samples helps to predict more accurately the possible performance of HapMap markers in Korean disease-gene studies. Tagging SNPs selected from the two HapMap Asian maps, especially the Japanese map, were shown to be very effective for Korean samples. These results demonstrate that the HapMap variation maps are robust in related populations and will serve as an important resource for the studies of the Korean population in particular. PMID:16702437
Grams, Vanessa; Wellmann, Robin; Preuß, Siegfried; Grashorn, Michael A; Kjaer, Jörgen B; Bessei, Werner; Bennewitz, Jörn
2015-09-30
Feather pecking (FP) in laying hens is a well-known and multi-factorial behaviour with a genetic background. In a selection experiment, two lines were developed for 11 generations for high (HFP) and low (LFP) feather pecking, respectively. Starting with the second generation of selection, there was a constant difference in mean number of FP bouts between both lines. We used the data from this experiment to perform a quantitative genetic analysis and to map selection signatures. Pedigree and phenotypic data were available for the last six generations of both lines. Univariate quantitative genetic analyses were conducted using mixed linear and generalized mixed linear models assuming a Poisson distribution. Selection signatures were mapped using 33,228 single nucleotide polymorphisms (SNPs) genotyped on 41 HFP and 34 LFP individuals of generation 11. For each SNP, we estimated Wright's fixation index (FST). We tested the null hypothesis that FST is driven purely by genetic drift against the alternative hypothesis that it is driven by genetic drift and selection. The mixed linear model failed to analyze the LFP data because of the large number of 0s in the observation vector. The Poisson model fitted the data well and revealed a small but continuous genetic trend in both lines. Most of the 17 genome-wide significant SNPs were located on chromosomes 3 and 4. Thirteen clusters with at least two significant SNPs within an interval of 3 Mb maximum were identified. Two clusters were mapped on chromosomes 3, 4, 8 and 19. Of the 17 genome-wide significant SNPs, 12 were located within the identified clusters. This indicates a non-random distribution of significant SNPs and points to the presence of selection sweeps. Data on FP should be analysed using generalised linear mixed models assuming a Poisson distribution, especially if the number of FP bouts is small and the distribution is heavily peaked at 0. The FST-based approach was suitable to map selection signatures that need to be confirmed by linkage or association mapping.
Baranski, Matthew; Gopikrishna, Gopalapillay; Robinson, Nicholas A.; Katneni, Vinaya Kumar; Shekhar, Mudagandur S.; Shanmugakarthik, Jayakani; Jothivel, Sarangapani; Gopal, Chavali; Ravichandran, Pitchaiyappan; Kent, Matthew; Arnyasi, Mariann; Ponniah, Alphis G.
2014-01-01
Transcriptome sequencing using Illumina RNA-seq was performed on populations of black tiger shrimp from India. Samples were collected from (i) four landing centres around the east coastline (EC) of India, (ii) survivors of a severe WSSV infection during pond culture (SUR) and (iii) the Andaman Islands (AI) in the Bay of Bengal. Equal quantities of purified total RNA from homogenates of hepatopancreas, muscle, nervous tissue, intestinal tract, heart, gonad, gills, pleopod and lymphoid organs were combined to create AI, EC and SUR pools for RNA sequencing. De novo transcriptome assembly resulted in 136,223 contigs (minimum size 100 base pairs, bp) with a total length 61 Mb, an average length of 446 bp and an average coverage of 163× across all pools. Approximately 16% of contigs were annotated with BLAST hit information and gene ontology annotations. A total of 473,620 putative SNPs/indels were identified. An Illumina iSelect genotyping array containing 6,000 SNPs was developed and used to genotype 1024 offspring belonging to seven full-sibling families. A total of 3959 SNPs were mapped to 44 linkage groups. The linkage groups consisted of between 16–129 and 13–130 markers, of length between 139–10.8 and 109.1–10.5 cM and with intervals averaging between 1.2 and 0.9 cM for the female and male maps respectively. The female map was 28% longer than the male map (4060 and 2917 cM respectively) with a 1.6 higher recombination rate observed for female compared to male meioses. This approach has substantially increased expressed sequence and DNA marker resources for tiger shrimp and is a useful resource for QTL mapping and association studies for evolutionarily and commercially important traits. PMID:24465553
Integrating common and rare genetic variation in diverse human populations.
Altshuler, David M; Gibbs, Richard A; Peltonen, Leena; Altshuler, David M; Gibbs, Richard A; Peltonen, Leena; Dermitzakis, Emmanouil; Schaffner, Stephen F; Yu, Fuli; Peltonen, Leena; Dermitzakis, Emmanouil; Bonnen, Penelope E; Altshuler, David M; Gibbs, Richard A; de Bakker, Paul I W; Deloukas, Panos; Gabriel, Stacey B; Gwilliam, Rhian; Hunt, Sarah; Inouye, Michael; Jia, Xiaoming; Palotie, Aarno; Parkin, Melissa; Whittaker, Pamela; Yu, Fuli; Chang, Kyle; Hawes, Alicia; Lewis, Lora R; Ren, Yanru; Wheeler, David; Gibbs, Richard A; Muzny, Donna Marie; Barnes, Chris; Darvishi, Katayoon; Hurles, Matthew; Korn, Joshua M; Kristiansson, Kati; Lee, Charles; McCarrol, Steven A; Nemesh, James; Dermitzakis, Emmanouil; Keinan, Alon; Montgomery, Stephen B; Pollack, Samuela; Price, Alkes L; Soranzo, Nicole; Bonnen, Penelope E; Gibbs, Richard A; Gonzaga-Jauregui, Claudia; Keinan, Alon; Price, Alkes L; Yu, Fuli; Anttila, Verneri; Brodeur, Wendy; Daly, Mark J; Leslie, Stephen; McVean, Gil; Moutsianas, Loukas; Nguyen, Huy; Schaffner, Stephen F; Zhang, Qingrun; Ghori, Mohammed J R; McGinnis, Ralph; McLaren, William; Pollack, Samuela; Price, Alkes L; Schaffner, Stephen F; Takeuchi, Fumihiko; Grossman, Sharon R; Shlyakhter, Ilya; Hostetter, Elizabeth B; Sabeti, Pardis C; Adebamowo, Clement A; Foster, Morris W; Gordon, Deborah R; Licinio, Julio; Manca, Maria Cristina; Marshall, Patricia A; Matsuda, Ichiro; Ngare, Duncan; Wang, Vivian Ota; Reddy, Deepa; Rotimi, Charles N; Royal, Charmaine D; Sharp, Richard R; Zeng, Changqing; Brooks, Lisa D; McEwen, Jean E
2010-09-02
Despite great progress in identifying genetic variants that influence human disease, most inherited risk remains unexplained. A more complete understanding requires genome-wide studies that fully examine less common alleles in populations with a wide range of ancestry. To inform the design and interpretation of such studies, we genotyped 1.6 million common single nucleotide polymorphisms (SNPs) in 1,184 reference individuals from 11 global populations, and sequenced ten 100-kilobase regions in 692 of these individuals. This integrated data set of common and rare alleles, called 'HapMap 3', includes both SNPs and copy number polymorphisms (CNPs). We characterized population-specific differences among low-frequency variants, measured the improvement in imputation accuracy afforded by the larger reference panel, especially in imputing SNPs with a minor allele frequency of
Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Schlautman, Brandon; Deutsch, Joseph; Salazar, Walter; Hernandez-Ochoa, Miguel; Grygleski, Edward; Steffan, Shawn; Iorizzo, Massimo; Polashock, James; Vorsa, Nicholi; Zalapa, Juan
2016-06-13
The application of genotyping by sequencing (GBS) approaches, combined with data imputation methodologies, is narrowing the genetic knowledge gap between major and understudied, minor crops. GBS is an excellent tool to characterize the genomic structure of recently domesticated (~200 years) and understudied species, such as cranberry (Vaccinium macrocarpon Ait.), by generating large numbers of markers for genomic studies such as genetic mapping. We identified 10842 potentially mappable single nucleotide polymorphisms (SNPs) in a cranberry pseudo-testcross population wherein 5477 SNPs and 211 short sequence repeats (SSRs) were used to construct a high density linkage map in cranberry of which a total of 4849 markers were mapped. Recombination frequency, linkage disequilibrium (LD), and segregation distortion at the genomic level in the parental and integrated linkage maps were characterized for first time in cranberry. SSR markers, used as the backbone in the map, revealed high collinearity with previously published linkage maps. The 4849 point map consisted of twelve linkage groups spanning 1112 cM, which anchored 2381 nuclear scaffolds accounting for ~13 Mb of the estimated 470 Mb cranberry genome. Bin mapping identified 592 and 672 unique bins in the parentals and a total of 1676 unique marker positions in the integrated map. Synteny analyses comparing the order of anchored cranberry scaffolds to their homologous positions in kiwifruit, grape, and coffee genomes provided initial evidence of homology between cranberry and closely related species. GBS data was used to rapidly saturate the cranberry genome with markers in a pseudo-testcross population. Collinearity between the present saturated genetic map and previous cranberry SSR maps suggests that the SNP locations represent accurate marker order and chromosome structure of the cranberry genome. SNPs greatly improved current marker genome coverage, which allowed for genome-wide structure investigations such as segregation distortion, recombination, linkage disequilibrium, and synteny analyses. In the future, GBS can be used to accelerate cranberry molecular breeding through QTL mapping and genome-wide association studies (GWAS).
PExFInS: An Integrative Post-GWAS Explorer for Functional Indels and SNPs
Cheng, Zhongshan; Chu, Hin; Fan, Yanhui; Li, Cun; Song, You-Qiang; Zhou, Jie; Yuen, Kwok-Yung
2015-01-01
Expression quantitative trait loci (eQTLs) mapping and linkage disequilibrium (LD) analysis have been widely employed to interpret findings of genome-wide association studies (GWAS). With the availability of deep sequencing data of 423 lymphoblastoid cell lines (LCLs) from six global populations and the microarray expression data, we performed eQTL analysis, identified more than 228 K SNP cis-eQTLs and 21 K indel cis-eQTLs and generated a LCL cis-eQTL database. We demonstrate that the percentages of population-shared and population-specific cis-eQTLs are comparable; while indel cis-eQTLs in the population-specific subsection make more contribution to gene expression variations than those in the population-shared subsection. We found cis-eQTLs, especially the population-shared cis-eQTLs are significantly enriched toward transcription start site. Moreover, the National Human Genome Research Institute cataloged GWAS SNPs are enriched for LCL cis-eQTLs. Specifically, 32.8% GWAS SNPs are LCL cis-eQTLs, among which 12.5% can be tagged by indel cis-eQTLs, suggesting the fundamental contribution of indel cis-eQTLs to GWAS association signals. To search for functional indels and SNPs tagging GWAS SNPs, a pipeline Post-GWAS Explorer for Functional Indels and SNPs (PExFInS) has been developed, integrating LD analysis, functional annotation from public databases, cis-eQTL mapping with our LCL cis-eQTL database and other published cis-eQTL datasets. PMID:26612672
Livingstone, Donald; Stack, Conrad; Mustiga, Guiliana M; Rodezno, Dayana C; Suarez, Carmen; Amores, Freddy; Feltus, Frank A; Mockaitis, Keithanne; Cornejo, Omar E; Motamayor, Juan C
2017-01-01
Cacao ( Theobroma cacao L.) is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance.
Fine Mapping Causal Variants with an Approximate Bayesian Method Using Marginal Test Statistics.
Chen, Wenan; Larrabee, Beth R; Ovsyannikova, Inna G; Kennedy, Richard B; Haralambieva, Iana H; Poland, Gregory A; Schaid, Daniel J
2015-07-01
Two recently developed fine-mapping methods, CAVIAR and PAINTOR, demonstrate better performance over other fine-mapping methods. They also have the advantage of using only the marginal test statistics and the correlation among SNPs. Both methods leverage the fact that the marginal test statistics asymptotically follow a multivariate normal distribution and are likelihood based. However, their relationship with Bayesian fine mapping, such as BIMBAM, is not clear. In this study, we first show that CAVIAR and BIMBAM are actually approximately equivalent to each other. This leads to a fine-mapping method using marginal test statistics in the Bayesian framework, which we call CAVIAR Bayes factor (CAVIARBF). Another advantage of the Bayesian framework is that it can answer both association and fine-mapping questions. We also used simulations to compare CAVIARBF with other methods under different numbers of causal variants. The results showed that both CAVIARBF and BIMBAM have better performance than PAINTOR and other methods. Compared to BIMBAM, CAVIARBF has the advantage of using only marginal test statistics and takes about one-quarter to one-fifth of the running time. We applied different methods on two independent cohorts of the same phenotype. Results showed that CAVIARBF, BIMBAM, and PAINTOR selected the same top 3 SNPs; however, CAVIARBF and BIMBAM had better consistency in selecting the top 10 ranked SNPs between the two cohorts. Software is available at https://bitbucket.org/Wenan/caviarbf. Copyright © 2015 by the Genetics Society of America.
Guinan, Eva C; Palmer, Christine D; Mancuso, Christy J; Brennan, Lisa; Stoler-Barak, Liat; Kalish, Leslie A; Suter, Eugenie E; Gallington, Leighanne C; Huhtelin, David P; Mansilla, Maria; Schumann, Ralf R; Murray, Jeffrey C; Weiss, Jerrold; Levy, Ofer
2014-10-01
Hematopoietic cell transplant (HCT) is a life-saving therapy for many malignant and non-malignant bone marrow diseases. Associated morbidities are often due to transplant-related toxicities and infections, exacerbated by regimen-induced immune suppression and systemic incursion of bacterial products. Patients undergoing myeloablative conditioning for HCT become endotoxemic and display blood/plasma changes consistent with lipopolysaccharide (LPS)-induced systemic innate immune activation. Herein, we addressed whether patients scheduled for HCT display differences in recognition/response to LPS ex vivo traceable to specific single nucleotide polymorphisms (SNPs). Two SNPs of LPS binding protein (LBP) were associated with changes in plasma LBP levels, with one LBP SNP also associating with differences in efficiency of extraction and transfer of endotoxin to myeloid differentiation factor-2 (MD-2), a step needed for activation of TLR4. None of the examined SNPs of CD14, bactericidal/permeability-increasing protein (BPI), TLR4 or MD-2 were associated with corresponding protein plasma levels or endotoxin delivery to MD-2, but CD14 and BPI SNPs significantly associated with differences in LPS-induced TNF-α release ex vivo and infection frequency, respectively. These findings suggest that specific LBP, CD14 and BPI SNPs might be contributory assessments in studies where clinical outcome may be affected by host response to endotoxin and bacterial infection. © The Author(s) 2013 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
A High-Density Admixture Map for Disease Gene Discovery in African Americans
Smith, Michael W. ; Patterson, Nick ; Lautenberger, James A. ; Truelove, Ann L. ; McDonald, Gavin J. ; Waliszewska, Alicja ; Kessing, Bailey D. ; Malasky, Michael J. ; Scafe, Charles ; Le, Ernest ; De Jager, Philip L. ; Mignault, Andre A. ; Yi, Zeng ; de Thé, Guy ; Essex, Myron ; Sankalé, Jean-Louis ; Moore, Jason H. ; Poku, Kwabena ; Phair, John P. ; Goedert, James J. ; Vlahov, David ; Williams, Scott M. ; Tishkoff, Sarah A. ; Winkler, Cheryl A. ; De La Vega, Francisco M. ; Woodage, Trevor ; Sninsky, John J. ; Hafler, David A. ; Altshuler, David ; Gilbert, Dennis A. ; O’Brien, Stephen J. ; Reich, David
2004-01-01
Admixture mapping (also known as “mapping by admixture linkage disequilibrium,” or MALD) provides a way of localizing genes that cause disease, in admixed ethnic groups such as African Americans, with ∼100 times fewer markers than are required for whole-genome haplotype scans. However, it has not been possible to perform powerful scans with admixture mapping because the method requires a dense map of validated markers known to have large frequency differences between Europeans and Africans. To create such a map, we screened through databases containing ∼450,000 single-nucleotide polymorphisms (SNPs) for which frequencies had been estimated in African and European population samples. We experimentally confirmed the frequencies of the most promising SNPs in a multiethnic panel of unrelated samples and identified 3,011 as a MALD map (1.2 cM average spacing). We estimate that this map is ∼70% informative in differentiating African versus European origins of chromosomal segments. This map provides a practical and powerful tool, which is freely available without restriction, for screening for disease genes in African American patient cohorts. The map is especially appropriate for those diseases that differ in incidence between the parental African and European populations. PMID:15088270
DOE Office of Scientific and Technical Information (OSTI.GOV)
Song, Qijian; Jia, Gaofeng; Hyten, David L.
A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of largemore » scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.« less
Song, Qijian; Jia, Gaofeng; Hyten, David L.; ...
2015-08-28
A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of largemore » scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.« less
Leak, Tennille S.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Keene, Keith L.; Gordon, Candace J.; Hicks, Pamela J.; Freedman, Barry I.; Bowden, Donald W.; Sale, Michèle M.
2009-01-01
Previously we performed a genome scan for type 2 diabetes (T2DM) using 638 African-American (AA) affected sibling pairs from 247 families; non-parametric linkage analysis suggested evidence of linkage at 6q24-27 (LOD 2.26). To comprehensively evaluate this region we performed a 2-stage association study by first constructing a SNP map of 754 SNPs selected from HapMap on the basis of linkage disequilibrium (LD) in 300 AAT2DM-ESRD subjects, 311 AA controls, 43 European American controls and 45 Yoruba Nigerian samples (Set 1). Replication analyses were conducted in an independent population of 283 AA T2DM-ESRD subjects and 282 AA controls (Set 2). In addition, we adjusted for the impact of admixture on association results by using ancestry informative markers (AIMs). In Stage 1, 137 (18.2%) SNPs showed nominal evidence of association (P<0.05) in one or more of tests of association: allelic (n=33), dominant (n=36), additive (n=29), or recessive (n=34) genotypic models, and 2- (n=47) and 3-SNP (n=43) haplotypic analyses. These SNPs were selected for follow-up genotyping. Stage 2 analyses confirmed association with a predicted 2-SNP “risk” haplotype in the PARK2 gene. Also, two intergenic SNPs showed consistent genotypic association with T2DM-ESRD: rs12197043 and rs4897081. Combined analysis of all subjects from both stages revealed nominal associations with 17 SNPs within genes; including suggestive associations in ESR1 and PARK2. This study confirms known diabetic nephropathy loci and identifies potentially novel susceptibility variants located within 6q24-27 in AA. PMID:18560894
Humble, E; Martinez-Barrio, A; Forcada, J; Trathan, P N; Thorne, M A S; Hoffmann, M; Wolf, J B W; Hoffman, J I
2016-07-01
Custom genotyping arrays provide a flexible and accurate means of genotyping single nucleotide polymorphisms (SNPs) in a large number of individuals of essentially any organism. However, validation rates, defined as the proportion of putative SNPs that are verified to be polymorphic in a population, are often very low. A number of potential causes of assay failure have been identified, but none have been explored systematically. In particular, as SNPs are often developed from transcriptomes, parameters relating to the genomic context are rarely taken into account. Here, we assembled a draft Antarctic fur seal (Arctocephalus gazella) genome (assembly size: 2.41 Gb; scaffold/contig N50 : 3.1 Mb/27.5 kb). We then used this resource to map the probe sequences of 144 putative SNPs genotyped in 480 individuals. The number of probe-to-genome mappings and alignment length together explained almost a third of the variation in validation success, indicating that sequence uniqueness and proximity to intron-exon boundaries play an important role. The same pattern was found after mapping the probe sequences to the Walrus and Weddell seal genomes, suggesting that the genomes of species divergent by as much as 23 million years can hold information relevant to SNP validation outcomes. Additionally, reanalysis of genotyping data from seven previous studies found the same two variables to be significantly associated with SNP validation success across a variety of taxa. Finally, our study reveals considerable scope for validation rates to be improved, either by simply filtering for SNPs whose flanking sequences align uniquely and completely to a reference genome, or through predictive modelling. © 2015 John Wiley & Sons Ltd.
Song, Qijian; Jia, Gaofeng; Hyten, David L; Jenkins, Jerry; Hwang, Eun-Young; Schroeder, Steven G; Osorno, Juan M; Schmutz, Jeremy; Jackson, Scott A; McClean, Phillip E; Cregan, Perry B
2015-08-28
A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of large scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad. Copyright © 2015 Song et al.
Quantitative trait locus mapping of deep rooting by linkage and association analysis in rice.
Lou, Qiaojun; Chen, Liang; Mei, Hanwei; Wei, Haibin; Feng, Fangjun; Wang, Pei; Xia, Hui; Li, Tiemei; Luo, Lijun
2015-08-01
Deep rooting is a very important trait for plants' drought avoidance, and it is usually represented by the ratio of deep rooting (RDR). Three sets of rice populations were used to determine the genetic base for RDR. A linkage mapping population with 180 recombinant inbred lines and an association mapping population containing 237 rice varieties were used to identify genes linked to RDR. Six quantitative trait loci (QTLs) of RDR were identified as being located on chromosomes 1, 2, 4, 7, and 10. Using 1 019 883 single-nucleotide polymorphisms (SNPs), a genome-wide association study of the RDR was performed. Forty-eight significant SNPs of the RDR were identified and formed a clear peak on the short arm of chromosome 1 in a Manhattan plot. Compared with the shallow-rooting group and the whole collection, the deep-rooting group had selective sweep regions on chromosomes 1 and 2, especially in the major QTL region on chromosome 2. Seven of the nine candidate SNPs identified by association mapping were verified in two RDR extreme groups. The findings from this study will be beneficial to rice drought-resistance research and breeding. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology.
Berdugo-Cely, Jhon; Valbuena, Raúl Iván; Sánchez-Betancourt, Erika; Barrero, Luz Stella; Yockteng, Roxana
2017-01-01
The potato (Solanum tuberosum L.) is the fourth most important crop food in the world and Colombia has one of the most important collections of potato germplasm in the world (the Colombian Central Collection-CCC). Little is known about its potential as a source of genetic diversity for molecular breeding programs. In this study, we analyzed 809 Andigenum group accessions from the CCC using 5968 SNPs to determine: 1) the genetic diversity and population structure of the Andigenum germplasm and 2) the usefulness of this collection to map qualitative traits across the potato genome. The genetic structure analysis based on principal components, cluster analyses, and Bayesian inference revealed that the CCC can be subdivided into two main groups associated with their ploidy level: Phureja (diploid) and Andigena (tetraploid). The Andigena population was more genetically diverse but less genetically substructured than the Phureja population (three vs. five subpopulations, respectively). The association mapping analysis of qualitative morphological data using 4666 SNPs showed 23 markers significantly associated with nine morphological traits. The present study showed that the CCC is a highly diverse germplasm collection genetically and phenotypically, useful to implement association mapping in order to identify genes related to traits of interest and to assist future potato genetic breeding programs.
Berdugo-Cely, Jhon; Valbuena, Raúl Iván; Sánchez-Betancourt, Erika; Barrero, Luz Stella
2017-01-01
The potato (Solanum tuberosum L.) is the fourth most important crop food in the world and Colombia has one of the most important collections of potato germplasm in the world (the Colombian Central Collection-CCC). Little is known about its potential as a source of genetic diversity for molecular breeding programs. In this study, we analyzed 809 Andigenum group accessions from the CCC using 5968 SNPs to determine: 1) the genetic diversity and population structure of the Andigenum germplasm and 2) the usefulness of this collection to map qualitative traits across the potato genome. The genetic structure analysis based on principal components, cluster analyses, and Bayesian inference revealed that the CCC can be subdivided into two main groups associated with their ploidy level: Phureja (diploid) and Andigena (tetraploid). The Andigena population was more genetically diverse but less genetically substructured than the Phureja population (three vs. five subpopulations, respectively). The association mapping analysis of qualitative morphological data using 4666 SNPs showed 23 markers significantly associated with nine morphological traits. The present study showed that the CCC is a highly diverse germplasm collection genetically and phenotypically, useful to implement association mapping in order to identify genes related to traits of interest and to assist future potato genetic breeding programs. PMID:28257509
USDA-ARS?s Scientific Manuscript database
Genotyping-by-sequencing (GBS) provides an opportunity for fast and inexpensive generation of unbiased SNPs. However, due to its low coverage, GBS SNPs have a higher proportion of missing data and genotyping error associated with heterozygote undercalling than traditional genotyping platforms. These...
Systematic Search for Gene-Gene Interaction Effect on Prostate Cancer Risk
2011-07-01
PLEASE DO NOT RETURN YOUR FORM TO THE ABOVE ADDRESS. 1. REPORT DATE (DD-MM-YYYY) 2 . REPORT TYPE 3. DATES COVERED (From - To) 4. TITLE AND SUBTITLE 5a...identify SNPs in the genome that interact to have stronger effects on PCa risk in the CGEMS GWAS data, 2 ) confirm the gene-gene interaction effect on PCa...for pairs of SNPs implicated in Aim 2 among the remaining 1,893 cases and 781 controls in CAPS, and 4) fine map the genomic regions where SNPs have
Genetic factors controlling wool shedding in a composite Easycare sheep flock.
Matika, O; Bishop, S C; Pong-Wong, R; Riggio, V; Headon, D J
2013-12-01
Historically, sheep have been selectively bred for desirable traits including wool characteristics. However, recent moves towards extensive farming and reduced farm labour have seen a renewed interest in Easycare breeds. The aim of this study was to quantify the underlying genetic architecture of wool shedding in an Easycare flock. Wool shedding scores were collected from 565 pedigreed commercial Easycare sheep from 2002 to 2010. The wool scoring system was based on a 10-point (0-9) scale, with score 0 for animals retaining full fleece and 9 for those completely shedding. DNA was sampled from 200 animals of which 48 with extreme phenotypes were genotyped using a 50-k SNP chip. Three genetic analyses were performed: heritability analysis, complex segregation analysis to test for a major gene hypothesis and a genome-wide association study to map regions in the genome affecting the trait. Phenotypes were treated as a continuous or binary variable and categories. High estimates of heritability (0.80 when treated as a continuous, 0.65-0.75 as binary and 0.75 as categories) for shedding were obtained from linear mixed model analyses. Complex segregation analysis gave similar estimates (0.80 ± 0.06) to those above with additional evidence for a major gene with dominance effects. Mixed model association analyses identified four significant (P < 0.05) SNPs. Further analyses of these four SNPs in all 200 animals revealed that one of the SNPs displayed dominance effects similar to those obtained from the complex segregation analyses. In summary, we found strong genetic control for wool shedding, demonstrated the possibility of a single putative dominant gene controlling this trait and identified four SNPs that may be in partial linkage disequilibrium with gene(s) controlling shedding. © 2013 University of Edinburgh, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.
Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D
2013-02-28
Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
Scalabrin, Simone; Gilmore, Barbara; Lawley, Cynthia T.; Gasic, Ksenija; Micheletti, Diego; Rosyara, Umesh R.; Cattonaro, Federica; Vendramin, Elisa; Main, Dorrie; Aramini, Valeria; Blas, Andrea L.; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Troggio, Michela; Sosinski, Bryon; Aranzana, Maria José; Arús, Pere; Iezzoni, Amy; Morgante, Michele; Peace, Cameron
2012-01-01
Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species. PMID:22536421
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation
2013-01-01
Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355
Livingstone, Donald; Stack, Conrad; Mustiga, Guiliana M.; Rodezno, Dayana C.; Suarez, Carmen; Amores, Freddy; Feltus, Frank A.; Mockaitis, Keithanne; Cornejo, Omar E.; Motamayor, Juan C.
2017-01-01
Cacao (Theobroma cacao L.) is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance. PMID:29259608
Sakai, Hiroaki; Kanamori, Hiroyuki; Arai-Kichise, Yuko; Shibata-Hatta, Mari; Ebana, Kaworu; Oono, Youko; Kurita, Kanako; Fujisawa, Hiroko; Katagiri, Satoshi; Mukai, Yoshiyuki; Hamada, Masao; Itoh, Takeshi; Matsumoto, Takashi; Katayose, Yuichi; Wakasa, Kyo; Yano, Masahiro; Wu, Jianzhong
2014-01-01
Having a deep genetic structure evolved during its domestication and adaptation, the Asian cultivated rice (Oryza sativa) displays considerable physiological and morphological variations. Here, we describe deep whole-genome sequencing of the aus rice cultivar Kasalath by using the advanced next-generation sequencing (NGS) technologies to gain a better understanding of the sequence and structural changes among highly differentiated cultivars. The de novo assembled Kasalath sequences represented 91.1% (330.55 Mb) of the genome and contained 35 139 expressed loci annotated by RNA-Seq analysis. We detected 2 787 250 single-nucleotide polymorphisms (SNPs) and 7393 large insertion/deletion (indel) sites (>100 bp) between Kasalath and Nipponbare, and 2 216 251 SNPs and 3780 large indels between Kasalath and 93-11. Extensive comparison of the gene contents among these cultivars revealed similar rates of gene gain and loss. We detected at least 7.39 Mb of inserted sequences and 40.75 Mb of unmapped sequences in the Kasalath genome in comparison with the Nipponbare reference genome. Mapping of the publicly available NGS short reads from 50 rice accessions proved the necessity and the value of using the Kasalath whole-genome sequence as an additional reference to capture the sequence polymorphisms that cannot be discovered by using the Nipponbare sequence alone. PMID:24578372
Medaka: a promising model animal for comparative population genomics
Matsumoto, Yoshifumi; Oota, Hiroki; Asaoka, Yoichi; Nishina, Hiroshi; Watanabe, Koji; Bujnicki, Janusz M; Oda, Shoji; Kawamura, Shoji; Mitani, Hiroshi
2009-01-01
Background Within-species genome diversity has been best studied in humans. The international HapMap project has revealed a tremendous amount of single-nucleotide polymorphisms (SNPs) among humans, many of which show signals of positive selection during human evolution. In most of the cases, however, functional differences between the alleles remain experimentally unverified due to the inherent difficulty of human genetic studies. It would therefore be highly useful to have a vertebrate model with the following characteristics: (1) high within-species genetic diversity, (2) a variety of gene-manipulation protocols already developed, and (3) a completely sequenced genome. Medaka (Oryzias latipes) and its congeneric species, tiny fresh-water teleosts distributed broadly in East and Southeast Asia, meet these criteria. Findings Using Oryzias species from 27 local populations, we conducted a simple screening of nonsynonymous SNPs for 11 genes with apparent orthology between medaka and humans. We found medaka SNPs for which the same sites in human orthologs are known to be highly differentiated among the HapMap populations. Importantly, some of these SNPs show signals of positive selection. Conclusion These results indicate that medaka is a promising model system for comparative population genomics exploring the functional and adaptive significance of allelic differentiations. PMID:19426554
Pavy, Nathalie; Parsons, Lee S; Paule, Charles; MacKay, John; Bousquet, Jean
2006-01-01
Background High-throughput genotyping technologies represent a highly efficient way to accelerate genetic mapping and enable association studies. As a first step toward this goal, we aimed to develop a resource of candidate Single Nucleotide Polymorphisms (SNP) in white spruce (Picea glauca [Moench] Voss), a softwood tree of major economic importance. Results A white spruce SNP resource encompassing 12,264 SNPs was constructed from a set of 6,459 contigs derived from Expressed Sequence Tags (EST) and by using the bayesian-based statistical software PolyBayes. Several parameters influencing the SNP prediction were analysed including the a priori expected polymorphism, the probability score (PSNP), and the contig depth and length. SNP detection in 3' and 5' reads from the same clones revealed a level of inconsistency between overlapping sequences as low as 1%. A subset of 245 predicted SNPs were verified through the independent resequencing of genomic DNA of a genotype also used to prepare cDNA libraries. The validation rate reached a maximum of 85% for SNPs predicted with either PSNP ≥ 0.95 or ≥ 0.99. A total of 9,310 SNPs were detected by using PSNP ≥ 0.95 as a criterion. The SNPs were distributed among 3,590 contigs encompassing an array of broad functional categories, with an overall frequency of 1 SNP per 700 nucleotide sites. Experimental and statistical approaches were used to evaluate the proportion of paralogous SNPs, with estimates in the range of 8 to 12%. The 3,789 coding SNPs identified through coding region annotation and ORF prediction, were distributed into 39% nonsynonymous and 61% synonymous substitutions. Overall, there were 0.9 SNP per 1,000 nonsynonymous sites and 5.2 SNPs per 1,000 synonymous sites, for a genome-wide nonsynonymous to synonymous substitution rate ratio (Ka/Ks) of 0.17. Conclusion We integrated the SNP data in the ForestTreeDB database along with functional annotations to provide a tool facilitating the choice of candidate genes for mapping purposes or association studies. PMID:16824208
Improved methods for multi-trait fine mapping of pleiotropic risk loci.
Kichaev, Gleb; Roytman, Megan; Johnson, Ruth; Eskin, Eleazar; Lindström, Sara; Kraft, Peter; Pasaniuc, Bogdan
2017-01-15
Genome-wide association studies (GWAS) have identified thousands of regions in the genome that contain genetic variants that increase risk for complex traits and diseases. However, the variants uncovered in GWAS are typically not biologically causal, but rather, correlated to the true causal variant through linkage disequilibrium (LD). To discern the true causal variant(s), a variety of statistical fine-mapping methods have been proposed to prioritize variants for functional validation. In this work we introduce a new approach, fastPAINTOR, that leverages evidence across correlated traits, as well as functional annotation data, to improve fine-mapping accuracy at pleiotropic risk loci. To improve computational efficiency, we describe an new importance sampling scheme to perform model inference. First, we demonstrate in simulations that by leveraging functional annotation data, fastPAINTOR increases fine-mapping resolution relative to existing methods. Next, we show that jointly modeling pleiotropic risk regions improves fine-mapping resolution compared to standard single trait and pleiotropic fine mapping strategies. We report a reduction in the number of SNPs required for follow-up in order to capture 90% of the causal variants from 23 SNPs per locus using a single trait to 12 SNPs when fine-mapping two traits simultaneously. Finally, we analyze summary association data from a large-scale GWAS of lipids and show that these improvements are largely sustained in real data. The fastPAINTOR framework is implemented in the PAINTOR v3.0 package which is publicly available to the research community http://bogdan.bioinformatics.ucla.edu/software/paintor CONTACT: gkichaev@ucla.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Troggio, Michela; Surbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James
2013-01-01
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
Wen, Weie; He, Zhonghu; Gao, Fengmei; Liu, Jindong; Jin, Hui; Zhai, Shengnan; Qu, Yanying; Xia, Xianchun
2017-01-01
A high-density consensus map is a powerful tool for gene mapping, cloning and molecular marker-assisted selection in wheat breeding. The objective of this study was to construct a high-density, single nucleotide polymorphism (SNP)-based consensus map of common wheat (Triticum aestivum L.) by integrating genetic maps from four recombinant inbred line populations. The populations were each genotyped using the wheat 90K Infinium iSelect SNP assay. A total of 29,692 SNP markers were mapped on 21 linkage groups corresponding to 21 hexaploid wheat chromosomes, covering 2,906.86 cM, with an overall marker density of 10.21 markers/cM. Compared with the previous maps based on the wheat 90K SNP chip detected 22,736 (76.6%) of the SNPs with consistent chromosomal locations, whereas 1,974 (6.7%) showed different chromosomal locations, and 4,982 (16.8%) were newly mapped. Alignment of the present consensus map and the wheat expressed sequence tags (ESTs) Chromosome Bin Map enabled assignment of 1,221 SNP markers to specific chromosome bins and 819 ESTs were integrated into the consensus map. The marker orders of the consensus map were validated based on physical positions on the wheat genome with Spearman rank correlation coefficients ranging from 0.69 (4D) to 0.97 (1A, 4B, 5B, and 6A), and were also confirmed by comparison with genetic position on the previously 40K SNP consensus map with Spearman rank correlation coefficients ranging from 0.84 (6D) to 0.99 (6A). Chromosomal rearrangements reported previously were confirmed in the present consensus map and new putative rearrangements were identified. In addition, an integrated consensus map was developed through the combination of five published maps with ours, containing 52,607 molecular markers. The consensus map described here provided a high-density SNP marker map and a reliable order of SNPs, representing a step forward in mapping and validation of chromosomal locations of SNPs on the wheat 90K array. Moreover, it can be used as a reference for quantitative trait loci (QTL) mapping to facilitate exploitation of genes and QTL in wheat breeding. PMID:28848588
Linkage disequilibrium between STRPs and SNPs across the human genome.
Payseur, Bret A; Place, Michael; Weber, James L
2008-05-01
Patterns of linkage disequilibrium (LD) reveal the action of evolutionary processes and provide crucial information for association mapping of disease genes. Although recent studies have described the landscape of LD among single nucleotide polymorphisms (SNPs) from across the human genome, associations involving other classes of molecular variation remain poorly understood. In addition to recombination and population history, mutation rate and process are expected to shape LD. To test this idea, we measured associations between short-tandem-repeat polymorphisms (STRPs), which can mutate rapidly and recurrently, and SNPs in 721 regions across the human genome. We directly compared STRP-SNP LD with SNP-SNP LD from the same genomic regions in the human HapMap populations. The intensity of STRP-SNP LD, measured by the average of D', was reduced, consistent with the action of recurrent mutation. Nevertheless, a higher fraction of STRP-SNP pairs than SNP-SNP pairs showed significant LD, on both short (up to 50 kb) and long (cM) scales. These results reveal the substantial effects of mutational processes on LD at STRPs and provide important measures of the potential of STRPs for association mapping of disease genes.
A genetic map and germplasm diversity estimation of Mangifera indica (mango) with SNPs
USDA-ARS?s Scientific Manuscript database
Mango (Mangifera indica) is often referred to as the “King of Fruits”. As the first steps in developing a mango genomics project, we genotyped 582 individuals comprising six mapping populations with 1054 SNP markers. The resulting consensus map had 20 linkage groups defined by 726 SNP markers with...
Langlois, Christine; Abadi, Arkan; Peralta-Romero, Jesus; Alyass, Akram; Suarez, Fernando; Gomez-Zamudio, Jaime; Burguete-Garcia, Ana I.; Yazdi, Fereshteh T.; Cruz, Miguel; Meyre, David
2016-01-01
Genome wide association studies (GWAS) have identified single-nucleotide polymorphisms (SNPs) that are associated with fasting plasma glucose (FPG) in adult European populations. The contribution of these SNPs to FPG in non-Europeans and children is unclear. We studied the association of 15 GWAS SNPs and a genotype score (GS) with FPG and 7 metabolic traits in 1,421 Mexican children and adolescents from Mexico City. Genotyping of the 15 SNPs was performed using TaqMan Open Array. We used multivariate linear regression models adjusted for age, sex, body mass index standard deviation score, and recruitment center. We identified significant associations between 3 SNPs (G6PC2 (rs560887), GCKR (rs1260326), MTNR1B (rs10830963)), the GS and FPG level. The FPG risk alleles of 11 out of the 15 SNPs (73.3%) displayed significant or non-significant beta values for FPG directionally consistent with those reported in adult European GWAS. The risk allele frequencies for 11 of 15 (73.3%) SNPs differed significantly in Mexican children and adolescents compared to European adults from the 1000G Project, but no significant enrichment in FPG risk alleles was observed in the Mexican population. Our data support a partial transferability of European GWAS FPG association signals in children and adolescents from the admixed Mexican population. PMID:27782183
Guo, Yinshan; Xing, Huiyang; Zhao, Yuhui; Liu, Zhendong; Li, Kun; Guo, Xiuwu
2017-01-01
Genetic maps are important tools in plant genomics and breeding. We report a large-scale discovery of single nucleotide polymorphisms (SNPs) using the specific length amplified fragment sequencing (SLAF-seq) technique for the construction of high-density genetic maps for two elite wine grape cultivars, ‘Chardonnay’ and ‘Beibinghong’, and their 130 F1 plants. A total of 372.53 M paired-end reads were obtained after preprocessing. The average sequencing depth was 33.81 for ‘Chardonnay’ (the female parent), 48.20 for ‘Beibinghong’ (the male parent), and 12.66 for the F1 offspring. We detected 202,349 high-quality SLAFs of which 144,972 were polymorphic; 10,042 SNPs were used to construct a genetic map that spanned 1,969.95 cM, with an average genetic distance of 0.23 cM between adjacent markers. This genetic map contains the largest molecular marker number of the grape maps so far reported. We thus demonstrate that SLAF-seq is a promising strategy for the construction of high-density genetic maps; the map that we report here is a good potential resource for QTL mapping of genes linked to major economic and agronomic traits, map-based cloning, and marker-assisted selection of grape. PMID:28746364
Brookes, Keeley J; McConnell, George; Williams, Kirsty; Chaudhury, Sultan; Madhan, Gaganjit; Patel, Tulsi; Turley, Christopher; Guetta-Baranes, Tamar; Bras, Jose; Guerreiro, Rita; Hardy, John; Francis, Paul T; Morgan, Kevin
2018-06-08
The Brains for Dementia Research project is a recently established longitudinal cohort which aims to provide brain tissue for research purposes from neuropathologically defined samples. Here we present the findings from our analysis on the 19 established GWAS index SNPs for Alzheimer's disease, in order to demonstrate if the BDR sample also displays association to these variants. A highly significant association of the APOEɛ4 allele was identified (p = 3.99×10-12). Association tests for the 19 GWAS SNPs found that although no SNPs survive multiple testing, nominal significant findings were detected and concordance with the Lambert et al. GWAS meta-analysis was observed.
Sierra, Beatriz; Triska, Petr; Soares, Pedro; Garcia, Gissel; Perez, Ana B; Aguirre, Eglys; Oliveira, Marisa; Cavadas, Bruno; Regnault, Béatrice; Alvarez, Mayling; Ruiz, Didye; Samuels, David C; Sakuntabhai, Anavaj; Pereira, Luisa; Guzman, Maria G
2017-02-01
Ethnic groups can display differential genetic susceptibility to infectious diseases. The arthropod-born viral dengue disease is one such disease, with empirical and limited genetic evidence showing that African ancestry may be protective against the haemorrhagic phenotype. Global ancestry analysis based on high-throughput genotyping in admixed populations can be used to test this hypothesis, while admixture mapping can map candidate protective genes. A Cuban dengue fever cohort was genotyped using a 2.5 million SNP chip. Global ancestry was ascertained through ADMIXTURE and used in a fine-matched corrected association study, while local ancestry was inferred by the RFMix algorithm. The expression of candidate genes was evaluated by RT-PCR in a Cuban dengue patient cohort and gene set enrichment analysis was performed in a Thai dengue transcriptome. OSBPL10 and RXRA candidate genes were identified, with most significant SNPs placed in inferred weak enhancers, promoters and lncRNAs. OSBPL10 had significantly lower expression in Africans than Europeans, while for RXRA several SNPs may differentially regulate its transcription between Africans and Europeans. Their expression was confirmed to change through dengue disease progression in Cuban patients and to vary with disease severity in a Thai transcriptome dataset. These genes interact in the LXR/RXR activation pathway that integrates lipid metabolism and immune functions, being a key player in dengue virus entrance into cells, its replication therein and in cytokine production. Knockdown of OSBPL10 expression in THP-1 cells by two shRNAs followed by DENV2 infection tests led to a significant reduction in DENV replication, being a direct functional proof that the lower OSBPL10 expression profile in Africans protects this ancestry against dengue disease.
Soares, Pedro; Garcia, Gissel; Perez, Ana B.; Aguirre, Eglys; Cavadas, Bruno; Regnault, Béatrice; Alvarez, Mayling; Ruiz, Didye; Guzman, Maria G.
2017-01-01
Ethnic groups can display differential genetic susceptibility to infectious diseases. The arthropod-born viral dengue disease is one such disease, with empirical and limited genetic evidence showing that African ancestry may be protective against the haemorrhagic phenotype. Global ancestry analysis based on high-throughput genotyping in admixed populations can be used to test this hypothesis, while admixture mapping can map candidate protective genes. A Cuban dengue fever cohort was genotyped using a 2.5 million SNP chip. Global ancestry was ascertained through ADMIXTURE and used in a fine-matched corrected association study, while local ancestry was inferred by the RFMix algorithm. The expression of candidate genes was evaluated by RT-PCR in a Cuban dengue patient cohort and gene set enrichment analysis was performed in a Thai dengue transcriptome. OSBPL10 and RXRA candidate genes were identified, with most significant SNPs placed in inferred weak enhancers, promoters and lncRNAs. OSBPL10 had significantly lower expression in Africans than Europeans, while for RXRA several SNPs may differentially regulate its transcription between Africans and Europeans. Their expression was confirmed to change through dengue disease progression in Cuban patients and to vary with disease severity in a Thai transcriptome dataset. These genes interact in the LXR/RXR activation pathway that integrates lipid metabolism and immune functions, being a key player in dengue virus entrance into cells, its replication therein and in cytokine production. Knockdown of OSBPL10 expression in THP-1 cells by two shRNAs followed by DENV2 infection tests led to a significant reduction in DENV replication, being a direct functional proof that the lower OSBPL10 expression profile in Africans protects this ancestry against dengue disease. PMID:28241052
Shearman, Jeremy R.; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke
2015-01-01
Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly. PMID:25831195
Shearman, Jeremy R; Sangsrakru, Duangjai; Jomchai, Nukoon; Ruang-Areerate, Panthita; Sonthirod, Chutima; Naktang, Chaiwat; Theerawattanasuk, Kanikar; Tragoonrung, Somvong; Tangphatsornruang, Sithichoke
2015-01-01
Hevea brasiliensis, or rubber tree, is an important crop species that accounts for the majority of natural latex production. The rubber tree nuclear genome consists of 18 chromosomes and is roughly 2.15 Gb. The current rubber tree reference genome assembly consists of 1,150,326 scaffolds ranging from 200 to 531,465 bp and totalling 1.1 Gb. Only 143 scaffolds, totalling 7.6 Mb, have been placed into linkage groups. We have performed RNA-seq on 6 varieties of rubber tree to identify SNPs and InDels and used this information to perform target sequence enrichment and high throughput sequencing to genotype a set of SNPs in 149 rubber tree offspring from a cross between RRIM 600 and RRII 105 rubber tree varieties. We used this information to generate a linkage map allowing for the anchoring of 24,424 contigs from 3,009 scaffolds, totalling 115 Mb or 10.4% of the published sequence, into 18 linkage groups. Each linkage group contains between 319 and 1367 SNPs, or 60 to 194 non-redundant marker positions, and ranges from 156 to 336 cM in length. This linkage map includes 20,143 of the 69,300 predicted genes from rubber tree and will be useful for mapping studies and improving the reference genome assembly.
Klaften, Matthias; Hrabé de Angelis, Martin
2005-07-01
Genome-wide mapping in the identification of novel candidate genes has always been the standard method in genetics and genomics to correlate a clinically interesting phenotypic trait with a genotype. However, the performance of a mapping experiment using classical microsatellite approaches can be very time consuming. The high-throughput analysis of single-nucleotide polymorphisms (SNPs) has the potential of being the successor of microsatellite analysis routinely used for these mapping approaches, where one of the major obstacles is the design of the appropriate SNP marker set itself. Here we report on ARTS, an advanced retrieval tool for SNPs, which allows researchers to comb freely the public mouse dbSNP database for multiple reference and test strains. Several filters can be applied in order to improve the sensitivity and the specificity of the search results. By employing the panel generator function of this program, it is possible to abbreviate the extraction of reliable sequence data for a large marker panel including several different mouse strains from days to minutes. The concept of ARTS is easily adaptable to other species for which SNP databases are available, making it a versatile tool for the use of SNPs as markers for genotyping. The web interface is accessible at http://andromeda.gsf.de/arts.
Jia, Jing; Wei, Yi-Liang; Qin, Cui-Jiao; Hu, Lan; Wan, Li-Hua; Li, Cai-Xia
2014-01-01
Inferring the ancestral origin of DNA samples can be helpful in correcting population stratification in disease association studies or guiding crime investigations. Populations throughout the world vary in appearance features and biological characteristics. Based on this idea, we performed a genome-wide scan for SNPs within genes that are related to physical and biological traits. Using the HapMap database, we screened 52 genes and their flanking regions. Thirty-five SNPs that displayed highly contrasting allele frequencies (F(st)>0.3, linkage disequilibrium r(2)<0.2, and Hardy-Weinberg equilibrium P>0.001) among Africans, Europeans, and East Asians were selected and validated. A multiplexed assay was developed to genotype these 35 SNPs in 357 individuals from 10 populations worldwide. This panel provided accurate estimates of individual ancestry proportions with balanced discriminatory power among the three continental ancestries: Africans, Europeans, and East Asians. It also proved very effective in evaluating admixed populations living in joint regions of continents (e.g., Uyghurs and Indians) and discriminating some subpopulations within each of the three continents. Structure analysis was performed to establish and evaluate the panel of ancestry-informative markers, and the components of each population were also described to indicate the structural composition. The 21 population structures in our study are consistent with geographic patterns, and individuals were properly assigned to their original ancestral populations with proportion analyses and random match probability calculations. Thus, the panel and its population information will be useful resources to minimize the effects of population stratification in association analyses and to assign the most likely origin of an unknown DNA contributor in forensic investigations. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Fong, Cristian; Menzel, Stephan; Lizarralde, María Alejandra; Barreto, Guillermo
2015-01-01
Fetal hemoglobin is an important factor in modulating the severity of sickle cell anemia. Its level in peripheral blood underlies strong genetic determination. Associated loci with increased levels of fetal hemoglobin display population-specific allele frequencies. We investigated the presence and effect of known common genetic variants promoting fetal hemoglobin persistence (rs11886868, rs9399137, rs4895441, and rs7482144) in 60 Colombian patients with sickle cell anemia. Four single nucleotide polymorphisms (SNP) were genotyped by restriction fragment length polymorphisms (RFLP) and the use of the TaqMan procedure. Fetal hemoglobin (HbF) from these patients was quantified using the oxyhemoglobin alkaline denaturation technique. Genotype frequencies were compared with frequencies reported in global reference populations. We detected genetic variants in the four SNPs, reported to be associated with higher HbF levels for all four SNPs in the Colombian patients. Genetic association between SNPs and HbF levels did not reach statistical significance. The frequency of these variants reflected the specific ethnic make-up of our patient population: A high prevalence of rs7482144-'A' reflects the West-African origin of the sickle cell mutation, while high frequencies of rs4895441-'G' and rs11886868-'C' point to a significant influence of an Amerindian ethnic background in the Colombian sickle cell disease population. These results showed that in the sickle cell disease population in Colombia there is not a unique genetic background, but two (African and Amerindian). This unique genetic situation will provide opportunities for a further study of these loci, such as fine-mapping and molecular-biological investigation. Colombian patients are expected to yield a distinctive insight into the effect of modifier loci in sickle cell disease.
Muralidharan, Niveditha; Gulati, Reena; Misra, Durga Prasanna; Negi, Vir S
2018-02-01
The aim of the study was to look for any association of MTR 2756A>G and MTRR 66A>G gene polymorphisms with clinical phenotype, methotrexate (MTX) treatment response, and MTX-induced adverse events in South Indian Tamil patients with rheumatoid arthritis (RA). A total of 335 patients with RA were investigated. MTR 2756A>G gene polymorphism was analyzed by PCR-RFLP, and MTRR 66A>G SNP was analyzed by TaqMan 5' nuclease assay. The allele frequencies were compared with HapMap groups. MTR 2756G allele was found to be associated with risk of developing RA. The allele frequencies of MTR 2756A>G and MTRR 66A>G SNPs in controls differed significantly when compared with HapMap groups. Neither of the SNPs influenced the MTX treatment outcome and adverse effects. Neither of the SNPs seems to be associated with MTX treatment outcome and adverse events in South Indian Tamil patients with RA.
He, Yanxia; Yuan, Wangjun; Dong, Meifang; Han, Yuanji; Shang, Fude
2017-01-01
Osmanthus fragrans is an ornamental plant of substantial commercial value, and no genetic linkage maps of this species have previously been reported. Specific-locus amplified fragment sequencing (SLAF-seq) is a recently developed technology that allows massive single nucleotide polymorphisms (SNPs) to be identified and high-resolution genotyping. In our current research, we generated the first genetic map of O. fragrans using SLAF-seq, which is composed with 206.92 M paired-end reads and 173,537 SLAF markers. Among total 90,715 polymorphic SLAF markers, 15,317 polymorphic SLAFs could be used for genetic map construction. The integrated map contained 14,189 high quality SLAFs that were grouped in 23 genetic linkage groups, with a total length of 2962.46 cM and an average distance of 0.21 cM between two adjacent markers. In addition, 23,664 SNPs were identified from the mapped markers. As far as we know, this is the first of the genetic map of O. fragrans. Our results are further demonstrate that SLAF-seq is a very effective method for developing markers and constructing high-density linkage maps. The SNP markers and the genetic map reported in this study should be valuable resource in future research. PMID:29018460
Bajaj, Deepak; Das, Shouvik; Badoni, Saurabh; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We identified 82489 high-quality genome-wide SNPs from 93 wild and cultivated Cicer accessions through integrated reference genome- and de novo-based GBS assays. High intra- and inter-specific polymorphic potential (66–85%) and broader natural allelic diversity (6–64%) detected by genome-wide SNPs among accessions signify their efficacy for monitoring introgression and transferring target trait-regulating genomic (gene) regions/allelic variants from wild to cultivated Cicer gene pools for genetic improvement. The population-specific assignment of wild Cicer accessions pertaining to the primary gene pool are more influenced by geographical origin/phenotypic characteristics than species/gene-pools of origination. The functional significance of allelic variants (non-synonymous and regulatory SNPs) scanned from transcription factors and stress-responsive genes in differentiating wild accessions (with potential known sources of yield-contributing and stress tolerance traits) from cultivated desi and kabuli accessions, fine-mapping/map-based cloning of QTLs and determination of LD patterns across wild and cultivated gene-pools are suitably elucidated. The correlation between phenotypic (agromorphological traits) and molecular diversity-based admixed domestication patterns within six structured populations of wild and cultivated accessions via genome-wide SNPs was apparent. This suggests utility of whole genome SNPs as a potential resource for identifying naturally selected trait-regulating genomic targets/functional allelic variants adaptive to diverse agroclimatic regions for genetic enhancement of cultivated gene-pools. PMID:26208313
Ricaño-Ponce, Isis; Zhernakova, Daria V; Deelen, Patrick; Luo, Oscar; Li, Xingwang; Isaacs, Aaron; Karjalainen, Juha; Di Tommaso, Jennifer; Borek, Zuzanna Agnieszka; Zorro, Maria M; Gutierrez-Achury, Javier; Uitterlinden, Andre G; Hofman, Albert; van Meurs, Joyce; Netea, Mihai G; Jonkers, Iris H; Withoff, Sebo; van Duijn, Cornelia M; Li, Yang; Ruan, Yijun; Franke, Lude; Wijmenga, Cisca; Kumar, Vinod
2016-04-01
Genome-wide association and fine-mapping studies in 14 autoimmune diseases (AID) have implicated more than 250 loci in one or more of these diseases. As more than 90% of AID-associated SNPs are intergenic or intronic, pinpointing the causal genes is challenging. We performed a systematic analysis to link 460 SNPs that are associated with 14 AID to causal genes using transcriptomic data from 629 blood samples. We were able to link 71 (39%) of the AID-SNPs to two or more nearby genes, providing evidence that for part of the AID loci multiple causal genes exist. While 54 of the AID loci are shared by one or more AID, 17% of them do not share candidate causal genes. In addition to finding novel genes such as ULK3, we also implicate novel disease mechanisms and pathways like autophagy in celiac disease pathogenesis. Furthermore, 42 of the AID SNPs specifically affected the expression of 53 non-coding RNA genes. To further understand how the non-coding genome contributes to AID, the SNPs were linked to functional regulatory elements, which suggest a model where AID genes are regulated by network of chromatin looping/non-coding RNAs interactions. The looping model also explains how a causal candidate gene is not necessarily the gene closest to the AID SNP, which was the case in nearly 50% of cases. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
Vallée Marcotte, Bastien; Cormier, Hubert; Guénard, Frédéric; Rudkowska, Iwona; Lemieux, Simone; Couture, Patrick; Vohl, Marie-Claude
2016-01-01
A recent genome-wide association study (GWAS) by our group identified 13 loci associated with the plasma triglyceride (TG) response to omega-3 (n-3) fatty acid (FA) supplementation. This study aimed to test whether single-nucleotide polymorphisms (SNPs) within the IQCJ, NXPH1, PHF17 and MYB genes are associated with the plasma TG response to an n-3 FA supplementation. A total of 208 subjects followed a 6-week n-3 FA supplementation of 5 g/day of fish oil (1.9-2.2 g of eicosapentaenoic acid and 1.1 g of docosahexaenoic acid). Measurements of plasma lipids were made before and after the supplementation. Sixty-seven tagged SNPs were selected to increase the density of markers near GWAS hits. In a repeated model, independent effects of the genotype and the gene-supplementation interaction were associated with plasma TG. Genotype effects were observed with two SNPs of NXPH1, and gene-diet interactions were observed with ten SNPs of IQCJ, four SNPs of NXPH1 and three SNPs of MYB. Positive and negative responders showed different genotype frequencies with nine SNPs of IQCJ, two SNPs of NXPH1 and two SNPs of MYB. Fine mapping in GWAS-associated loci allowed the identification of SNPs partly explaining the large interindividual variability observed in plasma TG levels in response to an n-3 FA supplementation. © 2016 S. Karger AG, Basel.
Genome-wide association mapping of crown rust resistance in oat elite germplasm
USDA-ARS?s Scientific Manuscript database
Oat crown rust, caused by Puccinia coronata f. sp. avenae, is a major constraint to oat production in many parts of the world. In this first comprehensive multi-environment genome-wide association map of oat crown rust, we used 2,972 SNPs genotyped on 631 oat lines for association mapping of quantit...
In search of causal variants: refining disease association signals using cross-population contrasts.
Saccone, Nancy L; Saccone, Scott F; Goate, Alison M; Grucza, Richard A; Hinrichs, Anthony L; Rice, John P; Bierut, Laura J
2008-08-29
Genome-wide association (GWA) using large numbers of single nucleotide polymorphisms (SNPs) is now a powerful, state-of-the-art approach to mapping human disease genes. When a GWA study detects association between a SNP and the disease, this signal usually represents association with a set of several highly correlated SNPs in strong linkage disequilibrium. The challenge we address is to distinguish among these correlated loci to highlight potential functional variants and prioritize them for follow-up. We implemented a systematic method for testing association across diverse population samples having differing histories and LD patterns, using a logistic regression framework. The hypothesis is that important underlying biological mechanisms are shared across human populations, and we can filter correlated variants by testing for heterogeneity of genetic effects in different population samples. This approach formalizes the descriptive comparison of p-values that has typified similar cross-population fine-mapping studies to date. We applied this method to correlated SNPs in the cholinergic nicotinic receptor gene cluster CHRNA5-CHRNA3-CHRNB4, in a case-control study of cocaine dependence composed of 504 European-American and 583 African-American samples. Of the 10 SNPs genotyped in the r2 > or = 0.8 bin for rs16969968, three demonstrated significant cross-population heterogeneity and are filtered from priority follow-up; the remaining SNPs include rs16969968 (heterogeneity p = 0.75). Though the power to filter out rs16969968 is reduced due to the difference in allele frequency in the two groups, the results nevertheless focus attention on a smaller group of SNPs that includes the non-synonymous SNP rs16969968, which retains a similar effect size (odds ratio) across both population samples. Filtering out SNPs that demonstrate cross-population heterogeneity enriches for variants more likely to be important and causative. Our approach provides an important and effective tool to help interpret results from the many GWA studies now underway.
De La Vega, Francisco M; Dailey, David; Ziegle, Janet; Williams, Julie; Madden, Dawn; Gilbert, Dennis A
2002-06-01
Since public and private efforts announced the first draft of the human genome last year, researchers have reported great numbers of single nucleotide polymorphisms (SNPs). We believe that the availability of well-mapped, quality SNP markers constitutes the gateway to a revolution in genetics and personalized medicine that will lead to better diagnosis and treatment of common complex disorders. A new generation of tools and public SNP resources for pharmacogenomic and genetic studies--specifically for candidate-gene, candidate-region, and whole-genome association studies--will form part of the new scientific landscape. This will only be possible through the greater accessibility of SNP resources and superior high-throughput instrumentation-assay systems that enable affordable, highly productive large-scale genetic studies. We are contributing to this effort by developing a high-quality linkage disequilibrium SNP marker map and an accompanying set of ready-to-use, validated SNP assays across every gene in the human genome. This effort incorporates both the public sequence and SNP data sources, and Celera Genomics' human genome assembly and enormous resource ofphysically mapped SNPs (approximately 4,000,000 unique records). This article discusses our approach and methodology for designing the map, choosing quality SNPs, designing and validating these assays, and obtaining population frequency ofthe polymorphisms. We also discuss an advanced, high-performance SNP assay chemisty--a new generation of the TaqMan probe-based, 5' nuclease assay-and high-throughput instrumentation-software system for large-scale genotyping. We provide the new SNP map and validation information, validated SNP assays and reagents, and instrumentation systems as a novel resource for genetic discoveries.
Wang, Lu-Yong; Fasulo, D
2006-01-01
Genome-wide association study for complex diseases will generate massive amount of single nucleotide polymorphisms (SNPs) data. Univariate statistical test (i.e. Fisher exact test) was used to single out non-associated SNPs. However, the disease-susceptible SNPs may have little marginal effects in population and are unlikely to retain after the univariate tests. Also, model-based methods are impractical for large-scale dataset. Moreover, genetic heterogeneity makes the traditional methods harder to identify the genetic causes of diseases. A more recent random forest method provides a more robust method for screening the SNPs in thousands scale. However, for more large-scale data, i.e., Affymetrix Human Mapping 100K GeneChip data, a faster screening method is required to screening SNPs in whole-genome large scale association analysis with genetic heterogeneity. We propose a boosting-based method for rapid screening in large-scale analysis of complex traits in the presence of genetic heterogeneity. It provides a relatively fast and fairly good tool for screening and limiting the candidate SNPs for further more complex computational modeling task.
High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).
Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C
2016-03-01
Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. © 2015 John Wiley & Sons Ltd.
Painter, Jodie N; O'Mara, Tracy A; Batra, Jyotsna; Cheng, Timothy; Lose, Felicity A; Dennis, Joe; Michailidou, Kyriaki; Tyrer, Jonathan P; Ahmed, Shahana; Ferguson, Kaltin; Healey, Catherine S; Kaufmann, Susanne; Hillman, Kristine M; Walpole, Carina; Moya, Leire; Pollock, Pamela; Jones, Angela; Howarth, Kimberley; Martin, Lynn; Gorman, Maggie; Hodgson, Shirley; De Polanco, Ma Magdalena Echeverry; Sans, Monica; Carracedo, Angel; Castellvi-Bel, Sergi; Rojas-Martinez, Augusto; Santos, Erika; Teixeira, Manuel R; Carvajal-Carmona, Luis; Shu, Xiao-Ou; Long, Jirong; Zheng, Wei; Xiang, Yong-Bing; Montgomery, Grant W; Webb, Penelope M; Scott, Rodney J; McEvoy, Mark; Attia, John; Holliday, Elizabeth; Martin, Nicholas G; Nyholt, Dale R; Henders, Anjali K; Fasching, Peter A; Hein, Alexander; Beckmann, Matthias W; Renner, Stefan P; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Lambrechts, Diether; Coenegrachts, Lieve; Schrauwen, Stefanie; Amant, Frederic; Winterhoff, Boris; Dowdy, Sean C; Goode, Ellen L; Teoman, Attila; Salvesen, Helga B; Trovik, Jone; Njolstad, Tormund S; Werner, Henrica M J; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Tzortzatos, Gerasimos; Mints, Miriam; Tham, Emma; Hall, Per; Czene, Kamila; Liu, Jianjun; Li, Jingmei; Hopper, John L; Southey, Melissa C; Ekici, Arif B; Ruebner, Matthias; Johnson, Nicola; Peto, Julian; Burwinkel, Barbara; Marme, Frederik; Brenner, Hermann; Dieffenbach, Aida K; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Depreeuw, Jeroen; Moisse, Matthieu; Chang-Claude, Jenny; Rudolph, Anja; Couch, Fergus J; Olson, Janet E; Giles, Graham G; Bruinsma, Fiona; Cunningham, Julie M; Fridley, Brooke L; Børresen-Dale, Anne-Lise; Kristensen, Vessela N; Cox, Angela; Swerdlow, Anthony J; Orr, Nicholas; Bolla, Manjeet K; Wang, Qin; Weber, Rachel Palmieri; Chen, Zhihua; Shah, Mitul; French, Juliet D; Pharoah, Paul D P; Dunning, Alison M; Tomlinson, Ian; Easton, Douglas F; Edwards, Stacey L; Thompson, Deborah J; Spurdle, Amanda B
2015-03-01
Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10(-14), odds ratio = 0.86, 95% confidence interval = 0.82-0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Painter, Jodie N.; O'Mara, Tracy A.; Batra, Jyotsna; Cheng, Timothy; Lose, Felicity A.; Dennis, Joe; Michailidou, Kyriaki; Tyrer, Jonathan P.; Ahmed, Shahana; Ferguson, Kaltin; Healey, Catherine S.; Kaufmann, Susanne; Hillman, Kristine M.; Walpole, Carina; Moya, Leire; Pollock, Pamela; Jones, Angela; Howarth, Kimberley; Martin, Lynn; Gorman, Maggie; Hodgson, Shirley; De Polanco, Ma. Magdalena Echeverry; Sans, Monica; Carracedo, Angel; Castellvi-Bel, Sergi; Rojas-Martinez, Augusto; Santos, Erika; Teixeira, Manuel R.; Carvajal-Carmona, Luis; Shu, Xiao-Ou; Long, Jirong; Zheng, Wei; Xiang, Yong-Bing; Montgomery, Grant W.; Webb, Penelope M.; Scott, Rodney J.; McEvoy, Mark; Attia, John; Holliday, Elizabeth; Martin, Nicholas G.; Nyholt, Dale R.; Henders, Anjali K.; Fasching, Peter A.; Hein, Alexander; Beckmann, Matthias W.; Renner, Stefan P.; Dörk, Thilo; Hillemanns, Peter; Dürst, Matthias; Runnebaum, Ingo; Lambrechts, Diether; Coenegrachts, Lieve; Schrauwen, Stefanie; Amant, Frederic; Winterhoff, Boris; Dowdy, Sean C.; Goode, Ellen L.; Teoman, Attila; Salvesen, Helga B.; Trovik, Jone; Njolstad, Tormund S.; Werner, Henrica M.J.; Ashton, Katie; Proietto, Tony; Otton, Geoffrey; Tzortzatos, Gerasimos; Mints, Miriam; Tham, Emma; Hall, Per; Czene, Kamila; Liu, Jianjun; Li, Jingmei; Hopper, John L.; Southey, Melissa C.; Ekici, Arif B.; Ruebner, Matthias; Johnson, Nicola; Peto, Julian; Burwinkel, Barbara; Marme, Frederik; Brenner, Hermann; Dieffenbach, Aida K.; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Depreeuw, Jeroen; Moisse, Matthieu; Chang-Claude, Jenny; Rudolph, Anja; Couch, Fergus J.; Olson, Janet E.; Giles, Graham G.; Bruinsma, Fiona; Cunningham, Julie M.; Fridley, Brooke L.; Børresen-Dale, Anne-Lise; Kristensen, Vessela N.; Cox, Angela; Swerdlow, Anthony J.; Orr, Nicholas; Bolla, Manjeet K.; Wang, Qin; Weber, Rachel Palmieri; Chen, Zhihua; Shah, Mitul; French, Juliet D.; Pharoah, Paul D.P.; Dunning, Alison M.; Tomlinson, Ian; Easton, Douglas F.; Edwards, Stacey L.; Thompson, Deborah J.; Spurdle, Amanda B.
2015-01-01
Common variants in the hepatocyte nuclear factor 1 homeobox B (HNF1B) gene are associated with the risk of Type II diabetes and multiple cancers. Evidence to date indicates that cancer risk may be mediated via genetic or epigenetic effects on HNF1B gene expression. We previously found single-nucleotide polymorphisms (SNPs) at the HNF1B locus to be associated with endometrial cancer, and now report extensive fine-mapping and in silico and laboratory analyses of this locus. Analysis of 1184 genotyped and imputed SNPs in 6608 Caucasian cases and 37 925 controls, and 895 Asian cases and 1968 controls, revealed the best signal of association for SNP rs11263763 (P = 8.4 × 10−14, odds ratio = 0.86, 95% confidence interval = 0.82–0.89), located within HNF1B intron 1. Haplotype analysis and conditional analyses provide no evidence of further independent endometrial cancer risk variants at this locus. SNP rs11263763 genotype was associated with HNF1B mRNA expression but not with HNF1B methylation in endometrial tumor samples from The Cancer Genome Atlas. Genetic analyses prioritized rs11263763 and four other SNPs in high-to-moderate linkage disequilibrium as the most likely causal SNPs. Three of these SNPs map to the extended HNF1B promoter based on chromatin marks extending from the minimal promoter region. Reporter assays demonstrated that this extended region reduces activity in combination with the minimal HNF1B promoter, and that the minor alleles of rs11263763 or rs8064454 are associated with decreased HNF1B promoter activity. Our findings provide evidence for a single signal associated with endometrial cancer risk at the HNF1B locus, and that risk is likely mediated via altered HNF1B gene expression. PMID:25378557
Bassil, Nahla V; Davis, Thomas M; Zhang, Hailong; Ficklin, Stephen; Mittmann, Mike; Webster, Teresa; Mahoney, Lise; Wood, David; Alperin, Elisabeth S; Rosyara, Umesh R; Koehorst-Vanc Putten, Herma; Monfort, Amparo; Sargent, Daniel J; Amaya, Iraida; Denoyes, Beatrice; Bianco, Luca; van Dijk, Thijs; Pirani, Ali; Iezzoni, Amy; Main, Dorrie; Peace, Cameron; Yang, Yilong; Whitaker, Vance; Verma, Sujeet; Bellon, Laurent; Brew, Fiona; Herrera, Raul; van de Weg, Eric
2015-03-07
A high-throughput genotyping platform is needed to enable marker-assisted breeding in the allo-octoploid cultivated strawberry Fragaria × ananassa. Short-read sequences from one diploid and 19 octoploid accessions were aligned to the diploid Fragaria vesca 'Hawaii 4' reference genome to identify single nucleotide polymorphisms (SNPs) and indels for incorporation into a 90 K Affymetrix® Axiom® array. We report the development and preliminary evaluation of this array. About 36 million sequence variants were identified in a 19 member, octoploid germplasm panel. Strategies and filtering pipelines were developed to identify and incorporate markers of several types: di-allelic SNPs (66.6%), multi-allelic SNPs (1.8%), indels (10.1%), and ploidy-reducing "haploSNPs" (11.7%). The remaining SNPs included those discovered in the diploid progenitor F. iinumae (3.9%), and speculative "codon-based" SNPs (5.9%). In genotyping 306 octoploid accessions, SNPs were assigned to six classes with Affymetrix's "SNPolisher" R package. The highest quality classes, PolyHigh Resolution (PHR), No Minor Homozygote (NMH), and Off-Target Variant (OTV) comprised 25%, 38%, and 1% of array markers, respectively. These markers were suitable for genetic studies as demonstrated in the full-sib family 'Holiday' × 'Korona' with the generation of a genetic linkage map consisting of 6,594 PHR SNPs evenly distributed across 28 chromosomes with an average density of approximately one marker per 0.5 cM, thus exceeding our goal of one marker per cM. The Affymetrix IStraw90 Axiom array is the first high-throughput genotyping platform for cultivated strawberry and is commercially available to the worldwide scientific community. The array's high success rate is likely driven by the presence of naturally occurring variation in ploidy level within the nominally octoploid genome, and by effectiveness of the employed array design and ploidy-reducing strategies. This array enables genetic analyses including generation of high-density linkage maps, identification of quantitative trait loci for economically important traits, and genome-wide association studies, thus providing a basis for marker-assisted breeding in this high value crop.
2013-01-01
Background The genomic architecture of adaptive traits remains poorly understood in non-model plants. Various approaches can be used to bridge this gap, including the mapping of quantitative trait loci (QTL) in pedigrees, and genetic association studies in non-structured populations. Here we present results on the genomic architecture of adaptive traits in black spruce, which is a widely distributed conifer of the North American boreal forest. As an alternative to the usual candidate gene approach, a candidate SNP approach was developed for association testing. Results A genetic map containing 231 gene loci was used to identify QTL that were related to budset timing and to tree height assessed over multiple years and sites. Twenty-two unique genomic regions were identified, including 20 that were related to budset timing and 6 that were related to tree height. From results of outlier detection and bulk segregant analysis for adaptive traits using DNA pool sequencing of 434 genes, 52 candidate SNPs were identified and subsequently tested in genetic association studies for budset timing and tree height assessed over multiple years and sites. A total of 34 (65%) SNPs were significantly associated with budset timing, or tree height, or both. Although the percentages of explained variance (PVE) by individual SNPs were small, several significant SNPs were shared between sites and among years. Conclusions The sharing of genomic regions and significant SNPs between budset timing and tree height indicates pleiotropic effects. Significant QTLs and SNPs differed quite greatly among years, suggesting that different sets of genes for the same characters are involved at different stages in the tree’s life history. The functional diversity of genes carrying significant SNPs and low observed PVE further indicated that a large number of polymorphisms are involved in adaptive genetic variation. Accordingly, for undomesticated species such as black spruce with natural populations of large effective size and low linkage disequilibrium, efficient marker systems that are predictive of adaptation should require the survey of large numbers of SNPs. Candidate SNP approaches like the one developed in the present study could contribute to reducing these numbers. PMID:23724860
Cloud computing-based TagSNP selection algorithm for human genome data.
Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling
2015-01-05
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used.
Allegre, Mathilde; Argout, Xavier; Boccara, Michel; Fouet, Olivier; Roguet, Yolande; Bérard, Aurélie; Thévenin, Jean Marc; Chauveau, Aurélie; Rivallan, Ronan; Clement, Didier; Courtois, Brigitte; Gramacho, Karina; Boland-Augé, Anne; Tahi, Mathias; Umaharan, Pathmanathan; Brunel, Dominique; Lanaud, Claire
2012-01-01
Theobroma cacao is an economically important tree of several tropical countries. Its genetic improvement is essential to provide protection against major diseases and improve chocolate quality. We discovered and mapped new expressed sequence tag-single nucleotide polymorphism (EST-SNP) and simple sequence repeat (SSR) markers and constructed a high-density genetic map. By screening 149 650 ESTs, 5246 SNPs were detected in silico, of which 1536 corresponded to genes with a putative function, while 851 had a clear polymorphic pattern across a collection of genetic resources. In addition, 409 new SSR markers were detected on the Criollo genome. Lastly, 681 new EST-SNPs and 163 new SSRs were added to the pre-existing 418 co-dominant markers to construct a large consensus genetic map. This high-density map and the set of new genetic markers identified in this study are a milestone in cocoa genomics and for marker-assisted breeding. The data are available at http://tropgenedb.cirad.fr. PMID:22210604
HapZipper: sharing HapMap populations just got easier.
Chanda, Pritam; Elhaik, Eran; Bader, Joel S
2012-11-01
The rapidly growing amount of genomic sequence data being generated and made publicly available necessitate the development of new data storage and archiving methods. The vast amount of data being shared and manipulated also create new challenges for network resources. Thus, developing advanced data compression techniques is becoming an integral part of data production and analysis. The HapMap project is one of the largest public resources of human single-nucleotide polymorphisms (SNPs), characterizing over 3 million SNPs genotyped in over 1000 individuals. The standard format and biological properties of HapMap data suggest that a dedicated genetic compression method can outperform generic compression tools. We propose a compression methodology for genetic data by introducing HapZipper, a lossless compression tool tailored to compress HapMap data beyond benchmarks defined by generic tools such as gzip, bzip2 and lzma. We demonstrate the usefulness of HapZipper by compressing HapMap 3 populations to <5% of their original sizes. HapZipper is freely downloadable from https://bitbucket.org/pchanda/hapzipper/downloads/HapZipper.tar.bz2.
Cloud Computing-Based TagSNP Selection Algorithm for Human Genome Data
Hung, Che-Lun; Chen, Wen-Pei; Hua, Guan-Jie; Zheng, Huiru; Tsai, Suh-Jen Jane; Lin, Yaw-Ling
2015-01-01
Single nucleotide polymorphisms (SNPs) play a fundamental role in human genetic variation and are used in medical diagnostics, phylogeny construction, and drug design. They provide the highest-resolution genetic fingerprint for identifying disease associations and human features. Haplotypes are regions of linked genetic variants that are closely spaced on the genome and tend to be inherited together. Genetics research has revealed SNPs within certain haplotype blocks that introduce few distinct common haplotypes into most of the population. Haplotype block structures are used in association-based methods to map disease genes. In this paper, we propose an efficient algorithm for identifying haplotype blocks in the genome. In chromosomal haplotype data retrieved from the HapMap project website, the proposed algorithm identified longer haplotype blocks than an existing algorithm. To enhance its performance, we extended the proposed algorithm into a parallel algorithm that copies data in parallel via the Hadoop MapReduce framework. The proposed MapReduce-paralleled combinatorial algorithm performed well on real-world data obtained from the HapMap dataset; the improvement in computational efficiency was proportional to the number of processors used. PMID:25569088
2011-01-01
Background A number of molecular marker linkage maps have been developed for melon (Cucumis melo L.) over the last two decades. However, these maps were constructed using different marker sets, thus, making comparative analysis among maps difficult. In order to solve this problem, a consensus genetic map in melon was constructed using primarily highly transferable anchor markers that have broad potential use for mapping, synteny, and comparative quantitative trait loci (QTL) analysis, increasing breeding effectiveness and efficiency via marker-assisted selection (MAS). Results Under the framework of the International Cucurbit Genomics Initiative (ICuGI, http://www.icugi.org), an integrated genetic map has been constructed by merging data from eight independent mapping experiments using a genetically diverse array of parental lines. The consensus map spans 1150 cM across the 12 melon linkage groups and is composed of 1592 markers (640 SSRs, 330 SNPs, 252 AFLPs, 239 RFLPs, 89 RAPDs, 15 IMAs, 16 indels and 11 morphological traits) with a mean marker density of 0.72 cM/marker. One hundred and ninety-six of these markers (157 SSRs, 32 SNPs, 6 indels and 1 RAPD) were newly developed, mapped or provided by industry representatives as released markers, including 27 SNPs and 5 indels from genes involved in the organic acid metabolism and transport, and 58 EST-SSRs. Additionally, 85 of 822 SSR markers contributed by Syngenta Seeds were included in the integrated map. In addition, 370 QTL controlling 62 traits from 18 previously reported mapping experiments using genetically diverse parental genotypes were also integrated into the consensus map. Some QTL associated with economically important traits detected in separate studies mapped to similar genomic positions. For example, independently identified QTL controlling fruit shape were mapped on similar genomic positions, suggesting that such QTL are possibly responsible for the phenotypic variability observed for this trait in a broad array of melon germplasm. Conclusions Even though relatively unsaturated genetic maps in a diverse set of melon market types have been published, the integrated saturated map presented herein should be considered the initial reference map for melon. Most of the mapped markers contained in the reference map are polymorphic in diverse collection of germplasm, and thus are potentially transferrable to a broad array of genetic experimentation (e.g., integration of physical and genetic maps, colinearity analysis, map-based gene cloning, epistasis dissection, and marker-assisted selection). PMID:21797998
Diaz, Aurora; Fergany, Mohamed; Formisano, Gelsomina; Ziarsolo, Peio; Blanca, José; Fei, Zhanjun; Staub, Jack E; Zalapa, Juan E; Cuevas, Hugo E; Dace, Gayle; Oliver, Marc; Boissot, Nathalie; Dogimont, Catherine; Pitrat, Michel; Hofstede, René; van Koert, Paul; Harel-Beja, Rotem; Tzuri, Galil; Portnoy, Vitaly; Cohen, Shahar; Schaffer, Arthur; Katzir, Nurit; Xu, Yong; Zhang, Haiying; Fukino, Nobuko; Matsumoto, Satoru; Garcia-Mas, Jordi; Monforte, Antonio J
2011-07-28
A number of molecular marker linkage maps have been developed for melon (Cucumis melo L.) over the last two decades. However, these maps were constructed using different marker sets, thus, making comparative analysis among maps difficult. In order to solve this problem, a consensus genetic map in melon was constructed using primarily highly transferable anchor markers that have broad potential use for mapping, synteny, and comparative quantitative trait loci (QTL) analysis, increasing breeding effectiveness and efficiency via marker-assisted selection (MAS). Under the framework of the International Cucurbit Genomics Initiative (ICuGI, http://www.icugi.org), an integrated genetic map has been constructed by merging data from eight independent mapping experiments using a genetically diverse array of parental lines. The consensus map spans 1150 cM across the 12 melon linkage groups and is composed of 1592 markers (640 SSRs, 330 SNPs, 252 AFLPs, 239 RFLPs, 89 RAPDs, 15 IMAs, 16 indels and 11 morphological traits) with a mean marker density of 0.72 cM/marker. One hundred and ninety-six of these markers (157 SSRs, 32 SNPs, 6 indels and 1 RAPD) were newly developed, mapped or provided by industry representatives as released markers, including 27 SNPs and 5 indels from genes involved in the organic acid metabolism and transport, and 58 EST-SSRs. Additionally, 85 of 822 SSR markers contributed by Syngenta Seeds were included in the integrated map. In addition, 370 QTL controlling 62 traits from 18 previously reported mapping experiments using genetically diverse parental genotypes were also integrated into the consensus map. Some QTL associated with economically important traits detected in separate studies mapped to similar genomic positions. For example, independently identified QTL controlling fruit shape were mapped on similar genomic positions, suggesting that such QTL are possibly responsible for the phenotypic variability observed for this trait in a broad array of melon germplasm. Even though relatively unsaturated genetic maps in a diverse set of melon market types have been published, the integrated saturated map presented herein should be considered the initial reference map for melon. Most of the mapped markers contained in the reference map are polymorphic in diverse collection of germplasm, and thus are potentially transferrable to a broad array of genetic experimentation (e.g., integration of physical and genetic maps, colinearity analysis, map-based gene cloning, epistasis dissection, and marker-assisted selection).
GrigoraSNPs: Optimized Analysis of SNPs for DNA Forensics.
Ricke, Darrell O; Shcherbina, Anna; Michaleas, Adam; Fremont-Smith, Philip
2018-04-16
High-throughput sequencing (HTS) of single nucleotide polymorphisms (SNPs) enables additional DNA forensic capabilities not attainable using traditional STR panels. However, the inclusion of sets of loci selected for mixture analysis, extended kinship, phenotype, biogeographic ancestry prediction, etc., can result in large panel sizes that are difficult to analyze in a rapid fashion. GrigoraSNP was developed to address the allele-calling bottleneck that was encountered when analyzing SNP panels with more than 5000 loci using HTS. GrigoraSNPs uses a MapReduce parallel data processing on multiple computational threads plus a novel locus-identification hashing strategy leveraging target sequence tags. This tool optimizes the SNP calling module of the DNA analysis pipeline with runtimes that scale linearly with the number of HTS reads. Results are compared with SNP analysis pipelines implemented with SAMtools and GATK. GrigoraSNPs removes a computational bottleneck for processing forensic samples with large HTS SNP panels. Published 2018. This article is a U.S. Government work and is in the public domain in the USA.
Yi, Liuxi; Gao, Fengyun; Siqin, Bateer; Zhou, Yu; Li, Qiang; Zhao, Xiaoqing; Jia, Xiaoyun; Zhang, Hui
2017-01-01
Flax is an important crop for oil and fiber, however, no high-density genetic maps have been reported for this species. Specific length amplified fragment sequencing (SLAF-seq) is a high-resolution strategy for large scale de novo discovery and genotyping of single nucleotide polymorphisms. In this study, SLAF-seq was employed to develop SNP markers in an F2 population to construct a high-density genetic map for flax. In total, 196.29 million paired-end reads were obtained. The average sequencing depth was 25.08 in male parent, 32.17 in the female parent, and 9.64 in each F2 progeny. In total, 389,288 polymorphic SLAFs were detected, from which 260,380 polymorphic SNPs were developed. After filtering, 4,638 SNPs were found suitable for genetic map construction. The final genetic map included 4,145 SNP markers on 15 linkage groups and was 2,632.94 cM in length, with an average distance of 0.64 cM between adjacent markers. To our knowledge, this map is the densest SNP-based genetic map for flax. The SNP markers and genetic map reported in here will serve as a foundation for the fine mapping of quantitative trait loci (QTLs), map-based gene cloning and marker assisted selection (MAS) for flax.
LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs
Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong
2009-01-01
Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380
Everett, Meredith V; Seeb, James E
2014-01-01
Understanding how organisms interact with their environments is increasingly important for conservation efforts in many species, especially in light of highly anticipated climate changes. One method for understanding this relationship is to use genetic maps and QTL mapping to detect genomic regions linked to phenotypic traits of importance for adaptation. We used high-throughput genotyping by sequencing (GBS) to both detect and map thousands of SNPs in haploid Chinook salmon (Oncorhynchus tshawytscha). We next applied this map to detect QTL related to temperature tolerance and body size in families of diploid Chinook salmon. Using these techniques, we mapped 3534 SNPs in 34 linkage groups which is consistent with the haploid chromosome number for Chinook salmon. We successfully detected three QTL for temperature tolerance and one QTL for body size at the experiment-wide level, as well as additional QTL significant at the chromosome-wide level. The use of haploids coupled with GBS provides a robust pathway to rapidly develop genomic resources in nonmodel organisms; these QTL represent preliminary progress toward linking traits of conservation interest to regions in the Chinook salmon genome. PMID:24822082
Epicuticular waxes and thrips resistance in onion
USDA-ARS?s Scientific Manuscript database
Next-generation sequencing of normalized cDNAs from two inbred lines of onion revealed over 3000 well supported single nucleotide polymorphisms (SNPs), of which over 800 have been mapped. This SNP-based map was used to identify quantitative trait loci (QTL) controlling the amounts and types of epicu...
High-Throughput Gene Mapping in Caenorhabditis elegans
Swan, Kathryn A.; Curtis, Damian E.; McKusick, Kathleen B.; Voinov, Alexander V.; Mapa, Felipa A.; Cancilla, Michael R.
2002-01-01
Positional cloning of mutations in model genetic systems is a powerful method for the identification of targets of medical and agricultural importance. To facilitate the high-throughput mapping of mutations in Caenorhabditis elegans, we have identified a further 9602 putative new single nucleotide polymorphisms (SNPs) between two C. elegans strains, Bristol N2 and the Hawaiian mapping strain CB4856, by sequencing inserts from a CB4856 genomic DNA library and using an informatics pipeline to compare sequences with the canonical N2 genomic sequence. When combined with data from other laboratories, our marker set of 17,189 SNPs provides even coverage of the complete worm genome. To date, we have confirmed >1099 evenly spaced SNPs (one every 91 ± 56 kb) across the six chromosomes and validated the utility of our SNP marker set and new fluorescence polarization-based genotyping methods for systematic and high-throughput identification of genes in C. elegans by cloning several proprietary genes. We illustrate our approach by recombination mapping and confirmation of the mutation in the cloned gene, dpy-18. [The sequence data described in this paper have been submitted to the NCBI dbSNP data library under accession nos. 4388625–4389689 and GenBank dbSTS under accession nos. 973810–974874. The following individuals and institutions kindly provided reagents, samples, or unpublished information as indicated in the paper: The C. elegans Sequencing Consortium and The Caenorhabditis Genetics Center.] PMID:12097347
Hsueh, Wen-Chi; Nair, Anup K; Kobes, Sayuko; Chen, Peng; Göring, Harald H H; Pollin, Toni I; Malhotra, Alka; Knowler, William C; Baier, Leslie J; Hanson, Robert L
2017-12-01
Identity-by-descent mapping using empirical estimates of identity-by-descent allele sharing may be useful for studies of complex traits in founder populations, where hidden relationships may augment the inherent genetic information that can be used for localization. Through identity-by-descent mapping, using ≈400 000 single-nucleotide polymorphisms (SNPs), of serum lipid profiles, we identified a major linkage signal for triglycerides in 1007 Pima Indians (LOD=9.23; P =3.5×10 -11 on chromosome 11q). In subsequent fine-mapping and replication association studies in ≈7500 Amerindians, we determined that this signal reflects effects of a loss-of-function Ala43Thr substitution in APOC3 (rs147210663) and 3 established functional SNPs in APOA5 . The association with rs147210663 was particularly strong; each copy of the Thr allele conferred 42% lower triglycerides (β=-0.92±0.059 SD unit; P =9.6×10 -55 in 4668 Pimas and 2793 Southwest Amerindians combined). The Thr allele is extremely rare in most global populations but has a frequency of 2.5% in Pimas. We further demonstrated that 3 APOA5 SNPs with established functional impact could explain the association with the most well-replicated SNP (rs964184) for triglycerides identified by genome-wide association studies. Collectively, these 4 SNPs account for 6.9% of variation in triglycerides in Pimas (and 4.1% in Southwest Amerindians), and their inclusion in the original linkage model reduced the linkage signal to virtually null. APOC3/APOA5 constitutes a major locus for serum triglycerides in Amerindians, especially the Pimas, and these results provide an empirical example for the concept that population-based linkage analysis is a useful strategy to identify complex trait variants. © 2017 American Heart Association, Inc.
Feature co-localization landscape of the human genome
Ng, Siu-Kin; Hu, Taobo; Long, Xi; Chan, Cheuk-Hin; Tsang, Shui-Ying; Xue, Hong
2016-01-01
Although feature co-localizations could serve as useful guide-posts to genome architecture, a comprehensive and quantitative feature co-localization map of the human genome has been lacking. Herein we show that, in contrast to the conventional bipartite division of genomic sequences into genic and inter-genic regions, pairwise co-localizations of forty-two genomic features in the twenty-two autosomes based on 50-kb to 2,000-kb sequence windows indicate a tripartite zonal architecture comprising Genic zones enriched with gene-related features and Alu-elements; Proximal zones enriched with MIR- and L2-elements, transcription-factor-binding-sites (TFBSs), and conserved-indels (CIDs); and Distal zones enriched with L1-elements. Co-localizations between single-nucleotide-polymorphisms (SNPs) and copy-number-variations (CNVs) reveal a fraction of sequence windows displaying steeply enhanced levels of SNPs, CNVs and recombination rates that point to active adaptive evolution in such pathways as immune response, sensory perceptions, and cognition. The strongest positive co-localization observed between TFBSs and CIDs suggests a regulatory role of CIDs in cooperation with TFBSs. The positive co-localizations of cancer somatic CNVs (CNVT) with all Proximal zone and most Genic zone features, in contrast to the distinctly more restricted co-localizations exhibited by germline CNVs (CNVG), reveal disparate distributions of CNVTs and CNVGs indicative of dissimilarity in their underlying mechanisms. PMID:26854351
efficient association study design via power-optimized tag SNP selection
HAN, BUHM; KANG, HYUN MIN; SEO, MYEONG SEONG; ZAITLEN, NOAH; ESKIN, ELEAZAR
2008-01-01
Discovering statistical correlation between causal genetic variation and clinical traits through association studies is an important method for identifying the genetic basis of human diseases. Since fully resequencing a cohort is prohibitively costly, genetic association studies take advantage of local correlation structure (or linkage disequilibrium) between single nucleotide polymorphisms (SNPs) by selecting a subset of SNPs to be genotyped (tag SNPs). While many current association studies are performed using commercially available high-throughput genotyping products that define a set of tag SNPs, choosing tag SNPs remains an important problem for both custom follow-up studies as well as designing the high-throughput genotyping products themselves. The most widely used tag SNP selection method optimizes over the correlation between SNPs (r2). However, tag SNPs chosen based on an r2 criterion do not necessarily maximize the statistical power of an association study. We propose a study design framework that chooses SNPs to maximize power and efficiently measures the power through empirical simulation. Empirical results based on the HapMap data show that our method gains considerable power over a widely used r2-based method, or equivalently reduces the number of tag SNPs required to attain the desired power of a study. Our power-optimized 100k whole genome tag set provides equivalent power to the Affymetrix 500k chip for the CEU population. For the design of custom follow-up studies, our method provides up to twice the power increase using the same number of tag SNPs as r2-based methods. Our method is publicly available via web server at http://design.cs.ucla.edu. PMID:18702637
Troggio, Michela; Šurbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James
2013-01-01
High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies. PMID:23826289
Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Alonso, M. Rosario; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W.; Benitez, Javier; Bogdanova, Natalia V.; Bojesen, Stig E.; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M.; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F.; Fasching, Peter A.; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G.; Goldberg, Mark S.; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A.; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L.; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L.; Muir, Kenneth; Neuhausen, Susan L.; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C.; Stram, Daniel O.; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H.; Tessier, Daniel C.; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M.; Vincent, Daniel; Winqvist, Robert; Wu, Anna H.; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D. P.; Hall, Per; Edwards, Stacey L.; Simard, Jacques; French, Juliet D.; Chenevix-Trench, Georgia; Dunning, Alison M.
2016-01-01
Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90–0.94; P = 8.96 × 10−15)) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10−09, r2 = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10−11, r2 = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus. PMID:27600471
Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Alonso, M Rosario; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Benitez, Javier; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F; Fasching, Peter A; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G; Goldberg, Mark S; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C; Stram, Daniel O; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H; Tessier, Daniel C; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M; Vincent, Daniel; Winqvist, Robert; Wu, Anna H; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D P; Hall, Per; Edwards, Stacey L; Simard, Jacques; French, Juliet D; Chenevix-Trench, Georgia; Dunning, Alison M
2016-09-07
Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90-0.94; P = 8.96 × 10(-15))) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10(-09), r(2) = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10(-11), r(2) = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus.
Development and Implementation of High-Throughput SNP Genotyping in Barley
USDA-ARS?s Scientific Manuscript database
Approximately 22,000 SNPs were identified from barley ESTs and sequenced amplicons; 4,596 of them were tested for performance in three pilot phase Illumina GoldenGate assays. Pilot phase data from three barley doubled haploid mapping populations supported the production of an initial consensus map, ...
N, Nagasundaram; Zhu, Hailong; Liu, Jiming; V, Karthick; C, George Priya Doss; Chakraborty, Chiranjib; Chen, Luonan
2015-01-01
The cyclin-dependent kinase 4 (CDK4)-cyclin D1 complex plays a crucial role in the transition from the G1 phase to S phase of the cell cycle. Among the CDKs, CDK4 is one of the genes most frequently affected by somatic genetic variations that are associated with various forms of cancer. Thus, because the abnormal function of the CDK4-cyclin D1 protein complex might play a vital role in causing cancer, CDK4 can be considered a genetically validated therapeutic target. In this study, we used a systematic, integrated computational approach to identify deleterious nsSNPs and predict their effects on protein-protein (CDK4-cyclin D1) and protein-ligand (CDK4-flavopiridol) interactions. This analysis resulted in the identification of possible inhibitors of mutant CDK4 proteins that bind the conformations induced by deleterious nsSNPs. Using computational prediction methods, we identified five nsSNPs as highly deleterious: R24C, Y180H, A205T, R210P, and R246C. From molecular docking and molecular dynamic studies, we observed that these deleterious nsSNPs affected CDK4-cyclin D1 and CDK4-flavopiridol interactions. Furthermore, in a virtual screening approach, the drug 5_7_DIHYDROXY_ 2_ (3_4_5_TRI HYDROXYPHENYL) _4H_CHROMEN_ 4_ONE displayed good binding affinity for proteins with the mutations R24C or R246C, the drug diosmin displayed good binding affinity for the protein with the mutation Y180H, and the drug rutin displayed good binding affinity for proteins with the mutations A205T and R210P. Overall, this computational investigation of the CDK4 gene highlights the link between genetic variation and biological phenomena in human cancer and aids in the discovery of molecularly targeted therapies for personalized treatment. PMID:26252490
Mutation Scanning in Wheat by Exon Capture and Next-Generation Sequencing.
King, Robert; Bird, Nicholas; Ramirez-Gonzalez, Ricardo; Coghill, Jane A; Patil, Archana; Hassani-Pak, Keywan; Uauy, Cristobal; Phillips, Andrew L
2015-01-01
Targeted Induced Local Lesions in Genomes (TILLING) is a reverse genetics approach to identify novel sequence variation in genomes, with the aims of investigating gene function and/or developing useful alleles for breeding. Despite recent advances in wheat genomics, most current TILLING methods are low to medium in throughput, being based on PCR amplification of the target genes. We performed a pilot-scale evaluation of TILLING in wheat by next-generation sequencing through exon capture. An oligonucleotide-based enrichment array covering ~2 Mbp of wheat coding sequence was used to carry out exon capture and sequencing on three mutagenised lines of wheat containing previously-identified mutations in the TaGA20ox1 homoeologous genes. After testing different mapping algorithms and settings, candidate SNPs were identified by mapping to the IWGSC wheat Chromosome Survey Sequences. Where sequence data for all three homoeologues were found in the reference, mutant calls were unambiguous; however, where the reference lacked one or two of the homoeologues, captured reads from these genes were mis-mapped to other homoeologues, resulting either in dilution of the variant allele frequency or assignment of mutations to the wrong homoeologue. Competitive PCR assays were used to validate the putative SNPs and estimate cut-off levels for SNP filtering. At least 464 high-confidence SNPs were detected across the three mutagenized lines, including the three known alleles in TaGA20ox1, indicating a mutation rate of ~35 SNPs per Mb, similar to that estimated by PCR-based TILLING. This demonstrates the feasibility of using exon capture for genome re-sequencing as a method of mutation detection in polyploid wheat, but accurate mutation calling will require an improved genomic reference with more comprehensive coverage of homoeologues.
USDA-ARS?s Scientific Manuscript database
In this study, we generated a linkage map containing 1,151,856 high quality SNPs between Mo17 and B73, which were verified in the maize intermated B73'×'Mo17 (IBM) Syn10 population. This resource is an excellent complement to existing maize genetic maps available in an online database (iPlant, http:...
Holliday, Jason A; Wang, Tongli; Aitken, Sally
2012-09-01
Climate is the primary driver of the distribution of tree species worldwide, and the potential for adaptive evolution will be an important factor determining the response of forests to anthropogenic climate change. Although association mapping has the potential to improve our understanding of the genomic underpinnings of climatically relevant traits, the utility of adaptive polymorphisms uncovered by such studies would be greatly enhanced by the development of integrated models that account for the phenotypic effects of multiple single-nucleotide polymorphisms (SNPs) and their interactions simultaneously. We previously reported the results of association mapping in the widespread conifer Sitka spruce (Picea sitchensis). In the current study we used the recursive partitioning algorithm 'Random Forest' to identify optimized combinations of SNPs to predict adaptive phenotypes. After adjusting for population structure, we were able to explain 37% and 30% of the phenotypic variation, respectively, in two locally adaptive traits--autumn budset timing and cold hardiness. For each trait, the leading five SNPs captured much of the phenotypic variation. To determine the role of epistasis in shaping these phenotypes, we also used a novel approach to quantify the strength and direction of pairwise interactions between SNPs and found such interactions to be common. Our results demonstrate the power of Random Forest to identify subsets of markers that are most important to climatic adaptation, and suggest that interactions among these loci may be widespread.
Kang, Eun Yong; Martin, Lisa J.; Mangul, Serghei; Isvilanonda, Warin; Zou, Jennifer; Ben-David, Eyal; Han, Buhm; Lusis, Aldons J.; Shifman, Sagiv; Eskin, Eleazar
2016-01-01
The study of the genetics of gene expression is of considerable importance to understanding the nature of common, complex diseases. The most widely applied approach to identifying relationships between genetic variation and gene expression is the expression quantitative trait loci (eQTL) approach. Here, we increased the computational power of eQTL with an alternative and complementary approach based on analyzing allele specific expression (ASE). We designed a novel analytical method to identify cis-acting regulatory variants based on genome sequencing and measurements of ASE from RNA-sequencing (RNA-seq) data. We evaluated the power and resolution of our method using simulated data. We then applied the method to map regulatory variants affecting gene expression in lymphoblastoid cell lines (LCLs) from 77 unrelated northern and western European individuals (CEU), which were part of the HapMap project. A total of 2309 SNPs were identified as being associated with ASE patterns. The SNPs associated with ASE were enriched within promoter regions and were significantly more likely to signal strong evidence for a regulatory role. Finally, among the candidate regulatory SNPs, we identified 108 SNPs that were previously associated with human immune diseases. With further improvements in quantifying ASE from RNA-seq, the application of our method to other datasets is expected to accelerate our understanding of the biological basis of common diseases. PMID:27765809
Saxena, Rachit K.; Varma Penmetsa, R.; Upadhyaya, Hari D.; Kumar, Ashish; Carrasquilla-Garcia, Noelia; Schlueter, Jessica A.; Farmer, Andrew; Whaley, Adam M.; Sarma, Birinchi K.; May, Gregory D.; Cook, Douglas R.; Varshney, Rajeev K.
2012-01-01
Single-nucleotide polymorphisms (SNPs, >2000) were discovered by using RNA-seq and allele-specific sequencing approaches in pigeonpea (Cajanus cajan). For making the SNP genotyping cost-effective, successful competitive allele-specific polymerase chain reaction (KASPar) assays were developed for 1616 SNPs and referred to as PKAMs (pigeonpea KASPar assay markers). Screening of PKAMs on 24 genotypes [23 from cultivated species and 1 wild species (Cajanus scarabaeoides)] defined a set of 1154 polymorphic markers (77.4%) with a polymorphism information content (PIC) value from 0.04 to 0.38. One thousand and ninety-four PKAMs showed polymorphisms between parental lines of the reference mapping population (C. cajan ICP 28 × C. scarabaeoides ICPW 94). By using high-quality marker genotyping data on 167 F2 lines from the population, a comprehensive genetic map comprising 875 PKAMs with an average inter-marker distance of 1.11 cM was developed. Previously mapped 35 simple sequence repeat markers were integrated into the PKAM map and an integrated genetic map of 996.21 cM was constructed. Mapped PKAMs showed a higher degree of synteny with the genome of Glycine max followed by Medicago truncatula and Lotus japonicus and least with Vigna unguiculata. These PKAMs will be useful for genetics research and breeding applications in pigeonpea and for utilizing genome information from other legume species. PMID:23103470
Johnson, Eric O; Hancock, Dana B; Levy, Joshua L; Gaddis, Nathan C; Saccone, Nancy L; Bierut, Laura J; Page, Grier P
2013-05-01
A great promise of publicly sharing genome-wide association data is the potential to create composite sets of controls. However, studies often use different genotyping arrays, and imputation to a common set of SNPs has shown substantial bias: a problem which has no broadly applicable solution. Based on the idea that using differing genotyped SNP sets as inputs creates differential imputation errors and thus bias in the composite set of controls, we examined the degree to which each of the following occurs: (1) imputation based on the union of genotyped SNPs (i.e., SNPs available on one or more arrays) results in bias, as evidenced by spurious associations (type 1 error) between imputed genotypes and arbitrarily assigned case/control status; (2) imputation based on the intersection of genotyped SNPs (i.e., SNPs available on all arrays) does not evidence such bias; and (3) imputation quality varies by the size of the intersection of genotyped SNP sets. Imputations were conducted in European Americans and African Americans with reference to HapMap phase II and III data. Imputation based on the union of genotyped SNPs across the Illumina 1M and 550v3 arrays showed spurious associations for 0.2 % of SNPs: ~2,000 false positives per million SNPs imputed. Biases remained problematic for very similar arrays (550v1 vs. 550v3) and were substantial for dissimilar arrays (Illumina 1M vs. Affymetrix 6.0). In all instances, imputing based on the intersection of genotyped SNPs (as few as 30 % of the total SNPs genotyped) eliminated such bias while still achieving good imputation quality.
Abnet, Christian C.; Wang, Zhaoming; Song, Xin; Hu, Nan; Zhou, Fu-You; Freedman, Neal D.; Li, Xue-Min; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Dawsey, Sanford M.; Liao, Linda M.; Lee, Maxwell P.; Ding, Ti; Qiao, You-Lin; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Chung, Charles C.; Wang, Chaoyu; Wheeler, William; Yeager, Meredith; Yuenger, Jeff; Hutchinson, Amy; Jacobs, Kevin B.; Giffen, Carol A.; Burdett, Laurie; Fraumeni, Joseph F.; Tucker, Margaret A.; Chow, Wong-Ho; Zhao, Xue-Ke; Li, Jiang-Man; Li, Ai-Li; Sun, Liang-Dan; Wei, Wu; Li, Ji-Lin; Zhang, Peng; Li, Hong-Lei; Cui, Wen-Yan; Wang, Wei-Peng; Liu, Zhi-Cai; Yang, Xia; Fu, Wen-Jing; Cui, Ji-Li; Lin, Hong-Li; Zhu, Wen-Liang; Liu, Min; Chen, Xi; Chen, Jie; Guo, Li; Han, Jing-Jing; Zhou, Sheng-Li; Huang, Jia; Wu, Yue; Yuan, Chao; Huang, Jing; Ji, Ai-Fang; Kul, Jian-Wei; Fan, Zhong-Min; Wang, Jian-Po; Zhang, Dong-Yun; Zhang, Lian-Qun; Zhang, Wei; Chen, Yuan-Fang; Ren, Jing-Li; Li, Xiu-Min; Dong, Jin-Cheng; Xing, Guo-Lan; Guo, Zhi-Gang; Yang, Jian-Xue; Mao, Yi-Ming; Yuan, Yuan; Guo, Er-Tao; Zhang, Wei; Hou, Zhi-Chao; Liu, Jing; Li, Yan; Tang, Sa; Chang, Jia; Peng, Xiu-Qin; Han, Min; Yin, Wan-Li; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Yang, Liu-Qin; Zhu, Fu-Guo; Yang, Xiu-Feng; Feng, Xiao-Shan; Wang, Zhou; Li, Yin; Gao, She-Gan; Liu, Hai-Lin; Yuan, Ling; Jin, Yan; Zhang, Yan-Rui; Sheyhidin, Ilyar; Li, Feng; Chen, Bao-Ping; Ren, Shu-Wei; Liu, Bin; Li, Dan; Zhang, Gao-Fu; Yue, Wen-Bin; Feng, Chang-Wei; Qige, Qirenwang; Zhao, Jian-Ting; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Xu, Li-Yan; Wu, Zhi-Yong; Bao, Zhi-Qin; Chen, Ji-Li; Li, Xian-Chang; Zhuang, Xiang; Zhou, Ying-Fa; Zuo, Xian-Bo; Dong, Zi-Ming; Wang, Lu-Wen; Fan, Xue-Pin; Wang, Jin; Zhou, Qi; Ma, Guo-Shun; Zhang, Qin-Xian; Liu, Hai; Jian, Xin-Ying; Lian, Sin-Yong; Wang, Jin-Sheng; Chang, Fu-Bao; Lu, Chang-Dong; Miao, Jian-Jun; Chen, Zhi-Guo; Wang, Ran; Guo, Ming; Fan, Zeng-Lin; Tao, Ping; Liu, Tai-Jing; Wei, Jin-Chang; Kong, Qing-Peng; Fan, Lei; Wang, Xian-Zeng; Gao, Fu-Sheng; Wang, Tian-Yun; Xie, Dong; Wang, Li; Chen, Shu-Qing; Yang, Wan-Cai; Hong, Jun-Yan; Wang, Liang; Qiu, Song-Liang; Goldstein, Alisa M.; Yuan, Zhi-Qing; Chanock, Stephen J.; Zhang, Xue-Jun; Taylor, Philip R.; Wang, Li-Dong
2012-01-01
Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10−8, and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19–1.40) and P= 7.63 × 10−10. An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants. PMID:22323360
Abnet, Christian C; Wang, Zhaoming; Song, Xin; Hu, Nan; Zhou, Fu-You; Freedman, Neal D; Li, Xue-Min; Yu, Kai; Shu, Xiao-Ou; Yuan, Jian-Min; Zheng, Wei; Dawsey, Sanford M; Liao, Linda M; Lee, Maxwell P; Ding, Ti; Qiao, You-Lin; Gao, Yu-Tang; Koh, Woon-Puay; Xiang, Yong-Bing; Tang, Ze-Zhong; Fan, Jin-Hu; Chung, Charles C; Wang, Chaoyu; Wheeler, William; Yeager, Meredith; Yuenger, Jeff; Hutchinson, Amy; Jacobs, Kevin B; Giffen, Carol A; Burdett, Laurie; Fraumeni, Joseph F; Tucker, Margaret A; Chow, Wong-Ho; Zhao, Xue-Ke; Li, Jiang-Man; Li, Ai-Li; Sun, Liang-Dan; Wei, Wu; Li, Ji-Lin; Zhang, Peng; Li, Hong-Lei; Cui, Wen-Yan; Wang, Wei-Peng; Liu, Zhi-Cai; Yang, Xia; Fu, Wen-Jing; Cui, Ji-Li; Lin, Hong-Li; Zhu, Wen-Liang; Liu, Min; Chen, Xi; Chen, Jie; Guo, Li; Han, Jing-Jing; Zhou, Sheng-Li; Huang, Jia; Wu, Yue; Yuan, Chao; Huang, Jing; Ji, Ai-Fang; Kul, Jian-Wei; Fan, Zhong-Min; Wang, Jian-Po; Zhang, Dong-Yun; Zhang, Lian-Qun; Zhang, Wei; Chen, Yuan-Fang; Ren, Jing-Li; Li, Xiu-Min; Dong, Jin-Cheng; Xing, Guo-Lan; Guo, Zhi-Gang; Yang, Jian-Xue; Mao, Yi-Ming; Yuan, Yuan; Guo, Er-Tao; Zhang, Wei; Hou, Zhi-Chao; Liu, Jing; Li, Yan; Tang, Sa; Chang, Jia; Peng, Xiu-Qin; Han, Min; Yin, Wan-Li; Liu, Ya-Li; Hu, Yan-Long; Liu, Yu; Yang, Liu-Qin; Zhu, Fu-Guo; Yang, Xiu-Feng; Feng, Xiao-Shan; Wang, Zhou; Li, Yin; Gao, She-Gan; Liu, Hai-Lin; Yuan, Ling; Jin, Yan; Zhang, Yan-Rui; Sheyhidin, Ilyar; Li, Feng; Chen, Bao-Ping; Ren, Shu-Wei; Liu, Bin; Li, Dan; Zhang, Gao-Fu; Yue, Wen-Bin; Feng, Chang-Wei; Qige, Qirenwang; Zhao, Jian-Ting; Yang, Wen-Jun; Lei, Guang-Yan; Chen, Long-Qi; Li, En-Min; Xu, Li-Yan; Wu, Zhi-Yong; Bao, Zhi-Qin; Chen, Ji-Li; Li, Xian-Chang; Zhuang, Xiang; Zhou, Ying-Fa; Zuo, Xian-Bo; Dong, Zi-Ming; Wang, Lu-Wen; Fan, Xue-Pin; Wang, Jin; Zhou, Qi; Ma, Guo-Shun; Zhang, Qin-Xian; Liu, Hai; Jian, Xin-Ying; Lian, Sin-Yong; Wang, Jin-Sheng; Chang, Fu-Bao; Lu, Chang-Dong; Miao, Jian-Jun; Chen, Zhi-Guo; Wang, Ran; Guo, Ming; Fan, Zeng-Lin; Tao, Ping; Liu, Tai-Jing; Wei, Jin-Chang; Kong, Qing-Peng; Fan, Lei; Wang, Xian-Zeng; Gao, Fu-Sheng; Wang, Tian-Yun; Xie, Dong; Wang, Li; Chen, Shu-Qing; Yang, Wan-Cai; Hong, Jun-Yan; Wang, Liang; Qiu, Song-Liang; Goldstein, Alisa M; Yuan, Zhi-Qing; Chanock, Stephen J; Zhang, Xue-Jun; Taylor, Philip R; Wang, Li-Dong
2012-05-01
Genome-wide association studies have identified susceptibility loci for esophageal squamous cell carcinoma (ESCC). We conducted a meta-analysis of all single-nucleotide polymorphisms (SNPs) that showed nominally significant P-values in two previously published genome-wide scans that included a total of 2961 ESCC cases and 3400 controls. The meta-analysis revealed five SNPs at 2q33 with P< 5 × 10(-8), and the strongest signal was rs13016963, with a combined odds ratio (95% confidence interval) of 1.29 (1.19-1.40) and P= 7.63 × 10(-10). An imputation analysis of 4304 SNPs at 2q33 suggested a single association signal, and the strongest imputed SNP associations were similar to those from the genotyped SNPs. We conducted an ancestral recombination graph analysis with 53 SNPs to identify one or more haplotypes that harbor the variants directly responsible for the detected association signal. This showed that the five SNPs exist in a single haplotype along with 45 imputed SNPs in strong linkage disequilibrium, and the strongest candidate was rs10201587, one of the genotyped SNPs. Our meta-analysis found genome-wide significant SNPs at 2q33 that map to the CASP8/ALS2CR12/TRAK2 gene region. Variants in CASP8 have been extensively studied across a spectrum of cancers with mixed results. The locus we identified appears to be distinct from the widely studied rs3834129 and rs1045485 SNPs in CASP8. Future studies of esophageal and other cancers should focus on comprehensive sequencing of this 2q33 locus and functional analysis of rs13016963 and rs10201587 and other strongly correlated variants.
Genome-wide association study of sporadic brain arteriovenous malformations.
Weinsheimer, Shantel; Bendjilali, Nasrine; Nelson, Jeffrey; Guo, Diana E; Zaroff, Jonathan G; Sidney, Stephen; McCulloch, Charles E; Al-Shahi Salman, Rustam; Berg, Jonathan N; Koeleman, Bobby P C; Simon, Matthias; Bostroem, Azize; Fontanella, Marco; Sturiale, Carmelo L; Pola, Roberto; Puca, Alfredo; Lawton, Michael T; Young, William L; Pawlikowska, Ludmila; Klijn, Catharina J M; Kim, Helen
2016-09-01
The pathogenesis of sporadic brain arteriovenous malformations (BAVMs) remains unknown, but studies suggest a genetic component. We estimated the heritability of sporadic BAVM and performed a genome-wide association study (GWAS) to investigate association of common single nucleotide polymorphisms (SNPs) with risk of sporadic BAVM in the international, multicentre Genetics of Arteriovenous Malformation (GEN-AVM) consortium. The Caucasian discovery cohort included 515 BAVM cases and 1191 controls genotyped using Affymetrix genome-wide SNP arrays. Genotype data were imputed to 1000 Genomes Project data, and well-imputed SNPs (>0.01 minor allele frequency) were analysed for association with BAVM. 57 top BAVM-associated SNPs (51 SNPs with p<10(-05) or p<10(-04) in candidate pathway genes, and 6 candidate BAVM SNPs) were tested in a replication cohort including 608 BAVM cases and 744 controls. The estimated heritability of BAVM was 17.6% (SE 8.9%, age and sex-adjusted p=0.015). None of the SNPs were significantly associated with BAVM in the replication cohort after correction for multiple testing. 6 SNPs had a nominal p<0.1 in the replication cohort and map to introns in EGFEM1P, SP4 and CDKAL1 or near JAG1 and BNC2. Of the 6 candidate SNPs, 2 in ACVRL1 and MMP3 had a nominal p<0.05 in the replication cohort. We performed the first GWAS of sporadic BAVM in the largest BAVM cohort assembled to date. No GWAS SNPs were replicated, suggesting that common SNPs do not contribute strongly to BAVM susceptibility. However, heritability estimates suggest a modest but significant genetic contribution. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Kumar, Sunil; Ambrosini, Giovanna; Bucher, Philipp
2017-01-04
SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Singh, Vikas K; Khan, Aamir W; Saxena, Rachit K; Kumar, Vinay; Kale, Sandip M; Sinha, Pallavi; Chitikineni, Annapurna; Pazhamala, Lekha T; Garg, Vanika; Sharma, Mamta; Sameer Kumar, Chanda Venkata; Parupalli, Swathi; Vechalapu, Suryanarayana; Patil, Suyash; Muniswamy, Sonnappa; Ghanta, Anuradha; Yamini, Kalinati Narasimhan; Dharmaraj, Pallavi Subbanna; Varshney, Rajeev K
2016-05-01
To map resistance genes for Fusarium wilt (FW) and sterility mosaic disease (SMD) in pigeonpea, sequencing-based bulked segregant analysis (Seq-BSA) was used. Resistant (R) and susceptible (S) bulks from the extreme recombinant inbred lines of ICPL 20096 × ICPL 332 were sequenced. Subsequently, SNP index was calculated between R- and S-bulks with the help of draft genome sequence and reference-guided assembly of ICPL 20096 (resistant parent). Seq-BSA has provided seven candidate SNPs for FW and SMD resistance in pigeonpea. In parallel, four additional genotypes were re-sequenced and their combined analysis with R- and S-bulks has provided a total of 8362 nonsynonymous (ns) SNPs. Of 8362 nsSNPs, 60 were found within the 2-Mb flanking regions of seven candidate SNPs identified through Seq-BSA. Haplotype analysis narrowed down to eight nsSNPs in seven genes. These eight nsSNPs were further validated by re-sequencing 11 genotypes that are resistant and susceptible to FW and SMD. This analysis revealed association of four candidate nsSNPs in four genes with FW resistance and four candidate nsSNPs in three genes with SMD resistance. Further, In silico protein analysis and expression profiling identified two most promising candidate genes namely C.cajan_01839 for SMD resistance and C.cajan_03203 for FW resistance. Identified candidate genomic regions/SNPs will be useful for genomics-assisted breeding in pigeonpea. © 2015 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Kujur, Alice; Upadhyaya, Hari D.; Shree, Tanima; Bajaj, Deepak; Das, Shouvik; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
We discovered 26785 and 16573 high-quality SNPs differentiating two parental genotypes of a RIL mapping population using reference desi and kabuli genome-based GBS assay. Of these, 3625 and 2177 SNPs have been integrated into eight desi and kabuli chromosomes, respectively in order to construct ultra-high density (0.20–0.37 cM) intra-specific chickpea genetic linkage maps. One of these constructed high-resolution genetic map has potential to identify 33 major genomic regions harbouring 35 robust QTLs (PVE: 17.9–39.7%) associated with three agronomic traits, which were mapped within <1 cM mean marker intervals on desi chromosomes. The extended LD (linkage disequilibrium) decay (~15 cM) in chromosomes of genetic maps have encouraged us to use a rapid integrated approach (comparative QTL mapping, QTL-region specific haplotype/LD-based trait association analysis, expression profiling and gene haplotype-based association mapping) rather than a traditional QTL map-based cloning method to narrow-down one major seed weight (SW) robust QTL region. It delineated favourable natural allelic variants and superior haplotype-containing one seed-specific candidate embryo defective gene regulating SW in chickpea. The ultra-high-resolution genetic maps, QTLs/genes and alleles/haplotypes-related genomic information generated and integrated strategy for rapid QTL/gene identification developed have potential to expedite genomics-assisted breeding applications in crop plants, including chickpea for their genetic enhancement. PMID:25942004
Sánchez, Cecilia Castaño; Smith, Timothy P L; Wiedmann, Ralph T; Vallejo, Roger L; Salem, Mohamed; Yao, Jianbo; Rexroad, Caird E
2009-11-25
To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population. The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme HaeIII; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the sequences from the validated markers were associated with rainbow trout transcripts. The use of reduced representation libraries and pyrosequencing technology proved to be an effective strategy for the discovery of a high number of putative SNPs in rainbow trout; however, modifications to the technique to decrease the false discovery rate resulting from the evolutionary recent genome duplication would be desirable.
Human growth hormone (GH1) gene polymorphism map in a normal-statured adult population
Esteban, Cristina; Audí, Laura; Carrascosa, Antonio; Fernández-Cancio, Mónica; Pérez-Arroyo, Annalisa; Ulied, Angels; Andaluz, Pilar; Arjona, Rosa; Albisu, Marian; Clemente, María; Gussinyé, Miquel; Yeste, Diego
2007-01-01
Objective GH1 gene presents a complex map of single nucleotide polymorphisms (SNPs) in the entire promoter, coding and noncoding regions. The aim of the study was to establish the complete map of GH1 gene SNPs in our control normal population and to analyse its association with adult height. Design, subjects and measurements A systematic GH1 gene analysis was designed in a control population of 307 adults of both sexes with height normally distributed within normal range for the same population: −2 standard deviation scores (SDS) to +2 SDS. An analysis was performed on individual and combined genotype associations with adult height. Results Twenty-five SNPs presented a frequency over 1%: 11 in the promoter (P1 to P11), three in the 5′UTR region (P12 to P14), one in exon 1 (P15), three in intron 1 (P16 to P18), two in intron 2 (P19 and P20), two in exon 4 (P21 and P22) and three in intron 4 (P23 to P25). Twenty-nine additional changes with frequencies under 1% were found in 29 subjects. P8, P19, P20 and P25 had not been previously described. P6, P12, P17 and P25 accounted for 6·2% of the variation in adult height (P = 0·0007) in this population with genotypes A/G at P6, G/G at P6 and A/G at P12 decreasing height SDS (−0·063 ± 0·031, −0·693 ± 0·350 and −0·489 ± 0·265, Mean ± SE) and genotypes A/T at P17 and T/G at P25 increasing height SDS (+1·094 ± 0·456 and +1·184 ± 0·432). Conclusions This study established the GH1 gene sequence variation map in a normal adult height control population confirming the high density of SNPs in a relatively small gene. Our study shows that the more frequent SNPs did not significantly contribute to height determination, while only one promoter and two intronic SNPs contributed significantly to it. Studies in larger populations will have to confirm the associations and in vitro functional studies will elucidate the mechanisms involved. Systematic GH1 gene analysis in patients with growth delay and suspected GH deficiency/insufficiency will clarify whether different SNP frequencies and/or the presence of different sequence changes may be associated with phenotypes in them. PMID:17223997
Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple.
Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick; Ordidge, Matthew; Paprstein, Frantisek; Sedlak, Jiri; Nybom, Hilde; Garkava-Gustavsson, Larisa; Troggio, Michela; Bianco, Luca; Velasco, Riccardo; Poncet, Charles; Théron, Anthony; Moriya, Shigeki; Bink, Marco C A M; Laurens, François; Tartarini, Stefano; Durel, Charles-Eric
2017-01-01
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs.
Wang, Shichen; Wong, Debbie; Forrest, Kerrie; Allen, Alexandra; Chao, Shiaoman; Huang, Bevan E; Maccaferri, Marco; Salvi, Silvio; Milner, Sara G; Cattivelli, Luigi; Mastrangelo, Anna M; Whan, Alex; Stephen, Stuart; Barker, Gary; Wieseke, Ralf; Plieske, Joerg; International Wheat Genome Sequencing Consortium; Lillemo, Morten; Mather, Diane; Appels, Rudi; Dolferus, Rudy; Brown-Guedira, Gina; Korol, Abraham; Akhunova, Alina R; Feuillet, Catherine; Salse, Jerome; Morgante, Michele; Pozniak, Curtis; Luo, Ming-Cheng; Dvorak, Jan; Morell, Matthew; Dubcovsky, Jorge; Ganal, Martin; Tuberosa, Roberto; Lawley, Cindy; Mikoulitch, Ivan; Cavanagh, Colin; Edwards, Keith J; Hayden, Matthew; Akhunov, Eduard
2014-01-01
High-density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker–trait associations in mapping experiments. We developed a genotyping array including about 90 000 gene-associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome-wide distributed SNPs that are represented in populations of diverse geographical origin. We used density-based spatial clustering algorithms to enable high-throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model-free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low-intensity clusters can provide insight into the distribution of presence–absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat. PMID:24646323
Genome-Wide Association Mapping of Crown Rust Resistance in Oat Elite Germplasm.
Klos, Kathy Esvelt; Yimer, Belayneh A; Babiker, Ebrahiem M; Beattie, Aaron D; Bonman, J Michael; Carson, Martin L; Chong, James; Harrison, Stephen A; Ibrahim, Amir M H; Kolb, Frederic L; McCartney, Curt A; McMullen, Michael; Fetch, Jennifer Mitchell; Mohammadi, Mohsen; Murphy, J Paul; Tinker, Nicholas A
2017-07-01
Oat crown rust, caused by f. sp. , is a major constraint to oat ( L.) production in many parts of the world. In this first comprehensive multienvironment genome-wide association map of oat crown rust, we used 2972 single-nucleotide polymorphisms (SNPs) genotyped on 631 oat lines for association mapping of quantitative trait loci (QTL). Seedling reaction to crown rust in these lines was assessed as infection type (IT) with each of 10 crown rust isolates. Adult plant reaction was assessed in the field in a total of 10 location-years as percentage severity (SV) and as infection reaction (IR) in a 0-to-1 scale. Overall, 29 SNPs on 12 linkage groups were predictive of crown rust reaction in at least one experiment at a genome-wide level of statistical significance. The QTL identified here include those in regions previously shown to be linked with seedling resistance genes , , , , , and and also with adult-plant resistance and adaptation-related QTL. In addition, QTL on linkage groups Mrg03, Mrg08, and Mrg23 were identified in regions not previously associated with crown rust resistance. Evaluation of marker genotypes in a set of crown rust differential lines supported as the identity of . The SNPs with rare alleles associated with lower disease scores may be suitable for use in marker-assisted selection of oat lines for crown rust resistance. Copyright © 2017 Crop Science Society of America.
Patil, Gunvant; Vuong, Tri D; Kale, Sandip; Valliyodan, Babu; Deshmukh, Rupesh; Zhu, Chengsong; Wu, Xiaolei; Bai, Yonghe; Yungbluth, Dennis; Lu, Fang; Kumpatla, Siva; Shannon, J Grover; Varshney, Rajeev K; Nguyen, Henry T
2018-04-04
The cultivated [Glycine max (L) Merr.] and wild [Glycine soja Siebold & Zucc.] soybean species comprise wide variation in seed composition traits. Compared to wild soybean, cultivated soybean contains low protein, high oil, and high sucrose. In this study, an interspecific population was derived from a cross between G. max (Williams 82) and G. soja (PI 483460B). This recombinant inbred line (RIL) population of 188 lines was sequenced at 0.3× depth. Based on 91 342 single nucleotide polymorphisms (SNPs), recombination events in RILs were defined, and a high-resolution bin map was developed (4070 bins). In addition to bin mapping, quantitative trait loci (QTL) analysis for protein, oil, and sucrose was performed using 3343 polymorphic SNPs (3K-SNP), derived from Illumina Infinium BeadChip sequencing platform. The QTL regions from both platforms were compared, and a significant concordance was observed between bin and 3K-SNP markers. Importantly, the bin map derived from next-generation sequencing technology enhanced mapping resolution (from 1325 to 50 Kb). A total of five, nine, and four QTLs were identified for protein, oil, and sucrose content, respectively, and some of the QTLs coincided with soybean domestication-related genomic loci. The major QTL for protein and oil were mapped on Chr. 20 (qPro_20) and suggested negative correlation between oil and protein. In terms of sucrose content, a novel and major QTL were identified on Chr. 8 (qSuc_08) and harbours putative genes involved in sugar transport. In addition, genome-wide association using 91 342 SNPs confirmed the genomic loci derived from QTL mapping. A QTL-based haplotype using whole-genome resequencing of 106 diverse soybean lines identified unique allelic variation in wild soybean that could be utilized to widen the genetic base in cultivated soybean. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Pang, Erli; Wu, Xiaomei; Lin, Kui
2016-06-01
Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.
Obesity-related genetic variants, human pigmentation, and risk of melanoma
Li, Xin; Liang, Liming; Zhang, Mingfeng; Song, Fengju; Nan, Hongmei; Wang, Li-E; Wei, Qingyi; Lee, Jeffrey E.; Amos, Christopher I.; Qureshi, Abrar A.; Han, Jiali
2013-01-01
Previous biological studies showed evidence of a genetic link between obesity and pigmentation in both animal models and humans. Our study investigated the individual and joint associations between obesity-related single nucleotide polymorphisms (SNPs) and both human pigmentation and risk of melanoma. Eight obesity-related SNPs in the FTO, MAP2K5, NEGR1, FLJ35779, ETV5, CADM2, and NUDT3 genes were nominally significantly associated with hair color among 5,876 individuals of European ancestry. The genetic score combining 35 independent obesity-risk loci was significantly associated with darker hair color (beta-coefficient per ten alleles=0.12, P-value=4 10−5). However, single SNPs or genetic scores showed non-significant association with tanning ability. We further examined the SNPs at the FTO locus for their associations with pigmentation and risk of melanoma. Among the 783 SNPs in the FTO gene with imputation R-square quality metric >0.8 using the 1000 genome data set, ten and three independent SNPs were significantly associated with hair color and tanning ability respectively. Moreover, five independent FTO SNPs showed nominally significant association with risk of melanoma in 1,804 cases and 1,026 controls. But none of them was associated with obesity or in linkage disequilibrium with obesity-related variants. FTO locus may confer variation in human pigmentation and risk of melanoma, which may be independent of its effect on obesity. PMID:23539184
Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay
2013-09-23
Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.
Genetics of Sputum Gene Expression in Chronic Obstructive Pulmonary Disease
Qiu, Weiliang; Cho, Michael H.; Riley, John H.; Anderson, Wayne H.; Singh, Dave; Bakke, Per; Gulsvik, Amund; Litonjua, Augusto A.; Lomas, David A.; Crapo, James D.; Beaty, Terri H.; Celli, Bartolome R.; Rennard, Stephen; Tal-Singer, Ruth; Fox, Steven M.; Silverman, Edwin K.; Hersh, Craig P.
2011-01-01
Previous expression quantitative trait loci (eQTL) studies have performed genetic association studies for gene expression, but most of these studies examined lymphoblastoid cell lines from non-diseased individuals. We examined the genetics of gene expression in a relevant disease tissue from chronic obstructive pulmonary disease (COPD) patients to identify functional effects of known susceptibility genes and to find novel disease genes. By combining gene expression profiling on induced sputum samples from 131 COPD cases from the ECLIPSE Study with genomewide single nucleotide polymorphism (SNP) data, we found 4315 significant cis-eQTL SNP-probe set associations (3309 unique SNPs). The 3309 SNPs were tested for association with COPD in a genomewide association study (GWAS) dataset, which included 2940 COPD cases and 1380 controls. Adjusting for 3309 tests (p<1.5e-5), the two SNPs which were significantly associated with COPD were located in two separate genes in a known COPD locus on chromosome 15: CHRNA5 and IREB2. Detailed analysis of chromosome 15 demonstrated additional eQTLs for IREB2 mapping to that gene. eQTL SNPs for CHRNA5 mapped to multiple linkage disequilibrium (LD) bins. The eQTLs for IREB2 and CHRNA5 were not in LD. Seventy-four additional eQTL SNPs were associated with COPD at p<0.01. These were genotyped in two COPD populations, finding replicated associations with a SNP in PSORS1C1, in the HLA-C region on chromosome 6. Integrative analysis of GWAS and gene expression data from relevant tissue from diseased subjects has located potential functional variants in two known COPD genes and has identified a novel COPD susceptibility locus. PMID:21949713
Thirunavukkarasu, Nepolean; Hossain, Firoz; Arora, Kanika; Sharma, Rinku; Shiriga, Kaliyugam; Mittal, Swati; Mohan, Sweta; Namratha, Pottekatt Mohanlal; Dogga, Sreelatha; Rani, Tikka Shobha; Katragadda, Sumalini; Rathore, Abhishek; Shah, Trushar; Mohapatra, Trilochan; Gupta, Hari Shankar
2014-12-24
Earlier studies were focused on the genetics of temperate and tropical maize under drought. We identified genetic loci and their association with functional mechanisms in 240 accessions of subtropical maize using a high-density marker set under water stress. Out of 61 significant SNPs (11 were false-discovery-rate-corrected associations), identified across agronomic traits, models, and locations by subjecting the accessions to water stress at flowering stage, 48% were associated with drought-tolerant genes. Maize gene models revealed that SNPs mapped for agronomic traits were in fact associated with number of functional traits as follows: stomatal closure, 28; flowering, 15; root development, 5; detoxification, 4; and reduced water potential, 2. Interactions of these SNPS through the functional traits could lead to drought tolerance. The SNPs associated with ABA-dependent signalling pathways played a major role in the plant's response to stress by regulating a series of functions including flowering, root development, auxin metabolism, guard cell functions, and scavenging reactive oxygen species (ROS). ABA signalling genes regulate flowering through epigenetic changes in stress-responsive genes. ROS generated by ABA signalling are reduced by the interplay between ethylene, ABA, and detoxification signalling transductions. Integration of ABA-signalling genes with auxin-inducible genes regulates root development which in turn, maintains the water balance by regulating electrochemical gradient in plant. Several genes are directly or indirectly involved in the functioning of agronomic traits related to water stress. Genes involved in these crucial biological functions interacted significantly in order to maintain the primary as well as exclusive functions related to coping with water stress. SNPs associated with drought-tolerant genes involved in strategic biological functions will be useful to understand the mechanisms of drought tolerance in subtropical maize.
Garcia-Rios, A; Perez-Martinez, P; Fuentes, F; Mata, P; Lopez-Miranda, J; Alonso, R; Rodriguez, F; Garcia-Olid, A; Ruano, J; Ordovas, JM; Perez-Jimenez, F
2010-01-01
Objective To investigate the association of four common single nucleotide polymorphisms (SNPs) at ABCG5 (i7892A>G, i18429C>T, Gln604GluC>G, i11836G>A) and five at ABCG8 (5U145T>G, Tyr54CysA>G, Asp19HisG>C, i14222T>C, and Thr400LysG>T) with plasma lipids concentrations and to explore the interaction between those SNPs and smoking in patients with FH. Methods and Results ABCG5/G8 SNPs were genotyped in 500 subjects with genetic diagnosis of FH. Carriers of the minor A allele at the ABCG5_i11836G>A SNP displayed significantly higher HDL-C concentrations (P=0.023) than G/G subjects. In addition, carriers of the minor G allele at the ABCG5_Gln604GluC>G SNP had significantly lower VLDL-C (P=0.011) and lower TG (P=0.017) concentrations than homozygous C/C. Interestingly, a significant gene-smoking interaction was found, in which carriers of the minor alleles at ABCG5 (i7892A>G, i18429C>T, i11836G>A) SNPs displayed significantly lower HDL-C, higher TC and higher TG respectively, only in smokers. On the other hand, non-smokers carriers of the minor alleles at ABCG5 (i18429C>T and Gln604GluC>G) SNPs had significantly lower TG concentrations (P=0.012 and P=0.035) compared with homozygous for the major allele. Conclusions Our data support the notion that ABCG5/G8 genetic variants modulate plasma lipids concentrations in patients with FH and confirm that this effect could be influenced by smoking. Therefore, these results suggest that gene-environmental interactions can affect the clinical phenotype of FH. PMID:20172523
Quero-García, José; Guzmán, Alejandra; Mansur, Levi; Gratacós, Eduardo; Silva, Herman; Rosyara, Umesh R.; Iezzoni, Amy; Meisel, Lee A.; Dirlewanger, Elisabeth
2013-01-01
Despite the agronomical importance and high synteny with other Prunus species, breeding improvements for cherry have been slow compared to other temperate fruits, such as apple or peach. However, the recent release of the peach genome v1.0 by the International Peach Genome Initiative and the sequencing of cherry accessions to identify Single Nucleotide Polymorphisms (SNPs) provide an excellent basis for the advancement of cherry genetic and genomic studies. The availability of dense genetic linkage maps in phenotyped segregating progenies would be a valuable tool for breeders and geneticists. Using two sweet cherry (Prunus avium L.) intra-specific progenies derived from crosses between ‘Black Tartarian’ × ‘Kordia’ (BT×K) and ‘Regina’ × ‘Lapins’(R×L), high-density genetic maps of the four parental lines and the two segregating populations were constructed. For BT×K and R×L, 89 and 121 F1 plants were used for linkage mapping, respectively. A total of 5,696 SNP markers were tested in each progeny. As a result of these analyses, 723 and 687 markers were mapped into eight linkage groups (LGs) in BT×K and R×L, respectively. The resulting maps spanned 752.9 and 639.9 cM with an average distance of 1.1 and 0.9 cM between adjacent markers in BT×K and R×L, respectively. The maps displayed high synteny and co-linearity between each other, with the Prunus bin map, and with the peach genome v1.0 for all eight LGs (LG1–LG8). These maps provide a useful tool for investigating traits of interest in sweet cherry and represent a qualitative advance in the understanding of the cherry genome and its synteny with other members of the Rosaceae family. PMID:23382953
NASA Technical Reports Server (NTRS)
Baty, D. L.; Wempe, T. E.; Huff, E. M.
1973-01-01
Six airline pilots participated in a fixed-base simulator study to determine the effects of two Horizontal Situation Display (HSD/map) panel locations relative to the Vertical Situation Display (VSD), and of three map orientations on manual piloting performance. Pilot comments and opinions were formally obtained. Significant performance differences were found between wind conditions, and among pilots, but not between map locations and orientations. The results also illustrate the potential tracking accuracy of such a display. Recommendations concerning display location and map orientation are made.
Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe
2010-01-01
Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950
Aurora-A as a Modifier of Breast Cancer Risk in BRCA 1/2 Mutation Carriers
2007-06-01
Dieter Schaefer, Institute of Human Genetics, University of Frankfurt, Frankfurt, Germany; Norbert Arnold, University of Schleswig- Holstein , Campus...Intron 2 Opossum Mouse Rat Cow Dog Intron 1 Figure 3 | The FGFR2 locus. a, Map of the whole FGFR2 gene, viewed relative to common SNPs on HapMap
Robinson, Nicholas; Baranski, Matthew; Mahapatra, Kanta Das; Saha, Jatindra Nath; Das, Sweta; Mishra, Jashobanta; Das, Paramananda; Kent, Matthew; Arnyasi, Mariann; Sahoo, Pramoda Kumar
2014-06-30
Production of carp dominates world aquaculture. More than 1.1 million tonnes of rohu carp, Labeo rohita (Hamilton), were produced in 2010. Aeromonas hydrophila is a bacterial pathogen causing aeromoniasis in rohu, and is a major problem for carp production worldwide. There is a need to better understand the genetic mechanisms affecting resistance to this disease, and to develop tools that can be used with selective breeding to improve resistance. Here we use a 6 K SNP array to genotype 21 full-sibling families of L. rohita that were experimentally challenged intra-peritoneally with a virulent strain of A. hydrophila to scan the genome for quantitative trait loci associated with disease resistance. In all, 3193 SNPs were found to be informative and were used to create a linkage map and to scan for QTL affecting resistance to A. hydrophila. The linkage map consisted of 25 linkage groups, corresponding to the number of haploid chromosomes in L. rohita. Male and female linkage maps were similar in terms of order, coverage (1384 and 1393 cM, respectively) and average interval distances (1.32 and 1.35 cM, respectively). Forty-one percent of the SNPs were annotated with gene identity using BLAST (cut off E-score of 0.001). Twenty-one SNPs mapping to ten linkage groups showed significant associations with the traits hours of survival and dead or alive (P <0.05 after Bonferroni correction). Of the SNPs showing significant or suggestive associations with the traits, several were homologous to genes of known immune function or were in close linkage to such genes. Genes of interest included heat shock proteins (70, 60, 105 and "small heat shock proteins"), mucin (5b precursor and 2), lectin (receptor and CD22), tributyltin-binding protein, major histocompatibility loci (I and II), complement protein component c7-1, perforin 1, ubiquitin (ligase, factor e4b isoform 2 and conjugation enzyme e2 c), proteasome subunit, T-cell antigen receptor and lymphocyte specific protein tyrosine kinase. A panel of markers has been identified that will be validated for use with both genomic and marker-assisted selection to improve resistance of L. rohita to A. hydrophila.
Zambonelli, Paolo; Davoli, Roberta; Bigi, Mila; Braglia, Silvia; De Paolis, Luigi Francesco; Buttazzoni, Luca; Gallo, Maurizio; Russo, Vincenzo
2013-10-08
The pH is an important parameter influencing technological quality of pig meat, a trait affected by environmental and genetic factors. Several quantitative trait loci associated to meat pH are described on PigQTL database but only two genes influencing this parameter have been so far detected: Ryanodine receptor 1 and Protein kinase, AMP-activated, gamma 3 non-catalytic subunit. To search for genes influencing meat pH we analyzed genomic regions with quantitative effect on this trait in order to detect SNPs to use for an association study. The expressed sequences mapping on porcine chromosomes 1, 2, 3 in regions associated to pork pH were searched in silico to find SNPs. 356 out of 617 detected SNPs were used to genotype Italian Large White pigs and to perform an association analysis with meat pH values recorded in semimembranosus muscle at about 1 hour (pH1) and 24 hours (pHu) post mortem.The results of the analysis showed that 5 markers mapping on chromosomes 1 or 3 were associated with pH1 and 10 markers mapping on chromosomes 1 or 2 were associated with pHu. After False Discovery Rate correction only one SNP mapping on chromosome 2 was confirmed to be associated to pHu. This polymorphism was located in the 3'UTR of two partly overlapping genes, Deoxyhypusine synthase (DHPS) and WD repeat domain 83 (WDR83). The overlapping of the 3'UTRs allows the co-regulation of mRNAs stability by a cis-natural antisense transcript method of regulation. DHPS catalyzes the first step in hypusine formation, a unique amino acid formed by the posttranslational modification of the protein eukaryotic translation initiation factor 5A in a specific lysine residue. WDR83 has an important role in the modulation of a cascade of genes involved in cellular hypoxia defense by intensifying the glycolytic pathway and, theoretically, the meat pH value. The involvement of the SNP detected in the DHPS/WDR83 genes on meat pH phenotypic variability and their functional role are suggestive of molecular and biological processes related to glycolysis increase during post-mortem phase. This finding, after validation, can be applied to identify new biomarkers to be used to improve pig meat quality.
Re-sequencing and genetic variation identification of a rice line with ideal plant architecture.
Li, Shuangcheng; Xie, Kailong; Li, Wenbo; Zou, Ting; Ren, Yun; Wang, Shiquan; Deng, Qiming; Zheng, Aiping; Zhu, Jun; Liu, Huainian; Wang, Lingxia; Ai, Peng; Gao, Fengyan; Huang, Bin; Cao, Xuemei; Li, Ping
2012-12-01
The ideal plant architecture (IPA) includes several important characteristics such as low tiller numbers, few or no unproductive tillers, more grains per panicle, and thick and sturdy stems. We have developed an indica restorer line 7302R that displays the IPA phenotype in terms of tiller number, grain number, and stem strength. However, its mechanism had to be clarified. We performed re-sequencing and genome-wide variation analysis of 7302R using the Solexa sequencing technology. With the genomic sequence of the indica cultivar 9311 as reference, 307 627 SNPs, 57 372 InDels, and 3 096 SVs were identified in the 7302R genome. The 7302R-specific variations were investigated via the synteny analysis of all the SNPs of 7302R with those of the previous sequenced none-IPA-type lines IR24, MH63, and SH527. Moreover, we found 178 168 7302R-specific SNPs across the whole genome and 30 239 SNPs in the predicted mRNA regions, among which 8 517 were Non-syn CDS. In addition, 263 large-effect SNPs that were expected to affect the integrity of encoded proteins were identified from the 7302R-specific SNPs. SNPs of several important previously cloned rice genes were also identified by aligning the 7302R sequence with other sequence lines. Our results provided several candidates account for the IPA phenotype of 7302R. These results therefore lay the groundwork for long-term efforts to uncover important genes and alleles for rice plant architecture construction, also offer useful data resources for future genetic and genomic studies in rice.
van Eck, Herman J; Vos, Peter G; Valkonen, Jari P T; Uitdewilligen, Jan G A M L; Lensing, Hellen; de Vetten, Nick; Visser, Richard G F
2017-03-01
The method of graphical genotyping is applied to a panel of tetraploid potato cultivars to visualize haplotype sharing. The method allowed to map genes involved in virus and nematode resistance. The physical coordinates of the amount of linkage drag surrounding these genes are easily interpretable. Graphical genotyping is a visually attractive and easily interpretable method to represent genetic marker data. In this paper, the method is extended from diploids to a panel of tetraploid potato cultivars. Application of filters to select a subset of SNPs allows one to visualize haplotype sharing between individuals that also share a specific locus. The method is illustrated with cultivars resistant to Potato virus Y (PVY), while simultaneously selecting for the absence of the SNPs in susceptible clones. SNP data will then merge into an image which displays the coordinates of a distal genomic region on the northern arm of chromosome 11 where a specific haplotype is introgressed from the wild potato species S. stoloniferum (CPC 2093) carrying a gene (Ny (o,n)sto ) conferring resistance to two PVY strains, PVY O and PVY NTN . Graphical genotyping was also successful in showing the haplotypes on chromosome 12 carrying Ry-f sto , another resistance gene derived from S. stoloniferum conferring broad-spectrum resistance to PVY, as well as chromosome 5 haplotypes from S. vernei, with the Gpa5 locus involved in resistance against Globodera pallida cyst nematodes. The image also shows shortening of linkage drag by meiotic recombination of the introgression segment in more recent breeding material. Identity-by-descent was found to be a requirement for using graphical genotyping, which is proposed as a non-statistical alternative method for gene discovery, as compared with genome-wide association studies. The potential and limitations of the method are discussed.
Improving mapping and SNP-calling performance in multiplexed targeted next-generation sequencing
2012-01-01
Background Compared to classical genotyping, targeted next-generation sequencing (tNGS) can be custom-designed to interrogate entire genomic regions of interest, in order to detect novel as well as known variants. To bring down the per-sample cost, one approach is to pool barcoded NGS libraries before sample enrichment. Still, we lack a complete understanding of how this multiplexed tNGS approach and the varying performance of the ever-evolving analytical tools can affect the quality of variant discovery. Therefore, we evaluated the impact of different software tools and analytical approaches on the discovery of single nucleotide polymorphisms (SNPs) in multiplexed tNGS data. To generate our own test model, we combined a sequence capture method with NGS in three experimental stages of increasing complexity (E. coli genes, multiplexed E. coli, and multiplexed HapMap BRCA1/2 regions). Results We successfully enriched barcoded NGS libraries instead of genomic DNA, achieving reproducible coverage profiles (Pearson correlation coefficients of up to 0.99) across multiplexed samples, with <10% strand bias. However, the SNP calling quality was substantially affected by the choice of tools and mapping strategy. With the aim of reducing computational requirements, we compared conventional whole-genome mapping and SNP-calling with a new faster approach: target-region mapping with subsequent ‘read-backmapping’ to the whole genome to reduce the false detection rate. Consequently, we developed a combined mapping pipeline, which includes standard tools (BWA, SAMtools, etc.), and tested it on public HiSeq2000 exome data from the 1000 Genomes Project. Our pipeline saved 12 hours of run time per Hiseq2000 exome sample and detected ~5% more SNPs than the conventional whole genome approach. This suggests that more potential novel SNPs may be discovered using both approaches than with just the conventional approach. Conclusions We recommend applying our general ‘two-step’ mapping approach for more efficient SNP discovery in tNGS. Our study has also shown the benefit of computing inter-sample SNP-concordances and inspecting read alignments in order to attain more confident results. PMID:22913592
Shi, Yuan Yuan; Sun, Liang Xian; Huang, Zachary Y; Wu, Xiao Bo; Zhu, Yong Qiang; Zheng, Hua Jun; Zeng, Zhi Jiang
2013-01-01
The Eastern honey bee, Apis cerana Fabricius, is distributed in southern and eastern Asia, from India and China to Korea and Japan and southeast to the Moluccas. This species is also widely kept for honey production besides Apis mellifera. Apis cerana is also a model organism for studying social behavior, caste determination, mating biology, sexual selection, and host-parasite interactions. Few resources are available for molecular research in this species, and a linkage map was never constructed. A linkage map is a prerequisite for quantitative trait loci mapping and for analyzing genome structure. We used the Chinese honey bee, Apis cerana cerana to construct the first linkage map in the Eastern honey bee. F2 workers (N = 103) were genotyped for 126,990 single nucleotide polymorphisms (SNPs). After filtering low quality and those not passing the Mendel test, we obtained 3,000 SNPs, 1,535 of these were informative and used to construct a linkage map. The preliminary map contains 19 linkage groups, we then mapped the 19 linkage groups to 16 chromosomes by comparing the markers to the genome of A. mellfiera. The final map contains 16 linkage groups with a total of 1,535 markers. The total genetic distance is 3,942.7 centimorgans (cM) with the largest linkage group (180 loci) measuring 574.5 cM. Average marker interval for all markers across the 16 linkage groups is 2.6 cM. We constructed a high density linkage map for A. c. cerana with 1,535 markers. Because the map is based on SNP markers, it will enable easier and faster genotyping assays than randomly amplified polymorphic DNA or microsatellite based maps used in A. mellifera.
Broekgaarden, Colette; Bucher, Johan; Bac-Molenaar, Johanna; Keurentjes, Joost J. B.; Kruijer, Willem; Voorrips, Roeland E.; Vosman, Ben
2015-01-01
Plants have evolved a variety of ways to defend themselves against biotic attackers. This has resulted in the presence of substantial variation in defense mechanisms among plants, even within a species. Genome-wide association (GWA) mapping is a useful tool to study the genetic architecture of traits, but has so far only had limited exploitation in studies of plant defense. Here, we study the genetic architecture of defense against the phloem-feeding insect cabbage whitefly (Aleyrodes proletella) in Arabidopsis thaliana. We determined whitefly performance, i.e. the survival and reproduction of whitefly females, on 360 worldwide selected natural accessions and subsequently performed GWA mapping using 214,051 SNPs. Substantial variation for whitefly adult survival and oviposition rate (number of eggs laid per female per day) was observed between the accessions. We identified 39 candidate SNPs for either whitefly adult survival or oviposition rate, all with relatively small effects, underpinning the complex architecture of defense traits. Among the corresponding candidate genes, i.e. genes in linkage disequilibrium (LD) with candidate SNPs, none have previously been identified as a gene playing a role in the interaction between plants and phloem-feeding insects. Whitefly performance on knock-out mutants of a number of candidate genes was significantly affected, validating the potential of GWA mapping for novel gene discovery in plant-insect interactions. Our results show that GWA analysis is a very useful tool to gain insight into the genetic architecture of plant defense against herbivorous insects, i.e. we identified and validated several genes affecting whitefly performance that have not previously been related to plant defense against herbivorous insects. PMID:26699853
Genome wide association mapping for grain shape traits in indica rice.
Feng, Yue; Lu, Qing; Zhai, Rongrong; Zhang, Mengchen; Xu, Qun; Yang, Yaolong; Wang, Shan; Yuan, Xiaoping; Yu, Hanyong; Wang, Yiping; Wei, Xinghua
2016-10-01
Using genome-wide association mapping, 47 SNPs within 27 significant loci were identified for four grain shape traits, and 424 candidate genes were predicted from public database. Grain shape is a key determinant of grain yield and quality in rice (Oryza sativa L.). However, our knowledge of genes controlling rice grain shape remains limited. Genome-wide association mapping based on linkage disequilibrium (LD) has recently emerged as an effective approach for identifying genes or quantitative trait loci (QTL) underlying complex traits in plants. In this study, association mapping based on 5291 single nucleotide polymorphisms (SNPs) was conducted to identify significant loci associated with grain shape traits in a global collection of 469 diverse rice accessions. A total of 47 SNPs were located in 27 significant loci for four grain traits, and explained ~44.93-65.90 % of the phenotypic variation for each trait. In total, 424 candidate genes within a 200 kb extension region (±100 kb of each locus) of these loci were predicted. Of them, the cloned genes GS3 and qSW5 showed very strong effects on grain length and grain width in our study. Comparing with previously reported QTLs for grain shape traits, we found 11 novel loci, including 3, 3, 2 and 3 loci for grain length, grain width, grain length-width ratio and thousand grain weight, respectively. Validation of these new loci would be performed in the future studies. These results revealed that besides GS3 and qSW5, multiple novel loci and mechanisms were involved in determining rice grain shape. These findings provided valuable information for understanding of the genetic control of grain shape and molecular marker assistant selection (MAS) breeding in rice.
Zhong, Xiaoxiao; Li, Qi; Yu, Hong; Kong, Lingfeng
2014-01-01
Oysters, with high levels of phenotypic plasticity and wide geographic distribution, are a challenging group for taxonomists and phylogenetics. Our study is intended to generate new EST-SNP markers and to evaluate their potential for cross-species utilization in phylogenetic study of the genus Crassostrea. In the study, 57 novel SNPs were developed from an EST database of C. gigas by the HRM (high-resolution melting) method. Transferability of 377 SNPs developed for C. gigas was examined on four other Crassostrea species: C. sikamea, C. angulata, C. hongkongensis and C. ariakensis. Among the 377 primer pairs tested, 311 (82.5%) primers showed amplification in C. sikamea, 353 (93.6%) in C. angulata, 254 (67.4%) in C. hongkongensis and 253 (67.1%) in C. ariakensis. A total of 214 SNPs were found to be transferable to all four species. Phylogenetic analyses showed that C. hongkongensis was a sister species of C. ariakensis and that this clade was sister to the clade containing C. sikamea, C. angulata and C. gigas. Within this clade, C. gigas and C. angulata had the closest relationship, with C. sikamea being the sister group. In addition, we detected eight SNPs as potentially being under selection by two outlier tests (fdist and hierarchical methods). The SNPs studied here should be useful for genetic diversity, comparative mapping and phylogenetic studies across species in Crassostrea and the candidate outlier SNPs are worth exploring in more detail regarding association genetics and functional studies.
Melzer, Nina; Wittenburg, Dörte; Repsilber, Dirk
2013-01-01
In this study the benefit of metabolome level analysis for the prediction of genetic value of three traditional milk traits was investigated. Our proposed approach consists of three steps: First, milk metabolite profiles are used to predict three traditional milk traits of 1,305 Holstein cows. Two regression methods, both enabling variable selection, are applied to identify important milk metabolites in this step. Second, the prediction of these important milk metabolite from single nucleotide polymorphisms (SNPs) enables the detection of SNPs with significant genetic effects. Finally, these SNPs are used to predict milk traits. The observed precision of predicted genetic values was compared to the results observed for the classical genotype-phenotype prediction using all SNPs or a reduced SNP subset (reduced classical approach). To enable a comparison between SNP subsets, a special invariable evaluation design was implemented. SNPs close to or within known quantitative trait loci (QTL) were determined. This enabled us to determine if detected important SNP subsets were enriched in these regions. The results show that our approach can lead to genetic value prediction, but requires less than 1% of the total amount of (40,317) SNPs., significantly more important SNPs in known QTL regions were detected using our approach compared to the reduced classical approach. Concluding, our approach allows a deeper insight into the associations between the different levels of the genotype-phenotype map (genotype-metabolome, metabolome-phenotype, genotype-phenotype). PMID:23990900
Chen, Zhanghua; Pereira, Mark A.; Seielstad, Mark; Koh, Woon-Puay; Tai, E. Shyong; Teo, Yik-Ying; Liu, Jianjun; Hsu, Chris; Wang, Renwei; Odegaard, Andrew O.; Thyagarajan, Bharat; Koratkar, Revati; Yuan, Jian-Min; Gross, Myron D.; Stram, Daniel O.
2014-01-01
Background Genome-wide association studies (GWAS) have identified genetic factors in type 2 diabetes (T2D), mostly among individuals of European ancestry. We tested whether previously identified T2D-associated single nucleotide polymorphisms (SNPs) replicate and whether SNPs in regions near known T2D SNPs were associated with T2D within the Singapore Chinese Health Study. Methods 2338 cases and 2339 T2D controls from the Singapore Chinese Health Study were genotyped for 507,509 SNPs. Imputation extended the genotyped SNPs to 7,514,461 with high estimated certainty (r2>0.8). Replication of known index SNP associations in T2D was attempted. Risk scores were computed as the sum of index risk alleles. SNPs in regions ±100 kb around each index were tested for associations with T2D in conditional fine-mapping analysis. Results Of 69 index SNPs, 20 were genotyped directly and genotypes at 35 others were well imputed. Among the 55 SNPs with data, disease associations were replicated (at p<0.05) for 15 SNPs, while 32 more were directionally consistent with previous reports. Risk score was a significant predictor with a 2.03 fold higher risk CI (1.69–2.44) of T2D comparing the highest to lowest quintile of risk allele burden (p = 5.72×10−14). Two improved SNPs around index rs10923931 and 5 new candidate SNPs around indices rs10965250 and rs1111875 passed simple Bonferroni corrections for significance in conditional analysis. Nonetheless, only a small fraction (2.3% on the disease liability scale) of T2D burden in Singapore is explained by these SNPs. Conclusions While diabetes risk in Singapore Chinese involves genetic variants, most disease risk remains unexplained. Further genetic work is ongoing in the Singapore Chinese population to identify unique common variants not already seen in earlier studies. However rapid increases in T2D risk have occurred in recent decades in this population, indicating that dynamic environmental influences and possibly gene by environment interactions complicate the genetic architecture of this disease. PMID:24520337
Chen, Zhanghua; Pereira, Mark A; Seielstad, Mark; Koh, Woon-Puay; Tai, E Shyong; Teo, Yik-Ying; Liu, Jianjun; Hsu, Chris; Wang, Renwei; Odegaard, Andrew O; Thyagarajan, Bharat; Koratkar, Revati; Yuan, Jian-Min; Gross, Myron D; Stram, Daniel O
2014-01-01
Genome-wide association studies (GWAS) have identified genetic factors in type 2 diabetes (T2D), mostly among individuals of European ancestry. We tested whether previously identified T2D-associated single nucleotide polymorphisms (SNPs) replicate and whether SNPs in regions near known T2D SNPs were associated with T2D within the Singapore Chinese Health Study. 2338 cases and 2339 T2D controls from the Singapore Chinese Health Study were genotyped for 507,509 SNPs. Imputation extended the genotyped SNPs to 7,514,461 with high estimated certainty (r(2)>0.8). Replication of known index SNP associations in T2D was attempted. Risk scores were computed as the sum of index risk alleles. SNPs in regions ± 100 kb around each index were tested for associations with T2D in conditional fine-mapping analysis. Of 69 index SNPs, 20 were genotyped directly and genotypes at 35 others were well imputed. Among the 55 SNPs with data, disease associations were replicated (at p<0.05) for 15 SNPs, while 32 more were directionally consistent with previous reports. Risk score was a significant predictor with a 2.03 fold higher risk CI (1.69-2.44) of T2D comparing the highest to lowest quintile of risk allele burden (p = 5.72 × 10(-14)). Two improved SNPs around index rs10923931 and 5 new candidate SNPs around indices rs10965250 and rs1111875 passed simple Bonferroni corrections for significance in conditional analysis. Nonetheless, only a small fraction (2.3% on the disease liability scale) of T2D burden in Singapore is explained by these SNPs. While diabetes risk in Singapore Chinese involves genetic variants, most disease risk remains unexplained. Further genetic work is ongoing in the Singapore Chinese population to identify unique common variants not already seen in earlier studies. However rapid increases in T2D risk have occurred in recent decades in this population, indicating that dynamic environmental influences and possibly gene by environment interactions complicate the genetic architecture of this disease.
Hand, Brian K.; Hether, Tyler D; Kovach, Ryan P.; Muhlfeld, Clint C.; Amish, Stephen J.; Boyer, Matthew C.; O’Rourke, Sean M.; Miller, Michael R.; Lowe, Winsor H.; Hohenlohe, Paul A.; Luikart, Gordon
2015-01-01
Invasive hybridization and introgression pose a serious threat to the persistence of many native species. Understanding the effects of hybridization on native populations (e.g., fitness consequences) requires numerous species-diagnostic loci distributed genome-wide. Here we used RAD sequencing to discover thousands of single-nucleotide polymorphisms (SNPs) that are diagnostic between rainbow trout (RBT, Oncorhynchus mykiss), the world’s most widely introduced fish, and native westslope cutthroat trout (WCT, O. clarkii lewisi) in the northern Rocky Mountains, USA. We advanced previous work that identified 4,914 species-diagnostic loci by using longer sequence reads (100 bp vs. 60 bp) and a larger set of individuals (n = 84). We sequenced RAD libraries for individuals from diverse sampling sources, including native populations of WCT and hatchery broodstocks of WCT and RBT. We also took advantage of a newly released reference genome assembly for RBT to align our RAD loci. In total, we discovered 16,788 putatively diagnostic SNPs, 10,267 of which we mapped to anchored chromosome locations on the RBT genome. A small portion of previously discovered putative diagnostic loci (325 of 4,914) were no longer diagnostic (i.e., fixed between species) based on our wider survey of non-hybridized RBT and WCT individuals. Our study suggests that RAD loci mapped to a draft genome assembly could provide the marker density required to identify genes and chromosomal regions influencing selection in admixed populations of conservation concern and evolutionary interest.
Genetic variants associated with celiac disease and the risk for coronary artery disease.
Jansen, Henning; Willenborg, Christina; Schlesinger, Sabrina; Ferrario, Paola G; König, Inke R; Erdmann, Jeanette; Samani, Nilesh J; Lieb, Wolfgang; Schunkert, Heribert
2015-10-01
Epidemiological evidence suggests that patients with celiac disease are at increased risk for coronary artery disease (CAD). Genetic-epidemiological analyses identified many single nucleotide polymorphisms (SNPs) associated with celiac disease. If there is a causal relation between celiac disease and CAD, one might expect that risk alleles primarily associated with celiac disease also increase the risk of CAD. In this study we identified from literature 41 SNPs that have been previously described to be genome-wide associated with celiac disease (p < 5 × 10(-08)). These SNPs were evaluated for their association with CAD in the Coronary ARtery DIsease Genome-wide Replication and Meta-analysis (CARDIoGRAM) dataset, a meta-analysis comprising genome-wide SNP association data from 22,233 CAD cases and 64,762 controls. 24 out of 41 (58.5 %) risk alleles for celiac disease displayed a positive association with CAD (CAD-OR range 1.001-1.081). The remaining risk alleles for celiac disease (n = 16) revealed CAD-ORs of ≤1.0 (range 0.951-1.0). The proportion of CAD associated alleles was greater but did not differ significantly from the proportion of 50 % expected by chance (p = 0.069). One SNP (rs653178 at the SH2B3/ATXN2 locus) displayed study-wise statistically significant association with CAD with directionality consistent effects on celiac disease and CAD. However, the effect of this locus is most likely driven by pleiotropic effects on multiple other diseases. In conclusion, this genetically based approach provided no convincing evidence that SNPs associated with celiac disease contribute to the risk of CAD. Hence, common non-genetic factors may play a more important role explaining the coincidence of these two complex disease conditions.
Genome-wide association study identifies novel breast cancer susceptibility loci
Easton, Douglas F.; Pooley, Karen A.; Dunning, Alison M.; Pharoah, Paul D. P.; Thompson, Deborah; Ballinger, Dennis G.; Struewing, Jeffery P.; Morrison, Jonathan; Field, Helen; Luben, Robert; Wareham, Nicholas; Ahmed, Shahana; Healey, Catherine S.; Bowman, Richard; Meyer, Kerstin B.; Haiman, Christopher A.; Kolonel, Laurence K.; Henderson, Brian E.; Marchand, Loic Le; Brennan, Paul; Sangrajrang, Suleeporn; Gaborieau, Valerie; Odefrey, Fabrice; Shen, Chen-Yang; Wu, Pei-Ei; Wang, Hui-Chun; Eccles, Diana; Evans, D. Gareth; Peto, Julian; Fletcher, Olivia; Johnson, Nichola; Seal, Sheila; Stratton, Michael R.; Rahman, Nazneen; Chenevix-Trench, Georgia; Bojesen, Stig E.; Nordestgaard, Børge G.; Axelsson, Christen K.; Garcia-Closas, Montserrat; Brinton, Louise; Chanock, Stephen; Lissowska, Jolanta; Peplonska, Beata; Nevanlinna, Heli; Fagerholm, Rainer; Eerola, Hannaleena; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Ahn, Sei-Hyun; Hunter, David J.; Hankinson, Susan E.; Cox, David G.; Hall, Per; Wedren, Sara; Liu, Jianjun; Low, Yen-Ling; Bogdanova, Natalia; Schürmann, Peter; Dörk, Thilo; Tollenaar, Rob A. E. M.; Jacobi, Catharina E.; Devilee, Peter; Klijn, Jan G. M.; Sigurdson, Alice J.; Doody, Michele M.; Alexander, Bruce H.; Zhang, Jinghui; Cox, Angela; Brock, Ian W.; MacPherson, Gordon; Reed, Malcolm W. R.; Couch, Fergus J.; Goode, Ellen L.; Olson, Janet E.; Meijers-Heijboer, Hanne; van den Ouweland, Ans; Uitterlinden, André; Rivadeneira, Fernando; Milne, Roger L.; Ribas, Gloria; Gonzalez-Neira, Anna; Benitez, Javier; Hopper, John L.; McCredie, Margaret; Southey, Melissa; Giles, Graham G.; Schroen, Chris; Justenhoven, Christina; Brauch, Hiltrud; Hamann, Ute; Ko, Yon-Dschun; Spurdle, Amanda B.; Beesley, Jonathan; Chen, Xiaoqing; Mannermaa, Arto; Kosma, Veli-Matti; Kataja, Vesa; Hartikainen, Jaana; Day, Nicholas E.; Cox, David R.; Ponder, Bruce A. J.; Luccarini, Craig; Conroy, Don; Shah, Mitul; Munday, Hannah; Jordan, Clare; Perkins, Barbara; West, Judy; Redman, Karen; Driver, Kristy; Aghmesheh, Morteza; Amor, David; Andrews, Lesley; Antill, Yoland; Armes, Jane; Armitage, Shane; Arnold, Leanne; Balleine, Rosemary; Begley, Glenn; Beilby, John; Bennett, Ian; Bennett, Barbara; Berry, Geoffrey; Blackburn, Anneke; Brennan, Meagan; Brown, Melissa; Buckley, Michael; Burke, Jo; Butow, Phyllis; Byron, Keith; Callen, David; Campbell, Ian; Chenevix-Trench, Georgia; Clarke, Christine; Colley, Alison; Cotton, Dick; Cui, Jisheng; Culling, Bronwyn; Cummings, Margaret; Dawson, Sarah-Jane; Dixon, Joanne; Dobrovic, Alexander; Dudding, Tracy; Edkins, Ted; Eisenbruch, Maurice; Farshid, Gelareh; Fawcett, Susan; Field, Michael; Firgaira, Frank; Fleming, Jean; Forbes, John; Friedlander, Michael; Gaff, Clara; Gardner, Mac; Gattas, Mike; George, Peter; Giles, Graham; Gill, Grantley; Goldblatt, Jack; Greening, Sian; Grist, Scott; Haan, Eric; Harris, Marion; Hart, Stewart; Hayward, Nick; Hopper, John; Humphrey, Evelyn; Jenkins, Mark; Jones, Alison; Kefford, Rick; Kirk, Judy; Kollias, James; Kovalenko, Sergey; Lakhani, Sunil; Leary, Jennifer; Lim, Jacqueline; Lindeman, Geoff; Lipton, Lara; Lobb, Liz; Maclurcan, Mariette; Mann, Graham; Marsh, Deborah; McCredie, Margaret; McKay, Michael; McLachlan, Sue Anne; Meiser, Bettina; Milne, Roger; Mitchell, Gillian; Newman, Beth; O'Loughlin, Imelda; Osborne, Richard; Peters, Lester; Phillips, Kelly; Price, Melanie; Reeve, Jeanne; Reeve, Tony; Richards, Robert; Rinehart, Gina; Robinson, Bridget; Rudzki, Barney; Salisbury, Elizabeth; Sambrook, Joe; Saunders, Christobel; Scott, Clare; Scott, Elizabeth; Scott, Rodney; Seshadri, Ram; Shelling, Andrew; Southey, Melissa; Spurdle, Amanda; Suthers, Graeme; Taylor, Donna; Tennant, Christopher; Thorne, Heather; Townshend, Sharron; Tucker, Kathy; Tyler, Janet; Venter, Deon; Visvader, Jane; Walpole, Ian; Ward, Robin; Waring, Paul; Warner, Bev; Warren, Graham; Watson, Elizabeth; Williams, Rachael; Wilson, Judy; Winship, Ingrid; Young, Mary Ann; Bowtell, David; Green, Adele; deFazio, Anna; Chenevix-Trench, Georgia; Gertig, Dorota; Webb, Penny
2009-01-01
Breast cancer exhibits familial aggregation, consistent with variation in genetic susceptibility to the disease. Known susceptibility genes account for less than 25% of the familial risk of breast cancer, and the residual genetic variance is likely to be due to variants conferring more moderate risks. To identify further susceptibility alleles, we conducted a two-stage genome-wide association study in 4,398 breast cancer cases and 4,316 controls, followed by a third stage in which 30 single nucleotide polymorphisms (SNPs) were tested for confirmation in 21,860 cases and 22,578 controls from 22 studies. We used 227,876 SNPs that were estimated to correlate with 77% of known common SNPs in Europeans at r2>0.5. SNPs in five novel independent loci exhibited strong and consistent evidence of association with breast cancer (P<10−7). Four of these contain plausible causative genes (FGFR2, TNRC9, MAP3K1 and LSP1). At the second stage, 1,792 SNPs were significant at the P<0.05 level compared with an estimated 1,343 that would be expected by chance, indicating that many additional common susceptibility alleles may be identifiable by this approach. PMID:17529967
O'Donnell, Peter H.; Gamazon, Eric; Zhang, Wei; Stark, Amy L.; Kistner-Griffin, Emily O.; Huang, R. Stephanie; Dolan, M. Eileen
2010-01-01
Objectives Clinical studies show that Asians (ASN) are more susceptible to toxicities associated with platinum-containing regimens. We hypothesized that studying ASN as an `enriched phenotype' population could enable the discovery of novel genetic determinants of platinum susceptibility. Methods Using well-genotyped lymphoblastoid cell lines from the HapMap, we determined cisplatin and carboplatin cytotoxicity phenotypes (IC50s) for ASN, Caucasians (CEU), and Africans (YRI). IC50s were used in genome-wide association studies. Results ASN were most sensitive to platinums, corroborating clinical findings. ASN genome-wide association studies produced 479 single-nucleotide polymorphisms (SNPs) associating with cisplatin susceptibility and 199 with carboplatin susceptibility (P<10−4). Considering only the most significant variants (P< 9.99 × 10−6), backwards elimination was then used to identify reduced-model SNPs, which robustly described the drug phenotypes within ASN. These SNPs comprised highly descriptive genetic signatures of susceptibility, with 12 SNPs explaining more than 95% of the susceptibility phenotype variation for cisplatin, and eight SNPs approximately 75% for carboplatin. To determine the possible function of these variants in ASN, the SNPs were tested for association with differential expression of target genes. SNPs were highly associated with the expression of multiple target genes, and notably, the histone H3 family was implicated for both drugs, suggesting a platinum-class mechanism. Histone H3 has repeatedly been described as regulating the formation of platinum-DNA adducts, but this is the first evidence that specific genetic variants might mediate these interactions in a pharmacogenetic manner. Finally, to determine whether any ASN-identified SNPs might also be important in other human populations, we interrogated all 479/199 SNPs for association with platinum susceptibility in an independent combined CEU/YRI population. Three unique SNPs for cisplatin and 10 for carboplatin replicated in CEU/YRI. Conclusion Enriched `platinum susceptible' populations can be used to discover novel genetic determinants governing interindividual platinum chemotherapy susceptibility. PMID:20393316
2013-01-01
Background SNPs&GO is a method for the prediction of deleterious Single Amino acid Polymorphisms (SAPs) using protein functional annotation. In this work, we present the web server implementation of SNPs&GO (WS-SNPs&GO). The server is based on Support Vector Machines (SVM) and for a given protein, its input comprises: the sequence and/or its three-dimensional structure (when available), a set of target variations and its functional Gene Ontology (GO) terms. The output of the server provides, for each protein variation, the probabilities to be associated to human diseases. Results The server consists of two main components, including updated versions of the sequence-based SNPs&GO (recently scored as one of the best algorithms for predicting deleterious SAPs) and of the structure-based SNPs&GO3d programs. Sequence and structure based algorithms are extensively tested on a large set of annotated variations extracted from the SwissVar database. Selecting a balanced dataset with more than 38,000 SAPs, the sequence-based approach achieves 81% overall accuracy, 0.61 correlation coefficient and an Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve of 0.88. For the subset of ~6,600 variations mapped on protein structures available at the Protein Data Bank (PDB), the structure-based method scores with 84% overall accuracy, 0.68 correlation coefficient, and 0.91 AUC. When tested on a new blind set of variations, the results of the server are 79% and 83% overall accuracy for the sequence-based and structure-based inputs, respectively. Conclusions WS-SNPs&GO is a valuable tool that includes in a unique framework information derived from protein sequence, structure, evolutionary profile, and protein function. WS-SNPs&GO is freely available at http://snps.biofold.org/snps-and-go. PMID:23819482
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are the marker of choice for many researchers due to their abundance and the high-throughput methods available for their multiplex analysis. Only recently have SNP markers been available to researchers in soybean [Glycine max (L.) Merr.] with the release of th...
USDA-ARS?s Scientific Manuscript database
Chromosome 5 of onion carries major quantitative trait loci (QTL) of interest to breeders that control dry-matter content, pungency and storability of bulbs, amounts and types of epicuticular waxes, and resistances to abiotic factors. SNPs, SSRs and RFLPs in expressed regions of the onion genome hav...
Huo, Dezheng
2013-01-01
Numerous single nucleotide polymorphisms (SNPs) associated with breast cancer susceptibility have been identified by genome-wide association studies (GWAS). However, these SNPs were primarily discovered and validated in women of European and Asian ancestry. Because linkage disequilibrium is ancestry-dependent and heterogeneous among racial/ethnic populations, we evaluated common genetic variants at 22 GWAS-identified breast cancer susceptibility loci in a pooled sample of 1502 breast cancer cases and 1378 controls of African ancestry. None of the 22 GWAS index SNPs could be validated, challenging the direct generalizability of breast cancer risk variants identified in Caucasians or Asians to other populations. Novel breast cancer risk variants for women of African ancestry were identified in regions including 5p12 (odds ratio [OR] = 1.40, 95% confidence interval [CI] = 1.11–1.76; P = 0.004), 5q11.2 (OR = 1.22, 95% CI = 1.09–1.36; P = 0.00053) and 10p15.1 (OR = 1.22, 95% CI = 1.08–1.38; P = 0.0015). We also found positive association signals in three regions (6q25.1, 10q26.13 and 16q12.1–q12.2) previously confirmed by fine mapping in women of African ancestry. In addition, polygenic model indicated that eight best markers in this study, compared with 22 GWAS-identified SNPs, could better predict breast cancer risk in women of African ancestry (per-allele OR = 1.21, 95% CI = 1.16–1.27; P = 9.7 × 10–16). Our results demonstrate that fine mapping is a powerful approach to better characterize the breast cancer risk alleles in diverse populations. Future studies and new GWAS in women of African ancestry hold promise to discover additional variants for breast cancer susceptibility with clinical implications throughout the African diaspora. PMID:23475944
Yoneyama, Sachiko; Yao, Jie; Guo, Xiuqing; Fernandez-Rhodes, Lindsay; Lim, Unhee; Boston, Jonathan; Buzková, Petra; Carlson, Christopher S.; Cheng, Iona; Cochran, Barbara; Cooper, Richard; Ehret, Georg; Fornage, Myriam; Gong, Jian; Gross, Myron; Gu, C. Charles; Haessler, Jeff; Haiman, Christopher A.; Henderson, Brian; Hindorff, Lucia A.; Houston, Denise; Irvin, Marguerite R.; Jackson, Rebecca; Kuller, Lew; Leppert, Mark; Lewis, Cora E.; Li, Rongling; Le Marchand, Loic; Matise, Tara C.; Nguyen, Khanh-Dung H.; Chakravarti, Aravinda; Pankow, James S.; Pankratz, Nathan; Pooler, Loreall; Ritchie, Marylyn D.; Bien, Stephanie A.; Wassel, Christina L.; Chen, Yii-Der I.; Taylor, Kent D.; Allison, Matthew; Rotter, Jerome I.; Schreiner, Pamela J.; Schumacher, Fredrick; Wilkens, Lynne; Boerwinkle, Eric; Kooperberg, Charles; Peters, Ulrike; Buyske, Steven; Graff, Mariaelisa; North, Kari E.
2016-01-01
Background/Objectives Central adiposity measures such as waist circumference (WC) and waist-to-hip ratio (WHR) are associated with cardiometabolic disorders independently of BMI and are gaining clinically utility. Several studies report genetic variants associated with central adiposity, but most utilize only European ancestry populations. Understanding whether the genetic associations discovered among mainly European descendants are shared with African ancestry populations will help elucidate the biological underpinnings of abdominal fat deposition. Subjects/Methods To identify the underlying functional genetic determinants of body fat distribution, we conducted an array-wide association meta-analysis among persons of African ancestry across seven studies/consortia participating in the Population Architecture using Genomics and Epidemiology (PAGE) consortium. We used the Metabochip array, designed for fine mapping cardiovascular associated loci, to explore novel array-wide associations with WC and WHR among 15 945 African descendants using all and sex-stratified groups. We further interrogated 17 known WHR regions for African ancestry-specific variants. Results Of the 17 WHR loci, eight SNPs located in four loci were replicated in the sex-combined or sex-stratified meta-analyses. Two of these eight independently associated with WHR after conditioning on the known variant in European descendants (rs12096179 in TBX15-WARS2 and rs2059092 in ADAMTS9). In the fine mapping assessment, the putative functional region was reduced across all four loci but to varying degrees (average 40% drop in number of putative SNPs and 20% drop in genomic region). Similar to previous studies, the significant SNPs in the female stratified analysis were stronger than the significant SNPs from the sex-combined analysis. No novel associations were detected in the array-wide analyses. Conclusions Of 17 previously identified loci, four loci replicated in the African ancestry populations of this study. Utilizing different linkage disequilibrium patterns observed between European and African ancestries, we narrowed the suggestive region containing causative variants for all four loci. PMID:27867202
Stefanaki, Irene; Panagiotou, Orestis A; Kodela, Elisavet; Gogas, Helen; Kypreou, Katerina P; Chatzinasiou, Foteini; Nikolaou, Vasiliki; Plaka, Michaela; Kalfa, Iro; Antoniou, Christina; Ioannidis, John P A; Evangelou, Evangelos; Stratigos, Alexander J
2013-01-01
Genetic association studies have revealed numerous polymorphisms conferring susceptibility to melanoma. We aimed to replicate previously discovered melanoma-associated single-nucleotide polymorphisms (SNPs) in a Greek case-control population, and examine their predictive value. Based on a field synopsis of genetic variants of melanoma (MelGene), we genotyped 284 patients and 284 controls at 34 melanoma-associated SNPs of which 19 derived from GWAS. We tested each one of the 33 SNPs passing quality control for association with melanoma both with and without accounting for the presence of well-established phenotypic risk factors. We compared the risk allele frequencies between the Greek population and the HapMap CEU sample. Finally, we evaluated the predictive ability of the replicated SNPs. Risk allele frequencies were significantly lower compared to the HapMap CEU for eight SNPs (rs16891982--SLC45A2, rs12203592--IRF4, rs258322--CDK10, rs1805007--MC1R, rs1805008--MC1R, rs910873--PIGU, rs17305573--PIGU, and rs1885120--MTAP) and higher for one SNP (rs6001027--PLA2G6) indicating a different profile of genetic susceptibility in the studied population. Previously identified effect estimates modestly correlated with those found in our population (r = 0.72, P<0.0001). The strongest associations were observed for rs401681-T in CLPTM1L (odds ratio [OR] 1.60, 95% CI 1.22-2.10; P = 0.001), rs16891982-C in SCL45A2 (OR 0.51, 95% CI 0.34-0.76; P = 0.001), and rs1805007-T in MC1R (OR 4.38, 95% CI 2.03-9.43; P = 2×10⁻⁵). Nominally statistically significant associations were seen also for another 5 variants (rs258322-T in CDK10, rs1805005-T in MC1R, rs1885120-C in MYH7B, rs2218220-T in MTAP and rs4911442-G in the ASIP region). The addition of all SNPs with nominal significance to a clinical non-genetic model did not substantially improve melanoma risk prediction (AUC for clinical model 83.3% versus 83.9%, p = 0.66). Overall, our study has validated genetic variants that are likely to contribute to melanoma susceptibility in the Greek population.
A study on aircraft map display location and orientation
NASA Technical Reports Server (NTRS)
Baty, D. L.; Wempe, T.; Huff, E.
1973-01-01
Six airline pilots participated in a fixed-based simulator study to determine the effects of two horizontal situation display (HSD/map) panel locations relative to the vertical situation display (VSD), and of three map orientations on manual piloting performance. Pilot comments and opinions were formally obtained. Significant performance differences were found between wind conditions and among pilots but not between map locations and orientations. The results also illustrate the potential tracking accuracy of such a display. Recommendations concerning display location and map orientation are made.
A study on aircraft map display location and orientation
NASA Technical Reports Server (NTRS)
Baty, D. L.; Wempe, T. E.; Huff, E. M.
1974-01-01
Six airline pilots participated in a fixed-base simulator study to determine the effects of two horizontal situation display (HSD/map) panel locations relative to the vertical situation display (VSD), and of three map orientations on manual piloting performance. Pilot comments and opinions were formally obtained. Significant performance differences were found between wind conditions and among pilots, but not between map locations and orientations. The results also illustrate the potential tracking accuracy of such a display. Recommendations concerning display location and map orientation are made.
Ojeda, Diego A; Forero, Diego A
2014-10-01
Non-synonymous single nucleotide polymorphisms (nsSNPs) in brain-expressed genes represent interesting candidates for genetic research in neuropsychiatric disorders. To study novel nsSNPs in brain-expressed genes in a sample of Colombian subjects. We applied an approach based on in silico mining of available genomic data to identify and select novel nsSNPs in brain-expressed genes. We developed novel genotyping assays, based in allele-specific PCR methods, for these nsSNPs and genotyped them in 171 Colombian subjects. Five common nsSNPs (rs6855837; p.Leu395Ile, rs2305160; p.Thr394Ala, rs10503929; p.Met289Thr, rs2270641; p.Thr4Pro and rs3822659; p.Ser735Ala) were studied, located in the CLOCK, NPAS2, NRG1, SLC18A1 and WWC1 genes. We reported allele and genotype frequencies in a sample of South American healthy subjects. There is previous experimental evidence, arising from genome-wide expression and association studies, for the involvement of these genes in several neuropsychiatric disorders and endophenotypes, such as schizophrenia, mood disorders or memory performance. Frequencies for these nsSNPSs in the Colombian samples varied in comparison to different HapMap populations. Future study of these nsSNPs in brain-expressed genes, a synaptogenomics approach, will be important for a better understanding of neuropsychiatric diseases and endophenotypes in different populations.
Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple
Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick; Ordidge, Matthew; Paprstein, Frantisek; Sedlak, Jiri; Nybom, Hilde; Garkava-Gustavsson, Larisa; Troggio, Michela; Bianco, Luca; Velasco, Riccardo; Poncet, Charles; Théron, Anthony; Moriya, Shigeki; Bink, Marco C. A. M.; Laurens, François; Tartarini, Stefano; Durel, Charles-Eric
2017-01-01
Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs. PMID:29176988
Hajiloo, Mohsen; Sapkota, Yadav; Mackey, John R; Robson, Paula; Greiner, Russell; Damaraju, Sambasivarao
2013-02-22
Population stratification is a systematic difference in allele frequencies between subpopulations. This can lead to spurious association findings in the case-control genome wide association studies (GWASs) used to identify single nucleotide polymorphisms (SNPs) associated with disease-linked phenotypes. Methods such as self-declared ancestry, ancestry informative markers, genomic control, structured association, and principal component analysis are used to assess and correct population stratification but each has limitations. We provide an alternative technique to address population stratification. We propose a novel machine learning method, ETHNOPRED, which uses the genotype and ethnicity data from the HapMap project to learn ensembles of disjoint decision trees, capable of accurately predicting an individual's continental and sub-continental ancestry. To predict an individual's continental ancestry, ETHNOPRED produced an ensemble of 3 decision trees involving a total of 10 SNPs, with 10-fold cross validation accuracy of 100% using HapMap II dataset. We extended this model to involve 29 disjoint decision trees over 149 SNPs, and showed that this ensemble has an accuracy of ≥ 99.9%, even if some of those 149 SNP values were missing. On an independent dataset, predominantly of Caucasian origin, our continental classifier showed 96.8% accuracy and improved genomic control's λ from 1.22 to 1.11. We next used the HapMap III dataset to learn classifiers to distinguish European subpopulations (North-Western vs. Southern), East Asian subpopulations (Chinese vs. Japanese), African subpopulations (Eastern vs. Western), North American subpopulations (European vs. Chinese vs. African vs. Mexican vs. Indian), and Kenyan subpopulations (Luhya vs. Maasai). In these cases, ETHNOPRED produced ensembles of 3, 39, 21, 11, and 25 disjoint decision trees, respectively involving 31, 502, 526, 242 and 271 SNPs, with 10-fold cross validation accuracy of 86.5% ± 2.4%, 95.6% ± 3.9%, 95.6% ± 2.1%, 98.3% ± 2.0%, and 95.9% ± 1.5%. However, ETHNOPRED was unable to produce a classifier that can accurately distinguish Chinese in Beijing vs. Chinese in Denver. ETHNOPRED is a novel technique for producing classifiers that can identify an individual's continental and sub-continental heritage, based on a small number of SNPs. We show that its learned classifiers are simple, cost-efficient, accurate, transparent, flexible, fast, applicable to large scale GWASs, and robust to missing values.
Single nucleotide polymorphism discovery in bovine liver using RNA-seq technology.
Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Pierzchała, Mariusz; Feng, Yaping; Kadarmideen, Haja N; Kumar, Dibyendu
2017-01-01
RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver tissue of young bulls of the Polish Red, Polish Holstein-Friesian (HF) and Hereford breeds, and to understand the genomic variation in the three cattle breeds that may reflect differences in production traits. The RNA-seq experiment on bovine liver produced 107,114,4072 raw paired-end reads, with an average of approximately 60 million paired-end reads per library. Breed-wise, a total of 345.06, 290.04 and 436.03 million paired-end reads were obtained from the Polish Red, Polish HF, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed that 81.35%, 82.81% and 84.21% of the mapped sequencing reads were properly paired to the Polish Red, Polish HF, and Hereford breeds, respectively. This study identified 5,641,401 SNPs and insertion and deletion (indel) positions expressed in the bovine liver with an average of 313,411 SNPs and indel per young bull. Following the removal of the indel mutations, a total of 195,3804, 152,7120 and 205,3184 raw SNPs expressed in bovine liver were identified for the Polish Red, Polish HF, and Hereford breeds, respectively. Breed-wise, three highly reliable breed-specific SNP-databases (SNP-dbs) with 31,562, 24,945 and 28,194 SNP records were constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10, IGFBPL1, IGFL1, LEP, LHX4, MC4R, MSTN, NKAIN1, PLAG1, POU1F1, SDR16C5, SH2B2, TOX, UCP3 and WNT10B) in all three cattle breeds. However, six SNP loci (CCSER1, GHR, KCNIP4, MTSS1, EGFR and NSMCE2) were identified as highly polymorphic among the cattle breeds. This study identified breed-specific SNPs with greater SNP ratio and excellent mapping coverage, as well as monomorphic and highly polymorphic putative SNP loci within QTL/CGs of bovine liver tissue. A breed-specific SNP-db constructed for bovine liver yielded nearly six million SNPs. In addition, a KASPTM SNP genotyping assay, as a reliable cost-effective method, successfully validated the breed-specific putative SNPs originating from the RNA-seq experiments.
The UCSC genome browser and associated tools
Haussler, David; Kent, W. James
2013-01-01
The UCSC Genome Browser (http://genome.ucsc.edu) is a graphical viewer for genomic data now in its 13th year. Since the early days of the Human Genome Project, it has presented an integrated view of genomic data of many kinds. Now home to assemblies for 58 organisms, the Browser presents visualization of annotations mapped to genomic coordinates. The ability to juxtapose annotations of many types facilitates inquiry-driven data mining. Gene predictions, mRNA alignments, epigenomic data from the ENCODE project, conservation scores from vertebrate whole-genome alignments and variation data may be viewed at any scale from a single base to an entire chromosome. The Browser also includes many other widely used tools, including BLAT, which is useful for alignments from high-throughput sequencing experiments. Private data uploaded as Custom Tracks and Data Hubs in many formats may be displayed alongside the rich compendium of precomputed data in the UCSC database. The Table Browser is a full-featured graphical interface, which allows querying, filtering and intersection of data tables. The Saved Session feature allows users to store and share customized views, enhancing the utility of the system for organizing multiple trains of thought. Binary Alignment/Map (BAM), Variant Call Format and the Personal Genome Single Nucleotide Polymorphisms (SNPs) data formats are useful for visualizing a large sequencing experiment (whole-genome or whole-exome), where the differences between the data set and the reference assembly may be displayed graphically. Support for high-throughput sequencing extends to compact, indexed data formats, such as BAM, bigBed and bigWig, allowing rapid visualization of large datasets from RNA-seq and ChIP-seq experiments via local hosting. PMID:22908213
The UCSC genome browser and associated tools.
Kuhn, Robert M; Haussler, David; Kent, W James
2013-03-01
The UCSC Genome Browser (http://genome.ucsc.edu) is a graphical viewer for genomic data now in its 13th year. Since the early days of the Human Genome Project, it has presented an integrated view of genomic data of many kinds. Now home to assemblies for 58 organisms, the Browser presents visualization of annotations mapped to genomic coordinates. The ability to juxtapose annotations of many types facilitates inquiry-driven data mining. Gene predictions, mRNA alignments, epigenomic data from the ENCODE project, conservation scores from vertebrate whole-genome alignments and variation data may be viewed at any scale from a single base to an entire chromosome. The Browser also includes many other widely used tools, including BLAT, which is useful for alignments from high-throughput sequencing experiments. Private data uploaded as Custom Tracks and Data Hubs in many formats may be displayed alongside the rich compendium of precomputed data in the UCSC database. The Table Browser is a full-featured graphical interface, which allows querying, filtering and intersection of data tables. The Saved Session feature allows users to store and share customized views, enhancing the utility of the system for organizing multiple trains of thought. Binary Alignment/Map (BAM), Variant Call Format and the Personal Genome Single Nucleotide Polymorphisms (SNPs) data formats are useful for visualizing a large sequencing experiment (whole-genome or whole-exome), where the differences between the data set and the reference assembly may be displayed graphically. Support for high-throughput sequencing extends to compact, indexed data formats, such as BAM, bigBed and bigWig, allowing rapid visualization of large datasets from RNA-seq and ChIP-seq experiments via local hosting.
Genome sequence, comparative analysis and haplotype structure of the domestic dog.
Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S
2005-12-08
Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.
Korinsak, Siripar; Tangphatsornruang, Sithichoke; Pootakham, Wirulda; Wanchana, Samart; Plabpla, Anucha; Jantasuriyarat, Chatchawan; Patarapuwadol, Sujin; Vanavichit, Apichart; Toojinda, Theerayut
2018-05-15
Magnaporthe oryzae is a fungal pathogen causing blast disease in many plant species. In this study, seventy three isolates of M. oryzae collected from rice (Oryza sativa) in 1996-2014 were genotyped using a genotyping-by-sequencing approach to detect genetic variation. An association study was performed to identify single nucleotide polymorphisms (SNPs) associated with virulence genes using 831 selected SNP and infection phenotypes on local and improved rice varieties. Population structure analysis revealed eight subpopulations. The division into eight groups was not related to the degree of virulence. Association mapping showed five SNPs associated with fungal virulence on chromosome 1, 2, 3, 4 and 7. The SNP on chromosome 1 was associated with virulence against RD6-Pi7 and IRBL7-M which might be linked to the previously reported AvrPi7. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Travel guidance system for vehicles
DOE Office of Scientific and Technical Information (OSTI.GOV)
Takanabe, K.; Yamamoto, M.; Ito, K.
1987-02-24
A travel guidance system is described for vehicles including: a heading sensor for detecting a direction of movement of a vehicle; a distance sensor for detecting a distance traveled by the vehicle; a map data storage medium preliminarily storing map data; a control unit for receiving a heading signal from the heading sensor and a distance signal from the distance sensor to successively compute a present position of the vehicle and for generating video signals corresponding to display data including map data from the map data storage medium and data of the present position; and a display having first andmore » second display portions and responsive to the video signals from the control unit to display on the first display portion a map and a present portion mark, in which: the map data storage medium comprises means for preliminarily storing administrative division name data and landmark data; and the control unit comprises: landmark display means for: (1) determining a landmark closest to the present position, (2) causing a position of the landmark to be displayed on the map and (3) retrieving a landmark massage concerning the landmark from the storage medium to cause the display to display the landmark message on the second display portion; division name display means for retrieving the name of an administrative division to which the present position belongs from the storage medium and causing the display to display a division name message on the second display portion; and selection means for selectively actuating at least one of the landmark display means and the division name display means.« less
Jia, Ming; Zhu, Meiling; Zhou, Fei; Wang, Mengyun; Sun, Menghong; Yang, Yajun; Wang, Xiaofeng; Wang, Jiucun; Jin, Li; Xiang, Jiaqing; Zhang, Yawei; Chang, Jianhua; Wei, Qingyi
2017-02-15
The JNK and p38α pathways play an important role in carcinogenesis. Therefore, we hypothesize that single nucleotide polymorphisms (SNPs) of genes involved in these pathways are associated with risk of lung cancer. We first selected and genotyped 11 independent SNPs of the JNK and p38α pathway-related genes in a discovery set of 1,002 non-small cell lung cancer (NSCLC) cases and 1,025 cancer-free controls of Eastern Chinese. Then, we validated those significant SNPs in a replication set of 1,333 NSCLC cases and 1,339 cancer-free controls of Eastern Chinese. Multifactor dimensionality reduction (MDR) and classification and regression tree (CART) analyses were used to identify interactions between significant SNPs and other covariates. In both discovery and replication as well as their pooled analysis, carriers of GADD45G rs8252T variant genotypes had a significantly lower risk of NSCLC (adjusted OR = 0.81 and 0.79, 95% CI = 0.72-0.92 and 0.64-0.99 and p = 0.001 and 0.040 for dominant and recessive genetic models, respectively) and carriers of MAP2K7 rs3679T variant genotypes had an increased risk of NSCLC (adjusted OR = 1.19 and 1.29, 95% CI = 1.05-1.34 and 1.09-1.54 and p = 0.005 and 0.004 for dominant and recessive genetic models, respectively). Furthermore, rs8252 variant CT/TT carriers showed significantly higher levels of GADD45G mRNA expression than CC carriers in the target tissues. We observed some evidence of interactions between rs8252 genotypes and sex in NSCLC risk. These results indicate that GADD45G rs8252 and MAP2K7 rs3679 SNPs may be susceptibility biomarkers for NSCLC in Eastern Chinese populations. © 2016 UICC.
Genome-wide DNA polymorphisms in two cultivars of mei (Prunus mume sieb. et zucc.).
Sun, Lidan; Zhang, Qixiang; Xu, Zongda; Yang, Weiru; Guo, Yu; Lu, Jiuxing; Pan, Huitang; Cheng, Tangren; Cai, Ming
2013-10-06
Mei (Prunus mume Sieb. et Zucc.) is a famous ornamental plant and fruit crop grown in East Asian countries. Limited genetic resources, especially molecular markers, have hindered the progress of mei breeding projects. Here, we performed low-depth whole-genome sequencing of Prunus mume 'Fenban' and Prunus mume 'Kouzi Yudie' to identify high-quality polymorphic markers between the two cultivars on a large scale. A total of 1464.1 Mb and 1422.1 Mb of 'Fenban' and 'Kouzi Yudie' sequencing data were uniquely mapped to the mei reference genome with about 6-fold coverage, respectively. We detected a large number of putative polymorphic markers from the 196.9 Mb of sequencing data shared by the two cultivars, which together contained 200,627 SNPs, 4,900 InDels, and 7,063 SSRs. Among these markers, 38,773 SNPs, 174 InDels, and 418 SSRs were distributed in the 22.4 Mb CDS region, and 63.0% of these marker-containing CDS sequences were assigned to GO terms. Subsequently, 670 selected SNPs were validated using an Agilent's SureSelect solution phase hybridization assay. A subset of 599 SNPs was used to assess the genetic similarity of a panel of mei germplasm samples and a plum (P. salicina) cultivar, producing a set of informative diversity data. We also analyzed the frequency and distribution of detected InDels and SSRs in mei genome and validated their usefulness as DNA markers. These markers were successfully amplified in the cultivars and in their segregating progeny. A large set of high-quality polymorphic SNPs, InDels, and SSRs were identified in parallel between 'Fenban' and 'Kouzi Yudie' using low-depth whole-genome sequencing. The study presents extensive data on these polymorphic markers, which can be useful for constructing high-resolution genetic maps, performing genome-wide association studies, and designing genomic selection strategies in mei.
Keel, B N; Nonneman, D J; Rohrer, G A
2017-08-01
Genetic variants detected from sequence have been used to successfully identify causal variants and map complex traits in several organisms. High and moderate impact variants, those expected to alter or disrupt the protein coded by a gene and those that regulate protein production, likely have a more significant effect on phenotypic variation than do other types of genetic variants. Hence, a comprehensive list of these functional variants would be of considerable interest in swine genomic studies, particularly those targeting fertility and production traits. Whole-genome sequence was obtained from 72 of the founders of an intensely phenotyped experimental swine herd at the U.S. Meat Animal Research Center (USMARC). These animals included all 24 of the founding boars (12 Duroc and 12 Landrace) and 48 Yorkshire-Landrace composite sows. Sequence reads were mapped to the Sscrofa10.2 genome build, resulting in a mean of 6.1 fold (×) coverage per genome. A total of 22 342 915 high confidence SNPs were identified from the sequenced genomes. These included 21 million previously reported SNPs and 79% of the 62 163 SNPs on the PorcineSNP60 BeadChip assay. Variation was detected in the coding sequence or untranslated regions (UTRs) of 87.8% of the genes in the porcine genome: loss-of-function variants were predicted in 504 genes, 10 202 genes contained nonsynonymous variants, 10 773 had variation in UTRs and 13 010 genes contained synonymous variants. Approximately 139 000 SNPs were classified as loss-of-function, nonsynonymous or regulatory, which suggests that over 99% of the variation detected in our pigs could potentially be ignored, allowing us to focus on a much smaller number of functional SNPs during future analyses. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.
Fine mapping of chromosome 15q25.1 lung cancer susceptibility in African-Americans.
Hansen, Helen M; Xiao, Yuanyuan; Rice, Terri; Bracci, Paige M; Wrensch, Margaret R; Sison, Jennette D; Chang, Jeffery S; Smirnov, Ivan V; Patoka, Joseph; Seldin, Michael F; Quesenberry, Charles P; Kelsey, Karl T; Wiencke, John K
2010-09-15
Several genome-wide association studies identified the chr15q25.1 region, which includes three nicotinic cholinergic receptor genes (CHRNA5-B4) and the cell proliferation gene (PSMA4), for its association with lung cancer risk in Caucasians. A haplotype and its tagging single nucleotide polymorphisms (SNPs) encompassing six genes from IREB2 to CHRNB4 were most strongly associated with lung cancer risk (OR = 1.3; P < 10(-20)). In order to narrow the region of association and identify potential causal variations, we performed a fine-mapping study using 77 SNPs in a 194 kb segment of the 15q25.1 region in a sample of 448 African-American lung cancer cases and 611 controls. Four regions, two SNPs and two distinct haplotypes from sliding window analyses, were associated with lung cancer. CHRNA5 rs17486278 G had OR = 1.28, 95% CI 1.07-1.54 and P = 0.008, whereas CHRNB4 rs7178270 G had OR = 0.78, 95% CI 0.66-0.94 and P = 0.008 for lung cancer risk. Lung cancer associations remained significant after pack-year adjustment. Rs7178270 decreased lung cancer risk in women but not in men; gender interaction P = 0.009. For two SNPs (rs7168796 A/G and rs7164594 A/G) upstream of PSMA4, lung cancer risks for people with haplotypes GG and AA were reduced compared with those with AG (OR = 0.56, 95% CI 0.38-0.82; P = 0.003 and OR = 0.73, 95% CI 0.59-0.90, P = 0.004, respectively). A four-SNP haplotype spanning CHRNA5 (rs11637635 C, rs17408276 T, rs16969968 G) and CHRNA3 (rs578776 G) was associated with increased lung cancer risk (P = 0.002). The identified regions contain SNPs predicted to affect gene regulation. There are multiple lung cancer risk loci in the 15q25.1 region in African-Americans.
McClure, Matthew; Kim, Euisoo; Bickhart, Derek; Null, Daniel; Cooper, Tabatha; Cole, John; Wiggans, George; Ajmone-Marsan, Paolo; Colli, Licia; Santus, Enrico; Liu, George E.; Schroeder, Steve; Matukumalli, Lakshmi; Van Tassell, Curt; Sonstegard, Tad
2013-01-01
Bovine Progressive Degenerative Myeloencephalopathy (Weaver Syndrome) is a recessive neurological disease that has been observed in the Brown Swiss cattle breed since the 1970’s in North America and Europe. Bilateral hind leg weakness and ataxia appear in afflicted animals at 6 to 18 months of age, and slowly progresses to total loss of hind limb control by 3 to 4 years of age. While Weaver has previously been mapped to Bos taurus autosome (BTA) 4∶46–56 Mb and a diagnostic test based on the 6 microsatellite (MS) markers is commercially available, neither the causative gene nor mutation has been identified; therefore misdiagnosis can occur due to recombination between the diagnostic MS markers and the causative mutation. Analysis of 34,980 BTA 4 SNPs genotypes derived from the Illumina BovineHD assay for 20 Brown Swiss Weaver carriers and 49 homozygous normal bulls refined the Weaver locus to 48–53 Mb. Genotyping of 153 SNPs, identified from whole genome sequencing of 10 normal and 10 carrier animals, across a validation set of 841 animals resulted in the identification of 41 diagnostic SNPs that were concordant with the disease. Except for one intergenic SNP all are associated with genes expressed in nervous tissues: 37 distal to NRCAM, one non-synonymous (serine to asparagine) in PNPLA8, one synonymous and one non-synonymous (lysine to glutamic acid) in CTTNBP2. Haplotype and imputation analyses of 7,458 Brown Swiss animals with Illumina BovineSNP50 data and the 41 diagnostic SNPs resulted in the identification of only one haplotype concordant with the Weaver phenotype. Use of this haplotype and the diagnostic SNPs more accurately identifies Weaver carriers in both Brown Swiss purebred and influenced herds. PMID:23527149
Sharp, Robert C.; Beg, Shazia A.; Naser, Saleh A.
2018-01-01
A shared genetic pre-disposition, chronic inflammation, and treatment with similar biologics between Rheumatoid arthritis (RA) and Crohn's disease (CD) have intrigued us to investigate whether the two disorders share trigger association or possible causation. We hypothesized earlier that Single Nucleotide Polymorphisms (SNPs) in the negative regulators Protein Tyrosine Phosphatase Non-receptor type 2 and 22 (PTPN2/22) lead to a dysregulated immune response, susceptibility to environmental triggers, and continued apoptosis as seen in chronic inflammation in RA and CD. To test the hypothesis, peripheral leukocytes samples from 132 consented subjects were genotyped for 9 SNPs in PTPN2/22 using TaqMan™ genotyping. The effect of the SNPs on PTPN2/22 and IFN-γ expression was determined using real time PCR. T-cell proliferation and response to phytohematoagglutonin (PHA) mitogen and mycobacterial antigens were determined by BrdU proliferation assay. Blood samples were also analyzed for the Mycobacterium avium subspecies paratuberculosis (MAP) IS900 gene by nPCR. Out of 9 SNPs examined, heterozygous (TC) or minor (CC) alleles of PTPN2:rs478582 occurred in 79% RA compared to 60% healthy controls (p-values ≤ 0.05; OR = 2.28). Similarly, heterozygous (GA) or minor (AA) alleles of PTPN22:rs2476601 occurred in 29% RA compared to 6% healthy controls (p-values ≤ 0.05; OR = 5.90). PTPN2/22 expression in RA was decreased by 1.2-fold compared to healthy controls. PTPN2:rs478582 upregulated IFN-γ in RA by 1.5-fold. Combined PTPN2:rs478582 and PTPN22:rs2476601 increased T-cell proliferation by 2.7-fold when treated with PHA. Surprisingly, MAP DNA was detected in 34% of RA samples compared to 8% healthy controls, (p-values ≤ 0.05, OR = 5.74). RA samples with PTPN2:rs478582 and/or PTPN22:rs2476601 were more positive for MAP than samples without polymorphisms. Combined occurrence of PTPN2:rs478582 and PTPN22:rs2476601 in association with the presence of MAP has significantly increased T-cell response and elevated IFN-γ expression in RA samples. The data suggest that genetic polymorphisms may play vital role in T-cell regulation, susceptibility to mycobacteria and ultimately response to treatment. This is the first study to report the detection of MAP DNA in the blood of RA patients; further studies are needed using larger number of samples. PMID:29423382
Aoun, Meriem; Breiland, Matthew; Kathryn Turner, M; Loladze, Alexander; Chao, Shiaoman; Xu, Steven S; Ammar, Karim; Anderson, James A; Kolmer, James A; Acevedo, Maricelis
2016-11-01
Leaf rust (caused by Erikss. []) is increasingly impacting durum wheat ( L. var. ) production with the recent appearance of races with virulence to widely grown cultivars in many durum producing areas worldwide. A highly virulent race on durum wheat was recently detected in Kansas. This race may spread to the northern Great Plains, where most of the US durum wheat is produced. The objective of this study was to identify sources of resistance to several races from the United States and Mexico at seedling stage in the greenhouse and at adult stage in field experiments. Genome-wide association study (GWAS) was used to identify single-nucleotide polymorphism (SNP) markers associated with leaf rust response in a worldwide durum wheat collection of 496 accessions. Thirteen accessions were resistant across all experiments. Association mapping revealed 88 significant SNPs associated with leaf rust response. Of these, 33 SNPs were located on chromosomes 2A and 2B, and 55 SNPs were distributed across all other chromosomes except for 1B and 7B. Twenty markers were associated with leaf rust response at seedling stage, while 68 markers were associated with leaf rust response at adult plant stage. The current study identified a total of 14 previously uncharacterized loci associated with leaf rust response in durum wheat. The discovery of these loci through association mapping (AM) is a significant step in identifying useful sources of resistance that can be used to broaden the relatively narrow leaf rust resistance spectrum in durum wheat germplasm. Copyright © 2016 Crop Science Society of America.
Li, Lun; Long, Yan; Zhang, Libin; Dalton-Morgan, Jessica; Batley, Jacqueline; Yu, Longjiang; Meng, Jinling; Li, Maoteng
2015-01-01
The prediction of the flowering time (FT) trait in Brassica napus based on genome-wide markers and the detection of underlying genetic factors is important not only for oilseed producers around the world but also for the other crop industry in the rotation system in China. In previous studies the low density and mixture of biomarkers used obstructed genomic selection in B. napus and comprehensive mapping of FT related loci. In this study, a high-density genome-wide SNP set was genotyped from a double-haploid population of B. napus. We first performed genomic prediction of FT traits in B. napus using SNPs across the genome under ten environments of three geographic regions via eight existing genomic predictive models. The results showed that all the models achieved comparably high accuracies, verifying the feasibility of genomic prediction in B. napus. Next, we performed a large-scale mapping of FT related loci among three regions, and found 437 associated SNPs, some of which represented known FT genes, such as AP1 and PHYE. The genes tagged by the associated SNPs were enriched in biological processes involved in the formation of flowers. Epistasis analysis showed that significant interactions were found between detected loci, even among some known FT related genes. All the results showed that our large scale and high-density genotype data are of great practical and scientific values for B. napus. To our best knowledge, this is the first evaluation of genomic selection models in B. napus based on a high-density SNP dataset and large-scale mapping of FT loci.
Pan, Lei; Wang, Nian; Wu, Zhihua; Guo, Rui; Yu, Xiaolu; Zheng, Yu; Xia, Qiuju; Gui, Songtao; Chen, Chanyou
2017-01-01
Cowpea [Vigna unguiculata (L.) Walp.] is an annual legume of economic importance and widely grown in the semi-arid tropics. However, high-density genetic maps of cowpea are still lacking. Here, we identified 34,868 SNPs (single nucleotide polymorphisms) that were distributed in the cowpea genome based on the RAD sequencing (restriction-site associated DNA sequencing) technique using a population of 170 individuals (two cowpea parents and 168 F2:3 progenies). Of these, 17,996 reliable SNPs were allotted to 11 consensus linkage groups (LGs). The length of the genetic map was 1,194.25 cM in total with a mean distance of 0.066 cM/SNP marker locus. Using this map and the F2:3 population, combined with the CIM (composite interval mapping) method, eleven quantitative trait loci (QTL) of yield-related trait were detected on seven LGs (LG4, 5, 6, 7, 9, 10, and 11) in cowpea. These QTL explained 0.05–17.32% of the total phenotypic variation. Among these, four QTL were for pod length, four QTL for thousand-grain weight (TGW), two QTL for grain number per pod, and one QTL for carpopodium length. Our results will provide a foundation for understanding genes related to grain yield in the cowpea and genus Vigna. PMID:28936219
Convergent evidence from systematic analysis of GWAS revealed genetic basis of esophageal cancer.
Gao, Xue-Xin; Gao, Lei; Wang, Jiu-Qiang; Qu, Su-Su; Qu, Yue; Sun, Hong-Lei; Liu, Si-Dang; Shang, Ying-Li
2016-07-12
Recent genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with risk of esophageal cancer (EC). However, investigation of genetic basis from the perspective of systematic biology and integrative genomics remains scarce.In this study, we explored genetic basis of EC based on GWAS data and implemented a series of bioinformatics methods including functional annotation, expression quantitative trait loci (eQTL) analysis, pathway enrichment analysis and pathway grouped network analysis.Two hundred and thirteen risk SNPs were identified, in which 44 SNPs were found to have significantly differential gene expression in esophageal tissues by eQTL analysis. By pathway enrichment analysis, 170 risk genes mapped by risk SNPs were enriched into 38 significant GO terms and 17 significant KEGG pathways, which were significantly grouped into 9 sub-networks by pathway grouped network analysis. The 9 groups of interconnected pathways were mainly involved with muscle cell proliferation, cellular response to interleukin-6, cell adhesion molecules, and ethanol oxidation, which might participate in the development of EC.Our findings provide genetic evidence and new insight for exploring the molecular mechanisms of EC.
Rathinasabapathi, Pasupathi; Purushothaman, Natarajan; Parani, Madasamy
2016-05-01
Although rice genome was sequenced in the year 2002, efforts in resequencing the large number of available accessions, landraces, traditional cultivars, and improved varieties of this important food crop are limited. We have initiated resequencing of the traditional cultivars from India. Kavuni is an important traditional rice cultivar from South India that attracts premium price for its nutritional and therapeutic properties. Whole-genome sequencing of Kavuni using Illumina platform and SNPs analysis using Nipponbare reference genome identified 1 150 711 SNPs of which 377 381 SNPs were located in the genic regions. Non-synonymous SNPs (62 708) were distributed in 19 251 genes, and their number varied between 1 and 115 per gene. Large-effect DNA polymorphisms (7769) were present in 3475 genes. Pathway mapping of these polymorphisms revealed the involvement of genes related to carbohydrate metabolism, translation, protein-folding, and cell death. Analysis of the starch biosynthesis related genes revealed that the granule-bound starch synthase I gene had T/G SNPs at the first intron/exon junction and a two-nucleotide combination, which were reported to favour high amylose content and low glycemic index. The present study provided a valuable genomics resource to study the rice varieties with nutritional and medicinal properties.
Larson, Wesley; Palti, Yniv; Gao, G.; Warheit, Kenneth I.; Seeb, James E.
2017-01-01
Natural-origin steelhead trout (Oncorhynchus mykiss (Walbaum, 1792)) in the Pacific Northwest, USA, are threatened by a number of factors including habitat destruction, disease, decline in marine survival, and a potential erosion of genetic viability due to introgression from hatchery strains. Our major goal was to use a recently developed SNP array containing ∼57 000 SNPs to identify a subset of SNPs that differentiate hatchery and natural-origin populations. We analyzed 35 765 polymorphic SNPs in nine populations of steelhead trout sampled from Puget Sound, Washington, USA. We then conducted two outlier tests and found 360 loci that were candidates for divergent selection between hatchery and natural-origin populations (mean FCT = 0.29, maximum = 0.65) and 595 SNPs that were candidates for selection among natural-origin populations (mean FST = 0.25, maximum = 0.51). Comparisons with a linkage map revealed that two chromosomes (Omy05 and Omy25) contained significantly more outliers than other chromosomes, suggesting that regions on Omy05 and Omy25 may be of adaptive significance. Our results highlight several advantages of the 57 000 SNP array as a tool for population and conservation genomics studies.
Aokic, Jun-ya; Kawase, Junya; Hamada, Kazuhisa; Fujimoto, Hiroshi; Yamamoto, Ikki; Usuki, Hironori
2018-01-01
Greater amberjack (Seriola dumerili) is distributed in tropical and temperate waters worldwide and is an important aquaculture fish. We carried out de novo sequencing of the greater amberjack genome to construct a reference genome sequence to identify single nucleotide polymorphisms (SNPs) for breeding amberjack by marker-assisted or gene-assisted selection as well as to identify functional genes for biological traits. We obtained 200 times coverage and constructed a high-quality genome assembly using next generation sequencing technology. The assembled sequences were aligned onto a yellowtail (Seriola quinqueradiata) radiation hybrid (RH) physical map by sequence homology. A total of 215 of the longest amberjack sequences, with a total length of 622.8 Mbp (92% of the total length of the genome scaffolds), were lined up on the yellowtail RH map. We resequenced the whole genomes of 20 greater amberjacks and mapped the resulting sequences onto the reference genome sequence. About 186,000 nonredundant SNPs were successfully ordered on the reference genome. Further, we found differences in the genome structural variations between two greater amberjack populations using BreakDancer. We also analyzed the greater amberjack transcriptome and mapped the annotated sequences onto the reference genome sequence. PMID:29785397
Payen, Thibaut; Murat, Claude; Gigant, Anaïs; Morin, Emmanuelle; De Mita, Stéphane; Martin, Francis
2015-09-01
The Périgord black truffle (Tuber melanosporum Vittad.), considered a gastronomic delicacy worldwide, is an ectomycorrhizal filamentous fungus that is ecologically important in Mediterranean French, Italian and Spanish woodlands. In this study, we developed a novel resource of single nucleotide polymorphisms (SNPs) for T. melanosporum using Illumina high-throughput resequencing. The genome from six T. melanosporum geographical accessions was sequenced to a depth of approximately 20×. These geographical accessions were selected from different populations within the northern and southern regions of the geographical species distribution. Approximately 80% of the reads for each of the six resequenced geographical accessions mapped against the reference T. melanosporum genome assembly, estimating the core genome size of this organism to be approximately 110 Mbp. A total of 442 326 SNPs corresponding to 3540 SNPs/Mbps were identified as being included in all seven genomes. The SNPs occurred more frequently in repeated sequences (85%), although 4501 SNPs were also identified in the coding regions of 2587 genes. Using the ratio of nonsynonymous mutations per nonsynonymous site (pN) to synonymous mutations per synonymous site (pS) and Tajima's D index scanning the whole genome, we were able to identify genomic regions and genes potentially subjected to positive or purifying selection. The SNPs identified represent a valuable resource for future population genetics and genomics studies. © 2015 John Wiley & Sons Ltd.
Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
The genome-wide discovery and high-throughput genotyping of SNPs in chickpea natural germplasm lines is indispensable to extrapolate their natural allelic diversity, domestication, and linkage disequilibrium (LD) patterns leading to the genetic enhancement of this vital legume crop. We discovered 44,844 high-quality SNPs by sequencing of 93 diverse cultivated desi, kabuli, and wild chickpea accessions using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays that were physically mapped across eight chromosomes of desi and kabuli. Of these, 22,542 SNPs were structurally annotated in different coding and non-coding sequence components of genes. Genes with 3296 non-synonymous and 269 regulatory SNPs could functionally differentiate accessions based on their contrasting agronomic traits. A high experimental validation success rate (92%) and reproducibility (100%) along with strong sensitivity (93–96%) and specificity (99%) of GBS-based SNPs was observed. This infers the robustness of GBS as a high-throughput assay for rapid large-scale mining and genotyping of genome-wide SNPs in chickpea with sub-optimal use of resources. With 23,798 genome-wide SNPs, a relatively high intra-specific polymorphic potential (49.5%) and broader molecular diversity (13–89%)/functional allelic diversity (18–77%) was apparent among 93 chickpea accessions, suggesting their tremendous applicability in rapid selection of desirable diverse accessions/inter-specific hybrids in chickpea crossbred varietal improvement program. The genome-wide SNPs revealed complex admixed domestication pattern, extensive LD estimates (0.54–0.68) and extended LD decay (400–500 kb) in a structured population inclusive of 93 accessions. These findings reflect the utility of our identified SNPs for subsequent genome-wide association study (GWAS) and selective sweep-based domestication trait dissection analysis to identify potential genomic loci (gene-associated targets) specifically regulating important complex quantitative agronomic traits in chickpea. The numerous informative genome-wide SNPs, natural allelic diversity-led domestication pattern, and LD-based information generated in our study have got multidimensional applicability with respect to chickpea genomics-assisted breeding. PMID:25873920
Functional evaluation of genetic variants associated with endometriosis near GREB1.
Fung, Jenny N; Holdsworth-Carson, Sarah J; Sapkota, Yadav; Zhao, Zhen Zhen; Jones, Lincoln; Girling, Jane E; Paiva, Premila; Healey, Martin; Nyholt, Dale R; Rogers, Peter A W; Montgomery, Grant W
2015-05-01
Do DNA variants in the growth regulation by estrogen in breast cancer 1 (GREB1) region regulate endometrial GREB1 expression and increase the risk of developing endometriosis in women? We identified new single nucleotide polymorphisms (SNPs) with strong association with endometriosis at the GREB1 locus although we did not detect altered GREB1 expression in endometriosis patients with defined genotypes. Genome-wide association studies have identified the GREB1 region on chromosome 2p25.1 for increasing endometriosis risk. The differential expression of GREB1 has also been reported by others in association with endometriosis disease phenotype. Fine mapping studies comprehensively evaluated SNPs within the GREB1 region in a large-scale data set (>2500 cases and >4000 controls). Publicly available bioinformatics tools were employed to functionally annotate SNPs showing the strongest association signal with endometriosis risk. Endometrial GREB1 mRNA and protein expression was studied with respect to phases of the menstrual cycle (n = 2-45 per cycle stage) and expression quantitative trait loci (eQTL) analysis for significant SNPs were undertaken for GREB1 [mRNA (n = 94) and protein (n = 44) in endometrium]. Participants in this study are females who provided blood and/or endometrial tissue samples in a hospital setting. The key SNPs were genotyped using Sequenom MassARRAY. The functional roles and regulatory annotations for identified SNPs are predicted by various publicly available bioinformatics tools. Endometrial GREB1 expression work employed qRT-PCR, western blotting and immunohistochemistry studies. Fine mapping results identified a number of SNPs showing stronger association (0.004 < P < 0.032) with endometriosis risk than the original GWAS SNP (rs13394619) (P = 0.034). Some of these SNPs were predicted to have functional roles, for example, interaction with transcription factor motifs. The haplotype (a combination of alleles) formed by the risk alleles from two common SNPs showed significant association (P = 0.026) with endometriosis and epistasis analysis showed no evidence for interaction between the two SNPs, suggesting an additive effect of SNPs on endometriosis risk. In normal human endometrium, GREB1 protein expression was altered depending on the cycle stage (significantly different in late proliferative versus late secretory, P < 0.05) and cell type (glandular epithelium, not stromal cells). However, GREB1 expression in endometriosis cases versus controls and eQTL analyses did not reveal any significant changes. In silico prediction tools are generally based on cell lines different to our tissue and disease of interest. Functional annotations drawn from these analyses should be considered with this limitation in mind. We identified cell-specific and hormone-specific changes in GREB1 protein expression. The lack of a significant difference observed following our GREB1 expression studies may be the result of moderate power on mixed cell populations in the endometrial tissue samples. This study further implicates the GREB1 region on chromosome 2p25.1 and the GREB1 gene with involvement in endometriosis risk. More detailed functional studies are required to determine the role of the novel GREB1 transcripts in endometriosis pathophysiology. Funding for this work was provided by NHMRC Project Grants APP1012245, APP1026033, APP1049472 and APP1046880. There are no competing interests. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Ulloa, Mauricio; Hulse-Kemp, Amanda M; De Santiago, Luis M; Stelly, David M; Burke, John J
2017-01-01
High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton ( Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F 2 , a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD) 1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the A t and D t subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the D t subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral A t -subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum . However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid A t and D t subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
Lu, Xia; Luan, Sheng; Hu, Long Yang; Mao, Yong; Tao, Ye; Zhong, Sheng Ping; Kong, Jie
2016-06-01
The Kuruma prawn, Marsupenaeus japonicus, is one of the most promising marine invertebrates in the industry in Asia, Europe and Australia. However, the increasing global temperatures result in considerable economic losses in M. japonicus farming. In the present study, to select genetically improved animals for the sustainable development of the Kuruma prawn industry, a high-resolution genetic linkage map and quantitative trait locus (QTL) identification were performed using the RAD technology. The maternal map contained 5849 SNP markers and spanned 3127.23 cM, with an average marker interval of 0.535 cM. Instead, the paternal map contained 3927 SNP markers and spanned 3326.19 cM, with an average marker interval of 0.847 cM. The consensus map contained 9289 SNP markers and spanned 3610.90 cM, with an average marker interval of 0.388 cM and coverage of 99.06 % of the genome. The markers were grouped into 41 linkage groups in the maps. Significantly, negative correlation was detected between high-temperature tolerance (UTT) and body weight (BW). The QTL mapping revealed 129 significant QTL loci for UTT and four significant QTL loci for BW at the genome-wide significance threshold. Among these QTLs, 129 overlapped with linked SNPs, and the remaining four were located in regions between contiguous SNPs. They explained the total phenotypic variance ranging from 8.9 to 12.4 %. Because of a significantly negative correlation between growth and high-temperature tolerance, we demonstrate that this high-resolution linkage map and QTLs would be useful for further marker-assisted selection in the genetic improvement of M. japonicus.
Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.
2015-01-01
High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569
Prasad, Pushplata; Kumar, Ashok; Gupta, Rajiva; Juyal, Ramesh C.; B. K., Thelma
2012-01-01
Genome-wide association studies and meta-analysis indicate that several genes/loci are consistently associated with rheumatoid arthritis (RA) in European and Asian populations. To evaluate the transferability status of these findings to an ethnically diverse north Indian population, we performed a replication analysis. We investigated the association of 47 single-nucleotide polymorphisms (SNPs) at 43 of these genes/loci with RA in a north Indian cohort comprising 983 RA cases and 1007 age and gender matched controls. Genotyping was done using Infinium human 660w-quad. Association analysis by chi-square test implemented in plink was carried out in two steps. Firstly, association of the index or surrogate SNP (r2>0.8, calculated from reference GIH Hap-Map population) was tested. In the second step, evidence for allelic/locus heterogeneity at aforementioned genes/loci was assessed for by testing additional flanking SNPs in linkage equilibrium with index/surrogate marker. Of the 44 European specific index SNPs, neither index nor surrogate SNPs were present for nine SNPs in the genotyping array. Of the remaining 35, associations were replicated at seven genes namely PTPN22 (rs1217407, p = 3×10−3); IL2–21 (rs13119723, p = 0.008); HLA-DRB1 (rs660895, p = 2.56×10−5; rs6457617, p = 1.6×10−09; rs13192471, p = 6.7×10−16); TNFA1P3 (rs9321637, p = 0.03); CCL21 (rs13293020, p = 0.01); IL2RA (rs2104286, p = 1.9×10−4) and ZEB1 (rs2793108, p = 0.006). Of the three Asian specific loci tested, rs2977227 in PADI4 showed modest association (p<0.02). Further, of the 140 SNPs (in LE with index/surrogate variant) tested, association was observed at 11 additional genes: PTPRC, AFF3, CD28, CTLA4, PXK, ANKRD55, TAGAP, CCR6, BLK, CD40 and IL2RB. This study indicates limited replication of European and Asian index SNPs and apparent allelic heterogeneity in RA etiology among north Indians warranting independent GWAS in this population. However, replicated associations of HLA-DRB1, PTPN22 (which confer ∼50% of the heritable risk to RA) and IL2RA suggest that cross-ethnicity fine mapping of such loci is apposite for identification of causal variants. PMID:22355377
Krishnan, Michelle L.; Wang, Zi; Aljabar, Paul; Ball, Gareth; Mirza, Ghazala; Saxena, Alka; Counsell, Serena J.; Hajnal, Joseph V.; Montana, Giovanni
2017-01-01
Preterm infants show abnormal structural and functional brain development, and have a high risk of long-term neurocognitive problems. The molecular and cellular mechanisms involved are poorly understood, but novel methods now make it possible to address them by examining the relationship between common genetic variability and brain endophenotype. We addressed the hypothesis that variability in the Peroxisome Proliferator Activated Receptor (PPAR) pathway would be related to brain development. We employed machine learning in an unsupervised, unbiased, combined analysis of whole-brain diffusion tractography together with genomewide, single-nucleotide polymorphism (SNP)-based genotypes from a cohort of 272 preterm infants, using Sparse Reduced Rank Regression (sRRR) and correcting for ethnicity and age at birth and imaging. Empirical selection frequencies for SNPs associated with cerebral connectivity ranged from 0.663 to zero, with multiple highly selected SNPs mapping to genes for PPARG (six SNPs), ITGA6 (four SNPs), and FXR1 (two SNPs). SNPs in PPARG were significantly overrepresented (ranked 7–11 and 67 of 556,000 SNPs; P < 2.2 × 10−7), and were mostly in introns or regulatory regions with predicted effects including protein coding and nonsense-mediated decay. Edge-centric graph-theoretic analysis showed that highly selected white-matter tracts were consistent across the group and important for information transfer (P < 2.2 × 10−17); they most often connected to the insula (P < 6 × 10−17). These results suggest that the inhibited brain development seen in humans exposed to the stress of a premature extrauterine environment is modulated by genetic factors, and that PPARG signaling has a previously unrecognized role in cerebral development. PMID:29229843
Genetic polymorphisms associated with increased risk of developing chronic myelogenous leukemia
Bruzzoni-Giovanelli, Heriberto; González, Juan R.; Sigaux, François; Villoutreix, Bruno O.; Cayuela, Jean Michel; Guilhot, Joëlle; Preudhomme, Claude; Guilhot, François; Poyet, Jean-Luc; Rousselot, Philippe
2015-01-01
Little is known about inherited factors associated with the risk of developing chronic myelogenous leukemia (CML). We used a dedicated DNA chip containing 16 561 single nucleotide polymorphisms (SNPs) covering 1 916 candidate genes to analyze 437 CML patients and 1 144 healthy control individuals. Single SNP association analysis identified 139 SNPs that passed multiple comparisons (1% false discovery rate). The HDAC9, AVEN, SEMA3C, IKBKB, GSTA3, RIPK1 and FGF2 genes were each represented by three SNPs, the PSM family by four SNPs and the SLC15A1 gene by six. Haplotype analysis showed that certain combinations of rare alleles of these genes increased the risk of developing CML by more than two or three-fold. A classification tree model identified five SNPs belonging to the genes PSMB10, TNFRSF10D, PSMB2, PPARD and CYP26B1, which were associated with CML predisposition. A CML-risk-allele score was created using these five SNPs. This score was accurate for discriminating CML status (AUC: 0.61, 95%CI: 0.58–0.64). Interestingly, the score was associated with age at diagnosis and the average number of risk alleles was significantly higher in younger patients. The risk-allele score showed the same distribution in the general population (HapMap CEU samples) as in our control individuals and was associated with differential gene expression patterns of two genes (VAPA and TDRKH). In conclusion, we describe haplotypes and a genetic score that are significantly associated with a predisposition to develop CML. The SNPs identified will also serve to drive fundamental research on the putative role of these genes in CML development. PMID:26474455
2012-01-01
Background Single nucleotide polymorphism (SNP) validation and large-scale genotyping are required to maximize the use of DNA sequence variation and determine the functional relevance of candidate genes for complex stress tolerance traits through genetic association in rice. We used the bead array platform-based Illumina GoldenGate assay to validate and genotype SNPs in a select set of stress-responsive genes to understand their functional relevance and study the population structure in rice. Results Of the 384 putative SNPs assayed, we successfully validated and genotyped 362 (94.3%). Of these 325 (84.6%) showed polymorphism among the 91 rice genotypes examined. Physical distribution, degree of allele sharing, admixtures and introgression, and amino acid replacement of SNPs in 263 abiotic and 62 biotic stress-responsive genes provided clues for identification and targeted mapping of trait-associated genomic regions. We assessed the functional and adaptive significance of validated SNPs in a set of contrasting drought tolerant upland and sensitive lowland rice genotypes by correlating their allelic variation with amino acid sequence alterations in catalytic domains and three-dimensional secondary protein structure encoded by stress-responsive genes. We found a strong genetic association among SNPs in the nine stress-responsive genes with upland and lowland ecological adaptation. Higher nucleotide diversity was observed in indica accessions compared with other rice sub-populations based on different population genetic parameters. The inferred ancestry of 16% among rice genotypes was derived from admixed populations with the maximum between upland aus and wild Oryza species. Conclusions SNPs validated in biotic and abiotic stress-responsive rice genes can be used in association analyses to identify candidate genes and develop functional markers for stress tolerance in rice. PMID:22921105
Paziewska, Agnieszka; Cukrowska, Bozena; Dabrowska, Michalina; Goryca, Krzysztof; Piatkowska, Magdalena; Kluska, Anna; Mikula, Michal; Karczmarski, Jakub; Oralewska, Beata; Rybak, Anna; Socha, Jerzy; Balabas, Aneta; Zeber-Lubecka, Natalia; Ambrozkiewicz, Filip; Konopka, Ewa; Trojanowska, Ilona; Zagroba, Malgorzata; Szperl, Malgorzata; Ostrowski, Jerzy
2015-01-01
Assessment of non-HLA variants alongside standard HLA testing was previously shown to improve the identification of potential coeliac disease (CD) patients. We intended to identify new genetic variants associated with CD in the Polish population that would improve CD risk prediction when used alongside HLA haplotype analysis. DNA samples of 336 CD and 264 unrelated healthy controls were used to create DNA pools for a genome wide association study (GWAS). GWAS findings were validated with individual HLA tag single nucleotide polymorphism (SNP) typing of 473 patients and 714 healthy controls. Association analysis using four HLA-tagging SNPs showed that, as was found in other populations, positive predicting genotypes (HLA-DQ2.5/DQ2.5, HLA-DQ2.5/DQ2.2, and HLA-DQ2.5/DQ8) were found at higher frequencies in CD patients than in healthy control individuals in the Polish population. Both CD-associated SNPs discovered by GWAS were found in the CD susceptibility region, confirming the previously-determined association of the major histocompatibility (MHC) region with CD pathogenesis. The two most significant SNPs from the GWAS were rs9272346 (HLA-dependent; localized within 1 Kb of DQA1) and rs3130484 (HLA-independent; mapped to MSH5). Specificity of CD prediction using the four HLA-tagging SNPs achieved 92.9%, but sensitivity was only 45.5%. However, when a testing combination of the HLA-tagging SNPs and the MSH5 SNP was used, specificity decreased to 80%, and sensitivity increased to 74%. This study confirmed that improvement of CD risk prediction sensitivity could be achieved by including non-HLA SNPs alongside HLA SNPs in genetic testing.
Use of a draft genome of coffee (Coffea arabica) to identify SNPs associated with caffeine content.
Tran, Hue T M; Ramaraj, Thiruvarangan; Furtado, Agnelo; Lee, Leonard Slade; Henry, Robert J
2018-03-07
Arabica coffee (Coffea arabica) has a small gene pool limiting genetic improvement. Selection for caffeine content within this gene pool would be assisted by identification of the genes controlling this important trait. Sequencing of DNA bulks from 18 genotypes with extreme high- or low-caffeine content from a population of 232 genotypes was used to identify linked polymorphisms. To obtain a reference genome, a whole genome assembly of arabica coffee (variety K7) was achieved by sequencing using short read (Illumina) and long-read (PacBio) technology. Assembly was performed using a range of assembly tools resulting in 76 409 scaffolds with a scaffold N50 of 54 544 bp and a total scaffold length of 1448 Mb. Validation of the genome assembly using different tools showed high completeness of the genome. More than 99% of transcriptome sequences mapped to the C. arabica draft genome, and 89% of BUSCOs were present. The assembled genome annotated using AUGUSTUS yielded 99 829 gene models. Using the draft arabica genome as reference in mapping and variant calling allowed the detection of 1444 nonsynonymous single nucleotide polymorphisms (SNPs) associated with caffeine content. Based on Kyoto Encyclopaedia of Genes and Genomes pathway-based analysis, 65 caffeine-associated SNPs were discovered, among which 11 SNPs were associated with genes encoding enzymes involved in the conversion of substrates, which participate in the caffeine biosynthesis pathways. This analysis demonstrated the complex genetic control of this key trait in coffee. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Cheng, Timothy H T; Gorman, Maggie; Martin, Lynn; Barclay, Ella; Casey, Graham; Saunders, Brian; Thomas, Huw; Clark, Sue; Tomlinson, Ian
2015-02-01
The presence of multiple (5-100) colorectal adenomas suggests an inherited predisposition, but the genetic aetiology of this phenotype is undetermined if patients test negative for Mendelian polyposis syndromes such as familial adenomatous polyposis (FAP) and MUTYH-associated polyposis (MAP). We investigated whether 18 common colorectal cancer (CRC) predisposition single-nucleotide polymorphisms (SNPs) could help to explain some cases with multiple adenomas who phenocopied FAP or MAP, but had no pathogenic APC or MUTYH variant. No multiple adenoma case had an outlying number of CRC SNP risk alleles, but multiple adenoma patients did have a significantly higher number of risk alleles than population controls (P=5.7 × 10(-7)). The association was stronger in those with ≥10 adenomas. The CRC SNPs accounted for 4.3% of the variation in multiple adenoma risk, with three SNPs (rs6983267, rs10795668, rs3802842) explaining 3.0% of the variation. In FAP patients, the CRC risk score did not differ significantly from the controls, as we expected given the overwhelming effect of pathogenic germline APC variants on the phenotype of these cases. More unexpectedly, we found no evidence that the CRC SNPs act as modifier genes for the number of colorectal adenomas in FAP patients. In conclusion, common colorectal tumour risk alleles contribute to the development of multiple adenomas in patients without pathogenic germline APC or MUTYH variants. This phenotype may have 'polygenic' or monogenic origins. The risk of CRC in relatives of multiple adenoma cases is probably much lower for cases with polygenic disease, and this should be taken into account when counselling such patients.
Genome-wide association study with the risk of schizophrenia in a Korean population.
Kim, Lyoung Hyo; Park, Byung Lae; Cheong, Hyun Sub; Namgoong, Suhg; Kim, Ji On; Kim, Jeong-Hyun; Shin, Joong-Gon; Park, Chul Soo; Kim, Bong-Jo; Kim, Jae Won; Choi, Ihn-Geun; Hwang, Jaeuk; Shin, Hyoung Doo; Woo, Sung-Il
2016-03-01
Schizophrenia is regarded as a multifactorial and polygenic brain disorder that is attributed to different combinations of genetic and environmental risk factors. Recently, several genome-wide association studies (GWASs) of schizophrenia have identified numerous risk factors, but the replication results remain controversial and ambiguous. To identify schizophrenia susceptibility loci in the Korean population, we performed a GWAS using the Illumina HumanOmni1-Quad V1.0 Microarray. We genotyped 1,140,419 single nucleotide polymorphisms (SNPs) in 350 Korea schizophrenia patients and 700 control subjects, and approximately 620,001 autosomal SNPs were passed our quality control. In the case-control analysis, the rs9607195 A>G on intergenic area 250 kb away from the ISX gene and the rs12738007 A>G on the intron of the MECR gene were the most strongly associated SNPs with the risk of schizophrenia (P = 6.2 × 10(-8) , OR = 0.50 and P = 3.7 × 10(-7) , OR = 2.39, respectively). In subsequent fine-mapping analysis, 6 SNPs of MECR were genotyped with 310 schizophrenia patients and 604 control subjects. The association of the MECR rs12738007, a top ranked-SNP in GWAS, was replicated (P = 1.5 × 10(-2) , OR = 1.53 in fine mapping analysis, P = 1.5 × 10(-6) , OR = 1.90 in combined analysis). The identification of putative schizophrenia susceptibility loci could provide new insights into genetic factors related with schizophrenia and clues for the development of diagnosis strategies. © 2015 Wiley Periodicals, Inc.
Freua, Mateus Castelani; Santana, Miguel Henrique de Almeida; Ventura, Ricardo Vieira; Tedeschi, Luis Orlindo; Ferraz, José Bento Sterman
2017-08-01
The interplay between dynamic models of biological systems and genomics is based on the assumption that genetic variation of the complex trait (i.e., outcome of model behavior) arises from component traits (i.e., model parameters) in lower hierarchical levels. In order to provide a proof of concept of this statement for a cattle growth model, we ask whether model parameters map genomic regions that harbor quantitative trait loci (QTLs) already described for the complex trait. We conducted a genome-wide association study (GWAS) with a Bayesian hierarchical LASSO method in two parameters of the Davis Growth Model, a system of three ordinary differential equations describing DNA accretion, protein synthesis and degradation, and fat synthesis. Phenotypic and genotypic data were available for 893 Nellore (Bos indicus) cattle. Computed values for parameter k 1 (DNA accretion rate) ranged from 0.005 ± 0.003 and for α (constant for energy for maintenance requirement) 0.134 ± 0.024. The expected biological interpretation of the parameters is confirmed by QTLs mapped for k 1 and α. QTLs within genomic regions mapped for k 1 are expected to be correlated with the DNA pool: body size and weight. Single nucleotide polymorphisms (SNPs) which were significant for α mapped QTLs that had already been associated with residual feed intake, feed conversion ratio, average daily gain (ADG), body weight, and also dry matter intake. SNPs identified for k 1 were able to additionally explain 2.2% of the phenotypic variability of the complex ADG, even when SNPs for k 1 did not match the genomic regions associated with ADG. Although improvements are needed, our findings suggest that genomic analysis on component traits may help to uncover the genetic basis of more complex traits, particularly when lower biological hierarchies are mechanistically described by mathematical simulation models.
Single nucleotide polymorphism discovery in bovine liver using RNA-seq technology
Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Pierzchała, Mariusz; Feng, Yaping; Kadarmideen, Haja N.; Kumar, Dibyendu
2017-01-01
Background RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver tissue of young bulls of the Polish Red, Polish Holstein-Friesian (HF) and Hereford breeds, and to understand the genomic variation in the three cattle breeds that may reflect differences in production traits. Results The RNA-seq experiment on bovine liver produced 107,114,4072 raw paired-end reads, with an average of approximately 60 million paired-end reads per library. Breed-wise, a total of 345.06, 290.04 and 436.03 million paired-end reads were obtained from the Polish Red, Polish HF, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed that 81.35%, 82.81% and 84.21% of the mapped sequencing reads were properly paired to the Polish Red, Polish HF, and Hereford breeds, respectively. This study identified 5,641,401 SNPs and insertion and deletion (indel) positions expressed in the bovine liver with an average of 313,411 SNPs and indel per young bull. Following the removal of the indel mutations, a total of 195,3804, 152,7120 and 205,3184 raw SNPs expressed in bovine liver were identified for the Polish Red, Polish HF, and Hereford breeds, respectively. Breed-wise, three highly reliable breed-specific SNP-databases (SNP-dbs) with 31,562, 24,945 and 28,194 SNP records were constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10, IGFBPL1, IGFL1, LEP, LHX4, MC4R, MSTN, NKAIN1, PLAG1, POU1F1, SDR16C5, SH2B2, TOX, UCP3 and WNT10B) in all three cattle breeds. However, six SNP loci (CCSER1, GHR, KCNIP4, MTSS1, EGFR and NSMCE2) were identified as highly polymorphic among the cattle breeds. Conclusions This study identified breed-specific SNPs with greater SNP ratio and excellent mapping coverage, as well as monomorphic and highly polymorphic putative SNP loci within QTL/CGs of bovine liver tissue. A breed-specific SNP-db constructed for bovine liver yielded nearly six million SNPs. In addition, a KASPTM SNP genotyping assay, as a reliable cost-effective method, successfully validated the breed-specific putative SNPs originating from the RNA-seq experiments. PMID:28234981
Osorio-Guarín, Jaime A; Enciso-Rodríguez, Felix E; González, Carolina; Fernández-Pozo, Noé; Mueller, Lukas A; Barrero, Luz Stella
2016-03-18
Vascular wilt caused by Fusarium oxysporum is the most important disease in cape gooseberry (Physalis peruviana L.) in Colombia. The development of resistant cultivars is considered one of the most cost-effective means to reduce the impact of this disease. In order to do so, it is necessary to provide breeders with molecular markers and promising germplasm for introgression of different resistance loci as part of breeding schemes. Here we described an association mapping study in cape gooseberry with the goal to: (i) select promising materials for use in plant breeding and (ii) identify SNPs associated with the cape gooseberry resistance response to the F. oxysporum pathogen under greenhouse conditions, as potential markers for cape gooseberry breeding. We found a total of 21 accessions with different resistance responses within a diversity panel of 100 cape gooseberry accessions. A total of 60,663 SNPs were also identified within the same panel by means of GBS (Genotyping By Sequencing). Model-based population structure and neighbor-joining analyses showed three populations comprising the cape gooseberry panel. After correction for population structure and kinship, we identified SNPs markers associated with the resistance response against F. oxysporum. The identification of markers was based on common tags using the reference genomes of tomato and potato as well as the root/stem transcriptome of cape gooseberry. By comparing their location with the tomato genome, 16 SNPs were found in genes involved in defense/resistance response to pathogens, likewise when compared with the genome of potato, 12 markers were related. The work presented herein provides the first association mapping study in cape gooseberry showing both the identification of promising accessions with resistance response phenotypes and the identification of a set of SNP markers mapped to defense/resistance response genes of reference genomes. Thus, the work also provides new knowledge on candidate genes involved in the P. peruviana - F. oxysporum pathosystem as a foundation for further validation in marker-assisted selection. The results have important implications for conservation and breeding strategies in cape gooseberry.
Single nucleotide polymorphisms for feed efficiency and performance in crossbred beef cattle
2014-01-01
Background This study was conducted to: (1) identify new SNPs for residual feed intake (RFI) and performance traits within candidate genes identified in a genome wide association study (GWAS); (2) estimate the proportion of variation in RFI explained by the detected SNPs; (3) estimate the effects of detected SNPs on carcass traits to avoid undesirable correlated effects on these economically important traits when selecting for feed efficiency; and (4) map the genes to biological mechanisms and pathways. A total number of 339 SNPs corresponding to 180 genes were tested for association with phenotypes using a single locus regression (SLRM) and genotypic model on 726 and 990 crossbred animals for feed efficiency and carcass traits, respectively. Results Strong evidence of associations for RFI were located on chromosomes 8, 15, 16, 18, 19, 21, and 28. The strongest association with RFI (P = 0.0017) was found with a newly discovered SNP located on BTA 8 within the ELP3 gene. SNPs rs41820824 and rs41821600 on BTA 16 within the gene HMCN1 were strongly associated with RFI (P = 0.0064 and P = 0.0033, respectively). A SNP located on BTA 18 within the ZNF423 gene provided strong evidence for association with RFI (P = 0.0028). Genomic estimated breeding values (GEBV) from 98 significant SNPs were moderately correlated (0.47) to the estimated breeding values (EBVs) from a mixed animal model. The significant (P < 0.05) SNPs (98) explained 26% of the genetic variance for RFI. In silico functional analysis for the genes suggested 35 and 39 biological processes and pathways, respectively for feed efficiency traits. Conclusions This study identified several positional and functional candidate genes involved in important biological mechanisms associated with feed efficiency and performance. Significant SNPs should be validated in other populations to establish their potential utilization in genetic improvement programs. PMID:24476087
Tomato breeding in the genomics era: insights from a SNP array.
Víquez-Zamora, Marcela; Vosman, Ben; van de Geest, Henri; Bovy, Arnaud; Visser, Richard G F; Finkers, Richard; van Heusden, Adriaan W
2013-05-27
The major bottle neck in genetic and linkage studies in tomato has been the lack of a sufficient number of molecular markers. This has radically changed with the application of next generation sequencing and high throughput genotyping. A set of 6000 SNPs was identified and 5528 of them were used to evaluate tomato germplasm at the level of species, varieties and segregating populations. From the 5528 SNPs, 1980 originated from 454-sequencing, 3495 from Illumina Solexa sequencing and 53 were additional known markers. Genotyping different tomato samples allowed the evaluation of the level of heterozygosity and introgressions among commercial varieties. Cherry tomatoes were especially different from round/beefs in chromosomes 4, 5 and 12. We were able to identify a set of 750 unique markers distinguishing S. lycopersicum 'Moneymaker' from all its distantly related wild relatives. Clustering and neighbour joining analysis among varieties and species showed expected grouping patterns, with S. pimpinellifolium as the most closely related to commercial tomatoes earlier results. Our results show that a SNP search in only a few breeding lines already provides generally applicable markers in tomato and its wild relatives. It also shows that the Illumina bead array generated data are highly reproducible. Our SNPs can roughly be divided in two categories: SNPs of which both forms are present in the wild relatives and in domesticated tomatoes (originating from common ancestors) and SNPs unique for the domesticated tomato (originating from after the domestication event). The SNPs can be used for genotyping, identification of varieties, comparison of genetic and physical linkage maps and to confirm (phylogenetic) relations. In the SNPs used for the array there is hardly any overlap with the SolCAP array and it is strongly recommended to combine both SNP sets and to select a core collection of robust SNPs completely covering the entire tomato genome.
Richardson, Kris; Schnitzler, Gavin R; Lai, Chao-Qiang; Ordovas, Jose M
2015-12-01
Cardiovascular disease and type 2 diabetes mellitus represent overlapping diseases where a large portion of the variation attributable to genetics remains unexplained. An important player in their pathogenesis is peroxisome proliferator-activated receptor γ (PPARγ) that is involved in lipid and glucose metabolism and maintenance of metabolic homeostasis. We used a functional genomics methodology to interrogate human chromatin immunoprecipitation-sequencing, genome-wide association studies, and expression quantitative trait locus data to inform selection of candidate functional single nucleotide polymorphisms (SNPs) falling in PPARγ motifs. We derived 27 328 chromatin immunoprecipitation-sequencing peaks for PPARγ in human adipocytes through meta-analysis of 3 data sets. The PPARγ consensus motif showed greatest enrichment and mapped to 8637 peaks. We identified 146 SNPs in these motifs. This number was significantly less than would be expected by chance, and Inference of Natural Selection from Interspersed Genomically coHerent elemenTs analysis indicated that these motifs are under weak negative selection. A screen of these SNPs against genome-wide association studies for cardiometabolic traits revealed significant enrichment with 16 SNPs. A screen against the MuTHER expression quantitative trait locus data revealed 8 of these were significantly associated with altered gene expression in human adipose, more than would be expected by chance. Several SNPs fall close, or are linked by expression quantitative trait locus to lipid-metabolism loci including CYP26A1. We demonstrated the use of functional genomics to identify SNPs of potential function. Specifically, that SNPs within PPARγ motifs that bind PPARγ in adipocytes are significantly associated with cardiometabolic disease and with the regulation of transcription in adipose. This method may be used to uncover functional SNPs that do not reach significance thresholds in the agnostic approach of genome-wide association studies. © 2015 American Heart Association, Inc.
Nucleotide diversity maps reveal variation in diversity among wheat genomes and chromosomes
2010-01-01
Background A genome-wide assessment of nucleotide diversity in a polyploid species must minimize the inclusion of homoeologous sequences into diversity estimates and reliably allocate individual haplotypes into their respective genomes. The same requirements complicate the development and deployment of single nucleotide polymorphism (SNP) markers in polyploid species. We report here a strategy that satisfies these requirements and deploy it in the sequencing of genes in cultivated hexaploid wheat (Triticum aestivum, genomes AABBDD) and wild tetraploid wheat (Triticum turgidum ssp. dicoccoides, genomes AABB) from the putative site of wheat domestication in Turkey. Data are used to assess the distribution of diversity among and within wheat genomes and to develop a panel of SNP markers for polyploid wheat. Results Nucleotide diversity was estimated in 2114 wheat genes and was similar between the A and B genomes and reduced in the D genome. Within a genome, diversity was diminished on some chromosomes. Low diversity was always accompanied by an excess of rare alleles. A total of 5,471 SNPs was discovered in 1791 wheat genes. Totals of 1,271, 1,218, and 2,203 SNPs were discovered in 488, 463, and 641 genes of wheat putative diploid ancestors, T. urartu, Aegilops speltoides, and Ae. tauschii, respectively. A public database containing genome-specific primers, SNPs, and other information was constructed. A total of 987 genes with nucleotide diversity estimated in one or more of the wheat genomes was placed on an Ae. tauschii genetic map, and the map was superimposed on wheat deletion-bin maps. The agreement between the maps was assessed. Conclusions In a young polyploid, exemplified by T. aestivum, ancestral species are the primary source of genetic diversity. Low effective recombination due to self-pollination and a genetic mechanism precluding homoeologous chromosome pairing during polyploid meiosis can lead to the loss of diversity from large chromosomal regions. The net effect of these factors in T. aestivum is large variation in diversity among genomes and chromosomes, which impacts the development of SNP markers and their practical utility. Accumulation of new mutations in older polyploid species, such as wild emmer, results in increased diversity and its more uniform distribution across the genome. PMID:21156062
Schönhals, Elske Maria; Ding, Jia; Ritter, Enrique; Paulo, Maria João; Cara, Nicolás; Tacke, Ekhard; Hofferbert, Hans-Reinhard; Lübeck, Jens; Strahwald, Josef; Gebhardt, Christiane
2017-08-22
Tuber yield and starch content of the cultivated potato are complex traits of decisive importance for breeding improved varieties. Natural variation of tuber yield and starch content depends on the environment and on multiple, mostly unknown genetic factors. Dissection and molecular identification of the genes and their natural allelic variants controlling these complex traits will lead to the development of diagnostic DNA-based markers, by which precision and efficiency of selection can be increased (precision breeding). Three case-control populations were assembled from tetraploid potato cultivars based on maximizing the differences between high and low tuber yield (TY), starch content (TSC) and starch yield (TSY, arithmetic product of TY and TSC). The case-control populations were genotyped by restriction-site associated DNA sequencing (RADseq) and the 8.3 k SolCAP SNP genotyping array. The allele frequencies of single nucleotide polymorphisms (SNPs) were compared between cases and controls. RADseq identified, depending on data filtering criteria, between 6664 and 450 genes with one or more differential SNPs for one, two or all three traits. Differential SNPs in 275 genes were detected using the SolCAP array. A genome wide association study using the SolCAP array on an independent, unselected population identified SNPs associated with tuber starch content in 117 genes. Physical mapping of the genes containing differential or associated SNPs, and comparisons between the two genome wide genotyping methods and two different populations identified genome segments on all twelve potato chromosomes harboring one or more quantitative trait loci (QTL) for TY, TSC and TSY. Several hundred genes control tuber yield and starch content in potato. They are unequally distributed on all potato chromosomes, forming clusters between 0.5-4 Mbp width. The largest fraction of these genes had unknown function, followed by genes with putative signalling and regulatory functions. The genetic control of tuber yield and starch content is interlinked. Most differential SNPs affecting both traits had antagonistic effects: The allele increasing TY decreased TSC and vice versa. Exceptions were 89 SNP alleles which had synergistic effects on TY, TSC and TSY. These and the corresponding genes are primary targets for developing diagnostic markers.
Genome-wide Association Study Implicates PARD3B-based AIDS Restriction
Nelson, George W.; Lautenberger, James A.; Chinn, Leslie; McIntosh, Carl; Johnson, Randall C.; Sezgin, Efe; Kessing, Bailey; Malasky, Michael; Hendrickson, Sher L.; Pontius, Joan; Tang, Minzhong; An, Ping; Winkler, Cheryl A.; Limou, Sophie; Le Clerc, Sigrid; Delaneau, Olivier; Zagury, Jean-François; Schuitemaker, Hanneke; van Manen, Daniëlle; Bream, Jay H.; Gomperts, Edward D.; Buchbinder, Susan; Goedert, James J.; Kirk, Gregory D.; O'Brien, Stephen J.
2011-01-01
Background. Host genetic variation influences human immunodeficiency virus (HIV) infection and progression to AIDS. Here we used clinically well-characterized subjects from 5 pretreatment HIV/AIDS cohorts for a genome-wide association study to identify gene associations with rate of AIDS progression. Methods. European American HIV seroconverters (n = 755) were interrogated for single-nucleotide polymorphisms (SNPs) (n = 700,022) associated with progression to AIDS 1987 (Cox proportional hazards regression analysis, co-dominant model). Results. Association with slower progression was observed for SNPs in the gene PARD3B. One of these, rs11884476, reached genome-wide significance (relative hazard = 0.3; P =3. 370 × 10−9) after statistical correction for 700,022 SNPs and contributes 4.52% of the overall variance in AIDS progression in this study. Nine of the top-ranked SNPs define a PARD3B haplotype that also displays significant association with progression to AIDS (hazard ratio, 0.3; P = 3.220 × 10−8). One of these SNPs, rs10185378, is a predicted exonic splicing enhancer; significant alteration in the expression profile of PARD3B splicing transcripts was observed in B cell lines with alternate rs10185378 genotypes. This SNP was typed in European cohorts of rapid progressors and was found to be protective for AIDS 1993 definition (odds ratio, 0.43, P = .025). Conclusions. These observations suggest a potential unsuspected pathway of host genetic influence on the dynamics of AIDS progression. PMID:21502085
Wenzel, Marius A; Douglas, Alex; James, Marianne C; Redpath, Steve M; Piertney, Stuart B
2016-01-01
Landscape genomics promises to provide novel insights into how neutral and adaptive processes shape genome-wide variation within and among populations. However, there has been little emphasis on examining whether individual-based phenotype-genotype relationships derived from approaches such as genome-wide association (GWAS) manifest themselves as a population-level signature of selection in a landscape context. The two may prove irreconcilable as individual-level patterns become diluted by high levels of gene flow and complex phenotypic or environmental heterogeneity. We illustrate this issue with a case study that examines the role of the highly prevalent gastrointestinal nematode Trichostrongylus tenuis in shaping genomic signatures of selection in red grouse (Lagopus lagopus scotica). Individual-level GWAS involving 384 SNPs has previously identified five SNPs that explain variation in T. tenuis burden. Here, we examine whether these same SNPs display population-level relationships between T. tenuis burden and genetic structure across a small-scale landscape of 21 sites with heterogeneous parasite pressure. Moreover, we identify adaptive SNPs showing signatures of directional selection using F(ST) outlier analysis and relate population- and individual-level patterns of multilocus neutral and adaptive genetic structure to T. tenuis burden. The five candidate SNPs for parasite-driven selection were neither associated with T. tenuis burden on a population level, nor under directional selection. Similarly, there was no evidence of parasite-driven selection in SNPs identified as candidates for directional selection. We discuss these results in the context of red grouse ecology and highlight the broader consequences for the utility of landscape genomics approaches for identifying signatures of selection. © 2015 John Wiley & Sons Ltd.
Angulo, Jenniffer; Pino, Karla; Echeverría-Chagas, Natalia; Marco, Claudia; Martínez-Valdebenito, Constanza; Galeno, Héctor; Villagra, Eliecer; Vera, Lilian; Lagos, Natalia; Becerra, Natalia; Mora, Judith; Bermúdez, Andrea; Cárcamo, Marcela; Díaz, Janepsy; Miquel, Juan Francisco; Ferrés, Marcela; López-Lastra, Marcelo
2015-01-01
Background. Andes virus (ANDV) is the sole etiologic agent of hantavirus cardiopulmonary syndrome (HCPS) in Chile, with a fatality rate of about 35%. Individual host factors affecting ANDV infection outcome are poorly understood. In this case-control genetic association analysis, we explored the link between single-nucleotide polymorphisms (SNPs) rs12979860, rs8099917 and rs1800629 and the clinical outcome of ANDV-induced disease. The SNPs rs12979860 and rs8099917 are known to play a role in the differential expression of the interleukin 28B gene (IL28B), whereas SNP rs1800629 is implicated in the expression of tumor necrosis factor α gene (TNF-α). Methods. A total of 238 samples from confirmed ANDV-infected patients collected between 2006 and 2014, and categorized according to the severity of the disease, were genotyped for SNPs rs12979860, rs8099917, and rs1800629. Results. Analysis of IL28B SNPs rs12979860 and rs8099917 revealed a link between homozygosity of the minor alleles (TT and GG, respectively), displaying a mild disease progression, whereas heterozygosity or homozygosity for the major alleles (CT/CC and TG/TT, respectively) in both IL28B SNPs is associated with severe disease. No association with the clinical outcome of HCPS was observed for TNF-α SNP rs1800629 (TNF −308G>A). Conclusions. The IL28B SNPs rs12979860 and rs8099917, but not TNF-α SNP rs1800629, are associated with the clinical outcome of ANDV-induced disease, suggesting a possible link between IL28B expression and ANDV pathogenesis. PMID:26394672
Angulo, Jenniffer; Pino, Karla; Echeverría-Chagas, Natalia; Marco, Claudia; Martínez-Valdebenito, Constanza; Galeno, Héctor; Villagra, Eliecer; Vera, Lilian; Lagos, Natalia; Becerra, Natalia; Mora, Judith; Bermúdez, Andrea; Cárcamo, Marcela; Díaz, Janepsy; Miquel, Juan Francisco; Ferrés, Marcela; López-Lastra, Marcelo
2015-12-15
Andes virus (ANDV) is the sole etiologic agent of hantavirus cardiopulmonary syndrome (HCPS) in Chile, with a fatality rate of about 35%. Individual host factors affecting ANDV infection outcome are poorly understood. In this case-control genetic association analysis, we explored the link between single-nucleotide polymorphisms (SNPs) rs12979860, rs8099917 and rs1800629 and the clinical outcome of ANDV-induced disease. The SNPs rs12979860 and rs8099917 are known to play a role in the differential expression of the interleukin 28B gene (IL28B), whereas SNP rs1800629 is implicated in the expression of tumor necrosis factor α gene (TNF-α). A total of 238 samples from confirmed ANDV-infected patients collected between 2006 and 2014, and categorized according to the severity of the disease, were genotyped for SNPs rs12979860, rs8099917, and rs1800629. Analysis of IL28B SNPs rs12979860 and rs8099917 revealed a link between homozygosity of the minor alleles (TT and GG, respectively), displaying a mild disease progression, whereas heterozygosity or homozygosity for the major alleles (CT/CC and TG/TT, respectively) in both IL28B SNPs is associated with severe disease. No association with the clinical outcome of HCPS was observed for TNF-α SNP rs1800629 (TNF -308G>A). The IL28B SNPs rs12979860 and rs8099917, but not TNF-α SNP rs1800629, are associated with the clinical outcome of ANDV-induced disease, suggesting a possible link between IL28B expression and ANDV pathogenesis. © The Author 2015. Published by Oxford University Press on behalf of the Infectious Diseases Society of America. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick
2013-01-01
Background and Aims Despite differences in morphology, the genera representing ‘true citrus fruit trees’ are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial ‘species’ of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between ‘true citrus fruit trees’ were clarified. Methods Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. Key Results A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Conclusions Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis. PMID:23104641
Sacco, James C; Trepanier, Lauren A
2010-01-01
NADH cytochrome b5 reductase (b5R) and cytochrome b5 (b5) catalyze the reduction of sulfamethoxazole hydroxylamine (SMX-HA), which can contribute to sulfonamide hypersensitivity, to the parent drug sulfamethoxazole. Variability in hydroxylamine reduction could thus play a role in adverse drug reactions. The aim of this study was to characterize variability in SMX-HA reduction in 111 human livers, and investigate its association with single nucleotide polymorphisms (SNPs) in b5 and b5R cDNA. Liver microsomes were assayed for SMX-HA reduction activity, and b5 and b5R expression was semiquantified by immunoblotting. The coding regions of the b5 (CYB5A) and b5R (CYB5R3) genes were resequenced. Hepatic SMX-HA reduction displayed a 19-fold range of individual variability (0.06-1.11 nmol/min/mg protein), and a 17-fold range in efficiency (Vmax/Km) among outliers. SMX-HA reduction was positively correlated with b5 and b5R protein content (P<0.0001, r=0.42; P=0.01, r=0.23, respectively), and expression of both proteins correlated with one another (P<0.0001; r=0.74). A novel cSNP in CYB5A (S5A) was associated with very low activity and protein expression. Two novel CYB5R3 SNPs, R59H and R297H, displayed atypical SMX-HA reduction kinetics and decreased SMX-HA reduction efficiency. These studies indicate that although novel cSNPs in CYB5A and CYB5R3 are associated with significantly altered protein expression and/or hydroxylamine reduction activities, these low-frequency cSNPs seem to only minimally impact overall observed phenotypic variability. Work is underway to characterize polymorphisms in other regions of these genes to further account for individual variability in hydroxylamine reduction.
Garcia-Lor, Andres; Curk, Franck; Snoussi-Trifa, Hager; Morillon, Raphael; Ancillo, Gema; Luro, François; Navarro, Luis; Ollitrault, Patrick
2013-01-01
Despite differences in morphology, the genera representing 'true citrus fruit trees' are sexually compatible, and their phylogenetic relationships remain unclear. Most of the important commercial 'species' of Citrus are believed to be of interspecific origin. By studying polymorphisms of 27 nuclear genes, the average molecular differentiation between species was estimated and some phylogenetic relationships between 'true citrus fruit trees' were clarified. Sanger sequencing of PCR-amplified fragments from 18 genes involved in metabolite biosynthesis pathways and nine putative genes for salt tolerance was performed for 45 genotypes of Citrus and relatives of Citrus to mine single nucleotide polymorphisms (SNPs) and indel polymorphisms. Fifty nuclear simple sequence repeats (SSRs) were also analysed. A total of 16 238 kb of DNA was sequenced for each genotype, and 1097 single nucleotide polymorphisms (SNPs) and 50 indels were identified. These polymorphisms were more valuable than SSRs for inter-taxon differentiation. Nuclear phylogenetic analysis revealed that Citrus reticulata and Fortunella form a cluster that is differentiated from the clade that includes three other basic taxa of cultivated citrus (C. maxima, C. medica and C. micrantha). These results confirm the taxonomic subdivision between the subgenera Metacitrus and Archicitrus. A few genes displayed positive selection patterns within or between species, but most of them displayed neutral patterns. The phylogenetic inheritance patterns of the analysed genes were inferred for commercial Citrus spp. Numerous molecular polymorphisms (SNPs and indels), which are potentially useful for the analysis of interspecific genetic structures, have been identified. The nuclear phylogenetic network for Citrus and its sexually compatible relatives was consistent with the geographical origins of these genera. The positive selection observed for a few genes will help further works to analyse the molecular basis of the variability of the associated traits. This study presents new insights into the origin of C. sinensis.
Hewett, Duncan; Samuelsson, Lena; Polding, Joanne; Enlund, Fredrik; Smart, Devi; Cantone, Kathryn; See, Chee Gee; Chadha, Sapna; Inerot, Annica; Enerback, Charlotta; Montgomery, Doug; Christodolou, Chris; Robinson, Phil; Matthews, Paul; Plumpton, Mary; Wahlstrom, Jan; Swanbeck, Gunnar; Martinsson, Tommy; Roses, Allen; Riley, John; Purvis, Ian
2002-03-01
Psoriasis is a chronic inflammatory disease of the skin with both genetic and environmental risk factors. Here we describe the creation of a single-nucleotide polymorphism (SNP) map spanning 900-1200 kb of chromosome 3q21, which had been previously recognized as containing a psoriasis susceptibility locus, PSORS5. We genotyped 644 individuals, from 195 Swedish psoriatic families, for 19 polymorphisms. Linkage disequilibrium (LD) between marker and disease was assessed using the transmission/disequilibrium test (TDT). In the TDT analysis, alleles of three of these SNPs showed significant association with disease (P<0.05). A 160-kb interval encompassing these three SNPs was sequenced, and a coding sequence consisting of 13 exons was identified. The predicted protein shares 30-40% homology with the family of cation/chloride cotransporters. A five-marker haplotype spanning the 3' half of this gene is associated with psoriasis to a P value of 3.8<10(-5). We have called this gene SLC12A8, coding for a member of the solute carrier family 12 proteins. It belongs to a class of genes that were previously unrecognized as playing a role in psoriasis pathogenesis.
IntegratedMap: a Web interface for integrating genetic map data.
Yang, Hongyu; Wang, Hongyu; Gingle, Alan R
2005-05-01
IntegratedMap is a Web application and database schema for storing and interactively displaying genetic map data. Its Web interface includes a menu for direct chromosome/linkage group selection, a search form for selection based on mapped object location and linkage group displays. An overview display provides convenient access to the full range of mapped and anchored object types with genetic locus details, such as numbers, types and names of mapped/anchored objects displayed in a compact scrollable list box that automatically updates based on selected map location and object type. Also, multilinkage group and localized map views are available along with links that can be configured for integration with other Web resources. IntegratedMap is implemented in C#/ASP.NET and the package, including a MySQL schema creation script, is available from http://cggc.agtec.uga.edu/Data/download.asp
Multi-ethnic fine-mapping of 14 central adiposity loci.
Liu, Ching-Ti; Buchkovich, Martin L; Winkler, Thomas W; Heid, Iris M; Borecki, Ingrid B; Fox, Caroline S; Mohlke, Karen L; North, Kari E; Adrienne Cupples, L
2014-09-01
The Genetic Investigation of Anthropometric Traits (GIANT) consortium identified 14 loci in European Ancestry (EA) individuals associated with waist-to-hip ratio (WHR) adjusted for body mass index. These loci are wide and narrowing the signals remains necessary. Twelve of 14 loci identified in GIANT EA samples retained strong associations with WHR in our joint EA/individuals of African Ancestry (AA) analysis (log-Bayes factor >6.1). Trans-ethnic analyses at five loci (TBX15-WARS2, LYPLAL1, ADAMTS9, LY86 and ITPR2-SSPN) substantially narrowed the signals to smaller sets of variants, some of which are in regions that have evidence of regulatory activity. By leveraging varying linkage disequilibrium structures across different populations, single-nucleotide polymorphisms (SNPs) with strong signals and narrower credible sets from trans-ethnic meta-analysis of central obesity provide more precise localizations of potential functional variants and suggest a possible regulatory role. Meta-analysis results for WHR were obtained from 77 167 EA participants from GIANT and 23 564 AA participants from the African Ancestry Anthropometry Genetics Consortium. For fine mapping we interrogated SNPs within ± 250 kb flanking regions of 14 previously reported index SNPs from loci discovered in EA populations by performing trans-ethnic meta-analysis of results from the EA and AA meta-analyses. We applied a Bayesian approach that leverages allelic heterogeneity across populations to combine meta-analysis results and aids in fine-mapping shared variants at these locations. We annotated variants using information from the ENCODE Consortium and Roadmap Epigenomics Project to prioritize variants for possible functionality. Published by Oxford University Press 2014. This work is written by (a) US Government employee(s) and is in the public domain in the US.
Spencer, Amy V; Cox, Angela; Lin, Wei-Yu; Easton, Douglas F; Michailidou, Kyriaki; Walters, Kevin
2016-04-01
There is a large amount of functional genetic data available, which can be used to inform fine-mapping association studies (in diseases with well-characterised disease pathways). Single nucleotide polymorphism (SNP) prioritization via Bayes factors is attractive because prior information can inform the effect size or the prior probability of causal association. This approach requires the specification of the effect size. If the information needed to estimate a priori the probability density for the effect sizes for causal SNPs in a genomic region isn't consistent or isn't available, then specifying a prior variance for the effect sizes is challenging. We propose both an empirical method to estimate this prior variance, and a coherent approach to using SNP-level functional data, to inform the prior probability of causal association. Through simulation we show that when ranking SNPs by our empirical Bayes factor in a fine-mapping study, the causal SNP rank is generally as high or higher than the rank using Bayes factors with other plausible values of the prior variance. Importantly, we also show that assigning SNP-specific prior probabilities of association based on expert prior functional knowledge of the disease mechanism can lead to improved causal SNPs ranks compared to ranking with identical prior probabilities of association. We demonstrate the use of our methods by applying the methods to the fine mapping of the CASP8 region of chromosome 2 using genotype data from the Collaborative Oncological Gene-Environment Study (COGS) Consortium. The data we analysed included approximately 46,000 breast cancer case and 43,000 healthy control samples. © 2016 The Authors. *Genetic Epidemiology published by Wiley Periodicals, Inc.
Loci influencing blood pressure identified using a cardiovascular gene-centric array.
Ganesh, Santhi K; Tragante, Vinicius; Guo, Wei; Guo, Yiran; Lanktree, Matthew B; Smith, Erin N; Johnson, Toby; Castillo, Berta Almoguera; Barnard, John; Baumert, Jens; Chang, Yen-Pei Christy; Elbers, Clara C; Farrall, Martin; Fischer, Mary E; Franceschini, Nora; Gaunt, Tom R; Gho, Johannes M I H; Gieger, Christian; Gong, Yan; Isaacs, Aaron; Kleber, Marcus E; Mateo Leach, Irene; McDonough, Caitrin W; Meijs, Matthijs F L; Mellander, Olle; Molony, Cliona M; Nolte, Ilja M; Padmanabhan, Sandosh; Price, Tom S; Rajagopalan, Ramakrishnan; Shaffer, Jonathan; Shah, Sonia; Shen, Haiqing; Soranzo, Nicole; van der Most, Peter J; Van Iperen, Erik P A; Van Setten, Jessica; Van Setten, Jessic A; Vonk, Judith M; Zhang, Li; Beitelshees, Amber L; Berenson, Gerald S; Bhatt, Deepak L; Boer, Jolanda M A; Boerwinkle, Eric; Burkley, Ben; Burt, Amber; Chakravarti, Aravinda; Chen, Wei; Cooper-Dehoff, Rhonda M; Curtis, Sean P; Dreisbach, Albert; Duggan, David; Ehret, Georg B; Fabsitz, Richard R; Fornage, Myriam; Fox, Ervin; Furlong, Clement E; Gansevoort, Ron T; Hofker, Marten H; Hovingh, G Kees; Kirkland, Susan A; Kottke-Marchant, Kandice; Kutlar, Abdullah; Lacroix, Andrea Z; Langaee, Taimour Y; Li, Yun R; Lin, Honghuang; Liu, Kiang; Maiwald, Steffi; Malik, Rainer; Murugesan, Gurunathan; Newton-Cheh, Christopher; O'Connell, Jeffery R; Onland-Moret, N Charlotte; Ouwehand, Willem H; Palmas, Walter; Penninx, Brenda W; Pepine, Carl J; Pettinger, Mary; Polak, Joseph F; Ramachandran, Vasan S; Ranchalis, Jane; Redline, Susan; Ridker, Paul M; Rose, Lynda M; Scharnag, Hubert; Schork, Nicholas J; Shimbo, Daichi; Shuldiner, Alan R; Srinivasan, Sathanur R; Stolk, Ronald P; Taylor, Herman A; Thorand, Barbara; Trip, Mieke D; van Duijn, Cornelia M; Verschuren, W Monique; Wijmenga, Cisca; Winkelmann, Bernhard R; Wyatt, Sharon; Young, J Hunter; Boehm, Bernhard O; Caulfield, Mark J; Chasman, Daniel I; Davidson, Karina W; Doevendans, Pieter A; Fitzgerald, Garret A; Gums, John G; Hakonarson, Hakon; Hillege, Hans L; Illig, Thomas; Jarvik, Gail P; Johnson, Julie A; Kastelein, John J P; Koenig, Wolfgang; März, Winfried; Mitchell, Braxton D; Murray, Sarah S; Oldehinkel, Albertine J; Rader, Daniel J; Reilly, Muredach P; Reiner, Alex P; Schadt, Eric E; Silverstein, Roy L; Snieder, Harold; Stanton, Alice V; Uitterlinden, André G; van der Harst, Pim; van der Schouw, Yvonne T; Samani, Nilesh J; Johnson, Andrew D; Munroe, Patricia B; de Bakker, Paul I W; Zhu, Xiaofeng; Levy, Daniel; Keating, Brendan J; Asselbergs, Folkert W
2013-04-15
Blood pressure (BP) is a heritable determinant of risk for cardiovascular disease (CVD). To investigate genetic associations with systolic BP (SBP), diastolic BP (DBP), mean arterial pressure (MAP) and pulse pressure (PP), we genotyped ∼50 000 single-nucleotide polymorphisms (SNPs) that capture variation in ∼2100 candidate genes for cardiovascular phenotypes in 61 619 individuals of European ancestry from cohort studies in the USA and Europe. We identified novel associations between rs347591 and SBP (chromosome 3p25.3, in an intron of HRH1) and between rs2169137 and DBP (chromosome1q32.1 in an intron of MDM4) and between rs2014408 and SBP (chromosome 11p15 in an intron of SOX6), previously reported to be associated with MAP. We also confirmed 10 previously known loci associated with SBP, DBP, MAP or PP (ADRB1, ATP2B1, SH2B3/ATXN2, CSK, CYP17A1, FURIN, HFE, LSP1, MTHFR, SOX6) at array-wide significance (P < 2.4 × 10(-6)). We then replicated these associations in an independent set of 65 886 individuals of European ancestry. The findings from expression QTL (eQTL) analysis showed associations of SNPs in the MDM4 region with MDM4 expression. We did not find any evidence of association of the two novel SNPs in MDM4 and HRH1 with sequelae of high BP including coronary artery disease (CAD), left ventricular hypertrophy (LVH) or stroke. In summary, we identified two novel loci associated with BP and confirmed multiple previously reported associations. Our findings extend our understanding of genes involved in BP regulation, some of which may eventually provide new targets for therapeutic intervention.
Dhanapal, Arun Prabhu; Ray, Jeffery D; Singh, Shardendu K; Hoyos-Villegas, Valerio; Smith, James R; Purcell, Larry C; Fritschi, Felix B
2016-08-04
Chlorophyll is a major component of chloroplasts and a better understanding of the genetic basis of chlorophyll in soybean [Glycine max (L.) Merr.] might contribute to improving photosynthetic capacity and yield in regions with adverse environmental conditions. A collection of 332 diverse soybean genotypes were grown in 2 years (2009 and 2010) and chlorophyll a (eChl_A), chlorophyll b (eChl_B), and total chlorophyll (eChl_T) content as well as chlorophyll a/b ratio (eChl_R) in leaf tissues were determined by extraction and spectrometric determination. Total chlorophyll was also derived from canopy spectral reflectance measurements using a model of wavelet transformed spectra (tChl_T) as well as with a spectral reflectance index (iChl_T). A genome-wide associating mapping approach was employed using 31,253 single nucleotide polymorphisms (SNPs) to identify loci associated with the extract based eChl_A, eChl_B, eChl_R and eChl_T measurements and the two canopy spectral reflectance-based methods (tChl_T and iChl_T). A total of 23 (14 loci), 15 (7 loci) and 14 SNPs (10 loci) showed significant association with eChl_A, eChl_B and eChl_R respectively. A total of 52 unique SNPs were significantly associated with total chlorophyll content based on at least one of the three approaches (eChl_T, tChl_T and iChl_T) and likely tagged 27 putative loci for total chlorophyll content, four of which were indicated by all three approaches. Results presented here show that markers for chlorophyll traits can be identified in soybean using both extract-based and canopy spectral reflectance-based phenotypes, and confirm that high-throughput phenotyping-amenable canopy spectral reflectance measurements can be used for association mapping.
Pecavar, Verena; Blaschitz, Marion; Hufnagl, Peter; Zeinzinger, Josef; Fiedler, Anita; Allerberger, Franz; Maass, Matthias; Indra, Alexander
2012-06-01
Clostridium difficile, a Gram-positive, spore-forming, anaerobic bacterium, is the main causative agent of hospital-acquired diarrhoea worldwide. In addition to metronidazole and vancomycin, rifaximin, a rifamycin derivative, is a promising antibiotic for the treatment of recurring C. difficile infections (CDI). However, exposure of C. difficile to this antibiotic has led to the development of rifaximin-resistance due to point mutations in the β-subunit of the RNA polymerase (rpoB) gene. In the present study, 348 C. difficile strains with known PCR-ribotypes were investigated for respective single nucleotide polymorphisms (SNPs) within the proposed rpoB hot-spot region by using high-resolution melting (HRM) analysis. This method allows the detection of SNPs by comparing the altered melting behaviour of dsDNA with that of wild-type DNA. Discrimination between wild-type and mutant strains was enhanced by creating heteroduplexes by mixing sample DNA with wild-type DNA, leading to characteristic melting curve shapes from samples containing SNPs in the respective rpoB section. In the present study, we were able to identify 16 different rpoB sequence-types (ST) by sequencing analysis of a 325 bp fragment. The 16 PCR STs displayed a total of 24 different SNPs. Fifteen of these 24 SNPs were located within the proposed 151 bp SNP hot-spot region, resulting in 11 different HRM curve profiles (CP). Eleven SNPs (seven of which were within the proposed hot-spot region) led to amino acid substitutions associated with reduced susceptibility to rifaximin and 13 SNPs (eight of which were within the hot-spot region) were synonymous. This investigation clearly demonstrates that HRM analysis of the proposed SNP hot-spot region in the rpoB gene of C. difficile is a fast and cost-effective method for the identification of C. difficile samples with reduced susceptibility to rifaximin and even allows simultaneous SNP subtyping of the respective C. difficile isolates.
2013-01-01
Background Population stratification is a systematic difference in allele frequencies between subpopulations. This can lead to spurious association findings in the case–control genome wide association studies (GWASs) used to identify single nucleotide polymorphisms (SNPs) associated with disease-linked phenotypes. Methods such as self-declared ancestry, ancestry informative markers, genomic control, structured association, and principal component analysis are used to assess and correct population stratification but each has limitations. We provide an alternative technique to address population stratification. Results We propose a novel machine learning method, ETHNOPRED, which uses the genotype and ethnicity data from the HapMap project to learn ensembles of disjoint decision trees, capable of accurately predicting an individual’s continental and sub-continental ancestry. To predict an individual’s continental ancestry, ETHNOPRED produced an ensemble of 3 decision trees involving a total of 10 SNPs, with 10-fold cross validation accuracy of 100% using HapMap II dataset. We extended this model to involve 29 disjoint decision trees over 149 SNPs, and showed that this ensemble has an accuracy of ≥ 99.9%, even if some of those 149 SNP values were missing. On an independent dataset, predominantly of Caucasian origin, our continental classifier showed 96.8% accuracy and improved genomic control’s λ from 1.22 to 1.11. We next used the HapMap III dataset to learn classifiers to distinguish European subpopulations (North-Western vs. Southern), East Asian subpopulations (Chinese vs. Japanese), African subpopulations (Eastern vs. Western), North American subpopulations (European vs. Chinese vs. African vs. Mexican vs. Indian), and Kenyan subpopulations (Luhya vs. Maasai). In these cases, ETHNOPRED produced ensembles of 3, 39, 21, 11, and 25 disjoint decision trees, respectively involving 31, 502, 526, 242 and 271 SNPs, with 10-fold cross validation accuracy of 86.5% ± 2.4%, 95.6% ± 3.9%, 95.6% ± 2.1%, 98.3% ± 2.0%, and 95.9% ± 1.5%. However, ETHNOPRED was unable to produce a classifier that can accurately distinguish Chinese in Beijing vs. Chinese in Denver. Conclusions ETHNOPRED is a novel technique for producing classifiers that can identify an individual’s continental and sub-continental heritage, based on a small number of SNPs. We show that its learned classifiers are simple, cost-efficient, accurate, transparent, flexible, fast, applicable to large scale GWASs, and robust to missing values. PMID:23432980
Schlautman, Brandon; Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Iorizzo, Massimo; Polashock, James; Grygleski, Edward; Vorsa, Nicholi; Zalapa, Juan
2017-01-01
The American cranberry (Vaccinium macrocarpon Ait.) is a recently domesticated, economically important, fruit crop with limited molecular resources. New genetic resources could accelerate genetic gain in cranberry through characterization of its genomic structure and by enabling molecular-assisted breeding strategies. To increase the availability of cranberry genomic resources, genotyping-by-sequencing (GBS) was used to discover and genotype thousands of single nucleotide polymorphisms (SNPs) within three interrelated cranberry full-sib populations. Additional simple sequence repeat (SSR) loci were added to the SNP datasets and used to construct bin maps for the parents of the populations, which were then merged to create the first high-density cranberry composite map containing 6073 markers (5437 SNPs and 636 SSRs) on 12 linkage groups (LGs) spanning 1124 cM. Interestingly, higher rates of recombination were observed in maternal than paternal gametes. The large number of markers in common (mean of 57.3) and the high degree of observed collinearity (mean Pair-wise Spearman rank correlations >0.99) between the LGs of the parental maps demonstrates the utility of GBS in cranberry for identifying polymorphic SNP loci that are transferable between pedigrees and populations in future trait-association studies. Furthermore, the high-density of markers anchored within the component maps allowed identification of segregation distortion regions, placement of centromeres on each of the 12 LGs, and anchoring of genomic scaffolds. Collectively, the results represent an important contribution to the current understanding of cranberry genomic structure and to the availability of molecular tools for future genetic research and breeding efforts in cranberry. PMID:28250016
snpGeneSets: An R Package for Genome-Wide Study Annotation
Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian
2016-01-01
Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048
Han, Summer S.; Yeager, Meredith; Moore, Lee E.; Wei, Ming-Hui; Pfeiffer, Ruth; Toure, Ousmane; Purdue, Mark P.; Johansson, Mattias; Scelo, Ghislaine; Chung, Charles C.; Gaborieau, Valerie; Zaridze, David; Schwartz, Kendra; Szeszenia-Dabrowska, Neonilia; Davis, Faith; Bencko, Vladimir; Colt, Joanne S.; Janout, Vladimir; Matveev, Vsevolod; Foretova, Lenka; Mates, Dana; Navratilova, M.; Boffetta, Paolo; Berg, Christine D.; Grubb, Robert L.; Stevens, Victoria L.; Thun, Michael J.; Diver, W. Ryan; Gapstur, Susan M.; Albanes, Demetrius; Weinstein, Stephanie J.; Virtamo, Jarmo; Burdett, Laurie; Brisuda, Antonin; McKay, James D.; Fraumeni, Joseph F.; Chatterjee, Nilanjan; Rosenberg, Philip S.; Rothman, Nathaniel; Brennan, Paul; Chow, Wong-Ho; Tucker, Margaret A.; Chanock, Stephen J.; Toro, Jorge R.
2012-01-01
In follow-up of a recent genome-wide association study (GWAS) that identified a locus in chromosome 2p21 associated with risk for renal cell carcinoma (RCC), we conducted a fine mapping analysis of a 120 kb region that includes EPAS1. We genotyped 59 tagged common single-nucleotide polymorphisms (SNPs) in 2278 RCC and 3719 controls of European background and observed a novel signal for rs9679290 [P = 5.75 × 10−8, per-allele odds ratio (OR) = 1.27, 95% confidence interval (CI): 1.17–1.39]. Imputation of common SNPs surrounding rs9679290 using HapMap 3 and 1000 Genomes data yielded two additional signals, rs4953346 (P = 4.09 × 10−14) and rs12617313 (P = 7.48 × 10−12), both highly correlated with rs9679290 (r2 > 0.95), but interestingly not correlated with the two SNPs reported in the GWAS: rs11894252 and rs7579899 (r2 < 0.1 with rs9679290). Genotype analysis of rs12617313 confirmed an association with RCC risk (P = 1.72 × 10−9, per-allele OR = 1.28, 95% CI: 1.18–1.39) In conclusion, we report that chromosome 2p21 harbors a complex genetic architecture for common RCC risk variants. PMID:22113997
Ge, Bing; Tayo, Bamidele; Mathias, Rasika A.; Ding, Jingzhong; Nalls, Michael A.; Adeyemo, Adebowale; Adoue, Véronique; Ambrosone, Christine B.; Atwood, Larry; Bandera, Elisa V.; Becker, Lewis C.; Berndt, Sonja I.; Bernstein, Leslie; Blot, William J.; Boerwinkle, Eric; Britton, Angela; Casey, Graham; Chanock, Stephen J.; Demerath, Ellen; Deming, Sandra L.; Diver, W. Ryan; Fox, Caroline; Harris, Tamara B.; Hernandez, Dena G.; Hu, Jennifer J.; Ingles, Sue A.; John, Esther M.; Johnson, Craig; Keating, Brendan; Kittles, Rick A.; Kolonel, Laurence N.; Kritchevsky, Stephen B.; Le Marchand, Loic; Lohman, Kurt; Liu, Jiankang; Millikan, Robert C.; Murphy, Adam; Musani, Solomon; Neslund-Dudas, Christine; North, Kari E.; Nyante, Sarah; Ogunniyi, Adesola; Ostrander, Elaine A.; Papanicolaou, George; Patel, Sanjay; Pettaway, Curtis A.; Press, Michael F.; Redline, Susan; Rodriguez-Gil, Jorge L.; Rotimi, Charles; Rybicki, Benjamin A.; Salako, Babatunde; Schreiner, Pamela J.; Signorello, Lisa B.; Singleton, Andrew B.; Stanford, Janet L.; Stram, Alex H.; Stram, Daniel O.; Strom, Sara S.; Suktitipat, Bhoom; Thun, Michael J.; Witte, John S.; Yanek, Lisa R.; Ziegler, Regina G.; Zheng, Wei; Zhu, Xiaofeng; Zmuda, Joseph M.; Zonderman, Alan B.; Evans, Michele K.; Liu, Yongmei; Becker, Diane M.; Cooper, Richard S.; Pastinen, Tomi; Henderson, Brian E.; Hirschhorn, Joel N.; Lettre, Guillaume; Haiman, Christopher A.
2011-01-01
Adult height is a classic polygenic trait of high heritability (h 2 ∼0.8). More than 180 single nucleotide polymorphisms (SNPs), identified mostly in populations of European descent, are associated with height. These variants convey modest effects and explain ∼10% of the variance in height. Discovery efforts in other populations, while limited, have revealed loci for height not previously implicated in individuals of European ancestry. Here, we performed a meta-analysis of genome-wide association (GWA) results for adult height in 20,427 individuals of African ancestry with replication in up to 16,436 African Americans. We found two novel height loci (Xp22-rs12393627, P = 3.4×10−12 and 2p14-rs4315565, P = 1.2×10−8). As a group, height associations discovered in European-ancestry samples replicate in individuals of African ancestry (P = 1.7×10−4 for overall replication). Fine-mapping of the European height loci in African-ancestry individuals showed an enrichment of SNPs that are associated with expression of nearby genes when compared to the index European height SNPs (P<0.01). Our results highlight the utility of genetic studies in non-European populations to understand the etiology of complex human diseases and traits. PMID:21998595
Loci and candidate genes conferring resistance to soybean cyst nematode HG type 2.5.7.
Zhao, Xue; Teng, Weili; Li, Yinghui; Liu, Dongyuan; Cao, Guanglu; Li, Dongmei; Qiu, Lijuan; Zheng, Hongkun; Han, Yingpeng; Li, Wenbin
2017-06-14
Soybean (Glycine max L. Merr.) cyst nematode (SCN, Heterodera glycines I,) is a major pest of soybean worldwide. The most effective strategy to control this pest involves the use of resistant cultivars. The aim of the present study was to investigate the genome-wide genetic architecture of resistance to SCN HG Type 2.5.7 (race 1) in landrace and elite cultivated soybeans. A total of 200 diverse soybean accessions were screened for resistance to SCN HG Type 2.5.7 and genotyped through sequencing using the Specific Locus Amplified Fragment Sequencing (SLAF-seq) approach with a 6.14-fold average sequencing depth. A total of 33,194 SNPs were identified with minor allele frequencies (MAF) over 4%, covering 97% of all the genotypes. Genome-wide association mapping (GWAS) revealed thirteen SNPs associated with resistance to SCN HG Type 2.5.7. These SNPs were distributed on five chromosomes (Chr), including Chr7, 8, 14, 15 and 18. Four SNPs were novel resistance loci and nine SNPs were located near known QTL. A total of 30 genes were identified as candidate genes underlying SCN resistance. A total of sixteen novel soybean accessions were identified with significant resistance to HG Type 2.5.7. The beneficial alleles and candidate genes identified by GWAS might be valuable for improving marker-assisted breeding efficiency and exploring the molecular mechanisms underlying SCN resistance.
Fine mapping on chromosome 13q32-34 and brain expression analysis implicates MYO16 in schizophrenia.
Rodriguez-Murillo, Laura; Xu, Bin; Roos, J Louw; Abecasis, Gonçalo R; Gogos, Joseph A; Karayiorgou, Maria
2014-03-01
We previously reported linkage of schizophrenia and schizoaffective disorder to 13q32-34 in the European descent Afrikaner population from South Africa. The nature of genetic variation underlying linkage peaks in psychiatric disorders remains largely unknown and both rare and common variants may be contributing. Here, we examine the contribution of common variants located under the 13q32-34 linkage region. We used densely spaced SNPs to fine map the linkage peak region using both a discovery sample of 415 families and a meta-analysis incorporating two additional replication family samples. In a second phase of the study, we use one family-based data set with 237 families and independent case-control data sets for fine mapping of the common variant association signal using HapMap SNPs. We report a significant association with a genetic variant (rs9583277) within the gene encoding for the myosin heavy-chain Myr 8 (MYO16), which has been implicated in neuronal phosphoinositide 3-kinase signaling. Follow-up analysis of HapMap variation within MYO16 in a second set of Afrikaner families and additional case-control data sets of European descent highlighted a region across introns 2-6 as the most likely region to harbor common MYO16 risk variants. Expression analysis revealed a significant increase in the level of MYO16 expression in the brains of schizophrenia patients. Our results suggest that common variation within MYO16 may contribute to the genetic liability to schizophrenia.
Kader, A Karim; Sun, Jielin; Isaacs, Sarah D; Wiley, Kathleen E; Yan, Guifang; Kim, Seong-Tae; Fedor, Helen; DeMarzo, Angelo M; Epstein, Jonathan I; Walsh, Patrick C; Partin, Alan W; Trock, Bruce; Zheng, S Lilly; Xu, Jianfeng; Isaacs, William
2009-08-01
More than a dozen single nucleotide polymorphisms (SNPs) have been associated with prostate cancer (PCa) risk from genome-wide association studies (GWAS). Their association with PCa aggressiveness and clinicopathologic variables is inconclusive. Twenty PCa risk SNPs implicated in GWAS and fine mapping studies were evaluated in 5,895 PCa cases treated by radical prostatectomy at Johns Hopkins Hospital, where each tumor was uniformly graded and staged using the same protocol. For 18 of the 20 SNPs examined, no statistically significant differences (P > 0.05) were observed in risk allele frequencies between patients with more aggressive (Gleason scores > or =4 + 3, or stage > or =T3b, or N+) or less aggressive disease (Gleason scores < or =3 + 4, and stage < or =T2, and N0). For the two SNPs that had significant differences between more and less aggressive disease rs2735839 in KLK3 (P = 8.4 x 10(-7)) and rs10993994 in MSMB (P = 0.046), the alleles that are associated with increased risk for PCa were more frequent in patients with less aggressive disease. Since these SNPs are known to be associated with PSA levels in men without PCa diagnoses, these latter associations may reflect the enrichment of low grade, low stage cases diagnosed by contemporary disease screening with PSA. The vast majority of PCa risk-associated SNPs are not associated with aggressiveness and clinicopathologic variables of PCa. Correspondingly, they have minimal utility in predicting the risk for developing more or less aggressive forms of PCa.
Interleukin 18 receptor 1 gene polymorphisms are associated with asthma.
Zhu, Guohua; Whyte, Moira K B; Vestbo, Jorgen; Carlsen, Karin; Carlsen, Kai-Håkon; Lenney, Warren; Silverman, Michael; Helms, Peter; Pillai, Sreekumar G
2008-09-01
The interleukin 18 receptor (IL18R1) gene is a strong candidate gene for asthma. It has been implicated in the pathophysiology of asthma and maps to an asthma susceptibility locus on chromosome 2q12. The possibility of association between polymorphisms in IL18R1 and asthma was examined by genotyping seven SNPs in 294, 342 and 100 families from Denmark, United Kingdom and Norway and conducting family-based association analyses for asthma, atopic asthma and bronchial hyper-reactivity (BHR) phenotypes. Three SNPs in IL18R1 were associated with asthma (0.01131 < or = P < or = 0.01377), five with atopic asthma (0.00066 < or = P < or = 0.00405) and two with BHR (0.01450 < or = P < or = 0.03203) in the Danish population; two SNPs were associated with atopic asthma (0.00397 < or = P < or = 0.01481) and four with BHR (0.00435 < or = P < or = 0.03544) in the UK population; four SNPs showed associations with asthma (0.00015 < or = P < or = 0.03062), two with atopic asthma (0.01269 < or = P < or = 0.04042) and three with BHR (0.00259 < or = P < or = 0.01401) in the Norwegian population; five SNPs showed associations with asthma (0.00005 < or = P < or = 0.03744), five with atopic asthma (0.00001 < or = P < or = 0.04491) and three with BHR (0.03568 < or = P < or = 0.04778) in the combined population. Three intronic SNPs (rs1420099, rs1362348 and rs1974675) showed replicated association for at least one asthma-related phenotype. These results demonstrate significant association between polymorphisms in IL18R1 and asthma.
Kader, A. Karim; Sun, Jielin; Isaacs, Sarah D.; Wiley, Kathleen E.; Yan, Guifang; Kim, Seong-Tae; Fedor, Helen; DeMarzo, Angelo M.; Epstein, Jonathan I.; Walsh, Patrick C.; Partin, Alan W.; Trock, Bruce; Zheng, S. Lilly; Xu, Jianfeng; Isaacs, William
2009-01-01
Background More than a dozen single nucleotide polymorphisms (SNPs) have been associated with prostate cancer (PCa) risk from genome-wide association studies (GWAS). Their association with PCa aggressiveness and clinicopathologic variables is inconclusive. Methods Twenty PCa risk SNPs implicated in GWAS and fine mapping studies were evaluated in 5,895 PCa cases treated by radical prostatectomy at Johns Hopkins Hospital, where each tumor was uniformly graded and staged using the same protocol. Results For 18 of the 20 SNPs examined, no statistically significant differences (P > 0.05) were observed in risk allele frequencies between patients with more aggressive (Gleason Scores ≥ 4+3, or stage ≥ T3b, or N+) or less aggressive disease (Gleason Scores ≤ 3+4, and stage ≤ T2, and N0). For the two SNPs that had significant differences between more and less aggressive disease (rs2735839 in KLK3 (P = 8.4 × 10−7) and rs10993994 in MSMB (P = 0.046), the alleles that are associated with increased risk for PCa were more frequent in patients with less aggressive disease. Since these SNPs are known to be associated with PSA levels in men without PCa diagnoses, these latter associations may reflect the enrichment of low grade, low stage cases diagnosed by contemporary disease screening with PSA. Conclusions The vast majority of PCa risk-associated SNPs are not associated with aggressiveness and clinicopathologic variables of PCa. Correspondingly, they have minimal utility in predicting the risk for developing more or less aggressive forms of PCa. PMID:19434657
Hierarchical Naive Bayes for genetic association studies.
Malovini, Alberto; Barbarini, Nicola; Bellazzi, Riccardo; de Michelis, Francesca
2012-01-01
Genome Wide Association Studies represent powerful approaches that aim at disentangling the genetic and molecular mechanisms underlying complex traits. The usual "one-SNP-at-the-time" testing strategy cannot capture the multi-factorial nature of this kind of disorders. We propose a Hierarchical Naïve Bayes classification model for taking into account associations in SNPs data characterized by Linkage Disequilibrium. Validation shows that our model reaches classification performances superior to those obtained by the standard Naïve Bayes classifier for simulated and real datasets. In the Hierarchical Naïve Bayes implemented, the SNPs mapping to the same region of Linkage Disequilibrium are considered as "details" or "replicates" of the locus, each contributing to the overall effect of the region on the phenotype. A latent variable for each block, which models the "population" of correlated SNPs, can be then used to summarize the available information. The classification is thus performed relying on the latent variables conditional probability distributions and on the SNPs data available. The developed methodology has been tested on simulated datasets, each composed by 300 cases, 300 controls and a variable number of SNPs. Our approach has been also applied to two real datasets on the genetic bases of Type 1 Diabetes and Type 2 Diabetes generated by the Wellcome Trust Case Control Consortium. The approach proposed in this paper, called Hierarchical Naïve Bayes, allows dealing with classification of examples for which genetic information of structurally correlated SNPs are available. It improves the Naïve Bayes performances by properly handling the within-loci variability.
Sakai, Hiroaki; Lee, Sung Shin; Tanaka, Tsuyoshi; Numa, Hisataka; Kim, Jungsok; Kawahara, Yoshihiro; Wakimoto, Hironobu; Yang, Ching-chia; Iwamoto, Masao; Abe, Takashi; Yamada, Yuko; Muto, Akira; Inokuchi, Hachiro; Ikemura, Toshimichi; Matsumoto, Takashi; Sasaki, Takuji; Itoh, Takeshi
2013-02-01
The Rice Annotation Project Database (RAP-DB, http://rapdb.dna.affrc.go.jp/) has been providing a comprehensive set of gene annotations for the genome sequence of rice, Oryza sativa (japonica group) cv. Nipponbare. Since the first release in 2005, RAP-DB has been updated several times along with the genome assembly updates. Here, we present our newest RAP-DB based on the latest genome assembly, Os-Nipponbare-Reference-IRGSP-1.0 (IRGSP-1.0), which was released in 2011. We detected 37,869 loci by mapping transcript and protein sequences of 150 monocot species. To provide plant researchers with highly reliable and up to date rice gene annotations, we have been incorporating literature-based manually curated data, and 1,626 loci currently incorporate literature-based annotation data, including commonly used gene names or gene symbols. Transcriptional activities are shown at the nucleotide level by mapping RNA-Seq reads derived from 27 samples. We also mapped the Illumina reads of a Japanese leading japonica cultivar, Koshihikari, and a Chinese indica cultivar, Guangluai-4, to the genome and show alignments together with the single nucleotide polymorphisms (SNPs) and gene functional annotations through a newly developed browser, Short-Read Assembly Browser (S-RAB). We have developed two satellite databases, Plant Gene Family Database (PGFD) and Integrative Database of Cereal Gene Phylogeny (IDCGP), which display gene family and homologous gene relationships among diverse plant species. RAP-DB and the satellite databases offer simple and user-friendly web interfaces, enabling plant and genome researchers to access the data easily and facilitating a broad range of plant research topics.
Humble, Emily; Thorne, Michael A S; Forcada, Jaume; Hoffman, Joseph I
2016-08-26
Single nucleotide polymorphism (SNP) discovery is an important goal of many studies. However, the number of 'putative' SNPs discovered from a sequence resource may not provide a reliable indication of the number that will successfully validate with a given genotyping technology. For this it may be necessary to account for factors such as the method used for SNP discovery and the type of sequence data from which it originates, suitability of the SNP flanking sequences for probe design, and genomic context. To explore the relative importance of these and other factors, we used Illumina sequencing to augment an existing Roche 454 transcriptome assembly for the Antarctic fur seal (Arctocephalus gazella). We then mapped the raw Illumina reads to the new hybrid transcriptome using BWA and BOWTIE2 before calling SNPs with GATK. The resulting markers were pooled with two existing sets of SNPs called from the original 454 assembly using NEWBLER and SWAP454. Finally, we explored the extent to which SNPs discovered using these four methods overlapped and predicted the corresponding validation outcomes for both Illumina Infinium iSelect HD and Affymetrix Axiom arrays. Collating markers across all discovery methods resulted in a global list of 34,718 SNPs. However, concordance between the methods was surprisingly poor, with only 51.0 % of SNPs being discovered by more than one method and 13.5 % being called from both the 454 and Illumina datasets. Using a predictive modeling approach, we could also show that SNPs called from the Illumina data were on average more likely to successfully validate, as were SNPs called by more than one method. Above and beyond this pattern, predicted validation outcomes were also consistently better for Affymetrix Axiom arrays. Our results suggest that focusing on SNPs called by more than one method could potentially improve validation outcomes. They also highlight possible differences between alternative genotyping technologies that could be explored in future studies of non-model organisms.
A genome-wide association study of seed composition traits in wild soybean (Glycine soja).
Leamy, Larry J; Zhang, Hengyou; Li, Changbao; Chen, Charles Y; Song, Bao-Hua
2017-01-05
Cultivated soybean (Glycine max) is a major agricultural crop that provides a crucial source of edible protein and oil. Decreased amounts of saturated palmitic acid and increased amounts of unsaturated oleic acid in soybean oil are considered optimal for human cardiovascular health and therefore there has considerable interest by breeders in discovering genes affecting the relative concentrations of these fatty acids. Using a genome-wide association (GWA) approach with nearly 30,000 single nucleotide polymorphisms (SNPs), we investigated the genetic basis of protein, oil and all five fatty acid levels in seeds from a sample of 570 wild soybeans (Glycine soja), the progenitor of domesticated soybean, to identify quantitative trait loci (QTLs) affecting these seed composition traits. We discovered 29 SNPs located on ten different chromosomes that are significantly associated with the seven seed composition traits in our wild soybean sample. Eight SNPs co-localized with QTLs previously uncovered in linkage or association mapping studies conducted with cultivated soybean samples, while the remaining SNPs appeared to be in novel locations. Twenty-four of the SNPs significantly associated with fatty acid variation, with the majority located on chromosomes 14 (6 SNPs) and seven (8 SNPs). Two SNPs were common for two or more fatty acids, suggesting loci with pleiotropic effects. We also identified some candidate genes that are involved in fatty acid metabolism and regulation. For each of the seven traits, most of the SNPs produced differences between the average phenotypic values of the two homozygotes of about one-half standard deviation and contributed over 3% of their total variability. This is the first GWA study conducted on seed composition traits solely in wild soybean populations, and a number of QTLs were found that have not been previously discovered. Some of these may be useful to breeders who select for increased protein/oil content or altered fatty acid ratios in the seeds. The results also provide additional insight into the genetic architecture of these traits in a large sample of wild soybean, and suggest some new candidate genes whose molecular effects on these traits need to be further studied.
de la Garza Puentes, Andrea; Montes Goyanes, Rosa; Chisaguano Tonato, Aida Maribel; Torres-Espínola, Francisco José; Arias García, Miriam; de Almeida, Leonor; Bonilla Aguirre, María; Guerendiain, Marcela; Castellote Bargalló, Ana Isabel; Segura Moreno, Maite; García-Valdés, Luz; Campoy, Cristina; Lopez-Sabater, M. Carmen
2017-01-01
Single nucleotide polymorphisms (SNPs) in the genes encoding the fatty acid desaturase (FADS) and elongase (ELOVL) enzymes affect long-chain polyunsaturated fatty acid (LC-PUFA) production. We aimed to determine if these SNPs are associated with body mass index (BMI) or affect fatty acids (FAs) in pregnant women. Participants (n = 180) from the PREOBE cohort were grouped according to pre-pregnancy BMI: normal-weight (BMI = 18.5–24.9, n = 88) and overweight/obese (BMI≥25, n = 92). Plasma samples were analyzed at 24 weeks of gestation to measure FA levels in the phospholipid fraction. Selected SNPs were genotyped (7 in FADS1, 5 in FADS2, 3 in ELOVL2 and 2 in ELOVL5). Minor allele carriers of rs174545, rs174546, rs174548 and rs174553 (FADS1), and rs1535 and rs174583 (FADS2) were nominally associated with an increased risk of having a BMI≥25. Only for the normal-weight group, minor allele carriers of rs174537, rs174545, rs174546, and rs174553 (FADS1) were negatively associated with AA:DGLA index. Normal-weight women who were minor allele carriers of FADS SNPs had lower levels of AA, AA:DGLA and AA:LA indexes, and higher levels of DGLA, compared to major homozygotes. Among minor allele carriers of FADS2 and ELOVL2 SNPs, overweight/obese women showed higher DHA:EPA index than the normal-weight group; however, they did not present higher DHA concentrations than the normal-weight women. In conclusion, minor allele carriers of FADS SNPs have an increased risk of obesity. Maternal weight changes the effect of genotype on FA levels. Only in the normal-weight group, minor allele carriers of FADS SNPs displayed reduced enzymatic activity and FA levels. This suggests that women with a BMI≥25 are less affected by FADS genetic variants in this regard. In the presence of FADS2 and ELOVL2 SNPs, overweight/obese women showed higher n-3 LC-PUFA production indexes than women with normal weight, but this was not enough to obtain a higher n-3 LC-PUFA concentration. PMID:28598979
Rostoks, Nils; Ramsay, Luke; MacKenzie, Katrin; Cardle, Linda; Bhat, Prasanna R.; Roose, Mikeal L.; Svensson, Jan T.; Stein, Nils; Varshney, Rajeev K.; Marshall, David F.; Graner, Andreas; Close, Timothy J.; Waugh, Robbie
2006-01-01
Genomewide association studies depend on the extent of linkage disequilibrium (LD), the number and distribution of markers, and the underlying structure in populations under study. Outbreeding species generally exhibit limited LD, and consequently, a very large number of markers are required for effective whole-genome association genetic scans. In contrast, several of the world's major food crops are self-fertilizing inbreeding species with narrow genetic bases and theoretically extensive LD. Together these are predicted to result in a combination of low resolution and a high frequency of spurious associations in LD-based studies. However, inbred elite plant varieties represent a unique human-induced pseudooutbreeding population that has been subjected to strong selection for advantageous alleles. By assaying 1,524 genomewide SNPs we demonstrate that, after accounting for population substructure, the level of LD exhibited in elite northwest European barley, a typical inbred cereal crop, can be effectively exploited to map traits by using whole-genome association scans with several hundred to thousands of biallelic SNPs. PMID:17085595
Jia, Guanqing; Huang, Xuehui; Zhi, Hui; Zhao, Yan; Zhao, Qiang; Li, Wenjun; Chai, Yang; Yang, Lifang; Liu, Kunyan; Lu, Hengyun; Zhu, Chuanrang; Lu, Yiqi; Zhou, Congcong; Fan, Danlin; Weng, Qijun; Guo, Yunli; Huang, Tao; Zhang, Lei; Lu, Tingting; Feng, Qi; Hao, Hangfei; Liu, Hongkuan; Lu, Ping; Zhang, Ning; Li, Yuhui; Guo, Erhu; Wang, Shujun; Wang, Suying; Liu, Jinrong; Zhang, Wenfei; Chen, Guoqiu; Zhang, Baojin; Li, Wei; Wang, Yongfang; Li, Haiquan; Zhao, Baohua; Li, Jiayang; Diao, Xianmin; Han, Bin
2013-08-01
Foxtail millet (Setaria italica) is an important grain crop that is grown in arid regions. Here we sequenced 916 diverse foxtail millet varieties, identified 2.58 million SNPs and used 0.8 million common SNPs to construct a haplotype map of the foxtail millet genome. We classified the foxtail millet varieties into two divergent groups that are strongly correlated with early and late flowering times. We phenotyped the 916 varieties under five different environments and identified 512 loci associated with 47 agronomic traits by genome-wide association studies. We performed a de novo assembly of deeply sequenced genomes of a Setaria viridis accession (the wild progenitor of S. italica) and an S. italica variety and identified complex interspecies and intraspecies variants. We also identified 36 selective sweeps that seem to have occurred during modern breeding. This study provides fundamental resources for genetics research and genetic improvement in foxtail millet.
Genome-wide association analysis of ischemic stroke in young adults.
Cheng, Yu-Ching; O'Connell, Jeffrey R; Cole, John W; Stine, O Colin; Dueker, Nicole; McArdle, Patrick F; Sparks, Mary J; Shen, Jess; Laurie, Cathy C; Nelson, Sarah; Doheny, Kimberly F; Ling, Hua; Pugh, Elizabeth W; Brott, Thomas G; Brown, Robert D; Meschia, James F; Nalls, Michael; Rich, Stephen S; Worrall, Bradford; Anderson, Christopher D; Biffi, Alessandro; Cortellini, Lynelle; Furie, Karen L; Rost, Natalia S; Rosand, Jonathan; Manolio, Teri A; Kittner, Steven J; Mitchell, Braxton D
2011-11-01
Ischemic stroke (IS) is among the leading causes of death in Western countries. There is a significant genetic component to IS susceptibility, especially among young adults. To date, research to identify genetic loci predisposing to stroke has met only with limited success. We performed a genome-wide association (GWA) analysis of early-onset IS to identify potential stroke susceptibility loci. The GWA analysis was conducted by genotyping 1 million SNPs in a biracial population of 889 IS cases and 927 controls, ages 15-49 years. Genotypes were imputed using the HapMap3 reference panel to provide 1.4 million SNPs for analysis. Logistic regression models adjusting for age, recruitment stages, and population structure were used to determine the association of IS with individual SNPs. Although no single SNP reached genome-wide significance (P < 5 × 10(-8)), we identified two SNPs in chromosome 2q23.3, rs2304556 (in FMNL2; P = 1.2 × 10(-7)) and rs1986743 (in ARL6IP6; P = 2.7 × 10(-7)), strongly associated with early-onset stroke. These data suggest that a novel locus on human chromosome 2q23.3 may be associated with IS susceptibility among young adults.
Peng, Fred Y; Yang, Rong-Cai
2017-06-20
The resistance to leaf rust (Lr) caused by Puccinia triticina in wheat (Triticum aestivum L.) has been well studied over the past decades with over 70 Lr genes being mapped on different chromosomes and numerous QTLs (quantitative trait loci) being detected or mapped using DNA markers. Such resistance is often divided into race-specific and race-nonspecific resistance. The race-nonspecific resistance can be further divided into resistance to most or all races of the same pathogen and resistance to multiple pathogens. At the molecular level, these three types of resistance may cover across the whole spectrum of pathogen specificities that are controlled by genes encoding different protein families in wheat. The objective of this study is to predict and analyze genes in three such families: NBS-LRR (nucleotide-binding sites and leucine-rich repeats or NLR), START (Steroidogenic Acute Regulatory protein [STaR] related lipid-transfer) and ABC (ATP-Binding Cassette) transporter. The focus of the analysis is on the patterns of relationships between these protein-coding genes within the gene families and QTLs detected for leaf rust resistance. We predicted 526 ABC, 1117 NLR and 144 START genes in the hexaploid wheat genome through a domain analysis of wheat proteome. Of the 1809 SNPs from leaf rust resistance QTLs in seedling and adult stages of wheat, 126 SNPs were found within coding regions of these genes or their neighborhood (5 Kb upstream from transcription start site [TSS] or downstream from transcription termination site [TTS] of the genes). Forty-three of these SNPs for adult resistance and 18 SNPs for seedling resistance reside within coding or neighboring regions of the ABC genes whereas 14 SNPs for adult resistance and 29 SNPs for seedling resistance reside within coding or neighboring regions of the NLR gene. Moreover, we found 17 nonsynonymous SNPs for adult resistance and five SNPs for seedling resistance in the ABC genes, and five nonsynonymous SNPs for adult resistance and six SNPs for seedling resistance in the NLR genes. Most of these coding SNPs were predicted to alter encoded amino acids and such information may serve as a starting point towards more thorough molecular and functional characterization of the designated Lr genes. Using the primer sequences of 99 known non-SNP markers from leaf rust resistance QTLs, we found candidate genes closely linked to these markers, including Lr34 with distances to its two gene-specific markers being 1212 bases (to cssfr1) and 2189 bases (to cssfr2). This study represents a comprehensive analysis of ABC, NLR and START genes in the hexaploid wheat genome and their physical relationships with QTLs for leaf rust resistance at seedling and adult stages. Our analysis suggests that the ABC (and START) genes are more likely to be co-located with QTLs for race-nonspecific, adult resistance whereas the NLR genes are more likely to be co-located with QTLs for race-specific resistance that would be often expressed at the seedling stage. Though our analysis was hampered by inaccurate or unknown physical positions of numerous QTLs due to the incomplete assembly of the complex hexaploid wheat genome that is currently available, the observed associations between (i) QTLs for race-specific resistance and NLR genes and (ii) QTLs for nonspecific resistance and ABC genes will help discover SNP variants for leaf rust resistance at seedling and adult stages. The genes containing nonsynonymous SNPs are promising candidates that can be investigated in future studies as potential new sources of leaf rust resistance in wheat breeding.
PanGEA: identification of allele specific gene expression using the 454 technology.
Kofler, Robert; Teixeira Torres, Tatiana; Lelley, Tamas; Schlötterer, Christian
2009-05-14
Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: http://www.kofler.or.at/bioinformatics/PanGEA
PanGEA: Identification of allele specific gene expression using the 454 technology
Kofler, Robert; Teixeira Torres, Tatiana; Lelley, Tamas; Schlötterer, Christian
2009-01-01
Background Next generation sequencing technologies hold great potential for many biological questions. While mainly used for genomic sequencing, they are also very promising for gene expression profiling. Sequencing of cDNA does not only provide an estimate of the absolute expression level, it can also be used for the identification of allele specific gene expression. Results We developed PanGEA, a tool which enables a fast and user-friendly analysis of allele specific gene expression using the 454 technology. PanGEA allows mapping of 454-ESTs to genes or whole genomes, displaying gene expression profiles, identification of SNPs and the quantification of allele specific gene expression. The intuitive GUI of PanGEA facilitates a flexible and interactive analysis of the data. PanGEA additionally implements a modification of the Smith-Waterman algorithm which deals with incorrect estimates of homopolymer length as occuring in the 454 technology Conclusion To our knowledge, PanGEA is the first tool which facilitates the identification of allele specific gene expression. PanGEA is distributed under the Mozilla Public License and available at: PMID:19442283
Liu, Shi; Gao, Peng; Zhu, Qianglong; Luan, Feishi; Davis, Angela R.; Wang, Xiaolu
2016-01-01
Cleaved amplified polymorphic sequence (CAPS) markers are useful tools for detecting single nucleotide polymorphisms (SNPs). This study detected and converted SNP sites into CAPS markers based on high-throughput re-sequencing data in watermelon, for linkage map construction and quantitative trait locus (QTL) analysis. Two inbred lines, Cream of Saskatchewan (COS) and LSW-177 had been re-sequenced and analyzed by Perl self-compiled script for CAPS marker development. 88.7% and 78.5% of the assembled sequences of the two parental materials could map to the reference watermelon genome, respectively. Comparative assembled genome data analysis provided 225,693 and 19,268 SNPs and indels between the two materials. 532 pairs of CAPS markers were designed with 16 restriction enzymes, among which 271 pairs of primers gave distinct bands of the expected length and polymorphic bands, via PCR and enzyme digestion, with a polymorphic rate of 50.94%. Using the new CAPS markers, an initial CAPS-based genetic linkage map was constructed with the F2 population, spanning 1836.51 cM with 11 linkage groups and 301 markers. 12 QTLs were detected related to fruit flesh color, length, width, shape index, and brix content. These newly CAPS markers will be a valuable resource for breeding programs and genetic studies of watermelon. PMID:27162496
Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela
2014-01-01
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.
Talbert, Matthew E; Langefeld, Carl D; Ziegler, Julie; Mychaleckyj, Josyf C; Haffner, Steven M; Norris, Jill M; Bowden, Donald W
2009-01-01
The SOCS3 gene product participates in the feedback inhibition of a range of cytokine signals. Most notably, SOCS3 inhibits the functioning of leptin and downstream steps in insulin signaling after being expressed by terminal transcription factors, such as STAT3 and c-fos. The SOCS3 gene is located in the chromosome region 17q24–17q25, previously linked to body mass index (BMI), visceral adipose tissue (VAT), and waist circumference (WAIST) in Hispanic families in the Insulin Resistance Atherosclerosis Family Study (IRASFS). A high density map of 1536 single nucleotide polymorphisms (SNPs) was constructed to cover a portion of the 17q linkage interval in DNA samples from 1425 Hispanic subjects from 90 extended families in IRASFS. Analysis of this dense SNP map data revealed evidence of association of rs9914220 (located 10 kb 5’ of the SOCS3 gene) with BMI, VAT, and WAIST (P-value ranging from 0 003 to 0.017). Using a tagging SNP approach, rs9914220 and 22 additional SOCS3 SNPs were genotyped for genetic association analysis with measures of adiposity and glucose homeostasis. The adiposity phenotypes utilized in association analyses included BMI, WAIST, waist to hip ratio (WHR), subcutaneous adipose tissue (SAT), VAT, and visceral to subcutaneous ratio (VSR). Linkage disequilibrium (LD) calculations revealed three haplotype blocks near SOCS3. Haplotype Block 1 (5’ of SOCS3) contained SNPs consistently associated with BMI, WAIST, WHR, and VAT (P-values ranging from 2.00x10−4 to .036). Haplotype Block 3 contained single-SNPs that were associated with most adiposity traits except for VSR (P-values ranging from 0.002 to 0.047). When trait associated SNPs were included in linkage analyses as covariates, a reduction of VAT LOD score from 1.26 to .76 above the SOCS3 locus (110 cM) was observed. Multi-SNP haplotype testing using the quantitative pedigree disequilibrium test (QPDT) was broadly consistent with the single-SNP associations. In conclusion, these results support a role for SOCS3 genetic variants in human obesity. PMID:19083014
Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela
2014-01-01
High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088
Ahlstrom, Christina; Barkema, Herman W; Stevenson, Karen; Zadoks, Ruth N; Biek, Roman; Kao, Rowland; Trewby, Hannah; Haupstein, Deb; Kelton, David F; Fecteau, Gilles; Labrecque, Olivia; Keefe, Greg P; McKenna, Shawn L B; De Buck, Jeroen
2015-03-08
Mycobacterium avium subsp. paratuberculosis (MAP), the causative bacterium of Johne's disease in dairy cattle, is widespread in the Canadian dairy industry and has significant economic and animal welfare implications. An understanding of the population dynamics of MAP can be used to identify introduction events, improve control efforts and target transmission pathways, although this requires an adequate understanding of MAP diversity and distribution between herds and across the country. Whole genome sequencing (WGS) offers a detailed assessment of the SNP-level diversity and genetic relationship of isolates, whereas several molecular typing techniques used to investigate the molecular epidemiology of MAP, such as variable number of tandem repeat (VNTR) typing, target relatively unstable repetitive elements in the genome that may be too unpredictable to draw accurate conclusions. The objective of this study was to evaluate the diversity of bovine MAP isolates in Canadian dairy herds using WGS and then determine if VNTR typing can distinguish truly related and unrelated isolates. Phylogenetic analysis based on 3,039 SNPs identified through WGS of 124 MAP isolates identified eight genetically distinct subtypes in dairy herds from seven Canadian provinces, with the dominant type including over 80% of MAP isolates. VNTR typing of 527 MAP isolates identified 12 types, including "bison type" isolates, from seven different herds. At a national level, MAP isolates differed from each other by 1-2 to 239-240 SNPs, regardless of whether they belonged to the same or different VNTR types. A herd-level analysis of MAP isolates demonstrated that VNTR typing may both over-estimate and under-estimate the relatedness of MAP isolates found within a single herd. The presence of multiple MAP subtypes in Canada suggests multiple introductions into the country including what has now become one dominant type, an important finding for Johne's disease control. VNTR typing often failed to identify closely and distantly related isolates, limiting the applicability of using this typing scheme to study the molecular epidemiology of MAP at a national and herd-level.
Johnston, Susan E.; Huisman, Jisca; Ellis, Philip A.; Pemberton, Josephine M.
2017-01-01
High-density linkage maps are an important tool to gain insight into the genetic architecture of traits of evolutionary and economic interest, and provide a resource to characterize variation in recombination landscapes. Here, we used information from the cattle genome and the 50 K Cervine Illumina BeadChip to inform and refine a high-density linkage map in a wild population of red deer (Cervus elaphus). We constructed a predicted linkage map of 38,038 SNPs and a skeleton map of 10,835 SNPs across 34 linkage groups. We identified several chromosomal rearrangements in the deer lineage relative to sheep and cattle, including six chromosome fissions, one fusion, and two large inversions. Otherwise, our findings showed strong concordance with map orders in the cattle genome. The sex-averaged linkage map length was 2739.7 cM and the genome-wide autosomal recombination rate was 1.04 cM/Mb. The female autosomal map length was 1.21 longer than that of males (2767.4 cM vs. 2280.8 cM, respectively). Sex differences in map length were driven by high female recombination rates in peri-centromeric regions, a pattern that is unusual relative to other mammal species. This effect was more pronounced in fission chromosomes that would have had to produce new centromeres. We propose two hypotheses to explain this effect: (1) that this mechanism may have evolved to counteract centromeric drive associated with meiotic asymmetry in oocyte production; and/or (2) that sequence and structural characteristics suppressing recombination in close proximity to the centromere may not have evolved at neo-centromeres. Our study provides insight into how recombination landscapes vary and evolve in mammals, and will provide a valuable resource for studies of evolution, genetic improvement, and population management in red deer and related species. PMID:28667018
Determining minimal display element requirements for surface map displays
DOT National Transportation Integrated Search
2003-04-14
There is a great deal of interest in developing electronic surface map displays to enhance safety and reduce incidents and incursions on or near the airport surface. There is a lack of research, however, detailing the minimal display elements require...
Single Nucleotide Polymorphism Discovery in Bovine Pituitary Gland Using RNA-Seq Technology
Pareek, Chandra Shekhar; Smoczyński, Rafał; Kadarmideen, Haja N.; Dziuba, Piotr; Błaszczyk, Paweł; Sikora, Marcin; Walendzik, Paulina; Grzybowski, Tomasz; Pierzchała, Mariusz; Horbańczuk, Jarosław; Szostak, Agnieszka; Ogluszka, Magdalena; Zwierzchowski, Lech; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Wąsowicz, Krzysztof; Gelfand, Brian; Feng, Yaping; Kumar, Dibyendu
2016-01-01
Examination of bovine pituitary gland transcriptome by strand-specific RNA-seq allows detection of putative single nucleotide polymorphisms (SNPs) within potential candidate genes (CGs) or QTLs regions as well as to understand the genomics variations that contribute to economic trait. Here we report a breed-specific model to successfully perform the detection of SNPs in the pituitary gland of young growing bulls representing Polish Holstein-Friesian (HF), Polish Red, and Hereford breeds at three developmental ages viz., six months, nine months, and twelve months. A total of 18 bovine pituitary gland polyA transcriptome libraries were prepared and sequenced using the Illumina NextSeq 500 platform. Sequenced FastQ databases of all 18 young bulls were submitted to NCBI-SRA database with NCBI-SRA accession numbers SRS1296732. For the investigated young bulls, a total of 113,882,3098 raw paired-end reads with a length of 156 bases were obtained, resulting in an approximately 63 million paired-end reads per library. Breed-wise, a total of 515.38, 215.39, and 408.04 million paired-end reads were obtained for Polish HF, Polish Red, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed 93.04%, 94.39%, and 83.46% of the mapped sequencing reads were properly paired to the Polish HF, Polish Red, and Hereford breeds, respectively. Constructed breed-specific SNP-db of three cattle breeds yielded at 13,775,885 SNPs. On an average 765,326 breed-specific SNPs per young bull were identified. Using two stringent filtering parameters, i.e., a minimum 10 SNP reads per base with an accuracy ≥ 90% and a minimum 10 SNP reads per base with an accuracy = 100%, SNP-db records were trimmed to construct a highly reliable SNP-db. This resulted in a reduction of 95,7% and 96,4% cut-off mark of constructed raw SNP-db. Finally, SNP discoveries using RNA-Seq data were validated by KASP™ SNP genotyping assay. The comprehensive QTLs/CGs analysis of 76 QTLs/CGs with RNA-seq data identified KCNIP4, CCSER1, DPP6, MAP3K5 and GHR CGs with highest SNPs hit loci in all three breeds and developmental ages. However, CAST CG with more than 100 SNPs hits were observed only in Polish HF and Hereford breeds.These findings are important for identification and construction of novel tissue specific SNP-db and breed specific SNP-db dataset by screening of putative SNPs according to QTL db and candidate genes for bovine growth and reproduction traits, one can develop genomic selection strategies for growth and reproductive traits. PMID:27606429
Single Nucleotide Polymorphism Discovery in Bovine Pituitary Gland Using RNA-Seq Technology.
Pareek, Chandra Shekhar; Smoczyński, Rafał; Kadarmideen, Haja N; Dziuba, Piotr; Błaszczyk, Paweł; Sikora, Marcin; Walendzik, Paulina; Grzybowski, Tomasz; Pierzchała, Mariusz; Horbańczuk, Jarosław; Szostak, Agnieszka; Ogluszka, Magdalena; Zwierzchowski, Lech; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Wąsowicz, Krzysztof; Gelfand, Brian; Feng, Yaping; Kumar, Dibyendu
2016-01-01
Examination of bovine pituitary gland transcriptome by strand-specific RNA-seq allows detection of putative single nucleotide polymorphisms (SNPs) within potential candidate genes (CGs) or QTLs regions as well as to understand the genomics variations that contribute to economic trait. Here we report a breed-specific model to successfully perform the detection of SNPs in the pituitary gland of young growing bulls representing Polish Holstein-Friesian (HF), Polish Red, and Hereford breeds at three developmental ages viz., six months, nine months, and twelve months. A total of 18 bovine pituitary gland polyA transcriptome libraries were prepared and sequenced using the Illumina NextSeq 500 platform. Sequenced FastQ databases of all 18 young bulls were submitted to NCBI-SRA database with NCBI-SRA accession numbers SRS1296732. For the investigated young bulls, a total of 113,882,3098 raw paired-end reads with a length of 156 bases were obtained, resulting in an approximately 63 million paired-end reads per library. Breed-wise, a total of 515.38, 215.39, and 408.04 million paired-end reads were obtained for Polish HF, Polish Red, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed 93.04%, 94.39%, and 83.46% of the mapped sequencing reads were properly paired to the Polish HF, Polish Red, and Hereford breeds, respectively. Constructed breed-specific SNP-db of three cattle breeds yielded at 13,775,885 SNPs. On an average 765,326 breed-specific SNPs per young bull were identified. Using two stringent filtering parameters, i.e., a minimum 10 SNP reads per base with an accuracy ≥ 90% and a minimum 10 SNP reads per base with an accuracy = 100%, SNP-db records were trimmed to construct a highly reliable SNP-db. This resulted in a reduction of 95,7% and 96,4% cut-off mark of constructed raw SNP-db. Finally, SNP discoveries using RNA-Seq data were validated by KASP™ SNP genotyping assay. The comprehensive QTLs/CGs analysis of 76 QTLs/CGs with RNA-seq data identified KCNIP4, CCSER1, DPP6, MAP3K5 and GHR CGs with highest SNPs hit loci in all three breeds and developmental ages. However, CAST CG with more than 100 SNPs hits were observed only in Polish HF and Hereford breeds.These findings are important for identification and construction of novel tissue specific SNP-db and breed specific SNP-db dataset by screening of putative SNPs according to QTL db and candidate genes for bovine growth and reproduction traits, one can develop genomic selection strategies for growth and reproductive traits.
Yang, Shangbin; Xu, LiHui; Wu, Haifeng M.
2010-01-01
Warfarin exhibits significant interindividual variability in dosing requirements. Different drug responses are partly attributed to the single nucleotide polymorphisms (SNPs) that influence either drug action or drug metabolism. Rapid genotyping of these SNPs helps clinicians to choose appropriate initial doses to quickly achieve anticoagulation effects and to prevent complications. We report a novel application of surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (SELDI-TOF MS) in the rapid genotyping of SNPs that impact warfarin efficacy. The SNPs were first amplified by PCR and then underwent single base extension to generate the specific SNP product. Next, genetic variants displaying different masses were bound to Q10 anionic proteinChips and then genotyped by using SELDI-TOF MS in a multiplex fashion. SELDI-TOF MS offered unique properties of on-chip sample enrichment and clean-ups, which streamlined the testing procedures and eliminated many tedious experimental steps required by the conventional MS-based method. The turn-around time for genotyping three known warfarin-related SNPs, CYP2C9*2, CYP2C9*3, and VKORC1 3673G>A by SELDI-TOF MS was less than 5 hours. The analytical accuracy of this method was confirmed both by bidirectional DNA sequencing and by comparing the genotype results (n = 189) obtained by SELDI-TOF MS to reports from a clinical reference laboratory. This new multiplex genotyping method provides an excellent clinical laboratory platform to promote personalized medicine in warfarin therapy. PMID:20075209
Yang, Shangbin; Xu, LiHui; Wu, Haifeng M
2010-03-01
Warfarin exhibits significant interindividual variability in dosing requirements. Different drug responses are partly attributed to the single nucleotide polymorphisms (SNPs) that influence either drug action or drug metabolism. Rapid genotyping of these SNPs helps clinicians to choose appropriate initial doses to quickly achieve anticoagulation effects and to prevent complications. We report a novel application of surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (SELDI-TOF MS) in the rapid genotyping of SNPs that impact warfarin efficacy. The SNPs were first amplified by PCR and then underwent single base extension to generate the specific SNP product. Next, genetic variants displaying different masses were bound to Q10 anionic proteinChips and then genotyped by using SELDI-TOF MS in a multiplex fashion. SELDI-TOF MS offered unique properties of on-chip sample enrichment and clean-ups, which streamlined the testing procedures and eliminated many tedious experimental steps required by the conventional MS-based method. The turn-around time for genotyping three known warfarin-related SNPs, CYP2C9*2, CYP2C9*3, and VKORC1 3673G>A by SELDI-TOF MS was less than 5 hours. The analytical accuracy of this method was confirmed both by bidirectional DNA sequencing and by comparing the genotype results (n = 189) obtained by SELDI-TOF MS to reports from a clinical reference laboratory. This new multiplex genotyping method provides an excellent clinical laboratory platform to promote personalized medicine in warfarin therapy.
NASA Technical Reports Server (NTRS)
Aretz, Anthony J.
1990-01-01
This paper presents a cognitive model of a pilot's navigation task and describes an experiment comparing a visual momentum map display to the traditional track-up and north-up approaches. The data show the advantage to a track-up map is its congruence with the ego-centered forward view; however, the development of survey knowledge is hindered by the inconsistency of the rotating display. The stable alignment of a north-up map aids the acquisition of survey knowledge, but there is a cost associated with the mental rotation of the display to a track-up alignment for ego-centered tasks. The results also show that visual momentum can be used to reduce the mental rotation costs of a north-up display.
EvoSNP-DB: A database of genetic diversity in East Asian populations.
Kim, Young Uk; Kim, Young Jin; Lee, Jong-Young; Park, Kiejung
2013-08-01
Genome-wide association studies (GWAS) have become popular as an approach for the identification of large numbers of phenotype-associated variants. However, differences in genetic architecture and environmental factors mean that the effect of variants can vary across populations. Understanding population genetic diversity is valuable for the investigation of possible population specific and independent effects of variants. EvoSNP-DB aims to provide information regarding genetic diversity among East Asian populations, including Chinese, Japanese, and Korean. Non-redundant SNPs (1.6 million) were genotyped in 54 Korean trios (162 samples) and were compared with 4 million SNPs from HapMap phase II populations. EvoSNP-DB provides two user interfaces for data query and visualization, and integrates scores of genetic diversity (Fst and VarLD) at the level of SNPs, genes, and chromosome regions. EvoSNP-DB is a web-based application that allows users to navigate and visualize measurements of population genetic differences in an interactive manner, and is available online at [http://biomi.cdc.go.kr/EvoSNP/].
Belamkar, Vikas; Farmer, Andrew D.; Weeks, Nathan T.; Kalberer, Scott R.; Blackmon, William J.; Cannon, Steven B.
2016-01-01
For species with potential as new crops, rapid improvement may be facilitated by new genomic methods. Apios (Apios americana Medik.), once a staple food source of Native American Indians, produces protein-rich tubers, tolerates a wide range of soils, and symbiotically fixes nitrogen. We report the first high-quality de novo transcriptome assembly, an expression atlas, and a set of 58,154 SNP and 39,609 gene expression markers (GEMs) for characterization of a breeding collection. Both SNPs and GEMs identify six genotypic clusters in the collection. Transcripts mapped to the Phaseolus vulgaris genome–another phaseoloid legume with the same chromosome number–provide provisional genetic locations for 46,852 SNPs. Linkage disequilibrium decays within 10 kb (based on the provisional genetic locations), consistent with outcrossing reproduction. SNPs and GEMs identify more than 21 marker-trait associations for at least 11 traits. This study demonstrates a holistic approach for mining plant collections to accelerate crop improvement. PMID:27721469
Dissection of genetic architecture of grain chalk using NIR spectroscopy
USDA-ARS?s Scientific Manuscript database
Chalk is a major quality characteristic that causes grain breakage during milling and loss of crop value. In this study, we sought to elucidate the quantitatively inherited grain chalk trait in rice and to conduct genome-wide association mapping to identify SNPs and candidate genes associated with ...
USDA-ARS?s Scientific Manuscript database
Accumulated capsaicinoid content and increased fruit size are traits resulting from Capsicum annuum domestication. In this study, we used a diverse collection of domesticated and wild C. annuum to generate 66,960 SNPs using genotyping by sequencing. Principal component analysis and identity by state...
Ojeda, Dario I; Dhillon, Braham; Tsui, Clement K M; Hamelin, Richard C
2014-03-01
Single-nucleotide polymorphisms (SNPs) are rapidly becoming the standard markers in population genomics studies; however, their use in nonmodel organisms is limited due to the lack of cost-effective approaches to uncover genome-wide variation, and the large number of individuals needed in the screening process to reduce ascertainment bias. To discover SNPs for population genomics studies in the fungal symbionts of the mountain pine beetle (MPB), we developed a road map to discover SNPs and to produce a genotyping platform. We undertook a whole-genome sequencing approach of Leptographium longiclavatum in combination with available genomics resources of another MPB symbiont, Grosmannia clavigera. We sequenced 71 individuals pooled into four groups using the Illumina sequencing technology. We generated between 27 and 30 million reads of 75 bp that resulted in a total of 1, 181 contigs longer than 2 kb and an assembled genome size of 28.9 Mb (N50 = 48 kb, average depth = 125x). A total of 9052 proteins were annotated, and between 9531 and 17,266 SNPs were identified in the four pools. A subset of 206 genes (containing 574 SNPs, 11% false positives) was used to develop a genotyping platform for this species. Using this roadmap, we developed a genotyping assay with a total of 147 SNPs located in 121 genes using the Illumina(®) Sequenom iPLEX Gold. Our preliminary genotyping (success rate = 85%) of 304 individuals from 36 populations supports the utility of this approach for population genomics studies in other MPB fungal symbionts and other fungal nonmodel species. © 2013 John Wiley & Sons Ltd.
Katz, Lee S.; Sharma, Nitya V.; Harcourt, Brian H.; Thomas, Jennifer Dolan; Wang, Xin; Mayer, Leonard W.; Jordan, I. King
2011-01-01
Neisseria meningitidis is one of the main agents of bacterial meningitis, causing substantial morbidity and mortality worldwide. However, most of the time N. meningitidis is carried as a commensal not associated with invasive disease. The genomic basis of the difference between disease-associated and carried isolates of N. meningitidis may provide critical insight into mechanisms of virulence, yet it has remained elusive. Here, we have taken a comparative genomics approach to interrogate the difference between disease-associated and carried isolates of N. meningitidis at the level of individual nucleotide variations (i.e., single nucleotide polymorphisms [SNPs]). We aligned complete genome sequences of 8 disease-associated and 4 carried isolates of N. meningitidis to search for SNPs that show mutually exclusive patterns of variation between the two groups. We found 63 SNPs that distinguish the 8 disease-associated genomes from the 4 carried genomes of N. meningitidis, which is far more than can be expected by chance alone given the level of nucleotide variation among the genomes. The putative list of SNPs that discriminate between disease-associated and carriage genomes may be expected to change with increased sampling or changes in the identities of the isolates being compared. Nevertheless, we show that these discriminating SNPs are more likely to reflect phenotypic differences than shared evolutionary history. Discriminating SNPs were mapped to genes, and the functions of the genes were evaluated for possible connections to virulence mechanisms. A number of overrepresented functional categories related to virulence were uncovered among SNP-associated genes, including genes related to the category “symbiosis, encompassing mutualism through parasitism.” PMID:21622743
Delaney, Jessica T; Jeff, Janina M; Brown, Nancy J; Pretorius, Mias; Okafor, Henry E; Darbar, Dawood; Roden, Dan M; Crawford, Dana C
2012-01-01
Despite a greater burden of risk factors, atrial fibrillation (AF) is less common among African Americans than European-descent populations. Genome-wide association studies (GWAS) for AF in European-descent populations have identified three predominant genomic regions associated with increased risk (1q21, 4q25, and 16q22). The contribution of these loci to AF risk in African American is unknown. We studied 73 African Americans with AF from the Vanderbilt-Meharry AF registry and 71 African American controls, with no history of AF including after cardiac surgery. Tests of association were performed for 148 SNPs across the three regions associated with AF, and 22 SNPs were significantly associated with AF (P<0.05). The SNPs with the strongest associations in African Americans were both different from the index SNPs identified in European-descent populations and independent from the index European-descent population SNPs (r(2)<0.40 in HapMap CEU): 1q21 rs4845396 (odds ratio [OR] 0.30, 95% confidence interval [CI] 0.13-0.67, P = 0.003), 4q25 rs4631108 (OR 3.43, 95% CI 1.59-7.42, P = 0.002), and 16q22 rs16971547 (OR 8.1, 95% CI 1.46-45.4, P = 0.016). Estimates of European ancestry were similar among cases (23.6%) and controls (23.8%). Accordingly, the probability of having two copies of the European derived chromosomes at each region did not differ between cases and controls. Variable European admixture at known AF loci does not explain decreased AF susceptibility in African Americans. These data support the role of 1q21, 4q25, and 16q22 variants in AF risk for African Americans, although the index SNPs differ from those identified in European-descent populations.
He, Jun; Xu, Jiaqi; Wu, Xiao-Lin; Bauck, Stewart; Lee, Jungjae; Morota, Gota; Kachman, Stephen D; Spangler, Matthew L
2018-04-01
SNP chips are commonly used for genotyping animals in genomic selection but strategies for selecting low-density (LD) SNPs for imputation-mediated genomic selection have not been addressed adequately. The main purpose of the present study was to compare the performance of eight LD (6K) SNP panels, each selected by a different strategy exploiting a combination of three major factors: evenly-spaced SNPs, increased minor allele frequencies, and SNP-trait associations either for single traits independently or for all the three traits jointly. The imputation accuracies from 6K to 80K SNP genotypes were between 96.2 and 98.2%. Genomic prediction accuracies obtained using imputed 80K genotypes were between 0.817 and 0.821 for daughter pregnancy rate, between 0.838 and 0.844 for fat yield, and between 0.850 and 0.863 for milk yield. The two SNP panels optimized on the three major factors had the highest genomic prediction accuracy (0.821-0.863), and these accuracies were very close to those obtained using observed 80K genotypes (0.825-0.868). Further exploration of the underlying relationships showed that genomic prediction accuracies did not respond linearly to imputation accuracies, but were significantly affected by genotype (imputation) errors of SNPs in association with the traits to be predicted. SNPs optimal for map coverage and MAF were favorable for obtaining accurate imputation of genotypes whereas trait-associated SNPs improved genomic prediction accuracies. Thus, optimal LD SNP panels were the ones that combined both strengths. The present results have practical implications on the design of LD SNP chips for imputation-enabled genomic prediction.
Schlautman, Brandon; Covarrubias-Pazaran, Giovanny; Diaz-Garcia, Luis; Iorizzo, Massimo; Polashock, James; Grygleski, Edward; Vorsa, Nicholi; Zalapa, Juan
2017-04-03
The American cranberry ( Vaccinium macrocarpon Ait.) is a recently domesticated, economically important, fruit crop with limited molecular resources. New genetic resources could accelerate genetic gain in cranberry through characterization of its genomic structure and by enabling molecular-assisted breeding strategies. To increase the availability of cranberry genomic resources, genotyping-by-sequencing (GBS) was used to discover and genotype thousands of single nucleotide polymorphisms (SNPs) within three interrelated cranberry full-sib populations. Additional simple sequence repeat (SSR) loci were added to the SNP datasets and used to construct bin maps for the parents of the populations, which were then merged to create the first high-density cranberry composite map containing 6073 markers (5437 SNPs and 636 SSRs) on 12 linkage groups (LGs) spanning 1124 cM. Interestingly, higher rates of recombination were observed in maternal than paternal gametes. The large number of markers in common (mean of 57.3) and the high degree of observed collinearity (mean Pair-wise Spearman rank correlations >0.99) between the LGs of the parental maps demonstrates the utility of GBS in cranberry for identifying polymorphic SNP loci that are transferable between pedigrees and populations in future trait-association studies. Furthermore, the high-density of markers anchored within the component maps allowed identification of segregation distortion regions, placement of centromeres on each of the 12 LGs, and anchoring of genomic scaffolds. Collectively, the results represent an important contribution to the current understanding of cranberry genomic structure and to the availability of molecular tools for future genetic research and breeding efforts in cranberry. Copyright © 2017 Schlautman et al.
Alcina, Antonio; Fedetz, María; Ndagire, Dorothy; Fernández, Oscar; Leyva, Laura; Guerrero, Miguel; Abad-Grau, María M.; Arnal, Carmen; Delgado, Concepción; Lucas, Miguel; Izquierdo, Guillermo; Matesanz, Fuencisla
2009-01-01
Background IL-2 receptor (IL2R) alpha is the specific component of the high affinity IL2R system involved in the immune response and in the control of autoimmunity. Methods and Results Here we perform a replication and fine mapping of the IL2RA gene region analyzing 3 SNPs previously associated with multiple sclerosis (MS) and 5 SNPs associated with type 1 diabetes (T1D) in a collection of 798 MS patients and 927 matched Caucasian controls from the south of Spain. We observed association with MS in 6 of 8 SNPs. The rs1570538, at the 3′- UTR extreme of the gene, previously reported to have a weak association with MS, is replicated here (P = 0.032). The most associated T1D SNP (rs41295061) was not associated with MS in the present study. However, the rs35285258, belonging to another independent group of SNPs associated with T1D, showed the maximal association in this study but different risk allele. We replicated the association of only one (rs2104286) of the two IL2RA SNPs identified in the recently performed genome-wide association study of MS. Conclusions These findings confirm and extend the association of this gene with MS and reveal a genetic heterogeneity of the associated polymorphisms and risk alleles between MS and T1D suggesting different immunopathological roles of IL2RA in these two diseases. PMID:19125193
Pettengill, James; Strain, Errol; Allard, Marc W.; Ahmed, Rafiq; Zhao, Shaohua; Brown, Eric W.
2014-01-01
Phage typing has been used for the epidemiological surveillance of Salmonella enterica serovar Enteritidis for over 2 decades. However, knowledge of the genetic and evolutionary relationships between phage types is very limited, making differences difficult to interpret. Here, single nucleotide polymorphisms (SNPs) identified from whole-genome comparisons were used to determine the relationships between some S. Enteritidis phage types (PTs) commonly associated with food-borne outbreaks in the United States. Emphasis was placed on the predominant phage types PT8, PT13a, and PT13 in North America. With >89,400 bp surveyed across 98 S. Enteritidis isolates representing 14 distinct phage types, 55 informative SNPs were discovered within 23 chromosomally anchored loci. To maximize the discriminatory and evolutionary partitioning of these highly homogeneous strains, sequences comprising informative SNPs were concatenated into a single combined data matrix and subjected to phylogenetic analysis. The resultant phylogeny allocated most S. Enteritidis isolates into two distinct clades (clades I and II) and four subclades. Synapomorphic (shared and derived) sets of SNPs capable of distinguishing individual clades/subclades were identified. However, individual phage types appeared to be evolutionarily disjunct when mapped to this phylogeny, suggesting that phage typing may not be valid for making phylogenetic inferences. Furthermore, the set of SNPs identified here represents useful genetic markers for strain differentiation of more clonal S. Enteritidis strains and provides core genotypic markers for future development of a SNP typing scheme with S. Enteritidis. PMID:24574287
High throughput SNP discovery and genotyping in hexaploid wheat.
Rimbert, Hélène; Darrier, Benoît; Navarro, Julien; Kitt, Jonathan; Choulet, Frédéric; Leveugle, Magalie; Duarte, Jorge; Rivière, Nathalie; Eversole, Kellye; Le Gouis, Jacques; Davassi, Alessandro; Balfourier, François; Le Paslier, Marie-Christine; Berard, Aurélie; Brunel, Dominique; Feuillet, Catherine; Poncet, Charles; Sourdille, Pierre; Paux, Etienne
2018-01-01
Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research.
Evaluation of iconic versus F-map microburst displays
NASA Technical Reports Server (NTRS)
Salzberger, Mark; Hansman, R. John; Wanke, Craig
1994-01-01
Previous studies have shown graphical presentation methods of hazardous wind shear to be superior to textual or audible warnings alone. Positional information and the strength of the hazard were observed to be and were cited by pilots as the most important factors in a display. In this experiment the use of the three different graphical presentations of hazardous wind shear are examined. Airborne predictive detectors of wind shear enable the dissemination of varying levels of information. The effectiveness of iconic and mapping display modes of different complexities are addressed through simulation and analysis. Different positional and time-varying situations are presented in a 'part-task' Boeing 767 simulator using data from actual microburst events. Experienced airline pilots fly approach profiles using both iconic and F-map wind shear alerting displays. Microburst accompanied each event is also shown to the pilot. Mapping display types are expected to be found exceptionally efficient at conveying location comparison information while iconic displays simplify the threat recognition process. Preliminary results from the simulator study are presented. Recommendations concerning the suitability of multilevel iconic and mapping displays are made. Situational problems with current display prototypes are also addressed.
Bajaj, Deepak; Das, Shouvik; Upadhyaya, Hari D.; Ranjan, Rajeev; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. Laxmipathi; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.
2015-01-01
The study identified 9045 high-quality SNPs employing both genome-wide GBS- and candidate gene-based SNP genotyping assays in 172, including 93 cultivated (desi and kabuli) and 79 wild chickpea accessions. The GWAS in a structured population of 93 sequenced accessions detected 15 major genomic loci exhibiting significant association with seed coat color. Five seed color-associated major genomic loci underlying robust QTLs mapped on a high-density intra-specific genetic linkage map were validated by QTL mapping. The integration of association and QTL mapping with gene haplotype-specific LD mapping and transcript profiling identified novel allelic variants (non-synonymous SNPs) and haplotypes in a MATE secondary transporter gene regulating light/yellow brown and beige seed coat color differentiation in chickpea. The down-regulation and decreased transcript expression of beige seed coat color-associated MATE gene haplotype was correlated with reduced proanthocyanidins accumulation in the mature seed coats of beige than light/yellow brown seed colored desi and kabuli accessions for their coloration/pigmentation. This seed color-regulating MATE gene revealed strong purifying selection pressure primarily in LB/YB seed colored desi and wild Cicer reticulatum accessions compared with the BE seed colored kabuli accessions. The functionally relevant molecular tags identified have potential to decipher the complex transcriptional regulatory gene function of seed coat coloration and for understanding the selective sweep-based seed color trait evolutionary pattern in cultivated and wild accessions during chickpea domestication. The genome-wide integrated approach employed will expedite marker-assisted genetic enhancement for developing cultivars with desirable seed coat color types in chickpea. PMID:26635822
Fine Mapping on Chromosome 13q32–34 and Brain Expression Analysis Implicates MYO16 in Schizophrenia
Rodriguez-Murillo, Laura; Xu, Bin; Roos, J Louw; Abecasis, Gonçalo R; Gogos, Joseph A; Karayiorgou, Maria
2014-01-01
We previously reported linkage of schizophrenia and schizoaffective disorder to 13q32–34 in the European descent Afrikaner population from South Africa. The nature of genetic variation underlying linkage peaks in psychiatric disorders remains largely unknown and both rare and common variants may be contributing. Here, we examine the contribution of common variants located under the 13q32–34 linkage region. We used densely spaced SNPs to fine map the linkage peak region using both a discovery sample of 415 families and a meta-analysis incorporating two additional replication family samples. In a second phase of the study, we use one family-based data set with 237 families and independent case–control data sets for fine mapping of the common variant association signal using HapMap SNPs. We report a significant association with a genetic variant (rs9583277) within the gene encoding for the myosin heavy-chain Myr 8 (MYO16), which has been implicated in neuronal phosphoinositide 3-kinase signaling. Follow-up analysis of HapMap variation within MYO16 in a second set of Afrikaner families and additional case–control data sets of European descent highlighted a region across introns 2–6 as the most likely region to harbor common MYO16 risk variants. Expression analysis revealed a significant increase in the level of MYO16 expression in the brains of schizophrenia patients. Our results suggest that common variation within MYO16 may contribute to the genetic liability to schizophrenia. PMID:24141571
Randhawa, April Kaur; Horne, David J.; Adams, Mark D.; Shey, Muki; Barnholtz-Sloan, Jill; Mayanja-Kizza, Harriet; Kaplan, Gilla; Hanekom, Willem A.; Boom, W. Henry; Hawn, Thomas R.; Stein, Catherine M.
2012-01-01
Genetic epidemiological studies of complex diseases often rely on data from the International HapMap Consortium for identification of single nucleotide polymorphisms (SNPs), particularly those that tag haplotypes. However, little is known about the relevance of the African populations used to collect HapMap data for study populations conducted elsewhere in Africa. Toll-like receptor (TLR) genes play a key role in susceptibility to various infectious diseases, including tuberculosis. We conducted full-exon sequencing in samples obtained from Uganda (n = 48) and South Africa (n = 48), in four genes in the TLR pathway: TLR2, TLR4, TLR6, and TIRAP. We identified one novel TIRAP SNP (with minor allele frequency [MAF] 3.2%) and a novel TLR6 SNP (MAF 8%) in the Ugandan population, and a TLR6 SNP that is unique to the South African population (MAF 14%). These SNPs were also not present in the 1000 Genomes data. Genotype and haplotype frequencies and linkage disequilibrium patterns in Uganda and South Africa were similar to African populations in the HapMap datasets. Multidimensional scaling analysis of polymorphisms in all four genes suggested broad overlap of all of the examined African populations. Based on these data, we propose that there is enough similarity among African populations represented in the HapMap database to justify initial SNP selection for genetic epidemiological studies in Uganda and South Africa. We also discovered three novel polymorphisms that appear to be population-specific and would only be detected by sequencing efforts. PMID:23112821
Howie, Bryan N.; Donnelly, Peter; Marchini, Jonathan
2009-01-01
Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000 Genomes Project) will soon allow a broader range of SNPs to be imputed with higher accuracy, thereby increasing power. We describe a genotype imputation method (IMPUTE version 2) that is designed to address the challenges presented by these new datasets. The main innovation of our approach is a flexible modelling framework that increases accuracy and combines information across multiple reference panels while remaining computationally feasible. We find that IMPUTE v2 attains higher accuracy than other methods when the HapMap provides the sole reference panel, but that the size of the panel constrains the improvements that can be made. We also find that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%–20% lower than those of the closest competing method. One particularly challenging aspect of next-generation association studies is to integrate information across multiple reference panels genotyped on different sets of SNPs; we show that our approach to this problem has practical advantages over other suggested solutions. PMID:19543373
Chopra, Ratan; Burow, Gloria; Farmer, Andrew; Mudge, Joann; Simpson, Charles E; Wilkins, Thea A; Baring, Michael R; Puppala, Naveen; Chamberlin, Kelly D; Burow, Mark D
2015-06-01
Single-nucleotide polymorphisms, which can be identified in the thousands or millions from comparisons of transcriptome or genome sequences, are ideally suited for making high-resolution genetic maps, investigating population evolutionary history, and discovering marker-trait linkages. Despite significant results from their use in human genetics, progress in identification and use in plants, and particularly polyploid plants, has lagged. As part of a long-term project to identify and use SNPs suitable for these purposes in cultivated peanut, which is tetraploid, we generated transcriptome sequences of four peanut cultivars, namely OLin, New Mexico Valencia C, Tamrun OL07 and Jupiter, which represent the four major market classes of peanut grown in the world, and which are important economically to the US southwest peanut growing region. CopyDNA libraries of each genotype were used to generate 2 × 54 paired-end reads using an Illumina GAIIx sequencer. Raw reads were mapped to a custom reference consisting of Tifrunner 454 sequences plus peanut ESTs in GenBank, compromising 43,108 contigs; 263,840 SNP and indel variants were identified among four genotypes compared to the reference. A subset of 6 variants was assayed across 24 genotypes representing four market types using KASP chemistry to assess the criteria for SNP selection. Results demonstrated that transcriptome sequencing can identify SNPs usable as selectable DNA-based markers in complex polyploid species such as peanut. Criteria for effective use of SNPs as markers are discussed in this context.
Evaluating the association of common APOA2 variants with type 2 diabetes
Duesing, Konsta; Charpentier, Guillaume; Marre, Michel; Tichet, Jean; Hercberg, Serge; Balkau, Beverley; Froguel, Philippe; Gibson, Fernando
2009-01-01
Background APOA2 is a positional and biological candidate gene for type 2 diabetes at the chromosome 1q21-q24 susceptibility locus. The aim of this study was to examine if HapMap phase II tag SNPs in APOA2 are associated with type 2 diabetes and quantitative traits in French Caucasian subjects. Methods We genotyped the three HapMap phase II tagging SNPs (rs6413453, rs5085 and rs5082) required to capture the common variation spanning the APOA2 locus in our type 2 diabetes case-control cohort comprising 3,093 French Caucasian subjects. The association between these variants and quantitative traits was also examined in the normoglycaemic adults of the control cohort. In addition, meta-analysis of publicly available whole genome association data was performed. Results None of the APOA2 tag SNPs were associated with type 2 diabetes in the French Caucasian case-control cohort (rs6413453, P = 0.619; rs5085, P = 0.245; rs5082, P = 0.591). However, rs5082 was marginally associated with total cholesterol levels (P = 0.026) and waist-to-hip ratio (P = 0.029). The meta-analysis of data from 12,387 subjects confirmed our finding that common variation at the APOA2 locus is not associated with type 2 diabetes. Conclusion The available data does not support a role for common variants in APOA2 on type 2 diabetes susceptibility or related quantitative traits in Northern Europeans. PMID:19216768
Evaluating the association of common APOA2 variants with type 2 diabetes.
Duesing, Konsta; Charpentier, Guillaume; Marre, Michel; Tichet, Jean; Hercberg, Serge; Balkau, Beverley; Froguel, Philippe; Gibson, Fernando
2009-02-13
APOA2 is a positional and biological candidate gene for type 2 diabetes at the chromosome 1q21-q24 susceptibility locus. The aim of this study was to examine if HapMap phase II tag SNPs in APOA2 are associated with type 2 diabetes and quantitative traits in French Caucasian subjects. We genotyped the three HapMap phase II tagging SNPs (rs6413453, rs5085 and rs5082) required to capture the common variation spanning the APOA2 locus in our type 2 diabetes case-control cohort comprising 3,093 French Caucasian subjects. The association between these variants and quantitative traits was also examined in the normoglycaemic adults of the control cohort. In addition, meta-analysis of publicly available whole genome association data was performed. None of the APOA2 tag SNPs were associated with type 2 diabetes in the French Caucasian case-control cohort (rs6413453, P = 0.619; rs5085, P = 0.245; rs5082, P = 0.591). However, rs5082 was marginally associated with total cholesterol levels (P = 0.026) and waist-to-hip ratio (P = 0.029). The meta-analysis of data from 12,387 subjects confirmed our finding that common variation at the APOA2 locus is not associated with type 2 diabetes. The available data does not support a role for common variants in APOA2 on type 2 diabetes susceptibility or related quantitative traits in Northern Europeans.
Paes, Geísa Pinheiro; Viana, José Marcelo Soriano; Silva, Fabyano Fonseca e; Mundim, Gabriel Borges
2016-01-01
Abstract The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis. PMID:27007903
Paes, Geísa Pinheiro; Viana, José Marcelo Soriano; Silva, Fabyano Fonseca E; Mundim, Gabriel Borges
2016-03-01
The objectives of this study were to assess linkage disequilibrium (LD) and selection-induced changes in single nucleotide polymorphism (SNP) frequency, and to perform association mapping in popcorn chromosome regions containing quantitative trait loci (QTLs) for quality traits. Seven tropical and two temperate popcorn populations were genotyped for 96 SNPs chosen in chromosome regions containing QTLs for quality traits. The populations were phenotyped for expansion volume, 100-kernel weight, kernel sphericity, and kernel density. The LD statistics were the difference between the observed and expected haplotype frequencies (D), the proportion of D relative to the expected maximum value in the population, and the square of the correlation between the values of alleles at two loci. Association mapping was based on least squares and Bayesian approaches. In the tropical populations, D-values greater than 0.10 were observed for SNPs separated by 100-150 Mb, while most of the D-values in the temperate populations were less than 0.05. Selection for expansion volume indirectly led to increase in LD values, population differentiation, and significant changes in SNP frequency. Some associations were observed for expansion volume and the other quality traits. The candidate genes are involved with starch, storage protein, lipid, and cell wall polysaccharides synthesis.
Hu, Yingyan; Hong, Wu; Smith, Alicia; Yu, Shunying; Li, Zezhi; Wang, Dongxiang; Yuan, Chengmei; Cao, Lan; Wu, Zhiguo; Huang, Jia; Fralick, Drew; Phillips, Michael Robert; Fang, Yiru
2017-11-01
Recent research findings suggest that BDNF and BDNF signaling pathways participate in the development of major depressive disorder. Mitogen-activated extracellular signal-regulated kinase (MEK) is the most important kinase in the extracellular signal-regulated kinase pathway, and the extracellular signal-regulated kinase pathway is the key signaling pathway of BDNF, so it may play a role in development of depressive disorder. The aim of this study is to investigate the association between polymorphisms of the MAP2K1 (also known as MEK) gene and depressive disorder. Three single nucleotide polymorphisms (SNPs), were significantly associated with depressive disorder: rs1549854 (p = 0.006), rs1432441 (p = 0.025), and rs7182853 (p = 0.039). When subdividing the sample by gender, two of the SNPs remained statistically associated with depressive disorder in females: rs1549854 (p = 0.013) and rs1432441 (p = 0.04). The rs1549854 and rs1432441 polymorphisms of the MAP2K1 gene may be associated with major depressive disorder, especially in females. This study is the first to report that the MAP2K1 gene may be a genetic marker for depressive disorder. Copyright © 2017 Elsevier B.V. All rights reserved.
Non-additive genetic variation in growth, carcass and fertility traits of beef cattle.
Bolormaa, Sunduimijid; Pryce, Jennie E; Zhang, Yuandan; Reverter, Antonio; Barendse, William; Hayes, Ben J; Goddard, Michael E
2015-04-02
A better understanding of non-additive variance could lead to increased knowledge on the genetic control and physiology of quantitative traits, and to improved prediction of the genetic value and phenotype of individuals. Genome-wide panels of single nucleotide polymorphisms (SNPs) have been mainly used to map additive effects for quantitative traits, but they can also be used to investigate non-additive effects. We estimated dominance and epistatic effects of SNPs on various traits in beef cattle and the variance explained by dominance, and quantified the increase in accuracy of phenotype prediction by including dominance deviations in its estimation. Genotype data (729 068 real or imputed SNPs) and phenotypes on up to 16 traits of 10 191 individuals from Bos taurus, Bos indicus and composite breeds were used. A genome-wide association study was performed by fitting the additive and dominance effects of single SNPs. The dominance variance was estimated by fitting a dominance relationship matrix constructed from the 729 068 SNPs. The accuracy of predicted phenotypic values was evaluated by best linear unbiased prediction using the additive and dominance relationship matrices. Epistatic interactions (additive × additive) were tested between each of the 28 SNPs that are known to have additive effects on multiple traits, and each of the other remaining 729 067 SNPs. The number of significant dominance effects was greater than expected by chance and most of them were in the direction that is presumed to increase fitness and in the opposite direction to inbreeding depression. Estimates of dominance variance explained by SNPs varied widely between traits, but had large standard errors. The median dominance variance across the 16 traits was equal to 5% of the phenotypic variance. Including a dominance deviation in the prediction did not significantly increase its accuracy for any of the phenotypes. The number of additive × additive epistatic effects that were statistically significant was greater than expected by chance. Significant dominance and epistatic effects occur for growth, carcass and fertility traits in beef cattle but they are difficult to estimate precisely and including them in phenotype prediction does not increase its accuracy.
WASP: a Web-based Allele-Specific PCR assay designing tool for detecting SNPs and mutations
Wangkumhang, Pongsakorn; Chaichoompu, Kridsadakorn; Ngamphiw, Chumpol; Ruangrit, Uttapong; Chanprasert, Juntima; Assawamakin, Anunchai; Tongsima, Sissades
2007-01-01
Background Allele-specific (AS) Polymerase Chain Reaction is a convenient and inexpensive method for genotyping Single Nucleotide Polymorphisms (SNPs) and mutations. It is applied in many recent studies including population genetics, molecular genetics and pharmacogenomics. Using known AS primer design tools to create primers leads to cumbersome process to inexperience users since information about SNP/mutation must be acquired from public databases prior to the design. Furthermore, most of these tools do not offer the mismatch enhancement to designed primers. The available web applications do not provide user-friendly graphical input interface and intuitive visualization of their primer results. Results This work presents a web-based AS primer design application called WASP. This tool can efficiently design AS primers for human SNPs as well as mutations. To assist scientists with collecting necessary information about target polymorphisms, this tool provides a local SNP database containing over 10 million SNPs of various populations from public domain databases, namely NCBI dbSNP, HapMap and JSNP respectively. This database is tightly integrated with the tool so that users can perform the design for existing SNPs without going off the site. To guarantee specificity of AS primers, the proposed system incorporates a primer specificity enhancement technique widely used in experiment protocol. In particular, WASP makes use of different destabilizing effects by introducing one deliberate 'mismatch' at the penultimate (second to last of the 3'-end) base of AS primers to improve the resulting AS primers. Furthermore, WASP offers graphical user interface through scalable vector graphic (SVG) draw that allow users to select SNPs and graphically visualize designed primers and their conditions. Conclusion WASP offers a tool for designing AS primers for both SNPs and mutations. By integrating the database for known SNPs (using gene ID or rs number), this tool facilitates the awkward process of getting flanking sequences and other related information from public SNP databases. It takes into account the underlying destabilizing effect to ensure the effectiveness of designed primers. With user-friendly SVG interface, WASP intuitively presents resulting designed primers, which assist users to export or to make further adjustment to the design. This software can be freely accessed at . PMID:17697334
Zhao, Huiying; Nyholt, Dale R; Yang, Yuanhao; Wang, Jihua; Yang, Yuedong
2017-06-14
Genome-wide association studies (GWAS) have successfully identified single variants associated with diseases. To increase the power of GWAS, gene-based and pathway-based tests are commonly employed to detect more risk factors. However, the gene- and pathway-based association tests may be biased towards genes or pathways containing a large number of single-nucleotide polymorphisms (SNPs) with small P-values caused by high linkage disequilibrium (LD) correlations. To address such bias, numerous pathway-based methods have been developed. Here we propose a novel method, DGAT-path, to divide all SNPs assigned to genes in each pathway into LD blocks, and to sum the chi-square statistics of LD blocks for assessing the significance of the pathway by permutation tests. The method was proven robust with the type I error rate >1.6 times lower than other methods. Meanwhile, the method displays a higher power and is not biased by the pathway size. The applications to the GWAS summary statistics for schizophrenia and breast cancer indicate that the detected top pathways contain more genes close to associated SNPs than other methods. As a result, the method identified 17 and 12 significant pathways containing 20 and 21 novel associated genes, respectively for two diseases. The method is available online by http://sparks-lab.org/server/DGAT-path .
The research of selection model based on LOD in multi-scale display of electronic map
NASA Astrophysics Data System (ADS)
Zhang, Jinming; You, Xiong; Liu, Yingzhen
2008-10-01
This paper proposes a selection model based on LOD to aid the display of electronic map. The ratio of display scale to map scale is regarded as a LOD operator. The categorization rule, classification rule, elementary rule and spatial geometry character rule of LOD operator setting are also concluded.
Nimmakayala, Padma; Abburi, Venkata L.; Saminathan, Thangasamy; Alaparthi, Suresh B.; Almeida, Aldo; Davenport, Brittany; Nadimi, Marjan; Davidson, Joshua; Tonapi, Krittika; Yadav, Lav; Malkaram, Sridhar; Vajja, Gopinath; Hankins, Gerald; Harris, Robert; Park, Minkyu; Choi, Doil; Stommel, John; Reddy, Umesh K.
2016-01-01
Accumulated capsaicinoid content and increased fruit size are traits resulting from Capsicum annuum domestication. In this study, we used a diverse collection of C. annuum to generate 66,960 SNPs using genotyping by sequencing. The study identified 1189 haplotypes containing 3413 SNPs. Length of individual linkage disequilibrium (LD) blocks varied along chromosomes, with regions of high and low LD interspersed with an average LD of 139 kb. Principal component analysis (PCA), Bayesian model based population structure analysis and an Euclidean tree built based on identity by state (IBS) indices revealed that the clustering pattern of diverse accessions are in agreement with capsaicin content (CA) and fruit weight (FW) classifications indicating the importance of these traits in shaping modern pepper genome. PCA and IBS were used in a mixed linear model of capsaicin and dihydrocapsaicin content and fruit weight to reduce spurious associations because of confounding effects of subpopulations in genome-wide association study (GWAS). Our GWAS results showed SNPs in Ankyrin-like protein, IKI3 family protein, ABC transporter G family and pentatricopeptide repeat protein are the major markers for capsaicinoids and of 16 SNPs strongly associated with FW in both years of the study, 7 are located in known fruit weight controlling genes. PMID:27901114
Nimmakayala, Padma; Abburi, Venkata L; Saminathan, Thangasamy; Alaparthi, Suresh B; Almeida, Aldo; Davenport, Brittany; Nadimi, Marjan; Davidson, Joshua; Tonapi, Krittika; Yadav, Lav; Malkaram, Sridhar; Vajja, Gopinath; Hankins, Gerald; Harris, Robert; Park, Minkyu; Choi, Doil; Stommel, John; Reddy, Umesh K
2016-11-30
Accumulated capsaicinoid content and increased fruit size are traits resulting from Capsicum annuum domestication. In this study, we used a diverse collection of C. annuum to generate 66,960 SNPs using genotyping by sequencing. The study identified 1189 haplotypes containing 3413 SNPs. Length of individual linkage disequilibrium (LD) blocks varied along chromosomes, with regions of high and low LD interspersed with an average LD of 139 kb. Principal component analysis (PCA), Bayesian model based population structure analysis and an Euclidean tree built based on identity by state (IBS) indices revealed that the clustering pattern of diverse accessions are in agreement with capsaicin content (CA) and fruit weight (FW) classifications indicating the importance of these traits in shaping modern pepper genome. PCA and IBS were used in a mixed linear model of capsaicin and dihydrocapsaicin content and fruit weight to reduce spurious associations because of confounding effects of subpopulations in genome-wide association study (GWAS). Our GWAS results showed SNPs in Ankyrin-like protein, IKI3 family protein, ABC transporter G family and pentatricopeptide repeat protein are the major markers for capsaicinoids and of 16 SNPs strongly associated with FW in both years of the study, 7 are located in known fruit weight controlling genes.
T cells are influenced by a long non-coding RNA in the autoimmune associated PTPN2 locus.
Houtman, Miranda; Shchetynsky, Klementy; Chemin, Karine; Hensvold, Aase Haj; Ramsköld, Daniel; Tandre, Karolina; Eloranta, Maija-Leena; Rönnblom, Lars; Uebe, Steffen; Catrina, Anca Irinel; Malmström, Vivianne; Padyukov, Leonid
2018-06-01
Non-coding SNPs in the protein tyrosine phosphatase non-receptor type 2 (PTPN2) locus have been linked with several autoimmune diseases, including rheumatoid arthritis, type I diabetes, and inflammatory bowel disease. However, the functional consequences of these SNPs are poorly characterized. Herein, we show in blood cells that SNPs in the PTPN2 locus are highly correlated with DNA methylation levels at four CpG sites downstream of PTPN2 and expression levels of the long non-coding RNA (lncRNA) LINC01882 downstream of these CpG sites. We observed that LINC01882 is mainly expressed in T cells and that anti-CD3/CD28 activated naïve CD4 + T cells downregulate the expression of LINC01882. RNA sequencing analysis of LINC01882 knockdown in Jurkat T cells, using a combination of antisense oligonucleotides and RNA interference, revealed the upregulation of the transcription factor ZEB1 and kinase MAP2K4, both involved in IL-2 regulation. Overall, our data suggests the involvement of LINC01882 in T cell activation and hints towards an auxiliary role of these non-coding SNPs in autoimmunity associated with the PTPN2 locus. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Common variants in Mendelian kidney disease genes and their association with renal function.
Parsa, Afshin; Fuchsberger, Christian; Köttgen, Anna; O'Seaghdha, Conall M; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M; Borecki, Ingrid; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Bochud, Murielle; Heid, Iris M; Siscovick, David S; Fox, Caroline S; Kao, W Linda; Böger, Carsten A
2013-12-01
Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research.
Sharma, A; Dang, C G; Kim, K S; Kim, J J; Lee, H K; Kim, H C; Yeon, S H; Kang, H S; Lee, S H
2014-12-01
The objective of this study was to validate the association of significant SNPs identified from a previous genome-wide association study with carcass weight (CWT) in a commercial Hanwoo population. We genotyped 13 SNPs located on BTA14 in 867 steers from Korea Hanwoo feedlot bulls. Of these 13 SNPs, five SNPs, namely rs29021868, rs110061498, rs109546980, rs42404006 and rs42303720, were found to be significantly associated (P < 0.001) with CWT. These five significant markers spanned the 24.3 to 29.4 Mb region of BTA14. The most significant marker (rs29021868) for CWT in this study had a 13.07 kg allele substitution effect and accounted for 2.4% of the additive genetic variance in the commercial Hanwoo population. The SNP marker rs109546980 was found to be significantly associated with both CWT (P < 0.001) and eye muscle area (P < 0.001) and could potentially be exploited for marker-assisted selection in Hanwoo cattle. We also genotyped the ss319607402 variation, which maps to intron2 of PLAG1 gene and which is already reported to be associated with height, to identify any significant association with carcass weight; however, no such association was observed in this Hanwoo commercial population. © 2014 Stichting International Foundation for Animal Genetics.
Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.
Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart
2016-01-01
Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal.
Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications
Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R.; Taylor, Jeremy F.; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart
2016-01-01
Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal. PMID:27583971
Robinson, Nicholas A; Gopikrishna, Gopalapillay; Baranski, Matthew; Katneni, Vinaya Kumar; Shekhar, Mudagandur S; Shanmugakarthik, Jayakani; Jothivel, Sarangapani; Gopal, Chavali; Ravichandran, Pitchaiyappan; Gitterle, Thomas; Ponniah, Alphis G
2014-08-28
Shrimp culture is a fast growing aquaculture sector, but in recent years there has been a shift away from tiger shrimp Penaeus monodon to other species. This is largely due to the susceptibility of P. monodon to white spot syndrome virus disease (Whispovirus sp.) which has impacted production around the world. As female penaeid shrimp grow more rapidly than males, mono-sex production would be advantageous, however little is known about genes controlling or markers associated with sex determination in shrimp. In this study, a mapped set of 3959 transcribed single nucleotide polymorphisms were used to scan the P. monodon genome for loci associated with resistance to white-spot syndrome virus and sex in seven full-sibling tiger shrimp families challenged with white spot syndrome virus. Linkage groups 2, 3, 5, 6, 17, 18, 19, 22, 27 and 43 were found to contain quantitative trait loci significantly associated with hours of survival after white spot syndrome virus infection (P < 0.05 after Bonferroni correction). Nine QTL were significantly associated with hours of survival. Of the SNPs mapping to these and other regions with suggestive associations, many were found to occur in transcripts showing homology to genes with putative immune functions of interest, including genes affecting the action of the ubiquitin-proteasome pathway, lymphocyte-cell function, heat shock proteins, the TOLL pathway, protein kinase signal transduction pathways, mRNA binding proteins, lectins and genes affecting the development and differentiation of the immune system (eg. RUNT protein 1A). Several SNPs significantly associated with sex were mapped to linkage group 30, the strongest associations (P < 0.001 after Bonferroni correction) for 3 SNPs located in a 0.8 cM stretch between positions 43.5 and 44.3 cM where the feminisation gene (FEM-1, affecting sexual differentiation in Caenorhabditis elegans) mapped. The markers for disease resistance and sexual differentiation identified by this study could be useful for marker assisted selection to improve resistance to WSSV and for identifying homogametic female individuals for mono-sex (all female) production. The genes with putative functions affecting immunity and sexual differentiation that were found to closely map to these loci provide leads about the mechanisms affecting these important economic traits in shrimp.
Leak, Tennille S; Langefeld, Carl D; Keene, Keith L; Gallagher, Carla J; Lu, Lingyi; Mychaleckyj, Josyf C; Rich, Stephen S; Freedman, Barry I; Bowden, Donald W; Sale, Michèle M
2010-02-08
Previously we performed a linkage scan of 638 African American affected sibling pairs (ASP) with type 2 diabetes (T2D) enriched for end-stage renal disease (ESRD). Ordered subset linkage analysis (OSA) revealed a linkage peak on chromosome 7p in the subset of families with earlier age of T2D diagnosis. We fine mapped this region by genotyping 11 additional polymorphic markers in the same ASP and investigated a total of 68 single nucleotide polymorphisms (SNPs) in functional candidate genes (GCK1, IL6, IGFBP1 and IGFBP3) for association with age of T2D diagnosis, age of ESRD diagnosis, duration of T2D to onset of ESRD, body mass index (BMI) in African American cases and T2D-ESRD in an African American case-control cohort. OSA of fine mapping markers supported linkage at 28 cM on 7p (near D7S3051) in early-onset T2D families (max. LOD = 3.61, P = 0.002). SNPs in candidate genes and 70 ancestry-informative markers (AIMs) were evaluated in 577 African American T2D-ESRD cases and 596 African American controls. The most significant association was observed between ESRD age of diagnosis and SNP rs730497, located in intron 1 of the GCK1 gene (recessive T2D age-adjusted P = 0.0006). Nominal associations were observed with GCK1 SNPs and T2D age of diagnosis (BMI-adjusted P = 0.014 to 0.032). Also, one IGFBP1 and four IGFBP3 SNPs showed nominal genotypic association with T2D-ESRD (P = 0.002-0.049). After correcting for multiple tests, only rs730497 remanined significant. Variant rs730947 in the GCK1 gene appears to play a role in early ESRD onset in African Americans.
Re-Ranking Sequencing Variants in the Post-GWAS Era for Accurate Causal Variant Identification
Faye, Laura L.; Machiela, Mitchell J.; Kraft, Peter; Bull, Shelley B.; Sun, Lei
2013-01-01
Next generation sequencing has dramatically increased our ability to localize disease-causing variants by providing base-pair level information at costs increasingly feasible for the large sample sizes required to detect complex-trait associations. Yet, identification of causal variants within an established region of association remains a challenge. Counter-intuitively, certain factors that increase power to detect an associated region can decrease power to localize the causal variant. First, combining GWAS with imputation or low coverage sequencing to achieve the large sample sizes required for high power can have the unintended effect of producing differential genotyping error among SNPs. This tends to bias the relative evidence for association toward better genotyped SNPs. Second, re-use of GWAS data for fine-mapping exploits previous findings to ensure genome-wide significance in GWAS-associated regions. However, using GWAS findings to inform fine-mapping analysis can bias evidence away from the causal SNP toward the tag SNP and SNPs in high LD with the tag. Together these factors can reduce power to localize the causal SNP by more than half. Other strategies commonly employed to increase power to detect association, namely increasing sample size and using higher density genotyping arrays, can, in certain common scenarios, actually exacerbate these effects and further decrease power to localize causal variants. We develop a re-ranking procedure that accounts for these adverse effects and substantially improves the accuracy of causal SNP identification, often doubling the probability that the causal SNP is top-ranked. Application to the NCI BPC3 aggressive prostate cancer GWAS with imputation meta-analysis identified a new top SNP at 2 of 3 associated loci and several additional possible causal SNPs at these loci that may have otherwise been overlooked. This method is simple to implement using R scripts provided on the author's website. PMID:23950724
Bellone, Rebecca R; Forsyth, George; Leeb, Tosso; Archer, Sheila; Sigurdsson, Snaevar; Imsland, Freyja; Mauceli, Evan; Engensteiner, Martina; Bailey, Ernest; Sandmeyer, Lynne; Grahn, Bruce; Lindblad-Toh, Kerstin; Wade, Claire M
2010-05-01
Leopard Complex spotting occurs in several breeds of horses and is caused by an incompletely dominant allele (LP). Homozygosity for LP is also associated with congenital stationary night blindness (CSNB) in Appaloosa horses. Previously, LP was mapped to a 6 cm region on ECA1 containing the candidate gene TRPM1 (Transient Receptor Potential Cation Channel, Subfamily M, Member 1) and decreased expression of this gene, measured by qRT-PCR, was identified as the likely cause of both spotting and ocular phenotypes. This study describes investigations for a mutation causing or associated with the Leopard Complex and CSNB phenotype in horses. Re-sequencing of the gene and associated splice sites within the 105 624 bp genomic region of TRPM1 led to the discovery of 18 SNPs. Most of the SNPs did not have a predictive value for the presence of LP. However, one SNP (ECA1:108,249,293 C>T) found within intron 11 had a strong (P < 0.0005), but not complete, association with LP and CSNB and thus is a good marker but unlikely to be causative. To further localize the association, 70 SNPs spanning over two Mb including the TRPM1 gene were genotyped in 192 horses from three different breeds segregating for LP. A single 173 kb haplotype associated with LP and CSNB (ECA1: 108,197,355- 108,370,150) was identified. Illumina sequencing of 300 kb surrounding this haplotype revealed 57 SNP variants. Based on their localization within expressed sequences or regions of high sequence conservation across mammals, six of these SNPs were considered to be the most likely candidate mutations. While the precise function of TRPM1 remains to be elucidated, this work solidifies its functional role in both pigmentation and night vision. Further, this work has identified several potential regulatory elements of the TRPM1 gene that should be investigated further in this and other species.
Baron-Cohen, Simon; Murphy, Laura; Chakrabarti, Bhismadev; Craig, Ian; Mallya, Uma; Lakatošová, Silvia; Rehnstrom, Karola; Peltonen, Leena; Wheelwright, Sally; Allison, Carrie; Fisher, Simon E; Warrier, Varun
2014-01-01
Mathematical ability is heritable, but few studies have directly investigated its molecular genetic basis. Here we aimed to identify specific genetic contributions to variation in mathematical ability. We carried out a genome wide association scan using pooled DNA in two groups of U.K. samples, based on end of secondary/high school national academic exam achievement: high (n = 419) versus low (n = 183) mathematical ability while controlling for their verbal ability. Significant differences in allele frequencies between these groups were searched for in 906,600 SNPs using the Affymetrix GeneChip Human Mapping version 6.0 array. After meeting a threshold of p<1.5×10(-5), 12 SNPs from the pooled association analysis were individually genotyped in 542 of the participants and analyzed to validate the initial associations (lowest p-value 1.14 ×10(-6)). In this analysis, one of the SNPs (rs789859) showed significant association after Bonferroni correction, and four (rs10873824, rs4144887, rs12130910 rs2809115) were nominally significant (lowest p-value 3.278 × 10(-4)). Three of the SNPs of interest are located within, or near to, known genes (FAM43A, SFT2D1, C14orf64). The SNP that showed the strongest association, rs789859, is located in a region on chromosome 3q29 that has been previously linked to learning difficulties and autism. rs789859 lies 1.3 kbp downstream of LSG1, and 700 bp upstream of FAM43A, mapping within the potential promoter/regulatory region of the latter. To our knowledge, this is only the second study to investigate the association of genetic variants with mathematical ability, and it highlights a number of interesting markers for future study.
Norton, Gareth J.; Douglas, Alex; Lahner, Brett; Yakubova, Elena; Guerinot, Mary Lou; Pinson, Shannon R. M.; Tarpley, Lee; Eizenga, Georgia C.; McGrath, Steve P.; Zhao, Fang-Jie; Islam, M. Rafiqul; Islam, Shofiqul; Duan, Guilan; Zhu, Yongguan; Salt, David E.; Meharg, Andrew A.; Price, Adam H.
2014-01-01
The mineral concentrations in cereals are important for human health, especially for individuals who consume a cereal subsistence diet. A number of elements, such as zinc, are required within the diet, while some elements are toxic to humans, for example arsenic. In this study we carry out genome-wide association (GWA) mapping of grain concentrations of arsenic, copper, molybdenum and zinc in brown rice using an established rice diversity panel of ∼300 accessions and 36.9 k single nucleotide polymorphisms (SNPs). The study was performed across five environments: one field site in Bangladesh, one in China and two in the US, with one of the US sites repeated over two years. GWA mapping on the whole dataset and on separate subpopulations of rice revealed a large number of loci significantly associated with variation in grain arsenic, copper, molybdenum and zinc. Seventeen of these loci were detected in data obtained from grain cultivated in more than one field location, and six co-localise with previously identified quantitative trait loci. Additionally, a number of candidate genes for the uptake or transport of these elements were located near significantly associated SNPs (within 200 kb, the estimated global linkage disequilibrium previously employed in this rice panel). This analysis highlights a number of genomic regions and candidate genes for further analysis as well as the challenges faced when mapping environmentally-variable traits in a highly genetically structured diversity panel. PMID:24586963
Lalusin, Antonio; Borromeo, Teresita; Gregorio, Glenn; Hernandez, Jose; Virk, Parminder; Collard, Bertrand; McCouch, Susan R.
2015-01-01
Genome-wide association mapping studies (GWAS) are frequently used to detect QTL in diverse collections of crop germplasm, based on historic recombination events and linkage disequilibrium across the genome. Generally, diversity panels genotyped with high density SNP panels are utilized in order to assay a wide range of alleles and haplotypes and to monitor recombination breakpoints across the genome. By contrast, GWAS have not generally been performed in breeding populations. In this study we performed association mapping for 19 agronomic traits including yield and yield components in a breeding population of elite irrigated tropical rice breeding lines so that the results would be more directly applicable to breeding than those from a diversity panel. The population was genotyped with 71,710 SNPs using genotyping-by-sequencing (GBS), and GWAS performed with the explicit goal of expediting selection in the breeding program. Using this breeding panel we identified 52 QTL for 11 agronomic traits, including large effect QTLs for flowering time and grain length/grain width/grain-length-breadth ratio. We also identified haplotypes that can be used to select plants in our population for short stature (plant height), early flowering time, and high yield, and thus demonstrate the utility of association mapping in breeding populations for informing breeding decisions. We conclude by exploring how the newly identified significant SNPs and insights into the genetic architecture of these quantitative traits can be leveraged to build genomic-assisted selection models. PMID:25785447
Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J
2012-05-25
A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the 'Golden Delicious' reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
Liu, Hu; Dong, Xiaoyan; Sun, Yan
2016-01-15
A series of highly charged nanoscale chelators were fabricated by grafting of poly(glycidyl methacrylate-iminodiacetic acid) (pGI) chains with iminodiacetic acid (IDA) chelating group on silica nanoparticles (SNPs) via atom transfer radical polymerization (ATRP). The nanoscale chelators, denoted as SNPs-pGI, possessed a nickel ion chelating capacity as high as 2800 μmol/g, 50 times higher than the IDA-modified Sepharose FF (IDA-Sepharose) resin reported in literature and offered a high affinity binding capacity for hexahistidine-tagged enhanced green fluorescence protein (6 × His-EGFP) after nickel ion loading. More importantly, the anionic SNPs-pGI of high charge densities displayed much better performance than IDA-Sepharose in facilitating the refolding of like-charged 6 × His-EGFP from inclusion bodies (IBs). For example, for 0.2mg/mL 6 × His-EGFP IB refolding, addition of 6.2 μL/mL SNPs-pGI with the highest charge density led to a refolding yield of 90%, over 43% higher than that obtained with 460 μL/mL IDA-Sepharose. It is notable that the much higher efficiency of the nanoscale chelator was obtained with a chelator consumption corresponding to only 1.4% of IDA-Sepharose. Moreover, the highly charged SNPs-pGI could efficiently facilitate the refolding of 6 × His-EGFP at higher IB concentrations (0.4 and 0.8 mg/mL). After refolding, nickel ions addition led to the recovery of the refolded 6 × His-EGFP with high yield (80%), purity (96%) and enrichment ratio (1.8). All the results suggest that the SNPs-pGI of high charge densities were promising for cost-effective recovery of His-tagged proteins expressed as IBs with the integrative like-charge facilitated refolding and metal-chelate affinity purification strategy. Copyright © 2015 Elsevier B.V. All rights reserved.
Stone, B N; Griesinger, G L; Modelevsky, J L
1984-01-01
We describe an interactive computational tool, PLASMAP, which allows the user to electronically store, retrieve, and display circular restriction maps. PLASMAP permits users to construct libraries of plasmid restriction maps as a set of files which may be edited in the laboratory at any time. The display feature of PLASMAP quickly generates device-independent, artist-quality, full-color or monochrome, hard copies or CRT screens of complex, conventional circular restriction maps. PMID:6320096
USDA-ARS?s Scientific Manuscript database
Genotyping by sequencing (GBS) has been developed as an affordable application of next-generation sequencing for the purposes of discovering and genotyping SNPs in a variety of crop species and populations. In this study we employed a double restriction enzyme digestion protocol (HindIII and NlaIII)...
USDA-ARS?s Scientific Manuscript database
Cowpea aphid (CPA; Aphis craccivora) is a destructive insect pest of cowpea, as well as other legume crops including alfalfa, beans, chickpea, lentils, lupins and peanuts. The utilization of aphid resistance in cowpea breeding is one of the most efficient and environmental friendly methods to contro...
Development and use of molecular markers: past and present.
Grover, Atul; Sharma, P C
2016-01-01
Molecular markers, due to their stability, cost-effectiveness and ease of use provide an immensely popular tool for a variety of applications including genome mapping, gene tagging, genetic diversity diversity, phylogenetic analysis and forensic investigations. In the last three decades, a number of molecular marker techniques have been developed and exploited worldwide in different systems. However, only a handful of these techniques, namely RFLPs, RAPDs, AFLPs, ISSRs, SSRs and SNPs have received global acceptance. A recent revolution in DNA sequencing techniques has taken the discovery and application of molecular markers to high-throughput and ultrahigh-throughput levels. Although, the choice of marker will obviously depend on the targeted use, microsatellites, SNPs and genotyping by sequencing (GBS) largely fulfill most of the user requirements. Further, modern transcriptomic and functional markers will lead the ventures onto high-density genetic map construction, identification of QTLs, breeding and conservation strategies in times to come in combination with other high throughput techniques. This review presents an overview of different marker technologies and their variants with a comparative account of their characteristic features and applications.
Vongpaisarnsin, Kornkiat; Listman, Jennifer Beth; Malison, Robert T; Gelernter, Joel
2015-01-01
The main purpose of this work was to identify a set of AIMs that stratify the genetic structure and diversity of the Thai population from a high-throughput autosomal genome-wide association study. In this study, more than one million SNPs from the International HapMap database and the Thai depression genome-wide association study have been examined to identify ancestry informative markers (AIMs) that distinguish between Thai populations. An efficient strategy is proposed to identify and characterize such SNPs and to test high-resolution SNP data from international HapMap populations. The best AIMs are identified to stratify the population and to infer genetic ancestry structure. A total of 124 AIMs were clearly clustered geographically across the continent, whereas only 89 AIMs stratified the Thai population from East Asian populations. Finally, a set of 273 AIMs was able to distinguish northern from southern Thai subpopulations. These markers will be of particular value in identifying the ethnic origins in regions where matching by self-reports is unavailable or unreliable, which usually occurs in real forensic cases. PMID:25759192
Vongpaisarnsin, Kornkiat; Listman, Jennifer Beth; Malison, Robert T; Gelernter, Joel
2015-07-01
The main purpose of this work was to identify a set of AIMs that stratify the genetic structure and diversity of the Thai population from a high-throughput autosomal genome-wide association study. In this study, more than one million SNPs from the international HapMap database and the Thai depression genome-wide association study have been examined to identify ancestry informative markers (AIMs) that distinguish between Thai populations. An efficient strategy is proposed to identify and characterize such SNPs and to test high-resolution SNP data from international HapMap populations. The best AIMs are identified to stratify the population and to infer genetic ancestry structure. A total of 124 AIMs were clearly clustered geographically across the continent, whereas only 89 AIMs stratified the Thai population from East Asian populations. Finally, a set of 273 AIMs was able to distinguish northern from southern Thai subpopulations. These markers will be of particular value in identifying the ethnic origins in regions where matching by self-reports is unavailable or unreliable, which usually occurs in real forensic cases. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Cloete, Ruben; Akurugu, Wisdom A; Werely, Cedric J; van Helden, Paul D; Christoffels, Alan
2017-08-01
The human arylamine N-acetyltransferase 1 (NAT1) enzyme plays a vital role in determining the duration of action of amine-containing drugs such as para-aminobenzoic acid (PABA) by influencing the balance between detoxification and metabolic activation of these drugs. Recently, four novel single nucleotide polymorphisms (SNPs) were identified within a South African mixed ancestry population. Modeling the effects of these SNPs within the structural protein was done to assess possible structure and function changes in the enzyme. The use of molecular dynamics simulations and stability predictions indicated less thermodynamically stable protein structures containing E264K and V231G, while the N245I change showed a stabilizing effect. Coincidently the N245I change displayed a similar free energy landscape profile to the known R64W amino acid substitution (slow acetylator), while the R242M displayed a similar profile to the published variant, I263V (proposed fast acetylator), and the wild type protein structure. Similarly, principal component analysis indicated that two amino acid substitutions (E264K and V231G) occupied less conformational clusters of folded states as compared to the WT and were found to be destabilizing (may affect protein function). However, two of the four novel SNPs that result in amino acid changes: (V231G and N245I) were predicted by both SIFT and POLYPHEN-2 algorithms to affect NAT1 protein function, while two other SNPs that result in R242M and E264K substitutions showed contradictory results based on SIFT and POLYPHEN-2 analysis. In conclusion, the structural methods were able to verify that two non-synonymous substitutions (E264K and V231G) can destabilize the protein structure, and are in agreement with mCSM predictions, and should therefore be experimentally tested for NAT1 activity. These findings could inform a strategy of incorporating genotypic data (i.e., functional SNP alleles) with phenotypic information (slow or fast acetylator) to better prescribe effective treatment using drugs metabolized by NAT1. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Kang, Yang Jae; Ahn, Yul-Kyun; Kim, Ki-Taek; Jun, Tae-Hwan
2016-10-28
Bacterial wilt (BW) is a widespread plant disease that affects a broad range of dicot and monocot hosts and is particularly harmful for solanaceous plants, such as pepper, tomato, and eggplant. The pathogen responsible for BW is the soil-borne bacterium, Ralstonia solanacearum, which can adapt to diverse temperature conditions and is found in climates ranging from tropical to temperate. Resistance to BW has been detected in some pepper plant lines; however, the genomic loci and alleles that mediate this are poorly studied in this species. We resequenced the pepper cultivars YCM344 and Taean, which are parental recombinant inbred lines (RIL) that display differential resistance phenotypes against BW, with YCM344 being highly resistant to infection with this pathogen. We identified novel single nucleotide polymorphisms (SNPs) and insertions/deletions (Indels) that are only present in both parental lines, as compared to the reference genome and further determined variations that distinguish these two cultivars from one another. We then identified potentially informative SNPs that were found in genes related to those that have been previously associated with disease resistance, such as the R genes and stress response genes. Moreover, via comparative analysis, we identified SNPs located in genomic regions that have homology to known resistance genes in the tomato genomes. From our SNP profiling in both parental lines, we could identify SNPs that are potentially responsible for BW resistance, and practically, these may be used as markers for assisted breeding schemes using these populations. We predict that our analyses will be valuable for both better understanding the YCM334/Taean-derived populations, as well as for enhancing our knowledge of critical SNPs present in the pepper genome.
OxfordGrid: a web interface for pairwise comparative map views.
Yang, Hongyu; Gingle, Alan R
2005-12-01
OxfordGrid is a web application and database schema for storing and interactively displaying genetic map data in a comparative, dot-plot, fashion. Its display is composed of a matrix of cells, each representing a pairwise comparison of mapped probe data for two linkage groups or chromosomes. These are arranged along the axes with one forming grid columns and the other grid rows with the degree and pattern of synteny/colinearity between the two linkage groups manifested in the cell's dot density and structure. A mouse click over the selected grid cell launches an image map-based display for the selected cell. Both individual and linear groups of mapped probes can be selected and displayed. Also, configurable links can be used to access other web resources for mapped probe information. OxfordGrid is implemented in C#/ASP.NET and the package, including MySQL schema creation scripts, is available at ftp://cggc.agtec.uga.edu/OxfordGrid/.
Displays, instruments, and the multi-dimensional world of cartography
NASA Technical Reports Server (NTRS)
Mccleary, George F., Jr.
1989-01-01
Cartographers are creators and purveyors of maps. Maps are representations of space, geographical images of the environment. Maps organize spatial information for convenience, particularly for use in performing tasks which involve the environment. There are many different kinds of maps, and there are as many different uses of maps as there are spatial problems to be solved. Maps and the display instrument dichotomy are examined. Also examined are the categories of map use along with the characteristics of maps.
Sajuthi, Satria P.; Sharma, Neeraj K.; Chou, Jeff W.; Palmer, Nicholette D.; McWilliams, David R.; Beal, John; Comeau, Mary E.; Ma, Lijun; Calles-Escandon, Jorge; Demons, Jamehl; Rogers, Samantha; Cherry, Kristina; Menon, Lata; Kouba, Ethel; Davis, Donna; Burris, Marcie; Byerly, Sara J.; Ng, Maggie C.Y.; Maruthur, Nisa M.; Patel, Sanjay R.; Bielak, Lawrence F.; Lange, Leslie; Guo, Xiuqing; Sale, Michèle M.; Chan, Kei Hang; Monda, Keri L.; Chen, Gary K.; Taylor, Kira; Palmer, Cameron; Edwards, Todd L; North, Kari E.; Haiman, Christopher A.; Bowden, Donald W.; Freedman, Barry I.; Langefeld, Carl D.; Das, Swapan K.
2016-01-01
Relative to European Americans, type 2 diabetes (T2D) is more prevalent in African Americans (AAs). Genetic variation may modulate transcript abundance in insulin-responsive tissues and contribute to risk; yet published studies identifying expression quantitative trait loci (eQTLs) in African ancestry populations are restricted to blood cells. This study aims to develop a map of genetically regulated transcripts expressed in tissues important for glucose homeostasis in AAs, critical for identifying the genetic etiology of T2D and related traits. Quantitative measures of adipose and muscle gene expression, and genotypic data were integrated in 260 non-diabetic AAs to identify expression regulatory variants. Their roles in genetic susceptibility to T2D, and related metabolic phenotypes were evaluated by mining GWAS datasets. eQTL analysis identified 1,971 and 2,078 cis-eGenes in adipose and muscle, respectively. Cis-eQTLs for 885 transcripts including top cis-eGenes CHURC1, USMG5, and ERAP2, were identified in both tissues. 62.1% of top cis-eSNPs were within ±50kb of transcription start sites and cis-eGenes were enriched for mitochondrial transcripts. Mining GWAS databases revealed association of cis-eSNPs for more than 50 genes with T2D (e.g. PIK3C2A, RBMS1, UFSP1), gluco-metabolic phenotypes, (e.g. INPP5E, SNX17, ERAP2, FN3KRP), and obesity (e.g. POMC, CPEB4). Integration of GWAS meta-analysis data from AA cohorts revealed the most significant association for cis-eSNPs of ATP5SL and MCCC1 genes, with T2D and BMI, respectively. This study developed the first comprehensive map of adipose and muscle tissue eQTLs in AAs (publically accessible at https://mdsetaa.phs.wakehealth.edu) and identified genetically-regulated transcripts for delineating genetic causes of T2D, and related metabolic phenotypes. PMID:27193597
Salazar, Juan A; Pacheco, Igor; Shinya, Paulina; Zapata, Patricio; Silva, Claudia; Aradhya, Mallikarjuna; Velasco, Dianne; Ruiz, David; Martínez-Gómez, Pedro; Infante, Rodrigo
2017-01-01
Marker-assisted selection (MAS) in stone fruit ( Prunus species) breeding is currently difficult to achieve due to the polygenic nature of the most relevant agronomic traits linked to fruit quality. Genotyping by sequencing (GBS), however, provides a large quantity of useful data suitable for fine mapping using Single Nucleotide Polymorphisms (SNPs) from a reference genome. In this study, GBS was used to genotype 272 seedlings of three F1 Japanese plum ( Prunus salicina Lindl) progenies derived from crossing "98-99" (as a common female parent) with "Angeleno," "September King," and "September Queen" as male parents. Raw sequences were aligned to the Peach genome v1, and 42,909 filtered SNPs were obtained after sequence alignment. In addition, 153 seedlings from the "98-99" × "Angeleno" cross were used to develop a genetic map for each parent. A total of 981 SNPs were mapped (479 for "98-99" and 502 for "Angeleno"), covering a genetic distance of 688.8 and 647.03 cM, respectively. Fifty five seedlings from this progeny were phenotyped for different fruit quality traits including ripening time, fruit weight, fruit shape, chlorophyll index, skin color, flesh color, over color, firmness, and soluble solids content in the years 2015 and 2016. Linkage-based QTL analysis allowed the identification of genomic regions significantly associated with ripening time (LG4 of both parents and both phenotyping years), fruit skin color (LG3 and LG4 of both parents and both years), chlorophyll degradation index (LG3 of both parents in 2015) and fruit weight (LG7 of both parents in 2016). These results represent a promising situation for GBS in the identification of SNP variants associated to fruit quality traits, potentially applicable in breeding programs through MAS, in a highly heterozygous crop species such as Japanese plum.
Cell Specific eQTL Analysis without Sorting Cells
Esko, Tõnu; Peters, Marjolein J.; Schurmann, Claudia; Schramm, Katharina; Kettunen, Johannes; Yaghootkar, Hanieh; Fairfax, Benjamin P.; Andiappan, Anand Kumar; Li, Yang; Fu, Jingyuan; Karjalainen, Juha; Platteel, Mathieu; Visschedijk, Marijn; Weersma, Rinse K.; Kasela, Silva; Milani, Lili; Tserel, Liina; Peterson, Pärt; Reinmaa, Eva; Hofman, Albert; Uitterlinden, André G.; Rivadeneira, Fernando; Homuth, Georg; Petersmann, Astrid; Lorbeer, Roberto; Prokisch, Holger; Meitinger, Thomas; Herder, Christian; Roden, Michael; Grallert, Harald; Ripatti, Samuli; Perola, Markus; Wood, Andrew R.; Melzer, David; Ferrucci, Luigi; Singleton, Andrew B.; Hernandez, Dena G.; Knight, Julian C.; Melchiotti, Rossella; Lee, Bernett; Poidinger, Michael; Zolezzi, Francesca; Larbi, Anis; Wang, De Yun; van den Berg, Leonard H.; Veldink, Jan H.; Rotzschke, Olaf; Makino, Seiko; Salomaa, Veikko; Strauch, Konstantin; Völker, Uwe; van Meurs, Joyce B. J.; Metspalu, Andres; Wijmenga, Cisca; Jansen, Ritsert C.; Franke, Lude
2015-01-01
The functional consequences of trait associated SNPs are often investigated using expression quantitative trait locus (eQTL) mapping. While trait-associated variants may operate in a cell-type specific manner, eQTL datasets for such cell-types may not always be available. We performed a genome-environment interaction (GxE) meta-analysis on data from 5,683 samples to infer the cell type specificity of whole blood cis-eQTLs. We demonstrate that this method is able to predict neutrophil and lymphocyte specific cis-eQTLs and replicate these predictions in independent cell-type specific datasets. Finally, we show that SNPs associated with Crohn’s disease preferentially affect gene expression within neutrophils, including the archetypal NOD2 locus. PMID:25955312
ComVisMD - compact visualization of multidimensional data: experimenting with cricket players data
NASA Astrophysics Data System (ADS)
Dandin, Shridhar B.; Ducassé, Mireille
2018-03-01
Database information is multidimensional and often displayed in tabular format (row/column display). Presented in aggregated form, multidimensional data can be used to analyze the records or objects. Online Analytical database Processing (OLAP) proposes mechanisms to display multidimensional data in aggregated forms. A choropleth map is a thematic map in which areas are colored in proportion to the measurement of a statistical variable being displayed, such as population density. They are used mostly for compact graphical representation of geographical information. We propose a system, ComVisMD inspired by choropleth map and the OLAP cube to visualize multidimensional data in a compact way. ComVisMD displays multidimensional data like OLAP Cube, where we are mapping an attribute a (first dimension, e.g. year started playing cricket) in vertical direction, object coloring based on b (second dimension, e.g. batting average), mapping varying-size circles based on attribute c (third dimension, e.g. highest score), mapping numbers based on attribute d (fourth dimension, e.g. matches played). We illustrate our approach on cricket players data, namely on two tables Country and Player. They have a large number of rows and columns: 246 rows and 17 columns for players of one country. ComVisMD’s visualization reduces the size of the tabular display by a factor of about 4, allowing users to grasp more information at a time than the bare table display.
Win, Khin Thanda; Vegas, Juan; Zhang, Chunying; Song, Kihwan; Lee, Sanghyeob
2017-01-01
QTL mapping using NGS-assisted BSA was successfully applied to an F 2 population for downy mildew resistance in cucumber. QTLs detected by NGS-assisted BSA were confirmed by conventional QTL analysis. Downy mildew (DM), caused by Pseudoperonospora cubensis, is one of the most destructive foliar diseases in cucumber. QTL mapping is a fundamental approach for understanding the genetic inheritance of DM resistance in cucumber. Recently, many studies have reported that a combination of bulked segregant analysis (BSA) and next-generation sequencing (NGS) can be a rapid and cost-effective way of mapping QTLs. In this study, we applied NGS-assisted BSA to QTL mapping of DM resistance in cucumber and confirmed the results by conventional QTL analysis. By sequencing two DNA pools each consisting of ten individuals showing high resistance and susceptibility to DM from a F 2 population, we identified single nucleotide polymorphisms (SNPs) between the two pools. We employed a statistical method for QTL mapping based on these SNPs. Five QTLs, dm2.2, dm4.1, dm5.1, dm5.2, and dm6.1, were detected and dm2.2 showed the largest effect on DM resistance. Conventional QTL analysis using the F 2 confirmed dm2.2 (R 2 = 10.8-24 %) and dm5.2 (R 2 = 14-27.2 %) as major QTLs and dm4.1 (R 2 = 8 %) as two minor QTLs, but could not detect dm5.1 and dm6.1. A new QTL on chromosome 2, dm2.1 (R 2 = 28.2 %) was detected by the conventional QTL method using an F 3 population. This study demonstrated the effectiveness of NGS-assisted BSA for mapping QTLs conferring DM resistance in cucumber and revealed the unique genetic inheritance of DM resistance in this population through two distinct major QTLs on chromosome 2 that mainly harbor DM resistance.
A survey of the population genetic variation in the human kinome.
Zhang, Wei; Catenacci, Daniel V T; Duan, Shiwei; Ratain, Mark J
2009-08-01
Protein kinases are key regulators of various biological processes, such as control of cell growth, metabolism, differentiation and apoptosis. Therefore, protein kinases have been an important class of targets for anticancer drugs. Health-related disparities such as differential drug response have been observed between human populations. A survey of the human kinases and their ligand genes for those containing population-specific genetic variants could provide new insights into the mechanisms of these health disparities and suggest novel targets for ethnicity-specific personalized medicine. Using the International HapMap Project genotypic data on single-nucleotide polymorphisms (SNPs), the protein kinase complement of the human genome (kinome) and some experimentally verified ligand genes were scanned for the existence of population-specific SNPs (eSNPs). In general, protein kinases were found to contain a much higher proportion of eSNPs than the whole genome background, indicating a stronger pressure for adaptation in individual populations. In contrast, the proportion of ligand genes containing eSNPs was not different from that of the whole genome background. Although with some important limitations, our results suggest that human kinases are more likely to be under recent positive selection than ligands. Our findings suggest that the health-related disparities associated with kinase signaling pathways are more likely to be driven by the genetic variation in the kinase genes than their cognate ligands. Illustrating the role of molecular evolution in the genetic variation of the human kinome could provide a promising route to understand the ethnic differences in cancer and facilitate the realization of ethnicity-based individualized medicine.
Leak, T. S.; Perlegas, P.S.; Smith, S.G.; Keene, K.L.; Hicks, P.J.; Langefeld, C.D.; Mychaleckyj, J.C.; Rich, S.S.; Kirk, J.K.; Freedman, B.I.; Bowden, D.W.; Sale, M.M.
2009-01-01
Variants in the engulfment and cell motility 1 (ELMO1) gene are associated with nephropathy due to type 2 diabetes mellitus (T2DM) in a Japanese cohort. We comprehensively evaluated this gene in African American (AA) T2DM patients with end-stage renal disease (ESRD). Three hundred nine HapMap tagging SNPs and 9 reportedly associated SNPs were genotyped in 577 AA T2DM-ESRD patients and 596 AA non-diabetic controls, plus 43 non-diabetic European American controls and 45 Yoruba Nigerian samples for admixture adjustment. Replication analyses were conducted in 558 AAs with T2DM-ESRD and 564 controls without diabetes. Extension analyses included 328 AA with T2DM lacking nephropathy and 326 with non-diabetic ESRD. The original and replication analyses confirmed association with four SNPs in intron 13 (permutation p-values for combined analyses = 0.001-0.003), one in intron 1 (P=0.004) and one in intron 5 (P=0.002) with T2DM-associated ESRD. In a subsequent combined analysis of all 1,135 T2DM-ESRD cases and 1,160 controls, an additional 7 intron 13 SNPs produced evidence of association (P = 3.5×10-5 – P=0.05). No associations were seen with these SNPs in those with T2DM lacking nephropathy or with ESRD due to non-diabetic causes. Variants in intron 13 of the ELMO1 gene appear to confer risk for diabetic nephropathy in AA. PMID:19183347
Sabourin, Jeremy; Nobel, Andrew B.; Valdar, William
2014-01-01
Genomewide association studies sometimes identify loci at which both the number and identities of the underlying causal variants are ambiguous. In such cases, statistical methods that model effects of multiple SNPs simultaneously can help disentangle the observed patterns of association and provide information about how those SNPs could be prioritized for follow-up studies. Current multi-SNP methods, however, tend to assume that SNP effects are well captured by additive genetics; yet when genetic dominance is present, this assumption translates to reduced power and faulty prioritizations. We describe a statistical procedure for prioritizing SNPs at GWAS loci that efficiently models both additive and dominance effects. Our method, LLARRMA-dawg, combines a group LASSO procedure for sparse modeling of multiple SNP effects with a resampling procedure based on fractional observation weights; it estimates for each SNP the robustness of association with the phenotype both to sampling variation and to competing explanations from other SNPs. In producing a SNP prioritization that best identifies underlying true signals, we show that: our method easily outperforms a single marker analysis; when additive-only signals are present, our joint model for additive and dominance is equivalent to or only slightly less powerful than modeling additive-only effects; and, when dominance signals are present, even in combination with substantial additive effects, our joint model is unequivocally more powerful than a model assuming additivity. We also describe how performance can be improved through calibrated randomized penalization, and discuss how dominance in ungenotyped SNPs can be incorporated through either heterozygote dosage or multiple imputation. PMID:25417853
High throughput SNP discovery and genotyping in hexaploid wheat
Navarro, Julien; Kitt, Jonathan; Choulet, Frédéric; Leveugle, Magalie; Duarte, Jorge; Rivière, Nathalie; Eversole, Kellye; Le Gouis, Jacques; Davassi, Alessandro; Balfourier, François; Le Paslier, Marie-Christine; Berard, Aurélie; Brunel, Dominique; Feuillet, Catherine; Poncet, Charles; Sourdille, Pierre
2018-01-01
Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research. PMID:29293495
Khatkar, Mehar S; Nicholas, Frank W; Collins, Andrew R; Zenger, Kyall R; Cavanagh, Julie A L; Barris, Wes; Schnabel, Robert D; Taylor, Jeremy F; Raadsma, Herman W
2008-04-24
The extent of linkage disequilibrium (LD) within a population determines the number of markers that will be required for successful association mapping and marker-assisted selection. Most studies on LD in cattle reported to date are based on microsatellite markers or small numbers of single nucleotide polymorphisms (SNPs) covering one or only a few chromosomes. This is the first comprehensive study on the extent of LD in cattle by analyzing data on 1,546 Holstein-Friesian bulls genotyped for 15,036 SNP markers covering all regions of all autosomes. Furthermore, most studies in cattle have used relatively small sample sizes and, consequently, may have had biased estimates of measures commonly used to describe LD. We examine minimum sample sizes required to estimate LD without bias and loss in accuracy. Finally, relatively little information is available on comparative LD structures including other mammalian species such as human and mouse, and we compare LD structure in cattle with public-domain data from both human and mouse. We computed three LD estimates, D', Dvol and r2, for 1,566,890 syntenic SNP pairs and a sample of 365,400 non-syntenic pairs. Mean D' is 0.189 among syntenic SNPs, and 0.105 among non-syntenic SNPs; mean r2 is 0.024 among syntenic SNPs and 0.0032 among non-syntenic SNPs. All three measures of LD for syntenic pairs decline with distance; the decline is much steeper for r2 than for D' and Dvol. The value of D' and Dvol are quite similar. Significant LD in cattle extends to 40 kb (when estimated as r2) and 8.2 Mb (when estimated as D'). The mean values for LD at large physical distances are close to those for non-syntenic SNPs. Minor allelic frequency threshold affects the distribution and extent of LD. For unbiased and accurate estimates of LD across marker intervals spanning < 1 kb to > 50 Mb, minimum sample sizes of 400 (for D') and 75 (for r2) are required. The bias due to small samples sizes increases with inter-marker interval. LD in cattle is much less extensive than in a mouse population created from crossing inbred lines, and more extensive than in humans. For association mapping in Holstein-Friesian cattle, for a given design, at least one SNP is required for each 40 kb, giving a total requirement of at least 75,000 SNPs for a low power whole-genome scan (median r2 > 0.19) and up to 300,000 markers at 10 kb intervals for a high power genome scan (median r2 > 0.62). For estimation of LD by D' and Dvol with sufficient precision, a sample size of at least 400 is required, whereas for r2 a minimum sample of 75 is adequate.
Transcriptome sequencing for high throughput SNP development and genetic mapping in Pea
2014-01-01
Background Pea has a complex genome of 4.3 Gb for which only limited genomic resources are available to date. Although SNP markers are now highly valuable for research and modern breeding, only a few are described and used in pea for genetic diversity and linkage analysis. Results We developed a large resource by cDNA sequencing of 8 genotypes representative of modern breeding material using the Roche 454 technology, combining both long reads (400 bp) and high coverage (3.8 million reads, reaching a total of 1,369 megabases). Sequencing data were assembled and generated a 68 K unigene set, from which 41 K were annotated from their best blast hit against the model species Medicago truncatula. Annotated contigs showed an even distribution along M. truncatula pseudochromosomes, suggesting a good representation of the pea genome. 10 K pea contigs were found to be polymorphic among the genetic material surveyed, corresponding to 35 K SNPs. We validated a subset of 1538 SNPs through the GoldenGate assay, proving their ability to structure a diversity panel of breeding germplasm. Among them, 1340 were genetically mapped and used to build a new consensus map comprising a total of 2070 markers. Based on blast analysis, we could establish 1252 bridges between our pea consensus map and the pseudochromosomes of M. truncatula, which provides new insight on synteny between the two species. Conclusions Our approach created significant new resources in pea, i.e. the most comprehensive genetic map to date tightly linked to the model species M. truncatula and a large SNP resource for both academic research and breeding. PMID:24521263
Schönhals, E M; Ortega, F; Barandalla, L; Aragones, A; Ruiz de Galarreta, J I; Liao, J-C; Sanetomo, R; Walkemeier, B; Tacke, E; Ritter, E; Gebhardt, C
2016-04-01
SNPs in candidate genes Pain - 1, InvCD141 (invertases), SSIV (starch synthase), StCDF1 (transcription factor), LapN (leucine aminopeptidase), and cytoplasm type are associated with potato tuber yield, starch content and/or starch yield. Tuber yield (TY), starch content (TSC), and starch yield (TSY) are complex characters of high importance for the potato crop in general and for industrial starch production in particular. DNA markers associated with superior alleles of genes that control the natural variation of TY, TSC, and TSY could increase precision and speed of breeding new cultivars optimized for potato starch production. Diagnostic DNA markers are identified by association mapping in populations of tetraploid potato varieties and advanced breeding clones. A novel association mapping population of 282 genotypes including varieties, breeding clones and Andean landraces was assembled and field evaluated in Northern Spain for TY, TSC, TSY, tuber number (TN) and tuber weight (TW). The landraces had lower mean values of TY, TW, TN, and TSY. The population was genotyped for 183 microsatellite alleles, 221 single nucleotide polymorphisms (SNPs) in fourteen candidate genes and eight known diagnostic markers for TSC and TSY. Association test statistics including kinship and population structure reproduced five known marker-trait associations of candidate genes and discovered new ones, particularly for tuber yield and starch yield. The inclusion of landraces increased the number of detected marker-trait associations. Integration of the present association mapping results with previous QTL linkage mapping studies for TY, TSC, TSY, TW, TN, and tuberization revealed some hot spots of QTL for these traits in the potato genome. The genomic positions of markers linked or associated with QTL for complex tuber traits suggest high multiplicity and genome wide distribution of the underlying genes.
Characterization of recombination features and the genetic basis in multiple cattle breeds.
Shen, Botong; Jiang, Jicai; Seroussi, Eyal; Liu, George E; Ma, Li
2018-04-27
Crossover generated by meiotic recombination is a fundamental event that facilitates meiosis and sexual reproduction. Comparative studies have shown wide variation in recombination rate among species, but the characterization of recombination features between cattle breeds has not yet been performed. Cattle populations in North America count millions, and the dairy industry has genotyped millions of individuals with pedigree information that provide a unique opportunity to study breed-level variations in recombination. Based on large pedigrees of Jersey, Ayrshire and Brown Swiss cattle with genotype data, we identified over 3.4 million maternal and paternal crossover events from 161,309 three-generation families. We constructed six breed- and sex-specific genome-wide recombination maps using 58,982 autosomal SNPs for two sexes in the three dairy cattle breeds. A comparative analysis of the six recombination maps revealed similar global recombination patterns between cattle breeds but with significant differences between sexes. We confirmed that male recombination map is 10% longer than the female map in all three cattle breeds, consistent with previously reported results in Holstein cattle. When comparing recombination hotspot regions between cattle breeds, we found that 30% and 10% of the hotspots were shared between breeds in males and females, respectively, with each breed exhibiting some breed-specific hotspots. Finally, our multiple-breed GWAS found that SNPs in eight loci affected recombination rate and that the PRDM9 gene associated with hotspot usage in multiple cattle breeds, indicating a shared genetic basis for recombination across dairy cattle breeds. Collectively, our results generated breed- and sex-specific recombination maps for multiple cattle breeds, provided a comprehensive characterization and comparison of recombination patterns between breeds, and expanded our understanding of the breed-level variations in recombination features within an important livestock species.
Cockpit display of traffic information (CDTI) and airport moving map industry survey
DOT National Transportation Integrated Search
2016-08-01
This document provides an overview of Cockpit Display of Traffic Information (CDTI) products as of May 2016, including those with airport moving map functionality, and airport moving map applications without traffic depiction. This document updates a...
User Guide for the Anvil Threat Cooridor Forecast Tool V2.4 for AWIPS
NASA Technical Reports Server (NTRS)
Barett, Joe H., III; Bauman, William H., III
2008-01-01
The Anvil Tool GUI allows users to select a Data Type, toggle the map refresh on/off, place labels, and choose the Profiler Type (source of the KSC 50 MHz profiler data), the Date- Time of the data, the Center of Plot, and the Station (location of the RAOB or 50 MHz profiler). If the Data Type is Models, the user selects a Fcst Hour (forecast hour) instead of Station. There are menus for User Profiles, Circle Label Options, and Frame Label Options. Labels can be placed near the center circle of the plot and/or at a specified distance and direction from the center of the circle (Center of Plot). The default selection for the map refresh is "ON". When the user creates a new Anvil Tool map with Refresh Map "ON, the plot is automatically displayed in the AWIPS frame. If another Anvil Tool map is already displayed and the user does not change the existing map number shown at the bottom of the GUI, the new Anvil Tool map will overwrite the old one. If the user turns the Refresh Map "OFF", the new Anvil Tool map is created but not automatically displayed. The user can still display the Anvil Tool map through the Maps dropdown menu* as shown in Figure 4.
Color stability of ground beef packaged in a low carbon monoxide atmosphere or vacuum.
Jeong, Jong Youn; Claus, James R
2011-01-01
Ground beef was either packaged in an atmosphere of 0.4% CO, 30% CO₂, and 69.6% N₂ (CO-MAP) or vacuum. After storage (48 h, 2-3°C), packages of CO-MAP and vacuum were opened and overwrapped with polyvinyl chloride. Other CO-MAP and vacuum packages were left intact. Packages were initially displayed for 7 days (2-3°C). Intact packages were further displayed up to 35 days before being opened and displayed (1 or 3 days). Intact CO-MAP packaged ground beef was always more red than intact vacuum-packaged ground beef. Color was relatively stable for both types of intact packages over 35 days of display. Upon opening CO-MAP packaged ground beef, the red color decreased slower than in ground beef from vacuum packages. Published by Elsevier Ltd.
Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao
Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos
2015-01-01
Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. PMID:26070980
Namgoong, Suhg; Cheong, Hyun Sub; Kim, Ji On; Kim, Lyoung Hyo; Na, Han Sung; Koh, In Song; Chung, Myeon Woo; Shin, Hyoung Doo
2015-11-01
Organic anion-transporting polypeptide (OATP; gene symbol, SLCO) transporters are generally involved in the uptake of multiple drugs and their metabolites at most epithelial barriers. The pattern of single-nucleotide polymorphisms (SNPs) in these transporters may be determinants of interindividual variability in drug disposition and response. The objective of this study was to define the distribution of SNPs of three SLCO genes, SLCO1B1, SLCO1B3, and SLCO2B1, in a Korean population and other ethnic groups. The study was screened using the Illumina GoldenGate assay for genomic DNA from 450 interethnic subjects, including 11 pharmacogenetic core variants and 76 HapMap tagging SNPs. The genotype distribution of the Korean population was similar to East Asian populations, but significantly different from African American and European American cohorts. These interethnic differences will be useful information for prospective studies, including genetic association and pharmacogenetic studies of drug metabolism by SLCO families. Copyright © 2015 Elsevier B.V. All rights reserved.
Genetic ancestry is associated with colorectal adenomas and adenocarcinomas in Latino populations.
Hernandez-Suarez, Gustavo; Sanabria, Maria Carolina; Serrano, Marta; Herran, Oscar F; Perez, Jesus; Plata, Jose L; Zabaleta, Jovanny; Tenesa, Albert
2014-10-01
Colorectal cancer rates in Latin American countries are less than half of those observed in the United States. Latin Americans are the resultant of generations of an admixture of Native American, European, and African individuals. The potential role of genetic admixture in colorectal carcinogenesis has not been examined. We evaluate the association of genetic ancestry with colorectal neoplasms in 190 adenocarcinomas, 113 sporadic adenomas and 243 age- and sex-matched controls enrolled in a multicentric case-control study in Colombia. Individual ancestral genetic fractions were estimated using the STRUCTURE software, based on allele frequencies and assuming three distinct population origins. We used the Illumina Cancer Panel to genotype 1,421 sparse single-nucleotide polymorphisms (SNPs), and Northern and Western European ancestry, LWJ and Han Chinese in Beijing, China populations from the HapMap project as references. A total of 678 autosomal SNPs overlapped with the HapMap data set SNPs and were used for ancestry estimations. African mean ancestry fraction was higher in adenomas (0.13, 95% confidence interval (95% CI)=0.11-0.15) and cancer cases (0.14, 95% CI=0.12-0.16) compared with controls (0.11, 95% CI=0.10-0.12). Conditional logistic regression analysis, controlling for known risk factors, showed a positive association of African ancestry per 10% increase with both colorectal adenoma (odds ratio (OR)=1.12, 95% CI=0.97-1.30) and adenocarcinoma (OR=1.19, 95% CI=1.05-1.35). In conclusion, increased African ancestry (or variants linked to it) contributes to the increased susceptibility of colorectal cancer in admixed Latin American population.
HapMap-based study of CIP2A gene polymorphisms and HCC susceptibility
LI, YUCHUN; WANG, KAIJUAN; DAI, LIPING; WANG, PENG; SONG, CHUNHUA; SHI, JIANXIANG; REN, PENGFEI; YE, HUA; ZHANG, JIANYING
2012-01-01
CIP2A is a human oncoprotein that inhibits PP2A and stabilizes c-myc in human malignancies. Autoantibodies to CIP2A protein have been reported to be present in higher levels in sera from patients with hepatocellular carcinoma (HCC) than in sera of healthy individuals. The CIP2A gene has been demonstrated as a potential cancer susceptibility gene. To elucidate whether common CIP2A variants are associated with HCC susceptibility, we conducted a case-control study comprising 233 cases of HCC and 280 controls matched on age, gender and ethnicity in the Chinese Han population. Two haplotype-tagging single nucleotide polymorphisms (htSNPs) (rs2278911 and rs4855656) from the HapMap database were analyzed, which provide an almost complete coverage of the genetic variations in the CIP2A gene. We found that neither of these htSNPs and haplotypes were associated with the risk of HCC. However, an interaction was observed between hepatitis virus B and C infection (HBV and HCV) and the C carriers (TC or CC) of rs2278911 on HCC risk (OR=12.35; 95% CI, 4.93–19.87). No such association was found for rs4855656. Our study also demonstrated that two htSNPs (rs2278911 and rs4855656) in the CIP2A gene are not associated with the risk of HCC. HBV and HCV infection was found to exert a synergistic effect on the risk of HCC in individuals with the C carriers (TC or CC) of rs2278911 in the Chinese Han population. PMID:22844383
The OncoArray Consortium: A Network for Understanding the Genetic Architecture of Common Cancers.
Amos, Christopher I; Dennis, Joe; Wang, Zhaoming; Byun, Jinyoung; Schumacher, Fredrick R; Gayther, Simon A; Casey, Graham; Hunter, David J; Sellers, Thomas A; Gruber, Stephen B; Dunning, Alison M; Michailidou, Kyriaki; Fachal, Laura; Doheny, Kimberly; Spurdle, Amanda B; Li, Yafang; Xiao, Xiangjun; Romm, Jane; Pugh, Elizabeth; Coetzee, Gerhard A; Hazelett, Dennis J; Bojesen, Stig E; Caga-Anan, Charlisse; Haiman, Christopher A; Kamal, Ahsan; Luccarini, Craig; Tessier, Daniel; Vincent, Daniel; Bacot, François; Van Den Berg, David J; Nelson, Stefanie; Demetriades, Stephen; Goldgar, David E; Couch, Fergus J; Forman, Judith L; Giles, Graham G; Conti, David V; Bickeböller, Heike; Risch, Angela; Waldenberger, Melanie; Brüske-Hohlfeld, Irene; Hicks, Belynda D; Ling, Hua; McGuffog, Lesley; Lee, Andrew; Kuchenbaecker, Karoline; Soucy, Penny; Manz, Judith; Cunningham, Julie M; Butterbach, Katja; Kote-Jarai, Zsofia; Kraft, Peter; FitzGerald, Liesel; Lindström, Sara; Adams, Marcia; McKay, James D; Phelan, Catherine M; Benlloch, Sara; Kelemen, Linda E; Brennan, Paul; Riggan, Marjorie; O'Mara, Tracy A; Shen, Hongbing; Shi, Yongyong; Thompson, Deborah J; Goodman, Marc T; Nielsen, Sune F; Berchuck, Andrew; Laboissiere, Sylvie; Schmit, Stephanie L; Shelford, Tameka; Edlund, Christopher K; Taylor, Jack A; Field, John K; Park, Sue K; Offit, Kenneth; Thomassen, Mads; Schmutzler, Rita; Ottini, Laura; Hung, Rayjean J; Marchini, Jonathan; Amin Al Olama, Ali; Peters, Ulrike; Eeles, Rosalind A; Seldin, Michael F; Gillanders, Elizabeth; Seminara, Daniela; Antoniou, Antonis C; Pharoah, Paul D P; Chenevix-Trench, Georgia; Chanock, Stephen J; Simard, Jacques; Easton, Douglas F
2017-01-01
Common cancers develop through a multistep process often including inherited susceptibility. Collaboration among multiple institutions, and funding from multiple sources, has allowed the development of an inexpensive genotyping microarray, the OncoArray. The array includes a genome-wide backbone, comprising 230,000 SNPs tagging most common genetic variants, together with dense mapping of known susceptibility regions, rare variants from sequencing experiments, pharmacogenetic markers, and cancer-related traits. The OncoArray can be genotyped using a novel technology developed by Illumina to facilitate efficient genotyping. The consortium developed standard approaches for selecting SNPs for study, for quality control of markers, and for ancestry analysis. The array was genotyped at selected sites and with prespecified replicate samples to permit evaluation of genotyping accuracy among centers and by ethnic background. The OncoArray consortium genotyped 447,705 samples. A total of 494,763 SNPs passed quality control steps with a sample success rate of 97% of the samples. Participating sites performed ancestry analysis using a common set of markers and a scoring algorithm based on principal components analysis. Results from these analyses will enable researchers to identify new susceptibility loci, perform fine-mapping of new or known loci associated with either single or multiple cancers, assess the degree of overlap in cancer causation and pleiotropic effects of loci that have been identified for disease-specific risk, and jointly model genetic, environmental, and lifestyle-related exposures. Ongoing analyses will shed light on etiology and risk assessment for many types of cancer. Cancer Epidemiol Biomarkers Prev; 26(1); 126-35. ©2016 AACR. ©2016 American Association for Cancer Research.
The Role of Local Ancestry Adjustment in Association Studies Using Admixed Populations
Zhang, Jianqi; Stram, Daniel O.
2016-01-01
Association analysis using admixed populations imposes challenges and opportunities for disease mapping. By developing some explicit results for the variance of an allele of interest conditional on either local or global ancestry and by simulation of recently admixed genomes we evaluate power and false-positive rates under a variety of scenarios concerning linkage disequilibrium (LD) and the presence of unmeasured variants. Pairwise LD patterns were compared between admixed and nonadmixed populations using the HapMap phase 3 data. Based on the above, we showed that as follows: For causal variants with similar effect size in all populations, power is generally higher in a study using admixed population than using nonadmixed population, especially for highly differentiated SNPs. This gain of power is achieved with adjustment of global ancestry, which completely removes any cross-chromosome inflation of type I error rates, and addresses much of the intrachromosome inflation.If reliably estimated, adjusting for local ancestry precisely recovers the localization that could have been achieved in a stratified analysis of source populations. Improved localization is most evident for highly differentiated SNPs; however, the advantage of higher power is lost on exactly the same differentiated SNPs.In the real admixed populations such as African Americans and Latinos, the expansion of LD is not as dramatic as in our simulation.While adjustment for global ancestry is required prior to announcing a novel association seen in an admixed population, local ancestry adjustment may best be regarded as a localization tool not strictly required for discovery purposes. PMID:25043967
The OncoArray Consortium: a Network for Understanding the Genetic Architecture of Common Cancers
Amos, Christopher I.; Dennis, Joe; Wang, Zhaoming; Byun, Jinyoung; Schumacher, Fredrick R.; Gayther, Simon A.; Casey, Graham; Hunter, David J.; Sellers, Thomas A.; Gruber, Stephen B.; Dunning, Alison M.; Michailidou, Kyriaki; Fachal, Laura; Doheny, Kimberly; Spurdle, Amanda B.; Li, Yafang; Xiao, Xiangjun; Romm, Jane; Pugh, Elizabeth; Coetzee, Gerhard A.; Hazelett, Dennis J.; Bojesen, Stig E.; Caga-Anan, Charlisse; Haiman, Christopher A.; Kamal, Ahsan; Luccarini, Craig; Tessier, Daniel; Vincent, Daniel; Bacot, François; Van Den Berg, David J.; Nelson, Stefanie; Demetriades, Stephen; Goldgar, David E.; Couch, Fergus J.; Forman, Judith L.; Giles, Graham G.; Conti, David V.; Bickeböller, Heike; Risch, Angela; Waldenberger, Melanie; Brüske, Irene; Hicks, Belynda D.; Ling, Hua; McGuffog, Lesley; Lee, Andrew; Kuchenbaecker, Karoline B.; Soucy, Penny; Manz, Judith; Cunningham, Julie M.; Butterbach, Katja; Kote-Jarai, Zsofia; Kraft, Peter; FitzGerald, Liesel M.; Lindström, Sara; Adams, Marcia; McKay, James D.; Phelan, Catherine M.; Benlloch, Sara; Kelemen, Linda E.; Brennan, Paul; Riggan, Marjorie; O’Mara, Tracy A.; Shen, Hongbin; Shi, Yongyong; Thompson, Deborah J.; Goodman, Marc T.; Nielsen, Sune F.; Berchuck, Andrew; Laboissiere, Sylvie; Schmit, Stephanie L.; Shelford, Tameka; Edlund, Christopher K.; Taylor, Jack A.; Field, John K.; Park, Sue K.; Offit, Kenneth; Thomassen, Mads; Schmutzler, Rita; Ottini, Laura; Hung, Rayjean J.; Marchini, Jonathan; Al Olama, Ali Amin; Peters, Ulrike; Eeles, Rosalind A.; Seldin, Michael F.; Gillanders, Elizabeth; Seminara, Daniela; Antoniou, Antonis C.; Pharoah, Paul D.; Chenevix-Trench, Georgia; Chanock, Stephen J.; Simard, Jacques; Easton, Douglas F.
2016-01-01
Background Common cancers develop through a multistep process often including inherited susceptibility. Collaboration among multiple institutions, and funding from multiple sources, has allowed the development of an inexpensive genotyping microarray, the OncoArray. The array includes a genome-wide backbone, comprising 230,000 SNPs tagging most common genetic variants, together with dense mapping of known susceptibility regions, rare variants from sequencing experiments, pharmacogenetic markers and cancer related traits. Methods The OncoArray can be genotyped using a novel technology developed by Illumina to facilitate efficient genotyping. The consortium developed standard approaches for selecting SNPs for study, for quality control of markers and for ancestry analysis. The array was genotyped at selected sites and with prespecified replicate samples to permit evaluation of genotyping accuracy among centers and by ethnic background. Results The OncoArray consortium genotyped 447,705 samples. A total of 494,763 SNPs passed quality control steps with a sample success rate of 97% of the samples. Participating sites performed ancestry analysis using a common set of markers and a scoring algorithm based on principal components analysis. Conclusions Results from these analyses will enable researchers to identify new susceptibility loci, perform fine mapping of new or known loci associated with either single or multiple cancers, assess the degree of overlap in cancer causation and pleiotropic effects of loci that have been identified for disease-specific risk, and jointly model genetic, environmental and lifestyle related exposures. Impact Ongoing analyses will shed light on etiology and risk assessment for many types of cancer. PMID:27697780
SNP identification in FBXO32 gene and their associations with growth traits in cattle.
Wang, Ailan; Zhang, Ya; Li, Mijie; Lan, Xianyong; Wang, Juqiang; Chen, Hong
2013-02-15
The F-box protein 32 (FBXO32), also known as Atrogin-1, is one of the four subunits of the ubiquitin protein ligase complex. FBXO32 has been previously shown to be involved in regulation of initiation and development of muscle mass. In the present study, we investigated the polymorphism of FBXO32 gene in 1313 cattle from seven bovine breeds using DNA sequencing, polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and PCR-based amplification-created restriction site (PCR-ACRS) methods. Four novel single nucleotide polymorphisms (SNPs) were identified within bovine FBXO32, and were deposited in the GenBank database. The association studies between these four SNPs and growth traits were performed in NanYang cattle. Notably, the SNPs ss411628932 and ss411628936 were shown to be significantly associated with body length of 24-month-old NanYang cattle. Based on the above four SNPs, 16 haplotypes were identified. The main haplotype was AATA, which occurred at a frequency of more than 40%. Additionally, phylogenetic analysis showed that geographical distance was essential to gene flow among seven cattle breeds. Indigenous bovine breeds displayed genetic difference in comparison to hybrid bovine breeds that have foreign origins. We herein describe for the first time a comprehensive study on the variability of bovine FBXO32 gene that is predictive of genetic potential for body length phenotype. Copyright © 2012 Elsevier B.V. All rights reserved.
Control-display mapping in brain-computer interfaces.
Thurlings, Marieke E; van Erp, Jan B F; Brouwer, Anne-Marie; Blankertz, Benjamin; Werkhoven, Peter
2012-01-01
Event-related potential (ERP) based brain-computer interfaces (BCIs) employ differences in brain responses to attended and ignored stimuli. When using a tactile ERP-BCI for navigation, mapping is required between navigation directions on a visual display and unambiguously corresponding tactile stimuli (tactors) from a tactile control device: control-display mapping (CDM). We investigated the effect of congruent (both display and control horizontal or both vertical) and incongruent (vertical display, horizontal control) CDMs on task performance, the ERP and potential BCI performance. Ten participants attended to a target (determined via CDM), in a stream of sequentially vibrating tactors. We show that congruent CDM yields best task performance, enhanced the P300 and results in increased estimated BCI performance. This suggests a reduced availability of attentional resources when operating an ERP-BCI with incongruent CDM. Additionally, we found an enhanced N2 for incongruent CDM, which indicates a conflict between visual display and tactile control orientations. Incongruency in control-display mapping reduces task performance. In this study, brain responses, task and system performance are related to (in)congruent mapping of command options and the corresponding stimuli in a brain-computer interface (BCI). Directional congruency reduces task errors, increases available attentional resources, improves BCI performance and thus facilitates human-computer interaction.
USDA-ARS?s Scientific Manuscript database
The objective of this study was to analyze genetic mutations in the swine odorant binding protein (OBP), map them on the swine genome, and to investigate the association between genetic variants and growth traits. The tissue sample of intramuscular fat from 10 individuals for each growth stage (0, 9...
USDA-ARS?s Scientific Manuscript database
Cowpea (Vigna unguiculata (L) Walp.) is an important legume and the antioxidants in cowpea seeds have been recognized as health-promoting compounds for human. The objectives of this study were to analyze the population structure of cowpea collections using single nucleotide polymorphism (SNP) and to...
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers for susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpesvirus 3 (CyHV-3) is highly contagious and virulent in common carp (Cyprinus carpio). With the aim to de...
SNP identification, genetic mapping and tissue expression of the rainbow trout TLR9 gene
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) in Toll-like receptor (TLR) genes have been reported to be associated with disease resistance in human and livestock. A number of TLR genes have been identified in rainbow trout including TLR2, TLR3, TLR5, TLR20, TLR22 and TLR23. The rainbow trout (Oncorhynch...
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are capable of providing the highest level of genome coverage for genomic and genetic analysis because of their abundance and relatively even distribution in the genome. Such a capacity, however, cannot be achieved without an efficient genotyping platform such ...
Otto, Lars-Gernot; Mondal, Prodyut; Brassac, Jonathan; Preiss, Susanne; Degenhardt, Jörg; He, Sang; Reif, Jochen Christoph; Sharbel, Timothy Francis
2017-08-10
Chamomile (Matricaria recutita L.) has a long history of use in herbal medicine with various applications, and the flower heads contain numerous secondary metabolites which are medicinally active. In the major crop plants, next generation sequencing (NGS) approaches are intensely applied to exploit genetic resources, to develop genomic resources and to enhance breeding. Here, genotyping-by-sequencing (GBS) has been used in the non-model medicinal plant chamomile to evaluate the genetic structure of the cultivated varieties/populations, and to perform genome wide association study (GWAS) focusing on genes with large effect on flowering time and the medicinally important alpha-bisabolol content. GBS analysis allowed the identification of 6495 high-quality SNP-markers in our panel of 91 M. recutita plants from 33 origins (2-4 genotypes each) and 4 M. discoidea plants as outgroup, grown in the greenhouse in Gatersleben, Germany. M. recutita proved to be clearly distinct from the outgroup, as was demonstrated by different cluster and principal coordinate analyses using the SNP-markers. Chamomile genotypes from the same origin were mostly genetically similar. Model-based cluster analysis revealed one large group of tetraploid genotypes with low genetic differentiation including 39 plants from 14 origins. Tetraploids tended to display lower genetic diversity than diploids, probably reflecting their origin by artificial polyploidisation from only a limited set of genetic backgrounds. Analyses of flowering time demonstrated that diploids generally flowered earlier than tetraploids, and the analysis of alpha-bisabolol identified several tetraploid genotypes with a high content. GWAS identified highly significant (P < 0.01) SNPs for flowering time (9) and alpha-bisabolol (71). One sequence harbouring SNPs associated with flowering time was described to play a role in self-pollination in Arabidopsis thaliana, whereas four sequences harbouring SNPs associated with alpha-bisabolol were identified to be involved in plant biotic and abiotic stress response in various plants species. The first genomic resource for future applications to enhance breeding in chamomile was created, andanalyses of diversity will facilitate the exploitation of these genetic resources. The GWAS data pave the way for future research towards the genetics underlying important traits in chamomile, the identification of marker-trait associations, and development of reliable markers for practical breeding.
2014-01-01
Background Numerous quantitative trait loci (QTL) have been detected in pigs over the past 20 years using microsatellite markers. However, due to the low density of these markers, the accuracy of QTL location has generally been poor. Since 2009, the dense genome coverage provided by the Illumina PorcineSNP60 BeadChip has made it possible to more accurately map QTL using genome-wide association studies (GWAS). Our objective was to perform high-density GWAS in order to identify genomic regions and corresponding haplotypes associated with production traits in a French Large White population of pigs. Methods Animals (385 Large White pigs from 106 sires) were genotyped using the PorcineSNP60 BeadChip and evaluated for 19 traits related to feed intake, growth, carcass composition and meat quality. Of the 64 432 SNPs on the chip, 44 412 were used for GWAS with an animal mixed model that included a regression coefficient for the tested SNPs and a genomic kinship matrix. SNP haplotype effects in QTL regions were then tested for association with phenotypes following phase reconstruction based on the Sscrofa10.2 pig genome assembly. Results Twenty-three QTL regions were identified on autosomes and their effects ranged from 0.25 to 0.75 phenotypic standard deviation units for feed intake and feed efficiency (four QTL), carcass (12 QTL) and meat quality traits (seven QTL). The 10 most significant QTL regions had effects on carcass (chromosomes 7, 10, 16, 17 and 18) and meat quality traits (two regions on chromosome 1 and one region on chromosomes 8, 9 and 13). Thirteen of the 23 QTL regions had not been previously described. A haplotype block of 183 kb on chromosome 1 (six SNPs) was identified and displayed three distinct haplotypes with significant (0.0001 < P < 0.03) associations with all evaluated meat quality traits. Conclusions GWAS analyses with the PorcineSNP60 BeadChip enabled the detection of 23 QTL regions that affect feed consumption, carcass and meat quality traits in a LW population, of which 13 were novel QTL. The proportionally larger number of QTL found for meat quality traits suggests a specific opportunity for improving these traits in the pig by genomic selection. PMID:24528607
Topographical Hill Shading Map Production Based Tianditu (map World)
NASA Astrophysics Data System (ADS)
Wang, C.; Zha, Z.; Tang, D.; Yang, J.
2018-04-01
TIANDITU (Map World) is the public version of National Platform for Common Geospatial Information Service, and the terrain service is an important channel for users on the platform. With the development of TIANDITU, topographical hill shading map production for providing and updating global terrain map on line becomes necessary for the characters of strong intuition, three-dimensional sense and aesthetic effect. As such, the terrain service of TIANDITU focuses on displaying the different scales of topographical data globally. And this paper mainly aims to research the method of topographical hill shading map production globally using DEM (Digital Elevation Model) data between the displaying scales about 1 : 140,000,000 to 1 : 4,000,000, corresponded the display level from 2 to 7 on TIANDITU website.
Randall, Joshua C; Winkler, Thomas W; Kutalik, Zoltán; Berndt, Sonja I; Jackson, Anne U; Monda, Keri L; Kilpeläinen, Tuomas O; Esko, Tõnu; Mägi, Reedik; Li, Shengxu; Workalemahu, Tsegaselassie; Feitosa, Mary F; Croteau-Chonka, Damien C; Day, Felix R; Fall, Tove; Ferreira, Teresa; Gustafsson, Stefan; Locke, Adam E; Mathieson, Iain; Scherag, Andre; Vedantam, Sailaja; Wood, Andrew R; Liang, Liming; Steinthorsdottir, Valgerdur; Thorleifsson, Gudmar; Dermitzakis, Emmanouil T; Dimas, Antigone S; Karpe, Fredrik; Min, Josine L; Nicholson, George; Clegg, Deborah J; Person, Thomas; Krohn, Jon P; Bauer, Sabrina; Buechler, Christa; Eisinger, Kristina; Bonnefond, Amélie; Froguel, Philippe; Hottenga, Jouke-Jan; Prokopenko, Inga; Waite, Lindsay L; Harris, Tamara B; Smith, Albert Vernon; Shuldiner, Alan R; McArdle, Wendy L; Caulfield, Mark J; Munroe, Patricia B; Grönberg, Henrik; Chen, Yii-Der Ida; Li, Guo; Beckmann, Jacques S; Johnson, Toby; Thorsteinsdottir, Unnur; Teder-Laving, Maris; Khaw, Kay-Tee; Wareham, Nicholas J; Zhao, Jing Hua; Amin, Najaf; Oostra, Ben A; Kraja, Aldi T; Province, Michael A; Cupples, L Adrienne; Heard-Costa, Nancy L; Kaprio, Jaakko; Ripatti, Samuli; Surakka, Ida; Collins, Francis S; Saramies, Jouko; Tuomilehto, Jaakko; Jula, Antti; Salomaa, Veikko; Erdmann, Jeanette; Hengstenberg, Christian; Loley, Christina; Schunkert, Heribert; Lamina, Claudia; Wichmann, H Erich; Albrecht, Eva; Gieger, Christian; Hicks, Andrew A; Johansson, Asa; Pramstaller, Peter P; Kathiresan, Sekar; Speliotes, Elizabeth K; Penninx, Brenda; Hartikainen, Anna-Liisa; Jarvelin, Marjo-Riitta; Gyllensten, Ulf; Boomsma, Dorret I; Campbell, Harry; Wilson, James F; Chanock, Stephen J; Farrall, Martin; Goel, Anuj; Medina-Gomez, Carolina; Rivadeneira, Fernando; Estrada, Karol; Uitterlinden, André G; Hofman, Albert; Zillikens, M Carola; den Heijer, Martin; Kiemeney, Lambertus A; Maschio, Andrea; Hall, Per; Tyrer, Jonathan; Teumer, Alexander; Völzke, Henry; Kovacs, Peter; Tönjes, Anke; Mangino, Massimo; Spector, Tim D; Hayward, Caroline; Rudan, Igor; Hall, Alistair S; Samani, Nilesh J; Attwood, Antony Paul; Sambrook, Jennifer G; Hung, Joseph; Palmer, Lyle J; Lokki, Marja-Liisa; Sinisalo, Juha; Boucher, Gabrielle; Huikuri, Heikki; Lorentzon, Mattias; Ohlsson, Claes; Eklund, Niina; Eriksson, Johan G; Barlassina, Cristina; Rivolta, Carlo; Nolte, Ilja M; Snieder, Harold; Van der Klauw, Melanie M; Van Vliet-Ostaptchouk, Jana V; Gejman, Pablo V; Shi, Jianxin; Jacobs, Kevin B; Wang, Zhaoming; Bakker, Stephan J L; Mateo Leach, Irene; Navis, Gerjan; van der Harst, Pim; Martin, Nicholas G; Medland, Sarah E; Montgomery, Grant W; Yang, Jian; Chasman, Daniel I; Ridker, Paul M; Rose, Lynda M; Lehtimäki, Terho; Raitakari, Olli; Absher, Devin; Iribarren, Carlos; Basart, Hanneke; Hovingh, Kees G; Hyppönen, Elina; Power, Chris; Anderson, Denise; Beilby, John P; Hui, Jennie; Jolley, Jennifer; Sager, Hendrik; Bornstein, Stefan R; Schwarz, Peter E H; Kristiansson, Kati; Perola, Markus; Lindström, Jaana; Swift, Amy J; Uusitupa, Matti; Atalay, Mustafa; Lakka, Timo A; Rauramaa, Rainer; Bolton, Jennifer L; Fowkes, Gerry; Fraser, Ross M; Price, Jackie F; Fischer, Krista; Krjutå Kov, Kaarel; Metspalu, Andres; Mihailov, Evelin; Langenberg, Claudia; Luan, Jian'an; Ong, Ken K; Chines, Peter S; Keinanen-Kiukaanniemi, Sirkka M; Saaristo, Timo E; Edkins, Sarah; Franks, Paul W; Hallmans, Göran; Shungin, Dmitry; Morris, Andrew David; Palmer, Colin N A; Erbel, Raimund; Moebus, Susanne; Nöthen, Markus M; Pechlivanis, Sonali; Hveem, Kristian; Narisu, Narisu; Hamsten, Anders; Humphries, Steve E; Strawbridge, Rona J; Tremoli, Elena; Grallert, Harald; Thorand, Barbara; Illig, Thomas; Koenig, Wolfgang; Müller-Nurasyid, Martina; Peters, Annette; Boehm, Bernhard O; Kleber, Marcus E; März, Winfried; Winkelmann, Bernhard R; Kuusisto, Johanna; Laakso, Markku; Arveiler, Dominique; Cesana, Giancarlo; Kuulasmaa, Kari; Virtamo, Jarmo; Yarnell, John W G; Kuh, Diana; Wong, Andrew; Lind, Lars; de Faire, Ulf; Gigante, Bruna; Magnusson, Patrik K E; Pedersen, Nancy L; Dedoussis, George; Dimitriou, Maria; Kolovou, Genovefa; Kanoni, Stavroula; Stirrups, Kathleen; Bonnycastle, Lori L; Njølstad, Inger; Wilsgaard, Tom; Ganna, Andrea; Rehnberg, Emil; Hingorani, Aroon; Kivimaki, Mika; Kumari, Meena; Assimes, Themistocles L; Barroso, Inês; Boehnke, Michael; Borecki, Ingrid B; Deloukas, Panos; Fox, Caroline S; Frayling, Timothy; Groop, Leif C; Haritunians, Talin; Hunter, David; Ingelsson, Erik; Kaplan, Robert; Mohlke, Karen L; O'Connell, Jeffrey R; Schlessinger, David; Strachan, David P; Stefansson, Kari; van Duijn, Cornelia M; Abecasis, Gonçalo R; McCarthy, Mark I; Hirschhorn, Joel N; Qi, Lu; Loos, Ruth J F; Lindgren, Cecilia M; North, Kari E; Heid, Iris M
2013-06-01
Given the anthropometric differences between men and women and previous evidence of sex-difference in genetic effects, we conducted a genome-wide search for sexually dimorphic associations with height, weight, body mass index, waist circumference, hip circumference, and waist-to-hip-ratio (133,723 individuals) and took forward 348 SNPs into follow-up (additional 137,052 individuals) in a total of 94 studies. Seven loci displayed significant sex-difference (FDR<5%), including four previously established (near GRB14/COBLL1, LYPLAL1/SLC30A10, VEGFA, ADAMTS9) and three novel anthropometric trait loci (near MAP3K1, HSD17B4, PPARG), all of which were genome-wide significant in women (P<5×10(-8)), but not in men. Sex-differences were apparent only for waist phenotypes, not for height, weight, BMI, or hip circumference. Moreover, we found no evidence for genetic effects with opposite directions in men versus women. The PPARG locus is of specific interest due to its role in diabetes genetics and therapy. Our results demonstrate the value of sex-specific GWAS to unravel the sexually dimorphic genetic underpinning of complex traits.
Jackson, Anne U.; Monda, Keri L.; Kilpeläinen, Tuomas O.; Esko, Tõnu; Mägi, Reedik; Li, Shengxu; Workalemahu, Tsegaselassie; Feitosa, Mary F.; Croteau-Chonka, Damien C.; Day, Felix R.; Fall, Tove; Ferreira, Teresa; Gustafsson, Stefan; Locke, Adam E.; Mathieson, Iain; Scherag, Andre; Vedantam, Sailaja; Wood, Andrew R.; Liang, Liming; Steinthorsdottir, Valgerdur; Thorleifsson, Gudmar; Dermitzakis, Emmanouil T.; Dimas, Antigone S.; Karpe, Fredrik; Min, Josine L.; Nicholson, George; Clegg, Deborah J.; Person, Thomas; Krohn, Jon P.; Bauer, Sabrina; Buechler, Christa; Eisinger, Kristina; Bonnefond, Amélie; Froguel, Philippe; Hottenga, Jouke-Jan; Prokopenko, Inga; Waite, Lindsay L.; Harris, Tamara B.; Smith, Albert Vernon; Shuldiner, Alan R.; McArdle, Wendy L.; Caulfield, Mark J.; Munroe, Patricia B.; Grönberg, Henrik; Chen, Yii-Der Ida; Li, Guo; Beckmann, Jacques S.; Johnson, Toby; Thorsteinsdottir, Unnur; Teder-Laving, Maris; Khaw, Kay-Tee; Wareham, Nicholas J.; Zhao, Jing Hua; Amin, Najaf; Oostra, Ben A.; Kraja, Aldi T.; Province, Michael A.; Cupples, L. Adrienne; Heard-Costa, Nancy L.; Kaprio, Jaakko; Ripatti, Samuli; Surakka, Ida; Collins, Francis S.; Saramies, Jouko; Tuomilehto, Jaakko; Jula, Antti; Salomaa, Veikko; Erdmann, Jeanette; Hengstenberg, Christian; Loley, Christina; Schunkert, Heribert; Lamina, Claudia; Wichmann, H. Erich; Albrecht, Eva; Gieger, Christian; Hicks, Andrew A.; Johansson, Åsa; Pramstaller, Peter P.; Kathiresan, Sekar; Speliotes, Elizabeth K.; Penninx, Brenda; Hartikainen, Anna-Liisa; Jarvelin, Marjo-Riitta; Gyllensten, Ulf; Boomsma, Dorret I.; Campbell, Harry; Wilson, James F.; Chanock, Stephen J.; Farrall, Martin; Goel, Anuj; Medina-Gomez, Carolina; Rivadeneira, Fernando; Estrada, Karol; Uitterlinden, André G.; Hofman, Albert; Zillikens, M. Carola; den Heijer, Martin; Kiemeney, Lambertus A.; Maschio, Andrea; Hall, Per; Tyrer, Jonathan; Teumer, Alexander; Völzke, Henry; Kovacs, Peter; Tönjes, Anke; Mangino, Massimo; Spector, Tim D.; Hayward, Caroline; Rudan, Igor; Hall, Alistair S.; Samani, Nilesh J.; Attwood, Antony Paul; Sambrook, Jennifer G.; Hung, Joseph; Palmer, Lyle J.; Lokki, Marja-Liisa; Sinisalo, Juha; Boucher, Gabrielle; Huikuri, Heikki; Lorentzon, Mattias; Ohlsson, Claes; Eklund, Niina; Eriksson, Johan G.; Barlassina, Cristina; Rivolta, Carlo; Nolte, Ilja M.; Snieder, Harold; Van der Klauw, Melanie M.; Van Vliet-Ostaptchouk, Jana V.; Gejman, Pablo V.; Shi, Jianxin; Jacobs, Kevin B.; Wang, Zhaoming; Bakker, Stephan J. L.; Mateo Leach, Irene; Navis, Gerjan; van der Harst, Pim; Martin, Nicholas G.; Medland, Sarah E.; Montgomery, Grant W.; Yang, Jian; Chasman, Daniel I.; Ridker, Paul M.; Rose, Lynda M.; Lehtimäki, Terho; Raitakari, Olli; Absher, Devin; Iribarren, Carlos; Basart, Hanneke; Hovingh, Kees G.; Hyppönen, Elina; Power, Chris; Anderson, Denise; Beilby, John P.; Hui, Jennie; Jolley, Jennifer; Sager, Hendrik; Bornstein, Stefan R.; Schwarz, Peter E. H.; Kristiansson, Kati; Perola, Markus; Lindström, Jaana; Swift, Amy J.; Uusitupa, Matti; Atalay, Mustafa; Lakka, Timo A.; Rauramaa, Rainer; Bolton, Jennifer L.; Fowkes, Gerry; Fraser, Ross M.; Price, Jackie F.; Fischer, Krista; KrjutÅ¡kov, Kaarel; Metspalu, Andres; Mihailov, Evelin; Langenberg, Claudia; Luan, Jian'an; Ong, Ken K.; Chines, Peter S.; Keinanen-Kiukaanniemi, Sirkka M.; Saaristo, Timo E.; Edkins, Sarah; Franks, Paul W.; Hallmans, Göran; Shungin, Dmitry; Morris, Andrew David; Palmer, Colin N. A.; Erbel, Raimund; Moebus, Susanne; Nöthen, Markus M.; Pechlivanis, Sonali; Hveem, Kristian; Narisu, Narisu; Hamsten, Anders; Humphries, Steve E.; Strawbridge, Rona J.; Tremoli, Elena; Grallert, Harald; Thorand, Barbara; Illig, Thomas; Koenig, Wolfgang; Müller-Nurasyid, Martina; Peters, Annette; Boehm, Bernhard O.; Kleber, Marcus E.; März, Winfried; Winkelmann, Bernhard R.; Kuusisto, Johanna; Laakso, Markku; Arveiler, Dominique; Cesana, Giancarlo; Kuulasmaa, Kari; Virtamo, Jarmo; Yarnell, John W. G.; Kuh, Diana; Wong, Andrew; Lind, Lars; de Faire, Ulf; Gigante, Bruna; Magnusson, Patrik K. E.; Pedersen, Nancy L.; Dedoussis, George; Dimitriou, Maria; Kolovou, Genovefa; Kanoni, Stavroula; Stirrups, Kathleen; Bonnycastle, Lori L.; Njølstad, Inger; Wilsgaard, Tom; Ganna, Andrea; Rehnberg, Emil; Hingorani, Aroon; Kivimaki, Mika; Kumari, Meena; Assimes, Themistocles L.; Barroso, Inês; Boehnke, Michael; Borecki, Ingrid B.; Deloukas, Panos; Fox, Caroline S.; Frayling, Timothy; Groop, Leif C.; Haritunians, Talin; Hunter, David; Ingelsson, Erik; Kaplan, Robert; Mohlke, Karen L.; O'Connell, Jeffrey R.; Schlessinger, David; Strachan, David P.; Stefansson, Kari; van Duijn, Cornelia M.; Abecasis, Gonçalo R.; McCarthy, Mark I.; Hirschhorn, Joel N.; Qi, Lu; Loos, Ruth J. F.; Lindgren, Cecilia M.; North, Kari E.; Heid, Iris M.
2013-01-01
Given the anthropometric differences between men and women and previous evidence of sex-difference in genetic effects, we conducted a genome-wide search for sexually dimorphic associations with height, weight, body mass index, waist circumference, hip circumference, and waist-to-hip-ratio (133,723 individuals) and took forward 348 SNPs into follow-up (additional 137,052 individuals) in a total of 94 studies. Seven loci displayed significant sex-difference (FDR<5%), including four previously established (near GRB14/COBLL1, LYPLAL1/SLC30A10, VEGFA, ADAMTS9) and three novel anthropometric trait loci (near MAP3K1, HSD17B4, PPARG), all of which were genome-wide significant in women (P<5×10−8), but not in men. Sex-differences were apparent only for waist phenotypes, not for height, weight, BMI, or hip circumference. Moreover, we found no evidence for genetic effects with opposite directions in men versus women. The PPARG locus is of specific interest due to its role in diabetes genetics and therapy. Our results demonstrate the value of sex-specific GWAS to unravel the sexually dimorphic genetic underpinning of complex traits. PMID:23754948
Qi, Lu; Speliotes, Elizabeth K.; Thorleifsson, Gudmar; Willer, Cristen J.; Herrera, Blanca M.; Jackson, Anne U.; Lim, Noha; Scheet, Paul; Soranzo, Nicole; Amin, Najaf; Aulchenko, Yurii S.; Chambers, John C.; Drong, Alexander; Luan, Jian'an; Lyon, Helen N.; Rivadeneira, Fernando; Sanna, Serena; Timpson, Nicholas J.; Zillikens, M. Carola; Zhao, Jing Hua; Almgren, Peter; Bandinelli, Stefania; Bennett, Amanda J.; Bergman, Richard N.; Bonnycastle, Lori L.; Bumpstead, Suzannah J.; Chanock, Stephen J.; Cherkas, Lynn; Chines, Peter; Coin, Lachlan; Cooper, Cyrus; Crawford, Gabriel; Doering, Angela; Dominiczak, Anna; Doney, Alex S. F.; Ebrahim, Shah; Elliott, Paul; Erdos, Michael R.; Estrada, Karol; Ferrucci, Luigi; Fischer, Guido; Forouhi, Nita G.; Gieger, Christian; Grallert, Harald; Groves, Christopher J.; Grundy, Scott; Guiducci, Candace; Hadley, David; Hamsten, Anders; Havulinna, Aki S.; Hofman, Albert; Holle, Rolf; Holloway, John W.; Illig, Thomas; Isomaa, Bo; Jacobs, Leonie C.; Jameson, Karen; Jousilahti, Pekka; Karpe, Fredrik; Kuusisto, Johanna; Laitinen, Jaana; Lathrop, G. Mark; Lawlor, Debbie A.; Mangino, Massimo; McArdle, Wendy L.; Meitinger, Thomas; Morken, Mario A.; Morris, Andrew P.; Munroe, Patricia; Narisu, Narisu; Nordström, Anna; Nordström, Peter; Oostra, Ben A.; Palmer, Colin N. A.; Payne, Felicity; Peden, John F.; Prokopenko, Inga; Renström, Frida; Ruokonen, Aimo; Salomaa, Veikko; Sandhu, Manjinder S.; Scott, Laura J.; Scuteri, Angelo; Silander, Kaisa; Song, Kijoung; Yuan, Xin; Stringham, Heather M.; Swift, Amy J.; Tuomi, Tiinamaija; Uda, Manuela; Vollenweider, Peter; Waeber, Gerard; Wallace, Chris; Walters, G. Bragi; Weedon, Michael N.; Witteman, Jacqueline C. M.; Zhang, Cuilin; Zhang, Weihua; Caulfield, Mark J.; Collins, Francis S.; Davey Smith, George; Day, Ian N. M.; Franks, Paul W.; Hattersley, Andrew T.; Hu, Frank B.; Jarvelin, Marjo-Riitta; Kong, Augustine; Kooner, Jaspal S.; Laakso, Markku; Lakatta, Edward; Mooser, Vincent; Morris, Andrew D.; Peltonen, Leena; Samani, Nilesh J.; Spector, Timothy D.; Strachan, David P.; Tanaka, Toshiko; Tuomilehto, Jaakko; Uitterlinden, André G.; van Duijn, Cornelia M.; Wareham, Nicholas J.; Watkins for the PROCARDIS consortia, Hugh; Waterworth, Dawn M.; Boehnke, Michael; Deloukas, Panos; Groop, Leif; Hunter, David J.; Thorsteinsdottir, Unnur; Schlessinger, David; Wichmann, H.-Erich; Frayling, Timothy M.; Abecasis, Gonçalo R.; Hirschhorn, Joel N.; Loos, Ruth J. F.; Stefansson, Kari; Mohlke, Karen L.; Barroso, Inês; McCarthy for the GIANT consortium, Mark I.
2009-01-01
To identify genetic loci influencing central obesity and fat distribution, we performed a meta-analysis of 16 genome-wide association studies (GWAS, N = 38,580) informative for adult waist circumference (WC) and waist–hip ratio (WHR). We selected 26 SNPs for follow-up, for which the evidence of association with measures of central adiposity (WC and/or WHR) was strong and disproportionate to that for overall adiposity or height. Follow-up studies in a maximum of 70,689 individuals identified two loci strongly associated with measures of central adiposity; these map near TFAP2B (WC, P = 1.9×10−11) and MSRA (WC, P = 8.9×10−9). A third locus, near LYPLAL1, was associated with WHR in women only (P = 2.6×10−8). The variants near TFAP2B appear to influence central adiposity through an effect on overall obesity/fat-mass, whereas LYPLAL1 displays a strong female-only association with fat distribution. By focusing on anthropometric measures of central obesity and fat distribution, we have identified three loci implicated in the regulation of human adiposity. PMID:19557161
Hegarty, Mary; Canham, Matt S; Fabrikant, Sara I
2010-01-01
Three experiments examined how bottom-up and top-down processes interact when people view and make inferences from complex visual displays (weather maps). Bottom-up effects of display design were investigated by manipulating the relative visual salience of task-relevant and task-irrelevant information across different maps. Top-down effects of domain knowledge were investigated by examining performance and eye fixations before and after participants learned relevant meteorological principles. Map design and knowledge interacted such that salience had no effect on performance before participants learned the meteorological principles; however, after learning, participants were more accurate if they viewed maps that made task-relevant information more visually salient. Effects of display design on task performance were somewhat dissociated from effects of display design on eye fixations. The results support a model in which eye fixations are directed primarily by top-down factors (task and domain knowledge). They suggest that good display design facilitates performance not just by guiding where viewers look in a complex display but also by facilitating processing of the visual features that represent task-relevant information at a given display location. (PsycINFO Database Record (c) 2009 APA, all rights reserved).
Characterization of six human disease-associated inversion polymorphisms.
Antonacci, Francesca; Kidd, Jeffrey M; Marques-Bonet, Tomas; Ventura, Mario; Siswara, Priscillia; Jiang, Zhaoshi; Eichler, Evan E
2009-07-15
The human genome is a highly dynamic structure that shows a wide range of genetic polymorphic variation. Unlike other types of structural variation, little is known about inversion variants within normal individuals because such events are typically balanced and are difficult to detect and analyze by standard molecular approaches. Using sequence-based, cytogenetic and genotyping approaches, we characterized six large inversion polymorphisms that map to regions associated with genomic disorders with complex segmental duplications mapping at the breakpoints. We developed a metaphase FISH-based assay to genotype inversions and analyzed the chromosomes of 27 individuals from three HapMap populations. In this subset, we find that these inversions are less frequent or absent in Asians when compared with European and Yoruban populations. Analyzing multiple individuals from outgroup species of great apes, we show that most of these large inversion polymorphisms are specific to the human lineage with two exceptions, 17q21.31 and 8p23 inversions, which are found to be similarly polymorphic in other great ape species and where the inverted allele represents the ancestral state. Investigating linkage disequilibrium relationships with genotyped SNPs, we provide evidence that most of these inversions appear to have arisen on at least two different haplotype backgrounds. In these cases, discovery and genotyping methods based on SNPs may be confounded and molecular cytogenetics remains the only method to genotype these inversions.
Slattery, Martha L.; Lundgreen, Abbie; John, Esther M.; Torres-Mejia, Gabriela; Hines, Lisa; Giuliano, Anna R.; Baumgartner, Kathy B.; Stern, Mariana C.; Wolff, Roger K.
2015-01-01
Mitogen-activated protein kinases (MAPK) are integration points for multiple biochemical signals. We evaluated 13 MAPK genes with breast cancer risk and determined if diet and lifestyle factors mediated risk. Data from three population-based case-control studies conducted in Southwestern United States, California, and Mexico included 4183 controls and 3592 cases. Percent Indigenous American (IA) ancestry was determined from 104 Ancestry Informative Markers. The adaptive rank truncated product (ARTP) was used to determine the significance of each gene and the pathway with breast cancer risk, by menopausal status, genetic ancestry level, and ER/PR strata. MAP3K9 was associated with breast cancer overall (PARTP=0.02) with strongest association among women with the highest IA ancestry (PARTP=0.04). Several SNPs in MAP3K9 were associated with ER+/PR+ tumors and interacted with dietary oxidative balance score (DOBS), dietary folate, body mass index (BMI), alcohol consumption, cigarette smoking, and a history of diabetes. DUSP4 and MAPK8 interacted with calories to alter breast cancer risk; MAPK1 interacted with DOBS, dietary fiber, folate and BMI; MAP3K2 interacted with dietary fat; and MAPK14 interacted with dietary folate and BMI. The patterns of association across diet and lifestyle factors with similar biological properties for the same SNPs within genes provide support for associations. PMID:25629224
Identification of rs671, a common variant of ALDH2, as a gout susceptibility locus.
Sakiyama, Masayuki; Matsuo, Hirotaka; Nakaoka, Hirofumi; Yamamoto, Ken; Nakayama, Akiyoshi; Nakamura, Takahiro; Kawai, Sayo; Okada, Rieko; Ooyama, Hiroshi; Shimizu, Toru; Shinomiya, Nariyoshi
2016-05-16
Gout is a common disease resulting from hyperuricemia. Recently, a genome-wide association study identified an association between gout and a single nucleotide polymorphism (SNP) rs2188380, located on an intergenic region between MYL2 and CUX2 on chromosome 12. However, other genes around rs2188380 could possibly be gout susceptibility genes. Therefore, we performed a fine-mapping study of the MYL2-CUX2 region. From 8,595 SNPs in the MYL2-CUX2 region, 9 tag SNPs were selected, and genotyping of 1,048 male gout patients and 1,334 male controls was performed by TaqMan method. Eight SNPs showed significant associations with gout after Bonferroni correction. rs671 (Glu504Lys) of ALDH2 had the most significant association with gout (P = 1.7 × 10(-18), odds ratio = 0.53). After adjustment for rs671, the other 8 SNPs no longer showed a significant association with gout, while the significant association of rs671 remained. rs671 has been reportedly associated with alcohol drinking behavior, and it is well-known that alcohol drinking elevates serum uric acid levels. These data suggest that rs671, a common functional SNP of ALDH2, is a genuine gout-associated SNP in the MYL2-CUX2 locus and that "A" allele (Lys) of rs671 plays a protective role in the development of gout.
Common Variants in Mendelian Kidney Disease Genes and Their Association with Renal Function
Fuchsberger, Christian; Köttgen, Anna; O’Seaghdha, Conall M.; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I.; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J.; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V.; O’Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A.; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H.-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M.; Borecki, Ingrid; Kardia, Sharon L.R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M.; Bochud, Murielle; Heid, Iris M.; Siscovick, David S.; Fox, Caroline S.; Kao, W. Linda; Böger, Carsten A.
2013-01-01
Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research. PMID:24029420
Valluru, Ravi; Reynolds, Matthew P; Davies, William J; Sukumaran, Sivakumar
2017-04-01
The gaseous phytohormone ethylene plays an important role in spike development in wheat (Triticum aestivum). However, the genotypic variation and the genomic regions governing spike ethylene (SET) production in wheat under long-term heat stress remain unexplored. We investigated genotypic variation in the production of SET and its relationship with spike dry weight (SDW) in 130 diverse wheat elite lines and landraces under heat-stressed field conditions. We employed an Illumina iSelect 90K single nucleotide polymorphism (SNP) genotyping array to identify the genetic loci for SET and SDW through a genome-wide association study (GWAS) in a subset of the Wheat Association Mapping Initiative (WAMI) panel. The SET and SDW exhibited appreciable genotypic variation among wheat genotypes at the anthesis stage. There was a strong negative correlation between SET and SDW. The GWAS uncovered five and 32 significant SNPs for SET, and 22 and 142 significant SNPs for SDW, in glasshouse and field conditions, respectively. Some of these SNPs closely localized to the SNPs for plant height, suggesting close associations between plant height and spike-related traits. The phenotypic and genetic elucidation of SET and its relationship with SDW supports future efforts toward gene discovery and breeding wheat cultivars with reduced ethylene effects on yield under heat stress. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Effects of extended aging and modified atmospheric packaging on beef top loin steak color.
English, A R; Mafi, G G; VanOverbeke, D L; Ramanathan, R
2016-04-01
The objective of this study was to evaluate the effects of extended aging and modified atmospheric packaging on beef LM color. Using a randomized complete block design, each beef longissimus lumborum muscle ( = 10; USDA Choice, 3 d postmortem) was equally divided into 3 sections and randomly assigned to 1 of 3 aging periods (21, 42, or 62 d at 2°C). After respective aging, each loin section was cut into four 2.5-cm-thick steaks and randomly assigned to 1 of 3 packaging types (PVC, HiOx-MAP [80% oxygen and 20% carbon dioxide], or CO-MAP [0.4% carbon monoxide, 69.6% nitrogen, and 30% carbon dioxide]). The steaks were displayed under continuous fluorescent lighting for 6 d, and surface color was determined daily using a HunterLab Miniscan XE Plus spectrophotometer and a visual panel. The fourth steak was used to characterize oxygen consumption (OC), lipid oxidation, and metmyoglobin reducing activity (MRA) on 21, 42, and 62 d (before display). On d 6 display, MRA, OC, and lipid oxidation also were measured. An increase in aging time decreased ( < 0.0001) muscle pH. Loin sections aged for 42 and 62 d had a lower ( < 0.0002) pH compared with loin sections aged for 21 d. An aging period × packaging × display time interaction ( < 0.0001) resulted for a* values (redness), chroma, and visual color (muscle color and surface discoloration). As aging time increased, HiOx-MAP had the most discoloration ( < 0.0001) compared with other packaging types on d 6. At all aging periods, steaks packaged in CO-MAP had greater ( < 0.0001) MRA on d 6 than PVC and HiOx-MAP, whereas steaks packaged in HiOx-MAP had the least MRA ( < 0.0001). There were no differences ( = 0.34) in thiobarbituric acid reactive substances values between steaks aged for 21 and 42 d when steaks were packaged in CO-MAP and displayed for 6 d. However, steaks packaged in HiOx-MAP and displayed 6 d had greater ( < 0.0001) lipid oxidation than CO-MAP. Steaks packaged in HiOx-MAP had a lower ( < 0.0001) OC compared with PVC and CO-MAP when aged for 42 and 62 d. There were no differences ( = 0.49) in OC between steaks packaged in PVC and HiOx-MAP when aged for 21 d and displayed 6 d. The results indicate that extended aging is detrimental to color stability when packaged in PVC and HiOx-MAP. However, steaks in CO-MAP had stable red color during display. Decreased color stability in PVC and HiOx-MAP could be associated, in part, with decreased MRA and OC.
Luzardo, S; Woerner, D R; Geornaras, I; Hess, A M; Belk, K E
2016-06-01
Two studies were conducted to evaluate the influence of packaging during storage of strip loins (to simulate export shipment) from steers fattened on intensive grazing systems (Uruguay; UR) or on a high-concentrate diet (United States; US) on retail display life microbial growth. Four or 3 different packaging treatments were applied to UR and US strip loin roasts or steaks during 35 d of storage; treatments were applied 7 d following slaughter. After 35 d of storage, the samples were evaluated during simulated retail display for up to 6 d. In Exp. 1, the treatments were vacuum packaging (VP), low-oxygen modified atmosphere packaging (MAP) with N and CO (MAP/CO), low-oxygen MAP with N plus CO and CO, and VP plus an application of peroxyacetic acid (VP/PAA). In Exp. 2, block 1, the treatments were VP, MAP/CO, and VP with ethyl--lauroyl--arginate HCl incorporated into the film as an antimicrobial agent (VP/AM). In Exp. 2, block 2, the treatments were VP, MAP/CO, MAP/CO, and VP/AM. For retail display, VP treatments were sliced and repackaged in PVC overwrap, and MAP treatments were actually PVC overwrap trays that were removed from a master bag with the prescribed gas treatment. Regardless of production system and packaging treatment, mesophilic and psychrotrophic counts of 6.9 to 7.8 and 6.7 to 7.7 log10 CFU/cm, respectively, were obtained at the end of retail display, except for US samples in Exp. 2 (5.5 to 6.3 log CFU/cm). No differences ( > 0.05) were detected for spp. counts among packaging treatments in US steaks at the end of the display time in Exp.1, whereas, for UR steaks, both MAP treatments had lower ( < 0.05) spp. counts than VP treatments. spp. counts were lower ( < 0.05) in the MAP/CO treatment than in the other 3 treatments in US samples on d 6 of retail display for Exp. 2. At the end of display time and for Exp. 1, US steaks under MAP/CO had greater ( < 0.05) lactic acid bacteria (LAB) counts than samples in both VP treatments; no differences ( > 0.05) among packaging were detected for UR steaks. Both MAP and VP/AM treatments in the US samples for Exp. 2 had lower ( < 0.05) LAB counts on d 6 of display than the VP treatment, but no differences ( > 0.05) were found among packaging treatments for the UR samples. To maximize shelf life (storage and display life) of exported fresh beef, it is critical to minimize bacterial populations during processing and storage.
SNPchiMp: a database to disentangle the SNPchip jungle in bovine livestock.
Nicolazzi, Ezequiel Luis; Picciolini, Matteo; Strozzi, Francesco; Schnabel, Robert David; Lawley, Cindy; Pirani, Ali; Brew, Fiona; Stella, Alessandra
2014-02-11
Currently, six commercial whole-genome SNP chips are available for cattle genotyping, produced by two different genotyping platforms. Technical issues need to be addressed to combine data that originates from the different platforms, or different versions of the same array generated by the manufacturer. For example: i) genome coordinates for SNPs may refer to different genome assemblies; ii) reference genome sequences are updated over time changing the positions, or even removing sequences which contain SNPs; iii) not all commercial SNP ID's are searchable within public databases; iv) SNPs can be coded using different formats and referencing different strands (e.g. A/B or A/C/T/G alleles, referencing forward/reverse, top/bottom or plus/minus strand); v) Due to new information being discovered, higher density chips do not necessarily include all the SNPs present in the lower density chips; and, vi) SNP IDs may not be consistent across chips and platforms. Most researchers and breed associations manage SNP data in real-time and thus require tools to standardise data in a user-friendly manner. Here we present SNPchiMp, a MySQL database linked to an open access web-based interface. Features of this interface include, but are not limited to, the following functions: 1) referencing the SNP mapping information to the latest genome assembly, 2) extraction of information contained in dbSNP for SNPs present in all commercially available bovine chips, and 3) identification of SNPs in common between two or more bovine chips (e.g. for SNP imputation from lower to higher density). In addition, SNPchiMp can retrieve this information on subsets of SNPs, accessing such data either via physical position on a supported assembly, or by a list of SNP IDs, rs or ss identifiers. This tool combines many different sources of information, that otherwise are time consuming to obtain and difficult to integrate. The SNPchiMp not only provides the information in a user-friendly format, but also enables researchers to perform a large number of operations with a few clicks of the mouse. This significantly reduces the time needed to execute the large number of operations required to manage SNP data.
Zhang, Suhua; Bian, Yingnan; Chen, Anqi; Zheng, Hancheng; Gao, Yuzhen; Hou, Yiping; Li, Chengtao
2017-03-01
Utilizing massively parallel sequencing (MPS) technology for SNP testing in forensic genetics is becoming attractive because of the shortcomings of STR markers, such as their high mutation rates and disadvantages associated with the current PCR-CE method as well as its limitations regarding multiplex capabilities. MPS offers the potential to genotype hundreds to thousands of SNPs from multiple samples in a single experimental run. In this study, we designed a customized SNP panel that includes 273 forensically relevant identity SNPs chosen from SNPforID, IISNP, and the HapMap database as well as previously related studies and evaluated the levels of genotyping precision, sequence coverage, sensitivity and SNP performance using the Ion Torrent PGM. In a concordant study of the custom MPS-SNP panel, only four MPS callings were missing due to coverage reads that were too low (<20), whereas the others were fully concordant with Sanger's sequencing results across the two control samples, that is, 9947A and 9948. The analyses indicated a balanced coverage among the included loci, with the exception of the 16 SNPs that were used to detect an inconsistent allele balance and/or lower coverage reads among 50 tested individuals from the Chinese HAN population and the above controls. With the exception of the 16 poorly performing SNPs, the sequence coverage obtained was extensive for the bulk of the SNPs, and only three Y-SNPs (rs16980601, rs11096432, rs3900) showed a mean coverage below 1000. Analyses of the dilution series of control DNA 9948 yielded reproducible results down to 1ng of DNA input. In addition, we provide an analysis tool for automated data quality control and genotyping checks, and we conclude that the SNP targets are polymorphic and independent in the Chinese HAN population. In summary, the evaluation of the sensitivity, accuracy and genotyping performance provides strong support for the application of MPS technology in forensic SNP analysis, and the assay offers a straightforward sample-to-genotype workflow that could be beneficial in forensic casework with respect to both individual identification and complex kinship issues. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
RTEL1 tagging SNPs and haplotypes were associated with glioma development.
Li, Gang; Jin, Tianbo; Liang, Hongjuan; Zhang, Zhiguo; He, Shiming; Tu, Yanyang; Yang, Haixia; Geng, Tingting; Cui, Guangbin; Chen, Chao; Gao, Guodong
2013-05-17
As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case-control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF)>5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P=0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P=0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype "GG" of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P=0.0002), while the genotype "CC" of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P=0.0003). Furthermore, haplotype "GCT" in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher's P=0.0005; Pearson's P=0.0005), and haplotype "ATT" was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher's P=0.0013; Pearson's P=0.0013). Two single variants, the genotypes of "GG" of rs6010620 and "CC" of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998.
O'Brien, Timothy D; Jia, Peilin; Caporaso, Neil E; Landi, Maria Teresa; Zhao, Zhongming
2018-02-27
There are two main types of lung cancer: small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). NSCLC has many subtypes, but the two most common are lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC). These subtypes are mainly classified by physiological and pathological characteristics, although there is increasing evidence of genetic and molecular differences as well. Although some work has been done at the somatic level to explore the genetic and biological differences among subtypes, little work has been done that interrogates these differences at the germline level to characterize the unique and shared susceptibility genes for each subtype. We used single-nucleotide polymorphisms (SNPs) from a genome-wide association study (GWAS) of European samples to interrogate the similarity of the subtypes at the SNP, gene, pathway, and regulatory levels. We expanded these genotyped SNPs to include all SNPs in linkage disequilibrium (LD) using data from the 1000 Genomes Project. We mapped these SNPs to several lung tissue expression quantitative trait loci (eQTL) and enhancer datasets to identify regulatory SNPs and their target genes. We used these genes to perform a biological pathway analysis for each subtype. We identified 8295, 8734, and 8361 SNPs with moderate association signals for LUAD, LUSC, and SCLC, respectively. Those SNPs had p < 1 × 10 - 3 in the original GWAS or were within LD (r 2 > 0.8, Europeans) to the genotyped SNPs. We identified 215, 320, and 172 disease-associated genes for LUAD, LUSC, and SCLC, respectively. Only five genes (CHRNA5, IDH3A, PSMA4, RP11-650 L12.2, and TBC1D2B) overlapped all subtypes. Furthermore, we observed only two pathways from the Kyoto Encyclopedia of Genes and Genomes shared by all subtypes. At the regulatory level, only three eQTL target genes and two enhancer target genes overlapped between all subtypes. Our results suggest that the three lung cancer subtypes do not share much genetic signal at the SNP, gene, pathway, or regulatory level, which differs from the common subtype classification based upon histology. However, three (CHRNA5, IDH3A, and PSMA4) of the five genes shared between the subtypes are well-known lung cancer genes that may act as general lung cancer genes regardless of subtype.
Image display device in digital TV
Choi, Seung Jong [Seoul, KR
2006-07-18
Disclosed is an image display device in a digital TV that is capable of carrying out the conversion into various kinds of resolution by using single bit map data in the digital TV. The image display device includes: a data processing part for executing bit map conversion, compression, restoration and format-conversion for text data; a memory for storing the bit map data obtained according to the bit map conversion and compression in the data processing part and image data inputted from an arbitrary receiving part, the receiving part receiving one of digital image data and analog image data; an image outputting part for reading the image data from the memory; and a display processing part for mixing the image data read from the image outputting part and the bit map data converted in format from the a data processing part. Therefore, the image display device according to the present invention can convert text data in such a manner as to correspond with various resolution, carry out the compression for bit map data, thereby reducing the memory space, and support text data of an HTML format, thereby providing the image with the text data of various shapes.
Ingle, James N.; Schaid, Daniel J.; Goss, Paul E.; Liu, Mohan; Mushiroda, Taisei; Chapman, Judy-Anne W.; Kubo, Michiaki; Jenkins, Gregory D.; Batzler, Anthony; Shepherd, Lois; Pater, Joseph; Wang, Liewei; Ellis, Matthew J.; Stearns, Vered; Rohrer, Daniel C.; Goetz, Matthew P.; Pritchard, Kathleen I.; Flockhart, David A.; Nakamura, Yusuke; Weinshilboum, Richard M.
2010-01-01
Purpose We performed a case-control genome-wide association study (GWAS) to identify single nucleotide polymorphisms (SNPs) associated with musculoskeletal adverse events (MS-AEs) in women treated with aromatase inhibitors (AIs) for early breast cancer. Patients and Methods A nested case-control design was used to select patients enrolled onto the MA.27 phase III trial comparing anastrozole with exemestane. Cases were matched to two controls and were defined as patients with grade 3 or 4 MS-AEs (according to the National Cancer Institute's Common Terminology Criteria for Adverse Events v3.0) or those who discontinued treatment for any grade of MS-AE within the first 2 years. Genotyping was performed with the Illumina Human610-Quad BeadChip. Results The GWAS included 293 cases and 585 controls. A total of 551,358 SNPs were analyzed, followed by imputation and fine mapping of a region of interest on chromosome 14. Four SNPs on chromosome 14 had the lowest P values (2.23E-06 to 6.67E-07). T-cell leukemia 1A (TCL1A) was the gene closest (926-7000 bp) to the four SNPs. Functional genomic studies revealed that one of these SNPs (rs11849538) created an estrogen response element and that TCL1A expression was estrogen dependent, was associated with the variant SNP genotypes in estradiol-treated lymphoblastoid cells transfected with estrogen receptor alpha and was directly related to interleukin 17 receptor A (IL17RA) expression. Conclusion This GWAS identified SNPs associated with MS-AEs in women treated with AIs and with a gene (TCL1A) which, in turn, was related to a cytokine (IL17). These findings provide a focus for further research to identify patients at risk for MS-AEs and to explore the mechanisms for these adverse events. PMID:20876420
Penetrance of Polygenic Obesity Susceptibility Loci across the Body Mass Index Distribution.
Abadi, Arkan; Alyass, Akram; Robiou du Pont, Sebastien; Bolker, Ben; Singh, Pardeep; Mohan, Viswanathan; Diaz, Rafael; Engert, James C; Yusuf, Salim; Gerstein, Hertzel C; Anand, Sonia S; Meyre, David
2017-12-07
A growing number of single-nucleotide polymorphisms (SNPs) have been associated with body mass index (BMI) and obesity, but whether the effects of these obesity-susceptibility loci are uniform across the BMI distribution remains unclear. We studied the effects of 37 BMI-associated SNPs in 75,230 adults of European ancestry across BMI percentiles by using conditional quantile regression (CQR) and meta-regression (MR) models. The effects of nine SNPs (24%)-rs1421085 (FTO; p = 8.69 × 10 -15 ), rs6235 (PCSK1; p = 7.11 × 10 -6 ), rs7903146 (TCF7L2; p = 9.60 × 10 -6 ), rs11873305 (MC4R; p = 5.08 × 10 -5 ), rs12617233 (FANCL; p = 5.30 × 10 -5 ), rs11672660 (GIPR; p = 1.64 × 10 -4 ), rs997295 (MAP2K5; p = 3.25 × 10 -4 ), rs6499653 (FTO; p = 6.23 × 10 -4 ), and rs3824755 (NT5C2; p = 7.90 × 10 -4 )-increased significantly across the sample BMI distribution. We showed that such increases stemmed from unadjusted gene interactions that enhanced the effects of SNPs in persons with a high BMI. When 125 height-associated SNPs were analyzed for comparison, only one (<1%), rs6219 (IGF1, p = 1.80 × 10 -4 ), showed effects that varied significantly across height percentiles. Cumulative gene scores of these SNPs (GS-BMI and GS-height) showed that only GS-BMI had effects that increased significantly across the sample distribution (BMI: p = 7.03 × 10 -37 ; height: p = 0.499). Overall, these findings underscore the importance of gene-gene and gene-environment interactions in shaping the genetic architecture of BMI and advance a method for detecting such interactions by using only the sample outcome distribution. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Controversial opinion: evaluation of EGR1 and LAMA2 loci for high myopia in Chinese populations.
Lin, Fang-yu; Huang, Zhu; Lu, Ning; Chen, Wei; Fang, Hui; Han, Wei
2016-03-01
Functional studies have suggested the important role of early growth response 1 (EGR1) and Laminin α2-chain (LAMA2) in human eye development. Genetic studies have reported a significant association of the single nucleotide polymorphism (SNP) in the LAMA2 gene with myopia. This study aimed to evaluate the association of the tagging SNPs (tSNPs) in the EGR1 and LAMA2 genes with high myopia in two independent Han Chinese populations. Four tSNPs (rs11743810 in the EGR1 gene; rs2571575, rs9321170, and rs1889891 in the LAMA2 gene) were selected, according to the HapMap database (http://hapmap.ncbi.nlm.nih.gov), and were genotyped using the ligase detection reaction (LDR) approach for 167 Han Chinese nuclear families with extremely highly myopic offspring (<-10.0 diopters) and an independent group with 485 extremely highly myopic cases (<-10.0 diopters) and 499 controls. Direct sequencing was used to confirm the LDR results in twenty randomly selected subjects. Family-based association analysis was performed using the family-based association test (FBAT) software package (Version 1.5.5). Population-based association analysis was performed using the Chi-square test. The association analysis power was estimated using online software (http://design.cs.ucla.edu). The FBAT demonstrated that all four tSNPs tested did not show association with high myopia (P>0.05). Haplotype analysis of tSNPs in the LAMA2 genes also did not show a significant association (P>0.05). Meanwhile, population-based association analysis also showed no significant association results with high myopia (P>0.05). On the basis of our family- and population-based analyses for the Han Chinese population, we did not find positive association signals of the four SNPs in the LAMA2 and EGR1 genes with high myopia.
Grant, Audrey V.; El Baghdadi, Jamila; Sabri, Ayoub; El Azbaoui, Safa; Alaoui-Tahiri, Kebir; Abderrahmani Rhorfi, Ismail; Gharbaoui, Yasser; Abid, Ahmed; Benkirane, Majid; Raharimanga, Vaomalala; Richard, Vincent; Orlova, Marianna; Boland, Anne; Migaud, Mélanie; Okada, Satoshi; Nolan, Daniel K.; Bustamante, Jacinta; Barreiro, Luis B.; Schurr, Erwin; Boisson-Dupuis, Stephanie; Rasolofo, Voahangy; Casanova, Jean-Laurent; Abel, Laurent
2013-01-01
Only a small fraction of individuals infected with Mycobacterium tuberculosis develop clinical tuberculosis (TB) in their lifetime. Genetic epidemiological evidence suggests a genetic determinism of pulmonary TB (PTB), but the molecular basis of genetic predisposition to PTB remains largely unknown. We used a positional-cloning approach to carry out ultrafine linkage-disequilibrium mapping of a previously identified susceptibility locus in chromosomal region 8q12–13 by genotyping 3,216 SNPs in a family-based Moroccan sample including 286 offspring with PTB. We observed 44 PTB-associated SNPs (p < 0.01), which were genotyped in an independent set of 317 cases and 650 controls from Morocco. A single signal, consisting of two correlated SNPs close to TOX, rs1568952 and rs2726600 (combined p = 1.1 × 10−5 and 9.2 × 10−5, respectively), was replicated. Stronger evidence of association was found in individuals who developed PTB before the age of 25 years (combined p for rs1568952 = 4.4 × 10−8; odds ratio of PTB for AA versus AG/GG = 3.09 [1.99–4.78]). The association with rs2726600 (p = 0.04) was subsequently replicated in PTB-affected subjects under 25 years in a study of 243 nuclear families from Madagascar. Stronger evidence of replication in Madagascar was obtained for additional SNPs in strong linkage disequilibrium with the two initial SNPs (p = 0.003 for rs2726597), further confirming the signal. We thus identified around rs1568952 and rs2726600 a cluster of SNPs strongly associated with early-onset PTB in Morocco and Madagascar. SNP rs2726600 is located in a transcription-factor binding site in the 3′ region of TOX, and further functional explorations will focus on CD4 T lymphocytes. PMID:23415668
Reid, Brett M; Permuth, Jennifer B; Chen, Y Ann; Teer, Jamie K; Monteiro, Alvaro N A; Chen, Zhihua; Tyrer, Jonathan; Berchuck, Andrew; Chenevix-Trench, Georgia; Doherty, Jennifer A; Goode, Ellen L; Iverson, Edwin S; Lawrenson, Kate; Pearce, Celeste L; Pharoah, Paul D; Phelan, Catherine M; Ramus, Susan J; Rossing, Mary Anne; Schildkraut, Joellen M; Cheng, Jin Q; Gayther, Simon A; Sellers, Thomas A
2017-01-01
Genome-wide association studies (GWAS) have identified multiple loci associated with epithelial ovarian cancer (EOC) susceptibility, but further progress requires integration of epidemiology and biology to illuminate true risk loci below genome-wide significance levels (P < 5 × 10 -8 ). Most risk SNPs lie within non-protein-encoding regions, and we hypothesize that long noncoding RNA (lncRNA) genes are enriched at EOC risk regions and represent biologically relevant functional targets. Using imputed GWAS data from about 18,000 invasive EOC cases and 34,000 controls of European ancestry, the GENCODE (v19) lncRNA database was used to annotate SNPs from 13,442 lncRNAs for permutation-based enrichment analysis. Tumor expression quantitative trait locus (eQTL) analysis was performed for sub-genome-wide regions (1 × 10 -5 > P > 5 × 10 -8 ) overlapping lncRNAs. Of 5,294 EOC-associated SNPs (P < 1.0 × 10 -5 ), 1,464 (28%) mapped within 53 unique lncRNAs and an additional 3,484 (66%) SNPs were correlated (r 2 > 0.2) with SNPs within 115 lncRNAs. EOC-associated SNPs comprised 130 independent regions, of which 72 (55%) overlapped with lncRNAs, representing a significant enrichment (P = 5.0 × 10 -4 ) that was more pronounced among a subset of 5,401 lncRNAs with active epigenetic regulation in normal ovarian tissue. EOC-associated lncRNAs and their putative promoters and transcription factors were enriched for biologically relevant pathways and eQTL analysis identified five novel putative risk regions with allele-specific effects on lncRNA gene expression. lncRNAs are significantly enriched at EOC risk regions, suggesting a mechanistic role for lncRNAs in driving predisposition to EOC. lncRNAs represent key candidates for integrative epidemiologic and functional studies. Further research on their biologic role in ovarian cancer is indicated. Cancer Epidemiol Biomarkers Prev; 26(1); 116-25. ©2016 AACR. ©2016 American Association for Cancer Research.
Genome-wide association study of smoking behaviors in COPD patients
Siedlinski, Mateusz; Cho, Michael H.; Bakke, Per; Gulsvik, Amund; Lomas, David A.; Anderson, Wayne; Kong, Xiangyang; Rennard, Stephen I.; Beaty, Terri H.; Hokanson, John E.; Crapo, James D.; Silverman, Edwin K.
2012-01-01
Background Cigarette smoking is a major risk factor for COPD and COPD severity. Previous genome-wide association studies (GWAS) have identified numerous single nucleotide polymorphisms (SNPs) associated with the number of cigarettes smoked per day (CPD) and a Dopamine Beta-Hydroxylase (DBH) locus associated with smoking cessation in multiple populations. Objective To identify SNPs associated with lifetime average and current CPD, age at smoking initiation, and smoking cessation in COPD subjects. Methods GWAS were conducted in 4 independent cohorts encompassing 3,441 ever-smoking COPD subjects (GOLD stage II or higher). Untyped SNPs were imputed using HapMap (phase II) panel. Results from all cohorts were meta-analyzed. Results Several SNPs near the HLA region on chromosome 6p21 and in an intergenic region on chromosome 2q21 showed associations with age at smoking initiation, both with the lowest p=2×10−7. No SNPs were associated with lifetime average CPD, current CPD or smoking cessation with p<10−6. Nominally significant associations with candidate SNPs within alpha-nicotinic acetylcholine receptors 3/5 (CHRNA3/CHRNA5; e.g. p=0.00011 for SNP rs1051730) and Cytochrome P450 2A6 (CYP2A6; e.g. p=2.78×10−5 for a nonsynonymous SNP rs1801272) regions were observed for lifetime average CPD, however only CYP2A6 showed evidence of significant association with current CPD. A candidate SNP (rs3025343) in the DBH was significantly (p=0.015) associated with smoking cessation. Conclusion We identified two candidate regions associated with age at smoking initiation in COPD subjects. Associations of CHRNA3/CHRNA5 and CYP2A6 loci with CPD and DBH with smoking cessation are also likely of importance in the smoking behaviors of COPD patients. PMID:21685187
Tang, Shaowen; Lv, Xiaozhen; Zhang, Yuan; Wu, Shanshan; Yang, Zhirong; Xia, Yinyin; Tu, Dehua; Deng, Peiyuan; Ma, Yu; Chen, Dafang; Zhan, Siyan
2013-01-01
The pathogenic mechanism of anti-tuberculosis (anti-TB) drug-induced hepatitis is associated with drug metabolizing enzymes. No tagging single-nucleotide polymorphisms (tSNPs) of cytochrome P450 2E1(CYP2E1) in the risk of anti-TB drug-induced hepatitis have been reported. The present study was aimed at exploring the role of tSNPs in CYP2E1 gene in a population-based anti-TB treatment cohort. A nested case-control study was designed. Each hepatitis case was 14 matched with controls by age, gender, treatment history, disease severity and drug dosage. The tSNPs were selected by using Haploview 4.2 based on the HapMap database of Han Chinese in Beijing, and detected by using TaqMan allelic discrimination technology. Eighty-nine anti-TB drug-induced hepatitis cases and 356 controls were included in this study. 6 tSNPs (rs2031920, rs2070672, rs915908, rs8192775, rs2515641, rs2515644) were genotyped and minor allele frequencies of these tSNPs were 21.9%, 23.0%, 19.1%, 23.6%, 20.8% and 44.4% in the cases and 20.9%, 22.7%, 18.9%, 23.2%, 18.2% and 43.2% in the controls, respectively. No significant difference was observed in genotypes or allele frequencies of the 6 tSNPs between case group and control group, and neither of haplotypes in block 1 nor in block 2 was significantly associated with the development of hepatitis. Based on the Chinese anti-TB treatment cohort, we did not find a statistically significant association between genetic polymorphisms of CYP2E1 and the risk of anti-TB drug-induced hepatitis. None of the haplotypes showed a significant association with the development of hepatitis in Chinese TB population.
Delaney, Jessica T.; Jeff, Janina M.; Brown, Nancy J.; Pretorius, Mias; Okafor, Henry E.; Darbar, Dawood; Roden, Dan M.; Crawford, Dana C.
2012-01-01
Background Despite a greater burden of risk factors, atrial fibrillation (AF) is less common among African Americans than European-descent populations. Genome-wide association studies (GWAS) for AF in European-descent populations have identified three predominant genomic regions associated with increased risk (1q21, 4q25, and 16q22). The contribution of these loci to AF risk in African American is unknown. Methodology/Principal Findings We studied 73 African Americans with AF from the Vanderbilt-Meharry AF registry and 71 African American controls, with no history of AF including after cardiac surgery. Tests of association were performed for 148 SNPs across the three regions associated with AF, and 22 SNPs were significantly associated with AF (P<0.05). The SNPs with the strongest associations in African Americans were both different from the index SNPs identified in European-descent populations and independent from the index European-descent population SNPs (r2<0.40 in HapMap CEU): 1q21 rs4845396 (odds ratio [OR] 0.30, 95% confidence interval [CI] 0.13–0.67, P = 0.003), 4q25 rs4631108 (OR 3.43, 95% CI 1.59–7.42, P = 0.002), and 16q22 rs16971547 (OR 8.1, 95% CI 1.46–45.4, P = 0.016). Estimates of European ancestry were similar among cases (23.6%) and controls (23.8%). Accordingly, the probability of having two copies of the European derived chromosomes at each region did not differ between cases and controls. Conclusions/Significance Variable European admixture at known AF loci does not explain decreased AF susceptibility in African Americans. These data support the role of 1q21, 4q25, and 16q22 variants in AF risk for African Americans, although the index SNPs differ from those identified in European-descent populations. PMID:22384221
DOT National Transportation Integrated Search
2004-09-01
The Federal Aviation Administration (FAA) has requested human factors guidance to support the new moving map Technical Standard Order (TSO)-C165, Electronic Map Display Equipment for Graphical Depiction of Aircraft Position. This document was develop...
Federal Register 2010, 2011, 2012, 2013, 2014
2013-11-15
... Devices, Navigation and Display Systems, Radar Systems, Navigational Aids, Mapping Systems and Related... navigation products, including GPS devices, navigation and display systems, radar systems, navigational aids..., radar systems, navigational aids, mapping systems and related software by reason of infringement of one...
PigGIS: Pig Genomic Informatics System
Ruan, Jue; Guo, Yiran; Li, Heng; Hu, Yafeng; Song, Fei; Huang, Xin; Kristiensen, Karsten; Bolund, Lars; Wang, Jun
2007-01-01
Pig Genomic Information System (PigGIS) is a web-based depository of pig (Sus scrofa) genomic learning mainly engineered for biomedical research to locate pig genes from their human homologs and position single nucleotide polymorphisms (SNPs) in different pig populations. It utilizes a variety of sequence data, including whole genome shotgun (WGS) reads and expressed sequence tags (ESTs), and achieves a successful mapping solution to the low-coverage genome problem. With the data presently available, we have identified a total of 15 700 pig consensus sequences covering 18.5 Mb of the homologous human exons. We have also recovered 18 700 SNPs and 20 800 unique 60mer oligonucleotide probes for future pig genome analyses. PigGIS can be freely accessed via the web at and . PMID:17090590
2012-01-01
Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety. PMID:22631220
Gao, J B; Li, Y K; Yang, N; Ma, X H; Adoligbe, C; Jiang, B J; Fu, C Z; Cheng, G; Zan, L S
2013-02-28
The aim of this study was to determine whether single nucleotide polymorphisms (SNPs) of bovine Dickkopf homolog 4 (DKK4) are associated with body measurement traits in Qinchuan cattle. By using PCR-SSCP technology and DNA sequencing, we discovered 5 DKK4 SNPs in Qingchuan cattle, including -65G>A and -77G>T in the 5'-untranslated region, 1532C>G and 1533T>C in exon 2, and 2088C>T in exon 3. The sequencing map showed that 1532C>G and 1533T>C were in close linkage disequilibrium and were treated as 1532C>G-1533T>C in this study. Allele frequencies were calculated and analyzed by the chi-square test, which showed that -65G>A and 1532C>G-1533T>C were in Hardy-Weinberg equilibrium (P > 0.05), whereas -77G>T and 2088C>T were not in all 633 tested Qinchuan cattle individuals (P < 0.01). Gene heterozygosity (HE), effective allele number (NE), and polymorphism information content (PIC) were 0.407, 1.686, and 0.324 at -65G>A; 0.472, 1.894, and 0.361 at -77G>T; 0.476, 1.908, and 0.363 at 1532C>G-1533T>C; and 0.218, 1.279, and 0.195 at 2088C>T. We also evaluated the potential association of these SNPs with body measurement traits in all 633 individuals; the results suggest that several SNPs in Qinchuan cattle DKK4 were significantly associated with body length, hip height, rump length, hip width, heart girth, and pin bone width (P < 0.05 and P < 0.01). These results suggest that bovine DKK4 could be used as candidate gene for Qinchuan cattle breeding.
Genetically decreased vitamin D and risk of Alzheimer disease.
Mokry, Lauren E; Ross, Stephanie; Morris, John A; Manousaki, Despoina; Forgetta, Vincenzo; Richards, J Brent
2016-12-13
To test whether genetically decreased vitamin D levels are associated with Alzheimer disease (AD) using mendelian randomization (MR), a method that minimizes bias due to confounding or reverse causation. We selected single nucleotide polymorphisms (SNPs) that are strongly associated with 25-hydroxyvitamin D (25OHD) levels (p < 5 × 10 -8 ) from the Study of Underlying Genetic Determinants of Vitamin D and Highly Related Traits (SUNLIGHT) Consortium (N = 33,996) to act as instrumental variables for the MR study. We measured the effect of each of these SNPs on 25OHD levels in the Canadian Multicentre Osteoporosis Study (CaMos; N = 2,347) and obtained the corresponding effect estimates for each SNP on AD risk from the International Genomics of Alzheimer's Project (N = 17,008 AD cases and 37,154 controls). To produce MR estimates, we weighted the effect of each SNP on AD by its effect on 25OHD and meta-analyzed these estimates using a fixed-effects model to provide a summary effect estimate. The SUNLIGHT Consortium identified 4 SNPs to be genome-wide significant for 25OHD, which described 2.44% of the variance in 25OHD in CaMos. All 4 SNPs map to genes within the vitamin D metabolic pathway. MR analyses demonstrated that a 1-SD decrease in natural log-transformed 25OHD increased AD risk by 25% (odds ratio 1.25, 95% confidence interval 1.03-1.51, p = 0.021). After sensitivity analysis in which we removed SNPs possibly influenced by pleiotropy and population stratification, the results were largely unchanged. Our results provide evidence supporting 25OHD as a causal risk factor for AD. These findings provide further rationale to understand the effect of vitamin D supplementation on cognition and AD risk in randomized controlled trials. © 2016 American Academy of Neurology.
Vargas, Jose D; Manichaikul, Ani; Wang, Xin-Qun; Rich, Stephen S; Rotter, Jerome I; Post, Wendy S; Polak, Joseph F; Budoff, Matthew J; Bluemke, David A
2016-02-01
Subclinical atherosclerosis (sCVD), measured by coronary artery calcium (CAC) and carotid intima media thickness (CIMT) is associated with cardiovascular disease (CVD). Genome-Wide Association Studies (GWAS) of sCVD and CVD have focused primarily on Caucasian populations. We hypothesized that these associations may differ in populations from distinct genetic backgrounds. The associations between sCVD and 66 single nucleotide polymorphisms (SNPs) from published GWAS of sCVD and CVD were tested in 8224 Multi-Ethnic Study of Atherosclerosis (MESA) and MESA Family participants [2329 Caucasians (EUA), 691 Chinese (CHN), 2482 African Americans (AFA), and 2012 Hispanic (HIS)] using an additive model adjusting for CVD risk factors, with SNP significance defined by a Bonferroni-corrected p < 7.6 × 10(-4) (0.05/66). In EUA there were significant associations for CAC with SNPs in 9p21 (rs1333049, P = 2 × 10(-9); rs4977574, P = 4 × 10(-9)), COL4A1 (rs9515203, P = 9 × 10(-6)), and PHACTR1 (rs9349379, P = 4 × 10(-4)). In HIS, CAC was associated with SNPs in 9p21 (rs1333049, P = 8 × 10(-5); rs4977574, P = 5 × 10(-5)), APOA5 (rs964184, P = 2 × 10(-4)), and ADAMTS7 (rs7173743, P = 4 × 10(-4)). There were no associations between CAC and 9p21 SNPs for AFA and CHN. Fine mapping of the 9p21 region revealed SNPs with robust associations with CAC in EUA and HIS but no significant associations in AFA and CHN. Our results suggest some shared genetic architecture for sCVD across ethnic groups, while also underscoring the possibility of novel variants and/or pathways in risk of CVD in ethnically diverse populations. Published by Elsevier Ireland Ltd.
Haplotype-based approach to known MS-associated regions increases the amount of explained risk
Khankhanian, Pouya; Gourraud, Pierre-Antoine; Lizee, Antoine; Goodin, Douglas S
2015-01-01
Genome-wide association studies (GWAS), using single nucleotide polymorphisms (SNPs), have yielded 110 non-human leucocyte antigen genomic regions that are associated with multiple sclerosis (MS). Despite this large number of associations, however, only 28% of MS-heritability can currently be explained. Here we compare the use of multi-SNP-haplotypes to the use of single-SNPs as alternative methods to describe MS genetic risk. SNP-haplotypes (of various lengths from 1 up to 15 contiguous SNPs) were constructed at each of the 110 previously identified, MS-associated, genomic regions. Even after correcting for the larger number of statistical comparisons made when using the haplotype-method, in 32 of the regions, the SNP-haplotype based model was markedly more significant than the single-SNP based model. By contrast, in no region was the single-SNP based model similarly more significant than the SNP-haplotype based model. Moreover, when we included the 932 MS-associated SNP-haplotypes (that we identified from 102 regions) as independent variables into a logistic linear model, the amount of MS-heritability, as assessed by Nagelkerke's R-squared, was 38%, which was considerably better than 29%, which was obtained by using only single-SNPs. This study demonstrates that SNP-haplotypes can be used to fine-map the genetic associations within regions of interest previously identified by single-SNP GWAS. Moreover, the amount of the MS genetic risk explained by the SNP-haplotype associations in the 110 MS-associated genomic regions was considerably greater when using SNP-haplotypes than when using single-SNPs. Also, the use of SNP-haplotypes can lead to the discovery of new regions of interest, which have not been identified by a single-SNP GWAS. PMID:26185143
Windelinckx, An; De Mars, Gunther; Huygens, Wim; Peeters, Maarten W.; Vincent, Barbara; Wijmenga, Cisca; Lambrechts, Diether; Aerssens, Jeroen; Vlietinck, Robert; Beunen, Gaston
2011-01-01
Muscle strength is an important determinant in elite sports performance as well as in the activities of daily living. Muscle metabolism also plays a role in the genesis, and therefore prevention, of common pathological conditions and chronic diseases. Even though heritability estimates between 31 and 78% suggest a significant genetic component in muscle strength, only a limited number of genes influencing muscle strength have been identified. This study aimed to identify and prioritize positional candidate genes within a skeletal muscle strength quantitative trait locus on chromosome 12q22-23 for follow-up. A two-staged gene-centered fine-mapping approach using 122 single nucleotide polymorphisms (SNPs) in stage 1 identified a familybased association (n = 500) between several tagSNPs located in the ATPase, Ca2+ transporting, cardiac muscle, slow twitch 2 (ATP2A2; rs3026468), the NUAK family, SNF1-like kinase, 1 (NUAK1; rs10861553 and rs3741886), and the protein phosphatase 1, catalytic subunit, gamma isoform (PPP1CC; rs1050587 and rs7901769) genes and knee torque production (P values up to 0.00092). In stage 2, family-based association tests on additional putatively functional SNPs (e.g., exonic SNPs, SNPs in transcription factor binding sites or in conserved regions) in an enlarged sample (n = 536; 464 individuals overlap with stage 1) did not identify additional associations with muscle strength characteristics. Further in-depth analyses will be necessary to elucidate the exact role of ATP2A2, PPP1CC, and NUAK1 in muscle strength and to find out which functional polymorphisms are at the base of the interindividual strength differences. PMID:21750233
Achenbach, Ute; Paulo, Joao; Ilarionova, Evgenyia; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Gebhardt, Christiane
2009-02-01
The damage caused by the parasitic root cyst nematode Globodera pallida is a major yield-limiting factor in potato cultivation . Breeding for resistance is facilitated by the PCR-based marker 'HC', which is diagnostic for an allele conferring high resistance against G. pallida pathotype Pa2/3 that has been introgressed from the wild potato species Solanum vernei into the Solanum tuberosum tetraploid breeding pool. The major quantitative trait locus (QTL) controlling this nematode resistance maps on potato chromosome V in a hot spot for resistance to various pathogens including nematodes and the oomycete Phytophthora infestans. An unstructured sample of 79 tetraploid, highly heterozygous varieties and breeding clones was selected based on presence (41 genotypes) or absence (38 genotypes) of the HC marker. Testing the clones for resistance to G. pallida confirmed the diagnostic power of the HC marker. The 79 individuals were genotyped for 100 single nucleotide polymorphisms (SNPs) at 10 loci distributed over 38 cM on chromosome V. Forty-five SNPs at six loci spanning 2 cM in the interval between markers GP21-GP179 were associated with resistance to G. pallida. Based on linkage disequilibrium (LD) between SNP markers, six LD groups comprising between 2 and 18 SNPs were identified. The LD groups indicated the existence of multiple alleles at a single resistance locus or at several, physically linked resistance loci. LD group C comprising 18 SNPs corresponded to the 'HC' marker. LD group E included 16 SNPs and showed an association peak, which positioned one nematode resistance locus physically close to the R1 gene family.
The design of electronic map displays
NASA Technical Reports Server (NTRS)
Aretz, Anthony J.
1991-01-01
This paper presents a cognitive analysis of a pilot's navigation task and describes an experiment comparing a new map display that employs the principle of visual momentum with the two traditional approaches, track-up and north-up. The data show that the advantage of a track-up alignment is its congruence with the egocentered forward view; however, the inconsistency of the rotating display hinders development of a cognitive map. The stability of a north-up alignment aids the acquisition of a cognitive map, but there is a cost associated with the mental rotation of the display to a track-up alignment for tasks involving the ego-centered forward view. The data also show that the visual momentum design captures the benefits and reduces the costs associated with the two traditional approaches.
Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M
2007-01-01
Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
2010-10-01
reproducibility of genotype c alls among the four batches by comparing the HapMap samples across batches. We also calculated identity-by- descent (IBD...used to aid clinicians in per sonalizing dosage to improve the therapeutic index of radiotherapy treatment for prostate cancer. References None Appendices None
2011-10-01
the HapMap samples across batches. We also calculated identity-by- descent (IBD) and identity-by-state (IBS) measures to confirm the identity of the...development of adverse events following radiotherapy. Such a tool could be used to aid clinicians in personalizing dosage to improve the therapeutic index of
Nikolić, Miloš; Papantonis, Argyris
2017-01-01
Abstract Genome-wide association studies (GWAS) have emerged as a powerful tool to uncover the genetic basis of human common diseases, which often show a complex, polygenic and multi-factorial aetiology. These studies have revealed that 70–90% of all single nucleotide polymorphisms (SNPs) associated with common complex diseases do not occur within genes (i.e. they are non-coding), making the discovery of disease-causative genetic variants and the elucidation of the underlying pathological mechanisms far from straightforward. Based on emerging evidences suggesting that disease-associated SNPs are frequently found within cell type-specific regulatory sequences, here we present GARLIC (GWAS-based Prediction Toolkit for Connecting Diseases and Cell Types), a user-friendly, multi-purpose software with an associated database and online viewer that, using global maps of cis-regulatory elements, can aetiologically connect human diseases with relevant cell types. Additionally, GARLIC can be used to retrieve potential disease-causative genetic variants overlapping regulatory sequences of interest. Overall, GARLIC can satisfy several important needs within the field of medical genetics, thus potentially assisting in the ultimate goal of uncovering the elusive and complex genetic basis of common human disorders. PMID:28007912
Liu, Hanmei; Wang, Xuewen; Wei, Bin; Wang, Yongbin; Liu, Yinghong; Zhang, Junjie; Hu, Yufeng; Yu, Guowu; Li, Jian; Xu, Zhanbin; Huang, Yubi
2016-01-01
In southwest China, some maize landraces have long been isolated geographically, and have phenotypes that differ from those of widely grown cultivars. These landraces may harbor rich genetic variation responsible for those phenotypes. Four-row Wax is one such landrace, with four rows of kernels on the cob. We resequenced the genome of Four-row Wax, obtaining 50.46 Gb sequence at 21.87× coverage, then identified and characterized 3,252,194 SNPs, 213,181 short InDels (1–5 bp) and 39,631 structural variations (greater than 5 bp). Of those, 312,511 (9.6%) SNPs were novel compared to the most detailed haplotype map (HapMap) SNP database of maize. Characterization of variations in reported kernel row number (KRN) related genes and KRN QTL regions revealed potential causal mutations in fea2, td1, kn1, and te1. Genome-wide comparisons revealed abundant genetic variations in Four-row Wax, which may be associated with environmental adaptation. The sequence and SNP variations described here enrich genetic resources of maize, and provide guidance into study of seed numbers for crop yield improvement. PMID:27242868
Chung, Chia-Min; Wang, Ruey-Yun; Fann, Cathy S. J.; Chen, Jaw-Wen; Jong, Yuh-Shiun; Jou, Yuh-Shan; Yang, Hsin-Chou; Kang, Chih-Sen; Chen, Chien-Chung; Chang, Huan-Cheng; Pan, Wen-Harn
2013-01-01
Angiotensin-converting enzyme (ACE) has been implicated in multiple biological system, particularly cardiovascular diseases. However, findings associating ACE insertion/deletion polymorphism with hypertension or other related traits are inconsistent. Therefore, in a two-stage approach, we aimed to fine-map ACE in order to narrow-down the function-specific locations. We genotyped 31 single nucleotide polymorphisms (SNPs) of ACE from 1168 individuals from 305 young-onset (age ≤40) hypertension pedigrees, and found four linkage disequilibrium (LD) blocks. A tag-SNP, rs1800764 on LD block 2, upstream of and near the ACE promoter, was significantly associated with young-onset hypertension (p = 0.04). Tag-SNPs on all LD blocks were significantly associated with ACE activity (p-value: 10–16 to <10–33). The two regions most associated with ACE activity were found between exon13 and intron18 and between intron 20 and 3′UTR, as revealed by measured haplotype analysis. These two major QTLs of ACE activity and the moderate effect variant upstream of ACE promoter for young-onset hypertension were replicated by another independent association study with 842 subjects. PMID:23469169
Onuki, Ritsuko; Yamaguchi, Rui; Shibuya, Tetsuo; Kanehisa, Minoru; Goto, Susumu
2017-01-01
Genome-wide scans for positive selection have become important for genomic medicine, and many studies aim to find genomic regions affected by positive selection that are associated with risk allele variations among populations. Most such studies are designed to detect recent positive selection. However, we hypothesize that ancient positive selection is also important for adaptation to pathogens, and has affected current immune-mediated common diseases. Based on this hypothesis, we developed a novel linkage disequilibrium-based pipeline, which aims to detect regions associated with ancient positive selection across populations from single nucleotide polymorphism (SNP) data. By applying this pipeline to the genotypes in the International HapMap project database, we show that genes in the detected regions are enriched in pathways related to the immune system and infectious diseases. The detected regions also contain SNPs reported to be associated with cancers and metabolic diseases, obesity-related traits, type 2 diabetes, and allergic sensitization. These SNPs were further mapped to biological pathways to determine the associations between phenotypes and molecular functions. Assessments of candidate regions to identify functions associated with variations in incidence rates of these diseases are needed in the future. PMID:28445522
An integrated map of genetic variation from 1,092 human genomes
2012-01-01
Summary Through characterising the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help understand the genetic contribution to disease. We describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methodologies to integrate information across multiple algorithms and diverse data sources we provide a validated haplotype map of 38 million SNPs, 1.4 million indels and over 14 thousand larger deletions. We show that individuals from different populations carry different profiles of rare and common variants and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways and that each individual harbours hundreds of rare non-coding variants at conserved sites, such as transcription-factor-motif disrupting changes. This resource, which captures up to 98% of accessible SNPs at a frequency of 1% in populations of medical genetics focus, enables analysis of common and low-frequency variants in individuals from diverse, including admixed, populations. PMID:23128226
Damage Proxy Map from Interferometric Synthetic Aperture Radar Coherence
NASA Technical Reports Server (NTRS)
Webb, Frank H. (Inventor); Yun, Sang-Ho (Inventor); Fielding, Eric Jameson (Inventor); Simons, Mark (Inventor)
2015-01-01
A method, apparatus, and article of manufacture provide the ability to generate a damage proxy map. A master coherence map and a slave coherence map, for an area prior and subsequent to (including) a damage event are obtained. The slave coherence map is registered to the master coherence map. Pixel values of the slave coherence map are modified using histogram matching to provide a first histogram of the master coherence map that exactly matches a second histogram of the slave coherence map. A coherence difference between the slave coherence map and the master coherence map is computed to produce a damage proxy map. The damage proxy map is displayed with the coherence difference displayed in a visually distinguishable manner.
Kayam, Galya; Brand, Yael; Faigenboim-Doron, Adi; Patil, Abhinandan; Hedvat, Ilan; Hovav, Ran
2017-01-01
The growth habit of lateral shoots (also termed “branching habit”) is an important descriptive and agronomic character of peanut. Yet, both the inheritance of branching habit and the genetic mechanism that controls it in this crop remain unclear. In addition, the low degree of polymorphism among cultivated peanut varieties hinders fine-mapping of this and other traits in non-homozygous genetic structures. Here, we combined high-throughput sequencing with a well-defined genetic system to study these issues in peanut. Initially, segregating F2 populations derived from a reciprocal cross between very closely related Virginia-type peanut cultivars with spreading and bunch growth habits were examined. The spreading/bunch trait was shown to be controlled by a single gene with no cytoplasmic effect. That gene was named Bunch1 and was significantly correlated with pod yield per plant, time to maturation and the ratio of “dead-end” pods. Subsequently, bulked segregant analysis was performed on 52 completely bunch, and 47 completely spreading F3 families. In order to facilitate the process of SNP detection and candidate-gene analysis, the transcriptome was used instead of genomic DNA. Young leaves were sampled and bulked. Reads from Illumina sequencing were aligned against the peanut reference transcriptome and the diploid genomes. Inter-varietal SNPs were detected, scored and quality-filtered. Thirty-four candidate SNPs were found to have a bulk frequency ratio value >10 and 6 of those SNPs were found to be located in the genomic region of linkage group B5. Three best hits from that over-represented region were further analyzed in the segregating population. The trait locus was found to be located in a ~1.1 Mbp segment between markers M875 (B5:145,553,897; 1.9 cM) and M255 (B5:146,649,943; 2.25 cM). The method was validated using a population of recombinant inbreed lines of the same cross and a new DNA SNP-array. This study demonstrates the relatively straight-forward utilization of bulk segregant analysis for trait fine-mapping in the low polymeric and heterozygous germplasm of cultivated peanut and provides a baseline for candidate gene discovery and map-based cloning of Bunch1. PMID:28421098
Fan, Sook-Ha; Say, Yee-How
2014-06-20
This study was to investigate the prevalence of single nucleotide polymorphisms (SNPs) in leptin gene LEP (A19G and G2548A) and leptin receptor gene LEPR (K109R and Q223R) and their association with fasting plasma leptin level (PLL) and obesity in a Malaysian suburban population in Kampar, Perak. Convenience sampling was performed with informed consents, and the study sample was drawn from patients who were patrons of the Kampar Health Clinic. A total of 408 subjects (mean age, 52.4 ± 13.7 years; 169 men, 239 women; 190 obese, 218 non-obese; 148 Malays, 177 ethnic Chinese, 83 ethnic Indians) participated. Socio-demographic data and anthropometric measurements were taken, and genotyping was performed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). The LEP A19G, G2548A and LEPR K109R, Q223R variant allele frequencies were 0.74, 0.67 and 0.61, 0.79, respectively. The genotype and allele distributions of these gene variants were significantly different among ethnic groups, but not among body mass index (BMI) classes. Subjects with LEPR K109 and Q223 allele had significantly higher systolic blood pressure and adiposity indices after adjustment for ethnicity (higher BMI, total body and subcutaneous fat; lower skeletal muscle percentage). Subjects with LEPR 109R allele had lower PLL than their wild-type allele counterparts. The influence of LEP A19G and G2548A SNPs on blood pressures, anthropometrics, and PLL was not evident. Interestingly, synergistic effect of the LEP and LEPR SNPs was observed as subjects homozygous for all four SNPs studied exhibited significantly higher subcutaneous fat and PLL than those with other genotype combinations. The LEP and LEPR SNPs in this study may not be an obesity marker among Malaysians in this population, but were associated with ethnicity. Our findings suggest that each of these SNPs contributes to minor but significant variation in obesity-related traits and in combination they display synergistic effects on subcutaneous fat and PLL.
2014-01-01
Background This study was to investigate the prevalence of single nucleotide polymorphisms (SNPs) in leptin gene LEP (A19G and G2548A) and leptin receptor gene LEPR (K109R and Q223R) and their association with fasting plasma leptin level (PLL) and obesity in a Malaysian suburban population in Kampar, Perak. Methods Convenience sampling was performed with informed consents, and the study sample was drawn from patients who were patrons of the Kampar Health Clinic. A total of 408 subjects (mean age, 52.4 ± 13.7 years; 169 men, 239 women; 190 obese, 218 non-obese; 148 Malays, 177 ethnic Chinese, 83 ethnic Indians) participated. Socio-demographic data and anthropometric measurements were taken, and genotyping was performed using polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). Results The LEP A19G, G2548A and LEPR K109R, Q223R variant allele frequencies were 0.74, 0.67 and 0.61, 0.79, respectively. The genotype and allele distributions of these gene variants were significantly different among ethnic groups, but not among body mass index (BMI) classes. Subjects with LEPR K109 and Q223 allele had significantly higher systolic blood pressure and adiposity indices after adjustment for ethnicity (higher BMI, total body and subcutaneous fat; lower skeletal muscle percentage). Subjects with LEPR 109R allele had lower PLL than their wild-type allele counterparts. The influence of LEP A19G and G2548A SNPs on blood pressures, anthropometrics, and PLL was not evident. Interestingly, synergistic effect of the LEP and LEPR SNPs was observed as subjects homozygous for all four SNPs studied exhibited significantly higher subcutaneous fat and PLL than those with other genotype combinations. Conclusions The LEP and LEPR SNPs in this study may not be an obesity marker among Malaysians in this population, but were associated with ethnicity. Our findings suggest that each of these SNPs contributes to minor but significant variation in obesity-related traits and in combination they display synergistic effects on subcutaneous fat and PLL. PMID:24947733
Wang, Xianshu; Pankratz, V Shane; Fredericksen, Zachary; Tarrell, Robert; Karaus, Mary; McGuffog, Lesley; Pharaoh, Paul D P; Ponder, Bruce A J; Dunning, Alison M; Peock, Susan; Cook, Margaret; Oliver, Clare; Frost, Debra; Sinilnikova, Olga M; Stoppa-Lyonnet, Dominique; Mazoyer, Sylvie; Houdayer, Claude; Hogervorst, Frans B L; Hooning, Maartje J; Ligtenberg, Marjolijn J; Spurdle, Amanda; Chenevix-Trench, Georgia; Schmutzler, Rita K; Wappenschmidt, Barbara; Engel, Christoph; Meindl, Alfons; Domchek, Susan M; Nathanson, Katherine L; Rebbeck, Timothy R; Singer, Christian F; Gschwantler-Kaulich, Daphne; Dressler, Catherina; Fink, Anneliese; Szabo, Csilla I; Zikan, Michal; Foretova, Lenka; Claes, Kathleen; Thomas, Gilles; Hoover, Robert N; Hunter, David J; Chanock, Stephen J; Easton, Douglas F; Antoniou, Antonis C; Couch, Fergus J
2010-07-15
Recent studies have identified single nucleotide polymorphisms (SNPs) that significantly modify breast cancer risk in BRCA1 and BRCA2 mutation carriers. Since these risk modifiers were originally identified as genetic risk factors for breast cancer in genome-wide association studies (GWASs), additional risk modifiers for BRCA1 and BRCA2 may be identified from promising signals discovered in breast cancer GWAS. A total of 350 SNPs identified as candidate breast cancer risk factors (P < 1 x 10(-3)) in two breast cancer GWAS studies were genotyped in 3451 BRCA1 and 2006 BRCA2 mutation carriers from nine centers. Associations with breast cancer risk were assessed using Cox models weighted for penetrance. Eight SNPs in BRCA1 carriers and 12 SNPs in BRCA2 carriers, representing an enrichment over the number expected, were significantly associated with breast cancer risk (P(trend) < 0.01). The minor alleles of rs6138178 in SNRPB and rs6602595 in CAMK1D displayed the strongest associations in BRCA1 carriers (HR = 0.78, 95% CI: 0.69-0.90, P(trend) = 3.6 x 10(-4) and HR = 1.25, 95% CI: 1.10-1.41, P(trend) = 4.2 x 10(-4)), whereas rs9393597 in LOC134997 and rs12652447 in FBXL7 showed the strongest associations in BRCA2 carriers (HR = 1.55, 95% CI: 1.25-1.92, P(trend) = 6 x 10(-5) and HR = 1.37, 95% CI: 1.16-1.62, P(trend) = 1.7 x 10(-4)). The magnitude and direction of the associations were consistent with the original GWAS. In subsequent risk assessment studies, the loci appeared to interact multiplicatively for breast cancer risk in BRCA1 and BRCA2 carriers. Promising candidate SNPs from GWAS were identified as modifiers of breast cancer risk in BRCA1 and BRCA2 carriers. Upon further validation, these SNPs together with other genetic and environmental factors may improve breast cancer risk assessment in these populations.
2011-01-01
Background Big sagebrush (Artemisia tridentata) is one of the most widely distributed and ecologically important shrub species in western North America. This species serves as a critical habitat and food resource for many animals and invertebrates. Habitat loss due to a combination of disturbances followed by establishment of invasive plant species is a serious threat to big sagebrush ecosystem sustainability. Lack of genomic data has limited our understanding of the evolutionary history and ecological adaptation in this species. Here, we report on the sequencing of expressed sequence tags (ESTs) and detection of single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) markers in subspecies of big sagebrush. Results cDNA of A. tridentata sspp. tridentata and vaseyana were normalized and sequenced using the 454 GS FLX Titanium pyrosequencing technology. Assembly of the reads resulted in 20,357 contig consensus sequences in ssp. tridentata and 20,250 contigs in ssp. vaseyana. A BLASTx search against the non-redundant (NR) protein database using 29,541 consensus sequences obtained from a combined assembly resulted in 21,436 sequences with significant blast alignments (≤ 1e-15). A total of 20,952 SNPs and 119 polymorphic SSRs were detected between the two subspecies. SNPs were validated through various methods including sequence capture. Validation of SNPs in different individuals uncovered a high level of nucleotide variation in EST sequences. EST sequences of a third, tetraploid subspecies (ssp. wyomingensis) obtained by Illumina sequencing were mapped to the consensus sequences of the combined 454 EST assembly. Approximately one-third of the SNPs between sspp. tridentata and vaseyana identified in the combined assembly were also polymorphic within the two geographically distant ssp. wyomingensis samples. Conclusion We have produced a large EST dataset for Artemisia tridentata, which contains a large sample of the big sagebrush leaf transcriptome. SNP mapping among the three subspecies suggest the origin of ssp. wyomingensis via mixed ancestry. A large number of SNP and SSR markers provide the foundation for future research to address questions in big sagebrush evolution, ecological genetics, and conservation using genomic approaches. PMID:21767398
Evaluating the Effect of Display Realism on Natural Resource Decision Making
NASA Astrophysics Data System (ADS)
Chong, Steven S.
2018-05-01
Geographic information systems (GIS) facilitate location-based decision making. Despite the improved availability of GIS software to non-professionals, training in cartographic design has not followed suit. Prior research indicates that when presented with map choices, users are influenced by naïve realism, a preference for realistic displays cotaining irrelevant, extraneous details, leading to decreased task efficiency. This study investigated the role of naïve realism in decision making for natural resource management, a field that often employs geospatial tools. Data was collected through a GIS user ability test, a questionnaire and direct observation. Forty volunteer expert and non-expert resource managers evaluated the suitability of different sites for a land management scenario. Each participant was tested on two map display treatments containing different levels of realism - a simpler 2D display and a more complex 3D display - to compare task performance. Performance was measured by task accuracy and task completion time. User perceptions and preferences about the displays were also recorded. Display realism had an impact on performance and there were indications naïve realism was present. Users completed tasks significantly faster on the 2D display and many individuals misjudged which display they were most accurate or fastest with. The results are informative for designing information systems containing interactive maps, particularly for resource management applications. The results also suggest that the order displays were presented had a significant effect and may have implications for teaching users map-based tasks.
A lattice model for data display
NASA Technical Reports Server (NTRS)
Hibbard, William L.; Dyer, Charles R.; Paul, Brian E.
1994-01-01
In order to develop a foundation for visualization, we develop lattice models for data objects and displays that focus on the fact that data objects are approximations to mathematical objects and real displays are approximations to ideal displays. These lattice models give us a way to quantize the information content of data and displays and to define conditions on the visualization mappings from data to displays. Mappings satisfy these conditions if and only if they are lattice isomorphisms. We show how to apply this result to scientific data and display models, and discuss how it might be applied to recursively defined data types appropriate for complex information processing.
Choosing colors for map display icons using models of visual search.
Shive, Joshua; Francis, Gregory
2013-04-01
We show how to choose colors for icons on maps to minimize search time using predictions of a model of visual search. The model analyzes digital images of a search target (an icon on a map) and a search display (the map containing the icon) and predicts search time as a function of target-distractor color distinctiveness and target eccentricity. We parameterized the model using data from a visual search task and performed a series of optimization tasks to test the model's ability to choose colors for icons to minimize search time across icons. Map display designs made by this procedure were tested experimentally. In a follow-up experiment, we examined the model's flexibility to assign colors in novel search situations. The model fits human performance, performs well on the optimization tasks, and can choose colors for icons on maps with novel stimuli to minimize search time without requiring additional model parameter fitting. Models of visual search can suggest color choices that produce search time reductions for display icons. Designers should consider constructing visual search models as a low-cost method of evaluating color assignments.
Ma, Chun-Lei; Jin, Ji-Qiang; Li, Chun-Fang; Wang, Rong-Kai; Zheng, Hong-Kun; Yao, Ming-Zhe; Chen, Liang
2015-01-01
Genetic maps are important tools in plant genomics and breeding. The present study reports the large-scale discovery of single nucleotide polymorphisms (SNPs) for genetic map construction in tea plant. We developed a total of 6,042 valid SNP markers using specific-locus amplified fragment sequencing (SLAF-seq), and subsequently mapped them into the previous framework map. The final map contained 6,448 molecular markers, distributing on fifteen linkage groups corresponding to the number of tea plant chromosomes. The total map length was 3,965 cM, with an average inter-locus distance of 1.0 cM. This map is the first SNP-based reference map of tea plant, as well as the most saturated one developed to date. The SNP markers and map resources generated in this study provide a wealth of genetic information that can serve as a foundation for downstream genetic analyses, such as the fine mapping of quantitative trait loci (QTL), map-based cloning, marker-assisted selection, and anchoring of scaffolds to facilitate the process of whole genome sequencing projects for tea plant. PMID:26035838
A Comprehensive Linkage Map of the Dog Genome
Wong, Aaron K.; Ruhe, Alison L.; Dumont, Beth L.; Robertson, Kathryn R.; Guerrero, Giovanna; Shull, Sheila M.; Ziegle, Janet S.; Millon, Lee V.; Broman, Karl W.; Payseur, Bret A.; Neff, Mark W.
2010-01-01
We have leveraged the reference sequence of a boxer to construct the first complete linkage map for the domestic dog. The new map improves access to the dog's unique biology, from human disease counterparts to fascinating evolutionary adaptations. The map was constructed with ∼3000 microsatellite markers developed from the reference sequence. Familial resources afforded 450 mostly phase-known meioses for map assembly. The genotype data supported a framework map with ∼1500 loci. An additional ∼1500 markers served as map validators, contributing modestly to estimates of recombination rate but supporting the framework content. Data from ∼22,000 SNPs informing on a subset of meioses supported map integrity. The sex-averaged map extended 21 M and revealed marked region- and sex-specific differences in recombination rate. The map will enable empiric coverage estimates and multipoint linkage analysis. Knowledge of the variation in recombination rate will also inform on genomewide patterns of linkage disequilibrium (LD), and thus benefit association, selective sweep, and phylogenetic mapping approaches. The computational and wet-bench strategies can be applied to the reference genome of any nonmodel organism to assemble a de novo linkage map. PMID:19966068
How to Display Hazards and other Scientific Data Using Google Maps
NASA Astrophysics Data System (ADS)
Venezky, D. Y.; Fee, J. M.
2007-12-01
The U.S. Geological Survey's (USGS) Volcano Hazard Program (VHP) is launching a map-based interface to display hazards information using the Google® Map API (Application Program Interface). Map-based interfaces provide a synoptic view of data, making patterns easier to detect and allowing users to quickly ascertain where hazards are in relation to major population and infrastructure centers. Several map-based interfaces are now simple to run on a web server, providing ideal platforms for sharing information with colleagues, emergency managers, and the public. There are three main steps to making data accessible on a map-based interface; formatting the input data, plotting the data on the map, and customizing the user interface. The presentation, "Creating Geospatial RSS and ATOM feeds for Map-based Interfaces" (Fee and Venezky, this session), reviews key features for map input data. Join us for this presentation on how to plot data in a geographic context and then format the display with images, custom markers, and links to external data. Examples will show how the VHP Volcano Status Map was created and how to plot a field trip with driving directions.
Hormone-Related Pathways and Risk of Breast Cancer Subtypes in African American Women
Haddad, Stephen A.; Lunetta, Kathryn L.; Ruiz-Narváez, Edward A.; Bensen, Jeannette T.; Hong, Chi-Chen; Sucheston-Campbell, Lara E.; Yao, Song; Bandera, Elisa V.; Rosenberg, Lynn; Haiman, Christopher A.; Troester, Melissa A.; Ambrosone, Christine B.; Palmer, Julie R.
2016-01-01
Purpose We sought to investigate genetic variation in hormone pathways in relation to risk of overall and subtype-specific breast cancer in women of African ancestry (AA). Methods Genotyping and imputation yielded data on 143,934 SNPs in 308 hormone-related genes for 3663 breast cancer cases (1098 ER-, 1983 ER+, 582 ER unknown) and 4687 controls from the African American Breast Cancer Epidemiology and Risk (AMBER) Consortium. AMBER includes data from four large studies of AA women: the Carolina Breast Cancer Study, the Women's Circle of Health Study, the Black Women's Health Study, and the Multiethnic Cohort Study. Pathway- and gene-based analyses were conducted, and single SNP tests were run for the top genes. Results There were no strong associations at the pathway level. The most significantly associated genes were GHRH, CALM2, CETP, and AKR1C1 for overall breast cancer (gene-based nominal p ≤0.01); NR0B1, IGF2R, CALM2, CYP1B1, and GRB2 for ER+ breast cancer (p ≤0.02); and PGR, MAPK3, MAP3K1, and LHCGR for ER- disease (p ≤0.02). Single-SNP tests for SNPs with pairwise linkage disequilibrium r2 <0.8 in the top genes identified 12 common SNPs (in CALM2, CETP, NR0B1, IGF2R, CYP1B1, PGR, MAPK3, and MAP3K1) associated with overall or subtype-specific breast cancer after gene-level correction for multiple testing. Rs11571215 in PGR (progesterone receptor) was the SNP most strongly associated with ER- disease. Conclusion We identified eight genes in hormone pathways that contain common variants associated with breast cancer in AA women after gene-level correction for multiple testing. PMID:26458823
Sequence-Based Genotyping for Marker Discovery and Co-Dominant Scoring in Germplasm and Populations
Truong, Hoa T.; Ramos, A. Marcos; Yalcin, Feyruz; de Ruiter, Marjo; van der Poel, Hein J. A.; Huvenaars, Koen H. J.; Hogers, René C. J.; van Enckevort, Leonora. J. G.; Janssen, Antoine; van Orsouw, Nathalie J.; van Eijk, Michiel J. T.
2012-01-01
Conventional marker-based genotyping platforms are widely available, but not without their limitations. In this context, we developed Sequence-Based Genotyping (SBG), a technology for simultaneous marker discovery and co-dominant scoring, using next-generation sequencing. SBG offers users several advantages including a generic sample preparation method, a highly robust genome complexity reduction strategy to facilitate de novo marker discovery across entire genomes, and a uniform bioinformatics workflow strategy to achieve genotyping goals tailored to individual species, regardless of the availability of a reference sequence. The most distinguishing features of this technology are the ability to genotype any population structure, regardless whether parental data is included, and the ability to co-dominantly score SNP markers segregating in populations. To demonstrate the capabilities of SBG, we performed marker discovery and genotyping in Arabidopsis thaliana and lettuce, two plant species of diverse genetic complexity and backgrounds. Initially we obtained 1,409 SNPs for arabidopsis, and 5,583 SNPs for lettuce. Further filtering of the SNP dataset produced over 1,000 high quality SNP markers for each species. We obtained a genotyping rate of 201.2 genotypes/SNP and 58.3 genotypes/SNP for arabidopsis (n = 222 samples) and lettuce (n = 87 samples), respectively. Linkage mapping using these SNPs resulted in stable map configurations. We have therefore shown that the SBG approach presented provides users with the utmost flexibility in garnering high quality markers that can be directly used for genotyping and downstream applications. Until advances and costs will allow for routine whole-genome sequencing of populations, we expect that sequence-based genotyping technologies such as SBG will be essential for genotyping of model and non-model genomes alike. PMID:22662172
Genomic Locus Modulating IOP in the BXD RI Mouse Strains
King, Rebecca; Li, Ying; Wang, Jiaxing; Struebing, Felix L.; Geisert, Eldon E.
2018-01-01
Intraocular pressure (IOP) is the primary risk factor for developing glaucoma, yet little is known about the contribution of genomic background to IOP regulation. The present study leverages an array of systems genetics tools to study genomic factors modulating normal IOP in the mouse. The BXD recombinant inbred (RI) strain set was used to identify genomic loci modulating IOP. We measured the IOP in a total of 506 eyes from 38 different strains. Strain averages were subjected to conventional quantitative trait analysis by means of composite interval mapping. Candidate genes were defined, and immunohistochemistry and quantitative PCR (qPCR) were used for validation. Of the 38 BXD strains examined the mean IOP ranged from a low of 13.2mmHg to a high of 17.1mmHg. The means for each strain were used to calculate a genome wide interval map. One significant quantitative trait locus (QTL) was found on Chr.8 (96 to 103 Mb). Within this 7 Mb region only 4 annotated genes were found: Gm15679, Cdh8, Cdh11 and Gm8730. Only two genes (Cdh8 and Cdh11) were candidates for modulating IOP based on the presence of non-synonymous SNPs. Further examination using SIFT (Sorting Intolerant From Tolerant) analysis revealed that the SNPs in Cdh8 (Cadherin 8) were predicted to not change protein function; while the SNPs in Cdh11 (Cadherin 11) would not be tolerated, affecting protein function. Furthermore, immunohistochemistry demonstrated that CDH11 is expressed in the trabecular meshwork of the mouse. We have examined the genomic regulation of IOP in the BXD RI strain set and found one significant QTL on Chr. 8. Within this QTL, there is one good candidate gene, Cdh11. PMID:29496776
Evaluation of 19 susceptibility loci of breast cancer in women of African ancestry
Huo, Dezheng; Zheng, Yonglan; Ogundiran, Temidayo O.; Adebamowo, Clement; Nathanson, Katherine L.; Domchek, Susan M.; Rebbeck, Timothy R.; Simon, Michael S.; John, Esther M.; Hennis, Anselm; Nemesure, Barbara; Wu, Suh-Yuh; Leske, M.Cristina; Ambs, Stefan; Niu, Qun; Zhang, Jing; Cox, Nancy J.; Olopade, Olufunmilayo I.
2012-01-01
Multiple breast cancer susceptibility loci have been identified in genome-wide association studies (GWAS) in populations of European and Asian ancestry using array chips optimized for populations of European ancestry. It is important to examine whether these loci are associated with breast cancer risk in women of African ancestry. We evaluated 25 single nucleotide polymorphisms (SNPs) at 19 loci in a pooled case–control study of breast cancer, which included 1509 cases and 1383 controls. Cases and controls were enrolled in Nigeria, Barbados and the USA; all women were of African ancestry. We found significant associations for three SNPs, which were in the same direction and of similar magnitude as those reported in previous fine-mapping studies in women of African ancestry. The allelic odds ratios were 1.24 [95% confidence interval (CI): 1.04–1.47; P = 0.018] for the rs2981578-G allele (10q26/FGFR2), 1.34 (95% CI: 1.10–1.63; P = 0.0035) for the rs9397435-G allele (6q25) and 1.12 (95% CI: 1.00–1.25; P = 0.04) for the rs3104793-C allele (16q12). Although a significant association was observed for an additional index SNP (rs3817198), it was in the opposite direction to prior GWAS studies. In conclusion, this study highlights the complexity of applying current GWAS findings across racial/ethnic groups, as none of GWAS-identified index SNPs could be replicated in women of African ancestry. Further fine-mapping studies in women of African ancestry will be needed to reveal additional and causal variants for breast cancer. PMID:22357627
Jayewardene, Avindra F; Mavros, Yorgi; Gwinn, Tom; Hancock, Dale P; Rooney, Kieron B
2016-02-01
Recent studies have shown that CD36 gene variants are associated with an increased prevalence of chronic disease. Although a genetic component to trainability has been proven, no data are available specifically on the influence of CD36 on training response. Two single nucleotide polymorphisms (SNPs) (rs1527479 and rs1984112) were assessed for associations with whole-body substrate oxidation, response to a 75-g dextrose oral glucose tolerance test, fasting plasma lipids, and cardiovascular disease risk factors in a young healthy cohort, both using cross-sectional analysis and following a 4-week endurance-exercise training program. Genotyping was performed using real-time polymerase chain reaction. Cross-sectional data were collected in 34 individuals (age, 22.7 ± 3.5 years), with 17 completing the training program. At baseline, TT SNP carriers at rs1527479 and wild-type GG carriers at rs1984112 were associated with significantly greater whole-body rate of fat oxidation (Fatox) during submaximal exercise (P < 0.05), whilst AA carriers at the same position were associated with elevated triglyceride (TG) levels. A significant genotype × time interaction in Fatox at SNP rs1984112 was identified at rest. Significant genotype × time interactions were present at rs1527479, with TT carriers exhibiting a favourable response to training when compared with C-allele carriers for fasting TG, diastolic blood pressure (DBP), and mean arterial pressure (MAP). In conclusion, cross-sectional assessment identified associations with Fatox and TG. Training response at both SNPs identified "at-risk" genotypes responding favourably to the training stimulus in Fatox, TG, DBP, and MAP. Although these data show potential pleiotropic influence of CD36 SNPs, assessment in a larger cohort is warranted.
Extended diversity analysis of cultivated grapevine Vitis vinifera with 10K genome-wide SNPs.
Laucou, Valérie; Launay, Amandine; Bacilieri, Roberto; Lacombe, Thierry; Adam-Blondon, Anne-Françoise; Bérard, Aurélie; Chauveau, Aurélie; de Andrés, Maria Teresa; Hausmann, Ludger; Ibáñez, Javier; Le Paslier, Marie-Christine; Maghradze, David; Martinez-Zapater, José Miguel; Maul, Erika; Ponnaiah, Maharajah; Töpfer, Reinhard; Péros, Jean-Pierre; Boursiquot, Jean-Michel
2018-01-01
Grapevine is a very important crop species that is mainly cultivated worldwide for fruits, wine and juice. Identification of the genetic bases of performance traits through association mapping studies requires a precise knowledge of the available diversity and how this diversity is structured and varies across the whole genome. An 18k SNP genotyping array was evaluated on a panel of Vitis vinifera cultivars and we obtained a data set with no missing values for a total of 10207 SNPs and 783 different genotypes. The average inter-SNP spacing was ~47 kbp, the mean minor allele frequency (MAF) was 0.23 and the genetic diversity in the sample was high (He = 0.32). Fourteen SNPs, chosen from those with the highest MAF values, were sufficient to identify each genotype in the sample. Parentage analysis revealed 118 full parentages and 490 parent-offspring duos, thus confirming the close pedigree relationships within the cultivated grapevine. Structure analyses also confirmed the main divisions due to an eastern-western gradient and human usage (table vs. wine). Using a multivariate approach, we refined the structure and identified a total of eight clusters. Both the genetic diversity (He, 0.26-0.32) and linkage disequilibrium (LD, 28.8-58.2 kbp) varied between clusters. Despite the short span LD, we also identified some non-recombining haplotype blocks that may complicate association mapping. Finally, we performed a genome-wide association study that confirmed previous works and also identified new regions for important performance traits such as acidity. Taken together, all the results contribute to a better knowledge of the genetics of the cultivated grapevine.
Extended diversity analysis of cultivated grapevine Vitis vinifera with 10K genome-wide SNPs
Launay, Amandine; Bacilieri, Roberto; Lacombe, Thierry; Adam-Blondon, Anne-Françoise; Bérard, Aurélie; Chauveau, Aurélie; de Andrés, Maria Teresa; Maghradze, David; Maul, Erika; Ponnaiah, Maharajah; Töpfer, Reinhard; Péros, Jean-Pierre; Boursiquot, Jean-Michel
2018-01-01
Grapevine is a very important crop species that is mainly cultivated worldwide for fruits, wine and juice. Identification of the genetic bases of performance traits through association mapping studies requires a precise knowledge of the available diversity and how this diversity is structured and varies across the whole genome. An 18k SNP genotyping array was evaluated on a panel of Vitis vinifera cultivars and we obtained a data set with no missing values for a total of 10207 SNPs and 783 different genotypes. The average inter-SNP spacing was ~47 kbp, the mean minor allele frequency (MAF) was 0.23 and the genetic diversity in the sample was high (He = 0.32). Fourteen SNPs, chosen from those with the highest MAF values, were sufficient to identify each genotype in the sample. Parentage analysis revealed 118 full parentages and 490 parent-offspring duos, thus confirming the close pedigree relationships within the cultivated grapevine. Structure analyses also confirmed the main divisions due to an eastern-western gradient and human usage (table vs. wine). Using a multivariate approach, we refined the structure and identified a total of eight clusters. Both the genetic diversity (He, 0.26–0.32) and linkage disequilibrium (LD, 28.8–58.2 kbp) varied between clusters. Despite the short span LD, we also identified some non-recombining haplotype blocks that may complicate association mapping. Finally, we performed a genome-wide association study that confirmed previous works and also identified new regions for important performance traits such as acidity. Taken together, all the results contribute to a better knowledge of the genetics of the cultivated grapevine. PMID:29420602
Genetic ancestry is associated with colorectal adenomas and adenocarcinomas in Latino populations
Hernandez-Suarez, Gustavo; Sanabria, Maria Carolina; Serrano, Marta; Herran, Oscar F; Perez, Jesus; Plata, Jose L; Zabaleta, Jovanny; Tenesa, Albert
2014-01-01
Colorectal cancer rates in Latin American countries are less than half of those observed in the United States. Latin Americans are the resultant of generations of an admixture of Native American, European, and African individuals. The potential role of genetic admixture in colorectal carcinogenesis has not been examined. We evaluate the association of genetic ancestry with colorectal neoplasms in 190 adenocarcinomas, 113 sporadic adenomas and 243 age- and sex-matched controls enrolled in a multicentric case–control study in Colombia. Individual ancestral genetic fractions were estimated using the STRUCTURE software, based on allele frequencies and assuming three distinct population origins. We used the Illumina Cancer Panel to genotype 1,421 sparse single-nucleotide polymorphisms (SNPs), and Northern and Western European ancestry, LWJ and Han Chinese in Beijing, China populations from the HapMap project as references. A total of 678 autosomal SNPs overlapped with the HapMap data set SNPs and were used for ancestry estimations. African mean ancestry fraction was higher in adenomas (0.13, 95% confidence interval (95% CI)=0.11–0.15) and cancer cases (0.14, 95% CI=0.12–0.16) compared with controls (0.11, 95% CI=0.10–0.12). Conditional logistic regression analysis, controlling for known risk factors, showed a positive association of African ancestry per 10% increase with both colorectal adenoma (odds ratio (OR)=1.12, 95% CI=0.97–1.30) and adenocarcinoma (OR=1.19, 95% CI=1.05–1.35). In conclusion, increased African ancestry (or variants linked to it) contributes to the increased susceptibility of colorectal cancer in admixed Latin American population. PMID:24518838
Antoniou, Antonis C.; Spurdle, Amanda B.; Sinilnikova, Olga M.; Healey, Sue; Pooley, Karen A.; Schmutzler, Rita K.; Versmold, Beatrix; Engel, Christoph; Meindl, Alfons; Arnold, Norbert; Hofmann, Wera; Sutter, Christian; Niederacher, Dieter; Deissler, Helmut; Caldes, Trinidad; Kämpjärvi, Kati; Nevanlinna, Heli; Simard, Jacques; Beesley, Jonathan; Chen, Xiaoqing; Neuhausen, Susan L.; Rebbeck, Timothy R.; Wagner, Theresa; Lynch, Henry T.; Isaacs, Claudine; Weitzel, Jeffrey; Ganz, Patricia A.; Daly, Mary B.; Tomlinson, Gail; Olopade, Olufunmilayo I.; Blum, Joanne L.; Couch, Fergus J.; Peterlongo, Paolo; Manoukian, Siranoush; Barile, Monica; Radice, Paolo; Szabo, Csilla I.; Pereira, Lutecia H. Mateus; Greene, Mark H.; Rennert, Gad; Lejbkowicz, Flavio; Barnett-Griness, Ofra; Andrulis, Irene L.; Ozcelik, Hilmi; Gerdes, Anne-Marie; Caligo, Maria A.; Laitman, Yael; Kaufman, Bella; Milgrom, Roni; Friedman, Eitan; Domchek, Susan M.; Nathanson, Katherine L.; Osorio, Ana; Llort, Gemma; Milne, Roger L.; Benítez, Javier; Hamann, Ute; Hogervorst, Frans B.L.; Manders, Peggy; Ligtenberg, Marjolijn J.L.; van den Ouweland, Ans M.W.; Peock, Susan; Cook, Margaret; Platte, Radka; Evans, D. Gareth; Eeles, Rosalind; Pichert, Gabriella; Chu, Carol; Eccles, Diana; Davidson, Rosemarie; Douglas, Fiona; Godwin, Andrew K.; Barjhoux, Laure; Mazoyer, Sylvie; Sobol, Hagay; Bourdon, Violaine; Eisinger, François; Chompret, Agnès; Capoulade, Corinne; Bressac-de Paillerets, Brigitte; Lenoir, Gilbert M.; Gauthier-Villars, Marion; Houdayer, Claude; Stoppa-Lyonnet, Dominique; Chenevix-Trench, Georgia; Easton, Douglas F.
2008-01-01
Germline mutations in BRCA1 and BRCA2 confer high risks of breast cancer. However, evidence suggests that these risks are modified by other genetic or environmental factors that cluster in families. A recent genome-wide association study has shown that common alleles at single nucleotide polymorphisms (SNPs) in FGFR2 (rs2981582), TNRC9 (rs3803662), and MAP3K1 (rs889312) are associated with increased breast cancer risks in the general population. To investigate whether these loci are also associated with breast cancer risk in BRCA1 and BRCA2 mutation carriers, we genotyped these SNPs in a sample of 10,358 mutation carriers from 23 studies. The minor alleles of SNP rs2981582 and rs889312 were each associated with increased breast cancer risk in BRCA2 mutation carriers (per-allele hazard ratio [HR] = 1.32, 95% CI: 1.20–1.45, ptrend = 1.7 × 10−8 and HR = 1.12, 95% CI: 1.02–1.24, ptrend = 0.02) but not in BRCA1 carriers. rs3803662 was associated with increased breast cancer risk in both BRCA1 and BRCA2 mutation carriers (per-allele HR = 1.13, 95% CI: 1.06–1.20, ptrend = 5 × 10−5 in BRCA1 and BRCA2 combined). These loci appear to interact multiplicatively on breast cancer risk in BRCA2 mutation carriers. The differences in the effects of the FGFR2 and MAP3K1 SNPs between BRCA1 and BRCA2 carriers point to differences in the biology of BRCA1 and BRCA2 breast cancer tumors and confirm the distinct nature of breast cancer in BRCA1 mutation carriers. PMID:18355772
Rice, Treva K; Sarzynski, Mark A; Sung, Yun Ju; Argyropoulos, George; Stütz, Adrian M; Teran-Garcia, Margarita; Rao, D C; Bouchard, Claude; Rankinen, Tuomo
2012-08-01
Although regular exercise improves submaximal aerobic capacity, there is large variability in its response to exercise training. While this variation is thought to be partly due to genetic differences, relatively little is known about the causal genes. Submaximal aerobic capacity traits in the current report include the responses of oxygen consumption (ΔVO(2)60), power output (ΔWORK60), and cardiac output (ΔQ60) at 60% of VO2max to a standardized 20-week endurance exercise training program. Genome-wide linkage analysis in 475 HERITAGE Family Study Caucasians identified a locus on chromosome 13q for ΔVO(2)60 (LOD = 3.11). Follow-up fine mapping involved a dense marker panel of over 1,800 single-nucleotide polymorphisms (SNPs) in a 7.9-Mb region (21.1-29.1 Mb from p-terminus). Single-SNP analyses found 14 SNPs moderately associated with both ΔVO(2)60 at P ≤ 0.005 and the correlated traits of ΔWORK60 and ΔQ60 at P < 0.05. Haplotype analyses provided several strong signals (P < 1.0 × 10(-5)) for ΔVO(2)60. Overall, association analyses narrowed the target region and included potential biological candidate genes (MIPEP and SGCG). Consistent with maximal heritability estimates of 23%, up to 20% of the phenotypic variance in ΔVO(2)60 was accounted for by these SNPs. These results implicate candidate genes on chromosome 13q12 for the ability to improve submaximal exercise capacity in response to regular exercise. Submaximal exercise at 60% of maximal capacity is an exercise intensity that falls well within the range recommended in the Physical Activity Guidelines for Americans and thus has potential public health relevance.
Rice, Treva K.; Sarzynski, Mark A.; Sung, Yun Ju; Argyropoulos, George; Stütz, Adrian M.; Teran-Garcia, Margarita; Rao, D. C.; Bouchard, Claude
2014-01-01
Although regular exercise improves submaximal aerobic capacity, there is large variability in its response to exercise training. While this variation is thought to be partly due to genetic differences, relatively little is known about the causal genes. Submaximal aerobic capacity traits in the current report include the responses of oxygen consumption (ΔVO260), power output (ΔWORK60), and cardiac output (ΔQ60) at 60% of VO2max to a standardized 20-week endurance exercise training program. Genome-wide linkage analysis in 475 HERITAGE Family Study Caucasians identified a locus on chromosome 13q for ΔVO260 (LOD = 3.11). Follow-up fine mapping involved a dense marker panel of over 1,800 single-nucleotide polymorphisms (SNPs) in a 7.9-Mb region (21.1–29.1 Mb from p-terminus). Single-SNP analyses found 14 SNPs moderately associated with both ΔVO260 at P ≤ 0.005 and the correlated traits of ΔWORK60 and ΔQ60 at P < 0.05. Haplotype analyses provided several strong signals (P<1.0 × 10−5) for ΔVO260. Overall, association analyses narrowed the target region and included potential biological candidate genes (MIPEP and SGCG). Consistent with maximal heritability estimates of 23%, up to 20% of the phenotypic variance in ΔVO260 was accounted for by these SNPs. These results implicate candidate genes on chromosome 13q12 for the ability to improve submaximal exercise capacity in response to regular exercise. Submaximal exercise at 60% of maximal capacity is an exercise intensity that falls well within the range recommended in the Physical Activity Guidelines for Americans and thus has potential public health relevance. PMID:22170014
Displaying R spatial statistics on Google dynamic maps with web applications created by Rwui
2012-01-01
Background The R project includes a large variety of packages designed for spatial statistics. Google dynamic maps provide web based access to global maps and satellite imagery. We describe a method for displaying directly the spatial output from an R script on to a Google dynamic map. Methods This is achieved by creating a Java based web application which runs the R script and then displays the results on the dynamic map. In order to make this method easy to implement by those unfamiliar with programming Java based web applications, we have added the method to the options available in the R Web User Interface (Rwui) application. Rwui is an established web application for creating web applications for running R scripts. A feature of Rwui is that all the code for the web application being created is generated automatically so that someone with no knowledge of web programming can make a fully functional web application for running an R script in a matter of minutes. Results Rwui can now be used to create web applications that will display the results from an R script on a Google dynamic map. Results may be displayed as discrete markers and/or as continuous overlays. In addition, users of the web application may select regions of interest on the dynamic map with mouse clicks and the coordinates of the region of interest will automatically be made available for use by the R script. Conclusions This method of displaying R output on dynamic maps is designed to be of use in a number of areas. Firstly it allows statisticians, working in R and developing methods in spatial statistics, to easily visualise the results of applying their methods to real world data. Secondly, it allows researchers who are using R to study health geographics data, to display their results directly onto dynamic maps. Thirdly, by creating a web application for running an R script, a statistician can enable users entirely unfamiliar with R to run R coded statistical analyses of health geographics data. Fourthly, we envisage an educational role for such applications. PMID:22998945
Displaying R spatial statistics on Google dynamic maps with web applications created by Rwui.
Newton, Richard; Deonarine, Andrew; Wernisch, Lorenz
2012-09-24
The R project includes a large variety of packages designed for spatial statistics. Google dynamic maps provide web based access to global maps and satellite imagery. We describe a method for displaying directly the spatial output from an R script on to a Google dynamic map. This is achieved by creating a Java based web application which runs the R script and then displays the results on the dynamic map. In order to make this method easy to implement by those unfamiliar with programming Java based web applications, we have added the method to the options available in the R Web User Interface (Rwui) application. Rwui is an established web application for creating web applications for running R scripts. A feature of Rwui is that all the code for the web application being created is generated automatically so that someone with no knowledge of web programming can make a fully functional web application for running an R script in a matter of minutes. Rwui can now be used to create web applications that will display the results from an R script on a Google dynamic map. Results may be displayed as discrete markers and/or as continuous overlays. In addition, users of the web application may select regions of interest on the dynamic map with mouse clicks and the coordinates of the region of interest will automatically be made available for use by the R script. This method of displaying R output on dynamic maps is designed to be of use in a number of areas. Firstly it allows statisticians, working in R and developing methods in spatial statistics, to easily visualise the results of applying their methods to real world data. Secondly, it allows researchers who are using R to study health geographics data, to display their results directly onto dynamic maps. Thirdly, by creating a web application for running an R script, a statistician can enable users entirely unfamiliar with R to run R coded statistical analyses of health geographics data. Fourthly, we envisage an educational role for such applications.
Population genomics of Pacific lamprey: adaptive variation in a highly dispersive species.
Hess, Jon E; Campbell, Nathan R; Close, David A; Docker, Margaret F; Narum, Shawn R
2013-06-01
Unlike most anadromous fishes that have evolved strict homing behaviour, Pacific lamprey (Entosphenus tridentatus) seem to lack philopatry as evidenced by minimal population structure across the species range. Yet unexplained findings of within-region population genetic heterogeneity coupled with the morphological and behavioural diversity described for the species suggest that adaptive genetic variation underlying fitness traits may be responsible. We employed restriction site-associated DNA sequencing to genotype 4439 quality filtered single nucleotide polymorphism (SNP) loci for 518 individuals collected across a broad geographical area including British Columbia, Washington, Oregon and California. A subset of putatively neutral markers (N = 4068) identified a significant amount of variation among three broad populations: northern British Columbia, Columbia River/southern coast and 'dwarf' adults (F(CT) = 0.02, P ≪ 0.001). Additionally, 162 SNPs were identified as adaptive through outlier tests, and inclusion of these markers revealed a signal of adaptive variation related to geography and life history. The majority of the 162 adaptive SNPs were not independent and formed four groups of linked loci. Analyses with matsam software found that 42 of these outlier SNPs were significantly associated with geography, run timing and dwarf life history, and 27 of these 42 SNPs aligned with known genes or highly conserved genomic regions using the genome browser available for sea lamprey. This study provides both neutral and adaptive context for observed genetic divergence among collections and thus reconciles previous findings of population genetic heterogeneity within a species that displays extensive gene flow. © 2012 John Wiley & Sons Ltd.
ERIC Educational Resources Information Center
Hegarty, Mary; Canham, Matt S.; Fabrikant, Sara I.
2010-01-01
Three experiments examined how bottom-up and top-down processes interact when people view and make inferences from complex visual displays (weather maps). Bottom-up effects of display design were investigated by manipulating the relative visual salience of task-relevant and task-irrelevant information across different maps. Top-down effects of…
SPSmart: adapting population based SNP genotype databases for fast and comprehensive web access.
Amigo, Jorge; Salas, Antonio; Phillips, Christopher; Carracedo, Angel
2008-10-10
In the last five years large online resources of human variability have appeared, notably HapMap, Perlegen and the CEPH foundation. These databases of genotypes with population information act as catalogues of human diversity, and are widely used as reference sources for population genetics studies. Although many useful conclusions may be extracted by querying databases individually, the lack of flexibility for combining data from within and between each database does not allow the calculation of key population variability statistics. We have developed a novel tool for accessing and combining large-scale genomic databases of single nucleotide polymorphisms (SNPs) in widespread use in human population genetics: SPSmart (SNPs for Population Studies). A fast pipeline creates and maintains a data mart from the most commonly accessed databases of genotypes containing population information: data is mined, summarized into the standard statistical reference indices, and stored into a relational database that currently handles as many as 4 x 10(9) genotypes and that can be easily extended to new database initiatives. We have also built a web interface to the data mart that allows the browsing of underlying data indexed by population and the combining of populations, allowing intuitive and straightforward comparison of population groups. All the information served is optimized for web display, and most of the computations are already pre-processed in the data mart to speed up the data browsing and any computational treatment requested. In practice, SPSmart allows populations to be combined into user-defined groups, while multiple databases can be accessed and compared in a few simple steps from a single query. It performs the queries rapidly and gives straightforward graphical summaries of SNP population variability through visual inspection of allele frequencies outlined in standard pie-chart format. In addition, full numerical description of the data is output in statistical results panels that include common population genetics metrics such as heterozygosity, Fst and In.
Jin, Peng; Andiappan, Anand Kumar; Quek, Jia Min; Lee, Bernett; Au, Bijin; Sio, Yang Yie; Irwanto, Astrid; Schurmann, Claudia; Grabe, Hans Jörgen; Suri, Bani Kaur; Matta, Sri Anusha; Westra, Harm-Jan; Franke, Lude; Esko, Tonu; Sun, Liangdan; Zhang, Xuejun; Liu, Hong; Zhang, Furen; Larbi, Anis; Xu, Xin; Poidinger, Michael; Liu, Jianjun; Chew, Fook Tim; Rotzschke, Olaf; Shi, Li; Wang, De Yun
2015-06-01
Brain-derived neurotrophic factor (BDNF) is a secretory protein that has been implicated in the pathogenesis of allergic rhinitis (AR), atopic asthma, and eczema, but it is currently unknown whether BDNF polymorphisms influence susceptibility to moderate-to-severe AR. We sought to identify disease associations and the functional effect of BDNF genetic variants in patients with moderate-to-severe AR. Tagging single nucleotide polymorphisms (SNPs) spanning the BDNF gene were selected from the human HapMap Han Chinese from Beijing (CHB) data set, and associations with moderate-to-severe AR were assessed in 2 independent cohorts of Chinese patients (2216 from Shandong province and 1239 living in Singapore). The functional effects of the BDNF genetic variants were determined by using both in vitro and ex vivo assays. The tagging SNP rs10767664 was significantly associated with the risk of moderate-to-severe AR in both Singapore Chinese (P = .0017; odds ratio, 1.324) and Shandong Chinese populations (P = .039; odds ratio, 1.180). The coding nonsynonymous SNP rs6265 was in perfect linkage with rs10767664 and conferred increased BDNF protein secretion by a human cell line in vitro. Subjects bearing the AA genotype of rs10767664 exhibited increased risk of moderate-to-severe AR and displayed increased BDNF protein and total IgE levels in plasma. Using a large-scale expression quantitative trait locus study, we demonstrated that BDNF SNPs are significantly associated with altered BDNF concentrations in peripheral blood. A common genetic variant of the BDNF gene is associated with increased risk of moderate-to-severe AR, and the AA genotype is associated with increased BDNF mRNA levels in peripheral blood. Together, these data indicate that functional BDNF gene variants increase the risk of moderate-to-severe AR. Copyright © 2015 American Academy of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Vogler, Amy J.; Andrianaivoarimanana, Voahangy; Telfer, Sandra; Hall, Carina M.; Sahl, Jason W.; Hepp, Crystal M.; Centner, Heather; Andersen, Genevieve; Birdsell, Dawn N.; Rahalison, Lila; Nottingham, Roxanne; Keim, Paul; Rajerison, Minoarisoa
2017-01-01
Background Yersinia pestis appears to be maintained in multiple, geographically separate, and phylogenetically distinct subpopulations within the highlands of Madagascar. However, the dynamics of these locally differentiated subpopulations through time are mostly unknown. To address that gap and further inform our understanding of plague epidemiology, we investigated the phylogeography of Y. pestis in Madagascar over an 18 year period. Methodology/Principal findings We generated whole genome sequences for 31 strains and discovered new SNPs that we used in conjunction with previously identified SNPs and variable-number tandem repeats (VNTRs) to genotype 773 Malagasy Y. pestis samples from 1995 to 2012. We mapped the locations where samples were obtained on a fine geographic scale to examine phylogeographic patterns through time. We identified 18 geographically separate and phylogenetically distinct subpopulations that display spatial and temporal stability, persisting in the same locations over a period of almost two decades. We found that geographic areas with higher levels of topographical relief are associated with greater levels of phylogenetic diversity and that sampling frequency can vary considerably among subpopulations and from year to year. We also found evidence of various Y. pestis dispersal events, including over long distances, but no evidence that any dispersal events resulted in successful establishment of a transferred genotype in a new location during the examined time period. Conclusions/Significance Our analysis suggests that persistent endemic cycles of Y. pestis transmission within local areas are responsible for the long term maintenance of plague in Madagascar, rather than repeated episodes of wide scale epidemic spread. Landscape likely plays a role in maintaining Y. pestis subpopulations in Madagascar, with increased topographical relief associated with increased levels of localized differentiation. Local ecological factors likely affect the dynamics of individual subpopulations and the associated likelihood of observing human plague cases in a given year in a particular location. PMID:28873412
Vogler, Amy J; Andrianaivoarimanana, Voahangy; Telfer, Sandra; Hall, Carina M; Sahl, Jason W; Hepp, Crystal M; Centner, Heather; Andersen, Genevieve; Birdsell, Dawn N; Rahalison, Lila; Nottingham, Roxanne; Keim, Paul; Wagner, David M; Rajerison, Minoarisoa
2017-09-01
Yersinia pestis appears to be maintained in multiple, geographically separate, and phylogenetically distinct subpopulations within the highlands of Madagascar. However, the dynamics of these locally differentiated subpopulations through time are mostly unknown. To address that gap and further inform our understanding of plague epidemiology, we investigated the phylogeography of Y. pestis in Madagascar over an 18 year period. We generated whole genome sequences for 31 strains and discovered new SNPs that we used in conjunction with previously identified SNPs and variable-number tandem repeats (VNTRs) to genotype 773 Malagasy Y. pestis samples from 1995 to 2012. We mapped the locations where samples were obtained on a fine geographic scale to examine phylogeographic patterns through time. We identified 18 geographically separate and phylogenetically distinct subpopulations that display spatial and temporal stability, persisting in the same locations over a period of almost two decades. We found that geographic areas with higher levels of topographical relief are associated with greater levels of phylogenetic diversity and that sampling frequency can vary considerably among subpopulations and from year to year. We also found evidence of various Y. pestis dispersal events, including over long distances, but no evidence that any dispersal events resulted in successful establishment of a transferred genotype in a new location during the examined time period. Our analysis suggests that persistent endemic cycles of Y. pestis transmission within local areas are responsible for the long term maintenance of plague in Madagascar, rather than repeated episodes of wide scale epidemic spread. Landscape likely plays a role in maintaining Y. pestis subpopulations in Madagascar, with increased topographical relief associated with increased levels of localized differentiation. Local ecological factors likely affect the dynamics of individual subpopulations and the associated likelihood of observing human plague cases in a given year in a particular location.
Natarajan, Sathishkumar; Kim, Hoy-Taek; Thamilarasan, Senthil Kumar; Veerappan, Karpagam; Park, Jong-In; Nou, Ill-Sup
2016-01-01
Powdery mildew is one of the most common fungal diseases in the world. This disease frequently affects melon (Cucumis melo L.) and other Cucurbitaceous family crops in both open field and greenhouse cultivation. One of the goals of genomics is to identify the polymorphic loci responsible for variation in phenotypic traits. In this study, powdery mildew disease assessment scores were calculated for four melon accessions, 'SCNU1154', 'Edisto47', 'MR-1', and 'PMR5'. To investigate the genetic variation of these accessions, whole genome re-sequencing using the Illumina HiSeq 2000 platform was performed. A total of 754,759,704 quality-filtered reads were generated, with an average of 82.64% coverage relative to the reference genome. Comparisons of the sequences for the melon accessions revealed around 7.4 million single nucleotide polymorphisms (SNPs), 1.9 million InDels, and 182,398 putative structural variations (SVs). Functional enrichment analysis of detected variations classified them into biological process, cellular component and molecular function categories. Further, a disease-associated QTL map was constructed for 390 SNPs and 45 InDels identified as related to defense-response genes. Among them 112 SNPs and 12 InDels were observed in powdery mildew responsive chromosomes. Accordingly, this whole genome re-sequencing study identified SNPs and InDels associated with defense genes that will serve as candidate polymorphisms in the search for sources of resistance against powdery mildew disease and could accelerate marker-assisted breeding in melon.
Verma, Shefali S.; Hall, Molly A.; Goodloe, Robert J.; Berg, Richard L.; Carrell, Dave S.; Carlson, Christopher S.; Chen, Lin; Crosslin, David R.; Denny, Joshua C.; Jarvik, Gail; Li, Rongling; Linneman, James G.; Pathak, Jyoti; Peissig, Peggy; Rasmussen, Luke V.; Ramirez, Andrea H.; Wang, Xiaoming; Wilke, Russell A.; Wolf, Wendy A.; Torstenson, Eric S.; Turner, Stephen D.; McCarty, Catherine A.
2014-01-01
Purpose Cataract is the leading cause of blindness in the world, and in the United States accounts for approximately 60% of Medicare costs related to vision. The purpose of this study was to identify genetic markers for age-related cataract through a genome-wide association study (GWAS). Methods In the electronic medical records and genomics (eMERGE) network, we ran an electronic phenotyping algorithm on individuals in each of five sites with electronic medical records linked to DNA biobanks. We performed a GWAS using 530,101 SNPs from the Illumina 660W-Quad in a total of 7,397 individuals (5,503 cases and 1,894 controls). We also performed an age-at-diagnosis case-only analysis. Results We identified several statistically significant associations with age-related cataract (45 SNPs) as well as age at diagnosis (44 SNPs). The 45 SNPs associated with cataract at p<1×10−5 are in several interesting genes, including ALDOB, MAP3K1, and MEF2C. All have potential biologic relationships with cataracts. Conclusions This is the first genome-wide association study of age-related cataract, and several regions of interest have been identified. The eMERGE network has pioneered the exploration of genomic associations in biobanks linked to electronic health records, and this study is another example of the utility of such resources. Explorations of age-related cataract including validation and replication of the association results identified herein are needed in future studies. PMID:25352737
Yahya, Padillah; Sulong, Sarina; Harun, Azian; Wan Isa, Hatin; Ab Rajab, Nur-Shafawati; Wangkumhang, Pongsakorn; Wilantho, Alisa; Ngamphiw, Chumpol; Tongsima, Sissades; Zilfalil, Bin Alwi
2017-09-01
Malay, the main ethnic group in Peninsular Malaysia, is represented by various sub-ethnic groups such as Melayu Banjar, Melayu Bugis, Melayu Champa, Melayu Java, Melayu Kedah Melayu Kelantan, Melayu Minang and Melayu Patani. Using data retrieved from the MyHVP (Malaysian Human Variome Project) database, a total of 135 individuals from these sub-ethnic groups were profiled using the Affymetrix GeneChip Mapping Xba 50-K single nucleotide polymorphism (SNP) array to identify SNPs that were ancestry-informative markers (AIMs) for Malays of Peninsular Malaysia. Prior to selecting the AIMs, the genetic structure of Malays was explored with reference to 11 other populations obtained from the Pan-Asian SNP Consortium database using principal component analysis (PCA) and ADMIXTURE. Iterative pruning principal component analysis (ipPCA) was further used to identify sub-groups of Malays. Subsequently, we constructed an AIMs panel for Malays using the informativeness for assignment (I n ) of genetic markers, and the K-nearest neighbor classifier (KNN) was used to teach the classification models. A model of 250 SNPs ranked by I n , correctly classified Malay individuals with an accuracy of up to 90%. The identified panel of SNPs could be utilized as a panel of AIMs to ascertain the specific ancestry of Malays, which may be useful in disease association studies, biomedical research or forensic investigation purposes. Copyright © 2017 Elsevier B.V. All rights reserved.
Hodoğlugil, Uğur; Williamson, David W.; Yu, Yi; Farrer, Lindsay A.; Mahley, Robert W.
2011-01-01
Summary We narrowed chromosome 15q21-23 linkage to plasma high density lipoprotein cholesterol (HDL-C) levels in atherogenic dyslipidemic Turkish families by fine mapping, then focused on glucuronic acid epimerase (GLCE), a heparan sulfate proteoglycan (HSPG) biosynthesis enzyme. HSPGs participate in lipid metabolism along with apolipoprotein (apo) E. Of 31 SNPs in the GLCE locus, nine analyzed by haplotype were associated with plasma HDL-C and triglyceride levels (permuted p = 0.006 and 0.013, respectively) in families. Of five tagging GLCE SNPs in two cohorts of unrelated subjects, three (rs16952868, rs11631403, rs3865014) were associated with triglyceride and HDL-C levels in males (non-permuted p < 0.05). The association was stronger in APOE 2/3 subjects (apoE2 has reduced binding to HSPGs) and reached multiple-testing significance (p < 0.05) in both males and females (n = 2612). Similar results were obtained in the second cohort (n = 1164). Interestingly, at the GLCE locus, bounded by recombination hotspots, Turks had a minor allele frequency of SNPs resembling Chinese more than European ancestry; adjoining regions on chromosome 15 resembled the European pattern. Studies of glce+/–apoe–/– mice fed a chow or high-fat diet supported a role for GLCE in lipid metabolism. Thus, SNPs in GLCE are associated with triglyceride and HDL-C levels in Turks, and mouse studies support a role for glce in lipid metabolism. PMID:21488854
Patterns of population differentiation of candidate genes for cardiovascular disease.
Kullo, Iftikhar J; Ding, Keyue
2007-07-12
The basis for ethnic differences in cardiovascular disease (CVD) susceptibility is not fully understood. We investigated patterns of population differentiation (FST) of a set of genes in etiologic pathways of CVD among 3 ethnic groups: Yoruba in Nigeria (YRI), Utah residents with European ancestry (CEU), and Han Chinese (CHB) + Japanese (JPT). We identified 37 pathways implicated in CVD based on the PANTHER classification and 416 genes in these pathways were further studied; these genes belonged to 6 biological processes (apoptosis, blood circulation and gas exchange, blood clotting, homeostasis, immune response, and lipoprotein metabolism). Genotype data were obtained from the HapMap database. We calculated FST for 15,559 common SNPs (minor allele frequency > or = 0.10 in at least one population) in genes that co-segregated among the populations, as well as an average-weighted FST for each gene. SNPs were classified as putatively functional (non-synonymous and untranslated regions) or non-functional (intronic and synonymous sites). Mean FST values for common putatively functional variants were significantly higher than FST values for nonfunctional variants. A significant variation in FST was also seen based on biological processes; the processes of 'apoptosis' and 'lipoprotein metabolism' showed an excess of genes with high FST. Thus, putative functional SNPs in genes in etiologic pathways for CVD show greater population differentiation than non-functional SNPs and a significant variance of FST values was noted among pairwise population comparisons for different biological processes. These results suggest a possible basis for varying susceptibility to CVD among ethnic groups.
Genetic variation in cell death genes and risk of non-Hodgkin lymphoma.
Schuetz, Johanna M; Daley, Denise; Graham, Jinko; Berry, Brian R; Gallagher, Richard P; Connors, Joseph M; Gascoyne, Randy D; Spinelli, John J; Brooks-Wilson, Angela R
2012-01-01
Non-Hodgkin lymphomas are a heterogeneous group of solid tumours that constitute the 5(th) highest cause of cancer mortality in the United States and Canada. Poor control of cell death in lymphocytes can lead to autoimmune disease or cancer, making genes involved in programmed cell death of lymphocytes logical candidate genes for lymphoma susceptibility. We tested for genetic association with NHL and NHL subtypes, of SNPs in lymphocyte cell death genes using an established population-based study. 17 candidate genes were chosen based on biological function, with 123 SNPs tested. These included tagSNPs from HapMap and novel SNPs discovered by re-sequencing 47 cases in genes for which SNP representation was judged to be low. The main analysis, which estimated odds ratios by fitting data to an additive logistic regression model, used European ancestry samples that passed quality control measures (569 cases and 547 controls). A two-tiered approach for multiple testing correction was used: correction for number of tests within each gene by permutation-based methodology, followed by correction for the number of genes tested using the false discovery rate. Variant rs928883, near miR-155, showed an association (OR per A-allele: 2.80 [95% CI: 1.63-4.82]; p(F) = 0.027) with marginal zone lymphoma that is significant after correction for multiple testing. This is the first reported association between a germline polymorphism at a miRNA locus and lymphoma.
Lai, Fang-Nong; Zhai, Hong-Li; Cheng, Ming; Ma, Jun-Yu; Cheng, Shun-Feng; Ge, Wei; Zhang, Guo-Liang; Wang, Jun-Jie; Zhang, Rui-Qian; Wang, Xue; Min, Ling-Jiang; Song, Jiu-Zhou; Shen, Wei
2016-01-01
Dairy goats are one of the most utilized domesticated animals in China. Here, we selected extreme populations based on differential fecundity in two Laoshan dairy goat populations. Utilizing deep sequencing we have generated 68.7 and 57.8 giga base of sequencing data, and identified 12,458,711 and 12,423,128 SNPs in the low fecundity and high fecundity groups, respectively. Following selective sweep analyses, a number of loci and candidate genes in the two populations were scanned independently. The reproduction related genes CCNB2, AR, ADCY1, DNMT3B, SMAD2, AMHR2, ERBB2, FGFR1, MAP3K12 and THEM4 were specifically selected in the high fecundity group whereas KDM6A, TENM1, SWI5 and CYM were specifically selected in the low fecundity group. A sub-set of genes including SYCP2, SOX5 and POU3F4 were localized both in the high and low fecundity selection windows, suggesting that these particular genes experienced strong selection with lower genetic diversity. From the genome data, the rare nonsense mutations may not contribute to fecundity, whereas nonsynonymous SNPs likely play a predominant role. The nonsynonymous exonic SNPs in SETDB2 and CDH26 which were co-localized in the selected region may take part in fecundity traits. These observations bring us a new insights into the genetic variation influencing fecundity traits within dairy goats. PMID:27905513
Genome-wide scan in Hispanics highlights candidate loci for brain white matter hyperintensities
Beecham, Ashley; Dong, Chuanhui; Wright, Clinton B.; Dueker, Nicole; Brickman, Adam M.; Wang, Liyong; DeCarli, Charles; Blanton, Susan H.; Rundek, Tatjana; Mayeux, Richard
2017-01-01
Objective: To investigate genetic variants influencing white matter hyperintensities (WMHs) in the understudied Hispanic population. Methods: Using 6.8 million single nucleotide polymorphisms (SNPs), we conducted a genome-wide association study (GWAS) to identify SNPs associated with WMH volume (WMHV) in 922 Hispanics who underwent brain MRI as a cross-section of 2 community-based cohorts in the Northern Manhattan Study and the Washington Heights–Inwood Columbia Aging Project. Multiple linear modeling with PLINK was performed to examine the additive genetic effects on ln(WMHV) after controlling for age, sex, total intracranial volume, and principal components of ancestry. Gene-based tests of association were performed using VEGAS. Replication was performed in independent samples of Europeans, African Americans, and Asians. Results: From the SNP analysis, a total of 17 independent SNPs in 7 genes had suggestive evidence of association with WMHV in Hispanics (p < 1 × 10−5) and 5 genes from the gene-based analysis with p < 1 × 10−3. One SNP (rs9957475 in GATA6) and 1 gene (UBE2C) demonstrated evidence of association (p < 0.05) in the African American sample. Four SNPs with p < 1 × 10−5 were shown to affect binding of SPI1 using RegulomeDB. Conclusions: This GWAS of 2 community-based Hispanic cohorts revealed several novel WMH-associated genetic variants. Further replication is needed in independent Hispanic samples to validate these suggestive associations, and fine mapping is needed to pinpoint causal variants. PMID:28975155
Turcotte, Michelle; Abadi, Arkan; Peralta-Romero, Jesus; Suarez, Fernando; Reddon, Hudson; Gomez-Zamudio, Jaime; Burguete-Garcia, Ana I; Cruz, Miguel; Meyre, David
2018-03-13
The prevalence of abdominal obesity in Mexican children has risen dramatically in the past decade. Genome-wide association studies (GWAS) for waist-to-hip ratio (WHR) performed predominantly in European descent adult populations have identified multiple single-nucleotide polymorphisms (SNPs) with larger effects in women. The contribution of these SNPs to WHR in non-European children is unknown. Mexican children and adolescents (N = 1421, 5-17 years) were recruited in Mexico City. Twelve GWAS SNPs were genotyped using TaqMan Open Array and analyzed individually and as a gene score (GS). Mexican boys and girls displayed 2.81 ± 0.29 and 3.10 ± 0.31 WHR standard deviations higher than children and adolescents from the United States. WHR was positively associated with TG (β = 0.733 ± 0.190, P = 1.1 × 10 -4 ) and LDL-C (β = 0.491 ± 0.203, P = 1.6 × 10 -2 ), and negatively associated with HDL-C (β = -0.652 ± 0.195, P = 8.0 × 10 -4 ), independently of body mass index. The effect allele frequency (EAF) of 8 of 12 (67%) SNPs differed significantly (P < 4.17 × 10 -3 ) in Mexican children and European adults, with no evidence of effect allele enrichment in both populations (4 depleted and 4 enriched; binomial test, P = 1). Ten out of 12 SNPs (83.3%) had effects that were directionally consistent with those reported in GWAS (P = 0.04). HOXC13 rs1443512 displayed the best fit when modeled recessively, and was significantly associated with WHR under a recessive mode of inheritance (β = 0.140 ± 0.06, P = 2.3 × 10 -2 ). Significant interactions with sex were also observed for HOXC13 rs1443512 and the GS on WHR (P = 2.2 × 10 -2 and 1.2 × 10 -2 , respectively). HOXC13 rs1443512 (β = 0.022 ± 0.012, P = 4.7 × 10 -2 ) and the GS (β = 0.007 ± 0.003, P = 7.0 × 10 -3 ) were significantly associated with WHR in girls only. This study demonstrates that Mexican children are at high risk for abdominal obesity and detrimental lipid profiles. Our data support a partial transferability of sex-specific European GWAS WHR association signals in children and adolescents from the admixed Mexican population.
Interleukin-10 family cytokines pathway: genetic variants and psoriasis.
Galimova, E; Rätsep, R; Traks, T; Kingo, K; Escott-Price, V; Kõks, S
2017-06-01
Interleukin (IL)-10 family cytokines IL-10, IL-19, IL-20 and IL-24 have been implicated in autoimmune diseases and we have previously reported that genetic variants in the IL10 gene cluster were associated with psoriasis. To analyse the relationship between genetic polymorphisms in the IL10 gene cluster and psoriasis. This study also explores whether there are gene-gene interactions among these genetic polymorphisms. A total of 377 patients with psoriasis and 403 matched healthy controls were enrolled to carry out a case-control study for 48 single-nucleotide polymorphisms (SNPs) of the IL10 gene cluster. Genotyping for the SNPs was conducted on the Applied Biosystems 3730 DNA Analyzer using SNPlex ® technology. Generalized multifactor dimensionality reduction (GMDR) analysis was applied to discover a likely gene-gene interaction model among the SNPs. The results showed that the allele distributions of IL10 gene cluster SNPs are significantly different between the case and control groups. Carriers of the IL10 T allele (rs1554286) and the IL20 T allele (rs1400986) conferred protection from psoriasis [odds ratio (OR) = 0·63, corrected P-value (Pc) = 0·007; OR = 0·62, Pc = 0·038, respectively]. GMDR analysis displayed a significant gene-gene interaction between IL10 (rs1554286) and IL20 (rs1518108) variants. The strongest protective effect was found with the block 1 haplotype ACATA in the IL10 gene (Pc = 0·004). This study presents a novel finding that the combination of the two SNPs, IL10 (rs1554286) and IL20 (rs1518108), is associated with a reduced risk of psoriasis. Our results indicate that genetic variants of the immunomodulatory IL10 and IL20 genes may offer a protective effect in Europeans from Russia. Independent studies are required to verify the results and find a possible functional explanation. © 2017 British Association of Dermatologists.
Sahiner, U M; Semic-Jusufagic, A; Curtin, J A; Birben, E; Belgrave, D; Sackesen, C; Simpson, A; Yavuz, T S; Akdis, C A; Custovic, A; Kalayci, O
2014-12-01
Genetic variants in endotoxin signaling pathway are important in modulating the effect of environmental endotoxin on asthma and atopic phenotypes. Our objective was to determine the single nucleotide polymorphisms (SNPs) in the endotoxin signaling pathway that may influence in vitro IgE synthesis and to investigate the relationship between these variants and endotoxin exposure in relation to the development of asthma and atopy in a birth cohort. Peripheral blood mononuclear cells from 45 children with asthma were stimulated with 2 and 200 ng/ml lipopolysaccharide in vitro and IgE was measured in the culture supernatants. Children were genotyped for 121 SNPs from 30 genes in the endotoxin signaling pathway. Variants with a dose-response IgE production in relation to lipopolysaccharide (LPS) were selected for replication in a population-based birth cohort, in which we investigated the interaction between these SNPs and endotoxin exposure in relation to airway hyper-responsiveness, wheeze, and atopic sensitization. Twenty-one SNPs in nine genes (CD14, TLR4, IRF3, TRAF-6, TIRAP, TRIF, IKK-1, ST-2, SOCS1) were found to modulate the effect of endotoxin on in vitro IgE synthesis, with six displaying high linkage disequilibrium. Of the remaining 15 SNPs, for seven we found significant relationships between genotype and endotoxin exposure in the genetic association study in relation to symptomatic airway hyper-responsiveness (CD14-rs2915863 and rs2569191, TRIF-rs4807000), current wheeze (ST-2-rs17639215, IKK-1-rs2230804, and TRIF-rs4807000), and atopy (CD14-rs2915863 and rs2569192, TRAF-6-rs5030411, and IKK-1-rs2230804). Variants in the endotoxin signaling pathway are important determinants of asthma and atopy. The genotype effect is a function of the environmental endotoxin exposure. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.
Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos
2015-08-01
Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Sillén, Anna; Brohede, Jesper; Forsell, Charlotte; Lilius, Lena; Andrade, Jorge; Odeberg, Jacob; Kimura, Toru; Winblad, Bengt; Graff, Caroline
2011-01-01
We have previously reported the results of an extended genome-wide scan of Swedish Alzheimer disease (AD)-affected families; in this paper, we analyzed a subset of these families with autopsy-confirmed AD. We report the fine-mapping, using both microsatellite markers and single-nucleotide polymorphisms (SNPs), in the observed maximum logarithm of the odds (LOD)-2 unit (LOD(max)-2) region under the identified linkage peak, linkage analysis of the fine-mapping data with additionally analyzed pedigrees, and association analysis of SNPs selected from candidate genes in the linked interval. The subset was made on the criterion of at least one autopsy-confirmed AD case per family, resulting in 24 families. Linkage analysis of a family subset having at least one autopsy-confirmed AD case showed a significant nonparametric single-point LOD score of 4.4 in 8q24. Fine-mapping under the linkage peak with 10 microsatellite markers yielded an increase in the multipoint (mpt) LOD score from 2.1 to 3.0. SNP genotyping was performed on 21 selected candidate transcripts of the LOD(max)-2 region. Both family-based association and linkage analysis were performed on extended material from 30 families, resulting in a suggestive linkage at peak marker rs6577853 (mpt LOD score = 2.4). The 8q24 region has been implicated to be involved in AD etiology. Copyright © 2011 S. Karger AG, Basel.
Hierarchical tone mapping for high dynamic range image visualization
NASA Astrophysics Data System (ADS)
Qiu, Guoping; Duan, Jiang
2005-07-01
In this paper, we present a computationally efficient, practically easy to use tone mapping techniques for the visualization of high dynamic range (HDR) images in low dynamic range (LDR) reproduction devices. The new method, termed hierarchical nonlinear linear (HNL) tone-mapping operator maps the pixels in two hierarchical steps. The first step allocates appropriate numbers of LDR display levels to different HDR intensity intervals according to the pixel densities of the intervals. The second step linearly maps the HDR intensity intervals to theirs allocated LDR display levels. In the developed HNL scheme, the assignment of LDR display levels to HDR intensity intervals is controlled by a very simple and flexible formula with a single adjustable parameter. We also show that our new operators can be used for the effective enhancement of ordinary images.
Melo, Thaise P; Takada, Luciana; Baldi, Fernando; Oliveira, Henrique N; Dias, Marina M; Neves, Haroldo H R; Schenkel, Flavio S; Albuquerque, Lucia G; Carvalheiro, Roberto
2016-06-21
QTL mapping through genome-wide association studies (GWAS) is challenging, especially in the case of low heritability complex traits and when few animals possess genotypic and phenotypic information. When most of the phenotypic information is from non-genotyped animals, GWAS can be performed using the weighted single-step GBLUP (WssGBLUP) method, which permits to combine all available information, even that of non-genotyped animals. However, it is not clear to what extent phenotypic information from non-genotyped animals increases the power of QTL detection, and whether factors such as the extent of linkage disequilibrium (LD) in the population and weighting SNPs in WssGBLUP affect the importance of using information from non-genotyped animals in GWAS. These questions were investigated in this study using real and simulated data. Analysis of real data showed that the use of phenotypes of non-genotyped animals affected SNP effect estimates and, consequently, QTL mapping. Despite some coincidence, the most important genomic regions identified by the analyses, either using or ignoring phenotypes of non-genotyped animals, were not the same. The simulation results indicated that the inclusion of all available phenotypic information, even that of non-genotyped animals, tends to improve QTL detection for low heritability complex traits. For populations with low levels of LD, this trend of improvement was less pronounced. Stronger shrinkage on SNPs explaining lower variance was not necessarily associated with better QTL mapping. The use of phenotypic information from non-genotyped animals in GWAS may improve the ability to detect QTL for low heritability complex traits, especially in populations in which the level of LD is high.
Kumar, Sachin; Knox, Ron E.; Singh, Asheesh K.; DePauw, Ron M.; Campbell, Heather L.; Isidro-Sanchez, Julio; Clarke, Fran R.; Pozniak, Curtis J.; N’Daye, Amidou; Meyer, Brad; Sharpe, Andrew; Ruan, Yuefeng; Cuthbert, Richard D.; Somers, Daryl; Fedak, George
2018-01-01
Loose smut, caused by Ustilago tritici (Pers.) Rostr., is a systemic disease of tetraploid durum wheat (Triticum turgidum L.). Loose smut can be economically controlled by growing resistant varieties, making it important to find and deploy new sources of resistance. Blackbird, a variety of T. turgidum L. subsp. carthlicum (Nevski) A. Love & D. Love, carries a high level of resistance to loose smut. Blackbird was crossed with the loose smut susceptible durum cultivar Strongfield to produce a doubled haploid (DH) mapping population. The parents and progenies were inoculated with U. tritici races T26, T32 and T33 individually and as a mixture at Swift Current, Canada in 2011 and 2012 and loose smut incidence (LSI) was assessed. Genotyping of the DH population and parents using an Infinium iSelect 90K single nucleotide polymorphism (SNP) array identified 12,952 polymorphic SNPs. The SNPs and 426 SSRs (previously genotyped in the same population) were mapped to 16 linkage groups spanning 3008.4 cM at an average inter-marker space of 0.2 cM in a high-density genetic map. Composite interval mapping analysis revealed three significant quantitative trait loci (QTL) for loose smut resistance on chromosomes 3A, 6B and 7A. The loose smut resistance QTL on 6B (QUt.spa-6B.2) and 7A (QUt.spa-7A.2) were derived from Blackbird. Strongfield contributed the minor QTL on 3A (QUt.spa-3A.2). The resistance on 6B was a stable major QTL effective against all individual races and the mixture of the three races; it explained up to 74% of the phenotypic variation. This study is the first attempt in durum wheat to identify and map loose smut resistance QTL using a high-density genetic map. The QTL QUt.spa-6B.2 would be an effective source for breeding resistance to multiple races of the loose smut pathogen because it provides near-complete broad resistance to the predominant virulence on the Canadian prairies. PMID:29485999