Sample records for multiple snp variations

  1. Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping

    PubMed Central

    2011-01-01

    Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants. PMID:22082336

  2. Dynamic variable selection in SNP genotype autocalling from APEX microarray data.

    PubMed

    Podder, Mohua; Welch, William J; Zamar, Ruben H; Tebbutt, Scott J

    2006-11-30

    Single nucleotide polymorphisms (SNPs) are DNA sequence variations, occurring when a single nucleotide--adenine (A), thymine (T), cytosine (C) or guanine (G)--is altered. Arguably, SNPs account for more than 90% of human genetic variation. Our laboratory has developed a highly redundant SNP genotyping assay consisting of multiple probes with signals from multiple channels for a single SNP, based on arrayed primer extension (APEX). This mini-sequencing method is a powerful combination of a highly parallel microarray with distinctive Sanger-based dideoxy terminator sequencing chemistry. Using this microarray platform, our current genotype calling system (known as SNP Chart) is capable of calling single SNP genotypes by manual inspection of the APEX data, which is time-consuming and exposed to user subjectivity bias. Using a set of 32 Coriell DNA samples plus three negative PCR controls as a training data set, we have developed a fully-automated genotyping algorithm based on simple linear discriminant analysis (LDA) using dynamic variable selection. The algorithm combines separate analyses based on the multiple probe sets to give a final posterior probability for each candidate genotype. We have tested our algorithm on a completely independent data set of 270 DNA samples, with validated genotypes, from patients admitted to the intensive care unit (ICU) of St. Paul's Hospital (plus one negative PCR control sample). Our method achieves a concordance rate of 98.9% with a 99.6% call rate for a set of 96 SNPs. By adjusting the threshold value for the final posterior probability of the called genotype, the call rate reduces to 94.9% with a higher concordance rate of 99.6%. We also reversed the two independent data sets in their training and testing roles, achieving a concordance rate up to 99.8%. The strength of this APEX chemistry-based platform is its unique redundancy having multiple probes for a single SNP. Our model-based genotype calling algorithm captures the redundancy in the system considering all the underlying probe features of a particular SNP, automatically down-weighting any 'bad data' corresponding to image artifacts on the microarray slide or failure of a specific chemistry. In this regard, our method is able to automatically select the probes which work well and reduce the effect of other so-called bad performing probes in a sample-specific manner, for any number of SNPs.

  3. SNP-based association analysis for seedling traits in durum wheat (Triticum turgidum L. durum (Desf.)).

    PubMed

    Sabiel, Salih A I; Huang, Sisi; Hu, Xin; Ren, Xifeng; Fu, Chunjie; Peng, Junhua; Sun, Dongfa

    2017-03-01

    In the present study, 150 accessions of worldwide originated durum wheat germplasm ( Triticum turgidum spp. durum ) were observed for major seedling traits and their growth. The accessions were evaluated for major seedling traits under controlled conditions of hydroponics at the 13 th , 20 th , 27 th and 34 th day-after germination. Biomass traits were measured at the 34 th day-after germination. Correlation analysis was conducted among the seedling traits and three field traits at maturity, plant height, grain weight and 1000-grain weight observed in four consecutive years. Associations of the measured seedling traits and SNP markers were analyzed based on the mixed linear model (MLM). The results indicated that highly significant genetic variation and robust heritability were found for the seedling and field mature traits. In total, 259 significant associations were detected for all the traits and four growth stages. The phenotypic variation explained (R2) by a single SNP marker is higher than 10% for most (84%) of the significant SNP markers. Forty-six SNP markers associated with multiple traits, indicating non-neglectable pleiotropy in seedling stage. The associated SNP markers could be helpful for genetic analysis of seedling traits, and marker-assisted breeding of new wheat varieties with strong seedling vigor.

  4. Genome-wide association study of acute post-surgical pain in humans

    PubMed Central

    Kim, Hyungsuk; Ramsay, Edward; Lee, Hyewon; Wahl, Sharon; Dionne, Raymond A

    2009-01-01

    Aims Testing a relatively small genomic region with a few hundred SNPs provides limited information. Genome-wide association studies (GWAS) provide an opportunity to overcome the limitation of candidate gene association studies. Here, we report the results of a GWAS for the responses to an NSAID analgesic. Materials & methods European Americans (60 females and 52 males) undergoing oral surgery were genotyped with Affymetrix 500K SNP assay. Additional SNP genotyping was performed from the gene in linkage disequilibrium with the candidate SNP revealed by the GWAS. Results GWAS revealed a candidate SNP (rs2562456) associated with analgesic onset, which is in linkage disequilibrium with a gene encoding a zinc finger protein. Additional SNP genotyping of ZNF429 confirmed the association with analgesic onset in humans (p = 1.8 × 10−10, degrees of freedom = 103, F = 28.3). We also found candidate loci for the maximum post-operative pain rating (rs17122021, p = 6.9 × 10−7) and post-operative pain onset time (rs6693882, p = 2.1 × 10−6), however, correcting for multiple comparisons did not sustain these genetic associations. Conclusion GWAS for acute clinical pain followed by additional SNP genotyping of a neighboring gene suggests that genetic variations in or near the loci encoding DNA binding proteins play a role in the individual variations in responses to analgesic drugs. PMID:19207018

  5. Fine-mapping additive and dominant SNP effects using group-LASSO and Fractional Resample Model Averaging

    PubMed Central

    Sabourin, Jeremy; Nobel, Andrew B.; Valdar, William

    2014-01-01

    Genomewide association studies sometimes identify loci at which both the number and identities of the underlying causal variants are ambiguous. In such cases, statistical methods that model effects of multiple SNPs simultaneously can help disentangle the observed patterns of association and provide information about how those SNPs could be prioritized for follow-up studies. Current multi-SNP methods, however, tend to assume that SNP effects are well captured by additive genetics; yet when genetic dominance is present, this assumption translates to reduced power and faulty prioritizations. We describe a statistical procedure for prioritizing SNPs at GWAS loci that efficiently models both additive and dominance effects. Our method, LLARRMA-dawg, combines a group LASSO procedure for sparse modeling of multiple SNP effects with a resampling procedure based on fractional observation weights; it estimates for each SNP the robustness of association with the phenotype both to sampling variation and to competing explanations from other SNPs. In producing a SNP prioritization that best identifies underlying true signals, we show that: our method easily outperforms a single marker analysis; when additive-only signals are present, our joint model for additive and dominance is equivalent to or only slightly less powerful than modeling additive-only effects; and, when dominance signals are present, even in combination with substantial additive effects, our joint model is unequivocally more powerful than a model assuming additivity. We also describe how performance can be improved through calibrated randomized penalization, and discuss how dominance in ungenotyped SNPs can be incorporated through either heterozygote dosage or multiple imputation. PMID:25417853

  6. Software for optimization of SNP and PCR-RFLP genotyping to discriminate many genomes with the fewest assays

    PubMed Central

    Gardner, Shea N; Wagner, Mark C

    2005-01-01

    Background Microbial forensics is important in tracking the source of a pathogen, whether the disease is a naturally occurring outbreak or part of a criminal investigation. Results A method and SPR Opt (SNP and PCR-RFLP Optimization) software to perform a comprehensive, whole-genome analysis to forensically discriminate multiple sequences is presented. Tools for the optimization of forensic typing using Single Nucleotide Polymorphism (SNP) and PCR-Restriction Fragment Length Polymorphism (PCR-RFLP) analyses across multiple isolate sequences of a species are described. The PCR-RFLP analysis includes prediction and selection of optimal primers and restriction enzymes to enable maximum isolate discrimination based on sequence information. SPR Opt calculates all SNP or PCR-RFLP variations present in the sequences, groups them into haplotypes according to their co-segregation across those sequences, and performs combinatoric analyses to determine which sets of haplotypes provide maximal discrimination among all the input sequences. Those set combinations requiring that membership in the fewest haplotypes be queried (i.e. the fewest assays be performed) are found. These analyses highlight variable regions based on existing sequence data. These markers may be heterogeneous among unsequenced isolates as well, and thus may be useful for characterizing the relationships among unsequenced as well as sequenced isolates. The predictions are multi-locus. Analyses of mumps and SARS viruses are summarized. Phylogenetic trees created based on SNPs, PCR-RFLPs, and full genomes are compared for SARS virus, illustrating that purported phylogenies based only on SNP or PCR-RFLP variations do not match those based on multiple sequence alignment of the full genomes. Conclusion This is the first software to optimize the selection of forensic markers to maximize information gained from the fewest assays, accepting whole or partial genome sequence data as input. As more sequence data becomes available for multiple strains and isolates of a species, automated, computational approaches such as those described here will be essential to make sense of large amounts of information, and to guide and optimize efforts in the laboratory. The software and source code for SPR Opt is publicly available and free for non-profit use at . PMID:15904493

  7. LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.

    PubMed

    Karchin, Rachel; Diekhans, Mark; Kelly, Libusha; Thomas, Daryl J; Pieper, Ursula; Eswar, Narayanan; Haussler, David; Sali, Andrej

    2005-06-15

    The NCBI dbSNP database lists over 9 million single nucleotide polymorphisms (SNPs) in the human genome, but currently contains limited annotation information. SNPs that result in amino acid residue changes (nsSNPs) are of critical importance in variation between individuals, including disease and drug sensitivity. We have developed LS-SNP, a genomic scale software pipeline to annotate nsSNPs. LS-SNP comprehensively maps nsSNPs onto protein sequences, functional pathways and comparative protein structure models, and predicts positions where nsSNPs destabilize proteins, interfere with the formation of domain-domain interfaces, have an effect on protein-ligand binding or severely impact human health. It currently annotates 28,043 validated SNPs that produce amino acid residue substitutions in human proteins from the SwissProt/TrEMBL database. Annotations can be viewed via a web interface either in the context of a genomic region or by selecting sets of SNPs, genes, proteins or pathways. These results are useful for identifying candidate functional SNPs within a gene, haplotype or pathway and in probing molecular mechanisms responsible for functional impacts of nsSNPs. http://www.salilab.org/LS-SNP CONTACT: rachelk@salilab.org http://salilab.org/LS-SNP/supp-info.pdf.

  8. Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars.

    PubMed

    Cavanagh, Colin R; Chao, Shiaoman; Wang, Shichen; Huang, Bevan Emma; Stephen, Stuart; Kiani, Seifollah; Forrest, Kerrie; Saintenac, Cyrille; Brown-Guedira, Gina L; Akhunova, Alina; See, Deven; Bai, Guihua; Pumphrey, Michael; Tomar, Luxmi; Wong, Debbie; Kong, Stephan; Reynolds, Matthew; da Silva, Marta Lopez; Bockelman, Harold; Talbert, Luther; Anderson, James A; Dreisigacker, Susanne; Baenziger, Stephen; Carter, Arron; Korzun, Viktor; Morrell, Peter Laurent; Dubcovsky, Jorge; Morell, Matthew K; Sorrells, Mark E; Hayden, Matthew J; Akhunov, Eduard

    2013-05-14

    Domesticated crops experience strong human-mediated selection aimed at developing high-yielding varieties adapted to local conditions. To detect regions of the wheat genome subject to selection during improvement, we developed a high-throughput array to interrogate 9,000 gene-associated single-nucleotide polymorphisms (SNP) in a worldwide sample of 2,994 accessions of hexaploid wheat including landraces and modern cultivars. Using a SNP-based diversity map we characterized the impact of crop improvement on genomic and geographic patterns of genetic diversity. We found evidence of a small population bottleneck and extensive use of ancestral variation often traceable to founders of cultivars from diverse geographic regions. Analyzing genetic differentiation among populations and the extent of haplotype sharing, we identified allelic variants subjected to selection during improvement. Selective sweeps were found around genes involved in the regulation of flowering time and phenology. An introgression of a wild relative-derived gene conferring resistance to a fungal pathogen was detected by haplotype-based analysis. Comparing selective sweeps identified in different populations, we show that selection likely acts on distinct targets or multiple functionally equivalent alleles in different portions of the geographic range of wheat. The majority of the selected alleles were present at low frequency in local populations, suggesting either weak selection pressure or temporal variation in the targets of directional selection during breeding probably associated with changing agricultural practices or environmental conditions. The developed SNP chip and map of genetic variation provide a resource for advancing wheat breeding and supporting future population genomic and genome-wide association studies in wheat.

  9. Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars

    PubMed Central

    Cavanagh, Colin R.; Chao, Shiaoman; Wang, Shichen; Huang, Bevan Emma; Stephen, Stuart; Kiani, Seifollah; Forrest, Kerrie; Saintenac, Cyrille; Brown-Guedira, Gina L.; Akhunova, Alina; See, Deven; Bai, Guihua; Pumphrey, Michael; Tomar, Luxmi; Wong, Debbie; Kong, Stephan; Reynolds, Matthew; da Silva, Marta Lopez; Bockelman, Harold; Talbert, Luther; Anderson, James A.; Dreisigacker, Susanne; Baenziger, Stephen; Carter, Arron; Korzun, Viktor; Morrell, Peter Laurent; Dubcovsky, Jorge; Morell, Matthew K.; Sorrells, Mark E.; Hayden, Matthew J.; Akhunov, Eduard

    2013-01-01

    Domesticated crops experience strong human-mediated selection aimed at developing high-yielding varieties adapted to local conditions. To detect regions of the wheat genome subject to selection during improvement, we developed a high-throughput array to interrogate 9,000 gene-associated single-nucleotide polymorphisms (SNP) in a worldwide sample of 2,994 accessions of hexaploid wheat including landraces and modern cultivars. Using a SNP-based diversity map we characterized the impact of crop improvement on genomic and geographic patterns of genetic diversity. We found evidence of a small population bottleneck and extensive use of ancestral variation often traceable to founders of cultivars from diverse geographic regions. Analyzing genetic differentiation among populations and the extent of haplotype sharing, we identified allelic variants subjected to selection during improvement. Selective sweeps were found around genes involved in the regulation of flowering time and phenology. An introgression of a wild relative-derived gene conferring resistance to a fungal pathogen was detected by haplotype-based analysis. Comparing selective sweeps identified in different populations, we show that selection likely acts on distinct targets or multiple functionally equivalent alleles in different portions of the geographic range of wheat. The majority of the selected alleles were present at low frequency in local populations, suggesting either weak selection pressure or temporal variation in the targets of directional selection during breeding probably associated with changing agricultural practices or environmental conditions. The developed SNP chip and map of genetic variation provide a resource for advancing wheat breeding and supporting future population genomic and genome-wide association studies in wheat. PMID:23630259

  10. The G72/G30 gene complex and cognitive abnormalities in schizophrenia.

    PubMed

    Goldberg, Terry E; Straub, Richard E; Callicott, Joseph H; Hariri, Ahmad; Mattay, Venkata S; Bigelow, Llewellyn; Coppola, Richard; Egan, Michael F; Weinberger, Daniel R

    2006-09-01

    A recently discovered gene complex, G72/G30 (hereafter G72, but now termed DAOA), was found to be associated with schizophrenia and with bipolar disorder, possibly because of an indirect effect on NMDA neurotransmission. In principle, if G72 increases risk for psychosis by this mechanism, it might impact with greater penetrance those cortically based cognitive and neurophysiological functions associated with NMDA signaling. We performed two independent family-based association studies (one sample contained more than 200 families and the other more than 65) of multiple SNPs in the G72 region and of multiple SNPs in the gene for D-amino acid oxidase (DAAO), which may be modulated by G72. We examined the relationship between select cognitive measures in attention, working memory, and episodic memory and a restricted set of G72 SNPs in over 600 normal controls, schizophrenic patients, and their nonpsychotic siblings using mixed model ANOVAs. We also determined genotype effects on neurophysiology measures in normal controls using the fMRI BOLD response obtained during activation procedures involving either episodic memory or working memory. There were no significant single G72 SNP associations and clinical diagnosis in either sample, though one approached significance (p=0.06). Diagnosis by genotype interaction effects for G72 SNP 10 were significant for cognitive variables assessing working memory and attention (p=0.05), and at the trend level for episodic memory, such that in the schizophrenia group an exaggerated allele load effect in the predicted directions was observed. In the fMRI paradigms, a strong effect of G72 SNP 10 genotype was observed on BOLD activation in the hippocampus during the episodic memory paradigm. Tests of association with DAAO were consistently nonsignificant. We present evidence that SNP variations in the G72 gene region increase risk of cognitive impairment in schizophrenia. SNP variations were not strongly associated with clinical diagnosis in family-based analyses.

  11. Genetic variation in GABRB3 is associated with Asperger syndrome and multiple endophenotypes relevant to autism

    PubMed Central

    2013-01-01

    Background Autism spectrum conditions (ASC) are associated with deficits in social interaction and communication, alongside repetitive, restricted, and stereotyped behavior. ASC is highly heritable. The gamma-aminobutyric acid (GABA)-ergic system has been associated consistently with atypicalities in autism, in both genetic association and expression studies. A key component of the GABA-ergic system is encoded by the GABRB3 gene, which has been previously implicated both in ASC and in individual differences in empathy. Methods In this study, 45 genotyped single nucleotide polymorphisms (SNPs) within GABRB3 were tested for association with Asperger syndrome (AS), and related quantitative traits measured through the following tests: the Empathy Quotient (EQ), the Autism Spectrum Quotient (AQ), the Systemizing Quotient-Revised (SQ-R), the Embedded Figures Test (EFT), the Reading the Mind in the Eyes Test (RMET), and the Mental Rotation Test (MRT). Two-loci, three-loci, four-loci haplotype analyses, and one seven-loci haplotype analysis were also performed in the AS case–control sample. Results Three SNPs (rs7180158, rs7165604, rs12593579) were significantly associated with AS, and two SNPs (rs9806546, rs11636966) were significantly associated with EQ. Two SNP-SNP pairs, rs12438141-rs1035751 and rs12438141-rs7179514, showed significant association with variation in the EFT scores. One SNP-SNP pair, rs7174437-rs1863455, was significantly associated with variation in the MRT scores. Additionally, a few haplotypes, including a 19 kb genomic region that formed a linkage disequilibrium (LD) block in our sample and contained several nominally significant SNPs, were found to be significantly associated with AS. Conclusion The current study confirms the role of GABRB3 as an important candidate gene in both ASC and normative variation in related endophenotypes. PMID:24321478

  12. BioVLAB-mCpG-SNP-EXPRESS: A system for multi-level and multi-perspective analysis and exploration of DNA methylation, sequence variation (SNPs), and gene expression from multi-omics data.

    PubMed

    Chae, Heejoon; Lee, Sangseon; Seo, Seokjun; Jung, Daekyoung; Chang, Hyeonsook; Nephew, Kenneth P; Kim, Sun

    2016-12-01

    Measuring gene expression, DNA sequence variation, and DNA methylation status is routinely done using high throughput sequencing technologies. To analyze such multi-omics data and explore relationships, reliable bioinformatics systems are much needed. Existing systems are either for exploring curated data or for processing omics data in the form of a library such as R. Thus scientists have much difficulty in investigating relationships among gene expression, DNA sequence variation, and DNA methylation using multi-omics data. In this study, we report a system called BioVLAB-mCpG-SNP-EXPRESS for the integrated analysis of DNA methylation, sequence variation (SNPs), and gene expression for distinguishing cellular phenotypes at the pairwise and multiple phenotype levels. The system can be deployed on either the Amazon cloud or a publicly available high-performance computing node, and the data analysis and exploration of the analysis result can be conveniently done using a web-based interface. In order to alleviate analysis complexity, all the process are fully automated, and graphical workflow system is integrated to represent real-time analysis progression. The BioVLAB-mCpG-SNP-EXPRESS system works in three stages. First, it processes and analyzes multi-omics data as input in the form of the raw data, i.e., FastQ files. Second, various integrated analyses such as methylation vs. gene expression and mutation vs. methylation are performed. Finally, the analysis result can be explored in a number of ways through a web interface for the multi-level, multi-perspective exploration. Multi-level interpretation can be done by either gene, gene set, pathway or network level and multi-perspective exploration can be explored from either gene expression, DNA methylation, sequence variation, or their relationship perspective. The utility of the system is demonstrated by performing analysis of phenotypically distinct 30 breast cancer cell line data set. BioVLAB-mCpG-SNP-EXPRESS is available at http://biohealth.snu.ac.kr/software/biovlab_mcpg_snp_express/. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Significant variation between SNP-based HLA imputations in diverse populations: the last mile is the hardest.

    PubMed

    Pappas, D J; Lizee, A; Paunic, V; Beutner, K R; Motyer, A; Vukcevic, D; Leslie, S; Biesiada, J; Meller, J; Taylor, K D; Zheng, X; Zhao, L P; Gourraud, P-A; Hollenbach, J A; Mack, S J; Maiers, M

    2018-05-22

    Four single nucleotide polymorphism (SNP)-based human leukocyte antigen (HLA) imputation methods (e-HLA, HIBAG, HLA*IMP:02 and MAGPrediction) were trained using 1000 Genomes SNP and HLA genotypes and assessed for their ability to accurately impute molecular HLA-A, -B, -C and -DRB1 genotypes in the Human Genome Diversity Project cell panel. Imputation concordance was high (>89%) across all methods for both HLA-A and HLA-C, but HLA-B and HLA-DRB1 proved generally difficult to impute. Overall, <27.8% of subjects were correctly imputed for all HLA loci by any method. Concordance across all loci was not enhanced via the application of confidence thresholds; reliance on confidence scores across methods only led to noticeable improvement (+3.2%) for HLA-DRB1. As the HLA complex is highly relevant to the study of human health and disease, a standardized assessment of SNP-based HLA imputation methods is crucial for advancing genomic research. Considerable room remains for the improvement of HLA-B and especially HLA-DRB1 imputation methods, and no imputation method is as accurate as molecular genotyping. The application of large, ancestrally diverse HLA and SNP reference data sets and multiple imputation methods has the potential to make SNP-based HLA imputation methods a tractable option for determining HLA genotypes.

  14. Single nucleotide polymorphism (SNP) discovery in duplicated genomes: intron-primed exon-crossing (IPEC) as a strategy for avoiding amplification of duplicated loci in Atlantic salmon (Salmo salar) and other salmonid fishes

    PubMed Central

    Ryynänen, Heikki J; Primmer, Craig R

    2006-01-01

    Background Single nucleotide polymorphisms (SNPs) represent the most abundant type of DNA variation in the vertebrate genome, and their applications as genetic markers in numerous studies of molecular ecology and conservation of natural populations are emerging. Recent large-scale sequencing projects in several fish species have provided a vast amount of data in public databases, which can be utilized in novel SNP discovery in salmonids. However, the suggested duplicated nature of the salmonid genome may hamper SNP characterization if the primers designed in conserved gene regions amplify multiple loci. Results Here we introduce a new intron-primed exon-crossing (IPEC) method in an attempt to overcome this duplication problem, and also evaluate different priming methods for SNP discovery in Atlantic salmon (Salmo salar) and other salmonids. A total of 69 loci with differing priming strategies were screened in S. salar, and 27 of these produced ~13 kb of high-quality sequence data consisting of 19 SNPs or indels (one per 680 bp). The SNP frequency and the overall nucleotide diversity (3.99 × 10-4) in S. salar was lower than reported in a majority of other organisms, which may suggest a relative young population history for Atlantic salmon. A subset of primers used in cross-species analyses revealed considerable variation in the SNP frequencies and nucleotide diversities in other salmonids. Conclusion Sequencing success was significantly higher with the new IPEC primers; thus the total number of loci to screen in order to identify one potential polymorphic site was six times less with this new strategy. Given that duplication may hamper SNP discovery in some species, the IPEC method reported here is an alternative way of identifying novel polymorphisms in such cases. PMID:16872523

  15. Genome-wide association study identifies SNPs in the MHC class II loci that are associated with self-reported history of whooping cough

    PubMed Central

    McMahon, George; Ring, Susan M.; Davey-Smith, George; Timpson, Nicholas J.

    2015-01-01

    Whooping cough is currently seeing resurgence in countries despite high vaccine coverage. There is considerable variation in subject-specific response to infection and vaccine efficacy, but little is known about the role of human genetics. We carried out a case–control genome-wide association study of adult or parent-reported history of whooping cough in two cohorts from the UK: the ALSPAC cohort and the 1958 British Birth Cohort (815/758 cases and 6341/4308 controls, respectively). We also imputed HLA alleles using dense SNP data in the MHC region and carried out gene-based and gene-set tests of association and estimated the amount of additive genetic variation explained by common SNPs. We observed a novel association at SNPs in the MHC class II region in both cohorts [lead SNP rs9271768 after meta-analysis, odds ratio [95% confidence intervals (CIs)] 1.47 (1.35, 1.6), P-value 1.21E − 18]. Multiple strong associations were also observed at alleles at the HLA class II loci. The majority of these associations were explained by the lead SNP rs9271768. Gene-based and gene-set tests and estimates of explainable common genetic variation could not establish the presence of additional associations in our sample. Genetic variation at the MHC class II region plays a role in susceptibility to whooping cough. These findings provide additional perspective on mechanisms of whooping cough infection and vaccine efficacy. PMID:26231221

  16. Genome-wide association analyses for carcass quality in crossbred beef cattle

    PubMed Central

    2013-01-01

    Background Genetic improvement of beef quality will benefit both producers and consumers, and can be achieved by selecting animals that carry desired quantitative trait nucleotides (QTN), which result from intensive searches using genetic markers. This paper presents a genome-wide association approach utilizing single nucleotide polymorphisms (SNP) in the Illumina BovineSNP50 BeadChip to seek genomic regions that potentially harbor genes or QTN underlying variation in carcass quality of beef cattle. This study used 747 genotyped animals, mainly crossbred, with phenotypes on twelve carcass quality traits, including hot carcass weight (HCW), back fat thickness (BF), Longissimus dorsi muscle area or ribeye area (REA), marbling scores (MRB), lean yield grade by Beef Improvement Federation formulae (BIFYLD), steak tenderness by Warner-Bratzler shear force 7-day post-mortem (LM7D) as well as body composition as determined by partial rib (IMPS 103) dissection presented as a percentage of total rib weight including body cavity fat (BDFR), lean (LNR), bone (BNR), intermuscular fat (INFR), subcutaneous fat (SQFR), and total fat (TLFR). Results At the genome wide level false discovery rate (FDR < 10%), eight SNP were found significantly associated with HCW. Seven of these SNP were located on Bos taurus autosome (BTA) 6. At a less stringent significance level (P < 0.001), 520 SNP were found significantly associated with mostly individual traits (473 SNP), and multiple traits (47 SNP). Of these significant SNP, 48 were located on BTA6, and 22 of them were in association with hot carcass weight. There were 53 SNP associated with percentage of rib bone, and 12 of them were on BTA20. The rest of the significant SNP were scattered over other chromosomes. They accounted for 1.90 - 5.89% of the phenotypic variance of the traits. A region of approximately 4 Mbp long on BTA6 was found to be a potential area to harbor candidate genes influencing growth. One marker on BTA25 accounting for 2.67% of the variation in LM7D may be worth further investigation for the improvement of beef tenderness. Conclusion This study provides useful information to further assist the identification of chromosome regions and subsequently genes affecting carcass quality traits in beef cattle. It also revealed many SNP that acted pleiotropically to affect carcass quality. This knowledge is important in selecting subsets of SNP to improve the performance of beef cattle. PMID:24024930

  17. Fast and Accurate Approximation to Significance Tests in Genome-Wide Association Studies

    PubMed Central

    Zhang, Yu; Liu, Jun S.

    2011-01-01

    Genome-wide association studies commonly involve simultaneous tests of millions of single nucleotide polymorphisms (SNP) for disease association. The SNPs in nearby genomic regions, however, are often highly correlated due to linkage disequilibrium (LD, a genetic term for correlation). Simple Bonferonni correction for multiple comparisons is therefore too conservative. Permutation tests, which are often employed in practice, are both computationally expensive for genome-wide studies and limited in their scopes. We present an accurate and computationally efficient method, based on Poisson de-clumping heuristics, for approximating genome-wide significance of SNP associations. Compared with permutation tests and other multiple comparison adjustment approaches, our method computes the most accurate and robust p-value adjustments for millions of correlated comparisons within seconds. We demonstrate analytically that the accuracy and the efficiency of our method are nearly independent of the sample size, the number of SNPs, and the scale of p-values to be adjusted. In addition, our method can be easily adopted to estimate false discovery rate. When applied to genome-wide SNP datasets, we observed highly variable p-value adjustment results evaluated from different genomic regions. The variation in adjustments along the genome, however, are well conserved between the European and the African populations. The p-value adjustments are significantly correlated with LD among SNPs, recombination rates, and SNP densities. Given the large variability of sequence features in the genome, we further discuss a novel approach of using SNP-specific (local) thresholds to detect genome-wide significant associations. This article has supplementary material online. PMID:22140288

  18. VCS: Tool for Visualizing Copy Number Variation and Single Nucleotide Polymorphism.

    PubMed

    Kim, HyoYoung; Sung, Samsun; Cho, Seoae; Kim, Tae-Hun; Seo, Kangseok; Kim, Heebal

    2014-12-01

    Copy number variation (CNV) or single nucleotide phlyorphism (SNP) is useful genetic resource to aid in understanding complex phenotypes or deseases susceptibility. Although thousands of CNVs and SNPs are currently avaliable in the public databases, they are somewhat difficult to use for analyses without visualization tools. We developed a web-based tool called the VCS (visualization of CNV or SNP) to visualize the CNV or SNP detected. The VCS tool can assist to easily interpret a biological meaning from the numerical value of CNV and SNP. The VCS provides six visualization tools: i) the enrichment of genome contents in CNV; ii) the physical distribution of CNV or SNP on chromosomes; iii) the distribution of log2 ratio of CNVs with criteria of interested; iv) the number of CNV or SNP per binning unit; v) the distribution of homozygosity of SNP genotype; and vi) cytomap of genes within CNV or SNP region.

  19. Complex nature of SNP genotype effects on gene expression in primary human leucocytes.

    PubMed

    Heap, Graham A; Trynka, Gosia; Jansen, Ritsert C; Bruinenberg, Marcel; Swertz, Morris A; Dinesen, Lotte C; Hunt, Karen A; Wijmenga, Cisca; Vanheel, David A; Franke, Lude

    2009-01-07

    Genome wide association studies have been hugely successful in identifying disease risk variants, yet most variants do not lead to coding changes and how variants influence biological function is usually unknown. We correlated gene expression and genetic variation in untouched primary leucocytes (n = 110) from individuals with celiac disease - a common condition with multiple risk variants identified. We compared our observations with an EBV-transformed HapMap B cell line dataset (n = 90), and performed a meta-analysis to increase power to detect non-tissue specific effects. In celiac peripheral blood, 2,315 SNP variants influenced gene expression at 765 different transcripts (< 250 kb from SNP, at FDR = 0.05, cis expression quantitative trait loci, eQTLs). 135 of the detected SNP-probe effects (reflecting 51 unique probes) were also detected in a HapMap B cell line published dataset, all with effects in the same allelic direction. Overall gene expression differences within the two datasets predominantly explain the limited overlap in observed cis-eQTLs. Celiac associated risk variants from two regions, containing genes IL18RAP and CCR3, showed significant cis genotype-expression correlations in the peripheral blood but not in the B cell line datasets. We identified 14 genes where a SNP affected the expression of different probes within the same gene, but in opposite allelic directions. By incorporating genetic variation in co-expression analyses, functional relationships between genes can be more significantly detected. In conclusion, the complex nature of genotypic effects in human populations makes the use of a relevant tissue, large datasets, and analysis of different exons essential to enable the identification of the function for many genetic risk variants in common diseases.

  20. Genome-wide association study identifies SNPs in the MHC class II loci that are associated with self-reported history of whooping cough.

    PubMed

    McMahon, George; Ring, Susan M; Davey-Smith, George; Timpson, Nicholas J

    2015-10-15

    Whooping cough is currently seeing resurgence in countries despite high vaccine coverage. There is considerable variation in subject-specific response to infection and vaccine efficacy, but little is known about the role of human genetics. We carried out a case-control genome-wide association study of adult or parent-reported history of whooping cough in two cohorts from the UK: the ALSPAC cohort and the 1958 British Birth Cohort (815/758 cases and 6341/4308 controls, respectively). We also imputed HLA alleles using dense SNP data in the MHC region and carried out gene-based and gene-set tests of association and estimated the amount of additive genetic variation explained by common SNPs. We observed a novel association at SNPs in the MHC class II region in both cohorts [lead SNP rs9271768 after meta-analysis, odds ratio [95% confidence intervals (CIs)] 1.47 (1.35, 1.6), P-value 1.21E - 18]. Multiple strong associations were also observed at alleles at the HLA class II loci. The majority of these associations were explained by the lead SNP rs9271768. Gene-based and gene-set tests and estimates of explainable common genetic variation could not establish the presence of additional associations in our sample. Genetic variation at the MHC class II region plays a role in susceptibility to whooping cough. These findings provide additional perspective on mechanisms of whooping cough infection and vaccine efficacy. © The Author 2015. Published by Oxford University Press.

  1. Breeding and Genetics Symposium: networks and pathways to guide genomic selection.

    PubMed

    Snelling, W M; Cushman, R A; Keele, J W; Maltecca, C; Thomas, M G; Fortes, M R S; Reverter, A

    2013-02-01

    Many traits affecting profitability and sustainability of meat, milk, and fiber production are polygenic, with no single gene having an overwhelming influence on observed variation. No knowledge of the specific genes controlling these traits has been needed to make substantial improvement through selection. Significant gains have been made through phenotypic selection enhanced by pedigree relationships and continually improving statistical methodology. Genomic selection, recently enabled by assays for dense SNP located throughout the genome, promises to increase selection accuracy and accelerate genetic improvement by emphasizing the SNP most strongly correlated to phenotype although the genes and sequence variants affecting phenotype remain largely unknown. These genomic predictions theoretically rely on linkage disequilibrium (LD) between genotyped SNP and unknown functional variants, but familial linkage may increase effectiveness when predicting individuals related to those in the training data. Genomic selection with functional SNP genotypes should be less reliant on LD patterns shared by training and target populations, possibly allowing robust prediction across unrelated populations. Although the specific variants causing polygenic variation may never be known with certainty, a number of tools and resources can be used to identify those most likely to affect phenotype. Associations of dense SNP genotypes with phenotype provide a 1-dimensional approach for identifying genes affecting specific traits; in contrast, associations with multiple traits allow defining networks of genes interacting to affect correlated traits. Such networks are especially compelling when corroborated by existing functional annotation and established molecular pathways. The SNP occurring within network genes, obtained from public databases or derived from genome and transcriptome sequences, may be classified according to expected effects on gene products. As illustrated by functionally informed genomic predictions being more accurate than naive whole-genome predictions of beef tenderness, coupling evidence from livestock genotypes, phenotypes, gene expression, and genomic variants with existing knowledge of gene functions and interactions may provide greater insight into the genes and genomic mechanisms affecting polygenic traits and facilitate functional genomic selection for economically important traits.

  2. Oxytocin receptor gene variations predict neural and behavioral response to oxytocin in autism

    PubMed Central

    Watanabe, Takamitsu; Otowa, Takeshi; Abe, Osamu; Kuwabara, Hitoshi; Aoki, Yuta; Natsubori, Tatsunobu; Takao, Hidemasa; Kakiuchi, Chihiro; Kondo, Kenji; Ikeda, Masashi; Iwata, Nakao; Kasai, Kiyoto; Sasaki, Tsukasa

    2017-01-01

    Abstract Oxytocin appears beneficial for autism spectrum disorder (ASD), and more than 20 single-nucleotide polymorphisms (SNPs) in oxytocin receptor (OXTR) are relevant to ASD. However, neither biological functions of OXTR SNPs in ASD nor critical OXTR SNPs that determine oxytocin’s effects on ASD remains known. Here, using a machine-learning algorithm that was designed to evaluate collective effects of multiple SNPs and automatically identify most informative SNPs, we examined relationships between 27 representative OXTR SNPs and six types of behavioral/neural response to oxytocin in ASD individuals. The oxytocin effects were extracted from our previous placebo-controlled within-participant clinical trial administering single-dose intranasal oxytocin to 38 high-functioning adult Japanese ASD males. Consequently, we identified six different SNP sets that could accurately predict the six different oxytocin efficacies, and confirmed the robustness of these SNP selections against variations of the datasets and analysis parameters. Moreover, major alleles of several prominent OXTR SNPs—including rs53576 and rs2254298—were found to have dissociable effects on the oxytocin efficacies. These findings suggest biological functions of the OXTR SNP variants on autistic oxytocin responses, and implied that clinical oxytocin efficacy may be genetically predicted before its actual administration, which would contribute to establishment of future precision medicines for ASD. PMID:27798253

  3. Population distribution and ancestry of the cancer protective MDM2 SNP285 (rs117039649).

    PubMed

    Knappskog, Stian; Gansmo, Liv B; Dibirova, Khadizha; Metspalu, Andres; Cybulski, Cezary; Peterlongo, Paolo; Aaltonen, Lauri; Vatten, Lars; Romundstad, Pål; Hveem, Kristian; Devilee, Peter; Evans, Gareth D; Lin, Dongxin; Van Camp, Guy; Manolopoulos, Vangelis G; Osorio, Ana; Milani, Lili; Ozcelik, Tayfun; Zalloua, Pierre; Mouzaya, Francis; Bliznetz, Elena; Balanovska, Elena; Pocheshkova, Elvira; Kučinskas, Vaidutis; Atramentova, Lubov; Nymadawa, Pagbajabyn; Titov, Konstantin; Lavryashina, Maria; Yusupov, Yuldash; Bogdanova, Natalia; Koshel, Sergey; Zamora, Jorge; Wedge, David C; Charlesworth, Deborah; Dörk, Thilo; Balanovsky, Oleg; Lønning, Per E

    2014-09-30

    The MDM2 promoter SNP285C is located on the SNP309G allele. While SNP309G enhances Sp1 transcription factor binding and MDM2 transcription, SNP285C antagonizes Sp1 binding and reduces the risk of breast-, ovary- and endometrial cancer. Assessing SNP285 and 309 genotypes across 25 different ethnic populations (>10.000 individuals), the incidence of SNP285C was 6-8% across European populations except for Finns (1.2%) and Saami (0.3%). The incidence decreased towards the Middle-East and Eastern Russia, and SNP285C was absent among Han Chinese, Mongolians and African Americans. Interhaplotype variation analyses estimated SNP285C to have originated about 14,700 years ago (95% CI: 8,300 - 33,300). Both this estimate and the geographical distribution suggest SNP285C to have arisen after the separation between Caucasians and modern day East Asians (17,000 - 40,000 years ago). We observed a strong inverse correlation (r = -0.805; p < 0.001) between the percentage of SNP309G alleles harboring SNP285C and the MAF for SNP309G itself across different populations suggesting selection and environmental adaptation with respect to MDM2 expression in recent human evolution. In conclusion, we found SNP285C to be a pan-Caucasian variant. Ethnic variation regarding distribution of SNP285C needs to be taken into account when assessing the impact of MDM2 SNPs on cancer risk.

  4. Association of Scavenger Receptor Class B Type I Polymorphisms with Subclinical Atherosclerosis: The Multi-Ethnic Study of Atherosclerosis

    PubMed Central

    Naj, Adam C.; West, Michael; Rich, Stephen S.; Post, Wendy; Kao, W.H. Linda; Wasserman, Bruce A.; Herrington, David M.; Rodriguez, Annabelle

    2012-01-01

    Background Little is known regarding the association of scavenger receptor class B type I (SCARB1) single nucleotide polymorphisms (SNPs) and subclinical atherosclerosis (SCA), particularly in subjects of different racial/ethnic backgrounds. We examined this relationship in the Multi-Ethnic Study of Atherosclerosis (MESA). Methods and Results Forty-three SCARB1 tagging SNPs were genotyped. Baseline examinations included fasting lipids and SCA phenotypes (coronary artery calcium [CAC], and common and internal carotid artery thickness [CCIMT and ICIMT]). Examining SNP associations with different SCA phenotypes across multiple racial/ethnic groups with adjustment for multiple covariates, we found the C allele of SNP rs10846744 was associated with higher CCIMT in African American (P=0.03), Chinese (P=0.02), European American (P=0.05), and Hispanic participants (P=0.03), and was strongly associated in pooled analyses (P=0.0002). The results also showed that the association of this SNP with CCIMT was independent of lipids and other well-established cardiovascular risk factors. Stratifying by sex, there appeared to be a strong association of rs10846744 with CCIMT in females, but no genotype-sex interactions were observed. Conclusions Variation in SCARB1 at rs10846744 was significantly associated with CCIMT across racial/ethnic groups in MESA. PMID:20160195

  5. SNP discovery by high-throughput sequencing in soybean

    PubMed Central

    2010-01-01

    Background With the advance of new massively parallel genotyping technologies, quantitative trait loci (QTL) fine mapping and map-based cloning become more achievable in identifying genes for important and complex traits. Development of high-density genetic markers in the QTL regions of specific mapping populations is essential for fine-mapping and map-based cloning of economically important genes. Single nucleotide polymorphisms (SNPs) are the most abundant form of genetic variation existing between any diverse genotypes that are usually used for QTL mapping studies. The massively parallel sequencing technologies (Roche GS/454, Illumina GA/Solexa, and ABI/SOLiD), have been widely applied to identify genome-wide sequence variations. However, it is still remains unclear whether sequence data at a low sequencing depth are enough to detect the variations existing in any QTL regions of interest in a crop genome, and how to prepare sequencing samples for a complex genome such as soybean. Therefore, with the aims of identifying SNP markers in a cost effective way for fine-mapping several QTL regions, and testing the validation rate of the putative SNPs predicted with Solexa short sequence reads at a low sequencing depth, we evaluated a pooled DNA fragment reduced representation library and SNP detection methods applied to short read sequences generated by Solexa high-throughput sequencing technology. Results A total of 39,022 putative SNPs were identified by the Illumina/Solexa sequencing system using a reduced representation DNA library of two parental lines of a mapping population. The validation rates of these putative SNPs predicted with low and high stringency were 72% and 85%, respectively. One hundred sixty four SNP markers resulted from the validation of putative SNPs and have been selectively chosen to target a known QTL, thereby increasing the marker density of the targeted region to one marker per 42 K bp. Conclusions We have demonstrated how to quickly identify large numbers of SNPs for fine mapping of QTL regions by applying massively parallel sequencing combined with genome complexity reduction techniques. This SNP discovery approach is more efficient for targeting multiple QTL regions in a same genetic population, which can be applied to other crops. PMID:20701770

  6. Changes in variance explained by top SNP windows over generations for three traits in broiler chicken.

    PubMed

    Fragomeni, Breno de Oliveira; Misztal, Ignacy; Lourenco, Daniela Lino; Aguilar, Ignacio; Okimoto, Ronald; Muir, William M

    2014-01-01

    The purpose of this study was to determine if the set of genomic regions inferred as accounting for the majority of genetic variation in quantitative traits remain stable over multiple generations of selection. The data set contained phenotypes for five generations of broiler chicken for body weight, breast meat, and leg score. The population consisted of 294,632 animals over five generations and also included genotypes of 41,036 single nucleotide polymorphism (SNP) for 4,866 animals, after quality control. The SNP effects were calculated by a GWAS type analysis using single step genomic BLUP approach for generations 1-3, 2-4, 3-5, and 1-5. Variances were calculated for windows of 20 SNP. The top ten windows for each trait that explained the largest fraction of the genetic variance across generations were examined. Across generations, the top 10 windows explained more than 0.5% but less than 1% of the total variance. Also, the pattern of the windows was not consistent across generations. The windows that explained the greatest variance changed greatly among the combinations of generations, with a few exceptions. In many cases, a window identified as top for one combination, explained less than 0.1% for the other combinations. We conclude that identification of top SNP windows for a population may have little predictive power for genetic selection in the following generations for the traits here evaluated.

  7. Single-Nucleotide Polymorphisms Associated with Skin Naphthyl–Keratin Adduct Levels in Workers Exposed to Naphthalene

    PubMed Central

    Jiang, Rong; French, John E.; Stober, Vandy P.; Kang-Sickel, Juei-Chuan C.; Zou, Fei

    2012-01-01

    Background: Individual genetic variation that results in differences in systemic response to xenobiotic exposure is not accounted for as a predictor of outcome in current exposure assessment models. Objective: We developed a strategy to investigate individual differences in single-nucleotide polymorphisms (SNPs) as genetic markers associated with naphthyl–keratin adduct (NKA) levels measured in the skin of workers exposed to naphthalene. Methods: The SNP-association analysis was conducted in PLINK using candidate-gene analysis and genome-wide analysis. We identified significant SNP–NKA associations and investigated the potential impact of these SNPs along with personal and workplace factors on NKA levels using a multiple linear regression model and the Pratt index. Results: In candidate-gene analysis, a SNP (rs4852279) located near the CYP26B1 gene contributed to the 2-naphthyl–keratin adduct (2NKA) level. In the multiple linear regression model, the SNP rs4852279, dermal exposure, exposure time, task replacing foam, age, and ethnicity all were significant predictors of 2NKA level. In genome-wide analysis, no single SNP reached genome-wide significance for NKA levels (all p ≥ 1.05 × 10–5). Pathway and network analyses of SNPs associated with NKA levels were predicted to be involved in the regulation of cellular processes and homeostasis. Conclusions: These results provide evidence that a quantitative biomarker can be used as an intermediate phenotype when investigating the association between genetic markers and exposure–dose relationship in a small, well-characterized exposed worker population. PMID:22391508

  8. Joint Identification of Genetic Variants for Physical Activity in Korean Population

    PubMed Central

    Kim, Jayoun; Kim, Jaehee; Min, Haesook; Oh, Sohee; Kim, Yeonjung; Lee, Andy H.; Park, Taesung

    2014-01-01

    There has been limited research on genome-wide association with physical activity (PA). This study ascertained genetic associations between PA and 344,893 single nucleotide polymorphism (SNP) markers in 8842 Korean samples. PA data were obtained from a validated questionnaire that included information on PA intensity and duration. Metabolic equivalent of tasks were calculated to estimate the total daily PA level for each individual. In addition to single- and multiple-SNP association tests, a pathway enrichment analysis was performed to identify the biological significance of SNP markers. Although no significant SNP was found at genome-wide significance level via single-SNP association tests, 59 genetic variants mapped to 76 genes were identified via a multiple SNP approach using a bootstrap selection stability measure. Pathway analysis for these 59 variants showed that maturity onset diabetes of the young (MODY) was enriched. Joint identification of SNPs could enable the identification of multiple SNPs with good predictive power for PA and a pathway enriched for PA. PMID:25026172

  9. Variation in the X-Linked EFHC2 Gene Is Associated with Social Cognitive Abilities in Males

    PubMed Central

    Startin, Carla M.; Fiorentini, Chiara; de Haan, Michelle; Skuse, David H.

    2015-01-01

    Females outperform males on many social cognitive tasks. X-linked genes may contribute to this sex difference. Males possess one X chromosome, while females possess two X chromosomes. Functional variations in X-linked genes are therefore likely to impact more on males than females. Previous studies of X-monosomic women with Turner syndrome suggest a genetic association with facial fear recognition abilities at Xp11.3, specifically at a single nucleotide polymorphism (SNP rs7055196) within the EFHC2 gene. Based on a strong hypothesis, we investigated an association between variation at SNP rs7055196 and facial fear recognition and theory of mind abilities in males. As predicted, males possessing the G allele had significantly poorer facial fear detection accuracy and theory of mind abilities than males possessing the A allele (with SNP variant accounting for up to 4.6% of variance). Variation in the X-linked EFHC2 gene at SNP rs7055196 is therefore associated with social cognitive abilities in males. PMID:26107779

  10. SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping.

    PubMed

    Chang, Hsueh-Wei; Cheng, Yu-Huei; Chuang, Li-Yeh; Yang, Cheng-Hong

    2010-04-08

    PCR-restriction fragment length polymorphism (RFLP) assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels), gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.

  11. Proper joint analysis of summary association statistics requires the adjustment of heterogeneity in SNP coverage pattern.

    PubMed

    Zhang, Han; Wheeler, William; Song, Lei; Yu, Kai

    2017-07-07

    As meta-analysis results published by consortia of genome-wide association studies (GWASs) become increasingly available, many association summary statistics-based multi-locus tests have been developed to jointly evaluate multiple single-nucleotide polymorphisms (SNPs) to reveal novel genetic architectures of various complex traits. The validity of these approaches relies on the accurate estimate of z-score correlations at considered SNPs, which in turn requires knowledge on the set of SNPs assessed by each study participating in the meta-analysis. However, this exact SNP coverage information is usually unavailable from the meta-analysis results published by GWAS consortia. In the absence of the coverage information, researchers typically estimate the z-score correlations by making oversimplified coverage assumptions. We show through real studies that such a practice can generate highly inflated type I errors, and we demonstrate the proper way to incorporate correct coverage information into multi-locus analyses. We advocate that consortia should make SNP coverage information available when posting their meta-analysis results, and that investigators who develop analytic tools for joint analyses based on summary data should pay attention to the variation in SNP coverage and adjust for it appropriately. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.

  12. VarDetect: a nucleotide sequence variation exploratory tool

    PubMed Central

    Ngamphiw, Chumpol; Kulawonganunchai, Supasak; Assawamakin, Anunchai; Jenwitheesuk, Ekachai; Tongsima, Sissades

    2008-01-01

    Background Single nucleotide polymorphisms (SNPs) are the most commonly studied units of genetic variation. The discovery of such variation may help to identify causative gene mutations in monogenic diseases and SNPs associated with predisposing genes in complex diseases. Accurate detection of SNPs requires software that can correctly interpret chromatogram signals to nucleotides. Results We present VarDetect, a stand-alone nucleotide variation exploratory tool that automatically detects nucleotide variation from fluorescence based chromatogram traces. Accurate SNP base-calling is achieved using pre-calculated peak content ratios, and is enhanced by rules which account for common sequence reading artifacts. The proposed software tool is benchmarked against four other well-known SNP discovery software tools (PolyPhred, novoSNP, Genalys and Mutation Surveyor) using fluorescence based chromatograms from 15 human genes. These chromatograms were obtained from sequencing 16 two-pooled DNA samples; a total of 32 individual DNA samples. In this comparison of automatic SNP detection tools, VarDetect achieved the highest detection efficiency. Availability VarDetect is compatible with most major operating systems such as Microsoft Windows, Linux, and Mac OSX. The current version of VarDetect is freely available at . PMID:19091032

  13. Pharmacogenetics of steroid-responsive acute graft-versus-host disease.

    PubMed

    Arora, Mukta; Weisdorf, Daniel J; Shanley, Ryan M; Thyagarajan, Bharat

    2017-05-01

    Glucocorticoids are central to effective therapy of acute graft-versus-host disease (GVHD). However, only about half of the patients respond to steroids in initial therapy. Based on postulated mechanisms for anti-inflammatory effectiveness, we explored genetic variations in glucocorticoid receptor, co-chaperone proteins, membrane transporters, inflammatory mediators, and variants in the T-cell receptor complex in hematopoietic cell transplant recipients with acute GVHD requiring treatment with steroids and their donors toward response at day 28 after initiation of therapy. A total of 300 recipient and donor samples were analyzed. Twenty-three SNPs in 17 genes affecting glucocorticoid pathways were included in the analysis. In multiple regression analysis, donor SNP rs3192177 in the ZAP70 gene (O.R. 2.8, 95% CI: 1.3-6.0, P=.008) and donor SNP rs34471628 in the DUSPI gene (O.R. 0.3, 95% CI: 0.1-1.0, P=.048) were significantly associated with complete or partial response. However, after adjustment for multiple testing, these SNPs did not remain statistically significant. Our results, on this small, exploratory, hypothesis generating analysis suggest that common genetic variation in glucocorticoid pathways may help identify subjects with differential response to glucocorticoids. This needs further assessment in larger datasets and if validated could help identify subjects for alternative treatments and design targeted treatments to overcome steroid resistance. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. SNP discovery and genotyping using Genotyping-by-Sequencing in Pekin ducks.

    PubMed

    Zhu, Feng; Cui, Qian-Qian; Hou, Zhuo-Cheng

    2016-11-15

    Genomic selection and genome-wide association studies need thousands to millions of SNPs. However, many non-model species do not have reference chips for detecting variation. Our goal was to develop and validate an inexpensive but effective method for detecting SNP variation. Genotyping by sequencing (GBS) can be a highly efficient strategy for genome-wide SNP detection, as an alternative to microarray chips. Here, we developed a GBS protocol for ducks and tested it to genotype 49 Pekin ducks. A total of 169,209 SNPs were identified from all animals, with a mean of 55,920 SNPs per individual. The average SNP density reached 1156 SNPs/MB. In this study, the first application of GBS to ducks, we demonstrate the power and simplicity of this method. GBS can be used for genetic studies in to provide an effective method for genome-wide SNP discovery.

  15. Genevar: a database and Java application for the analysis and visualization of SNP-gene associations in eQTL studies.

    PubMed

    Yang, Tsun-Po; Beazley, Claude; Montgomery, Stephen B; Dimas, Antigone S; Gutierrez-Arcelus, Maria; Stranger, Barbara E; Deloukas, Panos; Dermitzakis, Emmanouil T

    2010-10-01

    Genevar (GENe Expression VARiation) is a database and Java tool designed to integrate multiple datasets, and provides analysis and visualization of associations between sequence variation and gene expression. Genevar allows researchers to investigate expression quantitative trait loci (eQTL) associations within a gene locus of interest in real time. The database and application can be installed on a standard computer in database mode and, in addition, on a server to share discoveries among affiliations or the broader community over the Internet via web services protocols. http://www.sanger.ac.uk/resources/software/genevar.

  16. Challenges in the association of human single nucleotide polymorphism mentions with unique database identifiers

    PubMed Central

    2011-01-01

    Background Most information on genomic variations and their associations with phenotypes are covered exclusively in scientific publications rather than in structured databases. These texts commonly describe variations using natural language; database identifiers are seldom mentioned. This complicates the retrieval of variations, associated articles, as well as information extraction, e. g. the search for biological implications. To overcome these challenges, procedures to map textual mentions of variations to database identifiers need to be developed. Results This article describes a workflow for normalization of variation mentions, i.e. the association of them to unique database identifiers. Common pitfalls in the interpretation of single nucleotide polymorphism (SNP) mentions are highlighted and discussed. The developed normalization procedure achieves a precision of 98.1 % and a recall of 67.5% for unambiguous association of variation mentions with dbSNP identifiers on a text corpus based on 296 MEDLINE abstracts containing 527 mentions of SNPs. The annotated corpus is freely available at http://www.scai.fraunhofer.de/snp-normalization-corpus.html. Conclusions Comparable approaches usually focus on variations mentioned on the protein sequence and neglect problems for other SNP mentions. The results presented here indicate that normalizing SNPs described on DNA level is more difficult than the normalization of SNPs described on protein level. The challenges associated with normalization are exemplified with ambiguities and errors, which occur in this corpus. PMID:21992066

  17. Spiking neural P systems with multiple channels.

    PubMed

    Peng, Hong; Yang, Jinyu; Wang, Jun; Wang, Tao; Sun, Zhang; Song, Xiaoxiao; Luo, Xiaohui; Huang, Xiangnian

    2017-11-01

    Spiking neural P systems (SNP systems, in short) are a class of distributed parallel computing systems inspired from the neurophysiological behavior of biological spiking neurons. In this paper, we investigate a new variant of SNP systems in which each neuron has one or more synaptic channels, called spiking neural P systems with multiple channels (SNP-MC systems, in short). The spiking rules with channel label are introduced to handle the firing mechanism of neurons, where the channel labels indicate synaptic channels of transmitting the generated spikes. The computation power of SNP-MC systems is investigated. Specifically, we prove that SNP-MC systems are Turing universal as both number generating and number accepting devices. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs

    PubMed Central

    Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong

    2009-01-01

    Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380

  19. SNP ID-info: SNP ID searching and visualization platform.

    PubMed

    Yang, Cheng-Hong; Chuang, Li-Yeh; Cheng, Yu-Huei; Wen, Cheng-Hao; Chang, Phei-Lang; Chang, Hsueh-Wei

    2008-09-01

    Many association studies provide the relationship between single nucleotide polymorphisms (SNPs), diseases and cancers, without giving a SNP ID, however. Here, we developed the SNP ID-info freeware to provide the SNP IDs within inputting genetic and physical information of genomes. The program provides an "SNP-ePCR" function to generate the full-sequence using primers and template inputs. In "SNPosition," sequence from SNP-ePCR or direct input is fed to match the SNP IDs from SNP fasta-sequence. In "SNP search" and "SNP fasta" function, information of SNPs within the cytogenetic band, contig position, and keyword input are acceptable. Finally, the SNP ID neighboring environment for inputs is completely visualized in the order of contig position and marked with SNP and flanking hits. The SNP identification problems inherent in NCBI SNP BLAST are also avoided. In conclusion, the SNP ID-info provides a visualized SNP ID environment for multiple inputs and assists systematic SNP association studies. The server and user manual are available at http://bio.kuas.edu.tw/snpid-info.

  20. SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate

    USGS Publications Warehouse

    Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon

    2016-01-01

    Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.

  1. A Novel Center Star Multiple Sequence Alignment Algorithm Based on Affine Gap Penalty and K-Band

    NASA Astrophysics Data System (ADS)

    Zou, Quan; Shan, Xiao; Jiang, Yi

    Multiple sequence alignment is one of the most important topics in computational biology, but it cannot deal with the large data so far. As the development of copy-number variant(CNV) and Single Nucleotide Polymorphisms(SNP) research, many researchers want to align numbers of similar sequences for detecting CNV and SNP. In this paper, we propose a novel multiple sequence alignment algorithm based on affine gap penalty and k-band. It can align more quickly and accurately, that will be helpful for mining CNV and SNP. Experiments prove the performance of our algorithm.

  2. Investigation of genetic variation in scavenger receptor class B, member 1 (SCARB1) and association with serum carotenoids

    PubMed Central

    McKay, Gareth J; Loane, Edward; Nolan, John M; Patterson, Christopher C; Meyers, Kristin J; Mares, Julie A; Yonova-Doing, Ekaterina; Hammond, Christopher J; Beatty, Stephen; Silvestri, Giuliana

    2013-01-01

    Objective To investigate association of scavenger receptor class B, member 1 (SCARB1) genetic variants with serum carotenoid levels of lutein (L) and zeaxanthin (Z) and macular pigment optical density (MPOD). Design A cross-sectional study of healthy adults aged 20-70. Participants 302 participants recruited following local advertisement. Methods MPOD was measured by customized heterochromatic flicker photometry. Fasting blood samples were taken for serum L and Z measurement by HPLC and lipoprotein analysis by spectrophotometric assay. Forty-seven single nucleotide polymorphisms (SNPs) across SCARB1 were genotyped using Sequenom technology. Association analyses were performed using PLINK to compare allele and haplotype means, with adjustment for potential confounding and correction for multiple comparisons by permutation testing. Replication analysis was performed in the TwinsUK and CAREDS cohorts. Main outcome measures Odds ratios (ORs) for macular pigment optical density area, serum lutein and zeaxanthin concentrations associated with genetic variations in SCARB1 and interactions between SCARB1 and sex. Results Following multiple regression analysis with adjustment for age, body mass index, sex, high-density lipoprotein cholesterol (HDLc), low-density lipoprotein cholesterol (LDLc), triglycerides, smoking, dietary L and Z levels, 5 SNPs were significantly associated with serum L concentration and 1 SNP with MPOD (P<0.01). Only the association between rs11057841 and serum L withstood correction for multiple comparisons by permutation testing (P<0.01) and replicated in the TwinsUK cohort (P=0.014). Independent replication was also observed in the CAREDS cohort with rs10846744 (P=2×10−4), a SNP in high linkage disequilibrium with rs11057841 (r2=0.93). No significant interactions by sex were found. Haplotype analysis revealed no stronger association than obtained with single SNP analyses. Conclusions Our study has identified association between rs11057841 and serum L concentration (24% increase per T allele) in healthy subjects, independent of potential confounding factors. Our data supports further evaluation of the role for SCARB1 in the transport of macular pigment and the possible modulation of AMD risk through combating the effects of oxidative stress within the retina. PMID:23562302

  3. Genome-wide association reveals that common genetic variation in the kallikrein-kinin system is associated with serum L-arginine levels.

    PubMed

    Zhang, Weihua; Jernerén, Fredrik; Lehne, Benjamin C; Chen, Ming-Huei; Luben, Robert N; Johnston, Carole; Elshorbagy, Amany; Eppinga, Ruben N; Scott, William R; Adeyeye, Elizabeth; Scott, James; Böger, Rainer H; Khaw, Kay-Tee; van der Harst, Pim; Wareham, Nicholas J; Vasan, Ramachandran S; Chambers, John C; Refsum, Helga; Kooner, Jaspal S

    2016-11-30

    L-arginine is the essential precursor of nitric oxide, and is involved in multiple key physiological processes, including vascular and immune function. The genetic regulation of blood L-arginine levels is largely unknown. We performed a genome-wide association study (GWAS) to identify genetic factors determining serum L-arginine levels, amongst 901 Europeans and 1,394 Indian Asians. We show that common genetic variations at the KLKB1 and F12 loci are strongly associated with serum L-arginine levels. The G allele of single nucleotide polymorphism (SNP) rs71640036 (T/G) in KLKB1 is associated with lower serum L-arginine concentrations (10 µmol/l per allele copy, p=1×10 -24 ), while allele T of rs2545801 (T/C) near the F12 gene is associated with lower serum L-arginine levels (7 µmol/l per allele copy, p=7×10 -12 ). Together these two loci explain 7 % of the total variance in serum L-arginine concentrations. The associations at both loci were replicated in independent cohorts with plasma L-arginine measurements (p<0.004). The two sentinel SNPs are in nearly complete LD with the nonsynonymous SNP rs3733402 at KLKB1 and the 5'-UTR SNP rs1801020 at F12, respectively. SNPs at both loci are associated with blood pressure. Our findings provide new insight into the genetic regulation of L-arginine and its potential relationship with cardiovascular risk.

  4. A global reference for human genetic variation

    PubMed Central

    2016-01-01

    The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies. PMID:26432245

  5. RExPrimer: an integrated primer designing tool increases PCR effectiveness by avoiding 3' SNP-in-primer and mis-priming from structural variation

    PubMed Central

    2009-01-01

    Background Polymerase chain reaction (PCR) is very useful in many areas of molecular biology research. It is commonly observed that PCR success is critically dependent on design of an effective primer pair. Current tools for primer design do not adequately address the problem of PCR failure due to mis-priming on target-related sequences and structural variations in the genome. Methods We have developed an integrated graphical web-based application for primer design, called RExPrimer, which was written in Python language. The software uses Primer3 as the primer designing core algorithm. Locally stored sequence information and genomic variant information were hosted on MySQLv5.0 and were incorporated into RExPrimer. Results RExPrimer provides many functionalities for improved PCR primer design. Several databases, namely annotated human SNP databases, insertion/deletion (indel) polymorphisms database, pseudogene database, and structural genomic variation databases were integrated into RExPrimer, enabling an effective without-leaving-the-website validation of the resulting primers. By incorporating these databases, the primers reported by RExPrimer avoid mis-priming to related sequences (e.g. pseudogene, segmental duplication) as well as possible PCR failure because of structural polymorphisms (SNP, indel, and copy number variation (CNV)). To prevent mismatching caused by unexpected SNPs in the designed primers, in particular the 3' end (SNP-in-Primer), several SNP databases covering the broad range of population-specific SNP information are utilized to report SNPs present in the primer sequences. Population-specific SNP information also helps customize primer design for a specific population. Furthermore, RExPrimer offers a graphical user-friendly interface through the use of scalable vector graphic image that intuitively presents resulting primers along with the corresponding gene structure. In this study, we demonstrated the program effectiveness in successfully generating primers for strong homologous sequences. Conclusion The improvements for primer design incorporated into RExPrimer were demonstrated to be effective in designing primers for challenging PCR experiments. Integration of SNP and structural variation databases allows for robust primer design for a variety of PCR applications, irrespective of the sequence complexity in the region of interest. This software is freely available at http://www4a.biotec.or.th/rexprimer. PMID:19958502

  6. Analyses of single nucleotide polymorphisms in selected nutrient-sensitive genes in weight-regain prevention: the DIOGENES study.

    PubMed

    Larsen, Lesli H; Angquist, Lars; Vimaleswaran, Karani S; Hager, Jörg; Viguerie, Nathalie; Loos, Ruth J F; Handjieva-Darlenska, Teodora; Jebb, Susan A; Kunesova, Marie; Larsen, Thomas M; Martinez, J Alfredo; Papadaki, Angeliki; Pfeiffer, Andreas F H; van Baak, Marleen A; Sørensen, Thorkild Ia; Holst, Claus; Langin, Dominique; Astrup, Arne; Saris, Wim H M

    2012-05-01

    Differences in the interindividual response to dietary intervention could be modified by genetic variation in nutrient-sensitive genes. This study examined single nucleotide polymorphisms (SNPs) in presumed nutrient-sensitive candidate genes for obesity and obesity-related diseases for main and dietary interaction effects on weight, waist circumference, and fat mass regain over 6 mo. In total, 742 participants who had lost ≥ 8% of their initial body weight were randomly assigned to follow 1 of 5 different ad libitum diets with different glycemic indexes and contents of dietary protein. The SNP main and SNP-diet interaction effects were analyzed by using linear regression models, corrected for multiple testing by using Bonferroni correction and evaluated by using quantile-quantile (Q-Q) plots. After correction for multiple testing, none of the SNPs were significantly associated with weight, waist circumference, or fat mass regain. Q-Q plots showed that ALOX5AP rs4769873 showed a higher observed than predicted P value for the association with less waist circumference regain over 6 mo (-3.1 cm/allele; 95% CI: -4.6, -1.6; P/Bonferroni-corrected P = 0.000039/0.076), independently of diet. Additional associations were identified by using Q-Q plots for SNPs in ALOX5AP, TNF, and KCNJ11 for main effects; in LPL and TUB for glycemic index interaction effects on waist circumference regain; in GHRL, CCK, MLXIPL, and LEPR on weight; in PPARC1A, PCK2, ALOX5AP, PYY, and ADRB3 on waist circumference; and in PPARD, FABP1, PLAUR, and LPIN1 on fat mass regain for dietary protein interaction. The observed effects of SNP-diet interactions on weight, waist, and fat mass regain suggest that genetic variation in nutrient-sensitive genes can modify the response to diet. This trial was registered at clinicaltrials.gov as NCT00390637.

  7. Genevar: a database and Java application for the analysis and visualization of SNP-gene associations in eQTL studies

    PubMed Central

    Yang, Tsun-Po; Beazley, Claude; Montgomery, Stephen B.; Dimas, Antigone S.; Gutierrez-Arcelus, Maria; Stranger, Barbara E.; Deloukas, Panos; Dermitzakis, Emmanouil T.

    2010-01-01

    Summary: Genevar (GENe Expression VARiation) is a database and Java tool designed to integrate multiple datasets, and provides analysis and visualization of associations between sequence variation and gene expression. Genevar allows researchers to investigate expression quantitative trait loci (eQTL) associations within a gene locus of interest in real time. The database and application can be installed on a standard computer in database mode and, in addition, on a server to share discoveries among affiliations or the broader community over the Internet via web services protocols. Availability: http://www.sanger.ac.uk/resources/software/genevar Contact: emmanouil.dermitzakis@unige.ch PMID:20702402

  8. Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array

    USDA-ARS?s Scientific Manuscript database

    Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139.8 megabases...

  9. Genetic variation in the oxytocin receptor (OXTR) gene is associated with Asperger Syndrome.

    PubMed

    Di Napoli, Agnese; Warrier, Varun; Baron-Cohen, Simon; Chakrabarti, Bhismadev

    2014-01-01

    Autism Spectrum Conditions (ASC) are a group of neurodevelopmental conditions characterized by impairments in communication and social interaction, alongside unusually repetitive behaviors and narrow interests. ASC are highly heritable and have complex patterns of inheritance where multiple genes are involved, alongside environmental and epigenetic factors. Asperger Syndrome (AS) is a subgroup of these conditions, where there is no history of language or cognitive delay. Animal models suggest a role for oxytocin (OXT) and oxytocin receptor (OXTR) genes in social-emotional behaviors, and several studies indicate that the oxytocin/oxytocin receptor system is altered in individuals with ASC. Previous studies have reported associations between genetic variations in the OXTR gene and ASC. The present study tested for an association between nine single nucleotide polymorphisms (SNPs) in the OXTR gene and AS in 530 individuals of Caucasian origin, using SNP association test and haplotype analysis. There was a significant association between rs2268493 in OXTR and AS. Multiple haplotypes that include this SNP (rs2268493-rs2254298, rs2268490-rs2268493-rs2254298, rs2268493-rs2254298-rs53576, rs237885-rs2268490-rs2268493-rs2254298, rs2268490-rs2268493-rs2254298-rs53576) were also associated with AS. rs2268493 has been previously associated with ASC and putatively alters several transcription factor-binding sites and regulates chromatin states, either directly or through other variants in linkage disequilibrium (LD). This study reports a significant association of the sequence variant rs2268493 in the OXTR gene and associated haplotypes with AS.

  10. Evaluation of copy number variation detection for a SNP array platform

    PubMed Central

    2014-01-01

    Background Copy Number Variations (CNVs) are usually inferred from Single Nucleotide Polymorphism (SNP) arrays by use of some software packages based on given algorithms. However, there is no clear understanding of the performance of these software packages; it is therefore difficult to select one or several software packages for CNV detection based on the SNP array platform. We selected four publicly available software packages designed for CNV calling from an Affymetrix SNP array, including Birdsuite, dChip, Genotyping Console (GTC) and PennCNV. The publicly available dataset generated by Array-based Comparative Genomic Hybridization (CGH), with a resolution of 24 million probes per sample, was considered to be the “gold standard”. Compared with the CGH-based dataset, the success rate, average stability rate, sensitivity, consistence and reproducibility of these four software packages were assessed compared with the “gold standard”. Specially, we also compared the efficiency of detecting CNVs simultaneously by two, three and all of the software packages with that by a single software package. Results Simply from the quantity of the detected CNVs, Birdsuite detected the most while GTC detected the least. We found that Birdsuite and dChip had obvious detecting bias. And GTC seemed to be inferior because of the least amount of CNVs it detected. Thereafter we investigated the detection consistency produced by one certain software package and the rest three software suits. We found that the consistency of dChip was the lowest while GTC was the highest. Compared with the CNVs detecting result of CGH, in the matching group, GTC called the most matching CNVs, PennCNV-Affy ranked second. In the non-overlapping group, GTC called the least CNVs. With regards to the reproducibility of CNV calling, larger CNVs were usually replicated better. PennCNV-Affy shows the best consistency while Birdsuite shows the poorest. Conclusion We found that PennCNV outperformed the other three packages in the sensitivity and specificity of CNV calling. Obviously, each calling method had its own limitations and advantages for different data analysis. Therefore, the optimized calling methods might be identified using multiple algorithms to evaluate the concordance and discordance of SNP array-based CNV calling. PMID:24555668

  11. Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus).

    PubMed

    Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

    2015-07-27

    Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1-8) were identified and genotyped via direct sequencing covering most of the coding region and 3'UTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3'UTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs.

  12. Molecular Characterization of Bovine SMO Gene and Effects of Its Genetic Variations on Body Size Traits in Qinchuan Cattle (Bos taurus)

    PubMed Central

    Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen

    2015-01-01

    Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1–8) were identified and genotyped via direct sequencing covering most of the coding region and 3ʹUTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3ʹUTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs. PMID:26225956

  13. Pedigree- and SNP-Associated Genetics and Recent Environment are the Major Contributors to Anthropometric and Cardiometabolic Trait Variation.

    PubMed

    Xia, Charley; Amador, Carmen; Huffman, Jennifer; Trochet, Holly; Campbell, Archie; Porteous, David; Hastie, Nicholas D; Hayward, Caroline; Vitart, Veronique; Navarro, Pau; Haley, Chris S

    2016-02-01

    Genome-wide association studies have successfully identified thousands of loci for a range of human complex traits and diseases. The proportion of phenotypic variance explained by significant associations is, however, limited. Given the same dense SNP panels, mixed model analyses capture a greater proportion of phenotypic variance than single SNP analyses but the total is generally still less than the genetic variance estimated from pedigree studies. Combining information from pedigree relationships and SNPs, we examined 16 complex anthropometric and cardiometabolic traits in a Scottish family-based cohort comprising up to 20,000 individuals genotyped for ~520,000 common autosomal SNPs. The inclusion of related individuals provides the opportunity to also estimate the genetic variance associated with pedigree as well as the effects of common family environment. Trait variation was partitioned into SNP-associated and pedigree-associated genetic variation, shared nuclear family environment, shared couple (partner) environment and shared full-sibling environment. Results demonstrate that trait heritabilities vary widely but, on average across traits, SNP-associated and pedigree-associated genetic effects each explain around half the genetic variance. For most traits the recently-shared environment of couples is also significant, accounting for ~11% of the phenotypic variance on average. On the other hand, the environment shared largely in the past by members of a nuclear family or by full-siblings, has a more limited impact. Our findings point to appropriate models to use in future studies as pedigree-associated genetic effects and couple environmental effects have seldom been taken into account in genotype-based analyses. Appropriate description of the trait variation could help understand causes of intra-individual variation and in the detection of contributing loci and environmental factors.

  14. Prospecting for pig single nucleotide polymorphisms in the human genome: have we struck gold?

    PubMed

    Grapes, L; Rudd, S; Fernando, R L; Megy, K; Rocha, D; Rothschild, M F

    2006-06-01

    Gene-to-gene variation in the frequency of single nucleotide polymorphisms (SNPs) has been observed in humans, mice, rats, primates and pigs, but a relationship across species in this variation has not been described. Here, the frequency of porcine coding SNPs (cSNPs) identified by in silico methods, and the frequency of murine cSNPs, were compared with the frequency of human cSNPs across homologous genes. From 150,000 porcine expressed sequence tag (EST) sequences, a total of 452 SNP-containing sequence clusters were found, totalling 1394 putative SNPs. All the clustered porcine EST annotations and SNP data have been made publicly available at http://sputnik.btk.fi/project?name=swine. Human and murine cSNPs were identified from dbSNP and were characterized as either validated or total number of cSNPs (validated plus non-validated) for comparison purposes. The correlation between in silico pig cSNP and validated human cSNP densities was found to be 0.77 (p < 0.00001) for a set of 25 homologous genes, while a correlation of 0.48 (p < 0.0005) was found for a primarily random sample of 50 homologous human and mouse genes. This is the first evidence of conserved gene-to-gene variability in cSNP frequency across species and indicates that site-directed screening of porcine genes that are homologous to cSNP-rich human genes may rapidly advance cSNP discovery in pigs.

  15. Variation in Recombination Rate and Its Genetic Determinism in Sheep Populations

    PubMed Central

    Petit, Morgane; Astruc, Jean-Michel; Sarry, Julien; Drouilhet, Laurence; Fabre, Stéphane; Moreno, Carole R.; Servin, Bertrand

    2017-01-01

    Recombination is a complex biological process that results from a cascade of multiple events during meiosis. Understanding the genetic determinism of recombination can help to understand if and how these events are interacting. To tackle this question, we studied the patterns of recombination in sheep, using multiple approaches and data sets. We constructed male recombination maps in a dairy breed from the south of France (the Lacaune breed) at a fine scale by combining meiotic recombination rates from a large pedigree genotyped with a 50K SNP array and historical recombination rates from a sample of unrelated individuals genotyped with a 600K SNP array. This analysis revealed recombination patterns in sheep similar to other mammals but also genome regions that have likely been affected by directional and diversifying selection. We estimated the average recombination rate of Lacaune sheep at 1.5 cM/Mb, identified ∼50,000 crossover hotspots on the genome, and found a high correlation between historical and meiotic recombination rate estimates. A genome-wide association study revealed two major loci affecting interindividual variation in recombination rate in Lacaune, including the RNF212 and HEI10 genes and possibly two other loci of smaller effects including the KCNJ15 and FSHR genes. The comparison of these new results to those obtained previously in a distantly related population of domestic sheep (the Soay) revealed that Soay and Lacaune males have a very similar distribution of recombination along the genome. The two data sets were thus combined to create more precise male meiotic recombination maps in Sheep. However, despite their similar recombination maps, Soay and Lacaune males were found to exhibit different heritabilities and QTL effects for interindividual variation in genome-wide recombination rates. This highlights the robustness of recombination patterns to underlying variation in their genetic determinism. PMID:28978774

  16. Variation in Recombination Rate and Its Genetic Determinism in Sheep Populations.

    PubMed

    Petit, Morgane; Astruc, Jean-Michel; Sarry, Julien; Drouilhet, Laurence; Fabre, Stéphane; Moreno, Carole R; Servin, Bertrand

    2017-10-01

    Recombination is a complex biological process that results from a cascade of multiple events during meiosis. Understanding the genetic determinism of recombination can help to understand if and how these events are interacting. To tackle this question, we studied the patterns of recombination in sheep, using multiple approaches and data sets. We constructed male recombination maps in a dairy breed from the south of France (the Lacaune breed) at a fine scale by combining meiotic recombination rates from a large pedigree genotyped with a 50K SNP array and historical recombination rates from a sample of unrelated individuals genotyped with a 600K SNP array. This analysis revealed recombination patterns in sheep similar to other mammals but also genome regions that have likely been affected by directional and diversifying selection. We estimated the average recombination rate of Lacaune sheep at 1.5 cM/Mb, identified ∼50,000 crossover hotspots on the genome, and found a high correlation between historical and meiotic recombination rate estimates. A genome-wide association study revealed two major loci affecting interindividual variation in recombination rate in Lacaune, including the RNF212 and HEI10 genes and possibly two other loci of smaller effects including the KCNJ15 and FSHR genes. The comparison of these new results to those obtained previously in a distantly related population of domestic sheep (the Soay) revealed that Soay and Lacaune males have a very similar distribution of recombination along the genome. The two data sets were thus combined to create more precise male meiotic recombination maps in Sheep. However, despite their similar recombination maps, Soay and Lacaune males were found to exhibit different heritabilities and QTL effects for interindividual variation in genome-wide recombination rates. This highlights the robustness of recombination patterns to underlying variation in their genetic determinism. Copyright © 2017 by the Genetics Society of America.

  17. Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

    PubMed Central

    Wang, Shichen; Wong, Debbie; Forrest, Kerrie; Allen, Alexandra; Chao, Shiaoman; Huang, Bevan E; Maccaferri, Marco; Salvi, Silvio; Milner, Sara G; Cattivelli, Luigi; Mastrangelo, Anna M; Whan, Alex; Stephen, Stuart; Barker, Gary; Wieseke, Ralf; Plieske, Joerg; International Wheat Genome Sequencing Consortium; Lillemo, Morten; Mather, Diane; Appels, Rudi; Dolferus, Rudy; Brown-Guedira, Gina; Korol, Abraham; Akhunova, Alina R; Feuillet, Catherine; Salse, Jerome; Morgante, Michele; Pozniak, Curtis; Luo, Ming-Cheng; Dvorak, Jan; Morell, Matthew; Dubcovsky, Jorge; Ganal, Martin; Tuberosa, Roberto; Lawley, Cindy; Mikoulitch, Ivan; Cavanagh, Colin; Edwards, Keith J; Hayden, Matthew; Akhunov, Eduard

    2014-01-01

    High-density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker–trait associations in mapping experiments. We developed a genotyping array including about 90 000 gene-associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome-wide distributed SNPs that are represented in populations of diverse geographical origin. We used density-based spatial clustering algorithms to enable high-throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model-free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low-intensity clusters can provide insight into the distribution of presence–absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat. PMID:24646323

  18. A Candidate Gene Association Study of Bone Mineral Density in an Iranian Population.

    PubMed

    Dastgheib, Seyed Alireza; Gartland, Alison; Tabei, Seyed Mohammad Bagher; Omrani, Gholamhossein Ranjbar; Teare, Marion Dawn

    2016-01-01

    The genetic epidemiology of variation in bone mineral density (BMD) and osteoporosis is not well studied in Iranian populations and needs more research. We report a candidate gene association study of BMD variation in a healthy cross-sectional study of 501 males and females sampled from the Iranian Multi-Centre Osteoporosis Study, Shiraz, Iran. We selected to study the association with 21 single nucleotide polymorphisms (SNPs) located in the 7 candidate genes LRP5, RANK, RANKL, OPG, P2RX7, VDR , and ESR1 . BMD was measured at the three sites L2-L4, neck of femur, and total hip. Association between BMD and each SNP was assessed using multiple linear regression assuming an allele dose (additive effect) on BMD (adjusted for age and sex). Statistically significant (at the unadjusted 5% level) associations were seen with seven SNPs in five of the candidate genes. Two SNPs showed statistically significant association with more than one BMD site. Significant association was seen between BMD at all the three sites with the VDR SNP rs731246 (L2-L4 p  = 0.038; neck of femur p  = 0.001; and total hip p  < 0.001). The T allele was consistently associated with lower BMD than the C allele. Significant association was also seen for the P2RX7 SNP rs3751143, where the G allele was consistently associated with lower BMD than the T allele (L2-L4 p  = 0.069; neck of femur p  = 0.024; and total hip p  = 0.045).

  19. Single nucleotide polymorphism (SNP) variation of wolves (Canis lupus) in Southeast Alaska and comparison with wolves, dogs, and coyotes in North America.

    PubMed

    Cronin, Matthew A; Cánovas, Angela; Bannasch, Danika L; Oberbauer, Anita M; Medrano, Juan F

    2015-01-01

    There is considerable interest in the genetics of wolves (Canis lupus) because of their close relationship to domestic dogs (C. familiaris) and the need for informed conservation and management. This includes wolf populations in Southeast Alaska for which we determined genotypes of 305 wolves at 173662 single nucleotide polymorphism (SNP) loci. After removal of invariant and linked SNP, 123801 SNP were used to quantify genetic differentiation of wolves in Southeast Alaska and wolves, coyotes (C. latrans), and dogs from other areas in North America. There is differentiation of SNP allele frequencies between the species (wolves, coyotes, and dogs), although differentiation is relatively low between some wolf and coyote populations. There are varying levels of differentiation among populations of wolves, including low differentiation of wolves in interior Alaska, British Columbia, and the northern US Rocky Mountains. There is considerable differentiation of SNP allele frequencies of wolves in Southeast Alaska from wolves in other areas. However, wolves in Southeast Alaska are not a genetically homogeneous group and there are comparable levels of genetic differentiation among areas within Southeast Alaska and between Southeast Alaska and other geographic areas. SNP variation and other genetic data are discussed regarding taxonomy and management. © The American Genetic Association 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  20. Distinct contributions of replication and transcription to mutation rate variation of human genomes.

    PubMed

    Cui, Peng; Ding, Feng; Lin, Qiang; Zhang, Lingfang; Li, Ang; Zhang, Zhang; Hu, Songnian; Yu, Jun

    2012-02-01

    Here, we evaluate the contribution of two major biological processes--DNA replication and transcription--to mutation rate variation in human genomes. Based on analysis of the public human tissue transcriptomics data, high-resolution replicating map of Hela cells and dbSNP data, we present significant correlations between expression breadth, replication time in local regions and SNP density. SNP density of tissue-specific (TS) genes is significantly higher than that of housekeeping (HK) genes. TS genes tend to locate in late-replicating genomic regions and genes in such regions have a higher SNP density compared to those in early-replication regions. In addition, SNP density is found to be positively correlated with expression level among HK genes. We conclude that the process of DNA replication generates stronger mutational pressure than transcription-associated biological processes do, resulting in an increase of mutation rate in TS genes while having weaker effects on HK genes. In contrast, transcription-associated processes are mainly responsible for the accumulation of mutations in highly-expressed HK genes. Copyright © 2012 Beijing Genomics Institute. Published by Elsevier Ltd. All rights reserved.

  1. Development of new SNP derived cleaved amplified polymorphic sequence marker set and its successful utilization in the genetic analysis of seed color variation in barley.

    PubMed

    Bungartz, Annemarie; Klaus, Marius; Mathew, Boby; Léon, Jens; Naz, Ali Ahmad

    2016-03-01

    The aim of the present study was to develop a new cost effective PCR based CAPS marker set using advantages of high-throughput SNP genotyping. Initially, SNP survey was made using 20 diverse barley genotypes via 9k iSelect array genotyping that resulted in 6334 polymorphic SNP markers. Principle component analysis using this marker data showed fine differentiation of barley diverse gene pool. Till this end, we developed 200 SNP derived CAPS markers distributed across the genome covering around 991cM with an average marker density of 5.09cM. Further, we genotyped 68 CAPS markers in an F2 population (Cheri×ICB181160) segregating for seed color variation in barley. Genetic mapping of seed color revealed putative linkage of single nuclear gene on chromosome 1H. These findings showed the proof of concept for the development and utility of a newer cost effective genomic tool kit to analyze broader genetic resources of barley worldwide. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. Allelic clustering and ancestry-dependent frequencies of rs6232, rs6234, and rs6235 PCSK1 SNPs in a Northern Ontario population sample.

    PubMed

    Sirois, Francine; Kaefer, Nadine; Currie, Krista A; Chrétien, Michel; Nkongolo, Kabwe K; Mbikay, Majambu

    2012-10-01

    The PCSK1 (proprotein convertase subtilisin/kexin type 1) locus encodes proprotein convertase 1/3, an endoprotease that converts prohormones and proneuropeptides to their active forms. Spontaneous loss-of-function mutations in the coding sequence of its gene have been linked to obesity in humans. Minor alleles of two common non-synonymous single-nucleotide polymorphisms (SNPs), rs6232 (T > C, N221D) and rs6235 (C > G, S690T), have been associated with increased risk of obesity in European populations. In this study, we compared the frequencies of the rs6232 and rs6234 (G > C, Q665E) SNPs in Aboriginal and Caucasian populations of Northern Ontario. The two SNPs were all relatively less frequent in Aboriginals: The minor allele frequency of the rs6232 SNP was 0.01 in Aboriginals and 0.08 in Caucasians (P < 4.10(-6)); for the rs6234 SNP, it was 0.20 and 0.32, respectively (P < 0.001). Resequencing revealed that the rs6234 SNP variation was tightly linked to that of the rs6235 SNP, as previously reported. Most interestingly, all carriers of the rs6232 SNP variation also carried the rs6234/rs6235 SNP clustered variations, but not the reverse, suggesting the former occurred later on an allele already carrying the latter. These data indicate that, in Northern Ontario Aboriginals, the triple-variant PCSK1 allele is relatively rare and might be of lesser significance for obesity risk in this population.

  3. Selection and Management of DNA Markers for Use in Genomic Evaluation

    USDA-ARS?s Scientific Manuscript database

    A database was constructed to store genotypes for 50,972 single-nucleotide polymorphisms (SNP) from the Illumina BovineSNP50 BeadChip for over 30,000 animals. The database allows storage of multiple samples per animal and stores all SNP genotypes for a sample in a single row. An indicator specifies ...

  4. A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses

    PubMed Central

    2010-01-01

    Background Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. Results A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [Punadj. = 6.96 × 10-6]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (Punadj. = 1.61 × 10-9; PBonf. = 6.58 × 10-5). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r2 = 0.86). Conclusions Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 × 10-10; BIEC2-417495, Punadj. = 1.61 × 10-9). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds. PMID:20932346

  5. A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses.

    PubMed

    Hill, Emmeline W; McGivney, Beatrice A; Gu, Jingjing; Whiston, Ronan; Machugh, David E

    2010-10-11

    Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [P(unadj.) = 6.96 x 10⁻⁶]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (P(unadj.) = 1.61 x 10⁻⁹; P(Bonf.) = 6.58 x 10⁻⁵). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r² = 0.86). Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 x 10⁻¹⁰; BIEC2-417495, P(unadj.) = 1.61 x 10⁻⁹). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds.

  6. Genome-Wide Analysis of Polymorphisms Associated with Cytokine Responses in Smallpox Vaccine Recipients

    PubMed Central

    Kennedy, Richard B.; Ovsyannikova, Inna G.; Pankratz, V. Shane; Haralambieva, Iana H.; Vierkant, Robert A.; Poland, Gregory A.

    2014-01-01

    The role that genetics plays in response to infection or disease is becoming increasingly clear as we learn more about immunogenetics and host-pathogen interactions. Here we report a genome-wide analysis of the effects of host genetic variation on cytokine responses to vaccinia virus stimulation in smallpox vaccine recipients. Our data show that vaccinia stimulation of immune individuals results in secretion of inflammatory and Th1 cytokines. We identified multiple SNPs significantly associated with variations in cytokine secretion. These SNPs are found in genes with known immune function, as well as in genes encoding for proteins involved in signal transduction, cytoskeleton, membrane channels and ion transport, as well as others with no previously identified connection to immune responses. The large number of significant SNP associations implies that cytokine secretion in response to vaccinia virus is a complex process controlled by multiple genes and gene families. Follow-up studies to replicate these findings and then pursue mechanistic studies will provide a greater understanding of how genetic variation influences vaccine responses. PMID:22610502

  7. Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS.

    PubMed

    Kwon, Ji-Sun; Kim, Jihye; Nam, Dougu; Kim, Sangsoo

    2012-06-01

    Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended.

  8. Genomic selection and complex trait prediction using a fast EM algorithm applied to genome-wide markers

    PubMed Central

    2010-01-01

    Background The information provided by dense genome-wide markers using high throughput technology is of considerable potential in human disease studies and livestock breeding programs. Genome-wide association studies relate individual single nucleotide polymorphisms (SNP) from dense SNP panels to individual measurements of complex traits, with the underlying assumption being that any association is caused by linkage disequilibrium (LD) between SNP and quantitative trait loci (QTL) affecting the trait. Often SNP are in genomic regions of no trait variation. Whole genome Bayesian models are an effective way of incorporating this and other important prior information into modelling. However a full Bayesian analysis is often not feasible due to the large computational time involved. Results This article proposes an expectation-maximization (EM) algorithm called emBayesB which allows only a proportion of SNP to be in LD with QTL and incorporates prior information about the distribution of SNP effects. The posterior probability of being in LD with at least one QTL is calculated for each SNP along with estimates of the hyperparameters for the mixture prior. A simulated example of genomic selection from an international workshop is used to demonstrate the features of the EM algorithm. The accuracy of prediction is comparable to a full Bayesian analysis but the EM algorithm is considerably faster. The EM algorithm was accurate in locating QTL which explained more than 1% of the total genetic variation. A computational algorithm for very large SNP panels is described. Conclusions emBayesB is a fast and accurate EM algorithm for implementing genomic selection and predicting complex traits by mapping QTL in genome-wide dense SNP marker data. Its accuracy is similar to Bayesian methods but it takes only a fraction of the time. PMID:20969788

  9. Association of HSP70 and its co-chaperones with Alzheimer’s Disease

    PubMed Central

    Broer, Linda; Ikram, Mohammad Arfan; Schuur, Maaike; DeStefano, Anita L.; Bis, Joshua C.; Liu, Fan; Rivadeneira, Fernando; Uitterlinden, Andre G.; Beiser, Alexa S.; Longstreth, William T.; Hofman, Albert; Aulchenko, Yurii; Seshadri, Sudha; Fitzpatrick, Annette L.; Oostra, Ben A.; Breteler, Monique M.B.; van Duijn, Cornelia M.

    2012-01-01

    The heat shock protein (HSP) 70 family has been implicated in the pathology of Alzheimer’s disease (AD). In this study, we examined common genetic variations in the 80 genes encoding HSP70 and its co-chaperones. We conducted a study in a series of 462 patients and 5238 unaffected participants derived from the Rotterdam Study, a population-based study including 7983 persons aged 55 years and older. We genotyped a total of 12,053 Single Nucleotide Polymorphisms (SNPs) using the HumanHap550K Genotyping BeadChip from Illumina. Replication was performed in two independent cohort studies, the Framingham Heart study (FHS; N=806) and Cardiovascular Health Study (CHS; N=2150). When adjusting for multiple testing, we found a small but consistent, though not significant effect of rs12118313 located 32kb from PFDN2, with an OR of 1.19 (p-value from meta-analysis =0.003). However this SNP was in the intron of another gene, suggesting it is unlikely this SNP reflects the effect of PFDN2. In a formal pathway analysis we found nominally significant evidence for an association of BAG, DNAJA and prefoldin with AD. These findings corroborate with those of a study of 2032 AD patients and 5328 controls, in which several members of the prefoldin family showed evidence for association to AD. Our study did not reveal evidence for a genetic variant if the HSP70 family with a major effect on AD. However, our findings of the single SNP analysis and pathway analysis suggest that multiple genetic variants in prefoldin are associated with AD. PMID:21403392

  10. Tetra-primer ARMS-PCR identified four pivotal genetic variations in bovine PNPLA3 gene and its expression patterns.

    PubMed

    Wang, Zi-nian; Cai, Han-fang; Li, Ming-xun; Cao, Xiu-kai; Lan, Xian-yong; Lei, Chu-zhao; Chen, Hong

    2016-01-10

    Patatin-like phospholipase domain-containing protein 3 (PNPLA3), a member of the patatin like phospholipase domain-containing (PNPLA) family, plays an important role in energy balance, fat metabolism regulation, glucose metabolism and fatty liver disease. Tetra-primer amplification refractory mutation system PCR (T-ARMS-PCR) is a new method offering fast detection and extreme simplicity at a negligible cost for SNP genotyping. In this paper, we investigated the genetic variations at different ages of 660 Chinese indigenous cattle belonging to three breeds (QC, NY, JX) and applied T-ARMS-PCR and PCR-RFLP methods to genotype four SNPs, SNP1: g.A2980G, SNP2: g.A2996T, SNP3: g.A36718G, SNP4: g.G36850A. The statistical analyses indicated that these 4 SNPs affected growth traits markedly (P<0.05) in QC population, whereas combined haplotypes were not (P>0.05). The qPCR (quantitative PCR) indicated that bovine PNPLA3 gene was exclusively expressed in fat tissues. Besides, the analysis between SNP and mRNA expression revealed that, in SNP1, the expression of AG was much higher than AA and GG (P<0.05), which was in accordance with the results of growth traits association analysis, while the results of SNP4 was not. These results supported high potential that SNPs of bovine PNPLA3 gene might be utilized as genetic markers in marker-assisted selection (MAS) for Chinese cattle breeding programs. Copyright © 2015 Elsevier B.V. All rights reserved.

  11. A comprehensive profile of DNA copy number variations in a Korean population: identification of copy number invariant regions among Koreans.

    PubMed

    Jeon, Jae Pil; Shim, Sung Mi; Jung, Jong Sun; Nam, Hye Young; Lee, Hye Jin; Oh, Berm Seok; Kim, Kuchan; Kim, Hyung Lae; Han, Bok Ghee

    2009-09-30

    To examine copy number variations among the Korean population, we compared individual genomes with the Korean reference genome assembly using the publicly available Korean HapMap SNP 50 k chip data from 90 individuals. Korean individuals exhibited 123 copy number variation regions (CNVRs) covering 27.2 mb, equivalent to 1.0% of the genome in the copy number variation (CNV) analysis using the combined criteria of P value (P<0.01) and standard deviation of copy numbers (SD>or= 0.25) among study subjects. In contrast, when compared to the Affymetrix reference genome assembly from multiple ethnic groups, considerably more CNVRs (n=643) were detected in larger proportions (5.0%) of the genome covering 135.1 mb even by more stringent criteria (P<0.001 and SD>or=0.25), reflecting ethnic diversity of structural variations between Korean and other populations. Some CNVRs were validated by the quantitative multiplex PCR of short fluorescent fragment (QMPSF) method, and then copy number invariant regions were detected among the study subjects. These copy number invariant regions would be used as good internal controls for further CNV studies. Lastly, we demonstrated that the CNV information could stratify even a single ethnic population with a proper reference genome assembly from multiple heterogeneous populations.

  12. Genetic contributions to variation in general cognitive function: a meta-analysis of genome-wide association studies in the CHARGE consortium (N=53 949)

    PubMed Central

    Davies, G; Armstrong, N; Bis, J C; Bressler, J; Chouraki, V; Giddaluru, S; Hofer, E; Ibrahim-Verbaas, C A; Kirin, M; Lahti, J; van der Lee, S J; Le Hellard, S; Liu, T; Marioni, R E; Oldmeadow, C; Postmus, I; Smith, A V; Smith, J A; Thalamuthu, A; Thomson, R; Vitart, V; Wang, J; Yu, L; Zgaga, L; Zhao, W; Boxall, R; Harris, S E; Hill, W D; Liewald, D C; Luciano, M; Adams, H; Ames, D; Amin, N; Amouyel, P; Assareh, A A; Au, R; Becker, J T; Beiser, A; Berr, C; Bertram, L; Boerwinkle, E; Buckley, B M; Campbell, H; Corley, J; De Jager, P L; Dufouil, C; Eriksson, J G; Espeseth, T; Faul, J D; Ford, I; Scotland, Generation; Gottesman, R F; Griswold, M E; Gudnason, V; Harris, T B; Heiss, G; Hofman, A; Holliday, E G; Huffman, J; Kardia, S L R; Kochan, N; Knopman, D S; Kwok, J B; Lambert, J-C; Lee, T; Li, G; Li, S-C; Loitfelder, M; Lopez, O L; Lundervold, A J; Lundqvist, A; Mather, K A; Mirza, S S; Nyberg, L; Oostra, B A; Palotie, A; Papenberg, G; Pattie, A; Petrovic, K; Polasek, O; Psaty, B M; Redmond, P; Reppermund, S; Rotter, J I; Schmidt, H; Schuur, M; Schofield, P W; Scott, R J; Steen, V M; Stott, D J; van Swieten, J C; Taylor, K D; Trollor, J; Trompet, S; Uitterlinden, A G; Weinstein, G; Widen, E; Windham, B G; Jukema, J W; Wright, A F; Wright, M J; Yang, Q; Amieva, H; Attia, J R; Bennett, D A; Brodaty, H; de Craen, A J M; Hayward, C; Ikram, M A; Lindenberger, U; Nilsson, L-G; Porteous, D J; Räikkönen, K; Reinvang, I; Rudan, I; Sachdev, P S; Schmidt, R; Schofield, P R; Srikanth, V; Starr, J M; Turner, S T; Weir, D R; Wilson, J F; van Duijn, C; Launer, L; Fitzpatrick, A L; Seshadri, S; Mosley, T H; Deary, I J

    2015-01-01

    General cognitive function is substantially heritable across the human life course from adolescence to old age. We investigated the genetic contribution to variation in this important, health- and well-being-related trait in middle-aged and older adults. We conducted a meta-analysis of genome-wide association studies of 31 cohorts (N=53 949) in which the participants had undertaken multiple, diverse cognitive tests. A general cognitive function phenotype was tested for, and created in each cohort by principal component analysis. We report 13 genome-wide significant single-nucleotide polymorphism (SNP) associations in three genomic regions, 6q16.1, 14q12 and 19q13.32 (best SNP and closest gene, respectively: rs10457441, P=3.93 × 10−9, MIR2113; rs17522122, P=2.55 × 10−8, AKAP6; rs10119, P=5.67 × 10−9, APOE/TOMM40). We report one gene-based significant association with the HMGN1 gene located on chromosome 21 (P=1 × 10−6). These genes have previously been associated with neuropsychiatric phenotypes. Meta-analysis results are consistent with a polygenic model of inheritance. To estimate SNP-based heritability, the genome-wide complex trait analysis procedure was applied to two large cohorts, the Atherosclerosis Risk in Communities Study (N=6617) and the Health and Retirement Study (N=5976). The proportion of phenotypic variation accounted for by all genotyped common SNPs was 29% (s.e.=5%) and 28% (s.e.=7%), respectively. Using polygenic prediction analysis, ~1.2% of the variance in general cognitive function was predicted in the Generation Scotland cohort (N=5487; P=1.5 × 10−17). In hypothesis-driven tests, there was significant association between general cognitive function and four genes previously associated with Alzheimer's disease: TOMM40, APOE, ABCG1 and MEF2C. PMID:25644384

  13. Green way genesis of silver nanoparticles using multiple fruit peels waste and its antimicrobial, anti-oxidant and anti-tumor cell line studies

    NASA Astrophysics Data System (ADS)

    Naganathan, Kiruthika; Thirunavukkarasu, Somanathan

    2017-04-01

    Green synthesis of silver nanoparticles (SNP) opens a new path to kill and prevent various infectious diseases and also tumor. In this study, we have synthesized silver nanoparticles using multiple fruit peel waste (pomegranate, orange, banana and apple (POBA)). The primarily nanoparticles formation has been confirmed by the color change. The synthesized SNP were analyzed by various physicochemical techniques such as UV- Visible spectroscopy, x-ray diffraction (XRD), fourier transform infra red (FT-IR) spectroscopy and transmission electron microscope (TEM). The formation of SNP was confirmed by its absorbance peak observed at 430 nm in UV-Visible spectrum. Further, the obtained SNP were identified by XRD and TEM, respectively to know the crystalline nature and size and shape of the particles. The activities of SNP were checked with human pathogens (Salmonella, E.coli and Pseudomonas), plant pathogen (Fusarium) and marine pathogen (Aeromonas hydrophila) and also studied the scavenging effect and anticancer properties against MCF-7 cell lines. This studies proves that the SNP prepared from fruit waste peel extract approach appears extremely fast, cost efficient, eco-friendly and alternative for conventional methods of SNP synthesis to promote the usage of these nanoparticles in medicinal application.

  14. Diversity analysis of cotton (Gossypium hirsutum L.) germplasm using the CottonSNP63K Array.

    PubMed

    Hinze, Lori L; Hulse-Kemp, Amanda M; Wilson, Iain W; Zhu, Qian-Hao; Llewellyn, Danny J; Taylor, Jen M; Spriggs, Andrew; Fang, David D; Ulloa, Mauricio; Burke, John J; Giband, Marc; Lacape, Jean-Marc; Van Deynze, Allen; Udall, Joshua A; Scheffler, Jodi A; Hague, Steve; Wendel, Jonathan F; Pepper, Alan E; Frelichowski, James; Lawley, Cindy T; Jones, Don C; Percy, Richard G; Stelly, David M

    2017-02-03

    Cotton germplasm resources contain beneficial alleles that can be exploited to develop germplasm adapted to emerging environmental and climate conditions. Accessions and lines have traditionally been characterized based on phenotypes, but phenotypic profiles are limited by the cost, time, and space required to make visual observations and measurements. With advances in molecular genetic methods, genotypic profiles are increasingly able to identify differences among accessions due to the larger number of genetic markers that can be measured. A combination of both methods would greatly enhance our ability to characterize germplasm resources. Recent efforts have culminated in the identification of sufficient SNP markers to establish high-throughput genotyping systems, such as the CottonSNP63K array, which enables a researcher to efficiently analyze large numbers of SNP markers and obtain highly repeatable results. In the current investigation, we have utilized the SNP array for analyzing genetic diversity primarily among cotton cultivars, making comparisons to SSR-based phylogenetic analyses, and identifying loci associated with seed nutritional traits. The SNP markers distinctly separated G. hirsutum from other Gossypium species and distinguished the wild from cultivated types of G. hirsutum. The markers also efficiently discerned differences among cultivars, which was the primary goal when designing the CottonSNP63K array. Population structure within the genus compared favorably with previous results obtained using SSR markers, and an association study identified loci linked to factors that affect cottonseed protein content. Our results provide a large genome-wide variation data set for primarily cultivated cotton. Thousands of SNPs in representative cotton genotypes provide an opportunity to finely discriminate among cultivated cotton from around the world. The SNPs will be relevant as dense markers of genome variation for association mapping approaches aimed at correlating molecular polymorphisms with variation in phenotypic traits, as well as for molecular breeding approaches in cotton.

  15. Effect prediction of identified SNPs linked to fruit quality and chilling injury in peach [Prunus persica (L.) Batsch].

    PubMed

    Martínez-García, Pedro J; Fresnedo-Ramírez, Jonathan; Parfitt, Dan E; Gradziel, Thomas M; Crisosto, Carlos H

    2013-01-01

    Single nucleotide polymorphisms (SNPs) are a fundamental source of genomic variation. Large SNP panels have been developed for Prunus species. Fruit quality traits are essential peach breeding program objectives since they determine consumer acceptance, fruit consumption, industry trends and cultivar adoption. For many cultivars, these traits are negatively impacted by cold storage, used to extend fruit market life. The major symptoms of chilling injury are lack of flavor, off flavor, mealiness, flesh browning, and flesh bleeding. A set of 1,109 SNPs was mapped previously and 67 were linked with these complex traits. The prediction of the effects associated with these SNPs on downstream products from the 'peach v1.0' genome sequence was carried out. A total of 2,163 effects were detected, 282 effects (non-synonymous, synonymous or stop codon gained) were located in exonic regions (13.04 %) and 294 placed in intronic regions (13.59 %). An extended list of genes and proteins that could be related to these traits was developed. Two SNP markers that explain a high percentage of the observed phenotypic variance, UCD_SNP_1084 and UCD_SNP_46, are associated with zinc finger (C3HC4-type RING finger) family protein and AOX1A (alternative oxidase 1a) protein groups, respectively. In addition, phenotypic variation suggests that the observed polymorphism for SNP UCD_SNP_1084 [A/G] mutation could be a candidate quantitative trait nucleotide affecting quantitative trait loci for mealiness. The interaction and expression of affected proteins could explain the variation observed in each individual and facilitate understanding of gene regulatory networks for fruit quality traits in peach.

  16. HLA Type Inference via Haplotypes Identical by Descent

    NASA Astrophysics Data System (ADS)

    Setty, Manu N.; Gusev, Alexander; Pe'Er, Itsik

    The Human Leukocyte Antigen (HLA) genes play a major role in adaptive immune response and are used to differentiate self antigens from non self ones. HLA genes are hyper variable with nearly every locus harboring over a dozen alleles. This variation plays an important role in susceptibility to multiple autoimmune diseases and needs to be matched on for organ transplantation. Unfortunately, HLA typing by serological methods is time consuming and expensive compared to high throughput Single Nucleotide Polymorphism (SNP) data. We present a new computational method to infer per-locus HLA types using shared segments Identical By Descent (IBD), inferred from SNP genotype data. IBD information is modeled as graph where shared haplotypes are explored among clusters of individuals with known and unknown HLA types to identify the latter. We analyze performance of the method in a previously typed subset of the HapMap population, achieving accuracy of 96% in HLA-A, 94% in HLA-B, 95% in HLA-C, 77% in HLA-DR1, 93% in HLA-DQA1 and 90% in HLA-DQB1 genes. We compare our method to a tag SNP based approach and demonstrate higher sensitivity and specificity. Our method demonstrates the power of using shared haplotype segments for large-scale imputation at the HLA locus.

  17. Genomic Heritability of Beef Cattle Growth

    USDA-ARS?s Scientific Manuscript database

    Calf weights were examined to determine association between high-density SNP genotypes and growth, in order to estimate additive genetic variation explained by SNP. Data taken from Cycle VII of the U.S. Meat Animal Research Center Germplasm Evaluation Project included birth weight (BWT), 205-d adju...

  18. Identification of an interaction between VWF rs7965413 and platelet count as a novel risk marker for metabolic syndrome: an extensive search of candidate polymorphisms in a case-control study.

    PubMed

    Nakatochi, Masahiro; Ushida, Yasunori; Yasuda, Yoshinari; Yoshida, Yasuko; Kawai, Shun; Kato, Ryuji; Nakashima, Toru; Iwata, Masamitsu; Kuwatsuka, Yachiyo; Ando, Masahiko; Hamajima, Nobuyuki; Kondo, Takaaki; Oda, Hiroaki; Hayashi, Mutsuharu; Kato, Sawako; Yamaguchi, Makoto; Maruyama, Shoichi; Matsuo, Seiichi; Honda, Hiroyuki

    2015-01-01

    Although many single nucleotide polymorphisms (SNPs) have been identified to be associated with metabolic syndrome (MetS), there was only a slight improvement in the ability to predict future MetS by the simply addition of SNPs to clinical risk markers. To improve the ability to predict future MetS, combinational effects, such as SNP-SNP interaction, SNP-environment interaction, and SNP-clinical parameter (SNP × CP) interaction should be also considered. We performed a case-control study to explore novel SNP × CP interactions as risk markers for MetS based on health check-up data of Japanese male employees. We selected 99 SNPs that were previously reported to be associated with MetS and components of MetS; subsequently, we genotyped these SNPs from 360 cases and 1983 control subjects. First, we performed logistic regression analyses to assess the association of each SNP with MetS. Of these SNPs, five SNPs were significantly associated with MetS (P < 0.05): LRP2 rs2544390, rs1800592 between UCP1 and TBC1D9, APOA5 rs662799, VWF rs7965413, and rs1411766 between MYO16 and IRS2. Furthermore, we performed multiple logistic regression analyses, including an SNP term, a CP term, and an SNP × CP interaction term for each CP and SNP that was significantly associated with MetS. We identified a novel SNP × CP interaction between rs7965413 and platelet count that was significantly associated with MetS [SNP term: odds ratio (OR) = 0.78, P = 0.004; SNP × CP interaction term: OR = 1.33, P = 0.001]. This association of the SNP × CP interaction with MetS remained nominally significant in multiple logistic regression analysis after adjustment for either the number of MetS components or MetS components excluding obesity. Our results reveal new insight into platelet count as a risk marker for MetS.

  19. Detection of genetic variation affecting milk coagulation properties in Danish Holstein dairy cattle by analyses of pooled whole-genome sequences from phenotypically extreme samples (pool-seq).

    PubMed

    Bertelsen, H P; Gregersen, V R; Poulsen, N; Nielsen, R O; Das, A; Madsen, L B; Buitenhuis, A J; Holm, L-E; Panitz, F; Larsen, L B; Bendixen, C

    2016-04-01

    Rennet-induced milk coagulation is an important trait for cheese production. Recent studies have reported an alarming frequency of cows producing poorly coagulating milk unsuitable for cheese production. Several genetic factors are known to affect milk coagulation, including variation in the major milk proteins; however, recent association studies indicate genetic effects from other genomic regions as well. The aim of this study was to detect genetic variation affecting milk coagulation properties, measured as curd-firming rate (CFR) and milk pH. This was achieved by examining allele frequency differences between pooled whole-genome sequences of phenotypically extreme samples (pool-seq).. Curd-firming rate and raw milk pH were measured for 415 Danish Holstein cows, and each animal was sequenced at low coverage. Pools were created containing whole genome sequence reads from samples with "extreme" values (high or low) for both phenotypic traits. A total of 6,992,186 and 5,295,501 SNP were assessed in relation to CFR and milk pH, respectively. Allele frequency differences were calculated between pools and 32 significantly different SNP were detected, 1 for milk pH and 31 for CFR, of which 19 are located on chromosome 6. A total of 9 significant SNP, which were selected based on the possible function of proximal candidate genes, were genotyped in the entire sample set ( = 415) to test for an association. The most significant SNP was located proximal to , explaining 33% of the phenotypic variance. , coding for κ-casein, is the most studied in relation to milk coagulation due to its position on the surface of the casein micelles and the direct involvement in milk coagulation. Three additional SNP located on chromosome 6 showed significant associations explaining 7, 3.6, and 1.3% of the phenotypic variance of CFR. The significant SNP on chromosome 6 were shown to be in linkage disequilibrium with the SNP peaking proximal to ; however, after accounting for the genotype of the peak SNP within this QTL, significant effects (-value < 0.1) could still be detected for 2 of the SNP accounting for 2 and 1% of the phenotypic variance. These 2 interesting SNP were located within introns or proximal to the candidate genes-solute carrier family 4 (sodium bicarbonate cotransporter), member 4 () and LIM and calponin homology domains 1 (), respectively-making them interesting targets for further analysis.

  20. DMET-analyzer: automatic analysis of Affymetrix DMET data.

    PubMed

    Guzzi, Pietro Hiram; Agapito, Giuseppe; Di Martino, Maria Teresa; Arbitrio, Mariamena; Tassone, Pierfrancesco; Tagliaferri, Pierosandro; Cannataro, Mario

    2012-10-05

    Clinical Bioinformatics is currently growing and is based on the integration of clinical and omics data aiming at the development of personalized medicine. Thus the introduction of novel technologies able to investigate the relationship among clinical states and biological machineries may help the development of this field. For instance the Affymetrix DMET platform (drug metabolism enzymes and transporters) is able to study the relationship among the variation of the genome of patients and drug metabolism, detecting SNPs (Single Nucleotide Polymorphism) on genes related to drug metabolism. This may allow for instance to find genetic variants in patients which present different drug responses, in pharmacogenomics and clinical studies. Despite this, there is currently a lack in the development of open-source algorithms and tools for the analysis of DMET data. Existing software tools for DMET data generally allow only the preprocessing of binary data (e.g. the DMET-Console provided by Affymetrix) and simple data analysis operations, but do not allow to test the association of the presence of SNPs with the response to drugs. We developed DMET-Analyzer a tool for the automatic association analysis among the variation of the patient genomes and the clinical conditions of patients, i.e. the different response to drugs. The proposed system allows: (i) to automatize the workflow of analysis of DMET-SNP data avoiding the use of multiple tools; (ii) the automatic annotation of DMET-SNP data and the search in existing databases of SNPs (e.g. dbSNP), (iii) the association of SNP with pathway through the search in PharmaGKB, a major knowledge base for pharmacogenomic studies. DMET-Analyzer has a simple graphical user interface that allows users (doctors/biologists) to upload and analyse DMET files produced by Affymetrix DMET-Console in an interactive way. The effectiveness and easy use of DMET Analyzer is demonstrated through different case studies regarding the analysis of clinical datasets produced in the University Hospital of Catanzaro, Italy. DMET Analyzer is a novel tool able to automatically analyse data produced by the DMET-platform in case-control association studies. Using such tool user may avoid wasting time in the manual execution of multiple statistical tests avoiding possible errors and reducing the amount of time needed for a whole experiment. Moreover annotations and the direct link to external databases may increase the biological knowledge extracted. The system is freely available for academic purposes at: https://sourceforge.net/projects/dmetanalyzer/files/

  1. Mining conifers’ mega-genome using rapid and efficient multiplexed high-throughput genotyping-by-sequencing (GBS) SNP discovery platform

    USDA-ARS?s Scientific Manuscript database

    Next-generation sequencing (NGS) technologies are revolutionizing both medical and biological research through generation of massive SNP data sets for identifying heritable genome variation underlying key traits, from rare human diseases to important agronomic phenotypes in crop species. We evaluate...

  2. Partial-genome evaluation of postweaning feed intake and efficiency of crossbred beef cattle

    USDA-ARS?s Scientific Manuscript database

    Effects of individual single nucleotide polymorphisms (SNP), and variation explained by sets of SNP associated with dry matter intake (DMI), metabolic mid-test weight (MBW), BW gain (GN) and feed efficiency expressed as phenotypic and genetic residual feed intake (RFIp; RFIg) were estimated from wei...

  3. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    PubMed Central

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  4. Comparison of SSR and SNP Markers in Estimation of Genetic Diversity and Population Structure of Indian Rice Varieties

    PubMed Central

    Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R. K.; Singh, N. K.; Singh, Rakesh

    2013-01-01

    Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis. PMID:24367635

  5. Hormone-Related Pathways and Risk of Breast Cancer Subtypes in African American Women

    PubMed Central

    Haddad, Stephen A.; Lunetta, Kathryn L.; Ruiz-Narváez, Edward A.; Bensen, Jeannette T.; Hong, Chi-Chen; Sucheston-Campbell, Lara E.; Yao, Song; Bandera, Elisa V.; Rosenberg, Lynn; Haiman, Christopher A.; Troester, Melissa A.; Ambrosone, Christine B.; Palmer, Julie R.

    2016-01-01

    Purpose We sought to investigate genetic variation in hormone pathways in relation to risk of overall and subtype-specific breast cancer in women of African ancestry (AA). Methods Genotyping and imputation yielded data on 143,934 SNPs in 308 hormone-related genes for 3663 breast cancer cases (1098 ER-, 1983 ER+, 582 ER unknown) and 4687 controls from the African American Breast Cancer Epidemiology and Risk (AMBER) Consortium. AMBER includes data from four large studies of AA women: the Carolina Breast Cancer Study, the Women's Circle of Health Study, the Black Women's Health Study, and the Multiethnic Cohort Study. Pathway- and gene-based analyses were conducted, and single SNP tests were run for the top genes. Results There were no strong associations at the pathway level. The most significantly associated genes were GHRH, CALM2, CETP, and AKR1C1 for overall breast cancer (gene-based nominal p ≤0.01); NR0B1, IGF2R, CALM2, CYP1B1, and GRB2 for ER+ breast cancer (p ≤0.02); and PGR, MAPK3, MAP3K1, and LHCGR for ER- disease (p ≤0.02). Single-SNP tests for SNPs with pairwise linkage disequilibrium r2 <0.8 in the top genes identified 12 common SNPs (in CALM2, CETP, NR0B1, IGF2R, CYP1B1, PGR, MAPK3, and MAP3K1) associated with overall or subtype-specific breast cancer after gene-level correction for multiple testing. Rs11571215 in PGR (progesterone receptor) was the SNP most strongly associated with ER- disease. Conclusion We identified eight genes in hormone pathways that contain common variants associated with breast cancer in AA women after gene-level correction for multiple testing. PMID:26458823

  6. Genome-wide association mapping reveals a rich genetic architecture of stripe rust resistance loci in emmer wheat (Triticum turgidum ssp. dicoccum).

    PubMed

    Liu, Weizhen; Maccaferri, Marco; Chen, Xianming; Laghetti, Gaetano; Pignone, Domenico; Pumphrey, Michael; Tuberosa, Roberto

    2017-11-01

    SNP-based genome scanning in worldwide domesticated emmer germplasm showed high genetic diversity, rapid linkage disequilibrium decay and 51 loci for stripe rust resistance, a large proportion of which were novel. Cultivated emmer wheat (Triticum turgidum ssp. dicoccum), one of the oldest domesticated crops in the world, is a potentially rich reservoir of variation for improvement of resistance/tolerance to biotic and abiotic stresses in wheat. Resistance to stripe rust (Puccinia striiformis f. sp. tritici) in emmer wheat has been under-investigated. Here, we employed genome-wide association (GWAS) mapping with a mixed linear model to dissect effective stripe rust resistance loci in a worldwide collection of 176 cultivated emmer wheat accessions. Adult plants were tested in six environments and seedlings were evaluated with five races from the United States and one from Italy under greenhouse conditions. Five accessions were resistant across all experiments. The panel was genotyped with the wheat 90,000 Illumina iSelect single nucleotide polymorphism (SNP) array and 5106 polymorphic SNP markers with mapped positions were obtained. A high level of genetic diversity and fast linkage disequilibrium decay were observed. In total, we identified 14 loci associated with field resistance in multiple environments. Thirty-seven loci were significantly associated with all-stage (seedling) resistance and six of them were effective against multiple races. Of the 51 total loci, 29 were mapped distantly from previously reported stripe rust resistance genes or quantitative trait loci and represent newly discovered resistance loci. Our results suggest that GWAS is an effective method for characterizing genes in cultivated emmer wheat and confirm that emmer wheat is a rich source of stripe rust resistance loci that can be used for wheat improvement.

  7. Genes-environment interactions in obesity- and diabetes-associated pancreatic cancer: a GWAS data analysis.

    PubMed

    Tang, Hongwei; Wei, Peng; Duell, Eric J; Risch, Harvey A; Olson, Sara H; Bueno-de-Mesquita, H Bas; Gallinger, Steven; Holly, Elizabeth A; Petersen, Gloria M; Bracci, Paige M; McWilliams, Robert R; Jenab, Mazda; Riboli, Elio; Tjønneland, Anne; Boutron-Ruault, Marie Christine; Kaaks, Rudolf; Trichopoulos, Dimitrios; Panico, Salvatore; Sund, Malin; Peeters, Petra H M; Khaw, Kay-Tee; Amos, Christopher I; Li, Donghui

    2014-01-01

    Obesity and diabetes are potentially alterable risk factors for pancreatic cancer. Genetic factors that modify the associations of obesity and diabetes with pancreatic cancer have previously not been examined at the genome-wide level. Using genome-wide association studies (GWAS) genotype and risk factor data from the Pancreatic Cancer Case Control Consortium, we conducted a discovery study of 2,028 cases and 2,109 controls to examine gene-obesity and gene-diabetes interactions in relation to pancreatic cancer risk by using the likelihood-ratio test nested in logistic regression models and Ingenuity Pathway Analysis (IPA). After adjusting for multiple comparisons, a significant interaction of the chemokine signaling pathway with obesity (P = 3.29 × 10(-6)) and a near significant interaction of calcium signaling pathway with diabetes (P = 1.57 × 10(-4)) in modifying the risk of pancreatic cancer were observed. These findings were supported by results from IPA analysis of the top genes with nominal interactions. The major contributing genes to the two top pathways include GNGT2, RELA, TIAM1, and GNAS. None of the individual genes or single-nucleotide polymorphism (SNP) except one SNP remained significant after adjusting for multiple testing. Notably, SNP rs10818684 of the PTGS1 gene showed an interaction with diabetes (P = 7.91 × 10(-7)) at a false discovery rate of 6%. Genetic variations in inflammatory response and insulin resistance may affect the risk of obesity- and diabetes-related pancreatic cancer. These observations should be replicated in additional large datasets. A gene-environment interaction analysis may provide new insights into the genetic susceptibility and molecular mechanisms of obesity- and diabetes-related pancreatic cancer.

  8. A novel variational Bayes multiple locus Z-statistic for genome-wide association studies with Bayesian model averaging

    PubMed Central

    Logsdon, Benjamin A.; Carty, Cara L.; Reiner, Alexander P.; Dai, James Y.; Kooperberg, Charles

    2012-01-01

    Motivation: For many complex traits, including height, the majority of variants identified by genome-wide association studies (GWAS) have small effects, leaving a significant proportion of the heritable variation unexplained. Although many penalized multiple regression methodologies have been proposed to increase the power to detect associations for complex genetic architectures, they generally lack mechanisms for false-positive control and diagnostics for model over-fitting. Our methodology is the first penalized multiple regression approach that explicitly controls Type I error rates and provide model over-fitting diagnostics through a novel normally distributed statistic defined for every marker within the GWAS, based on results from a variational Bayes spike regression algorithm. Results: We compare the performance of our method to the lasso and single marker analysis on simulated data and demonstrate that our approach has superior performance in terms of power and Type I error control. In addition, using the Women's Health Initiative (WHI) SNP Health Association Resource (SHARe) GWAS of African-Americans, we show that our method has power to detect additional novel associations with body height. These findings replicate by reaching a stringent cutoff of marginal association in a larger cohort. Availability: An R-package, including an implementation of our variational Bayes spike regression (vBsr) algorithm, is available at http://kooperberg.fhcrc.org/soft.html. Contact: blogsdon@fhcrc.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:22563072

  9. G-Protein Genomic Association With Normal Variation in Gray Matter Density

    PubMed Central

    Chen, Jiayu; Calhoun, Vince D.; Arias-Vasquez, Alejandro; Zwiers, Marcel P.; van Hulzen, Kimm; Fernández, Guillén; Fisher, Simon E.; Franke, Barbara; Turner, Jessica A.; Liu, Jingyu

    2017-01-01

    While detecting genetic variations underlying brain structures helps reveal mechanisms of neural disorders, high data dimensionality poses a major challenge for imaging genomic association studies. In this work, we present the application of a recently proposed approach, parallel independent component analysis with reference (pICA-R), to investigate genomic factors potentially regulating gray matter variation in a healthy population. This approach simultaneously assesses many variables for an aggregate effect and helps to elicit particular features in the data. We applied pICA-R to analyze gray matter density (GMD) images (274,131 voxels) in conjunction with single nucleotide polymorphism (SNP) data (666,019 markers) collected from 1,256 healthy individuals of the Brain Imaging Genetics (BIG) study. Guided by a genetic reference derived from the gene GNA14, pICA-R identified a significant SNP-GMD association (r = −0.16, P = 2.34 × 10−8), implying that subjects with specific genotypes have lower localized GMD. The identified components were then projected to an independent dataset from the Mind Clinical Imaging Consortium (MCIC) including 89 healthy individuals, and the obtained loadings again yielded a significant SNP-GMD association (r = −0.25, P = 0.02). The imaging component reflected GMD variations in frontal, precuneus, and cingulate regions. The SNP component was enriched in genes with neuronal functions, including synaptic plasticity, axon guidance, molecular signal transduction via PKA and CREB, highlighting the GRM1, PRKCH, GNA12, and CAMK2B genes. Collectively, our findings suggest that GNA12 and GNA14 play a key role in the genetic architecture underlying normal GMD variation in frontal and parietal regions. PMID:26248772

  10. SNPConvert: SNP Array Standardization and Integration in Livestock Species.

    PubMed

    Nicolazzi, Ezequiel Luis; Marras, Gabriele; Stella, Alessandra

    2016-06-09

    One of the main advantages of single nucleotide polymorphism (SNP) array technology is providing genotype calls for a specific number of SNP markers at a relatively low cost. Since its first application in animal genetics, the number of available SNP arrays for each species has been constantly increasing. However, conversely to that observed in whole genome sequence data analysis, SNP array data does not have a common set of file formats or coding conventions for allele calling. Therefore, the standardization and integration of SNP array data from multiple sources have become an obstacle, especially for users with basic or no programming skills. Here, we describe the difficulties related to handling SNP array data, focusing on file formats, SNP allele coding, and mapping. We also present SNPConvert suite, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. This tool, which can be integrated with open-source and open-access tools already available, is a first step towards an integrated system to standardize and integrate any type of raw SNP array data. The tool is available at: https://github. com/nicolazzie/SNPConvert.git.

  11. Common Genetic Variation In Cellular Transport Genes and Epithelial Ovarian Cancer (EOC) Risk.

    PubMed

    Chornokur, Ganna; Lin, Hui-Yi; Tyrer, Jonathan P; Lawrenson, Kate; Dennis, Joe; Amankwah, Ernest K; Qu, Xiaotao; Tsai, Ya-Yu; Jim, Heather S L; Chen, Zhihua; Chen, Ann Y; Permuth-Wey, Jennifer; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Bruinsma, Fiona; Bandera, Elisa V; Bean, Yukie T; Beckmann, Matthias W; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A; Brooks-Wilson, Angela; Bunker, Clareann H; Butzow, Ralf; Campbell, Ian G; Carty, Karen; Chang-Claude, Jenny; Cook, Linda S; Cramer, Daniel W; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; du Bois, Andreas; Despierre, Evelyn; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; Dürst, Matthias; Easton, Douglas F; Eccles, Diana M; Edwards, Robert P; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goodman, Marc T; Gronwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Claus K; Hogdall, Estrid; Hosono, Satoyo; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kelemen, Linda E; Kellar, Mellissa; Kiemeney, Lambertus A; Krakstad, Camilla; Kjaer, Susanne K; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lim, Boon Kiong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F A G; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; McNeish, Iain; Menon, Usha; Milne, Roger L; Modugno, Francesmary; Moysich, Kirsten B; Ness, Roberta B; Nevanlinna, Heli; Eilber, Ursula; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste L; Pejovic, Tanja; Pelttari, Liisa M; Pike, Malcolm C; Poole, Elizabeth M; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schernhammer, Eva; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Spiewankiewicz, Beata; Sucheston, Lara; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Thomsen, Lotte; Tangen, Ingvild L; Tworoger, Shelley S; van Altena, Anne M; Vierkant, Robert A; Vergote, Ignace; Walsh, Christine S; Wang-Gohrke, Shan; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Wu, Anna H; Wu, Xifeng; Woo, Yin-Ling; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Hasmad, Hanis N; Berchuck, Andrew; Iversen, Edwin S; Schildkraut, Joellen M; Ramus, Susan J; Goode, Ellen L; Monteiro, Alvaro N A; Gayther, Simon A; Narod, Steven A; Pharoah, Paul D P; Sellers, Thomas A; Phelan, Catherine M

    2015-01-01

    Defective cellular transport processes can lead to aberrant accumulation of trace elements, iron, small molecules and hormones in the cell, which in turn may promote the formation of reactive oxygen species, promoting DNA damage and aberrant expression of key regulatory cancer genes. As DNA damage and uncontrolled proliferation are hallmarks of cancer, including epithelial ovarian cancer (EOC), we hypothesized that inherited variation in the cellular transport genes contributes to EOC risk. In total, DNA samples were obtained from 14,525 case subjects with invasive EOC and from 23,447 controls from 43 sites in the Ovarian Cancer Association Consortium (OCAC). Two hundred seventy nine SNPs, representing 131 genes, were genotyped using an Illumina Infinium iSelect BeadChip as part of the Collaborative Oncological Gene-environment Study (COGS). SNP analyses were conducted using unconditional logistic regression under a log-additive model, and the FDR q<0.2 was applied to adjust for multiple comparisons. The most significant evidence of an association for all invasive cancers combined and for the serous subtype was observed for SNP rs17216603 in the iron transporter gene HEPH (invasive: OR = 0.85, P = 0.00026; serous: OR = 0.81, P = 0.00020); this SNP was also associated with the borderline/low malignant potential (LMP) tumors (P = 0.021). Other genes significantly associated with EOC histological subtypes (p<0.05) included the UGT1A (endometrioid), SLC25A45 (mucinous), SLC39A11 (low malignant potential), and SERPINA7 (clear cell carcinoma). In addition, 1785 SNPs in six genes (HEPH, MGST1, SERPINA, SLC25A45, SLC39A11 and UGT1A) were imputed from the 1000 Genomes Project and examined for association with INV EOC in white-European subjects. The most significant imputed SNP was rs117729793 in SLC39A11 (per allele, OR = 2.55, 95% CI = 1.5-4.35, p = 5.66x10-4). These results, generated on a large cohort of women, revealed associations between inherited cellular transport gene variants and risk of EOC histologic subtypes.

  12. Common Genetic Variation In Cellular Transport Genes and Epithelial Ovarian Cancer (EOC) Risk

    PubMed Central

    Chornokur, Ganna; Lin, Hui-Yi; Tyrer, Jonathan P.; Lawrenson, Kate; Dennis, Joe; Amankwah, Ernest K.; Qu, Xiaotao; Tsai, Ya-Yu; Jim, Heather S. L.; Chen, Zhihua; Chen, Ann Y.; Permuth-Wey, Jennifer; Aben, Katja KH.; Anton-Culver, Hoda; Antonenkova, Natalia; Bruinsma, Fiona; Bandera, Elisa V.; Bean, Yukie T.; Beckmann, Matthias W.; Bisogna, Maria; Bjorge, Line; Bogdanova, Natalia; Brinton, Louise A.; Brooks-Wilson, Angela; Bunker, Clareann H.; Butzow, Ralf; Campbell, Ian G.; Carty, Karen; Chang-Claude, Jenny; Cook, Linda S.; Cramer, Daniel W.; Cunningham, Julie M.; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; du Bois, Andreas; Despierre, Evelyn; Dicks, Ed; Doherty, Jennifer A.; Dörk, Thilo; Dürst, Matthias; Easton, Douglas F.; Eccles, Diana M.; Edwards, Robert P.; Ekici, Arif B.; Fasching, Peter A.; Fridley, Brooke L.; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G.; Glasspool, Rosalind; Goodman, Marc T.; Gronwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A. T.; Hillemanns, Peter; Hogdall, Claus K.; Hogdall, Estrid; Hosono, Satoyo; Jakubowska, Anna; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y.; Kelemen, Linda E.; Kellar, Mellissa; Kiemeney, Lambertus A.; Krakstad, Camilla; Kjaer, Susanne K.; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D.; Lee, Alice W.; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A.; Liang, Dong; Lim, Boon Kiong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon F. A. G.; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R.; McNeish, Iain; Menon, Usha; Milne, Roger L.; Modugno, Francesmary; Moysich, Kirsten B.; Ness, Roberta B.; Nevanlinna, Heli; Eilber, Ursula; Odunsi, Kunle; Olson, Sara H.; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Paul, James; Pearce, Celeste L.; Pejovic, Tanja; Pelttari, Liisa M.; Pike, Malcolm C.; Poole, Elizabeth M.; Risch, Harvey A.; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H.; Rudolph, Anja; Runnebaum, Ingo B.; Rzepecka, Iwona K.; Salvesen, Helga B.; Schernhammer, Eva; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B.; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C.; Spiewankiewicz, Beata; Sucheston, Lara; Teo, Soo-Hwang; Terry, Kathryn L.; Thompson, Pamela J.; Thomsen, Lotte; Tangen, Ingvild L.; Tworoger, Shelley S.; van Altena, Anne M.; Vierkant, Robert A.; Vergote, Ignace; Walsh, Christine S.; Wang-Gohrke, Shan; Wentzensen, Nicolas; Whittemore, Alice S.; Wicklund, Kristine G.; Wilkens, Lynne R.; Wu, Anna H.; Wu, Xifeng; Woo, Yin-Ling; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Hasmad, Hanis N.; Berchuck, Andrew; Iversen, Edwin S.; Schildkraut, Joellen M.; Ramus, Susan J.; Goode, Ellen L.; Monteiro, Alvaro N. A.; Gayther, Simon A.; Narod, Steven A.; Pharoah, Paul D. P.; Sellers, Thomas A.; Phelan, Catherine M.

    2015-01-01

    Background Defective cellular transport processes can lead to aberrant accumulation of trace elements, iron, small molecules and hormones in the cell, which in turn may promote the formation of reactive oxygen species, promoting DNA damage and aberrant expression of key regulatory cancer genes. As DNA damage and uncontrolled proliferation are hallmarks of cancer, including epithelial ovarian cancer (EOC), we hypothesized that inherited variation in the cellular transport genes contributes to EOC risk. Methods In total, DNA samples were obtained from 14,525 case subjects with invasive EOC and from 23,447 controls from 43 sites in the Ovarian Cancer Association Consortium (OCAC). Two hundred seventy nine SNPs, representing 131 genes, were genotyped using an Illumina Infinium iSelect BeadChip as part of the Collaborative Oncological Gene-environment Study (COGS). SNP analyses were conducted using unconditional logistic regression under a log-additive model, and the FDR q<0.2 was applied to adjust for multiple comparisons. Results The most significant evidence of an association for all invasive cancers combined and for the serous subtype was observed for SNP rs17216603 in the iron transporter gene HEPH (invasive: OR = 0.85, P = 0.00026; serous: OR = 0.81, P = 0.00020); this SNP was also associated with the borderline/low malignant potential (LMP) tumors (P = 0.021). Other genes significantly associated with EOC histological subtypes (p<0.05) included the UGT1A (endometrioid), SLC25A45 (mucinous), SLC39A11 (low malignant potential), and SERPINA7 (clear cell carcinoma). In addition, 1785 SNPs in six genes (HEPH, MGST1, SERPINA, SLC25A45, SLC39A11 and UGT1A) were imputed from the 1000 Genomes Project and examined for association with INV EOC in white-European subjects. The most significant imputed SNP was rs117729793 in SLC39A11 (per allele, OR = 2.55, 95% CI = 1.5-4.35, p = 5.66x10-4). Conclusion These results, generated on a large cohort of women, revealed associations between inherited cellular transport gene variants and risk of EOC histologic subtypes. PMID:26091520

  13. Population-genetic properties of differentiated copy number variations in cattle.

    PubMed

    Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Zhou, Yang; Hay, El Hamidi Abdel; Song, Jiuzhou; Sonstegard, Tad S; Van Tassell, Curtis P; Liu, George E

    2016-03-23

    While single nucleotide polymorphism (SNP) is typically the variant of choice for population genetics, copy number variation (CNV) which comprises insertion, deletion and duplication of genomic sequence, is an informative type of genetic variation. CNVs have been shown to be both common in mammals and important for understanding the relationship between genotype and phenotype. However, CNV differentiation, selection and its population genetic properties are not well understood across diverse populations. We performed a population genetics survey based on CNVs derived from the BovineHD SNP array data of eight distinct cattle breeds. We generated high resolution results that show geographical patterns of variations and genome-wide admixture proportions within and among breeds. Similar to the previous SNP-based studies, our CNV-based results displayed a strong correlation of population structure and geographical location. By conducting three pairwise comparisons among European taurine, African taurine, and indicine groups, we further identified 78 unique CNV regions that were highly differentiated, some of which might be due to selection. These CNV regions overlapped with genes involved in traits related to parasite resistance, immunity response, body size, fertility, and milk production. Our results characterize CNV diversity among cattle populations and provide a list of lineage-differentiated CNVs.

  14. Biological relevance of CNV calling methods using familial relatedness including monozygotic twins.

    PubMed

    Castellani, Christina A; Melka, Melkaye G; Wishart, Andrea E; Locke, M Elizabeth O; Awamleh, Zain; O'Reilly, Richard L; Singh, Shiva M

    2014-04-21

    Studies involving the analysis of structural variation including Copy Number Variation (CNV) have recently exploded in the literature. Furthermore, CNVs have been associated with a number of complex diseases and neurodevelopmental disorders. Common methods for CNV detection use SNP, CNV, or CGH arrays, where the signal intensities of consecutive probes are used to define the number of copies associated with a given genomic region. These practices pose a number of challenges that interfere with the ability of available methods to accurately call CNVs. It has, therefore, become necessary to develop experimental protocols to test the reliability of CNV calling methods from microarray data so that researchers can properly discriminate biologically relevant data from noise. We have developed a workflow for the integration of data from multiple CNV calling algorithms using the same array results. It uses four CNV calling programs: PennCNV (PC), Affymetrix® Genotyping Console™ (AGC), Partek® Genomics Suite™ (PGS) and Golden Helix SVS™ (GH) to analyze CEL files from the Affymetrix® Human SNP 6.0 Array™. To assess the relative suitability of each program, we used individuals of known genetic relationships. We found significant differences in CNV calls obtained by different CNV calling programs. Although the programs showed variable patterns of CNVs in the same individuals, their distribution in individuals of different degrees of genetic relatedness has allowed us to offer two suggestions. The first involves the use of multiple algorithms for the detection of the largest possible number of CNVs, and the second suggests the use of PennCNV over all other methods when the use of only one software program is desirable.

  15. Lack of association of the TP53 Arg72Pro SNP and the MDM2 SNP309 with systemic lupus erythematosus in Caucasian, African American, and Asian children and adults.

    PubMed

    Onel, K B; Huo, D; Hastings, D; Fryer-Biggs, J; Crow, M K; Onel, K

    2009-01-01

    The p53 tumour suppressor is the central regulator of apoptosis. Previously, the functional TP53 Arg72Pro polymorphism was found to be associated with systemic lupus erythematosus (SLE) in Koreans but not Spaniards. MDM2 is the major negative regulator of p53. An intronic polymorphism in MDM2, the SNP309, attenuates p53 activity and is associated with accelerated tumour development in premenopausal women. Polymorphic variation in MDM2 has never been studied in SLE. The aim of this study is to further assess the contribution of p53-pathway genetic variation to SLE by testing the association of the TP53 Arg72Pro polymorphism and the MDM2 SNP309 with SLE in a well-characterised and ethnically diverse cohort of patients with both childhood- and adult-onset SLE (n = 314). No association was found between the TP53 Arg72Pro polymorphism and SLE in patients of European descent, Asian descent or in African Americans, nor was an association found between the MDM2 SNP309 and SLE in patients of European descent or in African Americans. In addition, there was no correlation between either variant and early-onset disease or nephritis, an index of severe disease. It is concluded that neither the TP53 Arg72Pro polymorphism nor the MDM2 SNP309 contributes significantly to either susceptibility or disease severity in SLE.

  16. Vitis Phylogenomics: Hybridization Intensities from a SNP Array Outperform Genotype Calls

    PubMed Central

    Miller, Allison J.; Matasci, Naim; Schwaninger, Heidi; Aradhya, Mallikarjuna K.; Prins, Bernard; Zhong, Gan-Yuan; Simon, Charles; Buckler, Edward S.; Myles, Sean

    2013-01-01

    Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs) identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One approach to SNP-based phylogeny reconstruction is to identify SNPs in a subset of individuals, and then to compile SNPs on an array that can be used to genotype additional samples at hundreds or thousands of sites simultaneously. Although powerful and efficient, this method is subject to ascertainment bias because applying variation discovered in a representative subset to a larger sample favors identification of SNPs with high minor allele frequencies and introduces bias against rare alleles. Here, we demonstrate that the use of hybridization intensity data, rather than genotype calls, reduces the effects of ascertainment bias. Whereas traditional SNP calls assess known variants based on diversity housed in the discovery panel, hybridization intensity data survey variation in the broader sample pool, regardless of whether those variants are present in the initial SNP discovery process. We apply SNP genotype and hybridization intensity data derived from the Vitis9kSNP array developed for grape to show the effects of ascertainment bias and to reconstruct evolutionary relationships among Vitis species. We demonstrate that phylogenies constructed using hybridization intensities suffer less from the distorting effects of ascertainment bias, and are thus more accurate than phylogenies based on genotype calls. Moreover, we reconstruct the phylogeny of the genus Vitis using hybridization data, show that North American subgenus Vitis species are monophyletic, and resolve several previously poorly known relationships among North American species. This study builds on earlier work that applied the Vitis9kSNP array to evolutionary questions within Vitis vinifera and has general implications for addressing ascertainment bias in array-enabled phylogeny reconstruction. PMID:24236035

  17. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.

    PubMed

    Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D

    2013-02-28

    Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.

  18. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation

    PubMed Central

    2013-01-01

    Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355

  19. Association, effects and validation of polymorphisms within the NCAPG - LCORL locus located on BTA6 with feed intake, gain, meat and carcass traits in beef cattle

    PubMed Central

    2011-01-01

    Background In a previously reported genome-wide association study based on a high-density bovine SNP genotyping array, 8 SNP were nominally associated (P ≤ 0.003) with average daily gain (ADG) and 3 of these were also associated (P ≤ 0.002) with average daily feed intake (ADFI) in a population of crossbred beef cattle. The SNP were clustered in a 570 kb region around 38 Mb on the draft sequence of bovine chromosome 6 (BTA6), an interval containing several positional and functional candidate genes including the bovine LAP3, NCAPG, and LCORL genes. The goal of the present study was to develop and examine additional markers in this region to optimize the ability to distinguish favorable alleles, with potential to identify functional variation. Results Animals from the original study were genotyped for 47 SNP within or near the gene boundaries of the three candidate genes. Sixteen markers in the NCAPG-LCORL locus displayed significant association with both ADFI and ADG even after stringent correction for multiple testing (P ≤ 005). These markers were evaluated for their effects on meat and carcass traits. The alleles associated with higher ADFI and ADG were also associated with higher hot carcass weight (HCW) and ribeye area (REA), and lower adjusted fat thickness (AFT). A reduced set of markers was genotyped on a separate, crossbred population including genetic contributions from 14 beef cattle breeds. Two of the markers located within the LCORL gene locus remained significant for ADG (P ≤ 0.04). Conclusions Several markers within the NCAPG-LCORL locus were significantly associated with feed intake and body weight gain phenotypes. These markers were also associated with HCW, REA and AFT suggesting that they are involved with lean growth and reduced fat deposition. Additionally, the two markers significant for ADG in the validation population of animals may be more robust for the prediction of ADG and possibly the correlated trait ADFI, across multiple breeds and populations of cattle. PMID:22168586

  20. Associations and interactions between SNPs in the alcohol metabolizing genes and alcoholism phenotypes in European Americans.

    PubMed

    Sherva, Richard; Rice, John P; Neuman, Rosalind J; Rochberg, Nanette; Saccone, Nancy L; Bierut, Laura J

    2009-05-01

    Alcohol dependence is a major cause of morbidity and mortality worldwide and has a strong familial component. Several linkage and association studies have identified chromosomal regions and/or genes that affect alcohol consumption, notably in genes involved in the 2-stage pathway of alcohol metabolism. Here, we use multiple regression models to test for associations and interactions between 2 alcohol-related phenotypes and SNPs in 17 genes involved in alcohol metabolism in a sample of 1,588 European American subjects. The strongest evidence for association after correcting for multiple testing was between rs1229984, a nonsynonymous coding SNP in ADH1B, and DSM-IV symptom count (p = 0.0003). This SNP was also associated with maximum number of drinks in 24 hours (p = 0.0004). Each minor allele at this SNP predicts 45% fewer DSM-IV symptoms and 18% fewer max drinks. Another SNP in a splice site in ALDH1A1 (rs8187974) showed evidence for association with both phenotypes as well (p = 0.02 and 0.004, respectively), but neither association was significant after accounting for multiple testing. Minor alleles at this SNP predict greater alcohol consumption. In addition, pairwise interactions were observed between SNPs in several genes (p = 0.00002). We replicated the large effect of rs1229984 on alcohol behavior, and although not common (MAF = 4%), this polymorphism may be highly relevant from a public health perspective in European Americans. Another SNP, rs8187974, may also affect alcohol behavior but requires replication. Also, interactions between polymorphisms in genes involved in alcohol metabolism are likely determinants of the parameters that ultimately affect alcohol consumption.

  1. Polymorphisms of the tumor necrosis factor-alpha receptor 2 gene are associated with obesity phenotypes among 405 Caucasian nuclear families.

    PubMed

    Zhao, Lan-Juan; Xiong, Dong-Hai; Pan, Feng; Liu, Xiao-Gang; Recker, Robert R; Deng, Hong-Wen

    2008-09-01

    The plasma level of the tumor necrosis factor-alpha receptor 2 (TNFR2) is associated with obesity phenotypes. However, the genetic polymorphisms for such an association have rarely been explored and are generally unknown. In this study, by employing a large sample of 1,873 subjects from 405 Caucasian nuclear families, we explored the association of 12 SNPs of the TNFR2 gene and obesity-related phenotypes, including body mass index (BMI), fat mass, and percentage fat mass (PFM). The within-family quantitative transmission disequilibrium test, which is robust to sample stratification, was implemented to evaluate the association of TNFR2 gene with obesity phenotypes. Evidence of association was obtained at SNP9 (rs5746059) with fat mass (P = 0.0002), BMI (P = 0.002), and PFM (P = 0.0006). The contribution of this polymorphism to the variation of fat mass and PFM was 6.24 and 7.82%, respectively. Individuals carrying allele A at the SNP9 site had a 4.6% higher fat mass and a 2.5% increased PFM compared to noncarriers. The results remained significant even after correction for multiple testing. Evidence of association between the TNFR2 gene and obesity phenotypes are also found in 700 independent Chinese Han and 1,000 random Caucasians samples. The results suggest that the TNFR2 gene polymorphisms contribute to the variation of obesity phenotypes.

  2. Oxytocin and Vasopressin Receptor Gene Variation as a Proximate Base for Inter- and Intraspecific Behavioral Differences in Bonobos and Chimpanzees

    PubMed Central

    Staes, Nicky; Stevens, Jeroen M. G.; Helsen, Philippe; Hillyer, Mia; Korody, Marisa; Eens, Marcel

    2014-01-01

    Recent literature has revealed the importance of variation in neuropeptide receptor gene sequences in the regulation of behavioral phenotypic variation. Here we focus on polymorphisms in the oxytocin receptor gene (OXTR) and vasopressin receptor gene 1a (Avpr1a) in chimpanzees and bonobos. In humans, a single nucleotide polymorphism (SNP) in the third intron of OXTR (rs53576 SNP (A/G)) is linked with social behavior, with the risk allele (A) carriers showing reduced levels of empathy and prosociality. Bonobos and chimpanzees differ in these same traits, therefore we hypothesized that these differences might be reflected in variation at the rs53576 position. We sequenced a 320 bp region surrounding rs53576 but found no indications of this SNP in the genus Pan. However, we identified previously unreported SNP variation in the chimpanzee OXTR sequence that differs from both humans and bonobos. Humans and bonobos have previously been shown to have a more similar 5′ promoter region of Avpr1a when compared to chimpanzees, who are polymorphic for the deletion of ∼360 bp in this region (+/− DupB) which includes a microsatellite (RS3). RS3 has been linked with variation in levels of social bonding, potentially explaining part of the interspecies behavioral differences found in bonobos, chimpanzees and humans. To date, results for bonobos have been based on small sample sizes. Our results confirmed that there is no DupB deletion in bonobos with a sample size comprising approximately 90% of the captive founder population, whereas in chimpanzees the deletion of DupB had the highest frequency. Because of the higher frequency of DupB alleles in our bonobo population, we suggest that the presence of this microsatellite may partly reflect documented differences in levels of sociability found in bonobos and chimpanzees. PMID:25405348

  3. Imputation of microsatellite alleles from dense SNP genotypes for parentage verification across multiple Bos taurus and Bos indicus breeds

    PubMed Central

    McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.

    2013-01-01

    To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982

  4. Identification of an Interaction between VWF rs7965413 and Platelet Count as a Novel Risk Marker for Metabolic Syndrome: An Extensive Search of Candidate Polymorphisms in a Case-Control Study

    PubMed Central

    Nakatochi, Masahiro; Ushida, Yasunori; Yasuda, Yoshinari; Yoshida, Yasuko; Kawai, Shun; Kato, Ryuji; Nakashima, Toru; Iwata, Masamitsu; Kuwatsuka, Yachiyo; Ando, Masahiko; Hamajima, Nobuyuki; Kondo, Takaaki; Oda, Hiroaki; Hayashi, Mutsuharu; Kato, Sawako; Yamaguchi, Makoto; Maruyama, Shoichi; Matsuo, Seiichi; Honda, Hiroyuki

    2015-01-01

    Although many single nucleotide polymorphisms (SNPs) have been identified to be associated with metabolic syndrome (MetS), there was only a slight improvement in the ability to predict future MetS by the simply addition of SNPs to clinical risk markers. To improve the ability to predict future MetS, combinational effects, such as SNP—SNP interaction, SNP—environment interaction, and SNP—clinical parameter (SNP × CP) interaction should be also considered. We performed a case-control study to explore novel SNP × CP interactions as risk markers for MetS based on health check-up data of Japanese male employees. We selected 99 SNPs that were previously reported to be associated with MetS and components of MetS; subsequently, we genotyped these SNPs from 360 cases and 1983 control subjects. First, we performed logistic regression analyses to assess the association of each SNP with MetS. Of these SNPs, five SNPs were significantly associated with MetS (P < 0.05): LRP2 rs2544390, rs1800592 between UCP1 and TBC1D9, APOA5 rs662799, VWF rs7965413, and rs1411766 between MYO16 and IRS2. Furthermore, we performed multiple logistic regression analyses, including an SNP term, a CP term, and an SNP × CP interaction term for each CP and SNP that was significantly associated with MetS. We identified a novel SNP × CP interaction between rs7965413 and platelet count that was significantly associated with MetS [SNP term: odds ratio (OR) = 0.78, P = 0.004; SNP × CP interaction term: OR = 1.33, P = 0.001]. This association of the SNP × CP interaction with MetS remained nominally significant in multiple logistic regression analysis after adjustment for either the number of MetS components or MetS components excluding obesity. Our results reveal new insight into platelet count as a risk marker for MetS. PMID:25646961

  5. The analysis of correlation between IL-1B gene expression and genotyping in multiple sclerosis patients.

    PubMed

    Heidary, Masoumeh; Rakhshi, Nahid; Pahlevan Kakhki, Majid; Behmanesh, Mehrdad; Sanati, Mohammad Hossein; Sanadgol, Nima; Kamaladini, Hossein; Nikravesh, Abbas

    2014-08-15

    IL-1B is released by monocytes, astrocytes and brain endothelial cells and seems to be involved in inflammatory reactions of the central nervous system (CNS) in multiple sclerosis (MS). This study aims to evaluate the expression level of IL-1B mRNA in peripheral blood mononuclear cells (PBMCs), genotype the rs16944 SNP and find out the role of this SNP on the expression level of IL-1B in MS patients. We found that the expression level of IL-1B in MS patients increased 3.336 times more than controls in PBMCs but the rs16944 SNP in the promoter region of IL-1B did not affect the expression level of this gene and there was not association of this SNP with MS in the examined population. Also, our data did not reveal any correlation between normalized expressions of IL-1B gene with age of participants, age of onset, and disease duration. Copyright © 2014 Elsevier B.V. All rights reserved.

  6. Analysis of the genome-wide variations among multiple strains of the plant pathogenic bacterium Xylella fastidiosa

    PubMed Central

    Doddapaneni, Harshavardhan; Yao, Jiqiang; Lin, Hong; Walker, M Andrew; Civerolo, Edwin L

    2006-01-01

    Background The Gram-negative, xylem-limited phytopathogenic bacterium Xylella fastidiosa is responsible for causing economically important diseases in grapevine, citrus and many other plant species. Despite its economic impact, relatively little is known about the genomic variations among strains isolated from different hosts and their influence on the population genetics of this pathogen. With the availability of genome sequence information for four strains, it is now possible to perform genome-wide analyses to identify and categorize such DNA variations and to understand their influence on strain functional divergence. Results There are 1,579 genes and 194 non-coding homologous sequences present in the genomes of all four strains, representing a 76. 2% conservation of the sequenced genome. About 60% of the X. fastidiosa unique sequences exist as tandem gene clusters of 6 or more genes. Multiple alignments identified 12,754 SNPs and 14,449 INDELs in the 1528 common genes and 20,779 SNPs and 10,075 INDELs in the 194 non-coding sequences. The average SNP frequency was 1.08 × 10-2 per base pair of DNA and the average INDEL frequency was 2.06 × 10-2 per base pair of DNA. On an average, 60.33% of the SNPs were synonymous type while 39.67% were non-synonymous type. The mutation frequency, primarily in the form of external INDELs was the main type of sequence variation. The relative similarity between the strains was discussed according to the INDEL and SNP differences. The number of genes unique to each strain were 60 (9a5c), 54 (Dixon), 83 (Ann1) and 9 (Temecula-1). A sub-set of the strain specific genes showed significant differences in terms of their codon usage and GC composition from the native genes suggesting their xenologous origin. Tandem repeat analysis of the genomic sequences of the four strains identified associations of repeat sequences with hypothetical and phage related functions. Conclusion INDELs and strain specific genes have been identified as the main source of variations among strains, with individual strains showing different rates of genome evolution. Based on these genome comparisons, it appears that the Pierce's disease strain Temecula-1 genome represents the ancestral genome of the X. fastidiosa. Results of this analysis are publicly available in the form of a web database. PMID:16948851

  7. Translational genomics for abiotic stress in sorghum: transcriptional profiling and validation of SNP markers between germplasm with differential cold tolerance

    USDA-ARS?s Scientific Manuscript database

    One focus of the Sorghum Translational Genomics Lab (part of sorghum CRIS, PSGD, CSRL, USDA-ARS, Lubbock TX) is to utilize nucleotide variation between sorghum germplasm such as those derived from RNA seq for translation and validation of Single Nucleotide Polymorphism (SNP) into easy access DNA m...

  8. Both a Nicotinic Single Nucleotide Polymorphism (SNP) and a Noradrenergic SNP Modulate Working Memory Performance when Attention Is Manipulated

    ERIC Educational Resources Information Center

    Greenwood, Pamela M.; Sundararajan, Ramya; Lin, Ming-Kuan; Kumar, Reshma; Fryxell, Karl J.; Parasuraman, Raja

    2009-01-01

    We investigated the relation between the two systems of visuospatial attention and working memory by examining the effect of normal variation in cholinergic and noradrenergic genes on working memory performance under attentional manipulation. We previously reported that working memory for location was impaired following large location precues,…

  9. Polymorphism at the TRIB1 gene modulates plasma lipid levels: insight from the Spanish familial hypercholesterolemia cohort study

    USDA-ARS?s Scientific Manuscript database

    rs17321515 SNP has been associated with variation in LDL-C, high density lipoprotein cholesterol and triglycerides concentrations. This effect has never been studied in patients with severe hypercholesterolemia. Therefore, our aims were to assess the association of the rs17321515 (TRIB1) SNP with pl...

  10. A Discovery Resource of Rare Copy Number Variations in Individuals with Autism Spectrum Disorder

    PubMed Central

    Prasad, Aparna; Merico, Daniele; Thiruvahindrapuram, Bhooma; Wei, John; Lionel, Anath C.; Sato, Daisuke; Rickaby, Jessica; Lu, Chao; Szatmari, Peter; Roberts, Wendy; Fernandez, Bridget A.; Marshall, Christian R.; Hatchwell, Eli; Eis, Peggy S.; Scherer, Stephen W.

    2012-01-01

    The identification of rare inherited and de novo copy number variations (CNVs) in human subjects has proven a productive approach to highlight risk genes for autism spectrum disorder (ASD). A variety of microarrays are available to detect CNVs, including single-nucleotide polymorphism (SNP) arrays and comparative genomic hybridization (CGH) arrays. Here, we examine a cohort of 696 unrelated ASD cases using a high-resolution one-million feature CGH microarray, the majority of which were previously genotyped with SNP arrays. Our objective was to discover new CNVs in ASD cases that were not detected by SNP microarray analysis and to delineate novel ASD risk loci via combined analysis of CGH and SNP array data sets on the ASD cohort and CGH data on an additional 1000 control samples. Of the 615 ASD cases analyzed on both SNP and CGH arrays, we found that 13,572 of 21,346 (64%) of the CNVs were exclusively detected by the CGH array. Several of the CGH-specific CNVs are rare in population frequency and impact previously reported ASD genes (e.g., NRXN1, GRM8, DPYD), as well as novel ASD candidate genes (e.g., CIB2, DAPP1, SAE1), and all were inherited except for a de novo CNV in the GPHN gene. A functional enrichment test of gene-sets in ASD cases over controls revealed nucleotide metabolism as a potential novel pathway involved in ASD, which includes several candidate genes for follow-up (e.g., DPYD, UPB1, UPP1, TYMP). Finally, this extensively phenotyped and genotyped ASD clinical cohort serves as an invaluable resource for the next step of genome sequencing for complete genetic variation detection. PMID:23275889

  11. The analysis of APOL1 genetic variation and haplotype diversity provided by 1000 Genomes project.

    PubMed

    Peng, Ting; Wang, Li; Li, Guisen

    2017-08-11

    The APOL1 gene variants has been shown to be associated with an increased risk of multiple kinds of diseases, particularly in African Americans, but not in Caucasians and Asians. In this study, we explored the single nucleotide polymorphism (SNP) and haplotype diversity of APOL1 gene in different races provided by 1000 Genomes project. Variants of APOL1 gene in 1000 Genome Project were obtained and SNPs located in the regulatory region or coding region were selected for genetic variation analysis. Total 2504 individuals from 26 populations were classified as four groups that included Africa, Europe, Asia and Admixed populations. Tag SNPs were selected to evaluate the haplotype diversities in the four populations by HaploStats software. APOL1 gene was surrounded by some of the most polymorphic genes in the human genome, variation of APOL1 gene was common, with up to 613 SNP (1000 Genome Project reported) and 99 of them (16.2%) with MAF ≥ 1%. There were 79 SNPs in the URR and 92 SNPs in 3'UTR. Total 12 SNPs in URR and 24 SNPs in 3'UTR were considered as common variants with MAF ≥ 1%. It is worth noting that URR-1 was presents lower frequencies in European populations, while other three haplotypes taken an opposite pattern; 3'UTR presents several high-frequency variation sites in a short segment, and the differences of its haplotypes among different population were significant (P < 0.01), UTR-1 and UTR-5 presented much higher frequency in African population, while UTR-2, UTR-3 and UTR-4 were much lower. APOL1 coding region showed that two SNP of G1 with higher frequency are actually pull down the haplotype H-1 frequency when considering all populations pooled together, and the diversity among the four populations be widen by the G1 two mutation (P 1  = 3.33E-4 vs P 2  = 3.61E-30). The distributions of APOL1 gene variants and haplotypes were significantly different among the different populations, in either regulatory or coding regions. It could provide clues for the future genetic study of APOL1 related diseases.

  12. Computational intelligence in bioinformatics: SNP/haplotype data in genetic association study for common diseases.

    PubMed

    Kelemen, Arpad; Vasilakos, Athanasios V; Liang, Yulan

    2009-09-01

    Comprehensive evaluation of common genetic variations through association of single-nucleotide polymorphism (SNP) structure with common complex disease in the genome-wide scale is currently a hot area in human genome research due to the recent development of the Human Genome Project and HapMap Project. Computational science, which includes computational intelligence (CI), has recently become the third method of scientific enquiry besides theory and experimentation. There have been fast growing interests in developing and applying CI in disease mapping using SNP and haplotype data. Some of the recent studies have demonstrated the promise and importance of CI for common complex diseases in genomic association study using SNP/haplotype data, especially for tackling challenges, such as gene-gene and gene-environment interactions, and the notorious "curse of dimensionality" problem. This review provides coverage of recent developments of CI approaches for complex diseases in genetic association study with SNP/haplotype data.

  13. Lack of Association of the TP53 Arg72Pro SNP and the MDM2 SNP309 with systemic lupus erythematosus in Caucasian, African American, and Asian children and adults

    PubMed Central

    Onel, KB; Huo, D; Hastings, D; Fryer-Biggs, J; Crow, MK; Onel, K

    2009-01-01

    The p53 tumour suppressor is the central regulator of apoptosis. Previously, the functional TP53 Arg72Pro polymorphism was found to be associated with systemic lupus erythematosus (SLE) in Koreans but not Spaniards. MDM2 is the major negative regulator of p53. An intronic polymorphism in MDM2, the SNP309, attenuates p53 activity and is associated with accelerated tumour development in premenopausal women. Polymorphic variation in MDM2 has never been studied in SLE. The aim of this study is to further assess the contribution of p53-pathway genetic variation to SLE by testing the association of the TP53 Arg72Pro polymorphism and the MDM2 SNP309 with SLE in a well-characterised and ethnically diverse cohort of patients with both childhood- and adult-onset SLE (n = 314). No association was found between the TP53 Arg72Pro polymorphism and SLE in patients of European descent, Asian descent or in African Americans, nor was an association found between the MDM2 SNP309 and SLE in patients of European descent or in African Americans. In addition, there was no correlation between either variant and early-onset disease or nephritis, an index of severe disease. It is concluded that neither the TP53 Arg72Pro polymorphism nor the MDM2 SNP309 contributes significantly to either susceptibility or disease severity in SLE. PMID:19074170

  14. An innovative SNP genotyping method adapting to multiple platforms and throughputs

    USDA-ARS?s Scientific Manuscript database

    Single nucleotide polymorphisms (SNPs) are highly abundant, distributed throughout the genome in various species, and therefore they are widely used as genetic markers. However, the usefulness of this genetic tool relies heavily on the availability of user-friendly SNP genotyping methods. We have d...

  15. Optimal design of low-density SNP arrays for genomic prediction: algorithm and applications

    USDA-ARS?s Scientific Manuscript database

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for their optimal design. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optim...

  16. Microsatellite Imputation for parental verification from SNP across multiple Bos taurus and indicus breeds

    USDA-ARS?s Scientific Manuscript database

    Microsatellite markers (MS) have traditionally been used for parental verification and are still the international standard in spite of their higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP)-based assays. Despite domestic and international demands fro...

  17. Genetic Variants in STAT3 Promoter Regions and Their Application in Molecular Breeding for Body Size Traits in Qinchuan Cattle

    PubMed Central

    Wang, Yaning; Ning, Yue; Guo, Hongfang; Wang, Xiaoyu; Zhang, Le; Cheng, Gong; Wang, Hongbao; Zan, Linsen

    2018-01-01

    Signal transducer and activator of transcription 3 (STAT3) plays a critical role in leptin-mediated regulation of energy metabolism. This study investigated genetic variation in STAT3 promoter regions and verified their contribution to bovine body size traits. We first estimated the degree of conservation in STAT3, followed by measurements of its mRNA expression during fetal and adult stages of Qinchuan cattle. We then sequenced the STAT3 promoter region to determine genetic variants and evaluate their association with body size traits. From fetus to adult, STAT3 expression increased significantly in muscle, fat, heart, liver, and spleen tissues (p < 0.01), but decreased in the intestine, lung, and rumen (p < 0.01). We identified and named five single nucleotide polymorphisms (SNPs): SNP1-304A>C, SNP2-285G>A, SNP3-209A>C, SNP4-203A>G, and SNP5-188T>C. These five mutations fell significantly outside the Hardy–Weinberg equilibrium (HWE) (Chi-squared test, p < 0.05) and significantly associated with body size traits (p < 0.05). Individuals with haplotype H3H3 (CC-GG-CC-GG-CC) were larger in body size than other haplotypes. Therefore, variations in the STAT3 gene promoter regions, most notably haplotype H3H3, may benefit marker-assisted breeding of Qinchuan cattle. PMID:29596388

  18. Genetic Variants in STAT3 Promoter Regions and Their Application in Molecular Breeding for Body Size Traits in Qinchuan Cattle.

    PubMed

    Wu, Sen; Wang, Yaning; Ning, Yue; Guo, Hongfang; Wang, Xiaoyu; Zhang, Le; Khan, Rajwali; Cheng, Gong; Wang, Hongbao; Zan, Linsen

    2018-03-29

    Signal transducer and activator of transcription 3 (STAT3) plays a critical role in leptin-mediated regulation of energy metabolism. This study investigated genetic variation in STAT3 promoter regions and verified their contribution to bovine body size traits. We first estimated the degree of conservation in STAT3, followed by measurements of its mRNA expression during fetal and adult stages of Qinchuan cattle. We then sequenced the STAT3 promoter region to determine genetic variants and evaluate their association with body size traits. From fetus to adult, STAT3 expression increased significantly in muscle, fat, heart, liver, and spleen tissues ( p < 0.01), but decreased in the intestine, lung, and rumen ( p < 0.01). We identified and named five single nucleotide polymorphisms (SNPs): SNP1-304A>C, SNP2-285G>A, SNP3-209A>C, SNP4-203A>G, and SNP5-188T>C. These five mutations fell significantly outside the Hardy-Weinberg equilibrium (HWE) (Chi-squared test, p < 0.05) and significantly associated with body size traits ( p < 0.05). Individuals with haplotype H3H3 (CC-GG-CC-GG-CC) were larger in body size than other haplotypes. Therefore, variations in the STAT3 gene promoter regions, most notably haplotype H3H3, may benefit marker-assisted breeding of Qinchuan cattle.

  19. High-Resolution SNP/CGH Microarrays Reveal the Accumulation of Loss of Heterozygosity in Commonly Used Candida albicans Strains

    PubMed Central

    Abbey, Darren; Hickman, Meleah; Gresham, David; Berman, Judith

    2011-01-01

    Phenotypic diversity can arise rapidly through loss of heterozygosity (LOH) or by the acquisition of copy number variations (CNV) spanning whole chromosomes or shorter contiguous chromosome segments. In Candida albicans, a heterozygous diploid yeast pathogen with no known meiotic cycle, homozygosis and aneuploidy alter clinical characteristics, including drug resistance. Here, we developed a high-resolution microarray that simultaneously detects ∼39,000 single nucleotide polymorphism (SNP) alleles and ∼20,000 copy number variation loci across the C. albicans genome. An important feature of the array analysis is a computational pipeline that determines SNP allele ratios based upon chromosome copy number. Using the array and analysis tools, we constructed a haplotype map (hapmap) of strain SC5314 to assign SNP alleles to specific homologs, and we used it to follow the acquisition of loss of heterozygosity (LOH) and copy number changes in a series of derived laboratory strains. This high-resolution SNP/CGH microarray and the associated hapmap facilitated the phasing of alleles in lab strains and revealed detrimental genome changes that arose frequently during molecular manipulations of laboratory strains. Furthermore, it provided a useful tool for rapid, high-resolution, and cost-effective characterization of changes in allele diversity as well as changes in chromosome copy number in new C. albicans isolates. PMID:22384363

  20. Reproducibility and repeatability of peripheral microvascular assessment using iontophoresis in conjunction with laser Doppler imaging.

    PubMed

    Jadhav, Sachin; Sattar, Naveed; Petrie, John R; Cobbe, Stuart M; Ferrell, William R

    2007-09-01

    Interrogation of peripheral vascular function is increasingly recognized as a noninvasive surrogate marker for coronary vascular function and carries with it important prognostic information regarding future cardiovascular risk. Laser Doppler imaging (LDI) is a completely noninvasive method for looking at peripheral microvascular function. We sought to look at reproducibility and repeatability of LDI-derived assessment of peripheral microvascular function between arms and 8 weeks apart. We used LDI in conjunction with iontophoretic application of ACh and SNP to look at endothelium-dependent and -independent microvascular function, respectively, in a mixture of women with cardiac syndrome X and healthy volunteers. We looked at variation between arms (n = 40) and variation at 8 weeks apart (n = 22). When measurements were corrected for skin resistance, there was nonsignificant variation between arms for ACh (2.7%) and SNP (3.8%) and nonsignificant temporal variation for ACh (3.5%) and SNP (4.7%). Construction of Bland-Altman plots reinforce that measurements have good repeatability. Elimination of the baseline perfusion response had deleterious effects on repeatability. LDI can be used to assess peripheral vascular response with good repeatability as long as measurements are corrected for skin resistance, which affects drug delivery. This has important implications for the future use of LDI.

  1. Association of single nucleotide polymorphisms in candidate genes previously related to genetic variation in fertility with phenotypic measurements of reproductive function in Holstein cows

    USDA-ARS?s Scientific Manuscript database

    The objectives of this study were to evaluate the effect of 68 SNP previously associated with genetic merit for fertility and production on phenotype for reproductive and productive traits in a population of Holstein cows. In addition, we determined which SNP had repeated effects across three studie...

  2. Analysis of SNP rs16754 of WT1 gene in a series of de novo acute myeloid leukemia patients.

    PubMed

    Luna, Irene; Such, Esperanza; Cervera, Jose; Barragán, Eva; Jiménez-Velasco, Antonio; Dolz, Sandra; Ibáñez, Mariam; Gómez-Seguí, Inés; López-Pavía, María; Llop, Marta; Fuster, Óscar; Oltra, Silvestre; Moscardó, Federico; Martínez-Cuadrón, David; Senent, M Leonor; Gascón, Adriana; Montesinos, Pau; Martín, Guillermo; Bolufer, Pascual; Sanz, Miguel A

    2012-12-01

    The single nucleotide polymorphism (SNP) rs16754 of the WT1 gene has been previously described as a possible prognostic marker in normal karyotype acute myeloid leukemia (AML) patients. Nevertheless, the findings in this field are not always reproducible in different series. One hundred and seventy-five adult de novo AML patients were screened with two different methods for the detection of SNP rs16754: high-resolution melting (HRM) and FRET hybridization probes. Direct sequencing was used to validate both techniques. The SNP was detected in 52 out of 175 patients (30 %), both by HRM and hybridization probes. Direct sequencing confirmed that every positive sample in the screening methods had a variation in the DNA sequence. Patients with the wild-type genotype (WT1(AA)) for the SNP rs16754 were significantly younger than those with the heterozygous WT1(AG) genotype. No other difference was observed for baseline characteristic or outcome between patients with or without the SNP. Both techniques are equally reliable and reproducible as screening methods for the detection of the SNP rs16754, allowing for the selection of those samples that will need to be sequenced. We were unable to confirm the suggested favorable outcome of SNP rs16754 in de novo AML.

  3. Whole genome survey of coding SNPs reveals a reproducible pathway determinant of Parkinson disease

    PubMed Central

    Srinivasan, Balaji S; Doostzadeh, Jaleh; Absalan, Farnaz; Mohandessi, Sharareh; Jalili, Roxana; Bigdeli, Saharnaz; Wang, Justin; Mahadevan, Jaydev; Lee, Caroline LG; Davis, Ronald W; William Langston, J; Ronaghi, Mostafa

    2009-01-01

    It is quickly becoming apparent that situating human variation in a pathway context is crucial to understanding its phenotypic significance. Toward this end, we have developed a general method for finding pathways associated with traits that control for pathway size. We have applied this method to a new whole genome survey of coding SNP variation in 187 patients afflicted with Parkinson disease (PD) and 187 controls. We show that our dataset provides an independent replication of the axon guidance association recently reported by Lesnick et al. [PLoS Genet 2007;3:e98], and also indicates that variation in the ubiquitin-mediated proteolysis and T-cell receptor signaling pathways may predict PD susceptibility. Given this result, it is reasonable to hypothesize that pathway associations are more replicable than individual SNP associations in whole genome association studies. However, this hypothesis is complicated by a detailed comparison of our dataset to the second recent PD association study by Fung et al. [Lancet Neurol 2006;5:911–916]. Surprisingly, we find that the axon guidance pathway does not rank at the very top of the Fung dataset after controlling for pathway size. More generally, in comparing the studies, we find that SNP frequencies replicate well despite technologically different assays, but that both SNP and pathway associations are globally uncorrelated across studies. We thus have a situation in which an association between axon guidance pathway variation and PD has been found in 2 out of 3 studies. We conclude by relating this seeming inconsistency to the molecular heterogeneity of PD, and suggest future analyses that may resolve such discrepancies. PMID:18853455

  4. Genetic Variation in Selenoprotein Genes, Lifestyle, and Risk of Colon and Rectal Cancer

    PubMed Central

    Slattery, Martha L.; Lundgreen, Abbie; Welbourn, Bill; Corcoran, Christopher; Wolff, Roger K.

    2012-01-01

    Background Associations between selenium and cancer have directed attention to role of selenoproteins in the carcinogenic process. Methods We used data from two population-based case-control studies of colon (n = 1555 cases, 1956 controls) and rectal (n = 754 cases, 959 controls) cancer. We evaluated the association between genetic variation in TXNRD1, TXNRD2, TXNRD3, C11orf31 (SelH), SelW, SelN1, SelS, SepX, and SeP15 with colorectal cancer risk. Results After adjustment for multiple comparisons, several associations were observed. Two SNPs in TXNRD3 were associated with rectal cancer (rs11718498 dominant OR 1.42 95% CI 1.16,1.74 pACT 0.0036 and rs9637365 recessive 0.70 95% CI 0.55,0.90 pACT 0.0208). Four SNPs in SepN1 were associated with rectal cancer (rs11247735 recessive OR 1.30 95% CI 1.04,1.63 pACT 0.0410; rs2072749 GGvsAA OR 0.53 95% CI 0.36,0.80 pACT 0.0159; rs4659382 recessive OR 0.58 95% CI 0.39,0.86 pACT 0.0247; rs718391 dominant OR 0.76 95% CI 0.62,0.94 pACT 0.0300). Interaction between these genes and exposures that could influence these genes showed numerous significant associations after adjustment for multiple comparisons. Two SNPs in TXNRD1 and four SNPs in TXNRD2 interacted with aspirin/NSAID to influence colon cancer; one SNP in TXNRD1, two SNPs in TXNRD2, and one SNP in TXNRD3 interacted with aspirin/NSAIDs to influence rectal cancer. Five SNPs in TXNRD2 and one in SelS, SeP15, and SelW1 interacted with estrogen to modify colon cancer risk; one SNP in SelW1 interacted with estrogen to alter rectal cancer risk. Several SNPs in this candidate pathway influenced survival after diagnosis with colon cancer (SeP15 and SepX1 increased HRR) and rectal cancer (SepX1 increased HRR). Conclusions Findings support an association between selenoprotein genes and colon and rectal cancer development and survival after diagnosis. Given the interactions observed, it is likely that the impact of cancer susceptibility from genotype is modified by lifestyle. PMID:22615972

  5. SNP2TFBS - a database of regulatory SNPs affecting predicted transcription factor binding site affinity.

    PubMed

    Kumar, Sunil; Ambrosini, Giovanna; Bucher, Philipp

    2017-01-04

    SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Linkage disequilibrium between STRPs and SNPs across the human genome.

    PubMed

    Payseur, Bret A; Place, Michael; Weber, James L

    2008-05-01

    Patterns of linkage disequilibrium (LD) reveal the action of evolutionary processes and provide crucial information for association mapping of disease genes. Although recent studies have described the landscape of LD among single nucleotide polymorphisms (SNPs) from across the human genome, associations involving other classes of molecular variation remain poorly understood. In addition to recombination and population history, mutation rate and process are expected to shape LD. To test this idea, we measured associations between short-tandem-repeat polymorphisms (STRPs), which can mutate rapidly and recurrently, and SNPs in 721 regions across the human genome. We directly compared STRP-SNP LD with SNP-SNP LD from the same genomic regions in the human HapMap populations. The intensity of STRP-SNP LD, measured by the average of D', was reduced, consistent with the action of recurrent mutation. Nevertheless, a higher fraction of STRP-SNP pairs than SNP-SNP pairs showed significant LD, on both short (up to 50 kb) and long (cM) scales. These results reveal the substantial effects of mutational processes on LD at STRPs and provide important measures of the potential of STRPs for association mapping of disease genes.

  7. mrsFAST-Ultra: a compact, SNP-aware mapper for high performance sequencing applications.

    PubMed

    Hach, Faraz; Sarrafi, Iman; Hormozdiari, Farhad; Alkan, Can; Eichler, Evan E; Sahinalp, S Cenk

    2014-07-01

    High throughput sequencing (HTS) platforms generate unprecedented amounts of data that introduce challenges for processing and downstream analysis. While tools that report the 'best' mapping location of each read provide a fast way to process HTS data, they are not suitable for many types of downstream analysis such as structural variation detection, where it is important to report multiple mapping loci for each read. For this purpose we introduce mrsFAST-Ultra, a fast, cache oblivious, SNP-aware aligner that can handle the multi-mapping of HTS reads very efficiently. mrsFAST-Ultra improves mrsFAST, our first cache oblivious read aligner capable of handling multi-mapping reads, through new and compact index structures that reduce not only the overall memory usage but also the number of CPU operations per alignment. In fact the size of the index generated by mrsFAST-Ultra is 10 times smaller than that of mrsFAST. As importantly, mrsFAST-Ultra introduces new features such as being able to (i) obtain the best mapping loci for each read, and (ii) return all reads that have at most n mapping loci (within an error threshold), together with these loci, for any user specified n. Furthermore, mrsFAST-Ultra is SNP-aware, i.e. it can map reads to reference genome while discounting the mismatches that occur at common SNP locations provided by db-SNP; this significantly increases the number of reads that can be mapped to the reference genome. Notice that all of the above features are implemented within the index structure and are not simple post-processing steps and thus are performed highly efficiently. Finally, mrsFAST-Ultra utilizes multiple available cores and processors and can be tuned for various memory settings. Our results show that mrsFAST-Ultra is roughly five times faster than its predecessor mrsFAST. In comparison to newly enhanced popular tools such as Bowtie2, it is more sensitive (it can report 10 times or more mappings per read) and much faster (six times or more) in the multi-mapping mode. Furthermore, mrsFAST-Ultra has an index size of 2GB for the entire human reference genome, which is roughly half of that of Bowtie2. mrsFAST-Ultra is open source and it can be accessed at http://mrsfast.sourceforge.net. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. Association of GSK3beta polymorphisms with brain structural changes in major depressive disorder.

    PubMed

    Inkster, Becky; Nichols, Thomas E; Saemann, Philipp G; Auer, Dorothee P; Holsboer, Florian; Muglia, Pierandrea; Matthews, Paul M

    2009-07-01

    Indirect evidence suggests that the glycogen synthase kinase-3beta (GSK3beta) gene might be implicated in major depressive disorder (MDD). We evaluated 15 GSK3beta single-nucleotide polymorphisms (SNPs) to test for associations with regional gray matter (GM) volume differences in patients with recurrent MDD. We then used the defined regions of interest based on significant associations to test for MDD x genotype interactions by including a matched control group without any psychiatric disorder, including MDD. General linear model with nonstationary cluster-based inference. Munich, Germany. Patients with recurrent MDD (n = 134) and age-, sex-, and ethnicity-matched healthy controls (n = 143). Associations between GSK3beta polymorphisms and regional GM volume differences. Variation in GM volume was associated with GSK3beta polymorphisms; the most significant associations were found for rs6438552, a putative functional intronic SNP that showed 3 significant GM clusters in the right and left superior temporal gyri and the right hippocampus (P < .001, P = .02, and P = .02, respectively, corrected for multiple comparisons across the whole brain). Similar results were obtained with rs12630592, an SNP in high linkage disequilibrium. A significant SNP x MDD status interaction was observed for the effect on GM volumes in the right hippocampus and superior temporal gyri (P < .001 and P = .01, corrected, respectively). The GSK3beta gene may have a role in determining regional GM volume differences of the right hippocampus and bilateral superior temporal gyri. The association between genotype and brain structure was specific to the patients with MDD, suggesting that GSK3beta genotypes might interact with MDD status. We speculate that this is a consequence of regional neocortical, glial, or neuronal growth or survival. In considering core cognitive features of MDD, the association of GSK3beta polymorphisms with structural variation in the temporal lobe and hippocampus is of particular interest in the context of other evidence for structural and functional abnormalities in the hippocampi of patients with MDD.

  9. CSF protein changes associated with hippocampal sclerosis risk gene variants highlight impact of GRN/PGRN.

    PubMed

    Fardo, David W; Katsumata, Yuriko; Kauwe, John S K; Deming, Yuetiva; Harari, Oscar; Cruchaga, Carlos; Nelson, Peter T

    2017-04-01

    Hippocampal sclerosis of aging (HS-Aging) is a common cause of dementia in older adults. We tested the variability in cerebrospinal fluid (CSF) proteins associated with previously identified HS-Aging risk single nucleotide polymorphisms (SNPs). Alzheimer's Disease Neuroimaging Initiative cohort (ADNI; n=237) data, combining both multiplexed proteomics CSF and genotype data, were used to assess the association between CSF analytes and risk SNPs in four genes (SNPs): GRN (rs5848), TMEM106B (rs1990622), ABCC9 (rs704180), and KCNMB2 (rs9637454). For controls, non-HS-Aging SNPs in APOE (rs429358/rs7412) and MAPT (rs8070723) were also analyzed against Aβ1-42 and total tau CSF analytes. The GRN risk SNP (rs5848) status correlated with variation in CSF proteins, with the risk allele (T) associated with increased levels of AXL Receptor Tyrosine Kinase (AXL), TNF-Related Apoptosis-Inducing Ligand Receptor 3 (TRAIL-R3), Vascular Cell Adhesion Molecule-1 (VCAM-1) and clusterin (CLU) (all p<0.05 after Bonferroni correction). The TRAIL-R3 correlation was significant in meta-analysis with an additional dataset (p=5.05×10 -5 ). Further, the rs5848 SNP status was associated with increased CSF tau protein - a marker of neurodegeneration (p=0.015). These data are remarkable since this GRN SNP has been found to be a risk factor for multiple types of dementia-related brain pathologies. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Clinical relevance of IL-6 gene polymorphism in severely injured patients

    PubMed Central

    Jeremić, Vasilije; Alempijević, Tamara; Mijatović, Srđan; Šijački, Ana; Dragašević, Sanja; Pavlović, Sonja; Miličić, Biljana; Krstić, Slobodan

    2014-01-01

    In polytrauma, injuries that may be surgically treated under regular circumstances due to a systemic inflammatory response become life-threatening. The inflammatory response involves a complex pattern of humoral and cellular responses and the expression of related factors is thought to be governed by genetic variations. This aim of this paper is to examine the influence of interleukin (IL) 6 single nucleotide polymorphism (SNP) -174C/G and -596G/A on the treatment outcome in severely injured patients. Forty-seven severely injured patients were included in this study. Patients were assigned an Injury Severity Score. Blood samples were drawn within 24 h after admission (designated day 1) and on subsequent days (24, 48, 72 hours and 7days) of hospitalization. The IL-6 levels were determined through ELISA technique. Polymorphisms were analyzed by a method of Polymerase Chain Reaction-Restriction Fragment Length Polymorphism (PCR). Among subjects with different outcomes, no statistically relevant difference was found with regards to the gene IL-6 SNP-174G/C polymorphism. More than a half of subjects who died had the SNP-174G/C polymorphism, while this polymorphism was represented in a slightly lower number in survivors. The incidence of subjects without polymorphism and those with heterozygous and homozygous gene IL-6 SNP-596G/A polymorphism did not present statistically significant variations between survivors and those who died. The levels of IL-6 over the observation period did not present any statistically relevant difference among subjects without the IL-6 SNP-174 or IL-6 SNP -596 gene polymorphism and those who had either a heterozygous or a homozygous polymorphism. PMID:24856384

  11. Imputation of microsatellite alleles from dense SNP genotypes for parentage verification across multiple Bos taurus and Bos indicus breeds

    USDA-ARS?s Scientific Manuscript database

    Microsatellite markers (MS) have traditionally been used for parental verification and are still the international standard in spite of their higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP) -based assays. Despite domestic and international demands fr...

  12. SEAN: SNP prediction and display program utilizing EST sequence clusters.

    PubMed

    Huntley, Derek; Baldo, Angela; Johri, Saurabh; Sergot, Marek

    2006-02-15

    SEAN is an application that predicts single nucleotide polymorphisms (SNPs) using multiple sequence alignments produced from expressed sequence tag (EST) clusters. The algorithm uses rules of sequence identity and SNP abundance to determine the quality of the prediction. A Java viewer is provided to display the EST alignments and predicted SNPs.

  13. Estimation and partitioning of (co)heritability of inflammatory bowel disease from GWAS and immunochip data.

    PubMed

    Chen, Guo-Bo; Lee, Sang Hong; Brion, Marie-Jo A; Montgomery, Grant W; Wray, Naomi R; Radford-Smith, Graham L; Visscher, Peter M

    2014-09-01

    As custom arrays are cheaper than generic GWAS arrays, larger sample size is achievable for gene discovery. Custom arrays can tag more variants through denser genotyping of SNPs at associated loci, but at the cost of losing genome-wide coverage. Balancing this trade-off is important for maximizing experimental designs. We quantified both the gain in captured SNP-heritability at known candidate regions and the loss due to imperfect genome-wide coverage for inflammatory bowel disease using immunochip (iChip) and imputed GWAS data on 61,251 and 38.550 samples, respectively. For Crohn's disease (CD), the iChip and GWAS data explained 19 and 26% of variation in liability, respectively, and SNPs in the densely genotyped iChip regions explained 13% of the SNP-heritability for both the iChip and GWAS data. For ulcerative colitis (UC), the iChip and GWAS data explained 15 and 19% of variation in liability, respectively, and the dense iChip regions explained 10 and 9% of the SNP-heritability in the iChip and the GWAS data. From bivariate analyses, estimates of the genetic correlation in risk between CD and UC were 0.75 (SE 0.017) and 0.62 (SE 0.042) for the iChip and GWAS data, respectively. We also quantified the SNP-heritability of genomic regions that did or did not contain the previous 163 GWAS hits for CD and UC, and SNP-heritability of the overlapping loci between the densely genotyped iChip regions and the 163 GWAS hits. For both diseases, over different genomic partitioning, the densely genotyped regions on the iChip tagged at least as much variation in liability as in the corresponding regions in the GWAS data, however a certain amount of tagged SNP-heritability in the GWAS data was lost using the iChip due to the low coverage at unselected regions. These results imply that custom arrays with a GWAS backbone will facilitate more gene discovery, both at associated and novel loci. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  14. Translating natural genetic variation to gene expression in a computational model of the Drosophila gap gene regulatory network

    PubMed Central

    Kozlov, Konstantin N.; Kulakovskiy, Ivan V.; Zubair, Asif; Marjoram, Paul; Lawrie, David S.; Nuzhdin, Sergey V.; Samsonova, Maria G.

    2017-01-01

    Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects. PMID:28898266

  15. Genetic analysis of QTL for eye cross and eye diameter in common carp (Cyprinus carpio L.) using microsatellites and SNPs.

    PubMed

    Jin, S B; Zhang, X F; Lu, J G; Fu, H T; Jia, Z Y; Sun, X W

    2015-04-17

    A group of 107 F1 hybrid common carp was used to construct a linkage map using JoinMap 4.0. A total of 4877 microsatellite and single nucleotide polymorphism (SNP) markers isolated from a genomic library (978 microsatellite and 3899 SNP markers) were assigned to construct the genetic map, which comprised 50 linkage groups. The total length of the linkage map for the common carp was 4775.90 cM with an average distance between markers of 0.98 cM. Ten quantitative trait loci (QTL) were associated with eye diameter, corresponding to 10.5-57.2% of the total phenotypic variation. Twenty QTL were related to eye cross, contributing to 10.8-36.9% of the total phenotypic variation. Two QTL for eye diameter and four QTL for eye cross each accounted for more than 20% of the total phenotypic variation and were considered to be major QTL. One growth factor related to eye diameter was observed on LG10 of the common carp genome, and three growth factors related to eye cross were observed on LG10, LG35, and LG44 of the common carp genome. The significant positive relationship of eye cross and eye diameter with other commercial traits suggests that eye diameter and eye cross can be used to assist in indirect selection for many commercial traits, particularly body weight. Thus, the growth factor for eye cross may also contribute to the growth of body weight, implying that aggregate breeding could have multiple effects. These findings provide information for future genetic studies and breeding of common carp.

  16. Overlap in genomic variation associated with milk fat composition in Holstein Friesian and Dutch native dual-purpose breeds.

    PubMed

    Maurice-Van Eijndhoven, M H T; Bovenhuis, H; Veerkamp, R F; Calus, M P L

    2015-09-01

    The aim of this study was to identify if genomic variations associated with fatty acid (FA) composition are similar between the Holstein-Friesian (HF) and native dual-purpose breeds used in the Dutch dairy industry. Phenotypic and genotypic information were available for the breeds Meuse-Rhine-Yssel (MRY), Dutch Friesian (DF), Groningen White Headed (GWH), and HF. First, the reliability of genomic breeding values of the native Dutch dual-purpose cattle breeds MRY, DF, and GWH was evaluated using single nucleotide polymorphism (SNP) effects estimated in HF, including all SNP or subsets with stronger associations in HF. Second, the genomic variation of the regions associated with FA composition in HF (regions on Bos taurus autosome 5, 14, and 26), were studied in the different breeds. Finally, similarities in genotype and allele frequencies between MRY, DF, GWH, and HF breeds were assessed for specific regions associated with FA composition. On average across the traits, the highest reliabilities of genomic prediction were estimated for GWH (0.158) and DF (0.116) when the 8 to 22 SNP with the strongest association in HF were included. With the same set of SNP, GEBV for MRY were the least reliable (0.022). This indicates that on average only 2 (MRY) to 16% (GWH) of the genomic variation in HF is shared with the native Dutch dual-purpose breeds. The comparison of predicted variances of different regions associated with milk and milk fat composition showed that breeds clearly differed in genomic variation within these regions. Finally, the correlations of allele frequencies between breeds across the 8 to 22 SNP with the strongest association in HF were around 0.8 between the Dutch native dual-purpose breeds, whereas the correlations between the native breeds and HF were clearly lower and around 0.5. There was no consistent relationship between the reliabilities of genomic prediction for a specific breed and the correlation between the allele frequencies of this breed and HF. In conclusion, most of the genomic variation associated with FA composition in the Dutch dual-purpose breeds appears to be breed-specific. Furthermore, the minor allele frequencies of genes having an effect on the milk FA composition in HF were shown to be much smaller in the breeds MRY, DF, and GWH, especially for the MRY breed. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  17. miRNA-Mediated Relationships between Cis-SNP Genotypes and Transcript Intensities in Lymphocyte Cell Lines

    PubMed Central

    Zhang, Wensheng; Edwards, Andrea; Zhu, Dongxiao; Flemington, Erik K.; Deininger, Prescott; Zhang, Kun

    2012-01-01

    In metazoans, miRNAs regulate gene expression primarily through binding to target sites in the 3′ UTRs (untranslated regions) of messenger RNAs (mRNAs). Cis-acting variants within, or close to, a gene are crucial in explaining the variability of gene expression measures. Single nucleotide polymorphisms (SNPs) in the 3′ UTRs of genes can affect the base-pairing between miRNAs and mRNAs, and hence disrupt existing target sites (in the reference sequence) or create novel target sites, suggesting a possible mechanism for cis regulation of gene expression. Moreover, because the alleles of different SNPs within a DNA sequence of limited length tend to be in strong linkage disequilibrium (LD), we hypothesize the variants of miRNA target sites caused by SNPs potentially function as bridges linking the documented cis-SNP markers to the expression of the associated genes. A large-scale analysis was herein performed to test this hypothesis. By systematically integrating multiple latest information sources, we found 21 significant gene-level SNP-involved miRNA-mediated post-transcriptional regulation modules (SNP-MPRMs) in the form of SNP-miRNA-mRNA triplets in lymphocyte cell lines for the CEU and YRI populations. Among the cognate genes, six including ALG8, DGKE, GNA12, KLF11, LRPAP1, and MMAB are related to multiple genetic diseases such as depressive disorder and Type-II diabetes. Furthermore, we found that ∼35% of the documented transcript intensity-related cis-SNPs (∼950) in a recent publication are identical to, or in significant linkage disequilibrium (LD) (p<0.01) with, one or multiple SNPs located in miRNA target sites. Based on these associations (or identities), 69 significant exon-level SNP-MPRMs and 12 disease genes were further determined for two populations. These results provide concrete in silico evidence for the proposed hypothesis. The discovered modules warrant additional follow-up in independent laboratory studies. PMID:22348086

  18. Haplotype-based approach to known MS-associated regions increases the amount of explained risk

    PubMed Central

    Khankhanian, Pouya; Gourraud, Pierre-Antoine; Lizee, Antoine; Goodin, Douglas S

    2015-01-01

    Genome-wide association studies (GWAS), using single nucleotide polymorphisms (SNPs), have yielded 110 non-human leucocyte antigen genomic regions that are associated with multiple sclerosis (MS). Despite this large number of associations, however, only 28% of MS-heritability can currently be explained. Here we compare the use of multi-SNP-haplotypes to the use of single-SNPs as alternative methods to describe MS genetic risk. SNP-haplotypes (of various lengths from 1 up to 15 contiguous SNPs) were constructed at each of the 110 previously identified, MS-associated, genomic regions. Even after correcting for the larger number of statistical comparisons made when using the haplotype-method, in 32 of the regions, the SNP-haplotype based model was markedly more significant than the single-SNP based model. By contrast, in no region was the single-SNP based model similarly more significant than the SNP-haplotype based model. Moreover, when we included the 932 MS-associated SNP-haplotypes (that we identified from 102 regions) as independent variables into a logistic linear model, the amount of MS-heritability, as assessed by Nagelkerke's R-squared, was 38%, which was considerably better than 29%, which was obtained by using only single-SNPs. This study demonstrates that SNP-haplotypes can be used to fine-map the genetic associations within regions of interest previously identified by single-SNP GWAS. Moreover, the amount of the MS genetic risk explained by the SNP-haplotype associations in the 110 MS-associated genomic regions was considerably greater when using SNP-haplotypes than when using single-SNPs. Also, the use of SNP-haplotypes can lead to the discovery of new regions of interest, which have not been identified by a single-SNP GWAS. PMID:26185143

  19. An imputed genotype resource for the laboratory mouse

    PubMed Central

    Szatkiewicz, Jin P.; Beane, Glen L.; Ding, Yueming; Hutchins, Lucie; de Villena, Fernando Pardo-Manuel; Churchill, Gary A.

    2009-01-01

    We have created a high-density SNP resource encompassing 7.87 million polymorphic loci across 49 inbred mouse strains of the laboratory mouse by combining data available from public databases and training a hidden Markov model to impute missing genotypes in the combined data. The strong linkage disequilibrium found in dense sets of SNP markers in the laboratory mouse provides the basis for accurate imputation. Using genotypes from eight independent SNP resources, we empirically validated the quality of the imputed genotypes and demonstrate that they are highly reliable for most inbred strains. The imputed SNP resource will be useful for studies of natural variation and complex traits. It will facilitate association study designs by providing high density SNP genotypes for large numbers of mouse strains. We anticipate that this resource will continue to evolve as new genotype data become available for laboratory mouse strains. The data are available for bulk download or query at http://cgd.jax.org/. PMID:18301946

  20. Two-phase designs for joint quantitative-trait-dependent and genotype-dependent sampling in post-GWAS regional sequencing.

    PubMed

    Espin-Garcia, Osvaldo; Craiu, Radu V; Bull, Shelley B

    2018-02-01

    We evaluate two-phase designs to follow-up findings from genome-wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation-maximization-based inference under a semiparametric maximum likelihood formulation tailored for post-GWAS inference. A GWAS-SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT-SNP-dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme-QT strata yields significant power improvements compared to marginal QT- or SNP-based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. © 2017 The Authors. Genetic Epidemiology Published by Wiley Periodicals, Inc.

  1. Electronic and spectroscopic characterizations of SNP isomers

    NASA Astrophysics Data System (ADS)

    Trabelsi, Tarek; Al Mogren, Muneerah Mogren; Hochlaf, Majdi; Francisco, Joseph S.

    2018-02-01

    High-level ab initio electronic structure calculations were performed to characterize SNP isomers. In addition to the known linear SNP, cyc-PSN, and linear SPN isomers, we identified a fourth isomer, linear PSN, which is located ˜2.4 eV above the linear SNP isomer. The low-lying singlet and triplet electronic states of the linear SNP and SPN isomers were investigated using a multi-reference configuration interaction method and large basis set. Several bound electronic states were identified. However, their upper rovibrational levels were predicted to pre-dissociate, leading to S + PN, P + NS products, and multi-step pathways were discovered. For the ground states, a set of spectroscopic parameters were derived using standard and explicitly correlated coupled-cluster methods in conjunction with augmented correlation-consistent basis sets extrapolated to the complete basis set limit. We also considered scalar and core-valence effects. For linear isomers, the rovibrational spectra were deduced after generation of their 3D-potential energy surfaces along the stretching and bending coordinates and variational treatments of the nuclear motions.

  2. Two combinatorial optimization problems for SNP discovery using base-specific cleavage and mass spectrometry.

    PubMed

    Chen, Xin; Wu, Qiong; Sun, Ruimin; Zhang, Louxin

    2012-01-01

    The discovery of single-nucleotide polymorphisms (SNPs) has important implications in a variety of genetic studies on human diseases and biological functions. One valuable approach proposed for SNP discovery is based on base-specific cleavage and mass spectrometry. However, it is still very challenging to achieve the full potential of this SNP discovery approach. In this study, we formulate two new combinatorial optimization problems. While both problems are aimed at reconstructing the sample sequence that would attain the minimum number of SNPs, they search over different candidate sequence spaces. The first problem, denoted as SNP - MSP, limits its search to sequences whose in silico predicted mass spectra have all their signals contained in the measured mass spectra. In contrast, the second problem, denoted as SNP - MSQ, limits its search to sequences whose in silico predicted mass spectra instead contain all the signals of the measured mass spectra. We present an exact dynamic programming algorithm for solving the SNP - MSP problem and also show that the SNP - MSQ problem is NP-hard by a reduction from a restricted variation of the 3-partition problem. We believe that an efficient solution to either problem above could offer a seamless integration of information in four complementary base-specific cleavage reactions, thereby improving the capability of the underlying biotechnology for sensitive and accurate SNP discovery.

  3. Fine-scaled human genetic structure revealed by SNP microarrays.

    PubMed

    Xing, Jinchuan; Watkins, W Scott; Witherspoon, David J; Zhang, Yuhua; Guthery, Stephen L; Thara, Rangaswamy; Mowry, Bryan J; Bulayeva, Kazima; Weiss, Robert B; Jorde, Lynn B

    2009-05-01

    We report an analysis of more than 240,000 loci genotyped using the Affymetrix SNP microarray in 554 individuals from 27 worldwide populations in Africa, Asia, and Europe. To provide a more extensive and complete sampling of human genetic variation, we have included caste and tribal samples from two states in South India, Daghestanis from eastern Europe, and the Iban from Malaysia. Consistent with observations made by Charles Darwin, our results highlight shared variation among human populations and demonstrate that much genetic variation is geographically continuous. At the same time, principal components analyses reveal discernible genetic differentiation among almost all identified populations in our sample, and in most cases, individuals can be clearly assigned to defined populations on the basis of SNP genotypes. All individuals are accurately classified into continental groups using a model-based clustering algorithm, but between closely related populations, genetic and self-classifications conflict for some individuals. The 250K data permitted high-level resolution of genetic variation among Indian caste and tribal populations and between highland and lowland Daghestani populations. In particular, upper-caste individuals from Tamil Nadu and Andhra Pradesh form one defined group, lower-caste individuals from these two states form another, and the tribal Irula samples form a third. Our results emphasize the correlation of genetic and geographic distances and highlight other elements, including social factors that have contributed to population structure.

  4. GWAS of human bitter taste perception identifies new loci and reveals additional complexity of bitter taste genetics.

    PubMed

    Ledda, Mirko; Kutalik, Zoltán; Souza Destito, Maria C; Souza, Milena M; Cirillo, Cintia A; Zamboni, Amabilene; Martin, Nathalie; Morya, Edgard; Sameshima, Koichi; Beckmann, Jacques S; le Coutre, Johannes; Bergmann, Sven; Genick, Ulrich K

    2014-01-01

    Human perception of bitterness displays pronounced interindividual variation. This phenotypic variation is mirrored by equally pronounced genetic variation in the family of bitter taste receptor genes. To better understand the effects of common genetic variations on human bitter taste perception, we conducted a genome-wide association study on a discovery panel of 504 subjects and a validation panel of 104 subjects from the general population of São Paulo in Brazil. Correction for general taste-sensitivity allowed us to identify a SNP in the cluster of bitter taste receptors on chr12 (10.88- 11.24 Mb, build 36.1) significantly associated (best SNP: rs2708377, P = 5.31 × 10(-13), r(2) = 8.9%, β = -0.12, s.e. = 0.016) with the perceived bitterness of caffeine. This association overlaps with-but is statistically distinct from-the previously identified SNP rs10772420 influencing the perception of quinine bitterness that falls in the same bitter taste cluster. We replicated this association to quinine perception (P = 4.97 × 10(-37), r(2) = 23.2%, β = 0.25, s.e. = 0.020) and additionally found the effect of this genetic locus to be concentration specific with a strong impact on the perception of low, but no impact on the perception of high concentrations of quinine. Our study, thus, furthers our understanding of the complex genetic architecture of bitter taste perception.

  5. PGen: large-scale genomic variations analysis workflow and browser in SoyKB.

    PubMed

    Liu, Yang; Khan, Saad M; Wang, Juexin; Rynge, Mats; Zhang, Yuanxun; Zeng, Shuai; Chen, Shiyuan; Maldonado Dos Santos, Joao V; Valliyodan, Babu; Calyam, Prasad P; Merchant, Nirav; Nguyen, Henry T; Xu, Dong; Joshi, Trupti

    2016-10-06

    With the advances in next-generation sequencing (NGS) technology and significant reductions in sequencing costs, it is now possible to sequence large collections of germplasm in crops for detecting genome-scale genetic variations and to apply the knowledge towards improvements in traits. To efficiently facilitate large-scale NGS resequencing data analysis of genomic variations, we have developed "PGen", an integrated and optimized workflow using the Extreme Science and Engineering Discovery Environment (XSEDE) high-performance computing (HPC) virtual system, iPlant cloud data storage resources and Pegasus workflow management system (Pegasus-WMS). The workflow allows users to identify single nucleotide polymorphisms (SNPs) and insertion-deletions (indels), perform SNP annotations and conduct copy number variation analyses on multiple resequencing datasets in a user-friendly and seamless way. We have developed both a Linux version in GitHub ( https://github.com/pegasus-isi/PGen-GenomicVariations-Workflow ) and a web-based implementation of the PGen workflow integrated within the Soybean Knowledge Base (SoyKB), ( http://soykb.org/Pegasus/index.php ). Using PGen, we identified 10,218,140 single-nucleotide polymorphisms (SNPs) and 1,398,982 indels from analysis of 106 soybean lines sequenced at 15X coverage. 297,245 non-synonymous SNPs and 3330 copy number variation (CNV) regions were identified from this analysis. SNPs identified using PGen from additional soybean resequencing projects adding to 500+ soybean germplasm lines in total have been integrated. These SNPs are being utilized for trait improvement using genotype to phenotype prediction approaches developed in-house. In order to browse and access NGS data easily, we have also developed an NGS resequencing data browser ( http://soykb.org/NGS_Resequence/NGS_index.php ) within SoyKB to provide easy access to SNP and downstream analysis results for soybean researchers. PGen workflow has been optimized for the most efficient analysis of soybean data using thorough testing and validation. This research serves as an example of best practices for development of genomics data analysis workflows by integrating remote HPC resources and efficient data management with ease of use for biological users. PGen workflow can also be easily customized for analysis of data in other species.

  6. Common genetic variation and novel loci associated with volumetric mammographic density.

    PubMed

    Brand, Judith S; Humphreys, Keith; Li, Jingmei; Karlsson, Robert; Hall, Per; Czene, Kamila

    2018-04-17

    Mammographic density (MD) is a strong and heritable intermediate phenotype of breast cancer, but much of its genetic variation remains unexplained. We conducted a genetic association study of volumetric MD in a Swedish mammography screening cohort (n = 9498) to identify novel MD loci. Associations with volumetric MD phenotypes (percent dense volume, absolute dense volume, and absolute nondense volume) were estimated using linear regression adjusting for age, body mass index, menopausal status, and six principal components. We also estimated the proportion of MD variance explained by additive contributions from single-nucleotide polymorphisms (SNP-based heritability [h 2 SNP ]) in 4948 participants of the cohort. In total, three novel MD loci were identified (at P < 5 × 10 - 8 ): one for percent dense volume (HABP2) and two for the absolute dense volume (INHBB, LINC01483). INHBB is an established locus for ER-negative breast cancer, and HABP2 and LINC01483 represent putative new breast cancer susceptibility loci, because both loci were associated with breast cancer in available meta-analysis data including 122,977 breast cancer cases and 105,974 control subjects (P < 0.05). h 2 SNP (SE) estimates for percent dense, absolute dense, and nondense volume were 0.29 (0.07), 0.31 (0.07), and 0.25 (0.07), respectively. Corresponding ratios of h 2 SNP to previously observed narrow-sense h 2 estimates in the same cohort were 0.46, 0.72, and 0.41, respectively. These findings provide new insights into the genetic basis of MD and biological mechanisms linking MD to breast cancer risk. Apart from identifying three novel loci, we demonstrate that at least 25% of the MD variance is explained by common genetic variation with h 2 SNP /h 2 ratios varying between dense and nondense MD components.

  7. Common colorectal cancer risk alleles contribute to the multiple colorectal adenoma phenotype, but do not influence colonic polyposis in FAP.

    PubMed

    Cheng, Timothy H T; Gorman, Maggie; Martin, Lynn; Barclay, Ella; Casey, Graham; Saunders, Brian; Thomas, Huw; Clark, Sue; Tomlinson, Ian

    2015-02-01

    The presence of multiple (5-100) colorectal adenomas suggests an inherited predisposition, but the genetic aetiology of this phenotype is undetermined if patients test negative for Mendelian polyposis syndromes such as familial adenomatous polyposis (FAP) and MUTYH-associated polyposis (MAP). We investigated whether 18 common colorectal cancer (CRC) predisposition single-nucleotide polymorphisms (SNPs) could help to explain some cases with multiple adenomas who phenocopied FAP or MAP, but had no pathogenic APC or MUTYH variant. No multiple adenoma case had an outlying number of CRC SNP risk alleles, but multiple adenoma patients did have a significantly higher number of risk alleles than population controls (P=5.7 × 10(-7)). The association was stronger in those with ≥10 adenomas. The CRC SNPs accounted for 4.3% of the variation in multiple adenoma risk, with three SNPs (rs6983267, rs10795668, rs3802842) explaining 3.0% of the variation. In FAP patients, the CRC risk score did not differ significantly from the controls, as we expected given the overwhelming effect of pathogenic germline APC variants on the phenotype of these cases. More unexpectedly, we found no evidence that the CRC SNPs act as modifier genes for the number of colorectal adenomas in FAP patients. In conclusion, common colorectal tumour risk alleles contribute to the development of multiple adenomas in patients without pathogenic germline APC or MUTYH variants. This phenotype may have 'polygenic' or monogenic origins. The risk of CRC in relatives of multiple adenoma cases is probably much lower for cases with polygenic disease, and this should be taken into account when counselling such patients.

  8. SNP-Based QTL Mapping of 15 Complex Traits in Barley under Rain-Fed and Well-Watered Conditions by a Mixed Modeling Approach.

    PubMed

    Mora, Freddy; Quitral, Yerko A; Matus, Ivan; Russell, Joanne; Waugh, Robbie; Del Pozo, Alejandro

    2016-01-01

    This study identified single nucleotide polymorphism (SNP) markers associated with 15 complex traits in a breeding population of barley (Hordeum vulgare L.) consisting of 137 recombinant chromosome substitution lines (RCSL), evaluated under contrasting water availability conditions in the Mediterranean climatic region of central Chile. Given that markers showed a very strong segregation distortion, a quantitative trait locus/loci (QTL) mapping mixed model was used to account for the heterogeneity in genetic relatedness between genotypes. Fifty-seven QTL were detected under rain-fed conditions, which accounted for 5-22% of the phenotypic variation. In full irrigation conditions, 84 SNPs were significantly associated with the traits studied, explaining 5-35% of phenotypic variation. Most of the QTL were co-localized on chromosomes 2H and 3H. Environment-specific genomic regions were detected for 12 of the 15 traits scored. Although most QTL-trait associations were environment and trait specific, some important and stable associations were also detected. In full irrigation conditions, a relatively major genomic region was found underlying hectoliter weight (HW), on chromosome 1H, which explained between 27% (SNP 2711-234) and 35% (SNP 1923-265) of the phenotypic variation. Interestingly, the locus 1923-265 was also detected for grain yield at both environmental conditions, accounting for 9 and 18%, in the rain-fed and irrigation conditions, respectively. Analysis of QTL in this breeding population identified significant genomic regions that can be used for marker-assisted selection (MAS) of barley in areas where drought is a significant constraint.

  9. Population-genetic nature of copy number variations in the human genome.

    PubMed

    Kato, Mamoru; Kawaguchi, Takahisa; Ishikawa, Shumpei; Umeda, Takayoshi; Nakamichi, Reiichiro; Shapero, Michael H; Jones, Keith W; Nakamura, Yusuke; Aburatani, Hiroyuki; Tsunoda, Tatsuhiko

    2010-03-01

    Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000-4000 CNVs (4-6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV-SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV-SNP linkage disequilibrium (LD) for 500-900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP-SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs.

  10. SNP-Based QTL Mapping of 15 Complex Traits in Barley under Rain-Fed and Well-Watered Conditions by a Mixed Modeling Approach

    PubMed Central

    Mora, Freddy; Quitral, Yerko A.; Matus, Ivan; Russell, Joanne; Waugh, Robbie; del Pozo, Alejandro

    2016-01-01

    This study identified single nucleotide polymorphism (SNP) markers associated with 15 complex traits in a breeding population of barley (Hordeum vulgare L.) consisting of 137 recombinant chromosome substitution lines (RCSL), evaluated under contrasting water availability conditions in the Mediterranean climatic region of central Chile. Given that markers showed a very strong segregation distortion, a quantitative trait locus/loci (QTL) mapping mixed model was used to account for the heterogeneity in genetic relatedness between genotypes. Fifty-seven QTL were detected under rain-fed conditions, which accounted for 5–22% of the phenotypic variation. In full irrigation conditions, 84 SNPs were significantly associated with the traits studied, explaining 5–35% of phenotypic variation. Most of the QTL were co-localized on chromosomes 2H and 3H. Environment-specific genomic regions were detected for 12 of the 15 traits scored. Although most QTL-trait associations were environment and trait specific, some important and stable associations were also detected. In full irrigation conditions, a relatively major genomic region was found underlying hectoliter weight (HW), on chromosome 1H, which explained between 27% (SNP 2711-234) and 35% (SNP 1923-265) of the phenotypic variation. Interestingly, the locus 1923-265 was also detected for grain yield at both environmental conditions, accounting for 9 and 18%, in the rain-fed and irrigation conditions, respectively. Analysis of QTL in this breeding population identified significant genomic regions that can be used for marker-assisted selection (MAS) of barley in areas where drought is a significant constraint. PMID:27446139

  11. Association of udder traits with single nucleotide polymorphisms in crossbred Bos indicus-Bos taurus cows.

    PubMed

    Tolleson, M W; Gill, C A; Herring, A D; Riggs, P K; Sawyer, J E; Sanders, J O; Riley, D G

    2017-06-01

    The size, support, and health of udders limit the productive life of beef cows, especially those with background, because, in general, such cows have a reputation for problems with udders. Genomic association studies of bovine udder traits have been conducted in dairy cattle and recently in Continental European beef breeds but not in cows with background. The objective of this study was to determine associations of SNP and udder support scores, teat length, and teat diameter in half (Nellore), half (Angus) cows. Udders of cows ( = 295) born from 2003 to 2007 were evaluated for udder support and teat length and diameter ( = 1,746 records) from 2005 through 2014. These included a subjective score representing udder support (values of 1 indicated poorly supported, pendulous udders and values of 9 indicated very well-supported udders) and lengths and diameters of individual teats in the 4 udder quarters as well as the average. Cows were in full-sibling or half-sibling families. Residuals for each trait were produced from repeated records models with cow age category nested within birth year of cows. Those residuals were averaged to become the dependent variables for genomewide association analyses. Regression analyses of those dependent variables included genotypic values as explanatory variables for 34,980 SNP from a commercially available array and included the genomic relationship matrix. Fifteen SNP loci on BTA 5 were associated (false discovery rate controlled at 0.05) with udder support score. One of those was also detected as associated with average teat diameter. Three of those 15 SNP were located within genes, including one each in (), (), and (). These are notable for their functional role in some aspect of mammary gland formation or health. Other candidate genes for these traits in the vicinity of the SNP loci include () and (). Because these were detected in Nellore-Angus crossbred cows, which typically have very well-formed udders with excellent support across their productive lives, similar efforts in other breeds should be completed, because that may facilitate further refinement of genomic regions responsible for variation in udder traits important in multiple breeds.

  12. Human Variation in Short Regions Predisposed to Deep Evolutionary Conservation

    PubMed Central

    Loots, Gabriela G.; Ovcharenko, Ivan

    2010-01-01

    The landscape of the human genome consists of millions of short islands of conservation that are 100% conserved across multiple vertebrate genomes (termed “bricks”), the majority of which are located in noncoding regions. Several hundred thousand bricks are deeply conserved reaching the genomes of amphibians and fish. Deep phylogenetic conservation of noncoding DNA has been reported to be strongly associated with the presence of gene regulatory elements, introducing bricks as a proxy to the functional noncoding landscape of the human genome. Here, we report a significant overrepresentation of bricks in the promoters of transcription factors and developmental genes, where the high level of phylogenetic conservation correlates with an increase in brick overrepresentation. We also found that the presence of a brick dictates a predisposition to evolutionary constraint, with only 0.7% of the amniota brick central nucleotides being diverged within the primate lineage—an 11-fold reduction in the divergence rate compared with random expectation. Human single-nucleotide polymorphism (SNP) data explains only 3% of primate-specific variation in amniota bricks, thus arguing for a widespread fixation of brick mutations within the primate lineage and prior to human radiation. This variation, in turn, might have been utilized as a driving force for primate- and hominoid-specific adaptation. We also discovered a pronounced deviation from the evolutionary predisposition in the human lineage, with over 20-fold increase in the substitution rate at brick SNP sites over expected values. In addition, contrary to typical brick mutations, brick variation commonly encountered in the human population displays limited, if any, signatures of negative selection as measured by the minor allele frequency and population differentiation (F-statistical measure) measures. These observations argue for the plasticity of gene regulatory mechanisms in vertebrates—with evidence of strong purifying selection acting on the gene regulatory landscape of the human genome, where widespread advantageous mutations in putative regulatory elements are likely utilized in functional diversification and adaptation of species. PMID:20093432

  13. Real-Time PCR Typing of Escherichia coli Based on Multiple Single Nucleotide Polymorphisms--a Convenient and Rapid Method.

    PubMed

    Lager, Malin; Mernelius, Sara; Löfgren, Sture; Söderman, Jan

    2016-01-01

    Healthcare-associated infections caused by Escherichia coli and antibiotic resistance due to extended-spectrum beta-lactamase (ESBL) production constitute a threat against patient safety. To identify, track, and control outbreaks and to detect emerging virulent clones, typing tools of sufficient discriminatory power that generate reproducible and unambiguous data are needed. A probe based real-time PCR method targeting multiple single nucleotide polymorphisms (SNP) was developed. The method was based on the multi locus sequence typing scheme of Institute Pasteur and by adaptation of previously described typing assays. An 8 SNP-panel that reached a Simpson's diversity index of 0.95 was established, based on analysis of sporadic E. coli cases (ESBL n = 27 and non-ESBL n = 53). This multi-SNP assay was used to identify the sequence type 131 (ST131) complex according to the Achtman's multi locus sequence typing scheme. However, it did not fully discriminate within the complex but provided a diagnostic signature that outperformed a previously described detection assay. Pulsed-field gel electrophoresis typing of isolates from a presumed outbreak (n = 22) identified two outbreaks (ST127 and ST131) and three different non-outbreak-related isolates. Multi-SNP typing generated congruent data except for one non-outbreak-related ST131 isolate. We consider multi-SNP real-time PCR typing an accessible primary generic E. coli typing tool for rapid and uniform type identification.

  14. Novel applications of array comparative genomic hybridization in molecular diagnostics.

    PubMed

    Cheung, Sau W; Bi, Weimin

    2018-05-31

    In 2004, the implementation of array comparative genomic hybridization (array comparative genome hybridization [CGH]) into clinical practice marked a new milestone for genetic diagnosis. Array CGH and single-nucleotide polymorphism (SNP) arrays enable genome-wide detection of copy number changes in a high resolution, and therefore microarray has been recognized as the first-tier test for patients with intellectual disability or multiple congenital anomalies, and has also been applied prenatally for detection of clinically relevant copy number variations in the fetus. Area covered: In this review, the authors summarize the evolution of array CGH technology from their diagnostic laboratory, highlighting exonic SNP arrays developed in the past decade which detect small intragenic copy number changes as well as large DNA segments for the region of heterozygosity. The applications of array CGH to human diseases with different modes of inheritance with the emphasis on autosomal recessive disorders are discussed. Expert commentary: An exonic array is a powerful and most efficient clinical tool in detecting genome wide small copy number variants in both dominant and recessive disorders. However, whole-genome sequencing may become the single integrated platform for detection of copy number changes, single-nucleotide changes as well as balanced chromosomal rearrangements in the near future.

  15. Development and application of a novel genome-wide SNP array reveals domestication history in soybean

    PubMed Central

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-01-01

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884

  16. Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

    PubMed

    Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

    2016-02-09

    Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.

  17. Single-nucleotide polymorphism-gene intermixed networking reveals co-linkers connected to multiple gene expression phenotypes

    PubMed Central

    Gong, Bin-Sheng; Zhang, Qing-Pu; Zhang, Guang-Mei; Zhang, Shao-Jun; Zhang, Wei; Lv, Hong-Chao; Zhang, Fan; Lv, Sa-Li; Li, Chuan-Xing; Rao, Shao-Qi; Li, Xia

    2007-01-01

    Gene expression profiles and single-nucleotide polymorphism (SNP) profiles are modern data for genetic analysis. It is possible to use the two types of information to analyze the relationships among genes by some genetical genomics approaches. In this study, gene expression profiles were used as expression traits. And relationships among the genes, which were co-linked to a common SNP(s), were identified by integrating the two types of information. Further research on the co-expressions among the co-linked genes was carried out after the gene-SNP relationships were established using the Haseman-Elston sib-pair regression. The results showed that the co-expressions among the co-linked genes were significantly higher if the number of connections between the genes and a SNP(s) was more than six. Then, the genes were interconnected via one or more SNP co-linkers to construct a gene-SNP intermixed network. The genes sharing more SNPs tended to have a stronger correlation. Finally, a gene-gene network was constructed with their intensities of relationships (the number of SNP co-linkers shared) as the weights for the edges. PMID:18466544

  18. Haplotype diversity in 11 candidate genes across four populations.

    PubMed

    Beaty, T H; Fallin, M D; Hetmanski, J B; McIntosh, I; Chong, S S; Ingersoll, R; Sheng, X; Chakraborty, R; Scott, A F

    2005-09-01

    Analysis of haplotypes based on multiple single-nucleotide polymorphisms (SNP) is becoming common for both candidate gene and fine-mapping studies. Before embarking on studies of haplotypes from genetically distinct populations, however, it is important to consider variation both in linkage disequilibrium (LD) and in haplotype frequencies within and across populations, as both vary. Such diversity will influence the choice of "tagging" SNPs for candidate gene or whole-genome association studies because some markers will not be polymorphic in all samples and some haplotypes will be poorly represented or completely absent. Here we analyze 11 genes, originally chosen as candidate genes for oral clefts, where multiple markers were genotyped on individuals from four populations. Estimated haplotype frequencies, measures of pairwise LD, and genetic diversity were computed for 135 European-Americans, 57 Chinese-Singaporeans, 45 Malay-Singaporeans, and 46 Indian-Singaporeans. Patterns of pairwise LD were compared across these four populations and haplotype frequencies were used to assess genetic variation. Although these populations are fairly similar in allele frequencies and overall patterns of LD, both haplotype frequencies and genetic diversity varied significantly across populations. Such haplotype diversity has implications for designing studies of association involving samples from genetically distinct populations.

  19. Multiple novel prostate cancer susceptibility signals identified by fine-mapping of known risk loci among Europeans.

    PubMed

    Amin Al Olama, Ali; Dadaev, Tokhir; Hazelett, Dennis J; Li, Qiuyan; Leongamornlert, Daniel; Saunders, Edward J; Stephens, Sarah; Cieza-Borrella, Clara; Whitmore, Ian; Benlloch Garcia, Sara; Giles, Graham G; Southey, Melissa C; Fitzgerald, Liesel; Gronberg, Henrik; Wiklund, Fredrik; Aly, Markus; Henderson, Brian E; Schumacher, Fredrick; Haiman, Christopher A; Schleutker, Johanna; Wahlfors, Tiina; Tammela, Teuvo L; Nordestgaard, Børge G; Key, Tim J; Travis, Ruth C; Neal, David E; Donovan, Jenny L; Hamdy, Freddie C; Pharoah, Paul; Pashayan, Nora; Khaw, Kay-Tee; Stanford, Janet L; Thibodeau, Stephen N; Mcdonnell, Shannon K; Schaid, Daniel J; Maier, Christiane; Vogel, Walther; Luedeke, Manuel; Herkommer, Kathleen; Kibel, Adam S; Cybulski, Cezary; Wokołorczyk, Dominika; Kluzniak, Wojciech; Cannon-Albright, Lisa; Brenner, Hermann; Butterbach, Katja; Arndt, Volker; Park, Jong Y; Sellers, Thomas; Lin, Hui-Yi; Slavov, Chavdar; Kaneva, Radka; Mitev, Vanio; Batra, Jyotsna; Clements, Judith A; Spurdle, Amanda; Teixeira, Manuel R; Paulo, Paula; Maia, Sofia; Pandha, Hardev; Michael, Agnieszka; Kierzek, Andrzej; Govindasami, Koveela; Guy, Michelle; Lophatonanon, Artitaya; Muir, Kenneth; Viñuela, Ana; Brown, Andrew A; Freedman, Mathew; Conti, David V; Easton, Douglas; Coetzee, Gerhard A; Eeles, Rosalind A; Kote-Jarai, Zsofia

    2015-10-01

    Genome-wide association studies (GWAS) have identified numerous common prostate cancer (PrCa) susceptibility loci. We have fine-mapped 64 GWAS regions known at the conclusion of the iCOGS study using large-scale genotyping and imputation in 25 723 PrCa cases and 26 274 controls of European ancestry. We detected evidence for multiple independent signals at 16 regions, 12 of which contained additional newly identified significant associations. A single signal comprising a spectrum of correlated variation was observed at 39 regions; 35 of which are now described by a novel more significantly associated lead SNP, while the originally reported variant remained as the lead SNP only in 4 regions. We also confirmed two association signals in Europeans that had been previously reported only in East-Asian GWAS. Based on statistical evidence and linkage disequilibrium (LD) structure, we have curated and narrowed down the list of the most likely candidate causal variants for each region. Functional annotation using data from ENCODE filtered for PrCa cell lines and eQTL analysis demonstrated significant enrichment for overlap with bio-features within this set. By incorporating the novel risk variants identified here alongside the refined data for existing association signals, we estimate that these loci now explain ∼38.9% of the familial relative risk of PrCa, an 8.9% improvement over the previously reported GWAS tag SNPs. This suggests that a significant fraction of the heritability of PrCa may have been hidden during the discovery phase of GWAS, in particular due to the presence of multiple independent signals within the same region. © The Author 2015. Published by Oxford University Press.

  20. Detection of genetic association and functional polymorphisms of UGDH affecting milk production trait in Chinese Holstein cattle.

    PubMed

    Xu, Qing; Mei, Gui; Sun, Dongxiao; Zhang, Qin; Zhang, Yuan; Yin, Cengceng; Chen, Huiyong; Ding, Xiangdong; Liu, Jianfeng

    2012-11-02

    We previously localized a quantitative trait locus (QTL) on bovine chromosome 6 affecting milk production traits to a 1.5-Mb region between BMS483 and MNB-209 via genome scanning followed by fine mapping. Totally 15 genes were mapped within such linkage region through bioinformatic analysis of the cattle-human comparative map and bovine genome assembly. Of them, the UDP-glucose dehydrogenase (UGDH) was suggested as a potential positional candidate gene for milk production traits based on its corresponding physiological and biochemical functions and genetic effects. By sequencing all the coding exons and the untranslated regions in UGDH with pooled DNA of 8 sires represented the separated families detected in our previous studies, a total of ten SNPs were identified and genotyped in 1417 Holstein cows of 8 separation families. Individual SNP-based association analysis revealed 4 significant associations of SNP Ex1-1, SNP Int3-1, SNP Int5-1, and SNP Ex12-3 with milk yield (P < 0.05), and 2 significant associations of SNP Ex1-1 and SNP Ex12-3 with protein yield (P < 0.05). Furthermore, our haplotype-based association analyses indicated that haplotypes G-C-C, formed by SNP Ex12-2-SNP Int11-1-SNP Ex11-1, T-G, formed by SNP Int9-3-SNP Int9-2, and C-C, formed by SNP Int5-1-SNP Int3-1, are significantly associated with protein percentage (F=4.15; P=0.0418) and fat percentage (F=5.18~7.25; P=0.0072~0.0231). Finally, by using an in vitro expression assay, we demonstrated that the A allele of SNP Ex1-1 and T allele of SNP Ex11-1of UGDH significantly decreases the expression of UGDH by 68.0% at the RNA, and 50.1% at the protein level, suggesting that SNP Ex1-1 and Ex11-1 represent two functional polymorphisms affecting expression of UGDH and may partly contributed to the observed association of the gene with milk production traits in our samples. Taken together, our findings strongly indicate that UGDH gene could be involved in genetic variation underlying the QTL for milk production traits.

  1. Polymorphic genetic variation in immune system genes: a study of two populations of Espirito Santo, Brazil.

    PubMed

    Dettogni, Raquel Spinassé; Sá, Ricardo Tristão; Tovar, Thaís Tristão; Louro, Iúri Drumond

    2013-08-01

    Mapping single nucleotide polymorphisms (SNPs) in genes potentially involved in immune responses may help understand the pathophysiology of infectious diseases in specific geographical regions. In this context, we have aimed to analyze the frequency of immunogenetic markers, focusing on genes CD209 (SNP -336A/G), FCγRIIa (SNP -131H/R), TNF-α (SNP -308A/G) and VDR (SNP Taq I) in two populations of the Espirito Santo State (ES), Brazil: general and Pomeranian populations. Peripheral blood genomic DNA was extracted from one hundred healthy individuals of the general population and from 59 Pomeranians. Polymorphic variant identification was performed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). SNP genotype frequencies were in Hardy-Weinberg Equilibrium. There was no statistically significant difference in allelic and genotypic distributions between the two populations studied. Statistically significant differences were observed for SNP genotype distribution in genes CD209, TNF-α and VDR when comparing the ES populations with other Brazilian populations. This is the first report of CD209, FcγRIIa, TNF-α and VDR allelic frequencies for the general and Pomeranian populations of ES.

  2. Phylogeographic and population genetic analyses reveal multiple species of Boa and independent origins of insular dwarfism.

    PubMed

    Card, Daren C; Schield, Drew R; Adams, Richard H; Corbin, Andrew B; Perry, Blair W; Andrew, Audra L; Pasquesi, Giulia I M; Smith, Eric N; Jezkova, Tereza; Boback, Scott M; Booth, Warren; Castoe, Todd A

    2016-09-01

    Boa is a Neotropical genus of snakes historically recognized as monotypic despite its expansive distribution. The distinct morphological traits and color patterns exhibited by these snakes, together with the wide diversity of ecosystems they inhabit, collectively suggest that the genus may represent multiple species. Morphological variation within Boa also includes instances of dwarfism observed in multiple offshore island populations. Despite this substantial diversity, the systematics of the genus Boa has received little attention until very recently. In this study we examined the genetic structure and phylogenetic relationships of Boa populations using mitochondrial sequences and genome-wide SNP data obtained from RADseq. We analyzed these data at multiple geographic scales using a combination of phylogenetic inference (including coalescent-based species delimitation) and population genetic analyses. We identified extensive population structure across the range of the genus Boa and multiple lines of evidence for three widely-distributed clades roughly corresponding with the three primary land masses of the Western Hemisphere. We also find both mitochondrial and nuclear support for independent origins and parallel evolution of dwarfism on offshore island clusters in Belize and Cayos Cochinos Menor, Honduras. Copyright © 2016 Elsevier Inc. All rights reserved.

  3. High-density single nucleotide polymorphism (SNP) array mapping in Brassica oleracea: identification of QTL associated with carotenoid variation in broccoli florets.

    PubMed

    Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W

    2014-09-01

    A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.

  4. Insights into the genetic architecture of morphological traits in two passerine bird species.

    PubMed

    Silva, C N S; McFarlane, S E; Hagen, I J; Rönnegård, L; Billing, A M; Kvalnes, T; Kemppainen, P; Rønning, B; Ringsby, T H; Sæther, B-E; Qvarnström, A; Ellegren, H; Jensen, H; Husby, A

    2017-09-01

    Knowledge about the underlying genetic architecture of phenotypic traits is needed to understand and predict evolutionary dynamics. The number of causal loci, magnitude of the effects and location in the genome are, however, still largely unknown. Here, we use genome-wide single-nucleotide polymorphism (SNP) data from two large-scale data sets on house sparrows and collared flycatchers to examine the genetic architecture of different morphological traits (tarsus length, wing length, body mass, bill depth, bill length, total and visible badge size and white wing patches). Genomic heritabilities were estimated using relatedness calculated from SNPs. The proportion of variance captured by the SNPs (SNP-based heritability) was lower in house sparrows compared with collared flycatchers, as expected given marker density (6348 SNPs in house sparrows versus 38 689 SNPs in collared flycatchers). Indeed, after downsampling to similar SNP density and sample size, this estimate was no longer markedly different between species. Chromosome-partitioning analyses demonstrated that the proportion of variance explained by each chromosome was significantly positively related to the chromosome size for some traits and, generally, that larger chromosomes tended to explain proportionally more variation than smaller chromosomes. Finally, we found two genome-wide significant associations with very small-effect sizes. One SNP on chromosome 20 was associated with bill length in house sparrows and explained 1.2% of phenotypic variation (V P ), and one SNP on chromosome 4 was associated with tarsus length in collared flycatchers (3% of V P ). Although we cannot exclude the possibility of undetected large-effect loci, our results indicate a polygenic basis for morphological traits.

  5. Analysis of copy number variations in Holstein cows identify potential mechanisms contributing to differences in residual feed intake

    USDA-ARS?s Scientific Manuscript database

    Genomic structural variation is an important and abundant source of genetic and phenotypic variation. In this study, we performed an initial analysis of CNVs using BovineHD SNP genotyping data from 147 Holstein cows identified as having high or low feed efficiency as estimated by residual feed intak...

  6. TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

    PubMed

    Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

    2018-04-11

    Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.

  7. Genetic sharing and heritability of paediatric age of onset autoimmune diseases

    PubMed Central

    Li, Yun R.; Zhao, Sihai D.; Li, Jin; Bradfield, Jonathan P.; Mohebnasab, Maede; Steel, Laura; Kobie, Julie; Abrams, Debra J.; Mentch, Frank D.; Glessner, Joseph T.; Guo, Yiran; Wei, Zhi; Connolly, John J.; Cardinale, Christopher J.; Bakay, Marina; Li, Dong; Maggadottir, S. Melkorka; Thomas, Kelly A.; Qui, Haijun; Chiavacci, Rosetta M.; Kim, Cecilia E.; Wang, Fengxiang; Snyder, James; Flatø, Berit; Førre, Øystein; Denson, Lee A.; Thompson, Susan D.; Becker, Mara L.; Guthery, Stephen L.; Latiano, Anna; Perez, Elena; Resnick, Elena; Strisciuglio, Caterina; Staiano, Annamaria; Miele, Erasmo; Silverberg, Mark S.; Lie, Benedicte A.; Punaro, Marilynn; Russell, Richard K.; Wilson, David C.; Dubinsky, Marla C.; Monos, Dimitri S.; Annese, Vito; Munro, Jane E.; Wise, Carol; Chapel, Helen; Cunningham-Rundles, Charlotte; Orange, Jordan S.; Behrens, Edward M.; Sullivan, Kathleen E.; Kugathasan, Subra; Griffiths, Anne M.; Satsangi, Jack; Grant, Struan F. A.; Sleiman, Patrick M. A.; Finkel, Terri H.; Polychronakos, Constantin; Baldassano, Robert N.; Luning Prak, Eline T.; Ellis, Justine A.; Li, Hongzhe; Keating, Brendan J.; Hakonarson, Hakon

    2015-01-01

    Autoimmune diseases (AIDs) are polygenic diseases affecting 7–10% of the population in the Western Hemisphere with few effective therapies. Here, we quantify the heritability of paediatric AIDs (pAIDs), including JIA, SLE, CEL, T1D, UC, CD, PS, SPA and CVID, attributable to common genomic variations (SNP-h2). SNP-h2 estimates are most significant for T1D (0.863±s.e. 0.07) and JIA (0.727±s.e. 0.037), more modest for UC (0.386±s.e. 0.04) and CD (0.454±0.025), largely consistent with population estimates and are generally greater than that previously reported by adult GWAS. On pairwise analysis, we observed that the diseases UC-CD (0.69±s.e. 0.07) and JIA-CVID (0.343±s.e. 0.13) are the most strongly correlated. Variations across the MHC strongly contribute to SNP-h2 in T1D and JIA, but does not significantly contribute to the pairwise rG. Together, our results partition contributions of shared versus disease-specific genomic variations to pAID heritability, identifying pAIDs with unexpected risk sharing, while recapitulating known associations between autoimmune diseases previously reported in adult cohorts. PMID:26450413

  8. Genomic characteristics of cattle copy number variations

    USDA-ARS?s Scientific Manuscript database

    We performed a systematic analysis of cattle copy number variations (CNVs) using the Bovine HapMap SNP genotyping data, including 539 animals of 21 modern cattle breeds and 6 outgroups. After correcting genomic waves and considering the trio information, we identified 682 candidate CNV regions (CNVR...

  9. Predicting the disease of Alzheimer with SNP biomarkers and clinical data using data mining classification approach: decision tree.

    PubMed

    Erdoğan, Onur; Aydin Son, Yeşim

    2014-01-01

    Single Nucleotide Polymorphisms (SNPs) are the most common genomic variations where only a single nucleotide differs between individuals. Individual SNPs and SNP profiles associated with diseases can be utilized as biological markers. But there is a need to determine the SNP subsets and patients' clinical data which is informative for the diagnosis. Data mining approaches have the highest potential for extracting the knowledge from genomic datasets and selecting the representative SNPs as well as most effective and informative clinical features for the clinical diagnosis of the diseases. In this study, we have applied one of the widely used data mining classification methodology: "decision tree" for associating the SNP biomarkers and significant clinical data with the Alzheimer's disease (AD), which is the most common form of "dementia". Different tree construction parameters have been compared for the optimization, and the most accurate tree for predicting the AD is presented.

  10. Associations between single nucleotide polymorphisms in multiple candidate genes and body weight in rabbits

    PubMed Central

    El-Sabrout, Karim; Aggag, Sarah A.

    2017-01-01

    Aim: In this study, we examined parts of six growth genes (growth hormone [GH], melanocortin 4 receptor [MC4R], growth hormone receptor [GHR], phosphorglycerate mutase [PGAM], myostatin [MSTN], and fibroblast growth factor [FGF]) as specific primers for two rabbit lines (V-line, Alexandria) using nucleotide sequence analysis, to investigate association between detecting single nucleotide polymorphism (SNP) of these genes and body weight (BW) at market. Materials and Methods: Each line kits were grouped into high and low weight rabbits to identify DNA markers useful for association studies with high BW. DNA from blood samples of each group was extracted to amplify the six growth genes. SNP technique was used to study the associate polymorphism in the six growth genes and marketing BW (at 63 days) in the two rabbit lines. The purified polymerase chain reaction products were sequenced in those had the highest and lowest BW in each line. Results: Alignment of sequence data from each group revealed the following SNPs: At nucleotide 23 (A-C) and nucleotide 35 (T-G) in MC4R gene (sense mutation) of Alexandria and V-line high BW. Furthermore, we detected the following SNPs variation between the two lines: A SNP (T-C) at nucleotide 27 was identified by MC4R gene (sense mutation) and another one (A-C) at nucleotide 14 was identified by GHR gene (nonsense mutation) of Alexandria line. The results of individual BW at market (63 days) indicated that Alexandria rabbits had significantly higher BW compared with V-line rabbits. MC4R polymorphism showed significant association with high BW in rabbits. Conclusion: The results of polymorphism demonstrate the possibility to detect an association between BW in rabbits and the efficiency of the used primers to predict through the genetic specificity using the SNP of MC4R. PMID:28246458

  11. Environmental Response and Genomic Regions Correlated with Rice Root Growth and Yield under Drought in the OryzaSNP Panel across Multiple Study Systems

    PubMed Central

    Wade, Len J.; Bartolome, Violeta; Mauleon, Ramil; Vasant, Vivek Deshmuck; Prabakar, Sumeet Mankar; Chelliah, Muthukumar; Kameoka, Emi; Nagendra, K.; Reddy, K. R. Kamalnath; Varma, C. Mohan Kumar; Patil, Kalmeshwar Gouda; Shrestha, Roshi; Al-Shugeairy, Zaniab; Al-Ogaidi, Faez; Munasinghe, Mayuri; Gowda, Veeresh; Semon, Mande; Suralta, Roel R.; Shenoy, Vinay; Vadez, Vincent; Serraj, Rachid; Shashidhar, H. E.; Yamauchi, Akira; Babu, Ranganathan Chandra; Price, Adam; McNally, Kenneth L.; Henry, Amelia

    2015-01-01

    The rapid progress in rice genotyping must be matched by advances in phenotyping. A better understanding of genetic variation in rice for drought response, root traits, and practical methods for studying them are needed. In this study, the OryzaSNP set (20 diverse genotypes that have been genotyped for SNP markers) was phenotyped in a range of field and container studies to study the diversity of rice root growth and response to drought. Of the root traits measured across more than 20 root experiments, root dry weight showed the most stable genotypic performance across studies. The environment (E) component had the strongest effect on yield and root traits. We identified genomic regions correlated with root dry weight, percent deep roots, maximum root depth, and grain yield based on a correlation analysis with the phenotypes and aus, indica, or japonica introgression regions using the SNP data. Two genomic regions were identified as hot spots in which root traits and grain yield were co-located; on chromosome 1 (39.7–40.7 Mb) and on chromosome 8 (20.3–21.9 Mb). Across experiments, the soil type/ growth medium showed more correlations with plant growth than the container dimensions. Although the correlations among studies and genetic co-location of root traits from a range of study systems points to their potential utility to represent responses in field studies, the best correlations were observed when the two setups had some similar properties. Due to the co-location of the identified genomic regions (from introgression block analysis) with QTL for a number of previously reported root and drought traits, these regions are good candidates for detailed characterization to contribute to understanding rice improvement for response to drought. This study also highlights the utility of characterizing a small set of 20 genotypes for root growth, drought response, and related genomic regions. PMID:25909711

  12. Genetic Modifiers of Neurofibromatosis Type 1-Associated Café-au-Lait Macule Count Identified Using Multi-platform Analysis

    PubMed Central

    Pemov, Alexander; Sung, Heejong; Hyland, Paula L.; Sloan, Jennifer L.; Ruppert, Sarah L.; Baldwin, Andrea M.; Boland, Joseph F.; Bass, Sara E.; Lee, Hyo Jung; Jones, Kristine M.; Zhang, Xijun; Mullikin, James C.; Widemann, Brigitte C.; Wilson, Alexander F.; Stewart, Douglas R.

    2014-01-01

    Neurofibromatosis type 1 (NF1) is an autosomal dominant, monogenic disorder of dysregulated neurocutaneous tissue growth. Pleiotropy, variable expressivity and few NF1 genotype-phenotype correlates limit clinical prognostication in NF1. Phenotype complexity in NF1 is hypothesized to derive in part from genetic modifiers unlinked to the NF1 locus. In this study, we hypothesized that normal variation in germline gene expression confers risk for certain phenotypes in NF1. In a set of 79 individuals with NF1, we examined the association between gene expression in lymphoblastoid cell lines with NF1-associated phenotypes and sequenced select genes with significant phenotype/expression correlations. In a discovery cohort of 89 self-reported European-Americans with NF1 we examined the association between germline sequence variants of these genes with café-au-lait macule (CALM) count, a tractable, tumor-like phenotype in NF1. Two correlated, common SNPs (rs4660761 and rs7161) between DPH2 and ATP6V0B were significantly associated with the CALM count. Analysis with tiled regression also identified SNP rs4660761 as significantly associated with CALM count. SNP rs1800934 and 12 rare variants in the mismatch repair gene MSH6 were also associated with CALM count. Both SNPs rs7161 and rs4660761 (DPH2 and ATP6V0B) were highly significant in a mega-analysis in a combined cohort of 180 self-reported European-Americans; SNP rs1800934 (MSH6) was near-significant in a meta-analysis assuming dominant effect of the minor allele. SNP rs4660761 is predicted to regulate ATP6V0B, a gene associated with melanosome biology. Individuals with homozygous mutations in MSH6 can develop an NF1-like phenotype, including multiple CALMs. Through a multi-platform approach, we identified variants that influence NF1 CALM count. PMID:25329635

  13. Genome-Wide Association Mapping Combined with Reverse Genetics Identifies New Effectors of Low Water Potential-Induced Proline Accumulation in Arabidopsis1[W][OPEN

    PubMed Central

    Verslues, Paul E.; Lasky, Jesse R.; Juenger, Thomas E.; Liu, Tzu-Wen; Kumar, M. Nagaraj

    2014-01-01

    Arabidopsis (Arabidopsis thaliana) exhibits natural genetic variation in drought response, including varying levels of proline (Pro) accumulation under low water potential. As Pro accumulation is potentially important for stress tolerance and cellular redox control, we conducted a genome-wide association (GWAS) study of low water potential-induced Pro accumulation using a panel of natural accessions and publicly available single-nucleotide polymorphism (SNP) data sets. Candidate genomic regions were prioritized for subsequent study using metrics considering both the strength and spatial clustering of the association signal. These analyses found many candidate regions likely containing gene(s) influencing Pro accumulation. Reverse genetic analysis of several candidates identified new Pro effector genes, including thioredoxins and several genes encoding Universal Stress Protein A domain proteins. These new Pro effector genes further link Pro accumulation to cellular redox and energy status. Additional new Pro effector genes found include the mitochondrial protease LON1, ribosomal protein RPL24A, protein phosphatase 2A subunit A3, a MADS box protein, and a nucleoside triphosphate hydrolase. Several of these new Pro effector genes were from regions with multiple SNPs, each having moderate association with Pro accumulation. This pattern supports the use of summary approaches that incorporate clusters of SNP associations in addition to consideration of individual SNP probability values. Further GWAS-guided reverse genetics promises to find additional effectors of Pro accumulation. The combination of GWAS and reverse genetics to efficiently identify new effector genes may be especially applicable for traits difficult to analyze by other genetic screening methods. PMID:24218491

  14. GWAS of human bitter taste perception identifies new loci and reveals additional complexity of bitter taste genetics

    PubMed Central

    Ledda, Mirko; Kutalik, Zoltán; Souza Destito, Maria C.; Souza, Milena M.; Cirillo, Cintia A.; Zamboni, Amabilene; Martin, Nathalie; Morya, Edgard; Sameshima, Koichi; Beckmann, Jacques S.; le Coutre, Johannes; Bergmann, Sven; Genick, Ulrich K.

    2014-01-01

    Human perception of bitterness displays pronounced interindividual variation. This phenotypic variation is mirrored by equally pronounced genetic variation in the family of bitter taste receptor genes. To better understand the effects of common genetic variations on human bitter taste perception, we conducted a genome-wide association study on a discovery panel of 504 subjects and a validation panel of 104 subjects from the general population of São Paulo in Brazil. Correction for general taste-sensitivity allowed us to identify a SNP in the cluster of bitter taste receptors on chr12 (10.88– 11.24 Mb, build 36.1) significantly associated (best SNP: rs2708377, P = 5.31 × 10−13, r2 = 8.9%, β = −0.12, s.e. = 0.016) with the perceived bitterness of caffeine. This association overlaps with—but is statistically distinct from—the previously identified SNP rs10772420 influencing the perception of quinine bitterness that falls in the same bitter taste cluster. We replicated this association to quinine perception (P = 4.97 × 10−37, r2 = 23.2%, β = 0.25, s.e. = 0.020) and additionally found the effect of this genetic locus to be concentration specific with a strong impact on the perception of low, but no impact on the perception of high concentrations of quinine. Our study, thus, furthers our understanding of the complex genetic architecture of bitter taste perception. PMID:23966204

  15. A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families.

    PubMed

    Wang, Yi-Ting; Sung, Pei-Yuan; Lin, Peng-Lin; Yu, Ya-Wen; Chung, Ren-Hua

    2015-05-15

    Genome-wide association studies (GWAS) have become a common approach to identifying single nucleotide polymorphisms (SNPs) associated with complex diseases. As complex diseases are caused by the joint effects of multiple genes, while the effect of individual gene or SNP is modest, a method considering the joint effects of multiple SNPs can be more powerful than testing individual SNPs. The multi-SNP analysis aims to test association based on a SNP set, usually defined based on biological knowledge such as gene or pathway, which may contain only a portion of SNPs with effects on the disease. Therefore, a challenge for the multi-SNP analysis is how to effectively select a subset of SNPs with promising association signals from the SNP set. We developed the Optimal P-value Threshold Pedigree Disequilibrium Test (OPTPDT). The OPTPDT uses general nuclear families. A variable p-value threshold algorithm is used to determine an optimal p-value threshold for selecting a subset of SNPs. A permutation procedure is used to assess the significance of the test. We used simulations to verify that the OPTPDT has correct type I error rates. Our power studies showed that the OPTPDT can be more powerful than the set-based test in PLINK, the multi-SNP FBAT test, and the p-value based test GATES. We applied the OPTPDT to a family-based autism GWAS dataset for gene-based association analysis and identified MACROD2-AS1 with genome-wide significance (p-value=2.5×10(-6)). Our simulation results suggested that the OPTPDT is a valid and powerful test. The OPTPDT will be helpful for gene-based or pathway association analysis. The method is ideal for the secondary analysis of existing GWAS datasets, which may identify a set of SNPs with joint effects on the disease.

  16. A 48 SNP set for grapevine cultivar identification

    PubMed Central

    2011-01-01

    Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP markers are bi-allelic, allele identification and genotype naming are extremely simple and genotypes obtained with different equipments and by different laboratories are always fully comparable. PMID:22060012

  17. Genomic and evolutionary characteristics of cattle copy number variations

    USDA-ARS?s Scientific Manuscript database

    We performed a systematic analysis of cattle copy number variations (CNVs) using the Bovine HapMap SNP genotyping data, including 539 animals of 21 modern cattle breeds and 6 outgroups. After correcting genomic waves and considering the trio information, we identified 682 candidate CNV regions (CNVR...

  18. Detection of selective sweeps in cattle using genome-wide SNP data

    PubMed Central

    2013-01-01

    Background The domestication and subsequent selection by humans to create breeds and biological types of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweeps have now been identified in the genomes of many animal species including humans, dogs, horses, and chickens. Here, we attempt to identify and characterise regions of the bovine genome that have been subjected to selective sweeps. Results Two datasets were used for the discovery and validation of selective sweeps via the fixation of alleles at a series of contiguous SNP loci. BovineSNP50 data were used to identify 28 putative sweep regions among 14 diverse cattle breeds. Affymetrix BOS 1 prescreening assay data for five breeds were used to identify 85 regions and validate 5 regions identified using the BovineSNP50 data. Many genes are located within these regions and the lack of sequence data for the analysed breeds precludes the nomination of selected genes or variants and limits the prediction of the selected phenotypes. However, phenotypes that we predict to have historically been under strong selection include horned-polled, coat colour, stature, ear morphology, and behaviour. Conclusions The bias towards common SNPs in the design of the BovineSNP50 assay led to the identification of recent selective sweeps associated with breed formation and common to only a small number of breeds rather than ancient events associated with domestication which could potentially be common to all European taurines. The limited SNP density, or marker resolution, of the BovineSNP50 assay significantly impacted the rate of false discovery of selective sweeps, however, we found sweeps in common between breeds which were confirmed using an ultra-high-density assay scored in a small number of animals from a subset of the breeds. No sweep regions were shared between indicine and taurine breeds reflecting their divergent selection histories and the very different environmental habitats to which these sub-species have adapted. PMID:23758707

  19. 4P: fast computing of population genetics statistics from large DNA polymorphism panels

    PubMed Central

    Benazzo, Andrea; Panziera, Alex; Bertorelle, Giorgio

    2015-01-01

    Massive DNA sequencing has significantly increased the amount of data available for population genetics and molecular ecology studies. However, the parallel computation of simple statistics within and between populations from large panels of polymorphic sites is not yet available, making the exploratory analyses of a set or subset of data a very laborious task. Here, we present 4P (parallel processing of polymorphism panels), a stand-alone software program for the rapid computation of genetic variation statistics (including the joint frequency spectrum) from millions of DNA variants in multiple individuals and multiple populations. It handles a standard input file format commonly used to store DNA variation from empirical or simulation experiments. The computational performance of 4P was evaluated using large SNP (single nucleotide polymorphism) datasets from human genomes or obtained by simulations. 4P was faster or much faster than other comparable programs, and the impact of parallel computing using multicore computers or servers was evident. 4P is a useful tool for biologists who need a simple and rapid computer program to run exploratory population genetics analyses in large panels of genomic data. It is also particularly suitable to analyze multiple data sets produced in simulation studies. Unix, Windows, and MacOs versions are provided, as well as the source code for easier pipeline implementations. PMID:25628874

  20. Genomic Variation by Whole-Genome SNP Mapping Arrays Predicts Time-to-Event Outcome in Patients with Chronic Lymphocytic Leukemia

    PubMed Central

    Schweighofer, Carmen D.; Coombes, Kevin R.; Majewski, Tadeusz; Barron, Lynn L.; Lerner, Susan; Sargent, Rachel L.; O'Brien, Susan; Ferrajoli, Alessandra; Wierda, William G.; Czerniak, Bogdan A.; Medeiros, L. Jeffrey; Keating, Michael J.; Abruzzo, Lynne V.

    2013-01-01

    Genomic abnormalities, such as deletions in 11q22 or 17p13, are associated with poorer prognosis in patients with chronic lymphocytic leukemia (CLL). We hypothesized that unknown regions of copy number variation (CNV) affect clinical outcome and can be detected by array-based single-nucleotide polymorphism (SNP) genotyping. We compared SNP genotypes from 168 untreated patients with CLL with genotypes from 73 white HapMap controls. We identified 322 regions of recurrent CNV, 82 of which occurred significantly more often in CLL than in HapMap (CLL-specific CNV), including regions typically aberrant in CLL: deletions in 6q21, 11q22, 13q14, and 17p13 and trisomy 12. In univariate analyses, 35 of total and 11 of CLL-specific CNVs were associated with unfavorable time-to-event outcomes, including gains or losses in chromosomes 2p, 4p, 4q, 6p, 6q, 7q, 11p, 11q, and 17p. In multivariate analyses, six CNVs (ie, CLL-specific variations in 11p15.1-15.4 or 6q27) predicted time-to-treatment or overall survival independently of established markers of prognosis. Moreover, genotypic complexity (ie, the number of independent CNVs per patient) significantly predicted prognosis, with a median time-to-treatment of 64 months versus 23 months in patients with zero to one versus two or more CNVs, respectively (P = 3.3 × 10−8). In summary, a comparison of SNP genotypes from patients with CLL with HapMap controls allowed us to identify known and unknown recurrent CNVs and to determine regions and rates of CNV that predict poorer prognosis in patients with CLL. PMID:23273604

  1. Exploiting sequence similarity to validate the sensitivity of SNP arrays in detecting fine-scaled copy number variations.

    PubMed

    Wong, Gerard; Leckie, Christopher; Gorringe, Kylie L; Haviv, Izhak; Campbell, Ian G; Kowalczyk, Adam

    2010-04-15

    High-density single nucleotide polymorphism (SNP) genotyping arrays are efficient and cost effective platforms for the detection of copy number variation (CNV). To ensure accuracy in probe synthesis and to minimize production costs, short oligonucleotide probe sequences are used. The use of short probe sequences limits the specificity of binding targets in the human genome. The specificity of these short probeset sequences has yet to be fully analysed against a normal reference human genome. Sequence similarity can artificially elevate or suppress copy number measurements, and hence reduce the reliability of affected probe readings. For the purpose of detecting narrow CNVs reliably down to the width of a single probeset, sequence similarity is an important issue that needs to be addressed. We surveyed the Affymetrix Human Mapping SNP arrays for probeset sequence similarity against the reference human genome. Utilizing sequence similarity results, we identified a collection of fine-scaled putative CNVs between gender from autosomal probesets whose sequence matches various loci on the sex chromosomes. To detect these variations, we utilized our statistical approach, Detecting REcurrent Copy number change using rank-order Statistics (DRECS), and showed that its performance was superior and more stable than the t-test in detecting CNVs. Through the application of DRECS on the HapMap population datasets with multi-matching probesets filtered, we identified biologically relevant SNPs in aberrant regions across populations with known association to physical traits, such as height, covered by the span of a single probe. This provided empirical confirmation of the existence of naturally occurring narrow CNVs as well as the sensitivity of the Affymetrix SNP array technology in detecting them. The MATLAB implementation of DRECS is available at http://ww2.cs.mu.oz.au/ approximately gwong/DRECS/index.html.

  2. Design and characterization of a 52K SNP chip for goats.

    PubMed

    Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C M; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T; McEwan, John; Martin, Patrice; Moreno, Carole R; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang

    2014-01-01

    The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50-60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years.

  3. Design and Characterization of a 52K SNP Chip for Goats

    PubMed Central

    Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C. M.; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T.; McEwan, John; Martin, Patrice; Moreno, Carole R.; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L.; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang

    2014-01-01

    The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50–60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years. PMID:24465974

  4. ITALICS: an algorithm for normalization and DNA copy number calling for Affymetrix SNP arrays.

    PubMed

    Rigaill, Guillem; Hupé, Philippe; Almeida, Anna; La Rosa, Philippe; Meyniel, Jean-Philippe; Decraene, Charles; Barillot, Emmanuel

    2008-03-15

    Affymetrix SNP arrays can be used to determine the DNA copy number measurement of 11 000-500 000 SNPs along the genome. Their high density facilitates the precise localization of genomic alterations and makes them a powerful tool for studies of cancers and copy number polymorphism. Like other microarray technologies it is influenced by non-relevant sources of variation, requiring correction. Moreover, the amplitude of variation induced by non-relevant effects is similar or greater than the biologically relevant effect (i.e. true copy number), making it difficult to estimate non-relevant effects accurately without including the biologically relevant effect. We addressed this problem by developing ITALICS, a normalization method that estimates both biological and non-relevant effects in an alternate, iterative manner, accurately eliminating irrelevant effects. We compared our normalization method with other existing and available methods, and found that ITALICS outperformed these methods for several in-house datasets and one public dataset. These results were validated biologically by quantitative PCR. The R package ITALICS (ITerative and Alternative normaLIzation and Copy number calling for affymetrix Snp arrays) has been submitted to Bioconductor.

  5. GrigoraSNPs: Optimized Analysis of SNPs for DNA Forensics.

    PubMed

    Ricke, Darrell O; Shcherbina, Anna; Michaleas, Adam; Fremont-Smith, Philip

    2018-04-16

    High-throughput sequencing (HTS) of single nucleotide polymorphisms (SNPs) enables additional DNA forensic capabilities not attainable using traditional STR panels. However, the inclusion of sets of loci selected for mixture analysis, extended kinship, phenotype, biogeographic ancestry prediction, etc., can result in large panel sizes that are difficult to analyze in a rapid fashion. GrigoraSNP was developed to address the allele-calling bottleneck that was encountered when analyzing SNP panels with more than 5000 loci using HTS. GrigoraSNPs uses a MapReduce parallel data processing on multiple computational threads plus a novel locus-identification hashing strategy leveraging target sequence tags. This tool optimizes the SNP calling module of the DNA analysis pipeline with runtimes that scale linearly with the number of HTS reads. Results are compared with SNP analysis pipelines implemented with SAMtools and GATK. GrigoraSNPs removes a computational bottleneck for processing forensic samples with large HTS SNP panels. Published 2018. This article is a U.S. Government work and is in the public domain in the USA.

  6. Candidate Gene Approach for Parasite Resistance in Sheep – Variation in Immune Pathway Genes and Association with Fecal Egg Count

    PubMed Central

    Periasamy, Kathiravan; Pichler, Rudolf; Poli, Mario; Cristel, Silvina; Cetrá, Bibiana; Medus, Daniel; Basar, Muladno; A. K., Thiruvenkadan; Ramasamy, Saravanan; Ellahi, Masroor Babbar; Mohammed, Faruque; Teneva, Atanaska; Shamsuddin, Mohammed; Podesta, Mario Garcia; Diallo, Adama

    2014-01-01

    Sheep chromosome 3 (Oar3) has the largest number of QTLs reported to be significantly associated with resistance to gastro-intestinal nematodes. This study aimed to identify single nucleotide polymorphisms (SNPs) within candidate genes located in sheep chromosome 3 as well as genes involved in major immune pathways. A total of 41 SNPs were identified across 38 candidate genes in a panel of unrelated sheep and genotyped in 713 animals belonging to 22 breeds across Asia, Europe and South America. The variations and evolution of immune pathway genes were assessed in sheep populations across these macro-environmental regions that significantly differ in the diversity and load of pathogens. The mean minor allele frequency (MAF) did not vary between Asian and European sheep reflecting the absence of ascertainment bias. Phylogenetic analysis revealed two major clusters with most of South Asian, South East Asian and South West Asian breeds clustering together while European and South American sheep breeds clustered together distinctly. Analysis of molecular variance revealed strong phylogeographic structure at loci located in immune pathway genes, unlike microsatellite and genome wide SNP markers. To understand the influence of natural selection processes, SNP loci located in chromosome 3 were utilized to reconstruct haplotypes, the diversity of which showed significant deviations from selective neutrality. Reduced Median network of reconstructed haplotypes showed balancing selection in force at these loci. Preliminary association of SNP genotypes with phenotypes recorded 42 days post challenge revealed significant differences (P<0.05) in fecal egg count, body weight change and packed cell volume at two, four and six SNP loci respectively. In conclusion, the present study reports strong phylogeographic structure and balancing selection operating at SNP loci located within immune pathway genes. Further, SNP loci identified in the study were found to have potential for future large scale association studies in naturally exposed sheep populations. PMID:24533078

  7. Bovine exome sequence analysis and targeted SNP genotyping of recessive fertility defects BH1, HH2, and HH3 reveal a putative causative mutation in SMC2 for HH3.

    PubMed

    McClure, Matthew C; Bickhart, Derek; Null, Dan; Vanraden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B; Van Tassell, Curtis P; Sonstegard, Tad S

    2014-01-01

    The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.

  8. Bovine Exome Sequence Analysis and Targeted SNP Genotyping of Recessive Fertility Defects BH1, HH2, and HH3 Reveal a Putative Causative Mutation in SMC2 for HH3

    PubMed Central

    McClure, Matthew C.; Bickhart, Derek; Null, Dan; VanRaden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B.; Van Tassell, Curtis P.; Sonstegard, Tad S.

    2014-01-01

    The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array. PMID:24667746

  9. Characterization of Genome-Wide Variation in Four-Row Wax, a Waxy Maize Landrace with a Reduced Kernel Row Phenotype

    PubMed Central

    Liu, Hanmei; Wang, Xuewen; Wei, Bin; Wang, Yongbin; Liu, Yinghong; Zhang, Junjie; Hu, Yufeng; Yu, Guowu; Li, Jian; Xu, Zhanbin; Huang, Yubi

    2016-01-01

    In southwest China, some maize landraces have long been isolated geographically, and have phenotypes that differ from those of widely grown cultivars. These landraces may harbor rich genetic variation responsible for those phenotypes. Four-row Wax is one such landrace, with four rows of kernels on the cob. We resequenced the genome of Four-row Wax, obtaining 50.46 Gb sequence at 21.87× coverage, then identified and characterized 3,252,194 SNPs, 213,181 short InDels (1–5 bp) and 39,631 structural variations (greater than 5 bp). Of those, 312,511 (9.6%) SNPs were novel compared to the most detailed haplotype map (HapMap) SNP database of maize. Characterization of variations in reported kernel row number (KRN) related genes and KRN QTL regions revealed potential causal mutations in fea2, td1, kn1, and te1. Genome-wide comparisons revealed abundant genetic variations in Four-row Wax, which may be associated with environmental adaptation. The sequence and SNP variations described here enrich genetic resources of maize, and provide guidance into study of seed numbers for crop yield improvement. PMID:27242868

  10. MAFsnp: A Multi-Sample Accurate and Flexible SNP Caller Using Next-Generation Sequencing Data

    PubMed Central

    Hu, Jiyuan; Li, Tengfei; Xiu, Zidi; Zhang, Hong

    2015-01-01

    Most existing statistical methods developed for calling single nucleotide polymorphisms (SNPs) using next-generation sequencing (NGS) data are based on Bayesian frameworks, and there does not exist any SNP caller that produces p-values for calling SNPs in a frequentist framework. To fill in this gap, we develop a new method MAFsnp, a Multiple-sample based Accurate and Flexible algorithm for calling SNPs with NGS data. MAFsnp is based on an estimated likelihood ratio test (eLRT) statistic. In practical situation, the involved parameter is very close to the boundary of the parametric space, so the standard large sample property is not suitable to evaluate the finite-sample distribution of the eLRT statistic. Observing that the distribution of the test statistic is a mixture of zero and a continuous part, we propose to model the test statistic with a novel two-parameter mixture distribution. Once the parameters in the mixture distribution are estimated, p-values can be easily calculated for detecting SNPs, and the multiple-testing corrected p-values can be used to control false discovery rate (FDR) at any pre-specified level. With simulated data, MAFsnp is shown to have much better control of FDR than the existing SNP callers. Through the application to two real datasets, MAFsnp is also shown to outperform the existing SNP callers in terms of calling accuracy. An R package “MAFsnp” implementing the new SNP caller is freely available at http://homepage.fudan.edu.cn/zhangh/softwares/. PMID:26309201

  11. Complex Variation in Measures of General Intelligence and Cognitive Change

    PubMed Central

    Rowe, Suzanne J.; Rowlatt, Amy; Davies, Gail; Harris, Sarah E.; Porteous, David J.; Liewald, David C.; McNeill, Geraldine; Starr, John M.

    2013-01-01

    Combining information from multiple SNPs may capture a greater amount of genetic variation than from the sum of individual SNP effects and help identifying missing heritability. Regions may capture variation from multiple common variants of small effect, multiple rare variants or a combination of both. We describe regional heritability mapping of human cognition. Measures of crystallised (gc) and fluid intelligence (gf) in late adulthood (64–79 years) were available for 1806 individuals genotyped for 549,692 autosomal single nucleotide polymorphisms (SNPs). The same individuals were tested at age 11, enabling us the rare opportunity to measure cognitive change across most of their lifespan. 547,750 SNPs ranked by position are divided into 10, 908 overlapping regions of 101 SNPs to estimate the genetic variance each region explains, an approach that resembles classical linkage methods. We also estimate the genetic variation explained by individual autosomes and by SNPs within genes. Empirical significance thresholds are estimated separately for each trait from whole genome scans of 500 permutated data sets. The 5% significance threshold for the likelihood ratio test of a single region ranged from 17–17.5 for the three traits. This is the equivalent to nominal significance under the expectation of a chi-squared distribution (between 1df and 0) of P<1.44×10−5. These thresholds indicate that the distribution of the likelihood ratio test from this type of variance component analysis should be estimated empirically. Furthermore, we show that estimates of variation explained by these regions can be grossly overestimated. After applying permutation thresholds, a region for gf on chromosome 5 spanning the PRRC1 gene is significant at a genome-wide 10% empirical threshold. Analysis of gene methylation on the temporal cortex provides support for the association of PRRC1 and fluid intelligence (P = 0.004), and provides a prime candidate gene for high throughput sequencing of these uniquely informative cohorts. PMID:24349040

  12. Association of SNP3 polymorphism in the apolipoprotein A-V gene with plasma triglyceride level in Tunisian type 2 diabetes

    PubMed Central

    Chaaba, Raja; Attia, Nebil; Hammami, Sonia; Smaoui, Maha; Mahjoub, Sylvia; Hammami, Mohamed; Masmoudi, Ahmed Slaheddine

    2005-01-01

    Background Apolipoprotein A-V (Apo A-V) gene has recently been identified as a new apolipoprotein involved in triglyceride metabolism. A single nucleotide polymorphism (SNP3) located in the gene promoter (-1131) was associated with triglyceride variation in healthy subjects. In type 2 diabetes the triglyceride level increased compared to healthy subjects. Hypertriglyceridemia is a risk factor for coronary artery disease. We aimed to examine the interaction between SNP3 and lipid profile and coronary artery disease (CAD) in Tunisian type 2 diabetic patients. Results The genotype frequencies of T/T, T/C and C/C were 0.74, 0.23 and 0.03 respectively in non diabetic subjects, 0.71, 0.25 and 0.04 respectively in type 2 diabetic patients. Triglyceride level was higher in heterozygous genotype (-1131 T/C) of apo A-V (p = 0.024). Heterozygous genotype is more frequent in high triglyceride group (40.9%) than in low triglyceride group (18.8%) ; p = 0.011. Despite the relation between CAD and hypertriglyceridemia the SNP 3 was not associated with CAD. Conclusion In type 2 diabetic patients SNP3 is associated with triglyceride level, however there was no association between SNP3 and coronary artery disease. PMID:15636639

  13. Apolipoprotein H promoter polymorphisms in relation to lupus and lupus-related phenotypes.

    PubMed

    Suresh, Sangita; Demirci, F Yesim K; Jacobs, Erin; Kao, Amy H; Rhew, Elisa Y; Sanghera, Dharambir K; Selzer, Faith; Sutton-Tyrrell, Kim; McPherson, David; Bontempo, Franklin A; Kammerer, Candace M; Ramsey-Goldman, Rosalind; Manzi, Susan; Kamboh, M Ilyas

    2009-02-01

    Sequence variation in gene promoters is often associated with disease risk. We tested the hypothesis that common promoter variation in the APOH gene (encoding for ss(2)-glycoprotein I) is associated with systemic lupus erythematosus (SLE) risk and SLE-related clinical phenotypes in a Caucasian cohort. We used a case-control design and genotyped 345 women with SLE and 454 healthy control women for 8 APOH promoter single-nucleotide polymorphisms (SNP; -1284C>G, -1219G>A, -1190G>C, -759A>G, -700C>A, -643T>C, -38G>A, and -32C>A).Association analyses were performed on single SNP and haplotypes. Haplotype analyses were performed using EH (Estimate Haplotype-frequencies) and Haploview programs. In vitro reporter gene assay was performed in COS-1 cells. Electrophoretic mobility shift assay (EMSA) was performed using HepG2 nuclear cells. Overall haplotype distribution of the APOH promoter SNP was significantly different between cases and controls (p = 0.009). The -643C allele was found to be protective against carotid plaque formation (adjusted OR 0.37, p = 0.013) among patients with SLE. The -643C allele was associated with a ~2-fold decrease in promoter activity as compared to wild-type -643T allele (mean +/- standard deviation: 3.94 +/- 0.05 vs 6.99 +/- 0.68, p = 0.016). EMSA showed that the -643T>C SNP harbors a binding site for a nuclear factor. The -1219G>A SNP showed a significant association with the risk of lupus nephritis (age-adjusted OR 0.36, p = 0.016). Our data indicate that APOH promoter variants may be involved in the etiology of SLE, especially the risk for autoimmune-mediated cardiovascular disease.

  14. Characterization of the acute heat stress response in gilts: III. Genome-wide association studies of thermotolerance traits in pigs.

    PubMed

    Kim, Kwan-Suk; Seibert, Jacob T; Edea, Zewde; Graves, Kody L; Kim, Eui-Soo; Keating, Aileen F; Baumgard, Lance H; Ross, Jason W; Rothschild, Max F

    2018-06-04

    Heat stress is one of the limiting factors negatively affecting pig production, health, and fertility. Characterizing genomic regions responsible for variation in HS tolerance would be useful in identifying important genetic factor(s) regulating physiological responses to HS. In the present study, we performed genome-wide association analyses for respiration rate (RR), rectal temperature (TR), and skin temperature (TS) during HS in 214 crossbred gilts genotyped for 68,549 single nucleotide polymorphisms (SNP) using the Porcine SNP 70K BeadChip. Considering the top 0.1% smoothed phenotypic variances explained by SNP windows, we detected 26, 26, 21, and 14 genes that reside within SNPs explaining the largest proportion of variance (top 25 SNP windows) and associated with change in RR (ΔRR) from thermoneutral (TN) conditions to HS environment, as well as the change in prepubertal TR (ΔTR), change in postpubertal ΔTR, and change in TS (ΔTS), respectively. The region between 28.85 Mb and 29.10 Mb on chromosome 16 explained about 0.05% of the observed variation for ΔRR. The growth hormone receptor (GHR) gene resides in this region and is associated with the HS response. The other important candidate genes associated with ΔRR (PAIP1, NNT, and TEAD4), ΔTR (LIMS2, TTR, and TEAD4), and ΔTS (ERBB4, FKBP1B, NFATC2, and ATP9A) have reported roles in the cellular stress response. The SNP explaining the largest proportion of variance and located within and in the vicinity of genes were related to apoptosis or cellular stress and are potential candidates that underlie the physiological response to HS in pigs.

  15. Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes

    PubMed Central

    Lam, Tze Hau; Tay, Matthew Zirui; Wang, Bei; Xiao, Ziwei; Ren, Ee Chee

    2015-01-01

    Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases. PMID:26593880

  16. Genetic variation affecting exon skipping contributes to brain structural atrophy in Alzheimer's disease.

    PubMed

    Lee, Younghee; Han, Seonggyun; Kim, Dongwook; Kim, Dokyoon; Horgousluoglu, Emrin; Risacher, Shannon L; Saykin, Andrew J; Nho, Kwangsik

    2018-01-01

    Genetic variation in cis-regulatory elements related to splicing machinery and splicing regulatory elements (SREs) results in exon skipping and undesired protein products. We developed a splicing decision model to identify actionable loci among common SNPs for gene regulation. The splicing decision model identified SNPs affecting exon skipping by analyzing sequence-driven alternative splicing (AS) models and by scanning the genome for the regions with putative SRE motifs. We used non-Hispanic Caucasians with neuroimaging, and fluid biomarkers for Alzheimer's disease (AD) and identified 17,088 common exonic SNPs affecting exon skipping. GWAS identified one SNP (rs1140317) in HLA-DQB1 as significantly associated with entorhinal cortical thickness, AD neuroimaging biomarker, after controlling for multiple testing. Further analysis revealed that rs1140317 was significantly associated with brain amyloid-f deposition (PET and CSF). HLA-DQB1 is an essential immune gene and may regulate AS, thereby contributing to AD pathology. SRE may hold potential as novel therapeutic targets for AD.

  17. SNP-Based Typing: A Useful Tool to Study Bordetella pertussis Populations

    PubMed Central

    van der Heide, Han G. J.; Heuvelman, Kees J.; Kallonen, Teemu; He, Qiushui; Mertsola, Jussi; Advani, Abdolreza; Hallander, Hans O.; Janssens, Koen; Hermans, Peter W.; Mooi, Frits R.

    2011-01-01

    To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE) and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA). In this study, a single nucleotide polymorphism (SNP) typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in the Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis. PMID:21647370

  18. Combined array CGH plus SNP genome analyses in a single assay for optimized clinical testing

    PubMed Central

    Wiszniewska, Joanna; Bi, Weimin; Shaw, Chad; Stankiewicz, Pawel; Kang, Sung-Hae L; Pursley, Amber N; Lalani, Seema; Hixson, Patricia; Gambin, Tomasz; Tsai, Chun-hui; Bock, Hans-Georg; Descartes, Maria; Probst, Frank J; Scaglia, Fernando; Beaudet, Arthur L; Lupski, James R; Eng, Christine; Wai Cheung, Sau; Bacino, Carlos; Patel, Ankita

    2014-01-01

    In clinical diagnostics, both array comparative genomic hybridization (array CGH) and single nucleotide polymorphism (SNP) genotyping have proven to be powerful genomic technologies utilized for the evaluation of developmental delay, multiple congenital anomalies, and neuropsychiatric disorders. Differences in the ability to resolve genomic changes between these arrays may constitute an implementation challenge for clinicians: which platform (SNP vs array CGH) might best detect the underlying genetic cause for the disease in the patient? While only SNP arrays enable the detection of copy number neutral regions of absence of heterozygosity (AOH), they have limited ability to detect single-exon copy number variants (CNVs) due to the distribution of SNPs across the genome. To provide comprehensive clinical testing for both CNVs and copy-neutral AOH, we enhanced our custom-designed high-resolution oligonucleotide array that has exon-targeted coverage of 1860 genes with 60 000 SNP probes, referred to as Chromosomal Microarray Analysis – Comprehensive (CMA-COMP). Of the 3240 cases evaluated by this array, clinically significant CNVs were detected in 445 cases including 21 cases with exonic events. In addition, 162 cases (5.0%) showed at least one AOH region >10 Mb. We demonstrate that even though this array has a lower density of SNP probes than other commercially available SNP arrays, it reliably detected AOH events >10 Mb as well as exonic CNVs beyond the detection limitations of SNP genotyping. Thus, combining SNP probes and exon-targeted array CGH into one platform provides clinically useful genetic screening in an efficient manner. PMID:23695279

  19. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie

    2014-06-18

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealedmore » substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ‘ecotype model’ of diversification, but not previously observed in natural populations.« less

  20. Genome-wide Selective Sweeps in Natural Bacterial Populations Revealed by Time-series Metagenomics

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chan, Leong-Keat; Bendall, Matthew L.; Malfatti, Stephanie

    2014-05-12

    Multiple evolutionary models have been proposed to explain the formation of genetically and ecologically distinct bacterial groups. Time-series metagenomics enables direct observation of evolutionary processes in natural populations, and if applied over a sufficiently long time frame, this approach could capture events such as gene-specific or genome-wide selective sweeps. Direct observations of either process could help resolve how distinct groups form in natural microbial assemblages. Here, from a three-year metagenomic study of a freshwater lake, we explore changes in single nucleotide polymorphism (SNP) frequencies and patterns of gene gain and loss in populations of Chlorobiaceae and Methylophilaceae. SNP analyses revealedmore » substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied considerably among closely related, co-occurring Methylophilaceae populations. SNP allele frequencies, as well as the relative abundance of certain genes, changed dramatically over time in each population. Interestingly, SNP diversity was purged at nearly every genome position in one of the Chlorobiaceae populations over the course of three years, while at the same time multiple genes either swept through or were swept from this population. These patterns were consistent with a genome-wide selective sweep, a process predicted by the ecotype model? of diversification, but not previously observed in natural populations.« less

  1. Genome-Wide Associations between Genetic and Epigenetic Variation Influence mRNA Expression and Insulin Secretion in Human Pancreatic Islets

    PubMed Central

    Olsson, Anders H.; Volkov, Petr; Bacos, Karl; Dayeh, Tasnim; Hall, Elin; Nilsson, Emma A.; Ladenvall, Claes; Rönn, Tina; Ling, Charlotte

    2014-01-01

    Genetic and epigenetic mechanisms may interact and together affect biological processes and disease development. However, most previous studies have investigated genetic and epigenetic mechanisms independently, and studies examining their interactions throughout the human genome are lacking. To identify genetic loci that interact with the epigenome, we performed the first genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human pancreatic islets. We related 574,553 single nucleotide polymorphisms (SNPs) with genome-wide DNA methylation data of 468,787 CpG sites targeting 99% of RefSeq genes in islets from 89 donors. We identified 67,438 SNP-CpG pairs in cis, corresponding to 36,783 SNPs (6.4% of tested SNPs) and 11,735 CpG sites (2.5% of tested CpGs), and 2,562 significant SNP-CpG pairs in trans, corresponding to 1,465 SNPs (0.3% of tested SNPs) and 383 CpG sites (0.08% of tested CpGs), showing significant associations after correction for multiple testing. These include reported diabetes loci, e.g. ADCY5, KCNJ11, HLA-DQA1, INS, PDX1 and GRB10. CpGs of significant cis-mQTLs were overrepresented in the gene body and outside of CpG islands. Follow-up analyses further identified mQTLs associated with gene expression and insulin secretion in human islets. Causal inference test (CIT) identified SNP-CpG pairs where DNA methylation in human islets is the potential mediator of the genetic association with gene expression or insulin secretion. Functional analyses further demonstrated that identified candidate genes (GPX7, GSTT1 and SNX19) directly affect key biological processes such as proliferation and apoptosis in pancreatic β-cells. Finally, we found direct correlations between DNA methylation of 22,773 (4.9%) CpGs with mRNA expression of 4,876 genes, where 90% of the correlations were negative when CpGs were located in the region surrounding transcription start site. Our study demonstrates for the first time how genome-wide genetic and epigenetic variation interacts to influence gene expression, islet function and potential diabetes risk in humans. PMID:25375650

  2. Genetic Variations in Magnesium-Related Ion Channels May Affect Diabetes Risk among African American and Hispanic American Women123

    PubMed Central

    Chan, Kei Hang K; Chacko, Sara A; Song, Yiqing; Cho, Michele; Eaton, Charles B; Wu, Wen-Chih H; Liu, Simin

    2015-01-01

    Background: Prospective studies consistently link low magnesium intake to higher type 2 diabetes (T2D) risk. Objective: We examined the association of common genetic variants [single nucleotide polymorphisms (SNPs)] in genes related to magnesium homeostasis with T2D risk and potential interactions with magnesium intake. Methods: Using the Women's Health Initiative-SNP Health Association Resource (WHI-SHARe) study, we identified 17 magnesium-related ion channel genes (583 SNPs) and examined their associations with T2D risk in 7287 African-American (AA; n = 1949 T2D cases) and 3285 Hispanic-American (HA; n = 611 T2D cases) postmenopausal women. We performed both single- and multiple-locus haplotype analyses. Results: Among AA women, carriers of each additional copy of SNP rs6584273 in cyclin mediator 1 (CNNM1) had 16% lower T2D risk [OR: 0.84; false discovery rate (FDR)-adjusted P = 0.02]. Among HA women, several variants were significantly associated with T2D risk, including rs10861279 in solute carrier family 41 (anion exchanger), member 2 (SLC41A2) (OR: 0.54; FDR-adjusted P = 0.04), rs7174119 in nonimprinted in Prader-Willi/Angelman syndrome 1 (NIPA1) (OR: 1.27; FDR-adjusted P = 0.04), and 2 SNPs in mitochondrial RNA splicing 2 (MRS2) (rs7738943: OR = 1.55, FDR-adjusted P = 0.01; rs1056285: OR = 1.48, FDR-adjusted P = 0.02). Even with the most conservative Bonferroni adjustment, two 2-SNP-haplotypes in SLC41A2 and MRS2 region were significantly associated with T2D risk (rs12582312-rs10861279: P = 0.0006; rs1056285-rs7738943: P = 0.002). Among women with magnesium intake in the lowest 30% (AA: ≤0.164 g/d; HA: ≤0.185 g/d), 4 SNP signals were strengthened [rs11590362 in claudin 19 (CLDN19), rs823154 in SLC41A1, rs5929706 and rs5930817 in membra; HA: ≥0.313 g/d), rs6584273 in CNNM1 (OR: 0.71; FDR-adjusted P = 0.04) and rs1800467 in potassium inwardly rectifying channel, subfamily J, member 11 (KCNJ11) (OR: 2.50; FDR-adjusted P = 0.01) were significantly associated with T2D risk. Conclusions: Our findings suggest important associations between genetic variations in magnesium-related ion channel genes and T2D risk in AA and HA women that vary by amount of magnesium intake. PMID:25733456

  3. Genome-wide single nucleotide polymorphisms reveal population history and adaptive divergence in wild guppies.

    PubMed

    Willing, Eva-Maria; Bentzen, Paul; van Oosterhout, Cock; Hoffmann, Margarete; Cable, Joanne; Breden, Felix; Weigel, Detlef; Dreyer, Christine

    2010-03-01

    Adaptation of guppies (Poecilia reticulata) to contrasting upland and lowland habitats has been extensively studied with respect to behaviour, morphology and life history traits. Yet population history has not been studied at the whole-genome level. Although single nucleotide polymorphisms (SNPs) are the most abundant form of variation in many genomes and consequently very informative for a genome-wide picture of standing natural variation in populations, genome-wide SNP data are rarely available for wild vertebrates. Here we use genetically mapped SNP markers to comprehensively survey genetic variation within and among naturally occurring guppy populations from a wide geographic range in Trinidad and Venezuela. Results from three different clustering methods, Neighbor-net, principal component analysis (PCA) and Bayesian analysis show that the population substructure agrees with geographic separation and largely with previously hypothesized patterns of historical colonization. Within major drainages (Caroni, Oropouche and Northern), populations are genetically similar, but those in different geographic regions are highly divergent from one another, with some indications of ancient shared polymorphisms. Clear genomic signatures of a previous introduction experiment were seen, and we detected additional potential admixture events. Headwater populations were significantly less heterozygous than downstream populations. Pairwise F(ST) values revealed marked differences in allele frequencies among populations from different regions, and also among populations within the same region. F(ST) outlier methods indicated some regions of the genome as being under directional selection. Overall, this study demonstrates the power of a genome-wide SNP data set to inform for studies on natural variation, adaptation and evolution of wild populations.

  4. Variation in conserved non-coding sequences on chromosome 5q andsusceptibility to asthma and atopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Donfack, Joseph; Schneider, Daniel H.; Tan, Zheng

    2005-09-10

    Background: Evolutionarily conserved sequences likely havebiological function. Methods: To determine whether variation in conservedsequences in non-coding DNA contributes to risk for human disease, westudied six conserved non-coding elements in the Th2 cytokine cluster onhuman chromosome 5q31 in a large Hutterite pedigree and in samples ofoutbred European American and African American asthma cases and controls.Results: Among six conserved non-coding elements (>100 bp,>70percent identity; human-mouse comparison), we identified one singlenucleotide polymorphism (SNP) in each of two conserved elements and sixSNPs in the flanking regions of three conserved elements. We genotypedour samples for four of these SNPs and an additional three SNPs eachmore » inthe IL13 and IL4 genes. While there was only modest evidence forassociation with single SNPs in the Hutterite and European Americansamples (P<0.05), there were highly significant associations inEuropean Americans between asthma and haplotypes comprised of SNPs in theIL4 gene (P<0.001), including a SNP in a conserved non-codingelement. Furthermore, variation in the IL13 gene was strongly associatedwith total IgE (P = 0.00022) and allergic sensitization to mold allergens(P = 0.00076) in the Hutterites, and more modestly associated withsensitization to molds in the European Americans and African Americans (P<0.01). Conclusion: These results indicate that there is overalllittle variation in the conserved non-coding elements on 5q31, butvariation in IL4 and IL13, including possibly one SNP in a conservedelement, influence asthma and atopic phenotypes in diversepopulations.« less

  5. Genome-wide copy number variant analysis reveals variants associated with 10 diverse production traits in Holstein cattle

    USDA-ARS?s Scientific Manuscript database

    Copy number variation (CNV) is an important type of genetic variation contributing to phenotypic differences among mammals and may serve as an alternative molecular marker to single nucleotide polymorphism (SNP) for genome-wide association study (GWAS). Recently, GWAS analysis using CNV has been app...

  6. Genetic variation of apolipoproteins, diet and other environmental interactions; an updated review.

    PubMed

    Sotos-Prieto, Mercedes; Peñalvo, José Luis

    2013-01-01

    This paper summarizes the recent findings from studies investigating the potential environmental modulation of the genetic variation of apolipoprotein genes on metabolic traits. We reviewed nutrigenetic studies evaluating variations on apolipoproteins-related genes and its associated response to nutrients (mostly dietary fatty acids) or any other dietary or environmental component. Most revised research studied single nucleotide polymorphism (SNP) and specific nutrients through small intervention studies, and only few interactions have been replicated in large and independent populations (as in the case of -265T > C SNP in APOA2 gene). Although current knowledge shows that variations on apolipoprotein genes may contribute to the different response on metabolic traits due to dietary interventions, evidence is still scarce and results are inconsistent. Success in this area will require going beyond the limitations of current experimental designs and explore the hypotheses within large populations. Some of these limitations are being covered by the rapidly advance in high-throughput technologies and large scale-genome wide association studies. Copyright © AULA MEDICA EDICIONES 2013. Published by AULA MEDICA. All rights reserved.

  7. Single Nucleotide Polymorphism (SNP)-Strings: An Alternative Method for Assessing Genetic Associations

    PubMed Central

    Goodin, Douglas S.; Khankhanian, Pouya

    2014-01-01

    Background Genome-wide association studies (GWAS) identify disease-associations for single-nucleotide-polymorphisms (SNPs) from scattered genomic-locations. However, SNPs frequently reside on several different SNP-haplotypes, only some of which may be disease-associated. This circumstance lowers the observed odds-ratio for disease-association. Methodology/Principal Findings Here we develop a method to identify the two SNP-haplotypes, which combine to produce each person’s SNP-genotype over specified chromosomal segments. Two multiple sclerosis (MS)-associated genetic regions were modeled; DRB1 (a Class II molecule of the major histocompatibility complex) and MMEL1 (an endopeptidase that degrades both neuropeptides and β-amyloid). For each locus, we considered sets of eleven adjacent SNPs, surrounding the putative disease-associated gene and spanning ∼200 kb of DNA. The SNP-information was converted into an ordered-set of eleven-numbers (subject-vectors) based on whether a person had zero, one, or two copies of particular SNP-variant at each sequential SNP-location. SNP-strings were defined as those ordered-combinations of eleven-numbers (0 or 1), representing a haplotype, two of which combined to form the observed subject-vector. Subject-vectors were resolved using probabilistic methods. In both regions, only a small number of SNP-strings were present. We compared our method to the SHAPEIT-2 phasing-algorithm. When the SNP-information spanning 200 kb was used, SHAPEIT-2 was inaccurate. When the SHAPEIT-2 window was increased to 2,000 kb, the concordance between the two methods, in both of these eleven-SNP regions, was over 99%, suggesting that, in these regions, both methods were quite accurate. Nevertheless, correspondence was not uniformly high over the entire DNA-span but, rather, was characterized by alternating peaks and valleys of concordance. Moreover, in the valleys of poor-correspondence, SHAPEIT-2 was also inconsistent with itself, suggesting that the SNP-string method is more accurate across the entire region. Conclusions/Significance Accurate haplotype identification will enhance the detection of genetic-associations. The SNP-string method provides a simple means to accomplish this and can be extended to cover larger genomic regions, thereby improving a GWAS’s power, even for those published previously. PMID:24727690

  8. SNP Data Quality Control in a National Beef and Dairy Cattle System and Highly Accurate SNP Based Parentage Verification and Identification

    PubMed Central

    McClure, Matthew C.; McCarthy, John; Flynn, Paul; McClure, Jennifer C.; Dair, Emma; O'Connell, D. K.; Kearney, John F.

    2018-01-01

    A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP) verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS), they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF) analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800) selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR), and minor allele frequency (MAF) in the Irish cattle population. Large datasets require sample and SNP quality control (QC). Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present), and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non-matching genotypes per animal, SNP duplicates, sex and breed prediction mismatches, parentage and progeny validation results, and other situations. The Animal QC pipeline make use of ICBF800 SNP set where appropriate to identify errors in a computationally efficient yet still highly accurate method. PMID:29599798

  9. Genetic Association Study of KCNQ5 Polymorphisms with High Myopia.

    PubMed

    Liao, Xuan; Yap, Maurice K H; Leung, Kim Hung; Kao, Patrick Y P; Liu, Long Qian; Yip, Shea Ping

    2017-01-01

    Identification of genetic variations related to high myopia may advance our knowledge of the etiopathogenesis of refractive error. This study investigated the role of potassium channel gene (KCNQ5) polymorphisms in high myopia. We performed a case-control study of 1563 unrelated Han Chinese subjects (809 cases of high myopia and 754 emmetropic controls). Five tag single-nucleotide polymorphisms (SNPs) of KCNQ5 were genotyped, and association testing with high myopia was conducted using logistic regression analysis adjusted for sex and age to give P asym values, and multiple comparisons were corrected by permutation test to give P emp values. All five noncoding SNPs were associated with high myopia. The SNP rs7744813, previously shown to be associated with refractive error and myopia in two GWAS, showed an odds ratio of 0.75 (95% CI 0.63-0.90; P emp = 0.0058) for the minor allele. The top SNP rs9342979 showed an odds ratio of 0.75 (95% CI 0.64-0.89; P emp = 0.0045) for the minor allele. Both SNPs are located within enhancer histone marks and DNase-hypersensitive sites. Our data support the involvement of KCNQ5 gene polymorphisms in the genetic susceptibility to high myopia and further exploration of KCNQ5 as a risk factor for high myopia.

  10. Single Locked Nucleic Acid-Enhanced Nanopore Genetic Discrimination of Pathogenic Serotypes and Cancer Driver Mutations.

    PubMed

    Tian, Kai; Chen, Xiaowei; Luan, Binquan; Singh, Prashant; Yang, Zhiyu; Gates, Kent S; Lin, Mengshi; Mustapha, Azlin; Gu, Li-Qun

    2018-05-22

    Accurate and rapid detection of single-nucleotide polymorphism (SNP) in pathogenic mutants is crucial for many fields such as food safety regulation and disease diagnostics. Current detection methods involve laborious sample preparations and expensive characterizations. Here, we investigated a single locked nucleic acid (LNA) approach, facilitated by a nanopore single-molecule sensor, to accurately determine SNPs for detection of Shiga toxin producing Escherichia coli (STEC) serotype O157:H7, and cancer-derived EGFR L858R and KRAS G12D driver mutations. Current LNA applications that require incorporation and optimization of multiple LNA nucleotides. But we found that in the nanopore system, a single LNA introduced in the probe is sufficient to enhance the SNP discrimination capability by over 10-fold, allowing accurate detection of the pathogenic mutant DNA mixed in a large amount of the wild-type DNA. Importantly, the molecular mechanistic study suggests that such a significant improvement is due to the effect of the single-LNA that both stabilizes the fully matched base-pair and destabilizes the mismatched base-pair. This sensitive method, with a simplified, low cost, easy-to-operate LNA design, could be generalized for various applications that need rapid and accurate identification of single-nucleotide variations.

  11. Discovery of gene-gene interactions across multiple independent data sets of late onset Alzheimer disease from the Alzheimer Disease Genetics Consortium.

    PubMed

    Hohman, Timothy J; Bush, William S; Jiang, Lan; Brown-Gentry, Kristin D; Torstenson, Eric S; Dudek, Scott M; Mukherjee, Shubhabrata; Naj, Adam; Kunkle, Brian W; Ritchie, Marylyn D; Martin, Eden R; Schellenberg, Gerard D; Mayeux, Richard; Farrer, Lindsay A; Pericak-Vance, Margaret A; Haines, Jonathan L; Thornton-Wells, Tricia A

    2016-02-01

    Late-onset Alzheimer disease (AD) has a complex genetic etiology, involving locus heterogeneity, polygenic inheritance, and gene-gene interactions; however, the investigation of interactions in recent genome-wide association studies has been limited. We used a biological knowledge-driven approach to evaluate gene-gene interactions for consistency across 13 data sets from the Alzheimer Disease Genetics Consortium. Fifteen single nucleotide polymorphism (SNP)-SNP pairs within 3 gene-gene combinations were identified: SIRT1 × ABCB1, PSAP × PEBP4, and GRIN2B × ADRA1A. In addition, we extend a previously identified interaction from an endophenotype analysis between RYR3 × CACNA1C. Finally, post hoc gene expression analyses of the implicated SNPs further implicate SIRT1 and ABCB1, and implicate CDH23 which was most recently identified as an AD risk locus in an epigenetic analysis of AD. The observed interactions in this article highlight ways in which genotypic variation related to disease may depend on the genetic context in which it occurs. Further, our results highlight the utility of evaluating genetic interactions to explain additional variance in AD risk and identify novel molecular mechanisms of AD pathogenesis. Copyright © 2016 Elsevier Inc. All rights reserved.

  12. Variations in apolipoprotein D and sigma non-opioid intracellular receptor 1 genes with relation to risk, severity and outcome of ischemic stroke.

    PubMed

    Lövkvist, Håkan; Jönsson, Ann-Cathrin; Luthman, Holger; Jood, Katarina; Jern, Christina; Wieloch, Tadeusz; Lindgren, Arne

    2014-09-28

    In experimental studies, the apolipoprotein D (APOD) and the sigma receptor type 1 (SIGMAR1) have been related to processes of brain damage, repair and plasticity. We examined blood samples from 3081 ischemic stroke (IS) patients and 1595 control subjects regarding 10 single nucleotide polymorphisms (SNPs) in the APOD (chromosomal location 3q29) and SIGMAR1 (chromosomal location 9p13) genes to find possible associations with IS risk, IS severity (NIHSS-score) and recovery after IS (modified Rankin Scale, mRS, at 90 days). Simple/multiple logistic regression and Spearman's rho were utilized for the analyses. Among the SNPs analyzed, rs7659 within the APOD gene showed a possible association with stroke risk (OR = 1.12; 95% CI: 1.01-1.25; P = 0.029) and stroke severity (NIHSS ≥ 16) (OR = 0.70; 95% CI: 0.54-0.92; P = 0.009) when controlling for age, sex and vascular risk factors for stroke. No SNP showed an association with stroke recovery (mRS). We conclude that the SNP rs7659 within the APOD gene might be related to risk and severity of ischemic stroke in patients.

  13. Relationship Between Some Single-nucleotide Polymorphism and Response to Hydroxyurea Therapy in Iranian Patients With β-Thalassemia Intermedia.

    PubMed

    Karimi, Mehran; Zarei, Tahereh; Haghpanah, Sezaneh; Moghadam, Mohamad; Ebrahimi, Ahmad; Rezaei, Narges; Heidari, Ghazaleh; Vazin, Afsaneh; Khavari, Maryam; Miri, Hamid R

    2017-05-01

    To evaluate the possible relationship between hydroxyurea (HU) response and some single-nucleotide polymorphism (SNP) in patients affected by β-thalassemia intermedia. In this cross-sectional study, 100 β-thalassemia intermedia patients who were taking HU with a dose of 8 to 15 mg/kg body weight per day for a period of at least 6 months were randomly selected between February 2013 and October 2014 in southern Iran. HU response was defined based on decrease or cessation of the blood transfusion need and evaluation of Hb level. In univariate analysis, from all evaluated SNPs, only rs10837814 SNP of olfactory receptors (ORs) OR51B2 showed a significant association with HU response (P=0.038) and from laboratory characteristics, only nucleated red blood cells showed significant associations (116%±183%) in good responders versus (264%±286%) in poor responders (P=0.045). In multiple logistic regression, neither laboratory variables nor different SNPs, showed significant association with HU response. Three novel nucleotide variations (-665 [A→C], -1301 [T→G],-1199 delA) in OR51B2 gene were found in good responders. None of the evaluated SNPs in our study showed significant association with HU response. Further larger studies and evaluation of other genes are suggested.

  14. When Whole-Genome Alignments Just Won't Work: kSNP v2 Software for Alignment-Free SNP Discovery and Phylogenetics of Hundreds of Microbial Genomes

    PubMed Central

    Gardner, Shea N.; Hall, Barry G.

    2013-01-01

    Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four “raw read” genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths. PMID:24349125

  15. When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

    PubMed

    Gardner, Shea N; Hall, Barry G

    2013-01-01

    Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four "raw read" genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths.

  16. Association of a novel polymorphism in the bovine PPARGC1A gene with growth, slaughter and meat quality traits in Brangus steers.

    PubMed

    Soria, L A; Corva, P M; Branda Sica, A; Villarreal, E L; Melucci, L M; Mezzadra, C A; Papaleo Mazzucco, J; Fernández Macedo, G; Silvestro, C; Schor, A; Miquel, M C

    2009-12-01

    The PPARGC1A gene (peroxysome proliferator-activated receptor-gamma coactivator 1alpha gene) controls muscle fiber type and brown adipocyte differentiation; therefore, it is a candidate gene for beef quality traits (tenderness and fat content). Two SNPs (Single Nucleotide Polymorphisms) were identified within exon 8 by multiple alignment of DNA sequences obtained from 24 bulls: a transition G/A (SNP 1181) and a transversion A/T (SNP 1299). The SNP 1181 is a novel SNP, corresponding to a non-conservative substitution (AGT/AAT) that could be the cause of amino acid substitution ((364)Serine/(364)Asparagine). A Mismatch PCR method was designed to determine genotypes of 73 bulls and 268 steers for SNP 1181. Growth, slaughter and meat quality information were available for the group of steers. Allele A of SNP 1181 was not found in Angus. In 243 steers, no significant differences (P > 0.05) were found for either final live body weight, gain in backfat thickness in Spring, kidney fat weight, kidney fat percentage, Warner-Bratzler shear force at 7 days postmortem, intramuscular fat percentage or meat colour between genotype GG and AG. This SNP could be included in breed composition and population admixture analyses because there are marked differences in allelic frequencies between Bos taurus and Bos indicus breeds.

  17. Neuregulin-1 genotype is associated with structural differences in the normal human brain.

    PubMed

    Barnes, Anna; Isohanni, Matti; Barnett, Jennifer H; Pietiläinen, Olli; Veijola, Juha; Miettunen, Jouko; Paunio, Tiina; Tanskanen, Päivikki; Ridler, Khanum; Suckling, John; Bullmore, Edward T; Jones, Peter B; Murray, Graham K

    2012-02-01

    The human neuregulin-1 (NRG-1) gene is highly expressed in the brain, is implicated in numerous functions associated with neuronal development, and is a leading candidate gene for schizophrenia. The T allele of SNP8NRG243177, part of a risk haplotype for schizophrenia, has been previously associated with decreases in white matter in the right anterior internal capsule and the left anterior thalamic radiation. To our knowledge no studies have described the effects of SNP8NRG243177 on grey matter volume at a voxelwise level. We assessed associations between this SNP and brain structure in 79 general population volunteers from the Northern Finland 1966 Birth Cohort (NFBC 1966). We show, for the first time, that genetic variation in SNP8NRG243177 is associated with variation in frontal brain structure in both grey and white matter. T allele carriers showed decreased grey matter volume in several frontal gyri, including inferior, middle and superior frontal gyri and the anterior cingulate gyrus, as well as decreased white matter volume in the regions of the genu and body of the corpus callosum, anterior and superior corona radiata, anterior limb of the internal capsule and external capsule regions traversed by major white matter tracts of the anterior thalamic radiation, and the inferior fronto-occipital fasciculus. These results suggest that this genetic variant may mediate risk for schizophrenia, in part, through its effect on brain structure in these regions. Copyright © 2011 Elsevier Inc. All rights reserved.

  18. DNA sequence polymorphisms within the bovine guanine nucleotide-binding protein Gs subunit alpha (Gsα)-encoding (GNAS) genomic imprinting domain are associated with performance traits.

    PubMed

    Sikora, Klaudia M; Magee, David A; Berkowicz, Erik W; Berry, Donagh P; Howard, Dawn J; Mullen, Michael P; Evans, Ross D; Machugh, David E; Spillane, Charles

    2011-01-07

    Genes which are epigenetically regulated via genomic imprinting can be potential targets for artificial selection during animal breeding. Indeed, imprinted loci have been shown to underlie some important quantitative traits in domestic mammals, most notably muscle mass and fat deposition. In this candidate gene study, we have identified novel associations between six validated single nucleotide polymorphisms (SNPs) spanning a 97.6 kb region within the bovine guanine nucleotide-binding protein Gs subunit alpha gene (GNAS) domain on bovine chromosome 13 and genetic merit for a range of performance traits in 848 progeny-tested Holstein-Friesian sires. The mammalian GNAS domain consists of a number of reciprocally-imprinted, alternatively-spliced genes which can play a major role in growth, development and disease in mice and humans. Based on the current annotation of the bovine GNAS domain, four of the SNPs analysed (rs43101491, rs43101493, rs43101485 and rs43101486) were located upstream of the GNAS gene, while one SNP (rs41694646) was located in the second intron of the GNAS gene. The final SNP (rs41694656) was located in the first exon of transcripts encoding the putative bovine neuroendocrine-specific protein NESP55, resulting in an aspartic acid-to-asparagine amino acid substitution at amino acid position 192. SNP genotype-phenotype association analyses indicate that the single intronic GNAS SNP (rs41694646) is associated (P ≤ 0.05) with a range of performance traits including milk yield, milk protein yield, the content of fat and protein in milk, culled cow carcass weight and progeny carcass conformation, measures of animal body size, direct calving difficulty (i.e. difficulty in calving due to the size of the calf) and gestation length. Association (P ≤ 0.01) with direct calving difficulty (i.e. due to calf size) and maternal calving difficulty (i.e. due to the maternal pelvic width size) was also observed at the rs43101491 SNP. Following adjustment for multiple-testing, significant association (q ≤ 0.05) remained between the rs41694646 SNP and four traits (animal stature, body depth, direct calving difficulty and milk yield) only. Notably, the single SNP in the bovine NESP55 gene (rs41694656) was associated (P ≤ 0.01) with somatic cell count--an often-cited indicator of resistance to mastitis and overall health status of the mammary system--and previous studies have demonstrated that the chromosomal region to where the GNAS domain maps underlies an important quantitative trait locus for this trait. This association, however, was not significant after adjustment for multiple testing. The three remaining SNPs assayed were not associated with any of the performance traits analysed in this study. Analysis of all pairwise linkage disequilibrium (r2) values suggests that most allele substitution effects for the assayed SNPs observed are independent. Finally, the polymorphic coding SNP in the putative bovine NESP55 gene was used to test the imprinting status of this gene across a range of foetal bovine tissues. Previous studies in other mammalian species have shown that DNA sequence variation within the imprinted GNAS gene cluster contributes to several physiological and metabolic disorders, including obesity in humans and mice. Similarly, the results presented here indicate an important role for the imprinted GNAS cluster in underlying complex performance traits in cattle such as animal growth, calving, fertility and health. These findings suggest that GNAS domain-associated polymorphisms may serve as important genetic markers for future livestock breeding programs and support previous studies that candidate imprinted loci may act as molecular targets for the genetic improvement of agricultural populations. In addition, we present new evidence that the bovine NESP55 gene is epigenetically regulated as a maternally expressed imprinted gene in placental and intestinal tissues from 8-10 week old bovine foetuses.

  19. DNA sequence polymorphisms within the bovine guanine nucleotide-binding protein Gs subunit alpha (Gsα)-encoding (GNAS) genomic imprinting domain are associated with performance traits

    PubMed Central

    2011-01-01

    Background Genes which are epigenetically regulated via genomic imprinting can be potential targets for artificial selection during animal breeding. Indeed, imprinted loci have been shown to underlie some important quantitative traits in domestic mammals, most notably muscle mass and fat deposition. In this candidate gene study, we have identified novel associations between six validated single nucleotide polymorphisms (SNPs) spanning a 97.6 kb region within the bovine guanine nucleotide-binding protein Gs subunit alpha gene (GNAS) domain on bovine chromosome 13 and genetic merit for a range of performance traits in 848 progeny-tested Holstein-Friesian sires. The mammalian GNAS domain consists of a number of reciprocally-imprinted, alternatively-spliced genes which can play a major role in growth, development and disease in mice and humans. Based on the current annotation of the bovine GNAS domain, four of the SNPs analysed (rs43101491, rs43101493, rs43101485 and rs43101486) were located upstream of the GNAS gene, while one SNP (rs41694646) was located in the second intron of the GNAS gene. The final SNP (rs41694656) was located in the first exon of transcripts encoding the putative bovine neuroendocrine-specific protein NESP55, resulting in an aspartic acid-to-asparagine amino acid substitution at amino acid position 192. Results SNP genotype-phenotype association analyses indicate that the single intronic GNAS SNP (rs41694646) is associated (P ≤ 0.05) with a range of performance traits including milk yield, milk protein yield, the content of fat and protein in milk, culled cow carcass weight and progeny carcass conformation, measures of animal body size, direct calving difficulty (i.e. difficulty in calving due to the size of the calf) and gestation length. Association (P ≤ 0.01) with direct calving difficulty (i.e. due to calf size) and maternal calving difficulty (i.e. due to the maternal pelvic width size) was also observed at the rs43101491 SNP. Following adjustment for multiple-testing, significant association (q ≤ 0.05) remained between the rs41694646 SNP and four traits (animal stature, body depth, direct calving difficulty and milk yield) only. Notably, the single SNP in the bovine NESP55 gene (rs41694656) was associated (P ≤ 0.01) with somatic cell count--an often-cited indicator of resistance to mastitis and overall health status of the mammary system--and previous studies have demonstrated that the chromosomal region to where the GNAS domain maps underlies an important quantitative trait locus for this trait. This association, however, was not significant after adjustment for multiple testing. The three remaining SNPs assayed were not associated with any of the performance traits analysed in this study. Analysis of all pairwise linkage disequilibrium (r2) values suggests that most allele substitution effects for the assayed SNPs observed are independent. Finally, the polymorphic coding SNP in the putative bovine NESP55 gene was used to test the imprinting status of this gene across a range of foetal bovine tissues. Conclusions Previous studies in other mammalian species have shown that DNA sequence variation within the imprinted GNAS gene cluster contributes to several physiological and metabolic disorders, including obesity in humans and mice. Similarly, the results presented here indicate an important role for the imprinted GNAS cluster in underlying complex performance traits in cattle such as animal growth, calving, fertility and health. These findings suggest that GNAS domain-associated polymorphisms may serve as important genetic markers for future livestock breeding programs and support previous studies that candidate imprinted loci may act as molecular targets for the genetic improvement of agricultural populations. In addition, we present new evidence that the bovine NESP55 gene is epigenetically regulated as a maternally expressed imprinted gene in placental and intestinal tissues from 8-10 week old bovine foetuses. PMID:21214909

  20. The genomic architecture and association genetics of adaptive characters using a candidate SNP approach in boreal black spruce

    PubMed Central

    2013-01-01

    Background The genomic architecture of adaptive traits remains poorly understood in non-model plants. Various approaches can be used to bridge this gap, including the mapping of quantitative trait loci (QTL) in pedigrees, and genetic association studies in non-structured populations. Here we present results on the genomic architecture of adaptive traits in black spruce, which is a widely distributed conifer of the North American boreal forest. As an alternative to the usual candidate gene approach, a candidate SNP approach was developed for association testing. Results A genetic map containing 231 gene loci was used to identify QTL that were related to budset timing and to tree height assessed over multiple years and sites. Twenty-two unique genomic regions were identified, including 20 that were related to budset timing and 6 that were related to tree height. From results of outlier detection and bulk segregant analysis for adaptive traits using DNA pool sequencing of 434 genes, 52 candidate SNPs were identified and subsequently tested in genetic association studies for budset timing and tree height assessed over multiple years and sites. A total of 34 (65%) SNPs were significantly associated with budset timing, or tree height, or both. Although the percentages of explained variance (PVE) by individual SNPs were small, several significant SNPs were shared between sites and among years. Conclusions The sharing of genomic regions and significant SNPs between budset timing and tree height indicates pleiotropic effects. Significant QTLs and SNPs differed quite greatly among years, suggesting that different sets of genes for the same characters are involved at different stages in the tree’s life history. The functional diversity of genes carrying significant SNPs and low observed PVE further indicated that a large number of polymorphisms are involved in adaptive genetic variation. Accordingly, for undomesticated species such as black spruce with natural populations of large effective size and low linkage disequilibrium, efficient marker systems that are predictive of adaptation should require the survey of large numbers of SNPs. Candidate SNP approaches like the one developed in the present study could contribute to reducing these numbers. PMID:23724860

  1. RNA sequencing to study gene expression and single nucleotide polymorphism variation associated with citrate content in cow milk.

    PubMed

    Cánovas, A; Rincón, G; Islas-Trejo, A; Jimenez-Flores, R; Laubscher, A; Medrano, J F

    2013-04-01

    The technological properties of milk have significant importance for the dairy industry. Citrate, a normal constituent of milk, forms one of the main buffer systems that regulate the equilibrium between Ca(2+) and H(+) ions. Higher-than-normal citrate content is associated with poor coagulation properties of milk. To identify the genes responsible for the variation of citrate content in milk in dairy cattle, the metabolic steps involved in citrate and fatty acid synthesis pathways in ruminant mammary tissue using RNA sequencing were studied. Genetic markers that could influence milk citrate content in Holstein cows were used in a marker-trait association study to establish the relationship between 74 single nucleotide polymorphisms (SNP) in 20 candidate genes and citrate content in 250 Holstein cows. This analysis revealed 6 SNP in key metabolic pathway genes [isocitrate dehydrogenase 1 (NADP+), soluble (IDH1); pyruvate dehydrogenase (lipoamide) β (PDHB); pyruvate kinase (PKM2); and solute carrier family 25 (mitochondrial carrier; citrate transporter), member 1 (SLC25A1)] significantly associated with increased milk citrate content. The amount of the phenotypic variation explained by the 6 SNP ranged from 10.1 to 13.7%. Also, genotype-combination analysis revealed the highest phenotypic variation was explained combining IDH1_23211, PDHB_5562, and SLC25A1_4446 genotypes. This specific genotype combination explained 21.3% of the phenotypic variation. The largest citrate associated effect was in the 3' untranslated region of the SLC25A1 gene, which is responsible for the transport of citrate across the mitochondrial inner membrane. This study provides an approach using RNA sequencing, metabolic pathway analysis, and association studies to identify genetic variation in functional target genes determining complex trait phenotypes. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  2. COMPREHENSIVE ANALYSES OF DNA REPAIR PATHWAYS, SMOKING, AND BLADDER CANCER RISK IN LOS ANGELES AND SHANGHAI

    PubMed Central

    Corral, Roman; Lewinger, Juan Pablo; Berg, David Van Den; Joshi, Amit D.; Yuan, Jian-Min; Gago-Dominguez, Manuela; Cortessis, Victoria K.; Pike, Malcolm C.; Conti, David V.; Thomas, Duncan C.; Edlund, Christopher K.; Gao, Yu-Tang; Xiang, Yong-Bing; Zhang, Wei; Su, Yu-Chen; Stern, Mariana C.

    2014-01-01

    Tobacco smoking is a bladder cancer risk factor and a source of carcinogens that induce DNA damage to urothelial cells. Using data and samples from 988 cases and 1,004 controls enrolled in the Los Angeles County Bladder Cancer Study and the Shanghai Bladder Cancer Study we investigated associations between bladder cancer risk and 632 tagSNPs that comprehensively capture genetic variation in 28 DNA repair genes from four DNA repair pathways: base excision repai, nucleotide excision repair (NER), non-homologous end-joining (NHEJ), and homologous recombination repair (HHR). Odds ratios (ORs) and 95% confidence intervals (CIs) for each tagSNP were corrected for multiple testing for all SNPs within each gene using pACT, and for genes within each pathway and across pathways with Bonferroni. Gene and pathway summary estimates were obtained using ARTP. We observed an association between bladder cancer and POLB rs7832529 (BER) (pACT = 0.003; ppathway = 0.021) among all, and SNPs in XPC (NER) and OGG1 (BER) among Chinese men and women, respectively. The NER pathway showed an overall association with risk among Chinese males (ARTP NER p = 0.034). The XRCC6 SNP rs2284082 (NHEJ), also in LD with SREBF2, showed an interaction with smoking (Smoking status interaction pgene = 0.001, ppathway = 0.008, poverall = 0.034). Our findings support a role in bladder carcinogenesis for regions that map close to or within BER (POLB, OGG1) and NER genes (XPC). A SNP that tags both the XRCC6 and SREBF2 genes strongly modifies the association between bladder cancer risk and smoking. PMID:24382701

  3. Are genetic variations in OXTR, AVPR1A, and CD38 genes important to social integration? Results from two large U.S. cohorts.

    PubMed

    Chang, Shun-Chiao; Glymour, M Maria; Rewak, Marissa; Cornelis, Marilyn C; Walter, Stefan; Koenen, Karestan C; Kawachi, Ichiro; Liang, Liming; Tchetgen Tchetgen, Eric J; Kubzansky, Laura D

    2014-01-01

    Some evidence suggests that genetic polymorphisms in oxytocin pathway genes influence various social behaviors, but findings thus far have been mixed. Many studies have been based in small samples and there is possibility of publication bias. Using data from 2 large U.S. prospective cohorts with over 11,000 individuals, we investigated 88 SNPs in OXTR, AVPR1A, and CD38, in relation to social integration (measured as social connectedness in both binary and continuous forms and being continuously married). After correction for multiple testing only one SNP in CD38 (rs12644506) was significantly associated with social integration and that SNP predicted when using a dichotomized indicator of social connectedness (adjusted p=0.02), but not a continuous measure of social connectedness or the continuously married outcome. A significant gender-heterogeneous effect was identified in one OXTR SNP on dichotomized social connectedness; specifically, rs4686302 T allele was nominally associated with social connectedness in men, whereas the association direction was opposite in women (adjusted gender heterogeneity p=0.02). Furthermore, the rs53576 A allele was significantly associated with social connectedness only in women, and the effect magnitude was stronger in a dominant genetic model (adjusted p=0.003). In summary, our findings suggested that common genetic variants of OXTR, CD38, and AVPR1A are not associated with social integration as measured in this study using the simplified Berkman-Syme Social Network Index, but these findings and other work hint that effects may be modified by gender or other social experiences. Further work considering genetic pathways in relation to social integration may be more fruitful if these additional factors can be more comprehensively evaluated. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Are genetic variations in OXTR, AVPR1A, and CD38 genes important to social integration? Results from two large U.S. cohorts

    PubMed Central

    Chang, Shun-Chiao; Glymour, M Maria; Rewak, Marissa; Cornelis, Marilyn; Walter, Stefan; Koenen, Karestan C; Kawachi, Ichiro; Liang, Liming; Tchetgen, Eric Tchetgen; Kubzansky, Laura D.

    2013-01-01

    Some evidence suggests that genetic polymorphisms in oxytocin pathway genes influence various social behaviors, but findings thus far have been mixed. Many studies have been based in small samples and there is possibility of publication bias. Using data from 2 large U.S. prospective cohorts with over 11,000 individuals, we investigated 88 SNPs in OXTR, AVPR1A, and CD38, in relation to social integration (measured as social connectedness in both binary and continuous forms and being continuously married). After correction for multiple testing only one SNP in CD38 (rs12644506) was significantly associated with social integration and that SNP predicted when using a dichotomized indicator of social connectedness (adjusted p=0.02), but not a continuous measure of social connectedness or the continuously married outcome. A significant gender-heterogeneous effect was identified in one OXTR SNP on dichotomized social connectedness; specifically, rs4686302 T allele was nominally associated with social connectedness in men, whereas the association direction was opposite in women (adjusted gender heterogeneity p=0.02). Furthermore, the rs53576 A allele was significantly associated with social connectedness only in women, and the effect magnitude was stronger in a dominant genetic model (adjusted p=0.003). In summary, our findings suggested that common genetic variants of OXTR, CD38, and AVPR1A are not associated with social integration as measured in this study using the simplified Berkman-Syme Social Network Index, but these findings and other work hint that effects may be modified by gender or other social experiences. Further work considering genetic pathways in relation to social integration may be more fruitful if these additional factors can be more comprehensively evaluated. PMID:24209975

  5. A cautionary tale: the non-causal association between type 2 diabetes risk SNP, rs7756992, and levels of non-coding RNA, CDKAL1-v1.

    PubMed

    Locke, Jonathan M; Wei, Fan-Yan; Tomizawa, Kazuhito; Weedon, Michael N; Harries, Lorna W

    2015-04-01

    Intronic single nucleotide polymorphisms (SNPs) in the CDKAL1 gene are associated with risk of developing type 2 diabetes. A strong correlation between risk alleles and lower levels of the non-coding RNA, CDKAL1-v1, has recently been reported in whole blood extracted from Japanese individuals. We sought to replicate this association in two independent cohorts: one using whole blood from white UK-resident individuals, and one using a collection of human pancreatic islets, a more relevant tissue type to study with respect to the aetiology of diabetes. Levels of CDKAL1-v1 were measured by real-time PCR using RNA extracted from human whole blood (n = 70) and human pancreatic islets (n = 48). Expression with respect to genotype was then determined. In a simple linear regression model, expression of CDKAL1-v1 was associated with the lead type 2 diabetes-associated SNP, rs7756992, in whole blood and islets. However, these associations were abolished or substantially reduced in multiple regression models taking into account rs9366357 genotype: a moderately linked SNP explaining a much larger amount of the variation in CDKAL1-v1 levels, but not strongly associated with risk of type 2 diabetes. Contrary to previous findings, we provide evidence against a role for dysregulated expression of CDKAL1-v1 in mediating the association between intronic SNPs in CDKAL1 and susceptibility to type 2 diabetes. The results of this study illustrate how caution should be exercised when inferring causality from an association between disease-risk genotype and non-coding RNA expression.

  6. Genetic Polymorphisms in the Hypothalamic Pathway in Relation to Subsequent Weight Change – The DiOGenes Study

    PubMed Central

    Ängquist, Lars; Hansen, Rikke D.; van der A, Daphne L.; Holst, Claus; Tjønneland, Anne; Overvad, Kim; Jakobsen, Marianne Uhre; Boeing, Heiner; Meidtner, Karina; Palli, Domenico; Masala, Giovanna; Bouatia-Naji, Nabila; Saris, Wim H. M.; Feskens, Edith J. M.; J.Wareham, Nicolas; Sørensen, Thorkild I. A.; Loos, Ruth J. F.

    2011-01-01

    Background Single nucleotide polymorphisms (SNPs) in genes encoding the components involved in the hypothalamic pathway may influence weight gain and dietary factors may modify their effects. Aim We conducted a case-cohort study to investigate the associations of SNPs in candidate genes with weight change during an average of 6.8 years of follow-up and to examine the potential effect modification by glycemic index (GI) and protein intake. Methods and Findings Participants, aged 20–60 years at baseline, came from five European countries. Cases (‘weight gainers’) were selected from the total eligible cohort (n = 50,293) as those with the greatest unexplained annual weight gain (n = 5,584). A random subcohort (n = 6,566) was drawn with the intention to obtain an equal number of cases and noncases (n = 5,507). We genotyped 134 SNPs that captured all common genetic variation across the 15 candidate genes; 123 met the quality control criteria. Each SNP was tested for association with the risk of being a ‘weight gainer’ (logistic regression models) in the case-noncase data and with weight gain (linear regression models) in the random subcohort data. After accounting for multiple testing, none of the SNPs was significantly associated with weight change. Furthermore, we observed no significant effect modification by dietary factors, except for SNP rs7180849 in the neuromedin β gene (NMB). Carriers of the minor allele had a more pronounced weight gain at a higher GI (P = 2×10−7). Conclusions We found no evidence of association between SNPs in the studied hypothalamic genes with weight change. The interaction between GI and NMB SNP rs7180849 needs further confirmation. PMID:21390334

  7. High-Density SNP Genotyping to Define β-Globin Locus Haplotypes

    PubMed Central

    Liu, Li; Muralidhar, Shalini; Singh, Manisha; Sylvan, Caprice; Kalra, Inderdeep S.; Quinn, Charles T.; Onyekwere, Onyinye C.; Pace, Betty S.

    2014-01-01

    Five major β-globin locus haplotypes have been established in individuals with sickle cell disease (SCD) from the Benin, Bantu, Senegal, Cameroon, and Arab-Indian populations. Historically, β-haplotypes were established using restriction fragment length polymorphism (RFLP) analysis across the β-locus, which consists of five functional β-like globin genes located on chromosome 11. Previous attempts to correlate these haplotypes as robust predictors of clinical phenotypes observed in SCD have not been successful. We speculate that the coverage and distribution of the RFLP sites located proximal to or within the globin genes are not sufficiently dense to accurately reflect the complexity of this region. To test our hypothesis, we performed RFLP analysis and high-density single nucleotide polymorphism (SNP) genotyping across the β-locus using DNA samples from either healthy African Americans with normal hemoglobin A (HbAA) or individuals with homozygous SS (HbSS) disease. Using the genotyping data from 88 SNPs and Haploview analysis, we generated a greater number of haplotypes than that observed with RFLP analysis alone. Furthermore, a unique pattern of long-range linkage disequilibrium between the locus control region and the β-like globin genes was observed in the HbSS group. Interestingly, we observed multiple SNPs within the HindIII restriction site located in the Gγ-globin intervening sequence II which produced the same RFLP pattern. These findings illustrated the inability of RFLP analysis to decipher the complexity of sequence variations that impacts genomic structure in this region. Our data suggest that high density SNP mapping may be required to accurately define β-haplotypes that correlate with the different clinical phenotypes observed in SCD. PMID:18829352

  8. Sample-to-SNP kit: a reliable, easy and fast tool for the detection of HFE p.H63D and p.C282Y variations associated to hereditary hemochromatosis.

    PubMed

    Nielsen, Peter B; Petersen, Maja S; Ystaas, Viviana; Andersen, Rolf V; Hansen, Karin M; Blaabjerg, Vibeke; Refstrup, Mette

    2012-10-01

    Classical hereditary hemochromatosis involves the HFE-gene and diagnostic analysis of the DNA variants HFE p.C282Y (c.845G>A; rs1800562) and HFE p.H63D (c.187C>G; rs1799945). The affected protein alters the iron homeostasis resulting in iron overload in various tissues. The aim of this study was to validate the TaqMan-based Sample-to-SNP protocol for the analysis of the HFE-p.C282Y and p.H63D variants with regard to accuracy, usefulness and reproducibility compared to an existing SNP protocol. The Sample-to-SNP protocol uses an approach where the DNA template is made accessible from a cell lysate followed by TaqMan analysis. Besides the HFE-SNPs other eight SNPs were used as well. These SNPs were: Coagulation factor II-gene F2 c.20210G>A, Coagulation factor V-gene F5 p.R506Q (c.1517G>A; rs121917732), Mitochondria SNP: mt7028 G>A, Mitochondria SNP: mt12308 A>G, Proprotein convertase subtilisin/kexin type 9-gene PCSK9 p.R46L (c.137G>T), Plutathione S-transferase pi 1-gene GSTP1 p.I105V (c313A>G; rs1695), LXR g.-171 A>G, ZNF202 g.-118 G>T. In conclusion the Sample-to-SNP kit proved to be an accurate, reliable, robust, easy to use and rapid TaqMan-based SNP detection protocol, which could be quickly implemented in a routine diagnostic or research facility. Copyright © 2012. Published by Elsevier B.V.

  9. ALOX12 polymorphisms are associated with fat mass but not peak bone mineral density in Chinese nuclear families.

    PubMed

    Xiao, W-J; He, J-W; Zhang, H; Hu, W-W; Gu, J-M; Yue, H; Gao, G; Yu, J-B; Wang, C; Ke, Y-H; Fu, W-Z; Zhang, Z-L

    2011-03-01

    Arachidonate 12-lipoxygenase (ALOX12) is a member of the lipoxygenase superfamily, which catalyzes the incorporation of molecular oxygen into polyunsaturated fatty acids. The products of ALOX12 reactions serve as endogenous ligands for peroxisome proliferator-activated receptor γ (PPARG). The activation of the PPARG pathway in marrow-derived mesenchymal progenitors stimulates adipogenesis and inhibits osteoblastogenesis. Our objective was to determine whether polymorphisms in the ALOX12 gene were associated with variations in peak bone mineral density (BMD) and obesity phenotypes in young Chinese men. All six tagging single-nucleotide polymorphisms (SNPs) in the ALOX12 gene were genotyped in a total of 1215 subjects from 400 Chinese nuclear families by allele-specific polymerase chain reaction. The BMD at the lumbar spine and hip, total fat mass (TFM) and total lean mass (TLM) were measured using dual-energy X-ray absorptiometry. The pairwise linkage disequilibrium among SNPs was measured, and the haplotype blocks were inferred. Both the individual SNP markers and the haplotypes were tested for an association with the peak BMD, body mass index, TFM, TLM and percentage fat mass (PFM) using the quantitative transmission disequilibrium test (QTDT). Using the QTDT, significant within-family association was found between the rs2073438 polymorphism in the ALOX12 gene and the TFM and PFM (P=0.007 and 0.012, respectively). Haplotype analyses were combined with our individual SNP results and remained significant even after correction for multiple testing. However, we failed to find significant within-family associations between ALOX12 SNPs and the BMD at any bone site in young Chinese men. Our present results suggest that the rs2073438 polymorphism of ALOX12 contributes to the variation of obesity phenotypes in young Chinese men, although we failed to replicate the association with the peak BMD variation in this sample. Further independent studies are needed to confirm our findings.

  10. Association analysis of the vitamin D receptor gene, the type I collagen gene COL1A1, and the estrogen receptor gene in idiopathic osteoarthritis.

    PubMed

    Loughlin, J; Sinsheimer, J S; Mustafa, Z; Carr, A J; Clipsham, K; Bloomfield, V A; Chitnavis, J; Bailey, A; Sykes, B; Chapman, K

    2000-03-01

    Evidence has accumulated supporting a role for genes in the etiology of osteoarthritis (OA). Several candidates have been targeted as potential susceptibility loci including genes that are involved in the regulation of bone density. Genetic association analysis has suggested a role for the vitamin D receptor gene (VDR) and the estrogen receptor gene (ER) in susceptibility. Such findings must be tested in additional independent cohorts. We tested for association of these 2 genes, plus a third gene implicated in bone density, COL1A1, with idiopathic OA. A case-control cohort of 371 affected probands and 369 unaffected spouses was used. Association was tested using 4 intragenic single nucleotide polymorphisms (SNP), one each for the VDR and COL1A1 genes, and 2 for the ER gene. The VDR and ER SNP are the same SNP that have been associated with OA. All 4 SNP affect restriction enzyme sites and were genotyped using polymerase chain reaction and enzyme digestion. Allele and genotype distributions for each SNP were compared between cases and controls and analyzed using Fisher's exact test. There was no evidence of association of the VDR or the ER gene SNP to OA. There was weak evidence of association of the COL1A1 SNP in female cases (p = 0.017), reflected by a difference in the distribution of genotypes at this SNP between female cases and controls (p = 0.027). However, when corrected for multiple testing, these results were not significant. If the VDR, ER, or COL1A1 genes do encode predisposition to OA then the 4 SNP tested are not associated with major susceptibility alleles at these 3 loci.

  11. Genetic dissection of the pre-eclampsia susceptibility locus on chromosome 2q22 reveals shared novel risk factors for cardiovascular disease

    PubMed Central

    Johnson, Matthew P.; Brennecke, Shaun P.; East, Christine E.; Dyer, Thomas D.; Roten, Linda T.; Proffitt, J. Michael; Melton, Phillip E.; Fenstad, Mona H.; Aalto-Viljakainen, Tia; Mäkikallio, Kaarin; Heinonen, Seppo; Kajantie, Eero; Kere, Juha; Laivuori, Hannele; Austgulen, Rigmor; Blangero, John; Moses, Eric K.; Pouta, Anneli; Kivinen, Katja; Ekholm, Eeva; Hietala, Reija; Sainio, Susanna; Saisto, Terhi; Uotila, Jukka; Klemetti, Miira; Inkeri Lokki, Anna; Georgiadis, Leena; Huovari, Elina; Kortelainen, Eija; Leminen, Satu; Lähdesmäki, Aija; Mehtälä, Susanna; Salmen, Christina

    2013-01-01

    Pre-eclampsia is an idiopathic pregnancy disorder promoting morbidity and mortality to both mother and child. Delivery of the fetus is the only means to resolve severe symptoms. Women with pre-eclamptic pregnancies demonstrate increased risk for later life cardiovascular disease (CVD) and good evidence suggests these two syndromes share several risk factors and pathophysiological mechanisms. To elucidate the genetic architecture of pre-eclampsia we have dissected our chromosome 2q22 susceptibility locus in an extended Australian and New Zealand familial cohort. Positional candidate genes were prioritized for exon-centric sequencing using bioinformatics, SNPing, transcriptional profiling and QTL-walking. In total, we interrogated 1598 variants from 52 genes. Four independent SNP associations satisfied our gene-centric multiple testing correction criteria: a missense LCT SNP (rs2322659, P = 0.0027), a synonymous LRP1B SNP (rs35821928, P = 0.0001), an UTR-3 RND3 SNP (rs115015150, P = 0.0024) and a missense GCA SNP (rs17783344, P = 0.0020). We replicated the LCT SNP association (P = 0.02) and observed a borderline association for the GCA SNP (P = 0.07) in an independent Australian case–control population. The LRP1B and RND3 SNP associations were not replicated in this same Australian singleton cohort. Moreover, these four SNP associations could not be replicated in two additional case–control populations from Norway and Finland. These four SNPs, however, exhibit pleiotropic effects with several quantitative CVD-related traits. Our results underscore the genetic complexity of pre-eclampsia and present novel empirical evidence of possible shared genetic mechanisms underlying both pre-eclampsia and other CVD-related risk factors. PMID:23420841

  12. A novel approach to exploring potential interactions among single-nucleotide polymorphisms of inflammation genes in gliomagenesis: an exploratory case-only study.

    PubMed

    Amirian, E Susan; Scheurer, Michael E; Liu, Yanhong; D'Amelio, Anthony M; Houlston, Richard S; Etzel, Carol J; Shete, Sanjay; Swerdlow, Anthony J; Schoemaker, Minouk J; McKinney, Patricia A; Fleming, Sarah J; Muir, Kenneth R; Lophatananon, Artitaya; Bondy, Melissa L

    2011-08-01

    Despite extensive research on the topic, glioma etiology remains largely unknown. Exploration of potential interactions between single-nucleotide polymorphisms (SNP) of immune genes is a promising new area of glioma research. The case-only study design is a powerful and efficient design for exploring possible multiplicative interactions between factors that are independent of one another. The purpose of our study was to use this exploratory design to identify potential pair wise SNP-SNP interactions from genes involved in several different immune-related pathways for investigation in future studies. The study population consisted of two case groups: 1,224 histologic confirmed, non-Hispanic white glioma cases from the United States and a validation population of 634 glioma cases from the United Kingdom. Polytomous logistic regression, in which one SNP was coded as the outcome and the other SNP was included as the exposure, was utilized to calculate the ORs of the likelihood of cases simultaneously having the variant alleles of two different SNPs. Potential interactions were examined only between SNPs located in different genes or chromosomes. Using this data mining strategy, we found 396 significant SNP-SNP interactions among polymorphisms of immune-related genes that were present in both the U.S. and U.K. study populations. This exploratory study was conducted for the purpose of hypothesis generation, and thus has provided several new hypotheses that can be tested using traditional case-control study designs to obtain estimates of risk. This is the first study, to our knowledge, to take this novel approach to identifying SNP-SNP interactions relevant to glioma etiology. ©2011 AACR.

  13. Genetic and clinical risk factors of root resorption associated with orthodontic treatment.

    PubMed

    Guo, Yujiao; He, Shushu; Gu, Tian; Liu, Yi; Chen, Song

    2016-08-01

    External apical root resorption (EARR) is a common complication in orthodontic treatment. Despite many studies on EARR, great controversies remain with regard to its risk factors. The objective of this study was to explore the relationship among sex, root movement, IL-1RN single nucleotide polymorphism (SNP) rs419598, IL-6 SNP rs1800796, and EARR associated with orthodontic treatment. Altogether 174 patients (with 174 maxillary left central incisors) were selected for this study. Cone-beam computed tomography was performed before the start of the treatment and at the end of the treatment. Cone-beam computed tomography data were used to reconstruct a 3-dimensional image of each tooth; the volume and the root resorption volume of each tooth were calculated. Three-dimensional matching was used to measure the amount of movement of each root. Genomic DNA was extracted from buccal swabs, and genotypes of SNP rs419598 and SNP rs1800796 of each subject were determined using TaqMan polymerase chain reaction genotyping (Applied Biosystems, Foster City, Calif). The data were analyzed with multiple linear regression analysis. The statistical analysis indicated no relationship between sex, tooth movement amount, and IL-1RN SNP rs419598 with EARR. The IL-6 SNP rs1800796 GC was associated with EARR, and root resorption differed significantly between SNP rs1800796 GC and CC. IL-6 SNP rs1800796 GC is a risk factor for EARR. The amount of root movement, IL-1RN SNP rs419598, and sex as risk factors for EARR need further study. Copyright © 2016 American Association of Orthodontists. Published by Elsevier Inc. All rights reserved.

  14. Meiotic gene-conversion rate and tract length variation in the human genome.

    PubMed

    Padhukasahasram, Badri; Rannala, Bruce

    2013-02-27

    Meiotic recombination occurs in the form of two different mechanisms called crossing-over and gene-conversion and both processes have an important role in shaping genetic variation in populations. Although variation in crossing-over rates has been studied extensively using sperm-typing experiments, pedigree studies and population genetic approaches, our knowledge of variation in gene-conversion parameters (ie, rates and mean tract lengths) remains far from complete. To explore variability in population gene-conversion rates and its relationship to crossing-over rate variation patterns, we have developed and validated using coalescent simulations a comprehensive Bayesian full-likelihood method that can jointly infer crossing-over and gene-conversion rates as well as tract lengths from population genomic data under general variable rate models with recombination hotspots. Here, we apply this new method to SNP data from multiple human populations and attempt to characterize for the first time the fine-scale variation in gene-conversion parameters along the human genome. We find that the estimated ratio of gene-conversion to crossing-over rates varies considerably across genomic regions as well as between populations. However, there is a great degree of uncertainty associated with such estimates. We also find substantial evidence for variation in the mean conversion tract length. The estimated tract lengths did not show any negative relationship with the local heterozygosity levels in our analysis.European Journal of Human Genetics advance online publication, 27 February 2013; doi:10.1038/ejhg.2013.30.

  15. Assumption-free estimation of the genetic contribution to refractive error across childhood.

    PubMed

    Guggenheim, Jeremy A; St Pourcain, Beate; McMahon, George; Timpson, Nicholas J; Evans, David M; Williams, Cathy

    2015-01-01

    Studies in relatives have generally yielded high heritability estimates for refractive error: twins 75-90%, families 15-70%. However, because related individuals often share a common environment, these estimates are inflated (via misallocation of unique/common environment variance). We calculated a lower-bound heritability estimate for refractive error free from such bias. Between the ages 7 and 15 years, participants in the Avon Longitudinal Study of Parents and Children (ALSPAC) underwent non-cycloplegic autorefraction at regular research clinics. At each age, an estimate of the variance in refractive error explained by single nucleotide polymorphism (SNP) genetic variants was calculated using genome-wide complex trait analysis (GCTA) using high-density genome-wide SNP genotype information (minimum N at each age=3,404). The variance in refractive error explained by the SNPs ("SNP heritability") was stable over childhood: Across age 7-15 years, SNP heritability averaged 0.28 (SE=0.08, p<0.001). The genetic correlation for refractive error between visits varied from 0.77 to 1.00 (all p<0.001) demonstrating that a common set of SNPs was responsible for the genetic contribution to refractive error across this period of childhood. Simulations suggested lack of cycloplegia during autorefraction led to a small underestimation of SNP heritability (adjusted SNP heritability=0.35; SE=0.09). To put these results in context, the variance in refractive error explained (or predicted) by the time participants spent outdoors was <0.005 and by the time spent reading was <0.01, based on a parental questionnaire completed when the child was aged 8-9 years old. Genetic variation captured by common SNPs explained approximately 35% of the variation in refractive error between unrelated subjects. This value sets an upper limit for predicting refractive error using existing SNP genotyping arrays, although higher-density genotyping in larger samples and inclusion of interaction effects is expected to raise this figure toward twin- and family-based heritability estimates. The same SNPs influenced refractive error across much of childhood. Notwithstanding the strong evidence of association between time outdoors and myopia, and time reading and myopia, less than 1% of the variance in myopia at age 15 was explained by crude measures of these two risk factors, indicating that their effects may be limited, at least when averaged over the whole population.

  16. Genome-wide association studies for multiple diseases of the German Shepherd Dog

    PubMed Central

    Tsai, Kate L.; Noorai, Rooksana E.; Starr-Moss, Alison N.; Quignon, Pascale; Rinz, Caitlin J.; Ostrander, Elaine A.; Steiner, Jörg M.; Murphy, Keith E.

    2012-01-01

    The German Shepherd Dog (GSD) is a popular working and companion breed for which over 50 hereditary diseases have been documented. Herein, SNP profiles for 197 GSDs were generated using the Affymetrix v2 canine SNP array for a genome-wide association study to identify loci associated with four diseases: pituitary dwarfism, degenerative myelopathy (DM), congenital megaesophagus (ME), and pancreatic acinar atrophy (PAA). A locus on Chr 9 is strongly associated with pituitary dwarfism and is proximal to a plausible candidate gene, LHX3. Results for DM confirm a major locus encompassing SOD1, in which an associated point mutation was previously identified, but do not suggest modifier loci. Several SNPs on Chr 12 are associated with ME and a 4.7 Mb haplotype block is present in affected dogs. Analysis of additional ME cases for a SNP within the haplotype provides further support for this association. Results for PAA indicate more complex genetic underpinnings. Several regions on multiple chromosomes reach genome-wide significance. However, no major locus is apparent and only two associated haplotype blocks, on Chrs 7 and 12 are observed. These data suggest that PAA may be governed by multiple loci with small effects, or it may be a heterogeneous disorder. PMID:22105877

  17. No association between SNP rs498055 on chromosome 10 and late-onset Alzheimer disease in multiple datasets.

    PubMed

    Liang, Xueying; Schnetz-Boutaud, Nathalie; Bartlett, Jackie; Allen, Melissa J; Gwirtsman, Harry; Schmechel, Don E; Carney, Regina M; Gilbert, John R; Pericak-Vance, Margaret A; Haines, Jonathan L

    2008-01-01

    SNP rs498055 in the predicted gene LOC439999 on chromosome 10 was recently identified as being strongly associated with late-onset Alzheimer disease (LOAD). This SNP falls within a chromosomal region that has engendered continued interest generated from both preliminary genetic linkage and candidate gene studies. To independently evaluate this interesting candidate SNP we examined four independent datasets, three family-based and one case-control. All the cases were late-onset AD Caucasian patients with minimum age at onset >or= 60 years. None of the three family samples or the combined family-based dataset showed association in either allelic or genotypic family-based association tests at p < 0.05. Both original and OSA two-point LOD scores were calculated. However, there was no evidence indicating linkage no matter what covariates were applied (the highest LOD score was 0.82). The case-control dataset did not demonstrate any association between this SNP and AD (all p-values > 0.52). Our results do not confirm the previous association, but are consistent with a more recent negative association result that used family-based association tests to examine the effect of this SNP in two family datasets. Thus we conclude that rs498055 is not associated with an increased risk of LOAD.

  18. Analysis and visualization of chromosomal abnormalities in SNP data with SNPscan

    PubMed Central

    Ting, Jason C; Ye, Ying; Thomas, George H; Ruczinski, Ingo; Pevsner, Jonathan

    2006-01-01

    Background A variety of diseases are caused by chromosomal abnormalities such as aneuploidies (having an abnormal number of chromosomes), microdeletions, microduplications, and uniparental disomy. High density single nucleotide polymorphism (SNP) microarrays provide information on chromosomal copy number changes, as well as genotype (heterozygosity and homozygosity). SNP array studies generate multiple types of data for each SNP site, some with more than 100,000 SNPs represented on each array. The identification of different classes of anomalies within SNP data has been challenging. Results We have developed SNPscan, a web-accessible tool to analyze and visualize high density SNP data. It enables researchers (1) to visually and quantitatively assess the quality of user-generated SNP data relative to a benchmark data set derived from a control population, (2) to display SNP intensity and allelic call data in order to detect chromosomal copy number anomalies (duplications and deletions), (3) to display uniparental isodisomy based on loss of heterozygosity (LOH) across genomic regions, (4) to compare paired samples (e.g. tumor and normal), and (5) to generate a file type for viewing SNP data in the University of California, Santa Cruz (UCSC) Human Genome Browser. SNPscan accepts data exported from Affymetrix Copy Number Analysis Tool as its input. We validated SNPscan using data generated from patients with known deletions, duplications, and uniparental disomy. We also inspected previously generated SNP data from 90 apparently normal individuals from the Centre d'Étude du Polymorphisme Humain (CEPH) collection, and identified three cases of uniparental isodisomy, four females having an apparently mosaic X chromosome, two mislabelled SNP data sets, and one microdeletion on chromosome 2 with mosaicism from an apparently normal female. These previously unrecognized abnormalities were all detected using SNPscan. The microdeletion was independently confirmed by fluorescence in situ hybridization, and a region of homozygosity in a UPD case was confirmed by sequencing of genomic DNA. Conclusion SNPscan is useful to identify chromosomal abnormalities based on SNP intensity (such as chromosomal copy number changes) and heterozygosity data (including regions of LOH and some cases of UPD). The program and source code are available at the SNPscan website . PMID:16420694

  19. Development and evaluation of a high density genotyping 'Axiom_Arachis' array with 58K SNPs for accelerating genetics and breeding in groundnut

    USDA-ARS?s Scientific Manuscript database

    Single nucleotide polymorphisms (SNPs) are the most abundant DNA sequence variation in the genomes which can be used to associate genotypic variation to the phenotype. Therefore, availability of a high-density SNP array with uniform genome coverage can advance genetic studies and breeding applicatio...

  20. Nonspecific microvascular vasodilation during iontophoresis is attenuated by application of hyperosmolar saline.

    PubMed

    Asberg, A; Holm, T; Vassbotn, T; Andreassen, A K; Hartmann, A

    1999-07-01

    Iontophoretic administration of acetylcholine chloride (ACh) and sodium nitroprusside (SNP) combined with laser Doppler skin blood perfusion measurements are used for determination of endothelial-dependent and -independent vasodilation. However, the method is biased by nonspecific vasodilation. The primary aim of this study was to investigate if iontophoresis-induced nonspecific vasodilation may be attenuated by addition of high molar concentrations of NaCl to the iontophoresis solutions. Secondary we investigated the applicability of 5 mol/liter NaCl solution as vehicle for ACh and SNP in this method. Skin perfusion changes were determined for iontophoresis of pure vehicles, deionized water and 5 mol/liter NaCl solution, in 12 healthy volunteers. Responses in skin perfusion to iontophoresis of ACh and SNP dissolved in both vehicles were also investigated. Addition of 5 mol/liter NaCl to deionized water significantly attenuated the nonspecific vasodilation and lowered the potential applied over the skin. The inter- and intraindividual coefficients of variation to ACh and SNP responses became, however, higher using hyperosmolar vehicle. During iontophoresis of SNP (in deionized water) we were unable to distinguish between SNP and vehicle effects. This study shows that the nonspecific vasodilation induced by iontophoresis can be attenuated by addition of 5 mol/liter NaCl, possibly due to lower electrical potential over the skin. However, the variability of the method was not improved. When deionized water was used as vehicle the effect of SNP could not be differentiated from that of the vehicle. This was not the case for ACh. Copyright 1999 Academic Press.

  1. A 2-Stage Genome-Wide Association Study to Identify Single Nucleotide Polymorphisms Associated With Development of Erectile Dysfunction Following Radiation Therapy for Prostate Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kerns, Sarah L.; Departments of Pathology and Genetics, Albert Einstein College of Medicine, Bronx, New York; Stock, Richard

    2013-01-01

    Purpose: To identify single nucleotide polymorphisms (SNPs) associated with development of erectile dysfunction (ED) among prostate cancer patients treated with radiation therapy. Methods and Materials: A 2-stage genome-wide association study was performed. Patients were split randomly into a stage I discovery cohort (132 cases, 103 controls) and a stage II replication cohort (128 cases, 102 controls). The discovery cohort was genotyped using Affymetrix 6.0 genome-wide arrays. The 940 top ranking SNPs selected from the discovery cohort were genotyped in the replication cohort using Illumina iSelect custom SNP arrays. Results: Twelve SNPs identified in the discovery cohort and validated in themore » replication cohort were associated with development of ED following radiation therapy (Fisher combined P values 2.1 Multiplication-Sign 10{sup -5} to 6.2 Multiplication-Sign 10{sup -4}). Notably, these 12 SNPs lie in or near genes involved in erectile function or other normal cellular functions (adhesion and signaling) rather than DNA damage repair. In a multivariable model including nongenetic risk factors, the odds ratios for these SNPs ranged from 1.6 to 5.6 in the pooled cohort. There was a striking relationship between the cumulative number of SNP risk alleles an individual possessed and ED status (Sommers' D P value = 1.7 Multiplication-Sign 10{sup -29}). A 1-allele increase in cumulative SNP score increased the odds for developing ED by a factor of 2.2 (P value = 2.1 Multiplication-Sign 10{sup -19}). The cumulative SNP score model had a sensitivity of 84% and specificity of 75% for prediction of developing ED at the radiation therapy planning stage. Conclusions: This genome-wide association study identified a set of SNPs that are associated with development of ED following radiation therapy. These candidate genetic predictors warrant more definitive validation in an independent cohort.« less

  2. A reduced number of mtSNPs saturates mitochondrial DNA haplotype diversity of worldwide population groups.

    PubMed

    Salas, Antonio; Amigo, Jorge

    2010-05-03

    The high levels of variation characterising the mitochondrial DNA (mtDNA) molecule are due ultimately to its high average mutation rate; moreover, mtDNA variation is deeply structured in different populations and ethnic groups. There is growing interest in selecting a reduced number of mtDNA single nucleotide polymorphisms (mtSNPs) that account for the maximum level of discrimination power in a given population. Applications of the selected mtSNP panel range from anthropologic and medical studies to forensic genetic casework. This study proposes a new simulation-based method that explores the ability of different mtSNP panels to yield the maximum levels of discrimination power. The method explores subsets of mtSNPs of different sizes randomly chosen from a preselected panel of mtSNPs based on frequency. More than 2,000 complete genomes representing three main continental human population groups (Africa, Europe, and Asia) and two admixed populations ("African-Americans" and "Hispanics") were collected from GenBank and the literature, and were used as training sets. Haplotype diversity was measured for each combination of mtSNP and compared with existing mtSNP panels available in the literature. The data indicates that only a reduced number of mtSNPs ranging from six to 22 are needed to account for 95% of the maximum haplotype diversity of a given population sample. However, only a small proportion of the best mtSNPs are shared between populations, indicating that there is not a perfect set of "universal" mtSNPs suitable for all population contexts. The discrimination power provided by these mtSNPs is much higher than the power of the mtSNP panels proposed in the literature to date. Some mtSNP combinations also yield high diversity values in admixed populations. The proposed computational approach for exploring combinations of mtSNPs that optimise the discrimination power of a given set of mtSNPs is more efficient than previous empirical approaches. In contrast to precedent findings, the results seem to indicate that only few mtSNPs are needed to reach high levels of discrimination power in a population, independently of its ancestral background.

  3. A whole genome SNP genotyping by DNA microarray and candidate gene association study for kidney stone disease

    PubMed Central

    2014-01-01

    Background Kidney stone disease (KSD) is a complex disorder with unknown etiology in majority of the patients. Genetic and environmental factors may cause the disease. In the present study, we used DNA microarray to genotype single nucleotide polymorphisms (SNP) and performed candidate gene association analysis to determine genetic variations associated with the disease. Methods A whole genome SNP genotyping by DNA microarray was initially conducted in 101 patients and 105 control subjects. A set of 104 candidate genes reported to be involved in KSD, gathered from public databases and candidate gene association study databases, were evaluated for their variations associated with KSD. Results Altogether 82 SNPs distributed within 22 candidate gene regions showed significant differences in SNP allele frequencies between the patient and control groups (P < 0.05). Of these, 4 genes including BGLAP, AHSG, CD44, and HAO1, encoding osteocalcin, fetuin-A, CD44-molecule and glycolate oxidase 1, respectively, were further assessed for their associations with the disease because they carried high proportion of SNPs with statistical differences of allele frequencies between the patient and control groups within the gene. The total of 26 SNPs showed significant differences of allele frequencies between the patient and control groups and haplotypes associated with disease risk were identified. The SNP rs759330 located 144 bp downstream of BGLAP where it is a predicted microRNA binding site at 3′UTR of PAQR6 – a gene encoding progestin and adipoQ receptor family member VI, was genotyped in 216 patients and 216 control subjects and found to have significant differences in its genotype and allele frequencies (P = 0.0007, OR 2.02 and P = 0.0001, OR 2.02, respectively). Conclusions Our results suggest that these candidate genes are associated with KSD and PAQR6 comes into our view as the most potent candidate since associated SNP rs759330 is located in the miRNA binding site and may affect mRNA expression level. PMID:24886237

  4. A Reduced Number of mtSNPs Saturates Mitochondrial DNA Haplotype Diversity of Worldwide Population Groups

    PubMed Central

    Salas, Antonio; Amigo, Jorge

    2010-01-01

    Background The high levels of variation characterising the mitochondrial DNA (mtDNA) molecule are due ultimately to its high average mutation rate; moreover, mtDNA variation is deeply structured in different populations and ethnic groups. There is growing interest in selecting a reduced number of mtDNA single nucleotide polymorphisms (mtSNPs) that account for the maximum level of discrimination power in a given population. Applications of the selected mtSNP panel range from anthropologic and medical studies to forensic genetic casework. Methodology/Principal Findings This study proposes a new simulation-based method that explores the ability of different mtSNP panels to yield the maximum levels of discrimination power. The method explores subsets of mtSNPs of different sizes randomly chosen from a preselected panel of mtSNPs based on frequency. More than 2,000 complete genomes representing three main continental human population groups (Africa, Europe, and Asia) and two admixed populations (“African-Americans” and “Hispanics”) were collected from GenBank and the literature, and were used as training sets. Haplotype diversity was measured for each combination of mtSNP and compared with existing mtSNP panels available in the literature. The data indicates that only a reduced number of mtSNPs ranging from six to 22 are needed to account for 95% of the maximum haplotype diversity of a given population sample. However, only a small proportion of the best mtSNPs are shared between populations, indicating that there is not a perfect set of “universal” mtSNPs suitable for all population contexts. The discrimination power provided by these mtSNPs is much higher than the power of the mtSNP panels proposed in the literature to date. Some mtSNP combinations also yield high diversity values in admixed populations. Conclusions/Significance The proposed computational approach for exploring combinations of mtSNPs that optimise the discrimination power of a given set of mtSNPs is more efficient than previous empirical approaches. In contrast to precedent findings, the results seem to indicate that only few mtSNPs are needed to reach high levels of discrimination power in a population, independently of its ancestral background. PMID:20454657

  5. A genetic stochastic process model for genome-wide joint analysis of biomarker dynamics and disease susceptibility with longitudinal data.

    PubMed

    He, Liang; Zhbannikov, Ilya; Arbeev, Konstantin G; Yashin, Anatoliy I; Kulminski, Alexander M

    2017-11-01

    Unraveling the underlying biological mechanisms or pathways behind the effects of genetic variations on complex diseases remains one of the major challenges in the post-GWAS (where GWAS is genome-wide association study) era. To further explore the relationship between genetic variations, biomarkers, and diseases for elucidating underlying pathological mechanism, a huge effort has been placed on examining pleiotropic and gene-environmental interaction effects. We propose a novel genetic stochastic process model (GSPM) that can be applied to GWAS and jointly investigate the genetic effects on longitudinally measured biomarkers and risks of diseases. This model is characterized by more profound biological interpretation and takes into account the dynamics of biomarkers during follow-up when investigating the hazards of a disease. We illustrate the rationale and evaluate the performance of the proposed model through two GWAS. One is to detect single nucleotide polymorphisms (SNPs) having interaction effects on type 2 diabetes (T2D) with body mass index (BMI) and the other is to detect SNPs affecting the optimal BMI level for protecting from T2D. We identified multiple SNPs that showed interaction effects with BMI on T2D, including a novel SNP rs11757677 in the CDKAL1 gene (P = 5.77 × 10 -7 ). We also found a SNP rs1551133 located on 2q14.2 that reversed the effect of BMI on T2D (P = 6.70 × 10 -7 ). In conclusion, the proposed GSPM provides a promising and useful tool in GWAS of longitudinal data for interrogating pleiotropic and interaction effects to gain more insights into the relationship between genes, quantitative biomarkers, and risks of complex diseases. © 2017 WILEY PERIODICALS, INC.

  6. Genetic variation in Pythium myriotylum based on SNP typing and development of a PCR-RFLP detection of isolates recovered from Pythium soft rot ginger.

    PubMed

    Le, D P; Smith, M K; Aitken, E A B

    2017-10-01

    Pythium myriotylum is responsible for severe losses in both capsicum and ginger crops in Australia under different regimes. Intraspecific genomic variation within the pathogen might explain the differences in aggressiveness and pathogenicity on diverse hosts. In this study, whole genome data of four P. myriotylum isolates recovered from three hosts and one Pythium zingiberis isolate were derived and analysed for sequence diversity based on single nucleotide polymorphisms (SNPs). A higher number of true and unique SNPs occurred in P. myriotylum isolates obtained from ginger with symptoms of Pythium soft rot (PSR) in Australia compared to other P. myriotylum isolates. Overall, SNPs were discovered more in the mitochondrial genome than those in the nuclear genome. Among the SNPs, a single substitution from the cytosine (C) to the thymine (T) in the partially sequenced CoxII gene of 14 representatives of PSR P. myriotylum isolates was within a restriction site of HinP1I enzyme which was used in the PCR-RFLP for detection and identification of the isolates without sequencing. The PCR-RFLP was also sensitive to detect PSR P. myriotylum strains from artificially infected ginger without the need for isolation for pure cultures. This is the first study of intraspecific variants of Pythium myriotylum isolates recovered from different hosts and origins based on single nucleotide polymorphism (SNP) genotyping of multiple genes. The SNPs discovered provide valuable makers for detection and identification of P. myriotylum strains initially isolated from Pythium soft rot (PSR) ginger by using PCR-RFLP of the CoxII locus. The PCR-RFLP was also sensitive to detect P. myriotylum directly from PSR ginger sampled from pot trials without the need of isolation for pure cultures. © 2017 The Society for Applied Microbiology.

  7. Association of Interleukin 23 Receptor Polymorphisms with Anti-Topoisomerase-I Positivity and Pulmonary Hypertension in Systemic Sclerosis

    PubMed Central

    AGARWAL, SANDEEP K.; GOURH, PRAVITT; SHETE, SANJAY; PAZ, GENE; DIVECHA, DIPAL; REVEILLE, JOHN D.; ASSASSI, SHERVIN; TAN, FILEMON K.; MAYES, MAUREEN D.; ARNETT, FRANK C.

    2010-01-01

    Objective IL23R has been identified as a susceptibility gene for development of multiple autoimmune diseases. We investigated the possible association of IL23R with systemic sclerosis (SSc), an autoimmune disease that leads to the development of cutaneous and visceral fibrosis. Methods We tested 9 single-nucleotide polymorphisms (SNP) in IL23R for association with SSc in a cohort of 1402 SSc cases and 1038 controls. IL23R SNP tested were previously identified as SNP showing associations with inflammatory bowel disease. Results Case-control comparisons revealed no statistically significant differences between patients and healthy controls with any of the IL23R polymorphisms. Analyses of subsets of SSc patients showed that rs11209026 (Arg381Gln variant) was associated with anti-topoisomerase I antibody (ATA)-positive SSc (p = 0.001)) and rs11465804 SNP was associated with diffuse and ATA-positive SSc (p = 0.0001, p = 0.0026, respectively). These associations remained significant after accounting for multiple comparisons using the false discovery rate method. Wild-type genotype at both rs11209026 and rs11465804 showed significant protection against the presence of pulmonary hypertension (PHT). (p = 3×10−5, p = 1×10−5, respectively). Conclusion Polymorphisms in IL23R are associated with susceptibility to ATA-positive SSc and protective against development of PHT in patients with SSc. PMID:19918037

  8. Influence of SNPs in nutrient-sensitive candidate genes and gene-diet interactions on blood lipids: the DiOGenes study.

    PubMed

    Brahe, Lena K; Ängquist, Lars; Larsen, Lesli H; Vimaleswaran, Karani S; Hager, Jörg; Viguerie, Nathalie; Loos, Ruth J F; Handjieva-Darlenska, Teodora; Jebb, Susan A; Hlavaty, Petr; Larsen, Thomas M; Martinez, J Alfredo; Papadaki, Angeliki; Pfeiffer, Andreas F H; van Baak, Marleen A; Sørensen, Thorkild I A; Holst, Claus; Langin, Dominique; Astrup, Arne; Saris, Wim H M

    2013-09-14

    Blood lipid response to a given dietary intervention could be determined by the effect of diet, gene variants or gene-diet interactions. The objective of the present study was to investigate whether variants in presumed nutrient-sensitive genes involved in lipid metabolism modified lipid profile after weight loss and in response to a given diet, among overweight European adults participating in the Diet Obesity and Genes study. By multiple linear regressions, 240 SNPs in twenty-four candidate genes were investigated for SNP main and SNP-diet interaction effects on total cholesterol, LDL-cholesterol, HDL-cholesterol and TAG after an 8-week low-energy diet (only main effect) ,and a 6-month ad libitum weight maintenance diet, with different contents of dietary protein or glycaemic index. After adjusting for multiple testing, a SNP-dietary protein interaction effect on TAG was identified for lipin 1 (LPIN1) rs4315495, with a decrease in TAG of 20.26 mmol/l per A-allele/protein unit (95% CI 20.38, 20.14, P=0.000043). In conclusion, we investigated SNP-diet interactions for blood lipid profiles for 240 SNPs in twenty-four candidate genes, selected for their involvement in lipid metabolism pathways, and identified one significant interaction between LPIN1 rs4315495 and dietary protein for TAG concentration.

  9. Development of a rapid SNP-typing assay to differentiate Bifidobacterium animalis ssp. lactis strains used in probiotic-supplemented dairy products.

    PubMed

    Lomonaco, Sara; Furumoto, Emily J; Loquasto, Joseph R; Morra, Patrizia; Grassi, Ausilia; Roberts, Robert F

    2015-02-01

    Identification at the genus, species, and strain levels is desirable when a probiotic microorganism is added to foods. Strains of Bifidobacterium animalis ssp. lactis (BAL) are commonly used worldwide in dairy products supplemented with probiotic strains. However, strain discrimination is difficult because of the high degree of genome identity (99.975%) between different genomes of this subspecies. Typing of monomorphic species can be carried out efficiently by targeting informative single nucleotide polymorphisms (SNP). Findings from a previous study analyzing both reference and commercial strains of BAL identified SNP that could be used to discriminate common strains into 8 groups. This paper describes development of a minisequencing assay based on the primer extension reaction (PER) targeting multiple SNP that can allow strain differentiation of BAL. Based on previous data, 6 informative SNP were selected for further testing, and a multiplex preliminary PCR was optimized to amplify the DNA regions containing the selected SNP. Extension primers (EP) annealing immediately adjacent to the selected SNP were developed and tested in simplex and multiplex PER to evaluate their performance. Twenty-five strains belonging to 9 distinct genomic clusters of B. animalis ssp. lactis were selected and analyzed using the developed minisequencing assay, simultaneously targeting the 6 selected SNP. Fragment analysis was subsequently carried out in duplicate and demonstrated that the assay yielded 8 specific profiles separating the most commonly used commercial strains. This novel multiplex PER approach provides a simple, rapid, flexible SNP-based subtyping method for proper characterization and identification of commercial probiotic strains of BAL from fermented dairy products. To assess the usefulness of this method, DNA was extracted from yogurt manufactured with and without the addition of B. animalis ssp. lactis BB-12. Extracted DNA was then subjected to the minisequencing protocol, resulting in a SNP profile matching the profile for the strain BB-12. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  10. Genetic Variant of Kalirin Gene Is Associated with Ischemic Stroke in a Chinese Han Population.

    PubMed

    Li, Hong; Yu, Shasha; Wang, Rui; Sun, Zhaoqing; Zhou, Xinghu; Zheng, Liqiang; Yin, Zhihua; Zhang, Xingang; Sun, Yingxian

    2017-01-01

    Ischemic stroke is a complex disorder resulting from the interplay of genetic and environmental factors. Previous studies showed that kalirin gene variations were associated with cardiovascular disease. However, the association between this gene and ischemic stroke was unknown. We performed this study to confirm if kalirin gene variation was associated with ischemic stroke. We enrolled 385 ischemic stroke patients and 362 controls from China. Three SNPs of kalirin gene were genotyped by means of ligase detection reaction-PCR method. Data was processed with SPSS and SHEsis platform. SNP rs7620580 (dominant model: OR = 1.590, p = 0.002 and adjusted OR = 1.662, p = 0.014; additive model: OR = 1.490, p = 0.002 and adjusted OR = 1.636, p = 0.005; recessive model: OR = 2.686, p = 0.039) and SNP rs1708303 (dominant model: OR = 1.523, p = 0.007 and adjusted OR = 1.604, p = 0.028; additive model: OR = 1.438, p = 0.01 and adjusted OR = 1.476, p = 0.039) were associated with ischemic stroke. The GG genotype and G allele of SNP rs7620580 were associated with a risk for ischemic stroke with an adjusted OR of 3.195 and an OR of 1.446, respectively. Haplotype analysis revealed that A-T-G,G-T-A, and A-T-A haplotypes were associated with ischemic stroke. Our results provide evidence that kalirin gene variations were associated with ischemic stroke in the Chinese Han population.

  11. Methylphenidate side effect profile is influenced by genetic variation in the attention-deficit/hyperactivity disorder-associated CES1 gene.

    PubMed

    Johnson, Katherine A; Barry, Edwina; Lambert, David; Fitzgerald, Michael; McNicholas, Fiona; Kirley, Aiveen; Gill, Michael; Bellgrove, Mark A; Hawi, Ziarih

    2013-12-01

    A naturalistic, prospective study of the influence of genetic variation on dose prescribed, clinical response, and side effects related to stimulant medication in 77 children with attention-deficit/hyperactivity disorder (ADHD) was undertaken. The influence of genetic variation of the CES1 gene coding for carboxylesterase 1A1 (CES1A1), the major enzyme responsible for the first-pass, stereoselective metabolism of methylphenidate, was investigated. Parent- and teacher-rated behavioral questionnaires were collected at baseline when the children were medication naïve, and again at 6 weeks while they were on medication. Medication dose, prescribed at the discretion of the treating clinician, and side effects, were recorded at week 6. Blood and saliva samples were collected for genotyping. Single nucleotide polymorphisms (SNPs) were selected in the coding, non-coding and the 3' flanking region of the CES1 gene. Genetic association between CES1 variants and ADHD was investigated in an expanded sample of 265 Irish ADHD families. Analyses were conducted using analysis of covariance (ANCOVA) and logistic regression models. None of the CES1 gene variants were associated with the dose of methylphenidate provided or the clinical response recorded at the 6 week time point. An association between two CES1 SNP markers and the occurrence of sadness as a side effect of short-acting methylphenidate was found. The two associated CES1 markers were in linkage disequilibrium and were significantly associated with ADHD in a larger sample of ADHD trios. The associated CES1 markers were also in linkage disequilibrium with two SNP markers of the noradrenaline transporter gene (SLC6A2). This study found an association between two CES1 SNP markers and the occurrence of sadness as a side effect of short-acting methylphenidate. These markers were in linkage disequilibrium together and with two SNP markers of the noradrenaline transporter gene.

  12. Single nucleotide polymorphism-specific regulation of matrix metalloproteinase-9 by multiple miRNAs targeting the coding exon

    PubMed Central

    Duellman, Tyler; Warren, Christopher; Yang, Jay

    2014-01-01

    Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221

  13. META-ANALYSIS OF GENOME-WIDE STUDIES IDENTIFIES WNT16 AND ESR1 SNPS ASSOCIATED WITH BONE MINERAL DENSITY IN PREMENOPAUSAL WOMEN

    PubMed Central

    Koller, Daniel L.; Zheng, Hou-Feng; Karasik, David; Yerges-Armstrong, Laura; Liu, Ching-Ti; McGuigan, Fiona; Kemp, John P.; Giroux, Sylvie; Lai, Dongbing; Edenberg, Howard J.; Peacock, Munro; Czerwinski, Stefan A.; Choh, Audrey C.; McMahon, George; St Pourcain, Beate; Timpson, Nicholas J.; Lawlor, Debbie A; Evans, David M; Towne, Bradford; Blangero, John; Carless, Melanie A.; Kammerer, Candace; Goltzman, David; Kovacs, Christopher S.; Prior, Jerilynn C.; Spector, Tim D.; Rousseau, Francois; Tobias, Jon H.; Akesson, Kristina; Econs, Michael J.; Mitchell, Braxton D.; Richards, J. Brent; Kiel, Douglas P.; Foroud, Tatiana

    2013-01-01

    Previous genome-wide association studies (GWAS) have identified common variants in genes associated with variation in bone mineral density (BMD), although most have been carried out in combined samples of older women and men. Meta-analyses of these results have identified numerous SNPs of modest effect at genome-wide significance levels in genes involved in both bone formation and resorption, as well as other pathways. We performed a meta-analysis restricted to premenopausal white women from four cohorts (n= 4,061 women, ages 20 to 45) to identify genes influencing peak bone mass at the lumbar spine and femoral neck. Following imputation, age- and weight-adjusted BMD values were tested for association with each SNP. Association of a SNP in the WNT16 gene (rs3801387; p=1.7 × 10−9) and multiple SNPs in the ESR1/C6orf97 (rs4870044; p=1.3 × 10−8) achieved genome-wide significance levels for lumbar spine BMD. These SNPs, along with others demonstrating suggestive evidence of association, were then tested for association in seven Replication cohorts that included premenopausal women of European, Hispanic-American, and African-American descent (combined n=5,597 for femoral neck; 4,744 for lumbar spine). When the data from the Discovery and Replication cohorts were analyzed jointly, the evidence was more significant (WNT16 joint p=1.3 × 10−11; ESR1/C6orf97 joint p= 1.4 × 10−10). Multiple independent association signals were observed with spine BMD at the ESR1 region after conditioning on the primary signal. Analyses of femoral neck BMD also supported association with SNPs in WNT16 and ESR1/C6orf97 (p< 1 × 10−5). Our results confirm that several of the genes contributing to BMD variation across a broad age range in both sexes have effects of similar magnitude on BMD of the spine in premenopausal women. These data support the hypothesis that variants in these genes of known skeletal function also affect BMD during the premenopausal period. PMID:23074152

  14. Genome-wide selective sweeps and gene-specific sweeps in natural bacterial populations

    DOE PAGES

    Bendall, Matthew L.; Stevens, Sarah L.R.; Chan, Leong-Keat; ...

    2016-01-08

    Multiple models describe the formation and evolution of distinct microbial phylogenetic groups. These evolutionary models make different predictions regarding how adaptive alleles spread through populations and how genetic diversity is maintained. Processes predicted by competing evolutionary models, for example, genome-wide selective sweeps vs gene-specific sweeps, could be captured in natural populations using time-series metagenomics if the approach were applied over a sufficiently long time frame. Direct observations of either process would help resolve how distinct microbial groups evolve. Using a 9-year metagenomic study of a freshwater lake (2005–2013), we explore changes in single-nucleotide polymorphism (SNP) frequencies and patterns of genemore » gain and loss in 30 bacterial populations. SNP analyses revealed substantial genetic heterogeneity within these populations, although the degree of heterogeneity varied by >1000-fold among populations. SNP allele frequencies also changed dramatically over time within some populations. Interestingly, nearly all SNP variants were slowly purged over several years from one population of green sulfur bacteria, while at the same time multiple genes either swept through or were lost from this population. Furthermore, these patterns were consistent with a genome-wide selective sweep in progress, a process predicted by the ‘ecotype model’ of speciation but not previously observed in nature. In contrast, other populations contained large, SNP-free genomic regions that appear to have swept independently through the populations prior to the study without purging diversity elsewhere in the genome. Finally, evidence for both genome-wide and gene-specific sweeps suggests that different models of bacterial speciation may apply to different populations coexisting in the same environment.« less

  15. Analysis of Multiple Association Studies Provides Evidence of an Expression QTL Hub in Gene-Gene Interaction Network Affecting HDL Cholesterol Levels

    PubMed Central

    Ma, Li; Ballantyne, Christie; Brautbar, Ariel; Keinan, Alon

    2014-01-01

    Epistasis has been suggested to underlie part of the missing heritability in genome-wide association studies. In this study, we first report an analysis of gene-gene interactions affecting HDL cholesterol (HDL-C) levels in a candidate gene study of 2,091 individuals with mixed dyslipidemia from a clinical trial. Two additional studies, the Atherosclerosis Risk in Communities study (ARIC; n = 9,713) and the Multi-Ethnic Study of Atherosclerosis (MESA; n = 2,685), were considered for replication. We identified a gene-gene interaction between rs1532085 and rs12980554 (P = 7.1×10−7) in their effect on HDL-C levels, which is significant after Bonferroni correction (P c = 0.017) for the number of SNP pairs tested. The interaction successfully replicated in the ARIC study (P = 7.0×10−4; P c = 0.02). Rs1532085, an expression QTL (eQTL) of LIPC, is one of the two SNPs involved in another, well-replicated gene-gene interaction underlying HDL-C levels. To further investigate the role of this eQTL SNP in gene-gene interactions affecting HDL-C, we tested in the ARIC study for interaction between this SNP and any other SNP genome-wide. We found the eQTL to be involved in a few suggestive interactions, one of which significantly replicated in MESA. Importantly, these gene-gene interactions, involving only rs1532085, explain an additional 1.4% variation of HDL-C, on top of the 0.65% explained by rs1532085 alone. LIPC plays a key role in the lipid metabolism pathway and it, and rs1532085 in particular, has been associated with HDL-C and other lipid levels. Collectively, we discovered several novel gene-gene interactions, all involving an eQTL of LIPC, thus suggesting a hub role of LIPC in the gene-gene interaction network that regulates HDL-C levels, which in turn raises the hypothesis that LIPC's contribution is largely via interactions with other lipid metabolism related genes. PMID:24651390

  16. Genome-wide Association Mapping of Qualitatively Inherited Traits in a Germplasm Collection.

    PubMed

    Bandillo, Nonoy B; Lorenz, Aaron J; Graef, George L; Jarquin, Diego; Hyten, David L; Nelson, Randall L; Specht, James E

    2017-07-01

    Genome-wide association (GWA) has been used as a tool for dissecting the genetic architecture of quantitatively inherited traits. We demonstrate here that GWA can also be highly useful for detecting many major genes governing categorically defined phenotype variants that exist for qualitatively inherited traits in a germplasm collection. Genome-wide association mapping was applied to categorical phenotypic data available for 10 descriptive traits in a collection of ∼13,000 soybean [ (L.) Merr.] accessions that had been genotyped with a 50,000 single nucleotide polymorphism (SNP) chip. A GWA on a panel of accessions of this magnitude can offer substantial statistical power and mapping resolution, and we found that GWA mapping resulted in the identification of strong SNP signals for 24 classical genes as well as several heretofore unknown genes controlling the phenotypic variants in those traits. Because some of these genes had been cloned, we were able to show that the narrow GWA mapping SNP signal regions that we detected for the phenotypic variants had chromosomal bp spans that, with just one exception, overlapped the bp region of the cloned genes, despite local variation in SNP number and nonuniform SNP distribution in the chip set. Copyright © 2017 Crop Science Society of America.

  17. MMP9 polymorphisms and breast cancer risk: a report from the Shanghai Breast Cancer Genetics Study.

    PubMed

    Beeghly-Fadiel, Alicia; Lu, Wei; Shu, Xiao-Ou; Long, Jirong; Cai, Qiuyin; Xiang, Yongbin; Gao, Yu-Tang; Zheng, Wei

    2011-04-01

    In addition to tumor invasion and angiogenesis, matrix metalloproteinase (MMP)9 also contributes to carcinogenesis and tumor growth. Genetic variation that may influence MMP9 expression was evaluated among participants of the Shanghai Breast Cancer Genetics Study (SBCGS) for associations with breast cancer susceptibility. In stage 1, 11 MMP9 single nucleotide polymorphisms (SNPs) were genotyped by the Affymetrix Targeted Genotyping System and/or the Affymetrix Genome-Wide Human SNP Array 6.0 among 4,227 SBCGS participants. One SNP was further genotyped using the Sequenom iPLEX MassARRAY platform among an additional 6,270 SBCGS participants. Associations with breast cancer risk were evaluated by odds ratios (OR) and 95% confidence intervals (CI) from logistic regression models that included adjustment for age, education, and genotyping stage when appropriate. In Stage 1, rare allele homozygotes for a promoter SNP (rs3918241) or a non-synonymous SNP (rs2274756, R668Q) tended to occur more frequently among breast cancer cases (P value = 0.116 and 0.056, respectively). Given their high linkage disequilibrium (D' = 1.0, r (2) = 0.97), one (rs3918241) was selected for additional analysis. An association with breast cancer risk was not supported by additional Stage 2 genotyping. In combined analysis, no elevated risk of breast cancer among homozygotes was found (OR: 1.2, 95% CI: 0.8-1.8). Common genetic variation in MMP9 was not found to be significantly associated with breast cancer susceptibility among participants of the Shanghai Breast Cancer Genetics Study.

  18. Effects of ambient and preceding temperatures and metabolic genes on flight metabolism in the Glanville fritillary butterfly.

    PubMed

    Wong, Swee Chong; Oksanen, Alma; Mattila, Anniina L K; Lehtonen, Rainer; Niitepõld, Kristjan; Hanski, Ilkka

    2016-02-01

    Flight is essential for foraging, mate searching and dispersal in many insects, but flight metabolism in ectotherms is strongly constrained by temperature. Thermal conditions vary greatly in natural populations and may hence restrict fitness-related activities. Working on the Glanville fritillary butterfly (Melitaea cinxia), we studied the effects of temperature experienced during the first 2 days of adult life on flight metabolism, genetic associations between flight metabolic rate and variation in candidate metabolic genes, and genotype-temperature interactions. The maximal flight performance was reduced by 17% by 2 days of low ambient temperature (15 °C) prior to the flight trial, mimicking conditions that butterflies commonly encounter in nature. A SNP in phosphoglucose isomerase (Pgi) had a significant association on flight metabolic rate in males and a SNP in triosephosphate isomerase (Tpi) was significantly associated with flight metabolic rate in females. In the Pgi SNP, AC heterozygotes had higher flight metabolic rate than AA homozygotes following low preceding temperature, but the trend was reversed following high preceding temperature, consistent with previous results on genotype-temperature interaction for this SNP. We suggest that these results on 2-day old butterflies reflect thermal effect on the maturation of flight muscles. These results highlight the consequences of variation in thermal conditions on the time scale of days, and they contribute to a better understanding of the complex dynamics of flight metabolism and flight-related activities under conditions that are relevant for natural populations living under variable thermal conditions. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  19. An integrated database-pipeline system for studying single nucleotide polymorphisms and diseases.

    PubMed

    Yang, Jin Ok; Hwang, Sohyun; Oh, Jeongsu; Bhak, Jong; Sohn, Tae-Kwon

    2008-12-12

    Studies on the relationship between disease and genetic variations such as single nucleotide polymorphisms (SNPs) are important. Genetic variations can cause disease by influencing important biological regulation processes. Despite the needs for analyzing SNP and disease correlation, most existing databases provide information only on functional variants at specific locations on the genome, or deal with only a few genes associated with disease. There is no combined resource to widely support gene-, SNP-, and disease-related information, and to capture relationships among such data. Therefore, we developed an integrated database-pipeline system for studying SNPs and diseases. To implement the pipeline system for the integrated database, we first unified complicated and redundant disease terms and gene names using the Unified Medical Language System (UMLS) for classification and noun modification, and the HUGO Gene Nomenclature Committee (HGNC) and NCBI gene databases. Next, we collected and integrated representative databases for three categories of information. For genes and proteins, we examined the NCBI mRNA, UniProt, UCSC Table Track and MitoDat databases. For genetic variants we used the dbSNP, JSNP, ALFRED, and HGVbase databases. For disease, we employed OMIM, GAD, and HGMD databases. The database-pipeline system provides a disease thesaurus, including genes and SNPs associated with disease. The search results for these categories are available on the web page http://diseasome.kobic.re.kr/, and a genome browser is also available to highlight findings, as well as to permit the convenient review of potentially deleterious SNPs among genes strongly associated with specific diseases and clinical phenotypes. Our system is designed to capture the relationships between SNPs associated with disease and disease-causing genes. The integrated database-pipeline provides a list of candidate genes and SNP markers for evaluation in both epidemiological and molecular biological approaches to diseases-gene association studies. Furthermore, researchers then can decide semi-automatically the data set for association studies while considering the relationships between genetic variation and diseases. The database can also be economical for disease-association studies, as well as to facilitate an understanding of the processes which cause disease. Currently, the database contains 14,674 SNP records and 109,715 gene records associated with human diseases and it is updated at regular intervals.

  20. A high-density association screen of 155 ion transport genes for involvement with common migraine

    PubMed Central

    Nyholt, Dale R.; LaForge, K. Steven; Kallela, Mikko; Alakurtti, Kirsi; Anttila, Verneri; Färkkilä, Markus; Hämaläinen, Eija; Kaprio, Jaakko; Kaunisto, Mari A.; Heath, Andrew C.; Montgomery, Grant W.; Göbel, Hartmut; Todt, Unda; Ferrari, Michel D.; Launer, Lenore J.; Frants, Rune R.; Terwindt, Gisela M.; de Vries, Boukje; Verschuren, W.M. Monique; Brand, Jan; Freilinger, Tobias; Pfaffenrath, Volker; Straube, Andreas; Ballinger, Dennis G.; Zhan, Yiping; Daly, Mark J.; Cox, David R.; Dichgans, Martin; van den Maagdenberg, Arn M.J.M.; Kubisch, Christian; Martin, Nicholas G.; Wessman, Maija; Peltonen, Leena; Palotie, Aarno

    2008-01-01

    The clinical overlap between monogenic Familial Hemiplegic Migraine (FHM) and common migraine subtypes, and the fact that all three FHM genes are involved in the transport of ions, suggest that ion transport genes may underlie susceptibility to common forms of migraine. To test this leading hypothesis, we examined common variation in 155 ion transport genes using 5257 single nucleotide polymorphisms (SNPs) in a Finnish sample of 841 unrelated migraine with aura cases and 884 unrelated non-migraine controls. The top signals were then tested for replication in four independent migraine case–control samples from the Netherlands, Germany and Australia, totalling 2835 unrelated migraine cases and 2740 unrelated controls. SNPs within 12 genes (KCNB2, KCNQ3, CLIC5, ATP2C2, CACNA1E, CACNB2, KCNE2, KCNK12, KCNK2, KCNS3, SCN5A and SCN9A) with promising nominal association (0.00041 < P < 0.005) in the Finnish sample were selected for replication. Although no variant remained significant after adjusting for multiple testing nor produced consistent evidence for association across all cohorts, a significant epistatic interaction between KCNB2 SNP rs1431656 (chromosome 8q13.3) and CACNB2 SNP rs7076100 (chromosome 10p12.33) (pointwise P = 0.00002; global P = 0.02) was observed in the Finnish case–control sample. We conclude that common variants of moderate effect size in ion transport genes do not play a major role in susceptibility to common migraine within these European populations, although there is some evidence for epistatic interaction between potassium and calcium channel genes, KCNB2 and CACNB2. Multiple rare variants or trans-regulatory elements of these genes are not ruled out. PMID:18676988

  1. An innovative SNP genotyping method adapting to multiple platforms and throughputs.

    PubMed

    Long, Y M; Chao, W S; Ma, G J; Xu, S S; Qi, L L

    2017-03-01

    An innovative genotyping method designated as semi-thermal asymmetric reverse PCR (STARP) was developed for genotyping individual SNPs with improved accuracy, flexible throughputs, low operational costs, and high platform compatibility. Multiplex chip-based technology for genome-scale genotyping of single nucleotide polymorphisms (SNPs) has made great progress in the past two decades. However, PCR-based genotyping of individual SNPs still remains problematic in accuracy, throughput, simplicity, and/or operational costs as well as the compatibility with multiple platforms. Here, we report a novel SNP genotyping method designated semi-thermal asymmetric reverse PCR (STARP). In this method, genotyping assay was performed under unique PCR conditions using two universal priming element-adjustable primers (PEA-primers) and one group of three locus-specific primers: two asymmetrically modified allele-specific primers (AMAS-primers) and their common reverse primer. The two AMAS-primers each were substituted one base in different positions at their 3' regions to significantly increase the amplification specificity of the two alleles and tailed at 5' ends to provide priming sites for PEA-primers. The two PEA-primers were developed for common use in all genotyping assays to stringently target the PCR fragments generated by the two AMAS-primers with similar PCR efficiencies and for flexible detection using either gel-free fluorescence signals or gel-based size separation. The state-of-the-art primer design and unique PCR conditions endowed STARP with all the major advantages of high accuracy, flexible throughputs, simple assay design, low operational costs, and platform compatibility. In addition to SNPs, STARP can also be employed in genotyping of indels (insertion-deletion polymorphisms). As vast variations in DNA sequences are being unearthed by many genome sequencing projects and genotyping by sequencing, STARP will have wide applications across all biological organisms in agriculture, medicine, and forensics.

  2. Genetic and molecular risk factors within the newly identified primate-specific exon of the SAP97/DLG1 gene in the 3q29 schizophrenia-associated locus.

    PubMed

    Uezato, Akihito; Yamamoto, Naoki; Jitoku, Daisuke; Haramo, Emiko; Hiraaki, Eri; Iwayama, Yoshimi; Toyota, Tomoko; Umino, Masakazu; Umino, Asami; Iwata, Yasuhide; Suzuki, Katsuaki; Kikuchi, Mitsuru; Hashimoto, Tasuku; Kanahara, Nobuhisa; Kurumaji, Akeo; Yoshikawa, Takeo; Nishikawa, Toru

    2017-12-01

    The synapse-associated protein 97/discs, large homolog 1 of Drosophila (DLG1) gene encodes synaptic scaffold PDZ proteins interacting with ionotropic glutamate receptors including the N-methyl-D-aspartate type glutamate receptor (NMDAR) that is presumed to be hypoactive in brains of patients with schizophrenia. The DLG1 gene resides in the chromosomal position 3q29, the microdeletion of which confers a 40-fold increase in the risk for schizophrenia. In the present study, we performed genetic association analyses for DLG1 gene using a Japanese cohort with 1808 schizophrenia patients and 2170 controls. We detected an association which remained significant after multiple comparison testing between schizophrenia and the single nucleotide polymorphism (SNP) rs3915512 that is located within the newly identified primate-specific exon (exon 3b) of the DLG1 gene and constitutes the exonic splicing enhancer sequence. When stratified by onset age, although it did not survive multiple comparisons, the association was observed in non-early onset schizophrenia, whose onset-age selectivity is consistent with our recent postmortem study demonstrating a decrease in the expression of the DLG1 variant in early-onset schizophrenia. Although the present study did not demonstrate the previously reported association of the SNP rs9843659 by itself, a meta-analysis revealed a significant association between DLG1 gene and schizophrenia. These findings provide a valuable clue for molecular mechanisms on how genetic variations in the primate-specific exon of the gene in the schizophrenia-associated 3q29 locus affect its regulation in the glutamate system and lead to the disease onset around a specific stage of brain development. © 2017 Wiley Periodicals, Inc.

  3. Genome-wide SNP discovery and population structure analysis in pepper (Capsicum annuum) using genotyping by sequencing.

    PubMed

    Taranto, F; D'Agostino, N; Greco, B; Cardi, T; Tripodi, P

    2016-11-21

    Knowledge on population structure and genetic diversity in vegetable crops is essential for association mapping studies and genomic selection. Genotyping by sequencing (GBS) represents an innovative method for large scale SNP detection and genotyping of genetic resources. Herein we used the GBS approach for the genome-wide identification of SNPs in a collection of Capsicum spp. accessions and for the assessment of the level of genetic diversity in a subset of 222 cultivated pepper (Capsicum annum) genotypes. GBS analysis generated a total of 7,568,894 master tags, of which 43.4% uniquely aligned to the reference genome CM334. A total of 108,591 SNP markers were identified, of which 105,184 were in C. annuum accessions. In order to explore the genetic diversity of C. annuum and to select a minimal core set representing most of the total genetic variation with minimum redundancy, a subset of 222 C. annuum accessions were analysed using 32,950 high quality SNPs. Based on Bayesian and Hierarchical clustering it was possible to divide the collection into three clusters. Cluster I had the majority of varieties and landraces mainly from Southern and Northern Italy, and from Eastern Europe, whereas clusters II and III comprised accessions of different geographical origins. Considering the genome-wide genetic variation among the accessions included in cluster I, a second round of Bayesian (K = 3) and Hierarchical (K = 2) clustering was performed. These analysis showed that genotypes were grouped not only based on geographical origin, but also on fruit-related features. GBS data has proven useful to assess the genetic diversity in a collection of C. annuum accessions. The high number of SNP markers, uniformly distributed on the 12 chromosomes, allowed the accessions to be distinguished according to geographical origin and fruit-related features. SNP markers and information on population structure developed in this study will undoubtedly support genome-wide association mapping studies and marker-assisted selection programs.

  4. A mass spectrometry-based multiplex SNP genotyping by utilizing allele-specific ligation and strand displacement amplification.

    PubMed

    Park, Jung Hun; Jang, Hyowon; Jung, Yun Kyung; Jung, Ye Lim; Shin, Inkyung; Cho, Dae-Yeon; Park, Hyun Gyu

    2017-05-15

    We herein describe a new mass spectrometry-based method for multiplex SNP genotyping by utilizing allele-specific ligation and strand displacement amplification (SDA) reaction. In this method, allele-specific ligation is first performed to discriminate base sequence variations at the SNP site within the PCR-amplified target DNA. The primary ligation probe is extended by a universal primer annealing site while the secondary ligation probe has base sequences as an overhang with a nicking enzyme recognition site and complementary mass marker sequence. The ligation probe pairs are ligated by DNA ligase only at specific allele in the target DNA and the resulting ligated product serves as a template to promote the SDA reaction using a universal primer. This process isothermally amplifies short DNA fragments, called mass markers, to be analyzed by mass spectrometry. By varying the sizes of the mass markers, we successfully demonstrated the multiplex SNP genotyping capability of this method by reliably identifying several BRCA mutations in a multiplex manner with mass spectrometry. Copyright © 2016 Elsevier B.V. All rights reserved.

  5. Genome-Wide QTL Mapping for Wheat Processing Quality Parameters in a Gaocheng 8901/Zhoumai 16 Recombinant Inbred Line Population.

    PubMed

    Jin, Hui; Wen, Weie; Liu, Jindong; Zhai, Shengnan; Zhang, Yan; Yan, Jun; Liu, Zhiyong; Xia, Xianchun; He, Zhonghu

    2016-01-01

    Dough rheological and starch pasting properties play an important role in determining processing quality in bread wheat (Triticum aestivum L.). In the present study, a recombinant inbred line (RIL) population derived from a Gaocheng 8901/Zhoumai 16 cross grown in three environments was used to identify quantitative trait loci (QTLs) for dough rheological and starch pasting properties evaluated by Mixograph, Rapid Visco-Analyzer (RVA), and Mixolab parameters using the wheat 90 and 660 K single nucleotide polymorphism (SNP) chip assays. A high-density linkage map constructed with 46,961 polymorphic SNP markers from the wheat 90 and 660 K SNP assays spanned a total length of 4121 cM, with an average chromosome length of 196.2 cM and marker density of 0.09 cM/marker; 6596 new SNP markers were anchored to the bread wheat linkage map, with 1046 and 5550 markers from the 90 and 660 K SNP assays, respectively. Composite interval mapping identified 119 additive QTLs on 20 chromosomes except 4D; among them, 15 accounted for more than 10% of the phenotypic variation across two or three environments. Twelve QTLs for Mixograph parameters, 17 for RVA parameters and 55 for Mixolab parameters were new. Eleven QTL clusters were identified. The closely linked SNP markers can be used in marker-assisted wheat breeding in combination with the Kompetitive Allele Specific PCR (KASP) technique for improvement of processing quality in bread wheat.

  6. Genome-Wide QTL Mapping for Wheat Processing Quality Parameters in a Gaocheng 8901/Zhoumai 16 Recombinant Inbred Line Population

    PubMed Central

    Jin, Hui; Wen, Weie; Liu, Jindong; Zhai, Shengnan; Zhang, Yan; Yan, Jun; Liu, Zhiyong; Xia, Xianchun; He, Zhonghu

    2016-01-01

    Dough rheological and starch pasting properties play an important role in determining processing quality in bread wheat (Triticum aestivum L.). In the present study, a recombinant inbred line (RIL) population derived from a Gaocheng 8901/Zhoumai 16 cross grown in three environments was used to identify quantitative trait loci (QTLs) for dough rheological and starch pasting properties evaluated by Mixograph, Rapid Visco-Analyzer (RVA), and Mixolab parameters using the wheat 90 and 660 K single nucleotide polymorphism (SNP) chip assays. A high-density linkage map constructed with 46,961 polymorphic SNP markers from the wheat 90 and 660 K SNP assays spanned a total length of 4121 cM, with an average chromosome length of 196.2 cM and marker density of 0.09 cM/marker; 6596 new SNP markers were anchored to the bread wheat linkage map, with 1046 and 5550 markers from the 90 and 660 K SNP assays, respectively. Composite interval mapping identified 119 additive QTLs on 20 chromosomes except 4D; among them, 15 accounted for more than 10% of the phenotypic variation across two or three environments. Twelve QTLs for Mixograph parameters, 17 for RVA parameters and 55 for Mixolab parameters were new. Eleven QTL clusters were identified. The closely linked SNP markers can be used in marker-assisted wheat breeding in combination with the Kompetitive Allele Specific PCR (KASP) technique for improvement of processing quality in bread wheat. PMID:27486464

  7. Genome-wide copy number variant analysis in Holstein cattle reveals variants associated with 10 production traits including residual feed intake and dry matter intake

    USDA-ARS?s Scientific Manuscript database

    Copy number variation (CNV) is an important type of genetic variation contributing to phenotypic differences among mammals and may serve as an alternative molecular marker to single nucleotide polymorphism (SNP) for genome-wide association study (GWAS). Recently, GWAS analysis using CNV has been app...

  8. Molecular genetic contributions to socioeconomic status and intelligence

    PubMed Central

    Marioni, Riccardo E.; Davies, Gail; Hayward, Caroline; Liewald, Dave; Kerr, Shona M.; Campbell, Archie; Luciano, Michelle; Smith, Blair H.; Padmanabhan, Sandosh; Hocking, Lynne J.; Hastie, Nicholas D.; Wright, Alan F.; Porteous, David J.; Visscher, Peter M.; Deary, Ian J.

    2014-01-01

    Education, socioeconomic status, and intelligence are commonly used as predictors of health outcomes, social environment, and mortality. Education and socioeconomic status are typically viewed as environmental variables although both correlate with intelligence, which has a substantial genetic basis. Using data from 6815 unrelated subjects from the Generation Scotland study, we examined the genetic contributions to these variables and their genetic correlations. Subjects underwent genome-wide testing for common single nucleotide polymorphisms (SNPs). DNA-derived heritability estimates and genetic correlations were calculated using the ‘Genome-wide Complex Trait Analyses’ (GCTA) procedures. 21% of the variation in education, 18% of the variation in socioeconomic status, and 29% of the variation in general cognitive ability was explained by variation in common SNPs (SEs ~ 5%). The SNP-based genetic correlations of education and socioeconomic status with general intelligence were 0.95 (SE 0.13) and 0.26 (0.16), respectively. There are genetic contributions to intelligence and education with near-complete overlap between common additive SNP effects on these traits (genetic correlation ~ 1). Genetic influences on socioeconomic status are also associated with the genetic foundations of intelligence. The results are also compatible with substantial environmental contributions to socioeconomic status. PMID:24944428

  9. Common polymorphic variation in the genetically diverse African insulin gene and its association with size at birth.

    PubMed

    Petry, Clive J; Rayco-Solon, Pura; Fulford, Anthony J C; Stead, John D H; Wingate, Dianne L; Ong, Ken K; Sirugo, Giorgio; Prentice, Andrew M; Dunger, David B

    2009-09-01

    The insulin variable number of tandem repeats (INS VNTR) has been variably associated with size at birth in non-African populations. Small size at birth is a major determinant of neonatal mortality, so the INS VNTR may influence survival. We tested the hypothesis, therefore, that genetic variation around the INS VNTR in a rural Gambian population, who experience seasonal variation in nutrition and subsequently birth weight, may be associated with foetal and early growth. Six polymorphisms flanking the INS VNTR were genotyped in over 2,500 people. Significant associations were detected between the maternally inherited SNP 27 (rs689) allele and birth length [effect size 17.5 (5.2-29.8) mm; P = 0.004; n = 361]. Significant associations were also found between the maternally inherited African-specific SNP 28 (rs5506) allele and post-natal weight gain [effect size 0.19 (0.05-0.32) z score points/year; P = 0.005; n = 728). These results suggest that in the Gambian population studied there are associations between polymorphic variation in the genetically diverse INS gene and foetal and early growth characteristics, which contribute to overall polygenic associations with these traits.

  10. Molecular genetic contributions to socioeconomic status and intelligence.

    PubMed

    Marioni, Riccardo E; Davies, Gail; Hayward, Caroline; Liewald, Dave; Kerr, Shona M; Campbell, Archie; Luciano, Michelle; Smith, Blair H; Padmanabhan, Sandosh; Hocking, Lynne J; Hastie, Nicholas D; Wright, Alan F; Porteous, David J; Visscher, Peter M; Deary, Ian J

    2014-05-01

    Education, socioeconomic status, and intelligence are commonly used as predictors of health outcomes, social environment, and mortality. Education and socioeconomic status are typically viewed as environmental variables although both correlate with intelligence, which has a substantial genetic basis. Using data from 6815 unrelated subjects from the Generation Scotland study, we examined the genetic contributions to these variables and their genetic correlations. Subjects underwent genome-wide testing for common single nucleotide polymorphisms (SNPs). DNA-derived heritability estimates and genetic correlations were calculated using the 'Genome-wide Complex Trait Analyses' (GCTA) procedures. 21% of the variation in education, 18% of the variation in socioeconomic status, and 29% of the variation in general cognitive ability was explained by variation in common SNPs (SEs ~ 5%). The SNP-based genetic correlations of education and socioeconomic status with general intelligence were 0.95 (SE 0.13) and 0.26 (0.16), respectively. There are genetic contributions to intelligence and education with near-complete overlap between common additive SNP effects on these traits (genetic correlation ~ 1). Genetic influences on socioeconomic status are also associated with the genetic foundations of intelligence. The results are also compatible with substantial environmental contributions to socioeconomic status.

  11. Role of calcium in nitric oxide-induced cytotoxicity: EGTA protects mouse oligodendrocytes.

    PubMed

    Boullerne, A I; Nedelkoska, L; Benjamins, J A

    2001-01-15

    Active nitrogen species are overproduced in inflammatory brain lesions in multiple sclerosis (MS) and experimental allergic encephalomyelitis (EAE). NO has been shown to mediate the death of oligodendrocytes (OLs), a primary target of damage in MS. To develop strategies to protect OLs, we examined the mechanisms of cytotoxicity of two NO donors, S-nitroso-N-acetyl-penicillamine (SNAP) and sodium nitroprusside (SNP) on mature mouse OLs. Nitrosonium ion (NO+) rather than NO. mediates damage with both SNAP and SNP, as shown by significant protection with hemoglobin (HbO2), but not with the NO. scavenger PTIO. SNAP and SNP differ in time course and mechanisms of killing OLs. With SNAP, OL death is delayed for at least 6 hr, but with SNP, OL death is continuous over 18 hr with no delay. Relative to NO release, SNP is more toxic than SNAP, due to synergism of NO with cyanide released by SNP. SNAP elicits a Ca2+ influx in over half of the OLs within min. Further, OL death due to NO release from SNAP is Ca2+-dependent, because the Ca2+ chelator EGTA protects OLs from killing by SNAP, and also from killing by the NONOates NOC-9 and NOC-18, which spontaneously release NO. SNP does not elicit a Ca2+ influx, and EGTA is not protective. In comparison to the N20.1 OL cell line (Boullerne et al., [1999] J. Neurochem. 72:1050-1060), mature OLs are (1) more sensitive to SNAP, (2) much more resistant to SNP, (3) sensitive to cyanide, but not iron, and (4) exhibit a Ca2+ influx and EGTA protection in response to NO generated by SNAP. Copyright 2001 Wiley-Liss, Inc.

  12. Genetic variation in vitamin B-12 content of bovine milk and its association with SNP along the bovine genome.

    PubMed

    Rutten, Marc J M; Bouwman, Aniek C; Sprong, R Corinne; van Arendonk, Johan A M; Visker, Marleen H P W

    2013-01-01

    Vitamin B-12 (also called cobalamin) is essential for human health and current intake levels of vitamin B-12 are considered to be too low. Natural enrichment of the vitamin B-12 content in milk, an important dietary source of vitamin B-12, may help to increase vitamin B-12 intake. Natural enrichment of the milk vitamin B-12 content could be achieved through genetic selection, provided there is genetic variation between cows with respect to the vitamin B-12 content in their milk. A substantial amount of genetic variation in vitamin B-12 content was detected among raw milk samples of 544 first-lactation Dutch Holstein Friesian cows. The presence of genetic variation between animals in vitamin B-12 content in milk indicates that the genotype of the cow affects the amount of vitamin B-12 that ends up in her milk and, consequently, that the average milk vitamin B-12 content of the cow population can be increased by genetic selection. A genome-wide association study revealed significant association between 68 SNP and vitamin B-12 content in raw milk of 487 first-lactation Dutch Holstein Friesian cows. This knowledge facilitates genetic selection for milk vitamin B-12 content. It also contributes to the understanding of the biological mechanism responsible for the observed genetic variation in vitamin B-12 content in milk. None of the 68 significantly associated SNP were in or near known candidate genes involved in transport of vitamin B-12 through the gastrointestinal tract, uptake by ileum epithelial cells, export from ileal cells, transport through the blood, uptake from the blood, intracellular processing, or reabsorption by the kidneys. Probably, associations relate to genes involved in alternative pathways of well-studied processes or to genes involved in less well-studied processes such as ruminal production of vitamin B-12 or secretion of vitamin B-12 by the mammary gland.

  13. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.

    Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less

  14. Developing single nucleotide polymorphism (SNP) markers from transcriptome sequences for identification of longan (Dimocarpus longan) germplasm

    PubMed Central

    Wang, Boyi; Tan, Hua-Wei; Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Matsumoto, Tracie; Zhang, Dapeng

    2015-01-01

    Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in 50 longan germplasm accessions, including cultivated varieties and wild germplasm; and designated 25 SNP markers that unambiguously identified all tested longan varieties with high statistical rigor (P<0.0001). Multiple trees from the same clone were verified and off-type trees were identified. Diversity analysis revealed genetic relationships among analyzed accessions. Cultivated varieties differed significantly from wild populations (Fst=0.300; P<0.001), demonstrating untapped genetic diversity for germplasm conservation and utilization. Within cultivated varieties, apparent differences between varieties from China and those from Thailand and Hawaii indicated geographic patterns of genetic differentiation. These SNP markers provide a powerful tool to manage longan genetic resources and breeding, with accurate and efficient genotype identification. PMID:26504559

  15. Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes

    PubMed Central

    Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Ángel

    2009-01-01

    Background Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. Results To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. Conclusion The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest. PMID:19344481

  16. Case-control study of eczema associated with IL13 genetic polymorphisms in Japanese children.

    PubMed

    Miyake, Yoshihiro; Kiyohara, Chikako; Koyanagi, Midori; Fujimoto, Takahiro; Shirasawa, Senji; Tanaka, Keiko; Sasaki, Satoshi; Hirota, Yoshio

    2011-01-01

    Several association studies have investigated the relationships between single nucleotide polymorphisms (SNPs) in the IL13 gene and eczema, with inconsistent results. We conducted a case-control study of the relationship between the polymorphisms of rs1800925 and rs20541 and the risk of eczema in Japanese children aged 3 years. Included were the 209 cases identified based on criteria of the International Study of Asthma and Allergies in Childhood (ISAAC). Controls were 451 children without eczema based on ISAAC questions who had not been diagnosed by a physician as having asthma or atopic eczema. The minor TT genotype of the rs1800925 SNP and the minor AA genotype of the rs20541 SNP were significantly related to an increased risk of eczema: adjusted odds ratio for the TT genotype was 2.78 (95% confidence interval 1.22-6.30) and that for the AA genotype was 2.38 (95% confidence interval 1.35-4.18). Haplotype analyses showed a protective association between the CG haplotype and eczema, whereas the TA haplotype was positively related to the risk of eczema. Perinatal smoking exposure did not interact with genotypes of the IL13 gene in the etiology of eczema. The significant association of the rs20541 SNP with eczema essentially disappeared after additional adjustment for the rs1800925 SNP, whereas a relationship with the rs1800925 SNP remained significant. A common genetic variation in the IL13 gene at the levels of both single SNPs and haplotypes was associated with eczema. However, the significant association with the rs20541 SNP might be ascribed to the rs1800925 SNP. Copyright © 2010 S. Karger AG, Basel.

  17. Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes.

    PubMed

    Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Angel

    2009-03-19

    Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest.

  18. Joint effect of unlinked genotypes: application to type 2 diabetes in the EPIC-Potsdam case-cohort study.

    PubMed

    Knüppel, Sven; Meidtner, Karina; Arregui, Maria; Holzhütter, Hermann-Georg; Boeing, Heiner

    2015-07-01

    Analyzing multiple single nucleotide polymorphisms (SNPs) is a promising approach to finding genetic effects beyond single-locus associations. We proposed the use of multilocus stepwise regression (MSR) to screen for allele combinations as a method to model joint effects, and compared the results with the often used genetic risk score (GRS), conventional stepwise selection, and the shrinkage method LASSO. In contrast to MSR, the GRS, conventional stepwise selection, and LASSO model each genotype by the risk allele doses. We reanalyzed 20 unlinked SNPs related to type 2 diabetes (T2D) in the EPIC-Potsdam case-cohort study (760 cases, 2193 noncases). No SNP-SNP interactions and no nonlinear effects were found. Two SNP combinations selected by MSR (Nagelkerke's R² = 0.050 and 0.048) included eight SNPs with mean allele combination frequency of 2%. GRS and stepwise selection selected nearly the same SNP combinations consisting of 12 and 13 SNPs (Nagelkerke's R² ranged from 0.020 to 0.029). LASSO showed similar results. The MSR method showed the best model fit measured by Nagelkerke's R² suggesting that further improvement may render this method a useful tool in genetic research. However, our comparison suggests that the GRS is a simple way to model genetic effects since it does not consider linkage, SNP-SNP interactions, and no non-linear effects. © 2015 John Wiley & Sons Ltd/University College London.

  19. Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

    PubMed

    Chen, Carla Chia-Ming; Schwender, Holger; Keith, Jonathan; Nunkesser, Robin; Mengersen, Kerrie; Macrossan, Paula

    2011-01-01

    Due to advancements in computational ability, enhanced technology and a reduction in the price of genotyping, more data are being generated for understanding genetic associations with diseases and disorders. However, with the availability of large data sets comes the inherent challenges of new methods of statistical analysis and modeling. Considering a complex phenotype may be the effect of a combination of multiple loci, various statistical methods have been developed for identifying genetic epistasis effects. Among these methods, logic regression (LR) is an intriguing approach incorporating tree-like structures. Various methods have built on the original LR to improve different aspects of the model. In this study, we review four variations of LR, namely Logic Feature Selection, Monte Carlo Logic Regression, Genetic Programming for Association Studies, and Modified Logic Regression-Gene Expression Programming, and investigate the performance of each method using simulated and real genotype data. We contrast these with another tree-like approach, namely Random Forests, and a Bayesian logistic regression with stochastic search variable selection.

  20. Variation at the NFATC2 Locus Increases the Risk of Thiazolidinedione-Induced Edema in the Diabetes REduction Assessment with ramipril and rosiglitazone Medication (DREAM) Study

    PubMed Central

    Bailey, Swneke D.; Xie, Changchun; Do, Ron; Montpetit, Alexandre; Diaz, Rafael; Mohan, Viswanathan; Keavney, Bernard; Yusuf, Salim; Gerstein, Hertzel C.; Engert, James C.; Anand, Sonia

    2010-01-01

    OBJECTIVE Thiazolidinediones are used to treat type 2 diabetes. Their use has been associated with peripheral edema and congestive heart failure—outcomes that may have a genetic etiology. RESEARCH DESIGN AND METHODS We genotyped 4,197 participants of the multiethnic DREAM (Diabetes REduction Assessment with ramipril and rosiglitazone Medication) trial with a 50k single nucleotide polymorphisms (SNP) array, which captures ∼2000 cardiovascular, inflammatory, and metabolic genes. We tested 32,088 SNPs for an association with edema among Europeans who received rosiglitazone (n = 965). RESULTS One SNP, rs6123045, in NFATC2 was significantly associated with edema (odds ratio 1.89 [95% CI 1.47–2.42]; P = 5.32 × 10−7, corrected P = 0.017). Homozygous individuals had the highest edema rate (hazard ratio 2.89, P = 4.22 × 10−4) when compared with individuals homozygous for the protective allele, with heterozygous individuals having an intermediate risk. The interaction between the SNP and rosiglitazone for edema was significant (P = 7.68 × 10−3). Six SNPs in NFATC2 were significant in both Europeans and Latin Americans (P < 0.05). CONCLUSIONS Genetic variation at the NFATC2 locus contributes to edema among individuals who receive rosiglitazone. PMID:20628086

  1. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    PubMed Central

    Kijas, James W.; Townley, David; Dalrymple, Brian P.; Heaton, Michael P.; Maddox, Jillian F.; McGrath, Annette; Wilson, Peter; Ingersoll, Roxann G.; McCulloch, Russell; McWilliam, Sean; Tang, Dave; McEwan, John; Cockett, Noelle; Oddy, V. Hutton; Nicholas, Frank W.; Raadsma, Herman

    2009-01-01

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identifying the first genome-wide set of SNP for sheep, we report on levels of genetic variability both within and between a diverse sample of ovine populations. Then, using cluster analysis and the partitioning of genetic variation, we demonstrate sheep are characterised by weak phylogeographic structure, overlapping genetic similarity and generally low differentiation which is consistent with their short evolutionary history. The degree of population substructure was, however, sufficient to cluster individuals based on geographic origin and known breed history. Specifically, African and Asian populations clustered separately from breeds of European origin sampled from Australia, New Zealand, Europe and North America. Furthermore, we demonstrate the presence of stratification within some, but not all, ovine breeds. The results emphasize that careful documentation of genetic structure will be an essential prerequisite when mapping the genetic basis of complex traits. Furthermore, the identification of a subset of SNP able to assign individuals into broad groupings demonstrates even a small panel of markers may be suitable for applications such as traceability. PMID:19270757

  2. Alternative SNP detection platforms, HRM and biosensors, for varietal identification in Vitis vinifera L. using F3H and LDOX genes.

    PubMed

    Gomes, Sónia; Castro, Cláudia; Barrias, Sara; Pereira, Leonor; Jorge, Pedro; Fernandes, José R; Martins-Lopes, Paula

    2018-04-11

    The wine sector requires quick and reliable methods for Vitis vinifera L. varietal identification. The number of V. vinifera varieties is estimated in about 5,000 worldwide. Single Nucleotide Polymorphisms (SNPs) represent the most basic and abundant form of genetic sequence variation, being adequate for varietal discrimination. The aim of this work was to develop DNA-based assays suitable to detect SNP variation in V. vinifera, allowing varietal discrimination. Genotyping by sequencing allowed the detection of eleven SNPs on two genes of the anthocyanin pathway, the flavanone 3-hydroxylase (F3H, EC: 1.14.11.9), and the leucoanthocyanidin dioxygenase (LDOX, EC 1.14.11.19; synonym anthocyanidin synthase, ANS) in twenty V. vinifera varieties. Three High Resolution Melting (HRM) assays were designed based on the sequencing information, discriminating five of the 20 varieties: Alicante Bouschet, Donzelinho Tinto, Merlot, Moscatel Galego and Tinta Roriz. Sanger sequencing of the HRM assay products confirmed the HRM profiles. Three probes, with different lengths and sequences, were used as bio-recognition elements in an optical biosensor platform based on a long period grating (LPG) fiber optic sensor. The label free platform detected a difference of a single SNP using genomic DNA samples. The two different platforms were successfully applied for grapevine varietal identification.

  3. The -1535 promoter variant of the visfatin gene is associated with serum triglyceride and HDL-cholesterol levels in Japanese subjects.

    PubMed

    Tokunaga, Ayumi; Miura, Atsuko; Okauchi, Yukiyoshi; Segawa, Katsumori; Fukuhara, Atsunori; Okita, Kohei; Takahashi, Masahiko; Funahashi, Tohru; Miyagawa, Jun-Ichiro; Shimomura, Iichiro; Yamagata, Kazuya

    2008-03-01

    Visfatin is a novel adipocytokine that is expressed by the visceral fat cells. We investigated the role of genetic variation in the visfatin gene in the pathophysiology of type 2 diabetes and clinical variables in Japanese subjects. The 11 exons, and the promoter region of the visfatin gene were screened for single nucleotide polymorphisms (SNPs) by PCR-direct sequencing. We found SNPs in the promoter region (SNP - 1535T>C), exon 2 (SNP + 131C>G, Thr44Arg), and exon 7 (SNP + 903G>A). The allele and genotype frequencies of these SNPs showed no significant differences between 200-448 diabetic and 200-333 control subjects. However, the -1535T/T genotype was associated with lower serum triglyceride levels (T/T vs. T/C + C/C (p = 0.015) and T/T vs. C/C (p = 0.043)) and higher HDL-cholesterol levels (T/T vs. C/C, p = 0.0496) in the nondiabetic subjects. Reporter gene assay of 3T3-L1 adipocytes revealed that the promoter activity of -1535T and -1535C was similar, suggesting that the observed association may reflect linkage disequilibrium between -1535T>C and causative variations of the visfatin gene.

  4. Neuregulin 1 transcripts are differentially expressed in schizophrenia and regulated by 5′ SNPs associated with the disease

    PubMed Central

    Law, Amanda J.; Lipska, Barbara K.; Weickert, Cynthia Shannon; Hyde, Thomas M.; Straub, Richard E.; Hashimoto, Ryota; Harrison, Paul J.; Kleinman, Joel E.; Weinberger, Daniel R.

    2006-01-01

    Genetic variation in neuregulin 1 (NRG1) is associated with schizophrenia. The disease-associated SNPs are noncoding, and their functional implications remain unknown. We hypothesized that differential expression of the NRG1 gene explains its association to the disease. We examined four of the disease-associated SNPs that make up the original risk haplotype in the 5′ upstream region of the gene for their effects on mRNA abundance of NRG1 types I–IV in human postmortem hippocampus. Diagnostic comparisons revealed a 34% increase in type I mRNA in schizophrenia and an interaction of diagnosis and genotype (SNP8NRG221132) on this transcript. Of potentially greater interest, a single SNP within the risk haplotype (SNP8NRG243177) and a 22-kb block of this core haplotype are associated with mRNA expression for the novel type IV isoform in patients and controls. Bioinformatic promoter analyses indicate that both SNPs lead to a gain/loss of putative binding sites for three transcription factors, serum response factor, myelin transcription factor-1, and High Mobility Group Box Protein-1. These data implicate variation in isoform expression as a molecular mechanism for the genetic association of NRG1 with schizophrenia. PMID:16618933

  5. Genome-environment associations in sorghum landraces predict adaptive traits

    PubMed Central

    Lasky, Jesse R.; Upadhyaya, Hari D.; Ramu, Punna; Deshpande, Santosh; Hash, C. Tom; Bonnette, Jason; Juenger, Thomas E.; Hyma, Katie; Acharya, Charlotte; Mitchell, Sharon E.; Buckler, Edward S.; Brenton, Zachary; Kresovich, Stephen; Morris, Geoffrey P.

    2015-01-01

    Improving environmental adaptation in crops is essential for food security under global change, but phenotyping adaptive traits remains a major bottleneck. If associations between single-nucleotide polymorphism (SNP) alleles and environment of origin in crop landraces reflect adaptation, then these could be used to predict phenotypic variation for adaptive traits. We tested this proposition in the global food crop Sorghum bicolor, characterizing 1943 georeferenced landraces at 404,627 SNPs and quantifying allelic associations with bioclimatic and soil gradients. Environment explained a substantial portion of SNP variation, independent of geographical distance, and genic SNPs were enriched for environmental associations. Further, environment-associated SNPs predicted genotype-by-environment interactions under experimental drought stress and aluminum toxicity. Our results suggest that genomic signatures of environmental adaptation may be useful for crop improvement, enhancing germplasm identification and marker-assisted selection. Together, genome-environment associations and phenotypic analyses may reveal the basis of environmental adaptation. PMID:26601206

  6. SNP in starch biosynthesis genes associated with nutritional and functional properties of rice

    PubMed Central

    Kharabian-Masouleh, Ardashir; Waters, Daniel L. E.; Reinke, Russell F.; Ward, Rachelle; Henry, Robert J.

    2012-01-01

    Starch is a major component of human diets. The relative contribution of variation in the genes of starch biosynthesis to the nutritional and functional properties of the rice was evaluated in a rice breeding population. Sequencing 18 genes involved in starch synthesis in a population of 233 rice breeding lines discovered 66 functional SNPs in exonic regions. Five genes, AGPS2b, Isoamylase1, SPHOL, SSIIb and SSIVb showed no polymorphism. Association analysis found 31 of the SNP were associated with differences in pasting and cooking quality properties of the rice lines. Two genes appear to be the major loci controlling traits under human selection in rice, GBSSI (waxy gene) and SSIIa. GBSSI influenced amylose content and retrogradation. Other genes contributing to retrogradation were GPT1, SSI, BEI and SSIIIa. SSIIa explained much of the variation in cooking characteristics. Other genes had relatively small effects. PMID:22870386

  7. Association of apolipoprotein E gene polymorphisms with blood lipids and their interaction with dietary factors.

    PubMed

    Shatwan, Israa M; Winther, Kristian Hillert; Ellahi, Basma; Elwood, Peter; Ben-Shlomo, Yoav; Givens, Ian; Rayman, Margaret P; Lovegrove, Julie A; Vimaleswaran, Karani S

    2018-04-30

    Several candidate genes have been identified in relation to lipid metabolism, and among these, lipoprotein lipase (LPL) and apolipoprotein E (APOE) gene polymorphisms are major sources of genetically determined variation in lipid concentrations. This study investigated the association of two single nucleotide polymorphisms (SNPs) at LPL, seven tagging SNPs at the APOE gene, and a common APOE haplotype (two SNPs) with blood lipids, and examined the interaction of these SNPs with dietary factors. The population studied for this investigation included 660 individuals from the Prevention of Cancer by Intervention with Selenium (PRECISE) study who supplied baseline data. The findings of the PRECISE study were further replicated using 1238 individuals from the Caerphilly Prospective cohort (CaPS). Dietary intake was assessed using a validated food-frequency questionnaire (FFQ) in PRECISE and a validated semi-quantitative FFQ in the CaPS. Interaction analyses were performed by including the interaction term in the linear regression model adjusted for age, body mass index, sex and country. There was no association between dietary factors and blood lipids after Bonferroni correction and adjustment for confounding factors in either cohort. In the PRECISE study, after correction for multiple testing, there was a statistically significant association of the APOE haplotype (rs7412 and rs429358; E2, E3, and E4) and APOE tagSNP rs445925 with total cholesterol (P = 4 × 10 - 4 and P = 0.003, respectively). Carriers of the E2 allele had lower total cholesterol concentration (5.54 ± 0.97 mmol/L) than those with the E3 (5.98 ± 1.05 mmol/L) (P = 0.001) and E4 (6.09 ± 1.06 mmol/L) (P = 2 × 10 - 4 ) alleles. The association of APOE haplotype (E2, E3, and E4) and APOE SNP rs445925 with total cholesterol (P = 2 × 10 - 6 and P = 3 × 10 - 4 , respectively) was further replicated in the CaPS. Additionally, significant association was found between APOE haplotype and APOE SNP rs445925 with low density lipoprotein cholesterol in CaPS (P = 4 × 10 - 4 and P = 0.001, respectively). After Bonferroni correction, none of the cohorts showed a statistically significant SNP-diet interaction on lipid outcomes. In summary, our findings from the two cohorts confirm that genetic variations at the APOE locus influence plasma total cholesterol concentrations, however, the gene-diet interactions on lipids require further investigation in larger cohorts.

  8. Identification of bovine NPC1 gene cSNPs and their effects on body size traits of Qinchuan cattle.

    PubMed

    Dang, Yonglong; Li, Mingxun; Yang, Mingjuan; Cao, Xiukai; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Lin, Qing; Chen, Hong

    2014-05-01

    NPC1 gene is an important gene closely related to the Niemann-Pick type C (NPC). Mutations in the NPC1 gene tend to cause Niemann-Pick type C, a lysosomal storage disorder. Previous studies have shown that NPC1 protein plays an important role in subcellular lipid transport, homeostasis, platelet function and formation, which are basic metabolic activities in the process of development. In this study, to explore the association between the NPC1 gene variation and body size traits in Qinchuan cattle, we detected four novel coding single nucleotide polymorphisms (cSNPs) in the bovine NPC1 gene, including one missense mutation (SNP1) and three synonymous mutations (SNP2, SNP3 and SNP4). Population genetic analyses of 518 individuals and association correlations between cSNPs and bovine body size traits were conducted in this research. A missense mutation at SNP1 locus was found to be significantly related to the heart girth, hip width and body weight (P<0.01 or P<0.05, 3.5-year-old). Two synonymous mutations at SNP2 and SNP3 loci also showed significant effects on hip width (P<0.05, 3.5-year-old). One synonymous mutation at SNP4 locus showed significant effect on body weight (P<0.05, 2.0-year-old). Combined haplotypes H2H6 and H6H6 showed significant effects on body size traits such as heart girth, hip width, and body weight (3.5-year-old, P<0.01 or P<0.05). This study provides evidence that the NPC1 gene might be involved in the regulation of bovine growth and body development, and may be considered as a candidate gene for marker assisted selection (MAS) in beef cattle breeding industry. Copyright © 2014. Published by Elsevier B.V.

  9. GEE-based SNP set association test for continuous and discrete traits in family-based association studies.

    PubMed

    Wang, Xuefeng; Lee, Seunggeun; Zhu, Xiaofeng; Redline, Susan; Lin, Xihong

    2013-12-01

    Family-based genetic association studies of related individuals provide opportunities to detect genetic variants that complement studies of unrelated individuals. Most statistical methods for family association studies for common variants are single marker based, which test one SNP a time. In this paper, we consider testing the effect of an SNP set, e.g., SNPs in a gene, in family studies, for both continuous and discrete traits. Specifically, we propose a generalized estimating equations (GEEs) based kernel association test, a variance component based testing method, to test for the association between a phenotype and multiple variants in an SNP set jointly using family samples. The proposed approach allows for both continuous and discrete traits, where the correlation among family members is taken into account through the use of an empirical covariance estimator. We derive the theoretical distribution of the proposed statistic under the null and develop analytical methods to calculate the P-values. We also propose an efficient resampling method for correcting for small sample size bias in family studies. The proposed method allows for easily incorporating covariates and SNP-SNP interactions. Simulation studies show that the proposed method properly controls for type I error rates under both random and ascertained sampling schemes in family studies. We demonstrate through simulation studies that our approach has superior performance for association mapping compared to the single marker based minimum P-value GEE test for an SNP-set effect over a range of scenarios. We illustrate the application of the proposed method using data from the Cleveland Family GWAS Study. © 2013 WILEY PERIODICALS, INC.

  10. Evaluation of Genetic Susceptibility to Childhood Allergy and ...

    EPA Pesticide Factsheets

    Background: Asthma and allergy represent complex phenotypes, which disproportionately burden ethnic minorities in the United States. Strong evidence for genomic factors predisposing subjects to asthma/allergy is available. However, methods to utilize this information to identify high risk groups are variable and replication of genetic associations in African Americans is warranted. Methods: We evaluated 41 single nucleotide polymorphisms (SNP) and a deletion corresponding to 11 genes demonstrating association with asthma in the literature, for association with asthma, atopy, testing positive for food allergens, eosinophilia, and total serum IgE among 141 African American children living in Detroit, Michigan. Independent SNP and haplotype associations were investigated for association with each trait, and subsequently assessed in concert using a genetic risk score (GRS). Results: Statistically significant associations with asthma were observed for SNPs in GSTM1, MS4A2, and GSTP1 genes, after correction for multiple testing. Chromosome 11 haplotype CTACGAGGCC (corresponding to MS4A2 rs574700, rs1441586, rs556917, rs502581, rs502419 and GSTP1 rs6591256, rs17593068, rs1695, rs1871042, rs947895) was associated with a nearly five-fold increase in the odds of asthma (Odds Ratio (OR) = 4.8, p = 0.007). The GRS was significantly associated with a higher odds of asthma (OR = 1.61, 95% Confidence Interval = 1.21, 2.13; p = 0.001). Conclusions: Variation in genes a

  11. A comprehensive screen for SNP associations on chromosome region 5q31-33 in Swedish/Norwegian celiac disease families.

    PubMed

    Amundsen, Silja Svanstrøm; Adamovic, Svetlana; Hellqvist, Asa; Nilsson, Staffan; Gudjónsdóttir, Audur H; Ascher, Henry; Ek, Johan; Larsson, Kristina; Wahlström, Jan; Lie, Benedicte A; Sollid, Ludvig M; Naluai, Asa Torinsson

    2007-09-01

    Celiac disease (CD) is a gluten-induced enteropathy, which results from the interplay between environmental and genetic factors. There is a strong human leukocyte antigen (HLA) association with the disease, and HLA-DQ alleles represent a major genetic risk factor. In addition to HLA-DQ, non-HLA genes appear to be crucial for CD development. Chromosomal region 5q31-33 has demonstrated linkage with CD in several genome-wide studies, including in our Swedish/Norwegian cohort. In a European meta-analysis 5q31-33 was the only region that reached a genome-wide level of significance except for the HLA region. To identify the genetic variant(s) responsible for this linkage signal, we performed a comprehensive single nucleotide polymorphism (SNP) association screen in 97 Swedish/Norwegian multiplex families who demonstrate linkage to the region. We selected tag SNPs from a 16 Mb region representing the 95% confidence interval of the linkage peak. A total of 1,404 SNPs were used for the association analysis. We identified several regions with SNPs demonstrating moderate single- or multipoint associations. However, the isolated association signals appeared insufficient to account for the linkage signal seen in our cohort. Collective effects of multiple risk genes within the region, incomplete genetic coverage or effects related to copy number variation are possible explanations for our findings.

  12. A Common Variant at the 14q32 Endometrial Cancer Risk Locus Activates AKT1 through YY1 Binding.

    PubMed

    Painter, Jodie N; Kaufmann, Susanne; O'Mara, Tracy A; Hillman, Kristine M; Sivakumaran, Haran; Darabi, Hatef; Cheng, Timothy H T; Pearson, John; Kazakoff, Stephen; Waddell, Nicola; Hoivik, Erling A; Goode, Ellen L; Scott, Rodney J; Tomlinson, Ian; Dunning, Alison M; Easton, Douglas F; French, Juliet D; Salvesen, Helga B; Pollock, Pamela M; Thompson, Deborah J; Spurdle, Amanda B; Edwards, Stacey L

    2016-06-02

    A recent meta-analysis of multiple genome-wide association and follow-up endometrial cancer case-control datasets identified a novel genetic risk locus for this disease at chromosome 14q32.33. To prioritize the functional SNP(s) and target gene(s) at this locus, we employed an in silico fine-mapping approach using genotyped and imputed SNP data for 6,608 endometrial cancer cases and 37,925 controls of European ancestry. Association and functional analyses provide evidence that the best candidate causal SNP is rs2494737. Multiple experimental analyses show that SNP rs2494737 maps to a silencer element located within AKT1, a member of the PI3K/AKT/MTOR intracellular signaling pathway activated in endometrial tumors. The rs2494737 risk A allele creates a YY1 transcription factor-binding site and abrogates the silencer activity in luciferase assays, an effect mimicked by transfection of YY1 siRNA. Our findings suggest YY1 is a positive regulator of AKT1, mediating the stimulatory effects of rs2494737 increasing endometrial cancer risk. Identification of an endometrial cancer risk allele within a member of the PI3K/AKT signaling pathway, more commonly activated in tumors by somatic alterations, raises the possibility that well tolerated inhibitors targeting this pathway could be candidates for evaluation as chemopreventive agents in individuals at high risk of developing endometrial cancer. Copyright © 2016 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  13. Molecular differentiation of Russian wild ginseng using mitochondrial nad7 intron 3 region.

    PubMed

    Li, Guisheng; Cui, Yan; Wang, Hongtao; Kwon, Woo-Saeng; Yang, Deok-Chun

    2017-07-01

    Cultivated ginseng is often introduced as a substitute and adulterant of Russian wild ginseng due to its lower cost or misidentification caused by similarity in appearance with wild ginseng. The aim of this study is to develop a simple and reliable method to differentiate Russian wild ginseng from cultivated ginseng. The mitochondrial NADH dehydrogenase subunit 7 ( nad 7) intron 3 regions of Russian wild ginseng and Chinese cultivated ginseng were analyzed. Based on the multiple sequence alignment result, a specific primer for Russian wild ginseng was designed by introducing additional mismatch and allele-specific polymerase chain reaction (PCR) was performed for identification of wild ginseng. Real-time allele-specific PCR with endpoint analysis was used for validation of the developed Russian wild ginseng single nucleotide polymorphism (SNP) marker. An SNP site specific to Russian wild ginseng was exploited by multiple alignments of mitochondrial nad 7 intron 3 regions of different ginseng samples. With the SNP-based specific primer, Russian wild ginseng was successfully discriminated from Chinese and Korean cultivated ginseng samples by allele-specific PCR. The reliability and specificity of the SNP marker was validated by checking 20 individuals of Russian wild ginseng samples with real-time allele-specific PCR assay. An effective DNA method for molecular discrimination of Russian wild ginseng from Chinese and Korean cultivated ginseng was developed. The established real-time allele-specific PCR was simple and reliable, and the present method should be a crucial complement of chemical analysis for authentication of Russian wild ginseng.

  14. Replication of an association of variation in the FOXO3A gene with human longevity using both case–control and longitudinal data

    PubMed Central

    Soerensen, Mette; Dato, Serena; Christensen, Kaare; McGue, Matt; Stevnsner, Tinna; Bohr, Vilhelm A.; Christiansen, Lene

    2010-01-01

    Summary Genetic variation in FOXO3A has previously been associated with human longevity. Studies published so far have been case–control studies and hence vulnerable to bias introduced by cohort effects. In this study we extended the previous findings in the cohorts of oldest old Danes (the Danish 1905 cohort, N = 1089) and middle-aged Danes (N = 736), applying a longitudinal study design as well as the case–control study design. Fifteen SNPs were chosen in order to cover the known common variation in FOXO3A. Comparing SNP frequencies in the oldest old with middle-aged individuals, we found association (after correction for multiple testing) of eight SNPs; 4 (rs13217795, rs2764264, rs479744, and rs9400239) previously reported to be associated with longevity and four novel SNPs (rs12206094, rs13220810, rs7762395, and rs9486902 (corrected P-values 0.001–0.044). Moreover, we found association of the haplotypes TAC and CAC of rs9486902, rs10499051, and rs12206094 (corrected P-values: 0.01–0.03) with longevity. Finally, we here present data applying a longitudinal study design; when using follow-up survival data on the oldest old in a longitudinal analysis, we found no SNPs to remain significant after the correction for multiple testing (Bonferroni correction). Hence, our results support and extent the proposed role of FOXO3A as a candidate longevity gene for survival from younger ages to old age, yet not during old age. PMID:20849522

  15. Genomewide association study of liver abscess in beef cattle.

    PubMed

    Keele, J W; Kuehn, L A; McDaneld, T G; Tait, R G; Jones, S A; Keel, B N; Snelling, W M

    2016-02-01

    Fourteen percent of U.S. cattle slaughtered in 2011 had liver abscesses, resulting in reduced carcass weight, quality, and value. Liver abscesses can result from a common bacterial cause, , which inhabits rumen lesions caused by acidosis and subsequently escapes into the blood stream, is filtered by the liver, and causes abscesses in the liver. Our aim was to identify SNP associated with liver abscesses in beef cattle. We used lung samples as a DNA source because they have low economic value, they have abundant DNA, and we had unrestricted access to sample them. We collected 2,304 lung samples from a beef processing plant: 1,152 from animals with liver abscess and 1,152 from animals without liver abscess. Lung tissue from pairs of animals, 1 with abscesses and another without, were collected from near one another on the viscera table to ensure that pairs of phenotypically extreme animals came from the same lot. Within each phenotype (abscess or no abscess), cattle were pooled by slaughter sequence into 12 pools of 96 cattle for each phenotype for a total of 24 pools. The pools were constructed by equal volume of frozen lung tissue from each animal. The DNA needed to allelotype each pool was then extracted from pooled lung tissue and the BovineHD Bead Array (777,962 SNP) was run on all 24 pools. Total intensity (TI), an indicator of copy number variants, was the sum of intensities from red and green dyes. Pooling allele frequency (PAF) was red dye intensity divided TI. Total intensity and PAF were weighted by the inverse of their respective genomic covariance matrices computed over all SNP across the genome. A false discovery rate ≤ 5% was achieved for 15 SNP for PAF and 20 SNP for TI. Genes within 50 kbp from significant SNP were in diverse pathways including maintenance of pH homeostasis in the gastrointestinal tract, maintain immune defenses in the liver, migration of leukocytes from the blood into infected tissues, transport of glutamine into the kidney in response to acidosis to facilitate production of bicarbonate to increase pH, aggregate platelets to liver injury to facilitate liver repair, and facilitate axon guidance. Evidence from the 35 detected SNP associations combined with evidence of polygenic variation indicate that there is adequate genetic variation in incidence rate of liver abscesses, which could be exploited to select sires for reduced susceptibility to subacute acidosis and associated liver abscess.

  16. Allelic-based gene-gene interaction associated with quantitative traits.

    PubMed

    Jung, Jeesun; Sun, Bin; Kwon, Deukwoo; Koller, Daniel L; Foroud, Tatiana M

    2009-05-01

    Recent studies have shown that quantitative phenotypes may be influenced not only by multiple single nucleotide polymorphisms (SNPs) within a gene but also by the interaction between SNPs at unlinked genes. We propose a new statistical approach that can detect gene-gene interactions at the allelic level which contribute to the phenotypic variation in a quantitative trait. By testing for the association of allelic combinations at multiple unlinked loci with a quantitative trait, we can detect the SNP allelic interaction whether or not it can be detected as a main effect. Our proposed method assigns a score to unrelated subjects according to their allelic combination inferred from observed genotypes at two or more unlinked SNPs, and then tests for the association of the allelic score with a quantitative trait. To investigate the statistical properties of the proposed method, we performed a simulation study to estimate type I error rates and power and demonstrated that this allelic approach achieves greater power than the more commonly used genotypic approach to test for gene-gene interaction. As an example, the proposed method was applied to data obtained as part of a candidate gene study of sodium retention by the kidney. We found that this method detects an interaction between the calcium-sensing receptor gene (CaSR), the chloride channel gene (CLCNKB) and the Na, K, 2Cl cotransporter gene (CLC12A1) that contributes to variation in diastolic blood pressure.

  17. Polymorphisms in genes encoding leptin, ghrelin and their receptors in German multiple sclerosis patients.

    PubMed

    Rey, Linda K; Wieczorek, Stefan; Akkad, Denis A; Linker, Ralf A; Chan, Andrew; Hoffjan, Sabine

    2011-01-01

    Multiple sclerosis (MS) is a neuro-inflammatory, autoimmune disease influenced by environmental and polygenic components. There is growing evidence that the peptide hormone leptin, known to regulate energy homeostasis, as well as its antagonist ghrelin play an important role in inflammatory processes in autoimmune diseases, including MS. Recently, single nucleotide polymorphisms (SNPs) in the genes encoding leptin, ghrelin and their receptors were evaluated, amongst others, in Wegener's granulomatosis and Churg-Strauss syndrome. The Lys656Asn SNP in the LEPR gene showed a significant but contrasting association with these vasculitides. We therefore aimed at investigating these polymorphisms in a German MS case-control cohort. Twelve SNPs in the LEP, LEPR, GHRL and GHSR genes were genotyped in 776 MS patients and 878 control subjects. We found an association of a haplotype in the GHSR gene with MS that could not be replicated in a second cohort. Otherwise, no significant differences in allele or genotype frequencies were observed between patients and controls in this particular cohort. Thus, the present results do not support the hypothesis that genetic variation in the leptin/ghrelin system contributes substantially to the pathogenesis of MS. However, a modest effect of GHSR variation cannot be ruled out and needs to be further evaluated in future studies. Copyright © 2011 Elsevier Ltd. All rights reserved.

  18. Association study of Toll-like receptor 5 (TLR5) and Toll-like receptor 9 (TLR9) polymorphisms in systemic lupus erythematosus.

    PubMed

    Demirci, F Yesim K; Manzi, Susan; Ramsey-Goldman, Rosalind; Kenney, Margaret; Shaw, Penny S; Dunlop-Thomas, Charmayne M; Kao, Amy H; Rhew, Elisa Y; Bontempo, Franklin; Kammerer, Candace; Kamboh, M Ilyas

    2007-08-01

    Toll-like receptors (TLR) play an important role in both adaptive and innate immunity. Variations in TLR genes have been shown to be associated with various infectious and inflammatory diseases. We investigated the association of TLR5 (Arg392Stop, rs5744168) and TLR9 (-1237T-->C, rs5743836) single nucleotide polymorphisms (SNP) with systemic lupus erythematosus (SLE) in Caucasian American subjects. We performed a case-control association study and genotyped 409 Caucasian women with SLE and 509 Caucasian healthy female controls using TaqMan allelic discrimination (rs5744168) or polymerase chain reaction-restriction fragment length polymorphism analysis (rs5743836). None of the 2 TLR SNP showed a statistically significant association with SLE risk in our cohort. Our results do not indicate a major influence of these putative functional TLR SNP on the susceptibility to (or protection from) SLE.

  19. Two‐phase designs for joint quantitative‐trait‐dependent and genotype‐dependent sampling in post‐GWAS regional sequencing

    PubMed Central

    Espin‐Garcia, Osvaldo; Craiu, Radu V.

    2017-01-01

    ABSTRACT We evaluate two‐phase designs to follow‐up findings from genome‐wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation‐maximization‐based inference under a semiparametric maximum likelihood formulation tailored for post‐GWAS inference. A GWAS‐SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT‐SNP‐dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme‐QT strata yields significant power improvements compared to marginal QT‐ or SNP‐based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. PMID:29239496

  20. Short communication: relationship of call rate and accuracy of single nucleotide polymorphism genotypes in dairy cattle.

    PubMed

    Cooper, T A; Wiggans, G R; VanRaden, P M

    2013-05-01

    Call rates on both a single nucleotide polymorphism (SNP) basis and an animal basis are used as measures of data quality and as screening tools for genomic studies and evaluations of dairy cattle. To investigate the relationship of SNP call rate and genotype accuracy for individual SNP, the correlation between percentages of missing genotypes and parent-progeny conflicts for each SNP was calculated for 103,313 Holsteins. Correlations ranged from 0.14 to 0.38 for the BovineSNP50 and BovineLD (Illumina Inc., San Diego, CA) and GeneSeek Genomic Profiler (Neogen Corp., Lincoln, NE) chips, with lower correlations for newer chips. For US genomic evaluations, genotypes are excluded for animals with a call rate of <90% across autosomal SNP or <80% across X-specific SNP. Mean call rate for 220,175 Holstein, Jersey, and Brown Swiss genotypes was 99.6%. Animal genotypes with a call rate of ≤99% were examined from the US Department of Agriculture genotype database to determine how genotype call rate is related to accuracy of calls on an animal basis. Animal call rate was determined from SNP used in genomic evaluation and is the number of called autosomal and X-specific SNP genotypes divided by the number of SNP from that type of chip. To investigate the relationship of animal call rate and parentage validation, conflicts between a genotyped animal and its sire or dam were determined through a duo test (opposite homozygous SNP genotypes between sire and progeny; 1,374 animal genotypes) and a trio test (also including conflicts with dam and heterozygous SNP genotype for the animal when both parents are the same homozygote; 482 animal genotypes). When animal call rate was ≤ 80%, parentage validation was no longer reliable with the duo test. With the trio test, parentage validation was no longer reliable when animal call rate was ≤ 90%. To investigate how animal call rate was related to genotyping accuracy for animals with multiple genotypes, concordance between genotypes for 1,216 animals that had a genotype with a call rate of ≤ 99% (low call rate) as well as a genotype with a call rate of >99% (high call rate) were calculated by dividing the number of identical SNP genotype calls by the number of SNP that were called for both genotypes. Mean concordance between low- and high-call genotypes was >99% for a low call rate of >90% but decreased to 97% for a call rate of 86 to 90% and to 58% for a call rate of <60%. Edits on call rate reduce the use of incorrect SNP genotypes to calculate genomic evaluations. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  1. Genetic variation in the base excision repair pathway and bladder cancer risk.

    PubMed

    Figueroa, Jonine D; Malats, Núria; Real, Francisco X; Silverman, Debra; Kogevinas, Manolis; Chanock, Stephen; Welch, Robert; Dosemeci, Mustafa; Tardón, Adonina; Serra, Consol; Carrato, Alfredo; García-Closas, Reina; Castaño-Vinyals, Gemma; Rothman, Nathaniel; García-Closas, Montserrat

    2007-04-01

    Genetic polymorphisms in DNA repair genes may impact individual variation in DNA repair capacity and alter cancer risk. In order to examine the association of common genetic variation in the base-excision repair (BER) pathway with bladder cancer risk, we analyzed 43 single nucleotide polymorphisms (SNPs) in 12 BER genes (OGG1, MUTYH, APEX1, PARP1, PARP3, PARP4, XRCC1, POLB, POLD1, PCNA, LIG1, and LIG3). Using genotype data from 1,150 cases of urinary bladder transitional cell carcinomas and 1,149 controls from the Spanish Bladder Cancer Study we estimated odds ratios (ORs) and 95% confidence intervals (CIs) adjusting for age, gender, region and smoking status. SNPs in three genes showed significant associations with bladder cancer risk: the 8-oxoG DNA glycosylase gene (OGG1), the Poly (ADP-ribose) polymerase family member 1 (PARP1) and the major gap filling polymerase-beta (POLB). Subjects who were heterozygous or homozygous variant for an OGG1 SNP in the promoter region (rs125701) had significantly decreased bladder cancer risk compared to common homozygous: OR (95%CI) 0.78 (0.63-0.96). Heterozygous or homozygous individuals for the functional SNP PARP1 rs1136410 (V762A) or for the intronic SNP POLB rs3136717 were at increased risk compared to those homozygous for the common alleles: 1.24 (1.02-1.51) and 1.30 (1.04-1.62), respectively. In summary, data from this large case-control study suggested bladder cancer risk associations with selected BER SNPs, which need to be confirmed in other study populations.

  2. Is a gene important for bone resorption a candidate for obesity? An association and linkage study on the RANK (receptor activator of nuclear factor-kappaB) gene in a large Caucasian sample.

    PubMed

    Zhao, Lan-Juan; Guo, Yan-Fang; Xiong, Dong-Hai; Xiao, Peng; Recker, Robert R; Deng, Hong-Wen

    2006-11-01

    In light of findings that osteoporosis and obesity may share some common genetic determination and previous reports that RANK (receptor activator of nuclear factor-kappaB) is expressed in skeletal muscles which are important for energy metabolism, we hypothesize that RANK, a gene essential for osteoclastogenesis, is also important for obesity. In order to test the hypothesis with solid data we first performed a linkage analysis around the RANK gene in 4,102 Caucasian subjects from 434 pedigrees, then we genotyped 19 SNPs in or around the RANK gene. A family-based association test (FBAT) was performed with both a quantitative measure of obesity [fat mass, lean mass, body mass index (BMI), and percentage fat mass (PFM)] and a dichotomously defined obesity phenotype-OB (OB if BMI > or = 30 kg/m(2)). In the linkage analysis, an empirical P = 0.004 was achieved at the location of the RANK gene for BMI. Family-based association analysis revealed significant associations of eight SNPs with at least one obesity-related phenotype (P < 0.05). Evidence of association was obtained at SNP10 (P = 0.002) and SNP16 (P = 0.001) with OB; SNP1 with fat mass (P = 0.003); SNP1 (P = 0.003) and SNP7 (P = 0.003) with lean mass; SNP1 (P = 0.002) and SNP7 (P = 0.002) with BMI; SNP1 (P = 0.003), SNP4 (P = 0.007), and SNP7 (P = 0.002) with PFM. In order to deal with the complex multiple testing issues, we performed FBAT multi-marker test (FBAT-MM) to evaluate the association between all the 18 SNPs and each obesity phenotype. The P value is 0.126 for OB, 0.033 for fat mass, 0.021 for lean mass, 0.016 for BMI, and 0.006 for PFM. The haplotype data analyses provide further association evidence. In conclusion, for the first time, our results suggest that RANK is a novel candidate for determination of obesity.

  3. High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species

    PubMed Central

    2011-01-01

    Background High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of Eucalyptus from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for E. grandis. A systematic assessment of in silico SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous in silico constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine Eucalyptus species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions This study indicates that the GGGT performs well both within and across species of Eucalyptus notwithstanding its nucleotide diversity ≥2%. The development of a much larger array of informative SNPs across multiple Eucalyptus species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in Eucalyptus. PMID:21492434

  4. GenomeGems: evaluation of genetic variability from deep sequencing data

    PubMed Central

    2012-01-01

    Background Detection of disease-causing mutations using Deep Sequencing technologies possesses great challenges. In particular, organizing the great amount of sequences generated so that mutations, which might possibly be biologically relevant, are easily identified is a difficult task. Yet, for this assignment only limited automatic accessible tools exist. Findings We developed GenomeGems to gap this need by enabling the user to view and compare Single Nucleotide Polymorphisms (SNPs) from multiple datasets and to load the data onto the UCSC Genome Browser for an expanded and familiar visualization. As such, via automatic, clear and accessible presentation of processed Deep Sequencing data, our tool aims to facilitate ranking of genomic SNP calling. GenomeGems runs on a local Personal Computer (PC) and is freely available at http://www.tau.ac.il/~nshomron/GenomeGems. Conclusions GenomeGems enables researchers to identify potential disease-causing SNPs in an efficient manner. This enables rapid turnover of information and leads to further experimental SNP validation. The tool allows the user to compare and visualize SNPs from multiple experiments and to easily load SNP data onto the UCSC Genome browser for further detailed information. PMID:22748151

  5. BayesPI-BAR: a new biophysical model for characterization of regulatory sequence variations

    PubMed Central

    Wang, Junbai; Batmanov, Kirill

    2015-01-01

    Sequence variations in regulatory DNA regions are known to cause functionally important consequences for gene expression. DNA sequence variations may have an essential role in determining phenotypes and may be linked to disease; however, their identification through analysis of massive genome-wide sequencing data is a great challenge. In this work, a new computational pipeline, a Bayesian method for protein–DNA interaction with binding affinity ranking (BayesPI-BAR), is proposed for quantifying the effect of sequence variations on protein binding. BayesPI-BAR uses biophysical modeling of protein–DNA interactions to predict single nucleotide polymorphisms (SNPs) that cause significant changes in the binding affinity of a regulatory region for transcription factors (TFs). The method includes two new parameters (TF chemical potentials or protein concentrations and direct TF binding targets) that are neglected by previous methods. The new method is verified on 67 known human regulatory SNPs, of which 47 (70%) have predicted true TFs ranked in the top 10. Importantly, the performance of BayesPI-BAR, which uses principal component analysis to integrate multiple predictions from various TF chemical potentials, is found to be better than that of existing programs, such as sTRAP and is-rSNP, when evaluated on the same SNPs. BayesPI-BAR is a publicly available tool and is able to carry out parallelized computation, which helps to investigate a large number of TFs or SNPs and to detect disease-associated regulatory sequence variations in the sea of genome-wide noncoding regions. PMID:26202972

  6. SNP Discovery and Linkage Map Construction in Cultivated Tomato

    PubMed Central

    Shirasawa, Kenta; Isobe, Sachiko; Hirakawa, Hideki; Asamizu, Erika; Fukuoka, Hiroyuki; Just, Daniel; Rothan, Christophe; Sasamoto, Shigemi; Fujishiro, Tsunakazu; Kishida, Yoshie; Kohara, Mitsuyo; Tsuruoka, Hisano; Wada, Tsuyuko; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi

    2010-01-01

    Few intraspecific genetic linkage maps have been reported for cultivated tomato, mainly because genetic diversity within Solanum lycopersicum is much less than that between tomato species. Single nucleotide polymorphisms (SNPs), the most abundant source of genomic variation, are the most promising source of polymorphisms for the construction of linkage maps for closely related intraspecific lines. In this study, we developed SNP markers based on expressed sequence tags for the construction of intraspecific linkage maps in tomato. Out of the 5607 SNP positions detected through in silico analysis, 1536 were selected for high-throughput genotyping of two mapping populations derived from crosses between ‘Micro-Tom’ and either ‘Ailsa Craig’ or ‘M82’. A total of 1137 markers, including 793 out of the 1338 successfully genotyped SNPs, along with 344 simple sequence repeat and intronic polymorphism markers, were mapped onto two linkage maps, which covered 1467.8 and 1422.7 cM, respectively. The SNP markers developed were then screened against cultivated tomato lines in order to estimate the transferability of these SNPs to other breeding materials. The molecular markers and linkage maps represent a milestone in the genomics and genetics, and are the first step toward molecular breeding of cultivated tomato. Information on the DNA markers, linkage maps, and SNP genotypes for these tomato lines is available at http://www.kazusa.or.jp/tomato/. PMID:21044984

  7. Genome-wide interaction study identifies RCBTB1 as a modifier for smoking effect on carotid intima-media thickness.

    PubMed

    Wang, Liyong; Rundek, Tatjana; Beecham, Ashley; Hudson, Barry; Blanton, Susan H; Zhao, Hongyu; Sacco, Ralph L; Dong, Chuanhui

    2014-01-01

    Carotid intima-media thickness (cIMT), a marker for atherosclerosis, is affected by smoking and has substantial interindividual variation. We sought to identify the genetic moderators influencing the effect of smoking on cIMT. With a multistage design using 722 379 single nucleotide polymorphisms (SNP), a genome-wide interaction study was performed in a discovery sample of 669 Hispanics, followed by replication in 589 subjects (264 Hispanics, 172 non-Hispanic blacks, 153 non-Hispanic whites). Assuming an additive genetic model, regression analysis was performed to test for smoking-SNP interaction on cIMT while controlling for age, sex, and the top 3 principal components of ancestry. The strongest interaction in Hispanics was found with a synonymous splicing SNP (rs3751383) in exon 9 of RCBTB1 (P=2.5e(-6) in discovery sample; P=0.01 in the Hispanic replication sample; P<8.8e(-9) in the combined Hispanic sample). Stratification analysis in the combined Hispanic sample showed that smoking had no effect on cIMT among rs3751383 G homozygote (P=0.15), a moderate effect among rs3751383 heterozygote (P=0.01), and a strong effect among rs3751383 A homozygote (P=2.1e(-7)). A consistent trend was observed in the non-Hispanic white and black data sets, leading to an interaction effect of P<2.9e(-9) in the meta-analysis of all 1258 subjects. Our study represents the first genome-wide smoking-SNP interaction study of cIMT and identifies RCBTB1 as a modifier of the smoking effect on cIMT. Testing for gene-environment interactions can help uncover genetic factors that contribute to the interindividual variation in response to the same environmental exposure.

  8. GWAS meta-analysis reveals novel loci and genetic correlates for general cognitive function: a report from the COGENT consortium.

    PubMed

    Trampush, J W; Yang, M L Z; Yu, J; Knowles, E; Davies, G; Liewald, D C; Starr, J M; Djurovic, S; Melle, I; Sundet, K; Christoforou, A; Reinvang, I; DeRosse, P; Lundervold, A J; Steen, V M; Espeseth, T; Räikkönen, K; Widen, E; Palotie, A; Eriksson, J G; Giegling, I; Konte, B; Roussos, P; Giakoumaki, S; Burdick, K E; Payton, A; Ollier, W; Horan, M; Chiba-Falek, O; Attix, D K; Need, A C; Cirulli, E T; Voineskos, A N; Stefanis, N C; Avramopoulos, D; Hatzimanolis, A; Arking, D E; Smyrnis, N; Bilder, R M; Freimer, N A; Cannon, T D; London, E; Poldrack, R A; Sabb, F W; Congdon, E; Conley, E D; Scult, M A; Dickinson, D; Straub, R E; Donohoe, G; Morris, D; Corvin, A; Gill, M; Hariri, A R; Weinberger, D R; Pendleton, N; Bitsios, P; Rujescu, D; Lahti, J; Le Hellard, S; Keller, M C; Andreassen, O A; Deary, I J; Glahn, D C; Malhotra, A K; Lencz, T

    2017-03-01

    The complex nature of human cognition has resulted in cognitive genomics lagging behind many other fields in terms of gene discovery using genome-wide association study (GWAS) methods. In an attempt to overcome these barriers, the current study utilized GWAS meta-analysis to examine the association of common genetic variation (~8M single-nucleotide polymorphisms (SNP) with minor allele frequency ⩾1%) to general cognitive function in a sample of 35 298 healthy individuals of European ancestry across 24 cohorts in the Cognitive Genomics Consortium (COGENT). In addition, we utilized individual SNP lookups and polygenic score analyses to identify genetic overlap with other relevant neurobehavioral phenotypes. Our primary GWAS meta-analysis identified two novel SNP loci (top SNPs: rs76114856 in the CENPO gene on chromosome 2 and rs6669072 near LOC105378853 on chromosome 1) associated with cognitive performance at the genome-wide significance level (P<5 × 10 -8 ). Gene-based analysis identified an additional three Bonferroni-corrected significant loci at chromosomes 17q21.31, 17p13.1 and 1p13.3. Altogether, common variation across the genome resulted in a conservatively estimated SNP heritability of 21.5% (s.e.=0.01%) for general cognitive function. Integration with prior GWAS of cognitive performance and educational attainment yielded several additional significant loci. Finally, we found robust polygenic correlations between cognitive performance and educational attainment, several psychiatric disorders, birth length/weight and smoking behavior, as well as a novel genetic association to the personality trait of openness. These data provide new insight into the genetics of neurocognitive function with relevance to understanding the pathophysiology of neuropsychiatric illness.

  9. GWAS meta-analysis reveals novel loci and genetic correlates for general cognitive function: a report from the COGENT consortium

    PubMed Central

    Trampush, J W; Yang, M L Z; Yu, J; Knowles, E; Davies, G; Liewald, D C; Starr, J M; Djurovic, S; Melle, I; Sundet, K; Christoforou, A; Reinvang, I; DeRosse, P; Lundervold, A J; Steen, V M; Espeseth, T; Räikkönen, K; Widen, E; Palotie, A; Eriksson, J G; Giegling, I; Konte, B; Roussos, P; Giakoumaki, S; Burdick, K E; Payton, A; Ollier, W; Horan, M; Chiba-Falek, O; Attix, D K; Need, A C; Cirulli, E T; Voineskos, A N; Stefanis, N C; Avramopoulos, D; Hatzimanolis, A; Arking, D E; Smyrnis, N; Bilder, R M; Freimer, N A; Cannon, T D; London, E; Poldrack, R A; Sabb, F W; Congdon, E; Conley, E D; Scult, M A; Dickinson, D; Straub, R E; Donohoe, G; Morris, D; Corvin, A; Gill, M; Hariri, A R; Weinberger, D R; Pendleton, N; Bitsios, P; Rujescu, D; Lahti, J; Le Hellard, S; Keller, M C; Andreassen, O A; Deary, I J; Glahn, D C; Malhotra, A K; Lencz, T

    2017-01-01

    The complex nature of human cognition has resulted in cognitive genomics lagging behind many other fields in terms of gene discovery using genome-wide association study (GWAS) methods. In an attempt to overcome these barriers, the current study utilized GWAS meta-analysis to examine the association of common genetic variation (~8M single-nucleotide polymorphisms (SNP) with minor allele frequency ⩾1%) to general cognitive function in a sample of 35 298 healthy individuals of European ancestry across 24 cohorts in the Cognitive Genomics Consortium (COGENT). In addition, we utilized individual SNP lookups and polygenic score analyses to identify genetic overlap with other relevant neurobehavioral phenotypes. Our primary GWAS meta-analysis identified two novel SNP loci (top SNPs: rs76114856 in the CENPO gene on chromosome 2 and rs6669072 near LOC105378853 on chromosome 1) associated with cognitive performance at the genome-wide significance level (P<5 × 10−8). Gene-based analysis identified an additional three Bonferroni-corrected significant loci at chromosomes 17q21.31, 17p13.1 and 1p13.3. Altogether, common variation across the genome resulted in a conservatively estimated SNP heritability of 21.5% (s.e.=0.01%) for general cognitive function. Integration with prior GWAS of cognitive performance and educational attainment yielded several additional significant loci. Finally, we found robust polygenic correlations between cognitive performance and educational attainment, several psychiatric disorders, birth length/weight and smoking behavior, as well as a novel genetic association to the personality trait of openness. These data provide new insight into the genetics of neurocognitive function with relevance to understanding the pathophysiology of neuropsychiatric illness. PMID:28093568

  10. Variation in the oxytocin receptor gene (OXTR) is associated with pair-bonding and social behavior

    PubMed Central

    Walum, Hasse; Lichtenstein, Paul; Neiderhiser, Jenae M.; Reiss, David; Ganiban, Jody M.; Spotts, Erica L.; Pedersen, Nancy L.; Anckarsäter, Henrik; Larsson, Henrik; Westberg, Lars

    2011-01-01

    Background In specific vole and primate species the neuropeptide Oxytocin (OT) plays a central role in the regulation of pair-bonding behavior. Here we investigate to what extent genetic variants in the oxytocin receptor gene (OXTR) are associated with pair-bonding and related social behaviors in humans. Methods We first genotyped twelve Single Nucleotide Polymorphisms (SNPs) in the Twin and Offspring Study in Sweden (TOSS, N=2309) and the Swedish Twin Study of CHild and Adolescent Development (TCHAD, N=1240) comprising measures of self-reported pair-bonding behavior. In the TOSS-sample we further investigated one the SNPs for measures of marital status and quality. Moreover, in the TCHAD sample we explored the longitudinal relationship between precursors of pair-bonding during childhood and subsequent behavior in romantic relationships. Finally, in TCHAD and in the Child and Adolescent Twin Study of Sweden (CATSS, N=1771) the association between the same SNP and childhood behaviors was investigated. Results One SNP (rs7632287) in OXTR was associated with traits reflecting pair-bonding in women in the TOSS and TCHAD samples. In girls the rs7632287 SNP was further associated with childhood social problems, which longitudinally predicted pair-bonding behavior in the TCHAD-sample. This association was replicated in the CATSS-sample in which an association between the same SNP and social interaction deficit symptoms from the autism spectrum was detected. Conclusion These results suggest an association between variation in OXTR and human pair-bonding and other social behaviors, possibly indicating that the well described influence of OT on affiliative behavior in voles could also be of importance for humans. PMID:22015110

  11. An Intronic Polymorphism in couch potato Is Not Distributed Clinally in European Drosophila melanogaster Populations nor Does It Affect Diapause Inducibility.

    PubMed

    Zonato, Valeria; Fedele, Giorgio; Kyriacou, Charalambos P

    2016-01-01

    couch potato (cpo) encodes an RNA binding protein that has been reported to be expressed in the peripheral and central nervous system of embryos, larvae and adults, including the major endocrine organ, the ring gland. A polymorphism in the D. melanogaster cpo gene coding region displays a latitudinal cline in frequency in North American populations, but as cpo lies within the inversion In(3R)Payne, which is at high frequencies and itself shows a strong cline on this continent, interpretation of the cpo cline is not straightforward. A second downstream SNP in strong linkage disequilibrium with the first has been claimed to be primarily responsible for the latitudinal cline in diapause incidence in USA populations.Here, we investigate the frequencies of these two cpo SNPs in populations of Drosophila throughout continental Europe. The advantage of studying cpo variation in Europe is the very low frequency of In(3R)Payne, which we reveal here, does not appear to be clinally distributed. We observe a very different geographical scenario for cpo variation from the one in North America, suggesting that the downstream SNP does not play a role in diapause. In an attempt to verify whether the SNPs influence diapause we subsequently generated lines with different combinations of the two cpo SNPs on known timeless (tim) genetic backgrounds, because polymorphism in the clock gene tim plays a significant role in diapause inducibility. Our results reveal that the downstream cpo SNP does not seem to play any role in diapause induction in European populations in contrast to the upstream coding cpo SNP. Consequently, all future diapause studies on strains of D. melanogaster should initially determine their tim and cpo status.

  12. Recent evolution of glacial lakes in the Eastern Himalayas: the case-study of Mt. Everest (Nepal)

    NASA Astrophysics Data System (ADS)

    Salerno, Franco; D'Agata, Carlo; Diolaiuti, Guglielmina; Smiraglia, Claudio; Viviano, Gaetano; Tartari, Gianni

    2010-05-01

    In this contribution we analyze the glacier and lakes surface variations since the end of the 1950s until 2008 (around 50 years) through hystorical maps and remote sensing images. The Sagarmatha National Park (SNP), Eastern Hymalaian range (Nepal) covers an area of 1141km2, ranging from 2845 m to 8848 m (Mt Everest). Nearly all (28 out of a total of 29 in SNP) are ‘black glaciers', known also as D-type or debris-covered. Overall, SNP experienced a small net reduction in glacier cover of 19.6 km2 (4.9%) from 403.9 km2 at the end of the ‘50s to 384.6 km2 at the start of the ‘90s. As regards lakes surface variations, SNP experienced a very large net increasing in lake surface cover of 1.6 km2 (26%) from 6.0 km2 at the end of the ‘50s to 7.6 km2 in 2008. Moreover the number of lakes is enormously increased (by 36%, from 124 to 169). The new lakes have appeared at higher elevations (42 m higher than the lakes of 50's) probably following the glaciers retreat. As previously documented in bibliography, the Proglacial lakes (Moraine-dammed and in contact with the glacier front) is the typology of glacial lakes more effected by the climate change. These lakes are susceptible to Glacial Lake Outburst Floods (GLOFs) with the potential of releasing million cubic meters of water in a few hours causing catastrophic flooding up. We conclude this contribution pointing out the emerged scientific questions to address future research activities.

  13. KCNK3 VARIANTS ARE ASSOCIATED WITH HYPERALDOSTERONISM AND HYPERTENSION

    PubMed Central

    Manichaikul, Ani; Rich, Stephen S.; Allison, Matthew A.; Guagliardo, Nick A.; Bayliss, Douglas A.; Carey, Robert M.; Barrett, Paula Q.

    2016-01-01

    Blood pressure (BP) is a complex trait that is the consequence of an interaction between genetic and environmental determinants. Previous studies have demonstrated increased blood pressure in mice with global deletion of TASK-1 channels contemporaneous with diverse dysregulation of aldosterone production. In humans, genome-wide association studies (GWAS) in ~100,000 individuals of European, East Asian and South Asian ancestry identified a single nucleotide polymorphism (SNP) in KCNK3 (the gene encoding TASK-1) associated with mean arterial pressure (MAP). The current study was motivated by the hypotheses that (1) association of KCNK3 SNPs with BP and related traits extends to African Americans and Hispanics, and (2) KCNK3 SNPs exhibit associations with plasma renin activity (PRA) and aldosterone levels. We examined baseline BP measurements for 7,840 participants from the Multi-Ethnic Study of Atherosclerosis (MESA), and aldosterone levels and PRA in a subset of 1,653 MESA participants. We identified statistically significant association of the previously reported KCNK3 SNP (rs1275988) with MAP in MESA African Americans (P=0.024) and a nearby SNP (rs13394970) in MESA Hispanics (P=0.031). We discovered additional KCNK3 SNP associations with systolic BP (SBP), MAP and hypertension. We also identified statistically significant association of KCNK3 rs2586886 with plasma aldosterone level in MESA and demonstrated that global deletion of TASK-1 channels in mice produces a mild-hyperaldosteronism, not associated with a decrease in renin. Our results suggest genetic variation in the KCNK3 gene may contribute to blood pressure variation and less severe hypertensive disorders in which aldosterone may be one of several causative factors. PMID:27296998

  14. Aromatase Inhibitor-Associated Bone Fractures: A Case-Cohort GWAS and Functional Genomics

    PubMed Central

    Liu, Mohan; Goss, Paul E.; Ingle, James N.; Kubo, Michiaki; Furukawa, Yoichi; Batzler, Anthony; Jenkins, Gregory D.; Carlson, Erin E.; Nakamura, Yusuke; Schaid, Daniel J.; Chapman, Judy-Anne W.; Shepherd, Lois E.; Ellis, Matthew J.; Khosla, Sundeep; Wang, Liewei

    2014-01-01

    Bone fractures are a major consequence of osteoporosis. There is a direct relationship between serum estrogen concentrations and osteoporosis risk. Aromatase inhibitors (AIs) greatly decrease serum estrogen levels in postmenopausal women, and increased incidence of fractures is a side effect of AI therapy. We performed a discovery case-cohort genome-wide association study (GWAS) using samples from 1071 patients, 231 cases and 840 controls, enrolled in the MA.27 breast cancer AI trial to identify genetic factors involved in AI-related fractures, followed by functional genomic validation. Association analyses identified 20 GWAS single nucleotide polymorphism (SNP) signals with P < 5E-06. After removal of signals in gene deserts and those composed entirely of imputed SNPs, we applied a functional validation “decision cascade” that resulted in validation of the CTSZ-SLMO2-ATP5E, TRAM2-TMEM14A, and MAP4K4 genes. These genes all displayed estradiol (E2)-dependent induction in human fetal osteoblasts transfected with estrogen receptor-α, and their knockdown altered the expression of known osteoporosis-related genes. These same genes also displayed SNP-dependent variation in E2 induction that paralleled the SNP-dependent induction of known osteoporosis genes, such as osteoprotegerin. In summary, our case-cohort GWAS identified SNPs in or near CTSZ-SLMO2-ATP5E, TRAM2-TMEM14A, and MAP4K4 that were associated with risk for bone fracture in estrogen receptor-positive breast cancer patients treated with AIs. These genes displayed E2-dependent induction, their knockdown altered the expression of genes related to osteoporosis, and they displayed SNP genotype-dependent variation in E2 induction. These observations may lead to the identification of novel mechanisms associated with fracture risk in postmenopausal women treated with AIs. PMID:25148458

  15. MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells

    PubMed Central

    Schulz, Vincent; Chen, Min; Tuck, David

    2010-01-01

    Background Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Several algorithms based on hidden Markov models (HMMs) have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data. Methods We have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells. Conclusions We validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM. Availability The MixHMM is available as a Python package provided with some other useful tools at http://genecube.med.yale.edu:8080/MixHMM. PMID:20532221

  16. Nucleotide diversity, natural variation, and evolution of Flexible culm-1 and Strong culm-2 lodging resistance genes in rice.

    PubMed

    Rashid, Muhammad Abdul Rehman; Zhao, Yan; Zhang, Hongliang; Li, Jinjie; Li, Zichao

    2016-07-01

    Lodging resistance is one of the vital traits in yield improvement and sustainability. Culm wall thickness, diameter, and strength are different traits that can govern the lodging resistance in rice. The genes SCM2 and FC1 have been isolated for culm thickness, strength, and flexibility, but their functional nucleotide variations were still unknown. We used a 13× deep sequence of 795 diverse genotypes to present the functional variation and SNP diversity in SCM2 and FC1. The major functional variant for the SCM2 gene was at position 27480181 and for the FC1 gene at position 31072992. Haplotype analysis of both genes provided their various allelic differences among haplotypes. SCM2 alleles further presented the evolution of Oryza sativa L. subsp. indica and subsp. japonica genomes from common parent in different geographical zones, while the haplotypes of FC1 suggested their evolution from different strains of the common parent Oryza rufipogon. SCM2 showed purifying selection and functional associations with rare alleles, while FC1 displayed balanced selection favored by multiple heterozygous alleles. Genotypes with an allelic combination of SCM2-3 and FC1-2 in japonica background exhibited striking resistance against lodging, which can be used in further breeding programs.

  17. Patterns of Ancestry, Signatures of Natural Selection, and Genetic Association with Stature in Western African Pygmies

    PubMed Central

    Jarvis, Joseph P.; Ferwerda, Bart; Froment, Alain; Bodo, Jean-Marie; Beggs, William; Hoffman, Gabriel; Mezey, Jason; Tishkoff, Sarah A.

    2012-01-01

    African Pygmy groups show a distinctive pattern of phenotypic variation, including short stature, which is thought to reflect past adaptation to a tropical environment. Here, we analyze Illumina 1M SNP array data in three Western Pygmy populations from Cameroon and three neighboring Bantu-speaking agricultural populations with whom they have admixed. We infer genome-wide ancestry, scan for signals of positive selection, and perform targeted genetic association with measured height variation. We identify multiple regions throughout the genome that may have played a role in adaptive evolution, many of which contain loci with roles in growth hormone, insulin, and insulin-like growth factor signaling pathways, as well as immunity and neuroendocrine signaling involved in reproduction and metabolism. The most striking results are found on chromosome 3, which harbors a cluster of selection and association signals between approximately 45 and 60 Mb. This region also includes the positional candidate genes DOCK3, which is known to be associated with height variation in Europeans, and CISH, a negative regulator of cytokine signaling known to inhibit growth hormone-stimulated STAT5 signaling. Finally, pathway analysis for genes near the strongest signals of association with height indicates enrichment for loci involved in insulin and insulin-like growth factor signaling. PMID:22570615

  18. Multiple Loci are associated with dilated cardiomyopathy in Irish wolfhounds.

    PubMed

    Philipp, Ute; Vollmar, Andrea; Häggström, Jens; Thomas, Anne; Distl, Ottmar

    2012-01-01

    Dilated cardiomyopathy (DCM) is a highly prevalent and often lethal disease in Irish wolfhounds. Complex segregation analysis indicated different loci involved in pathogenesis. Linear fixed and mixed models were used for the genome-wide association study. Using 106 DCM cases and 84 controls we identified one SNP significantly associated with DCM on CFA37 and five SNPs suggestively associated with DCM on CFA1, 10, 15, 21 and 17. On CFA37 MOGAT1 and ACSL3 two enzymes of the lipid metabolism were located near the identified SNP.

  19. Multiple Loci Are Associated with Dilated Cardiomyopathy in Irish Wolfhounds

    PubMed Central

    Philipp, Ute; Vollmar, Andrea; Häggström, Jens; Thomas, Anne; Distl, Ottmar

    2012-01-01

    Dilated cardiomyopathy (DCM) is a highly prevalent and often lethal disease in Irish wolfhounds. Complex segregation analysis indicated different loci involved in pathogenesis. Linear fixed and mixed models were used for the genome-wide association study. Using 106 DCM cases and 84 controls we identified one SNP significantly associated with DCM on CFA37 and five SNPs suggestively associated with DCM on CFA1, 10, 15, 21 and 17. On CFA37 MOGAT1 and ACSL3 two enzymes of the lipid metabolism were located near the identified SNP. PMID:22761652

  20. A novel approach to analyzing fMRI and SNP data via parallel independent component analysis

    NASA Astrophysics Data System (ADS)

    Liu, Jingyu; Pearlson, Godfrey; Calhoun, Vince; Windemuth, Andreas

    2007-03-01

    There is current interest in understanding genetic influences on brain function in both the healthy and the disordered brain. Parallel independent component analysis, a new method for analyzing multimodal data, is proposed in this paper and applied to functional magnetic resonance imaging (fMRI) and a single nucleotide polymorphism (SNP) array. The method aims to identify the independent components of each modality and the relationship between the two modalities. We analyzed 92 participants, including 29 schizophrenia (SZ) patients, 13 unaffected SZ relatives, and 50 healthy controls. We found a correlation of 0.79 between one fMRI component and one SNP component. The fMRI component consists of activations in cingulate gyrus, multiple frontal gyri, and superior temporal gyrus. The related SNP component is contributed to significantly by 9 SNPs located in sets of genes, including those coding for apolipoprotein A-I, and C-III, malate dehydrogenase 1 and the gamma-aminobutyric acid alpha-2 receptor. A significant difference in the presences of this SNP component is found between the SZ group (SZ patients and their relatives) and the control group. In summary, we constructed a framework to identify the interactions between brain functional and genetic information; our findings provide new insight into understanding genetic influences on brain function in a common mental disorder.

  1. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

    PubMed

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal.

  2. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications

    PubMed Central

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R.; Taylor, Jeremy F.; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal. PMID:27583971

  3. The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies

    PubMed Central

    Barnett, Ian; Mukherjee, Rajarshi; Lin, Xihong

    2017-01-01

    It is of substantial interest to study the effects of genes, genetic pathways, and networks on the risk of complex diseases. These genetic constructs each contain multiple SNPs, which are often correlated and function jointly, and might be large in number. However, only a sparse subset of SNPs in a genetic construct is generally associated with the disease of interest. In this article, we propose the generalized higher criticism (GHC) to test for the association between an SNP set and a disease outcome. The higher criticism is a test traditionally used in high-dimensional signal detection settings when marginal test statistics are independent and the number of parameters is very large. However, these assumptions do not always hold in genetic association studies, due to linkage disequilibrium among SNPs and the finite number of SNPs in an SNP set in each genetic construct. The proposed GHC overcomes the limitations of the higher criticism by allowing for arbitrary correlation structures among the SNPs in an SNP-set, while performing accurate analytic p-value calculations for any finite number of SNPs in the SNP-set. We obtain the detection boundary of the GHC test. We compared empirically using simulations the power of the GHC method with existing SNP-set tests over a range of genetic regions with varied correlation structures and signal sparsity. We apply the proposed methods to analyze the CGEM breast cancer genome-wide association study. Supplementary materials for this article are available online. PMID:28736464

  4. Validation of Pooled Whole-Genome Re-Sequencing in Arabidopsis lyrata.

    PubMed

    Fracassetti, Marco; Griffin, Philippa C; Willi, Yvonne

    2015-01-01

    Sequencing pooled DNA of multiple individuals from a population instead of sequencing individuals separately has become popular due to its cost-effectiveness and simple wet-lab protocol, although some criticism of this approach remains. Here we validated a protocol for pooled whole-genome re-sequencing (Pool-seq) of Arabidopsis lyrata libraries prepared with low amounts of DNA (1.6 ng per individual). The validation was based on comparing single nucleotide polymorphism (SNP) frequencies obtained by pooling with those obtained by individual-based Genotyping By Sequencing (GBS). Furthermore, we investigated the effect of sample number, sequencing depth per individual and variant caller on population SNP frequency estimates. For Pool-seq data, we compared frequency estimates from two SNP callers, VarScan and Snape; the former employs a frequentist SNP calling approach while the latter uses a Bayesian approach. Results revealed concordance correlation coefficients well above 0.8, confirming that Pool-seq is a valid method for acquiring population-level SNP frequency data. Higher accuracy was achieved by pooling more samples (25 compared to 14) and working with higher sequencing depth (4.1× per individual compared to 1.4× per individual), which increased the concordance correlation coefficient to 0.955. The Bayesian-based SNP caller produced somewhat higher concordance correlation coefficients, particularly at low sequencing depth. We recommend pooling at least 25 individuals combined with sequencing at a depth of 100× to produce satisfactory frequency estimates for common SNPs (minor allele frequency above 0.05).

  5. Complete mitochondrial genome sequences of Brassica rapa (Chinese cabbage and mizuna), and intraspecific differentiation of cytoplasm in B. rapa and Brassica juncea.

    PubMed

    Hatono, Saki; Nishimura, Kaori; Murakami, Yoko; Tsujimura, Mai; Yamagishi, Hiroshi

    2017-09-01

    The complete sequence of the mitochondrial genome was determined for two cultivars of Brassica rapa . After determining the sequence of a Chinese cabbage variety, 'Oushou hakusai', the sequence of a mizuna variety, 'Chusei shiroguki sensuji kyomizuna', was mapped against the sequence of Chinese cabbage. The precise sequences where the two varieties demonstrated variation were ascertained by direct sequencing. It was found that the mitochondrial genomes of the two varieties are identical over 219,775 bp, with a single nucleotide polymorphism (SNP) between the genomes. Because B. rapa is the maternal species of an amphidiploid crop species, Brassica juncea , the distribution of the SNP was observed both in B. rapa and B. juncea . While the mizuna type SNP was restricted mainly to cultivars of mizuna (japonica group) in B. rapa , the mizuna type was widely distributed in B. juncea . The finding that the two Brassica species have these SNP types in common suggests that the nucleotide substitution occurred in wild B. rapa before both mitotypes were domesticated. It was further inferred that the interspecific hybridization between B. rapa and B. nigra took place twice and resulted in the two mitotypes of cultivated B. juncea .

  6. Molecular phylogeny and SNP variation of polar bears (Ursus maritimus), brown bears (U. arctos), and black bears (U. americanus) derived from genome sequences.

    PubMed

    Cronin, Matthew A; Rincon, Gonzalo; Meredith, Robert W; MacNeil, Michael D; Islas-Trejo, Alma; Cánovas, Angela; Medrano, Juan F

    2014-01-01

    We assessed the relationships of polar bears (Ursus maritimus), brown bears (U. arctos), and black bears (U. americanus) with high throughput genomic sequencing data with an average coverage of 25× for each species. A total of 1.4 billion 100-bp paired-end reads were assembled using the polar bear and annotated giant panda (Ailuropoda melanoleuca) genome sequences as references. We identified 13.8 million single nucleotide polymorphisms (SNP) in the 3 species aligned to the polar bear genome. These data indicate that polar bears and brown bears share more SNP with each other than either does with black bears. Concatenation and coalescence-based analysis of consensus sequences of approximately 1 million base pairs of ultraconserved elements in the nuclear genome resulted in a phylogeny with black bears as the sister group to brown and polar bears, and all brown bears are in a separate clade from polar bears. Genotypes for 162 SNP loci of 336 bears from Alaska and Montana showed that the species are genetically differentiated and there is geographic population structure of brown and black bears but not polar bears.

  7. DNA polymorphisms and transcript abundance of PRKAG2 and phosphorylated AMP-activated protein kinase in the rumen are associated with gain and feed intake in beef steers

    USDA-ARS?s Scientific Manuscript database

    Beef steers with variation in feed efficiency phenotypes were evaluated previously on a high density SNP panel. Ten markers from rs110125325-rs41652818 on bovine chromosome 4 were associated with average daily gain (ADG). To identify the gene(s) in this 1.2Mb region responsible for variation in AD...

  8. - 174 G>C IL-6 polymorphism and primary iron overload in male patients.

    PubMed

    Tetzlaff, Walter F; Meroño, Tomás; Botta, Eliana E; Martín, Maximiliano E; Sorroche, Patricia B; Boero, Laura E; Castro, Marcelo; Frechtel, Gustavo D; Rey, Jorge; Daruich, Jorge; Cerrone, Gloria E; Brites, Fernando

    2018-04-14

    Primary iron overload (IO) is commonly associated with mutations in the hereditary hemochromatosis gene (HFE). Nonetheless, other genetic variants may influence the development of IO beyond HFE mutations. There is a single nucleotide polymorphism (SNP) at - 174 G>C of the interleukin (IL)-6 gene which might be associated with primary IO. Our aim was to study the association between the SNP - 174 G>C gene promoter of IL-6 and primary IO in middle-aged male patients. We studied 37 men with primary IO diagnosed by liver histology. Controls were age-matched male volunteers (n = 37). HFE mutations and the SNP - 174 G>C gene promoter of IL-6 were evaluated by PCR-RFLP. Logistic regression was used to evaluate the association between primary IO and SNP - 174 G>C gene promoter of IL-6. Patients and control subjects were in Hardy-Weinberg equilibrium for the SNP - 174 G>C gene promoter of IL-6 (p = 0.17). Significantly different genotype frequencies were observed between patients (43% CC, 43% CG, and 14% GG) and control subjects (10% CC, 41% CG, and 49% GG) (OR = 4.09, 95% CI = 2.06-8.13; p < 0.0001). The multiple logistic regression analysis showed that IO was significantly associated with CC homozygosis in the SNP - 174 G>C gene promoter of IL-6 (OR = 6.3, 95% CI = 1.9-21.4; p < 0.005) in a model adjusted by age and body mass index. In conclusion, CC homozygosis in the SNP - 174 G>C gene promoter of IL-6 can be proposed as one of the gene variants influencing iron accumulation in male adults with HFE mutations. Studies in larger cohorts are warranted.

  9. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography

    PubMed Central

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-01-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420

  10. A custom correlation coefficient (CCC) approach for fast identification of multi-SNP association patterns in genome-wide SNPs data.

    PubMed

    Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G; Gu, C Charles

    2014-11-01

    Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of custom correlation coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (six genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. © 2014 WILEY PERIODICALS, INC.

  11. A custom correlation coefficient (CCC) approach for fast identification of multi-SNP association patterns in genome-wide SNPs data

    PubMed Central

    Climer, Sharlee; Yang, Wei; de las Fuentes, Lisa; Dávila-Román, Victor G.; Gu, C. Charles

    2014-01-01

    Complex diseases are often associated with sets of multiple interacting genetic factors and possibly with unique sets of the genetic factors in different groups of individuals (genetic heterogeneity). We introduce a novel concept of Custom Correlation Coefficient (CCC) between single nucleotide polymorphisms (SNPs) that address genetic heterogeneity by measuring subset correlations autonomously. It is used to develop a 3-step process to identify candidate multi-SNP patterns: (1) pairwise (SNP-SNP) correlations are computed using CCC; (2) clusters of so-correlated SNPs identified; and (3) frequencies of these clusters in disease cases and controls compared to identify disease-associated multi-SNP patterns. This method identified 42 candidate multi-SNP associations with hypertensive heart disease (HHD), among which one cluster of 22 SNPs (6 genes) included 13 in SLC8A1 (aka NCX1, an essential component of cardiac excitation-contraction coupling) and another of 32 SNPs had 29 from a different segment of SLC8A1. While allele frequencies show little difference between cases and controls, the cluster of 22 associated alleles were found in 20% of controls but no cases and the other in 3% of controls but 20% of cases. These suggest that both protective and risk effects on HHD could be exerted by combinations of variants in different regions of SLC8A1, modified by variants from other genes. The results demonstrate that this new correlation metric identifies disease-associated multi-SNP patterns overlooked by commonly used correlation measures. Furthermore, computation time using CCC is a small fraction of that required by other methods, thereby enabling the analyses of large GWAS datasets. PMID:25168954

  12. Characterization of genetic variability of Venezuelan equine encephalitis viruses

    DOE PAGES

    Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.; ...

    2016-04-07

    Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less

  13. A functional polymorphism of the TNF-{alpha} gene that is associated with type 2 DM

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Susa, Shinji; Daimon, Makoto; Sakabe, Jun-Ichi

    2008-05-09

    To examine the association of the tumor necrosis factor-{alpha} (TNF-{alpha}) gene region with type 2 diabetes (DM), 11 single-nucleotide polymorphisms (SNPs) of the region were analyzed. The initial study using a sample set (148 cases vs. 227 controls) showed a significant association of the SNP IVS1G + 123A of the TNF-{alpha} gene with DM (p = 0.0056). Multiple logistic regression analysis using an enlarged sample set (225 vs. 716) revealed the significant association of the SNP with DM independently of any clinical traits examined (OR: 1.49, p = 0.014). The functional relevance of the SNP were examined by the electrophoreticmore » mobility shift assays using nuclear extracts from the U937 and NIH3T3 cells and luciferase assays in these cells with Simian virus 40 promoter- and TNF-{alpha} promoter-reporter gene constructs. The functional analyses showed that YY1 transcription factor bound allele-specifically to the SNP region and, the IVS1 + 123A allele had an increase in luciferase expression compared with the G allele.« less

  14. Phenotype variations affect genetic association studies of degenerative disc disease: conclusions of analysis of genetic association of 58 single nucleotide polymorphisms with highly specific phenotypes for disc degeneration in 332 subjects.

    PubMed

    Rajasekaran, S; Kanna, Rishi Mugesh; Senthil, Natesan; Raveendran, Muthuraja; Cheung, Kenneth M C; Chan, Danny; Subramaniam, Sakthikanal; Shetty, Ajoy Prasad

    2013-10-01

    Although the influence of genetics on the process of disc degeneration is well recognized, in recently published studies, there is a wide variation in the race and selection criteria for such study populations. More importantly, the radiographic features of disc degeneration that are selected to represent the disc degeneration phenotype are variable in these studies. The study presented here evaluates the association between single nucleotide polymorphisms (SNPs) of candidate genes and three distinct radiographic features that can be defined as the degenerative disc disease (DDD) phenotype. The study objectives were to examine the allelic diversity of 58 SNPs related to 35 candidate genes related to lumbar DDD, to evaluate the association in a hitherto unevaluated ethnic Indian population that represents more than one-sixth of the world population, and to analyze how genetic associations can vary in the same study subjects with the choice of phenotype. A cross-sectional, case-control study of an ethnic Indian population was carried out. Fifty-eight SNPs in 35 potential candidate genes were evaluated in 342 subjects and the associations were analyzed against three highly specific markers for DDD, namely disc degeneration by Pfirrmann grading, end-plate damage evaluated by total end-plate damage score, and annular tears evaluated by disc herniations and hyperintense zones. Genotyping of cases and controls was performed on a genome-wide SNP array to identify potential associated disease loci. The results from the genome-wide SNP array were then used to facilitate SNP selection and genotype validation was conducted using Sequenom-based genotyping. Eleven of the 58 SNPs provided evidence of association with one of the phenotypes. For annular tears, rs1042631 SNP of AGC1 and rs467691 SNP of ADAMTS5 were highly significantly associated (p<.01) and SNPs in NGFB, IL1B, IL18RAP, and MMP10 were also significantly associated (p<.05). The rs4076018 SNP of NGFB was highly significant (p<.01) and rs2292657 SNP of GLI1 was significantly (p<.05) correlated to disc degeneration. For end-plate damage, the rs2252070 SNP of MMP 13 showed a significant association (p<.05). Previously associated genes such as COL 9, SKT, CHST 3, CILP, IGFR, SOXp, BMP, MMP 2-12, ADH2, IL1RN, and COX2 were not significantly associated and new associations (NGFB and GLI1) were identified. The validity of all the associations was found to be phenotype dependent. For the first time, genetic associations with DDD have been performed in an Indian population. Apart from identifying new associations, the highlight of the study was that in the same study population with DDD, SNP associations completely changed when different radiographic features were used to define the DDD phenotype. Our study results therefore indicate that standardization of the phenotypes chosen to study the genetics of disc degeneration is essential and should be strongly considered before planning genetic association studies. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. Pacifiplex: an ancestry-informative SNP panel centred on Australia and the Pacific region.

    PubMed

    Santos, Carla; Phillips, Christopher; Fondevila, Manuel; Daniel, Runa; van Oorschot, Roland A H; Burchard, Esteban G; Schanfield, Moses S; Souto, Luis; Uacyisrael, Jolame; Via, Marc; Carracedo, Ángel; Lareu, Maria V

    2016-01-01

    The analysis of human population variation is an area of considerable interest in the forensic, medical genetics and anthropological fields. Several forensic single nucleotide polymorphism (SNP) assays provide ancestry-informative genotypes in sensitive tests designed to work with limited DNA samples, including a 34-SNP multiplex differentiating African, European and East Asian ancestries. Although assays capable of differentiating Oceanian ancestry at a global scale have become available, this study describes markers compiled specifically for differentiation of Oceanian populations. A sensitive multiplex assay, termed Pacifiplex, was developed and optimized in a small-scale test applicable to forensic analyses. The Pacifiplex assay comprises 29 ancestry-informative marker SNPs (AIM-SNPs) selected to complement the 34-plex test, that in a combined set distinguish Africans, Europeans, East Asians and Oceanians. Nine Pacific region study populations were genotyped with both SNP assays, then compared to four reference population groups from the HGDP-CEPH human diversity panel. STRUCTURE analyses estimated population cluster membership proportions that aligned with the patterns of variation suggested for each study population's currently inferred demographic histories. Aboriginal Taiwanese and Philippine samples indicated high East Asian ancestry components, Papua New Guinean and Aboriginal Australians samples were predominantly Oceanian, while other populations displayed cluster patterns explained by the distribution of divergence amongst Melanesians, Polynesians and Micronesians. Genotype data from Pacifiplex and 34-plex tests is particularly well suited to analysis of Australian Aboriginal populations and when combined with Y and mitochondrial DNA variation will provide a powerful set of markers for ancestry inference applied to modern Australian demographic profiles. On a broader geographic scale, Pacifiplex adds highly informative data for inferring the ancestry of individuals from Oceanian populations. The sensitivity of Pacifiplex enabled successful genotyping of population samples from 50-year-old serum samples obtained from several Oceanian regions that would otherwise be unlikely to produce useful population data. This indicates tests primarily developed for forensic ancestry analysis also provide an important contribution to studies of populations where useful samples are in limited supply. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  16. [Single nucleotide polymorphism and its application in allogeneic hematopoietic stem cell transplantation--review].

    PubMed

    Li, Su-Xia

    2004-12-01

    Single nucleotide polymorphism (SNP) is the third genetic marker after restriction fragment length polymorphism (RFLP) and short tandem repeat. It represents the most density genetic variability in the human genome and has been widely used in gene location, cloning, and research of heredity variation, as well as parenthood identification in forensic medicine. As steady heredity polymorphism, single nucleotide polymorphism is becoming the focus of attention in monitoring chimerism and minimal residual disease in the patients after allogeneic hematopoietic stem cell transplantation. The article reviews SNP heredity characterization, analysis techniques and its applications in allogeneic stem cell transplantation and other fields.

  17. Selection Signature Analysis Implicates the PC1/PCSK1 Region for Chicken Abdominal Fat Content

    PubMed Central

    Wang, Zhipeng; Zhang, Yuandan; Wang, Shouzhi; Wang, Ning; Ma, Li; Leng, Li; Wang, Shengwen; Wang, Qigui; Wang, Yuxiang; Tang, Zhiquan; Li, Ning; Da, Yang; Li, Hui

    2012-01-01

    We conducted a selection signature analysis using the chicken 60k SNP chip in two chicken lines that had been divergently selected for abdominal fat content (AFC) for 11 generations. The selection signature analysis used multiple signals of selection, including long-range allele frequency differences between the lean and fat lines, long-range heterozygosity changes, linkage disequilibrium, haplotype frequencies, and extended haplotype homozygosity. Multiple signals of selection identified ten signatures on chromosomes 1, 2, 4, 5, 11, 15, 20, 26 and Z. The 0.73 Mb PC1/PCSK1 region of the Z chromosome at 55.43-56.16 Mb was the most heavily selected region. This region had 26 SNP markers and seven genes, Mar-03, SLC12A2, FBN2, ERAP1, CAST, PC1/PCSK1 and ELL2, where PC1/PCSK1 are the chicken/human names for the same gene. The lean and fat lines had two main haplotypes with completely opposite SNP alleles for the 26 SNP markers and were virtually line-specific, and had a recombinant haplotype with nearly equal frequency (0.193 and 0.196) in both lines. Other haplotypes in this region had negligible frequencies. Nine other regions with selection signatures were PAH-IGF1, TRPC4, GJD4-CCNY, NDST4, NOVA1, GALNT9, the ESRP2-GALR1 region with five genes, the SYCP2-CADH4 with six genes, and the TULP1-KIF21B with 14 genes. Genome-wide association analysis showed that nearly all regions with evidence of selection signature had SNP effects with genome-wide significance (P<10–6) on abdominal fat weight and percentage. The results of this study provide specific gene targets for the control of chicken AFC and a potential model of AFC in human obesity. PMID:22792402

  18. Selection signature analysis implicates the PC1/PCSK1 region for chicken abdominal fat content.

    PubMed

    Zhang, Hui; Hu, Xiaoxiang; Wang, Zhipeng; Zhang, Yuandan; Wang, Shouzhi; Wang, Ning; Ma, Li; Leng, Li; Wang, Shengwen; Wang, Qigui; Wang, Yuxiang; Tang, Zhiquan; Li, Ning; Da, Yang; Li, Hui

    2012-01-01

    We conducted a selection signature analysis using the chicken 60k SNP chip in two chicken lines that had been divergently selected for abdominal fat content (AFC) for 11 generations. The selection signature analysis used multiple signals of selection, including long-range allele frequency differences between the lean and fat lines, long-range heterozygosity changes, linkage disequilibrium, haplotype frequencies, and extended haplotype homozygosity. Multiple signals of selection identified ten signatures on chromosomes 1, 2, 4, 5, 11, 15, 20, 26 and Z. The 0.73 Mb PC1/PCSK1 region of the Z chromosome at 55.43-56.16 Mb was the most heavily selected region. This region had 26 SNP markers and seven genes, Mar-03, SLC12A2, FBN2, ERAP1, CAST, PC1/PCSK1 and ELL2, where PC1/PCSK1 are the chicken/human names for the same gene. The lean and fat lines had two main haplotypes with completely opposite SNP alleles for the 26 SNP markers and were virtually line-specific, and had a recombinant haplotype with nearly equal frequency (0.193 and 0.196) in both lines. Other haplotypes in this region had negligible frequencies. Nine other regions with selection signatures were PAH-IGF1, TRPC4, GJD4-CCNY, NDST4, NOVA1, GALNT9, the ESRP2-GALR1 region with five genes, the SYCP2-CADH4 with six genes, and the TULP1-KIF21B with 14 genes. Genome-wide association analysis showed that nearly all regions with evidence of selection signature had SNP effects with genome-wide significance (P<10(-6)) on abdominal fat weight and percentage. The results of this study provide specific gene targets for the control of chicken AFC and a potential model of AFC in human obesity.

  19. Polymorphisms in TRPV1 and TAS2Rs associate with sensations from sampled ethanol.

    PubMed

    Allen, Alissa L; McGeary, John E; Hayes, John E

    2014-10-01

    Genetic variation in chemosensory genes can explain variability in individual's perception of and preference for many foods and beverages. To gain insight into variable preference and intake of alcoholic beverages, we explored individual variability in the responses to sampled ethanol (EtOH). In humans, EtOH elicits sweet, bitter, and burning sensations. Here, we explore the relationship between variation in EtOH sensations and polymorphisms in genes encoding bitter taste receptors (TAS2Rs) and a polymodal nociceptor (TRPV1). Caucasian participants (n = 93) were genotyped for 16 single nucleotide polymorphisms (SNPs) in TRPV1, 3 SNPs in TAS2R38, and 1 SNP in TAS2R13. Participants rated sampled EtOH on a generalized Labeled Magnitude Scale. Two stimuli were presented: a 16% EtOH whole-mouth sip-and-spit solution with a single time-point rating of overall intensity and a cotton swab saturated with 50% EtOH on the circumvallate papillae (CV) with ratings of multiple qualities over 3 minutes. Area-under-the-curve (AUC) was calculated for the time-intensity data. The EtOH whole-mouth solution had overall intensity ratings near "very strong." Burning/stinging had the highest mean AUC values, followed by bitterness and sweetness. Whole-mouth intensity ratings were significantly associated with burning/stinging and bitterness AUC values on the CV. Three TRPV1 SNPs (rs224547, rs4780521, rs161364) were associated with EtOH sensations on the CV, with 2 (rs224547 and rs4780521) exhibiting strong linkage disequilibrium. Additionally, the TAS2R38 SNPs rs713598, rs1726866, and rs10246939 formed a haplotype, and were associated with bitterness on the CV. Last, overall intensity for whole-mouth EtOH associated with the TAS2R13 SNP rs1015443. These data suggest genetic variation in TRPV1 and TAS2Rs influence sensations from sampled EtOH and may potentially influence how individuals initially respond to alcoholic beverages. Copyright © 2014 by the Research Society on Alcoholism.

  20. [Analysis of genomic copy number variations in two unrelated neonates with 8p deletion and duplication associated with congenital heart disease].

    PubMed

    Mei, Mei; Yang, Lin; Zhan, Guodong; Wang, Huijun; Ma, Duan; Zhou, Wenhao; Huang, Guoying

    2014-06-01

    To screen for genomic copy number variations (CNVs) in two unrelated neonates with multiple congenital abnormalities using Affymetrix SNP chip and try to find the critical region associated with congenital heart disease. Two neonates were tested for genomic copy number variations by using Cytogenetic SNP chip.Rare CNVs with potential clinical significance were selected of which deletion segments' size was larger than 50 kb and duplication segments' size was larger than 150 kb based on the analysis of ChAs software, without false positive CNVs and segments of normal population. The identified CNVs were compared with those of the cases in DECIPHER and ISCA databases. Eleven rare CNVs with size from 546.6-27 892 kb were identified in the 2 neonates. The deletion region and size of case 1 were 8p23.3-p23.1 (387 912-11 506 771 bp) and 11.1 Mb respectively, the duplication region and size of case 1 were 8p23.1-p11.1 (11 508 387-43 321 279 bp) and 31.8 Mb respectively. The deletion region and size of case 2 were 8p23.3-p23.1 (46 385-7 809 878 bp) and 7.8 Mb respectively, the duplication region and size of case 2 were 8p23.1-p11.21 (12 260 914-40 917 092 bp) and 28.7 Mb respectively. The comparison with Decipher and ISCA databases supported previous viewpoint that 8p23.1 had been associated with congenital heart disease and the region between 7 809 878-11 506 771 bp may play a role in the severe cardiac defects associated with 8p23.1 deletions. Case 1 had serious cardiac abnormalities whose GATA4 was located in the duplication segment and the copy number increased while SOX7 was located in the deletion segment and the copy number decreased. The region between 7 809 878-11 506 771 bp in 8p23.1 is associated with heart defects and copy number variants of SOX7 and GATA4 may result in congenital heart disease.

  1. Resequencing and Analysis of Variation in the TCF7L2 Gene in African Americans Suggests That SNP rs7903146 Is the Causal Diabetes Susceptibility Variant

    PubMed Central

    Palmer, Nicholette D.; Hester, Jessica M.; An, S. Sandy; Adeyemo, Adebowale; Rotimi, Charles; Langefeld, Carl D.; Freedman, Barry I.; Ng, Maggie C.Y.; Bowden, Donald W.

    2011-01-01

    OBJECTIVE Variation in the transcription factor 7-like 2 (TCF7L2) locus is associated with type 2 diabetes across multiple ethnicities. The aim of this study was to elucidate which variant in TCF7L2 confers diabetes susceptibility in African Americans. RESEARCH DESIGN AND METHODS Through the evaluation of tagging single nucleotide polymorphisms (SNPs), type 2 diabetes susceptibility was limited to a 4.3-kb interval, which contains the YRI (African) linkage disequilibrium (LD) block containing rs7903146. To better define the relationship between type 2 diabetes risk and genetic variation we resequenced this 4.3-kb region in 96 African American DNAs. Thirty-three novel and 13 known SNPs were identified: 20 with minor allele frequencies (MAF) >0.05 and 12 with MAF >0.10. These polymorphisms and the previously identified DG10S478 microsatellite were evaluated in African American type 2 diabetic cases (n = 1,033) and controls (n = 1,106). RESULTS Variants identified from direct sequencing and databases were genotyped or imputed. Fifteen SNPs showed association with type 2 diabetes (P < 0.05) with rs7903146 being the most significant (P = 6.32 × 10−6). Results of imputation, haplotype, and conditional analysis of SNPs were consistent with rs7903146 being the trait-defining SNP. Analysis of the DG10S478 microsatellite, which is outside the 4.3-kb LD block, revealed consistent association of risk allele 8 with type 2 diabetes (odds ratio [OR] = 1.33; P = 0.022) as reported in European populations; however, allele 16 (MAF = 0.016 cases and 0.032 controls) was strongly associated with reduced risk (OR = 0.39; P = 5.02 × 10−5) in contrast with previous studies. CONCLUSIONS In African Americans, these observations suggest that rs7903146 is the trait-defining polymorphism associated with type 2 diabetes risk. Collectively, these results support ethnic differences in type 2 diabetes associations. PMID:20980453

  2. Associations between novel single nucleotide polymorphisms in the Bos taurus growth hormone gene and performance traits in Holstein-Friesian dairy cattle.

    PubMed

    Mullen, M P; Berry, D P; Howard, D J; Diskin, M G; Lynch, C O; Berkowicz, E W; Magee, D A; MacHugh, D E; Waters, S M

    2010-12-01

    Growth hormone, produced in the anterior pituitary gland, stimulates the release of insulin-like growth factor-I from the liver and is of critical importance in the control of nutrient utilization and partitioning for lactogenesis, fertility, growth, and development in cattle. The aim of this study was to discover novel polymorphisms in the bovine growth hormone gene (GH1) and to quantify their association with performance using estimates of genetic merit on 848 Holstein-Friesian AI (artificial insemination) dairy sires. Associations with previously reported polymorphisms in the bovine GH1 gene were also undertaken. A total of 38 novel single nucleotide polymorphisms (SNP) were identified across a panel of 22 beef and dairy cattle by sequence analysis of the 5' promoter, intronic, exonic, and 3' regulatory regions, encompassing approximately 7 kb of the GH1 gene. Following multiple regression analysis on all SNP, associations were identified between 11 SNP (2 novel and 9 previously identified) and milk fat and protein yield, milk composition, somatic cell score, survival, body condition score, and body size. The G allele of a previously identified SNP in exon 5 at position 2141 of the GH1 sequence, resulting in a nonsynonymous substitution, was associated with decreased milk protein yield. The C allele of a novel SNP, GH32, was associated with inferior carcass conformation. In addition, the T allele of a previously characterized SNP, GH35, was associated with decreased survival. Both GH24 (novel) and GH35 were independently associated with somatic cell count, and 3 SNP, GH21, 2291, and GH35, were independently associated with body depth. Furthermore, 2 SNP, GH24 and GH63, were independently associated with carcass fat. Results of this study further demonstrate the multifaceted influences of GH1 on milk production, fertility, and growth-related traits in cattle. Copyright © 2010 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  3. Makeup of the genetic correlation between milk production traits using genome-wide single nucleotide polymorphism information.

    PubMed

    van Binsbergen, R; Veerkamp, R F; Calus, M P L

    2012-04-01

    The correlated responses between traits may differ depending on the makeup of genetic covariances, and may differ from the predictions of polygenic covariances. Therefore, the objective of the present study was to investigate the makeup of the genetic covariances between the well-studied traits: milk yield, fat yield, protein yield, and their percentages in more detail. Phenotypic records of 1,737 heifers of research farms in 4 different countries were used after homogenizing and adjusting for management effects. All cows had a genotype for 37,590 single nucleotide polymorphisms (SNP). A bayesian stochastic search variable selection model was used to estimate the SNP effects for each trait. About 0.5 to 1.0% of the SNP had a significant effect on 1 or more traits; however, the SNP without a significant effect explained most of the genetic variances and covariances of the traits. Single nucleotide polymorphism correlations differed from the polygenic correlations, but only 10 regions were found with an effect on multiple traits; in 1 of these regions the DGAT1 gene was previously reported with an effect on multiple traits. This region explained up to 41% of the variances of 4 traits and explained a major part of the correlation between fat yield and fat percentage and contributes to asymmetry in correlated response between fat yield and fat percentage. Overall, for the traits in this study, the infinitesimal model is expected to be sufficient for the estimation of the variances and covariances. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  4. Spectrum of sequence variations in the FANCA gene: an International Fanconi Anemia Registry (IFAR) study.

    PubMed

    Levran, Orna; Diotti, Raffaella; Pujara, Kanan; Batish, Sat D; Hanenberg, Helmut; Auerbach, Arleen D

    2005-02-01

    Fanconi anemia (FA) is an autosomal recessive disorder that is defined by cellular hypersensitivity to DNA cross-linking agents, and is characterized clinically by developmental abnormalities, progressive bone-marrow failure, and predisposition to leukemia and solid tumors. There is extensive genetic heterogeneity, with at least 11 different FA complementation groups. FA-A is the most common group, accounting for approximately 65% of all affected individuals. The mutation spectrum of the FANCA gene, located on chromosome 16q24.3, is highly heterogeneous. Here we summarize all sequence variations (mutations and polymorphisms) in FANCA described in the literature and listed in the Fanconi Anemia Mutation Database as of March 2004, and report 61 novel FANCA mutations identified in FA patients registered in the International Fanconi Anemia Registry (IFAR). Thirty-eight novel SNPs, previously unreported in the literature or in dbSNP, were also identified. We studied the segregation of common FANCA SNPs in FA families to generate haplotypes. We found that FANCA SNP data are highly useful for carrier testing, prenatal diagnosis, and preimplantation genetic diagnosis, particularly when the disease-causing mutations are unknown. Twenty-two large genomic deletions were identified by detection of apparent homozygosity for rare SNPs. In addition, a conserved SNP haplotype block spanning at least 60 kb of the FANCA gene was identified in individuals from various ethnic groups. (c) 2005 Wiley-Liss, Inc.

  5. Masking as an effective quality control method for next-generation sequencing data analysis.

    PubMed

    Yun, Sajung; Yun, Sijung

    2014-12-13

    Next generation sequencing produces base calls with low quality scores that can affect the accuracy of identifying simple nucleotide variation calls, including single nucleotide polymorphisms and small insertions and deletions. Here we compare the effectiveness of two data preprocessing methods, masking and trimming, and the accuracy of simple nucleotide variation calls on whole-genome sequence data from Caenorhabditis elegans. Masking substitutes low quality base calls with 'N's (undetermined bases), whereas trimming removes low quality bases that results in a shorter read lengths. We demonstrate that masking is more effective than trimming in reducing the false-positive rate in single nucleotide polymorphism (SNP) calling. However, both of the preprocessing methods did not affect the false-negative rate in SNP calling with statistical significance compared to the data analysis without preprocessing. False-positive rate and false-negative rate for small insertions and deletions did not show differences between masking and trimming. We recommend masking over trimming as a more effective preprocessing method for next generation sequencing data analysis since masking reduces the false-positive rate in SNP calling without sacrificing the false-negative rate although trimming is more commonly used currently in the field. The perl script for masking is available at http://code.google.com/p/subn/. The sequencing data used in the study were deposited in the Sequence Read Archive (SRX450968 and SRX451773).

  6. Genome-Wide Mapping of Copy Number Variation in Humans: Comparative Analysis of High Resolution Array Platforms

    PubMed Central

    Haraksingh, Rajini R.; Abyzov, Alexej; Gerstein, Mark; Urban, Alexander E.; Snyder, Michael

    2011-01-01

    Accurate and efficient genome-wide detection of copy number variants (CNVs) is essential for understanding human genomic variation, genome-wide CNV association type studies, cytogenetics research and diagnostics, and independent validation of CNVs identified from sequencing based technologies. Numerous, array-based platforms for CNV detection exist utilizing array Comparative Genome Hybridization (aCGH), Single Nucleotide Polymorphism (SNP) genotyping or both. We have quantitatively assessed the abilities of twelve leading genome-wide CNV detection platforms to accurately detect Gold Standard sets of CNVs in the genome of HapMap CEU sample NA12878, and found significant differences in performance. The technologies analyzed were the NimbleGen 4.2 M, 2.1 M and 3×720 K Whole Genome and CNV focused arrays, the Agilent 1×1 M CGH and High Resolution and 2×400 K CNV and SNP+CGH arrays, the Illumina Human Omni1Quad array and the Affymetrix SNP 6.0 array. The Gold Standards used were a 1000 Genomes Project sequencing-based set of 3997 validated CNVs and an ultra high-resolution aCGH-based set of 756 validated CNVs. We found that sensitivity, total number, size range and breakpoint resolution of CNV calls were highest for CNV focused arrays. Our results are important for cost effective CNV detection and validation for both basic and clinical applications. PMID:22140474

  7. A Variant in the BACH2 Gene Is Associated With Susceptibility to Autoimmune Addison's Disease in Humans.

    PubMed

    Pazderska, Agnieszka; Oftedal, Bergithe E; Napier, Catherine M; Ainsworth, Holly F; Husebye, Eystein S; Cordell, Heather J; Pearce, Simon H S; Mitchell, Anna L

    2016-11-01

    Autoimmune Addison's disease (AAD) is a rare but highly heritable condition. The BACH2 protein plays a crucial role in T lymphocyte maturation, and allelic variation in its gene has been associated with a number of autoimmune conditions. We aimed to determine whether alleles of the rs3757247 single nucleotide polymorphism (SNP) in the BACH2 gene are associated with AAD. This case-control association study was performed in two phases using Taqman chemistry. In the first phase, the rs3757247 SNP was genotyped in 358 UK AAD subjects and 166 local control subjects. Genotype data were also available from 5154 healthy UK controls from the Wellcome Trust (WTCCC2) for comparison. In the second phase, the SNP was genotyped in a validation cohort comprising 317 Norwegian AAD subjects and 365 controls. The frequency of the minor T allele was significantly higher in subjects with AAD from the United Kingdom compared to both the local and WTCCC2 control cohorts (58% vs 45 and 48%, respectively) (local controls, P = 1.1 × 10 -4 ; odds ratio [OR], 1.68; 95% confidence interval [CI], 1.29-2.18; WTCCC2 controls, P = 1.4 × 10 -6 ; OR, 1.44; 95% CI, 1.23-1.69). This finding was replicated in the Norwegian validation cohort (P = .0015; OR, 1.41; 95% CI, 1.14-1.75). Subgroup analysis showed that this association is present in subjects with both isolated AAD (OR, 1.53; 95% CI, 1.22-1.92) and autoimmune polyglandular syndrome type 2 (OR, 1.37; 95% CI, 1.12-1.69) in the UK cohort, and with autoimmune polyglandular syndrome type 2 in the Norwegian cohort (OR, 1.58; 95% CI, 1.22-2.06). We have demonstrated, for the first time, that allelic variability at the BACH2 locus is associated with susceptibility to AAD. Given its association with multiple autoimmune conditions, BACH2 can be considered a "universal" autoimmune susceptibility locus.

  8. Genetic predisposition scores associate with muscular strength, size, and trainability.

    PubMed

    Thomaes, Tom; Thomis, Martine; Onkelinx, Steven; Goetschalckx, Kaatje; Fagard, Robert; Lambrechts, Diether; Vanhees, Luc

    2013-08-01

    The number of studies trying to identify genetic sequence variation related to muscular phenotypes has increased enormously. The aim of this study was to identify the role of a genetic predisposition score (GPS) based on earlier identified gene variants for different muscular endophenotypes to explain the individual differences in muscular fitness characteristics and the response to training in patients with coronary artery disease. Two hundred and sixty coronary artery disease patients followed a standard ambulatory, 3-month supervised training program for cardiac patients. Maximal knee extension strength (KES) and rectus femoris diameter were measured at baseline and after rehabilitation. Sixty-five single nucleotide polymorphisms (SNP) in 30 genes were selected based on genotype-phenotype association literature. Backward regression analysis revealed subsets of SNP associated with the different phenotypes. GPS were constructed for all sets of SNP by adding up the strength-increasing alleles. General linear models and multiple stepwise regression analysis were used to test the explained variance of the GPS in baseline and strength responses. Receiver operating characteristic curve analyses were performed to discriminate between high- and low-responder status. GPS were significantly associated with the rectus femoris diameter (P < 0.01) and its response (P < 0.0001), the isometric KES (P < 0.05) and its response (P < 0.01), the isokinetic KES at 60° · s (P < 0.05) and 180° · s (P < 0.001) and their responses to training (P < 0.0001), and the isokinetic KES endurance (P < 0.001) and its change after training (P < 0.0001). The GPS was shown as an independent determinant in baseline and response phenotypes with partial explained variance up to 23%. Receiver operating characteristic analysis showed a significant discriminating accuracy of the models, including the GPS for responses to training, with areas under the curve ranging from 0.62 to 0.85. GPS for muscular phenotypes showed to be associated with baseline KES, muscle diameter, and the response to training in cardiac rehabilitation patients.

  9. Whole genome sequencing of Brucella melitensis isolated from 57 patients in Germany reveals high diversity in strains from Middle East.

    PubMed

    Georgi, Enrico; Walter, Mathias C; Pfalzgraf, Marie-Theres; Northoff, Bernd H; Holdt, Lesca M; Scholz, Holger C; Zoeller, Lothar; Zange, Sabine; Antwerpen, Markus H

    2017-01-01

    Brucellosis, a worldwide common bacterial zoonotic disease, has become quite rare in Northern and Western Europe. However, since 2014 a significant increase of imported infections caused by Brucella (B.) melitensis has been noticed in Germany. Patients predominantly originated from Middle East including Turkey and Syria. These circumstances afforded an opportunity to gain insights into the population structure of Brucella strains. Brucella-isolates from 57 patients were recovered between January 2014 and June 2016 with culture confirmed brucellosis by the National Consultant Laboratory for Brucella. Their whole genome sequences were generated using the Illumina MiSeq platform. A whole genome-based SNP typing assay was developed in order to resolve geographically attributed genetic clusters. Results were compared to MLVA typing results, the current gold-standard of Brucella typing. In addition, sequences were examined for possible genetic variation within target regions of molecular diagnostic assays. Phylogenetic analyses revealed spatial clustering and distinguished strains from different patients in either case, whereas multiple isolates from a single patient or technical replicates showed identical SNP and MLVA profiles. By including WGS data from the NCBI database, five major genotypes were identified. Notably, strains originating from Turkey showed a high diversity and grouped into seven subclusters of genotype II. MLVA analysis congruently clustered all isolates and predominantly matched the East Mediterranean genetic clade. This study confirms whole-genome based SNP-analysis as a powerful tool for accurate typing of B. melitensis. Furthermore it allows special allocation and therefore provides useful information on the geographic origin for trace-back analysis. However, the lack of reliable metadata in public databases often prevents a resolution below geographic regions or country levels and corresponding precise trace-back analysis. Once this obstacle is resolved, WGS-derived bacterial typing adds an important method to complement epidemiological surveys during outbreak investigations. This is the first report of a detailed genetic investigation of an extensive collection of B. melitensis strains isolated from human cases in Germany.

  10. Multiple nitrogen isotope recorders for surface ocean nitrate utilization in the Subarctic North Pacific and the Bering Sea

    NASA Astrophysics Data System (ADS)

    Ren, H. A.; Anderson, R.; Sigman, D. M.; Studer, A.; Winckler, G.; Haugh, G.; Serno, S.; Gersonde, R.

    2017-12-01

    Sedimentary nitrogen isotopes have been developed as a proxy to reconstruct the degree of nitrate utilization in the polar surface oceans. But its application could be compromised by 1) uncertainties on the biological production, transport, and preservation of the organic material in the sediments, and 2) potential changes in the isotopic composition of the nitrate source, that is remotely controlled by processes in other regions. In this study, we map and compare spatial patterns of three d15N recorders (bulk sedimentary nitrogen, the organic nitrogen within cleaned diatom frustules or diatom-bound N, and within planktonic foraminifera tests or foraminifera-bound N) from multicore surface sediments across the Subarctic North Pacific (SNP) and the Bering Sea between 60°N and 35°N. Diatom-bound d15N varies between 3.5 and 8.5‰. Its spatial variation is reversely correlated with changes in the surface nitrate concentration, and is consistent with the expected d15N change of the export production in a simple nitrate assimilation model. Similar to previous findings, diatom-bound d15N is generally 2 4‰ higher than the modeled d15N value of the export production, likely reflecting a biomass to frustual-bound N difference. However, the greater d15N elevation observed in the eastern open SNP may be best explained by lateral transport of residual surface nitrate enriched in 15N from the western SNP. The d15N of Neogloboquadrina pachyderma (sinistral) is similar to the diatom-bound d15N within 1‰. Bulk sedimentary d15N generally agrees with diatom-bound d15N, but is more variable. It is higher than diatom-bound d15N in the eastern and western transect close to the shelf area, likely reflecting a terrigenous source, while exceptionally low d15N values were found on the Bering Sea shelf, possibly due to contamination by mineral-associated inorganic N.

  11. Accumulating evidence for a role of TCF7L2 variants in bipolar disorder with elevated body mass index.

    PubMed

    Cuellar-Barboza, Alfredo B; Winham, Stacey J; McElroy, Susan L; Geske, Jennifer R; Jenkins, Gregory D; Colby, Colin L; Prieto, Miguel L; Ryu, Euijung; Cunningham, Julie M; Frye, Mark A; Biernacka, Joanna M

    2016-03-01

    Bipolar disorder (BD) is a complex disease associated with various hereditary traits, including a higher body mass index (BMI). In a prior genome-wide association study, we found that BMI modified the association of rs12772424 - a common variant in the gene encoding transcription factor 7-like 2 (TCF7L2) - with risk for BD. TCF7L2 is a transcription factor in the canonical Wnt pathway, involved in multiple disorders, including diabetes, cancer and psychiatric conditions. Here, using an independent sample, we evaluated 26 TCF7L2 single nucleotide polymorphisms (SNPs) to explore further the association of BD with the TCF7L2-BMI interaction. Using a sample of 662 BD cases and 616 controls, we conducted SNP-level and gene-level tests to assess the evidence for an association between BD and the interaction of BMI and genetic variation in TCF7L2. We also explored the potential mechanism behind the detected associations using human brain expression quantitative trait loci (eQTL) analysis. The analysis provided independent evidence of an rs12772424-BMI interaction (p = 0.011). Furthermore, while overall there was no evidence for SNP marginal effects on BD, the TCF7L2-BMI interaction was significant at the gene level (p = 0.042), with seven of the 26 SNPs showing SNP-BMI interaction effects with p < 0.05. The strongest evidence of interaction was observed for rs7895307 (p = 0.006). TCF7L2 expression showed a significant enrichment of association with the expression of other genes in the Wnt canonical pathway. The current study provides further evidence suggesting that TCF7L2 involvement in BD risk may be regulated by BMI. Detailed, prospective assessment of BMI, comorbidity, and other possible contributing factors is necessary to explain fully the mechanisms underlying this association. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  12. Xenobiotic metabolizing gene variants, pesticide use, and risk of prostate cancer

    PubMed Central

    Koutros, Stella; Andreotti, Gabriella; Berndt, Sonja I.; Barry, Kathryn Hughes; Lubin, Jay H.; Hoppin, Jane A.; Kamel, Freya; Sandler, Dale P.; Burdette, Laurie A.; Yuenger, Jeffrey; Yeager, Meredith; Alavanja, Michael C.R.; Beane Freeman, Laura E.

    2011-01-01

    Background To explore associations with prostate cancer and farming, it is important to investigate the relationship between pesticide use and single nucleotide polymorphisms (SNPs) in xenobiotic metabolic enzyme (XME) genes. Objectives We evaluated pesticide-SNP interactions between 45 pesticides and 1,913 XME SNPs with respect to prostate cancer among 776 cases and 1,444 controls in the Agricultural Health Study. Methods We used unconditional logistic regression to estimate odds ratios (ORs) and 95% confidence intervals (CIs). Multiplicative SNP-pesticide interactions were calculated using a likelihood ratio test. Results A positive monotonic interaction was observed between petroleum oil/petroleum distillate use and rs1883633 in the oxidative stress gene glutamate-cysteine ligase (GCLC) (p-interaction=1.0×10−4); men carrying at least one variant allele (minor allele) experienced an increased prostate cancer risk (OR=3.7, 95% CI: 1.9–7.3). Among men carrying the variant allele for thioredoxin reductase 2 (TXNRD2) rs4485648, microsomal epoxide hyrdolase 1 (EPHX1) rs17309872, or myeloperoxidase (MPO) rs11079344, increased prostate cancer risk was observed with high compared to no petroleum oil/petroleum distillate (OR=1.9, 95% CI: 1.1–3.2, p-interaction=0.01), (OR=2.1, 95% CI: 1.1–4.0, p-interaction=0.01), or terbufos (OR=3.0, 95% CI: 1.5–6.0 p-interaction=2.0×10−3) use, respectively. No interactions were deemed noteworthy at the false discovery rate = 0.20 level; the number of observed interactions in XMEs was comparable to the number expected by chance alone. Conclusions We observed several pesticide-SNP interactions in oxidative stress and phase I/phase II enzyme genes and risk of prostate cancer. Additional work is needed to explain the joint contribution of genetic variation in XMEs, pesticide use, and prostate cancer risk. PMID:21716162

  13. Common genetic variants in the 9p21 region and their associations with multiple tumours.

    PubMed

    Gu, F; Pfeiffer, R M; Bhattacharjee, S; Han, S S; Taylor, P R; Berndt, S; Yang, H; Sigurdson, A J; Toro, J; Mirabello, L; Greene, M H; Freedman, N D; Abnet, C C; Dawsey, S M; Hu, N; Qiao, Y-L; Ding, T; Brenner, A V; Garcia-Closas, M; Hayes, R; Brinton, L A; Lissowska, J; Wentzensen, N; Kratz, C; Moore, L E; Ziegler, R G; Chow, W-H; Savage, S A; Burdette, L; Yeager, M; Chanock, S J; Chatterjee, N; Tucker, M A; Goldstein, A M; Yang, X R

    2013-04-02

    The chromosome 9p21.3 region has been implicated in the pathogenesis of multiple cancers. We systematically examined up to 203 tagging SNPs of 22 genes on 9p21.3 (19.9-32.8 Mb) in eight case-control studies: thyroid cancer, endometrial cancer (EC), renal cell carcinoma, colorectal cancer (CRC), colorectal adenoma (CA), oesophageal squamous cell carcinoma (ESCC), gastric cardia adenocarcinoma and osteosarcoma (OS). We used logistic regression to perform single SNP analyses for each study separately, adjusting for study-specific covariates. We combined SNP results across studies by fixed-effect meta-analyses and a newly developed subset-based statistical approach (ASSET). Gene-based P-values were obtained by the minP method using the Adaptive Rank Truncated Product program. We adjusted for multiple comparisons by Bonferroni correction. Rs3731239 in cyclin-dependent kinase inhibitors 2A (CDKN2A) was significantly associated with ESCC (P=7 × 10(-6)). The CDKN2A-ESCC association was further supported by gene-based analyses (Pgene=0.0001). In the meta-analyses by ASSET, four SNPs (rs3731239 in CDKN2A, rs615552 and rs573687 in CDKN2B and rs564398 in CDKN2BAS) showed significant associations with ESCC and EC (P<2.46 × 10(-4)). One SNP in MTAP (methylthioadenosine phosphorylase) (rs7023329) that was previously associated with melanoma and nevi in multiple genome-wide association studies was associated with CRC, CA and OS by ASSET (P=0.007). Our data indicate that genetic variants in CDKN2A, and possibly nearby genes, may be associated with ESCC and several other tumours, further highlighting the importance of 9p21.3 genetic variants in carcinogenesis.

  14. Adaptive testing for multiple traits in a proportional odds model with applications to detect SNP-brain network associations.

    PubMed

    Kim, Junghi; Pan, Wei

    2017-04-01

    There has been increasing interest in developing more powerful and flexible statistical tests to detect genetic associations with multiple traits, as arising from neuroimaging genetic studies. Most of existing methods treat a single trait or multiple traits as response while treating an SNP as a predictor coded under an additive inheritance mode. In this paper, we follow an earlier approach in treating an SNP as an ordinal response while treating traits as predictors in a proportional odds model (POM). In this way, it is not only easier to handle mixed types of traits, e.g., some quantitative and some binary, but it is also potentially more robust to the commonly adopted additive inheritance mode. More importantly, we develop an adaptive test in a POM so that it can maintain high power across many possible situations. Compared to the existing methods treating multiple traits as responses, e.g., in a generalized estimating equation (GEE) approach, the proposed method can be applied to a high dimensional setting where the number of phenotypes (p) can be larger than the sample size (n), in addition to a usual small P setting. The promising performance of the proposed method was demonstrated with applications to the Alzheimer's Disease Neuroimaging Initiative (ADNI) data, in which either structural MRI driven phenotypes or resting-state functional MRI (rs-fMRI) derived brain functional connectivity measures were used as phenotypes. The applications led to the identification of several top SNPs of biological interest. Furthermore, simulation studies showed competitive performance of the new method, especially for p>n. © 2017 WILEY PERIODICALS, INC.

  15. The utility of low-density genotyping for imputation in the Thoroughbred horse

    PubMed Central

    2014-01-01

    Background Despite the dramatic reduction in the cost of high-density genotyping that has occurred over the last decade, it remains one of the limiting factors for obtaining the large datasets required for genomic studies of disease in the horse. In this study, we investigated the potential for low-density genotyping and subsequent imputation to address this problem. Results Using the haplotype phasing and imputation program, BEAGLE, it is possible to impute genotypes from low- to high-density (50K) in the Thoroughbred horse with reasonable to high accuracy. Analysis of the sources of variation in imputation accuracy revealed dependence both on the minor allele frequency of the single nucleotide polymorphisms (SNPs) being imputed and on the underlying linkage disequilibrium structure. Whereas equidistant spacing of the SNPs on the low-density panel worked well, optimising SNP selection to increase their minor allele frequency was advantageous, even when the panel was subsequently used in a population of different geographical origin. Replacing base pair position with linkage disequilibrium map distance reduced the variation in imputation accuracy across SNPs. Whereas a 1K SNP panel was generally sufficient to ensure that more than 80% of genotypes were correctly imputed, other studies suggest that a 2K to 3K panel is more efficient to minimize the subsequent loss of accuracy in genomic prediction analyses. The relationship between accuracy and genotyping costs for the different low-density panels, suggests that a 2K SNP panel would represent good value for money. Conclusions Low-density genotyping with a 2K SNP panel followed by imputation provides a compromise between cost and accuracy that could promote more widespread genotyping, and hence the use of genomic information in horses. In addition to offering a low cost alternative to high-density genotyping, imputation provides a means to combine datasets from different genotyping platforms, which is becoming necessary since researchers are starting to use the recently developed equine 70K SNP chip. However, more work is needed to evaluate the impact of between-breed differences on imputation accuracy. PMID:24495673

  16. The effect of rare alleles on estimated genomic relationships from whole genome sequence data.

    PubMed

    Eynard, Sonia E; Windig, Jack J; Leroy, Grégoire; van Binsbergen, Rianne; Calus, Mario P L

    2015-03-12

    Relationships between individuals and inbreeding coefficients are commonly used for breeding decisions, but may be affected by the type of data used for their estimation. The proportion of variants with low Minor Allele Frequency (MAF) is larger in whole genome sequence (WGS) data compared to Single Nucleotide Polymorphism (SNP) chips. Therefore, WGS data provide true relationships between individuals and may influence breeding decisions and prioritisation for conservation of genetic diversity in livestock. This study identifies differences between relationships and inbreeding coefficients estimated using pedigree, SNP or WGS data for 118 Holstein bulls from the 1000 Bull genomes project. To determine the impact of rare alleles on the estimates we compared three scenarios of MAF restrictions: variants with a MAF higher than 5%, variants with a MAF higher than 1% and variants with a MAF between 1% and 5%. We observed significant differences between estimated relationships and, although less significantly, inbreeding coefficients from pedigree, SNP or WGS data, and between MAF restriction scenarios. Computed correlations between pedigree and genomic relationships, within groups with similar relationships, ranged from negative to moderate for both estimated relationships and inbreeding coefficients, but were high between estimates from SNP and WGS (0.49 to 0.99). Estimated relationships from genomic information exhibited higher variation than from pedigree. Inbreeding coefficients analysis showed that more complete pedigree records lead to higher correlation between inbreeding coefficients from pedigree and genomic data. Finally, estimates and correlations between additive genetic (A) and genomic (G) relationship matrices were lower, and variances of the relationships were larger when accounting for allele frequencies than without accounting for allele frequencies. Using pedigree data or genomic information, and including or excluding variants with a MAF below 5% showed significant differences in relationship and inbreeding coefficient estimates. Estimated relationships and inbreeding coefficients are the basis for selection decisions. Therefore, it can be expected that using WGS instead of SNP can affect selection decision. Inclusion of rare variants will give access to the variation they carry, which is of interest for conservation of genetic diversity.

  17. Identification of mitochondrial DNA sequence variation and development of single nucleotide polymorphic markers for CMS-D8 in cotton.

    PubMed

    Suzuki, Hideaki; Yu, Jiwen; Wang, Fei; Zhang, Jinfa

    2013-06-01

    Cytoplasmic male sterility (CMS), which is a maternally inherited trait and controlled by novel chimeric genes in the mitochondrial genome, plays a pivotal role in the production of hybrid seed. In cotton, no PCR-based marker has been developed to discriminate CMS-D8 (from Gossypium trilobum) from its normal Upland cotton (AD1, Gossypium hirsutum) cytoplasm. The objective of the current study was to develop PCR-based single nucleotide polymorphic (SNP) markers from mitochondrial genes for the CMS-D8 cytoplasm. DNA sequence variation in mitochondrial genes involved in the oxidative phosphorylation chain including ATP synthase subunit 1, 4, 6, 8 and 9, and cytochrome c oxidase 1, 2 and 3 subunits were identified by comparing CMS-D8, its isogenic maintainer and restorer lines on the same nuclear genetic background. An allelic specific PCR (AS-PCR) was utilized for SNP typing by incorporating artificial mismatched nucleotides into the third or fourth base from the 3' terminus in both the specific and nonspecific primers. The result indicated that the method modifying allele-specific primers was successful in obtaining eight SNP markers out of eight SNPs using eight primer pairs to discriminate two alleles between AD1 and CMS-D8 cytoplasms. Two of the SNPs for atp1 and cox1 could also be used in combination to discriminate between CMS-D8 and CMS-D2 cytoplasms. Additionally, a PCR-based marker from a nine nucleotide insertion-deletion (InDel) sequence (AATTGTTTT) at the 59-67 bp positions from the start codon of atp6, which is present in the CMS and restorer lines with the D8 cytoplasm but absent in the maintainer line with the AD1 cytoplasm, was also developed. A SNP marker for two nucleotide substitutions (AA in AD1 cytoplasm to CT in CMS-D8 cytoplasm) in the intron (1,506 bp) of cox2 gene was also developed. These PCR-based SNP markers should be useful in discriminating CMS-D8 and AD1 cytoplasms, or those with CMS-D2 cytoplasm as a rapid, simple, inexpensive, and reliable genotyping tool to assist hybrid cotton breeding.

  18. Validated context-dependent associations of coronary heart disease risk with genotype variation in the chromosome 9p21 region: the Atherosclerosis Risk in Communities study

    PubMed Central

    Lusk, Christine M.; Dyson, Greg; Clark, Andrew G.; Ballantyne, Christie M.; Frikke-Schmidt, Ruth; Tybjærg-Hansen, Anne; Boerwinkle, Eric

    2014-01-01

    Markers of the chromosome 9p21 region are regarded as the strongest and most reliably significant genome-wide association study (GWAS) signals for Coronary heart disease (CHD) risk; this was recently confirmed by the CARDIoGRAMplusC4D Consortium meta-analysis. However, while these associations are significant at the population level, they may not be clinically relevant predictors of risk for all individuals. We describe here the results of a study designed to address the question: What is the contribution of context defined by traditional risk factors in determining the utility of DNA sequence variations marking the 9p21 region for explaining variation in CHD risk? We analyzed a sample of 7,589 (3,869 females and 3,720 males) European American participants of the Atherosclerosis Risk in Communities study. We confirmed CHD-SNP genotype associations for two 9p21 region marker SNPs previously identified by the CARDIoGRAMplusC4D Consortium study, of which ARIC was a part. We then tested each marker SNP genotype effect on prediction of CHD within sub-groups of the ARIC sample defined by traditional CHD risk factors by applying a novel multi-model strategy, PRIM. We observed that the effects of SNP genotypes in the 9p21 region were strongest in a subgroup of hypertensives. We subsequently validated the effect of the region in an independent sample from the Copenhagen City Heart Study. Our study suggests that marker SNPs identified as predictors of CHD risk in large population based GWAS may have their greatest utility in explaining risk of disease in particular sub-groups characterized by biological and environmental effects measured by the traditional CHD risk factors. PMID:24889828

  19. Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data

    PubMed Central

    Degner, Jacob F.; Marioni, John C.; Pai, Athma A.; Pickrell, Joseph K.; Nkadori, Everlyne; Gilad, Yoav; Pritchard, Jonathan K.

    2009-01-01

    Motivation: Next-generation sequencing has become an important tool for genome-wide quantification of DNA and RNA. However, a major technical hurdle lies in the need to map short sequence reads back to their correct locations in a reference genome. Here, we investigate the impact of SNP variation on the reliability of read-mapping in the context of detecting allele-specific expression (ASE). Results: We generated 16 million 35 bp reads from mRNA of each of two HapMap Yoruba individuals. When we mapped these reads to the human genome we found that, at heterozygous SNPs, there was a significant bias toward higher mapping rates of the allele in the reference sequence, compared with the alternative allele. Masking known SNP positions in the genome sequence eliminated the reference bias but, surprisingly, did not lead to more reliable results overall. We find that even after masking, ∼5–10% of SNPs still have an inherent bias toward more effective mapping of one allele. Filtering out inherently biased SNPs removes 40% of the top signals of ASE. The remaining SNPs showing ASE are enriched in genes previously known to harbor cis-regulatory variation or known to show uniparental imprinting. Our results have implications for a variety of applications involving detection of alternate alleles from short-read sequence data. Availability: Scripts, written in Perl and R, for simulating short reads, masking SNP variation in a reference genome and analyzing the simulation output are available upon request from JFD. Raw short read data were deposited in GEO (http://www.ncbi.nlm.nih.gov/geo/) under accession number GSE18156. Contact: jdegner@uchicago.edu; marioni@uchicago.edu; gilad@uchicago.edu; pritch@uchicago.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:19808877

  20. A possible genetic association with chronic fatigue in primary Sjögren's syndrome: a candidate gene study.

    PubMed

    Norheim, Katrine Brække; Le Hellard, Stephanie; Nordmark, Gunnel; Harboe, Erna; Gøransson, Lasse; Brun, Johan G; Wahren-Herlenius, Marie; Jonsson, Roland; Omdal, Roald

    2014-02-01

    Fatigue is prevalent and disabling in primary Sjögren's syndrome (pSS). Results from studies in chronic fatigue syndrome (CFS) indicate that genetic variation may influence fatigue. The aim of this study was to investigate single nucleotide polymorphism (SNP) variations in pSS patients with high and low fatigue. A panel of 85 SNPs in 12 genes was selected based on previous studies in CFS. A total of 207 pSS patients and 376 healthy controls were genotyped. One-hundred and ninety-three patients and 70 SNPs in 11 genes were available for analysis after quality control. Patients were dichotomized based on fatigue visual analogue scale (VAS) scores, with VAS <50 denominated "low fatigue" (n = 53) and VAS ≥50 denominated "high fatigue" (n = 140). We detected signals of association with pSS for one SNP in SLC25A40 (unadjusted p = 0.007) and two SNPs in PKN1 (both p = 0.03) in our pSS case versus control analysis. The association with SLC25A40 was stronger when only pSS high fatigue patients were analysed versus controls (p = 0.002). One SNP in PKN1 displayed an association in the case-only analysis of pSS high fatigue versus pSS low fatigue (p = 0.005). This candidate gene study in pSS did reveal a trend for associations between genetic variation in candidate genes and fatigue. The results will need to be replicated. More research on genetic associations with fatigue is warranted, and future trials should include larger cohorts and multicentre collaborations with sharing of genetic material to increase the statistical power.

  1. Genetic variation in C-reactive protein (CRP) gene may be associated with risk of systemic lupus erythematosus and CRP concentrations.

    PubMed

    Shih, P Betty; Manzi, Susan; Shaw, Penny; Kenney, Margaret; Kao, Amy H; Bontempo, Franklin; Barmada, M Michael; Kammerer, Candace; Kamboh, M Ilyas

    2008-11-01

    The gene coding for C-reactive protein (CRP) is located on chromosome 1q23.2, which falls within a linkage region thought to harbor a systemic lupus erythematosus (SLE) susceptibility gene. Recently, 2 single-nucleotide polymorphisms (SNP) in the CRP gene (+838, +2043) have been shown to be associated with CRP concentrations and/or SLE risk in a British family-based cohort. Our study was done to confirm the reported association in an independent population-based case-control cohort, and also to investigate the influence of 3 additional CRP tagSNP (-861, -390, +90) on SLE risk and serum CRP concentrations. DNA from 337 Caucasian women who met the American College of Rheumatology criteria for definite (n = 324) or probable (n = 13) SLE and 448 Caucasian healthy female controls was genotyped for 5 CRP tagSNP (-861, -390, +90, +838, +2043). Genotyping was performed using restriction fragment length polymorphism-polymerase chain reaction, pyrosequencing, or TaqMan assays. Serum CRP levels were measured using ELISA. Association studies were performed using the chi-squared distribution, Z-test, Fisher's exact test, and analysis of variance. Haplotype analysis was performed using EH software and the haplo.stats package in R 2.1.2. While none of the SNP were found to be associated with SLE risk individually, there was an association with the 5 SNP haplotypes (p < 0.001). Three SNP (-861, -390, +90) were found to significantly influence serum CRP level in SLE cases, both independently and as haplotypes. Our data suggest that unique haplotype combinations in the CRP gene may modify the risk of developing SLE and influence circulating CRP levels.

  2. Loss-of-function DNA sequence variant in the CLCNKA chloride channel implicates the cardio-renal axis in interindividual heart failure risk variation.

    PubMed

    Cappola, Thomas P; Matkovich, Scot J; Wang, Wei; van Booven, Derek; Li, Mingyao; Wang, Xuexia; Qu, Liming; Sweitzer, Nancy K; Fang, James C; Reilly, Muredach P; Hakonarson, Hakon; Nerbonne, Jeanne M; Dorn, Gerald W

    2011-02-08

    Common heart failure has a strong undefined heritable component. Two recent independent cardiovascular SNP array studies identified a common SNP at 1p36 in intron 2 of the HSPB7 gene as being associated with heart failure. HSPB7 resequencing identified other risk alleles but no functional gene variants. Here, we further show no effect of the HSPB7 SNP on cardiac HSPB7 mRNA levels or splicing, suggesting that the SNP marks the position of a functional variant in another gene. Accordingly, we used massively parallel platforms to resequence all coding exons of the adjacent CLCNKA gene, which encodes the K(a) renal chloride channel (ClC-K(a)). Of 51 exonic CLCNKA variants identified, one SNP (rs10927887, encoding Arg83Gly) was common, in linkage disequilibrium with the heart failure risk SNP in HSPB7, and associated with heart failure in two independent Caucasian referral populations (n = 2,606 and 1,168; combined P = 2.25 × 10(-6)). Individual genotyping of rs10927887 in the two study populations and a third independent heart failure cohort (combined n = 5,489) revealed an additive allele effect on heart failure risk that is independent of age, sex, and prior hypertension (odds ratio = 1.27 per allele copy; P = 8.3 × 10(-7)). Functional characterization of recombinant wild-type Arg83 and variant Gly83 ClC-K(a) chloride channel currents revealed ≈ 50% loss-of-function of the variant channel. These findings identify a common, functionally significant genetic risk factor for Caucasian heart failure. The variant CLCNKA risk allele, telegraphed by linked variants in the adjacent HSPB7 gene, uncovers a previously overlooked genetic mechanism affecting the cardio-renal axis.

  3. Development and Validation of a High-Density SNP Genotyping Array for African Oil Palm.

    PubMed

    Kwong, Qi Bin; Teh, Chee Keng; Ong, Ai Ling; Heng, Huey Ying; Lee, Heng Leng; Mohamed, Mohaimi; Low, Joel Zi-Bin; Apparow, Sukganah; Chew, Fook Tim; Mayes, Sean; Kulaveerasingam, Harikrishna; Tammi, Martti; Appleton, David Ross

    2016-08-01

    High-density single nucleotide polymorphism (SNP) genotyping arrays are powerful tools that can measure the level of genetic polymorphism within a population. To develop a whole-genome SNP array for oil palms, SNP discovery was performed using deep resequencing of eight libraries derived from 132 Elaeis guineensis and Elaeis oleifera palms belonging to 59 origins, resulting in the discovery of >3 million putative SNPs. After SNP filtering, the Illumina OP200K custom array was built with 170 860 successful probes. Phenetic clustering analysis revealed that the array could distinguish between palms of different origins in a way consistent with pedigree records. Genome-wide linkage disequilibrium declined more slowly for the commercial populations (ranging from 120 kb at r(2) = 0.43 to 146 kb at r(2) = 0.50) when compared with the semi-wild populations (19.5 kb at r(2) = 0.22). Genetic fixation mapping comparing the semi-wild and commercial population identified 321 selective sweeps. A genome-wide association study (GWAS) detected a significant peak on chromosome 2 associated with the polygenic component of the shell thickness trait (based on the trait shell-to-fruit; S/F %) in tenera palms. Testing of a genomic selection model on the same trait resulted in good prediction accuracy (r = 0.65) with 42% of the S/F % variation explained. The first high-density SNP genotyping array for oil palm has been developed and shown to be robust for use in genetic studies and with potential for developing early trait prediction to shorten the oil palm breeding cycle. Copyright © 2016 The Author. Published by Elsevier Inc. All rights reserved.

  4. IL6R Variation Asp358Ala Is a Potential Modifier of Lung Function in Asthma

    PubMed Central

    Hawkins, Gregory A; Robinson, Mac B; Hastie, Annette T; Li, Xingnan; Li, Huashi; Moore, Wendy C; Howard, Timothy D; Busse, William W.; Erzurum, Serpil C.; Wenzel, Sally E.; Peters, Stephen P; Meyers, Deborah A; Bleecker, Eugene R

    2012-01-01

    Background The IL6R SNP rs4129267 has recently been identified as an asthma susceptibility locus in subjects of European ancestry but has not been characterized with respect to asthma severity. The SNP rs4129267 is in linkage disequilibrium (r2=1) with the IL6R coding SNP rs2228145 (Asp358Ala). This IL6R coding change increases IL6 receptor shedding and promotes IL6 transsignaling. Objectives To evaluate the IL6R SNP rs2228145 with respect to asthma severity phenotypes. Methods The IL6R SNP rs2228145 was evaluated in subjects of European ancestry with asthma from the Severe Asthma Research Program (SARP). Lung function associations were replicated in the Collaborative Study on the Genetics of Asthma (CSGA) cohort. Serum soluble IL6 receptor (sIL6R) levels were measured in subjects from SARP. Immunohistochemistry was used to qualitatively evaluate IL6R protein expression in BAL cells and endobronchial biopsies. Results The minor C allele of IL6R SNP rs2228145 was associated with lower ppFEV1 in the SARP cohort (p=0.005), the CSGA cohort (0.008), and in combined cohort analysis (p=0.003). Additional associations with ppFVC, FEV1/FVC, and PC20 were observed. The rs2228145 C allele (Ala358) was more frequent in severe asthma phenotypic clusters. Elevated serum sIL6R was associated with lower ppFEV1 (p=0.02) and lower ppFVC (p=0.008) (N=146). IL6R protein expression was observed in BAL macrophages, airway epithelium, vascular endothelium, and airway smooth muscle. Conclusions The IL6R coding SNP rs2228145 (Asp358Ala) is a potential modifier of lung function in asthma and may identify subjects at risk for more severe asthma. IL6 transsignaling may have a pathogenic role in the lung. PMID:22554704

  5. Identification of a member of the catalase multigene family on wheat chromosome 7A associated with flour b* colour and biological significance of allelic variation.

    PubMed

    Li, Dora A; Walker, Esther; Francki, Michael G

    2015-12-01

    Carotenoids (especially lutein) are known to be the pigment source for flour b* colour in bread wheat. Flour b* colour variation is controlled by a quantitative trait locus (QTL) on wheat chromosome 7AL and one gene from the carotenoid pathway, phytoene synthase, was functionally associated with the QTL on 7AL in some, but not all, wheat genotypes. A SNP marker within a sequence similar to catalase (Cat3-A1snp) derived from full-length (FL) cDNA (AK332460), however, was consistently associated with the QTL on 7AL and implicated in regulating hydrogen peroxide (H2O2) to control carotenoid accumulation affecting flour b* colour. The number of catalase genes on chromosome 7AL was investigated in this study to identify which gene may be implicated in flour b* variation and two were identified through interrogation of the draft wheat genome survey sequence consisting of five exons and a further two members having eight exons identified through comparative analysis with the single catalase gene on rice chromosome 6, PCR amplification and sequencing. It was evident that the catalase genes on chromosome 7A had duplicated and diverged during evolution relative to its counterpart on rice chromosome 6. The detection of transcripts in seeds, the co-location with Cat3-A1snp marker and maximised alignment of FL-cDNA (AK332460) with cognate genomic sequence indicated that TaCat3-A1 was the member of the catalase gene family associated with flour b* colour variation. Re-sequencing identified three alleles from three wheat varieties, TaCat3-A1a, TaCat3-A1b and TaCat3-A1c, and their predicted protein identified differences in peroxisomal targeting signal tri-peptide domain in the carboxyl terminal end providing new insights into their potential role in regulating cellular H2O2 that contribute to flour b* colour variation.

  6. Single nucleotide polymorphism discovery in bovine liver using RNA-seq technology.

    PubMed

    Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Pierzchała, Mariusz; Feng, Yaping; Kadarmideen, Haja N; Kumar, Dibyendu

    2017-01-01

    RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver tissue of young bulls of the Polish Red, Polish Holstein-Friesian (HF) and Hereford breeds, and to understand the genomic variation in the three cattle breeds that may reflect differences in production traits. The RNA-seq experiment on bovine liver produced 107,114,4072 raw paired-end reads, with an average of approximately 60 million paired-end reads per library. Breed-wise, a total of 345.06, 290.04 and 436.03 million paired-end reads were obtained from the Polish Red, Polish HF, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed that 81.35%, 82.81% and 84.21% of the mapped sequencing reads were properly paired to the Polish Red, Polish HF, and Hereford breeds, respectively. This study identified 5,641,401 SNPs and insertion and deletion (indel) positions expressed in the bovine liver with an average of 313,411 SNPs and indel per young bull. Following the removal of the indel mutations, a total of 195,3804, 152,7120 and 205,3184 raw SNPs expressed in bovine liver were identified for the Polish Red, Polish HF, and Hereford breeds, respectively. Breed-wise, three highly reliable breed-specific SNP-databases (SNP-dbs) with 31,562, 24,945 and 28,194 SNP records were constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10, IGFBPL1, IGFL1, LEP, LHX4, MC4R, MSTN, NKAIN1, PLAG1, POU1F1, SDR16C5, SH2B2, TOX, UCP3 and WNT10B) in all three cattle breeds. However, six SNP loci (CCSER1, GHR, KCNIP4, MTSS1, EGFR and NSMCE2) were identified as highly polymorphic among the cattle breeds. This study identified breed-specific SNPs with greater SNP ratio and excellent mapping coverage, as well as monomorphic and highly polymorphic putative SNP loci within QTL/CGs of bovine liver tissue. A breed-specific SNP-db constructed for bovine liver yielded nearly six million SNPs. In addition, a KASPTM SNP genotyping assay, as a reliable cost-effective method, successfully validated the breed-specific putative SNPs originating from the RNA-seq experiments.

  7. Physiological Study on Association between Nicotinamide N-Methyltransferase Gene Polymorphisms and Hyperlipidemia

    PubMed Central

    Zhu, Xiao-Juan; Lin, Ya-Jun; Chen, Wei; Wang, Ya-Hui; Qiu, Li-Qiang; Cai, Can-Xin; Xiong, Qun; Chen, Fei; Chen, Li-Hui; Zhou, Qiong

    2016-01-01

    Nicotinamide N-methyltransferase (NNMT) catalyzes the methylation of nicotinamide. Our previous works indicate that NNMT is involved in the body mass index and energy metabolism, and recently the association between a SNP (rs694539) of NNMT and a variety of cardiovascular diseases was reported. At present, more than 200 NNMT single nucleotide polymorphisms (SNPs) have been identified in the databases of the human genome projects; however, the association between rs694539 variation and hyperlipidemia has not been reported yet, and whether there are any SNPs in NNMT significantly associated with hyperlipidemia is still unclear. In this paper, we selected 19 SNPs in NNMT as the tagSNPs using Haploview software (Haploview 4.2) first and then performed a case-control study to observe the association between these tagSNPs and hyperlipidemia and finally applied physiological approaches to explore the possible mechanisms through which the NNMT polymorphism induces hyperlipidemia. The results show that a SNP (rs1941404) in NNMT is significantly associated with hyperlipidemia, and the influence of rs1941404 variation on the resting energy expenditure may be the possible mechanism for rs1941404 variation to induce hyperlipidemia. PMID:27999813

  8. A genome-wide detection of copy number variation using SNP genotyping arrays in Beijing-You chickens.

    PubMed

    Zhou, Wei; Liu, Ranran; Zhang, Jingjing; Zheng, Maiqing; Li, Peng; Chang, Guobin; Wen, Jie; Zhao, Guiping

    2014-10-01

    Copy number variation (CNV) has been recently examined in many species and is recognized as being a source of genetic variability, especially for disease-related phenotypes. In this study, the PennCNV software, a genome-wide CNV detection system based on the 60 K SNP BeadChip was used on a total sample size of 1,310 Beijing-You chickens (a Chinese local breed). After quality control, 137 high confidence CNVRs covering 27.31 Mb of the chicken genome and corresponding to 2.61 % of the whole chicken genome. Within these regions, 131 known genes or coding sequences were involved. Q-PCR was applied to verify some of the genes related to disease development. Results showed that copy number of genes such as, phosphatidylinositol-5-phosphate 4-kinase II alpha, PHD finger protein 14, RHACD8 (a CD8α- like messenger RNA), MHC B-G, zinc finger protein, sarcosine dehydrogenase and ficolin 2 varied between individual chickens, which also supports the reliability of chip-detection of the CNVs. As one source of genomic variation, CNVs may provide new insight into the relationship between the genome and phenotypic characteristics.

  9. SNP Discovery by Illumina-Based Transcriptome Sequencing of the Olive and the Genetic Characterization of Turkish Olive Genotypes Revealed by AFLP, SSR and SNP Markers

    PubMed Central

    Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin

    2013-01-01

    Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of genetic variation among Turkish olive genotypes revealed by SNPs, AFLPs and SSRs allowed us to characterize the Turkish olive genotype. PMID:24058483

  10. RNA sequencing to study gene expression and SNP variations associated with growth in zebrafish fed a plant protein-based diet.

    PubMed

    Ulloa, Pilar E; Rincón, Gonzalo; Islas-Trejo, Alma; Araneda, Cristian; Iturra, Patricia; Neira, Roberto; Medrano, Juan F

    2015-06-01

    The objectives of this study were to measure gene expression in zebrafish and then identify SNP to be used as potential markers in a growth association study. We developed an approach where muscle samples collected from low- and high-growth fish were analyzed using RNA-Sequencing (RNA-seq), and SNP were chosen from the genes that were differentially expressed between the low and high groups. A population of 24 families was fed a plant protein-based diet from the larval to adult stages. From a total of 440 males, 5 % of the fish from both tails of the weight gain distribution were selected. Total RNA was extracted from individual muscle of 8 low-growth and 8 high-growth fish. Two pooled RNA-Seq libraries were prepared for each phenotype using 4 fish per library. Libraries were sequenced using the Illumina GAII Sequencer and analyzed using the CLCBio genomic workbench software. One hundred and twenty-four genes were differentially expressed between phenotypes (p value < 0.05 and FDR < 0.2). From these genes, 164 SNP were selected and genotyped in 240 fish samples. Marker-trait analysis revealed 5 SNP associated with growth in key genes (Nars, Lmod2b, Cuzd1, Acta1b, and Plac8l1). These genes are good candidates for further growth studies in fish and to consider for identification of potential SNPs associated with different growth rates in response to a plant protein-based diet.

  11. A Genome Wide Survey of SNP Variation Reveals the Genetic Structure of Sheep Breeds

    USDA-ARS?s Scientific Manuscript database

    The genetic structure of sheep reflects their domestication and subsequent formation into discrete breeds. Understanding genetic structure is essential for achieving genetic improvement through genome-wide association studies, genomic selection and the dissection of quantitative traits. After identi...

  12. Developing 100K Affymetrix Axiom SNP Array for Polyploid Sugarcane

    USDA-ARS?s Scientific Manuscript database

    Sugarcane genotyping or fingerprinting has long been a daunting task due to its high polyploidy level with large number of chromosomes. Single nucleotide polymorphisms (SNPs) are very abundant DNA sequence variations in the genomes. With the advance of next generation sequencing (NGS) technologies, ...

  13. Population sequencing reveals breed and sub-species specific CNVs in cattle

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...

  14. 11,670 whole-genome sequences representative of the Han Chinese population from the CONVERGE project.

    PubMed

    Cai, Na; Bigdeli, Tim B; Kretzschmar, Warren W; Li, Yihan; Liang, Jieqin; Hu, Jingchu; Peterson, Roseann E; Bacanu, Silviu; Webb, Bradley Todd; Riley, Brien; Li, Qibin; Marchini, Jonathan; Mott, Richard; Kendler, Kenneth S; Flint, Jonathan

    2017-02-14

    The China, Oxford and Virginia Commonwealth University Experimental Research on Genetic Epidemiology (CONVERGE) project on Major Depressive Disorder (MDD) sequenced 11,670 female Han Chinese at low-coverage (1.7X), providing the first large-scale whole genome sequencing resource representative of the largest ethnic group in the world. Samples are collected from 58 hospitals from 23 provinces around China. We are able to call 22 million high quality single nucleotide polymorphisms (SNP) from the nuclear genome, representing the largest SNP call set from an East Asian population to date. We use these variants for imputation of genotypes across all samples, and this has allowed us to perform a successful genome wide association study (GWAS) on MDD. The utility of these data can be extended to studies of genetic ancestry in the Han Chinese and evolutionary genetics when integrated with data from other populations. Molecular phenotypes, such as copy number variations and structural variations can be detected, quantified and analysed in similar ways.

  15. Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort.

    PubMed

    Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Siler Masters, Bettie Sue; Martásek, Pavel

    2015-01-01

    Estimating polymorphic allele frequencies of the NADPH-CYP450 oxidoreductase (POR) gene in a Czech Slavic population. The POR gene was analyzed in 322 individuals from a control cohort by sequencing and high resolution melting analysis. We identified seven unreported SNP genetic variations, including two SNPs in the 5' flanking region (g.4965C>T and g.4994G>T), one intronic variant (c.1899-20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared with wild-type. New POR variant identification indicates the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYP450s in the endoplasmic reticulum. Original submitted 15 September 2014; Revision submitted 17 November 2014.

  16. The diploid genome sequence of an Asian individual

    PubMed Central

    Wang, Jun; Wang, Wei; Li, Ruiqiang; Li, Yingrui; Tian, Geng; Goodman, Laurie; Fan, Wei; Zhang, Junqing; Li, Jun; Zhang, Juanbin; Guo, Yiran; Feng, Binxiao; Li, Heng; Lu, Yao; Fang, Xiaodong; Liang, Huiqing; Du, Zhenglin; Li, Dong; Zhao, Yiqing; Hu, Yujie; Yang, Zhenzhen; Zheng, Hancheng; Hellmann, Ines; Inouye, Michael; Pool, John; Yi, Xin; Zhao, Jing; Duan, Jinjie; Zhou, Yan; Qin, Junjie; Ma, Lijia; Li, Guoqing; Yang, Zhentao; Zhang, Guojie; Yang, Bin; Yu, Chang; Liang, Fang; Li, Wenjie; Li, Shaochuan; Li, Dawei; Ni, Peixiang; Ruan, Jue; Li, Qibin; Zhu, Hongmei; Liu, Dongyuan; Lu, Zhike; Li, Ning; Guo, Guangwu; Zhang, Jianguo; Ye, Jia; Fang, Lin; Hao, Qin; Chen, Quan; Liang, Yu; Su, Yeyang; san, A.; Ping, Cuo; Yang, Shuang; Chen, Fang; Li, Li; Zhou, Ke; Zheng, Hongkun; Ren, Yuanyuan; Yang, Ling; Gao, Yang; Yang, Guohua; Li, Zhuo; Feng, Xiaoli; Kristiansen, Karsten; Wong, Gane Ka-Shu; Nielsen, Rasmus; Durbin, Richard; Bolund, Lars; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian

    2009-01-01

    Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J. D. Watson and J. C. Venter), and structural variation identification. These variations were considered for their potential biological impact. Our sequence data and analyses demonstrate the potential usefulness of next-generation sequencing technologies for personal genomics. PMID:18987735

  17. Early developmental gene enhancers affect subcortical volumes in the adult human brain.

    PubMed

    Becker, Martin; Guadalupe, Tulio; Franke, Barbara; Hibar, Derrek P; Renteria, Miguel E; Stein, Jason L; Thompson, Paul M; Francks, Clyde; Vernes, Sonja C; Fisher, Simon E

    2016-05-01

    Genome-wide association screens aim to identify common genetic variants contributing to the phenotypic variability of complex traits, such as human height or brain morphology. The identified genetic variants are mostly within noncoding genomic regions and the biology of the genotype-phenotype association typically remains unclear. In this article, we propose a complementary targeted strategy to reveal the genetic underpinnings of variability in subcortical brain volumes, by specifically selecting genomic loci that are experimentally validated forebrain enhancers, active in early embryonic development. We hypothesized that genetic variation within these enhancers may affect the development and ultimately the structure of subcortical brain regions in adults. We tested whether variants in forebrain enhancer regions showed an overall enrichment of association with volumetric variation in subcortical structures of >13,000 healthy adults. We observed significant enrichment of genomic loci that affect the volume of the hippocampus within forebrain enhancers (empirical P = 0.0015), a finding which robustly passed the adjusted threshold for testing of multiple brain phenotypes (cutoff of P < 0.0083 at an alpha of 0.05). In analyses of individual single nucleotide polymorphisms (SNPs), we identified an association upstream of the ID2 gene with rs7588305 and variation in hippocampal volume. This SNP-based association survived multiple-testing correction for the number of SNPs analyzed but not for the number of subcortical structures. Targeting known regulatory regions offers a way to understand the underlying biology that connects genotypes to phenotypes, particularly in the context of neuroimaging genetics. This biology-driven approach generates testable hypotheses regarding the functional biology of identified associations. Hum Brain Mapp 37:1788-1800, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  18. Genetic variation in peroxisome proliferator-activated receptor gamma, soy, and mammographic density in Singapore Chinese women.

    PubMed

    Lee, Eunjung; Hsu, Chris; Van den Berg, David; Ursin, Giske; Koh, Woon-Puay; Yuan, Jian-Min; Stram, Daniel O; Yu, Mimi C; Wu, Anna H

    2012-04-01

    PPARγ is a transcription factor important for adipogenesis and adipocyte differentiation. Data from animal studies suggest that PPARγ may be involved in breast tumorigenesis, but results from epidemiologic studies on the association between PPARγ variation and breast cancer risk have been mixed. Recent data suggest that soy isoflavones can activate PPARγ. We investigated the interrelations of soy, PPARγ, and mammographic density, a biomarker of breast cancer risk in a cross-sectional study of 2,038 women who were members of the population-based Singapore Chinese Health Study Cohort. We assessed mammographic density using a computer-assisted method. We used linear regression to examine the association between 26 tagging single-nucleotide polymorphisms (SNP) of PPARγ and their interaction with soy intake and mammographic density. To correct for multiple testing, we calculated P values adjusted for multiple correlated tests (P(ACT)). Out of the 26 tested SNPs in the PPARγ, seven SNPs were individually shown to be statistically significantly associated with mammographic density (P(ACT) = 0.008-0.049). A stepwise regression procedure identified that only rs880663 was independently associated with mammographic density which decreased by 1.89% per-minor allele (P(ACT) = 0.008). This association was significantly stronger in high-soy consumers as mammographic density decreased by 3.97% per-minor allele of rs880663 in high-soy consumers (P(ACT) = 0.006; P for interaction with lower soy intake = 0.017). Our data support that PPARγ genetic variation may be important in determining mammographic density, particularly in high-soy consumers. Our findings may help to identify molecular targets and lifestyle intervention for future prevention research. ©2012 AACR.

  19. An SNP resource for rice genetics and breeding based on subspecies indica and japonica genome alignments.

    PubMed

    Feltus, F Alex; Wan, Jun; Schulze, Stefan R; Estill, James C; Jiang, Ning; Paterson, Andrew H

    2004-09-01

    Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% +/- 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% +/- 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp.

  20. An SNP Resource for Rice Genetics and Breeding Based on Subspecies Indica and Japonica Genome Alignments

    PubMed Central

    Feltus, F. Alex; Wan, Jun; Schulze, Stefan R.; Estill, James C.; Jiang, Ning; Paterson, Andrew H.

    2004-01-01

    Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% ± 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% ± 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp. PMID:15342564

  1. Efficient selection of tagging single-nucleotide polymorphisms in multiple populations.

    PubMed

    Howie, Bryan N; Carlson, Christopher S; Rieder, Mark J; Nickerson, Deborah A

    2006-08-01

    Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.

  2. Incorporation of Personal Single Nucleotide Polymorphism (SNP) Data into a National Level Electronic Health Record for Disease Risk Assessment, Part 2: The Incorporation of SNP into the National Health Information System of Turkey

    PubMed Central

    Beyan, Timur

    2014-01-01

    Background A personalized medicine approach provides opportunities for predictive and preventive medicine. Using genomic, clinical, environmental, and behavioral data, the tracking and management of individual wellness is possible. A prolific way to carry this personalized approach into routine practices can be accomplished by integrating clinical interpretations of genomic variations into electronic medical record (EMR)s/electronic health record (EHR)s systems. Today, various central EHR infrastructures have been constituted in many countries of the world, including Turkey. Objective As an initial attempt to develop a sophisticated infrastructure, we have concentrated on incorporating the personal single nucleotide polymorphism (SNP) data into the National Health Information System of Turkey (NHIS-T) for disease risk assessment, and evaluated the performance of various predictive models for prostate cancer cases. We present our work as a miniseries containing three parts: (1) an overview of requirements, (2) the incorporation of SNP into the NHIS-T, and (3) an evaluation of SNP data incorporated into the NHIS-T for prostate cancer. Methods For the second article of this miniseries, we have analyzed the existing NHIS-T and proposed the possible extensional architectures. In light of the literature survey and characteristics of NHIS-T, we have proposed and argued opportunities and obstacles for a SNP incorporated NHIS-T. A prototype with complementary capabilities (knowledge base and end-user applications) for these architectures has been designed and developed. Results In the proposed architectures, the clinically relevant personal SNP (CR-SNP) and clinicogenomic associations are shared between central repositories and end-users via the NHIS-T infrastructure. To produce these files, we need to develop a national level clinicogenomic knowledge base. Regarding clinicogenomic decision support, we planned to complete interpretation of these associations on the end-user applications. This approach gives us the flexibility to add/update envirobehavioral parameters and family health history that will be monitored or collected by end users. Conclusions Our results emphasized that even though the existing NHIS-T messaging infrastructure supports the integration of SNP data and clinicogenomic association, it is critical to develop a national level, accredited knowledge base and better end-user systems for the interpretation of genomic, clinical, and envirobehavioral parameters. PMID:25599817

  3. Incorporation of personal single nucleotide polymorphism (SNP) data into a national level electronic health record for disease risk assessment, part 2: the incorporation of SNP into the national health information system of Turkey.

    PubMed

    Beyan, Timur; Aydın Son, Yeşim

    2014-08-11

    A personalized medicine approach provides opportunities for predictive and preventive medicine. Using genomic, clinical, environmental, and behavioral data, the tracking and management of individual wellness is possible. A prolific way to carry this personalized approach into routine practices can be accomplished by integrating clinical interpretations of genomic variations into electronic medical record (EMR)s/electronic health record (EHR)s systems. Today, various central EHR infrastructures have been constituted in many countries of the world, including Turkey. As an initial attempt to develop a sophisticated infrastructure, we have concentrated on incorporating the personal single nucleotide polymorphism (SNP) data into the National Health Information System of Turkey (NHIS-T) for disease risk assessment, and evaluated the performance of various predictive models for prostate cancer cases. We present our work as a miniseries containing three parts: (1) an overview of requirements, (2) the incorporation of SNP into the NHIS-T, and (3) an evaluation of SNP data incorporated into the NHIS-T for prostate cancer. For the second article of this miniseries, we have analyzed the existing NHIS-T and proposed the possible extensional architectures. In light of the literature survey and characteristics of NHIS-T, we have proposed and argued opportunities and obstacles for a SNP incorporated NHIS-T. A prototype with complementary capabilities (knowledge base and end-user applications) for these architectures has been designed and developed. In the proposed architectures, the clinically relevant personal SNP (CR-SNP) and clinicogenomic associations are shared between central repositories and end-users via the NHIS-T infrastructure. To produce these files, we need to develop a national level clinicogenomic knowledge base. Regarding clinicogenomic decision support, we planned to complete interpretation of these associations on the end-user applications. This approach gives us the flexibility to add/update envirobehavioral parameters and family health history that will be monitored or collected by end users. Our results emphasized that even though the existing NHIS-T messaging infrastructure supports the integration of SNP data and clinicogenomic association, it is critical to develop a national level, accredited knowledge base and better end-user systems for the interpretation of genomic, clinical, and envirobehavioral parameters.

  4. A Polymorphic p53 Response Element in KIT Ligand Influences Cancer Risk and Has Undergone Natural Selection

    PubMed Central

    Zeron-Medina, Jorge; Wang, Xuting; Repapi, Emmanouela; Campbell, Michelle R.; Su, Dan; Castro-Giner, Francesc; Davies, Benjamin; Peterse, Elisabeth F.P.; Sacilotto, Natalia; Walker, Graeme J.; Terzian, Tamara; Tomlinson, Ian P.; Box, Neil F.; Meinshausen, Nicolai; De Val, Sarah; Bell, Douglas A.; Bond, Gareth L.

    2014-01-01

    SUMMARY The ability of p53 to regulate transcription is crucial for tumor suppression and implies that inherited polymorphisms in functional p53-binding sites could influence cancer. Here, we identify a polymorphic p53 responsive element and demonstrate its influence on cancer risk using genome-wide data sets of cancer susceptibility loci, genetic variation, p53 occupancy, and p53-binding sites. We uncover a single-nucleotide polymorphism (SNP) in a functional p53-binding site and establish its influence on the ability of p53 to bind to and regulate transcription of the KITLG gene. The SNP resides in KITLG and associates with one of the largest risks identified among cancer genome-wide association studies. We establish that the SNP has undergone positive selection throughout evolution, signifying a selective benefit, but go on to show that similar SNPs are rare in the genome due to negative selection, indicating that polymorphisms in p53-binding sites are primarily detrimental to humans. PMID:24120139

  5. MicroRNAs-1614-3p gene seed region polymorphisms and association analysis with chicken production traits.

    PubMed

    Li, Hong; Sun, Gui-Rong; Tian, Ya-Dong; Han, Rui-Li; Li, Guo-Xi; Kang, Xiang-Tao

    2013-05-01

    In the present study, a total of 860 chickens from a Gushi-Anka F2 resource population were used to evaluate the genetic effect of the gga-miR-1614-3p gene. A novel, silent, single nucleotide polymorphism (SNP, +5 C>T) was detected in the gga-miR-1614-3p gene seed region through AvaII polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) and PCR products sequencing methods. Associations between the SNP and chicken growth, meat quality and carcass traits were performed by association analysis. The results showed that the SNP was significantly associated with breast muscle shear force and leg muscle water loss rate, wing weight, liver weight and heart weight (p<0.05), and highly significantly associated with the weight of the abdominal fat (p<0.01). The secondary structure of gga-miR-1614 and the free energy were altered due to the variation predicted by the M-fold program.

  6. Novel and efficient tag SNPs selection algorithms.

    PubMed

    Chen, Wen-Pei; Hung, Che-Lun; Tsai, Suh-Jen Jane; Lin, Yaw-Ling

    2014-01-01

    SNPs are the most abundant forms of genetic variations amongst species; the association studies between complex diseases and SNPs or haplotypes have received great attention. However, these studies are restricted by the cost of genotyping all SNPs; thus, it is necessary to find smaller subsets, or tag SNPs, representing the rest of the SNPs. In fact, the existing tag SNP selection algorithms are notoriously time-consuming. An efficient algorithm for tag SNP selection was presented, which was applied to analyze the HapMap YRI data. The experimental results show that the proposed algorithm can achieve better performance than the existing tag SNP selection algorithms; in most cases, this proposed algorithm is at least ten times faster than the existing methods. In many cases, when the redundant ratio of the block is high, the proposed algorithm can even be thousands times faster than the previously known methods. Tools and web services for haplotype block analysis integrated by hadoop MapReduce framework are also developed using the proposed algorithm as computation kernels.

  7. Comparative Analysis of CNV Calling Algorithms: Literature Survey and a Case Study Using Bovine High-Density SNP Data.

    PubMed

    Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Song, Jiuzhou; Liu, George E

    2013-06-25

    Copy number variations (CNVs) are gains and losses of genomic sequence between two individuals of a species when compared to a reference genome. The data from single nucleotide polymorphism (SNP) microarrays are now routinely used for genotyping, but they also can be utilized for copy number detection. Substantial progress has been made in array design and CNV calling algorithms and at least 10 comparison studies in humans have been published to assess them. In this review, we first survey the literature on existing microarray platforms and CNV calling algorithms. We then examine a number of CNV calling tools to evaluate their impacts using bovine high-density SNP data. Large incongruities in the results from different CNV calling tools highlight the need for standardizing array data collection, quality assessment and experimental validation. Only after careful experimental design and rigorous data filtering can the impacts of CNVs on both normal phenotypic variability and disease susceptibility be fully revealed.

  8. Systematic assessment of the performance of whole-genome amplification for SNP/CNV detection and β-thalassemia genotyping.

    PubMed

    He, Fei; Zhou, Wanjun; Cai, Ren; Yan, Tizhen; Xu, Xiangmin

    2018-04-01

    In this study, we aimed to assess the performance of two whole-genome amplification methods, multiple displacement amplification (MDA), and multiple annealing and looping-based amplification cycle (MALBAC), for β-thalassemia genotyping and single-nucleotide polymorphism (SNP)/copy-number variant (CNV) detection using two DNA sequencing assays. We collected peripheral blood, cell lines, and discarded embryos, and carried out MALBAC and MDA on single-cell and five-cell samples. We detected and statistically analyzed differences in the amplification efficiency, positive predictive value, sensitivity, allele dropout (ADO) rate, SNPs, and CV values between the two methods. Through Sanger sequencing at the single-cell and five-cell levels, we showed that both the amplification rate and ADO rate of MDA were better than those using MALBAC, and the sensitivity and positive predictive value obtained from MDA were higher than those from MALBAC for β-thalassemia genotyping. Using next-generation sequencing (NGS) at the single-cell level, we confirmed that MDA has better properties than MALBAC for SNP detection. However, MALBAC was more stable and homogeneous than MDA using low-depth NGS at the single-cell level for CNV detection. We conclude that MALBAC is the better option for CNV detection, while MDA is better suited for SNV detection.

  9. Meta-analysis of genome-wide studies identifies WNT16 and ESR1 SNPs associated with bone mineral density in premenopausal women.

    PubMed

    Koller, Daniel L; Zheng, Hou-Feng; Karasik, David; Yerges-Armstrong, Laura; Liu, Ching-Ti; McGuigan, Fiona; Kemp, John P; Giroux, Sylvie; Lai, Dongbing; Edenberg, Howard J; Peacock, Munro; Czerwinski, Stefan A; Choh, Audrey C; McMahon, George; St Pourcain, Beate; Timpson, Nicholas J; Lawlor, Debbie A; Evans, David M; Towne, Bradford; Blangero, John; Carless, Melanie A; Kammerer, Candace; Goltzman, David; Kovacs, Christopher S; Prior, Jerilynn C; Spector, Tim D; Rousseau, Francois; Tobias, Jon H; Akesson, Kristina; Econs, Michael J; Mitchell, Braxton D; Richards, J Brent; Kiel, Douglas P; Foroud, Tatiana

    2013-03-01

    Previous genome-wide association studies (GWAS) have identified common variants in genes associated with variation in bone mineral density (BMD), although most have been carried out in combined samples of older women and men. Meta-analyses of these results have identified numerous single-nucleotide polymorphisms (SNPs) of modest effect at genome-wide significance levels in genes involved in both bone formation and resorption, as well as other pathways. We performed a meta-analysis restricted to premenopausal white women from four cohorts (n = 4061 women, aged 20 to 45 years) to identify genes influencing peak bone mass at the lumbar spine and femoral neck. After imputation, age- and weight-adjusted bone-mineral density (BMD) values were tested for association with each SNP. Association of an SNP in the WNT16 gene (rs3801387; p = 1.7 × 10(-9) ) and multiple SNPs in the ESR1/C6orf97 region (rs4870044; p = 1.3 × 10(-8) ) achieved genome-wide significance levels for lumbar spine BMD. These SNPs, along with others demonstrating suggestive evidence of association, were then tested for association in seven replication cohorts that included premenopausal women of European, Hispanic-American, and African-American descent (combined n = 5597 for femoral neck; n = 4744 for lumbar spine). When the data from the discovery and replication cohorts were analyzed jointly, the evidence was more significant (WNT16 joint p = 1.3 × 10(-11) ; ESR1/C6orf97 joint p = 1.4 × 10(-10) ). Multiple independent association signals were observed with spine BMD at the ESR1 region after conditioning on the primary signal. Analyses of femoral neck BMD also supported association with SNPs in WNT16 and ESR1/C6orf97 (p < 1 × 10(-5) ). Our results confirm that several of the genes contributing to BMD variation across a broad age range in both sexes have effects of similar magnitude on BMD of the spine in premenopausal women. These data support the hypothesis that variants in these genes of known skeletal function also affect BMD during the premenopausal period. Copyright © 2013 American Society for Bone and Mineral Research.

  10. Capturing chloroplast variation for molecular ecology studies: a simple next generation sequencing approach applied to a rainforest tree

    PubMed Central

    2013-01-01

    Background With high quantity and quality data production and low cost, next generation sequencing has the potential to provide new opportunities for plant phylogeographic studies on single and multiple species. Here we present an approach for in silicio chloroplast DNA assembly and single nucleotide polymorphism detection from short-read shotgun sequencing. The approach is simple and effective and can be implemented using standard bioinformatic tools. Results The chloroplast genome of Toona ciliata (Meliaceae), 159,514 base pairs long, was assembled from shotgun sequencing on the Illumina platform using de novo assembly of contigs. To evaluate its practicality, value and quality, we compared the short read assembly with an assembly completed using 454 data obtained after chloroplast DNA isolation. Sanger sequence verifications indicated that the Illumina dataset outperformed the longer read 454 data. Pooling of several individuals during preparation of the shotgun library enabled detection of informative chloroplast SNP markers. Following validation, we used the identified SNPs for a preliminary phylogeographic study of T. ciliata in Australia and to confirm low diversity across the distribution. Conclusions Our approach provides a simple method for construction of whole chloroplast genomes from shotgun sequencing of whole genomic DNA using short-read data and no available closely related reference genome (e.g. from the same species or genus). The high coverage of Illumina sequence data also renders this method appropriate for multiplexing and SNP discovery and therefore a useful approach for landscape level studies of evolutionary ecology. PMID:23497206

  11. Recommendations for Accurate Resolution of Gene and Isoform Allele-Specific Expression in RNA-Seq Data

    PubMed Central

    Wood, David L. A.; Nones, Katia; Steptoe, Anita; Christ, Angelika; Harliwong, Ivon; Newell, Felicity; Bruxner, Timothy J. C.; Miller, David; Cloonan, Nicole; Grimmond, Sean M.

    2015-01-01

    Genetic variation modulates gene expression transcriptionally or post-transcriptionally, and can profoundly alter an individual’s phenotype. Measuring allelic differential expression at heterozygous loci within an individual, a phenomenon called allele-specific expression (ASE), can assist in identifying such factors. Massively parallel DNA and RNA sequencing and advances in bioinformatic methodologies provide an outstanding opportunity to measure ASE genome-wide. In this study, matched DNA and RNA sequencing, genotyping arrays and computationally phased haplotypes were integrated to comprehensively and conservatively quantify ASE in a single human brain and liver tissue sample. We describe a methodological evaluation and assessment of common bioinformatic steps for ASE quantification, and recommend a robust approach to accurately measure SNP, gene and isoform ASE through the use of personalized haplotype genome alignment, strict alignment quality control and intragenic SNP aggregation. Our results indicate that accurate ASE quantification requires careful bioinformatic analyses and is adversely affected by sample specific alignment confounders and random sampling even at moderate sequence depths. We identified multiple known and several novel ASE genes in liver, including WDR72, DSP and UBD, as well as genes that contained ASE SNPs with imbalance direction discordant with haplotype phase, explainable by annotated transcript structure, suggesting isoform derived ASE. The methods evaluated in this study will be of use to researchers performing highly conservative quantification of ASE, and the genes and isoforms identified as ASE of interest to researchers studying those loci. PMID:25965996

  12. Characterization of phenylpropanoid pathway genes within European maize (Zea mays L.) inbreds

    PubMed Central

    Andersen, Jeppe Reitan; Zein, Imad; Wenzel, Gerhard; Darnhofer, Birte; Eder, Joachim; Ouzunova, Milena; Lübberstedt, Thomas

    2008-01-01

    Background Forage quality of maize is influenced by both the content and structure of lignins in the cell wall. Biosynthesis of monolignols, constituting the complex structure of lignins, is catalyzed by enzymes in the phenylpropanoid pathway. Results In the present study we have amplified partial genomic fragments of six putative phenylpropanoid pathway genes in a panel of elite European inbred lines of maize (Zea mays L.) contrasting in forage quality traits. Six loci, encoding C4H, 4CL1, 4CL2, C3H, F5H, and CAD, displayed different levels of nucleotide diversity and linkage disequilibrium (LD) possibly reflecting different levels of selection. Associations with forage quality traits were identified for several individual polymorphisms within the 4CL1, C3H, and F5H genomic fragments when controlling for both overall population structure and relative kinship. A 1-bp indel in 4CL1 was associated with in vitro digestibility of organic matter (IVDOM), a non-synonymous SNP in C3H was associated with IVDOM, and an intron SNP in F5H was associated with neutral detergent fiber. However, the C3H and F5H associations did not remain significant when controlling for multiple testing. Conclusion While the number of lines included in this study limit the power of the association analysis, our results imply that genetic variation for forage quality traits can be mined in phenylpropanoid pathway genes of elite breeding lines of maize. PMID:18173847

  13. A Multi-Trait, Meta-analysis for Detecting Pleiotropic Polymorphisms for Stature, Fatness and Reproduction in Beef Cattle

    PubMed Central

    Bolormaa, Sunduimijid; Pryce, Jennie E.; Reverter, Antonio; Zhang, Yuandan; Barendse, William; Kemper, Kathryn; Tier, Bruce; Savin, Keith; Hayes, Ben J.; Goddard, Michael E.

    2014-01-01

    Polymorphisms that affect complex traits or quantitative trait loci (QTL) often affect multiple traits. We describe two novel methods (1) for finding single nucleotide polymorphisms (SNPs) significantly associated with one or more traits using a multi-trait, meta-analysis, and (2) for distinguishing between a single pleiotropic QTL and multiple linked QTL. The meta-analysis uses the effect of each SNP on each of n traits, estimated in single trait genome wide association studies (GWAS). These effects are expressed as a vector of signed t-values (t) and the error covariance matrix of these t values is approximated by the correlation matrix of t-values among the traits calculated across the SNP (V). Consequently, t'V−1t is approximately distributed as a chi-squared with n degrees of freedom. An attractive feature of the meta-analysis is that it uses estimated effects of SNPs from single trait GWAS, so it can be applied to published data where individual records are not available. We demonstrate that the multi-trait method can be used to increase the power (numbers of SNPs validated in an independent population) of GWAS in a beef cattle data set including 10,191 animals genotyped for 729,068 SNPs with 32 traits recorded, including growth and reproduction traits. We can distinguish between a single pleiotropic QTL and multiple linked QTL because multiple SNPs tagging the same QTL show the same pattern of effects across traits. We confirm this finding by demonstrating that when one SNP is included in the statistical model the other SNPs have a non-significant effect. In the beef cattle data set, cluster analysis yielded four groups of QTL with similar patterns of effects across traits within a group. A linear index was used to validate SNPs having effects on multiple traits and to identify additional SNPs belonging to these four groups. PMID:24675618

  14. Diet and Colorectal Cancer: Analysis of a Candidate Pathway Using SNPS, Haplotypes, and Multi-Gene Assessment

    PubMed Central

    Slattery, Martha L.; Lundgreen, Abbie; Herrick, Jennifer S.; Caan, Bette J.; Potter, John D.; Wolff, Roger K.

    2012-01-01

    There is considerable biologic plausibility to the hypothesis that genetic variability in pathways involved in insulin signaling and energy homeostasis may modulate dietary risk associated with colorectal cancer. We utilized data from 2 population-based case-control studies of colon (n = 1,574 cases, 1,970 controls) and rectal (n = 791 cases, 999 controls) cancer to evaluate genetic variation in candidate SNPs identified from 9 genes in a candidate pathway: PDK1, RP6KA1, RPS6KA2, RPS6KB1, RPS6KB2, PTEN, FRAP1 (mTOR), TSC1, TSC2, Akt1, PIK3CA, and PRKAG2 with dietary intake of total energy, carbohydrates, fat, and fiber. We employed SNP, haplotype, and multiple-gene analysis to evaluate associations. PDK1 interacted with dietary fat for both colon and rectal cancer and with dietary carbohydrates for colon cancer. Statistically significant interaction with dietary carbohydrates and rectal cancer was detected by haplotype analysis of PDK1. Evaluation of dietary interactions with multiple genes in this candidate pathway showed several interactions with pairs of genes: Akt1 and PDK1, PDK1 and PTEN, PDK1 and TSC1, and PRKAG2 and PTEN. Analyses show that genetic variation influences risk of colorectal cancer associated with diet and illustrate the importance of evaluating dietary interactions beyond the level of single SNPs or haplotypes when a biologically relevant candidate pathway is examined. PMID:21999454

  15. Genetic variation in food choice behaviour of amino acid-deprived Drosophila.

    PubMed

    Toshima, Naoko; Hara, Chieko; Scholz, Claus-Jürgen; Tanimura, Teiichi

    2014-10-01

    To understand homeostatic regulation in insects, we need to understand the mechanisms by which they respond to external stimuli to maintain the internal milieu. Our previous study showed that Drosophila melanogaster exhibit specific amino acid preferences. Here, we used the D.melanogaster Genetic Reference Panel (DGRP), which is comprised of multiple inbred lines derived from a natural population, to examine how amino acid preference changes depending on the internal nutritional state in different lines. We performed a two-choice preference test and observed genetic variations in the response to amino acid deprivation. For example, a high-responding line showed an enhanced preference for amino acids even after only 1day of deprivation and responded to a fairly low concentration of amino acids. Conversely, a low-responding line showed no increased preference for amino acids after deprivation. We compared the gene expression profiles between selected high- and the low-responding lines and performed SNP analyses. We found several groups of genes putatively involved in altering amino acid preference. These results will contribute to future studies designed to explore how the genetic architecture of an organism evolves to adapt to different nutritional environments. Copyright © 2014 Elsevier Ltd. All rights reserved.

  16. cn.FARMS: a latent variable model to detect copy number variations in microarray data with a low false discovery rate.

    PubMed

    Clevert, Djork-Arné; Mitterecker, Andreas; Mayr, Andreas; Klambauer, Günter; Tuefferd, Marianne; De Bondt, An; Talloen, Willem; Göhlmann, Hinrich; Hochreiter, Sepp

    2011-07-01

    Cost-effective oligonucleotide genotyping arrays like the Affymetrix SNP 6.0 are still the predominant technique to measure DNA copy number variations (CNVs). However, CNV detection methods for microarrays overestimate both the number and the size of CNV regions and, consequently, suffer from a high false discovery rate (FDR). A high FDR means that many CNVs are wrongly detected and therefore not associated with a disease in a clinical study, though correction for multiple testing takes them into account and thereby decreases the study's discovery power. For controlling the FDR, we propose a probabilistic latent variable model, 'cn.FARMS', which is optimized by a Bayesian maximum a posteriori approach. cn.FARMS controls the FDR through the information gain of the posterior over the prior. The prior represents the null hypothesis of copy number 2 for all samples from which the posterior can only deviate by strong and consistent signals in the data. On HapMap data, cn.FARMS clearly outperformed the two most prevalent methods with respect to sensitivity and FDR. The software cn.FARMS is publicly available as a R package at http://www.bioinf.jku.at/software/cnfarms/cnfarms.html.

  17. Inter-individual variability and genetic influences on cytokine responses against bacterial and fungal pathogens

    PubMed Central

    Li, Yang; Oosting, Marije; Deelen, Patrick; Ricaño-Ponce, Isis; Smeekens, Sanne; Jaeger, Martin; Matzaraki, Vasiliki; Swertz, Morris A.; Xavier, Ramnik J.; Franke, Lude; Wijmenga, Cisca; Joosten, Leo A.B.; Kumar, Vinod; Netea, Mihai G.

    2016-01-01

    Little is known about the inter-individual variation of cytokine responses to different pathogens in healthy individuals. To systematically describe cytokine responses elicited by distinct pathogens, and to determine the impact of genetic variation on cytokine production, we profiled cytokines produced by peripheral blood mononuclear cells from 197 individuals of European origin from the 200 Functional Genomics (200FG) cohort within the Human Functional Genomics Study (www.humanfunctionalgenomics.org), obtained over three different years. By comparing bacteria- and fungi-induced cytokine profiles, we show that most cytokine responses are organized around a physiological response to specific pathogens, rather than around a particular immune pathway or cytokine. We then correlated genome-wide SNP genotypes with cytokine abundance and identified six cytokine QTLs. Among them, a cytokine QTL at NAA35-GOLM1 locus markedly modulates IL-6 production in response to multiple pathogens, and associated with susceptibility to candidemia. Furthermore, the cytokine QTLs we identified are enriched among SNPs previously associated with infectious diseases and heart diseases. These data reveal and begin to explain the variability in cytokine production by human immune cells in response to pathogens. PMID:27376574

  18. Interactions between genetic variation and cellular environment in skeletal muscle gene expression.

    PubMed

    Taylor, D Leland; Knowles, David A; Scott, Laura J; Ramirez, Andrea H; Casale, Francesco Paolo; Wolford, Brooke N; Guan, Li; Varshney, Arushi; Albanus, Ricardo D'Oliveira; Parker, Stephen C J; Narisu, Narisu; Chines, Peter S; Erdos, Michael R; Welch, Ryan P; Kinnunen, Leena; Saramies, Jouko; Sundvall, Jouko; Lakka, Timo A; Laakso, Markku; Tuomilehto, Jaakko; Koistinen, Heikki A; Stegle, Oliver; Boehnke, Michael; Birney, Ewan; Collins, Francis S

    2018-01-01

    From whole organisms to individual cells, responses to environmental conditions are influenced by genetic makeup, where the effect of genetic variation on a trait depends on the environmental context. RNA-sequencing quantifies gene expression as a molecular trait, and is capable of capturing both genetic and environmental effects. In this study, we explore opportunities of using allele-specific expression (ASE) to discover cis-acting genotype-environment interactions (GxE)-genetic effects on gene expression that depend on an environmental condition. Treating 17 common, clinical traits as approximations of the cellular environment of 267 skeletal muscle biopsies, we identify 10 candidate environmental response expression quantitative trait loci (reQTLs) across 6 traits (12 unique gene-environment trait pairs; 10% FDR per trait) including sex, systolic blood pressure, and low-density lipoprotein cholesterol. Although using ASE is in principle a promising approach to detect GxE effects, replication of such signals can be challenging as validation requires harmonization of environmental traits across cohorts and a sufficient sampling of heterozygotes for a transcribed SNP. Comprehensive discovery and replication will require large human transcriptome datasets, or the integration of multiple transcribed SNPs, coupled with standardized clinical phenotyping.

  19. Variation in the ICAM1-ICAM4-ICAM5 locus is associated with systemic lupus erythematosus susceptibility in multiple ancestries.

    PubMed

    Kim, Kwangwoo; Brown, Elizabeth E; Choi, Chan-Bum; Alarcón-Riquelme, Marta E; Kelly, Jennifer A; Glenn, Stuart B; Ojwang, Joshua O; Adler, Adam; Lee, Hye-Soon; Boackle, Susan A; Criswell, Lindsey A; Alarcón, Graciela S; Edberg, Jeffrey C; Stevens, Anne M; Jacob, Chaim O; Gilkeson, Gary S; Kamen, Diane L; Tsao, Betty P; Anaya, Juan-Manuel; Guthridge, Joel M; Nath, Swapan K; Richardson, Bruce; Sawalha, Amr H; Kang, Young Mo; Shim, Seung Cheol; Suh, Chang-Hee; Lee, Soo-Kon; Kim, Chang-sik; Merrill, Joan T; Petri, Michelle; Ramsey-Goldman, Rosalind; Vilá, Luis M; Niewold, Timothy B; Martin, Javier; Pons-Estel, Bernardo A; Vyse, Timothy J; Freedman, Barry I; Moser, Kathy L; Gaffney, Patrick M; Williams, Adrienne; Comeau, Mary; Reveille, John D; James, Judith A; Scofield, R Hal; Langefeld, Carl D; Kaufman, Kenneth M; Harley, John B; Kang, Changwon; Kimberly, Robert P; Bae, Sang-Cheol

    2012-11-01

    Systemic lupus erythematosus (SLE; OMIM 152700) is a chronic autoimmune disease for which the aetiology includes genetic and environmental factors. ITGAM, integrin α(M) (complement component 3 receptor 3 subunit) encoding a ligand for intracellular adhesion molecule (ICAM) proteins, is an established SLE susceptibility locus. This study aimed to evaluate the independent and joint effects of genetic variations in the genes that encode ITGAM and ICAM. The authors examined several markers in the ICAM1-ICAM4-ICAM5 locus on chromosome 19p13 and the single ITGAM polymorphism (rs1143679) using a large-scale case-control study of 17 481 unrelated participants from four ancestry populations. The single-marker association and gene-gene interaction were analysed for each ancestry, and a meta-analysis across the four ancestries was performed. The A-allele of ICAM1-ICAM4-ICAM5 rs3093030, associated with elevated plasma levels of soluble ICAM1, and the A-allele of ITGAM rs1143679 showed the strongest association with increased SLE susceptibility in each of the ancestry populations and the trans-ancestry meta-analysis (OR(meta)=1.16, 95% CI 1.11 to 1.22; p=4.88×10(-10) and OR(meta)=1.67, 95% CI 1.55 to 1.79; p=3.32×10(-46), respectively). The effect of the ICAM single-nucleotide polymorphisms (SNPs) was independent of the effect of the ITGAM SNP rs1143679, and carriers of both ICAM rs3093030-AA and ITGAM rs1143679-AA had an OR of 4.08 compared with those with no risk allele in either SNP (95% CI 2.09 to 7.98; p=3.91×10(-5)). These findings are the first to suggest that an ICAM-integrin-mediated pathway contributes to susceptibility to SLE.

  20. Variation in the ICAM1–ICAM4–ICAM5 locus is associated with systemic lupus erythematosus susceptibility in multiple ancestries

    PubMed Central

    Kim, Kwangwoo; Brown, Elizabeth E; Choi, Chan-Bum; Alarcón-Riquelme, Marta E; Kelly, Jennifer A; Glenn, Stuart B; Ojwang, Joshua O; Adler, Adam; Lee, Hye-Soon; Boackle, Susan A; Criswell, Lindsey A; Alarcón, Graciela S; Edberg, Jeffrey C; Stevens, Anne M; Jacob, Chaim O; Gilkeson, Gary S; Kamen, Diane L; Tsao, Betty P; Anaya, Juan-Manuel; Guthridge, Joel M; Nath, Swapan K; Richardson, Bruce; Sawalha, Amr H; Kang, Young Mo; Shim, Seung Cheol; Suh, Chang-Hee; Lee, Soo-Kon; Kim, Chang-sik; Merrill, Joan T; Petri, Michelle; Ramsey-Goldman, Rosalind; Vilá, Luis M; Niewold, Timothy B; Martin, Javier; Pons-Estel, Bernardo A; Vyse, Timothy J; Freedman, Barry I; Moser, Kathy L; Gaffney, Patrick M; Williams, Adrienne; Comeau, Mary; Reveille, John D; James, Judith A; Scofield, R Hal; Langefeld, Carl D; Kaufman, Kenneth M; Harley, John B; Kang, Changwon; Kimberly, Robert P; Bae, Sang-Cheol

    2012-01-01

    Objective Systemic lupus erythematosus (SLE; OMIM 152700) is a chronic autoimmune disease for which the aetiology includes genetic and environmental factors. ITGAM, integrin αΜ (complement component 3 receptor 3 subunit) encoding a ligand for intracellular adhesion molecule (ICAM) proteins, is an established SLE susceptibility locus. This study aimed to evaluate the independent and joint effects of genetic variations in the genes that encode ITGAM and ICAM. Methods The authors examined several markers in the ICAM1–ICAM4–ICAM5 locus on chromosome 19p13 and the single ITGAM polymorphism (rs1143679) using a large-scale case–control study of 17 481 unrelated participants from four ancestry populations. The single marker association and gene–gene interaction were analysed for each ancestry, and a meta-analysis across the four ancestries was performed. Results The A-allele of ICAM1–ICAM4–ICAM5 rs3093030, associated with elevated plasma levels of soluble ICAM1, and the A-allele of ITGAM rs1143679 showed the strongest association with increased SLE susceptibility in each of the ancestry populations and the trans-ancestry meta-analysis (ORmeta=1.16, 95% CI 1.11 to 1.22; p=4.88×10−10 and ORmeta=1.67, 95% CI 1.55 to 1.79; p=3.32×10−46, respectively). The effect of the ICAM single-nucleotide polymorphisms (SNPs) was independent of the effect of the ITGAM SNP rs1143679, and carriers of both ICAM rs3093030-AA and ITGAM rs1143679-AA had an OR of 4.08 compared with those with no risk allele in either SNP (95% CI 2.09 to 7.98; p=3.91×10−5). Conclusion These findings are the first to suggest that an ICAM–integrin-mediated pathway contributes to susceptibility to SLE. PMID:22523428

  1. A new variation in the promoter region, the -604 C>T, and the Leu72Met polymorphism of the ghrelin gene are associated with protection to insulin resistance.

    PubMed

    Zavarella, S; Petrone, A; Zampetti, S; Gueorguiev, M; Spoletini, M; Mein, C A; Leto, G; Korbonits, M; Buzzetti, R

    2008-04-01

    Previous studies suggested that polymorphisms in the coding region of the preproghrelin were involved in the etiology of obesity and might modulate glucose-induced insulin secretion. We evaluated the association of a new variation, -604C>T, in the promoter region of the ghrelin gene, of Leu72Met (247C>A) and of Gln90Leu (265A>T), all haplotype-tagging single nucleotide polymorphisms (SNPs), with measures of insulin sensitivity in 1420 adult individuals. The three SNPs were genotyped using ABI PRISM 7900 HT Sequence Detection System. We used multiple linear regression analysis for quantitative traits and THESIAS software for haplotype analysis. We observed a protective effect exerted by Met72 variant of Leu72Met SNP on insulin resistance parameters; a significant decreasing trend from Leu/Leu to Leu/Met and to Met/Met homozygous subjects in triglycerides, fasting insulin levels and HOMA-IR index (P=0.02, 0.01 and 0.003, respectively), and, consistently, an increase in ghrelin levels (P=0.003) was found. A significant decrease from CC to TC and to TT genotypes in insulin levels and HOMA-IR index was also detected (P=0.00l for both), but only in subjects homozygous for Leu72, where the protective effect of Met72 was not present. The haplotype analysis results supported the data obtained by the evaluation of each single SNP, showing the highest value of insulin levels and HOMA-IR index in the -604(c)247(c) haplotype intermediate value in -604(T)247(C) and lowest value in -604(C)247(A). Our observations suggest a protective role of the Met72 variant and of -604 T allele in modulating insulin resistance. These SNPs or an unknown functional variant in linkage disequilibrium could increase ghrelin levels and probably insulin sensitivity.

  2. Assessing variation across 8 established East Asian loci for type 2 diabetes mellitus in American Indians: Suggestive evidence for new sex-specific diabetes signals in GLIS3 and ZFAND3.

    PubMed

    Muller, Yunhua L; Piaggi, Paolo; Chen, Peng; Wiessner, Gregory; Okani, Chidinma; Kobes, Sayuko; Knowler, William C; Bogardus, Clifton; Hanson, Robert L; Baier, Leslie J

    2017-05-01

    Eight new loci for type 2 diabetes mellitus (T2DM) were identified in an East Asian genome-wide association study meta-analysis. We assess tag SNPs across these loci for associations with T2DM in American Indians. A total of 435 SNPs that tag (R 2  ≥ .85) common variation across the 8 loci were analyzed for association with T2DM (n = 7710), early onset T2DM (n = 1060), body mass index (n = 6839), insulin sensitivity (n = 555), and insulin secretion (n = 298). Tag SNPs within FITM2-R3HDML-HNF4A, GLIS3, KCNK16, and ZFAND3 associated with T2DM after accounting for locus-wide multiple testing. The T2DM association in FITM2-R3HDML-HNF4A (rs3212183; P = .0002; OR = 1.19 [1.09-1.30]) was independent from the East Asian lead SNP (rs6017317), which did not associate with T2DM in American Indians. The top signals in GLIS3 (rs7875253; P = .0004; OR = 1.23 [1.10-1.38]) and KCNK16 (rs1544050; P = .002; OR = 1.16 [1.06-1.27]) were attenuated after adjustment for the East Asian lead SNPs (rs7041847 in GLIS3; rs1535500 in KCNK16), both of which also associated with T2DM in American Indians (P = .02; OR = 1.11 [1.01-1.21]; P = .007; OR = 1.19 [1.05-1.36] respectively). The top SNP in ZFAND3 (rs9470794; P = .002; OR = 1.43 [1.14-1.80]) was the identical East Asian lead SNP. Additional SNPs in GLIS3 (rs180867004) and ZFAND3 (rs4714120 and rs9470701) had significant genotype × sex interactions (P ≤ .008). The GLIS3 SNP (rs180867004) associated with T2DM only in men (P = .00006, OR = 1.94 [1.40-2.68]). The ZFAND3 SNPs (rs4714120 and rs9470701) associated with T2DM only in women (P = .0002, OR = 1.35 [1.16-1.59]; P = .0003, OR = 1.37 [1.16-1.63] respectively). Replication of lead T2DM SNPs in GLIS3, KCNK16, and ZFAND3 was observed in American Indians. Sex-specific T2DM signals in GLIS3 and ZFAND3, which are distinct from the East Asian GWAS signals, were also identified. Published 2016. This article is a U.S. Government work and is in the public domain in the USA.

  3. Pathway-based analyses.

    PubMed

    Kent, Jack W

    2016-02-03

    New technologies for acquisition of genomic data, while offering unprecedented opportunities for genetic discovery, also impose severe burdens of interpretation and penalties for multiple testing. The Pathway-based Analyses Group of the Genetic Analysis Workshop 19 (GAW19) sought reduction of multiple-testing burden through various approaches to aggregation of highdimensional data in pathways informed by prior biological knowledge. Experimental methods testedincluded the use of "synthetic pathways" (random sets of genes) to estimate power and false-positive error rate of methods applied to simulated data; data reduction via independent components analysis, single-nucleotide polymorphism (SNP)-SNP interaction, and use of gene sets to estimate genetic similarity; and general assessment of the efficacy of prior biological knowledge to reduce the dimensionality of complex genomic data. The work of this group explored several promising approaches to managing high-dimensional data, with the caveat that these methods are necessarily constrained by the quality of external bioinformatic annotation.

  4. Effects of assortative mate choice on the genomic and morphological structure of a hybrid zone between two bird subspecies.

    PubMed

    Semenov, Georgy A; Scordato, Elizabeth S C; Khaydarov, David R; Smith, Chris C R; Kane, Nolan C; Safran, Rebecca J

    2017-11-01

    Phenotypic differentiation plays an important role in the formation and maintenance of reproductive barriers. In some cases, variation in a few key aspects of phenotype can promote and maintain divergence; hence, the identification of these traits and their associations with patterns of genomic divergence is crucial for understanding the patterns and processes of population differentiation. We studied hybridization between the alba and personata subspecies of the white wagtail (Motacilla alba), and quantified divergence and introgression of multiple morphological traits and 19,437 SNP loci on a 3,000 km transect. Our goal was to identify traits that may contribute to reproductive barriers and to assess how variation in these traits corresponds to patterns of genome-wide divergence. Variation in only one trait-head plumage patterning-was consistent with reproductive isolation. Transitions in head plumage were steep and occurred over otherwise morphologically and genetically homogeneous populations, whereas cline centres for other traits and genomic ancestry were displaced over 100 km from the head cline. Field observational data show that social pairs mated assortatively by head plumage, suggesting that these phenotypes are maintained by divergent mating preferences. In contrast, variation in all other traits and genetic markers could be explained by neutral diffusion, although weak ecological selection cannot be ruled out. Our results emphasize that assortative mating may maintain phenotypic differences independent of other processes shaping genome-wide variation, consistent with other recent findings that raise questions about the relative importance of mate choice, ecological selection and selectively neutral processes for divergent evolution. © 2017 John Wiley & Sons Ltd.

  5. Translational genomics for analysis of complex traits in peanut and sorghum

    USDA-ARS?s Scientific Manuscript database

    The integration of sequencing and genotype data from natural variation studies (by whole genome resequencing [wgs] or genotype by sequencing [gbs]), transcriptome (RNA-seq) and mutant analysis (also by wgs) facilitated the development of DNA markers in the form of single nucleotide polymorphic (SNP)...

  6. Population sequencing reveals breed and sub-species specific CNVs in cattle

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect the rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an incre...

  7. Development and utilization of 100K SNP array in Saccharum Spp.

    USDA-ARS?s Scientific Manuscript database

    Sugarcane genotyping or fingerprinting has long been a daunting task due to its high polyploidy level with large number of chromosomes. Single nucleotide polymorphisms (SNPs) are very abundant DNA sequence variations in the genome. With the advance of next generation sequencing (NGS) technologies, m...

  8. Species trees from consensus single nucleotide polymorphism (SNP) data: Testing phylogenetic approaches with simulated and empirical data.

    PubMed

    Schmidt-Lebuhn, Alexander N; Aitken, Nicola C; Chuah, Aaron

    2017-11-01

    Datasets of hundreds or thousands of SNPs (Single Nucleotide Polymorphisms) from multiple individuals per species are increasingly used to study population structure, species delimitation and shallow phylogenetics. The principal software tool to infer species or population trees from SNP data is currently the BEAST template SNAPP which uses a Bayesian coalescent analysis. However, it is computationally extremely demanding and tolerates only small amounts of missing data. We used simulated and empirical SNPs from plants (Australian Craspedia, Asteraceae, and Pelargonium, Geraniaceae) to compare species trees produced (1) by SNAPP, (2) using SVD quartets, and (3) using Bayesian and parsimony analysis with several different approaches to summarising data from multiple samples into one set of traits per species. Our aims were to explore the impact of tree topology and missing data on the results, and to test which data summarising and analyses approaches would best approximate the results obtained from SNAPP for empirical data. SVD quartets retrieved the correct topology from simulated data, as did SNAPP except in the case of a very unbalanced phylogeny. Both methods failed to retrieve the correct topology when large amounts of data were missing. Bayesian analysis of species level summary data scoring the two alleles of each SNP as independent characters and parsimony analysis of data scoring each SNP as one character produced trees with branch length distributions closest to the true trees on which SNPs were simulated. For empirical data, Bayesian inference and Dollo parsimony analysis of data scored allele-wise produced phylogenies most congruent with the results of SNAPP. In the case of study groups divergent enough for missing data to be phylogenetically informative (because of additional mutations preventing amplification of genomic fragments or bioinformatic establishment of homology), scoring of SNP data as a presence/absence matrix irrespective of allele content might be an additional option. As this depends on sampling across species being reasonably even and a random distribution of non-informative instances of missing data, however, further exploration of this approach is needed. Properly chosen data summary approaches to inferring species trees from SNP data may represent a potential alternative to currently available individual-level coalescent analyses especially for quick data exploration and when dealing with computationally demanding or patchy datasets. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.

  9. Ubiquitin-conjugating enzyme E2-like gene associated to pathogen response in Concholepas concholepas: SNP identification and transcription expression.

    PubMed

    Núñez-Acuña, Gustavo; Aguilar-Espinoza, Andrea; Chávez-Mardones, Jacqueline; Gallardo-Escárate, Cristian

    2012-10-01

    Ubiquitin-conjugated E2 enzyme (UBE2) is one of the main components of the proteasome degradation cascade. Previous studies have shown an increase of expression levels in individuals challenged to some pathogen organism such as virus and bacteria. The study was to characterize the immune response of UBE2 gene in the gastropod Concholepas concholepas through expression analysis and single nucleotide polymorphisms (SNP) discovery. Hence, UBE2 was identified from a cDNA library by 454 pyrosequencing, while SNP identification and validation were performed using De novo assembly and high resolution melting analysis. Challenge trials with Vibrio anguillarum was carried out to evaluate the relative transcript abundance of UBE2 gene from two to thirty-three hours post-treatment. The results showed a partial UBE2 sequence of 889 base pair (bp) with a partial coding region of 291 bp. SNP variation (A/C) was observed at the 546th position. Individuals challenged by V. anguillarum showed an overexpression of the UBE2 gene, the expression being significantly higher in homozygous individuals (AA) than (CC) or heterozygous individuals (A/C). This study contributes useful information relating to the UBE2 gene and its association with innate immune response in marine invertebrates. Copyright © 2012 Elsevier Ltd. All rights reserved.

  10. Association of MEOX2 polymorphism with nonsyndromic cleft palate only in a Vietnamese population.

    PubMed

    Tran, Duy L; Imura, Hideto; Mori, Akihiro; Suzuki, Satoshi; Niimi, Teruyuki; Ono, Maya; Sakuma, Chisato; Nakahara, Shinichi; Nguyen, Tham T H; Pham, Phuong T; Hoang, Viet; Tran, Van T T; Nguyen, Minh D; Natsume, Nagato

    2017-10-14

    To evaluate the association between the single nucleotide polymorphism (SNP) rs227493 in the MEOX2 gene and nonsyndromic cleft palate only, this research was conducted as a case-control study by comparing a nonsyndromic cleft palate only group with an independent, healthy, and unaffected control group who were both examined by specialists. Based on clinical examination and medical records, we analyzed a total of 570 DNA samples, including 277 cases and 293 controls, which were extracted from dry blood spot samples collected from both the Odonto and Maxillofacial Hospital in Ho Chi Minh City and Nguyen Dinh Chieu Hospital in Ben Tre province, respectively. The standard procedures of genotyping the specific SNP (rs2237493) for MEOX2 were performed on a StepOne Realtime PCR system with TaqMan SNP Genotyping Assays. Significant statistical differences were observed in allelic frequencies (allele T and allele G) between the non-syndromic cleft palate only and control groups in female subjects, with an allelic odds ratio of 1.455 (95% confidence interval: 1.026-2.064) and P < 0.05. These study findings suggest that nonsyndromic isolated cleft palate might be influenced by variation of MEOX2, especially SNP rs2237493 in Vietnamese females. © 2017 Japanese Teratology Society.

  11. Revision of the SNPforID 34-plex forensic ancestry test: Assay enhancements, standard reference sample genotypes and extended population studies.

    PubMed

    Fondevila, M; Phillips, C; Santos, C; Freire Aradas, A; Vallone, P M; Butler, J M; Lareu, M V; Carracedo, A

    2013-01-01

    A revision of an established 34 SNP forensic ancestry test has been made by swapping the under-performing rs727811 component SNP with the highly informative rs3827760 that shows a near-fixed East Asian specific allele. We collated SNP variability data for the revised SNP set in 66 reference populations from 1000 Genomes and HGDP-CEPH panels and used this as reference data to analyse four U.S. populations showing a range of admixture patterns. The U.S. Hispanics sample in particular displayed heterogeneous values of co-ancestry between European, Native American and African contributors, likely to reflect in part, the way this disparate group is defined using cultural as well as population genetic parameters. The genotyping of over 700 U.S. population samples also provided the opportunity to thoroughly gauge peak mobility variation and peak height ratios observed from routine use of the single base extension chemistry of the 34-plex test. Finally, the genotyping of the widely used DNA profiling Standard Reference Material samples plus other control DNAs completes the audit of the 34-plex assay to allow forensic practitioners to apply this test more readily in their own laboratories. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  12. Extent of linkage disequilibrium, consistency of gametic phase, and imputation accuracy within and across Canadian dairy breeds.

    PubMed

    Larmer, S G; Sargolzaei, M; Schenkel, F S

    2014-05-01

    Genomic selection requires a large reference population to accurately estimate single nucleotide polymorphism (SNP) effects. In some Canadian dairy breeds, the available reference populations are not large enough for accurate estimation of SNP effects for traits of interest. If marker phase is highly consistent across multiple breeds, it is theoretically possible to increase the accuracy of genomic prediction for one or all breeds by pooling several breeds into a common reference population. This study investigated the extent of linkage disequilibrium (LD) in 5 major dairy breeds using a 50,000 (50K) SNP panel and 3 of the same breeds using the 777,000 (777K) SNP panel. Correlation of pair-wise SNP phase was also investigated on both panels. The level of LD was measured using the squared correlation of alleles at 2 loci (r(2)), and the consistency of SNP gametic phases was correlated using the signed square root of these values. Because of the high cost of the 777K panel, the accuracy of imputation from lower density marker panels [6,000 (6K) or 50K] was examined both within breed and using a multi-breed reference population in Holstein, Ayrshire, and Guernsey. Imputation was carried out using FImpute V2.2 and Beagle 3.3.2 software. Imputation accuracies were then calculated as both the proportion of correct SNP filled in (concordance rate) and allelic R(2). Computation time was also explored to determine the efficiency of the different algorithms for imputation. Analysis showed that LD values >0.2 were found in all breeds at distances at or shorter than the average adjacent pair-wise distance between SNP on the 50K panel. Correlations of r-values, however, did not reach high levels (<0.9) at these distances. High correlation values of SNP phase between breeds were observed (>0.94) when the average pair-wise distances using the 777K SNP panel were examined. High concordance rate (0.968-0.995) and allelic R(2) (0.946-0.991) were found for all breeds when imputation was carried out with FImpute from 50K to 777K. Imputation accuracy for Guernsey and Ayrshire was slightly lower when using the imputation method in Beagle. Computing time was significantly greater when using Beagle software, with all comparable procedures being 9 to 13 times less efficient, in terms of time, compared with FImpute. These findings suggest that use of a multi-breed reference population might increase prediction accuracy using the 777K SNP panel and that 777K genotypes can be efficiently and effectively imputed using the lower density 50K SNP panel. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  13. Genome-wide association implicates numerous genes and pleiotropy underlying ecological trait variation in natural populations of Populus trichocarpa

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McKown, Athena; Klapste, Jaroslav; Guy, Robert

    2014-01-01

    To uncover the genetic basis of phenotypic trait variation, we used 448 unrelated wild accessions of black cottonwood (Populus trichocarpa Torr. & Gray) from natural populations throughout western North America. Extensive information from large-scale trait phenotyping (with spatial and temporal replications within a common garden) and genotyping (with a 34K Populus SNP array) of all accessions were used for gene discovery in a genome-wide association study (GWAS).

  14. Association of interleukin-1 gene variations with moderate to severe chronic periodontitis in multiple ethnicities

    PubMed Central

    Wu, X; Offenbacher, S; Lόpez, N J; Chen, D; Wang, H-Y; Rogus, J; Zhou, J; Beck, J; Jiang, S; Bao, X; Wilkins, L; Doucette-Stamm, L; Kornman, K

    2015-01-01

    Background and Objective Genetic markers associated with disease are often non-functional and generally tag one or more functional “causative” variants in linkage disequilibrium. Markers may not show tight linkage to the causative variants across multiple ethnicities due to evolutionary divergence, and therefore may not be informative across different population groups. Validated markers of disease suggest causative variants exist in the gene and, if the causative variants can be identified, it is reasonable to hypothesize that such variants will be informative across diverse populations. The aim of this study was to test that hypothesis using functional Interleukin-1 (IL-1) gene variations across multiple ethnic populations to replace the non-functional markers originally associated with chronic adult periodontitis in Caucasians. Material and Methods Adult chronic periodontitis cases and controls from four ethnic groups (Caucasians, African Americans, Hispanics and Asians) were recruited in the USA, Chile and China. Genotypes of IL1B gene single nucleotide polymorphisms (SNPs), including three functional SNPs (rs16944, rs1143623, rs4848306) in the promoter and one intronic SNP (rs1143633), were determined using a single base extension method or TaqMan 5′ nuclease assay. Logistic regression and other statistical analyses were used to examine the association between moderate to severe periodontitis and IL1B gene variations, including SNPs, haplotypes and composite genotypes. Genotype patterns associated with disease in the discovery study were then evaluated in independent validation studies. Results Significant associations were identified in the discovery study, consisting of Caucasians and African Americans, between moderate to severe adult chronic periodontitis and functional variations in the IL1B gene, including a pattern of four IL1B SNPs (OR = 1.87, p < 0.0001). The association between the disease and this IL1B composite genotype pattern was validated in two additional studies consisting of Hispanics (OR = 1.95, p = 0.04) or Asians (OR = 3.27, p = 0.01). A meta-analysis of the three populations supported the association between the IL-1 genotype pattern and moderate to severe periodontitis (OR 1.95; p < 0.001). Our analysis also demonstrated that IL1B gene variations had added value to conventional risk factors in predicting chronic periodontitis. Conclusion This study validated the influence of IL-1 genetic factors on the severity of chronic periodontitis in four different ethnicities. PMID:24690098

  15. Newborn serum retinoic acid level is associated with variants of genes in the retinol metabolism pathway.

    PubMed

    Manolescu, Daniel C; El-Kares, Reyhan; Lakhal-Chaieb, Lajmi; Montpetit, Alexandre; Bhat, Pangala V; Goodyer, Paul

    2010-06-01

    Retinoic acid (RA) is a critical regulator of gene expression during embryonic development. In rodents, moderate maternal vitamin A deficiency leads to subtle morphogenetic defects and inactivation of RA pathway genes causes major disturbances of embryogenesis. In this study, we quantified RA in umbilical cord blood of 145 healthy full-term Caucasian infants from Montreal. Sixty seven percent of values were <10 nmol/L (84 were <0.07 nmol/L) and 33% had moderate or high levels. Variation in RA could not be explained by parallel variation in its precursor, retinol (ROL). However, we found that the (A) allele of the rs12591551 single nucleotide polymorphism (SNP) in the ALDH1A2 gene (ALDH1A2rs12591551(A)), occurring in 19% of newborns, was associated with 2.5-fold higher serum RA levels. ALDH1A2 encodes retinaldehyde dehydrogenase (RALDH) 2, which synthesizes RA in fetal tissues. We also found that homozygosity for the (A) allele of the rs12724719 SNP in the CRABP2 gene (CRABP2rs12724719(A/A)) was associated with 4.4-fold increase in umbilical cord serum RA. CRABP2 facilitates RA binding to its cognate receptor complex and transfer to the nucleus. We hypothesize that individual variation in RA pathway genes may account for subtle variations in RA-dependent human embryogenesis.

  16. Effects of vertebral number variations on carcass traits and genotyping of Vertnin candidate gene in Kazakh sheep.

    PubMed

    Zhang, Zhifeng; Sun, Yawei; Du, Wei; He, Sangang; Liu, Mingjun; Tian, Changyan

    2017-09-01

    The vertebral number is associated with body length and carcass traits, which represents an economically important trait in farm animals. The variation of vertebral number has been observed in a few mammalian species. However, the variation of vertebral number and quantitative trait loci in sheep breeds have not been well addressed. In our investigation, the information including gender, age, carcass weight, carcass length and the number of thoracic and lumbar vertebrae from 624 China Kazakh sheep was collected. The effect of vertebral number variation on carcass weight and carcass length was estimated by general linear model. Further, the polymorphic sites of Vertnin ( VRTN ) gene were identified by sequencing, and the association of the genotype and vertebral number variation was analyzed by the one-way analysis of variance model. The variation of thoracolumbar vertebrae number in Kazakh sheep (18 to 20) was smaller than that in Texel sheep (17 to 21). The individuals with 19 thoracolumbar vertebrae (T13L6) were dominant in Kazakh sheep (79.2%). The association study showed that the numbers of thoracolumbar vertebrae were positively correlated with the carcass length and carcass weight, statistically significant with carcass length. To investigate the association of thoracolumbar vertebrae number with VRTN gene, we genotyped the VRTN gene. A total of 9 polymorphic sites were detected and only a single nucleotide polymorphism (SNP) (rs426367238) was suggested to associate with thoracic vertebral number statistically. The variation of thoracolumbar vertebrae number positively associated with the carcass length and carcass weight, especially with the carcass length. VRTN gene polymorphism of the SNP (rs426367238) with significant effect on thoracic vertebral number could be as a candidate marker to further evaluate its role in influence of thoracolumbar vertebral number.

  17. A common haplotype of the glucokinase gene alters fasting glucose and birth weight: association in six studies and population-genetics analyses.

    PubMed

    Weedon, Michael N; Clark, Vanessa J; Qian, Yudong; Ben-Shlomo, Yoav; Timpson, Nicholas; Ebrahim, Shah; Lawlor, Debbie A; Pembrey, Marcus E; Ring, Susan; Wilkin, Terry J; Voss, Linda D; Jeffery, Alison N; Metcalf, Brad; Ferrucci, Luigi; Corsi, Anna Maria; Murray, Anna; Melzer, David; Knight, Bridget; Shields, Bev; Smith, George Davey; Hattersley, Andrew T; Di Rienzo, Anna; Frayling, Tim M

    2006-12-01

    Fasting glucose is associated with future risk of type 2 diabetes and ischemic heart disease and is tightly regulated despite considerable variation in quantity, type, and timing of food intake. In pregnancy, maternal fasting glucose concentration is an important determinant of offspring birth weight. The key determinant of fasting glucose is the enzyme glucokinase (GCK). Rare mutations of GCK cause fasting hyperglycemia and alter birth weight. The extent to which common variation of GCK explains normal variation of fasting glucose and birth weight is not known. We aimed to comprehensively define the role of variation of GCK in determination of fasting glucose and birth weight, using a tagging SNP (tSNP) approach and studying 19,806 subjects from six population-based studies. Using 22 tSNPs, we showed that the variant rs1799884 is associated with fasting glucose at all ages in the normal population and exceeded genomewide levels of significance (P=10-9). rs3757840 was also highly significantly associated with fasting glucose (P=8x10-7), but haplotype analysis revealed that this is explained by linkage disequilibrium (r2=0.2) with rs1799884. A maternal A allele at rs1799884 was associated with a 32-g (95% confidence interval 11-53 g) increase in offspring birth weight (P=.002). Genetic variation influencing birth weight may have conferred a selective advantage in human populations. We performed extensive population-genetics analyses to look for evidence of recent positive natural selection on patterns of GCK variation. However, we found no strong signature of positive selection. In conclusion, a comprehensive analysis of common variation of the glucokinase gene shows that this is the first gene to be reproducibly associated with fasting glucose and fetal growth.

  18. Effect of P450 Oxidoreductase Polymorphisms on the Metabolic Activities of Ten Cytochrome P450s Varied by Polymorphic CYP Genotypes in Human Liver Microsomes.

    PubMed

    Fang, Yan; Gao, Na; Tian, Xin; Zhou, Jun; Zhang, Hai-Feng; Gao, Jie; He, Xiao-Pei; Wen, Qiang; Jia, Lin-Jing; Jin, Han; Qiao, Hai-Ling

    2018-06-27

    Background/ Aims: Little is known about the effect of P450 oxidoreductase (POR) gene polymorphisms on the activities of CYPs with multiple genotypes. We genotyped 102 human livers for 18 known POR single nucleotide polymorphisms (SNPs) with allelic frequencies greater than 1% as well as for 27 known SNPs in 10 CYPs. CYP enzyme activities in microsomes prepared from these livers were determined by measuring probe substrate metabolism by high performance liquid chromatograph. We found that the effects of the 18 POR SNPs on 10 CYP activities were CYP genotype-dependent. The POR mutations were significantly associated with decreased overall Km for CYP2B6 and 2E1, and specific genotypes within CYP1A2, 2A6, 2B6, 2C8, 2D6 and 2E1 were identified as being affected by these POR SNPs. Notably, the effect of a specific POR mutation on the activity of a CYP genotype could not be predicted from other CYP genotypes of even the same CYP. When combining one POR SNP with other POR SNPs, a hitherto unrecognized effect of multiple-site POR gene polymorphisms (MSGP) on CYP activity was uncovered, which was not necessarily consistent with the effect of either single POR SNP. The effects of POR SNPs on CYP activities were not only CYP-dependent, but more importantly, CYP genotype-dependent. Moreover, the effect of a POR SNP alone and in combination with other POR SNPs (MSGP) was not always consistent, nor predictable. Understanding the impact of POR gene polymorphisms on drug metabolism necessitates knowing the complete SNP complement of POR and the genotype of the relevant CYPs. © 2018 The Author(s). Published by S. Karger AG, Basel.

  19. Genetic signatures in choline and 1-carbon metabolism are associated with the severity of hepatic steatosis

    PubMed Central

    Corbin, Karen D.; Abdelmalek, Manal F.; Spencer, Melanie D.; da Costa, Kerry-Ann; Galanko, Joseph A.; Sha, Wei; Suzuki, Ayako; Guy, Cynthia D.; Cardona, Diana M.; Torquati, Alfonso; Diehl, Anna Mae; Zeisel, Steven H.

    2013-01-01

    Choline metabolism is important for very low-density lipoprotein secretion, making this nutritional pathway an important contributor to hepatic lipid balance. The purpose of this study was to assess whether the cumulative effects of multiple single nucleotide polymorphisms (SNPs) across genes of choline/1-carbon metabolism and functionally related pathways increase susceptibility to developing hepatic steatosis. In biopsy-characterized cases of nonalcoholic fatty liver disease and controls, we assessed 260 SNPs across 21 genes in choline/1-carbon metabolism. When SNPs were examined individually, using logistic regression, we only identified a single SNP (PNPLA3 rs738409) that was significantly associated with severity of hepatic steatosis after adjusting for confounders and multiple comparisons (P=0.02). However, when groupings of SNPs in similar metabolic pathways were defined using unsupervised hierarchical clustering, we identified groups of subjects with shared SNP signatures that were significantly correlated with steatosis burden (P=0.0002). The lowest and highest steatosis clusters could also be differentiated by ethnicity. However, unique SNP patterns defined steatosis burden irrespective of ethnicity. Our results suggest that analysis of SNP patterns in genes of choline/1-carbon metabolism may be useful for prediction of severity of steatosis in specific subsets of people, and the metabolic inefficiencies caused by these SNPs should be examined further.—Corbin, K. D., Abdelmalek, M. F., Spencer, M. D., da Costa, K.-A., Galanko, J. A., Sha, W., Suzuki, A., Guy, C. D., Cardona, D. M., Torquati, A., Diehl, A. M., Zeisel, S. H. Genetic signatures in choline and 1-carbon metabolism are associated with the severity of hepatic steatosis. PMID:23292069

  20. Whole Genome Sequence Typing to Investigate the Apophysomyces Outbreak following a Tornado in Joplin, Missouri, 2011

    PubMed Central

    Etienne, Kizee A.; Gillece, John; Hilsabeck, Remy; Schupp, Jim M.; Colman, Rebecca; Lockhart, Shawn R.; Gade, Lalitha; Thompson, Elizabeth H.; Sutton, Deanna A.; Neblett-Fanfair, Robyn; Park, Benjamin J.; Turabelidze, George; Keim, Paul; Brandt, Mary E.; Deak, Eszter; Engelthaler, David M.

    2012-01-01

    Case reports of Apophysomyces spp. in immunocompetent hosts have been a result of traumatic deep implantation of Apophysomyces spp. spore-contaminated soil or debris. On May 22, 2011 a tornado occurred in Joplin, MO, leaving 13 tornado victims with Apophysomyces trapeziformis infections as a result of lacerations from airborne material. We used whole genome sequence typing (WGST) for high-resolution phylogenetic SNP analysis of 17 outbreak Apophysomyces isolates and five additional temporally and spatially diverse Apophysomyces control isolates (three A. trapeziformis and two A. variabilis isolates). Whole genome SNP phylogenetic analysis revealed three clusters of genotypically related or identical A. trapeziformis isolates and multiple distinct isolates among the Joplin group; this indicated multiple genotypes from a single or multiple sources. Though no linkage between genotype and location of exposure was observed, WGST analysis determined that the Joplin isolates were more closely related to each other than to the control isolates, suggesting local population structure. Additionally, species delineation based on WGST demonstrated the need to reassess currently accepted taxonomic classifications of phylogenetic species within the genus Apophysomyces. PMID:23209631

  1. Whole genome sequence typing to investigate the Apophysomyces outbreak following a tornado in Joplin, Missouri, 2011.

    PubMed

    Etienne, Kizee A; Gillece, John; Hilsabeck, Remy; Schupp, Jim M; Colman, Rebecca; Lockhart, Shawn R; Gade, Lalitha; Thompson, Elizabeth H; Sutton, Deanna A; Neblett-Fanfair, Robyn; Park, Benjamin J; Turabelidze, George; Keim, Paul; Brandt, Mary E; Deak, Eszter; Engelthaler, David M

    2012-01-01

    Case reports of Apophysomyces spp. in immunocompetent hosts have been a result of traumatic deep implantation of Apophysomyces spp. spore-contaminated soil or debris. On May 22, 2011 a tornado occurred in Joplin, MO, leaving 13 tornado victims with Apophysomyces trapeziformis infections as a result of lacerations from airborne material. We used whole genome sequence typing (WGST) for high-resolution phylogenetic SNP analysis of 17 outbreak Apophysomyces isolates and five additional temporally and spatially diverse Apophysomyces control isolates (three A. trapeziformis and two A. variabilis isolates). Whole genome SNP phylogenetic analysis revealed three clusters of genotypically related or identical A. trapeziformis isolates and multiple distinct isolates among the Joplin group; this indicated multiple genotypes from a single or multiple sources. Though no linkage between genotype and location of exposure was observed, WGST analysis determined that the Joplin isolates were more closely related to each other than to the control isolates, suggesting local population structure. Additionally, species delineation based on WGST demonstrated the need to reassess currently accepted taxonomic classifications of phylogenetic species within the genus Apophysomyces.

  2. Deep Resequencing Unveils Genetic Architecture of ADIPOQ and Identifies a Novel Low-Frequency Variant Strongly Associated With Adiponectin Variation

    PubMed Central

    Warren, Liling L.; Li, Li; Nelson, Matthew R.; Ehm, Margaret G.; Shen, Judong; Fraser, Dana J.; Aponte, Jennifer L.; Nangle, Keith L.; Slater, Andrew J.; Woollard, Peter M.; Hall, Matt D.; Topp, Simon D.; Yuan, Xin; Cardon, Lon R.; Chissoe, Stephanie L.; Mooser, Vincent; Morris, Andrew D.; Palmer, Colin N.A.; Perry, John R.; Frayling, Timothy M.; Whittaker, John C.; Waterworth, Dawn M.

    2012-01-01

    Increased adiponectin levels have been shown to be associated with a lower risk of type 2 diabetes. To understand the relations between genetic variation at the adiponectin-encoding gene, ADIPOQ, and adiponectin levels, and subsequently its role in disease, we conducted a deep resequencing experiment of ADIPOQ in 14,002 subjects, including 12,514 Europeans, 594 African Americans, and 567 Indian Asians. We identified 296 single nucleotide polymorphisms (SNPs), including 30 amino acid changes, and carried out association analyses in a subset of 3,665 subjects from two independent studies. We confirmed multiple genome-wide association study findings and identified a novel association between a low-frequency SNP (rs17366653) and adiponectin levels (P = 2.2E–17). We show that seven SNPs exert independent effects on adiponectin levels. Together, they explained 6% of adiponectin variation in our samples. We subsequently assessed association between these SNPs and type 2 diabetes in the Genetics of Diabetes Audit and Research in Tayside Scotland (GO-DARTS) study, comprised of 5,145 case and 6,374 control subjects. No evidence of association with type 2 diabetes was found, but we were also unable to exclude the possibility of substantial effects (e.g., odds ratio 95% CI for rs7366653 [0.91–1.58]). Further investigation by large-scale and well-powered Mendelian randomization studies is warranted. PMID:22403302

  3. Incorporation of Personal Single Nucleotide Polymorphism (SNP) Data into a National Level Electronic Health Record for Disease Risk Assessment, Part 3: An Evaluation of SNP Incorporated National Health Information System of Turkey for Prostate Cancer

    PubMed Central

    Beyan, Timur

    2014-01-01

    Background A personalized medicine approach provides opportunities for predictive and preventive medicine. Using genomic, clinical, environmental, and behavioral data, the tracking and management of individual wellness is possible. A prolific way to carry this personalized approach into routine practices can be accomplished by integrating clinical interpretations of genomic variations into electronic medical records (EMRs)/electronic health records (EHRs). Today, various central EHR infrastructures have been constituted in many countries of the world, including Turkey. Objective As an initial attempt to develop a sophisticated infrastructure, we have concentrated on incorporating the personal single nucleotide polymorphism (SNP) data into the National Health Information System of Turkey (NHIS-T) for disease risk assessment, and evaluated the performance of various predictive models for prostate cancer cases. We present our work as a three part miniseries: (1) an overview of requirements, (2) the incorporation of SNP data into the NHIS-T, and (3) an evaluation of SNP data incorporated into the NHIS-T for prostate cancer. Methods In the third article of this miniseries, we have evaluated the proposed complementary capabilities (ie, knowledge base and end-user application) with real data. Before the evaluation phase, clinicogenomic associations about increased prostate cancer risk were extracted from knowledge sources, and published predictive genomic models assessing individual prostate cancer risk were collected. To evaluate complementary capabilities, we also gathered personal SNP data of four prostate cancer cases and fifteen controls. Using these data files, we compared various independent and model-based, prostate cancer risk assessment approaches. Results Through the extraction and selection processes of SNP-prostate cancer risk associations, we collected 209 independent associations for increased risk of prostate cancer from the studied knowledge sources. Also, we gathered six cumulative models and two probabilistic models. Cumulative models and assessment of independent associations did not have impressive results. There was one of the probabilistic, model-based interpretation that was successful compared to the others. In envirobehavioral and clinical evaluations, we found that some of the comorbidities, especially, would be useful to evaluate disease risk. Even though we had a very limited dataset, a comparison of performances of different disease models and their implementation with real data as use case scenarios helped us to gain deeper insight into the proposed architecture. Conclusions In order to benefit from genomic variation data, existing EHR/EMR systems must be constructed with the capability of tracking and monitoring all aspects of personal health status (genomic, clinical, environmental, etc) in 24/7 situations, and also with the capability of suggesting evidence-based recommendations. A national-level, accredited knowledge base is a top requirement for improved end-user systems interpreting these parameters. Finally, categorization using similar, individual characteristics (SNP patterns, exposure history, etc) may be an effective way to predict disease risks, but this approach needs to be concretized and supported with new studies. PMID:25600087

  4. The use of population-scale sequencing to identify CNVs impacting productive traits in different cattle breeds

    USDA-ARS?s Scientific Manuscript database

    Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...

  5. Linkage Disequilibrium And Genome-Wide Association Studies In O. sativa

    USDA-ARS?s Scientific Manuscript database

    There is increasing evidence that genome-wide association studies provide a powerful approach to find the genetic basis of complex phenotypic variation in all kinds of species. For this purpose, we developed the first generation 44K Affymetrix SNP array in rice (see Tung et al. poster). We genotyped...

  6. Population genetics related to adaptation in elite oat germplasm

    USDA-ARS?s Scientific Manuscript database

    Six hundred thirty five oat lines and 2,635 SNP loci were used to evaluate population structure, linkage disequilibrium (LD) and genotype-phenotype association with heading date. The first five principal components (PC) accounted for 25.3% of genetic variation. Neither the eigenvalues of the first 2...

  7. Population-Specific Patterns of Linkage Disequilibrium and SNP Variation in Spring and Winter Polyploid Wheat

    USDA-ARS?s Scientific Manuscript database

    Single nucleotide polymorphisms (SNPs) are ideally suited for the construction of high-resolution genetic maps, studying population evolutionary history and performing genome-wide association mapping experiments. Here we used a genome-wide set of 1536 SNPs to study linkage disequilibrium (LD) and po...

  8. International Cancer of the Head and Neck, Genetics and Environment (InterCHANGE) Study

    ClinicalTrials.gov

    2013-10-29

    Evaluate the Association Between Certain Environmental Exposures (e.g. Cigarette Smoking, Alcohol Drinking, Betel Nut Chewing…) and Head and Neck Cancers; Assess the Effect of Genetic Factors, Including Both SNP and Copy Number Variation (CNV) Through Analysis of Both Main Effect and Gene-gene Interaction

  9. Circulating insulin-like growth factors and Alzheimer disease: A mendelian randomization study.

    PubMed

    Williams, Dylan M; Karlsson, Ida K; Pedersen, Nancy L; Hägg, Sara

    2018-01-23

    To examine whether genetically predicted variation in circulating insulin-like growth factor 1 (IGF1) or its binding protein, IGFBP3, are associated with risk of Alzheimer disease (AD), using a mendelian randomization study design. We first examined disease risk by genotypes of 9 insulin-like growth factor (IGF)-related single nucleotide polymorphisms (SNPs) using published summary genome-wide association statistics from the International Genomics of Alzheimer's Project (IGAP; n = 17,008 cases; 37,154 controls). We then assessed whether any SNP-disease results replicated in an independent sample derived from the Swedish Twin Registry (n = 984 cases; 10,304 controls). Meta-analyses of SNP-AD results did not suggest that variation in IGF1, IGFBP3, or the molar ratio of these affect AD risk. Only one SNP appeared to affect AD risk in IGAP data. This variant is located in the gene FOXO3, implicated in human longevity. In a meta-analysis of both IGAP and secondary data, the odds ratio of AD per FOXO3 risk allele was 1.04 (95% confidence interval 1.01-1.08; p = 0.008). These findings suggest that circulating IGF1 and IGFBP3 are not important determinants of AD risk. FOXO3 function may influence AD development via pathways that are independent of IGF signaling (i.e., pleiotropic actions). Copyright © 2017 The Author(s). Published by Wolters Kluwer Health, Inc. on behalf of the American Academy of Neurology.

  10. Targeted capture and resequencing of 1040 genes reveal environmentally driven functional variation in grey wolves.

    PubMed

    Schweizer, Rena M; Robinson, Jacqueline; Harrigan, Ryan; Silva, Pedro; Galverni, Marco; Musiani, Marco; Green, Richard E; Novembre, John; Wayne, Robert K

    2016-01-01

    In an era of ever-increasing amounts of whole-genome sequence data for individuals and populations, the utility of traditional single nucleotide polymorphisms (SNPs) array-based genome scans is uncertain. We previously performed a SNP array-based genome scan to identify candidate genes under selection in six distinct grey wolf (Canis lupus) ecotypes. Using this information, we designed a targeted capture array for 1040 genes, including all exons and flanking regions, as well as 5000 1-kb nongenic neutral regions, and resequenced these regions in 107 wolves. Selection tests revealed striking patterns of variation within candidate genes relative to noncandidate regions and identified potentially functional variants related to local adaptation. We found 27% and 47% of candidate genes from the previous SNP array study had functional changes that were outliers in sweed and bayenv analyses, respectively. This result verifies the use of genomewide SNP surveys to tag genes that contain functional variants between populations. We highlight nonsynonymous variants in APOB, LIPG and USH2A that occur in functional domains of these proteins, and that demonstrate high correlation with precipitation seasonality and vegetation. We find Arctic and High Arctic wolf ecotypes have higher numbers of genes under selection, which highlight their conservation value and heightened threat due to climate change. This study demonstrates that combining genomewide genotyping arrays with large-scale resequencing and environmental data provides a powerful approach to discern candidate functional variants in natural populations. © 2015 John Wiley & Sons Ltd.

  11. Single Marker and Haplotype-Based Association Analysis of Semolina and Pasta Colour in Elite Durum Wheat Breeding Lines Using a High-Density Consensus Map.

    PubMed

    N'Diaye, Amidou; Haile, Jemanesh K; Cory, Aron T; Clarke, Fran R; Clarke, John M; Knox, Ron E; Pozniak, Curtis J

    2017-01-01

    Association mapping is usually performed by testing the correlation between a single marker and phenotypes. However, because patterns of variation within genomes are inherited as blocks, clustering markers into haplotypes for genome-wide scans could be a worthwhile approach to improve statistical power to detect associations. The availability of high-density molecular data allows the possibility to assess the potential of both approaches to identify marker-trait associations in durum wheat. In the present study, we used single marker- and haplotype-based approaches to identify loci associated with semolina and pasta colour in durum wheat, the main objective being to evaluate the potential benefits of haplotype-based analysis for identifying quantitative trait loci. One hundred sixty-nine durum lines were genotyped using the Illumina 90K Infinium iSelect assay, and 12,234 polymorphic single nucleotide polymorphism (SNP) markers were generated and used to assess the population structure and the linkage disequilibrium (LD) patterns. A total of 8,581 SNPs previously localized to a high-density consensus map were clustered into 406 haplotype blocks based on the average LD distance of 5.3 cM. Combining multiple SNPs into haplotype blocks increased the average polymorphism information content (PIC) from 0.27 per SNP to 0.50 per haplotype. The haplotype-based analysis identified 12 loci associated with grain pigment colour traits, including the five loci identified by the single marker-based analysis. Furthermore, the haplotype-based analysis resulted in an increase of the phenotypic variance explained (50.4% on average) and the allelic effect (33.7% on average) when compared to single marker analysis. The presence of multiple allelic combinations within each haplotype locus offers potential for screening the most favorable haplotype series and may facilitate marker-assisted selection of grain pigment colour in durum wheat. These results suggest a benefit of haplotype-based analysis over single marker analysis to detect loci associated with colour traits in durum wheat.

  12. Candidate Gene Study of TRAIL and TRAIL Receptors: Association with Response to Interferon Beta Therapy in Multiple Sclerosis Patients

    PubMed Central

    Órpez-Zafra, Teresa; Pinto-Medel, María Jesús; Oliver-Martos, Begoña; Ortega-Pinazo, Jesús; Arnáiz, Carlos; Guijarro-Castro, Cristina; Varadé, Jezabel; Álvarez-Lafuente, Roberto; Urcelay, Elena; Sánchez-Jiménez, Francisca

    2013-01-01

    TRAIL and TRAIL Receptor genes have been implicated in Multiple Sclerosis pathology as well as in the response to IFN beta therapy. The objective of our study was to evaluate the association of these genes in relation to the age at disease onset (AAO) and to the clinical response upon IFN beta treatment in Spanish MS patients. We carried out a candidate gene study of TRAIL, TRAILR-1, TRAILR-2, TRAILR-3 and TRAILR-4 genes. A total of 54 SNPs were analysed in 509 MS patients under IFN beta treatment, and an additional cohort of 226 MS patients was used to validate the results. Associations of rs1047275 in TRAILR-2 and rs7011559 in TRAILR-4 genes with AAO under an additive model did not withstand Bonferroni correction. In contrast, patients with the TRAILR-1 rs20576-CC genotype showed a better clinical response to IFN beta therapy compared with patients carrying the A-allele (recessive model: p = 8.88×10−4, pc = 0.048, OR = 0.30). This SNP resulted in a non synonymous substitution of Glutamic acid to Alanine in position 228 (E228A), a change previously associated with susceptibility to different cancer types and risk of metastases, suggesting a lack of functionality of TRAILR-1. In order to unravel how this amino acid change in TRAILR-1 would affect to death signal, we performed a molecular modelling with both alleles. Neither TRAIL binding sites in the receptor nor the expression levels of TRAILR-1 in peripheral blood mononuclear cell subsets (monocytes, CD4+ and CD8+ T cells) were modified, suggesting that this SNP may be altering the death signal by some other mechanism. These findings show a role for TRAILR-1 gene variations in the clinical outcome of IFN beta therapy that might have relevance as a biomarker to predict the response to IFN beta in MS. PMID:23658636

  13. Single nucleotide polymorphism discovery in bovine liver using RNA-seq technology

    PubMed Central

    Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Pierzchała, Mariusz; Feng, Yaping; Kadarmideen, Haja N.; Kumar, Dibyendu

    2017-01-01

    Background RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver tissue of young bulls of the Polish Red, Polish Holstein-Friesian (HF) and Hereford breeds, and to understand the genomic variation in the three cattle breeds that may reflect differences in production traits. Results The RNA-seq experiment on bovine liver produced 107,114,4072 raw paired-end reads, with an average of approximately 60 million paired-end reads per library. Breed-wise, a total of 345.06, 290.04 and 436.03 million paired-end reads were obtained from the Polish Red, Polish HF, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed that 81.35%, 82.81% and 84.21% of the mapped sequencing reads were properly paired to the Polish Red, Polish HF, and Hereford breeds, respectively. This study identified 5,641,401 SNPs and insertion and deletion (indel) positions expressed in the bovine liver with an average of 313,411 SNPs and indel per young bull. Following the removal of the indel mutations, a total of 195,3804, 152,7120 and 205,3184 raw SNPs expressed in bovine liver were identified for the Polish Red, Polish HF, and Hereford breeds, respectively. Breed-wise, three highly reliable breed-specific SNP-databases (SNP-dbs) with 31,562, 24,945 and 28,194 SNP records were constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10, IGFBPL1, IGFL1, LEP, LHX4, MC4R, MSTN, NKAIN1, PLAG1, POU1F1, SDR16C5, SH2B2, TOX, UCP3 and WNT10B) in all three cattle breeds. However, six SNP loci (CCSER1, GHR, KCNIP4, MTSS1, EGFR and NSMCE2) were identified as highly polymorphic among the cattle breeds. Conclusions This study identified breed-specific SNPs with greater SNP ratio and excellent mapping coverage, as well as monomorphic and highly polymorphic putative SNP loci within QTL/CGs of bovine liver tissue. A breed-specific SNP-db constructed for bovine liver yielded nearly six million SNPs. In addition, a KASPTM SNP genotyping assay, as a reliable cost-effective method, successfully validated the breed-specific putative SNPs originating from the RNA-seq experiments. PMID:28234981

  14. Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography.

    PubMed

    Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi

    2013-03-01

    New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined 'elimination' status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of Mycobacterium leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. Copyright © 2012 Elsevier B.V. All rights reserved.

  15. Using Drosophila melanogaster as a Model for Genotoxic Chemical Mutational Studies with a New Program, SnpSift

    PubMed Central

    Cingolani, Pablo; Patel, Viral M.; Coon, Melissa; Nguyen, Tung; Land, Susan J.; Ruden, Douglas M.; Lu, Xiangyi

    2012-01-01

    This paper describes a new program SnpSift for filtering differential DNA sequence variants between two or more experimental genomes after genotoxic chemical exposure. Here, we illustrate how SnpSift can be used to identify candidate phenotype-relevant variants including single nucleotide polymorphisms, multiple nucleotide polymorphisms, insertions, and deletions (InDels) in mutant strains isolated from genome-wide chemical mutagenesis of Drosophila melanogaster. First, the genomes of two independently isolated mutant fly strains that are allelic for a novel recessive male-sterile locus generated by genotoxic chemical exposure were sequenced using the Illumina next-generation DNA sequencer to obtain 20- to 29-fold coverage of the euchromatic sequences. The sequencing reads were processed and variants were called using standard bioinformatic tools. Next, SnpEff was used to annotate all sequence variants and their potential mutational effects on associated genes. Then, SnpSift was used to filter and select differential variants that potentially disrupt a common gene in the two allelic mutant strains. The potential causative DNA lesions were partially validated by capillary sequencing of polymerase chain reaction-amplified DNA in the genetic interval as defined by meiotic mapping and deletions that remove defined regions of the chromosome. Of the five candidate genes located in the genetic interval, the Pka-like gene CG12069 was found to carry a separate pre-mature stop codon mutation in each of the two allelic mutants whereas the other four candidate genes within the interval have wild-type sequences. The Pka-like gene is therefore a strong candidate gene for the male-sterile locus. These results demonstrate that combining SnpEff and SnpSift can expedite the identification of candidate phenotype-causative mutations in chemically mutagenized Drosophila strains. This technique can also be used to characterize the variety of mutations generated by genotoxic chemicals. PMID:22435069

  16. Genetic Variation at 9p22.2 and Ovarian Cancer Risk for BRCA1 and BRCA2 Mutation Carriers

    PubMed Central

    Kartsonaki, Christiana; Gayther, Simon A.; Pharoah, Paul D. P.; Sinilnikova, Olga M.; Beesley, Jonathan; Chen, Xiaoqing; McGuffog, Lesley; Healey, Sue; Couch, Fergus J.; Wang, Xianshu; Fredericksen, Zachary; Peterlongo, Paolo; Manoukian, Siranoush; Peissel, Bernard; Zaffaroni, Daniela; Roversi, Gaia; Barile, Monica; Viel, Alessandra; Allavena, Anna; Ottini, Laura; Papi, Laura; Gismondi, Viviana; Capra, Fabio; Radice, Paolo; Greene, Mark H.; Mai, Phuong L.; Andrulis, Irene L.; Glendon, Gord; Ozcelik, Hilmi; Thomassen, Mads; Gerdes, Anne-Marie; Kruse, Torben A.; Cruger, Dorthe; Jensen, Uffe Birk; Caligo, Maria Adelaide; Olsson, Håkan; Kristoffersson, Ulf; Lindblom, Annika; Arver, Brita; Karlsson, Per; Stenmark Askmalm, Marie; Borg, Ake; Neuhausen, Susan L.; Ding, Yuan Chun; Nathanson, Katherine L.; Domchek, Susan M.; Jakubowska, Anna; Lubiński, Jan; Huzarski, Tomasz; Byrski, Tomasz; Gronwald, Jacek; Górski, Bohdan; Cybulski, Cezary; Dębniak, Tadeusz; Osorio, Ana; Durán, Mercedes; Tejada, Maria-Isabel; Benítez, Javier; Hamann, Ute; Rookus, Matti A.; Verhoef, Senno; Tilanus-Linthorst, Madeleine A.; Vreeswijk, Maaike P.; Bodmer, Danielle; Ausems, Margreet G. E. M.; van Os, Theo A.; Asperen, Christi J.; Blok, Marinus J.; Meijers-Heijboer, Hanne E. J.; Peock, Susan; Cook, Margaret; Oliver, Clare; Frost, Debra; Dunning, Alison M.; Evans, D. Gareth; Eeles, Ros; Pichert, Gabriella; Cole, Trevor; Hodgson, Shirley; Brewer, Carole; Morrison, Patrick J.; Porteous, Mary; Kennedy, M. John; Rogers, Mark T.; Side, Lucy E.; Donaldson, Alan; Gregory, Helen; Godwin, Andrew; Stoppa-Lyonnet, Dominique; Moncoutier, Virginie; Castera, Laurent; Mazoyer, Sylvie; Barjhoux, Laure; Bonadona, Valérie; Leroux, Dominique; Faivre, Laurence; Lidereau, Rosette; Nogues, Catherine; Bignon, Yves-Jean; Prieur, Fabienne; Collonge-Rame, Marie-Agnès; Venat-Bouvet, Laurence; Fert-Ferrer, Sandra; Miron, Alex; Buys, Saundra S.; Hopper, John L.; Daly, Mary B.; John, Esther M.; Terry, Mary Beth; Goldgar, David; Hansen, Thomas v. O.; Jønson, Lars; Ejlertsen, Bent; Agnarsson, Bjarni A.; Offit, Kenneth; Kirchhoff, Tomas; Vijai, Joseph; Dutra-Clarke, Ana V. C.; Przybylo, Jennifer A.; Montagna, Marco; Casella, Cinzia; Imyanitov, Evgeny N.; Janavicius, Ramunas; Blanco, Ignacio; Lázaro, Conxi; Moysich, Kirsten B.; Karlan, Beth Y.; Gross, Jenny; Beattie, Mary S.; Schmutzler, Rita; Wappenschmidt, Barbara; Meindl, Alfons; Ruehl, Ina; Fiebig, Britta; Sutter, Christian; Arnold, Norbert; Deissler, Helmut; Varon-Mateeva, Raymonda; Kast, Karin; Niederacher, Dieter; Gadzicki, Dorothea; Caldes, Trinidad; de la Hoya, Miguel; Nevanlinna, Heli; Aittomäki, Kristiina; Simard, Jacques; Soucy, Penny; Spurdle, Amanda B.; Holland, Helene; Chenevix-Trench, Georgia; Easton, Douglas F.; Antoniou, Antonis C.

    2011-01-01

    Background Germline mutations in the BRCA1 and BRCA2 genes are associated with increased risks of breast and ovarian cancers. Although several common variants have been associated with breast cancer susceptibility in mutation carriers, none have been associated with ovarian cancer susceptibility. A genome-wide association study recently identified an association between the rare allele of the single-nucleotide polymorphism (SNP) rs3814113 (ie, the C allele) at 9p22.2 and decreased risk of ovarian cancer for women in the general population. We evaluated the association of this SNP with ovarian cancer risk among BRCA1 or BRCA2 mutation carriers by use of data from the Consortium of Investigators of Modifiers of BRCA1/2. Methods We genotyped rs3814113 in 10 029 BRCA1 mutation carriers and 5837 BRCA2 mutation carriers. Associations with ovarian and breast cancer were assessed with a retrospective likelihood approach. All statistical tests were two-sided. Results The minor allele of rs3814113 was associated with a reduced risk of ovarian cancer among BRCA1 mutation carriers (per-allele hazard ratio of ovarian cancer = 0.78, 95% confidence interval = 0.72 to 0.85; P = 4.8 × 10-9) and BRCA2 mutation carriers (hazard ratio of ovarian cancer = 0.78, 95% confidence interval = 0.67 to 0.90; P = 5.5 × 10-4). This SNP was not associated with breast cancer risk among either BRCA1 or BRCA2 mutation carriers. BRCA1 mutation carriers with the TT genotype at SNP rs3814113 were predicted to have an ovarian cancer risk to age 80 years of 48%, and those with the CC genotype were predicted to have a risk of 33%. Conclusion Common genetic variation at the 9p22.2 locus was associated with decreased risk of ovarian cancer for carriers of a BRCA1 or BRCA2 mutation. PMID:21169536

  17. Local Variability of Parameters for Characterization of the Corneal Subbasal Nerve Plexus.

    PubMed

    Winter, Karsten; Scheibe, Patrick; Köhler, Bernd; Allgeier, Stephan; Guthoff, Rudolf F; Stachs, Oliver

    2016-01-01

    The corneal subbasal nerve plexus (SNP) offers high potential for early diagnosis of diabetic peripheral neuropathy. Changes in subbasal nerve fibers can be assessed in vivo by confocal laser scanning microscopy (CLSM) and quantified using specific parameters. While current study results agree regarding parameter tendency, there are considerable differences in terms of absolute values. The present study set out to identify factors that might account for this high parameter variability. In three healthy subjects, we used a novel method of software-based large-scale reconstruction that provided SNP images of the central cornea, decomposed the image areas into all possible image sections corresponding to the size of a single conventional CLSM image (0.16 mm2), and calculated a set of parameters for each image section. In order to carry out a large number of virtual examinations within the reconstructed image areas, an extensive simulation procedure (10,000 runs per image) was implemented. The three analyzed images ranged in size from 3.75 mm2 to 4.27 mm2. The spatial configuration of the subbasal nerve fiber networks varied greatly across the cornea and thus caused heavily location-dependent results as well as wide value ranges for the parameters assessed. Distributions of SNP parameter values varied greatly between the three images and showed significant differences between all images for every parameter calculated (p < 0.001 in each case). The relatively small size of the conventionally evaluated SNP area is a contributory factor in high SNP parameter variability. Averaging of parameter values based on multiple CLSM frames does not necessarily result in good approximations of the respective reference values of the whole image area. This illustrates the potential for examiner bias when selecting SNP images in the central corneal area.

  18. GESPA: classifying nsSNPs to predict disease association.

    PubMed

    Khurana, Jay K; Reeder, Jay E; Shrimpton, Antony E; Thakar, Juilee

    2015-07-25

    Non-synonymous single nucleotide polymorphisms (nsSNPs) are the most common DNA sequence variation associated with disease in humans. Thus determining the clinical significance of each nsSNP is of great importance. Potential detrimental nsSNPs may be identified by genetic association studies or by functional analysis in the laboratory, both of which are expensive and time consuming. Existing computational methods lack accuracy and features to facilitate nsSNP classification for clinical use. We developed the GESPA (GEnomic Single nucleotide Polymorphism Analyzer) program to predict the pathogenicity and disease phenotype of nsSNPs. GESPA is a user-friendly software package for classifying disease association of nsSNPs. It allows flexibility in acceptable input formats and predicts the pathogenicity of a given nsSNP by assessing the conservation of amino acids in orthologs and paralogs and supplementing this information with data from medical literature. The development and testing of GESPA was performed using the humsavar, ClinVar and humvar datasets. Additionally, GESPA also predicts the disease phenotype associated with a nsSNP with high accuracy, a feature unavailable in existing software. GESPA's overall accuracy exceeds existing computational methods for predicting nsSNP pathogenicity. The usability of GESPA is enhanced by fast SQL-based cloud storage and retrieval of data. GESPA is a novel bioinformatics tool to determine the pathogenicity and phenotypes of nsSNPs. We anticipate that GESPA will become a useful clinical framework for predicting the disease association of nsSNPs. The program, executable jar file, source code, GPL 3.0 license, user guide, and test data with instructions are available at http://sourceforge.net/projects/gespa.

  19. Association analysis for feet and legs disorders with whole-genome sequence variants in 3 dairy cattle breeds.

    PubMed

    Wu, Xiaoping; Guldbrandtsen, Bernt; Lund, Mogens Sandø; Sahana, Goutam

    2016-09-01

    Identification of genetic variants associated with feet and legs disorders (FLD) will aid in the genetic improvement of these traits by providing knowledge on genes that influence trait variations. In Denmark, FLD in cattle has been recorded since the 1990s. In this report, we used deregressed breeding values as response variables for a genome-wide association study. Bulls (5,334 Danish Holstein, 4,237 Nordic Red Dairy Cattle, and 1,180 Danish Jersey) with deregressed estimated breeding values were genotyped with the Illumina Bovine 54k single nucleotide polymorphism (SNP) genotyping array. Genotypes were imputed to whole-genome sequence variants, and then 22,751,039 SNP on 29 autosomes were used for an association analysis. A modified linear mixed-model approach (efficient mixed-model association eXpedited, EMMAX) and a linear mixed model were used for association analysis. We identified 5 (3,854 SNP), 3 (13,642 SNP), and 0 quantitative trait locus (QTL) regions associated with the FLD index in Danish Holstein, Nordic Red Dairy Cattle, and Danish Jersey populations, respectively. We did not identify any QTL that were common among the 3 breeds. In a meta-analysis of the 3 breeds, 4 QTL regions were significant, but no additional QTL region was identified compared with within-breed analyses. Comparison between top SNP locations within these QTL regions and known genes suggested that RASGRP1, LCORL, MOS, and MITF may be candidate genes for FLD in dairy cattle. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  20. Temperature gradient affects differentiation of gene expression and SNP allele frequencies in the dominant Lake Baikal zooplankton species.

    PubMed

    Bowman, Larry L; Kondrateva, Elizaveta S; Timofeyev, Maxim A; Yampolsky, Lev Y

    2018-06-01

    Local adaptation and phenotypic plasticity are main mechanisms of organisms' resilience in changing environments. Both are affected by gene flow and are expected to be weak in zooplankton populations inhabiting large continuous water bodies and strongly affected by currents. Lake Baikal, the deepest and one of the coldest lakes on Earth, experienced epilimnion temperature increase during the last 100 years, exposing Baikal's zooplankton to novel selective pressures. We obtained a partial transcriptome of Epischura baikalensis (Copepoda: Calanoida), the dominant component of Baikal's zooplankton, and estimated SNP allele frequencies and transcript abundances in samples from regions of Baikal that differ in multiyear average surface temperatures. The strongest signal in both SNP and transcript abundance differentiation is the SW-NE gradient along the 600+ km long axis of the lake, suggesting isolation by distance. SNP differentiation is stronger for nonsynonymous than synonymous SNPs and is paralleled by differential survival during a laboratory exposure to increased temperature, indicating directional selection operating on the temperature gradient. Transcript abundance, generally collinear with the SNP differentiation, shows samples from the warmest, less deep location clustering together with the southernmost samples. Differential expression is more frequent among transcripts orthologous to candidate thermal response genes previously identified in model arthropods, including genes encoding cytoskeleton proteins, heat-shock proteins, proteases, enzymes of central energy metabolism, lipid and antioxidant pathways. We conclude that the pivotal endemic zooplankton species in Lake Baikal exists under temperature-mediated selection and possesses both genetic variation and plasticity to respond to novel temperature-related environmental pressures. © 2018 John Wiley & Sons Ltd.

  1. A low-density SNP array for analyzing differential selection in freshwater and marine populations of threespine stickleback (Gasterosteus aculeatus).

    PubMed

    Ferchaud, Anne-Laure; Pedersen, Susanne H; Bekkevold, Dorte; Jian, Jianbo; Niu, Yongchao; Hansen, Michael M

    2014-10-06

    The threespine stickleback (Gasterosteus aculeatus) has become an important model species for studying both contemporary and parallel evolution. In particular, differential adaptation to freshwater and marine environments has led to high differentiation between freshwater and marine stickleback populations at the phenotypic trait of lateral plate morphology and the underlying candidate gene Ectodysplacin (EDA). Many studies have focused on this trait and candidate gene, although other genes involved in marine-freshwater adaptation may be equally important. In order to develop a resource for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. We have developed a cost-efficient low-density SNP array that allows for rapid screening of polymorphisms in threespine stickleback. The array provides a valuable tool for analyzing adaptive divergence between freshwater and marine stickleback populations beyond the well-established candidate gene Ectodysplacin (EDA).

  2. Genetic variations in CYP17A1, CACNB2 and PLEKHA7 are associated with blood pressure and/or hypertension in She ethnic minority of China.

    PubMed

    Lin, Yinghua; Lai, Xiaolan; Chen, Bin; Xu, Yuan; Huang, Baoying; Chen, Zichun; Zhu, Shaoheng; Yao, Jin; Jiang, Qiqin; Huang, Huibin; Wen, Junping; Chen, Gang

    2011-12-01

    Two large-scale genome-wide association studies (GWAs) have identified multiple variants associated with blood pressure (BP) or hypertension. The present study was to investigate whether some variations were associated with BP traits and hypertension or even prehypertension in adult She ethnic minority of China. The population of the present study comprised 4460 (1979 males and 2481 females, respectively) unrelated she ethnic minority based on a cross-sectional study from Ningde City in Fujian province of China. There were 1692 hypertensives, 1600 prehypertensives and 1168 normotensive controls, respectively. We genotyped 7 variants in CYP17A1, PLEKHA7, CACNB2, ATP2B1, TBX3-TBX5, CSK-ULK3 and SH2B3 reported by the previous GWAs on Europeans. All analyses were performed in an additive genetic model. As the minor allele of rs653178 in/near SH2B3 was very rare with the frequency of 0.018, we excluded this single nucleotide polymorphism (SNP) in the further analyses. Of the other 6 loci, linear regression analyses revealed that rs11191548 in CYP17A1 and rs11014166 in CACNB2 were significantly associated with systolic BP (β = -1.17, P = 0.002 and β = -0.50, P = 0.006, respectively), while only SNP rs11191548 was significantly associated with diastolic BP (β = -0.56, P=0.002) after adjusted by age, sex and BMI. Two variants in CACNB2 and PLEKHA7 were found to be significantly related to hypertension (odds ratios [OR] and (95% confidence interval [CI]): 0.79 (0.65-0.97) and 1.19 (1.01-1.41), respectively) in logistic regression analyses after adjusted by age, sex and BMI. In addition, we found that combined risk alleles of the 6 SNPs increased risk of hypertension in a stepwise fashion (P for trend < 0.001). However, none of the 6 SNPs was significantly associated with BMI or prehypertension status. While logistic analysis showed that subjects with cumulative risk alleles more than 9 had significantly higher risk for prehypertension (adjusted OR: 3.10, P < 0.001) compared with those with risk alleles less than 4. We replicated that variations in CYP17A1, CACNB2 and PLEKHA7 were related to BP traits and/or hypertension in She population. In addition, although we failed to observe single gene associated with prehypertension, we first found that conjoint effect of multiple risk alleles on BP might increase the risk of progressing to prehypertension. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  3. The regulatory BCL2 promoter polymorphism (-938C>A) is associated with relapse and survival of patients with oropharyngeal squamous cell carcinoma.

    PubMed

    Lehnerdt, G F; Franz, P; Bankfalvi, A; Grehl, S; Kelava, A; Nückel, H; Lang, S; Schmid, K W; Siffert, W; Bachmann, H S

    2009-06-01

    Expression of the antiapoptotic and antiproliferative protein B-cell lymphoma 2 (Bcl-2) has been repeatedly shown to be associated with better locoregional control and patients' survival in oropharyngeal squamous cell carcinoma (OSCC). A regulatory (-938C>A) single-nucleotide polymorphism (SNP) in the inhibitory P2 BCL2 gene promoter generates significantly different BCL2 promoter activities and has been associated with outcome in different malignancies. The aim of the present study was to analyze the possible influence of the (-938C>A) SNP on survival of patients suffering from OSCC. One hundred and thirty-three patients with primary OSCC were retrospectively investigated. Bcl-2 expression of tumor cells was demonstrated by means of immunohistochemistry. Both the Bcl-2 expression and the (-938C>A) genotypes were correlated with the patients' survival. The (-938C>A) SNP was significantly related to Bcl-2 expression (P = 0.008). Kaplan-Meier curves revealed a significant association of the -938 SNP with relapse-free (P = 0.0283) and overall survival (P = 0.0247). Multiple Cox regression identified the BCL2 (-938CC) genotype as an independent prognostic factor for relapse [hazard ratio (HR) 1.898, P = 0.021] as well as for death in OSCC patients (HR 1.897, P = 0.013). The (-938C>A) SNP represents a potential novel prognostic marker in patients with OSCC that could help to identify a group of patients at high risk for relapse and death.

  4. A new risk locus in CHCHD5 for hypertension and obesity in a Chinese child population: a cohort study.

    PubMed

    Wu, Lijun; Gao, Liwang; Zhao, Xiaoyuan; Zhang, Meixian; Wu, Jianxin; Mi, Jie

    2017-09-11

    Coiled-coil-helix-coiled-coil-helix domain containing 5 (CHCHD5), a mitochondrial protein, is involved in the oxidative folding process in the mitochondrial intermembrane space. A previous study identified a hypertension-related single nucleotide polymorphism (SNP), rs3748024, in CHCHD5 in adults, but there are no reports regarding the association between CHCHD5 and obesity, which is a known risk factor for hypertension. The aim of the present study is to investigate the associations of the SNP rs3748024 with hypertension and obesity. Cohort study. Institute of Pediatrics in China. We genotyped the SNP rs3748024 in the Beijing Child and Adolescent Metabolic Syndrome study. A total of 3503 children participated in the study. Genotyping of rs3748024 was conducted using the TaqMan Allelic Discrimination Assay. Lipids and glucose were analysed by an automatic biochemical analyser using a kit assay. The levels of adipocytokines (leptin, adiponectin and resistin) were measured by ELISA techniques. There was a statistically significant association between rs3748024 and systolic blood pressure (SBP) (β=-0.853, 95% CI -1.482 to -0.024, p=0.044) under an additive model adjusted for age, gender and body mass index (BMI) after correction for multiple testing. The SNP was also significantly associated with BMI (β=-0.286, 95% CI -0.551 to -0.021, p=0.043), obesity (OR=0.828, 95% CI 0.723 to 0.949, p=0.018) and triglycerides (β=-0.039, 95% CI -0.070 to -0.007, p=0.044) after correction for multiple testing. We demonstrate for the first time that the SNP rs3748024 in CHCHD5 is associated with SBP, BMI, obesity and triglycerides in Chinese children. Our study identifies a new risk locus for hypertension and obesity in a child population. The function of CHCHD5 remains to be further studied to help elucidate the pathogenic role of CHCHD5 in hypertension and obesity. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  5. Accurate and fast multiple-testing correction in eQTL studies.

    PubMed

    Sul, Jae Hoon; Raj, Towfique; de Jong, Simone; de Bakker, Paul I W; Raychaudhuri, Soumya; Ophoff, Roel A; Stranger, Barbara E; Eskin, Eleazar; Han, Buhm

    2015-06-04

    In studies of expression quantitative trait loci (eQTLs), it is of increasing interest to identify eGenes, the genes whose expression levels are associated with variation at a particular genetic variant. Detecting eGenes is important for follow-up analyses and prioritization because genes are the main entities in biological processes. To detect eGenes, one typically focuses on the genetic variant with the minimum p value among all variants in cis with a gene and corrects for multiple testing to obtain a gene-level p value. For performing multiple-testing correction, a permutation test is widely used. Because of growing sample sizes of eQTL studies, however, the permutation test has become a computational bottleneck in eQTL studies. In this paper, we propose an efficient approach for correcting for multiple testing and assess eGene p values by utilizing a multivariate normal distribution. Our approach properly takes into account the linkage-disequilibrium structure among variants, and its time complexity is independent of sample size. By applying our small-sample correction techniques, our method achieves high accuracy in both small and large studies. We have shown that our method consistently produces extremely accurate p values (accuracy > 98%) for three human eQTL datasets with different sample sizes and SNP densities: the Genotype-Tissue Expression pilot dataset, the multi-region brain dataset, and the HapMap 3 dataset. Copyright © 2015 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  6. Development of a forensic skin colour predictive test.

    PubMed

    Maroñas, Olalla; Phillips, Chris; Söchtig, Jens; Gomez-Tato, Antonio; Cruz, Raquel; Alvarez-Dios, José; de Cal, María Casares; Ruiz, Yarimar; Fondevila, Manuel; Carracedo, Ángel; Lareu, María V

    2014-11-01

    There is growing interest in skin colour prediction in the forensic field. However, a lack of consensus approaches for recording skin colour phenotype plus the complicating factors of epistatic effects, environmental influences such as exposure to the sun and unidentified genetic variants, present difficulties for the development of a forensic skin colour predictive test centred on the most strongly associated SNPs. Previous studies have analysed skin colour variation in single unadmixed population groups, including South Asians (Stokowski et al., 2007, Am. J. Hum. Genet, 81: 1119-32) and Europeans (Jacobs et al., 2013, Hum Genet. 132: 147-58). Nevertheless, a major challenge lies in the analysis of skin colour in admixed individuals, where co-ancestry proportions do not necessarily dictate any one person's skin colour. Our study sought to analyse genetic differences between African, European and admixed African-European subjects where direct spectrometric measurements and photographs of skin colour were made in parallel. We identified strong associations to skin colour variation in the subjects studied from a pigmentation SNP discovery panel of 59 markers and developed a forensic online classifier based on naïve Bayes analysis of the SNP profiles made. A skin colour predictive test is described using the ten most strongly associated SNPs in 8 genes linked to skin pigmentation variation. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  7. The genetic architecture of economic and political preferences

    PubMed Central

    Benjamin, Daniel J.; Cesarini, David; van der Loos, Matthijs J. H. M.; Dawes, Christopher T.; Koellinger, Philipp D.; Magnusson, Patrik K. E.; Chabris, Christopher F.; Conley, Dalton; Laibson, David; Johannesson, Magnus; Visscher, Peter M.

    2012-01-01

    Preferences are fundamental building blocks in all models of economic and political behavior. We study a new sample of comprehensively genotyped subjects with data on economic and political preferences and educational attainment. We use dense single nucleotide polymorphism (SNP) data to estimate the proportion of variation in these traits explained by common SNPs and to conduct genome-wide association study (GWAS) and prediction analyses. The pattern of results is consistent with findings for other complex traits. First, the estimated fraction of phenotypic variation that could, in principle, be explained by dense SNP arrays is around one-half of the narrow heritability estimated using twin and family samples. The molecular-genetic–based heritability estimates, therefore, partially corroborate evidence of significant heritability from behavior genetic studies. Second, our analyses suggest that these traits have a polygenic architecture, with the heritable variation explained by many genes with small effects. Our results suggest that most published genetic association studies with economic and political traits are dramatically underpowered, which implies a high false discovery rate. These results convey a cautionary message for whether, how, and how soon molecular genetic data can contribute to, and potentially transform, research in social science. We propose some constructive responses to the inferential challenges posed by the small explanatory power of individual SNPs. PMID:22566634

  8. The genetic architecture of economic and political preferences.

    PubMed

    Benjamin, Daniel J; Cesarini, David; van der Loos, Matthijs J H M; Dawes, Christopher T; Koellinger, Philipp D; Magnusson, Patrik K E; Chabris, Christopher F; Conley, Dalton; Laibson, David; Johannesson, Magnus; Visscher, Peter M

    2012-05-22

    Preferences are fundamental building blocks in all models of economic and political behavior. We study a new sample of comprehensively genotyped subjects with data on economic and political preferences and educational attainment. We use dense single nucleotide polymorphism (SNP) data to estimate the proportion of variation in these traits explained by common SNPs and to conduct genome-wide association study (GWAS) and prediction analyses. The pattern of results is consistent with findings for other complex traits. First, the estimated fraction of phenotypic variation that could, in principle, be explained by dense SNP arrays is around one-half of the narrow heritability estimated using twin and family samples. The molecular-genetic-based heritability estimates, therefore, partially corroborate evidence of significant heritability from behavior genetic studies. Second, our analyses suggest that these traits have a polygenic architecture, with the heritable variation explained by many genes with small effects. Our results suggest that most published genetic association studies with economic and political traits are dramatically underpowered, which implies a high false discovery rate. These results convey a cautionary message for whether, how, and how soon molecular genetic data can contribute to, and potentially transform, research in social science. We propose some constructive responses to the inferential challenges posed by the small explanatory power of individual SNPs.

  9. Accurate HLA type inference using a weighted similarity graph.

    PubMed

    Xie, Minzhu; Li, Jing; Jiang, Tao

    2010-12-14

    The human leukocyte antigen system (HLA) contains many highly variable genes. HLA genes play an important role in the human immune system, and HLA gene matching is crucial for the success of human organ transplantations. Numerous studies have demonstrated that variation in HLA genes is associated with many autoimmune, inflammatory and infectious diseases. However, typing HLA genes by serology or PCR is time consuming and expensive, which limits large-scale studies involving HLA genes. Since it is much easier and cheaper to obtain single nucleotide polymorphism (SNP) genotype data, accurate computational algorithms to infer HLA gene types from SNP genotype data are in need. To infer HLA types from SNP genotypes, the first step is to infer SNP haplotypes from genotypes. However, for the same SNP genotype data set, the haplotype configurations inferred by different methods are usually inconsistent, and it is often difficult to decide which one is true. In this paper, we design an accurate HLA gene type inference algorithm by utilizing SNP genotype data from pedigrees, known HLA gene types of some individuals and the relationship between inferred SNP haplotypes and HLA gene types. Given a set of haplotypes inferred from the genotypes of a population consisting of many pedigrees, the algorithm first constructs a weighted similarity graph based on a new haplotype similarity measure and derives constraint edges from known HLA gene types. Based on the principle that different HLA gene alleles should have different background haplotypes, the algorithm searches for an optimal labeling of all the haplotypes with unknown HLA gene types such that the total weight among the same HLA gene types is maximized. To deal with ambiguous haplotype solutions, we use a genetic algorithm to select haplotype configurations that tend to maximize the same optimization criterion. Our experiments on a previously typed subset of the HapMap data show that the algorithm is highly accurate, achieving an accuracy of 96% for gene HLA-A, 95% for HLA-B, 97% for HLA-C, 84% for HLA-DRB1, 98% for HLA-DQA1 and 97% for HLA-DQB1 in a leave-one-out test. Our algorithm can infer HLA gene types from neighboring SNP genotype data accurately. Compared with a recent approach on the same input data, our algorithm achieved a higher accuracy. The code of our algorithm is available to the public for free upon request to the corresponding authors.

  10. Single Nucleotide Polymorphism Discovery in Bovine Pituitary Gland Using RNA-Seq Technology

    PubMed Central

    Pareek, Chandra Shekhar; Smoczyński, Rafał; Kadarmideen, Haja N.; Dziuba, Piotr; Błaszczyk, Paweł; Sikora, Marcin; Walendzik, Paulina; Grzybowski, Tomasz; Pierzchała, Mariusz; Horbańczuk, Jarosław; Szostak, Agnieszka; Ogluszka, Magdalena; Zwierzchowski, Lech; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Wąsowicz, Krzysztof; Gelfand, Brian; Feng, Yaping; Kumar, Dibyendu

    2016-01-01

    Examination of bovine pituitary gland transcriptome by strand-specific RNA-seq allows detection of putative single nucleotide polymorphisms (SNPs) within potential candidate genes (CGs) or QTLs regions as well as to understand the genomics variations that contribute to economic trait. Here we report a breed-specific model to successfully perform the detection of SNPs in the pituitary gland of young growing bulls representing Polish Holstein-Friesian (HF), Polish Red, and Hereford breeds at three developmental ages viz., six months, nine months, and twelve months. A total of 18 bovine pituitary gland polyA transcriptome libraries were prepared and sequenced using the Illumina NextSeq 500 platform. Sequenced FastQ databases of all 18 young bulls were submitted to NCBI-SRA database with NCBI-SRA accession numbers SRS1296732. For the investigated young bulls, a total of 113,882,3098 raw paired-end reads with a length of 156 bases were obtained, resulting in an approximately 63 million paired-end reads per library. Breed-wise, a total of 515.38, 215.39, and 408.04 million paired-end reads were obtained for Polish HF, Polish Red, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed 93.04%, 94.39%, and 83.46% of the mapped sequencing reads were properly paired to the Polish HF, Polish Red, and Hereford breeds, respectively. Constructed breed-specific SNP-db of three cattle breeds yielded at 13,775,885 SNPs. On an average 765,326 breed-specific SNPs per young bull were identified. Using two stringent filtering parameters, i.e., a minimum 10 SNP reads per base with an accuracy ≥ 90% and a minimum 10 SNP reads per base with an accuracy = 100%, SNP-db records were trimmed to construct a highly reliable SNP-db. This resulted in a reduction of 95,7% and 96,4% cut-off mark of constructed raw SNP-db. Finally, SNP discoveries using RNA-Seq data were validated by KASP™ SNP genotyping assay. The comprehensive QTLs/CGs analysis of 76 QTLs/CGs with RNA-seq data identified KCNIP4, CCSER1, DPP6, MAP3K5 and GHR CGs with highest SNPs hit loci in all three breeds and developmental ages. However, CAST CG with more than 100 SNPs hits were observed only in Polish HF and Hereford breeds.These findings are important for identification and construction of novel tissue specific SNP-db and breed specific SNP-db dataset by screening of putative SNPs according to QTL db and candidate genes for bovine growth and reproduction traits, one can develop genomic selection strategies for growth and reproductive traits. PMID:27606429

  11. Single Nucleotide Polymorphism Discovery in Bovine Pituitary Gland Using RNA-Seq Technology.

    PubMed

    Pareek, Chandra Shekhar; Smoczyński, Rafał; Kadarmideen, Haja N; Dziuba, Piotr; Błaszczyk, Paweł; Sikora, Marcin; Walendzik, Paulina; Grzybowski, Tomasz; Pierzchała, Mariusz; Horbańczuk, Jarosław; Szostak, Agnieszka; Ogluszka, Magdalena; Zwierzchowski, Lech; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Wąsowicz, Krzysztof; Gelfand, Brian; Feng, Yaping; Kumar, Dibyendu

    2016-01-01

    Examination of bovine pituitary gland transcriptome by strand-specific RNA-seq allows detection of putative single nucleotide polymorphisms (SNPs) within potential candidate genes (CGs) or QTLs regions as well as to understand the genomics variations that contribute to economic trait. Here we report a breed-specific model to successfully perform the detection of SNPs in the pituitary gland of young growing bulls representing Polish Holstein-Friesian (HF), Polish Red, and Hereford breeds at three developmental ages viz., six months, nine months, and twelve months. A total of 18 bovine pituitary gland polyA transcriptome libraries were prepared and sequenced using the Illumina NextSeq 500 platform. Sequenced FastQ databases of all 18 young bulls were submitted to NCBI-SRA database with NCBI-SRA accession numbers SRS1296732. For the investigated young bulls, a total of 113,882,3098 raw paired-end reads with a length of 156 bases were obtained, resulting in an approximately 63 million paired-end reads per library. Breed-wise, a total of 515.38, 215.39, and 408.04 million paired-end reads were obtained for Polish HF, Polish Red, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed 93.04%, 94.39%, and 83.46% of the mapped sequencing reads were properly paired to the Polish HF, Polish Red, and Hereford breeds, respectively. Constructed breed-specific SNP-db of three cattle breeds yielded at 13,775,885 SNPs. On an average 765,326 breed-specific SNPs per young bull were identified. Using two stringent filtering parameters, i.e., a minimum 10 SNP reads per base with an accuracy ≥ 90% and a minimum 10 SNP reads per base with an accuracy = 100%, SNP-db records were trimmed to construct a highly reliable SNP-db. This resulted in a reduction of 95,7% and 96,4% cut-off mark of constructed raw SNP-db. Finally, SNP discoveries using RNA-Seq data were validated by KASP™ SNP genotyping assay. The comprehensive QTLs/CGs analysis of 76 QTLs/CGs with RNA-seq data identified KCNIP4, CCSER1, DPP6, MAP3K5 and GHR CGs with highest SNPs hit loci in all three breeds and developmental ages. However, CAST CG with more than 100 SNPs hits were observed only in Polish HF and Hereford breeds.These findings are important for identification and construction of novel tissue specific SNP-db and breed specific SNP-db dataset by screening of putative SNPs according to QTL db and candidate genes for bovine growth and reproduction traits, one can develop genomic selection strategies for growth and reproductive traits.

  12. A non-sense mutation in the putative anti-mutator gene ada/alkA of Mycobacterium tuberculosis and M. bovis isolates suggests convergent evolution

    PubMed Central

    Nouvel, Laurent X; Vultos, Tiago Dos; Kassa-Kelembho, Eric; Rauzier, Jean; Gicquel, Brigitte

    2007-01-01

    Background Previous studies have suggested that variations in DNA repair genes of W-Beijing strains may have led to transient mutator phenotypes which in turn may have contributed to host adaptation of this strain family. Single nucleotide polymorphism (SNP) in the DNA repair gene mutT1 was identified in MDR-prone strains from the Central African Republic. A Mycobacteriumtuberculosis H37Rv mutant inactivated in two DNA repair genes, namely ada/alkA and ogt, was shown to display a hypermutator phenotype. We then looked for polymorphisms in these genes in Central African Republic strains (CAR). Results In this study, 55 MDR and 194 non-MDR strains were analyzed. Variations in DNA repair genes ada/alkA and ogt were identified. Among them, by comparison to M. tuberculosis published sequences, we found a non-sense variation in ada/alkA gene which was also observed in M. bovis AF2122 strain. SNPs that are present in the adjacent regions to the amber variation are different in M. bovis and in M. tuberculosis strain. Conclusion An Amber codon was found in the ada/alkA locus of clustered M. tuberculosis isolates and in M. bovis strain AF2122. This is likely due to convergent evolution because SNP differences between strains are incompatible with horizontal transfer of an entire gene. This suggests that such a variation may confer a selective advantage and be implicated in hypermutator phenotype expression, which in turn contributes to adaptation to environmental changes. PMID:17506895

  13. Association analysis of the SLC22A11 (organic anion transporter 4) and SLC22A12 (urate transporter 1) urate transporter locus with gout in New Zealand case-control sample sets reveals multiple ancestral-specific effects

    PubMed Central

    2013-01-01

    Introduction There is inconsistent association between urate transporters SLC22A11 (organic anion transporter 4 (OAT4)) and SLC22A12 (urate transporter 1 (URAT1)) and risk of gout. New Zealand (NZ) Māori and Pacific Island people have higher serum urate and more severe gout than European people. The aim of this study was to test genetic variation across the SLC22A11/SLC22A12 locus for association with risk of gout in NZ sample sets. Methods A total of 12 single nucleotide polymorphism (SNP) variants in four haplotype blocks were genotyped using TaqMan® and Sequenom MassArray in 1003 gout cases and 1156 controls. All cases had gout according to the 1977 American Rheumatism Association criteria. Association analysis of single markers and haplotypes was performed using PLINK and Stata. Results A haplotype block 1 SNP (rs17299124) (upstream of SLC22A11) was associated with gout in less admixed Polynesian sample sets, but not European Caucasian (odds ratio; OR = 3.38, P = 6.1 × 10-4; OR = 0.91, P = 0.40, respectively) sample sets. A protective block 1 haplotype caused the rs17299124 association (OR = 0.28, P = 6.0 × 10-4). Within haplotype block 2 (SLC22A11) we could not replicate previous reports of association of rs2078267 with gout in European Caucasian (OR = 0.98, P = 0.82) sample sets, however this SNP was associated with gout in Polynesian (OR = 1.51, P = 0.022) sample sets. Within haplotype block 3 (including SLC22A12) analysis of haplotypes revealed a haplotype with trans-ancestral protective effects (OR = 0.80, P = 0.004), and a second haplotype conferring protection in less admixed Polynesian sample sets (OR = 0.63, P = 0.028) but risk in European Caucasian samples (OR = 1.33, P = 0.039). Conclusions Our analysis provides evidence for multiple ancestral-specific effects across the SLC22A11/SLC22A12 locus that presumably influence the activity of OAT4 and URAT1 and risk of gout. Further fine mapping of the association signal is needed using trans-ancestral re-sequence data. PMID:24360580

  14. Integrative Analysis of Prognosis Data on Multiple Cancer Subtypes

    PubMed Central

    Liu, Jin; Huang, Jian; Zhang, Yawei; Lan, Qing; Rothman, Nathaniel; Zheng, Tongzhang; Ma, Shuangge

    2014-01-01

    Summary In cancer research, profiling studies have been extensively conducted, searching for genes/SNPs associated with prognosis. Cancer is diverse. Examining the similarity and difference in the genetic basis of multiple subtypes of the same cancer can lead to a better understanding of their connections and distinctions. Classic meta-analysis methods analyze each subtype separately and then compare analysis results across subtypes. Integrative analysis methods, in contrast, analyze the raw data on multiple subtypes simultaneously and can outperform meta-analysis methods. In this study, prognosis data on multiple subtypes of the same cancer are analyzed. An AFT (accelerated failure time) model is adopted to describe survival. The genetic basis of multiple subtypes is described using the heterogeneity model, which allows a gene/SNP to be associated with prognosis of some subtypes but not others. A compound penalization method is developed to identify genes that contain important SNPs associated with prognosis. The proposed method has an intuitive formulation and is realized using an iterative algorithm. Asymptotic properties are rigorously established. Simulation shows that the proposed method has satisfactory performance and outperforms a penalization-based meta-analysis method and a regularized thresholding method. An NHL (non-Hodgkin lymphoma) prognosis study with SNP measurements is analyzed. Genes associated with the three major subtypes, namely DLBCL, FL, and CLL/SLL, are identified. The proposed method identifies genes that are different from alternatives and have important implications and satisfactory prediction performance. PMID:24766212

  15. Global population-specific variation in miRNA associated with cancer risk and clinical biomarkers.

    PubMed

    Rawlings-Goss, Renata A; Campbell, Michael C; Tishkoff, Sarah A

    2014-08-28

    MiRNA expression profiling is being actively investigated as a clinical biomarker and diagnostic tool to detect multiple cancer types and stages as well as other complex diseases. Initial investigations, however, have not comprehensively taken into account genetic variability affecting miRNA expression and/or function in populations of different ethnic backgrounds. Therefore, more complete surveys of miRNA genetic variability are needed to assess global patterns of miRNA variation within and between diverse human populations and their effect on clinically relevant miRNA genes. Genetic variation in 1524 miRNA genes was examined using whole genome sequencing (60x coverage) in a panel of 69 unrelated individuals from 14 global populations, including European, Asian and African populations. We identified 33 previously undescribed miRNA variants, and 31 miRNA containing variants that are globally population-differentiated in frequency between African and non-African populations (PD-miRNA). The top 1% of PD-miRNA were significantly enriched for regulation of genes involved in glucose/insulin metabolism and cell division (p < 10(-7)), most significantly the mitosis pathway, which is strongly linked to cancer onset. Overall, we identify 7 PD-miRNAs that are currently implicated as cancer biomarkers or diagnostics: hsa-mir-202, hsa-mir-423, hsa-mir-196a-2, hsa-mir-520h, hsa-mir-647, hsa-mir-943, and hsa-mir-1908. Notably, hsa-mir-202, a potential breast cancer biomarker, was found to show significantly high allele frequency differentiation at SNP rs12355840, which is known to affect miRNA expression levels in vivo and subsequently breast cancer mortality. MiRNA expression profiles represent a promising new category of disease biomarkers. However, population specific genetic variation can affect the prevalence and baseline expression of these miRNAs in diverse populations. Consequently, miRNA genetic and expression level variation among ethnic groups may be contributing in part to health disparities observed in multiple forms of cancer, specifically breast cancer, and will be an essential consideration when assessing the utility of miRNA biomarkers for the clinic.

  16. Heme Oxygenase-1 and 2 Common Genetic Variants and Risk for Multiple Sclerosis

    PubMed Central

    Agúndez, José A. G.; García-Martín, Elena; Martínez, Carmen; Benito-León, Julián; Millán-Pascual, Jorge; Díaz-Sánchez, María; Calleja, Patricia; Pisa, Diana; Turpín-Fenoll , Laura; Alonso-Navarro, Hortensia; Pastor, Pau; Ortega-Cubero, Sara; Ayuso-Peralta, Lucía; Torrecillas, Dolores; García-Albea, Esteban; Plaza-Nieto, José Francisco; Jiménez-Jiménez, Félix Javier

    2016-01-01

    Several neurochemical, neuropathological, and experimental data suggest a possible role of oxidative stress in the ethiopathogenesis of multiple sclerosis(MS). Heme-oxygenases(HMOX) are an important defensive mechanism against oxidative stress, and HMOX1 is overexpressed in the brain and spinal cord of MS patients and in experimental autoimmune encephalomyelitis(EAE). We analyzed whether common polymorphisms affecting the HMOX1 and HMOX2 genes are related with the risk to develop MS. We analyzed the distribution of genotypes and allelic frequencies of the HMOX1 rs2071746, HMOX1 rs2071747, HMOX2 rs2270363, and HMOX2 rs1051308 SNPs, as well as the presence of Copy number variations(CNVs) of these genes in 292 subjects MS and 533 healthy controls, using TaqMan assays. The frequencies of HMOX2 rs1051308AA genotype and HMOX2 rs1051308A and HMOX1 rs2071746A alleles were higher in MS patients than in controls, although only that of the SNP HMOX2 rs1051308 in men remained as significant after correction for multiple comparisons. None of the studied polymorphisms was related to the age at disease onset or with the MS phenotype. The present study suggests a weak association between HMOX2 rs1051308 polymorphism and the risk to develop MS in Spanish Caucasian men and a trend towards association between the HMOX1 rs2071746A and MS risk. PMID:26868429

  17. Heme Oxygenase-1 and 2 Common Genetic Variants and Risk for Multiple Sclerosis.

    PubMed

    Agúndez, José A G; García-Martín, Elena; Martínez, Carmen; Benito-León, Julián; Millán-Pascual, Jorge; Díaz-Sánchez, María; Calleja, Patricia; Pisa, Diana; Turpín-Fenoll, Laura; Alonso-Navarro, Hortensia; Pastor, Pau; Ortega-Cubero, Sara; Ayuso-Peralta, Lucía; Torrecillas, Dolores; García-Albea, Esteban; Plaza-Nieto, José Francisco; Jiménez-Jiménez, Félix Javier

    2016-02-12

    Several neurochemical, neuropathological, and experimental data suggest a possible role of oxidative stress in the ethiopathogenesis of multiple sclerosis(MS). Heme-oxygenases(HMOX) are an important defensive mechanism against oxidative stress, and HMOX1 is overexpressed in the brain and spinal cord of MS patients and in experimental autoimmune encephalomyelitis(EAE). We analyzed whether common polymorphisms affecting the HMOX1 and HMOX2 genes are related with the risk to develop MS. We analyzed the distribution of genotypes and allelic frequencies of the HMOX1 rs2071746, HMOX1 rs2071747, HMOX2 rs2270363, and HMOX2 rs1051308 SNPs, as well as the presence of Copy number variations(CNVs) of these genes in 292 subjects MS and 533 healthy controls, using TaqMan assays. The frequencies of HMOX2 rs1051308AA genotype and HMOX2 rs1051308A and HMOX1 rs2071746A alleles were higher in MS patients than in controls, although only that of the SNP HMOX2 rs1051308 in men remained as significant after correction for multiple comparisons. None of the studied polymorphisms was related to the age at disease onset or with the MS phenotype. The present study suggests a weak association between HMOX2 rs1051308 polymorphism and the risk to develop MS in Spanish Caucasian men and a trend towards association between the HMOX1 rs2071746A and MS risk.

  18. Design and validation of a 90K SNP genotyping assay for the Water Buffalo (Bubalus bubalis)

    USDA-ARS?s Scientific Manuscript database

    The completion of the human genome sequence in 2001 was a major step forward in knowledge necessary to understand the variations between individuals. For farmed species, genomic information will facilitate the selection of animals optimised to live, and be productive in particular environments. The ...

  19. TILLING for plant breeding.

    PubMed

    Sharp, Peter; Dong, Chongmei

    2014-01-01

    TILLING is widely used in plant functional genomics. Mutagenesis and SNP detection is combined to allow for the isolation of mutations in genes of interest. It can also be used as a plant breeding tool, whereby variation in known or candidate genes of interest to breeding programs is generated. Here we describe a simple low-cost TILLING procedure.

  20. Detecting genotypic variation among the single spore isolates of Pasteuria penetrans population occuring in Florida using SNP-based markers

    USDA-ARS?s Scientific Manuscript database

    Pasteuria penetrans is a naturally occurring soil-borne endospore-forming bacterium, which functions as a castrating parasite of plant-parasitic nematodes belonging to the genus Meloidogyne. Pasteuria penetrans is established as an effective biological control agent for control and management o...

  1. Genetic variations in NADPH-CYP450 oxidoreductase in a Czech Slavic cohort

    PubMed Central

    Tomková, Mária; Panda, Satya Prakash; Šeda, Ondřej; Baxová, Alice; Hůlková, Martina; Masters, Bettie Sue Siler; Martásek, Pavel

    2015-01-01

    Background Gene polymorphisms encoding the enzyme NADPH–cytochrome P450 oxidoreductase (POR) contribute to inter-individual differences in drug response. Aim To estimate polymorphic allele frequencies of the POR gene in a Czech Slavic population. Materials & Methods The gene POR was analyzed in 322 Czech Slavic individuals from a control cohort by sequencing and HRM analysis. Results Twenty-five SNP genetic variations were identified. Of these variants, 7 were new, unreported SNPs, including two SNPs in the 5´flanking region (g.4965 C>T and g.4994 G>T), one intronic variant (c.1899 −20C>T), one synonymous SNP (p.20Ala=) and three nonsynonymous SNPs (p.Thr29Ser, p.Pro384Leu and p.Thr529Met). The p.Pro384Leu variant exhibited reduced enzymatic activities compared to wild type. Conclusion New POR variant identification indicates that the number of uncommon variants might be specific for each subpopulation being investigated, particularly germane to the singular role that POR plays in providing reducing equivalents to all CYPs in the endoplasmic reticulum. PMID:25712184

  2. Comparison of Constitutional and Replication Stress-Induced Genome Structural Variation by SNP Array and Mate-Pair Sequencing

    PubMed Central

    Arlt, Martin F.; Ozdemir, Alev Cagla; Birkeland, Shanda R.; Lyons, Robert H.; Glover, Thomas W.; Wilson, Thomas E.

    2011-01-01

    Copy-number variants (CNVs) are a major source of genetic variation in human health and disease. Previous studies have implicated replication stress as a causative factor in CNV formation. However, existing data are technically limited in the quality of comparisons that can be made between human CNVs and experimentally induced variants. Here, we used two high-resolution strategies—single nucleotide polymorphism (SNP) arrays and mate-pair sequencing—to compare CNVs that occur constitutionally to those that arise following aphidicolin-induced DNA replication stress in the same human cells. Although the optimized methods provided complementary information, sequencing was more sensitive to small variants and provided superior structural descriptions. The majority of constitutional and all aphidicolin-induced CNVs appear to be formed via homology-independent mechanisms, while aphidicolin-induced CNVs were of a larger median size than constitutional events even when mate-pair data were considered. Aphidicolin thus appears to stimulate formation of CNVs that closely resemble human pathogenic CNVs and the subset of larger nonhomologous constitutional CNVs. PMID:21212237

  3. Genetic diversity and structure of elite cotton germplasm (Gossypium hirsutum L.) using genome-wide SNP data.

    PubMed

    Ai, XianTao; Liang, YaJun; Wang, JunDuo; Zheng, JuYun; Gong, ZhaoLong; Guo, JiangPing; Li, XueYuan; Qu, YanYing

    2017-10-01

    Cotton (Gossypium spp.) is the most important natural textile fiber crop, and Gossypium hirsutum L. is responsible for 90% of the annual cotton crop in the world. Information on cotton genetic diversity and population structure is essential for new breeding lines. In this study, we analyzed population structure and genetic diversity of 288 elite Gossypium hirsutum cultivar accessions collected from around the world, and especially from China, using genome-wide single nucleotide polymorphisms (SNP) markers. The average polymorphsim information content (PIC) was 0.25, indicating a relatively low degree of genetic diversity. Population structure analysis revealed extensive admixture and identified three subgroups. Phylogenetic analysis supported the subgroups identified by STRUCTURE. The results from both population structure and phylogenetic analysis were, for the most part, in agreement with pedigree information. Analysis of molecular variance revealed a larger amount of variation was due to diversity within the groups. Establishment of genetic diversity and population structure from this study could be useful for genetic and genomic analysis and systematic utilization of the standing genetic variation in upland cotton.

  4. A valveless rotary microfluidic device for multiplex point mutation identification based on ligation-rolling circle amplification.

    PubMed

    Heo, Hyun Young; Chung, Soyi; Kim, Yong Tae; Kim, Do Hyun; Seo, Tae Seok

    2016-04-15

    Genetic variations such as single nucleotide polymorphism (SNP) and point mutations are important biomarkers to monitor disease prognosis and diagnosis. In this study, we developed a novel rotary microfluidic device which can perform multiplex SNP typing on the mutation sites of TP53 genes. The microdevice consists of three glass layers: a channel wafer, a Ti/Pt electrode-patterned resistance temperature detector (RTD) wafer, and a rotary plate in which twelve reaction chambers were fabricated. A series of sample injection, ligation-rolling circle amplification (L-RCA) reaction, and fluorescence detection of the resultant amplicons could be executed by rotating the top rotary plate, identifying five mutation points related with cancer prognosis. The use of the rotary plate eliminates the necessity of microvalves and micropumps to control the microfluidic flow in the channel, simplifying the chip design and chip operation for multiplex SNP detection. The proposed microdevice provides an advanced genetic analysis platform in terms of multiplexity, simplicity, and portability in the fields of biomedical diagnostics. Copyright © 2015 Elsevier B.V. All rights reserved.

  5. Analysis of consequences of non-synonymous SNP in feed conversion ratio associated TGF-β receptor type 3 gene in chicken.

    PubMed

    Rasal, Kiran D; Shah, Tejas M; Vaidya, Megha; Jakhesara, Subhash J; Joshi, Chaitanya G

    2015-06-01

    The recent advances in high throughput sequencing technology accelerate possible ways for the study of genome wide variation in several organisms and associated consequences. In the present study, mutations in TGFBR3 showing significant association with FCR trait in chicken during exome sequencing were further analyzed. Out of four SNPs, one nsSNP p.Val451Leu was found in the coding region of TGFBR3. In silico tools such as SnpSift and PANTHER predicted it as deleterious (0.04) and to be tolerated, respectively, while I-Mutant revealed that protein stability decreased. The TGFBR3 I-TASSER model has a C-score of 0.85, which was validated using PROCHECK. Based on MD simulation, mutant protein structure deviated from native with RMSD 0.08 Å due to change in the H-bonding distances of mutant residue. The docking of TGFBR3 with interacting TGFBR2 inferred that mutant required more global energy. Therefore, the present study will provide useful information about functional SNPs that have an impact on FCR traits.

  6. A method for determining haploid and triploid genotypes and their association with vascular phenotypes in Williams syndrome and 7q11.23 duplication syndrome.

    PubMed

    Gregory, Michael D; Kolachana, Bhaskar; Yao, Yin; Nash, Tiffany; Dickinson, Dwight; Eisenberg, Daniel P; Mervis, Carolyn B; Berman, Karen F

    2018-04-04

    Williams syndrome ([WS], 7q11.23 hemideletion) and 7q11.23 duplication syndrome (Dup7) show contrasting syndromic symptoms. However, within each group there is considerable interindividual variability in the degree to which these phenotypes are expressed. Though software exists to identify areas of copy number variation (CNV) from commonly-available SNP-chip data, this software does not provide non-diploid genotypes in CNV regions. Here, we describe a method for identifying haploid and triploid genotypes in CNV regions, and then, as a proof-of-concept for applying this information to explain clinical variability, we test for genotype-phenotype associations. Blood samples for 25 individuals with WS and 13 individuals with Dup7 were genotyped with Illumina-HumanOmni5M SNP-chips. PennCNV and in-house code were used to make genotype calls for each SNP in the 7q11.23 locus. We tested for association between the presence of aortic arteriopathy and genotypes of the remaining (haploid in WS) or duplicated (triploid in Dup7) alleles. Haploid calls in the 7q11.23 region were made for 99.0% of SNPs in the WS group, and triploid calls for 98.8% of SNPs in those with Dup7. The G allele of SNP rs2528795 in the ELN gene was associated with aortic stenosis in WS participants (p < 0.0049) while the A allele of the same SNP was associated with aortic dilation in Dup7. Commonly available SNP-chip information can be used to make haploid and triploid calls in individuals with CNVs and then to relate variability in specific genes to variability in syndromic phenotypes, as demonstrated here using aortic arteriopathy. This work sets the stage for similar genotype-phenotype analyses in CNVs where phenotypes may be more complex and/or where there is less information about genetic mechanisms.

  7. Contrasting association of a non-synonymous leptin receptor gene polymorphism with Wegener's granulomatosis and Churg-Strauss syndrome.

    PubMed

    Wieczorek, Stefan; Holle, Julia U; Bremer, Jan P; Wibisono, David; Moosig, Frank; Fricke, Harald; Assmann, Gunter; Harper, Lorraine; Arning, Larissa; Gross, Wolfgang L; Epplen, Joerg T

    2010-05-01

    There is evidence that the leptin/ghrelin system is involved in T-cell regulation and plays a role in (auto)immune disorders such as SLE, RA and ANCA-associated vasculitides (AAVs). Here, we evaluate the genetic background of this system in WG. We screened variations in the genes encoding leptin, ghrelin and their receptors, the leptin receptor (LEPR) and the growth hormone secretagogue receptor (GHSR). Three single nucleotide polymorphisms (SNPs) in each gene region were analysed in 460 German WG cases and 878 ethnically matched healthy controls. A three-SNP haplotype of GHSR was significantly associated with WG [P = 0.0067; corrected P-value (P(c)) = 0.026; odds ratio (OR) = 1.30; 95% CI 1.08, 1.57], as was one non-synonymous SNP in LEPR (Lys656Asn, P = 0.0034; P(c) = 0.013; OR = 0.72; 95% CI 0.58, 0.90). These four SNPs were re-analysed in independent cohorts of 226 German WG cases and 519 controls. While the GHSR association was not confirmed, allele frequencies of the LEPR SNP were virtually identical to those from the initial cohorts. Analysis of this SNP in the combined WG and control panels revealed a significant association of the LEPR 656Lys allele with WG (P = 0.00032; P(c) = 0.0013; OR = 0.72; 95% CI 0.60, 0.86). Remarkably, the Lys656Asn SNP showed contrasting allele distribution in two cohorts of 108 and 88 German cases diagnosed with Churg-Strauss syndrome (CSS, combined P = 0.0067; OR = 1.41; 95% CI 1.10, 1.81), whereas identical allele frequencies were revealed when comparing British WG and microscopic polyangiitis cases. While GHSR has to be further evaluated, these data provide profound evidence for an association of the LEPR Lys656Asn SNP with AAV, resulting in opposing effects in WG and CSS.

  8. Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar).

    PubMed

    Houston, Ross D; Taggart, John B; Cézard, Timothé; Bekaert, Michaël; Lowe, Natalie R; Downing, Alison; Talbot, Richard; Bishop, Stephen C; Archibald, Alan L; Bron, James E; Penman, David J; Davassi, Alessandro; Brew, Fiona; Tinch, Alan E; Gharbi, Karim; Hamilton, Alastair

    2014-02-06

    Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection.

  9. Genome-wide association study identified three major QTL for carcass weight including the PLAG1-CHCHD7 QTN for stature in Japanese Black cattle

    PubMed Central

    2012-01-01

    Background Significant quantitative trait loci (QTL) for carcass weight were previously mapped on several chromosomes in Japanese Black half-sib families. Two QTL, CW-1 and CW-2, were narrowed down to 1.1-Mb and 591-kb regions, respectively. Recent advances in genomic tools allowed us to perform a genome-wide association study (GWAS) in cattle to detect associations in a general population and estimate their effect size. Here, we performed a GWAS for carcass weight using 1156 Japanese Black steers. Results Bonferroni-corrected genome-wide significant associations were detected in three chromosomal regions on bovine chromosomes (BTA) 6, 8, and 14. The associated single nucleotide polymorphisms (SNP) on BTA 6 were in linkage disequilibrium with the SNP encoding NCAPG Ile442Met, which was previously identified as a candidate quantitative trait nucleotide for CW-2. In contrast, the most highly associated SNP on BTA 14 was located 2.3-Mb centromeric from the previously identified CW-1 region. Linkage disequilibrium mapping led to a revision of the CW-1 region within a 0.9-Mb interval around the associated SNP, and targeted resequencing followed by association analysis highlighted the quantitative trait nucleotides for bovine stature in the PLAG1-CHCHD7 intergenic region. The association on BTA 8 was accounted for by two SNP on the BovineSNP50 BeadChip and corresponded to CW-3, which was simultaneously detected by linkage analyses using half-sib families. The allele substitution effects of CW-1, CW-2, and CW-3 were 28.4, 35.3, and 35.0 kg per allele, respectively. Conclusion The GWAS revealed the genetic architecture underlying carcass weight variation in Japanese Black cattle in which three major QTL accounted for approximately one-third of the genetic variance. PMID:22607022

  10. Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)

    PubMed Central

    2014-01-01

    Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection. PMID:24524230

  11. Y chromosomal haplotype characteristics of domestic sheep (Ovis aries) in China.

    PubMed

    Wang, Yutao; Xu, Lei; Yan, Wei; Li, Shaobin; Wang, Jiqing; Liu, Xiu; Hu, Jiang; Luo, Yuzhu

    2015-07-10

    Investigations on the variation present at the male-specific Y chromosome region provide strong information to understand the origin and evolution of domestic sheep. One SNP OY1 (g.88A>G) in the upstream region of SRY gene, and the microsatellite SRYM18 locus within ovine Y chromosome were analyzed in one hundred and forty five samples collected from eleven breeds in China. SNP OY1 was analyzed using PCR-SSCP method and sequencing. Two different PCR-SSCP patterns represented two specific sequences with sequence analysis revealing SNP-OY1 (g.88A>G) were observed, while SNP A-OY1 showed the most common frequency (82.8%). Sequencing of the SRYM18 region revealed one novel size fragment (A2) with different repetitive units. Seven haplotypes (H4, H5, H6, H7, H8, H9 and H12) and two novel haplotypes (Ha and Hb) were established using combined genotype analysis. H6 showed the highest frequency (43.4%) across all breeds, and H8 showed the second frequency (24.1%). Ha was only found in one breed (Tan), while Hb was present in three breeds (Gansu alpine, White Suffolk and Duolang). Our findings reveal one novel allele in SRYM18 region and two novel male haplotypes of domestic sheep in China. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. Prenatal Diagnosis of DNA Copy Number Variations by Genomic Single-Nucleotide Polymorphism Array in Fetuses with Congenital Heart Defects.

    PubMed

    Tang, Shaohua; Lv, Jiaojiao; Chen, Xiangnan; Bai, Lili; Li, Huanzheng; Chen, Chong; Wang, Ping; Xu, Xueqin; Lu, Jianxin

    2016-01-01

    To evaluate the usefulness of single-nucleotide polymorphism (SNP) array for prenatal genetic diagnosis of congenital heart defect (CHD), we used this approach to detect clinically significant copy number variants (CNVs) in fetuses with CHDs. A HumanCytoSNP-12 array was used to detect genomic samples obtained from 39 fetuses that exhibited cardiovascular abnormalities on ultrasound and had a normal karyotype. The relationship between CNVs and CHDs was identified by using genotype-phenotype comparisons and searching of chromosomal databases. All clinically significant CNVs were confirmed by real-time PCR. CNVs were detected in 38/39 (97.4%) fetuses: variants of unknown significance were detected in 2/39 (5.1%), and clinically significant CNVs were identified in 7/39 (17.9%). In 3 of the 7 fetuses with clinically significant CNVs, 3 rare and previously undescribed CNVs were detected, and these CNVs encompassed the CHD candidate genes FLNA (Xq28 dup), BCOR (Xp11.4 dup), and RBL2 (16q12.2 del). Compared with conventional cytogenetic genomics, SNP array analysis provides significantly improved detection of submicroscopic genomic aberrations in pregnancies with CHDs. Based on these results, we propose that genomic SNP array is an effective method which could be used in the prenatal diagnostic test to assist genetic counseling for pregnancies with CHDs. © 2015 S. Karger AG, Basel.

  13. Accuracy of Assignment of Atlantic Salmon (Salmo salar L.) to Rivers and Regions in Scotland and Northeast England Based on Single Nucleotide Polymorphism (SNP) Markers

    PubMed Central

    Gilbey, John; Cauwelier, Eef; Coulson, Mark W.; Stradmeyer, Lee; Sampayo, James N.; Armstrong, Anja; Verspoor, Eric; Corrigan, Laura; Shelley, Jonathan; Middlemas, Stuart

    2016-01-01

    Understanding the habitat use patterns of migratory fish, such as Atlantic salmon (Salmo salar L.), and the natural and anthropogenic impacts on them, is aided by the ability to identify individuals to their stock of origin. Presented here are the results of an analysis of informative single nucleotide polymorphic (SNP) markers for detecting genetic structuring in Atlantic salmon in Scotland and NE England and their ability to allow accurate genetic stock identification. 3,787 fish from 147 sites covering 27 rivers were screened at 5,568 SNP markers. In order to identify a cost-effective subset of SNPs, they were ranked according to their ability to differentiate between fish from different rivers. A panel of 288 SNPs was used to examine both individual assignments and mixed stock fisheries and eighteen assignment units were defined. The results improved greatly on previously available methods and, for the first time, fish caught in the marine environment can be confidently assigned to geographically coherent units within Scotland and NE England, including individual rivers. As such, this SNP panel has the potential to aid understanding of the various influences acting upon Atlantic salmon on their marine migrations, be they natural environmental variations and/or anthropogenic impacts, such as mixed stock fisheries and interactions with marine power generation installations. PMID:27723810

  14. Genetic alterations within TLR genes in development of Toxoplasma gondii infection among Polish pregnant women.

    PubMed

    Wujcicka, Wioletta; Wilczyński, Jan; Nowakowska, Dorota

    2017-09-01

    The research was conducted to evaluate the role of genotypes, haplotypes and multiple-SNP variants in the range of TLR2, TLR4 and TLR9 single nucleotide polymorphisms (SNPs) in the development of Toxoplasma gondii infection among Polish pregnant women. The study was performed for 116 Polish pregnant women, including 51 patients infected with T. gondii, and 65 age-matched control pregnant individuals. Genotypes in TLR2 2258 G>A, TLR4 896 A>G, TLR4 1196 C>T and TLR9 2848 G>A SNPs were estimated by self-designed, nested PCR-RFLP assays. Randomly selected PCR products, representative for distinct genotypes in the studied polymorphisms, were confirmed by sequencing. All the genotypes were calculated for Hardy-Weinberg (H-W) equilibrium and TLR4 variants were tested for linkage disequilibrium. Relationships were assessed between alleles, genotypes, haplotypes or multiple-SNP variants in TLR polymorphisms and the occurrence of T. gondii infection in pregnant women, using a logistic regression model. All the analyzed genotypes preserved the H-W equilibrium among the studied groups of patients (P>0.050). Similar distribution of distinct alleles and individual genotypes in TLR SNPs, as well as of haplotypes in TLR4 polymorphisms, were observed in T. gondii infected and control uninfected pregnant women. However, the GACG multiple-SNP variant, within the range of all the four studied polymorphisms, was correlated with a decreased risk of the parasitic infection (OR 0.52, 95% CI 0.28-0.97; P≤0.050). The polymorphisms, located within TLR2, TLR4 and TLR9 genes, may be involved together in occurrence of T. gondii infection among Polish pregnant women. Copyright © 2017 Medical University of Bialystok. Published by Elsevier B.V. All rights reserved.

  15. Using Next Generation Sequencing for Multiplexed Trait-Linked Markers in Wheat

    PubMed Central

    Bernardo, Amy; Wang, Shan; St. Amand, Paul; Bai, Guihua

    2015-01-01

    With the advent of next generation sequencing (NGS) technologies, single nucleotide polymorphisms (SNPs) have become the major type of marker for genotyping in many crops. However, the availability of SNP markers for important traits of bread wheat ( Triticum aestivum L.) that can be effectively used in marker-assisted selection (MAS) is still limited and SNP assays for MAS are usually uniplex. A shift from uniplex to multiplex assays will allow the simultaneous analysis of multiple markers and increase MAS efficiency. We designed 33 locus-specific markers from SNP or indel-based marker sequences that linked to 20 different quantitative trait loci (QTL) or genes of agronomic importance in wheat and analyzed the amplicon sequences using an Ion Torrent Proton Sequencer and a custom allele detection pipeline to determine the genotypes of 24 selected germplasm accessions. Among the 33 markers, 27 were successfully multiplexed and 23 had 100% SNP call rates. Results from analysis of "kompetitive allele-specific PCR" (KASP) and sequence tagged site (STS) markers developed from the same loci fully verified the genotype calls of 23 markers. The NGS-based multiplexed assay developed in this study is suitable for rapid and high-throughput screening of SNPs and some indel-based markers in wheat. PMID:26625271

  16. Allelic variation in PtoPsbW associated with photosynthesis, growth, and wood properties in Populus tomentosa.

    PubMed

    Wang, Longxin; Wang, Bowen; Du, Qingzhang; Chen, Jinhui; Tian, Jiaxing; Yang, Xiaohui; Zhang, Deqiang

    2017-02-01

    Photosynthesis is one of the most important reactions on earth. PsbW, a nuclear-encoded subunit of photosystem II (PSII), stabilizes PSII structure and plays an important role in photosynthesis. Here, we used candidate gene-based linkage disequilibrium (LD) mapping to detect significant associations between allelic variations of PtoPsbW and traits related to photosynthesis, growth, and wood properties in Populus tomentosa. PtoPsbW showed the highest expression in leaves and it increased during the development of these leaves, suggesting that PtoPsbW may play an important role in plant growth and development. Analysis of nucleotide diversity and LD revealed that PtoPsbW has low single-nucleotide polymorphism (SNP) diversity (π tot  = 0.0048 and θ w  = 0.0050) and relatively low average value of LD (0.1500), indicating that PtoPsbW is conserved due to its indispensable function. Using single-SNP associations in an association population of 435 individuals, we identified five significant associations at the threshold of P ≤ 0.05, explaining 3.28-15.98 % of the phenotypic variation. Haplotype-based association analyses indicated that 13 haplotypes (P ≤ 0.05) from six blocks were associated with photosynthesis, growth, and wood properties. Our work shows that identifying allelic variation and LD can help to decipher the genetic basis of photosynthesis and could potentially be applied for molecular marker-assisted selection in Populus.

  17. Relationships among and variation within rare breeds of swine.

    PubMed

    Roberts, K S; Lamberson, W R

    2015-08-01

    Extinction of rare breeds of livestock threatens to reduce the total genetic variation available for selection in the face of the changing environment and new diseases. Swine breeds facing extinction typically share characteristics such as small size, slow growth rate, and high fat percentage, which limit them from contributing to commercial production. Compounding the risk of loss of variation is the lack of pedigree information for many rare breeds due to inadequate herd books, which increases the chance that producers are breeding closely related individuals. By making genetic data available, producers can make more educated breeding decisions to preserve genetic diversity in future generations, and conservation organizations can prioritize investments in breed preservation. The objective of this study was to characterize genetic variation within and among breeds of swine and prioritize heritage breeds for preservation. Genotypes from the Illumina PorcineSNP60 BeadChip (GeneSeek, Lincoln, NE) were obtained for Guinea, Ossabaw Island, Red Wattle, American Saddleback, Mulefoot, British Saddleback, Duroc, Landrace, Large White, Pietrain, and Tamworth pigs. A whole-genome analysis toolset was used to construct a genomic relationship matrix and to calculate inbreeding coefficients for the animals within each breed. Relatedness and average inbreeding coefficient differed among breeds, and pigs from rare breeds were generally more closely related and more inbred ( < 0.05). A multidimensional scaling diagram was constructed based on the SNP genotypes. Animals within breeds clustered tightly together except for 2 Guinea pigs. Tamworth, Duroc, and Mulefoot tended to not cluster with the other 7 breeds.

  18. Allelic Variation in TAS2R Bitter Receptor Genes Associates with Variation in Sensations from and Ingestive Behaviors toward Common Bitter Beverages in Adults

    PubMed Central

    Hayes, John E.; Wallace, Margaret R.; Knopik, Valerie S.; Herbstman, Deborah M.; Bartoshuk, Linda M.

    2011-01-01

    The 25 human bitter receptors and their respective genes (TAS2Rs) contain unusually high levels of allelic variation, which may influence response to bitter compounds in the food supply. Phenotypes based on the perceived bitterness of single bitter compounds were first linked to food preference over 50 years ago. The most studied phenotype is propylthiouracil bitterness, which is mediated primarily by the TAS2R38 gene and possibly others. In a laboratory-based study, we tested for associations between TAS2R variants and sensations, liking, or intake of bitter beverages among healthy adults who were primarily of European ancestry. A haploblock across TAS2R3, TAS2R4, and TAS2R5 explained some variability in the bitterness of espresso coffee. For grapefruit juice, variation at a TAS2R19 single nucleotide polymorphism (SNP) was associated with increased bitterness and decreased liking. An association between a TAS2R16 SNP and alcohol intake was identified, and the putative TAS2R38–alcohol relationship was confirmed, although these polymorphisms did not explain sensory or hedonic responses to sampled scotch whisky. In summary, TAS2R polymorphisms appear to influence the sensations, liking, or intake of common and nutritionally significant beverages. Studying perceptual and behavioral differences in vivo using real foods and beverages may potentially identify polymorphisms related to dietary behavior even in the absence of known ligands. PMID:21163912

  19. Allelic variation in TAS2R bitter receptor genes associates with variation in sensations from and ingestive behaviors toward common bitter beverages in adults.

    PubMed

    Hayes, John E; Wallace, Margaret R; Knopik, Valerie S; Herbstman, Deborah M; Bartoshuk, Linda M; Duffy, Valerie B

    2011-03-01

    The 25 human bitter receptors and their respective genes (TAS2Rs) contain unusually high levels of allelic variation, which may influence response to bitter compounds in the food supply. Phenotypes based on the perceived bitterness of single bitter compounds were first linked to food preference over 50 years ago. The most studied phenotype is propylthiouracil bitterness, which is mediated primarily by the TAS2R38 gene and possibly others. In a laboratory-based study, we tested for associations between TAS2R variants and sensations, liking, or intake of bitter beverages among healthy adults who were primarily of European ancestry. A haploblock across TAS2R3, TAS2R4, and TAS2R5 explained some variability in the bitterness of espresso coffee. For grapefruit juice, variation at a TAS2R19 single nucleotide polymorphism (SNP) was associated with increased bitterness and decreased liking. An association between a TAS2R16 SNP and alcohol intake was identified, and the putative TAS2R38-alcohol relationship was confirmed, although these polymorphisms did not explain sensory or hedonic responses to sampled scotch whisky. In summary, TAS2R polymorphisms appear to influence the sensations, liking, or intake of common and nutritionally significant beverages. Studying perceptual and behavioral differences in vivo using real foods and beverages may potentially identify polymorphisms related to dietary behavior even in the absence of known ligands.

  20. Southeast Asian origins of five Hill Tribe populations and correlation of genetic to linguistic relationships inferred with genome-wide SNP data

    PubMed Central

    Listman, JB; Malison, RT; Sanichwankul, K; Ittiwut, C; Mutirangura, A; Gelernter, J

    2010-01-01

    In Thailand, the term Hill Tribe is used to describe populations whose members traditionally practice slash and burn agriculture and reside in the mountains. These tribes are thought to have migrated throughout Asia for up to 5,000 years, including migrations through Southern China and/or Southeast Asia. There have been continuous migrations southward from China into Thailand for approximately the past thousand years and the present geographic range of any given tribe straddles multiple political borders. As none of these populations have autochthonous scripts, written histories have until recently, been externally produced. Northern Asian, Tibetan, and Siberian origins of Hill Tribes have been proposed. All purport endogamy and have non-mutually intelligible languages. In order to test hypotheses regarding the geographic origins of these populations, relatedness and migrations among them and neighboring populations, and whether their genetic relationships correspond with their linguistic relationships, we analyzed 2445 genome-wide SNP markers in 118 individuals from five Thai Hill Tribe populations (Akha, Hmong, Karen, Lahu, and Lisu), 90 individuals from majority Thai populations, and 826 individuals from Asian and Oceanean HGDP and HapMap populations using a Bayesian clustering method. Considering these results within the context of results of recent large-scale studies of Asian geographic genetic variation allows us to infer a shared Southeast Asian origin of these five Hill Tribe populations as well ancestry components that distinguish among them seen in successive levels of clustering. In addition, the inferred level of shared ancestry among the Hill Tribes corresponds well to relationships among their languages. PMID:20979205

  1. Southeast Asian origins of five Hill Tribe populations and correlation of genetic to linguistic relationships inferred with genome-wide SNP data.

    PubMed

    Listman, J B; Malison, R T; Sanichwankul, K; Ittiwut, C; Mutirangura, A; Gelernter, J

    2011-02-01

    In Thailand, the term Hill Tribe is used to describe populations whose members traditionally practice slash and burn agriculture and reside in the mountains. These tribes are thought to have migrated throughout Asia for up to 5,000 years, including migrations through Southern China and/or Southeast Asia. There have been continuous migrations southward from China into Thailand for approximately the past thousand years and the present geographic range of any given tribe straddles multiple political borders. As none of these populations have autochthonous scripts, written histories have until recently, been externally produced. Northern Asian, Tibetan, and Siberian origins of Hill Tribes have been proposed. All purport endogamy and have nonmutually intelligible languages. To test hypotheses regarding the geographic origins of these populations, relatedness and migrations among them and neighboring populations, and whether their genetic relationships correspond with their linguistic relationships, we analyzed 2,445 genome-wide SNP markers in 118 individuals from five Thai Hill Tribe populations (Akha, Hmong, Karen, Lahu, and Lisu), 90 individuals from majority Thai populations, and 826 individuals from Asian and Oceanean HGDP and HapMap populations using a Bayesian clustering method. Considering these results within the context of results ofrecent large-scale studies of Asian geographic genetic variation allows us to infer a shared Southeast Asian origin of these five Hill Tribe populations as well ancestry components that distinguish among them seen in successive levels of clustering. In addition, the inferred level of shared ancestry among the Hill Tribes corresponds well to relationships among their languages. 2010 Wiley-Liss, Inc.

  2. Multilocus nuclear DNA markers reveal population structure and demography of Anopheles minimus.

    PubMed

    Dixit, Jyotsana; Arunyawat, Uraiwan; Huong, Ngo Thi; Das, Aparup

    2014-11-01

    Utilization of multiple putatively neutral DNA markers for inferring evolutionary history of species population is considered to be the most robust approach. Molecular population genetic studies have been conducted in many species of Anopheles genus, but studies based on single nucleotide polymorphism (SNP) data are still very scarce. Anopheles minimus is one of the principal malaria vectors of Southeast (SE) Asia including the Northeastern (NE) India. Although population genetic studies with mitochondrial genetic variation data have been utilized to infer phylogeography of the SE Asian populations of this species, limited information on the population structure and demography of Indian An. minimus is available. We herewith have developed multilocus nuclear genetic approach with SNP markers located in X chromosome of An. minimus in eight Indian and two SE Asian population samples (121 individual mosquitoes in total) to infer population history and test several hypotheses on the phylogeography of this species. While the Thai population sample of An. minimus presented the highest nucleotide diversity, majority of the Indian samples were also fairly diverse. In general, An. minimus populations were moderately substructured in the distribution range covering SE Asia and NE India, largely falling under three distinct genetic clusters. Moreover, demographic expansion events could be detected in the majority of the presently studied populations of An. minimus. Additional DNA sequencing of the mitochondrial COII region in a subset of the samples (40 individual mosquitoes) corroborated the existing hypothesis of Indian An. minimus falling under the earlier reported mitochondrial lineage B. © 2014 John Wiley & Sons Ltd.

  3. A Conserved Role for Syndecan Family Members in the Regulation of Whole-Body Energy Metabolism

    PubMed Central

    De Luca, Maria; Klimentidis, Yann C.; Casazza, Krista; Moses Chambers, Michelle; Cho, Ruth; Harbison, Susan T.; Jumbo-Lucioni, Patricia; Zhang, Shaoyan; Leips, Jeff; Fernandez, Jose R.

    2010-01-01

    Syndecans are a family of type-I transmembrane proteins that are involved in cell-matrix adhesion, migration, neuronal development, and inflammation. Previous quantitative genetic studies pinpointed Drosophila Syndecan (dSdc) as a positional candidate gene affecting variation in fat storage between two Drosophila melanogaster strains. Here, we first used quantitative complementation tests with dSdc mutants to confirm that natural variation in this gene affects variability in Drosophila fat storage. Next, we examined the effects of a viable dSdc mutant on Drosophila whole-body energy metabolism and associated traits. We observed that young flies homozygous for the dSdc mutation had reduced fat storage and slept longer than homozygous wild-type flies. They also displayed significantly reduced metabolic rate, lower expression of spargel (the Drosophila homologue of PGC-1), and reduced mitochondrial respiration. Compared to control flies, dSdc mutants had lower expression of brain insulin-like peptides, were less fecund, more sensitive to starvation, and had reduced life span. Finally, we tested for association between single nucleotide polymorphisms (SNPs) in the human SDC4 gene and variation in body composition, metabolism, glucose homeostasis, and sleep traits in a cohort of healthy early pubertal children. We found that SNP rs4599 was significantly associated with resting energy expenditure (P = 0.001 after Bonferroni correction) and nominally associated with fasting glucose levels (P = 0.01) and sleep duration (P = 0.044). On average, children homozygous for the minor allele had lower levels of glucose, higher resting energy expenditure, and slept shorter than children homozygous for the common allele. We also observed that SNP rs1981429 was nominally associated with lean tissue mass (P = 0.035) and intra-abdominal fat (P = 0.049), and SNP rs2267871 with insulin sensitivity (P = 0.037). Collectively, our results in Drosophila and humans argue that syndecan family members play a key role in the regulation of body metabolism. PMID:20585652

  4. A conserved role for syndecan family members in the regulation of whole-body energy metabolism.

    PubMed

    De Luca, Maria; Klimentidis, Yann C; Casazza, Krista; Chambers, Michelle Moses; Cho, Ruth; Harbison, Susan T; Jumbo-Lucioni, Patricia; Zhang, Shaoyan; Leips, Jeff; Fernandez, Jose R

    2010-06-23

    Syndecans are a family of type-I transmembrane proteins that are involved in cell-matrix adhesion, migration, neuronal development, and inflammation. Previous quantitative genetic studies pinpointed Drosophila Syndecan (dSdc) as a positional candidate gene affecting variation in fat storage between two Drosophila melanogaster strains. Here, we first used quantitative complementation tests with dSdc mutants to confirm that natural variation in this gene affects variability in Drosophila fat storage. Next, we examined the effects of a viable dSdc mutant on Drosophila whole-body energy metabolism and associated traits. We observed that young flies homozygous for the dSdc mutation had reduced fat storage and slept longer than homozygous wild-type flies. They also displayed significantly reduced metabolic rate, lower expression of spargel (the Drosophila homologue of PGC-1), and reduced mitochondrial respiration. Compared to control flies, dSdc mutants had lower expression of brain insulin-like peptides, were less fecund, more sensitive to starvation, and had reduced life span. Finally, we tested for association between single nucleotide polymorphisms (SNPs) in the human SDC4 gene and variation in body composition, metabolism, glucose homeostasis, and sleep traits in a cohort of healthy early pubertal children. We found that SNP rs4599 was significantly associated with resting energy expenditure (P = 0.001 after Bonferroni correction) and nominally associated with fasting glucose levels (P = 0.01) and sleep duration (P = 0.044). On average, children homozygous for the minor allele had lower levels of glucose, higher resting energy expenditure, and slept shorter than children homozygous for the common allele. We also observed that SNP rs1981429 was nominally associated with lean tissue mass (P = 0.035) and intra-abdominal fat (P = 0.049), and SNP rs2267871 with insulin sensitivity (P = 0.037). Collectively, our results in Drosophila and humans argue that syndecan family members play a key role in the regulation of body metabolism.

  5. WDR36 and P53 Gene Variants and Susceptibility to Primary Open-Angle Glaucoma: Analysis of Gene-Gene Interactions

    PubMed Central

    Blanco-Marchite, Cristina; Sánchez-Sánchez, Francisco; López-Garrido, María-Pilar; Iñigez-de-Onzoño, Mercedes; López-Martínez, Francisco; López-Sánchez, Enrique; Alvarez, Lydia; Rodríguez-Calvo, Pedro-Pablo; Méndez-Hernández, Carmen; Fernández-Vega, Luis; García-Sánchez, Julián; Coca-Prados, Miguel; García-Feijoo, Julián

    2011-01-01

    Purpose. To investigate the role of WDR36 and P53 sequence variations in POAG susceptibility. Methods. The authors performed a case-control genetic association study in 268 unrelated Spanish patients (POAG1) and 380 control subjects matched for sex, age, and ethnicity. WDR36 sequence variations were screened by either direct DNA sequencing or denaturing high-performance liquid chromatography. P53 polymorphisms p.R72P and c.97–147ins16bp were analyzed by single-nucleotide polymorphism (SNP) genotyping and PCR, respectively. Positive SNP and haplotype associations were reanalyzed in a second sample of 211 patients and in combined cases (n = 479). Results. The authors identified almost 50 WDR36 sequence variations, of which approximately two-thirds were rare and one-third were polymorphisms. Approximately half the variants were novel. Eight patients (2.9%) carried rare mutations that were not identified in the control group (P = 0.001). Six Tag SNPs were expected to be structured in three common haplotypes. Haplotype H2 was consistently associated with the disease (P = 0.0024 in combined cases). According to a dominant model, genotypes containing allele P of the P53 p.R72P SNP slightly increased glaucoma risk. Glaucoma susceptibility associated with different WDR36 genotypes also increased significantly in combination with the P53 RP risk genotype, indicating the existence of a genetic interaction. For instance, the OR of the H2 diplotype estimated for POAG1 and combined cases rose approximately 1.6 times in the two-locus genotype H2/RP. Conclusions. Rare WDR36 variants and the P53 p.R72P polymorphism behaved as moderate glaucoma risk factors in Spanish patients. The authors provide evidence for a genetic interaction between WDR36 and P53 variants in POAG susceptibility, although this finding must be confirmed in other populations. PMID:21931130

  6. [Phenotypic and genetic analysis of a patient presented with Tietz/Waardenburg type II a syndrome].

    PubMed

    Wang, Huanhuan; Tang, Lifang; Zhang, Jingmin; Hu, Qin; Chen, Yingwei; Xiao, Bing

    2015-08-01

    To determine the genetic cause for a patient featuring decreased pigmentation of the skin and iris, hearing loss and multiple congenital anomalies. Routine chromosomal banding was performed to analyze the karyotype of the patient and his parents. Single nucleotide polymorphism array (SNP array) was employed to identify cryptic chromosome aberrations, and quantitative real-time PCR was used to confirm the results. Karyotype analysis has revealed no obvious anomaly for the patient and his parents. SNP array analysis of the patient has demonstrated a 3.9 Mb deletion encompassing 3p13p14.1, which caused loss of entire MITF gene. The deletion was confirmed by quantitative real-time PCR. Clinical features of the patient have included severe bilateral hearing loss, decreased pigmentation of the skin and iris and multiple congenital anomalies. The patient, carrying a 3p13p14.1 deletion, has features of Tietz syndrome/Waardenburg syndrome type IIa. This case may provide additional data for the study of genotype-phenotype correlation of this disease.

  7. The Genetic Architecture of the Human Immune System: A Bioresource for Autoimmunity and Disease Pathogenesis

    PubMed Central

    Roederer, Mario; Quaye, Lydia; Mangino, Massimo; Beddall, Margaret H.; Mahnke, Yolanda; Chattopadhyay, Pratip; Tosi, Isabella; Napolitano, Luca; Barberio, Manuela Terranova; Menni, Cristina; Villanova, Federica; Di Meglio, Paola; Spector, Tim D.; Nestle, Frank O.

    2015-01-01

    Summary Despite recent discoveries of genetic variants associated with autoimmunity and infection, genetic control of the human immune system during homeostasis is poorly understood. We undertook a comprehensive immunophenotyping approach, analysing 78,000 immune traits in 669 female twins. From the top 151 heritable traits (up to 96% heritable), we used replicated GWAS to obtain 297 SNP associations at 11 genetic loci explaining up to 36% of the variation of 19 traits. We found multiple associations with canonical traits of all major immune cell subsets, and uncovered insights into genetic control for regulatory T cells. This dataset also revealed traits associated with loci known to confer autoimmune susceptibility, providing mechanistic hypotheses linking immune traits with the etiology of disease. Our data establish a bioresource that links genetic control elements associated with normal immune traits to common autoimmune and infectious diseases, providing a shortcut to identifying potential mechanisms of immune-related diseases. PMID:25772697

  8. A genome-wide association study identifies multiple loci for variation in human ear morphology.

    PubMed

    Adhikari, Kaustubh; Reales, Guillermo; Smith, Andrew J P; Konka, Esra; Palmen, Jutta; Quinto-Sanchez, Mirsha; Acuña-Alonzo, Victor; Jaramillo, Claudia; Arias, William; Fuentes, Macarena; Pizarro, María; Barquera Lozano, Rodrigo; Macín Pérez, Gastón; Gómez-Valdés, Jorge; Villamil-Ramírez, Hugo; Hunemeier, Tábita; Ramallo, Virginia; Silva de Cerqueira, Caio C; Hurtado, Malena; Villegas, Valeria; Granja, Vanessa; Gallo, Carla; Poletti, Giovanni; Schuler-Faccini, Lavinia; Salzano, Francisco M; Bortolini, Maria-Cátira; Canizales-Quinteros, Samuel; Rothhammer, Francisco; Bedoya, Gabriel; Calderón, Rosario; Rosique, Javier; Cheeseman, Michael; Bhutta, Mahmood F; Humphries, Steve E; Gonzalez-José, Rolando; Headon, Denis; Balding, David; Ruiz-Linares, Andrés

    2015-06-24

    Here we report a genome-wide association study for non-pathological pinna morphology in over 5,000 Latin Americans. We find genome-wide significant association at seven genomic regions affecting: lobe size and attachment, folding of antihelix, helix rolling, ear protrusion and antitragus size (linear regression P values 2 × 10(-8) to 3 × 10(-14)). Four traits are associated with a functional variant in the Ectodysplasin A receptor (EDAR) gene, a key regulator of embryonic skin appendage development. We confirm expression of Edar in the developing mouse ear and that Edar-deficient mice have an abnormally shaped pinna. Two traits are associated with SNPs in a region overlapping the T-Box Protein 15 (TBX15) gene, a major determinant of mouse skeletal development. Strongest association in this region is observed for SNP rs17023457 located in an evolutionarily conserved binding site for the transcription factor Cartilage paired-class homeoprotein 1 (CART1), and we confirm that rs17023457 alters in vitro binding of CART1.

  9. The genetic architecture of the human immune system: a bioresource for autoimmunity and disease pathogenesis.

    PubMed

    Roederer, Mario; Quaye, Lydia; Mangino, Massimo; Beddall, Margaret H; Mahnke, Yolanda; Chattopadhyay, Pratip; Tosi, Isabella; Napolitano, Luca; Terranova Barberio, Manuela; Menni, Cristina; Villanova, Federica; Di Meglio, Paola; Spector, Tim D; Nestle, Frank O

    2015-04-09

    Despite recent discoveries of genetic variants associated with autoimmunity and infection, genetic control of the human immune system during homeostasis is poorly understood. We undertook a comprehensive immunophenotyping approach, analyzing 78,000 immune traits in 669 female twins. From the top 151 heritable traits (up to 96% heritable), we used replicated GWAS to obtain 297 SNP associations at 11 genetic loci, explaining up to 36% of the variation of 19 traits. We found multiple associations with canonical traits of all major immune cell subsets and uncovered insights into genetic control for regulatory T cells. This data set also revealed traits associated with loci known to confer autoimmune susceptibility, providing mechanistic hypotheses linking immune traits with the etiology of disease. Our data establish a bioresource that links genetic control elements associated with normal immune traits to common autoimmune and infectious diseases, providing a shortcut to identifying potential mechanisms of immune-related diseases. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Functional Characterization of Schizophrenia-Associated Variation in CACNA1C

    PubMed Central

    Eckart, Nicole; Song, Qifeng; Yang, Rebecca; Wang, Ruihua; Zhu, Heng; McCallion, Andrew S.; Avramopoulos, Dimitrios

    2016-01-01

    Calcium channel subunits, including CACNA1C, have been associated with multiple psychiatric disorders. Specifically, genome wide association studies (GWAS) have repeatedly identified the single nucleotide polymorphism (SNP) rs1006737 in intron 3 of CACNA1C to be strongly associated with schizophrenia and bipolar disorder. Here, we show that rs1006737 marks a quantitative trait locus for CACNA1C transcript levels. We test 16 SNPs in high linkage disequilibrium with rs1007637 and find one, rs4765905, consistently showing allele-dependent regulatory function in reporter assays. We find allele-specific protein binding for 13 SNPs including rs4765905. Using protein microarrays, we identify several proteins binding ≥3 SNPs, but not control sequences, suggesting possible functional interactions and combinatorial haplotype effects. Finally, using circular chromatin conformation capture, we show interaction of the disease-associated region including the 16 SNPs with the CACNA1C promoter and other potential regulatory regions. Our results elucidate the pathogenic relevance of one of the best-supported risk loci for schizophrenia and bipolar disorder. PMID:27276213

  11. Novel Thrombotic Function of a Human SNP in STXBP5 Revealed by CRISPR/Cas9 Gene Editing in Mice.

    PubMed

    Zhu, Qiuyu Martin; Ko, Kyung Ae; Ture, Sara; Mastrangelo, Michael A; Chen, Ming-Huei; Johnson, Andrew D; O'Donnell, Christopher J; Morrell, Craig N; Miano, Joseph M; Lowenstein, Charles J

    2017-02-01

    To identify and characterize the effect of a SNP (single-nucleotide polymorphism) in the STXBP5 locus that is associated with altered thrombosis in humans. GWAS (genome-wide association studies) have identified numerous SNPs associated with human thrombotic phenotypes, but determining the functional significance of an individual candidate SNP can be challenging, particularly when in vivo modeling is required. Recent GWAS led to the discovery of STXBP5 as a regulator of platelet secretion in humans. Further clinical studies have identified genetic variants of STXBP5 that are linked to altered plasma von Willebrand factor levels and thrombosis in humans, but the functional significance of these variants in STXBP5 is not understood. We used CRISPR/Cas9 (clustered regularly interspaced short palindromic repeats/CRISPR-associated 9) techniques to produce a precise mouse model carrying a human coding SNP rs1039084 (encoding human p. N436S) in the STXBP5 locus associated with decreased thrombosis. Mice carrying the orthologous human mutation (encoding p. N437S in mouse STXBP5) have lower plasma von Willebrand factor levels, decreased thrombosis, and decreased platelet secretion compared with wild-type mice. This thrombosis phenotype recapitulates the phenotype of humans carrying the minor allele of rs1039084. Decreased plasma von Willebrand factor and platelet activation may partially explain the decreased thrombotic phenotype in mutant mice. Using precise mammalian genome editing, we have identified a human nonsynonymous SNP rs1039084 in the STXBP5 locus as a causal variant for a decreased thrombotic phenotype. CRISPR/Cas9 genetic editing facilitates the rapid and efficient generation of animals to study the function of human genetic variation in vascular diseases. © 2016 American Heart Association, Inc.

  12. Single nucleotide polymorphisms in CETP, SLC46A1, SLC19A1, CD36, BCMO1, APOA5, and ABCA1 are significant predictors of plasma HDL in healthy adults

    PubMed Central

    2013-01-01

    Background In a marker-trait association study we estimated the statistical significance of 65 single nucleotide polymorphisms (SNP) in 23 candidate genes on HDL levels of two independent Caucasian populations. Each population consisted of men and women and their HDL levels were adjusted for gender and body weight. We used a linear regression model. Selected genes corresponded to folate metabolism, vitamins B-12, A, and E, and cholesterol pathways or lipid metabolism. Methods Extracted DNA from both the Sacramento and Beltsville populations was analyzed using an allele discrimination assay with a MALDI-TOF mass spectrometry platform. The adjusted phenotype, y, was HDL levels adjusted for gender and body weight only statistical analyses were performed using the genotype association and regression modules from the SNP Variation Suite v7. Results Statistically significant SNP (where P values were adjusted for false discovery rate) included: CETP (rs7499892 and rs5882); SLC46A1 (rs37514694; rs739439); SLC19A1 (rs3788199); CD36 (rs3211956); BCMO1 (rs6564851), APOA5 (rs662799), and ABCA1 (rs4149267). Many prior association trends of the SNP with HDL were replicated in our cross-validation study. Significantly, the association of SNP in folate transporters (SLC46A1 rs37514694 and rs739439; SLC19A1 rs3788199) with HDL was identified in our study. Conclusions Given recent literature on the role of niacin in the biogenesis of HDL, focus on status and metabolism of B-vitamins and metabolites of eccentric cleavage of β-carotene with lipid metabolism is exciting for future study. PMID:23656756

  13. Pervasive sharing of genetic effects in autoimmune disease.

    PubMed

    Cotsapas, Chris; Voight, Benjamin F; Rossin, Elizabeth; Lage, Kasper; Neale, Benjamin M; Wallace, Chris; Abecasis, Gonçalo R; Barrett, Jeffrey C; Behrens, Timothy; Cho, Judy; De Jager, Philip L; Elder, James T; Graham, Robert R; Gregersen, Peter; Klareskog, Lars; Siminovitch, Katherine A; van Heel, David A; Wijmenga, Cisca; Worthington, Jane; Todd, John A; Hafler, David A; Rich, Stephen S; Daly, Mark J

    2011-08-01

    Genome-wide association (GWA) studies have identified numerous, replicable, genetic associations between common single nucleotide polymorphisms (SNPs) and risk of common autoimmune and inflammatory (immune-mediated) diseases, some of which are shared between two diseases. Along with epidemiological and clinical evidence, this suggests that some genetic risk factors may be shared across diseases-as is the case with alleles in the Major Histocompatibility Locus. In this work we evaluate the extent of this sharing for 107 immune disease-risk SNPs in seven diseases: celiac disease, Crohn's disease, multiple sclerosis, psoriasis, rheumatoid arthritis, systemic lupus erythematosus, and type 1 diabetes. We have developed a novel statistic for Cross Phenotype Meta-Analysis (CPMA) which detects association of a SNP to multiple, but not necessarily all, phenotypes. With it, we find evidence that 47/107 (44%) immune-mediated disease risk SNPs are associated to multiple-but not all-immune-mediated diseases (SNP-wise P(CPMA)<0.01). We also show that distinct groups of interacting proteins are encoded near SNPs which predispose to the same subsets of diseases; we propose these as the mechanistic basis of shared disease risk. We are thus able to leverage genetic data across diseases to construct biological hypotheses about the underlying mechanism of pathogenesis.

  14. DNA methylation levels at chromosome 8q24 in peripheral blood are associated with 8q24 cancer susceptibility loci.

    PubMed

    Barry, Kathryn Hughes; Moore, Lee E; Sampson, Joshua; Yan, Liying; Meyer, Ann; Oler, Andrew J; Chung, Charles C; Wang, Zhaoming; Yeager, Meredith; Amundadottir, Laufey; Berndt, Sonja I

    2014-12-01

    Chromosome 8q24 has emerged as an important region for genetic susceptibility to various cancers, but little is known about the contribution of DNA methylation at 8q24. To evaluate variability in DNA methylation levels at 8q24 and the relationship with cancer susceptibility single nucleotide polymorphisms (SNPs) in this region, we quantified DNA methylation levels in peripheral blood at 145 CpG sites nearby 8q24 cancer susceptibility SNPs or MYC using pyrosequencing among 80 Caucasian men in the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial. For the 60 CpG sites meeting quality control, which also demonstrated temporal stability over a 5-year period, we calculated pairwise Spearman correlations for DNA methylation levels at each CpG site with 42 8q24 cancer susceptibility SNPs. To account for multiple testing, we adjusted P values into q values reflecting the false discovery rate (FDR). In contrast to the MYC CpG sites, most sites nearby the SNPs demonstrated good reproducibility, high methylation levels, and moderate-high between-individual variation. We observed 10 statistically significant (FDR < 0.05) CpG site-SNP correlations. These included correlations between an intergenic CpG site at Chr8:128393157 and the prostate cancer SNP rs16902094 (ρ = -0.54; P = 9.7 × 10(-7); q = 0.002), a PRNCR1 CpG site at Chr8:128167809 and the prostate cancer SNP rs1456315 (ρ = 0.52; P = 1.4 × 10(-6); q = 0.002), and two POU5F1B CpG sites and several prostate/colorectal cancer SNPs (for Chr8:128498051 and rs6983267, ρ = 0.46; P = 2.0 × 10(-5); q = 0.01). This is the first report of correlations between blood DNA methylation levels and cancer susceptibility SNPs at 8q24, suggesting that DNA methylation at this important susceptibility locus may contribute to cancer risk. ©2014 American Association for Cancer Research.

  15. Whole genome sequencing of Brucella melitensis isolated from 57 patients in Germany reveals high diversity in strains from Middle East

    PubMed Central

    Georgi, Enrico; Walter, Mathias C.; Pfalzgraf, Marie-Theres; Northoff, Bernd H.; Holdt, Lesca M.; Scholz, Holger C.; Zoeller, Lothar

    2017-01-01

    Brucellosis, a worldwide common bacterial zoonotic disease, has become quite rare in Northern and Western Europe. However, since 2014 a significant increase of imported infections caused by Brucella (B.) melitensis has been noticed in Germany. Patients predominantly originated from Middle East including Turkey and Syria. These circumstances afforded an opportunity to gain insights into the population structure of Brucella strains. Brucella-isolates from 57 patients were recovered between January 2014 and June 2016 with culture confirmed brucellosis by the National Consultant Laboratory for Brucella. Their whole genome sequences were generated using the Illumina MiSeq platform. A whole genome-based SNP typing assay was developed in order to resolve geographically attributed genetic clusters. Results were compared to MLVA typing results, the current gold-standard of Brucella typing. In addition, sequences were examined for possible genetic variation within target regions of molecular diagnostic assays. Phylogenetic analyses revealed spatial clustering and distinguished strains from different patients in either case, whereas multiple isolates from a single patient or technical replicates showed identical SNP and MLVA profiles. By including WGS data from the NCBI database, five major genotypes were identified. Notably, strains originating from Turkey showed a high diversity and grouped into seven subclusters of genotype II. MLVA analysis congruently clustered all isolates and predominantly matched the East Mediterranean genetic clade. This study confirms whole-genome based SNP-analysis as a powerful tool for accurate typing of B. melitensis. Furthermore it allows special allocation and therefore provides useful information on the geographic origin for trace-back analysis. However, the lack of reliable metadata in public databases often prevents a resolution below geographic regions or country levels and corresponding precise trace-back analysis. Once this obstacle is resolved, WGS-derived bacterial typing adds an important method to complement epidemiological surveys during outbreak investigations. This is the first report of a detailed genetic investigation of an extensive collection of B. melitensis strains isolated from human cases in Germany. PMID:28388689

  16. A Variant in the BACH2 Gene Is Associated With Susceptibility to Autoimmune Addison's Disease in Humans

    PubMed Central

    Oftedal, Bergithe E.; Napier, Catherine M.; Ainsworth, Holly F.; Husebye, Eystein S.; Cordell, Heather J.; Pearce, Simon H. S.; Mitchell, Anna L.

    2016-01-01

    Context: Autoimmune Addison's disease (AAD) is a rare but highly heritable condition. The BACH2 protein plays a crucial role in T lymphocyte maturation, and allelic variation in its gene has been associated with a number of autoimmune conditions. Objective: We aimed to determine whether alleles of the rs3757247 single nucleotide polymorphism (SNP) in the BACH2 gene are associated with AAD. Design, Setting, and Patients: This case-control association study was performed in two phases using Taqman chemistry. In the first phase, the rs3757247 SNP was genotyped in 358 UK AAD subjects and 166 local control subjects. Genotype data were also available from 5154 healthy UK controls from the Wellcome Trust (WTCCC2) for comparison. In the second phase, the SNP was genotyped in a validation cohort comprising 317 Norwegian AAD subjects and 365 controls. Results: The frequency of the minor T allele was significantly higher in subjects with AAD from the United Kingdom compared to both the local and WTCCC2 control cohorts (58% vs 45 and 48%, respectively) (local controls, P = 1.1 × 10−4; odds ratio [OR], 1.68; 95% confidence interval [CI], 1.29–2.18; WTCCC2 controls, P = 1.4 × 10−6; OR, 1.44; 95% CI, 1.23–1.69). This finding was replicated in the Norwegian validation cohort (P = .0015; OR, 1.41; 95% CI, 1.14–1.75). Subgroup analysis showed that this association is present in subjects with both isolated AAD (OR, 1.53; 95% CI, 1.22–1.92) and autoimmune polyglandular syndrome type 2 (OR, 1.37; 95% CI, 1.12–1.69) in the UK cohort, and with autoimmune polyglandular syndrome type 2 in the Norwegian cohort (OR, 1.58; 95% CI, 1.22–2.06). Conclusion: We have demonstrated, for the first time, that allelic variability at the BACH2 locus is associated with susceptibility to AAD. Given its association with multiple autoimmune conditions, BACH2 can be considered a “universal” autoimmune susceptibility locus. PMID:27680876

  17. Transcriptome-enabled marker discovery and mapping of plastochron-related genes in Petunia spp.

    PubMed

    Guo, Yufang; Wiegert-Rininger, Krystle E; Vallejo, Veronica A; Barry, Cornelius S; Warner, Ryan M

    2015-09-24

    Petunia (Petunia × hybrida), derived from a hybrid between P. axillaris and P. integrifolia, is one of the most economically important bedding plant crops and Petunia spp. serve as model systems for investigating the mechanisms underlying diverse mating systems and pollination syndromes. In addition, we have previously described genetic variation and quantitative trait loci (QTL) related to petunia development rate and morphology, which represent important breeding targets for the floriculture industry to improve crop production and performance. Despite the importance of petunia as a crop, the floriculture industry has been slow to adopt marker assisted selection to facilitate breeding strategies and there remains a limited availability of sequences and molecular markers from the genus compared to other economically important members of the Solanaceae family such as tomato, potato and pepper. Here we report the de novo assembly, annotation and characterization of transcriptomes from P. axillaris, P. exserta and P. integrifolia. Each transcriptome assembly was derived from five tissue libraries (callus, 3-week old seedlings, shoot apices, flowers of mixed developmental stages, and trichomes). A total of 74,573, 54,913, and 104,739 assembled transcripts were recovered from P. axillaris, P. exserta and P. integrifolia, respectively and following removal of multiple isoforms, 32,994 P. axillaris, 30,225 P. exserta, and 33,540 P. integrifolia high quality representative transcripts were extracted for annotation and expression analysis. The transcriptome data was mined for single nucleotide polymorphisms (SNP) and simple sequence repeat (SSR) markers, yielding 89,007 high quality SNPs and 2949 SSRs, respectively. 15,701 SNPs were computationally converted into user-friendly cleaved amplified polymorphic sequence (CAPS) markers and a subset of SNP and CAPS markers were experimentally verified. CAPS markers developed from plastochron-related homologous transcripts from P. axillaris were mapped in an interspecific Petunia population and evaluated for co-localization with QTL for development rate. The high quality of the three Petunia spp. transcriptomes coupled with the utility of the SNP data will serve as a resource for further exploration of genetic diversity within the genus and will facilitate efforts to develop genetic and physical maps to aid the identification of QTL associated with traits of interest.

  18. Integrating genome-wide genetic variations and monocyte expression data reveals trans-regulated gene modules in humans.

    PubMed

    Rotival, Maxime; Zeller, Tanja; Wild, Philipp S; Maouche, Seraya; Szymczak, Silke; Schillert, Arne; Castagné, Raphaele; Deiseroth, Arne; Proust, Carole; Brocheton, Jessy; Godefroy, Tiphaine; Perret, Claire; Germain, Marine; Eleftheriadis, Medea; Sinning, Christoph R; Schnabel, Renate B; Lubos, Edith; Lackner, Karl J; Rossmann, Heidi; Münzel, Thomas; Rendon, Augusto; Erdmann, Jeanette; Deloukas, Panos; Hengstenberg, Christian; Diemert, Patrick; Montalescot, Gilles; Ouwehand, Willem H; Samani, Nilesh J; Schunkert, Heribert; Tregouet, David-Alexandre; Ziegler, Andreas; Goodall, Alison H; Cambien, François; Tiret, Laurence; Blankenberg, Stefan

    2011-12-01

    One major expectation from the transcriptome in humans is to characterize the biological basis of associations identified by genome-wide association studies. So far, few cis expression quantitative trait loci (eQTLs) have been reliably related to disease susceptibility. Trans-regulating mechanisms may play a more prominent role in disease susceptibility. We analyzed 12,808 genes detected in at least 5% of circulating monocyte samples from a population-based sample of 1,490 European unrelated subjects. We applied a method of extraction of expression patterns-independent component analysis-to identify sets of co-regulated genes. These patterns were then related to 675,350 SNPs to identify major trans-acting regulators. We detected three genomic regions significantly associated with co-regulated gene modules. Association of these loci with multiple expression traits was replicated in Cardiogenics, an independent study in which expression profiles of monocytes were available in 758 subjects. The locus 12q13 (lead SNP rs11171739), previously identified as a type 1 diabetes locus, was associated with a pattern including two cis eQTLs, RPS26 and SUOX, and 5 trans eQTLs, one of which (MADCAM1) is a potential candidate for mediating T1D susceptibility. The locus 12q24 (lead SNP rs653178), which has demonstrated extensive disease pleiotropy, including type 1 diabetes, hypertension, and celiac disease, was associated to a pattern strongly correlating to blood pressure level. The strongest trans eQTL in this pattern was CRIP1, a known marker of cellular proliferation in cancer. The locus 12q15 (lead SNP rs11177644) was associated with a pattern driven by two cis eQTLs, LYZ and YEATS4, and including 34 trans eQTLs, several of them tumor-related genes. This study shows that a method exploiting the structure of co-expressions among genes can help identify genomic regions involved in trans regulation of sets of genes and can provide clues for understanding the mechanisms linking genome-wide association loci to disease.

  19. A Genome-Wide mQTL Analysis in Human Adipose Tissue Identifies Genetic Variants Associated with DNA Methylation, Gene Expression and Metabolic Traits

    PubMed Central

    Volkov, Petr; Olsson, Anders H.; Gillberg, Linn; Jørgensen, Sine W.; Brøns, Charlotte; Eriksson, Karl-Fredrik; Groop, Leif; Jansson, Per-Anders; Nilsson, Emma; Rönn, Tina; Vaag, Allan; Ling, Charlotte

    2016-01-01

    Little is known about the extent to which interactions between genetics and epigenetics may affect the risk of complex metabolic diseases and/or their intermediary phenotypes. We performed a genome-wide DNA methylation quantitative trait locus (mQTL) analysis in human adipose tissue of 119 men, where 592,794 single nucleotide polymorphisms (SNPs) were related to DNA methylation of 477,891 CpG sites, covering 99% of RefSeq genes. SNPs in significant mQTLs were further related to gene expression in adipose tissue and obesity related traits. We found 101,911 SNP-CpG pairs (mQTLs) in cis and 5,342 SNP-CpG pairs in trans showing significant associations between genotype and DNA methylation in adipose tissue after correction for multiple testing, where cis is defined as distance less than 500 kb between a SNP and CpG site. These mQTLs include reported obesity, lipid and type 2 diabetes loci, e.g. ADCY3/POMC, APOA5, CETP, FADS2, GCKR, SORT1 and LEPR. Significant mQTLs were overrepresented in intergenic regions meanwhile underrepresented in promoter regions and CpG islands. We further identified 635 SNPs in significant cis-mQTLs associated with expression of 86 genes in adipose tissue including CHRNA5, G6PC2, GPX7, RPL27A, THNSL2 and ZFP57. SNPs in significant mQTLs were also associated with body mass index (BMI), lipid traits and glucose and insulin levels in our study cohort and public available consortia data. Importantly, the Causal Inference Test (CIT) demonstrates how genetic variants mediate their effects on metabolic traits (e.g. BMI, cholesterol, high-density lipoprotein (HDL), hemoglobin A1c (HbA1c) and homeostatic model assessment of insulin resistance (HOMA-IR)) via altered DNA methylation in human adipose tissue. This study identifies genome-wide interactions between genetic and epigenetic variation in both cis and trans positions influencing gene expression in adipose tissue and in vivo (dys)metabolic traits associated with the development of obesity and diabetes. PMID:27322064

  20. Development of a Genetic Map for Onion (Allium cepa L.) Using Reference-Free Genotyping-by-Sequencing and SNP Assays

    PubMed Central

    Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl

    2017-01-01

    Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273

Top