Soler, Stephan; Rittore, Cécile; Touitou, Isabelle; Philibert, Laurent
2011-02-20
From the wide range of methods currently available for genotyping, we wished to identify a quick, reliable and affordable approach for routine use in our laboratory for LTA+252 C>T SNP screening. We set up and compared three genotyping methods for SNP detection: restriction fragment length polymorphism (RFLP), tetra primer amplification refractory mutation system PCR (TPAP) and unlabeled probe melting analysis (UPMA). The SNP model used was LTA+252 C>T, a cytokine gene polymorphism that has been associated with response to treatment in rheumatoid arthritis. The study was performed using 46 samples from healthy Caucasian volunteers. Allele and genotype distribution was similar to that previously described in the same population. All three genotyping methods showed good reproducibility and are suitable for a medium scale throughput molecular platform. UPMA was the most cost effective, reliable and safe method since it required the shortest technician time, could be performed in a single closed tube and involved automatic data analysis. This work is the first to compare these three genotyping techniques and provides evidence for UPMA being the method of choice for LTA+252 C>T SNP genotyping. Copyright © 2010 Elsevier B.V. All rights reserved.
SNP discovery and genotyping using Genotyping-by-Sequencing in Pekin ducks.
Zhu, Feng; Cui, Qian-Qian; Hou, Zhuo-Cheng
2016-11-15
Genomic selection and genome-wide association studies need thousands to millions of SNPs. However, many non-model species do not have reference chips for detecting variation. Our goal was to develop and validate an inexpensive but effective method for detecting SNP variation. Genotyping by sequencing (GBS) can be a highly efficient strategy for genome-wide SNP detection, as an alternative to microarray chips. Here, we developed a GBS protocol for ducks and tested it to genotype 49 Pekin ducks. A total of 169,209 SNPs were identified from all animals, with a mean of 55,920 SNPs per individual. The average SNP density reached 1156 SNPs/MB. In this study, the first application of GBS to ducks, we demonstrate the power and simplicity of this method. GBS can be used for genetic studies in to provide an effective method for genome-wide SNP discovery.
Pappas, D J; Lizee, A; Paunic, V; Beutner, K R; Motyer, A; Vukcevic, D; Leslie, S; Biesiada, J; Meller, J; Taylor, K D; Zheng, X; Zhao, L P; Gourraud, P-A; Hollenbach, J A; Mack, S J; Maiers, M
2018-05-22
Four single nucleotide polymorphism (SNP)-based human leukocyte antigen (HLA) imputation methods (e-HLA, HIBAG, HLA*IMP:02 and MAGPrediction) were trained using 1000 Genomes SNP and HLA genotypes and assessed for their ability to accurately impute molecular HLA-A, -B, -C and -DRB1 genotypes in the Human Genome Diversity Project cell panel. Imputation concordance was high (>89%) across all methods for both HLA-A and HLA-C, but HLA-B and HLA-DRB1 proved generally difficult to impute. Overall, <27.8% of subjects were correctly imputed for all HLA loci by any method. Concordance across all loci was not enhanced via the application of confidence thresholds; reliance on confidence scores across methods only led to noticeable improvement (+3.2%) for HLA-DRB1. As the HLA complex is highly relevant to the study of human health and disease, a standardized assessment of SNP-based HLA imputation methods is crucial for advancing genomic research. Considerable room remains for the improvement of HLA-B and especially HLA-DRB1 imputation methods, and no imputation method is as accurate as molecular genotyping. The application of large, ancestrally diverse HLA and SNP reference data sets and multiple imputation methods has the potential to make SNP-based HLA imputation methods a tractable option for determining HLA genotypes.
McClure, Matthew C.; Sonstegard, Tad S.; Wiggans, George R.; Van Eenennaam, Alison L.; Weber, Kristina L.; Penedo, Cecilia T.; Berry, Donagh P.; Flynn, John; Garcia, Jose F.; Carmo, Adriana S.; Regitano, Luciana C. A.; Albuquerque, Milla; Silva, Marcos V. G. B.; Machado, Marco A.; Coffey, Mike; Moore, Kirsty; Boscher, Marie-Yvonne; Genestout, Lucie; Mazza, Raffaele; Taylor, Jeremy F.; Schnabel, Robert D.; Simpson, Barry; Marques, Elisa; McEwan, John C.; Cromie, Andrew; Coutinho, Luiz L.; Kuehn, Larry A.; Keele, John W.; Piper, Emily K.; Cook, Jim; Williams, Robert; Van Tassell, Curtis P.
2013-01-01
To assist cattle producers transition from microsatellite (MS) to single nucleotide polymorphism (SNP) genotyping for parental verification we previously devised an effective and inexpensive method to impute MS alleles from SNP haplotypes. While the reported method was verified with only a limited data set (N = 479) from Brown Swiss, Guernsey, Holstein, and Jersey cattle, some of the MS-SNP haplotype associations were concordant across these phylogenetically diverse breeds. This implied that some haplotypes predate modern breed formation and remain in strong linkage disequilibrium. To expand the utility of MS allele imputation across breeds, MS and SNP data from more than 8000 animals representing 39 breeds (Bos taurus and B. indicus) were used to predict 9410 SNP haplotypes, incorporating an average of 73 SNPs per haplotype, for which alleles from 12 MS markers could be accurately be imputed. Approximately 25% of the MS-SNP haplotypes were present in multiple breeds (N = 2 to 36 breeds). These shared haplotypes allowed for MS imputation in breeds that were not represented in the reference population with only a small increase in Mendelian inheritance inconsistancies. Our reported reference haplotypes can be used for any cattle breed and the reported methods can be applied to any species to aid the transition from MS to SNP genetic markers. While ~91% of the animals with imputed alleles for 12 MS markers had ≤1 Mendelian inheritance conflicts with their parents' reported MS genotypes, this figure was 96% for our reference animals, indicating potential errors in the reported MS genotypes. The workflow we suggest autocorrects for genotyping errors and rare haplotypes, by MS genotyping animals whose imputed MS alleles fail parentage verification, and then incorporating those animals into the reference dataset. PMID:24065982
An innovative SNP genotyping method adapting to multiple platforms and throughputs
USDA-ARS?s Scientific Manuscript database
Single nucleotide polymorphisms (SNPs) are highly abundant, distributed throughout the genome in various species, and therefore they are widely used as genetic markers. However, the usefulness of this genetic tool relies heavily on the availability of user-friendly SNP genotyping methods. We have d...
McClure, Matthew C.; McCarthy, John; Flynn, Paul; McClure, Jennifer C.; Dair, Emma; O'Connell, D. K.; Kearney, John F.
2018-01-01
A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP) verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS), they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF) analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800) selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR), and minor allele frequency (MAF) in the Irish cattle population. Large datasets require sample and SNP quality control (QC). Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present), and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non-matching genotypes per animal, SNP duplicates, sex and breed prediction mismatches, parentage and progeny validation results, and other situations. The Animal QC pipeline make use of ICBF800 SNP set where appropriate to identify errors in a computationally efficient yet still highly accurate method. PMID:29599798
He, Hongbin; Argiro, Laurent; Dessein, Helia; Chevillard, Christophe
2007-01-01
FTA technology is a novel method designed to simplify the collection, shipment, archiving and purification of nucleic acids from a wide variety of biological sources. The number of punches that can normally be obtained from a single specimen card are often however, insufficient for the testing of the large numbers of loci required to identify genetic factors that control human susceptibility or resistance to multifactorial diseases. In this study, we propose an improved technique to perform large-scale SNP genotyping. We applied a whole genome amplification method to amplify DNA from buccal cell samples stabilized using FTA technology. The results show that using the improved technique it is possible to perform up to 15,000 genotypes from one buccal cell sample. Furthermore, the procedure is simple. We consider this improved technique to be a promising methods for performing large-scale SNP genotyping because the FTA technology simplifies the collection, shipment, archiving and purification of DNA, while whole genome amplification of FTA card bound DNA produces sufficient material for the determination of thousands of SNP genotypes.
snpAD: An ancient DNA genotype caller.
Prüfer, Kay
2018-06-21
The study of ancient genomes can elucidate the evolutionary past. However, analyses are complicated by base-modifications in ancient DNA molecules that result in errors in DNA sequences. These errors are particularly common near the ends of sequences and pose a challenge for genotype calling. I describe an iterative method that estimates genotype frequencies and errors along sequences to allow for accurate genotype calling from ancient sequences. The implementation of this method, called snpAD, performs well on high-coverage ancient data, as shown by simulations and by subsampling the data of a high-coverage Neandertal genome. Although estimates for low-coverage genomes are less accurate, I am able to derive approximate estimates of heterozygosity from several low-coverage Neandertals. These estimates show that low heterozygosity, compared to modern humans, was common among Neandertals. The C ++ code of snpAD is freely available at http://bioinf.eva.mpg.de/snpAD/. Supplementary data are available at Bioinformatics online.
Use of partial least squares regression to impute SNP genotypes in Italian cattle breeds.
Dimauro, Corrado; Cellesi, Massimo; Gaspa, Giustino; Ajmone-Marsan, Paolo; Steri, Roberto; Marras, Gabriele; Macciotta, Nicolò P P
2013-06-05
The objective of the present study was to test the ability of the partial least squares regression technique to impute genotypes from low density single nucleotide polymorphisms (SNP) panels i.e. 3K or 7K to a high density panel with 50K SNP. No pedigree information was used. Data consisted of 2093 Holstein, 749 Brown Swiss and 479 Simmental bulls genotyped with the Illumina 50K Beadchip. First, a single-breed approach was applied by using only data from Holstein animals. Then, to enlarge the training population, data from the three breeds were combined and a multi-breed analysis was performed. Accuracies of genotypes imputed using the partial least squares regression method were compared with those obtained by using the Beagle software. The impact of genotype imputation on breeding value prediction was evaluated for milk yield, fat content and protein content. In the single-breed approach, the accuracy of imputation using partial least squares regression was around 90 and 94% for the 3K and 7K platforms, respectively; corresponding accuracies obtained with Beagle were around 85% and 90%. Moreover, computing time required by the partial least squares regression method was on average around 10 times lower than computing time required by Beagle. Using the partial least squares regression method in the multi-breed resulted in lower imputation accuracies than using single-breed data. The impact of the SNP-genotype imputation on the accuracy of direct genomic breeding values was small. The correlation between estimates of genetic merit obtained by using imputed versus actual genotypes was around 0.96 for the 7K chip. Results of the present work suggested that the partial least squares regression imputation method could be useful to impute SNP genotypes when pedigree information is not available.
Hein, David W; Doll, Mark A
2012-01-01
Aim Humans exhibit genetic polymorphism in NAT2 resulting in rapid, intermediate and slow acetylator phenotypes. Over 65 NAT2 variants possessing one or more SNPs in the 870-bp NAT2 coding region have been reported. The seven most frequent SNPs are rs1801279 (191G>A), rs1041983 (282C>T), rs1801280 (341T>C), rs1799929 (481C>T), rs1799930 (590G>A), rs1208 (803A>G) and rs1799931 (857G>A). The majority of studies investigate the NAT2 genotype assay for three SNPs: 481C>T, 590G>A and 857G>A. A tag-SNP (rs1495741) recently identified in a genome-wide association study has also been proposed as a biomarker for the NAT2 phenotype. Materials & methods Sulfamethazine N-acetyltransferase catalytic activities were measured in cryopreserved human hepatocytes from a convenience sample of individuals in the USA with an ethnic frequency similar to the 2010 US population census. These activities were segregated by the tag-SNP rs1495741 and each of the seven SNPs described above. We assessed the accuracy of the tag-SNP and various two-, three-, four- and seven-SNP genotyping panels for their ability to accurately infer NAT2 phenotype. Results The accuracy of the various NAT2 SNP genotype panels to infer NAT2 phenotype were as follows: seven-SNP: 98.4%; tag-SNP: 77.7%; two-SNP: 96.1%; three-SNP: 92.2%; and four-SNP: 98.4%. Conclusion A NAT2 four-SNP genotype panel of rs1801279 (191G>A), rs1801280 (341T>C), rs1799930 (590G>A) and rs1799931 (857G>A) infers NAT2 acetylator phenotype with high accuracy, and is recommended over the tag-, two-, three- and (for economy of scale) the seven-SNP genotyping panels, particularly in populations of non-European ancestry. PMID:22092036
Lopes, F B; Wu, X-L; Li, H; Xu, J; Perkins, T; Genho, J; Ferretti, R; Tait, R G; Bauck, S; Rosa, G J M
2018-02-01
Reliable genomic prediction of breeding values for quantitative traits requires the availability of sufficient number of animals with genotypes and phenotypes in the training set. As of 31 October 2016, there were 3,797 Brangus animals with genotypes and phenotypes. These Brangus animals were genotyped using different commercial SNP chips. Of them, the largest group consisted of 1,535 animals genotyped by the GGP-LDV4 SNP chip. The remaining 2,262 genotypes were imputed to the SNP content of the GGP-LDV4 chip, so that the number of animals available for training the genomic prediction models was more than doubled. The present study showed that the pooling of animals with both original or imputed 40K SNP genotypes substantially increased genomic prediction accuracies on the ten traits. By supplementing imputed genotypes, the relative gains in genomic prediction accuracies on estimated breeding values (EBV) were from 12.60% to 31.27%, and the relative gain in genomic prediction accuracies on de-regressed EBV was slightly small (i.e. 0.87%-18.75%). The present study also compared the performance of five genomic prediction models and two cross-validation methods. The five genomic models predicted EBV and de-regressed EBV of the ten traits similarly well. Of the two cross-validation methods, leave-one-out cross-validation maximized the number of animals at the stage of training for genomic prediction. Genomic prediction accuracy (GPA) on the ten quantitative traits was validated in 1,106 newly genotyped Brangus animals based on the SNP effects estimated in the previous set of 3,797 Brangus animals, and they were slightly lower than GPA in the original data. The present study was the first to leverage currently available genotype and phenotype resources in order to harness genomic prediction in Brangus beef cattle. © 2018 Blackwell Verlag GmbH.
Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.
2015-01-01
ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133
HRM and SNaPshot as alternative forensic SNP genotyping methods.
Mehta, Bhavik; Daniel, Runa; McNevin, Dennis
2017-09-01
Single nucleotide polymorphisms (SNPs) have been widely used in forensics for prediction of identity, biogeographical ancestry (BGA) and externally visible characteristics (EVCs). Single base extension (SBE) assays, most notably SNaPshot® (Thermo Fisher Scientific), are commonly used for forensic SNP genotyping as they can be employed on standard instrumentation in forensic laboratories (e.g. capillary electrophoresis). High resolution melt (HRM) analysis is an alternative method and is a simple, fast, single tube assay for low throughput SNP typing. This study compares HRM and SNaPshot®. HRM produced reproducible and concordant genotypes at 500 pg, however, difficulties were encountered when genotyping SNPs with high GC content in flanking regions and differentiating variants of symmetrical SNPs. SNaPshot® was reproducible at 100 pg and is less dependent on SNP choice. HRM has a shorter processing time in comparison to SNaPshot®, avoids post PCR contamination risk and has potential as a screening tool for many forensic applications.
Typing SNP based on the near-infrared spectroscopy and artificial neural network
NASA Astrophysics Data System (ADS)
Ren, Li; Wang, Wei-Peng; Gao, Yu-Zhen; Yu, Xiao-Wei; Xie, Hong-Ping
2009-07-01
Based on the near-infrared spectra (NIRS) of the measured samples as the discriminant variables of their genotypes, the genotype discriminant model of SNP has been established by using back-propagation artificial neural network (BP-ANN). Taking a SNP (857G > A) of N-acetyltransferase 2 (NAT2) as an example, DNA fragments containing the SNP site were amplified by the PCR method based on a pair of primers to obtain the three-genotype (GG, AA, and GA) modeling samples. The NIRS-s of the amplified samples were directly measured in transmission by using quartz cell. Based on the sample spectra measured, the two BP-ANN-s were combined to obtain the stronger ability of the three-genotype classification. One of them was established to compress the measured NIRS variables by using the resilient back-propagation algorithm, and another network established by Levenberg-Marquardt algorithm according to the compressed NIRS-s was used as the discriminant model of the three-genotype classification. For the established model, the root mean square error for the training and the prediction sample sets were 0.0135 and 0.0132, respectively. Certainly, this model could rightly predict the three genotypes (i.e. the accuracy of prediction samples was up to100%) and had a good robust for the prediction of unknown samples. Since the three genotypes of SNP could be directly determined by using the NIRS-s without any preprocessing for the analyzed samples after PCR, this method is simple, rapid and low-cost.
Dynamic variable selection in SNP genotype autocalling from APEX microarray data.
Podder, Mohua; Welch, William J; Zamar, Ruben H; Tebbutt, Scott J
2006-11-30
Single nucleotide polymorphisms (SNPs) are DNA sequence variations, occurring when a single nucleotide--adenine (A), thymine (T), cytosine (C) or guanine (G)--is altered. Arguably, SNPs account for more than 90% of human genetic variation. Our laboratory has developed a highly redundant SNP genotyping assay consisting of multiple probes with signals from multiple channels for a single SNP, based on arrayed primer extension (APEX). This mini-sequencing method is a powerful combination of a highly parallel microarray with distinctive Sanger-based dideoxy terminator sequencing chemistry. Using this microarray platform, our current genotype calling system (known as SNP Chart) is capable of calling single SNP genotypes by manual inspection of the APEX data, which is time-consuming and exposed to user subjectivity bias. Using a set of 32 Coriell DNA samples plus three negative PCR controls as a training data set, we have developed a fully-automated genotyping algorithm based on simple linear discriminant analysis (LDA) using dynamic variable selection. The algorithm combines separate analyses based on the multiple probe sets to give a final posterior probability for each candidate genotype. We have tested our algorithm on a completely independent data set of 270 DNA samples, with validated genotypes, from patients admitted to the intensive care unit (ICU) of St. Paul's Hospital (plus one negative PCR control sample). Our method achieves a concordance rate of 98.9% with a 99.6% call rate for a set of 96 SNPs. By adjusting the threshold value for the final posterior probability of the called genotype, the call rate reduces to 94.9% with a higher concordance rate of 99.6%. We also reversed the two independent data sets in their training and testing roles, achieving a concordance rate up to 99.8%. The strength of this APEX chemistry-based platform is its unique redundancy having multiple probes for a single SNP. Our model-based genotype calling algorithm captures the redundancy in the system considering all the underlying probe features of a particular SNP, automatically down-weighting any 'bad data' corresponding to image artifacts on the microarray slide or failure of a specific chemistry. In this regard, our method is able to automatically select the probes which work well and reduce the effect of other so-called bad performing probes in a sample-specific manner, for any number of SNPs.
Cooper, T A; Wiggans, G R; VanRaden, P M
2013-05-01
Call rates on both a single nucleotide polymorphism (SNP) basis and an animal basis are used as measures of data quality and as screening tools for genomic studies and evaluations of dairy cattle. To investigate the relationship of SNP call rate and genotype accuracy for individual SNP, the correlation between percentages of missing genotypes and parent-progeny conflicts for each SNP was calculated for 103,313 Holsteins. Correlations ranged from 0.14 to 0.38 for the BovineSNP50 and BovineLD (Illumina Inc., San Diego, CA) and GeneSeek Genomic Profiler (Neogen Corp., Lincoln, NE) chips, with lower correlations for newer chips. For US genomic evaluations, genotypes are excluded for animals with a call rate of <90% across autosomal SNP or <80% across X-specific SNP. Mean call rate for 220,175 Holstein, Jersey, and Brown Swiss genotypes was 99.6%. Animal genotypes with a call rate of ≤99% were examined from the US Department of Agriculture genotype database to determine how genotype call rate is related to accuracy of calls on an animal basis. Animal call rate was determined from SNP used in genomic evaluation and is the number of called autosomal and X-specific SNP genotypes divided by the number of SNP from that type of chip. To investigate the relationship of animal call rate and parentage validation, conflicts between a genotyped animal and its sire or dam were determined through a duo test (opposite homozygous SNP genotypes between sire and progeny; 1,374 animal genotypes) and a trio test (also including conflicts with dam and heterozygous SNP genotype for the animal when both parents are the same homozygote; 482 animal genotypes). When animal call rate was ≤ 80%, parentage validation was no longer reliable with the duo test. With the trio test, parentage validation was no longer reliable when animal call rate was ≤ 90%. To investigate how animal call rate was related to genotyping accuracy for animals with multiple genotypes, concordance between genotypes for 1,216 animals that had a genotype with a call rate of ≤ 99% (low call rate) as well as a genotype with a call rate of >99% (high call rate) were calculated by dividing the number of identical SNP genotype calls by the number of SNP that were called for both genotypes. Mean concordance between low- and high-call genotypes was >99% for a low call rate of >90% but decreased to 97% for a call rate of 86 to 90% and to 58% for a call rate of <60%. Edits on call rate reduce the use of incorrect SNP genotypes to calculate genomic evaluations. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Gregory, Michael D; Kolachana, Bhaskar; Yao, Yin; Nash, Tiffany; Dickinson, Dwight; Eisenberg, Daniel P; Mervis, Carolyn B; Berman, Karen F
2018-04-04
Williams syndrome ([WS], 7q11.23 hemideletion) and 7q11.23 duplication syndrome (Dup7) show contrasting syndromic symptoms. However, within each group there is considerable interindividual variability in the degree to which these phenotypes are expressed. Though software exists to identify areas of copy number variation (CNV) from commonly-available SNP-chip data, this software does not provide non-diploid genotypes in CNV regions. Here, we describe a method for identifying haploid and triploid genotypes in CNV regions, and then, as a proof-of-concept for applying this information to explain clinical variability, we test for genotype-phenotype associations. Blood samples for 25 individuals with WS and 13 individuals with Dup7 were genotyped with Illumina-HumanOmni5M SNP-chips. PennCNV and in-house code were used to make genotype calls for each SNP in the 7q11.23 locus. We tested for association between the presence of aortic arteriopathy and genotypes of the remaining (haploid in WS) or duplicated (triploid in Dup7) alleles. Haploid calls in the 7q11.23 region were made for 99.0% of SNPs in the WS group, and triploid calls for 98.8% of SNPs in those with Dup7. The G allele of SNP rs2528795 in the ELN gene was associated with aortic stenosis in WS participants (p < 0.0049) while the A allele of the same SNP was associated with aortic dilation in Dup7. Commonly available SNP-chip information can be used to make haploid and triploid calls in individuals with CNVs and then to relate variability in specific genes to variability in syndromic phenotypes, as demonstrated here using aortic arteriopathy. This work sets the stage for similar genotype-phenotype analyses in CNVs where phenotypes may be more complex and/or where there is less information about genetic mechanisms.
USDA-ARS?s Scientific Manuscript database
The objective of this study was to investigate alternative methods for designing and utilizing reduced single nucleotide polymorphism (SNP) panels for imputing SNP genotypes. Two purebred Hereford populations, an experimental population known as Line 1 Hereford (L1, N=240) and registered Hereford wi...
Delaneau, Olivier; Marchini, Jonathan
2014-06-13
A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.
SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping.
Chang, Hsueh-Wei; Cheng, Yu-Huei; Chuang, Li-Yeh; Yang, Cheng-Hong
2010-04-08
PCR-restriction fragment length polymorphism (RFLP) assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels), gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.
Sequence-Based Genotyping for Marker Discovery and Co-Dominant Scoring in Germplasm and Populations
Truong, Hoa T.; Ramos, A. Marcos; Yalcin, Feyruz; de Ruiter, Marjo; van der Poel, Hein J. A.; Huvenaars, Koen H. J.; Hogers, René C. J.; van Enckevort, Leonora. J. G.; Janssen, Antoine; van Orsouw, Nathalie J.; van Eijk, Michiel J. T.
2012-01-01
Conventional marker-based genotyping platforms are widely available, but not without their limitations. In this context, we developed Sequence-Based Genotyping (SBG), a technology for simultaneous marker discovery and co-dominant scoring, using next-generation sequencing. SBG offers users several advantages including a generic sample preparation method, a highly robust genome complexity reduction strategy to facilitate de novo marker discovery across entire genomes, and a uniform bioinformatics workflow strategy to achieve genotyping goals tailored to individual species, regardless of the availability of a reference sequence. The most distinguishing features of this technology are the ability to genotype any population structure, regardless whether parental data is included, and the ability to co-dominantly score SNP markers segregating in populations. To demonstrate the capabilities of SBG, we performed marker discovery and genotyping in Arabidopsis thaliana and lettuce, two plant species of diverse genetic complexity and backgrounds. Initially we obtained 1,409 SNPs for arabidopsis, and 5,583 SNPs for lettuce. Further filtering of the SNP dataset produced over 1,000 high quality SNP markers for each species. We obtained a genotyping rate of 201.2 genotypes/SNP and 58.3 genotypes/SNP for arabidopsis (n = 222 samples) and lettuce (n = 87 samples), respectively. Linkage mapping using these SNPs resulted in stable map configurations. We have therefore shown that the SBG approach presented provides users with the utmost flexibility in garnering high quality markers that can be directly used for genotyping and downstream applications. Until advances and costs will allow for routine whole-genome sequencing of populations, we expect that sequence-based genotyping technologies such as SBG will be essential for genotyping of model and non-model genomes alike. PMID:22662172
Park, Jung Hun; Jang, Hyowon; Jung, Yun Kyung; Jung, Ye Lim; Shin, Inkyung; Cho, Dae-Yeon; Park, Hyun Gyu
2017-05-15
We herein describe a new mass spectrometry-based method for multiplex SNP genotyping by utilizing allele-specific ligation and strand displacement amplification (SDA) reaction. In this method, allele-specific ligation is first performed to discriminate base sequence variations at the SNP site within the PCR-amplified target DNA. The primary ligation probe is extended by a universal primer annealing site while the secondary ligation probe has base sequences as an overhang with a nicking enzyme recognition site and complementary mass marker sequence. The ligation probe pairs are ligated by DNA ligase only at specific allele in the target DNA and the resulting ligated product serves as a template to promote the SDA reaction using a universal primer. This process isothermally amplifies short DNA fragments, called mass markers, to be analyzed by mass spectrometry. By varying the sizes of the mass markers, we successfully demonstrated the multiplex SNP genotyping capability of this method by reliably identifying several BRCA mutations in a multiplex manner with mass spectrometry. Copyright © 2016 Elsevier B.V. All rights reserved.
Knoppers, Bartha M; Isasi, Rosario; Benvenisty, Nissim; Kim, Ock-Joo; Lomax, Geoffrey; Morris, Clive; Murray, Thomas H; Lee, Eng Hin; Perry, Margery; Richardson, Genevra; Sipp, Douglas; Tanner, Klaus; Wahlström, Jan; de Wert, Guido; Zeng, Fanyi
2011-09-01
Novel methods and associated tools permitting individual identification in publicly accessible SNP databases have become a debatable issue. There is growing concern that current technical and ethical safeguards to protect the identities of donors could be insufficient. In the context of human embryonic stem cell research, there are no studies focusing on the probability that an hESC line donor could be identified by analyzing published SNP profiles and associated genotypic and phenotypic information. We present the International Stem Cell Forum (ISCF) Ethics Working Party's Policy Statement on "Publishing SNP Genotypes of Human Embryonic Stem Cell Lines (hESC)". The Statement prospectively addresses issues surrounding the publication of genotypic data and associated annotations of hESC lines in open access databases. It proposes a balanced approach between the goals of open science and data sharing with the respect for fundamental bioethical principles (autonomy, privacy, beneficence, justice and research merit and integrity).
Multiplexed SNP genotyping using the Qbead™ system: a quantum dot-encoded microsphere-based assay
Xu, Hongxia; Sha, Michael Y.; Wong, Edith Y.; Uphoff, Janet; Xu, Yanzhang; Treadway, Joseph A.; Truong, Anh; O’Brien, Eamonn; Asquith, Steven; Stubbins, Michael; Spurr, Nigel K.; Lai, Eric H.; Mahoney, Walt
2003-01-01
We have developed a new method using the Qbead™ system for high-throughput genotyping of single nucleotide polymorphisms (SNPs). The Qbead system employs fluorescent Qdot™ semiconductor nanocrystals, also known as quantum dots, to encode microspheres that subsequently can be used as a platform for multiplexed assays. By combining mixtures of quantum dots with distinct emission wavelengths and intensities, unique spectral ‘barcodes’ are created that enable the high levels of multiplexing required for complex genetic analyses. Here, we applied the Qbead system to SNP genotyping by encoding microspheres conjugated to allele-specific oligonucleotides. After hybridization of oligonucleotides to amplicons produced by multiplexed PCR of genomic DNA, individual microspheres are analyzed by flow cytometry and each SNP is distinguished by its unique spectral barcode. Using 10 model SNPs, we validated the Qbead system as an accurate and reliable technique for multiplexed SNP genotyping. By modifying the types of probes conjugated to microspheres, the Qbead system can easily be adapted to other assay chemistries for SNP genotyping as well as to other applications such as analysis of gene expression and protein–protein interactions. With its capability for high-throughput automation, the Qbead system has the potential to be a robust and cost-effective platform for a number of applications. PMID:12682378
Ultra-low-density genotype panels for breed assignment of Angus and Hereford cattle.
Judge, M M; Kelleher, M M; Kearney, J F; Sleator, R D; Berry, D P
2017-06-01
Angus and Hereford beef is marketed internationally for apparent superior meat quality attributes; DNA-based breed authenticity could be a useful instrument to ensure consumer confidence on premium meat products. The objective of this study was to develop an ultra-low-density genotype panel to accurately quantify the Angus and Hereford breed proportion in biological samples. Medium-density genotypes (13 306 single nucleotide polymorphisms (SNPs)) were available on 54 703 commercial and 4042 purebred animals. The breed proportion of the commercial animals was generated from the medium-density genotypes and this estimate was regarded as the gold-standard breed composition. Ten genotype panels (100 to 1000 SNPs) were developed from the medium-density genotypes; five methods were used to identify the most informative SNPs and these included the Delta statistic, the fixation (F st) statistic and an index of both. Breed assignment analyses were undertaken for each breed, panel density and SNP selection method separately with a programme to infer population structure using the entire 13 306 SNP panel (representing the gold-standard measure). Breed assignment was undertaken for all commercial animals (n=54 703), animals deemed to contain some proportion of Angus based on pedigree (n=5740) and animals deemed to contain some proportion of Hereford based on pedigree (n=5187). The predicted breed proportion of all animals from the lower density panels was then compared with the gold-standard breed prediction. Panel density, SNP selection method and breed all had a significant effect on the correlation of predicted and actual breed proportion. Regardless of breed, the Index method of SNP selection numerically (but not significantly) outperformed all other selection methods in accuracy (i.e. correlation and root mean square of prediction) when panel density was ⩾300 SNPs. The correlation between actual and predicted breed proportion increased as panel density increased. Using 300 SNPs (selected using the global index method), the correlation between predicted and actual breed proportion was 0.993 and 0.995 in the Angus and Hereford validation populations, respectively. When SNP panels optimised for breed prediction in one population were used to predict the breed proportion of a separate population, the correlation between predicted and actual breed proportion was 0.034 and 0.044 weaker in the Hereford and Angus populations, respectively (using the 300 SNP panel). It is necessary to include at least 300 to 400 SNPs (per breed) on genotype panels to accurately predict breed proportion from biological samples.
Analysis of SNP rs16754 of WT1 gene in a series of de novo acute myeloid leukemia patients.
Luna, Irene; Such, Esperanza; Cervera, Jose; Barragán, Eva; Jiménez-Velasco, Antonio; Dolz, Sandra; Ibáñez, Mariam; Gómez-Seguí, Inés; López-Pavía, María; Llop, Marta; Fuster, Óscar; Oltra, Silvestre; Moscardó, Federico; Martínez-Cuadrón, David; Senent, M Leonor; Gascón, Adriana; Montesinos, Pau; Martín, Guillermo; Bolufer, Pascual; Sanz, Miguel A
2012-12-01
The single nucleotide polymorphism (SNP) rs16754 of the WT1 gene has been previously described as a possible prognostic marker in normal karyotype acute myeloid leukemia (AML) patients. Nevertheless, the findings in this field are not always reproducible in different series. One hundred and seventy-five adult de novo AML patients were screened with two different methods for the detection of SNP rs16754: high-resolution melting (HRM) and FRET hybridization probes. Direct sequencing was used to validate both techniques. The SNP was detected in 52 out of 175 patients (30 %), both by HRM and hybridization probes. Direct sequencing confirmed that every positive sample in the screening methods had a variation in the DNA sequence. Patients with the wild-type genotype (WT1(AA)) for the SNP rs16754 were significantly younger than those with the heterozygous WT1(AG) genotype. No other difference was observed for baseline characteristic or outcome between patients with or without the SNP. Both techniques are equally reliable and reproducible as screening methods for the detection of the SNP rs16754, allowing for the selection of those samples that will need to be sequenced. We were unable to confirm the suggested favorable outcome of SNP rs16754 in de novo AML.
High-throughput SNP genotyping for breeding applications in rice using the BeadXpress platform
USDA-ARS?s Scientific Manuscript database
Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
Ahlawat, Sonika; Sharma, Rekha; Maitra, A.; Roy, Manoranjan; Tantia, M.S.
2014-01-01
New, quick, and inexpensive methods for genotyping novel caprine Fec gene polymorphisms through tetra-primer ARMS PCR were developed in the present investigation. Single nucleotide polymorphism (SNP) genotyping needs to be attempted to establish association between the identified mutations and traits of economic importance. In the current study, we have successfully genotyped three new SNPs identified in caprine fecundity genes viz. T(-242)C (BMPR1B), G1189A (GDF9) and G735A (BMP15). Tetra-primer ARMS PCR protocol was optimized and validated for these SNPs with short turn-around time and costs. The optimized techniques were tested on 158 random samples of Black Bengal goat breed. Samples with known genotypes for the described genes, previously tested in duplicate using the sequencing methods, were employed for validation of the assay. Upon validation, complete concordance was observed between the tetra-primer ARMS PCR assays and the sequencing results. These results highlight the ability of tetra-primer ARMS PCR in genotyping of mutations in Fec genes. Any associated SNP could be used to accelerate the improvement of goat reproductive traits by identifying high prolific animals at an early stage of life. Our results provide direct evidence that tetra-primer ARMS-PCR is a rapid, reliable, and cost-effective method for SNP genotyping of mutations in caprine Fec genes. PMID:25606428
Henshall, John M; Dierens, Leanne; Sellars, Melony J
2014-09-02
While much attention has focused on the development of high-density single nucleotide polymorphism (SNP) assays, the costs of developing and running low-density assays have fallen dramatically. This makes it feasible to develop and apply SNP assays for agricultural species beyond the major livestock species. Although low-cost low-density assays may not have the accuracy of the high-density assays widely used in human and livestock species, we show that when combined with statistical analysis approaches that use quantitative instead of discrete genotypes, their utility may be improved. The data used in this study are from a 63-SNP marker Sequenom® iPLEX Platinum panel for the Black Tiger shrimp, for which high-density SNP assays are not currently available. For quantitative genotypes that could be estimated, in 5% of cases the most likely genotype for an individual at a SNP had a probability of less than 0.99. Matrix formulations of maximum likelihood equations for parentage assignment were developed for the quantitative genotypes and also for discrete genotypes perturbed by an assumed error term. Assignment rates that were based on maximum likelihood with quantitative genotypes were similar to those based on maximum likelihood with perturbed genotypes but, for more than 50% of cases, the two methods resulted in individuals being assigned to different families. Treating genotypes as quantitative values allows the same analysis framework to be used for pooled samples of DNA from multiple individuals. Resulting correlations between allele frequency estimates from pooled DNA and individual samples were consistently greater than 0.90, and as high as 0.97 for some pools. Estimates of family contributions to the pools based on quantitative genotypes in pooled DNA had a correlation of 0.85 with estimates of contributions from DNA-derived pedigree. Even with low numbers of SNPs of variable quality, parentage testing and family assignment from pooled samples are sufficiently accurate to provide useful information for a breeding program. Treating genotypes as quantitative values is an alternative to perturbing genotypes using an assumed error distribution, but can produce very different results. An understanding of the distribution of the error is required for SNP genotyping platforms.
Jin, Jia-Li; Sun, Jing; Ge, Hui-Juan; Cao, Yun-Xia; Wu, Xiao-Ke; Liang, Feng-Jing; Sun, Hai-Xiang; Ke, Lu; Yi, Long; Wu, Zhi-Wei; Wang, Yong
2009-12-16
Several studies have reported the association of the SNP rs2414096 in the CYP19 gene with hyperandrogenism, which is one of the clinical manifestations of polycystic ovary syndrome (PCOS). These studies suggest that SNP rs2414096 may be involved in the etiopathogenisis of PCOS. To investigate whetherthe CYP19 gene SNP rs2414096 polymorphism is associated with the susceptibility to PCOS, we designed a case-controlled association study including 684 individuals. A case-controlled association study including 684 individuals (386 PCOS patients and 298 controls) was performed to assess the association of SNP rs2414096 with PCOS. Genotyping of SNP rs2414096 was conducted by the polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP) method that was performed on genomic DNA isolated from blood leucocytes. Results were analyzed in respect to clinical test results. The genotypic distributions of rs2414096 (GG, AG, AA) in the CYP19 gene (GG, AG, AA) in women with PCOS (0.363, 0.474, 0.163, respectively) were significantly different from that in controls (0.242, 0.500, 0.258, respectively) (P = 0.001). E2/T was different between the AA and GG genotypes. Age at menarche (AAM) and FSH were also significantly different among the GG, AG, and AA genotypes in women with PCOS (P = 0.0391 and 0.0118, respectively). No differences were observed in body mass index (BMI) and other serum hormone concentrations among the three genotypes, either in the PCOS patients or controls. Our data suggest that SNP rs2414096 in the CYP19 gene is associated with susceptibility to PCOS.
Germline Mutation of the CCK Receptor: A Novel Biomarker for Pancreas Cancer.
Alsubai, Jelal; Matters, Gail L; McGovern, Christopher O; Liao, Jiangang; Gilius, Evan L; Smith, Jill P
2016-01-07
Today, genetic biomarkers have been demonstrated to play an important role in identifying at-risk subjects for familial or inherited cancers. We have identified a single-nucleotide polymorphism (SNP) that results in missplicing of the cholecystokinin (CCK) receptor gene and expressing a larger mutated receptor in pancreatic cancer. The purpose of this study was to evaluate the significance and specificity of this SNP as a potential biomarker in patients with pancreatic cancer compared with other gastrointestinal (GI) cancers that also have CCK receptors. DNA was isolated and genotyped for the CCK receptor SNP from frozen tumor tissue from banked specimens of patients with pancreas, gastric, or colon cancer and from human cancer cell lines. Genotype and allelic frequencies were compared between the cancer cohort and two normal control databases using Fisher's exact test and odds ratio (OR). The Kaplan-Meier method was used to estimate the survival for patients with the CCK-B receptor SNP compared with those with the wild-type genotype. Immunohistochemical staining of cancer cells was done to detect the mutated receptor. Colon and gastric cancer patients had similar genotype frequencies for the CCK receptor SNP as that reported in the normal population. In contrast, the prevalence of the SNP in subjects with pancreatic cancer was twice that of controls and other GI cancers. Survival was adversely affected by the presence of the SNP only in those with pancreatic cancer. Immunoreactivity for the mutated receptor was positive in pancreatic cancer tissues with the SNP but absent in other GI cancers. A SNP of the CCK receptor is significantly increased in patients with pancreatic cancer but not in those with other GI malignancies. Therefore, this SNP may be a potential biomarker for pancreatic cancer.
Sivaprasad, Siddapuram; Rao, Padaki Nagaraja; Gupta, Rajesh; Ashwini, Kaitha; Reddy, Duvvuru Nageshwar
2012-01-01
Background The single nucleotide polymorphism (SNP) of IL28B gene on chromosome 19, encoding for the interferon (IFN)-λ-3 is strongly associated with treatment response to pegylated-IFN and ribavirin in patients infected with different genotypes of hepatitis C virus (HCV). Difference between ethnicity and treatment response rates suggesting a key role of host genetics. The IL28B polymorphism (rs12979860C/T) shows a marked differential distribution between racial groups. Aim The present study is aimed to evaluate genotype and allelic frequency of IL28B gene polymorphism (rs12979860C/T) in Andhra Pradesh, India. Methods A total of 220 healthy controls were recruited for the study. The genotyping of SNP rs12979860C/T on IL28B gene was performed by polymerase chain reaction-direct sequencing method. Result The frequency of CC genotype was found to be significantly (59.09%) higher compared to CT (34.09%) and TT (6.81%) genotypes, respectively. The frequency of major allele C is 0.762 whereas minor allele T is 0.238. Conclusion The higher distribution of genotype ‘CC’ of SNP, rs12979860C/T of IL28B gene in study subjects is suggestive of better response of HCV patients to standard anti-HCV therapy. PMID:25755419
Application of genomic selection in farm animal breeding.
Tan, Cheng; Bian, Cheng; Yang, Da; Li, Ning; Wu, Zhen-Fang; Hu, Xiao-Xiang
2017-11-20
Genomic selection (GS) has become a widely accepted method in animal breeding to genetically improve economic traits. With the declining costs of high-density SNP chips and next-generation sequencing, GS has been applied in dairy cattle, swine, poultry and other animals and gained varying degrees of success. Currently, major challenges in GS studies include further reducing the cost of genome-wide SNP genotyping and improving the predictive accuracy of genomic estimated breeding value (GEBV). In this review, we summarize various methods for genome-wide SNP genotyping and GEBV prediction, and give a brief introduction of GS in livestock and poultry breeding. This review will provide a reference for further implementation of GS in farm animal breeding.
Wolc, Anna; Stricker, Chris; Arango, Jesus; Settar, Petek; Fulton, Janet E; O'Sullivan, Neil P; Preisinger, Rudolf; Habier, David; Fernando, Rohan; Garrick, Dorian J; Lamont, Susan J; Dekkers, Jack C M
2011-01-21
Genomic selection involves breeding value estimation of selection candidates based on high-density SNP genotypes. To quantify the potential benefit of genomic selection, accuracies of estimated breeding values (EBV) obtained with different methods using pedigree or high-density SNP genotypes were evaluated and compared in a commercial layer chicken breeding line. The following traits were analyzed: egg production, egg weight, egg color, shell strength, age at sexual maturity, body weight, albumen height, and yolk weight. Predictions appropriate for early or late selection were compared. A total of 2,708 birds were genotyped for 23,356 segregating SNP, including 1,563 females with records. Phenotypes on relatives without genotypes were incorporated in the analysis (in total 13,049 production records).The data were analyzed with a Reduced Animal Model using a relationship matrix based on pedigree data or on marker genotypes and with a Bayesian method using model averaging. Using a validation set that consisted of individuals from the generation following training, these methods were compared by correlating EBV with phenotypes corrected for fixed effects, selecting the top 30 individuals based on EBV and evaluating their mean phenotype, and by regressing phenotypes on EBV. Using high-density SNP genotypes increased accuracies of EBV up to two-fold for selection at an early age and by up to 88% for selection at a later age. Accuracy increases at an early age can be mostly attributed to improved estimates of parental EBV for shell quality and egg production, while for other egg quality traits it is mostly due to improved estimates of Mendelian sampling effects. A relatively small number of markers was sufficient to explain most of the genetic variation for egg weight and body weight.
Vitis Phylogenomics: Hybridization Intensities from a SNP Array Outperform Genotype Calls
Miller, Allison J.; Matasci, Naim; Schwaninger, Heidi; Aradhya, Mallikarjuna K.; Prins, Bernard; Zhong, Gan-Yuan; Simon, Charles; Buckler, Edward S.; Myles, Sean
2013-01-01
Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs) identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One approach to SNP-based phylogeny reconstruction is to identify SNPs in a subset of individuals, and then to compile SNPs on an array that can be used to genotype additional samples at hundreds or thousands of sites simultaneously. Although powerful and efficient, this method is subject to ascertainment bias because applying variation discovered in a representative subset to a larger sample favors identification of SNPs with high minor allele frequencies and introduces bias against rare alleles. Here, we demonstrate that the use of hybridization intensity data, rather than genotype calls, reduces the effects of ascertainment bias. Whereas traditional SNP calls assess known variants based on diversity housed in the discovery panel, hybridization intensity data survey variation in the broader sample pool, regardless of whether those variants are present in the initial SNP discovery process. We apply SNP genotype and hybridization intensity data derived from the Vitis9kSNP array developed for grape to show the effects of ascertainment bias and to reconstruct evolutionary relationships among Vitis species. We demonstrate that phylogenies constructed using hybridization intensities suffer less from the distorting effects of ascertainment bias, and are thus more accurate than phylogenies based on genotype calls. Moreover, we reconstruct the phylogeny of the genus Vitis using hybridization data, show that North American subgenus Vitis species are monophyletic, and resolve several previously poorly known relationships among North American species. This study builds on earlier work that applied the Vitis9kSNP array to evolutionary questions within Vitis vinifera and has general implications for addressing ascertainment bias in array-enabled phylogeny reconstruction. PMID:24236035
An integrated SNP mining and utilization (ISMU) pipeline for next generation sequencing data.
Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A V S K; Varshney, Rajeev K
2014-01-01
Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone free software.
Genome-wide association study of acute post-surgical pain in humans
Kim, Hyungsuk; Ramsay, Edward; Lee, Hyewon; Wahl, Sharon; Dionne, Raymond A
2009-01-01
Aims Testing a relatively small genomic region with a few hundred SNPs provides limited information. Genome-wide association studies (GWAS) provide an opportunity to overcome the limitation of candidate gene association studies. Here, we report the results of a GWAS for the responses to an NSAID analgesic. Materials & methods European Americans (60 females and 52 males) undergoing oral surgery were genotyped with Affymetrix 500K SNP assay. Additional SNP genotyping was performed from the gene in linkage disequilibrium with the candidate SNP revealed by the GWAS. Results GWAS revealed a candidate SNP (rs2562456) associated with analgesic onset, which is in linkage disequilibrium with a gene encoding a zinc finger protein. Additional SNP genotyping of ZNF429 confirmed the association with analgesic onset in humans (p = 1.8 × 10−10, degrees of freedom = 103, F = 28.3). We also found candidate loci for the maximum post-operative pain rating (rs17122021, p = 6.9 × 10−7) and post-operative pain onset time (rs6693882, p = 2.1 × 10−6), however, correcting for multiple comparisons did not sustain these genetic associations. Conclusion GWAS for acute clinical pain followed by additional SNP genotyping of a neighboring gene suggests that genetic variations in or near the loci encoding DNA binding proteins play a role in the individual variations in responses to analgesic drugs. PMID:19207018
GStream: Improving SNP and CNV Coverage on Genome-Wide Association Studies
Alonso, Arnald; Marsal, Sara; Tortosa, Raül; Canela-Xandri, Oriol; Julià, Antonio
2013-01-01
We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method. PMID:23844243
Accurate HLA type inference using a weighted similarity graph.
Xie, Minzhu; Li, Jing; Jiang, Tao
2010-12-14
The human leukocyte antigen system (HLA) contains many highly variable genes. HLA genes play an important role in the human immune system, and HLA gene matching is crucial for the success of human organ transplantations. Numerous studies have demonstrated that variation in HLA genes is associated with many autoimmune, inflammatory and infectious diseases. However, typing HLA genes by serology or PCR is time consuming and expensive, which limits large-scale studies involving HLA genes. Since it is much easier and cheaper to obtain single nucleotide polymorphism (SNP) genotype data, accurate computational algorithms to infer HLA gene types from SNP genotype data are in need. To infer HLA types from SNP genotypes, the first step is to infer SNP haplotypes from genotypes. However, for the same SNP genotype data set, the haplotype configurations inferred by different methods are usually inconsistent, and it is often difficult to decide which one is true. In this paper, we design an accurate HLA gene type inference algorithm by utilizing SNP genotype data from pedigrees, known HLA gene types of some individuals and the relationship between inferred SNP haplotypes and HLA gene types. Given a set of haplotypes inferred from the genotypes of a population consisting of many pedigrees, the algorithm first constructs a weighted similarity graph based on a new haplotype similarity measure and derives constraint edges from known HLA gene types. Based on the principle that different HLA gene alleles should have different background haplotypes, the algorithm searches for an optimal labeling of all the haplotypes with unknown HLA gene types such that the total weight among the same HLA gene types is maximized. To deal with ambiguous haplotype solutions, we use a genetic algorithm to select haplotype configurations that tend to maximize the same optimization criterion. Our experiments on a previously typed subset of the HapMap data show that the algorithm is highly accurate, achieving an accuracy of 96% for gene HLA-A, 95% for HLA-B, 97% for HLA-C, 84% for HLA-DRB1, 98% for HLA-DQA1 and 97% for HLA-DQB1 in a leave-one-out test. Our algorithm can infer HLA gene types from neighboring SNP genotype data accurately. Compared with a recent approach on the same input data, our algorithm achieved a higher accuracy. The code of our algorithm is available to the public for free upon request to the corresponding authors.
Bhat, Somanath; Polanowski, Andrea M; Double, Mike C; Jarman, Simon N; Emslie, Kerry R
2012-01-01
Recent advances in nanofluidic technologies have enabled the use of Integrated Fluidic Circuits (IFCs) for high-throughput Single Nucleotide Polymorphism (SNP) genotyping (GT). In this study, we implemented and validated a relatively low cost nanofluidic system for SNP-GT with and without Specific Target Amplification (STA). As proof of principle, we first validated the effect of input DNA copy number on genotype call rate using well characterised, digital PCR (dPCR) quantified human genomic DNA samples and then implemented the validated method to genotype 45 SNPs in the humpback whale, Megaptera novaeangliae, nuclear genome. When STA was not incorporated, for a homozygous human DNA sample, reaction chambers containing, on average 9 to 97 copies, showed 100% call rate and accuracy. Below 9 copies, the call rate decreased, and at one copy it was 40%. For a heterozygous human DNA sample, the call rate decreased from 100% to 21% when predicted copies per reaction chamber decreased from 38 copies to one copy. The tightness of genotype clusters on a scatter plot also decreased. In contrast, when the same samples were subjected to STA prior to genotyping a call rate and a call accuracy of 100% were achieved. Our results demonstrate that low input DNA copy number affects the quality of data generated, in particular for a heterozygous sample. Similar to human genomic DNA, a call rate and a call accuracy of 100% was achieved with whale genomic DNA samples following multiplex STA using either 15 or 45 SNP-GT assays. These calls were 100% concordant with their true genotypes determined by an independent method, suggesting that the nanofluidic system is a reliable platform for executing call rates with high accuracy and concordance in genomic sequences derived from biological tissue.
Henning, John A; Coggins, Jamie; Peterson, Matthew
2015-10-06
Hop is an economically important crop for the Pacific Northwest USA as well as other regions of the world. It is a perennial crop with rhizomatous or clonal propagation system for varietal distribution. A big concern for growers as well as brewers is variety purity and questions are regularly posed to public agencies concerning the availability of genotype testing. Current means for genotyping are based upon 25 microsatellites that provides relatively accurate genotyping but cannot always differentiate sister-lines. In addition, numerous PCR runs (25) are required to complete this process and only a few laboratories exist that perform this service. A genotyping protocol based upon SNPs would enable rapid accurate genotyping that can be assayed at any laboratory facility set up for SNP-based genotyping. The results of this study arose from a larger project designed for whole genome association studies upon the USDA-ARS hop germplasm collection consisting of approximately 116 distinct hop varieties and germplasm (female lines) from around the world. The original dataset that arose from partial sequencing of 121 genotypes resulted in the identification of 374,829 SNPs using TASSEL-UNEAK pipeline. After filtering out genotypes with more than 50% missing data (5 genotypes) and SNP markers with more than 20% missing data, 32,206 highly filtered SNP markers across 116 genotypes were identified and considered for this study. Minor allele frequency (MAF) was calculated for each SNP and ranked according to the most informative to least informative. Only those markers without missing data across genotypes as well as 60% or less heterozygous gamete calls were considered for further analysis. Genetic distances among individuals in the study were calculated using the marker with the highest MAF value, then by using a combination of the two markers with highest MAF values and so on. This process was reiterated until a set of markers was identified that allowed for all genotypes in the study to be genetically differentiated from each other. Next, we compared genetic matrices calculated from the minimal marker sets [(Table 2; 6-, 7-, 8-, 10- and 12-marker set matrices] and that of a matrix calculated from a set of markers with no missing data across all 116 samples (1006 SNP markers). The minimum number of markers required to meet both specifications was a set of 7-markers (Table 3). These seven SNPs were then aligned with a genome assembly, and DNA sequence both upstream and downstream were used to identify primer sequences that can be used to develop seven amplicons for high resolution melting curve PCR detection or other SNP-based PCR detection methods. This study identifies a set of 7 SNP markers that may prove useful for the identification and validation of hop varieties and accessions. Variety validation of unknown samples assumes that the variety under question has been included a priori in a discovery panel. These results are based upon in silica studies and markers need to be validated using different SNP marker technology upon a differential set of hop genotypes. The marker sequence data and suggested primer sets provide potential means to fingerprint hop varieties in most genetic laboratories utilizing SNP-marker technology.
Analysis of high-order SNP barcodes in mitochondrial D-loop for chronic dialysis susceptibility.
Yang, Cheng-Hong; Lin, Yu-Da; Chuang, Li-Yeh; Chang, Hsueh-Wei
2016-10-01
Positively identifying disease-associated single nucleotide polymorphism (SNP) markers in genome-wide studies entails the complex association analysis of a huge number of SNPs. Such large numbers of SNP barcode (SNP/genotype combinations) continue to pose serious computational challenges, especially for high-dimensional data. We propose a novel exploiting SNP barcode method based on differential evolution, termed IDE (improved differential evolution). IDE uses a "top combination strategy" to improve the ability of differential evolution to explore high-order SNP barcodes in high-dimensional data. We simulate disease data and use real chronic dialysis data to test four global optimization algorithms. In 48 simulated disease models, we show that IDE outperforms existing global optimization algorithms in terms of exploring ability and power to detect the specific SNP/genotype combinations with a maximum difference between cases and controls. In real data, we show that IDE can be used to evaluate the relative effects of each individual SNP on disease susceptibility. IDE generated significant SNP barcode with less computational complexity than the other algorithms, making IDE ideally suited for analysis of high-order SNP barcodes. Copyright © 2016 Elsevier Inc. All rights reserved.
Motawi, Tarek; Salman, Tarek; Shaker, Olfat
2015-01-01
Introduction Adiponectin is an adipose tissue-specific protein with insulin-sensitizing properties. Many investigators have explored the association between adiponectin single nucleotide polymorphisms (SNPs) and type 2 diabetes mellitus (T2DM) in different ethnic populations from different regions. Leptin is a protein hormone constituting an important signal in the regulation of adipose tissue mass and body weight. The aim of this study was to explore potential associations between SNP +45 T>G of the adiponectin gene and SNP 2548G/A of leptin with T2DM and the effect of SNPs on serum adiponectin and leptin levels. Material and methods From the Egyptian population, we enrolled 110 T2DM patients and 90 non-diabetic controls. Serum lipid profile, blood glucose, serum adiponectin, and leptin were measured. Genotyping for two common SNPs of the adiponectin and leptin genes was performed by polymerase chain reaction–restriction fragment length polymorphism. Results The G allele and TG/GG genotype of SNP 45 occurred more frequently than the T allele and TT genotype in T2DM patients compares to the controls. Subjects with the GG + TG genotype of SNP 45 were at increased risk for T2DM (OR = 6.476; 95% CI: 3.401–12.33) and associated with a low serum adiponectin level compared with the TT genotype. The serum leptin concentration of GA + AA genotype carriers was not significantly different from that of the GG genotype in the diabetic group. Conclusions The G allele carriers who have reduced plasma concentrations of adiponectin may have an association with T2DM, while leptin SNP 2548 G/A is not associated with the risk of development of T2DM in the Egyptian population. PMID:26528333
USDA-ARS?s Scientific Manuscript database
Multiplexed single nucleotide polymorphism (SNP) markers have the potential to increase the speed and cost-effectiveness of genotyping, provided that an optimal SNP density is used for each application. To test the efficiency of multiplexed SNP genotyping for diversity, mapping and breeding applicat...
Liu, X; Guo, X Y; Xu, X Z; Wu, M; Zhang, X; Li, Q; Ma, P P; Zhang, Y; Wang, C Y; Geng, F J; Qin, C H; Liu, L; Shi, W H; Wang, Y C; Yu, Y
2012-08-16
DNA methylation is essential for adipose deposition in mammals. We screened SNPs of the bovine DNA methyltransferase 3b (DNMT3b) gene in Snow Dragon beef, a commercial beef cattle population in China. Nine SNPs were found in the population and three of six novel SNPs were chosen for genotyping and analyzing a possible association with 16 meat quality traits. The frequencies of the alleles and genotypes of the three SNPs in Snow Dragon beef were similar to those in their terminal-paternal breed, Wagyu. Association analysis disclosed that SNP1 was not associated with any of the traits; SNP2 was significantly associated with lean meat color score and chuck short rib score, and SNP3 had a significant effect on dressing percentage and back-fat thickness in the beef population. The individuals with genotype GG for SNP2 had a 25.7% increase in lean meat color score and a 146% increase in chuck short rib score, compared with genotype AA. The cattle with genotype AG for SNP3 had 35.7 and 24% increases in dressing percentage and 28.8 and 29.2% increases in back-fat thickness, compared with genotypes GG and AA, respectively. Genotypic combination analysis revealed significant interactions between SNP1 and SNP2 and between SNP2 and SNP3 for the traits rib-eye area and live weight. We conclude that there is considerable evidence that DNMT3b is a determiner of beef quality traits.
Knüppel, Sven; Meidtner, Karina; Arregui, Maria; Holzhütter, Hermann-Georg; Boeing, Heiner
2015-07-01
Analyzing multiple single nucleotide polymorphisms (SNPs) is a promising approach to finding genetic effects beyond single-locus associations. We proposed the use of multilocus stepwise regression (MSR) to screen for allele combinations as a method to model joint effects, and compared the results with the often used genetic risk score (GRS), conventional stepwise selection, and the shrinkage method LASSO. In contrast to MSR, the GRS, conventional stepwise selection, and LASSO model each genotype by the risk allele doses. We reanalyzed 20 unlinked SNPs related to type 2 diabetes (T2D) in the EPIC-Potsdam case-cohort study (760 cases, 2193 noncases). No SNP-SNP interactions and no nonlinear effects were found. Two SNP combinations selected by MSR (Nagelkerke's R² = 0.050 and 0.048) included eight SNPs with mean allele combination frequency of 2%. GRS and stepwise selection selected nearly the same SNP combinations consisting of 12 and 13 SNPs (Nagelkerke's R² ranged from 0.020 to 0.029). LASSO showed similar results. The MSR method showed the best model fit measured by Nagelkerke's R² suggesting that further improvement may render this method a useful tool in genetic research. However, our comparison suggests that the GRS is a simple way to model genetic effects since it does not consider linkage, SNP-SNP interactions, and no non-linear effects. © 2015 John Wiley & Sons Ltd/University College London.
Polymorphism in ovine ANXA9 gene and physic-chemical properties and the fraction of protein in milk.
Pecka-Kiełb, Ewa; Czerniawska-Piątkowska, Ewa; Kowalewska-Łuczak, Inga; Vasil, Milan
2018-04-16
Annexin A9 (ANXA9) is a specific fatty acid transport protein. ANXA9 gene is expressed in various tissues, including secretory tissue and mammary glands. The association between three SNPs of the ANXA9 gene and sheep's milk compositions was assessed. Genotype analysis was performed with the use of PCR-RFLP method. The studied ANXA9 polymorphisms had the following MAF (Major Allele Frequency): SNP1: allele G 0,66; SNP2: allele G 0,54; SNP3: allele C 0,57. The study found the most desired profile of protein fractions, namely an increased kappa-casein fractions and a decreased level of whey protein in sheep's milk for SNP1 and SNP3 polymorphisms. Sheep with the SNP1 GA genotype had the highest (P <0.05) content of fat and dry matter in milk. AXNA9 gene polymorphism did not influence the levels of protein, lactose or urea in sheep's milk. The information contained in this study may be useful for determining the impact of the ANXA9 gene on sheep's milk. The ANXA9 SNP1 and SNP3 polymorphisms results could be included in the breeding programs to select the sheep with the genotypes ensuring the highest kappa-casein levels in milk. However, it is worth conducting further research on ANXA9 and milk composition in larger herds of animals and various breeds of sheep. This article is protected by copyright. All rights reserved.
Leite, Neiva; Furtado-Alle, Lupe; Teixeira, Mayza Dalcin; de Souza, Ricardo Lehtonen Rodrigues; da Silva, Larissa Rosa; Pizzi, Juliana; Lopes, Maria de Fátima Aguiar; Titski, Ana Cláudia Kapp
2018-01-01
Purpose The rs9939609 SNP (T > A) in FTO gene is associated with obesity and type 2 diabetes. The present study aimed at verifying whether this SNP influenced biochemical outcomes of children and adolescents who are overweight/obese submitted to a program of physical exercise and also if there was influence on basal levels of these biochemical variables. Methods The sample was composed by 432 children and adolescents grouped in three ways (obese, overweight, and normal weight); of these, 135 children and adoloescents who are obese and overweight were submitted to a physical exercise program for 12 weeks. All were genotyped by TaqMan SNP genotyping assay. Results The children and adolescents who are overweight/obese and carriers of AA genotype had higher levels of insulin (p=0.03) and HOMA (p=0.007) and lower levels of glucose (p=0.003), but the SNP did not modulate the response to physical exercise. Conclusions In our study, the rs9939609 AA genotype was associated with parameters related to insulin metabolism but did not interact with physical exercise. PMID:29854435
Use of Sequenom Sample ID Plus® SNP Genotyping in Identification of FFPE Tumor Samples
Miller, Jessica K.; Buchner, Nicholas; Timms, Lee; Tam, Shirley; Luo, Xuemei; Brown, Andrew M. K.; Pasternack, Danielle; Bristow, Robert G.; Fraser, Michael; Boutros, Paul C.; McPherson, John D.
2014-01-01
Short tandem repeat (STR) analysis, such as the AmpFlSTR® Identifiler® Plus kit, is a standard, PCR-based human genotyping method used in the field of forensics. Misidentification of cell line and tissue DNA can be costly if not detected early; therefore it is necessary to have quality control measures such as STR profiling in place. A major issue in large-scale research studies involving archival formalin-fixed paraffin embedded (FFPE) tissues is that varying levels of DNA degradation can result in failure to correctly identify samples using STR genotyping. PCR amplification of STRs of several hundred base pairs is not always possible when DNA is degraded. The Sample ID Plus® panel from Sequenom allows for human DNA identification and authentication using SNP genotyping. In comparison to lengthy STR amplicons, this multiplexing PCR assay requires amplification of only 76–139 base pairs, and utilizes 47 SNPs to discriminate between individual samples. In this study, we evaluated both STR and SNP genotyping methods of sample identification, with a focus on paired FFPE tumor/normal DNA samples intended for next-generation sequencing (NGS). The ability to successfully validate the identity of FFPE samples can enable cost savings by reducing rework. PMID:24551080
Use of Sequenom sample ID Plus® SNP genotyping in identification of FFPE tumor samples.
Miller, Jessica K; Buchner, Nicholas; Timms, Lee; Tam, Shirley; Luo, Xuemei; Brown, Andrew M K; Pasternack, Danielle; Bristow, Robert G; Fraser, Michael; Boutros, Paul C; McPherson, John D
2014-01-01
Short tandem repeat (STR) analysis, such as the AmpFlSTR® Identifiler® Plus kit, is a standard, PCR-based human genotyping method used in the field of forensics. Misidentification of cell line and tissue DNA can be costly if not detected early; therefore it is necessary to have quality control measures such as STR profiling in place. A major issue in large-scale research studies involving archival formalin-fixed paraffin embedded (FFPE) tissues is that varying levels of DNA degradation can result in failure to correctly identify samples using STR genotyping. PCR amplification of STRs of several hundred base pairs is not always possible when DNA is degraded. The Sample ID Plus® panel from Sequenom allows for human DNA identification and authentication using SNP genotyping. In comparison to lengthy STR amplicons, this multiplexing PCR assay requires amplification of only 76-139 base pairs, and utilizes 47 SNPs to discriminate between individual samples. In this study, we evaluated both STR and SNP genotyping methods of sample identification, with a focus on paired FFPE tumor/normal DNA samples intended for next-generation sequencing (NGS). The ability to successfully validate the identity of FFPE samples can enable cost savings by reducing rework.
Trembizki, Ella; Smith, Helen; Lahra, Monica M; Chen, Marcus; Donovan, Basil; Fairley, Christopher K; Guy, Rebecca; Kaldor, John; Regan, David; Ward, James; Nissen, Michael D; Sloots, Theo P; Whiley, David M
2014-06-01
Neisseria gonorrhoeae antimicrobial resistance (AMR) is a global problem heightened by emerging resistance to ceftriaxone. Appropriate molecular typing methods are important for understanding the emergence and spread of N. gonorrhoeae AMR. We report on the development, validation and testing of a Sequenom MassARRAY iPLEX method for multilocus sequence typing (MLST)-style genotyping of N. gonorrhoeae isolates. An iPLEX MassARRAY method (iPLEX14SNP) was developed targeting 14 informative gonococcal single nucleotide polymorphisms (SNPs) previously shown to predict MLST types. The method was initially validated using 24 N. gonorrhoeae control isolates and was then applied to 397 test isolates collected throughout Queensland, Australia in the first half of 2012. The iPLEX14SNP method provided 100% accuracy for the control isolates, correctly identifying all 14 SNPs for all 24 isolates (336/336). For the 397 test isolates, the iPLEX14SNP assigned results for 5461 of the possible 5558 SNPs (SNP call rate 98.25%), with complete 14 SNP profiles obtained for 364 isolates. Based on the complete SNP profile data, there were 49 different sequence types identified in Queensland, with 11 of the 49 SNP profiles accounting for the majority (n = 280; 77%) of isolates. AMR was dominated by several geographically clustered sequence types. Using the iPLEX14SNP method, up to 384 isolates could be tested within 1 working day for less than Aus$10 per isolate. The iPLEX14SNP offers an accurate and high-throughput method for the MLST-style genotyping of N. gonorrhoeae and may prove particularly useful for large-scale studies investigating the emergence and spread of gonococcal AMR. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Humble, Emily; Thorne, Michael A S; Forcada, Jaume; Hoffman, Joseph I
2016-08-26
Single nucleotide polymorphism (SNP) discovery is an important goal of many studies. However, the number of 'putative' SNPs discovered from a sequence resource may not provide a reliable indication of the number that will successfully validate with a given genotyping technology. For this it may be necessary to account for factors such as the method used for SNP discovery and the type of sequence data from which it originates, suitability of the SNP flanking sequences for probe design, and genomic context. To explore the relative importance of these and other factors, we used Illumina sequencing to augment an existing Roche 454 transcriptome assembly for the Antarctic fur seal (Arctocephalus gazella). We then mapped the raw Illumina reads to the new hybrid transcriptome using BWA and BOWTIE2 before calling SNPs with GATK. The resulting markers were pooled with two existing sets of SNPs called from the original 454 assembly using NEWBLER and SWAP454. Finally, we explored the extent to which SNPs discovered using these four methods overlapped and predicted the corresponding validation outcomes for both Illumina Infinium iSelect HD and Affymetrix Axiom arrays. Collating markers across all discovery methods resulted in a global list of 34,718 SNPs. However, concordance between the methods was surprisingly poor, with only 51.0 % of SNPs being discovered by more than one method and 13.5 % being called from both the 454 and Illumina datasets. Using a predictive modeling approach, we could also show that SNPs called from the Illumina data were on average more likely to successfully validate, as were SNPs called by more than one method. Above and beyond this pattern, predicted validation outcomes were also consistently better for Affymetrix Axiom arrays. Our results suggest that focusing on SNPs called by more than one method could potentially improve validation outcomes. They also highlight possible differences between alternative genotyping technologies that could be explored in future studies of non-model organisms.
2012-01-01
Background Efficient, robust, and accurate genotype imputation algorithms make large-scale application of genomic selection cost effective. An algorithm that imputes alleles or allele probabilities for all animals in the pedigree and for all genotyped single nucleotide polymorphisms (SNP) provides a framework to combine all pedigree, genomic, and phenotypic information into a single-stage genomic evaluation. Methods An algorithm was developed for imputation of genotypes in pedigreed populations that allows imputation for completely ungenotyped animals and for low-density genotyped animals, accommodates a wide variety of pedigree structures for genotyped animals, imputes unmapped SNP, and works for large datasets. The method involves simple phasing rules, long-range phasing and haplotype library imputation and segregation analysis. Results Imputation accuracy was high and computational cost was feasible for datasets with pedigrees of up to 25 000 animals. The resulting single-stage genomic evaluation increased the accuracy of estimated genomic breeding values compared to a scenario in which phenotypes on relatives that were not genotyped were ignored. Conclusions The developed imputation algorithm and software and the resulting single-stage genomic evaluation method provide powerful new ways to exploit imputation and to obtain more accurate genetic evaluations. PMID:22462519
Haplotype-Based Genotyping in Polyploids.
Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott
2018-01-01
Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.
Maternal grandsire confirmation and discovery in dairy cattle
USDA-ARS?s Scientific Manuscript database
Accurate pedigree information is essential for selecting dairy animals to improve economically important traits. Two methods of maternal grandsire (MGS) discovery were compared. The first compared one single nucleotide polymorphism (SNP) at a time using a genotype from one or both parents (SNP metho...
Yu, M Y; Zhao, P Q; Yan, X H; Liu, B; Zhang, Q Q; Wang, R; Ma, C H; Liang, X H; Zhu, F L; Gao, L F
2013-09-10
Tumor necrosis factor (TNF)-related apoptosis-inducing ligand (TRAIL) is expressed in different tissues and cells, including the pancreas and lymphocytes, and it can selectively induce apoptosis in tumor cells but not in most normal cells. TRAIL plays critical roles in type 1 diabetes mellitus, and is involved in type 2 diabetes mellitus (T2DM). We recently discovered the association of nonalcoholic fatty liver disease, a risk factor for T2DM, with a single nucleotide polymorphism (SNP) in the TRAIL (TNFSF10) gene at site 1595C/T (rs1131580), indicating the possible association of T2DM with this TRAIL polymorphism. The aim of this study was to investigate the relationship of the TRAIL SNP at site 1595C/T (rs1131580) with T2DM susceptibility and the biometabolic parameters of T2DM in a Han Chinese population. The polymerase chain reaction-restriction fragment length polymorphism method was used to genotype SNP rs1131580 in 292 patients with T2DM and 266 healthy controls. We found that the frequency of the CC genotype and that of the C allele of rs1131580 were significantly higher in T2DM patients than in the control group. Additionally, the triglyceride and serum creatinine levels of T2DM patients with the CC genotype were significantly higher than those of patients with the TT genotype. Thus, the CC genotype of the TRAIL SNP at 1595C/T (rs1131580) confers increased susceptible to T2DM in a Han Chinese population from Shandong Province. These data suggest that the CC genotype at this SNP is related to diabetic severity and it might be a candidate for the prognostic assessment of T2DM.
UCHL1 S18Y variant is a risk factor for Parkinson’s disease in Japan
2012-01-01
Background A recent meta-analysis on the UCHL1 S18Y variant and Parkinson’s disease (PD) showed a significant inverse association between the Y allele and PD; the individual studies included in that meta-analysis, however, have produced conflicting results. We examined the relationship between UCHL1 S18Y single nucleotide polymorphism (SNP) and sporadic PD in Japan. Methods Included were 229 cases within 6 years of onset of PD, defined according to the UK PD Society Brain Bank clinical diagnostic criteria. Controls were 357 inpatients and outpatients without neurodegenerative disease. Adjustment was made for sex, age, region of residence, smoking, and caffeine intake. Results Compared with subjects with the CC or CA genotype of UCHL1 S18Y SNP, those with the AA genotype had a significantly increased risk of sporadic PD: the adjusted OR was 1.57 (95 % CI: 1.06 − 2.31). Compared with subjects with the CC or CA genotype of UCHL1 S18Y and the CC or CT genotype of SNCA SNP rs356220, those with the AA genotype of UCHL1 S18Y and the TT genotype of SNP rs356220 had a significantly increased risk of sporadic PD; the interaction, however, was not significant. Our previous investigation found significant inverse relationships between smoking and caffeine intake and PD in this population. There were no significant interactions between UCHL1 S18Y and smoking or caffeine intake affecting sporadic PD. Conclusions This study reveals that the UCHL1 S18Y variant is a risk factor for sporadic PD. We could not find evidence for interactions affecting sporadic PD between UCHL1 S18Y and SNCA SNP rs356220, smoking, or caffeine intake. PMID:22839974
An Integrated SNP Mining and Utilization (ISMU) Pipeline for Next Generation Sequencing Data
Azam, Sarwar; Rathore, Abhishek; Shah, Trushar M.; Telluri, Mohan; Amindala, BhanuPrakash; Ruperao, Pradeep; Katta, Mohan A. V. S. K.; Varshney, Rajeev K.
2014-01-01
Open source single nucleotide polymorphism (SNP) discovery pipelines for next generation sequencing data commonly requires working knowledge of command line interface, massive computational resources and expertise which is a daunting task for biologists. Further, the SNP information generated may not be readily used for downstream processes such as genotyping. Hence, a comprehensive pipeline has been developed by integrating several open source next generation sequencing (NGS) tools along with a graphical user interface called Integrated SNP Mining and Utilization (ISMU) for SNP discovery and their utilization by developing genotyping assays. The pipeline features functionalities such as pre-processing of raw data, integration of open source alignment tools (Bowtie2, BWA, Maq, NovoAlign and SOAP2), SNP prediction (SAMtools/SOAPsnp/CNS2snp and CbCC) methods and interfaces for developing genotyping assays. The pipeline outputs a list of high quality SNPs between all pairwise combinations of genotypes analyzed, in addition to the reference genome/sequence. Visualization tools (Tablet and Flapjack) integrated into the pipeline enable inspection of the alignment and errors, if any. The pipeline also provides a confidence score or polymorphism information content value with flanking sequences for identified SNPs in standard format required for developing marker genotyping (KASP and Golden Gate) assays. The pipeline enables users to process a range of NGS datasets such as whole genome re-sequencing, restriction site associated DNA sequencing and transcriptome sequencing data at a fast speed. The pipeline is very useful for plant genetics and breeding community with no computational expertise in order to discover SNPs and utilize in genomics, genetics and breeding studies. The pipeline has been parallelized to process huge datasets of next generation sequencing. It has been developed in Java language and is available at http://hpc.icrisat.cgiar.org/ISMU as a standalone free software. PMID:25003610
An imputed genotype resource for the laboratory mouse
Szatkiewicz, Jin P.; Beane, Glen L.; Ding, Yueming; Hutchins, Lucie; de Villena, Fernando Pardo-Manuel; Churchill, Gary A.
2009-01-01
We have created a high-density SNP resource encompassing 7.87 million polymorphic loci across 49 inbred mouse strains of the laboratory mouse by combining data available from public databases and training a hidden Markov model to impute missing genotypes in the combined data. The strong linkage disequilibrium found in dense sets of SNP markers in the laboratory mouse provides the basis for accurate imputation. Using genotypes from eight independent SNP resources, we empirically validated the quality of the imputed genotypes and demonstrate that they are highly reliable for most inbred strains. The imputed SNP resource will be useful for studies of natural variation and complex traits. It will facilitate association study designs by providing high density SNP genotypes for large numbers of mouse strains. We anticipate that this resource will continue to evolve as new genotype data become available for laboratory mouse strains. The data are available for bulk download or query at http://cgd.jax.org/. PMID:18301946
Honsa, Erin; Fricke, Thomas; Stephens, Alex J; Ko, Danny; Kong, Fanrong; Gilbert, Gwendolyn L; Huygens, Flavia; Giffard, Philip M
2008-08-19
Streptococcus agalactiae (Group B Streptococcus (GBS)) is an important human pathogen, particularly of newborns. Emerging evidence for a relationship between genotype and virulence has accentuated the need for efficient and well-defined typing methods. The objective of this study was to develop a single nucleotide polymorphism (SNP) based method for assigning GBS isolates to multilocus sequence typing (MLST)-defined clonal complexes. It was found that a SNP set derived from the MLST database on the basis of maximization of Simpsons Index of Diversity provided poor resolution and did not define groups concordant with the population structure as defined by eBURST analysis of the MLST database. This was interpreted as being a consequence of low diversity and high frequency horizontal gene transfer. Accordingly, a different approach to SNP identification was developed. This entailed use of the "Not-N" bioinformatic algorithm that identifies SNPs diagnostic for groups of known sequence variants, together with an empirical process of SNP testing. This yielded a four member SNP set that divides GBS into 10 groups that are concordant with the population structure. A fifth SNP was identified that increased the sensitivity for the clinically significant clonal complex 17 to 100%. Kinetic PCR methods for the interrogation of these SNPs were developed, and used to genotype 116 well characterized isolates. A five SNP method for dividing GBS into biologically valid groups has been developed. These SNPs are ideal for high throughput surveillance activities, and combining with more rapidly evolving loci when additional resolution is required.
Honsa, Erin; Fricke, Thomas; Stephens, Alex J; Ko, Danny; Kong, Fanrong; Gilbert, Gwendolyn L; Huygens, Flavia; Giffard, Philip M
2008-01-01
Background Streptococcus agalactiae (Group B Streptococcus (GBS)) is an important human pathogen, particularly of newborns. Emerging evidence for a relationship between genotype and virulence has accentuated the need for efficient and well-defined typing methods. The objective of this study was to develop a single nucleotide polymorphism (SNP) based method for assigning GBS isolates to multilocus sequence typing (MLST)-defined clonal complexes. Results It was found that a SNP set derived from the MLST database on the basis of maximisation of Simpsons Index of Diversity provided poor resolution and did not define groups concordant with the population structure as defined by eBURST analysis of the MLST database. This was interpreted as being a consequence of low diversity and high frequency horizontal gene transfer. Accordingly, a different approach to SNP identification was developed. This entailed use of the "Not-N" bioinformatic algorithm that identifies SNPs diagnostic for groups of known sequence variants, together with an empirical process of SNP testing. This yielded a four member SNP set that divides GBS into 10 groups that are concordant with the population structure. A fifth SNP was identified that increased the sensitivity for the clinically significant clonal complex 17 to 100%. Kinetic PCR methods for the interrogation of these SNPs were developed, and used to genotype 116 well characterized isolates. Conclusion A five SNP method for dividing GBS into biologically valid groups has been developed. These SNPs are ideal for high throughput surveillance activities, and combining with more rapidly evolving loci when additional resolution is required. PMID:18710585
Muneta, Yoshihiro; Minagawa, Yu; Kusumoto, Masahiro; Shinkai, Hiroki; Uenishi, Hirohide; Splichal, Igor
2012-05-01
In the present study, we have developed an allele-specific primer-polymerase chain reaction (ASP-PCR) for genotyping a single nucleotide polymorphism (SNP) of swine Toll-like receptor 2 (TLR2) (C406G), which is related to the prevalence of pneumonia caused by Mycoplasma hyopneumoniae. We also compared the allele frequency among several pig breeds of Japan and the Czech Republic. Allele-specific primers were constructed by introducing 1-base mismatch sequence before the SNP site. The swine TLR2 C406G mutation was successfully determined by the ASP-PCR using genomic DNA samples in Japan as previously genotyped by a sequencing method. Using the PCR condition determined, genomic DNA samples from pig blood obtained from 110 pigs from 7 different breeds in the Czech Republic were genotyped by the ASP-PCR. The genotyping results from the ASP-PCR were completely matched with the results from the sequencing method. The allele frequency of the swine TLR2 C406G mutation was 27.5% in the Czech Republic and 3.6% in Japan. The C406G mutation was only found in the Landrace breed in Japan, and was almost exclusively found in the Landrace breed in the Czech Republic as well. These results indicated the usefulness of ASP-PCR for detecting a specific SNP for swine TLR2.
Espin-Garcia, Osvaldo; Craiu, Radu V; Bull, Shelley B
2018-02-01
We evaluate two-phase designs to follow-up findings from genome-wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation-maximization-based inference under a semiparametric maximum likelihood formulation tailored for post-GWAS inference. A GWAS-SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT-SNP-dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme-QT strata yields significant power improvements compared to marginal QT- or SNP-based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. © 2017 The Authors. Genetic Epidemiology Published by Wiley Periodicals, Inc.
Comparison of three PCR-based assays for SNP genotyping in sugar beet
USDA-ARS?s Scientific Manuscript database
Background: PCR allelic discrimination technologies have broad applications in the detection of single nucleotide polymorphisms (SNPs) in genetics and genomics. The use of fluorescence-tagged probes is the leading method for targeted SNP detection, but assay costs and error rates could be improved t...
Tong, Steven Y C; Xie, Shirley; Richardson, Leisha J; Ballard, Susan A; Dakh, Farshid; Grabsch, Elizabeth A; Grayson, M Lindsay; Howden, Benjamin P; Johnson, Paul D R; Giffard, Philip M
2011-01-01
We have developed a single nucleotide polymorphism (SNP) nucleated high-resolution melting (HRM) technique to genotype Enterococcus faecium. Eight SNPs were derived from the E. faecium multilocus sequence typing (MLST) database and amplified fragments containing these SNPs were interrogated by HRM. We tested the HRM genotyping scheme on 85 E. faecium bloodstream isolates and compared the results with MLST, pulsed-field gel electrophoresis (PFGE) and an allele specific real-time PCR (AS kinetic PCR) SNP typing method. In silico analysis based on predicted HRM curves according to the G+C content of each fragment for all 567 sequence types (STs) in the MLST database together with empiric data from the 85 isolates demonstrated that HRM analysis resolves E. faecium into 231 "melting types" (MelTs) and provides a Simpson's Index of Diversity (D) of 0.991 with respect to MLST. This is a significant improvement on the AS kinetic PCR SNP typing scheme that resolves 61 SNP types with D of 0.95. The MelTs were concordant with the known ST of the isolates. For the 85 isolates, there were 13 PFGE patterns, 17 STs, 14 MelTs and eight SNP types. There was excellent concordance between PFGE, MLST and MelTs with Adjusted Rand Indices of PFGE to MelT 0.936 and ST to MelT 0.973. In conclusion, this HRM based method appears rapid and reproducible. The results are concordant with MLST and the MLST based population structure.
High-throughput RAD-SNP genotyping for characterization of sugar beet genotypes
USDA-ARS?s Scientific Manuscript database
High-throughput SNP genotyping provides a rapid way of developing resourceful set of markers for delineating the genetic architecture and for effective species discrimination. In the presented research, we demonstrate a set of 192 SNPs for effective genotyping in sugar beet using high-throughput mar...
Xu, C; Yang, X; Wang, Y; Ding, N; Han, R; Sun, Y; Wang, Y
2017-07-01
Frequencies of two glucose transporter 1 (GLUT1) single-nucleotide polymorphisms (SNPs) (XbaI G>T and HaeIII T>C) were studied with urothelial cell carcinomas of the bladder (UCC) and 204 normal persons. And the expression of the p53, Ki67 and GLUT1 was assayed by immunohistochemistry. The frequency of the TT genotype and T allele of the XbaI G>T SNP was decreased in the patients with UCC. The frequency of the CC genotype and C allele of the HaeIII T>C SNP was decreased in the patients with UCC. The GLUT1 XbaI genotype GG was more frequent in higher tumor stage and higher tumor grade patients. In the XbaI G>T SNP, the GG genotype was significantly related to higher Remmele immunoreactive score (IRS) of Ki67 and higher IRS of GLUT1. In conclusion, the TT genotype in XbaI G>T SNP and CC genotype of HaeIII T>C SNP may have protective effect in the carcinogenesis process of UCC. In the XbaI G>T SNP, the GG genotype of was positively related to tumor proliferation, glucose metabolism, tumor grade and stage. Therefore, the variant might become a possible proliferation-related prognostic factor for UCC.
Ruan, Li; Zhu, Jian-guo; Pan, Cong; Hua, Xing; Yuan, Dong-bo; Li, Zheng-ming; Zhong, Wei-de
2015-01-01
Background. The aim of the study was to investigate the association between single nucleotide polymorphism (SNP) of vitamin D receptor (VDR) gene and clinical progress of benign prostatic hyperplasia (BPH) in Chinese men. Methods. The DNA was extracted from blood of 200 BPH patients with operation (progression group) and 200 patients without operation (control group), respectively. The genotypes of VDR gene FokI SNP represented by “F/f” were identified by PCR-restriction fragment length polymorphism. The odds ratio (OR) of having progression of BPH for having the genotype were calculated. Results. Our date indicated that the f alleles of the VDR gene FokI SNP associated with the progression of BPH (P = 0.009). Conclusion. For the first time, our study demonstrated that VDR gene FokI SNP may be associated with the risk of BPH progress. PMID:25685834
Genotyping three SNPs affecting warfarin drug response by isothermal real-time HDA assays.
Li, Ying; Jortani, Saeed A; Ramey-Hartung, Bronwyn; Hudson, Elizabeth; Lemieux, Bertrand; Kong, Huimin
2011-01-14
The response to the anticoagulant drug warfarin is greatly affected by genetic polymorphisms in the VKORC1 and CYP2C9 genes. Genotyping these polymorphisms has been shown to be important in reducing the time of the trial and error process for finding the maintenance dose of warfarin thus reducing the risk of adverse effects of the drug. We developed a real-time isothermal DNA amplification system for genotyping three single nucleotide polymorphisms (SNPs) that influence warfarin response. For each SNP, real-time isothermal Helicase Dependent Amplification (HDA) reactions were performed to amplify a DNA fragment containing the SNP. Amplicons were detected by fluorescently labeled allele specific probes during real-time HDA amplification. Fifty clinical samples were analyzed by the HDA-based method, generating a total of 150 results. Of these, 148 were consistent between the HDA-based assays and a reference method. The two samples with unresolved HDA-based test results were repeated and found to be consistent with the reference method. The HDA-based assays demonstrated a clinically acceptable performance for genotyping the VKORC1 -1639G>A SNP and two SNPs (430C>T and 1075A>C) for the CYP2C9 enzyme (CYP2C9*2 and CYP2C9*3), all of which are relevant in warfarin pharmacogenentics. Copyright © 2010 Elsevier B.V. All rights reserved.
The Development of Quality Control Genotyping Approaches: A Case Study Using Elite Maize Lines.
Chen, Jiafa; Zavala, Cristian; Ortega, Noemi; Petroli, Cesar; Franco, Jorge; Burgueño, Juan; Costich, Denise E; Hearne, Sarah J
2016-01-01
Quality control (QC) of germplasm identity and purity is a critical component of breeding and conservation activities. SNP genotyping technologies and increased availability of markers provide the opportunity to employ genotyping as a low-cost and robust component of this QC. In the public sector available low-cost SNP QC genotyping methods have been developed from a very limited panel of markers of 1,000 to 1,500 markers without broad selection of the most informative SNPs. Selection of optimal SNPs and definition of appropriate germplasm sampling in addition to platform section impact on logistical and resource-use considerations for breeding and conservation applications when mainstreaming QC. In order to address these issues, we evaluated the selection and use of SNPs for QC applications from large DArTSeq data sets generated from CIMMYT maize inbred lines (CMLs). Two QC genotyping strategies were developed, the first is a "rapid QC", employing a small number of SNPs to identify potential mislabeling of seed packages or plots, the second is a "broad QC", employing a larger number of SNP, used to identify each germplasm entry and to measure heterogeneity. The optimal marker selection strategies combined the selection of markers with high minor allele frequency, sampling of clustered SNP in proportion to marker cluster distance and selecting markers that maintain a uniform genomic distribution. The rapid and broad QC SNP panels selected using this approach were further validated using blind test assessments of related re-generation samples. The influence of sampling within each line was evaluated. Sampling 192 individuals would result in close to 100% possibility of detecting a 5% contamination in the entry, and approximately a 98% probability to detect a 2% contamination of the line. These results provide a framework for the establishment of QC genotyping. A comparison of financial and time costs for use of these approaches across different platforms is discussed providing a framework for institutions involved in maize conservation and breeding to assess the resource use effectiveness of QC genotyping. Application of these research findings, in combination with existing QC approaches, will ensure the regeneration, distribution and use in breeding of true to type inbred germplasm. These findings also provide an effective approach to optimize SNP selection for QC genotyping in other species.
Association of Interleukin-1 Gene cluster polymorphisms with coronary slow flow phenomenon
Mutluer, Ferit Onur; Ural, Dilek; Güngör, Barış; Bolca, Osman; Aksu, Tolga
2018-01-01
Objective: Coronary slow flow phenomenon (CSFP) is characterized by the decreased rate of contrast progression in epicardial coronary arte-ries in the absence of significant coronary stenosis. Mounting evidence has showed a significant association between inflammation and CSFP severity. This study aimed to evaluate possible associations between interleukin-1 receptor antagonist (IL-1ra) gene variable number tandem repeat (VNTR), IL-1β -511 single nucleotide (SNP), and IL-1β+3954 SNP mutations with CSFP. Methods: Forty-eight patients with CSFP and 62 controls with angiographically normal coronary arteries were prospectively enrolled in the study. Genotypes were assessed using the polymerase chain reaction (PCR)-based restriction fragment length polymorphism (PCR-RFLP) technique. Results: Homozygote genotype for allele 2 of+3954 C>T 2/2 genotype was significantly more frequent in patients with CSFP than in the control group, whereas 1/2 genotype was more frequent in the control group (35.4% versus 14.5% for 2/2 genotype and 25% versus 35.5% for 1/2 genotype in CSFP and control groups, respectively, X2=6.6; p=0.04). The allelic frequency of allele 2 of this polymorphism was significantly higher in the CSFP group than in the control group (47.9% versus 28.6% in the control group, X2=5.6; p=0.02). However, there was no significant difference with regard to genotype or allelic frequencies of IL-1ra VNTR or IL-1β -511 SNP polymorphisms between patients with CSFP and controls. Conclusion: IL-1β+3954 SNP mutations are significantly more common in patients with CSFP. It may suggest that the tendency for inflammation may contribute to the presence of this phenomenon. PMID:29339698
Genomic selection in dairy cattle: the USDA experience
USDA-ARS?s Scientific Manuscript database
Genomic selection has revolutionized dairy cattle breeding. Since 2000, assays have been developed to genotype large numbers of single nucleotide polymorphisms (SNP) at relatively low cost. The first commercial SNP genotyping chip was released with a set of 54,001 SNP in December 2007. Over 15,000 ...
Wu, Jianhui; Huang, Shuo; Zeng, Qingdong; Liu, Shengjie; Wang, Qilin; Mu, Jingmei; Yu, Shizhou; Han, Dejun; Kang, Zhensheng
2018-06-16
A major stripe rust resistance QTL on chromosome 4BL was localized to a 4.5-Mb interval using comparative QTL mapping methods and validated in 276 wheat genotypes by haplotype analysis. CYMMIT-derived wheat line P10103 was previously identified to have adult plant resistance (APR) to stripe rust in the greenhouse and field. The conventional approach for QTL mapping in common wheat is laborious. Here, we performed QTL detection of APR using a combination of genome-wide scanning and extreme pool-genotyping. SNP-based genetic maps were constructed using the Wheat55 K SNP array to genotype a recombinant inbred line (RIL) population derived from the cross Mingxian 169 × P10103. Five stable QTL were detected across multiple environments. A fter comparing SNP profiles from contrasting, extreme DNA pools of RILs six putative QTL were located to approximate chromosome positions. A major QTL on chromosome 4B was identified in F 2:4 contrasting pools from cross Zhengmai 9023 × P10103. A consensus QTL (LOD = 26-40, PVE = 42-55%), named QYr.nwafu-4BL, was defined and localized to a 4.5-Mb interval flanked by SNP markers AX-110963704 and AX-110519862 in chromosome arm 4BL. Based on stripe rust response, marker genotypes, pedigree analysis and mapping data, QYr.nwafu-4BL is likely to be a new APR QTL. The applicability of the SNP-based markers flanking QYr.nwafu-4BL was validated on a diversity panel of 276 wheat lines. The additional minor QTL on chromosomes 4A, 5A, 5B and 6A enhanced the level of resistance conferred by QYr.nwafu-4BL. Marker-assisted pyramiding of QYr.nwafu-4BL and other favorable minor QTL in new wheat cultivars should improve the level of APR to stripe rust.
Comparison between genotyping by sequencing and SNP-chip genotyping in QTL mapping in wheat
USDA-ARS?s Scientific Manuscript database
Array- or chip-based single nucleotide polymorphism (SNP) markers are widely used in genomic studies because of their abundance in a genome and cost less per data point compared to older marker technologies. Genotyping by sequencing (GBS), a relatively newer approach of genotyping, suggests equal or...
USDA-ARS?s Scientific Manuscript database
Call rate has been used as a measure of quality on both a single nucleotide polymorphism (SNP) and animal basis since SNP genotypes were first used in genomic evaluation of dairy cattle. The genotyping laboratories perform initial quality control screening and genotypes that fail are usually exclude...
Genetic source tracking of an anthrax outbreak in Shaanxi province, China.
Liu, Dong-Li; Wei, Jian-Chun; Chen, Qiu-Lan; Guo, Xue-Jun; Zhang, En-Min; He, Li; Liang, Xu-Dong; Ma, Guo-Zhu; Zhou, Ti-Cao; Yin, Wen-Wu; Liu, Wei; Liu, Kai; Shi, Yi; Ji, Jian-Jun; Zhang, Hui-Juan; Ma, Lin; Zhang, Fa-Xin; Zhang, Zhi-Kai; Zhou, Hang; Yu, Hong-Jie; Kan, Biao; Xu, Jian-Guo; Liu, Feng; Li, Wei
2017-01-17
Anthrax is an acute zoonotic infectious disease caused by the bacterium known as Bacillus anthracis. From 26 July to 8 August 2015, an outbreak with 20 suspected cutaneous anthrax cases was reported in Ganquan County, Shaanxi province in China. The genetic source tracking analysis of the anthrax outbreak was performed by molecular epidemiological methods in this study. Three molecular typing methods, namely canonical single nucleotide polymorphisms (canSNP), multiple-locus variable-number tandem repeat analysis (MLVA), and single nucleotide repeat (SNR) analysis, were used to investigate the possible source of transmission and identify the genetic relationship among the strains isolated from human cases and diseased animals during the outbreak. Five strains isolated from diseased mules were clustered together with patients' isolates using canSNP typing and MLVA. The causative B. anthracis lineages in this outbreak belonged to the A.Br.001/002 canSNP subgroup and the MLVA15-31 genotype (the 31 genotype in MLVA15 scheme). Because nine isolates from another four provinces in China were clustered together with outbreak-related strains by the canSNP (A.Br.001/002 subgroup) and MLVA15 method (MLVA15-31 genotype), still another SNR analysis (CL10, CL12, CL33, and CL35) was used to source track the outbreak, and the results suggesting that these patients in the anthrax outbreak were probably infected by the same pathogen clone. It was deduced that the anthrax outbreak occurred in Shaanxi province, China in 2015 was a local occurrence.
Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers
2010-01-01
Background At the current price, the use of high-density single nucleotide polymorphisms (SNP) genotyping assays in genomic selection of dairy cattle is limited to applications involving elite sires and dams. The objective of this study was to evaluate the use of low-density assays to predict direct genomic value (DGV) on five milk production traits, an overall conformation trait, a survival index, and two profit index traits (APR, ASI). Methods Dense SNP genotypes were available for 42,576 SNP for 2,114 Holstein bulls and 510 cows. A subset of 1,847 bulls born between 1955 and 2004 was used as a training set to fit models with various sets of pre-selected SNP. A group of 297 bulls born between 2001 and 2004 and all cows born between 1992 and 2004 were used to evaluate the accuracy of DGV prediction. Ridge regression (RR) and partial least squares regression (PLSR) were used to derive prediction equations and to rank SNP based on the absolute value of the regression coefficients. Four alternative strategies were applied to select subset of SNP, namely: subsets of the highest ranked SNP for each individual trait, or a single subset of evenly spaced SNP, where SNP were selected based on their rank for ASI, APR or minor allele frequency within intervals of approximately equal length. Results RR and PLSR performed very similarly to predict DGV, with PLSR performing better for low-density assays and RR for higher-density SNP sets. When using all SNP, DGV predictions for production traits, which have a higher heritability, were more accurate (0.52-0.64) than for survival (0.19-0.20), which has a low heritability. The gain in accuracy using subsets that included the highest ranked SNP for each trait was marginal (5-6%) over a common set of evenly spaced SNP when at least 3,000 SNP were used. Subsets containing 3,000 SNP provided more than 90% of the accuracy that could be achieved with a high-density assay for cows, and 80% of the high-density assay for young bulls. Conclusions Accurate genomic evaluation of the broader bull and cow population can be achieved with a single genotyping assays containing ~ 3,000 to 5,000 evenly spaced SNP. PMID:20950478
Wang, Zi-nian; Cai, Han-fang; Li, Ming-xun; Cao, Xiu-kai; Lan, Xian-yong; Lei, Chu-zhao; Chen, Hong
2016-01-10
Patatin-like phospholipase domain-containing protein 3 (PNPLA3), a member of the patatin like phospholipase domain-containing (PNPLA) family, plays an important role in energy balance, fat metabolism regulation, glucose metabolism and fatty liver disease. Tetra-primer amplification refractory mutation system PCR (T-ARMS-PCR) is a new method offering fast detection and extreme simplicity at a negligible cost for SNP genotyping. In this paper, we investigated the genetic variations at different ages of 660 Chinese indigenous cattle belonging to three breeds (QC, NY, JX) and applied T-ARMS-PCR and PCR-RFLP methods to genotype four SNPs, SNP1: g.A2980G, SNP2: g.A2996T, SNP3: g.A36718G, SNP4: g.G36850A. The statistical analyses indicated that these 4 SNPs affected growth traits markedly (P<0.05) in QC population, whereas combined haplotypes were not (P>0.05). The qPCR (quantitative PCR) indicated that bovine PNPLA3 gene was exclusively expressed in fat tissues. Besides, the analysis between SNP and mRNA expression revealed that, in SNP1, the expression of AG was much higher than AA and GG (P<0.05), which was in accordance with the results of growth traits association analysis, while the results of SNP4 was not. These results supported high potential that SNPs of bovine PNPLA3 gene might be utilized as genetic markers in marker-assisted selection (MAS) for Chinese cattle breeding programs. Copyright © 2015 Elsevier B.V. All rights reserved.
Diversity analysis of cotton (Gossypium hirsutum L.) germplasm using the CottonSNP63K Array.
Hinze, Lori L; Hulse-Kemp, Amanda M; Wilson, Iain W; Zhu, Qian-Hao; Llewellyn, Danny J; Taylor, Jen M; Spriggs, Andrew; Fang, David D; Ulloa, Mauricio; Burke, John J; Giband, Marc; Lacape, Jean-Marc; Van Deynze, Allen; Udall, Joshua A; Scheffler, Jodi A; Hague, Steve; Wendel, Jonathan F; Pepper, Alan E; Frelichowski, James; Lawley, Cindy T; Jones, Don C; Percy, Richard G; Stelly, David M
2017-02-03
Cotton germplasm resources contain beneficial alleles that can be exploited to develop germplasm adapted to emerging environmental and climate conditions. Accessions and lines have traditionally been characterized based on phenotypes, but phenotypic profiles are limited by the cost, time, and space required to make visual observations and measurements. With advances in molecular genetic methods, genotypic profiles are increasingly able to identify differences among accessions due to the larger number of genetic markers that can be measured. A combination of both methods would greatly enhance our ability to characterize germplasm resources. Recent efforts have culminated in the identification of sufficient SNP markers to establish high-throughput genotyping systems, such as the CottonSNP63K array, which enables a researcher to efficiently analyze large numbers of SNP markers and obtain highly repeatable results. In the current investigation, we have utilized the SNP array for analyzing genetic diversity primarily among cotton cultivars, making comparisons to SSR-based phylogenetic analyses, and identifying loci associated with seed nutritional traits. The SNP markers distinctly separated G. hirsutum from other Gossypium species and distinguished the wild from cultivated types of G. hirsutum. The markers also efficiently discerned differences among cultivars, which was the primary goal when designing the CottonSNP63K array. Population structure within the genus compared favorably with previous results obtained using SSR markers, and an association study identified loci linked to factors that affect cottonseed protein content. Our results provide a large genome-wide variation data set for primarily cultivated cotton. Thousands of SNPs in representative cotton genotypes provide an opportunity to finely discriminate among cultivated cotton from around the world. The SNPs will be relevant as dense markers of genome variation for association mapping approaches aimed at correlating molecular polymorphisms with variation in phenotypic traits, as well as for molecular breeding approaches in cotton.
2013-01-01
Background Insulin-like growth factor 1 (IGF-1) gene is considered as a promising candidate for the identification of polymorphisms affecting cattle performance. The objectives of the current study were to determine the association of the single nucleotide polymorphism (SNP) IGF-1/SnaBI with fertility, milk production and body condition traits in Holstein-Friesian dairy cows under grazing conditions. Methods Seventy multiparous cows from a commercial herd were genotyped for the SNP IGF-1/SnaBI. Fertility measures evaluated were: interval to commencement of luteal activity (CLA), calving to first service (CFS) and calving to conception (CC) intervals. Milk production and body condition score were also evaluated. The study period extended from 3 wk before calving to the fourth month of lactation. Results and discussion Frequencies of the SNP IGF-1/SnaBI alleles A and B were 0.59 and 0.41, respectively. Genotype frequencies were 0.31, 0.54 and 0.14 for AA, AB and BB, respectively. Cows with the AA genotype presented an early CLA and were more likely to resume ovarian cyclicity in the early postpartum than AB and BB ones. No effect of the SNP IGF-1/SnaBI genotype was evidenced on body condition change over the experimental period, suggesting that energy balance is not responsible for the outcome of postpartum ovarian resumption in this study. Traditional fertility measures were not affected by the SNP IGF-1/SnaBI. Conclusion To our knowledge this is the first report describing an association of the SNP IGF-1/SnaBI with an endocrine fertility measure like CLA in cattle. Results herein remark the important role of the IGF-1gene in the fertility of dairy cows on early lactation and make the SNP IGF-1/SnaBI an interesting candidate marker for genetic improvement of fertility in dairy cattle. PMID:23409757
Sun, Mingjun; Jing, Zhigang; Di, Dongdong; Yan, Hao; Zhang, Zhicheng; Xu, Quangang; Zhang, Xiyue; Wang, Xun; Ni, Bo; Sun, Xiangxiang; Yan, Chengxu; Yang, Zhen; Tian, Lili; Li, Jinping; Fan, Weixing
2017-01-01
Brucellosis is a worldwide zoonotic disease caused by Brucella spp. In China, brucellosis is recognized as a reemerging disease mainly caused by Brucella melitensis specie. To better understand the currently endemic B. melitensis strains in China, three Brucella genotyping methods were applied to 110 B. melitensis strains obtained in past several years. By MLVA genotyping, five MLVA-8 genotypes were identified, among which genotypes 42 (1-5-3-13-2-2-3-2) was recognized as the predominant genotype, while genotype 63 (1-5-3-13-2-3-3-2) and a novel genotype of 1-5-3-13-2-4-3-2 were second frequently observed. MLVA-16 discerned a total of 57 MLVA-16 genotypes among these Brucella strains, with 41 genotypes being firstly detected and the other 16 genotypes being previously reported. By BruMLSA21 typing, six sequence types (STs) were identified, among them ST8 is the most frequently seen in China while the other five STs were firstly detected and designated as ST137, ST138, ST139, ST140, and ST141 by international multilocus sequence typing database. Whole-genome sequence (WGS)-single-nucleotide polymorphism (SNP)-based typing and phylogenetic analysis resolved Chinese B. melitensis strains into five clusters, reflecting the existence of multiple lineages among these Chinese B. melitensis strains. In phylogeny, Chinese lineages are more closely related to strains collected from East Mediterranean and Middle East countries, such as Turkey, Kuwait, and Iraq. In the next few years, MLVA typing will certainly remain an important epidemiological tool for Brucella infection analysis, as it displays a high discriminatory ability and achieves result largely in agreement with WGS-SNP-based typing. However, WGS-SNP-based typing is found to be the most powerful and reliable method in discerning Brucella strains and will be popular used in the future.
Hardy, M Y; Ontiveros, N; Varney, M D; Tye-Din, J A
2018-04-01
A hallmark of coeliac disease (CD) is the exceptionally strong genetic association with HLA-DQ2.5, DQ8, and DQ2.2. HLA typing provides information on CD risk important to both clinicians and researchers. A method that enables simple and fast detection of all CD risk genotypes is particularly desirable for the study of large populations. Single nucleotide polymorphism (SNP)-based HLA typing can detect the CD risk genotypes by detecting a combination of six SNPs but this approach can struggle to resolve HLA-DQ2.2, seen in 4% of European CD patients, because of the low resolution of one negatively predicting SNP. We sought to optimise SNP-based HLA typing by harnessing the additional resolution of digital droplet PCR to resolve HLA-DQ2.2. Here we test this two-step approach in an unselected sample of Mexican DNA and compare its accuracy to DNA typed using traditional exon detection. The addition of digital droplet PCR for samples requiring negative prediction of HLA-DQ2.2 enabled HLA-DQ2.2 to be accurately typed. This technique is a simple addition to a SNP-based typing strategy and enables comprehensive definition of all at-risk HLA genotypes in CD in a timely and cost-effective manner. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Selection and Management of DNA Markers for Use in Genomic Evaluation
USDA-ARS?s Scientific Manuscript database
A database was constructed to store genotypes for 50,972 single-nucleotide polymorphisms (SNP) from the Illumina BovineSNP50 BeadChip for over 30,000 animals. The database allows storage of multiple samples per animal and stores all SNP genotypes for a sample in a single row. An indicator specifies ...
Hoffmann, Thomas J; Zhan, Yiping; Kvale, Mark N; Hesselson, Stephanie E; Gollub, Jeremy; Iribarren, Carlos; Lu, Yontao; Mei, Gangwu; Purdy, Matthew M; Quesenberry, Charles; Rowell, Sarah; Shapero, Michael H; Smethurst, David; Somkin, Carol P; Van den Eeden, Stephen K; Walter, Larry; Webster, Teresa; Whitmer, Rachel A; Finn, Andrea; Schaefer, Catherine; Kwok, Pui-Yan; Risch, Neil
2011-12-01
Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies. Copyright © 2011 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jackson, Paul J.; Hill, Karen K.
2009-11-09
The results outlined in this report provide the information for needed to apply a SNP-based forensic analysis to diverse ricin preparations. The same methods could be useful in castor breeding programs that seek to reduce or eliminate ricin in oil-producing R. communis cultivars.
Forensic SNP Genotyping with SNaPshot: Development of a Novel In-house SBE Multiplex SNP Assay.
Zar, Mian Sahib; Shahid, Ahmad Ali; Shahzad, Muhammad Saqib; Shin, Kyoung-Jin; Lee, Hwan Young; Lee, Sang-Seob; Israr, Muhammad; Wiegand, Peter; Kulstein, Galina
2018-04-10
This study introduces a newly developed in-house SNaPshot single-base extension (SBE) multiplex assay for forensic single nucleotide polymorphism (SNP) genotyping of fresh and degraded samples. The assay was validated with fresh blood samples from four different populations. In addition, altogether 24 samples from skeletal remains were analyzed with the multiplex. Full SNP profiles could be obtained from 14 specimens, while ten remains showed partial SNP profiles. Minor allele frequencies (MAF) of bone samples and different populations were compared and used for association of skeletal remains with a certain population. The results reveal that the SNPs of the bone samples are genetically close to the Pathan population. The findings show that the new multiplex system can be utilized for SNP genotyping of degraded and forensic relevant skeletal material, enabling to provide additional investigative leads in criminal cases. © 2018 American Academy of Forensic Sciences.
Genotype imputation in the domestic dog
Meurs, K. M.
2016-01-01
Application of imputation methods to accurately predict a dense array of SNP genotypes in the dog could provide an important supplement to current analyses of array-based genotyping data. Here, we developed a reference panel of 4,885,283 SNPs in 83 dogs across 15 breeds using whole genome sequencing. We used this panel to predict the genotypes of 268 dogs across three breeds with 84,193 SNP array-derived genotypes as inputs. We then (1) performed breed clustering of the actual and imputed data; (2) evaluated several reference panel breed combinations to determine an optimal reference panel composition; and (3) compared the accuracy of two commonly used software algorithms (Beagle and IMPUTE2). Breed clustering was well preserved in the imputation process across eigenvalues representing 75 % of the variation in the imputed data. Using Beagle with a target panel from a single breed, genotype concordance was highest using a multi-breed reference panel (92.4 %) compared to a breed-specific reference panel (87.0 %) or a reference panel containing no breeds overlapping with the target panel (74.9 %). This finding was confirmed using target panels derived from two other breeds. Additionally, using the multi-breed reference panel, genotype concordance was slightly higher with IMPUTE2 (94.1 %) compared to Beagle; Pearson correlation coefficients were slightly higher for both software packages (0.946 for Beagle, 0.961 for IMPUTE2). Our findings demonstrate that genotype imputation from SNP array-derived data to whole genome-level genotypes is both feasible and accurate in the dog with appropriate breed overlap between the target and reference panels. PMID:27129452
Utsumi, Yu; Sasaki, Nobuhito; Nagashima, Hiromi; Suzuki, Naomi; Nakamura, Yutaka; Yamashita, Masahiro; Kobayashi, Hitoshi; Yamauchi, Kohei
2013-09-01
A single nucleotide polymorphism (SNP; rs20541) in the IL-13 gene has been recognized as a risk factor for asthma. This SNP causes Arg to Gln (Q) substitution at position 110 in the mature IL-13 protein. We have recently showed that FEV1 in asthmatics with the Q110 variant of IL-13 declined faster, and progressive airway remodeling was observed in these subjects (Wynn, 2003 [1]). However, the effects of the IL-13 variant on airway hyperresponsiveness (AHR) remain to be elucidated. We analyzed the relationship between SNP rs20541 in IL-13 and AHR in asthmatics. We recruited 182 asthmatics who visited the asthma outpatient clinic at Iwate Medical University Hospital from 2006 to 2011. Subjects were genotyped for rs20541. Asthma severity, atopic status, age of asthma onset, serum IgE concentration, AHR, and pulmonary function were studied in these subjects. AHR was measured using the continuous methacholine inhalation method (Astograph; Chest; Tokyo, Japan). Genotyping of rs20541 revealed 26 A/A, 77 A/G, and 79 G/G patient genotypes. The D min (U) of the 3 genotypes was 1.17±0.300 in A/A, 1.99±0.35 in A/G, and 2.85±0.39 in G/G. The D min in the 3 genotypes was significantly different. Spirometric data revealed that % FEV1 and % FEF75 were significantly different among the 3 groups of IL-13 genotypes, whereas no significant differences were observed in therapeutic steps, atopic status, house dust mite sensitization, or serum IgE concentration. The SNP rs20541 in IL-13 was associated with AHR in Japanese adult asthmatics. Copyright © 2013 The Japanese Respiratory Society. Published by Elsevier B.V. All rights reserved.
Kaya, Hilal Betul; Cetin, Oznur; Kaya, Hulya; Sahin, Mustafa; Sefer, Filiz; Kahraman, Abdullah; Tanyolac, Bahattin
2013-01-01
Background The olive tree (Olea europaea L.) is a diploid (2n = 2x = 46) outcrossing species mainly grown in the Mediterranean area, where it is the most important oil-producing crop. Because of its economic, cultural and ecological importance, various DNA markers have been used in the olive to characterize and elucidate homonyms, synonyms and unknown accessions. However, a comprehensive characterization and a full sequence of its transcriptome are unavailable, leading to the importance of an efficient large-scale single nucleotide polymorphism (SNP) discovery in olive. The objectives of this study were (1) to discover olive SNPs using next-generation sequencing and to identify SNP primers for cultivar identification and (2) to characterize 96 olive genotypes originating from different regions of Turkey. Methodology/Principal Findings Next-generation sequencing technology was used with five distinct olive genotypes and generated cDNA, producing 126,542,413 reads using an Illumina Genome Analyzer IIx. Following quality and size trimming, the high-quality reads were assembled into 22,052 contigs with an average length of 1,321 bases and 45 singletons. The SNPs were filtered and 2,987 high-quality putative SNP primers were identified. The assembled sequences and singletons were subjected to BLAST similarity searches and annotated with a Gene Ontology identifier. To identify the 96 olive genotypes, these SNP primers were applied to the genotypes in combination with amplified fragment length polymorphism (AFLP) and simple sequence repeats (SSR) markers. Conclusions/Significance This study marks the highest number of SNP markers discovered to date from olive genotypes using transcriptome sequencing. The developed SNP markers will provide a useful source for molecular genetic studies, such as genetic diversity and characterization, high density quantitative trait locus (QTL) analysis, association mapping and map-based gene cloning in the olive. High levels of genetic variation among Turkish olive genotypes revealed by SNPs, AFLPs and SSRs allowed us to characterize the Turkish olive genotype. PMID:24058483
Genetic analysis of interleukin 18 gene polymorphisms in alopecia areata.
Celik, Sumeyya Deniz; Ates, Omer
2018-06-01
Alopecia areata (AA), which appears as nonscarring hair shedding on any hair-bearing area, is a common organ-specific autoimmune condition. Cytokines have important roles in the development of AA. Interleukin (IL) 18 is a significant proinflammatory cytokine that was found higher in the patients with AA. We aimed to investigate whether the IL-18 (rs187238 and rs1946518) single nucleotide polymorphisms (SNPs) may be associated with AA and/or clinical outcome of patients with AA in Turkish population. Genotyping of rs187238 and rs1946518 SNPs were detected using sequence-specific primer-polymerase chain reaction (SSP-PCR) method in 200 patients with AA and 200 control subjects. The genotype distribution of rs1946518 (-607C>A) SNP was found to be statistically significantly different among patients with AA and controls (P = .0008). Distribution of CC+CA genotypes and frequency of -607/allele C of rs1946518 SNP were higher in patients with AA (P = .001, P = .001, respectively). The genotype distribution of rs187238 (-137G>C) SNP was found to be statistically significantly different among patients with AA and control subjects (P = .0014). Distribution of GG genotype and frequency of -137/allele G of rs187238 SNP were higher in patients with AA (P = .0003, P = .001, respectively). The rs1946518 (-607C>A) and rs187238 (-137G>C) polymorphisms were found associated with alopecia areata disease. The study suggests that IL-18 rs187238 and rs1946518 SNPs may be the cause of the AA susceptibility. © 2018 Wiley Periodicals, Inc.
2011-01-01
Background High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of Eucalyptus from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for E. grandis. A systematic assessment of in silico SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous in silico constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine Eucalyptus species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions This study indicates that the GGGT performs well both within and across species of Eucalyptus notwithstanding its nucleotide diversity ≥2%. The development of a much larger array of informative SNPs across multiple Eucalyptus species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in Eucalyptus. PMID:21492434
KinSNP software for homozygosity mapping of disease genes using SNP microarrays
2010-01-01
Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from http://bioinfo.bgu.ac.il/bsu/software/kinSNP. PMID:20846928
An innovative SNP genotyping method adapting to multiple platforms and throughputs.
Long, Y M; Chao, W S; Ma, G J; Xu, S S; Qi, L L
2017-03-01
An innovative genotyping method designated as semi-thermal asymmetric reverse PCR (STARP) was developed for genotyping individual SNPs with improved accuracy, flexible throughputs, low operational costs, and high platform compatibility. Multiplex chip-based technology for genome-scale genotyping of single nucleotide polymorphisms (SNPs) has made great progress in the past two decades. However, PCR-based genotyping of individual SNPs still remains problematic in accuracy, throughput, simplicity, and/or operational costs as well as the compatibility with multiple platforms. Here, we report a novel SNP genotyping method designated semi-thermal asymmetric reverse PCR (STARP). In this method, genotyping assay was performed under unique PCR conditions using two universal priming element-adjustable primers (PEA-primers) and one group of three locus-specific primers: two asymmetrically modified allele-specific primers (AMAS-primers) and their common reverse primer. The two AMAS-primers each were substituted one base in different positions at their 3' regions to significantly increase the amplification specificity of the two alleles and tailed at 5' ends to provide priming sites for PEA-primers. The two PEA-primers were developed for common use in all genotyping assays to stringently target the PCR fragments generated by the two AMAS-primers with similar PCR efficiencies and for flexible detection using either gel-free fluorescence signals or gel-based size separation. The state-of-the-art primer design and unique PCR conditions endowed STARP with all the major advantages of high accuracy, flexible throughputs, simple assay design, low operational costs, and platform compatibility. In addition to SNPs, STARP can also be employed in genotyping of indels (insertion-deletion polymorphisms). As vast variations in DNA sequences are being unearthed by many genome sequencing projects and genotyping by sequencing, STARP will have wide applications across all biological organisms in agriculture, medicine, and forensics.
LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs
Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong
2009-01-01
Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380
He, Jun; Xu, Jiaqi; Wu, Xiao-Lin; Bauck, Stewart; Lee, Jungjae; Morota, Gota; Kachman, Stephen D; Spangler, Matthew L
2018-04-01
SNP chips are commonly used for genotyping animals in genomic selection but strategies for selecting low-density (LD) SNPs for imputation-mediated genomic selection have not been addressed adequately. The main purpose of the present study was to compare the performance of eight LD (6K) SNP panels, each selected by a different strategy exploiting a combination of three major factors: evenly-spaced SNPs, increased minor allele frequencies, and SNP-trait associations either for single traits independently or for all the three traits jointly. The imputation accuracies from 6K to 80K SNP genotypes were between 96.2 and 98.2%. Genomic prediction accuracies obtained using imputed 80K genotypes were between 0.817 and 0.821 for daughter pregnancy rate, between 0.838 and 0.844 for fat yield, and between 0.850 and 0.863 for milk yield. The two SNP panels optimized on the three major factors had the highest genomic prediction accuracy (0.821-0.863), and these accuracies were very close to those obtained using observed 80K genotypes (0.825-0.868). Further exploration of the underlying relationships showed that genomic prediction accuracies did not respond linearly to imputation accuracies, but were significantly affected by genotype (imputation) errors of SNPs in association with the traits to be predicted. SNPs optimal for map coverage and MAF were favorable for obtaining accurate imputation of genotypes whereas trait-associated SNPs improved genomic prediction accuracies. Thus, optimal LD SNP panels were the ones that combined both strengths. The present results have practical implications on the design of LD SNP chips for imputation-enabled genomic prediction.
Verbeke, Joren; Van Poucke, Mario; Peelman, Luc; Piepers, Sofie; De Vliegher, Sarne
2014-12-01
The CXCR1 gene plays an important role in the innate immunity of the bovine mammary gland. Associations between single nucleotide polymorphisms (SNP) CXCR1c.735C>G and c.980A>G and udder health have been identified before in small populations. A fluorescent multiprobe PCR assay was designed specifically and validated to genotype both SNP simultaneously in a reliable and cost-effective manner. In total, 3,106 cows from 50 commercial Flemish dairy herds were genotyped using this assay. Associations between genotype and detailed phenotypic data, including pathogen-specific incidence rate of clinical mastitis (IRCM), test-day somatic cell count, and test-day milk yield (MY) were analyzed. Staphylococcus aureus IRCM tended to associate with SNP c.735C>G. Cows with genotype c.735GG had lower Staph. aureus IRCM compared with cows with genotype c.735CC (rate ratio = 0.35, 95% confidence interval = 0.14–0.90). Additionally, a parity-specific association between Staph. aureus IRCM and SNP c.980A>G was detected. Heifers with genotype c.980GG had a lower Staph. aureus IRCM compared with heifers with genotype c.980AG (rate ratio = 0.15, 95% confidence interval = 0.04–0.56). Differences were less pronounced in multiparous cows. Associations between CXCR1 genotype and somatic cell count were not detected. However, MY was associated with SNP c.735C>G. Cows with genotype c.735GG out-produced cows with genotype c.735CC by 0.8 kg of milk/d. Results provide a basis for further research on the relation between CXCR1 polymorphism and pathogen-specific mastitis resistance and MY.
2014-01-01
Background Kidney stone disease (KSD) is a complex disorder with unknown etiology in majority of the patients. Genetic and environmental factors may cause the disease. In the present study, we used DNA microarray to genotype single nucleotide polymorphisms (SNP) and performed candidate gene association analysis to determine genetic variations associated with the disease. Methods A whole genome SNP genotyping by DNA microarray was initially conducted in 101 patients and 105 control subjects. A set of 104 candidate genes reported to be involved in KSD, gathered from public databases and candidate gene association study databases, were evaluated for their variations associated with KSD. Results Altogether 82 SNPs distributed within 22 candidate gene regions showed significant differences in SNP allele frequencies between the patient and control groups (P < 0.05). Of these, 4 genes including BGLAP, AHSG, CD44, and HAO1, encoding osteocalcin, fetuin-A, CD44-molecule and glycolate oxidase 1, respectively, were further assessed for their associations with the disease because they carried high proportion of SNPs with statistical differences of allele frequencies between the patient and control groups within the gene. The total of 26 SNPs showed significant differences of allele frequencies between the patient and control groups and haplotypes associated with disease risk were identified. The SNP rs759330 located 144 bp downstream of BGLAP where it is a predicted microRNA binding site at 3′UTR of PAQR6 – a gene encoding progestin and adipoQ receptor family member VI, was genotyped in 216 patients and 216 control subjects and found to have significant differences in its genotype and allele frequencies (P = 0.0007, OR 2.02 and P = 0.0001, OR 2.02, respectively). Conclusions Our results suggest that these candidate genes are associated with KSD and PAQR6 comes into our view as the most potent candidate since associated SNP rs759330 is located in the miRNA binding site and may affect mRNA expression level. PMID:24886237
Zhang, Suhua; Bian, Yingnan; Chen, Anqi; Zheng, Hancheng; Gao, Yuzhen; Hou, Yiping; Li, Chengtao
2017-03-01
Utilizing massively parallel sequencing (MPS) technology for SNP testing in forensic genetics is becoming attractive because of the shortcomings of STR markers, such as their high mutation rates and disadvantages associated with the current PCR-CE method as well as its limitations regarding multiplex capabilities. MPS offers the potential to genotype hundreds to thousands of SNPs from multiple samples in a single experimental run. In this study, we designed a customized SNP panel that includes 273 forensically relevant identity SNPs chosen from SNPforID, IISNP, and the HapMap database as well as previously related studies and evaluated the levels of genotyping precision, sequence coverage, sensitivity and SNP performance using the Ion Torrent PGM. In a concordant study of the custom MPS-SNP panel, only four MPS callings were missing due to coverage reads that were too low (<20), whereas the others were fully concordant with Sanger's sequencing results across the two control samples, that is, 9947A and 9948. The analyses indicated a balanced coverage among the included loci, with the exception of the 16 SNPs that were used to detect an inconsistent allele balance and/or lower coverage reads among 50 tested individuals from the Chinese HAN population and the above controls. With the exception of the 16 poorly performing SNPs, the sequence coverage obtained was extensive for the bulk of the SNPs, and only three Y-SNPs (rs16980601, rs11096432, rs3900) showed a mean coverage below 1000. Analyses of the dilution series of control DNA 9948 yielded reproducible results down to 1ng of DNA input. In addition, we provide an analysis tool for automated data quality control and genotyping checks, and we conclude that the SNP targets are polymorphic and independent in the Chinese HAN population. In summary, the evaluation of the sensitivity, accuracy and genotyping performance provides strong support for the application of MPS technology in forensic SNP analysis, and the assay offers a straightforward sample-to-genotype workflow that could be beneficial in forensic casework with respect to both individual identification and complex kinship issues. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Huang, Ke-Ke; Yin, Rui-Xing; Zeng, Xiao-Na; Huang, Ping; Lin, Quan-Zhen; Wu, Jian; Guo, Tao; Wang, Wei; Yang, De-Zhai; Lin, Wei-Xiong
2013-01-01
Background: The rs7395662 single nucleotide polymorphism (SNP) in the MADD-FOLH1 has been associated with serum lipid traits, but the results are inconsistent in different populations. The present study was undertaken to investigate the association of rs7395662 SNP and several environmental factors with serum lipid levels in the Guangxi Mulao and Han populations. Method: A total of 721 subjects of Mulao and 727 subjects of Han Chinese were randomly selected from our previous stratified randomized samples. Genotyping of the SNP was performed by polymerase chain reaction and restriction fragment length polymorphism combined with gel electrophoresis, and confirmed by direct sequencing. Results: Serum apolipoprotein (Apo) B levels were higher in Mulao than in Han (P < 0.01). The allelic and genotypic frequencies in Han were different between males and females (P < 0.05 for each), but there was no difference between Mulao and Han or between Mulao males and females. The levels of low-density lipoprotein cholesterol (LDL-C) and ApoB in Mulao females were different among the genotypes (P < 0.05), the G allele carriers had higher LDL-C and ApoB levels than the G allele non-carriers. The levels of total cholesterol (TC), triglyceride (TG), LDL-C and ApoB in Han males and TC, TG and high-density lipoprotein cholesterol (HDL-C) in Han females were different among the genotypes (P < 0.05-0.01), the subjects with GG genotype in Han males had higher TC, TG, and ApoB and lower LDL-C levels than the subjects with AA or AG genotype, and the G allele carriers in Han females had lower TC and HDL-C levels than the G allele non-carriers. The levels of LDL-C and ApoB in Mulao females were correlated with the genotypes (P < 0.05 for each). The levels of HDL-C and ApoAI in Han males and HDL-C in Han females were correlated with genotypes (P < 0.05-0.001). Serum lipid parameters were also correlated with several environmental factors in both ethnic groups (P < 0.05-0.01). Conclusion: The association of rs7395662 SNP and serum lipid levels is different between the Mulao and Han populations, and between males and females in both ethnic groups. PMID:24046529
Li, Ao; Liu, Zongzhi; Lezon-Geyda, Kimberly; Sarkar, Sudipa; Lannin, Donald; Schulz, Vincent; Krop, Ian; Winer, Eric; Harris, Lyndsay; Tuck, David
2011-01-01
There is an increasing interest in using single nucleotide polymorphism (SNP) genotyping arrays for profiling chromosomal rearrangements in tumors, as they allow simultaneous detection of copy number and loss of heterozygosity with high resolution. Critical issues such as signal baseline shift due to aneuploidy, normal cell contamination, and the presence of GC content bias have been reported to dramatically alter SNP array signals and complicate accurate identification of aberrations in cancer genomes. To address these issues, we propose a novel Global Parameter Hidden Markov Model (GPHMM) to unravel tangled genotyping data generated from tumor samples. In contrast to other HMM methods, a distinct feature of GPHMM is that the issues mentioned above are quantitatively modeled by global parameters and integrated within the statistical framework. We developed an efficient EM algorithm for parameter estimation. We evaluated performance on three data sets and show that GPHMM can correctly identify chromosomal aberrations in tumor samples containing as few as 10% cancer cells. Furthermore, we demonstrated that the estimation of global parameters in GPHMM provides information about the biological characteristics of tumor samples and the quality of genotyping signal from SNP array experiments, which is helpful for data quality control and outlier detection in cohort studies. PMID:21398628
Huang, Hu; Tada Iida, Kaoruko; Murakami, Haruka; Saito, Yoko; Otsuki, Takeshi; Iemitsu, Motoyuki; Maeda, Seiji; Sone, Hirohito; Kuno, Shinya; Ajisaka, Ryuichi
2007-12-01
Adiponectin is an adipocytokine that is involved in insulin sensitivity. The adiponectin gene contains a single nucleotide polymorphism (SNP) at position 276 (G/T). The GG genotype of SNP276 (G/T) is associated with lower plasma adiponectin levels and a higher insulin resistance index. Therefore, we examined the influence of SNP276 (G/T) on the plasma level of adiponectin in response to exercise training. Thirty healthy Japanese (M12/F18; 56 to 79 years old) performed both resistance and endurance training, 5 times a week for 6 months. The work rate per kg of weight at double-product break-point (DPBP) was measured. Blood samples were obtained before and after the experiment. Plasma concentrations of adiponectin, HbA1c, insulin, glucose, total, high-density lipoprotein (HDL), and low-density lipoprotein (LDL) cholesterol, and triglyceride were measured. Genotypes of SNP276 were specified. Student's t-test for paired values and unpaired values was used. After the 6-month training period, the work rate per kg of weight at DPBP and the plasma HDL-cholesterol level were significantly improved (P<0.05), while no change was observed in the total plasma adiponectin level. However, the plasma adiponectin level in those with the GT + TT genotype had significantly increased (P<0.05). Additionally, the degree of the decrease in the HOMA-R level was significantly greater in the subjects with the GT + TT genotype than those with the GG genotype (p<0.05). Our results suggest that subjects with the genotype GT + TT at SNP276 (G/T) have a greater adiponectin-related response to exercise training than those with the GG genotype.
Melzer, Nina; Wittenburg, Dörte; Repsilber, Dirk
2013-01-01
In this study the benefit of metabolome level analysis for the prediction of genetic value of three traditional milk traits was investigated. Our proposed approach consists of three steps: First, milk metabolite profiles are used to predict three traditional milk traits of 1,305 Holstein cows. Two regression methods, both enabling variable selection, are applied to identify important milk metabolites in this step. Second, the prediction of these important milk metabolite from single nucleotide polymorphisms (SNPs) enables the detection of SNPs with significant genetic effects. Finally, these SNPs are used to predict milk traits. The observed precision of predicted genetic values was compared to the results observed for the classical genotype-phenotype prediction using all SNPs or a reduced SNP subset (reduced classical approach). To enable a comparison between SNP subsets, a special invariable evaluation design was implemented. SNPs close to or within known quantitative trait loci (QTL) were determined. This enabled us to determine if detected important SNP subsets were enriched in these regions. The results show that our approach can lead to genetic value prediction, but requires less than 1% of the total amount of (40,317) SNPs., significantly more important SNPs in known QTL regions were detected using our approach compared to the reduced classical approach. Concluding, our approach allows a deeper insight into the associations between the different levels of the genotype-phenotype map (genotype-metabolome, metabolome-phenotype, genotype-phenotype). PMID:23990900
van Geest, Geert; Voorrips, Roeland E; Esselink, Danny; Post, Aike; Visser, Richard Gf; Arens, Paul
2017-08-07
Cultivated chrysanthemum is an outcrossing hexaploid (2n = 6× = 54) with a disputed mode of inheritance. In this paper, we present a single nucleotide polymorphism (SNP) selection pipeline that was used to design an Affymetrix Axiom array with 183 k SNPs from RNA sequencing data (1). With this array, we genotyped four bi-parental populations (with sizes of 405, 53, 76 and 37 offspring plants respectively), and a cultivar panel of 63 genotypes. Further, we present a method for dosage scoring in hexaploids from signal intensities of the array based on mixture models (2) and validation of selection steps in the SNP selection pipeline (3). The resulting genotypic data is used to draw conclusions on the mode of inheritance in chrysanthemum (4), and to make an inference on allelic expression bias (5). With use of the mixture model approach, we successfully called the dosage of 73,936 out of 183,130 SNPs (40.4%) that segregated in any of the bi-parental populations. To investigate the mode of inheritance, we analysed markers that segregated in the large bi-parental population (n = 405). Analysis of segregation of duplex x nulliplex SNPs resulted in evidence for genome-wide hexasomic inheritance. This evidence was substantiated by the absence of strong linkage between markers in repulsion, which indicated absence of full disomic inheritance. We present the success rate of SNP discovery out of RNA sequencing data as affected by different selection steps, among which SNP coverage over genotypes and use of different types of sequence read mapping software. Genomic dosage highly correlated with relative allele coverage from the RNA sequencing data, indicating that most alleles are expressed according to their genomic dosage. The large population, genotyped with a very large number of markers, is a unique framework for extensive genetic analyses in hexaploid chrysanthemum. As starting point, we show conclusive evidence for genome-wide hexasomic inheritance.
Taranto, F; D'Agostino, N; Greco, B; Cardi, T; Tripodi, P
2016-11-21
Knowledge on population structure and genetic diversity in vegetable crops is essential for association mapping studies and genomic selection. Genotyping by sequencing (GBS) represents an innovative method for large scale SNP detection and genotyping of genetic resources. Herein we used the GBS approach for the genome-wide identification of SNPs in a collection of Capsicum spp. accessions and for the assessment of the level of genetic diversity in a subset of 222 cultivated pepper (Capsicum annum) genotypes. GBS analysis generated a total of 7,568,894 master tags, of which 43.4% uniquely aligned to the reference genome CM334. A total of 108,591 SNP markers were identified, of which 105,184 were in C. annuum accessions. In order to explore the genetic diversity of C. annuum and to select a minimal core set representing most of the total genetic variation with minimum redundancy, a subset of 222 C. annuum accessions were analysed using 32,950 high quality SNPs. Based on Bayesian and Hierarchical clustering it was possible to divide the collection into three clusters. Cluster I had the majority of varieties and landraces mainly from Southern and Northern Italy, and from Eastern Europe, whereas clusters II and III comprised accessions of different geographical origins. Considering the genome-wide genetic variation among the accessions included in cluster I, a second round of Bayesian (K = 3) and Hierarchical (K = 2) clustering was performed. These analysis showed that genotypes were grouped not only based on geographical origin, but also on fruit-related features. GBS data has proven useful to assess the genetic diversity in a collection of C. annuum accessions. The high number of SNP markers, uniformly distributed on the 12 chromosomes, allowed the accessions to be distinguished according to geographical origin and fruit-related features. SNP markers and information on population structure developed in this study will undoubtedly support genome-wide association mapping studies and marker-assisted selection programs.
Construction of a versatile SNP array for pyramiding useful genes of rice.
Kurokawa, Yusuke; Noda, Tomonori; Yamagata, Yoshiyuki; Angeles-Shim, Rosalyn; Sunohara, Hidehiko; Uehara, Kanako; Furuta, Tomoyuki; Nagai, Keisuke; Jena, Kshirod Kumar; Yasui, Hideshi; Yoshimura, Atsushi; Ashikari, Motoyuki; Doi, Kazuyuki
2016-01-01
DNA marker-assisted selection (MAS) has become an indispensable component of breeding. Single nucleotide polymorphisms (SNP) are the most frequent polymorphism in the rice genome. However, SNP markers are not readily employed in MAS because of limitations in genotyping platforms. Here the authors report a Golden Gate SNP array that targets specific genes controlling yield-related traits and biotic stress resistance in rice. As a first step, the SNP genotypes were surveyed in 31 parental varieties using the Affymetrix Rice 44K SNP microarray. The haplotype information for 16 target genes was then converted to the Golden Gate platform with 143-plex markers. Haplotypes for the 14 useful allele are unique and can discriminate among all other varieties. The genotyping consistency between the Affymetrix microarray and the Golden Gate array was 92.8%, and the accuracy of the Golden Gate array was confirmed in 3 F2 segregating populations. The concept of the haplotype-based selection by using the constructed SNP array was proofed. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Single-feature polymorphism discovery in the barley transcriptome
Rostoks, Nils; Borevitz, Justin O; Hedley, Peter E; Russell, Joanne; Mudie, Sharon; Morris, Jenny; Cardle, Linda; Marshall, David F; Waugh, Robbie
2005-01-01
A probe-level model for analysis of GeneChip gene-expression data is presented which identified more than 10,000 single-feature polymorphisms (SFP) between two barley genotypes. The method has good sensitivity, as 67% of known single-nucleotide polymorphisms (SNP) were called as SFPs. This method is applicable to all oligonucleotide microarray data, accounts for SNP effects in gene-expression data and represents an efficient and versatile approach for highly parallel marker identification in large genomes. PMID:15960806
Qin, Feng-qin; Yu, Li-hua; Hu, Wen-ting; Guo, Jian; Chen, Ning; Guo, Jiang; Fang, Jing-huan; He, Li
2015-07-01
To investigate the relationship between single nucleotide polymorphism (SNP) rs6007897 of CELSR1 and acute ischemic stroke in Western China Han population. All subjects (759 acute ischemic stroke patients and 786 controls) were genotyped using ligation detection reaction (LDR). We analyzed the differences between SNP rs6007897 genotypes and allele frequencies between two groups. Two genotypes (AA, AG) of rs6007897 were found in both stroke and control group. There was no statistically significance between two groups about genotype and allele frequency. After adjusting for risk factors, we found there was no significant association between rs6007897 and ischemic stroke CP = 0.797, odds ratio (OR) = 0.886, 95% confidence interval (CI) = 0.352-2.227). SNP rs6007897 of CELSR1 was not significantly associated with ischemic stroke in Western China Han population.
Rice SNP-seek database update: new SNPs, indels, and queries.
Mansueto, Locedie; Fuentes, Roven Rommel; Borja, Frances Nikki; Detras, Jeffery; Abriol-Santos, Juan Miguel; Chebotarov, Dmytro; Sanciangco, Millicent; Palis, Kevin; Copetti, Dario; Poliakov, Alexandre; Dubchak, Inna; Solovyev, Victor; Wing, Rod A; Hamilton, Ruaraidh Sackville; Mauleon, Ramil; McNally, Kenneth L; Alexandrov, Nickolai
2017-01-04
We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
High-resolution melting PCR analysis for rapid genotyping of Burkholderia mallei.
Girault, G; Wattiau, P; Saqib, M; Martin, B; Vorimore, F; Singha, H; Engelsma, M; Roest, H J; Spicic, S; Grunow, R; Vicari, N; De Keersmaecker, S C J; Roosens, N H C; Fabbi, M; Tripathi, B N; Zientara, S; Madani, N; Laroucau, K
2018-05-08
Burkholderia (B.) mallei is the causative agent of glanders. A previous work conducted on single-nucleotide polymorphisms (SNP) extracted from the whole genome sequences of 45 B. mallei isolates identified 3 lineages for this species. In this study, we designed a high-resolution melting (HRM) method for the screening of 15 phylogenetically informative SNPs within the genome of B. mallei that subtype the species into 3 lineages and 12 branches/sub-branches/groups. The present results demonstrate that SNP-based genotyping represent an interesting approach for the molecular epidemiology analysis of B. mallei. Copyright © 2018 Elsevier B.V. All rights reserved.
Soria, L A; Corva, P M; Branda Sica, A; Villarreal, E L; Melucci, L M; Mezzadra, C A; Papaleo Mazzucco, J; Fernández Macedo, G; Silvestro, C; Schor, A; Miquel, M C
2009-12-01
The PPARGC1A gene (peroxysome proliferator-activated receptor-gamma coactivator 1alpha gene) controls muscle fiber type and brown adipocyte differentiation; therefore, it is a candidate gene for beef quality traits (tenderness and fat content). Two SNPs (Single Nucleotide Polymorphisms) were identified within exon 8 by multiple alignment of DNA sequences obtained from 24 bulls: a transition G/A (SNP 1181) and a transversion A/T (SNP 1299). The SNP 1181 is a novel SNP, corresponding to a non-conservative substitution (AGT/AAT) that could be the cause of amino acid substitution ((364)Serine/(364)Asparagine). A Mismatch PCR method was designed to determine genotypes of 73 bulls and 268 steers for SNP 1181. Growth, slaughter and meat quality information were available for the group of steers. Allele A of SNP 1181 was not found in Angus. In 243 steers, no significant differences (P > 0.05) were found for either final live body weight, gain in backfat thickness in Spring, kidney fat weight, kidney fat percentage, Warner-Bratzler shear force at 7 days postmortem, intramuscular fat percentage or meat colour between genotype GG and AG. This SNP could be included in breed composition and population admixture analyses because there are marked differences in allelic frequencies between Bos taurus and Bos indicus breeds.
A combined long-range phasing and long haplotype imputation method to impute phase for SNP genotypes
2011-01-01
Background Knowing the phase of marker genotype data can be useful in genome-wide association studies, because it makes it possible to use analysis frameworks that account for identity by descent or parent of origin of alleles and it can lead to a large increase in data quantities via genotype or sequence imputation. Long-range phasing and haplotype library imputation constitute a fast and accurate method to impute phase for SNP data. Methods A long-range phasing and haplotype library imputation algorithm was developed. It combines information from surrogate parents and long haplotypes to resolve phase in a manner that is not dependent on the family structure of a dataset or on the presence of pedigree information. Results The algorithm performed well in both simulated and real livestock and human datasets in terms of both phasing accuracy and computation efficiency. The percentage of alleles that could be phased in both simulated and real datasets of varying size generally exceeded 98% while the percentage of alleles incorrectly phased in simulated data was generally less than 0.5%. The accuracy of phasing was affected by dataset size, with lower accuracy for dataset sizes less than 1000, but was not affected by effective population size, family data structure, presence or absence of pedigree information, and SNP density. The method was computationally fast. In comparison to a commonly used statistical method (fastPHASE), the current method made about 8% less phasing mistakes and ran about 26 times faster for a small dataset. For larger datasets, the differences in computational time are expected to be even greater. A computer program implementing these methods has been made available. Conclusions The algorithm and software developed in this study make feasible the routine phasing of high-density SNP chips in large datasets. PMID:21388557
Varanasi, Satya S.; Tuck, Stephen P.; Mastana, Sarabjit S.; Dennison, Elaine; Cooper, Cyrus; Vila, Josephine; Francis, Roger M.; Datta, Harish K.
2011-01-01
Introduction. The association of bone morphogenetic protein 2 (BMP2) with BMD and risk of fracture was suggested by a recent linkage study, but subsequent studies have been contradictory. We report the results of a study of the relationship between BMP2 genotypes and BMD, annual change in BMD, and risk of fracture in male subjects. Materials and Methods. We tested three single-nucleotide polymorphisms (SNPs) across the BMP2 gene, including Ser37Ala SNP, in 342 Caucasian Englishmen, comprising 224 control and 118 osteoporotic subjects. Results. BMP2 SNP1 (Ser37Ala) genotypes were found to have similar low frequency in control subjects and men with osteoporosis. The major informative polymorphism, BMP2 SNP3 (Arg190Ser), showed no statistically significant association with weight, height, BMD, change in BMD at hip or lumbar spine, and risk of fracture. Conclusion. There were no genotypic or haplotypic effects of the BMP2 candidate gene on BMD, change in BMD, or fracture risk identified in this cohort. PMID:22013543
Arnedo, Javier; Svrakic, Dragan M; Del Val, Coral; Romero-Zaliz, Rocío; Hernández-Cuervo, Helena; Fanous, Ayman H; Pato, Michele T; Pato, Carlos N; de Erausquin, Gabriel A; Cloninger, C Robert; Zwir, Igor
2015-02-01
The authors sought to demonstrate that schizophrenia is a heterogeneous group of heritable disorders caused by different genotypic networks that cause distinct clinical syndromes. In a large genome-wide association study of cases with schizophrenia and controls, the authors first identified sets of interacting single-nucleotide polymorphisms (SNPs) that cluster within particular individuals (SNP sets) regardless of clinical status. Second, they examined the risk of schizophrenia for each SNP set and tested replicability in two independent samples. Third, they identified genotypic networks composed of SNP sets sharing SNPs or subjects. Fourth, they identified sets of distinct clinical features that cluster in particular cases (phenotypic sets or clinical syndromes) without regard for their genetic background. Fifth, they tested whether SNP sets were associated with distinct phenotypic sets in a replicable manner across the three studies. The authors identified 42 SNP sets associated with a 70% or greater risk of schizophrenia, and confirmed 34 (81%) or more with similar high risk of schizophrenia in two independent samples. Seventeen networks of SNP sets did not share any SNP or subject. These disjoint genotypic networks were associated with distinct gene products and clinical syndromes (i.e., the schizophrenias) varying in symptoms and severity. Associations between genotypic networks and clinical syndromes were complex, showing multifinality and equifinality. The interactive networks explained the risk of schizophrenia more than the average effects of all SNPs (24%). Schizophrenia is a group of heritable disorders caused by a moderate number of separate genotypic networks associated with several distinct clinical syndromes.
Hayes, C. Nelson; Abe, Hiromi; Miki, Daiki; Ochi, Hidenori; Karino, Yoshiyasu; Toyota, Joji; Nakamura, Yusuke; Kamatani, Naoyuki; Sezaki, Hitomi; Kobayashi, Mariko; Akuta, Norio; Suzuki, Fumitaka; Kumada, Hiromitsu
2011-01-01
Background. Pegylated interferon, ribavirin, and telaprevir triple therapy is a new strategy expected to eradicate the hepatitis C virus (HCV) even in patients infected with difficult-to-treat genotype 1 strains, although adverse effects, such as anemia and rash, are frequent. Methods. We assessed efficacy and predictive factors for sustained virological response (SVR) for triple therapy in 94 Japanese patients with HCV genotype 1. We included recently identified predictive factors, such as IL28B and ITPA polymorphism, and substitutions in the HCV core and NS5A proteins. Results. Patients treated with triple therapy achieved comparatively high SVR rates (73%), especially among treatment-naive patients (80%). Of note, however, patients who experienced relapse during prior pegylated interferon plus ribavirin combination therapy were highly likely to achieve SVR while receiving triple therapy (93%); conversely, prior nonresponders were much less likely to respond to triple therapy (32%). In addition to prior treatment response, IL28B SNP genotype and rapid viral response were significant independent predictors for SVR. Patients with the anemia-susceptible ITPA SNP rs1127354 genotype typically required ribavirin dose reduction earlier than did patients with other genotypes. Conclusions. Analysis of predictive factors identified IL28B SNP, rapid viral response, and transient response to previous therapy as significant independent predictors of SVR after triple therapy. PMID:21628662
Bungartz, Annemarie; Klaus, Marius; Mathew, Boby; Léon, Jens; Naz, Ali Ahmad
2016-03-01
The aim of the present study was to develop a new cost effective PCR based CAPS marker set using advantages of high-throughput SNP genotyping. Initially, SNP survey was made using 20 diverse barley genotypes via 9k iSelect array genotyping that resulted in 6334 polymorphic SNP markers. Principle component analysis using this marker data showed fine differentiation of barley diverse gene pool. Till this end, we developed 200 SNP derived CAPS markers distributed across the genome covering around 991cM with an average marker density of 5.09cM. Further, we genotyped 68 CAPS markers in an F2 population (Cheri×ICB181160) segregating for seed color variation in barley. Genetic mapping of seed color revealed putative linkage of single nuclear gene on chromosome 1H. These findings showed the proof of concept for the development and utility of a newer cost effective genomic tool kit to analyze broader genetic resources of barley worldwide. Copyright © 2016 Elsevier Inc. All rights reserved.
No association of IL-10 promoter SNP -592 and -1082 and SIDS.
Courts, Cornelius; Madea, Burkhard
2011-01-30
Sudden infant death syndrome (SIDS) constitutes a considerable percentage of infant death of unknown etiology. The genetically controlled pathway of cytokine mediated response to inflammation is presumed to play a role in SIDS. The A allele of SNP -592 of the promoter region of the anti-inflammatory cytokine IL-10 has been suggested to be associated with SIDS. Herein we investigated whether we could confirm this finding by SNP genotyping a series of 123 cases of SIDS and 406 control cases. We did not find a correlation between the A allele or an A allele containing genotype of IL-10 promoter SNP -592 and SIDS which is in contrast to previous studies. Also, in concordance with previous work, no association of the A allele or A allele containing genotypes of IL-10 promoter SNP -1082 and SIDS was found. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Huang, Chao-Wei; Lin, Yu-Tsung; Ding, Shih-Torng; Lo, Ling-Ling; Wang, Pei-Hwa; Lin, En-Chung; Liu, Fang-Wei; Lu, Yen-Wen
2015-01-01
The genetic markers associated with economic traits have been widely explored for animal breeding. Among these markers, single-nucleotide polymorphism (SNPs) are gradually becoming a prevalent and effective evaluation tool. Since SNPs only focus on the genetic sequences of interest, it thereby reduces the evaluation time and cost. Compared to traditional approaches, SNP genotyping techniques incorporate informative genetic background, improve the breeding prediction accuracy and acquiesce breeding quality on the farm. This article therefore reviews the typical procedures of animal breeding using SNPs and the current status of related techniques. The associated SNP information and genotyping techniques, including microarray and Lab-on-a-Chip based platforms, along with their potential are highlighted. Examples in pig and poultry with different SNP loci linked to high economic trait values are given. The recommendations for utilizing SNP genotyping in nimal breeding are summarized. PMID:27600241
He, Fei; Zhou, Wanjun; Cai, Ren; Yan, Tizhen; Xu, Xiangmin
2018-04-01
In this study, we aimed to assess the performance of two whole-genome amplification methods, multiple displacement amplification (MDA), and multiple annealing and looping-based amplification cycle (MALBAC), for β-thalassemia genotyping and single-nucleotide polymorphism (SNP)/copy-number variant (CNV) detection using two DNA sequencing assays. We collected peripheral blood, cell lines, and discarded embryos, and carried out MALBAC and MDA on single-cell and five-cell samples. We detected and statistically analyzed differences in the amplification efficiency, positive predictive value, sensitivity, allele dropout (ADO) rate, SNPs, and CV values between the two methods. Through Sanger sequencing at the single-cell and five-cell levels, we showed that both the amplification rate and ADO rate of MDA were better than those using MALBAC, and the sensitivity and positive predictive value obtained from MDA were higher than those from MALBAC for β-thalassemia genotyping. Using next-generation sequencing (NGS) at the single-cell level, we confirmed that MDA has better properties than MALBAC for SNP detection. However, MALBAC was more stable and homogeneous than MDA using low-depth NGS at the single-cell level for CNV detection. We conclude that MALBAC is the better option for CNV detection, while MDA is better suited for SNV detection.
Koskinen, Lotta; Romanos, Jihane; Kaukinen, Katri; Mustalahti, Kirsi; Korponay-Szabo, Ilma; Barisani, Donatella; Bardella, Maria Teresa; Ziberna, Fabiana; Vatta, Serena; Széles, György; Pocsai, Zsuzsa; Karell, Kati; Haimila, Katri; Adány, Róza; Not, Tarcisio; Ventura, Alessandro; Mäki, Markku; Partanen, Jukka; Wijmenga, Cisca; Saavalainen, Päivi
2009-04-01
Human leukocyte antigen (HLA) genes, located on chromosome 6p21.3, have a crucial role in susceptibility to various autoimmune and inflammatory diseases, such as celiac disease and type 1 diabetes. Certain HLA heterodimers, namely DQ2 (encoded by the DQA1*05 and DQB1*02 alleles) and DQ8 (DQA1*03 and DQB1*0302), are necessary for the development of celiac disease. Traditional genotyping of HLA genes is laborious, time-consuming, and expensive. A novel HLA-genotyping method, using six HLA-tagging single-nucleotide polymorphisms (SNPs) and suitable for high-throughput approaches, was described recently. Our aim was to validate this method in the Finnish, Hungarian, and Italian populations. The six previously reported HLA-tagging SNPs were genotyped in patients with celiac disease and in healthy individuals from Finland, Hungary, and two distinct regions of Italy. The potential of this method was evaluated in analyzing how well the tag SNP results correlate with the HLA genotypes previously determined using traditional HLA-typing methods. Using the tagging SNP method, it is possible to determine the celiac disease risk haplotypes accurately in Finnish, Hungarian, and Italian populations, with specificity and sensitivity ranging from 95% to 100%. In addition, it predicts homozygosity and heterozygosity for a risk haplotype, allowing studies on genotypic risk effects. The method is transferable between populations and therefore suited for large-scale research studies and screening of celiac disease among high-risk individuals or at the population level.
Jiang, Y; Palizhati, Abudoureyimu; Gao, X Y; Guan, S Z; Liu, J W
2016-10-20
Objective: To investigate the association between 5-hydroxytryptamine 2A (5-HT2A) receptor gene polymorphisms and occupational stress in oilfield workers. Methods: Cluster sampling was used to select 826 oilfield workers from January to August, 2013. The SNaPshot single nucleotide polymorphism (SNP) genotyping method was used to determine the genotypes of rs6313, rs1923884, and rs2070040 in 5-HT2A receptor gene, and the Occupational Stress Inventory-Revised Edition was used to analyze occupational stress in these workers. Results: There were no significant differences in occupational stress between groups with different individual characteristics ( P >0.05 ) . As for the comparison of occupational stress scores between workers with different genotypes of each SNP of 5-HT2A receptor gene, the workers with CC and CT genotypes of rs6313 had significantly higher role boundary scores than those with TT genotype ( P <0.05) , and the workers with CC genotype had a significantly higher vocational stress score than those with CT genotype ( P <0.05) ; the workers with CT genotype of rs1923884 had a significantly higher occupational role score than those with CC genotype ( P <0.05) and a significantly higher coping resources score than those with CC and TT genotypes ( P <0.05) ; the workers with AG genotype of rs2070040 had a significantly higher vocational stress score than those with AA genotype ( P <0.05) . The ordinal multinomial logistic regression analysis showed that workers with CT genotype of rs1923884 were susceptible to occupational stress ( OR =1.56, 95% CI 1.10~2.20) . Conclusion: CT genotype of rs1923884 in 5-HT2A receptor gene may be associated with the susceptibility to occupational stress in oilfield workers.
USDA-ARS?s Scientific Manuscript database
In this study, we aimed to (1) predict genomic estimated breeding value (GEBV) for bacterial cold water disease (BCWD) resistance by genotyping training (n=583) and validation samples (n=53) with two genotyping platforms (24K RAD-SNP and 49K SNP) and using different genomic selection (GS) models (Ba...
KinSNP software for homozygosity mapping of disease genes using SNP microarrays.
Amir, El-Ad David; Bartal, Ofer; Morad, Efrat; Nagar, Tal; Sheynin, Jony; Parvari, Ruti; Chalifa-Caspi, Vered
2010-08-01
Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from.
Xiao, Shijun; Wang, Panpan; Dong, Linsong; Zhang, Yaguang; Han, Zhaofang; Wang, Qiurong
2016-01-01
Whole-genome single-nucleotide polymorphism (SNP) markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS) provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms. PMID:28028455
Mei, C G; Gui, L S; Fu, C Z; Wang, H C; Wang, J L; Cheng, G; Zan, L S
2015-08-07
Previous studies have shown that the cell death-inducing DFF45-like effector-C (CIDEC) gene is involved in lipid storage and energy metabolism, suggesting that it is a potential candidate gene that affects body measurement traits (BMTs) and meat quality traits (MQTs). The aim of this study was to identify polymorphisms of the bovine CIDEC gene and analyze their possible associations with BMTs and MQTs in 531 randomly selected Qinchuan cattle aged between 18 and 24 months. DNA sequencing and polymerase chain reaction-restriction fragment length polymorphism were employed to detect CIDEC single nucleotide polymorphisms (SNPs). We found five SNPs: two in exon 5 (SNP1, g.9815G>A and SNP2, g.9924C>T) and three in the 3'-untranslated region (SNP3, g.13281C>T; SNP4, g.13297A>G; and SNP5, g.13307G>A). SNP1 was a missense mutation that resulted in an arginine to glutamine amino acid change, and exhibited two genotypes (GG and AG). SNP2 was a synonymous mutation that exhibited three genotypes (CC, CT, and TT). SNP3, 4, and 5 were completely linked, and only exhibited two genotypes (CC-AA-GG and CT-AG-GA). We found significant associations between these polymorphisms and BMTs and MQTs (P < 0.05); GG, CT, and CT-AG-GA appeared to be the most beneficial genotypes. Therefore, CIDEC may affect BMTs and MQTs in Qinchuan cattle, and could be used in marker-assisted selection.
Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes
Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Ángel
2009-01-01
Background Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. Results To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. Conclusion The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest. PMID:19344481
Case-control study of eczema associated with IL13 genetic polymorphisms in Japanese children.
Miyake, Yoshihiro; Kiyohara, Chikako; Koyanagi, Midori; Fujimoto, Takahiro; Shirasawa, Senji; Tanaka, Keiko; Sasaki, Satoshi; Hirota, Yoshio
2011-01-01
Several association studies have investigated the relationships between single nucleotide polymorphisms (SNPs) in the IL13 gene and eczema, with inconsistent results. We conducted a case-control study of the relationship between the polymorphisms of rs1800925 and rs20541 and the risk of eczema in Japanese children aged 3 years. Included were the 209 cases identified based on criteria of the International Study of Asthma and Allergies in Childhood (ISAAC). Controls were 451 children without eczema based on ISAAC questions who had not been diagnosed by a physician as having asthma or atopic eczema. The minor TT genotype of the rs1800925 SNP and the minor AA genotype of the rs20541 SNP were significantly related to an increased risk of eczema: adjusted odds ratio for the TT genotype was 2.78 (95% confidence interval 1.22-6.30) and that for the AA genotype was 2.38 (95% confidence interval 1.35-4.18). Haplotype analyses showed a protective association between the CG haplotype and eczema, whereas the TA haplotype was positively related to the risk of eczema. Perinatal smoking exposure did not interact with genotypes of the IL13 gene in the etiology of eczema. The significant association of the rs20541 SNP with eczema essentially disappeared after additional adjustment for the rs1800925 SNP, whereas a relationship with the rs1800925 SNP remained significant. A common genetic variation in the IL13 gene at the levels of both single SNPs and haplotypes was associated with eczema. However, the significant association with the rs20541 SNP might be ascribed to the rs1800925 SNP. Copyright © 2010 S. Karger AG, Basel.
Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes.
Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Angel
2009-03-19
Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest.
Weigel, K A; de los Campos, G; González-Recio, O; Naya, H; Wu, X L; Long, N; Rosa, G J M; Gianola, D
2009-10-01
The objective of the present study was to assess the predictive ability of subsets of single nucleotide polymorphism (SNP) markers for development of low-cost, low-density genotyping assays in dairy cattle. Dense SNP genotypes of 4,703 Holstein bulls were provided by the USDA Agricultural Research Service. A subset of 3,305 bulls born from 1952 to 1998 was used to fit various models (training set), and a subset of 1,398 bulls born from 1999 to 2002 was used to evaluate their predictive ability (testing set). After editing, data included genotypes for 32,518 SNP and August 2003 and April 2008 predicted transmitting abilities (PTA) for lifetime net merit (LNM$), the latter resulting from progeny testing. The Bayesian least absolute shrinkage and selection operator method was used to regress August 2003 PTA on marker covariates in the training set to arrive at estimates of marker effects and direct genomic PTA. The coefficient of determination (R(2)) from regressing the April 2008 progeny test PTA of bulls in the testing set on their August 2003 direct genomic PTA was 0.375. Subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP were created by choosing equally spaced and highly ranked SNP, with the latter based on the absolute value of their estimated effects obtained from the training set. The SNP effects were re-estimated from the training set for each subset of SNP, and the 2008 progeny test PTA of bulls in the testing set were regressed on corresponding direct genomic PTA. The R(2) values for subsets of 300, 500, 750, 1,000, 1,250, 1,500, and 2,000 SNP with largest effects (evenly spaced SNP) were 0.184 (0.064), 0.236 (0.111), 0.269 (0.190), 0.289 (0.179), 0.307 (0.228), 0.313 (0.268), and 0.322 (0.291), respectively. These results indicate that a low-density assay comprising selected SNP could be a cost-effective alternative for selection decisions and that significant gains in predictive ability may be achieved by increasing the number of SNP allocated to such an assay from 300 or fewer to 1,000 or more.
A 48 SNP set for grapevine cultivar identification
2011-01-01
Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP markers are bi-allelic, allele identification and genotype naming are extremely simple and genotypes obtained with different equipments and by different laboratories are always fully comparable. PMID:22060012
Reverter, A; Porto-Neto, L R; Fortes, M R S; McCulloch, R; Lyons, R E; Moore, S; Nicol, D; Henshall, J; Lehnert, S A
2016-10-01
We introduce an innovative approach to lowering the overall cost of obtaining genomic EBV (GEBV) and encourage their use in commercial extensive herds of Brahman beef cattle. In our approach, the DNA genotyping of cow herds from 2 independent properties was performed using a high-density bovine SNP chip on DNA from pooled blood samples, grouped according to the result of a pregnancy test following their first and second joining opportunities. For the DNA pooling strategy, 15 to 28 blood samples from the same phenotype and contemporary group were allocated to pools. Across the 2 properties, a total of 183 pools were created representing 4,164 cows. In addition, blood samples from 309 bulls from the same properties were also taken. After genotyping and quality control, 74,584 remaining SNP were used for analyses. Pools and individual DNA samples were related by means of a "hybrid" genomic relationship matrix. The pooled genotyping analysis of 2 large and independent commercial populations of tropical beef cattle was able to recover significant and plausible associations between SNP and pregnancy test outcome. We discuss 24 SNP with significant association ( < 1.0 × 10) and mapped within 40 kb of an annotated gene. We have established a method to estimate the GEBV in young herd bulls for a trait that is currently unable to be predicted at all. In summary, our novel approach allowed us to conduct genomic analyses of fertility in 2 large commercial Brahman herds managed under extensive pastoral conditions.
Linkage disequilibrium among commonly genotyped SNP and variants detected from bull sequence
USDA-ARS?s Scientific Manuscript database
Genomic prediction utilizing causal variants could increase selection accuracy above that achieved with SNP genotyped by commercial assays. A number of variants detected from sequencing influential sires are likely to be causal, but noticable improvements in prediction accuracy using imputed sequen...
Phetsuksiri, Benjawan; Srisungngam, Sopa; Rudeeaneksin, Janisara; Bunchoo, Supranee; Lukebua, Atchariya; Wongtrungkapun, Ruch; Paitoon, Soontara; Sakamuri, Rama Murthy; Brennan, Patrick J; Vissa, Varalakshmi
2012-01-01
Based on the discovery of three single nucleotide polymorphisms (SNPs) in Mycobacterium leprae, it has been previously reported that there are four major SNP types associated with different geographic regions around the world. Another typing system for global differentiation of M. leprae is the analysis of the variable number of short tandem repeats within the rpoT gene. To expand the analysis of geographic distribution of M. leprae, classified by SNP and rpoT gene polymorphisms, we studied 85 clinical isolates from Thai patients and compared the findings with those reported from Asian isolates. SNP genotyping by PCR amplification and sequencing revealed that all strains like those in Myanmar were SNP type 1 and 3, with the former being predominant, while in Japan, Korea, and Indonesia, the SNP type 3 was found to be more frequent. The pattern of M. leprae distribution in Thailand and Myanmar is quite similar, except that SNP type 2 was not found in Thailand. In addition, the 3-copy hexamer genotype in the rpoT gene is shared among the isolates from these two neighboring countries. On the basis of these two markers, we postulate that M. leprae in leprosy patients from Myanmar and Thailand has a common historical origin. Further differentiation among Thai isolates was possible by assessing copy numbers of the TTC sequence, a more polymorphic microsatellite locus.
Comparing CNV detection methods for SNP arrays.
Winchester, Laura; Yau, Christopher; Ragoussis, Jiannis
2009-09-01
Data from whole genome association studies can now be used for dual purposes, genotyping and copy number detection. In this review we discuss some of the methods for using SNP data to detect copy number events. We examine a number of algorithms designed to detect copy number changes through the use of signal-intensity data and consider methods to evaluate the changes found. We describe the use of several statistical models in copy number detection in germline samples. We also present a comparison of data using these methods to assess accuracy of prediction and detection of changes in copy number.
High-throughput SNP-genotyping analysis of the relationships among Ponto-Caspian sturgeon species
Rastorguev, Sergey M; Nedoluzhko, Artem V; Mazur, Alexander M; Gruzdeva, Natalia M; Volkov, Alexander A; Barmintseva, Anna E; Mugue, Nikolai S; Prokhortchouk, Egor B
2013-01-01
Abstract Legally certified sturgeon fisheries require population protection and conservation methods, including DNA tests to identify the source of valuable sturgeon roe. However, the available genetic data are insufficient to distinguish between different sturgeon populations, and are even unable to distinguish between some species. We performed high-throughput single-nucleotide polymorphism (SNP)-genotyping analysis on different populations of Russian (Acipenser gueldenstaedtii), Persian (A. persicus), and Siberian (A. baerii) sturgeon species from the Caspian Sea region (Volga and Ural Rivers), the Azov Sea, and two Siberian rivers. We found that Russian sturgeons from the Volga and Ural Rivers were essentially indistinguishable, but they differed from Russian sturgeons in the Azov Sea, and from Persian and Siberian sturgeons. We identified eight SNPs that were sufficient to distinguish these sturgeon populations with 80% confidence, and allowed the development of markers to distinguish sturgeon species. Finally, on the basis of our SNP data, we propose that the A. baerii-like mitochondrial DNA found in some Russian sturgeons from the Caspian Sea arose via an introgression event during the Pleistocene glaciation. In the present study, the high-throughput genotyping analysis of several sturgeon populations was performed. SNP markers for species identification were defined. The possible explanation of the baerii-like mitotype presence in some Russian sturgeons in the Caspian Sea was suggested. PMID:24567827
Klaften, Matthias; Hrabé de Angelis, Martin
2005-07-01
Genome-wide mapping in the identification of novel candidate genes has always been the standard method in genetics and genomics to correlate a clinically interesting phenotypic trait with a genotype. However, the performance of a mapping experiment using classical microsatellite approaches can be very time consuming. The high-throughput analysis of single-nucleotide polymorphisms (SNPs) has the potential of being the successor of microsatellite analysis routinely used for these mapping approaches, where one of the major obstacles is the design of the appropriate SNP marker set itself. Here we report on ARTS, an advanced retrieval tool for SNPs, which allows researchers to comb freely the public mouse dbSNP database for multiple reference and test strains. Several filters can be applied in order to improve the sensitivity and the specificity of the search results. By employing the panel generator function of this program, it is possible to abbreviate the extraction of reliable sequence data for a large marker panel including several different mouse strains from days to minutes. The concept of ARTS is easily adaptable to other species for which SNP databases are available, making it a versatile tool for the use of SNPs as markers for genotyping. The web interface is accessible at http://andromeda.gsf.de/arts.
Epistasis between polymorphisms in PCSK1 and DBH is associated with premature ovarian failure.
Pyun, Jung-A; Kim, Sunshin; Cha, Dong Hyun; Kwack, KyuBum
2014-11-01
This study examined whether epistasis between single nucleotide polymorphisms (SNPs) within proprotein convertase subtilisin/kexin type 1 (PCSK1) and dopamine β-hydroxylase (DBH) genes is associated with premature ovarian failure (POF). One hundred twenty women with POF and 222 female controls were recruited for this study. To genotype SNPs within PCSK1 and DBH, we used a GoldenGate assay with VeraCode technology, which uses an allele-specific primer extension method. Two SNPs (rs155979 and rs3762986) within PCSK1 and one SNP (rs1611114) within DBH, which were located in the 5' flanking region, were involved in synergistic interactions. The C allele in the rs155979 SNP showed an increased risk of POF in a dominant model when AA genotype in the rs1611114 SNP was present (odds ratio, 3.60; 95% CI, 1.82-7.14; P = 0.00024), whereas the G allele in the rs1611114 SNP showed a reduced risk of POF in a dominant model when at least one C allele at the rs155979 SNP was present (odds ratio, 0.24; 95% CI, 0.11-0.51; P = 0.00018) or one G allele at the rs3762986 SNP was present (odds ratio, 0.33; 95% CI, 0.19-0.60; P = 0.00023). Epistases between SNPs within PCSK1 and DBH genes are significantly associated with susceptibility or resistance to POF.
Zhang, Yang; Zhu, Zhen; Xu, Qi; Chen, Guohong
2014-01-07
Primers based on the cDNA sequence of the goose growth hormone (GH) gene in GenBank were designed to amplify exon 2 of the GH gene in Huoyan goose. A total of 552 individuals were brooded in one batch and raised in Liaoning and Jiangsu Provinces, China. Single nucleotide polymorphisms (SNPs) of exon 2 in the GH gene were detected by the polymerase chain reaction (single strand conformation polymorphism method). Homozygotes were subsequently cloned, sequenced and analyzed. Two SNP mutations were detected, and 10 genotypes (referred to as AA, BB, CC, DD, AB, AC, AD, BC, BD and CD) were obtained. Allele D was predominant, and the frequencies of the 10 genotypes fit the Hardy-Weinberg equilibrium in the male, female and whole populations according to the chi-square test. Based on SNP types, the 10 genotypes were combined into three main genotypes. Multiple comparisons were carried out between different genotypes and production traits when the geese were 10 weeks old. Some indices of production performance were significantly (p < 0.05) associated with the genotype. Particularly, geese with genotype AB or BB were highly productive. Thus, these genotypes may serve as selection markers for production traits in Huoyan geese.
Bangera, Rama; Correa, Katharina; Lhorente, Jean P; Figueroa, René; Yáñez, José M
2017-01-31
Salmon Rickettsial Syndrome (SRS) caused by Piscirickettsia salmonis is a major disease affecting the Chilean salmon industry. Genomic selection (GS) is a method wherein genome-wide markers and phenotype information of full-sibs are used to predict genomic EBV (GEBV) of selection candidates and is expected to have increased accuracy and response to selection over traditional pedigree based Best Linear Unbiased Prediction (PBLUP). Widely used GS methods such as genomic BLUP (GBLUP), SNPBLUP, Bayes C and Bayesian Lasso may perform differently with respect to accuracy of GEBV prediction. Our aim was to compare the accuracy, in terms of reliability of genome-enabled prediction, from different GS methods with PBLUP for resistance to SRS in an Atlantic salmon breeding program. Number of days to death (DAYS), binary survival status (STATUS) phenotypes, and 50 K SNP array genotypes were obtained from 2601 smolts challenged with P. salmonis. The reliability of different GS methods at different SNP densities with and without pedigree were compared to PBLUP using a five-fold cross validation scheme. Heritability estimated from GS methods was significantly higher than PBLUP. Pearson's correlation between predicted GEBV from PBLUP and GS models ranged from 0.79 to 0.91 and 0.79-0.95 for DAYS and STATUS, respectively. The relative increase in reliability from different GS methods for DAYS and STATUS with 50 K SNP ranged from 8 to 25% and 27-30%, respectively. All GS methods outperformed PBLUP at all marker densities. DAYS and STATUS showed superior reliability over PBLUP even at the lowest marker density of 3 K and 500 SNP, respectively. 20 K SNP showed close to maximal reliability for both traits with little improvement using higher densities. These results indicate that genomic predictions can accelerate genetic progress for SRS resistance in Atlantic salmon and implementation of this approach will contribute to the control of SRS in Chile. We recommend GBLUP for routine GS evaluation because this method is computationally faster and the results are very similar with other GS methods. The use of lower density SNP or the combination of low density SNP and an imputation strategy may help to reduce genotyping costs without compromising gain in reliability.
Safa, Ahmad Hosseini; Harandi, Majid Fasihi; Tajaddini, Mohammadhasan; Rostami-Nejad, Mohammad; Mohtashami-Pour, Mehdi; Pestehchian, Nader
2016-07-22
High-resolution melting (HRM) is a reliable and sensitive scanning method to detect variation in DNA sequences. We used this method to better understand the epidemiology and transmission of Echinococcus granulosus. We tested the use of HRM to discriminate the genotypes of E. granulosus and E. canadensis. One hundred forty-one hydatid cysts were collected from slaughtered animals in different parts of Isfahan-Iran in 2013. After DNA extraction, the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene was amplified using PCR coupled with the HRM curve. The result of HRM analysis using partial the sequences of cox1 gene revealed that 93, 35, and 2 isolates were identified as G1, G3, and G6 genotypes, respectively. A single nucleotide polymorphism (SNP) was found in locus 9867 of the cox1 gene. This is a critical locus for the differentiation between the G6 and G7 genotypes. In the phylogenic tree, the sample with a SNP was located between the G6 and G7 genotypes, which suggest that this isolate has a G6/G7 genotype. The HRM analysis developed in the present study provides a powerful technique for molecular and epidemiological studies on echinococcosis in humans and animals.
Ghaffari, Mohammad Ali; Askari Sede, Saeed; Rashtchizadeh, Nadereh; Mohammadzadeh, Ghorban; Majidi, Shahla
2014-01-01
Introduction: We evaluated the association between four polymorphisms in the CRP gene with serum C-reactive protein (CRP) levels, prevalence and severity of coronary artery disease (CAD) in type 2 diabetes mellitus (T2DM) patients. Methods: We performed coronary angiography for 308 T2DM patients and classified them into two groups: T2DM with CAD and T2DM without CAD. All patients were from Ahvaz, Iran. serum levels of CRP, glucose and lipid profile were measured. Genotyping was performed by PCR/RFLP, and the severity of coronary artery disease was determined by Gensini score. Results: The GG genotype of SNP rs279421 was associated with the increased risk of CAD (OR= 2.38; 95% CI: 1.12- 5.8; p= 0.02) and CA, TT, TA genotypes and A allele of SNP rs3091244 and GA genotypes and A allele of SNP rs3093062 were significantly associated with increased CRP levels. None of genotypes or alleles was associated with Gensini score. We found that the haplotype 7 (AGCG) was associated with decreased risk of CAD (OR= 0.11; 95% CI: 0.02, 0.66; p= 0.017) and the Gensini score was correlated with increased levels of CRP, only in CAD group. Conclusion: Although genetic polymorphisms were influenced on serum RP levels, none of the alleles and genotypes raising or falling C-reactive protein levels was consistently associated with an increased prevalence of CAD or protected from that. PMID:25337466
2013-01-01
Background Efficient screening of bacterial artificial chromosome (BAC) libraries with polymerase chain reaction (PCR)-based markers is feasible provided that a multidimensional pooling strategy is implemented. Single nucleotide polymorphisms (SNPs) can be screened in multiplexed format, therefore this marker type lends itself particularly well for medium- to high-throughput applications. Combining the power of multiplex-PCR assays with a multidimensional pooling system may prove to be especially challenging in a polyploid genome. In polyploid genomes two classes of SNPs need to be distinguished, polymorphisms between accessions (intragenomic SNPs) and those differentiating between homoeologous genomes (intergenomic SNPs). We have assessed whether the highly parallel Illumina GoldenGate® Genotyping Assay is suitable for the screening of a BAC library of the polyploid Brassica napus genome. Results A multidimensional screening platform was developed for a Brassica napus BAC library which is composed of almost 83,000 clones. Intragenomic and intergenomic SNPs were included in Illumina’s GoldenGate® Genotyping Assay and both SNP classes were used successfully for screening of the multidimensional BAC pools of the Brassica napus library. An optimized scoring method is proposed which is especially valuable for SNP calling of intergenomic SNPs. Validation of the genotyping results by independent methods revealed a success of approximately 80% for the multiplex PCR-based screening regardless of whether intra- or intergenomic SNPs were evaluated. Conclusions Illumina’s GoldenGate® Genotyping Assay can be efficiently used for screening of multidimensional Brassica napus BAC pools. SNP calling was specifically tailored for the evaluation of BAC pool screening data. The developed scoring method can be implemented independently of plant reference samples. It is demonstrated that intergenomic SNPs represent a powerful tool for BAC library screening of a polyploid genome. PMID:24010766
Kato, Hideaki; Ohata, Aya; Samukawa, Sei; Ueda, Atsuhisa; Ishigatsubo, Yoshiaki
2016-04-01
To investigate the association between single nucleotide polymorphisms (SNPs) in the adiponectin-encoding gene ADIPOQ and changes in serum lipid levels in HIV-1-infected patients after antiretroviral therapy (ART). ART-naïve HIV-1-infected patients were recruited to this prospective analysis. SNP +45 and SNP +276 genotype was determined by direct sequencing. Multivariate linear regression analysis was performed to analyse the effects of genotype, and predisposing conditions on serum total cholesterol and triglyceride in the 4 months before and after ART initiation. The study enrolled 78 patients with HIV-1-infection (73 male, five female; age range 22-67 years). HIV-1 viral load ≥5 log10 copies/ml, baseline total cholesterol ≥160 mg/dl, and CD4(+) lymphocyte count <200/µl were associated with increased serum total cholesterol levels after ART initiation. Protease inhibitor treatment and body mass index ≥25 kg/m(2) were associated with increased triglyceride levels after ART initiation. There were no significant associations between SNP +45 or SNP +276 genotype and serum total cholesterol or triglyceride levels. SNP +45 and SNP +276 genotype is not associated with changes in serum total cholesterol or triglyceride levels after ART initiation. © The Author(s) 2016.
Castro-Martínez, Anna Gabriela; Sánchez-Corona, José; Vázquez-Vargas, Adriana Patricia; García-Zapién, Alejandra Guadalupe; López-Quintero, Andres; Villalpando-Velazco, Héctor Javier; Flores-Martínez, Silvia Esperanza
2018-02-28
Gestational diabetes mellitus (GDM) is a metabolically complex disease with major genetic determinants. GDM has been associated with insulin resistance and dysfunction of pancreatic beta cells, so the GDM candidate genes are those that encode proteins modulating the function and secretion of insulin, such as that for calpain 10 (CAPN10). This study aimed to assess whether single nucleotide polymorphism (SNP)-43, SNP-44, SNP-63, and the indel-19 variant, and specific haplotypes of the CAPN10 gene were associated with gestational diabetes mellitus. We studied 116 patients with gestational diabetes mellitus and 83 women with normal glucose tolerance. Measurements of anthropometric and biochemical parameters were performed. SNP-43, SNP-44, and SNP-63 were identified by polymerase chain reaction (PCR)-restriction fragment length polymorphisms, while the indel-19 variant was detected by TaqMan qPCR assays. The allele, genotype, and haplotype frequencies of the four variants did not differ significantly between women with gestational diabetes mellitus and controls. However, in women with gestational diabetes mellitus, glucose levels were significantly higher bearing the 3R/3R genotype than in carriers of the 3R/2R genotype of the indel-19 variant (p = 0.006). In conclusion, the 3R/3R genotype of the indel-19 variant of the CAPN-10 gene influenced increased glucose levels in these Mexican women with gestational diabetes mellitus.
[Phenotype-genotype correlation analysis of 12 cases with Angelman/Prader-Willi syndrome].
Chen, Chen; Peng, Ying; Xia, Yan; Li, Haoxian; Zhu, Huimin; Pan, Qian; Yin, Fei; Wu, Lingqian
2014-12-01
To investigate the genotype-phenotype correlation in patients with Angelman syndrome/Prader-Willi syndrome (AS/PWS) and assess the application value of high-resolution single nucleotide polymorphism microarrays (SNP array) for such diseases. Twelve AS/PWS patients were diagnosed through SNP array, fluorescence in situ hybridization (FISH) and karyotype analysis. Clinical characteristics were analyzed. Deletions ranging from 4.8 Mb to 7.0 Mb on chromosome 15q11.2-13 were detected in 11 patients. Uniparental disomy (UPD) was detected in only 1 patient. Patients with deletions could be divided into 2 groups, including 7 cases with class I and 4 with class II. The two groups however had no significant phenotypic difference. The UPD patient had relatively better development and language ability. Deletions of 6 patients were confirmed by FISH to be of de novo in origin. The risk to their sibs was determined to be less than 1%. The phenotypic differences between AS/PWS patients with class I and class II deletion need to be further studied. SNP array is useful in detecting and distinguishing of patients with deletion or UPD. This method may be applied for studying the genotype-phenotype association and the mechanism underlying AS/PWS.
Brown, C M; Rea, T J; Hamon, S C; Hixson, J E; Boerwinkle, E; Clark, A G; Sing, C F
2006-07-01
Apolipoproteins (apo) A-I and C-III are components of high-density lipoprotein-cholesterol (HDL-C), a quantitative trait negatively correlated with risk of cardiovascular disease (CVD). We analyzed the contribution of individual and pairwise combinations of single nucleotide polymorphisms (SNPs) in the APOA1/APOC3 genes to HDL-C variability to evaluate (1) consistency of published single-SNP studies with our single-SNP analyses; (2) consistency of single-SNP and two-SNP phenotype-genotype relationships across race-, gender-, and geographical location-dependent contexts; and (3) the contribution of single SNPs and pairs of SNPs to variability beyond that explained by plasma apo A-I concentration. We analyzed 45 SNPs in 3,831 young African-American (N=1,858) and European-American (N=1,973) females and males ascertained by the Coronary Artery Risk Development in Young Adults (CARDIA) study. We found three SNPs that significantly impact HDL-C variability in both the literature and the CARDIA sample. Single-SNP analyses identified only one of five significant HDL-C SNP genotype relationships in the CARDIA study that was consistent across all race-, gender-, and geographical location-dependent contexts. The other four were consistent across geographical locations for a particular race-gender context. The portion of total phenotypic variance explained by single-SNP genotypes and genotypes defined by pairs of SNPs was less than 3%, an amount that is miniscule compared to the contribution explained by variability in plasma apo A-I concentration. Our findings illustrate the impact of context-dependence on SNP selection for prediction of CVD risk factor variability.
Håkansson, Anna; Westberg, Lars; Nilsson, Staffan; Buervenich, Silvia; Carmine, Andrea; Holmberg, Björn; Sydow, Olof; Olson, Lars; Johnels, Bo; Eriksson, Elias; Nissbrandt, Hans
2005-02-05
The multifunctional cytokine interleukin-6 (IL-6) is involved in inflammatory processes in the central nervous system and increased levels of IL-6 have been found in patients with Parkinson's disease (PD). It is known that estrogen inhibits the production of IL-6, via action on estrogen receptors, thereby pointing to an important influence of estrogen on IL-6. In a previous study, we reported an association between a G/A single nucleotide polymorphism (SNP) at position 1730 in the gene coding for estrogen receptor beta (ERbeta) and age of onset of PD. To investigate the influence of a G/C SNP at position 174 in the promoter of the IL-6 gene, and the possible interaction of this SNP and the ERbeta G-1730A SNP on the risk for PD, the G-174C SNP was genotyped, by pyrosequencing, in 258 patients with PD and 308 controls. A significantly elevated frequency of the GG genotype of the IL-6 SNP was found in the patient group and this was most obvious among patients with an early age of onset (=50 years) of PD. When the GG genotypes of the IL-6 and ERbeta SNPs were combined, the combination was much more robustly associated with PD, and especially with PD with an early age of onset, than respective GG genotype when analyzed separately. Our results indicate that the G-174C SNP in the IL-6 promoter may influence the risk for developing PD, particularly regarding early age of onset PD, and that the effect is modified by interaction of the G-1730A SNP in the ERbeta gene. (c) 2004 Wiley-Liss, Inc.
Su, Pen-Hua; Yang, Shun-Fa; Yu, Ju-Shan; Chen, Suh-Jen; Chen, Jia-Yuh
2012-12-01
We hypothesized that responses to growth hormone (GH) therapy by idiopathic short stature (ISS) and growth hormone deficiency (GHD) patients were associated with single nucleotide polymorphisms (SNPs) in the leptin (LEP) and leptin receptor (LEPR) genes. We retrospectively enrolled ISS (n = 32) and GHD (n = 38) patients and forty healthy age-and gender-matched children. They were genotyped for the LEP promoter at nt.-2548, and LEPR K109R and LEPR Q223R polymorphisms. Clinical and laboratory variables were determined before and after 2 years of GH treatment. ISS patients with G/A or A/A genotypes of the LEPR Q223R SNP had a significantly higher height velocity (cm/y) than ISS patients with the G/G genotype at 2 years after GH treatment. For GHD patients, G/A or A/A genotype of the LEPR K109R SNP was associated with higher body weight, higher BMI, and higher weight velocity than patients with the G/G genotype before GH treatment, but not after GH treatment. G/A or A/A genotype of the LEPR Q223R SNP was associated with a significantly higher body weight, higher height velocity before treatment, but not after GH treatment. G/A or A/A genotype of the LEPR Q223R SNP was associated with a significantly higher weight velocity before treatment, but a significantly lower weight velocity was found at 2 years after GH treatment. These results suggest LEPR Q223R SNP (rs1137101) is associated with outcomes of GH replacement therapy in ISS and GHD patients. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
An improved consensus linkage map of barley based on flow-sorted chromosomes and SNP markers
USDA-ARS?s Scientific Manuscript database
Recent advances in high-throughput genotyping have made it easier to combine information from different mapping populations into consensus genetic maps, which provide increased marker density and genome coverage compared to individual maps. Previously, a SNP-based genotyping platform was developed a...
Teh, L K; Lee, W L; Amir, J; Salleh, M Z; Ismail, R
2007-06-01
P-glycoprotein (PgP) is the most extensively studied ATP-binding cassette (ABC) coded by MDR1 gene. To date, 29 single nucleotide polymorphisms (SNPs) have been identified; but only SNP C3435T has been correlated with intestinal PgP expression levels and shown to influence the absorption of orally taken drugs that are PgP substrates. Individuals homozygous for the T allele have more than fourfold lower PgP expression compared with C/C individuals. We developed a one step primer based allele specific PCR method to detect SNP at C3435T to investigate the distribution of this genotype in the local population. DNA was extracted from 5 mL of whole blood using standard salting-out method. Primers were designed specific to 3' end which amplify the variants of C3435T. The method was validated by direct DNA sequencing. Seven hundred and sixty-three healthy blood donors comprising of three major ethnic groups in Malaysia were recruited and DNA subjected to genotyping of C3435T using this method. The method was found to be robust and reproducible in detecting SNP of C3435T. Interethnic variations in genotype and allele frequency were observed in PgP among the ethnic groups. In comparison to both the Caucasians and the other Asian countries, the Malay and Chinese showed a higher frequency of allele C (50-60%); while the Indian exhibits a lower frequency (40%), similar to other Indian populations. Using a new simple method to investigate the distribution of C3435T, we found that the allele frequency of MDR1 showed variablity between the different ethnic groups within the Malaysian population.
Brorsson, C.; Hansen, N. T.; Lage, K.; Bergholdt, R.; Brunak, S.; Pociot, F.
2009-01-01
Aim To develop novel methods for identifying new genes that contribute to the risk of developing type 1 diabetes within the Major Histocompatibility Complex (MHC) region on chromosome 6, independently of the known linkage disequilibrium (LD) between human leucocyte antigen (HLA)-DRB1, -DQA1, -DQB1 genes. Methods We have developed a novel method that combines single nucleotide polymorphism (SNP) genotyping data with protein–protein interaction (ppi) networks to identify disease-associated network modules enriched for proteins encoded from the MHC region. Approximately 2500 SNPs located in the 4 Mb MHC region were analysed in 1000 affected offspring trios generated by the Type 1 Diabetes Genetics Consortium (T1DGC). The most associated SNP in each gene was chosen and genes were mapped to ppi networks for identification of interaction partners. The association testing and resulting interacting protein modules were statistically evaluated using permutation. Results A total of 151 genes could be mapped to nodes within the protein interaction network and their interaction partners were identified. Five protein interaction modules reached statistical significance using this approach. The identified proteins are well known in the pathogenesis of T1D, but the modules also contain additional candidates that have been implicated in β-cell development and diabetic complications. Conclusions The extensive LD within the MHC region makes it important to develop new methods for analysing genotyping data for identification of additional risk genes for T1D. Combining genetic data with knowledge about functional pathways provides new insight into mechanisms underlying T1D. PMID:19143816
Clendenen, Tess V; Rendleman, Justin; Ge, Wenzhen; Koenig, Karen L; Wirgin, Isaac; Currie, Diane; Shore, Roy E; Kirchhoff, Tomas; Zeleniuch-Jacquotte, Anne
2015-01-01
Large epidemiologic studies have the potential to make valuable contributions to the assessment of gene-environment interactions because they prospectively collected detailed exposure data. Some of these studies, however, have only serum or plasma samples as a low quantity source of DNA. We examined whether DNA isolated from serum can be used to reliably and accurately genotype single nucleotide polymorphisms (SNPs) using Sequenom multiplex SNP genotyping technology. We genotyped 81 SNPs using samples from 158 participants in the NYU Women's Health Study. Each participant had DNA from serum and at least one paired DNA sample isolated from a high quality source of DNA, i.e. clots and/or cell precipitates, for comparison. We observed that 60 of the 81 SNPs (74%) had high call frequencies (≥95%) using DNA from serum, only slightly lower than the 85% of SNPs with high call frequencies in DNA from clots or cell precipitates. Of the 57 SNPs with high call frequencies for serum, clot, and cell precipitate DNA, 54 (95%) had highly concordant (>98%) genotype calls across all three sample types. High purity was not a critical factor to successful genotyping. Our results suggest that this multiplex SNP genotyping method can be used reliably on DNA from serum in large-scale epidemiologic studies.
SNPMeta: SNP annotation and SNP metadata collection without a reference genome
USDA-ARS?s Scientific Manuscript database
The increase in availability of resequencing data is greatly accelerating SNP discovery and has facilitated the development of SNP genotyping assays. This, in turn, is increasing interest in annotation of individual SNPs. Currently, these data are only available through curation, or comparison to a ...
Dettogni, Raquel Spinassé; Sá, Ricardo Tristão; Tovar, Thaís Tristão; Louro, Iúri Drumond
2013-08-01
Mapping single nucleotide polymorphisms (SNPs) in genes potentially involved in immune responses may help understand the pathophysiology of infectious diseases in specific geographical regions. In this context, we have aimed to analyze the frequency of immunogenetic markers, focusing on genes CD209 (SNP -336A/G), FCγRIIa (SNP -131H/R), TNF-α (SNP -308A/G) and VDR (SNP Taq I) in two populations of the Espirito Santo State (ES), Brazil: general and Pomeranian populations. Peripheral blood genomic DNA was extracted from one hundred healthy individuals of the general population and from 59 Pomeranians. Polymorphic variant identification was performed by polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). SNP genotype frequencies were in Hardy-Weinberg Equilibrium. There was no statistically significant difference in allelic and genotypic distributions between the two populations studied. Statistically significant differences were observed for SNP genotype distribution in genes CD209, TNF-α and VDR when comparing the ES populations with other Brazilian populations. This is the first report of CD209, FcγRIIa, TNF-α and VDR allelic frequencies for the general and Pomeranian populations of ES.
Bai, Xianan; Xie, Jingjing; Sun, Shanshan; Zhang, Xianyu; Jiang, Yongdong; Pang, Da
2017-01-01
Background Cytochrome P450 (CYP) 1A2 and CYP3A4 may play a role in the differentiation of clinical outcomes among breast cancer women. This study aimed to analyze the association of genetic polymorphisms in the CYP1A2 and CYP3A4 genes with clinicopathological features, protein expression and prognosis of breast cancer in the northern Chinese population. Results Firstly, SNP rs11636419, rs17861162 and rs2470890 in the CYP1A2 were significantly associated with age and menstruation status. And SNP rs11636419 and rs17861162 were associated with the P53 status. Secondly, SNP rs2470890 was correlated with CYP1A2 protein expression under the co-dominant and dominant model (P = 0.017, P = 0.006, respectively). Thirdly, for SNP rs2470890, the Kaplan–Meier 5 year survival curves showed that patients carrying genotypes CT or TT had a worse OS compared with the genotype CC carriers under both codominant and dominant model (P < 0.001, P < 0.001, respectively). Materials and Methods Four single nucleotide polymorphisms (SNPs) were successfully genotyped in 459 breast cancer patients using the SNaPshot method. The associations of four polymorphisms with protein expression and clinicopathological characteristics were evaluated by Pearson's chi-square test. The Cox hazard regression analysis and Kaplan–Meier survival analysis were performed to evaluate the relationship between the SNPs and overall survival (OS) of breast cancer. Conclusions CYP1A2 rs2470890 was significantly associated with the prognosis of patients with breast cancer and could serve as an independent impact factor of prognosis of breast carcinoma. PMID:28418906
Goodin, Douglas S.; Khankhanian, Pouya
2014-01-01
Background Genome-wide association studies (GWAS) identify disease-associations for single-nucleotide-polymorphisms (SNPs) from scattered genomic-locations. However, SNPs frequently reside on several different SNP-haplotypes, only some of which may be disease-associated. This circumstance lowers the observed odds-ratio for disease-association. Methodology/Principal Findings Here we develop a method to identify the two SNP-haplotypes, which combine to produce each person’s SNP-genotype over specified chromosomal segments. Two multiple sclerosis (MS)-associated genetic regions were modeled; DRB1 (a Class II molecule of the major histocompatibility complex) and MMEL1 (an endopeptidase that degrades both neuropeptides and β-amyloid). For each locus, we considered sets of eleven adjacent SNPs, surrounding the putative disease-associated gene and spanning ∼200 kb of DNA. The SNP-information was converted into an ordered-set of eleven-numbers (subject-vectors) based on whether a person had zero, one, or two copies of particular SNP-variant at each sequential SNP-location. SNP-strings were defined as those ordered-combinations of eleven-numbers (0 or 1), representing a haplotype, two of which combined to form the observed subject-vector. Subject-vectors were resolved using probabilistic methods. In both regions, only a small number of SNP-strings were present. We compared our method to the SHAPEIT-2 phasing-algorithm. When the SNP-information spanning 200 kb was used, SHAPEIT-2 was inaccurate. When the SHAPEIT-2 window was increased to 2,000 kb, the concordance between the two methods, in both of these eleven-SNP regions, was over 99%, suggesting that, in these regions, both methods were quite accurate. Nevertheless, correspondence was not uniformly high over the entire DNA-span but, rather, was characterized by alternating peaks and valleys of concordance. Moreover, in the valleys of poor-correspondence, SHAPEIT-2 was also inconsistent with itself, suggesting that the SNP-string method is more accurate across the entire region. Conclusions/Significance Accurate haplotype identification will enhance the detection of genetic-associations. The SNP-string method provides a simple means to accomplish this and can be extended to cover larger genomic regions, thereby improving a GWAS’s power, even for those published previously. PMID:24727690
2012-01-01
Background High-density genotyping arrays that measure hybridization of genomic DNA fragments to allele-specific oligonucleotide probes are widely used to genotype single nucleotide polymorphisms (SNPs) in genetic studies, including human genome-wide association studies. Hybridization intensities are converted to genotype calls by clustering algorithms that assign each sample to a genotype class at each SNP. Data for SNP probes that do not conform to the expected pattern of clustering are often discarded, contributing to ascertainment bias and resulting in lost information - as much as 50% in a recent genome-wide association study in dogs. Results We identified atypical patterns of hybridization intensities that were highly reproducible and demonstrated that these patterns represent genetic variants that were not accounted for in the design of the array platform. We characterized variable intensity oligonucleotide (VINO) probes that display such patterns and are found in all hybridization-based genotyping platforms, including those developed for human, dog, cattle, and mouse. When recognized and properly interpreted, VINOs recovered a substantial fraction of discarded probes and counteracted SNP ascertainment bias. We developed software (MouseDivGeno) that identifies VINOs and improves the accuracy of genotype calling. MouseDivGeno produced highly concordant genotype calls when compared with other methods but it uniquely identified more than 786000 VINOs in 351 mouse samples. We used whole-genome sequence from 14 mouse strains to confirm the presence of novel variants explaining 28000 VINOs in those strains. We also identified VINOs in human HapMap 3 samples, many of which were specific to an African population. Incorporating VINOs in phylogenetic analyses substantially improved the accuracy of a Mus species tree and local haplotype assignment in laboratory mouse strains. Conclusion The problems of ascertainment bias and missing information due to genotyping errors are widely recognized as limiting factors in genetic studies. We have conducted the first formal analysis of the effect of novel variants on genotyping arrays, and we have shown that these variants account for a large portion of miscalled and uncalled genotypes. Genetic studies will benefit from substantial improvements in the accuracy of their results by incorporating VINOs in their analyses. PMID:22260749
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jaing, C; Gardner, S
The goal of this project is to develop forensic genotyping assays for select agent viruses, enhancing the current capabilities for the viral bioforensics and law enforcement community. We used a multipronged approach combining bioinformatics analysis, PCR-enriched samples, microarrays and TaqMan assays to develop high resolution and cost effective genotyping methods for strain level forensic discrimination of viruses. We have leveraged substantial experience and efficiency gained through year 1 on software development, SNP discovery, TaqMan signature design and phylogenetic signature mapping to scale up the development of forensics signatures in year 2. In this report, we have summarized the whole genomemore » wide SNP analysis and microarray probe design for forensics characterization of South American hemorrhagic fever viruses, tick-borne encephalitis viruses and henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus and Japanese encephalitis virus.« less
Measuring diversity in Gossypium hirsutum using the CottonSNP63K Array
USDA-ARS?s Scientific Manuscript database
A CottonSNP63K array and accompanying cluster file has been developed and includes 45,104 intra-specific SNPs and 17,954 inter-specific SNPs for automated genotyping of cotton (Gossypium spp.) samples. Development of the cluster file included genotyping of 1,156 samples, a subset of which were iden...
2013-01-02
intensity data from the SNP array were normalized using the Affymetrix GeneChip Targeted Genotyping Analysis Software ( GTGS ). To assess robustness of SNP...calls, genotypes were called using three algorithms: (i) GTGS , (ii) illuminus (27), and (iii) a heuristic algorithm based on discrete cutoffs of
Combinations of SNP genotypes from the Wellcome Trust Case Control Study of bipolar patients.
Mellerup, Erling; Jørgensen, Martin Balslev; Dam, Henrik; Møller, Gert Lykke
2018-04-01
Combinations of genetic variants are the basis for polygenic disorders. We examined combinations of SNP genotypes taken from the 446 729 SNPs in The Wellcome Trust Case Control Study of bipolar patients. Parallel computing by graphics processing units, cloud computing, and data mining tools were used to scan The Wellcome Trust data set for combinations. Two clusters of combinations were significantly associated with bipolar disorder. One cluster contained 68 combinations, each of which included five SNP genotypes. Of the 1998 patients, 305 had combinations from this cluster in their genome, but none of the 1500 controls had any of these combinations in their genome. The other cluster contained six combinations, each of which included five SNP genotypes. Of the 1998 patients, 515 had combinations from the cluster in their genome, but none of the 1500 controls had any of these combinations in their genome. Clusters of combinations of genetic variants can be considered general risk factors for polygenic disorders, whereas accumulation of combinations from the clusters in the genome of a patient can be considered a personal risk factor.
Calpain-10 gene polymorphism in type 2 diabetes mellitus patients in the Gaza Strip.
Zaharna, Mazen M; Abed, Abdalla A; Sharif, Fadel A
2010-01-01
To examine the role of calpain-10 SNP-44, -43, -63 and del/ins-19 in genetic susceptibility to type 2 diabetes mellitus (T2DM) and associations with triglycerides and total cholesterol in a group of subjects residing in the Gaza Strip. Ninety-six individuals were examined: 48 T2DM patients and 48 controls. The groups were genotyped for calpain-10 SNP-44, -43, -63, and del/ins-19. Mutagenically separated polymerase chain reaction was used to examine SNP-44; del/ins-19 was examined by electrophoresis of the PCR product on agarose gel, while the restriction fragment length polymorphism method was used for SNP-43 and -63. There was evidence that the C allele at SNP-44 played a possible role in susceptibility to T2DM (p = 0.01). T2DM patients with G/A genotype were found to have higher levels of total cholesterol in comparison to those homozygous for allele 1 (G/G) in SNP-43. Total cholesterol levels increased in T2DM patients who are homozygous for del/ins-19 allele 2, in T2DM patients with the 121/221 haplotype combination, and in control subjects with the haplotype combination 111/121. SNP-44 polymorphism of the calpain-10 gene has a significant association with T2DM patients in the Gaza strip. Certain polymorphisms of calpain-10 also have associations with the levels of total cholesterol in both T2DM patients and controls. Copyright © 2010 S. Karger AG, Basel.
Loya Méndez, Yolanda; Reyes Leal, Gilberto; Sánchez González, Adriana; Portillo Reyes, Verónica; Reyes Ruvalcaba, David; Bojórquez Rangel, Guillermo
2014-09-28
Diabetes Mellitus (DM) type 2 is a common pathology with multifactorial etiology, which exact genetic bases remain unknown. Some studies suggest that single nucleotides polymorphisms (SNPs) in the CAPN10 gene (Locus 2q37.3) could be associated with the development of this disease, including the insertion/deletion polymorphism SNP-19 (2R→3R). The present study determined the association between the SNP-19 and the risk of developing DM type 2 in Ciudad Juarez population. For this study 107 participants were selected: 43 diabetics type 2 (cases) and 64 non diabetics with no family history of DM type 2 in first grade (control). Anthropometric studies were realized as well as lipids, lipoproteins and serum glucose biochemical profiles. The genotypification of SNP-19 was performed using peripheral blood lymphocytes DNA, polymerase chain reactions (PCR), and electrophoretic analysis in agarose gels. Once obtained the genotypic and allelic frequencies, the Hardy-Weinberg equilibrium test (GenAlEx 6.4) was also performed. Using the X² analysis it was identified the genotypic differences between cases and control with higher frequency of the homozygous genotype 3R of SNP- 19 in the cases group (0.418) compared to control group (0.265). Also, it was observed an association between genotype 2R/3R with elevated weight, body mass index, and waist and hip circumferences, but only in the diabetic group (P=< 0.05). The findings in this study suggest that SNP-19 in CAPN10 may participate in the development of DM type 2 in the studied population. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Calpain-10 gene polymorphisms and risk of type 2 diabetes mellitus in Mexican mestizos.
Picos-Cárdenas, V J; Sáinz-González, E; Miliar-García, A; Romero-Zazueta, A; Quintero-Osuna, R; Leal-Ugarte, E; Peralta-Leal, V; Meza-Espinoza, J P
2015-03-27
The calpain-10 gene is expressed primarily in tissues important in glucose metabolism; thus, some of its polymorphisms have been associated with type 2 diabetes. In this study, we examined the association between the calpain-10 single-nucleotide polymorphism (SNP)-43, SNP-19, and SNP-63 and type 2 diabetes in Mexican mestizos. We included 211 patients and 152 non-diabetic subjects. Polymerase chain reaction was used to identify alleles. We compared allele, genotype, haplotype, and diplotype frequencies between both groups and used the chi-square test to calculate the risk. The allele frequency of SNP-43 allele 1 was 70% in controls and 72% in patients; the GG, GA, and AA genotype frequencies were 48.7, 42.8, and 8.5% in controls and 51.2, 41.7, and 7.1% in patients, respectively. For SNP- 19, the prevalence of allele 1 (2R) was 32% in controls and 39% in patients. In controls, homozygosity (2R/2R) was 10.5%, heterozygosity was 42.8%, and 3R/3R was 46.7%; in cases, these values were 13.3, 50.7, and 36.0%, respectively. For SNP-63, the frequency of allele 1 was 87% in controls and 83% in patients; genotype frequencies in controls were 75.7% (CC), 23% (CT), and 1.3% (TT), and were 69.7, 27.5, and 2.8%, respectively for the cases. Genotype distributions were consistent with Hardy-Weinberg equilibrium. No significant intergroup differences for allele, genotype, haplotype, or diplotype frequencies were observed. We found no association between these polymorphisms and diabetes. However, our sample size was small, so the role of calpain-10 risk alleles should be further examined.
COLE-TOBIAN, JENNIFER L.; ZIMMERMAN, PETER A.; KING, CHRISTOPHER L.
2013-01-01
Individuals living in malaria endemic areas are often infected with multiple parasite clones. Currently used single nucleotide polymorphism (SNP) genotyping methods for malaria parasites are cumbersome; furthermore, few methods currently exist that can rapidly determine the most abundant clone in these complex infections. Here we describe an oligonucleotide ligation assay (OLA) to distinguish SNPs in the Plasmodium vivax Duffy binding protein gene (Pvdbp) at 14 polymorphic residues simultaneously. Allele abundance is determined by the highest mean fluorescent intensity of each allele. Using mixtures of plasmids encoding known haplotypes of the Pvdbp, single clones of P. vivax parasites from infected Aotus monkeys, and well-defined mixed infections from field samples, we were able to identify the predominant Pvdbp genotype with > 93% accuracy when the dominant clone is twice as abundant as a lesser genotype and > 97% of the time if the ratio was 5:1 or greater. Thus, the OLA can accurately, reproducibly, and rapidly determine the predominant parasite haplotype in complex blood stage infections. PMID:17255222
Hu, Jian; Zhou, Yi-ren; Ding, Jia-lin; Wang, Zhi-yuan; Liu, Ling; Wang, Ye-kai; Lou, Hui-ling; Qiao, Shou-yi; Wu, Yan-hua
2017-05-20
The ABO blood type is one of the most common and widely used genetic traits in humans. Three glycosyltransferase-encoding gene alleles, I A , I B and i, produce three red blood cell surface antigens, by which the ABO blood type is classified. By using the ABO blood type experiment as an ideal case for genetics teaching, we can easily introduce to the students several genetic concepts, including multiple alleles, gene interaction, single nucleotide polymorphism (SNP) and gene evolution. Herein we have innovated and integrated our ABO blood type genetics experiments. First, in the section of Molecular Genetics, a new method of ABO blood genotyping was established: specific primers based on SNP sites were designed to distinguish three alleles through quantitative real-time PCR. Next, the experimental teaching method of Gene Evolution was innovated in the Population Genetics section: a gene-evolution software was developed to simulate the evolutionary tendency of the ABO genotype encoding alleles under diverse conditions. Our reform aims to extend the contents of genetics experiments, to provide additional teaching approaches, and to improve the learning efficiency of our students eventually.
Fondevila, M; Børsting, C; Phillips, C; de la Puente, M; Consortium, Euroforen-NoE; Carracedo, A; Morling, N; Lareu, M V
2017-01-01
This review explores the key factors that influence the optimization, routine use, and profile interpretation of the SNaPshot single-base extension (SBE) system applied to forensic single-nucleotide polymorphism (SNP) genotyping. Despite being a mainly complimentary DNA genotyping technique to routine STR profiling, use of SNaPshot is an important part of the development of SNP sets for a wide range of forensic applications with these markers, from genotyping highly degraded DNA with very short amplicons to the introduction of SNPs to ascertain the ancestry and physical characteristics of an unidentified contact trace donor. However, this technology, as resourceful as it is, displays several features that depart from the usual STR genotyping far enough to demand a certain degree of expertise from the forensic analyst before tackling the complex casework on which SNaPshot application provides an advantage. In order to provide the basis for developing such expertise, we cover in this paper the most challenging aspects of the SNaPshot technology, focusing on the steps taken to design primer sets, optimize the PCR and single-base extension chemistries, and the important features of the peak patterns observed in typical forensic SNP profiles using SNaPshot. With that purpose in mind, we provide guidelines and troubleshooting for multiplex-SNaPshot-oriented primer design and the resulting capillary electrophoresis (CE) profile interpretation (covering the most commonly observed artifacts and expected departures from the ideal conditions). Copyright © 2017 Central Police University.
Eduardoff, M; Gross, T E; Santos, C; de la Puente, M; Ballard, D; Strobl, C; Børsting, C; Morling, N; Fusco, L; Hussing, C; Egyed, B; Souto, L; Uacyisrael, J; Syndercombe Court, D; Carracedo, Á; Lareu, M V; Schneider, P M; Parson, W; Phillips, C; Parson, W; Phillips, C
2016-07-01
The EUROFORGEN Global ancestry-informative SNP (AIM-SNPs) panel is a forensic multiplex of 128 markers designed to differentiate an individual's ancestry from amongst the five continental population groups of Africa, Europe, East Asia, Native America, and Oceania. A custom multiplex of AmpliSeq™ PCR primers was designed for the Global AIM-SNPs to perform massively parallel sequencing using the Ion PGM™ system. This study assessed individual SNP genotyping precision using the Ion PGM™, the forensic sensitivity of the multiplex using dilution series, degraded DNA plus simple mixtures, and the ancestry differentiation power of the final panel design, which required substitution of three original ancestry-informative SNPs with alternatives. Fourteen populations that had not been previously analyzed were genotyped using the custom multiplex and these studies allowed assessment of genotyping performance by comparison of data across five laboratories. Results indicate a low level of genotyping error can still occur from sequence misalignment caused by homopolymeric tracts close to the target SNP, despite careful scrutiny of candidate SNPs at the design stage. Such sequence misalignment required the exclusion of component SNP rs2080161 from the Global AIM-SNPs panel. However, the overall genotyping precision and sensitivity of this custom multiplex indicates the Ion PGM™ assay for the Global AIM-SNPs is highly suitable for forensic ancestry analysis with massively parallel sequencing. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
The utility of low-density genotyping for imputation in the Thoroughbred horse
2014-01-01
Background Despite the dramatic reduction in the cost of high-density genotyping that has occurred over the last decade, it remains one of the limiting factors for obtaining the large datasets required for genomic studies of disease in the horse. In this study, we investigated the potential for low-density genotyping and subsequent imputation to address this problem. Results Using the haplotype phasing and imputation program, BEAGLE, it is possible to impute genotypes from low- to high-density (50K) in the Thoroughbred horse with reasonable to high accuracy. Analysis of the sources of variation in imputation accuracy revealed dependence both on the minor allele frequency of the single nucleotide polymorphisms (SNPs) being imputed and on the underlying linkage disequilibrium structure. Whereas equidistant spacing of the SNPs on the low-density panel worked well, optimising SNP selection to increase their minor allele frequency was advantageous, even when the panel was subsequently used in a population of different geographical origin. Replacing base pair position with linkage disequilibrium map distance reduced the variation in imputation accuracy across SNPs. Whereas a 1K SNP panel was generally sufficient to ensure that more than 80% of genotypes were correctly imputed, other studies suggest that a 2K to 3K panel is more efficient to minimize the subsequent loss of accuracy in genomic prediction analyses. The relationship between accuracy and genotyping costs for the different low-density panels, suggests that a 2K SNP panel would represent good value for money. Conclusions Low-density genotyping with a 2K SNP panel followed by imputation provides a compromise between cost and accuracy that could promote more widespread genotyping, and hence the use of genomic information in horses. In addition to offering a low cost alternative to high-density genotyping, imputation provides a means to combine datasets from different genotyping platforms, which is becoming necessary since researchers are starting to use the recently developed equine 70K SNP chip. However, more work is needed to evaluate the impact of between-breed differences on imputation accuracy. PMID:24495673
Yuan, Youhua; Wen, Yan; You, Yuangang; Xing, Yan; Li, Huanying; Weng, Xiaoman; Wu, Nan; Liu, Shuang; Zhang, Shanshan; Zhang, Wenhong; Zhang, Ying
2015-01-01
Leprosy continues to be prevalent in some mountainous regions of China, and genotypes of leprosy strains endemic to the country are not known. Mycobacterium lepromatosis is a new species that was discovered in Mexico in 2008, and it remains unclear whether this species exists in China. Here, we conducted PCR- restriction fragment length polymorphism (RFLP) analysis to classify genotypes of 85 DNA samples collected from patients from 18 different provinces. All 171 DNA samples from skin biopsies of leprosy patients were tested for the presence of Mycobacterium leprae and Mycobacterium lepromatosis by amplifying the 16S rRNA gene using nested PCR, followed by DNA sequencing. The new species M. lepromatosis was not found among the 171 specimens from leprosy patients in 22 provinces in China. However, we found three SNP genotypes among 85 leprosy patients. A mutation at C251T in the 16S rRNA gene was found in 76% of the strains. We also found that the strains that showed the 16S rRNA C251T mutation belonged to SNP type 3, whereas strains without the point mutation belonged to SNP type 1. The SNP type 3 leprosy strains were observed in patients from both the inner and coastal regions of China, but the SNP type 1 strains were focused only in the coastal region. This indicated that the SNP type 3 leprosy strains were more prevalent than the SNP type 1 strains in China. In addition, the 16S rRNA gene sequence mutation at C251T also indicated a difference in the geographical distribution of the strains. To our knowledge, this is the first report of a new polymorphism in 16S rRNA gene in M. leprae in China. Our findings shed light on the prevalent genotypes and provide insight about leprosy transmission that are important for leprosy control in China.
IL13 genetic polymorphisms, smoking, and eczema in women: a case-control study in Japan
2011-01-01
Background Several genetic association studies have examined the relationships between single nucleotide polymorphisms (SNPs) in the IL13 gene and eczema, and have provided contradictory results. We investigated the relationship between the IL13 SNPs rs1800925 and rs20541 and the risk of eczema in Japanese young adult women. Methods Included were 188 cases who met the criteria of the International Study of Asthma and Allergies in Childhood (ISAAC) for eczema. Control subjects were 1,082 women without eczema according to the ISAAC criteria, who had not been diagnosed with atopic eczema by a doctor and who had no current asthma as defined by the European Community Respiratory Health Survey criteria. Adjustment was made for age, region of residence, number of children, smoking, and education. Results The minor TT genotype of SNP rs1800925 was significantly associated with an increased risk of eczema in the co-dominant model: the adjusted odds ratio was 2.19 (95% confidence interval: 1.03-4.67). SNP rs20541 was not related to eczema. None of the haplotypes were significantly associated with eczema. Compared with women with the CC or CT genotype of SNP rs1800925 who had never smoked, those with the TT genotype who had ever smoked had a 2.85-fold increased risk of eczema, though the adjusted odds ratio was not statistically significant, and neither multiplicative nor additive interaction was statistically significant. Conclusions Our findings suggest that the IL13 SNP rs1800925 is significantly associated with eczema in Japanese young adult women. We could not find evidence for an interaction between SNP rs1800925 and smoking with regard to eczema. PMID:22013915
Fedko, Iryna O; Hottenga, Jouke-Jan; Medina-Gomez, Carolina; Pappa, Irene; van Beijsterveldt, Catharina E M; Ehli, Erik A; Davies, Gareth E; Rivadeneira, Fernando; Tiemeier, Henning; Swertz, Morris A; Middeldorp, Christel M; Bartels, Meike; Boomsma, Dorret I
2015-09-01
Combining genotype data across cohorts increases power to estimate the heritability due to common single nucleotide polymorphisms (SNPs), based on analyzing a Genetic Relationship Matrix (GRM). However, the combination of SNP data across multiple cohorts may lead to stratification, when for example, different genotyping platforms are used. In the current study, we address issues of combining SNP data from different cohorts, the Netherlands Twin Register (NTR) and the Generation R (GENR) study. Both cohorts include children of Northern European Dutch background (N = 3102 + 2826, respectively) who were genotyped on different platforms. We explore imputation and phasing as a tool and compare three GRM-building strategies, when data from two cohorts are (1) just combined, (2) pre-combined and cross-platform imputed and (3) cross-platform imputed and post-combined. We test these three strategies with data on childhood height for unrelated individuals (N = 3124, average age 6.7 years) to explore their effect on SNP-heritability estimates and compare results to those obtained from the independent studies. All combination strategies result in SNP-heritability estimates with a standard error smaller than those of the independent studies. We did not observe significant difference in estimates of SNP-heritability based on various cross-platform imputed GRMs. SNP-heritability of childhood height was on average estimated as 0.50 (SE = 0.10). Introducing cohort as a covariate resulted in ≈2 % drop. Principal components (PCs) adjustment resulted in SNP-heritability estimates of about 0.39 (SE = 0.11). Strikingly, we did not find significant difference between cross-platform imputed and combined GRMs. All estimates were significant regardless the use of PCs adjustment. Based on these analyses we conclude that imputation with a reference set helps to increase power to estimate SNP-heritability by combining cohorts of the same ethnicity genotyped on different platforms. However, important factors should be taken into account such as remaining cohort stratification after imputation and/or phenotypic heterogeneity between and within cohorts. Whether one should use imputation, or just combine the genotype data, depends on the number of overlapping SNPs in relation to the total number of genotyped SNPs for both cohorts, and their ability to tag all the genetic variance related to the specific trait of interest.
Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J
2012-05-25
A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the 'Golden Delicious' reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
Dötsch, Annika; Eisele, Lewin; Rabeling, Miriam; Rump, Katharina; Walstein, Kai; Bick, Alexandra; Cox, Linda; Engler, Andrea; Bachmann, Hagen S; Jöckel, Karl-Heinz; Adamzik, Michael; Peters, Jürgen; Schäfer, Simon T
2017-06-14
Hypoxia-inducible-factor-2α (HIF-2α) and HIF-2 degrading prolyl-hydroxylases (PHD) are key regulators of adaptive hypoxic responses i.e., in acute respiratory distress syndrome (ARDS). Specifically, functionally active genetic variants of HIF-2α (single nucleotide polymorphism (SNP) [ch2:46441523(hg18)]) and PHD2 (C/T; SNP rs516651 and T/C; SNP rs480902) are associated with improved adaptation to hypoxia i.e., in high-altitude residents. However, little is known about these SNPs' prevalence in Caucasians and impact on ARDS-outcome. Thus, we tested the hypotheses that in Caucasian ARDS patients SNPs in HIF-2α or PHD2 genes are (1) common, and (2) independent risk factors for 30-day mortality. After ethics-committee approval, 272 ARDS patients were prospectively included, genotyped for PHD2 (Taqman SNP Genotyping Assay) and HIF-2α -polymorphism (restriction digest + agarose-gel visualization), and genotype dependent 30-day mortality was analyzed using Kaplan-Meier-plots and multivariate Cox-regression analyses. Frequencies were 99.62% for homozygous HIF-2α CC-carriers (CG: 0.38%; GG: 0%), 2.3% for homozygous PHD2 SNP rs516651 TT-carriers (CT: 18.9%; CC: 78.8%), and 3.7% for homozygous PHD2 SNP rs480902 TT-carriers (CT: 43.9%; CC: 52.4%). PHD2 rs516651 TT-genotype in ARDS was independently associated with a 3.34 times greater mortality risk (OR 3.34, CI 1.09-10.22; p = 0.034) within 30-days, whereas the other SNPs had no significant impact ( p = ns). The homozygous HIF-2α GG-genotype was not present in our Caucasian ARDS cohort; however PHD2 SNPs exist in Caucasians, and PHD2 rs516651 TT-genotype was associated with an increased 30-day mortality suggesting a relevance for adaptive responses in ARDS.
Wade, Len J.; Bartolome, Violeta; Mauleon, Ramil; Vasant, Vivek Deshmuck; Prabakar, Sumeet Mankar; Chelliah, Muthukumar; Kameoka, Emi; Nagendra, K.; Reddy, K. R. Kamalnath; Varma, C. Mohan Kumar; Patil, Kalmeshwar Gouda; Shrestha, Roshi; Al-Shugeairy, Zaniab; Al-Ogaidi, Faez; Munasinghe, Mayuri; Gowda, Veeresh; Semon, Mande; Suralta, Roel R.; Shenoy, Vinay; Vadez, Vincent; Serraj, Rachid; Shashidhar, H. E.; Yamauchi, Akira; Babu, Ranganathan Chandra; Price, Adam; McNally, Kenneth L.; Henry, Amelia
2015-01-01
The rapid progress in rice genotyping must be matched by advances in phenotyping. A better understanding of genetic variation in rice for drought response, root traits, and practical methods for studying them are needed. In this study, the OryzaSNP set (20 diverse genotypes that have been genotyped for SNP markers) was phenotyped in a range of field and container studies to study the diversity of rice root growth and response to drought. Of the root traits measured across more than 20 root experiments, root dry weight showed the most stable genotypic performance across studies. The environment (E) component had the strongest effect on yield and root traits. We identified genomic regions correlated with root dry weight, percent deep roots, maximum root depth, and grain yield based on a correlation analysis with the phenotypes and aus, indica, or japonica introgression regions using the SNP data. Two genomic regions were identified as hot spots in which root traits and grain yield were co-located; on chromosome 1 (39.7–40.7 Mb) and on chromosome 8 (20.3–21.9 Mb). Across experiments, the soil type/ growth medium showed more correlations with plant growth than the container dimensions. Although the correlations among studies and genetic co-location of root traits from a range of study systems points to their potential utility to represent responses in field studies, the best correlations were observed when the two setups had some similar properties. Due to the co-location of the identified genomic regions (from introgression block analysis) with QTL for a number of previously reported root and drought traits, these regions are good candidates for detailed characterization to contribute to understanding rice improvement for response to drought. This study also highlights the utility of characterizing a small set of 20 genotypes for root growth, drought response, and related genomic regions. PMID:25909711
Fang, Yan; Gao, Na; Tian, Xin; Zhou, Jun; Zhang, Hai-Feng; Gao, Jie; He, Xiao-Pei; Wen, Qiang; Jia, Lin-Jing; Jin, Han; Qiao, Hai-Ling
2018-06-27
Background/ Aims: Little is known about the effect of P450 oxidoreductase (POR) gene polymorphisms on the activities of CYPs with multiple genotypes. We genotyped 102 human livers for 18 known POR single nucleotide polymorphisms (SNPs) with allelic frequencies greater than 1% as well as for 27 known SNPs in 10 CYPs. CYP enzyme activities in microsomes prepared from these livers were determined by measuring probe substrate metabolism by high performance liquid chromatograph. We found that the effects of the 18 POR SNPs on 10 CYP activities were CYP genotype-dependent. The POR mutations were significantly associated with decreased overall Km for CYP2B6 and 2E1, and specific genotypes within CYP1A2, 2A6, 2B6, 2C8, 2D6 and 2E1 were identified as being affected by these POR SNPs. Notably, the effect of a specific POR mutation on the activity of a CYP genotype could not be predicted from other CYP genotypes of even the same CYP. When combining one POR SNP with other POR SNPs, a hitherto unrecognized effect of multiple-site POR gene polymorphisms (MSGP) on CYP activity was uncovered, which was not necessarily consistent with the effect of either single POR SNP. The effects of POR SNPs on CYP activities were not only CYP-dependent, but more importantly, CYP genotype-dependent. Moreover, the effect of a POR SNP alone and in combination with other POR SNPs (MSGP) was not always consistent, nor predictable. Understanding the impact of POR gene polymorphisms on drug metabolism necessitates knowing the complete SNP complement of POR and the genotype of the relevant CYPs. © 2018 The Author(s). Published by S. Karger AG, Basel.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Geraldes, Armando; Hannemann, Jan; Grassa, Chris
2013-01-01
Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. Despite the declining costs of genotyping by sequencing, for most studies, the use of large SNP genotyping arrays still offers the most cost-effective solution for large-scale targeted genotyping. Here we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species range. Due to the rapid decay of linkage disequilibrium in P. trichocarpa we adopted a candidate gene approach to the arraymore » design that resulted in the selection of 34,131 SNPs, the majority of which are located in, or within 2 kb, of 3,543 candidate genes. A subset of the SNPs (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%, indicating that high-quality data are generated with this array. We demonstrate that even among small numbers of samples (n=10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that due to ascertainment bias the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca (P. balsamifera and P. angustifolia). Finally, we provide evidence for the utility of the array for intraspecific studies of genetic differentiation and for species assignment and the detection of natural hybrids.« less
Re-Ranking Sequencing Variants in the Post-GWAS Era for Accurate Causal Variant Identification
Faye, Laura L.; Machiela, Mitchell J.; Kraft, Peter; Bull, Shelley B.; Sun, Lei
2013-01-01
Next generation sequencing has dramatically increased our ability to localize disease-causing variants by providing base-pair level information at costs increasingly feasible for the large sample sizes required to detect complex-trait associations. Yet, identification of causal variants within an established region of association remains a challenge. Counter-intuitively, certain factors that increase power to detect an associated region can decrease power to localize the causal variant. First, combining GWAS with imputation or low coverage sequencing to achieve the large sample sizes required for high power can have the unintended effect of producing differential genotyping error among SNPs. This tends to bias the relative evidence for association toward better genotyped SNPs. Second, re-use of GWAS data for fine-mapping exploits previous findings to ensure genome-wide significance in GWAS-associated regions. However, using GWAS findings to inform fine-mapping analysis can bias evidence away from the causal SNP toward the tag SNP and SNPs in high LD with the tag. Together these factors can reduce power to localize the causal SNP by more than half. Other strategies commonly employed to increase power to detect association, namely increasing sample size and using higher density genotyping arrays, can, in certain common scenarios, actually exacerbate these effects and further decrease power to localize causal variants. We develop a re-ranking procedure that accounts for these adverse effects and substantially improves the accuracy of causal SNP identification, often doubling the probability that the causal SNP is top-ranked. Application to the NCI BPC3 aggressive prostate cancer GWAS with imputation meta-analysis identified a new top SNP at 2 of 3 associated loci and several additional possible causal SNPs at these loci that may have otherwise been overlooked. This method is simple to implement using R scripts provided on the author's website. PMID:23950724
2011-01-01
Background The association of rs17321515 single nucleotide polymorphism (SNP) near TRIB1 gene and serum lipid profiles has never been studied in the Chinese population. Therefore, the present study was undertaken to detect the association of rs17321515 SNP and several environmental factors on serum lipid levels in the Mulao and Han populations. Methods A total of 639 unrelated subjects of Mulao nationality and 644 participants of Han nationality were randomly selected from our previous stratified randomized cluster samples. Genotypes of the TRIB1 rs17321515 A>G SNP were determined via polymerase chain reaction and restriction fragment length polymorphism, and then confirmed by direct sequencing. Results Serum apolipoprotein (Apo) B levels were higher in Mulao than in Han (P < 0.05). There were no differences in the genotypic and allelic frequencies between the two ethnic groups (P > 0.05). High- and low-density lipoprotein cholesterol (HDL-C and LDL-C) levels in Han were different among the genotypes (P < 0.05 for each), the subjects with AG/GG genotypes had higher HDL-C and LDL-C levels than the subjects with AA genotype. Total cholesterol (TC), HDL-C, LDL-C, ApoA1 and ApoB levels in Han males were different among the genotypes (P < 0.05-0.001), the G carriers had higher TC, HDL-C, LDL-C, ApoA1 and ApoB levels than the G noncarriers. HDL-C levels in Mulao males were different among the genotypes (P < 0.05), the G carriers had lower HDL-C levels than the G noncarriers. Serum HDL-C and LDL-C levels in both ethnic groups and TG levels in Han were correlated with the genotypes or alleles (P < 0.05-0.01). TG and HDL-C levels in Mulao males and TG, HDL-C, LDL-C and ApoA1 levels in Han males were correlated with genotypes or alleles (P < 0.05-0.001). TG and ApoA1 levels in Han females were associated with genotypes (P < 0.05 for each). Serum lipid parameters were also associated with several environmental factors in both ethnic groups. Conclusions The associations of TRIB1 rs17321515 SNP and serum lipid levels are different between the Mulao and Han populations. These discrepancies might partly result from different TRIB1 gene-environmental interactions in both ethnic groups. PMID:22145581
Investigation of inversion polymorphisms in the human genome using principal components analysis.
Ma, Jianzhong; Amos, Christopher I
2012-01-01
Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of a cost-efficient method applicable to large-scale samples. We propose a novel method based on principal components analysis (PCA) to characterize inversion polymorphisms using high-density SNP genotype data. Our method applies to non-recurrent inversions for which recombination between the inverted and non-inverted segments in inversion heterozygotes is suppressed due to the loss of unbalanced gametes. Inside such an inversion region, an effect similar to population substructure is thus created: two distinct "populations" of inversion homozygotes of different orientations and their 1:1 admixture, namely the inversion heterozygotes. This kind of substructure can be readily detected by performing PCA locally in the inversion regions. Using simulations, we demonstrated that the proposed method can be used to detect and genotype inversion polymorphisms using unphased genotype data. We applied our method to the phase III HapMap data and inferred the inversion genotypes of known inversion polymorphisms at 8p23.1 and 17q21.31. These inversion genotypes were validated by comparing with literature results and by checking Mendelian consistency using the family data whenever available. Based on the PCA-approach, we also performed a preliminary genome-wide scan for inversions using the HapMap data, which resulted in 2040 candidate inversions, 169 of which overlapped with previously reported inversions. Our method can be readily applied to the abundant SNP data, and is expected to play an important role in developing human genome maps of inversions and exploring associations between inversions and susceptibility of diseases.
Laing, Chad R; Buchanan, Cody; Taboada, Eduardo N; Zhang, Yongxiang; Karmali, Mohamed A; Thomas, James E; Gannon, Victor Pj
2009-06-29
Many approaches have been used to study the evolution, population structure and genetic diversity of Escherichia coli O157:H7; however, observations made with different genotyping systems are not easily relatable to each other. Three genetic lineages of E. coli O157:H7 designated I, II and I/II have been identified using octamer-based genome scanning and microarray comparative genomic hybridization (mCGH). Each lineage contains significant phenotypic differences, with lineage I strains being the most commonly associated with human infections. Similarly, a clade of hyper-virulent O157:H7 strains implicated in the 2006 spinach and lettuce outbreaks has been defined using single-nucleotide polymorphism (SNP) typing. In this study an in silico comparison of six different genotyping approaches was performed on 19 E. coli genome sequences from 17 O157:H7 strains and single O145:NM and K12 MG1655 strains to provide an overall picture of diversity of the E. coli O157:H7 population, and to compare genotyping methods for O157:H7 strains. In silico determination of lineage, Shiga-toxin bacteriophage integration site, comparative genomic fingerprint, mCGH profile, novel region distribution profile, SNP type and multi-locus variable number tandem repeat analysis type was performed and a supernetwork based on the combination of these methods was produced. This supernetwork showed three distinct clusters of strains that were O157:H7 lineage-specific, with the SNP-based hyper-virulent clade 8 synonymous with O157:H7 lineage I/II. Lineage I/II/clade 8 strains clustered closest on the supernetwork to E. coli K12 and E. coli O55:H7, O145:NM and sorbitol-fermenting O157 strains. The results of this study highlight the similarities in relationships derived from multi-locus genome sampling methods and suggest a "common genotyping language" may be devised for population genetics and epidemiological studies. Future genotyping methods should provide data that can be stored centrally and accessed locally in an easily transferable, informative and extensible format based on comparative genomic analyses.
McQuaid, Robyn J.; McInnis, Opal A.; Matheson, Kimberly; Anisman, Hymie
2016-01-01
Although the neuropeptide oxytocin has been associated with enhanced prosocial behaviors, it has also been linked to aggression and mental health disorders. Thus, it was suggested that oxytocin might act by increasing the salience of social stimuli, irrespective of whether these are positive or negative, thus increasing vulnerability to negative mental health outcomes. The current study (N = 243), conducted among white university students, examined the relation of trauma, depressive symptoms including suicidal ideation in relation to a single nucleotide polymorphism (SNP) within the oxytocin receptor gene (OXTR), rs53576, and a SNP on the CD38 gene that controls oxytocin release, rs3796863. Individuals with the polymorphism on both alleles (AA genotype) of the CD38 SNP had previously been linked to elevated plasma oxytocin levels. Consistent with the social sensitivity perspective, however, in the current study, individuals carrying the AA genotype displayed elevated feelings of alienation from parents and peers as well as increased levels of suicidal ideation. Moreover, they tended to report elevated depressive symptoms compared to CC homozygotes. It was also observed that the CD38 genotype moderated the relation between trauma and suicidal ideation scores, such that high levels of trauma were associated with elevated suicidal ideation among all CD38 genotypes, but this relationship was stronger among individuals with the AA genotype. In contrast, there was no relationship between the OXTR SNP, rs53576, depression or suicidal ideation. These findings support a social sensitivity hypothesis of oxytocin, wherein the AA genotype of the CD38 SNP, which has been considered the “protective allele” was associated with increased sensitivity and susceptibility to disturbed social relations and suicidal ideation. PMID:27486392
Tahir, Imtiaz Mahmood; Iqbal, Tahira; Saleem, Sadaf; Perveen, Sofia; Farooqi, Aboubakker
2017-01-01
Interindividual variability in polymorphic uridine diphosphate-glucuronosyltransferase 1A1 (UGT1A1) ascribed to genetic diversity is associated with relative glucuronidation level among individuals. The present research was aimed to study the effect of 2 important single nucleotide polymorphisms (SNPs; rs8330 and rs10929303) of UGT1A1 gene on glucuronidation status of acetaminophen in healthy volunteers (n = 109). Among enrolled volunteers, 54.13% were male (n = 59) and 45.87% were female (n = 50). The in vivo activity of UGT1A1 was investigated by high-performance liquid chromatography-based analysis of glucuronidation status (ie, acetaminophen and acetaminophen glucuronide) in human volunteers after oral intake of a single dose (1000 mg) of acetaminophen. The TaqMan SNP genotyping assay was used for UGT1A1 genotyping. The wild-type genotype (C/C) was observed the most frequent one for both SNPs (rs8330 and rs10929303) and associated with fast glucuronidator phenotypes. The distribution of variant genotype (G/G) for SNP rs8330 was observed in 5% of male and 8% of the female population; however, for SNP rs10929303, the G/G genotype was found in 8% of both genders. A trimodal distribution (fast, intermediate, and slow) based on phenotypes was observed. Among the male participants, the glucuronidation phenotypes were observed as 7% slow, 37% intermediate, and 56% fast glucuronidators; however, these findings for the females were slightly different as 8%, 32%, and 60% respectively. The k-statistics revealed a compelling evidence for good concordance between phenotype and genotype with a k value of 1.00 for SNP rs8330 and 0.966 for SNP rs10929303 in our population. PMID:28932176
DISTMIX: direct imputation of summary statistics for unmeasured SNPs from mixed ethnicity cohorts.
Lee, Donghyung; Bigdeli, T Bernard; Williamson, Vernell S; Vladimirov, Vladimir I; Riley, Brien P; Fanous, Ayman H; Bacanu, Silviu-Alin
2015-10-01
To increase the signal resolution for large-scale meta-analyses of genome-wide association studies, genotypes at unmeasured single nucleotide polymorphisms (SNPs) are commonly imputed using large multi-ethnic reference panels. However, the ever increasing size and ethnic diversity of both reference panels and cohorts makes genotype imputation computationally challenging for moderately sized computer clusters. Moreover, genotype imputation requires subject-level genetic data, which unlike summary statistics provided by virtually all studies, is not publicly available. While there are much less demanding methods which avoid the genotype imputation step by directly imputing SNP statistics, e.g. Directly Imputing summary STatistics (DIST) proposed by our group, their implicit assumptions make them applicable only to ethnically homogeneous cohorts. To decrease computational and access requirements for the analysis of cosmopolitan cohorts, we propose DISTMIX, which extends DIST capabilities to the analysis of mixed ethnicity cohorts. The method uses a relevant reference panel to directly impute unmeasured SNP statistics based only on statistics at measured SNPs and estimated/user-specified ethnic proportions. Simulations show that the proposed method adequately controls the Type I error rates. The 1000 Genomes panel imputation of summary statistics from the ethnically diverse Psychiatric Genetic Consortium Schizophrenia Phase 2 suggests that, when compared to genotype imputation methods, DISTMIX offers comparable imputation accuracy for only a fraction of computational resources. DISTMIX software, its reference population data, and usage examples are publicly available at http://code.google.com/p/distmix. dlee4@vcu.edu Supplementary Data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Báez, Sergio; Tsuchiya, Yasuo; Calvo, Alfonso; Pruyas, Martha; Nakamura, Kazutoshi; Kiyohara, Chikako; Oyama, Mari; Yamamoto, Masaharu
2010-01-01
AIM: To determine the effects of genetic variants associated with gallstone formation and capsaicin (a pungent component of chili pepper) metabolism on the risk of gallbladder cancer (GBC). METHODS: A total of 57 patients with GBC, 119 patients with gallstones, and 70 controls were enrolled in this study. DNA was extracted from their blood or paraffin block sample using standard commercial kits. The statuses of the genetic variants were assayed using Taqman® SNP Genotyping Assays or Custom Taqman® SNP Genotyping Assays. RESULTS: The non-ancestral T/T genotype of apolipoprotein B rs693 polymorphism was associated with a decreased risk of GBC (OR: 0.14, 95% CI: 0.03-0.63). The T/T genotype of cholesteryl ester transfer protein (CETP) rs708272 polymorphism was associated with an increased risk of GBC (OR: 5.04, 95% CI: 1.43-17.8). CONCLUSION: Genetic variants involved in gallstone formation such as the apolipoprotein B rs693 and CETP rs708272 polymorphisms may be related to the risk of developing GBC in Chilean women. PMID:20082485
Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography
Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi
2013-01-01
New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420
Pintus, M A; Gaspa, G; Nicolazzi, E L; Vicario, D; Rossoni, A; Ajmone-Marsan, P; Nardone, A; Dimauro, C; Macciotta, N P P
2012-06-01
The large number of markers available compared with phenotypes represents one of the main issues in genomic selection. In this work, principal component analysis was used to reduce the number of predictors for calculating genomic breeding values (GEBV). Bulls of 2 cattle breeds farmed in Italy (634 Brown and 469 Simmental) were genotyped with the 54K Illumina beadchip (Illumina Inc., San Diego, CA). After data editing, 37,254 and 40,179 single nucleotide polymorphisms (SNP) were retained for Brown and Simmental, respectively. Principal component analysis carried out on the SNP genotype matrix extracted 2,257 and 3,596 new variables in the 2 breeds, respectively. Bulls were sorted by birth year to create reference and prediction populations. The effect of principal components on deregressed proofs in reference animals was estimated with a BLUP model. Results were compared with those obtained by using SNP genotypes as predictors with either the BLUP or Bayes_A method. Traits considered were milk, fat, and protein yields, fat and protein percentages, and somatic cell score. The GEBV were obtained for prediction population by blending direct genomic prediction and pedigree indexes. No substantial differences were observed in squared correlations between GEBV and EBV in prediction animals between the 3 methods in the 2 breeds. The principal component analysis method allowed for a reduction of about 90% in the number of independent variables when predicting direct genomic values, with a substantial decrease in calculation time and without loss of accuracy. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Analysis of population structure and genetic history of cattle breeds based on high-density SNP data
USDA-ARS?s Scientific Manuscript database
Advances in single nucleotide polymorphism (SNP) genotyping microarrays have facilitated a new understanding of population structure and evolutionary history for several species. Most existing studies in livestock were based on low density SNP arrays. The first wave of low density SNP studies on cat...
A meta-analysis of interleukin-10-1082 promoter polymorphism associated with gastric cancer risk.
Ni, Peihua; Xu, Hong; Xue, Huiping; Lin, Bing; Lu, Yang
2012-04-01
We aimed to explore the role of allele A/G single nucleotide polymorphism (SNP) of gene Interleukin 10 (IL-10) promoter-1082 in the susceptibility to gastric cancer through a systematic review and meta-analysis. Each initially included article was scored for quality appraisal. Desirable data were extracted and registered into databases. Twenty studies were ultimately eligible for the meta-analysis of IL-10-1082 A/G SNP. We adopted the most probably appropriate genetic model (dominant model), with the combined group of GG-plus-GA genotypes compared with the AA genotype. Potential sources of heterogeneity were sought out via subgroup analyses and sensitivity analyses, and publication biases were estimated. Between IL-10-1082 GG-plus-GA genotypes with the risk of developing gastric cancer, statistically significant association could be noted with overall gastric cancer, being mainly in Asian subgroup, large sample subgroup, high quality subgroup, intestinal-type subgroup, cardia-type subgroup, and some genotyping method subgroups. Our meta-analysis indicates that IL-10-1082 GG-plus-GA genotypes are associated with the overall risk of developing gastric cancer and seem to be more susceptible to overall gastric cancer in Asian populations. IL-10-1082 GG-plus-GA genotypes are more associated with the pathologically intestinal-type gastric cancer or anatomically cardia-type gastric cancer.
A Meta-Analysis of Interleukin-10-1082 Promoter Polymorphism Associated with Gastric Cancer Risk
Ni, Peihua; Xu, Hong; Xue, Huiping; Lin, Bing
2012-01-01
We aimed to explore the role of allele A/G single nucleotide polymorphism (SNP) of gene Interleukin 10 (IL-10) promoter-1082 in the susceptibility to gastric cancer through a systematic review and meta-analysis. Each initially included article was scored for quality appraisal. Desirable data were extracted and registered into databases. Twenty studies were ultimately eligible for the meta-analysis of IL-10-1082 A/G SNP. We adopted the most probably appropriate genetic model (dominant model), with the combined group of GG-plus-GA genotypes compared with the AA genotype. Potential sources of heterogeneity were sought out via subgroup analyses and sensitivity analyses, and publication biases were estimated. Between IL-10-1082 GG-plus-GA genotypes with the risk of developing gastric cancer, statistically significant association could be noted with overall gastric cancer, being mainly in Asian subgroup, large sample subgroup, high quality subgroup, intestinal-type subgroup, cardia-type subgroup, and some genotyping method subgroups. Our meta-analysis indicates that IL-10-1082 GG-plus-GA genotypes are associated with the overall risk of developing gastric cancer and seem to be more susceptible to overall gastric cancer in Asian populations. IL-10-1082 GG-plus-GA genotypes are more associated with the pathologically intestinal-type gastric cancer or anatomically cardia-type gastric cancer. PMID:22335769
Muneta, Yoshihiro; Minagawa, Yu; Kusumoto, Masahiro; Shinkai, Hiroki; Uenishi, Hirohide; Splichal, Igor
2012-06-01
In the present study, an allele-specific primer-polymerase chain reaction (ASP-PCR) for genotyping a single nucleotide polymorphism (SNP) of swine Toll-like receptor 5 (TLR5) (C1205T; P402L) that is related to the impaired recognition of Salmonella enterica serovar Choleraesuis (SC) was developed. The allele frequencies in several pig breeds in Japan and the Czech Republic were also compared. The swine TLR5 C1205T mutation was successfully determined by ASP-PCR using genomic DNA samples in Japan that had previously been genotyped by a sequencing method. Using the PCR condition determined, genomic DNA samples from blood obtained from 110 pigs from seven different breeds in the Czech Republic were genotyped by the ASP-PCR. The genotyping results from the ASP-PCR completely matched the results from the sequencing method. The allele frequency of the swine TLR5 C1205T mutation was 27.5% in the Landrace breed of the Czech Republic compared with 50.0% in Japanese Landrace. In Japan, the C1205T mutation was found only in the Landrace breed, whereas in the Czech Republic it was found in both the Landrace and Piétrain breeds. These results indicate the usefulness of ASP-PCR for detecting a specific SNP for swine TLR5 affecting ligand recognition. They also suggest the possibility of genetically improving pigs to enhance their resistance against SC infection by eliminating or selecting this specific SNP of swine TLR5. © 2012 The Societies and Blackwell Publishing Asia Pty Ltd.
Chaaba, Raja; Attia, Nebil; Hammami, Sonia; Smaoui, Maha; Mahjoub, Sylvia; Hammami, Mohamed; Masmoudi, Ahmed Slaheddine
2005-01-01
Background Apolipoprotein A-V (Apo A-V) gene has recently been identified as a new apolipoprotein involved in triglyceride metabolism. A single nucleotide polymorphism (SNP3) located in the gene promoter (-1131) was associated with triglyceride variation in healthy subjects. In type 2 diabetes the triglyceride level increased compared to healthy subjects. Hypertriglyceridemia is a risk factor for coronary artery disease. We aimed to examine the interaction between SNP3 and lipid profile and coronary artery disease (CAD) in Tunisian type 2 diabetic patients. Results The genotype frequencies of T/T, T/C and C/C were 0.74, 0.23 and 0.03 respectively in non diabetic subjects, 0.71, 0.25 and 0.04 respectively in type 2 diabetic patients. Triglyceride level was higher in heterozygous genotype (-1131 T/C) of apo A-V (p = 0.024). Heterozygous genotype is more frequent in high triglyceride group (40.9%) than in low triglyceride group (18.8%) ; p = 0.011. Despite the relation between CAD and hypertriglyceridemia the SNP 3 was not associated with CAD. Conclusion In type 2 diabetic patients SNP3 is associated with triglyceride level, however there was no association between SNP3 and coronary artery disease. PMID:15636639
Laios, Eleftheria; Drogari, Euridiki
2006-12-01
Three mutations in the low density lipoprotein receptor (LDLR) gene account for 49% of familial hypercholesterolemia (FH) cases in Greece. We used the microelectronic array technology of the NanoChip Molecular Biology Workstation to develop a multiplex method to analyze these single-nucleotide polymorphisms (SNPs). Primer pairs amplified the region encompassing each SNP. The biotinylated PCR amplicon was electronically addressed to streptavidin-coated microarray sites. Allele-specific fluorescently labeled oligonucleotide reporters were designed and used for detection of wild-type and SNP sequences. Genotypes were compared to PCR-restriction fragment length polymorphism (PCR-RFLP). We developed three monoplex assays (1 SNP/site) and an optimized multiplex assay (3SNPs/site). We performed 92 Greece II, 100 Genoa, and 98 Afrikaner-2 NanoChip monoplex assays (addressed to duplicate sites and analyzed separately). Of the 580 monoplex genotypings (290 samples), 579 agreed with RFLP. Duplicate sites of one sample were not in agreement with each other. Of the 580 multiplex genotypings, 576 agreed with the monoplex results. Duplicate sites of three samples were not in agreement with each other, indicating requirement for repetition upon which discrepancies were resolved. The multiplex assay detects common LDLR mutations in Greek FH patients and can be extended to accommodate additional mutations.
2011-01-01
Background Six previous studies have examined the relationships between single nucleotide polymorphisms (SNPs) in the IL13 gene and allergic rhinitis, but the results have been inconsistent. However, a recent meta-analysis using data from these 6 studies has shown that the A allele of IL13 SNP rs20541 was associated with an increased risk of allergic rhinitis, whereas no such relationship existed between IL13 SNP rs1800925 and allergic rhinitis. We investigated the associations between IL13 SNPs rs1800925 and rs20541 and the risk of rhinoconjunctivitis in Japanese women. Methods Included were 393 cases who met the criteria of the International Study of Asthma and Allergies in Childhood (ISAAC) for rhinoconjunctivitis. Control subjects were 767 women without rhinoconjunctivitis according to the ISAAC criteria, who had also not been diagnosed with allergic rhinitis by a doctor. Adjustment was made for age, region of residence, presence of older siblings, smoking, family history of allergic rhinitis, and education. Results Compared with the GG genotype of IL13 SNP rs20541, the AA genotype, occurring in 7.1% of control subjects, was significantly positively related to the risk of rhinoconjunctivitis: the adjusted odds ratio was 1.65 (95% confidence interval: 1.05 - 2.60). SNP rs1800925 was not associated with rhinoconjunctivitis. The haplotype comprising the rs1800925 C allele and the rs20541 A allele was significantly positively related to rhinoconjunctivitis. The multiplicative interactions between the two SNPs under study and smoking on the risk of rhinoconjunctivitis were not statistically significant. Based on the recessive model, however, the additive interaction between SNP rs1800925, but not rs20541, and smoking was significant. Conclusions This study suggests that the minor genotype of IL13 SNP rs20541 and the CA haplotype are significantly positively associated with the risk of rhinoconjunctivitis. In addition, a new pattern of biological interaction that affects the risk of rhinoconjunctivitis is described between SNP rs1800925 and smoking. PMID:22023794
Wong, Michelle; Öhrmalm, Lars; Broliden, Kristina; Aust, Carl; Hibberd, Martin; Tolfvenstam, Thomas
2012-01-01
Background Mannose-binding Lectin protein (MBL) has been suggested to be relevant in the defence against infections in immunosuppressed individuals. In a Swedish adult cohort immunosuppressed from both the underlying disease and from iatrogenic treatments for their underlying disease we investigated the role of MBL in susceptibility to infection. Methods In this cross sectional, prospective study, blood samples obtained from 96 neutropaenic febrile episodes, representing 82 individuals were analysed for single nucleotide polymorphism (SNP) in the MBL2 gene. Concurrent measurement of plasma MBL protein concentrations was also performed for observation of acute response during febrile episodes. Findings No association was observed between MBL2 genotype or plasma MBL concentrations, and the type or frequency of infection. Adding to the literature, we found no evidence that viral infections or co-infections with virus and bacteria would be predisposed by MBL deficiency. We further saw no correlation between MBL2 genotype and the risk of fever. However, fever duration in febrile neutropaenic episodes was negatively associated with MBL2 SNP mutations (p<0.05). Patients with MBL2 SNP mutations presented a median febrile duration of 1.8 days compared with 3 days amongst patients with wildtype MBL2 genotype. Interpretation We found no clear association between infection, or infection type to MBL2 genotypes or plasma MBL concentration, and add to the reports casting doubts on the benefit of recombinant MBL replacement therapy use during iatrogenic neutropaenia. PMID:22363494
Geraldes, A; Difazio, S P; Slavov, G T; Ranjan, P; Muchero, W; Hannemann, J; Gunter, L E; Wymore, A M; Grassa, C J; Farzaneh, N; Porth, I; McKown, A D; Skyba, O; Li, E; Fujita, M; Klápště, J; Martin, J; Schackwitz, W; Pennacchio, C; Rokhsar, D; Friedmann, M C; Wasteneys, G O; Guy, R D; El-Kassaby, Y A; Mansfield, S D; Cronk, Q C B; Ehlting, J; Douglas, C J; Tuskan, G A
2013-03-01
Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. For such studies, the use of large single nucleotide polymorphism (SNP) genotyping arrays still offers the most cost-effective solution. Herein we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species latitudinal range. We adopted a candidate gene approach to the array design that resulted in the selection of 34 131 SNPs, the majority of which are located in, or within 2 kb of, 3543 candidate genes. A subset of the SNPs on the array (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%. We demonstrate that even among small numbers of samples (n = 10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca. Finally, we provide evidence for the utility of the array to address evolutionary questions such as intraspecific studies of genetic differentiation, species assignment and the detection of natural hybrids. © 2013 Blackwell Publishing Ltd.
Yin, Z Z; Dong, X Y; Dong, D J; Ma, Y Z
2016-10-01
Single nucleotide polymorphisms (SNPs) in the exons of the myogenic factor 5 (MYF5) and Kruppel-like factor 15 (KLF15) genes were identified and analysed by using DNA sequencing methods in 60 female domestic pigeons (Columba livia). Five SNPs (T5067A, C5084T, C5101T, T5127A and C5154G) were detected in exon 3 of MYF5 and 6 SNPs (C1398T, C1464T, G1542A, C1929T, G1965A and A2355G) were found in exon 2 of KLF15, respectively. The analysis revealed three genotypes, in which the AA genotype was dominant and the A allele showed a dominant advantage. For the MYF5 gene, the C5084T and T5127A SNP genotypes were significantly associated with carcass traits of pigeons. Within those two SNPs, the BB genotype showed relatively higher trait association values than those of AA or AB genotypes. No significant association was observed between the KLF15 SNP genotypes and carcass traits. These results indicated that the MYF5 gene is a potential major gene affecting carcass traits in domestic pigeons. The BB genotype of the C5084T and T5127A SNPs could be a potential candidate genetic marker for marker-assisted selection in pigeon.
Kurt, Ozlem; Yilmaz-Aydogan, Hulya; Uyar, Mehmet; Isbir, Turgay; Seyhan, Mehmet Fatih; Can, Ayse
2012-06-01
It has been suggested that the estrogen receptor alpha (ERα) and vitamin D receptor (VDR) genes as possibly implicated in reduced bone mineral density (BMD) in osteoporosis. The present study investigated the relation of ERα PvuII/XbaI polymorphisms and VDR FokI/TaqI polymorphisms with BMD in Turkish postmenopausal women. Eighty-one osteoporotic and 122 osteopenic postmenopausal women were recruited. For detection of the polymorphisms, polymerase chain reaction-restriction fragment lenght polymorphism techniques have been used. BMD was measured at the lumbar spine and hip by dual-energy X-ray absorptiometry. Distributions of ERα (PvuII dbSNP: rs2234693, XbaI dbSNP: rs9340799) and VDR genotypes (FokI dbSNP rs10735810, TaqI dbSNP: rs731236) were similar in study population. Although overall prevalence of osteoporosis had no association with these genotypes, the prevalence of decreased femoral neck BMD values were higher in the subjects with ERα PvuII "PP" and ERα XbaI "XX" genotypes than in those with "Pp/pp" genotypes and "xx" genotype, respectively (P < 0.05). Furthermore, subjects with VDR FokI "FF" genotype had lower BMD values of femoral neck and total hip compared to those with "Ff" genotype (P < 0.05). In the logistic regression analysis, we confirmed the presence of relationships between the VDR FokI "FF" genotypes, BMI ≤ 27.5, age ≥ 55 and the increased risk of femoral neck BMD below 0.8 value in postmenopausal women. The present data suggests that the ERα PvuII/XbaI and VDR FokI polymorphisms may contribute to the determination of bone mineral density in Turkish postmenopausal women.
NASA Astrophysics Data System (ADS)
Ma, Ruiqin; He, Feng; Wen, Haishen; Li, Jifang; Shi, Bao; Shi, Dan; Liu, Miao; Mu, Weijie; Zhang, Yuanqing; Hu, Jian; Han, Weiguo; Zhang, Jianan; Wang, Qingqing; Yuan, Yuren; Liu, Qun
2012-03-01
As a specific gene of fish, cytochrome P450c17-II ( CYP17-II) gene plays a key role in the growth, development an reproduction level of fish. In this study, the single-stranded conformational polymorphism (SSCP) technique was used to characterize polymorphisms within the coding region of CYP17-II gene in a population of 75 male Japanese flounder ( Paralichthys olivaceus). Three single nucleotide polymorphisms (SNPs) were identified in CYP17-II gene of Japanese flounder. They were c.G594A (p.G188R), c.G939A and c.G1502A (p.G490D). SNP1 (c.G594A), located in exon 4 of CYP17-II gene, was significantly associated with gonadosomatic index (GSI). Individuals with genotype GG of SNP1 had significantly lower GSI ( P < 0.05) than those with genotype AA or AG. SNP2 (c.G939A) located at the CpG island of CYP17-II gene. The mutation changed the methylation of exon 6. Individuals with genotype AA of SNP2 had significantly lower serum testosterone (T) level and hepatosomatic index (HSI) compared to those with genotype GG. The results suggested that SNP2 could influence the reproductive endocrine of male Japanese flounder. However, the SNP3 (c.G1502A) located in exon 9 did not affect the four measured reproductive traits. This study showed that CYP17-II gene could be a potentially useful candidate gene for the research of genetic breeding and physiological aspects of Japanese flounder.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Ben Ayed, Rayda; Ben Hassen, Hanen; Ennouri, Karim; Rebai, Ahmed
2016-12-01
The genetic diversity of 22 olive tree cultivars (Olea europaea L.) sampled from different Mediterranean countries was assessed using 5 SNP markers (FAD2.1; FAD2.3; CALC; SOD and ANTHO3) located in four different genes. The genotyping analysis of the 22 cultivars with 5 SNP loci revealed 11 alleles (average 2.2 per allele). The dendrogram based on cultivar genotypes revealed three clusters consistent with the cultivars classification. Besides, the results obtained with the five SNPs were compared to those obtained with the SSR markers using bioinformatic analyses and by computing a cophenetic correlation coefficient, indicating the usefulness of the UPGMA method for clustering plant genotypes. Based on principal coordinate analysis using a similarity matrix, the first two coordinates, revealed 54.94 % of the total variance. This work provides a more comprehensive explanation of the diversity available in Tunisia olive cultivars, and an important contribution for olive breeding and olive oil authenticity.
Automated tetraploid genotype calling by hierarchical clustering
USDA-ARS?s Scientific Manuscript database
SNP arrays are transforming breeding and genetics research for autotetraploids. To fully utilize these arrays, however, the relationship between signal intensity and allele dosage must be inferred independently for each marker. We developed an improved computational method to automate this process, ...
Loughlin, J; Sinsheimer, J S; Mustafa, Z; Carr, A J; Clipsham, K; Bloomfield, V A; Chitnavis, J; Bailey, A; Sykes, B; Chapman, K
2000-03-01
Evidence has accumulated supporting a role for genes in the etiology of osteoarthritis (OA). Several candidates have been targeted as potential susceptibility loci including genes that are involved in the regulation of bone density. Genetic association analysis has suggested a role for the vitamin D receptor gene (VDR) and the estrogen receptor gene (ER) in susceptibility. Such findings must be tested in additional independent cohorts. We tested for association of these 2 genes, plus a third gene implicated in bone density, COL1A1, with idiopathic OA. A case-control cohort of 371 affected probands and 369 unaffected spouses was used. Association was tested using 4 intragenic single nucleotide polymorphisms (SNP), one each for the VDR and COL1A1 genes, and 2 for the ER gene. The VDR and ER SNP are the same SNP that have been associated with OA. All 4 SNP affect restriction enzyme sites and were genotyped using polymerase chain reaction and enzyme digestion. Allele and genotype distributions for each SNP were compared between cases and controls and analyzed using Fisher's exact test. There was no evidence of association of the VDR or the ER gene SNP to OA. There was weak evidence of association of the COL1A1 SNP in female cases (p = 0.017), reflected by a difference in the distribution of genotypes at this SNP between female cases and controls (p = 0.027). However, when corrected for multiple testing, these results were not significant. If the VDR, ER, or COL1A1 genes do encode predisposition to OA then the 4 SNP tested are not associated with major susceptibility alleles at these 3 loci.
Liu, Kaihua; Zhang, Bin; Teng, Zhaochun; Wang, Youtao; Dong, Guodong; Xu, Cong; Qin, Bo; Song, Chunlian; Chai, Jun; Li, Yang; Shi, Xianwei; Shu, Xianghua; Zhang, Yifang
2017-03-01
We investigated the associations between SLC11A1 polymorphisms and susceptibility to tuberculosis (TB) in Chinese Holstein cattle, using a case-control study of 136 animals that had positive reactions to TB tests and showed symptoms and 96 animals that had negative reactions to tests and showed no symptoms. Polymerase chain reaction (PCR) sequencing and the restriction fragment length polymorphism (RFLP) technique were used to detect and determine SLC11A1 polymorphisms. Association analysis identified significant correlations between SLC11A1 polymorphisms and susceptibility/resistance to TB, and two genetic markers for SLC11A1 were established using PCR-RFLP. Sequence alignment of SLC11A1 revealed seven single-nucleotide polymorphisms (SNPs). This is the first report of MaeII PCR-RFLP markers for the SLC11A1-SNP3 site and PstI PCR-RFLP markers for the SLC11A1-SNP5 and SLC11A1-SNP6 sites in Chinese Holstein cattle. Logistic regression analysis indicated that SLC11A1-SNP1, SLC11A1-SNP3, and SLC11A1-SNP5 were significantly associated with susceptibility/resistance to TB. Two genotypes of SLC11A1-SNP3 were susceptible to TB, whereas one genotype of SLC11A1-SNP1 and two genotypes of SLC11A1-SNP5 were resistant. Haplotype analysis showed that nine haplotypes were potentially resistant to TB. After Bonferroni correction, three of the haplotypes remained significantly associated with TB resistance. SLC11A1 is a useful candidate gene related to TB in Chinese Holstein cattle. Copyright © 2016 Elsevier Ltd. All rights reserved.
Qin, Sisi; Ingle, James N; Liu, Mohan; Yu, Jia; Wickerham, D Lawrence; Kubo, Michiaki; Weinshilboum, Richard M; Wang, Liewei
2017-08-18
We previously performed a case-control genome-wide association study in women treated with selective estrogen receptor modulators (SERMs) for breast cancer prevention and identified single nucleotide polymorphisms (SNPs) in ZNF423 as potential biomarkers for response to SERM therapy. The ZNF423rs9940645 SNP, which is approximately 200 bp away from the estrogen response elements, resulted in the SNP, estrogen, and SERM-dependent regulation of ZNF423 expression and, "downstream", that of BRCA1. Electrophoretic mobility shift assay-mass spectrometry was performed to identify proteins binding to the ZNF423 SNP and coordinating with estrogen receptor alpha (ERα). Clustered, regularly interspaced short palindromic repeats (CRISPR)/Cas9 genome editing was applied to generate ZR75-1 breast cancer cells with different ZNF423 SNP genotypes. Both cultured cells and mouse xenograft models with different ZNF423 SNP genotypes were used to study the cellular responses to SERMs and poly(ADP-ribose) polymerase (PARP) inhibitors. We identified calmodulin-like protein 3 (CALML3) as a key sensor of this SNP and a coregulator of ERα, which contributes to differential gene transcription regulation in an estrogen and SERM-dependent fashion. Furthermore, using CRISPR/Cas9-engineered ZR75-1 breast cancer cells with different ZNF423 SNP genotypes, striking differences in cellular responses to SERMs and PARP inhibitors, alone or in combination, were observed not only in cells but also in a mouse xenograft model. Our results have demonstrated the mechanism by which the ZNF423 rs9940645 SNP might regulate gene expression and drug response as well as its potential role in achieving more highly individualized breast cancer therapy.
Shi, Chao; Ge, Yujie; Gu, Hongxi; Ma, Cuiping
2011-08-15
Single nucleotide polymorphism (SNP) genotyping is attracting extensive attentions owing to its direct connections with human diseases including cancers. Here, we have developed a highly sensitive chemiluminescence biosensor based on circular strand-displacement amplification and the separation by magnetic beads reducing the background signal for point mutation detection at room temperature. This method took advantage of both the T4 DNA ligase recognizing single-base mismatch with high selectivity and the strand-displacement reaction of polymerase to perform signal amplification. The detection limit of this method was 1.3 × 10(-16)M, which showed better sensitivity than that of most of those reported detection methods of SNP. Additionally, the magnetic beads as carrier of immobility was not only to reduce the background signal, but also may have potential apply in high through-put screening of SNP detection in human genome. Copyright © 2011 Elsevier B.V. All rights reserved.
2013-01-01
Background The apparent effect of a single nucleotide polymorphism (SNP) on phenotype depends on the linkage disequilibrium (LD) between the SNP and a quantitative trait locus (QTL). However, the phase of LD between a SNP and a QTL may differ between Bos indicus and Bos taurus because they diverged at least one hundred thousand years ago. Here, we test the hypothesis that the apparent effect of a SNP on a quantitative trait depends on whether the SNP allele is inherited from a Bos taurus or Bos indicus ancestor. Methods Phenotype data on one or more traits and SNP genotype data for 10 181 cattle from Bos taurus, Bos indicus and composite breeds were used. All animals had genotypes for 729 068 SNPs (real or imputed). Chromosome segments were classified as originating from B. indicus or B. taurus on the basis of the haplotype of SNP alleles they contained. Consequently, SNP alleles were classified according to their sub-species origin. Three models were used for the association study: (1) conventional GWAS (genome-wide association study), fitting a single SNP effect regardless of subspecies origin, (2) interaction GWAS, fitting an interaction between SNP and subspecies-origin, and (3) best variable GWAS, fitting the most significant combination of SNP and sub-species origin. Results Fitting an interaction between SNP and subspecies origin resulted in more significant SNPs (i.e. more power) than a conventional GWAS. Thus, the effect of a SNP depends on the subspecies that the allele originates from. Also, most QTL segregated in only one subspecies, suggesting that many mutations that affect the traits studied occurred after divergence of the subspecies or the mutation became fixed or was lost in one of the subspecies. Conclusions The results imply that GWAS and genomic selection could gain power by distinguishing SNP alleles based on their subspecies origin, and that only few QTL segregate in both B. indicus and B. taurus cattle. Thus, the QTL that segregate in current populations likely resulted from mutations that occurred in one of the subspecies and can have both positive and negative effects on the traits. There was no evidence that selection has increased the frequency of alleles that increase body weight. PMID:24168700
IL-10 -1082 SNP and IL-10 in primary CNS and vitreoretinal lymphomas
Ramkumar, Hema L.; Shen, De Fen; Tuo, Jingsheng; Braziel, Rita M.; Coupland, Sarah E.; Smith, Justine R.
2012-01-01
Objectives Most primary central nervous system lymphomas (PCNSLs) and primary vitreoretinal lymphomas (PVRLs) are B-cell lymphomas that produce high levels of interleukin (IL)-10, which is linked to rapid disease progression. The IL-10-1082G→A polymorphism (IL-10 SNP) is associated with improved survival in certain non-CNS lymphoma patients. PDCD4 is a tumor suppressor gene and upstream regulator of IL-10. This study examined the correlation between the IL-10 SNP, PDCD4 mRNA expression, and IL-10 expression (at transcript and protein levels) in these lymphoma cells. Materials and methods Single-nucleotide polymorphism (SNP)-typing at IL-10-1082 was performed after micro-dissecting cytospun PVRL cells from 26 specimens. Vitreal IL-10 and IL-6 levels were measured by ELISA. PCNSL cells from 52 paraffin-embedded sections were microdissected and SNP typed on genomic DNA. RT-PCR was performed to analyze expression of IL-10 and PDCD4 mRNA. IL-10-1082 SNP typing was performed on blood samples of 96 healthy controls. We measured IL-10-1082 SNP expression in 26 PVRLs and 52 PCNSLs and examined its relationship with IL-10 protein and gene expression, respectively. Results More PVRL patients expressed one copy of the IL-10-1082G→A SNP with the GA genotype compared to controls. The frequencies of the three genotypes (AA, AG, GG) significantly differed in PVRL versus controls and in PCNSL versus controls. In PVRLs, the vitreal IL-10/IL-6 ratio was higher in IL-10-1082 AG and IL-10-1082 AA patients, compared to IL-10-1082 GG patients. IL-10 mRNA expression was higher in IL-10-1082 AG and IL-10-1082 AA PCNSLs, compared to IL-10-1082 GG PCNSLs. No correlation was found between IL-10 and PDCD4 expression levels in 37 PCNSL samples. Conclusions PVRL and PCNSL patients had similar IL-10-1082 A allele frequencies, but genotype distributions differed from healthy controls. The findings suggest that the IL-10-1082 A allele is a risk factor for higher IL-10 levels in PVRLs and PCNSLs. Higher IL-10 levels have been correlated with more aggressive disease in both PVRLs and PCNSLs, making this finding an important and potentially clinically significant observation. PMID:22628023
Toward optimal set of single nucleotide polymorphism investigation before IVF.
Ivanov, A V; Dedul, A G; Fedotov, Y N; Komlichenko, E V
2016-10-01
At present, the patient preparation for IVF needs to undergo a series of planned tests, including the genotyping of single nucleotide polymorphism (SNP) alleles of some genes. In former USSR countries, such investigation was not included in overwhelming majority of health insurance programs and paid by patient. In common, there are prerequisites to the study of more than 50 polymorphisms. An important faced task is to determine the optimal panel for SNP genotyping in terms of price/number of SNP. During 2009-2015 in the University Hospital of St. Petersburg State University, blood samples were analyzed from 550 women with different reproductive system disorders preparing for IVF and 46 healthy women in control group. In total, 28 SNP were analyzed in the genes of thrombophilia factors, folic acid cycle, detoxification system, and the renin-angiotensin system. The method used was real-time PCR. A significant increase in the frequency of pathological alleles of some polymorphisms in patients with habitual failure of IVF was shown, compared with the control group. As a result, two options defined panels for optimal typing SNP before IVF were composed. Standard panel includes 8 SNP, 5 in thromborhilic factors, and 3 in folic acid cycle genes. They are 20210 G > A of FII gene, R506Q G > A of FV gene (mutation Leiden), -675 5G > 4G of PAI-I gene, L33P T > C of ITGB3 gene, -455 G > A of FGB gene, 667 C > T of MTHFR gene, 2756 A > G of MTR gene, and 66 A > G of MTRR gene. Extended panel of 15 SNP also includes 807 C > T of ITGA2 gene, T154M C > T of GP1BA gene, second polymorphism 1298 A > C in MTHFR gene, polymorphisms of the renin-angiotensin gene AGT M235T T > C and -1166 A > C of AGTR1 gene, polymorphisms I105V A > G and A114V C > T of detoxification system gene GSTP. The results of SNP genotyping can be adjusted for treatment tactics and IVF, and also medical support getting pregnant. The success rate of IVF is increased as the result, especially in the group with the usual failure of IVF.
Efficient selection of tagging single-nucleotide polymorphisms in multiple populations.
Howie, Bryan N; Carlson, Christopher S; Rieder, Mark J; Nickerson, Deborah A
2006-08-01
Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.
Identifying and mitigating batch effects in whole genome sequencing data.
Tom, Jennifer A; Reeder, Jens; Forrest, William F; Graham, Robert R; Hunkapiller, Julie; Behrens, Timothy W; Bhangale, Tushar R
2017-07-24
Large sample sets of whole genome sequencing with deep coverage are being generated, however assembling datasets from different sources inevitably introduces batch effects. These batch effects are not well understood and can be due to changes in the sequencing protocol or bioinformatics tools used to process the data. No systematic algorithms or heuristics exist to detect and filter batch effects or remove associations impacted by batch effects in whole genome sequencing data. We describe key quality metrics, provide a freely available software package to compute them, and demonstrate that identification of batch effects is aided by principal components analysis of these metrics. To mitigate batch effects, we developed new site-specific filters that identified and removed variants that falsely associated with the phenotype due to batch effect. These include filtering based on: a haplotype based genotype correction, a differential genotype quality test, and removing sites with missing genotype rate greater than 30% after setting genotypes with quality scores less than 20 to missing. This method removed 96.1% of unconfirmed genome-wide significant SNP associations and 97.6% of unconfirmed genome-wide significant indel associations. We performed analyses to demonstrate that: 1) These filters impacted variants known to be disease associated as 2 out of 16 confirmed associations in an AMD candidate SNP analysis were filtered, representing a reduction in power of 12.5%, 2) In the absence of batch effects, these filters removed only a small proportion of variants across the genome (type I error rate of 3%), and 3) in an independent dataset, the method removed 90.2% of unconfirmed genome-wide SNP associations and 89.8% of unconfirmed genome-wide indel associations. Researchers currently do not have effective tools to identify and mitigate batch effects in whole genome sequencing data. We developed and validated methods and filters to address this deficiency.
2012-01-01
Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety. PMID:22631220
CD44 Gene Polymorphisms in Breast Cancer Risk and Prognosis: A Study in North Indian Population
Tulsyan, Sonam; Agarwal, Gaurav; Lal, Punita; Agrawal, Sushma; Mittal, Rama Devi; Mittal, Balraj
2013-01-01
Background Cell surface biomarker CD44 plays an important role in breast cancer cell growth, differentiation, invasion, angiogenesis and tumour metastasis. Therefore, we aimed to investigate the role of CD44 gene polymorphisms in breast cancer risk and prognosis in North Indian population. Materials & Methods A total of 258 breast cancer patients and 241 healthy controls were included in the case-control study for risk prediction. According to RECIST, 114 patients who received neo-adjuvant chemotherapy were recruited for the evaluation of breast cancer prognosis. We examined the association of tagging SNP (rs353639) of Hapmap Gujrati Indians in Houston (GIH population) in CD44 gene along with a significant reported SNP (rs13347) in Chinese population by genotyping using Taqman allelic discrimination assays. Statistical analysis was done using SPSS software, version 17. In-silico analysis for prediction of functional effects was done using F-SNP and FAST-SNP. Results No significant association of both the genetic variants of the CD44 gene polymorphisms was found with breast cancer risk. On performing univariate analysis with clinicopathological characteristics and treatment response, we found significant association of genotype (CT+TT) of rs13347 polymorphism with earlier age of onset (P = 0.029, OR = 0.037). However, significance was lost in multivariate analysis. For rs353639 polymorphism, significant association was seen with clinical tumour size, both at the genotypic (AC+CC) (P = 0.039, OR = 3.02) as well as the allelic (C) (P = 0.042, OR = 2.87) levels. On performing multivariate analysis, increased significance of variant genotype (P = 0.017, OR = 4.29) and allele (P = 0.025, OR = 3.34) of rs353639 was found with clinical tumour size. In-silico analysis using F-SNP, showed altered transcriptional regulation for rs353639 polymorphism. Conclusions These findings suggest that CD44 rs353639 genetic variants may have significant effect in breast cancer prognosis. However, both the polymorphisms- rs13347 and rs353639 had no effect on breast cancer susceptibility. PMID:23940692
Increasing the number of single nucleotide polymorphisms used in genomic evaluations of dairy cattle
USDA-ARS?s Scientific Manuscript database
A small increase in the accuracy of genomic evaluations of dairy cattle was achieved by increasing the number of SNP used to 61,013. All the 45,195 SNP used previously were retained, and 15,818 SNP were selected from higher density genotyping chips if the magnitude of the SNP effect was among the to...
Development of a single nucleotide polymorphism barcode to genotype Plasmodium vivax infections.
Baniecki, Mary Lynn; Faust, Aubrey L; Schaffner, Stephen F; Park, Daniel J; Galinsky, Kevin; Daniels, Rachel F; Hamilton, Elizabeth; Ferreira, Marcelo U; Karunaweera, Nadira D; Serre, David; Zimmerman, Peter A; Sá, Juliana M; Wellems, Thomas E; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E; Volkman, Sarah K; Wirth, Dyann F; Sabeti, Pardis C
2015-03-01
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.
Development of a Single Nucleotide Polymorphism Barcode to Genotype Plasmodium vivax Infections
Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.
2015-01-01
Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890
Hohenlohe, Paul A.; Day, Mitch D.; Amish, Stephen J.; Miller, Michael R.; Kamps-Hughes, Nick; Boyer, Matthew C.; Muhlfeld, Clint C.; Allendorf, Fred W.; Johnson, Eric A.; Luikart, Gordon
2013-01-01
Rapid and inexpensive methods for genomewide single nucleotide polymorphism (SNP) discovery and genotyping are urgently needed for population management and conservation. In hybridized populations, genomic techniques that can identify and genotype thousands of species-diagnostic markers would allow precise estimates of population- and individual-level admixture as well as identification of 'super invasive' alleles, which show elevated rates of introgression above the genomewide background (likely due to natural selection). Techniques like restriction-site-associated DNA (RAD) sequencing can discover and genotype large numbers of SNPs, but they have been limited by the length of continuous sequence data they produce with Illumina short-read sequencing. We present a novel approach, overlapping paired-end RAD sequencing, to generate RAD contigs of >300–400 bp. These contigs provide sufficient flanking sequence for design of high-throughput SNP genotyping arrays and strict filtering to identify duplicate paralogous loci. We applied this approach in five populations of native westslope cutthroat trout that previously showed varying (low) levels of admixture from introduced rainbow trout (RBT). We produced 77 141 RAD contigs and used these data to filter and genotype 3180 previously identified species-diagnostic SNP loci. Our population-level and individual-level estimates of admixture were generally consistent with previous microsatellite-based estimates from the same individuals. However, we observed slightly lower admixture estimates from genomewide markers, which might result from natural selection against certain genome regions, different genomic locations for microsatellites vs. RAD-derived SNPs and/or sampling error from the small number of microsatellite loci (n = 7). We also identified candidate adaptive super invasive alleles from RBT that had excessively high admixture proportions in hybridized cutthroat trout populations.
Rapid calculation of genomic evaluations for new animals
USDA-ARS?s Scientific Manuscript database
A method was developed to calculate preliminary genomic evaluations daily or weekly before the release of official monthly evaluations by processing only newly genotyped animals using estimates of SNP effects from the previous official evaluation. To minimize computing time, reliabilities and genomi...
Impact of pre-imputation SNP-filtering on genotype imputation results
2014-01-01
Background Imputation of partially missing or unobserved genotypes is an indispensable tool for SNP data analyses. However, research and understanding of the impact of initial SNP-data quality control on imputation results is still limited. In this paper, we aim to evaluate the effect of different strategies of pre-imputation quality filtering on the performance of the widely used imputation algorithms MaCH and IMPUTE. Results We considered three scenarios: imputation of partially missing genotypes with usage of an external reference panel, without usage of an external reference panel, as well as imputation of completely un-typed SNPs using an external reference panel. We first created various datasets applying different SNP quality filters and masking certain percentages of randomly selected high-quality SNPs. We imputed these SNPs and compared the results between the different filtering scenarios by using established and newly proposed measures of imputation quality. While the established measures assess certainty of imputation results, our newly proposed measures focus on the agreement with true genotypes. These measures showed that pre-imputation SNP-filtering might be detrimental regarding imputation quality. Moreover, the strongest drivers of imputation quality were in general the burden of missingness and the number of SNPs used for imputation. We also found that using a reference panel always improves imputation quality of partially missing genotypes. MaCH performed slightly better than IMPUTE2 in most of our scenarios. Again, these results were more pronounced when using our newly defined measures of imputation quality. Conclusion Even a moderate filtering has a detrimental effect on the imputation quality. Therefore little or no SNP filtering prior to imputation appears to be the best strategy for imputing small to moderately sized datasets. Our results also showed that for these datasets, MaCH performs slightly better than IMPUTE2 in most scenarios at the cost of increased computing time. PMID:25112433
Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe
2010-01-01
Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950
Cleveland, M A; Hickey, J M
2013-08-01
Genomic selection can be implemented in pig breeding at a reduced cost using genotype imputation. Accuracy of imputation and the impact on resulting genomic breeding values (gEBV) was investigated. High-density genotype data was available for 4,763 animals from a single pig line. Three low-density genotype panels were constructed with SNP densities of 450 (L450), 3,071 (L3k) and 5,963 (L6k). Accuracy of imputation was determined using 184 test individuals with no genotyped descendants in the data but with parents and grandparents genotyped using the Illumina PorcineSNP60 Beadchip. Alternative genotyping scenarios were created in which parents, grandparents, and individuals that were not direct ancestors of test animals (Other) were genotyped at high density (S1), grandparents were not genotyped (S2), dams and granddams were not genotyped (S3), and dams and granddams were genotyped at low density (S4). Four additional scenarios were created by excluding Other animal genotypes. Test individuals were always genotyped at low density. Imputation was performed with AlphaImpute. Genomic breeding values were calculated using the single-step genomic evaluation. Test animals were evaluated for the information retained in the gEBV, calculated as the correlation between gEBV using imputed genotypes and gEBV using true genotypes. Accuracy of imputation was high for all scenarios but decreased with fewer SNP on the low-density panel (0.995 to 0.965 for S1) and with reduced genotyping of ancestors, where the largest changes were for L450 (0.965 in S1 to 0.914 in S3). Exclusion of genotypes for Other animals resulted in only small accuracy decreases. Imputation accuracy was not consistent across the genome. Information retained in the gEBV was related to genotyping scenario and thus to imputation accuracy. Reducing the number of SNP on the low-density panel reduced the information retained in the gEBV, with the largest decrease observed from L3k to L450. Excluding Other animal genotypes had little impact on imputation accuracy but caused large decreases in the information retained in the gEBV. These results indicate that accuracy of gEBV from imputed genotypes depends on the level of genotyping in close relatives and the size of the genotyped dataset. Fewer high-density genotyped individuals are needed to obtain accurate imputation than are needed to obtain accurate gEBV. Strategies to optimize development of low-density panels can improve both imputation and gEBV accuracy.
Spyrou, Elena M; Kalogianni, Despina P; Tragoulias, Sotirios S; Ioannou, Penelope C; Christopoulos, Theodore K
2016-10-01
Chemi(bio)luminometric assays have contributed greatly to various areas of nucleic acid analysis due to their simplicity and detectability. In this work, we present the development of chemiluminometric genotyping methods in which (a) detection is performed by using either a conventional digital camera (at ambient temperature) or a smartphone and (b) a lateral flow assay configuration is employed for even higher simplicity and suitability for point of care or field testing. The genotyping of the C677T single nucleotide polymorphism (SNP) of methylenetetrahydropholate reductase (MTHFR) gene is chosen as a model. The interrogated DNA sequence is amplified by polymerase chain reaction (PCR) followed by a primer extension reaction. The reaction products are captured through hybridization on the sensing areas (spots) of the strip. Streptavidin-horseradish peroxidase conjugate is used as a reporter along with a chemiluminogenic substrate. Detection of the emerging chemiluminescence from the sensing areas of the strip is achieved by digital camera or smartphone. For this purpose, we constructed a 3D-printed smartphone attachment that houses inexpensive lenses and converts the smartphone into a portable chemiluminescence imager. The device enables spatial discrimination of the two alleles of a SNP in a single shot by imaging of the strip, thus avoiding the need of dual labeling. The method was applied successfully to genotyping of real clinical samples. Graphical abstract Paper-based genotyping assays using digital camera and smartphone as detectors.
Quality control and quality assurance in genotypic data for genome-wide association studies
Laurie, Cathy C.; Doheny, Kimberly F.; Mirel, Daniel B.; Pugh, Elizabeth W.; Bierut, Laura J.; Bhangale, Tushar; Boehm, Frederick; Caporaso, Neil E.; Cornelis, Marilyn C.; Edenberg, Howard J.; Gabriel, Stacy B.; Harris, Emily L.; Hu, Frank B.; Jacobs, Kevin; Kraft, Peter; Landi, Maria Teresa; Lumley, Thomas; Manolio, Teri A.; McHugh, Caitlin; Painter, Ian; Paschall, Justin; Rice, John P.; Rice, Kenneth M.; Zheng, Xiuwen; Weir, Bruce S.
2011-01-01
Genome-wide scans of nucleotide variation in human subjects are providing an increasing number of replicated associations with complex disease traits. Most of the variants detected have small effects and, collectively, they account for a small fraction of the total genetic variance. Very large sample sizes are required to identify and validate findings. In this situation, even small sources of systematic or random error can cause spurious results or obscure real effects. The need for careful attention to data quality has been appreciated for some time in this field, and a number of strategies for quality control and quality assurance (QC/QA) have been developed. Here we extend these methods and describe a system of QC/QA for genotypic data in genome-wide association studies. This system includes some new approaches that (1) combine analysis of allelic probe intensities and called genotypes to distinguish gender misidentification from sex chromosome aberrations, (2) detect autosomal chromosome aberrations that may affect genotype calling accuracy, (3) infer DNA sample quality from relatedness and allelic intensities, (4) use duplicate concordance to infer SNP quality, (5) detect genotyping artifacts from dependence of Hardy-Weinberg equilibrium (HWE) test p-values on allelic frequency, and (6) demonstrate sensitivity of principal components analysis (PCA) to SNP selection. The methods are illustrated with examples from the ‘Gene Environment Association Studies’ (GENEVA) program. The results suggest several recommendations for QC/QA in the design and execution of genome-wide association studies. PMID:20718045
Chen, L; Schenkel, F; Vinsky, M; Crews, D H; Li, C
2013-10-01
In beef cattle, phenotypic data that are difficult and/or costly to measure, such as feed efficiency, and DNA marker genotypes are usually available on a small number of animals of different breeds or populations. To achieve a maximal accuracy of genomic prediction using the phenotype and genotype data, strategies for forming a training population to predict genomic breeding values (GEBV) of the selection candidates need to be evaluated. In this study, we examined the accuracy of predicting GEBV for residual feed intake (RFI) based on 522 Angus and 395 Charolais steers genotyped on SNP with the Illumina Bovine SNP50 Beadchip for 3 training population forming strategies: within breed, across breed, and by pooling data from the 2 breeds (i.e., combined). Two other scenarios with the training and validation data split by birth year and by sire family within a breed were also investigated to assess the impact of genetic relationships on the accuracy of genomic prediction. Three statistical methods including the best linear unbiased prediction with the relationship matrix defined based on the pedigree (PBLUP), based on the SNP genotypes (GBLUP), and a Bayesian method (BayesB) were used to predict the GEBV. The results showed that the accuracy of the GEBV prediction was the highest when the prediction was within breed and when the validation population had greater genetic relationships with the training population, with a maximum of 0.58 for Angus and 0.64 for Charolais. The within-breed prediction accuracies dropped to 0.29 and 0.38, respectively, when the validation populations had a minimal pedigree link with the training population. When the training population of a different breed was used to predict the GEBV of the validation population, that is, across-breed genomic prediction, the accuracies were further reduced to 0.10 to 0.22, depending on the prediction method used. Pooling data from the 2 breeds to form the training population resulted in accuracies increased to 0.31 and 0.43, respectively, for the Angus and Charolais validation populations. The results suggested that the genetic relationship of selection candidates with the training population has a greater impact on the accuracy of GEBV using the Illumina Bovine SNP50 Beadchip. Pooling data from different breeds to form the training population will improve the accuracy of across breed genomic prediction for RFI in beef cattle.
Leblanc, N; Cortey, M; Fernandez Pinero, J; Gallardo, C; Masembe, C; Okurut, A R; Heath, L; van Heerden, J; Sánchez-Vizcaino, J M; Ståhl, K; Belák, S
2013-08-01
African swine fever virus (ASFV) causes one of the most dreaded transboundary animal diseases (TADs) in Suidae. African swine fever (ASF) often causes high rates of morbidity and mortality, which can reach 100% in domestic swine. To date, serological diagnosis has the drawback of not being able to differentiate variants of this virus. Previous studies have identified the 22 genotypes based on sequence variation in the C-terminal region of the p72 gene, which has become the standard for categorizing ASFVs. This article describes a genotyping assay developed using a segment of PCR-amplified genomic DNA of approximately 450 bp, which encompasses the C-terminal end of the p72 gene. Complementary paired DNA probes of 15 or 17 bp in length, which are identical except for a single nucleotide polymorphism (SNP) in the central position, were designed to either individually or in combination differentiate between the 22 genotypes. The assay was developed using xMAP technology; probes were covalently linked to microspheres, hybridized to PCR product, labelled with a reporter and read in the Luminex 200 analyzer. Characterization of the sample was performed by comparing fluorescence of the paired SNP probes, that is, the probe with higher fluorescence in a complementary pair identified the SNP that a particular sample possessed. In the final assay, a total of 52 probes were employed, 24 SNP pairs and 4 for general detection. One or more samples from each of the 22 genotypes were tested. The assay was able to detect and distinguish all 22 genotypes. This novel assay provides a powerful novel tool for the simultaneous rapid diagnosis and genotypic differentiation of ASF. © 2012 Blackwell Verlag GmbH.
Xu, Jin; Lu, Zhigang; Xu, Mingming; Pan, Ling; Deng, Yi; Xie, Xiaohu; Liu, Huifen; Ding, Shixiong; Hurd, Yasmin L.; Pasternak, Gavril W.; Klein, Robert J.; Cartegni, Luca
2014-01-01
Single nucleotide polymorphisms (SNPs) in the OPRM1 gene have been associated with vulnerability to opioid dependence. The current study identifies an association of an intronic SNP (rs9479757) with the severity of heroin addiction among Han-Chinese male heroin addicts. Individual SNP analysis and haplotype-based analysis with additional SNPs in the OPRM1 locus showed that mild heroin addiction was associated with the AG genotype, whereas severe heroin addiction was associated with the GG genotype. In vitro studies such as electrophoretic mobility shift assay, minigene, siRNA, and antisense morpholino oligonucleotide studies have identified heterogeneous nuclear ribonucleoprotein H (hnRNPH) as the major binding partner for the G-containing SNP site. The G-to-A transition weakens hnRNPH binding and facilitates exon 2 skipping, leading to altered expressions of OPRM1 splice-variant mRNAs and hMOR-1 proteins. Similar changes in splicing and hMOR-1 proteins were observed in human postmortem prefrontal cortex with the AG genotype of this SNP when compared with the GG genotype. Interestingly, the altered splicing led to an increase in hMOR-1 protein levels despite decreased hMOR-1 mRNA levels, which is likely contributed by a concurrent increase in single transmembrane domain variants that have a chaperone-like function on MOR-1 protein stability. Our studies delineate the role of this SNP as a modifier of OPRM1 alternative splicing via hnRNPH interactions, and suggest a functional link between an SNP-containing splicing modifier and the severity of heroin addiction. PMID:25122903
Technical note: Equivalent genomic models with a residual polygenic effect.
Liu, Z; Goddard, M E; Hayes, B J; Reinhardt, F; Reents, R
2016-03-01
Routine genomic evaluations in animal breeding are usually based on either a BLUP with genomic relationship matrix (GBLUP) or single nucleotide polymorphism (SNP) BLUP model. For a multi-step genomic evaluation, these 2 alternative genomic models were proven to give equivalent predictions for genomic reference animals. The model equivalence was verified also for young genotyped animals without phenotypes. Due to incomplete linkage disequilibrium of SNP markers to genes or causal mutations responsible for genetic inheritance of quantitative traits, SNP markers cannot explain all the genetic variance. A residual polygenic effect is normally fitted in the genomic model to account for the incomplete linkage disequilibrium. In this study, we start by showing the proof that the multi-step GBLUP and SNP BLUP models are equivalent for the reference animals, when they have a residual polygenic effect included. Second, the equivalence of both multi-step genomic models with a residual polygenic effect was also verified for young genotyped animals without phenotypes. Additionally, we derived formulas to convert genomic estimated breeding values of the GBLUP model to its components, direct genomic values and residual polygenic effect. Third, we made a proof that the equivalence of these 2 genomic models with a residual polygenic effect holds also for single-step genomic evaluation. Both the single-step GBLUP and SNP BLUP models lead to equal prediction for genotyped animals with phenotypes (e.g., reference animals), as well as for (young) genotyped animals without phenotypes. Finally, these 2 single-step genomic models with a residual polygenic effect were proven to be equivalent for estimation of SNP effects, too. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
The role of the folate pathway in pancreatic cancer risk
Chittiboyina, Shirisha; Chen, Zhongxue; Chiorean, E. Gabriela; Kamendulis, Lisa M.
2018-01-01
Background Pancreatic cancer is the third leading cause of cancer related deaths in the United States. Several dietary factors have been identified that modify pancreatic cancer risk, including low folate levels. In addition to nutrition and lifestyle determinants, folate status may be influenced by genetic factors such as single nucleotide polymorphisms (SNPs). In the present study, we investigated the association between folate levels, genetic polymorphisms in genes of the folate pathway, and pancreatic cancer. Methods Serum and red blood cell (RBC) folate levels were measured in pancreatic cancer and control subjects. Genotypes were determined utilizing Taqman probes and SNP frequencies between cases and controls were assessed using Fisher’s exact test. Logistic regression was used to estimate the odds ratio (OR) and corresponding 95% confidence intervals (CIs) to measure the association between genotypes and pancreatic cancer risk. The association between folate levels and SNP expression was calculated using one-way ANOVA. Results Mean RBC folate levels were significantly lower in pancreatic cancer cases compared to unrelated controls (508.4 ± 215.9 ng/mL vs 588.3 ± 229.2 ng/mL, respectively) whereas serum folate levels were similar. Irrespective of cancer status, several SNPs were found to be associated with altered serum folate concentrations, including the D919G SNP in methionine synthase (MTR), the L474F SNP in serine hydroxymethyl transferase 1 (SHMT1) and the V175M SNP in phosphatidyl ethanolamine methyltransferase (PEMT). Further, the V allele of the A222V SNP and the E allele of the E429A SNP in methylene tetrahydrofolate reductase (MTHFR) were associated with low RBC folate levels. Pancreatic cancer risk was found to be significantly lower for the LL allele of the L78R SNP in choline dehydrogenase (CHDH; OR = 0.29; 95% CI 0.12–0.76); however, it was not associated with altered serum or RBC folate levels. PMID:29474406
Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S
2018-01-01
Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.
Sulovari, Arvis; Li, Dawei
2014-07-19
Genome-wide association studies (GWAS) have successfully identified genes associated with complex human diseases. Although much of the heritability remains unexplained, combining single nucleotide polymorphism (SNP) genotypes from multiple studies for meta-analysis will increase the statistical power to identify new disease-associated variants. Meta-analysis requires same allele definition (nomenclature) and genome build among individual studies. Similarly, imputation, commonly-used prior to meta-analysis, requires the same consistency. However, the genotypes from various GWAS are generated using different genotyping platforms, arrays or SNP-calling approaches, resulting in use of different genome builds and allele definitions. Incorrect assumptions of identical allele definition among combined GWAS lead to a large portion of discarded genotypes or incorrect association findings. There is no published tool that predicts and converts among all major allele definitions. In this study, we have developed a tool, GACT, which stands for Genome build and Allele definition Conversion Tool, that predicts and inter-converts between any of the common SNP allele definitions and between the major genome builds. In addition, we assessed several factors that may affect imputation quality, and our results indicated that inclusion of singletons in the reference had detrimental effects while ambiguous SNPs had no measurable effect. Unexpectedly, exclusion of genotypes with missing rate > 0.001 (40% of study SNPs) showed no significant decrease of imputation quality (even significantly higher when compared to the imputation with singletons in the reference), especially for rare SNPs. GACT is a new, powerful, and user-friendly tool with both command-line and interactive online versions that can accurately predict, and convert between any of the common allele definitions and between genome builds for genome-wide meta-analysis and imputation of genotypes from SNP-arrays or deep-sequencing, particularly for data from the dbGaP and other public databases. http://www.uvm.edu/genomics/software/gact.
Ramos, Antonio M.; Crooijmans, Richard P. M. A.; Affara, Nabeel A.; Amaral, Andreia J.; Archibald, Alan L.; Beever, Jonathan E.; Bendixen, Christian; Churcher, Carol; Clark, Richard; Dehais, Patrick; Hansen, Mark S.; Hedegaard, Jakob; Hu, Zhi-Liang; Kerstens, Hindrik H.; Law, Andy S.; Megens, Hendrik-Jan; Milan, Denis; Nonneman, Danny J.; Rohrer, Gary A.; Rothschild, Max F.; Smith, Tim P. L.; Schnabel, Robert D.; Van Tassell, Curt P.; Taylor, Jeremy F.; Wiedmann, Ralph T.; Schook, Lawrence B.; Groenen, Martien A. M.
2009-01-01
Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs. PMID:19654876
Leyva-Corona, Jose C; Reyna-Granados, Javier R; Zamorano-Algandar, Ricardo; Sanchez-Castro, Miguel A; Thomas, Milton G; Enns, R Mark; Speidel, Scott E; Medrano, Juan F; Rincon, Gonzalo; Luna-Nevarez, Pablo
2018-06-20
Prolactin (PRL), growth hormone (GH), and insulin-like growth factor-1 (IGF-1) are in hormone-response pathways involved in energy metabolism during thermoregulation processes in cattle. Objective herein was to study the association between single nucleotide polymorphisms (SNP) within genes of the PRL and GH/IGF-1 pathways with fertility traits such as services per conception (SPC) and days open (DO) in Holstein cattle lactating under a hot-humid climate. Ambient temperature and relative humidity were used to calculate the temperature-humidity index (THI) which revealed that the cows were exposed to heat stress conditions from June to November of 2012 in southern Sonora, Mexico. Individual blood samples from all cows were collected, spotted on FTA cards, and used to genotype a 179 tag SNP panel within 44 genes from the PRL and GH/IGF-1 pathways. The associative analyses among SNP genotypes and fertility traits were performed using mixed-effect models. Allele substitution effects were calculated using a regression model that included the genotype term as covariate. Single-SNP association analyses indicated that eight SNP within the genes IGF-1, IGF-1R, IGFBP5, PAPPA1, PMCH, PRLR, SOCS5, and SSTR2 were associated with SPC (P < 0.05), whereas four SNP in the genes GHR, PAPPA2, PRLR, and SOCS4 were associated with DO (P < 0.05). In conclusion, SNP within genes of the PRL and GH/IGF-1 pathways resulted as predictors of reproductive phenotypes in heat-stressed Holstein cows, and these SNP are proposed as candidates for a marker-assisted selection program intended to improve fertility of dairy cattle raised in warm climates.
Estimation of genomic breeding values for milk yield in UK dairy goats.
Mucha, S; Mrode, R; MacLaren-Lee, I; Coffey, M; Conington, J
2015-11-01
The objective of this study was to estimate genomic breeding values for milk yield in crossbred dairy goats. The research was based on data provided by 2 commercial goat farms in the UK comprising 590,409 milk yield records on 14,453 dairy goats kidding between 1987 and 2013. The population was created by crossing 3 breeds: Alpine, Saanen, and Toggenburg. In each generation the best performing animals were selected for breeding, and as a result, a synthetic breed was created. The pedigree file contained 30,139 individuals, of which 2,799 were founders. The data set contained test-day records of milk yield, lactation number, farm, age at kidding, and year and season of kidding. Data on milk composition was unavailable. In total 1,960 animals were genotyped with the Illumina 50K caprine chip. Two methods for estimation of genomic breeding value were compared-BLUP at the single nucleotide polymorphism level (BLUP-SNP) and single-step BLUP. The highest accuracy of 0.61 was obtained with single-step BLUP, and the lowest (0.36) with BLUP-SNP. Linkage disequilibrium (r(2), the squared correlation of the alleles at 2 loci) at 50 kb (distance between 2 SNP) was 0.18. This is the first attempt to implement genomic selection in UK dairy goats. Results indicate that the single-step method provides the highest accuracy for populations with a small number of genotyped individuals, where the number of genotyped males is low and females are predominant in the reference population. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography.
Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi
2013-03-01
New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined 'elimination' status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of Mycobacterium leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. Copyright © 2012 Elsevier B.V. All rights reserved.
Luo, Huajie; Wu, Hao; Shen, Hailian; Chen, Haifeng; Yang, Tao; Huang, Zhiwu; Jin, Xiaojie; Pang, Xiuhong; Li, Lei; Hu, Xianting; Jiang, Xuemei; Fan, Zhuping; Li, Jiping
2016-07-01
This study aimed to test the association between the European GWAS-identified risk IQGAP2 SNP rs457717 (A>G) and age-related hearing impairment (ARHI) in a Han male Chinese (HMC) population. A total of 2420 HMC subjects were divided into two groups [group 70+: >70 years (n = 1306), and group 70-: ≤70 years (n = 1114)]. The participants were categorised into case and control groups according to Z high scores for group 70- and the severity of hearing loss and different audiogram shapes identified by K-means cluster analysis for group 70+. The IQGAP2 tagSNP rs457717 was genotyped in accordance with the different ARHI phenotypes. The genotype distributions of IQGAP2 (AA/AG/GG) were not significantly different between the case and control groups (P = 0.613 for group 70-; P = 0.602 for group 70+). Compared with genotype AA, the ORs of genotypes AG and GG for ARHI were not significantly different following adjustment for other environmental risk factors. We demonstrated that the IQGAP2 TagSNP rs457717 (A/G) was not associated with ARHI in HMC individuals.
van Manen, Daniëlle; Delaneau, Olivier; Kootstra, Neeltje A.; Boeser-Nunnink, Brigitte D.; Limou, Sophie; Bol, Sebastiaan M.; Burger, Judith A.; Zwinderman, Aeilko H.; Moerland, Perry D.; van 't Slot, Ruben; Zagury, Jean-François; van 't Wout, Angélique B.; Schuitemaker, Hanneke
2011-01-01
Background AIDS develops typically after 7–11 years of untreated HIV-1 infection, with extremes of very rapid disease progression (<2 years) and long-term non-progression (>15 years). To reveal additional host genetic factors that may impact on the clinical course of HIV-1 infection, we designed a genome-wide association study (GWAS) in 404 participants of the Amsterdam Cohort Studies on HIV-1 infection and AIDS. Methods The association of SNP genotypes with the clinical course of HIV-1 infection was tested in Cox regression survival analyses using AIDS-diagnosis and AIDS-related death as endpoints. Results Multiple, not previously identified SNPs, were identified to be strongly associated with disease progression after HIV-1 infection, albeit not genome-wide significant. However, three independent SNPs in the top ten associations between SNP genotypes and time between seroconversion and AIDS-diagnosis, and one from the top ten associations between SNP genotypes and time between seroconversion and AIDS-related death, had P-values smaller than 0.05 in the French Genomics of Resistance to Immunodeficiency Virus cohort on disease progression. Conclusions Our study emphasizes that the use of different phenotypes in GWAS may be useful to unravel the full spectrum of host genetic factors that may be associated with the clinical course of HIV-1 infection. PMID:21811574
Adiponectin and resistin gene polymorphisms in association with their respective adipokine levels.
Lau, Cia-Hin; Muniandy, Sekaran
2011-05-01
Single nucleotide polymorphisms (SNPs) at the adiponectin and resistin loci are strongly associated with hypoadiponectinemia and hyperresistinemia, which may eventually increase risk of insulin resistance, type 2 diabetes (T2DM), metabolic syndrome (MS), and cardiovascular disease. Real-time PCR was used to genotype SNPs of the adiponectin (SNP+45T>G, SNP+276G>T, SNP+639T>C, and SNP+1212A>G) and resistin (SNP-420C>G and SNP+299G>A) genes in 809 Malaysian men (208 controls, 174 MS without T2DM, 171 T2DM without MS, 256 T2DM with MS) whose ages ranged between 40 and 70 years old. The genotyping results for each SNP marker was verified by sequencing. The anthropometric clinical and metabolic parameters of subjects were recorded. None of these SNPs at the adiponectin and resistin loci were associated with T2DM and MS susceptibility in Malaysian men. SNP+45T>G, SNP+276G>T, and SNP+639T>C of the adiponectin gene did not influence circulating levels of adiponectin. However, the G-allele of SNP+1212A>G at the adiponectin locus was marginally associated (P= 0.0227) with reduced circulating adiponectin levels. SNP-420C>G (df = 2; F= 16.026; P= 1.50×10(-7) ) and SNP+299G>A (df = 2; F= 22.944; P= 2.04×10(-10) ) of the resistin gene were strongly associated with serum resistin levels. Thus, SNP-420C>G and SNP+299G>A of the resistin gene are strongly associated with the risk of hyperresistinemia in Malaysian men. © 2011 The Authors Annals of Human Genetics © 2011 Blackwell Publishing Ltd/University College London.
A flexible bayesian model for testing for transmission ratio distortion.
Casellas, Joaquim; Manunza, Arianna; Mercader, Anna; Quintanilla, Raquel; Amills, Marcel
2014-12-01
Current statistical approaches to investigate the nature and magnitude of transmission ratio distortion (TRD) are scarce and restricted to the most common experimental designs such as F2 populations and backcrosses. In this article, we describe a new Bayesian approach to check TRD within a given biallelic genetic marker in a diploid species, providing a highly flexible framework that can accommodate any kind of population structure. This model relies on the genotype of each offspring and thus integrates all available information from either the parents' genotypes or population-specific allele frequencies and yields TRD estimates that can be corroborated by the calculation of a Bayes factor (BF). This approach has been evaluated on simulated data sets with appealing statistical performance. As a proof of concept, we have also tested TRD in a porcine population with five half-sib families and 352 offspring. All boars and piglets were genotyped with the Porcine SNP60 BeadChip, whereas genotypes from the sows were not available. The SNP-by-SNP screening of the pig genome revealed 84 SNPs with decisive evidences of TRD (BF > 100) after accounting for multiple testing. Many of these regions contained genes related to biological processes (e.g., nucleosome assembly and co-organization, DNA conformation and packaging, and DNA complex assembly) that are critically associated with embryonic viability. The implementation of this method, which overcomes many of the limitations of previous approaches, should contribute to fostering research on TRD in both model and nonmodel organisms. Copyright © 2014 by the Genetics Society of America.
Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Bellato, Cláudia M; Motilal, Lambert; Zhang, Dapeng
2014-01-15
Cacao (Theobroma cacao L.), the source of cocoa, is an economically important tropical crop. One problem with the premium cacao market is contamination with off-types adulterating raw premium material. Accurate determination of the genetic identity of single cacao beans is essential for ensuring cocoa authentication. Using nanofluidic single nucleotide polymorphism (SNP) genotyping with 48 SNP markers, we generated SNP fingerprints for small quantities of DNA extracted from the seed coat of single cacao beans. On the basis of the SNP profiles, we identified an assumed adulterant variety, which was unambiguously distinguished from the authentic beans by multilocus matching. Assignment tests based on both Bayesian clustering analysis and allele frequency clearly separated all 30 authentic samples from the non-authentic samples. Distance-based principle coordinate analysis further supported these results. The nanofluidic SNP protocol, together with forensic statistical tools, is sufficiently robust to establish authentication and to verify gourmet cacao varieties. This method shows significant potential for practical application.
Mao, H G; Dong, X Y; Cao, H Y; Xu, N Y; Yin, Z Z
2018-04-01
1. Diacylglycerol acyltransferase (DGAT) plays an important role in the synthesis of triacylglycerol, but its effects on meat quality and carcass composition in pigeons are unclear. In this study, single-nucleotide polymorphisms (SNPs) in the exons of the DGAT2 gene were identified and analysed by using DNA sequencing methods in 200 domestic pigeons (Columba livia). The associations between DGAT2 polymorphisms and carcass and meat quality traits were also analysed. 2. Sequencing results showed that 5 nucleotide mutations were detected in exons 3, 4, 5 and 6 of the DGAT2 gene. The analysis revealed three genotypes (AA, AB and BB) in G18398T and G22484C, in which the AA genotype and A allele had the highest frequency. 3. In the SNP of G18398T located in exon 5, individuals with genotype BB had significantly higher meat quality and lower abdominal fat content than those with AA or AB genotype. In the SNP of G22484C located in exon 6, the genotype AA showed highest carcass trait values, while the genotype BB represented better meat quality, compared to AA and AB genotypes. 4. The results imply that DGAT2 gene has a close relationship with carcass and meat quality traits in pigeons, and the SNPs of G18398T and G22484C can be used as genetic markers for marker-assisted breeding in pigeon.
Zhang, Qin; Bai, Bao-Ling; Liu, Xiao-Zhen; Miao, Chun-Yue; Li, Hui-Li
2014-08-01
To explore the association of polymorphisms in folate metabolism genes, methionine synthase reductase (MTRR) gene and 5,10-methylenetetrahydrofolate reductase (MTHFR) gene, with complex congenital abnormalities and to further investigate its association with complex congenital abnormalities derived from three germ layers. A total of 250 cases of birth defects (with complex congenital abnormalities including congenital heart disease, neural tube defects, and craniofacial anomalies) in Shanxi Province, China were included in the study. MTRR single nucleotide polymorphism (SNP) (rs1801394) and MTHFR SNP (rs1801133) were genotyped by the SNaPshot method, and the genotyping results were compared with those of controls (n=420). SNPs rs1801394 and rs1801133 were associated with multiple birth defects. For the recessive model, individuals with GG genotype at rs1801394 and CC genotype at rs1801133 had a relatively low risk of developing birth defects, so the two genotypes were protective factors against birth defects. The homozygous recessive genotype at rs1801133, which served as a protective factor, was associated with ectoderm- or endoderm-derived complex congenital abnormalities, while the homozygous recessive genotype at rs1801394, which served as a protective factor, was associated with ectoderm-, mesoderm- or endoderm-derived complex congenital abnormalities. Among the Chinese population in Shanxi Province, the SNPs in folate metabolism genes (MTRR and MTHFR) are associated with complex congenital abnormalities and related to ectoderm, mesoderm or endoderm development.
Characterization of genetic variability of Venezuelan equine encephalitis viruses
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.; ...
2016-04-07
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
HUMAN HtrA1 IN THE ARCHIVED EYES WITH AGE-RELATED MACULAR DEGENERATION
Chan, Chi-Chao; Shen, Defen; Zhou, Min; Ross, Robert J.; Ding, Xiaoyan; Zhang, Kang; Green, W. Richard; Tuo, Jingsheng
2007-01-01
Purpose HtrA1 belongs to the high temperature requirement factor A family of serine proteases, which are involved in protein quality control and cell fate. A single-nucleotide polymorphism (SNP), rs11200638, in the promoter of HtrA1 at chromosome 10q26 is reported as a likely causal variant for age-related macular degeneration (AMD). The SNP is located in the regulatory region and increases production of HtrA1 protein. This study investigates HtrA1 expression and SNP genotypes in archived ocular slides with AMD. Methods Macular, nonretinal, and peripheral retinal cells were microdissected from archived slides from 57 eyes with AMD and 16 age-matched, non-AMD controls. HtrA1 rs11200638 SNP genotyping was performed using polymerase chain reaction (PCR) and restriction fragment length polymorphism analysis. HtrA1 transcripts were measured using real-time reverse transcriptase–PCR. HtrA1 protein expression was evaluated using avidin-biotin complex immunohistochemistry. Results HtrA1 (G/A) SNP was successfully genotyped in 52 AMD cases and 13 non-AMD subjects. The frequencies of the risk allele (A) were 55 of 104 (52.9%) and 8 of 26 (30.8%) in AMD and control groups, respectively. HtrA1 mRNA was detected in normal peripheral and macular retinas, higher in the periphery than maculae. HtrA1 mRNA was much higher in the macula and a lot lower in the periphery of the AMD eyes as compared to control eyes. HtrA1 protein was expressed in normal retinal vascular endothelia and retinal pigment epithelia. Intense immunoreaction against HtrA1 was found in AMD lesions, slightly more in wet than dry AMD lesions. Conclusion This study successfully analyzes HtrA1 SNP and transcript expression in microdissected cells from archived paraffin fixed slides. Up-regulation of HtrA1 is detected in the macular lesions of AMD eyes. The data further suggest that rs11200638 in HtrA1 promoter is associated with AMD development. PMID:18427598
Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng
2009-02-01
High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.
On marker-based parentage verification via non-linear optimization.
Boerner, Vinzent
2017-06-15
Parentage verification by molecular markers is mainly based on short tandem repeat markers. Single nucleotide polymorphisms (SNPs) as bi-allelic markers have become the markers of choice for genotyping projects. Thus, the subsequent step is to use SNP genotypes for parentage verification as well. Recent developments of algorithms such as evaluating opposing homozygous SNP genotypes have drawbacks, for example the inability of rejecting all animals of a sample of potential parents. This paper describes an algorithm for parentage verification by constrained regression which overcomes the latter limitation and proves to be very fast and accurate even when the number of SNPs is as low as 50. The algorithm was tested on a sample of 14,816 animals with 50, 100 and 500 SNP genotypes randomly selected from 40k genotypes. The samples of putative parents of these animals contained either five random animals, or four random animals and the true sire. Parentage assignment was performed by ranking of regression coefficients, or by setting a minimum threshold for regression coefficients. The assignment quality was evaluated by the power of assignment (P[Formula: see text]) and the power of exclusion (P[Formula: see text]). If the sample of putative parents contained the true sire and parentage was assigned by coefficient ranking, P[Formula: see text] and P[Formula: see text] were both higher than 0.99 for the 500 and 100 SNP genotypes, and higher than 0.98 for the 50 SNP genotypes. When parentage was assigned by a coefficient threshold, P[Formula: see text] was higher than 0.99 regardless of the number of SNPs, but P[Formula: see text] decreased from 0.99 (500 SNPs) to 0.97 (100 SNPs) and 0.92 (50 SNPs). If the sample of putative parents did not contain the true sire and parentage was rejected using a coefficient threshold, the algorithm achieved a P[Formula: see text] of 1 (500 SNPs), 0.99 (100 SNPs) and 0.97 (50 SNPs). The algorithm described here is easy to implement, fast and accurate, and is able to assign parentage using genomic marker data with a size as low as 50 SNPs.
Genomic prediction using imputed whole-genome sequence data in Holstein Friesian cattle.
van Binsbergen, Rianne; Calus, Mario P L; Bink, Marco C A M; van Eeuwijk, Fred A; Schrooten, Chris; Veerkamp, Roel F
2015-09-17
In contrast to currently used single nucleotide polymorphism (SNP) panels, the use of whole-genome sequence data is expected to enable the direct estimation of the effects of causal mutations on a given trait. This could lead to higher reliabilities of genomic predictions compared to those based on SNP genotypes. Also, at each generation of selection, recombination events between a SNP and a mutation can cause decay in reliability of genomic predictions based on markers rather than on the causal variants. Our objective was to investigate the use of imputed whole-genome sequence genotypes versus high-density SNP genotypes on (the persistency of) the reliability of genomic predictions using real cattle data. Highly accurate phenotypes based on daughter performance and Illumina BovineHD Beadchip genotypes were available for 5503 Holstein Friesian bulls. The BovineHD genotypes (631,428 SNPs) of each bull were used to impute whole-genome sequence genotypes (12,590,056 SNPs) using the Beagle software. Imputation was done using a multi-breed reference panel of 429 sequenced individuals. Genomic estimated breeding values for three traits were predicted using a Bayesian stochastic search variable selection (BSSVS) model and a genome-enabled best linear unbiased prediction model (GBLUP). Reliabilities of predictions were based on 2087 validation bulls, while the other 3416 bulls were used for training. Prediction reliabilities ranged from 0.37 to 0.52. BSSVS performed better than GBLUP in all cases. Reliabilities of genomic predictions were slightly lower with imputed sequence data than with BovineHD chip data. Also, the reliabilities tended to be lower for both sequence data and BovineHD chip data when relationships between training animals were low. No increase in persistency of prediction reliability using imputed sequence data was observed. Compared to BovineHD genotype data, using imputed sequence data for genomic prediction produced no advantage. To investigate the putative advantage of genomic prediction using (imputed) sequence data, a training set with a larger number of individuals that are distantly related to each other and genomic prediction models that incorporate biological information on the SNPs or that apply stricter SNP pre-selection should be considered.
Unterseer, Sandra; Bauer, Eva; Haberer, Georg; Seidel, Michael; Knaak, Carsten; Ouzunova, Milena; Meitinger, Thomas; Strom, Tim M; Fries, Ruedi; Pausch, Hubert; Bertani, Christofer; Davassi, Alessandro; Mayer, Klaus Fx; Schön, Chris-Carolin
2014-09-29
High density genotyping data are indispensable for genomic analyses of complex traits in animal and crop species. Maize is one of the most important crop plants worldwide, however a high density SNP genotyping array for analysis of its large and highly dynamic genome was not available so far. We developed a high density maize SNP array composed of 616,201 variants (SNPs and small indels). Initially, 57 M variants were discovered by sequencing 30 representative temperate maize lines and then stringently filtered for sequence quality scores and predicted conversion performance on the array resulting in the selection of 1.2 M polymorphic variants assayed on two screening arrays. To identify high-confidence variants, 285 DNA samples from a broad genetic diversity panel of worldwide maize lines including the samples used for sequencing, important founder lines for European maize breeding, hybrids, and proprietary samples with European, US, semi-tropical, and tropical origin were used for experimental validation. We selected 616 k variants according to their performance during validation, support of genotype calls through sequencing data, and physical distribution for further analysis and for the design of the commercially available Affymetrix® Axiom® Maize Genotyping Array. This array is composed of 609,442 SNPs and 6,759 indels. Among these are 116,224 variants in coding regions and 45,655 SNPs of the Illumina® MaizeSNP50 BeadChip for study comparison. In a subset of 45,974 variants, apart from the target SNP additional off-target variants are detected, which show only a minor bias towards intermediate allele frequencies. We performed principal coordinate and admixture analyses to determine the ability of the array to detect and resolve population structure and investigated the extent of LD within a worldwide validation panel. The high density Affymetrix® Axiom® Maize Genotyping Array is optimized for European and American temperate maize and was developed based on a diverse sample panel by applying stringent quality filter criteria to ensure its suitability for a broad range of applications. With 600 k variants it is the largest currently publically available genotyping array in crop species.
SNP discovery in candidate adaptive genes using exon capture in a free-ranging alpine ungulate
Roffler, Gretchen H.; Amish, Stephen J.; Smith, Seth; Cosart, Ted F.; Kardos, Marty; Schwartz, Michael K.; Luikart, Gordon
2016-01-01
Identification of genes underlying genomic signatures of natural selection is key to understanding adaptation to local conditions. We used targeted resequencing to identify SNP markers in 5321 candidate adaptive genes associated with known immunological, metabolic and growth functions in ovids and other ungulates. We selectively targeted 8161 exons in protein-coding and nearby 5′ and 3′ untranslated regions of chosen candidate genes. Targeted sequences were taken from bighorn sheep (Ovis canadensis) exon capture data and directly from the domestic sheep genome (Ovis aries v. 3; oviAri3). The bighorn sheep sequences used in the Dall's sheep (Ovis dalli dalli) exon capture aligned to 2350 genes on the oviAri3 genome with an average of 2 exons each. We developed a microfluidic qPCR-based SNP chip to genotype 476 Dall's sheep from locations across their range and test for patterns of selection. Using multiple corroborating approaches (lositan and bayescan), we detected 28 SNP loci potentially under selection. We additionally identified candidate loci significantly associated with latitude, longitude, precipitation and temperature, suggesting local environmental adaptation. The three methods demonstrated consistent support for natural selection on nine genes with immune and disease-regulating functions (e.g. Ovar-DRA, APC, BATF2, MAGEB18), cell regulation signalling pathways (e.g. KRIT1, PI3K, ORRC3), and respiratory health (CYSLTR1). Characterizing adaptive allele distributions from novel genetic techniques will facilitate investigation of the influence of environmental variation on local adaptation of a northern alpine ungulate throughout its range. This research demonstrated the utility of exon capture for gene-targeted SNP discovery and subsequent SNP chip genotyping using low-quality samples in a nonmodel species.
SNP discovery in the bovine milk transcriptome using RNA-Seq technology.
Cánovas, Angela; Rincon, Gonzalo; Islas-Trejo, Alma; Wickramasinghe, Saumya; Medrano, Juan F
2010-12-01
High-throughput sequencing of RNA (RNA-Seq) was developed primarily to analyze global gene expression in different tissues. However, it also is an efficient way to discover coding SNPs. The objective of this study was to perform a SNP discovery analysis in the milk transcriptome using RNA-Seq. Seven milk samples from Holstein cows were analyzed by sequencing cDNAs using the Illumina Genome Analyzer system. We detected 19,175 genes expressed in milk samples corresponding to approximately 70% of the total number of genes analyzed. The SNP detection analysis revealed 100,734 SNPs in Holstein samples, and a large number of those corresponded to differences between the Holstein breed and the Hereford bovine genome assembly Btau4.0. The number of polymorphic SNPs within Holstein cows was 33,045. The accuracy of RNA-Seq SNP discovery was tested by comparing SNPs detected in a set of 42 candidate genes expressed in milk that had been resequenced earlier using Sanger sequencing technology. Seventy of 86 SNPs were detected using both RNA-Seq and Sanger sequencing technologies. The KASPar Genotyping System was used to validate unique SNPs found by RNA-Seq but not observed by Sanger technology. Our results confirm that analyzing the transcriptome using RNA-Seq technology is an efficient and cost-effective method to identify SNPs in transcribed regions. This study creates guidelines to maximize the accuracy of SNP discovery and prevention of false-positive SNP detection, and provides more than 33,000 SNPs located in coding regions of genes expressed during lactation that can be used to develop genotyping platforms to perform marker-trait association studies in Holstein cattle.
Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays
Binder, Hans; Fasold, Mario; Glomb, Torsten
2009-01-01
Background Single nucleotide polymorphism (SNP) arrays are important tools widely used for genotyping and copy number estimation. This technology utilizes the specific affinity of fragmented DNA for binding to surface-attached oligonucleotide DNA probes. We analyze the variability of the probe signals of Affymetrix GeneChip SNP arrays as a function of the probe sequence to identify relevant sequence motifs which potentially cause systematic biases of genotyping and copy number estimates. Methodology/Principal Findings The probe design of GeneChip SNP arrays enables us to disentangle different sources of intensity modulations such as the number of mismatches per duplex, matched and mismatched base pairings including nearest and next-nearest neighbors and their position along the probe sequence. The effect of probe sequence was estimated in terms of triple-motifs with central matches and mismatches which include all 256 combinations of possible base pairings. The probe/target interactions on the chip can be decomposed into nearest neighbor contributions which correlate well with free energy terms of DNA/DNA-interactions in solution. The effect of mismatches is about twice as large as that of canonical pairings. Runs of guanines (G) and the particular type of mismatched pairings formed in cross-allelic probe/target duplexes constitute sources of systematic biases of the probe signals with consequences for genotyping and copy number estimates. The poly-G effect seems to be related to the crowded arrangement of probes which facilitates complex formation of neighboring probes with at minimum three adjacent G's in their sequence. Conclusions The applied method of “triple-averaging” represents a model-free approach to estimate the mean intensity contributions of different sequence motifs which can be applied in calibration algorithms to correct signal values for sequence effects. Rules for appropriate sequence corrections are suggested. PMID:19924253
Investigation of Inversion Polymorphisms in the Human Genome Using Principal Components Analysis
Ma, Jianzhong; Amos, Christopher I.
2012-01-01
Despite the significant advances made over the last few years in mapping inversions with the advent of paired-end sequencing approaches, our understanding of the prevalence and spectrum of inversions in the human genome has lagged behind other types of structural variants, mainly due to the lack of a cost-efficient method applicable to large-scale samples. We propose a novel method based on principal components analysis (PCA) to characterize inversion polymorphisms using high-density SNP genotype data. Our method applies to non-recurrent inversions for which recombination between the inverted and non-inverted segments in inversion heterozygotes is suppressed due to the loss of unbalanced gametes. Inside such an inversion region, an effect similar to population substructure is thus created: two distinct “populations” of inversion homozygotes of different orientations and their 1∶1 admixture, namely the inversion heterozygotes. This kind of substructure can be readily detected by performing PCA locally in the inversion regions. Using simulations, we demonstrated that the proposed method can be used to detect and genotype inversion polymorphisms using unphased genotype data. We applied our method to the phase III HapMap data and inferred the inversion genotypes of known inversion polymorphisms at 8p23.1 and 17q21.31. These inversion genotypes were validated by comparing with literature results and by checking Mendelian consistency using the family data whenever available. Based on the PCA-approach, we also performed a preliminary genome-wide scan for inversions using the HapMap data, which resulted in 2040 candidate inversions, 169 of which overlapped with previously reported inversions. Our method can be readily applied to the abundant SNP data, and is expected to play an important role in developing human genome maps of inversions and exploring associations between inversions and susceptibility of diseases. PMID:22808122
Lijavetzky, Diego; Cabezas, José Antonio; Ibáñez, Ana; Rodríguez, Virginia; Martínez-Zapater, José M
2007-01-01
Background Single-nucleotide polymorphisms (SNPs) are the most abundant type of DNA sequence polymorphisms. Their higher availability and stability when compared to simple sequence repeats (SSRs) provide enhanced possibilities for genetic and breeding applications such as cultivar identification, construction of genetic maps, the assessment of genetic diversity, the detection of genotype/phenotype associations, or marker-assisted breeding. In addition, the efficiency of these activities can be improved thanks to the ease with which SNP genotyping can be automated. Expressed sequence tags (EST) sequencing projects in grapevine are allowing for the in silico detection of multiple putative sequence polymorphisms within and among a reduced number of cultivars. In parallel, the sequence of the grapevine cultivar Pinot Noir is also providing thousands of polymorphisms present in this highly heterozygous genome. Still the general application of those SNPs requires further validation since their use could be restricted to those specific genotypes. Results In order to develop a large SNP set of wide application in grapevine we followed a systematic re-sequencing approach in a group of 11 grape genotypes corresponding to ancient unrelated cultivars as well as wild plants. Using this approach, we have sequenced 230 gene fragments, what represents the analysis of over 1 Mb of grape DNA sequence. This analysis has allowed the discovery of 1573 SNPs with an average of one SNP every 64 bp (one SNP every 47 bp in non-coding regions and every 69 bp in coding regions). Nucleotide diversity in grape (π = 0.0051) was found to be similar to values observed in highly polymorphic plant species such as maize. The average number of haplotypes per gene sequence was estimated as six, with three haplotypes representing over 83% of the analyzed sequences. Short-range linkage disequilibrium (LD) studies within the analyzed sequences indicate the existence of a rapid decay of LD within the selected grapevine genotypes. To validate the use of the detected polymorphisms in genetic mapping, cultivar identification and genetic diversity studies we have used the SNPlex™ genotyping technology in a sample of grapevine genotypes and segregating progenies. Conclusion These results provide accurate values for nucleotide diversity in coding sequences and a first estimate of short-range LD in grapevine. Using SNPlex™ genotyping we have shown the application of a set of discovered SNPs as molecular markers for cultivar identification, linkage mapping and genetic diversity studies. Thus, the combination a highly efficient re-sequencing approach and the SNPlex™ high throughput genotyping technology provide a powerful tool for grapevine genetic analysis. PMID:18021442
Genomic analysis of cow mortality and milk production using a threshold-linear model.
Tsuruta, S; Lourenco, D A L; Misztal, I; Lawlor, T J
2017-09-01
The objective of this study was to investigate the feasibility of genomic evaluation for cow mortality and milk production using a single-step methodology. Genomic relationships between cow mortality and milk production were also analyzed. Data included 883,887 (866,700) first-parity, 733,904 (711,211) second-parity, and 516,256 (492,026) third-parity records on cow mortality (305-d milk yields) of Holsteins from Northeast states in the United States. The pedigree consisted of up to 1,690,481 animals including 34,481 bulls genotyped with 36,951 SNP markers. Analyses were conducted with a bivariate threshold-linear model for each parity separately. Genomic information was incorporated as a genomic relationship matrix in the single-step BLUP. Traditional and genomic estimated breeding values (GEBV) were obtained with Gibbs sampling using fixed variances, whereas reliabilities were calculated from variances of GEBV samples. Genomic EBV were then converted into single nucleotide polymorphism (SNP) marker effects. Those SNP effects were categorized according to values corresponding to 1 to 4 standard deviations. Moving averages and variances of SNP effects were calculated for windows of 30 adjacent SNP, and Manhattan plots were created for SNP variances with the same window size. Using Gibbs sampling, the reliability for genotyped bulls for cow mortality was 28 to 30% in EBV and 70 to 72% in GEBV. The reliability for genotyped bulls for 305-d milk yields was 53 to 65% to 81 to 85% in GEBV. Correlations of SNP effects between mortality and 305-d milk yields within categories were the highest with the largest SNP effects and reached >0.7 at 4 standard deviations. All SNP regions explained less than 0.6% of the genetic variance for both traits, except regions close to the DGAT1 gene, which explained up to 2.5% for cow mortality and 4% for 305-d milk yields. Reliability for GEBV with a moderate number of genotyped animals can be calculated by Gibbs samples. Genomic information can greatly increase the reliability of predictions not only for milk but also for mortality. The existence of a common region on Bos taurus autosome 14 affecting both traits may indicate a major gene with a pleiotropic effect on milk and mortality. Copyright © 2017 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Yang, Zhe; Zhou, Lin; Wu, Li-Ming; Xie, Hai-Yang; Zhang, Feng; Zheng, Shu-Sen
2010-12-01
Histone deacetylases (HDACs) have been reported to be poor prognostic indicators in patients with cancer. However, no data are available for the role of single nucleotide polymorphism (SNP) of class I HDAC in hepato-cellular carcinoma (HCC). Therefore, we investigated the association of class I HDAC isoforms genomic polymorphisms with risk of HCC and tumor recurrence following liver transplantation (LT). One hundred and ninety-six Chinese subjects consisting of 97 HCC patients and 99 controls were enrolled in this study. Nine polymorphisms of the HDAC1, HDAC2, and HDAC3 gene (rs2530223, rs1741981, rs2547547, rs13204445, rs6568819, rs10499080, rs11741808, rs2475631, rs11391) were examined using Applied Biosystems SNaP-Shot and TaqMan technology. We found no significant difference in genotype frequencies between the HCC cases and controls. In terms of tumor recurrence following LT, patients carrying the T allele of HDAC1 SNP rs1741981 showed a favorable outcome for recurrence free survival when compared with patients homozygous for CC. In addition, the same significant trend was observed in HDAC3 SNP rs2547547. Kaplan-Meier analysis showed that the combination of the T variant allele (CT+TT) of HDAC1 SNP rs1741981 and the homozygous TT variant allele of HDAC3 SNP rs2547547 was the most favorable prognostic factor. The risk for postoperative tumor recurrence was about 2.2-fold lower for patients with this genotype combination compared with carriers of the HDAC1 SNP rs1741981 CC and HDAC3 SNP rs2547547 CT genotype combination (hazard ratio: 2.235, p=0.003). Our data suggest that combined analysis of HDAC1 SNP rs1741981 and HDAC3 SNP rs2547547 may be a potential genetic marker for HCC recurrence in LT patients.
Chen, Sirui; An, Jianyong; Lian, Ling; Qu, Lujiang; Zheng, Jiangxia; Xu, Guiyun; Yang, Ning
2013-02-01
Muscle characteristics such as myofiber diameter, density, and total number are important traits in broiler breeding and production. In the present study, 19 SNP of 13 major genes, which are located in the vicinity of quantitative trait loci affecting breast muscle weight, including INS, IGF2, PIK3C2A, AKT3, PRKAB2, PRKAG3, VEGFA, RPS6KA2/3, FIGF, and TGF-β1/2/3, were chosen to be genotyped by high-throughput matrix-assisted laser desorption/ionization time-of-flight mass spectrometry in a broiler population. One hundred twenty birds were slaughtered at 6 wk of age. Body weight, breast muscle weight, myofiber diameter, density, and total number were determined for each bird. Six SNP with a very low minor allele frequency (<1%) were excluded for further analysis. The remaining 13 SNP were used for the association study with muscle characteristics. The results showed that SNP in TGF-β1/2/3 had significant effects on myofiber diameter. A SNP in PRKAG3 had a significant effect on myofiber density (P < 0.05). A C > G mutation in FIGF was strongly associated with total fiber number (P < 0.05). Additionally, birds with the GG genotype of the C > G mutation in AKT3 had significantly larger myofiber numbers (P < 0.05) than birds with the CC or GC genotype. The SNP identified in the present study might be used as potential markers in broiler breeding.
Goldstone, Robert J.; McLuckie, Joyce; Smith, David G. E.
2015-01-01
Typing of Mycobacterium avium subspecies paratuberculosis strains presents a challenge, since they are genetically monomorphic and traditional molecular techniques have limited discriminatory power. The recent advances and availability of whole-genome sequencing have extended possibilities for the characterization of Mycobacterium avium subspecies paratuberculosis, and whole-genome sequencing can provide a phylogenetic context to facilitate global epidemiology studies. In this study, we developed a single nucleotide polymorphism (SNP) assay based on PCR and restriction enzyme digestion or sequencing of the amplified product. The SNP analysis was performed using genome sequence data from 133 Mycobacterium avium subspecies paratuberculosis isolates with different genotypes from 8 different host species and 17 distinct geographic regions around the world. A total of 28,402 SNPs were identified among all of the isolates. The minimum number of SNPs required to distinguish between all of the 133 genomes was 93 and between only the type C isolates was 41. To reduce the number of SNPs and PCRs required, we adopted an approach based on sequential detection of SNPs and a decision tree. By the analysis of 14 SNPs Mycobacterium avium subspecies paratuberculosis isolates can be characterized within 14 phylogenetic groups with a higher discriminatory power than mycobacterial interspersed repetitive unit–variable number tandem repeat assay and other typing methods. Continuous updating of genome sequences is needed in order to better characterize new phylogenetic groups and SNP profiles. The novel SNP assay is a discriminative, simple, reproducible method and requires only basic laboratory equipment for the large-scale global typing of Mycobacterium avium subspecies paratuberculosis isolates. PMID:26677250
Rocher, Solen; Jean, Martine; Castonguay, Yves; Belzile, François
2015-01-01
Genotyping-by-sequencing (GBS) is a relatively low-cost high throughput genotyping technology based on next generation sequencing and is applicable to orphan species with no reference genome. A combination of genome complexity reduction and multiplexing with DNA barcoding provides a simple and affordable way to resolve allelic variation between plant samples or populations. GBS was performed on ApeKI libraries using DNA from 48 genotypes each of two heterogeneous populations of tetraploid alfalfa (Medicago sativa spp. sativa): the synthetic cultivar Apica (ATF0) and a derived population (ATF5) obtained after five cycles of recurrent selection for superior tolerance to freezing (TF). Nearly 400 million reads were obtained from two lanes of an Illumina HiSeq 2000 sequencer and analyzed with the Universal Network-Enabled Analysis Kit (UNEAK) pipeline designed for species with no reference genome. Following the application of whole dataset-level filters, 11,694 single nucleotide polymorphism (SNP) loci were obtained. About 60% had a significant match on the Medicago truncatula syntenic genome. The accuracy of allelic ratios and genotype calls based on GBS data was directly assessed using 454 sequencing on a subset of SNP loci scored in eight plant samples. Sequencing depth in this study was not sufficient for accurate tetraploid allelic dosage, but reliable genotype calls based on diploid allelic dosage were obtained when using additional quality filtering. Principal Component Analysis of SNP loci in plant samples revealed that a small proportion (<5%) of the genetic variability assessed by GBS is able to differentiate ATF0 and ATF5. Our results confirm that analysis of GBS data using UNEAK is a reliable approach for genome-wide discovery of SNP loci in outcrossed polyploids. PMID:26115486
Mendes, Adélia; Costa, Natália Rios; Chora, Inês; Ferreira, Sara; Araújo, Emanuel; Lopes, Pedro; Rosa, Gilberto; Marques, Pedro; Bettencourt, Paulo; Oliveira, Inês; Costa, Francisco; Ramos, Isabel; Teles, Maria José; Guimarães, João Tiago; Sobrinho-Simões, Manuel; Soares, Paula
2016-01-01
Head and neck cancers, and cardiovascular disease have been described as late effects of low dose radiation (LDR) exposure, namely in tinea capitis cohorts. In addition to radiation dose, gender and younger age at exposure, the genetic background might be involved in the susceptibility to LDR late effects. The -174 G>C (rs1800795) SNP in IL6 has been associated with cancer and cardiovascular disease, nevertheless this association is still controversial. We assessed the association of the IL6-174 G>C SNP with LDR effects such as thyroid carcinoma, basal cell carcinoma and carotid atherosclerosis in the Portuguese tinea capitis cohort. The IL6-174 G>C SNP was genotyped in 1269 individuals formerly irradiated for tinea capitis. This sampling group included thyroid cancer (n = 36), basal cell carcinoma (n = 113) and cases without thyroid or basal cell carcinoma (1120). A subgroup was assessed for atherosclerosis by ultrasonography (n = 379) and included matched controls (n = 222). Genotypes were discriminated by real-time PCR using a TaqMan SNP genotyping assay. In the irradiated group, we observed that the CC genotype was significantly associated with carotid plaque risk, both in the genotypic (OR = 3.57, CI = 1.60–7.95, p-value = 0.002) and in the recessive (OR = 3.02, CI = 1.42–6.42, p-value = 0.004) models. Irradiation alone was not a risk factor for carotid atherosclerosis. We did not find a significant association of the IL6-174 C allele with thyroid carcinoma or basal cell carcinoma risk. The IL6-174 CC genotype confers a three-fold risk for carotid atherosclerotic disease suggesting it may represent a genetic susceptibility factor in the LDR context. PMID:27662210
Lee, Kyoung-Young; Kang, Hyun-Sik; Shin, Yun-A
2013-03-10
The effects of exercise on adiponectin levels have been reported to be variable and may be attributable to an interaction between environmental and genetic factors. The single nucleotide polymorphisms (SNP) 45 (T>G) and SNP276 (G>T) of the adiponectin gene are associated with metabolic risk factors including adiponectin levels. We examined whether SNP45 and SNP276 would differentially influence the effect of exercise training in middle-aged women with uncomplicated obesity. We conducted a prospective study in the general community that included 90 Korean women (age 47.0±5.1 years) with uncomplicated obesity. The intervention was aerobic exercise training for 3 months. Body composition, adiponectin levels, and other metabolic risk factors were measured. Prior to exercise training, only body weight differed among the SNP276 genotypes. Exercise training improved body composition, systolic blood pressure, maximal oxygen consumption, high-density lipoprotein cholesterol, and leptin levels. In addition, exercise improved adiponectin levels irrespective of weight gain or loss. However, after adjustments for age, BMI, body fat (%), and waist circumference, no differences were found in obesity-related characteristics (e.g., adiponectin) following exercise training among the SNP45 and the 276 genotypes. Our findings suggest that aerobic exercise affects adiponectin levels regardless of weight loss and this effect would not be influenced by SNP45 and SNP276 in the adiponectin gene. Crown Copyright © 2012. Published by Elsevier B.V. All rights reserved.
Kumar, Rakesh; Gupta, I. D.; Verma, Archana; Verma, Nishant; Vineeth, M. R.
2015-01-01
Aim: The present study was undertaken to identify novel single nucleotide polymorphism (SNP) in Exon 3 of HSP90AA1 gene and to analyze their association with respiration rate (RR) and rectal temperature (RT) in Sahiwal cows. Materials and Methods: The present study was carried out in Sahiwal cows (n=100) with the objectives to identify novel SNP in exon 3 of HSP90AA1 gene and to explore the association with heat tolerance traits. CLUSTAL-W multiple sequence analysis was used to identify novel SNPs in exon 3 of HSP90AA1 gene in Sahiwal cows. Gene and genotype frequencies of different genotypes were estimated by standard procedure POPGENE version 1.32 (University of Alberta, Canada). The significant effect of SNP variants on physiological parameters, e.g. RR and RT were analyzed using the General Linear model procedure of SAS Version 9.2. Results: The polymerase chain reaction product with the amplicon size of 450 bp was successfully amplified, covering exon 3 region of HSP90AA1 gene in Sahiwal cows. On the basis of comparative sequence analysis of Sahiwal samples (n=100), transitional mutations were detected at locus A1209G as compared to Bos taurus (NCBI GenBank AC_000178.1). After chromatogram analysis, three genotypes AA, AG, and GG with respective frequencies of 0.23, 0.50, and 0.27 ascertained. RR and RT were recorded once during probable extreme hours in winter, spring, and summer seasons. It was revealed that significant difference (p<0.01) among genetic variants of HSP90AA1 gene with heat tolerance trait was found in Sahiwal cattle. The homozygotic animals with AA genotype had lower heat tolerance coefficient (HTC) (1.78±0.04a), as compared to both AG and GG genotypes (1.85±0.03b and 1.91±0.02c), respectively. The gene and genotype frequencies for the locus A1209G were ascertained. Conclusions: Novel SNP was found at the A1209G position showed all possible three genotypes (homozygous and heterozygous). Temperature humidity index has a highly significant association with RR, RT, and HTC in all the seasons. Perusal of results across different seasons showed the significant (p<0.01) difference in RR, RT, and HTC among winter, spring, and summer seasons. Genetic association with heat tolerance traits reveals their importance as a potential genetic marker for heat tolerance traits in Sahiwal cows. PMID:27047179
Schnitzler, Fabian; Friedrich, Matthias; Wolf, Christiane; Angelberger, Marianne; Diegelmann, Julia; Olszak, Torsten; Beigel, Florian; Tillack, Cornelia; Stallhofer, Johannes; Göke, Burkhard; Glas, Jürgen; Lohse, Peter; Brand, Stephan
2014-01-01
Very recently, a sub-analysis of genome-wide association scans revealed that the non-coding single nucleotide polymorphism (SNP) rs12212067 in the FOXO3A gene is associated with a milder course of Crohn's disease (CD) (Cell 2013;155:57-69). The aim of our study was to evaluate the clinical value of the SNP rs12212067 in predicting the severity of CD by correlating CD patient genotype status with the most relevant complications of CD such as stenoses, fistulas, and CD-related surgery. We genotyped 550 CD patients for rs12212067 (FOXO3A) and the three common CD-associated NOD2 mutations rs2066844, rs2066847, and rs2066847 and performed genotype-phenotype analyses. No significant phenotypic differences were found between the wild-type genotype TT of the FOXO3A SNP rs12212067 and the minor genotypes TG and GG independently from NOD2 variants. The allele frequency of the minor G allele was 12.7%. Age at diagnosis, disease duration, body mass index, surgery rate, stenoses, fistula, need for immunosuppressive therapy, and disease course were not significantly different. In contrast, the NOD2 mutant p.Leu1007fsX1008 (rs2066847) was highly associated with penetrating CD (p = 0.01), the development of fistulas (p = 0.01) and stenoses (p = 0.01), and ileal disease localization (p = 0.03). Importantly, the NOD2 SNP rs2066847 was a strong separator between an aggressive and a mild course of CD (p = 2.99×10(-5)), while the FOXO3A SNP rs12212067 did not separate between mild and aggressive CD behavior in our cohort (p = 0.35). 96.2% of the homozygous NOD2 p.Leu1007fsX1008 carriers had an aggressive disease behavior compared to 69.3% of the patients with the NOD2 wild-type genotype (p = 0.007). In clinical practice, the NOD2 variant p.Leu1007fsX1008 (rs2066847), in particular in homozygous form, is a much stronger marker for a severe clinical phenotype than the FOXO3A rs12212067 SNP for a mild disease course on an individual patient level despite its important impact on the inflammatory response of monocytes.
Ajayi, Oyeyemi O; Adefenwa, Mufliat A; Agaviezor, Brilliant O; Ikeobi, Christian O N; Wheto, Matthew; Okpeku, Moses; Amusan, Samuel A; Yakubu, Abdulmojeed; De Donato, Marcos; Peters, Sunday O; Imumorin, Ikhide G
2014-02-01
The tenascin-XB (TNXB) gene has antiadhesive effects, functions in matrix maturation in connective tissues, and localizes to the major histocompatibility complex class III region. We hypothesized that it may influence adaptive physiological response through an effect on blood vessel function. We identified a novel g.1324 A→G polymorphism at a TaqI recognition site in a 454 bp fragment of ovine TNXB and genotyped it in 150 Nigerian sheep using PCR-RFLP. The missense mutation changes glutamic acid (GAA) to glycine (GGA). Among SNP genotypes, significant differences (P < 0.05) were observed in body weight and fore cannon bone length. Interaction effects of breed, SNP genotype, and geographic location had a significant effect (P < 0.05) on chest girth. The SNP genotype was significantly (P < 0.05) associated with physiological traits of pulse rate and skin temperature. The observed effect of this novel polymorphism may be mediated through its role in connective tissue biology, requiring further association and functional studies.
USDA-ARS?s Scientific Manuscript database
This study was conducted as an initial assessment of a newly available genotyping assay containing about 34,000 common SNP included on previous SNP chips, and 199,000 sequence variants predicted to affect gene function. Objectives were to identify functional variants associated with birth weight in...
2011-01-01
Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants. PMID:22082336
Optimization of the genotyping-by-sequencing strategy for population genomic analysis in conifers.
Pan, Jin; Wang, Baosheng; Pei, Zhi-Yong; Zhao, Wei; Gao, Jie; Mao, Jian-Feng; Wang, Xiao-Ru
2015-07-01
Flexibility and low cost make genotyping-by-sequencing (GBS) an ideal tool for population genomic studies of nonmodel species. However, to utilize the potential of the method fully, many parameters affecting library quality and single nucleotide polymorphism (SNP) discovery require optimization, especially for conifer genomes with a high repetitive DNA content. In this study, we explored strategies for effective GBS analysis in pine species. We constructed GBS libraries using HpaII, PstI and EcoRI-MseI digestions with different multiplexing levels and examined the effect of restriction enzymes on library complexity and the impact of sequencing depth and size selection of restriction fragments on sequence coverage bias. We tested and compared UNEAK, Stacks and GATK pipelines for the GBS data, and then developed a reference-free SNP calling strategy for haploid pine genomes. Our GBS procedure proved to be effective in SNP discovery, producing 7000-11 000 and 14 751 SNPs within and among three pine species, respectively, from a PstI library. This investigation provides guidance for the design and analysis of GBS experiments, particularly for organisms for which genomic information is lacking. © 2014 John Wiley & Sons Ltd.
Du, Tao; Duan, Yu; Li, Kaiwen; Zhao, Xiaomiao; Ni, Renmin; Li, Yu; Yang, Dongzi
2015-01-01
Background. Single-nucleotide polymorphisms (SNPs) in the follicle stimulating hormone receptor (FSHR) gene are associated with PCOS. However, their relationship to the polycystic ovary (PCO) morphology remains unknown. This study aimed to investigate whether PCOS related SNPs in the FSHR gene are associated with PCO in women with PCOS. Methods. Patients were grouped into PCO (n = 384) and non-PCO (n = 63) groups. Genomic genotypes were profiled using Affymetrix human genome SNP chip 6. Two polymorphisms (rs2268361 and rs2349415) of FSHR were analyzed using a statistical approach. Results. Significant differences were found in the allele distributions of the GG genotype of rs2268361 between the PCO and non-PCO groups (27.6% GG, 53.4% GA, and 19.0% AA versus 33.3% GG, 36.5% GA, and 30.2% AA), while no significant differences were found in the allele distributions of the GG genotype of rs2349415. When rs2268361 was considered, there were statistically significant differences of serum follicle stimulating hormone, estradiol, and sex hormone binding globulin between genotypes in the PCO group. In case of the rs2349415 SNP, only serum sex hormone binding globulin was statistically different between genotypes in the PCO group. Conclusions. Functional variants in FSHR gene may contribute to PCO susceptibility in women with PCOS. PMID:26273622
Effects of ghrelin gene genotypes on the growth traits in Chinese cattle.
Zhang, Ai-ling; Zhang, Li; Zhang, Liang-zhi; Zhang, Cun-fang; Lan, Xian-yong; Zhang, Chun-lei; Chen, Hong
2012-06-01
Ghrelin is an important peptide that stimulates food intake and regulates energy balance of animals. Single nucleotide polymorphisms of ghrelin gene in three Chinese cattle populations were investigated through PCR-SSCP and DNA sequencing. Five over-lapped DNA fragments were analyzed and a total of three ones exhibited different genotypes. Three genotypes and four SNPs (-415 A > G, -414 T > C, -321 C > A, and -172 A > G) were found on the -544 to +35 bp region (G-1) of ghrelin gene. On the locus of -1037 to -509 bp (G-2), two genotypes and one SNP (-726 A > T) were discovered. And in the exon1, exon2, and intron1 (G-4 locus, (+4 to +427)), two genotypes and one SNP were detected (+205 C > T, located in intron1). Positions of the five SNPs in the 5′ regulatory region might be the transcription factor binding sites. The SNPs at -415 and -414 in the core binding sequence were found to cause the change of the site. Though the SNP at -172 did not change the binding site, it generated one new site at the same time. The frequencies of the genotypes varied differently in the three breeds. Results of ANOVA showed that G-1 was correlative to the ischium width (IW) of Nanyang cattle aged 18 months (p = 0.043). The least square analysis between genotypes at G-1 locus and growth traits in Nanyang cattle showed that the individuals (aged 18 months) with C genotype had greater IW than that of the other two genotypes. The C genotype might serve as one potential candidate genetic marker for cattle growth and development.
efficient association study design via power-optimized tag SNP selection
HAN, BUHM; KANG, HYUN MIN; SEO, MYEONG SEONG; ZAITLEN, NOAH; ESKIN, ELEAZAR
2008-01-01
Discovering statistical correlation between causal genetic variation and clinical traits through association studies is an important method for identifying the genetic basis of human diseases. Since fully resequencing a cohort is prohibitively costly, genetic association studies take advantage of local correlation structure (or linkage disequilibrium) between single nucleotide polymorphisms (SNPs) by selecting a subset of SNPs to be genotyped (tag SNPs). While many current association studies are performed using commercially available high-throughput genotyping products that define a set of tag SNPs, choosing tag SNPs remains an important problem for both custom follow-up studies as well as designing the high-throughput genotyping products themselves. The most widely used tag SNP selection method optimizes over the correlation between SNPs (r2). However, tag SNPs chosen based on an r2 criterion do not necessarily maximize the statistical power of an association study. We propose a study design framework that chooses SNPs to maximize power and efficiently measures the power through empirical simulation. Empirical results based on the HapMap data show that our method gains considerable power over a widely used r2-based method, or equivalently reduces the number of tag SNPs required to attain the desired power of a study. Our power-optimized 100k whole genome tag set provides equivalent power to the Affymetrix 500k chip for the CEU population. For the design of custom follow-up studies, our method provides up to twice the power increase using the same number of tag SNPs as r2-based methods. Our method is publicly available via web server at http://design.cs.ucla.edu. PMID:18702637
MMP9 polymorphisms and breast cancer risk: a report from the Shanghai Breast Cancer Genetics Study.
Beeghly-Fadiel, Alicia; Lu, Wei; Shu, Xiao-Ou; Long, Jirong; Cai, Qiuyin; Xiang, Yongbin; Gao, Yu-Tang; Zheng, Wei
2011-04-01
In addition to tumor invasion and angiogenesis, matrix metalloproteinase (MMP)9 also contributes to carcinogenesis and tumor growth. Genetic variation that may influence MMP9 expression was evaluated among participants of the Shanghai Breast Cancer Genetics Study (SBCGS) for associations with breast cancer susceptibility. In stage 1, 11 MMP9 single nucleotide polymorphisms (SNPs) were genotyped by the Affymetrix Targeted Genotyping System and/or the Affymetrix Genome-Wide Human SNP Array 6.0 among 4,227 SBCGS participants. One SNP was further genotyped using the Sequenom iPLEX MassARRAY platform among an additional 6,270 SBCGS participants. Associations with breast cancer risk were evaluated by odds ratios (OR) and 95% confidence intervals (CI) from logistic regression models that included adjustment for age, education, and genotyping stage when appropriate. In Stage 1, rare allele homozygotes for a promoter SNP (rs3918241) or a non-synonymous SNP (rs2274756, R668Q) tended to occur more frequently among breast cancer cases (P value = 0.116 and 0.056, respectively). Given their high linkage disequilibrium (D' = 1.0, r (2) = 0.97), one (rs3918241) was selected for additional analysis. An association with breast cancer risk was not supported by additional Stage 2 genotyping. In combined analysis, no elevated risk of breast cancer among homozygotes was found (OR: 1.2, 95% CI: 0.8-1.8). Common genetic variation in MMP9 was not found to be significantly associated with breast cancer susceptibility among participants of the Shanghai Breast Cancer Genetics Study.
Valdisser, Paula A M R; Pereira, Wendell J; Almeida Filho, Jâneo E; Müller, Bárbara S F; Coelho, Gesimária R C; de Menezes, Ivandilson P P; Vianna, João P G; Zucchi, Maria I; Lanna, Anna C; Coelho, Alexandre S G; de Oliveira, Jaison P; Moraes, Alessandra da Cunha; Brondani, Claudio; Vianello, Rosana P
2017-05-30
Common bean is a legume of social and nutritional importance as a food crop, cultivated worldwide especially in developing countries, accounting for an important source of income for small farmers. The availability of the complete sequences of the two common bean genomes has dramatically accelerated and has enabled new experimental strategies to be applied for genetic research. DArTseq has been widely used as a method of SNP genotyping allowing comprehensive genome coverage with genetic applications in common bean breeding programs. Using this technology, 6286 SNPs (1 SNP/86.5 Kbp) were genotyped in genic (43.3%) and non-genic regions (56.7%). Genetic subdivision associated to the common bean gene pools (K = 2) and related to grain types (K = 3 and K = 5) were reported. A total of 83% and 91% of all SNPs were polymorphic within the Andean and Mesoamerican gene pools, respectively, and 26% were able to differentiate the gene pools. Genetic diversity analysis revealed an average H E of 0.442 for the whole collection, 0.102 for Andean and 0.168 for Mesoamerican gene pools (F ST = 0.747 between gene pools), 0.440 for the group of cultivars and lines, and 0.448 for the group of landrace accessions (F ST = 0.002 between cultivar/line and landrace groups). The SNP effects were predicted with predominance of impact on non-coding regions (77.8%). SNPs under selection were identified within gene pools comparing landrace and cultivar/line germplasm groups (Andean: 18; Mesoamerican: 69) and between the gene pools (59 SNPs), predominantly on chromosomes 1 and 9. The LD extension estimate corrected for population structure and relatedness (r 2 SV ) was ~ 88 kbp, while for the Andean gene pool was ~ 395 kbp, and for the Mesoamerican was ~ 130 kbp. For common bean, DArTseq provides an efficient and cost-effective strategy of generating SNPs for large-scale genome-wide studies. The DArTseq resulted in an operational panel of 560 polymorphic SNPs in linkage equilibrium, providing high genome coverage. This SNP set could be used in genotyping platforms with many applications, such as population genetics, phylogeny relation between common bean varieties and support to molecular breeding approaches.
Wu, Wilfred; Clark, Erin A S; Stoddard, Gregory J; Watkins, W Scott; Esplin, M Sean; Manuck, Tracy A; Xing, Jinchuan; Varner, Michael W; Jorde, Lynn B
2013-04-25
Because of the role of inflammation in preterm birth (PTB), polymorphisms in and near the interleukin-6 gene (IL6) have been association study targets. Several previous studies have assessed the association between PTB and a single nucleotide polymorphism (SNP), rs1800795, located in the IL6 gene promoter region. Their results have been inconsistent and SNP frequencies have varied strikingly among different populations. We therefore conducted a meta-analysis with subgroup analysis by population strata to: (1) reduce the confounding effect of population structure, (2) increase sample size and statistical power, and (3) elucidate the association between rs1800975 and PTB. We reviewed all published papers for PTB phenotype and SNP rs1800795 genotype. Maternal genotype and fetal genotype were analyzed separately and the analyses were stratified by population. The PTB phenotype was defined as gestational age (GA) < 37 weeks, but results from earlier GA were selected when available. All studies were compared by genotype (CC versus CG+GG), based on functional studies.For the maternal genotype analysis, 1,165 PTBs and 3,830 term controls were evaluated. Populations were stratified into women of European descent (for whom the most data were available) and women of heterogeneous origin or admixed populations. All ancestry was self-reported. Women of European descent had a summary odds ratio (OR) of 0.68, (95% confidence interval (CI) 0.51 - 0.91), indicating that the CC genotype is protective against PTB. The result for non-European women was not statistically significant (OR 1.01, 95% CI 0.59 - 1.75). For the fetal genotype analysis, four studies were included; there was no significant association with PTB (OR 0.98, 95% CI 0.72 - 1.33). Sensitivity analysis showed that preterm premature rupture of membrane (PPROM) may be a confounding factor contributing to phenotype heterogeneity. IL6 SNP rs1800795 genotype CC is protective against PTB in women of European descent. It is not significant in other heterogeneous or admixed populations, or in fetal genotype analysis.Population structure is an important confounding factor that should be controlled for in studies of PTB.
Screening for susceptibility genes in hereditary non-polyposis colorectal cancer.
Yu, Li; Yin, Bo; Qu, Kaiying; Li, Jingjing; Jin, Qiao; Liu, Ling; Liu, Chunlan; Zhu, Yuxing; Wang, Qi; Peng, Xiaowei; Zhou, Jianda; Cao, Peiguo; Cao, Ke
2018-06-01
In the present study, hereditary non-polyposis colorectal cancer (HNPCC) susceptibility genes were screened for using whole exome sequencing in 3 HNPCC patients from 1 family and using single nucleotide polymorphism (SNP) genotyping assays in 96 other colorectal cancer and control samples. Peripheral blood was obtained from 3 HNPCC patients from 1 family; the proband and the proband's brother and cousin. High-throughput sequencing was performed using whole exome capture technology. Sequences were aligned against the HAPMAP, dbSNP130 and 1,000 Genome Project databases. Reported common variations and synonymous mutations were filtered out. Non-synonymous single nucleotide variants in the 3 HNPCC patients were integrated and the candidate genes were identified. Finally, SNP genotyping was performed for the genes in 96 peripheral blood samples. In total, 60.4 Gb of data was retrieved from the 3 HNPCC patients using whole exome capture technology. Subsequently, according to certain screening criteria, 15 candidate genes were identified. Among the 96 samples that had been SNP genotyped, 92 were successfully genotyped for 15 gene loci, while genotyping for HTRA1 failed in 4 sporadic colorectal cancer patient samples. In 12 control subjects and 81 sporadic colorectal cancer patients, genotypes at 13 loci were wild-type, namely DDX20, ZFYVE26, PIK3R3, SLC26A8, ZEB2, TP53INP1, SLC11A1, LRBA, CEBPZ, ETAA1, SEMA3G, IFRD2 and FAT1 . The CEP290 genotype was mutant in 1 sporadic colorectal cancer patient and was wild-type in all other subjects. A total of 5 of the 12 control subjects and 30 of the 81 sporadic colorectal cancer patients had a mutant HTRA1 genotype. In all 3 HNPCC patients, the same mutant genotypes were identified at all 15 gene loci. Overall, 13 potential susceptibility genes for HNPCC were identified, namely DDX20, ZFYVE26, PIK3R3, SLC26A8, ZEB2, TP53INP1, SLC11A1, LRBA, CEBPZ, ETAA1, SEMA3G, IFRD2 and FAT1 .
SNP Discovery and Linkage Map Construction in Cultivated Tomato
Shirasawa, Kenta; Isobe, Sachiko; Hirakawa, Hideki; Asamizu, Erika; Fukuoka, Hiroyuki; Just, Daniel; Rothan, Christophe; Sasamoto, Shigemi; Fujishiro, Tsunakazu; Kishida, Yoshie; Kohara, Mitsuyo; Tsuruoka, Hisano; Wada, Tsuyuko; Nakamura, Yasukazu; Sato, Shusei; Tabata, Satoshi
2010-01-01
Few intraspecific genetic linkage maps have been reported for cultivated tomato, mainly because genetic diversity within Solanum lycopersicum is much less than that between tomato species. Single nucleotide polymorphisms (SNPs), the most abundant source of genomic variation, are the most promising source of polymorphisms for the construction of linkage maps for closely related intraspecific lines. In this study, we developed SNP markers based on expressed sequence tags for the construction of intraspecific linkage maps in tomato. Out of the 5607 SNP positions detected through in silico analysis, 1536 were selected for high-throughput genotyping of two mapping populations derived from crosses between ‘Micro-Tom’ and either ‘Ailsa Craig’ or ‘M82’. A total of 1137 markers, including 793 out of the 1338 successfully genotyped SNPs, along with 344 simple sequence repeat and intronic polymorphism markers, were mapped onto two linkage maps, which covered 1467.8 and 1422.7 cM, respectively. The SNP markers developed were then screened against cultivated tomato lines in order to estimate the transferability of these SNPs to other breeding materials. The molecular markers and linkage maps represent a milestone in the genomics and genetics, and are the first step toward molecular breeding of cultivated tomato. Information on the DNA markers, linkage maps, and SNP genotypes for these tomato lines is available at http://www.kazusa.or.jp/tomato/. PMID:21044984
Delta-amino-levulinic acid dehydratase gene and essential tremor.
Agúndez, José A G; García-Martín, Elena; Alonso-Navarro, Hortensia; Ayuso, Pedro; Esguevillas, Gara; Benito-León, Julián; Ortega-Cubero, Sara; Pastor, Pau; López-Alburquerque, Tomás; Jiménez-Jiménez, Félix Javier
2017-05-01
Several reports found a relationship between increased serum lead levels and the risk for essential tremor (ET), especially in carriers of the minor allele of the single nucleotide polymorphism (SNP) rs1800435 in the aminolevulinate dehydratase (ALAD) gene, which is involved in the synthesis of haem groups. Our group reported decreased risk for ET in carriers of the minor alleles of the rs2071746 and rs1051308 SNPs in the haem-oxygenases 1 and 2 (HMOX1 and HMOX2), respectively, involved in haem metabolism. We analysed whether ALAD rs1800435 alone and their interactions with the four common SNPs in the HMOX1 and HMOX2 genes are associated with the risk for ET. We analysed the genotype and allele variants frequencies of ALAD rs1800435 in 202 patients with familial ET and 218 healthy controls using a TaqMan method. We also analysed the role of the interaction between ALAD rs1800435 and the HMOX1 rs2071746, HMOX1 rs2071747, HMOX2 rs2270363 and HMOX2 rs1051308 with the risk of developing ET. The frequencies of genotype and allelic variants of ALAD rs1800435 did not differ significantly between patients with ET and controls, and were not influenced by gender. Subjects carrying the ALAD rs1800435CC genotype (wild-type) and the HMOX2 rs1051308GG genotype or the HMOX2 rs1051308G allele had significantly decreased risk for ET. These results suggest that the ALAD rs1800435 SNP is not related with the risk for ET, but its interaction with the HMOX2 rs1051308 SNP could be weakly associated with the risk for this disease. © 2017 Stichting European Society for Clinical Investigation Journal Foundation.
Matsumura, Takayoshi; Amiya, Eisuke; Tamura, Natsuko; Maejima, Yasuhiro; Komuro, Issei; Isobe, Mitsuaki
2016-06-01
Takayasu arteritis (TAK) is an acute and chronic vasculitis of unknown etiology. Recently, our group reported that SNP rs6871626 in the IL12B region had significant association with disease susceptibility to TAK. However, association of the SNP with clinical characteristics of TAK has yet to be determined. Therefore, we assessed whether this SNP was associated with TAK disease severity as represented by early onset and/or refractoriness to medical therapy. A total of 90 patients were genotyped for rs6871626 and their clinical charts were reviewed retrospectively. By examining the relationship between genotype and clinical profiles of patients, we found a strong association between the number of risk alleles and the frequency of severe cases as defined by (1) age at onset <20 years old, (2) steroid resistance, and/or (3) a relapse of disease [p = 0.03; odds ratio 3.75 (95 % confidence interval 1.13-13.5)]. Thus, our study points to potential diagnostic use of SNP rs6871626 for predicting disease severity of TAK, with the goal of genotyping-oriented therapy in the near future.
Tollenaere, Charlotte; Susi, Hanna; Nokso-Koivisto, Jussi; Koskinen, Patrik; Tack, Ayco; Auvinen, Petri; Paulin, Lars; Frilander, Mikko J.; Lehtonen, Rainer; Laine, Anna-Liisa
2012-01-01
Background Molecular tools may greatly improve our understanding of pathogen evolution and epidemiology but technical constraints have hindered the development of genetic resources for parasites compared to free-living organisms. This study aims at developing molecular tools for Podosphaera plantaginis, an obligate fungal pathogen of Plantago lanceolata. This interaction has been intensively studied in the Åland archipelago of Finland with epidemiological data collected from over 4,000 host populations annually since year 2001. Principal Findings A cDNA library of a pooled sample of fungal conidia was sequenced on the 454 GS-FLX platform. Over 549,411 reads were obtained and annotated into 45,245 contigs. Annotation data was acquired for 65.2% of the assembled sequences. The transcriptome assembly was screened for SNP loci, as well as for functionally important genes (mating-type genes and potential effector proteins). A genotyping assay of 27 SNP loci was designed and tested on 380 infected leaf samples from 80 populations within the Åland archipelago. With this panel we identified 85 multilocus genotypes (MLG) with uneven frequencies across the pathogen metapopulation. Approximately half of the sampled populations contain polymorphism. Our genotyping protocol revealed mixed-genotype infection within a single host leaf to be common. Mixed infection has been proposed as one of the main drivers of pathogen evolution, and hence may be an important process in this pathosystem. Significance The developed SNP panel offers exciting research perspectives for future studies in this well-characterized pathosystem. Also, the transcriptome provides an invaluable novel genomic resource for powdery mildews, which cause significant yield losses on commercially important crops annually. Furthermore, the features that render genetic studies in this system a challenge are shared with the majority of obligate parasitic species, and hence our results provide methodological insights from SNP calling to field sampling protocols for a wide range of biological systems. PMID:23300684
Taira, Chiaki; Matsuda, Kazuyuki; Yamaguchi, Akemi; Sueki, Akane; Koeda, Hiroshi; Takagi, Fumio; Kobayashi, Yukihiro; Sugano, Mitsutoshi; Honda, Takayuki
2013-09-23
Single nucleotide alterations such as single nucleotide polymorphisms (SNP) and single nucleotide mutations are associated with responses to drugs and predisposition to several diseases, and they contribute to the pathogenesis of malignancies. We developed a rapid genotyping assay based on the allele-specific polymerase chain reaction (AS-PCR) with our droplet-PCR machine (droplet-AS-PCR). Using 8 SNP loci, we evaluated the specificity and sensitivity of droplet-AS-PCR. Buccal cells were pretreated with proteinase K and subjected directly to the droplet-AS-PCR without DNA extraction. The genotypes determined using the droplet-AS-PCR were then compared with those obtained by direct sequencing. Specific PCR amplifications for the 8 SNP loci were detected, and the detection limit of the droplet-AS-PCR was found to be 0.1-5.0% by dilution experiments. Droplet-AS-PCR provided specific amplification when using buccal cells, and all the genotypes determined within 9 min were consistent with those obtained by direct sequencing. Our novel droplet-AS-PCR assay enabled high-speed amplification retaining specificity and sensitivity and provided ultra-rapid genotyping. Crude samples such as buccal cells were available for the droplet-AS-PCR assay, resulting in the reduction of the total analysis time. Droplet-AS-PCR may therefore be useful for genotyping or the detection of single nucleotide alterations. Copyright © 2013 Elsevier B.V. All rights reserved.
Wang, G; Liao, J; Tang, M; Yu, S
2018-02-01
1. Microphthalmia-associated transcription factor (MITF) plays a pivotal role in melanocyte development by regulating the transcription of major pigmentation enzymes (e.g. TYR, TYRP1 and DCT). A single-nucleotide polymorphism (SNP), c.-638T>C, was identified in the MITF promoter, and genotyping of a population (n = 426) revealed that SNP c.-638T>C was associated with skin colour in black-boned chickens. 2. Individuals with genotypes CC and TC exhibited greater MTIF expression than those with genotype TT. Luciferase assays also revealed that genotype CC and TC promoters had higher activity levels than genotype TT. Expression of melanogenesis-related gene (TYR) was higher in the skin of chickens with the CC and CT genotype compared to TT chickens (P < 0.05). 3. Transcription factor-binding site analyses showed that the c.-638C allele contains a putative binding site for transcription factor sterol regulatory element-binding transcription factor 2, aryl hydrocarbon receptor nuclear translocator, transcription factor binding to IGHM enhancer 3 and upstream transcription factor 2. In contrast, the c.-638T allele contains binding sites for Sp3 transcription factor and Krüppel-like factor 1. 4. It was concluded that MITF promoter polymorphisms affected chicken skin colour. SNP c.-638T>C could be used for the marker-assisted selection of skin colour in black-boned chicken breeding.
Whole genome amplification and real-time PCR in forensic casework
Giardina, Emiliano; Pietrangeli, Ilenia; Martone, Claudia; Zampatti, Stefania; Marsala, Patrizio; Gabriele, Luciano; Ricci, Omero; Solla, Gianluca; Asili, Paola; Arcudi, Giovanni; Spinella, Aldo; Novelli, Giuseppe
2009-01-01
Background WGA (Whole Genome Amplification) in forensic genetics can eliminate the technical limitations arising from low amounts of genomic DNA (gDNA). However, it has not been used to date because any amplification bias generated may complicate the interpretation of results. Our aim in this paper was to assess the applicability of MDA to forensic SNP genotyping by performing a comparative analysis of genomic and amplified DNA samples. A 26-SNPs TaqMan panel specifically designed for low copy number (LCN) and/or severely degraded genomic DNA was typed on 100 genomic as well as amplified DNA samples. Results Aliquots containing 1, 0.1 and 0.01 ng each of 100 DNA samples were typed for a 26-SNPs panel. Similar aliquots of the same DNA samples underwent multiple displacement amplification (MDA) before being typed for the same panel. Genomic DNA samples showed 0% PCR failure rate for all three dilutions, whilst the PCR failure rate of the amplified DNA samples was 0% for the 1 ng and 0.1 ng dilutions and 0.077% for the 0.01 ng dilution. The genotyping results of both the amplified and genomic DNA samples were also compared with reference genotypes of the same samples obtained by direct sequencing. The genomic DNA samples showed genotype concordance rates of 100% for all three dilutions while the concordance rates of the amplified DNA samples were 100% for the 1 ng and 0.1 ng dilutions and 99.923% for the 0.01 ng dilution. Moreover, ten artificially-degraded DNA samples, which gave no results when analyzed by current forensic methods, were also amplified by MDA and genotyped with 100% concordance. Conclusion We investigated the suitability of MDA material for forensic SNP typing. Comparative analysis of amplified and genomic DNA samples showed that a large number of SNPs could be accurately typed starting from just 0.01 ng of template. We found that the MDA genotyping call and accuracy rates were only slightly lower than those for genomic DNA. Indeed, when 10 pg of input DNA was used in MDA, we obtained 99.923% concordance, indicating a genotyping error rate of 1/1299 (7.7 × 10-4). This is quite similar to the genotyping error rate of STRs used in current forensic analysis. Such efficiency and accuracy of SNP typing of amplified DNA suggest that MDA can also generate large amounts of genome-equivalent DNA from a minimal amount of input DNA. These results show for the first time that MDA material is suitable for SNP-based forensic protocols and in general when samples fail to give interpretable STR results. PMID:19366436
Chono, Makiko; Matsunaka, Hitoshi; Seki, Masako; Fujita, Masaya; Kiribuchi-Otobe, Chikako; Oda, Shunsuke; Kojima, Hisayo; Nakamura, Shingo
2015-03-01
In the wheat (Triticum aestivum L.) cultivar 'Zenkoujikomugi', a single nucleotide polymorphism (SNP) in the promoter of MOTHER OF FT AND TFL1 on chromosome 3A (MFT-3A) causes an increase in the level of gene expression, resulting in strong grain dormancy. We used a DNA marker to detect the 'Zenkoujikomugi'-type (Zen-type) SNP and examined the genotype of MFT-3A in Japanese wheat varieties, and we found that 169 of 324 varieties carry the Zen-type SNP. In Japanese commercial varieties, the frequency of the Zen-type SNP was remarkably high in the southern part of Japan, but low in the northern part. To examine the relationship between MFT-3A genotype and grain dormancy, we performed a germination assay in three wheat-growing seasons. On average, the varieties carrying the Zen-type SNP showed stronger grain dormancy than the varieties carrying the non-Zen-type SNP. Among commercial cultivars, 'Iwainodaichi' (Kyushu), 'Junreikomugi' (Kinki-Chugoku-Shikoku), 'Kinuhime' (Kanto-Tokai), 'Nebarigoshi' (Tohoku-Hokuriku), and 'Kitamoe' (Hokkaido) showed the strongest grain dormancy in each geographical group, and all these varieties, except for 'Kitamoe', were found to carry the Zen-type SNP. In recent years, the number of varieties carrying the Zen-type SNP has increased in the Tohoku-Hokuriku region, but not in the Hokkaido region.
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar).
Houston, Ross D; Taggart, John B; Cézard, Timothé; Bekaert, Michaël; Lowe, Natalie R; Downing, Alison; Talbot, Richard; Bishop, Stephen C; Archibald, Alan L; Bron, James E; Penman, David J; Davassi, Alessandro; Brew, Fiona; Tinch, Alan E; Gharbi, Karim; Hamilton, Alastair
2014-02-06
Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection.
Breeding and Genetics Symposium: networks and pathways to guide genomic selection.
Snelling, W M; Cushman, R A; Keele, J W; Maltecca, C; Thomas, M G; Fortes, M R S; Reverter, A
2013-02-01
Many traits affecting profitability and sustainability of meat, milk, and fiber production are polygenic, with no single gene having an overwhelming influence on observed variation. No knowledge of the specific genes controlling these traits has been needed to make substantial improvement through selection. Significant gains have been made through phenotypic selection enhanced by pedigree relationships and continually improving statistical methodology. Genomic selection, recently enabled by assays for dense SNP located throughout the genome, promises to increase selection accuracy and accelerate genetic improvement by emphasizing the SNP most strongly correlated to phenotype although the genes and sequence variants affecting phenotype remain largely unknown. These genomic predictions theoretically rely on linkage disequilibrium (LD) between genotyped SNP and unknown functional variants, but familial linkage may increase effectiveness when predicting individuals related to those in the training data. Genomic selection with functional SNP genotypes should be less reliant on LD patterns shared by training and target populations, possibly allowing robust prediction across unrelated populations. Although the specific variants causing polygenic variation may never be known with certainty, a number of tools and resources can be used to identify those most likely to affect phenotype. Associations of dense SNP genotypes with phenotype provide a 1-dimensional approach for identifying genes affecting specific traits; in contrast, associations with multiple traits allow defining networks of genes interacting to affect correlated traits. Such networks are especially compelling when corroborated by existing functional annotation and established molecular pathways. The SNP occurring within network genes, obtained from public databases or derived from genome and transcriptome sequences, may be classified according to expected effects on gene products. As illustrated by functionally informed genomic predictions being more accurate than naive whole-genome predictions of beef tenderness, coupling evidence from livestock genotypes, phenotypes, gene expression, and genomic variants with existing knowledge of gene functions and interactions may provide greater insight into the genes and genomic mechanisms affecting polygenic traits and facilitate functional genomic selection for economically important traits.
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)
2014-01-01
Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection. PMID:24524230
Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice
2011-05-05
High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to export genotyping data or sequences in various formats. Our experiments on grapevine genetic projects showed that SNiPlay allows geneticists to rapidly obtain advanced results in several key research areas of plant genetic diversity. Both the management and treatment of large amounts of SNP data are rendered considerably easier for end-users through automation and integration. Current developments are taking into account new advances in high-throughput technologies.SNiPlay is available at: http://sniplay.cirad.fr/.
Use of direct and iterative solvers for estimation of SNP effects in genome-wide selection
2010-01-01
The aim of this study was to compare iterative and direct solvers for estimation of marker effects in genomic selection. One iterative and two direct methods were used: Gauss-Seidel with Residual Update, Cholesky Decomposition and Gentleman-Givens rotations. For resembling different scenarios with respect to number of markers and of genotyped animals, a simulated data set divided into 25 subsets was used. Number of markers ranged from 1,200 to 5,925 and number of animals ranged from 1,200 to 5,865. Methods were also applied to real data comprising 3081 individuals genotyped for 45181 SNPs. Results from simulated data showed that the iterative solver was substantially faster than direct methods for larger numbers of markers. Use of a direct solver may allow for computing (co)variances of SNP effects. When applied to real data, performance of the iterative method varied substantially, depending on the level of ill-conditioning of the coefficient matrix. From results with real data, Gentleman-Givens rotations would be the method of choice in this particular application as it provided an exact solution within a fairly reasonable time frame (less than two hours). It would indeed be the preferred method whenever computer resources allow its use. PMID:21637627
Common rs5918 (PlA1/A2) polymorphism in the ITGB3 gene and risk of coronary artery disease
Heidari, Mohammad Mehdi; Soheilyfar, Sorour
2016-01-01
Introduction The T to C transition at nucleotide 1565 of the human glycoprotein IIIa (ITGB3) gene represents a genetic polymorphism (PlA1/A2) that can influence both platelet activation and aggregation and that has been associated with many types of disease. Here, we present a newly designed multiplex tetra-primer amplification refractory mutation system – polymerase chain reaction (T-ARMS-PCR) for genotyping a single nucleotide polymorphism (SNP) (dbSNP ID: rs5918) in the human ITGB3 gene. Material and methods We set up T-ARMS-PCR for the rs5918 SNP in a single-step PCR and the results were validated by the PCR-RFLP method in 132 coronary artery disease (CAD) patients and 122 unrelated healthy individuals. Results Full accordance was found for genotype determination by the PCR-RFLP method. The multiple logistic regression analysis showed a significant association of the rs5918 polymorphism and CAD according to dominant and recessive models (dominant model OR: 2.40, 95% CI: 1.33–4.35; p = 0.003, recessive model OR: 4.71, 95% CI: 1.32–16.80; p = 0.0067). Conclusions Our T-ARMS-PCR in comparison with RFLP and allele-specific PCR is more advantageous because this PCR method allows the evaluation of both the wild type and the mutant allele in the same tube. Our results suggest that the rs5918 (PlA1/A2) polymorphism in the ITGB3 gene may contribute to the susceptibility of sporadic Iranian coronary artery disease (CAD) patients. PMID:28905013
Stram, Daniel O; Leigh Pearce, Celeste; Bretsky, Phillip; Freedman, Matthew; Hirschhorn, Joel N; Altshuler, David; Kolonel, Laurence N; Henderson, Brian E; Thomas, Duncan C
2003-01-01
The US National Cancer Institute has recently sponsored the formation of a Cohort Consortium (http://2002.cancer.gov/scpgenes.htm) to facilitate the pooling of data on very large numbers of people, concerning the effects of genes and environment on cancer incidence. One likely goal of these efforts will be generate a large population-based case-control series for which a number of candidate genes will be investigated using SNP haplotype as well as genotype analysis. The goal of this paper is to outline the issues involved in choosing a method of estimating haplotype-specific risk estimates for such data that is technically appropriate and yet attractive to epidemiologists who are already comfortable with odds ratios and logistic regression. Our interest is to develop and evaluate extensions of methods, based on haplotype imputation, that have been recently described (Schaid et al., Am J Hum Genet, 2002, and Zaykin et al., Hum Hered, 2002) as providing score tests of the null hypothesis of no effect of SNP haplotypes upon risk, which may be used for more complex tasks, such as providing confidence intervals, and tests of equivalence of haplotype-specific risks in two or more separate populations. In order to do so we (1) develop a cohort approach towards odds ratio analysis by expanding the E-M algorithm to provide maximum likelihood estimates of haplotype-specific odds ratios as well as genotype frequencies; (2) show how to correct the cohort approach, to give essentially unbiased estimates for population-based or nested case-control studies by incorporating the probability of selection as a case or control into the likelihood, based on a simplified model of case and control selection, and (3) finally, in an example data set (CYP17 and breast cancer, from the Multiethnic Cohort Study) we compare likelihood-based confidence interval estimates from the two methods with each other, and with the use of the single-imputation approach of Zaykin et al. applied under both null and alternative hypotheses. We conclude that so long as haplotypes are well predicted by SNP genotypes (we use the Rh2 criteria of Stram et al. [1]) the differences between the three methods are very small and in particular that the single imputation method may be expected to work extremely well. Copyright 2003 S. Karger AG, Basel
Harker, Mark; Carvell, Ann-Marie; Marti, Vernon P J; Riazanskaia, Svetlana; Kelso, Hailey; Taylor, David; Grimshaw, Sally; Arnold, David S; Zillmer, Ruediger; Shaw, Jane; Kirk, Jayne M; Alcasid, Zee M; Gonzales-Tanon, Sheila; Chan, Gertrude P; Rosing, Egge A E; Smith, Adrian M
2014-01-01
A single nucleotide polymorphism (SNP), 538G→A, leading to a G180R substitution in the ABCC11 gene results in reduced concentrations of apocrine derived axillary odour precursors. Determine the axillary odour levels in the SNP ABCC11 genotype variants and to investigate if other parameters associated with odour production are affected. Axillary odour was assessed by subjective quantification and gas chromatography headspace analysis. Metabolite profiles, microbiome diversity and personal hygiene habits were also assessed. Axillary odour in the A/A homozygotes was significantly lower compared to the G/A and G/G genotypes. However, the perception-based measures still detected appreciable levels of axillary odour in the A/A subjects. Metabolomic analysis highlighted significant differences in axillary skin metabolites between A/A subjects compared to those carrying the G allele. These differences resulted in A/A subjects lacking specific volatile odourants in the axillary headspace, but all genotypes produced odoriferous short chain fatty acids. Microbiomic analysis revealed differences in the relative abundance of key bacterial genera associated with odour generation between the different genotypes. Deodorant usage indicated a high level of self awareness of axillary odour levels with A/A individuals less likely to adopt personal hygiene habits designed to eradicate/mask its presence. The SNP in the ABCC11 gene results in lower levels of axillary odour in the A/A homozygotes compared to those carrying the G allele, but A/A subjects still produce noticeable amounts of axillary odour. Differences in axillary skin metabolites, bacterial genera and personal hygiene behaviours also appear to be influenced by this SNP. Copyright © 2013. Published by Elsevier Ireland Ltd.
Lutkowska, Anna; Roszak, Andrzej; Lianeri, Margarita; Sowińska, Anna; Sotiri, Emianka; Jagodziński, Pawel P
2017-04-01
We studied the role of the NC_000017.10:g.38051348A>G (rs8067378) single nucleotide polymorphism (SNP) located 9.5 kb downstream of gasdermin B (GSDMB), in the development and progression of cervical squamous cell carcinomas (SCC). Using high-resolution melting curve analysis, we genotyped this SNP in patients with cervical SCC (n = 486) and controls (n = 511) from the Polish Caucasian population. Logistic regression analysis was used to adjust for the effect of confounders such as age, parity, oral contraceptive use, tobacco smoking, and menopausal status. The effect of this SNP on the expression of GSDMB was studied by reverse transcription and quantitative real-time polymerase chain reaction analysis of GSDMB transcript levels in SCC tissues. For all patients with SCC, the p trend value calculated for rs8067378 was statistically significant (p trend = 0.0019). The adjusted odds ratio for the G/G vs. A/A genotype was 1.304 (95% confidence interval 1.080-1.574, p = 0.0057) and the adjusted odds ratio for the G/A + G/G vs. A/A genotype was 1.444 (95% confidence interval 1.064-1.959, p = 0.0181). We also found a significant association of the rs8067378 SNP with tumor stages III, IV, and grade of differentiation G3, and with parity, oral contraceptive use, smoking, and women of postmenopausal age. We found increased GSDMB1 isoform transcripts in the cancerous and non-cancerous tissues from carriers of the G allele vs. carriers of the A/A genotype. The rs8067378 SNP variants may increase the expression of GSDMB and the risk of the development and progression of cervical SCC.
Shu, Jing-Ting; Bao, Wen-Bin; Zhang, Hong-Xia; Zhang, Xue-Yu; Ji, Cong-Liang; Chen, Guo-Hong
2007-03-01
This study investigates single nucleotide polymorphism (SNP) of the adenylosuccinate lyase(ADSL) gene in variety chicken breeds, including Recessive White chickens, Silkies chickens, Baier chickens, Tibetan chickens and two red jungle fowls. Primers for exon 2 in ADSL gene were designed based on the chicken genomic sequence and a SNP(C/T at 3484) was detected by PCR-SSCP and DNA sequencing. Three genotypes within all breeds were found and least square analysis showed that TT genotype birds had a significant higher inosine monophosphate acid (IMP) content than TC (P < 0.01) and CC (P < 0.05) genotype birds, TC genotype birds had a little higher IMP content than CC genotype birds, but the difference was not significant. We proposed this SNP site correlated with IMP content in chickens. A neighbour-joining dendrogram was constructed based on the Nei's genentic distance. The genetic relationship between Chinese red jungle fowl and Tibetan chickens was the nearest, whereas Baier chickens were more closer to Silkies chickens. The Chinese red jungle fowls were relatively closer to the domestic fowls, whereas Thailand red jungle fowls were relatively diverging to the Chinese native breeds. These results supported the theory concerning the independent origins of Chinese native fowl breeds.
2013-01-01
Background Vitis vinifera L. is one of society’s most important agricultural crops with a broad genetic variability. The difficulty in recognizing grapevine genotypes based on ampelographic traits and secondary metabolites prompted the development of molecular markers suitable for achieving variety genetic identification. Findings Here, we propose a comparison between a multi-locus barcoding approach based on six chloroplast markers and a single-copy nuclear gene sequencing method using five coding regions combined with a character-based system with the aim of reconstructing cultivar-specific haplotypes and genotypes to be exploited for the molecular characterization of 157 V. vinifera accessions. The analysis of the chloroplast target regions proved the inadequacy of the DNA barcoding approach at the subspecies level, and hence further DNA genotyping analyses were targeted on the sequences of five nuclear single-copy genes amplified across all of the accessions. The sequencing of the coding region of the UFGT nuclear gene (UDP-glucose: flavonoid 3-0-glucosyltransferase, the key enzyme for the accumulation of anthocyanins in berry skins) enabled the discovery of discriminant SNPs (1/34 bp) and the reconstruction of 130 V. vinifera distinct genotypes. Most of the genotypes proved to be cultivar-specific, and only few genotypes were shared by more, although strictly related, cultivars. Conclusion On the whole, this technique was successful for inferring SNP-based genotypes of grapevine accessions suitable for assessing the genetic identity and ancestry of international cultivars and also useful for corroborating some hypotheses regarding the origin of local varieties, suggesting several issues of misidentification (synonymy/homonymy). PMID:24298902
Meuwissen, Theo H E; Indahl, Ulf G; Ødegård, Jørgen
2017-12-27
Non-linear Bayesian genomic prediction models such as BayesA/B/C/R involve iteration and mostly Markov chain Monte Carlo (MCMC) algorithms, which are computationally expensive, especially when whole-genome sequence (WGS) data are analyzed. Singular value decomposition (SVD) of the genotype matrix can facilitate genomic prediction in large datasets, and can be used to estimate marker effects and their prediction error variances (PEV) in a computationally efficient manner. Here, we developed, implemented, and evaluated a direct, non-iterative method for the estimation of marker effects for the BayesC genomic prediction model. The BayesC model assumes a priori that markers have normally distributed effects with probability [Formula: see text] and no effect with probability (1 - [Formula: see text]). Marker effects and their PEV are estimated by using SVD and the posterior probability of the marker having a non-zero effect is calculated. These posterior probabilities are used to obtain marker-specific effect variances, which are subsequently used to approximate BayesC estimates of marker effects in a linear model. A computer simulation study was conducted to compare alternative genomic prediction methods, where a single reference generation was used to estimate marker effects, which were subsequently used for 10 generations of forward prediction, for which accuracies were evaluated. SVD-based posterior probabilities of markers having non-zero effects were generally lower than MCMC-based posterior probabilities, but for some regions the opposite occurred, resulting in clear signals for QTL-rich regions. The accuracies of breeding values estimated using SVD- and MCMC-based BayesC analyses were similar across the 10 generations of forward prediction. For an intermediate number of generations (2 to 5) of forward prediction, accuracies obtained with the BayesC model tended to be slightly higher than accuracies obtained using the best linear unbiased prediction of SNP effects (SNP-BLUP model). When reducing marker density from WGS data to 30 K, SNP-BLUP tended to yield the highest accuracies, at least in the short term. Based on SVD of the genotype matrix, we developed a direct method for the calculation of BayesC estimates of marker effects. Although SVD- and MCMC-based marker effects differed slightly, their prediction accuracies were similar. Assuming that the SVD of the marker genotype matrix is already performed for other reasons (e.g. for SNP-BLUP), computation times for the BayesC predictions were comparable to those of SNP-BLUP.
Kucharczyk, Tomasz; Krawczyk, Paweł; Powrózek, Tomasz; Kowalski, Dariusz M; Ramlau, Rodryg; Kalinka-Warzocha, Ewa; Knetki-Wróblewska, Magdalena; Winiarczyk, Kinga; Krzakowski, Maciej; Milanowski, Janusz
2016-01-01
In NSCLC, second-line chemotherapy using pemetrexed or docetaxel has limited efficacy and should be dedicated to selected groups of patients. Pemetrexed is an antifolate compound with the ability to inhibit enzymes (TS, DHFR and GARFT) involved in pyrimidine and purine synthesis. The objective of this study was to evaluate the association between polymorphisms of TS and MHFR genes and clinical outcomes in NSCLC patients treated with pemetrexed monotherapy. DNA was isolated from peripheral blood of 72 non-squamous NSCLC patients treated with pemetrexed. Using PCR and RFLP methods, the variable number of tandem repeats (VNTR), the G > C SNP in these repeats and insertion/deletion polymorphism of TS gene as well as 677C > T SNP in MTHFR gene were analyzed and correlated with disease control rate, progression-free survival and overall survival (OS) of NSCLC patients. Carriers of 2R/3R(G), 3R(C)/3R(G), 3R(G)/3R(G) genotypes showed significantly more frequent early progression than carriers of 2R/2R, 2R/3R(C), 3R(C)/3R(C) genotypes of TS gene (p < 0.05). Among carriers of triple 28 bp tandem repeats (3R) in TS gene and C/C genotype of MTHFR gene a significantly shorter OS was observed (HR = 3.07; p = 0.003). In multivariate analysis, significantly higher risk of death was observed in carriers of both 3R/3R genotype in TS and C/C genotype in 677C > T SNP in MTHFR (HR = 3.85; p < 0.005) as well as in patients with short duration of response to first-line chemotherapy (HR = 2.09; p < 0.005). Results of our study suggested that genetic factors may have a high predictive and prognostic value (even greater than clinical factors) for patients treated with pemetrexed monotherapy.
Blanco-Marchite, Cristina; Sánchez-Sánchez, Francisco; López-Garrido, María-Pilar; Iñigez-de-Onzoño, Mercedes; López-Martínez, Francisco; López-Sánchez, Enrique; Alvarez, Lydia; Rodríguez-Calvo, Pedro-Pablo; Méndez-Hernández, Carmen; Fernández-Vega, Luis; García-Sánchez, Julián; Coca-Prados, Miguel; García-Feijoo, Julián
2011-01-01
Purpose. To investigate the role of WDR36 and P53 sequence variations in POAG susceptibility. Methods. The authors performed a case-control genetic association study in 268 unrelated Spanish patients (POAG1) and 380 control subjects matched for sex, age, and ethnicity. WDR36 sequence variations were screened by either direct DNA sequencing or denaturing high-performance liquid chromatography. P53 polymorphisms p.R72P and c.97–147ins16bp were analyzed by single-nucleotide polymorphism (SNP) genotyping and PCR, respectively. Positive SNP and haplotype associations were reanalyzed in a second sample of 211 patients and in combined cases (n = 479). Results. The authors identified almost 50 WDR36 sequence variations, of which approximately two-thirds were rare and one-third were polymorphisms. Approximately half the variants were novel. Eight patients (2.9%) carried rare mutations that were not identified in the control group (P = 0.001). Six Tag SNPs were expected to be structured in three common haplotypes. Haplotype H2 was consistently associated with the disease (P = 0.0024 in combined cases). According to a dominant model, genotypes containing allele P of the P53 p.R72P SNP slightly increased glaucoma risk. Glaucoma susceptibility associated with different WDR36 genotypes also increased significantly in combination with the P53 RP risk genotype, indicating the existence of a genetic interaction. For instance, the OR of the H2 diplotype estimated for POAG1 and combined cases rose approximately 1.6 times in the two-locus genotype H2/RP. Conclusions. Rare WDR36 variants and the P53 p.R72P polymorphism behaved as moderate glaucoma risk factors in Spanish patients. The authors provide evidence for a genetic interaction between WDR36 and P53 variants in POAG susceptibility, although this finding must be confirmed in other populations. PMID:21931130
USDA-ARS?s Scientific Manuscript database
Breeding and selection for the traits with polygenic inheritance is a challenging task that can be done by phenotypic selection, by marker-assisted selection or by genome wide selection. We tested predictive ability of four selection models in a biparental population genotyped with 95 SNP markers an...
Nandi, Shyam Sundar; Sharma, Deepa Kailash; Deshpande, Jagadish M
2016-07-01
It is important to understand the role of cell surface receptors in susceptibility to infectious diseases. CD155 a member of the immunoglobulin super family, serves as the poliovirus receptor (PVR). Heterozygous (Ala67Thr) polymorphism in CD155 has been suggested as a risk factor for paralytic outcome of poliovirus infection. The present study pertains to the development of a screening test to detect the single nucleotide (SNP) polymorphism in the CD155 gene. New primers were designed for PCR, sequencing and SNP analysis of Exon2 of CD155 gene. DNAs extracted from either whole blood (n=75) or cells from oral cavity (n=75) were used for standardization and validation of the SNP assay. DNA sequencing was used as the gold standard method. A new SNP assay for detection of heterozygous Ala67Thr genotype was developed and validated by testing 150 DNA samples. Heterozygous CD155 was detected in 27.33 per cent (41/150) of DNA samples tested by both SNP detection assay and sequencing. The SNP detection assay was successfully developed for identification of Ala67Thr polymorphism in human PVR/CD155 gene. The SNP assay will be useful for large scale screening of DNA samples.
Grossi, D A; Brito, L F; Jafarikia, M; Schenkel, F S; Feng, Z
2018-04-30
The uptake of genomic selection (GS) by the swine industry is still limited by the costs of genotyping. A feasible alternative to overcome this challenge is to genotype animals using an affordable low-density (LD) single nucleotide polymorphism (SNP) chip panel followed by accurate imputation to a high-density panel. Therefore, the main objective of this study was to screen incremental densities of LD panels in order to systematically identify one that balances the tradeoffs among imputation accuracy, prediction accuracy of genomic estimated breeding values (GEBVs), and genotype density (directly associated with genotyping costs). Genotypes using the Illumina Porcine60K BeadChip were available for 1378 Duroc (DU), 2361 Landrace (LA) and 3192 Yorkshire (YO) pigs. In addition, pseudo-phenotypes (de-regressed estimated breeding values) for five economically important traits were provided for the analysis. The reference population for genotyping imputation consisted of 931 DU, 1631 LA and 2103 YO animals and the remainder individuals were included in the validation population of each breed. A LD panel of 3000 evenly spaced SNPs (LD3K) yielded high imputation accuracy rates: 93.78% (DU), 97.07% (LA) and 97.00% (YO) and high correlations (>0.97) between the predicted GEBVs using the actual 60 K SNP genotypes and the imputed 60 K SNP genotypes for all traits and breeds. The imputation accuracy was influenced by the reference population size as well as the amount of parental genotype information available in the reference population. However, parental genotype information became less important when the LD panel had at least 3000 SNPs. The correlation of the GEBVs directly increased with an increase in imputation accuracy. When genotype information for both parents was available, a panel of 300 SNPs (imputed to 60 K) yielded GEBV predictions highly correlated (⩾0.90) with genomic predictions obtained based on the true 60 K panel, for all traits and breeds. For a small reference population size with no parents on reference population, it is recommended the use of a panel at least as dense as the LD3K and, when there are two parents in the reference population, a panel as small as the LD300 might be a feasible option. These findings are of great importance for the development of LD panels for swine in order to reduce genotyping costs, increase the uptake of GS and, therefore, optimize the profitability of the swine industry.
Miyake, Yoshihiro; Hitsumoto, Shinichi; Tanaka, Keiko; Arakawa, Masashi
2015-08-01
We examined the association between thymic stromal lymphopoietin (TSLP) single nucleotide polymorphisms (SNPs) and eczema in young adult Japanese women. Cases were 188 women who met the criteria of the International Study of Asthma and Allergies in Childhood (ISAAC) for eczema. Controls were 565 women without eczema according to the ISAAC criteria, who had not been diagnosed with asthma, atopic eczema, and/or allergic rhinitis by a doctor and who had no asthma as defined by the European Community Respiratory Health Survey criteria and no rhinoconjunctivitis according to the ISAAC criteria. Compared with women with the TT genotype of SNP rs1837253, those with the TC or CC genotype had a significantly increased risk of eczema after adjustment for age and smoking, although this association was not significant in crude analysis. There were no relationships between SNP rs3806933 or rs2289276 and eczema. The TC and CC genotypes combined of SNP rs1837253 may be significantly positively associated with eczema.
Jiménez-Jiménez, Félix Javier; García-Martín, Elena; Alonso-Navarro, Hortensia; Martínez, Carmen; Zurdo, Martín; Turpín-Fenoll, Laura; Millán-Pascual, Jorge; Adeva-Bartolomé, Teresa; Cubo, Esther; Navacerrada, Francisco; Rojo-Sebastián, Ana; Rubio, Lluisa; Ortega-Cubero, Sara; Pastor, Pau; Calleja, Marisol; Plaza-Nieto, José Francisco; Pilo-de-la-Fuente, Belén; Arroyo-Solera, Margarita; García-Albea, Esteban; Agúndez, José A G
2017-03-01
A recent meta-analysis suggests an association between the rs11558538 single nucleotide polymorphism in the histamine-N-methyl-transferase (HNMT) gene and the risk for Parkinson's disease. Based on the possible relationship between PD and restless legs syndrome (RLS), we tried to establish whether rs11558538 SNP is associated with the risk for RLS. We studied the genotype and allelic variant frequencies of HNMT rs11558538 SNP 205 RLS patients and 410 healthy controls using a TaqMan assay. The frequencies of the HNMT rs11558538 genotypes allelic variants were similar between RLS patients and controls, and were not influenced by gender, family history of RLS, or RLS severity. RLS patients carrying the genotype rs11558538TT had an earlier age at onset, but this finding was based on three subjects only. These results suggest a lack of major association between HNMT rs11558538 SNP and the risk for RLS.
Yoon, Ho-Kyoung; Kim, Yong-Ku
2009-04-30
Serotonergic system-related genes can be good candidate genes for both major depressive disorder (MDD) and suicidal behavior. In this study, we aimed to investigate the association of serotonin 2A receptor gene -1438A/G SNP (HTR2A -1438A/G), tryptophan hydroxylase 2 gene -703G/T SNP (TPH2 -703G/T) and serotonin 1A receptor C-1019G (HTR1A C-1019G) with suicidal behavior. One hundred and eighty one suicidal depressed patients and 143 non-suicidal depressed patients who met DSM-IV criteria for major depressive disorder were recruited from patients who were admitted to Korea University Ansan Hospital. One hundred seventy six normal controls were healthy volunteers who were recruited by local advertisement. Patients and normal controls were genotyped for HTR2A -1438A/G, TPH2 -703G/T and 5-HT1A C-1019G. The suicidal depressed patients were evaluated by the lethality of individual suicide attempts using Weisman and Worden's risk-rescue rating (RRR) and the Lethality Suicide Attempt Rating Scale-updated (LSARS-II). In order to assess the severity of depressive symptoms of patients, Hamilton's Depression Rating Scale (HDRS) was administered. Genotype and allele frequencies were compared between groups by chi(2) statistics. Association of genotype of the candidate genes with the lethality of suicidal behavior was examined with ANOVA by comparing the mean scores of LSARS and RRR according to the genotype. There were statistically significant differences in the genotype distributions and allele frequencies of TPH2 -703G/T between the suicidal depressive group and the normal control group. The homozygous allele G (G/G genotype) frequency was significantly higher in suicidal depressed patients than in controls. However, no differences in either genotype distribution or in allele frequencies of HTR2A -1438A/G and HTR1A C-1019G were observed between the suicidal depressed patients, the non-suicidal depressed patients, and the normal controls. There were no differences in the lethality of suicidal behavior in suicidal depressed patients according to the genotypes of three polymorphisms. Our results suggest that TPH2 -703G/T SNP may have an important effect on susceptibility to suicidal behavior. Furthermore, an increased frequency of G allele of TPH2 SNP may be associated with elevated suicidal behavior itself rather than with the diagnosis of major depression and may increase risk of suicidality, independent of diagnosis.
2011-01-01
Background Enterococcus faecalis and Enterococcus faecium are associated with faecal pollution of water, linked to swimmer-associated gastroenteritis and demonstrate a wide range of antibiotic resistance. The Coomera River is a main water source for the Pimpama-Coomera watershed and is located in South East Queensland, Australia, which is used intensively for agriculture and recreational purposes. This study investigated the diversity of E. faecalis and E. faecium using Single Nucleotide Polymorphisms (SNPs) and associated antibiotic resistance profiles. Results Total enterococcal counts (cfu/ml) for three/six sampling sites were above the United States Environmental Protection Agency (USEPA) recommended level during rainfall periods and fall into categories B and C of the Australian National Health and Medical Research Council (NHMRC) guidelines (with a 1-10% gastrointestinal illness risk). E. faecalis and E. faecium isolates were grouped into 29 and 23 SNP profiles (validated by MLST analysis) respectively. This study showed the high diversity of E. faecalis and E. faecium over a period of two years and both human-related and human-specific SNP profiles were identified. 81.8% of E. faecalis and 70.21% of E. faecium SNP profiles were associated with genotypic and phenotypic antibiotic resistance. Gentamicin resistance was higher in E. faecalis (47% resistant) and harboured the aac(6')-aph(2') gene. Ciprofloxacin resistance was more common in E. faecium (12.7% resistant) and gyrA gene mutations were detected in these isolates. Tetracycline resistance was less common in both species while tet(L) and tet(M) genes were more prevalent. Ampicillin resistance was only found in E. faecium isolates with mutations in the pbp5 gene. Vancomycin resistance was not detected in any of the isolates. We found that antibiotic resistance profiles further sub-divided the SNP profiles of both E. faecalis and E. faecium. Conclusions The distribution of E. faecalis and E. faecium genotypes is highly diverse in the Coomera River. The SNP genotyping method is rapid and robust and can be applied to study the diversity of E. faecalis and E. faecium in waterways. It can also be used to test for human-related and human-specific enterococci in water. The resolving power can be increased by including antibiotic-resistant profiles which can be used as a possible source tracking tool. This warrants further investigation. PMID:21910889
Stephen J. Amish,; Paul A. Hohenlohe,; Sally Painter,; Robb F. Leary,; Muhlfeld, Clint C.; Fred W. Allendorf,; Luikart, Gordon
2012-01-01
Hybridization with introduced rainbow trout threatens most native westslope cutthroat trout populations. Understanding the genetic effects of hybridization and introgression requires a large set of high-throughput, diagnostic genetic markers to inform conservation and management. Recently, we identified several thousand candidate single-nucleotide polymorphism (SNP) markers based on RAD sequencing of 11 westslope cutthroat trout and 13 rainbow trout individuals. Here, we used flanking sequence for 56 of these candidate SNP markers to design high-throughput genotyping assays. We validated the assays on a total of 92 individuals from 22 populations and seven hatchery strains. Forty-six assays (82%) amplified consistently and allowed easy identification of westslope cutthroat and rainbow trout alleles as well as heterozygote controls. The 46 SNPs will provide high power for early detection of population admixture and improved identification of hybrid and nonhybridized individuals. This technique shows promise as a very low-cost, reliable and relatively rapid method for developing and testing SNP markers for nonmodel organisms with limited genomic resources.
Chen, Ying; Zhang, Zhijun; Xu, Zhi; Pu, Mengjia; Geng, Leiyu
2015-12-01
To explore the influence of interleukin-1 beta (IL1B) gene polymorphism and childhood maltreatment on antidepressant treatment. Two hundred and four patients with major depressive disorder (MDD) have received treatment with single antidepressant drugs and were followed up for 8 weeks. Hamilton depression scale-17 (HAMD-17) was used to evaluate the severity of depressive symptoms and therapeutic effect. Childhood maltreatment was assessed using Childhood Trauma Questionnaire, a 28-item Short Form (CTQ-SF). Single nucleotide polymorphism (SNP) of the IL1B gene was determined using a SNaPshot method. Correlation of rs16944 gene polymorphism with response to treatment was analyzed using Unphased 3.0.13 software. The main and interactive effects of SNP and childhood maltreatment on the antidepressant treatment were analyzed using Logistic regression analysis. No significant difference of gender, age, year of education, family history, episode time, and antidepressant agents was detected between the remitters and non-remitters. Association analysis has found that the SNP rs16944 in the IL1B AA genotype carriers antidepressant response was poorer (χ2=3.931, P=0.047). No significant difference was detected in the CTQ scores between the two groups. Genetic and environmental interaction analysis has demonstrated a significant correlation between rs16944 AA genotype and childhood maltreatment and poorer response to antidepressant treatment. The SNP rs16944 in the IL1B gene and its interaction with childhood maltreatment may influence the effect of antidepressant treatment for patients with MDD.
Schweighofer, Carmen D.; Coombes, Kevin R.; Majewski, Tadeusz; Barron, Lynn L.; Lerner, Susan; Sargent, Rachel L.; O'Brien, Susan; Ferrajoli, Alessandra; Wierda, William G.; Czerniak, Bogdan A.; Medeiros, L. Jeffrey; Keating, Michael J.; Abruzzo, Lynne V.
2013-01-01
Genomic abnormalities, such as deletions in 11q22 or 17p13, are associated with poorer prognosis in patients with chronic lymphocytic leukemia (CLL). We hypothesized that unknown regions of copy number variation (CNV) affect clinical outcome and can be detected by array-based single-nucleotide polymorphism (SNP) genotyping. We compared SNP genotypes from 168 untreated patients with CLL with genotypes from 73 white HapMap controls. We identified 322 regions of recurrent CNV, 82 of which occurred significantly more often in CLL than in HapMap (CLL-specific CNV), including regions typically aberrant in CLL: deletions in 6q21, 11q22, 13q14, and 17p13 and trisomy 12. In univariate analyses, 35 of total and 11 of CLL-specific CNVs were associated with unfavorable time-to-event outcomes, including gains or losses in chromosomes 2p, 4p, 4q, 6p, 6q, 7q, 11p, 11q, and 17p. In multivariate analyses, six CNVs (ie, CLL-specific variations in 11p15.1-15.4 or 6q27) predicted time-to-treatment or overall survival independently of established markers of prognosis. Moreover, genotypic complexity (ie, the number of independent CNVs per patient) significantly predicted prognosis, with a median time-to-treatment of 64 months versus 23 months in patients with zero to one versus two or more CNVs, respectively (P = 3.3 × 10−8). In summary, a comparison of SNP genotypes from patients with CLL with HapMap controls allowed us to identify known and unknown recurrent CNVs and to determine regions and rates of CNV that predict poorer prognosis in patients with CLL. PMID:23273604
Jun, Gyungah; Naj, Adam C.; Beecham, Gary W.; Wang, Li-San; Buros, Jacqueline; Gallins, Paul J.; Buxbaum, Joseph D.; Ertekin-Taner, Nilufer; Fallin, M. Daniele; Friedland, Robert; Inzelberg, Rivka; Kramer, Patricia; Rogaeva, Ekaterina; St George-Hyslop, Peter; Cantwell, Laura B.; Dombroski, Beth A.; Saykin, Andrew J.; Reiman, Eric M.; Bennett, David A.; Morris, John C.; Lunetta, Kathryn L.; Martin, Eden R.; Montine, Thomas J.; Goate, Alison M.; Blacker, Deborah; Tsuang, Debby W.; Beekly, Duane; Cupples, L. Adrienne; Hakonarson, Hakon; Kukull, Walter; Foroud, Tatiana M.; Haines, Jonathan; Mayeux, Richard; Farrer, Lindsay A.; Pericak-Vance, Margaret A.; Schellenberg, Gerard D.
2011-01-01
Objectives To determine whether genotypes at CLU, PICALM, and CR1 confer risk for Alzheimer’s disease (AD) and whether risk for AD associated with these genes is influenced by APOE genotypes. Design Association study of AD and CLU, PICALM, CR1 and APOE genotypes. Setting Academic research institutions in the United States, Canada, and Israel. Participants 7,070 AD cases, 3,055 with autopsies, and 8,169 elderly cognitively normal controls, 1,092 with autopsies from 12 different studies, including Caucasians, African Americans, Israeli-Arabs, and Caribbean Hispanics. Results Unadjusted, CLU [odds ratio (OR) = 0.91, 95% confidence interval (CI) = 0.85 – 0.96 for single nucleotide polymorphism (SNP) rs11136000], CR1 (OR = 1.14, CI = 1.07 – 1.22, SNP rs3818361), and PICALM (OR = 0.89, CI = 0.84 – 0.94, SNP rs3851179) were associated with AD in Caucasians. None were significantly associated with AD in the other ethnic groups. APOE ε4 was significantly associated with AD (ORs from 1.80 to 9.05) in all but one small Caucasian cohort and in the Arab cohort. Adjusting for age, sex, and the presence of at least one APOE ε4 allele greatly reduced evidence for association with PICALM but not CR1 or CLU. Models with the main SNP effect, APOE ε4 (+/−), and an interaction term showed significant interaction between APOE ε4 (+/−) and PICALM. Conclusions We confirm in a completely independent dataset that CR1, CLU, and PICALM are AD susceptibility loci in European ancestry populations. Genotypes at PICALM confer risk predominantly in APOE ε4-positive subject. Thus, APOE and PICALM synergistically interact. PMID:20697030
Wu, Dong-Feng; Yin, Rui-Xing; Cao, Xiao-Li; Huang, Feng; Wu, Jin-Zhen; Chen, Wu-Xian
2016-04-08
This study aimed to detect the association of the MADD-FOLH1 single nucleotide polymorphisms (SNPs) and their haplotypes with the risk of coronary heart disease (CHD) and ischemic stroke (IS) in a Chinese Han population. Six SNPs of rs7395662, rs326214, rs326217, rs1051006, rs3736101, and rs7120118 were genotyped in 584 CHD and 555 IS patients, and 596 healthy controls. The genotypic and allelic frequencies of the rs7395662 SNP were different between controls and patients, and the genotypes of the rs7395662 SNP were associated with the risk of CHD and IS in different genetic models. Six main haplotypes among the rs1051006, rs326214, rs326217, rs3736101, and rs7120118 SNPs were detected in our study population, the haplotypes of G-G-T-G-C and G-A-T-G-T were associated with an increased risk of CHD and IS, respectively. The subjects with rs7395662GG genotype in controls had higher triglyceride (TG) and lower high-density lipoprotein cholesterol (HDL-C) levels than the subjects with AA/AG genotypes. Several SNPs interacted with alcohol consumption to influence serum TG (rs326214, rs326217, and rs7120118) and HDL-C (rs7395662) levels. The SNP of rs3736101 interacted with cigarette smoking to modify serum HDL-C levels. The SNP of rs1051006 interacted with body mass index ≥24 kg/m² to modulate serum low-density lipoprotein cholesterol levels. The interactions of several haplotypes and alcohol consumption on the risk of CHD and IS were also observed.
Marvalim, Charlie; Wong, Jing Xiang Gimson; Sutiman, Natalia; Lim, Wan Teck; Tan, Shao Weng; Kanesvaran, Ravindran; Ng, Quan Sing; Jain, Amit; Ang, Mei Kim; Tan, Wan Ling; Toh, Chee Keong; Tan, Eng Huat; Chowbay, Balram
2017-03-01
The critical role of lysine demethylase 4A (KDM4A), in regulating chromatin structure and consequently in driving cellular proliferation and oncogenesis has been the focus of recent studies. Non-small-cell lung cancer (NSCLC) patients with adenocarcinoma histology who were homozygous for KDM4A single nucleotide polymorphism (SNP)-A482 (rs586339) were recently shown to have significantly worse overall survival (OS) compared with patients with the wild-type or the heterozygous genotype at this locus (hazard ratio=1.68, P=0.042). In the current study, we investigated the association between the same polymorphism with OS in our Asian NSCLC-adenocarcinoma patients comprising Chinese (N=572), Malays (N=50), and Indians (N=22). KDM4A SNP-A482 genotype status was determined by Sanger sequencing. OS was calculated from the date of diagnosis to date of death or censored at the date of last follow-up. Kaplan-Meier analysis, log-rank test, and Cox regression methods were utilized to evaluate OS outcomes. KDM4A SNP-A482 had a minor allele (C) frequency of 18.8% and a major allele (A) frequency of 81.2% in our Asian NSCLC (adenocarcinoma) patients. However, the OS in our Asian NSCLC patients homozygous for KDM4A SNP-A482 was not significantly different from those who were wild type or heterozygous at this locus [CC vs. AA/AC: median OS (95% confidence interval): 40.2 (18.7-61.6) vs. 29.6 (26.9-32.3) months; P=0.858]. The results remained statistically nonsignificant even after adjustment for epidermal growth factor receptor mutational status, suggesting that KDM4A SNP-A482 does not significantly influence OS in Asian NSCLC patients.
Diverse Genotypes of Yersinia pestis Caused Plague in Madagascar in 2007.
Riehm, Julia M; Projahn, Michaela; Vogler, Amy J; Rajerison, Minoaerisoa; Andersen, Genevieve; Hall, Carina M; Zimmermann, Thomas; Soanandrasana, Rahelinirina; Andrianaivoarimanana, Voahangy; Straubinger, Reinhard K; Nottingham, Roxanne; Keim, Paul; Wagner, David M; Scholz, Holger C
2015-06-01
Yersinia pestis is the causative agent of human plague and is endemic in various African, Asian and American countries. In Madagascar, the disease represents a significant public health problem with hundreds of human cases a year. Unfortunately, poor infrastructure makes outbreak investigations challenging. DNA was extracted directly from 93 clinical samples from patients with a clinical diagnosis of plague in Madagascar in 2007. The extracted DNAs were then genotyped using three molecular genotyping methods, including, single nucleotide polymorphism (SNP) typing, multi-locus variable-number tandem repeat analysis (MLVA), and Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) analysis. These methods provided increasing resolution, respectively. The results of these analyses revealed that, in 2007, ten molecular groups, two newly described here and eight previously identified, were responsible for causing human plague in geographically distinct areas of Madagascar. Plague in Madagascar is caused by numerous distinct types of Y. pestis. Genotyping method choice should be based upon the discriminatory power needed, expense, and available data for any desired comparisons. We conclude that genotyping should be a standard tool used in epidemiological investigations of plague outbreaks.
Wolf, Christiane; Angelberger, Marianne; Diegelmann, Julia; Olszak, Torsten; Beigel, Florian; Tillack, Cornelia; Stallhofer, Johannes; Göke, Burkhard; Glas, Jürgen; Lohse, Peter; Brand, Stephan
2014-01-01
Background Very recently, a sub-analysis of genome-wide association scans revealed that the non-coding single nucleotide polymorphism (SNP) rs12212067 in the FOXO3A gene is associated with a milder course of Crohn's disease (CD) (Cell 2013;155:57–69). The aim of our study was to evaluate the clinical value of the SNP rs12212067 in predicting the severity of CD by correlating CD patient genotype status with the most relevant complications of CD such as stenoses, fistulas, and CD-related surgery. Methodology/Principal Findings We genotyped 550 CD patients for rs12212067 (FOXO3A) and the three common CD-associated NOD2 mutations rs2066844, rs2066847, and rs2066847 and performed genotype-phenotype analyses. Results No significant phenotypic differences were found between the wild-type genotype TT of the FOXO3A SNP rs12212067 and the minor genotypes TG and GG independently from NOD2 variants. The allele frequency of the minor G allele was 12.7%. Age at diagnosis, disease duration, body mass index, surgery rate, stenoses, fistula, need for immunosuppressive therapy, and disease course were not significantly different. In contrast, the NOD2 mutant p.Leu1007fsX1008 (rs2066847) was highly associated with penetrating CD (p = 0.01), the development of fistulas (p = 0.01) and stenoses (p = 0.01), and ileal disease localization (p = 0.03). Importantly, the NOD2 SNP rs2066847 was a strong separator between an aggressive and a mild course of CD (p = 2.99×10−5), while the FOXO3A SNP rs12212067 did not separate between mild and aggressive CD behavior in our cohort (p = 0.35). 96.2% of the homozygous NOD2 p.Leu1007fsX1008 carriers had an aggressive disease behavior compared to 69.3% of the patients with the NOD2 wild-type genotype (p = 0.007). Conclusion/Significance In clinical practice, the NOD2 variant p.Leu1007fsX1008 (rs2066847), in particular in homozygous form, is a much stronger marker for a severe clinical phenotype than the FOXO3A rs12212067 SNP for a mild disease course on an individual patient level despite its important impact on the inflammatory response of monocytes. PMID:25365249
Liu, Yi-Chang; Hsiao, Hui-Hua; Yang, Wen-Chi; Liu, Ta-Chih; Chang, Chao-Sung; Yang, Ming-Yu; Lin, Pai-Mei; Hsu, Jui-Feng; Lee, Ching-Ping; Lin, Sheng-Fung
2014-12-01
The genetic or functional inactivation of the p53 pathway plays an important role with regards to disease progression from the chronic phase (CP) to blast phase (BP) and imatinib treatment response in chronic myeloid leukemia (CML). Two functional single nucleotide polymorphisms (SNPs), p53 R72P and MDM2 SNP309, are associated with alternation of p53 activity, however the association regarding CML susceptibility and BP transformation under imatinib treatment is unclear. The MDM2 SNP309 genotype was determined by polymerase chain reaction-restriction fragment length polymorphism and confirmed by direct sequencing from 116 CML patients, including 104 in the CP at diagnosis, and 162 healthy Taiwanese controls. The p53 R72P polymorphism was examined in all CML patients. The SNP309 G/G genotype was associated with an increased risk of CML susceptibility (OR: 1.82, 95% CI: 1.03-3.22, P = 0.037), and an earlier age of disease onset (log-rank P = 0.005) compared with the T/T + T/G genotypes. Higher MDM2 mRNA expression was found in G/G genotype compared with T/T (P = 0.034) and T/T + T/G (P = 0.056) genotypes. No associations were found between the p53 R72P genotypes and clinical parameters and survival outcomes. Among 62 CP patients receiving imatinib as first-line therapy, the G/G genotype was associated with a shorter blast-free survival (log-rank P = 0.048) and more clonal evolution compared with the T/T + T/G genotypes. In patients with advanced diseases at diagnosis, the G/G genotype was associated with a poor overall survival (log-rank P = 0.006). Closely monitoring CML patients harboring the G/G genotype and further large-scale studies are warranted. © 2013 Wiley Periodicals, Inc.
Badke, Yvonne M; Bates, Ronald O; Ernst, Catherine W; Fix, Justin; Steibel, Juan P
2014-04-16
Genomic selection has the potential to increase genetic progress. Genotype imputation of high-density single-nucleotide polymorphism (SNP) genotypes can improve the cost efficiency of genomic breeding value (GEBV) prediction for pig breeding. Consequently, the objectives of this work were to: (1) estimate accuracy of genomic evaluation and GEBV for three traits in a Yorkshire population and (2) quantify the loss of accuracy of genomic evaluation and GEBV when genotypes were imputed under two scenarios: a high-cost, high-accuracy scenario in which only selection candidates were imputed from a low-density platform and a low-cost, low-accuracy scenario in which all animals were imputed using a small reference panel of haplotypes. Phenotypes and genotypes obtained with the PorcineSNP60 BeadChip were available for 983 Yorkshire boars. Genotypes of selection candidates were masked and imputed using tagSNP in the GeneSeek Genomic Profiler (10K). Imputation was performed with BEAGLE using 128 or 1800 haplotypes as reference panels. GEBV were obtained through an animal-centric ridge regression model using de-regressed breeding values as response variables. Accuracy of genomic evaluation was estimated as the correlation between estimated breeding values and GEBV in a 10-fold cross validation design. Accuracy of genomic evaluation using observed genotypes was high for all traits (0.65-0.68). Using genotypes imputed from a large reference panel (accuracy: R(2) = 0.95) for genomic evaluation did not significantly decrease accuracy, whereas a scenario with genotypes imputed from a small reference panel (R(2) = 0.88) did show a significant decrease in accuracy. Genomic evaluation based on imputed genotypes in selection candidates can be implemented at a fraction of the cost of a genomic evaluation using observed genotypes and still yield virtually the same accuracy. On the other side, using a very small reference panel of haplotypes to impute training animals and candidates for selection results in lower accuracy of genomic evaluation.
Novel and efficient tag SNPs selection algorithms.
Chen, Wen-Pei; Hung, Che-Lun; Tsai, Suh-Jen Jane; Lin, Yaw-Ling
2014-01-01
SNPs are the most abundant forms of genetic variations amongst species; the association studies between complex diseases and SNPs or haplotypes have received great attention. However, these studies are restricted by the cost of genotyping all SNPs; thus, it is necessary to find smaller subsets, or tag SNPs, representing the rest of the SNPs. In fact, the existing tag SNP selection algorithms are notoriously time-consuming. An efficient algorithm for tag SNP selection was presented, which was applied to analyze the HapMap YRI data. The experimental results show that the proposed algorithm can achieve better performance than the existing tag SNP selection algorithms; in most cases, this proposed algorithm is at least ten times faster than the existing methods. In many cases, when the redundant ratio of the block is high, the proposed algorithm can even be thousands times faster than the previously known methods. Tools and web services for haplotype block analysis integrated by hadoop MapReduce framework are also developed using the proposed algorithm as computation kernels.
Discovery of 100K SNP array and its utilization in sugarcane
USDA-ARS?s Scientific Manuscript database
Next generation sequencing (NGS) enable us to identify thousands of single nucleotide polymorphisms (SNPs) marker for genotyping and fingerprinting. However, the process requires very precise bioinformatics analysis and filtering process. High throughput SNP array with predefined genomic location co...
Guo, Xi; Geng, Peng; Wang, Quan; Cao, Boyang; Liu, Bin
2014-10-01
Severe acute respiratory syndrome (SARS), a disease that spread widely in the world during late 2002 to 2004, severely threatened public health. Although there have been no reported infections since 2004, the extremely pathogenic SARS coronavirus (SARS-CoV), as the causative agent of SARS, has recently been identified in animals, showing the potential for the re-emergence of this disease. Previous studies showed that 27 single nucleotide polymorphism (SNP) mutations among the spike (S) gene of this virus are correlated closely with the SARS pathogenicity and epidemicity. We have developed a SNP DNA microarray in order to detect and genotype these SNPs, and to obtain related information on the pathogenicity and epidemicity of a given strain. The microarray was hybridized with PCR products amplified from cDNAs obtained from different SARS-CoV strains. We were able to detect 24 SNPs and determine the type of a given strain. The hybridization profile showed that 19 samples were detected and genotyped correctly by using our microarray, with 100% accuracy. Our microarray provides a novel method for the detection and epidemiological surveillance of SARS-CoV.
Sawayama, Eitaro; Noguchi, Daiki; Nakayama, Kei; Takagi, Motohiro
2018-03-23
We previously reported a body color deformity in juvenile red sea bream, which shows transparency in the juvenile stage because of delayed chromatophore development compared with normal individuals, and this finding suggested a genetic cause based on parentage assessments. To conduct marker-assisted selection to eliminate broodstock inheriting the causative gene, developing DNA markers associated with the phenotype was needed. We first conducted SNP mining based on AFLP analysis using bulked-DNA from normal and transparent individuals. One SNP was identified from a transparent-specific AFLP fragment, which significantly associated with transparent individuals. Two alleles (A/G) were observed in this locus, and the genotype G/G was dominantly observed in the transparent groups (97.1%) collected from several production lots produced from different broodstock populations. A few normal individuals inherited the G/G genotype (5.0%), but the A/A and A/G genotypes were dominantly observed in the normal groups. The homologs region of the SNP was searched using a medaka genome database, and intron 12 of the Nell2a gene (located on chromosome 6 of the medaka genome) was highly matched. We also mapped the red sea bream Nell2a gene on the previously developed linkage maps, and this gene was mapped on a male linkage group, LG4-M. The newly found SNP was useful in eliminating broodstock possessing the causative gene of the body color transparency observed in juvenile stage of red sea bream.
Chono, Makiko; Matsunaka, Hitoshi; Seki, Masako; Fujita, Masaya; Kiribuchi-Otobe, Chikako; Oda, Shunsuke; Kojima, Hisayo; Nakamura, Shingo
2015-01-01
In the wheat (Triticum aestivum L.) cultivar ‘Zenkoujikomugi’, a single nucleotide polymorphism (SNP) in the promoter of MOTHER OF FT AND TFL1 on chromosome 3A (MFT-3A) causes an increase in the level of gene expression, resulting in strong grain dormancy. We used a DNA marker to detect the ‘Zenkoujikomugi’-type (Zen-type) SNP and examined the genotype of MFT-3A in Japanese wheat varieties, and we found that 169 of 324 varieties carry the Zen-type SNP. In Japanese commercial varieties, the frequency of the Zen-type SNP was remarkably high in the southern part of Japan, but low in the northern part. To examine the relationship between MFT-3A genotype and grain dormancy, we performed a germination assay in three wheat-growing seasons. On average, the varieties carrying the Zen-type SNP showed stronger grain dormancy than the varieties carrying the non-Zen-type SNP. Among commercial cultivars, ‘Iwainodaichi’ (Kyushu), ‘Junreikomugi’ (Kinki-Chugoku-Shikoku), ‘Kinuhime’ (Kanto-Tokai), ‘Nebarigoshi’ (Tohoku-Hokuriku), and ‘Kitamoe’ (Hokkaido) showed the strongest grain dormancy in each geographical group, and all these varieties, except for ‘Kitamoe’, were found to carry the Zen-type SNP. In recent years, the number of varieties carrying the Zen-type SNP has increased in the Tohoku-Hokuriku region, but not in the Hokkaido region. PMID:25931984
Combined array CGH plus SNP genome analyses in a single assay for optimized clinical testing
Wiszniewska, Joanna; Bi, Weimin; Shaw, Chad; Stankiewicz, Pawel; Kang, Sung-Hae L; Pursley, Amber N; Lalani, Seema; Hixson, Patricia; Gambin, Tomasz; Tsai, Chun-hui; Bock, Hans-Georg; Descartes, Maria; Probst, Frank J; Scaglia, Fernando; Beaudet, Arthur L; Lupski, James R; Eng, Christine; Wai Cheung, Sau; Bacino, Carlos; Patel, Ankita
2014-01-01
In clinical diagnostics, both array comparative genomic hybridization (array CGH) and single nucleotide polymorphism (SNP) genotyping have proven to be powerful genomic technologies utilized for the evaluation of developmental delay, multiple congenital anomalies, and neuropsychiatric disorders. Differences in the ability to resolve genomic changes between these arrays may constitute an implementation challenge for clinicians: which platform (SNP vs array CGH) might best detect the underlying genetic cause for the disease in the patient? While only SNP arrays enable the detection of copy number neutral regions of absence of heterozygosity (AOH), they have limited ability to detect single-exon copy number variants (CNVs) due to the distribution of SNPs across the genome. To provide comprehensive clinical testing for both CNVs and copy-neutral AOH, we enhanced our custom-designed high-resolution oligonucleotide array that has exon-targeted coverage of 1860 genes with 60 000 SNP probes, referred to as Chromosomal Microarray Analysis – Comprehensive (CMA-COMP). Of the 3240 cases evaluated by this array, clinically significant CNVs were detected in 445 cases including 21 cases with exonic events. In addition, 162 cases (5.0%) showed at least one AOH region >10 Mb. We demonstrate that even though this array has a lower density of SNP probes than other commercially available SNP arrays, it reliably detected AOH events >10 Mb as well as exonic CNVs beyond the detection limitations of SNP genotyping. Thus, combining SNP probes and exon-targeted array CGH into one platform provides clinically useful genetic screening in an efficient manner. PMID:23695279
Lattka, Eva; Koletzko, Berthold; Zeilinger, Sonja; Hibbeln, Joseph R.; Klopp, Norman; Ring, Susan M.; Steer, Colin D.
2012-01-01
Fetal supply with long-chain PUFA (LC-PUFA) during pregnancy is important for brain growth and visual and cognitive development and is provided by materno–fetal placental transfer. We recently showed that maternal fatty acid desaturase (FADS) genotypes modulate the amounts of LC-PUFA in maternal blood. Whether FADS genotypes influence the amounts of umbilical cord fatty acids has not been investigated until now. The aim of the present study was to investigate the influence of maternal and child FADS genotypes on the amounts of LC-PUFA in umbilical cord venous plasma as an indicator of fetal fatty acid supply during pregnancy. A total of eleven cord plasma n-6 and n-3 fatty acids were analysed for association with seventeen FADS gene cluster SNP in over 2000 mothers and children from the Avon Longitudinal Study of Parents and Children. In a multivariable analysis, the maternal genotype effect was adjusted for the child genotype and vice versa to estimate which of the two has the stronger influence on cord plasma fatty acids. Both maternal and child FADS genotypes and haplotypes influenced amounts of cord plasma LC-PUFA and fatty acid ratios. Specifically, most analysed maternal SNP were associated with cord plasma levels of the precursor n-6 PUFA, whereas the child genotypes were mainly associated with more highly desaturated n-6 LC-PUFA. This first study on FADS genotypes and cord fatty acids suggests that fetal LC-PUFA status is determined to some extent by fetal fatty acid conversion. Associations of particular haplotypes suggest specific effects of SNP rs498793 and rs968567 on fatty acid metabolism. PMID:22877655
Lattka, Eva; Koletzko, Berthold; Zeilinger, Sonja; Hibbeln, Joseph R; Klopp, Norman; Ring, Susan M; Steer, Colin D
2013-04-14
Fetal supply with long-chain PUFA (LC-PUFA) during pregnancy is important for brain growth and visual and cognitive development and is provided by materno-fetal placental transfer. We recently showed that maternal fatty acid desaturase (FADS) genotypes modulate the amounts of LC-PUFA in maternal blood. Whether FADS genotypes influence the amounts of umbilical cord fatty acids has not been investigated until now. The aim of the present study was to investigate the influence of maternal and child FADS genotypes on the amounts of LC-PUFA in umbilical cord venous plasma as an indicator of fetal fatty acid supply during pregnancy. A total of eleven cord plasma n-6 and n-3 fatty acids were analysed for association with seventeen FADS gene cluster SNP in over 2000 mothers and children from the Avon Longitudinal Study of Parents and Children. In a multivariable analysis, the maternal genotype effect was adjusted for the child genotype and vice versa to estimate which of the two has the stronger influence on cord plasma fatty acids. Both maternal and child FADS genotypes and haplotypes influenced amounts of cord plasma LC-PUFA and fatty acid ratios. Specifically, most analysed maternal SNP were associated with cord plasma levels of the precursor n-6 PUFA, whereas the child genotypes were mainly associated with more highly desaturated n-6 LC-PUFA. This first study on FADS genotypes and cord fatty acids suggests that fetal LC-PUFA status is determined to some extent by fetal fatty acid conversion. Associations of particular haplotypes suggest specific effects of SNP rs498793 and rs968567 on fatty acid metabolism.
Polymorphisms of EpCAM gene and prognosis for non-small-cell lung cancer in Han Chinese
Yang, Yuefan; Fei, Fei; Song, Yang; Li, Xiaofei; Zhang, Zhipei; Fei, Zhou; Su, Haichuan; Wan, Shaogui
2014-01-01
The epithelial cell adhesion molecule (EpCAM) is overexpressed in a wide variety of human cancers and is associated with patient prognosis, including those with lung cancer. However, the association of single nucleotide polymorphisms (SNPs) in the EpCAM gene with the prognosis for non-small-cell lung cancer (NSCLC) patients has never been investigated. We evaluated the association between two SNPs, rs1126497 and rs1421, in the EpCAM gene and clinical outcomes in a Chinese cohort of 506 NSCLC patients. The SNPs were genotyped using the Sequenom iPLEX genotyping system. Multivariate Cox proportional hazards model and Kaplan–Meier curves were used to assess the association of EpCAM gene genotypes with the prognosis of NSCLC. We found that the non-synonymous SNP rs1126497 was significantly associated with survival. Compared with the CC genotype, the CT+TT genotype was a risk factor for both death (hazard ratio, 1.40; 95% confidence interval [CI], 1.02–1.94; P = 0.040) and recurrence (hazard ratio, 1.34; 95% CI, 1.02–1.77; P = 0.039). However, the SNP rs1421 did not show any significant effect on patient prognosis. Instead, the AG+GG genotype in rs1421 was significantly associated with early T stages (T1/T2) when compared with the AA genotype (odds ratio for late stage = 0.65; 95% CI, 0.44–0.96, P = 0.029). Further stratified analysis showed notable modulating effects of clinical characteristics on the associations between variant genotypes of rs1126497 and NSCLC outcomes. In conclusion, our study indicated that the non-synonymous SNP rs1126497 may be a potential prognostic marker for NSCLC patients. PMID:24304228
Lopez, G H; Morrison, J; Condon, J A; Wilson, B; Martin, J R; Liew, Y-W; Flower, R L; Hyland, C A
2015-10-01
Duffy blood group phenotypes can be predicted by genotyping for single nucleotide polymorphisms (SNPs) responsible for the Fy(a) /Fy(b) polymorphism, for weak Fy(b) antigen, and for the red cell null Fy(a-b-) phenotype. This study correlates Duffy phenotype predictions with serotyping to assess the most reliable procedure for typing. Samples, n = 155 (135 donors and 20 patients), were genotyped by high-resolution melt PCR and by microarray. Samples were in three serology groups: 1) Duffy patterns expected n = 79, 2) weak and equivocal Fy(b) patterns n = 29 and 3) Fy(a-b-) n = 47 (one with anti-Fy3 antibody). Discrepancies were observed for five samples. For two, SNP genotyping predicted weak Fy(b) expression discrepant with Fy(b-) (Group 1 and 3). For three, SNP genotyping predicted Fy(a) , discrepant with Fy(a-b-) (Group 3). DNA sequencing identified silencing mutations in these FY*A alleles. One was a novel FY*A 719delG. One, the sample with the anti-Fy3, was homozygous for a 14-bp deletion (FY*01N.02); a true null. Both the high-resolution melting analysis and SNP microarray assays were concordant and showed genotyping, as well as phenotyping, is essential to ensure 100% accuracy for Duffy blood group assignments. Sequencing is important to resolve phenotype/genotype conflicts which here identified alleles, one novel, that carry silencing mutations. The risk of alloimmunisation may be dependent on this zygosity status. © 2015 International Society of Blood Transfusion.
Ye, M H; Chen, J L; Zhao, G P; Zheng, M Q; Wen, J
2010-01-01
This study has assessed the association of single nucleotide polymorphisms (SNP) identified in the adipocyte fatty acid binding protein (A-FABP) and heart-type fatty acid binding protein (H-FABP) genes with the content of intramuscular fat (IMF) in a population of male Beijing-You chickens. A previously described SNP in the chicken A-FABP gene had a significant (P < 0.05) effect on IMF content. Chickens inheriting the homozygous BB genotype at A-FABP had a significantly higher content of IMF in thigh muscles and breast muscles than did those inheriting the AA and AB genotypes. A novel SNP, identified here, in the H-FABP gene was also significantly (P < 0.05) associated with IMF content in thigh and breast muscle. Chickens inheriting the genotypes of DD and CD had much higher content of IMF than those inheriting the homozygous genotype of CC. Markers at the A-FABP and H-FABP genes were associated with IMF content in the studied population. Chickens inheriting the BB genotype at A-FABP, along with the CD genotype at H-FABP, produced muscles with a much higher content of IMF when compared with all other genotypes. A weak interaction between A-FABP and H-FABP was detected (P < 0.09) for IMF content in the tested population. The statistical significance of interaction is tentative because of the limited number of observations for some genotypic combinations. Markers identified within the A-FABP and H-FABP genes are suitable for future use in identifying chickens with the genetic potential to produce more desirable muscle with higher IMF content, at least in the population of Beijing-You male chickens.
Development and Validation of a High-Density SNP Genotyping Array for African Oil Palm.
Kwong, Qi Bin; Teh, Chee Keng; Ong, Ai Ling; Heng, Huey Ying; Lee, Heng Leng; Mohamed, Mohaimi; Low, Joel Zi-Bin; Apparow, Sukganah; Chew, Fook Tim; Mayes, Sean; Kulaveerasingam, Harikrishna; Tammi, Martti; Appleton, David Ross
2016-08-01
High-density single nucleotide polymorphism (SNP) genotyping arrays are powerful tools that can measure the level of genetic polymorphism within a population. To develop a whole-genome SNP array for oil palms, SNP discovery was performed using deep resequencing of eight libraries derived from 132 Elaeis guineensis and Elaeis oleifera palms belonging to 59 origins, resulting in the discovery of >3 million putative SNPs. After SNP filtering, the Illumina OP200K custom array was built with 170 860 successful probes. Phenetic clustering analysis revealed that the array could distinguish between palms of different origins in a way consistent with pedigree records. Genome-wide linkage disequilibrium declined more slowly for the commercial populations (ranging from 120 kb at r(2) = 0.43 to 146 kb at r(2) = 0.50) when compared with the semi-wild populations (19.5 kb at r(2) = 0.22). Genetic fixation mapping comparing the semi-wild and commercial population identified 321 selective sweeps. A genome-wide association study (GWAS) detected a significant peak on chromosome 2 associated with the polygenic component of the shell thickness trait (based on the trait shell-to-fruit; S/F %) in tenera palms. Testing of a genomic selection model on the same trait resulted in good prediction accuracy (r = 0.65) with 42% of the S/F % variation explained. The first high-density SNP genotyping array for oil palm has been developed and shown to be robust for use in genetic studies and with potential for developing early trait prediction to shorten the oil palm breeding cycle. Copyright © 2016 The Author. Published by Elsevier Inc. All rights reserved.
Genomic prediction of the polled and horned phenotypes in Merino sheep.
Duijvesteijn, Naomi; Bolormaa, Sunduimijid; Daetwyler, Hans D; van der Werf, Julius H J
2018-05-22
In horned sheep breeds, breeding for polledness has been of interest for decades. The objective of this study was to improve prediction of the horned and polled phenotypes using horn scores classified as polled, scurs, knobs or horns. Derived phenotypes polled/non-polled (P/NP) and horned/non-horned (H/NH) were used to test four different strategies for prediction in 4001 purebred Merino sheep. These strategies include the use of single 'single nucleotide polymorphism' (SNP) genotypes, multiple-SNP haplotypes, genome-wide and chromosome-wide genomic best linear unbiased prediction and information from imputed sequence variants from the region including the RXFP2 gene. Low-density genotypes of these animals were imputed to the Illumina Ovine high-density (600k) chip and the 1.78-kb insertion polymorphism in RXFP2 was included in the imputation process to whole-genome sequence. We evaluated the mode of inheritance and validated models by a fivefold cross-validation and across- and between-family prediction. The most significant SNPs for prediction of P/NP and H/NH were OAR10_29546872.1 and OAR10_29458450, respectively, located on chromosome 10 close to the 1.78-kb insertion at 29.5 Mb. The mode of inheritance included an additive effect and a sex-dependent effect for dominance for P/NP and a sex-dependent additive and dominance effect for H/NH. Models with the highest prediction accuracies for H/NH used either single SNPs or 3-SNP haplotypes and included a polygenic effect estimated based on traditional pedigree relationships. Prediction accuracies for H/NH were 0.323 for females and 0.725 for males. For predicting P/NP, the best models were the same as for H/NH but included a genomic relationship matrix with accuracies of 0.713 for females and 0.620 for males. Our results show that prediction accuracy is high using a single SNP, but does not reach 1 since the causative mutation is not genotyped. Incomplete penetrance or allelic heterogeneity, which can influence expression of the phenotype, may explain why prediction accuracy did not approach 1 with any of the genetic models tested here. Nevertheless, a breeding program to eradicate horns from Merino sheep can be effective by selecting genotypes GG of SNP OAR10_29458450 or TT of SNP OAR10_29546872.1 since all sheep with these genotypes will be non-horned.
Bagheri, Masoumeh; Moradi-Sharhrbabak, M; Miraie-Ashtiani, R; Safdari-Shahroudi, M; Abdollahi-Arpanahi, R
2016-02-01
Mastitis is a major source of economic loss in dairy herds. The objective of this research was to evaluate the association between genotypes within SLC11A1 and CXCR1 candidate genes and clinical mastitis in Holstein dairy cattle using the selective genotyping method. The data set contained clinical mastitis records of 3,823 Holstein cows from two Holstein dairy herds located in two different regions in Iran. Data included the number of cases of clinical mastitis per lactation. Selective genotyping was based on extreme values for clinical mastitis residuals (CMR) from mixed model analyses. Two extreme groups consisting of 135 cows were formed (as cases and controls), and genotyped for the two candidate genes, namely, SLC11A1 and CXCR1, using polymerase chain reaction-single strand conformation polymorphism (PCR-SSCP) and polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP), respectively. Associations between single nucleotide polymorphism (SNP) genotypes with CMR and breeding values for milk and protein yield were carried out by applying logistic regression analyses, i.e. estimating the probability of the heterogeneous genotype in the dependency of values for CMR and breeding values (BVs). The sequencing results revealed a novel mutation in 1139 bp of exon 11 of the SLC11A1 gene and this SNP had a significant association with CMR (P < 0.05). PCR-RFLP analysis leads to three banding patterns for CXCR1c.735C>G and these genotypes had significant relationships with CMR. Overall, the results showed that SLC11A1 and CXCR1 are valuable candidate genes for the improvement of mastitis resistance as well as production traits in dairy cattle populations.
Genetic association of polymorphism rs1333049 with gout.
Wang, Binbin; Meng, Dongmei; Wang, Jing; Liu, Shiguo; Zhou, Sirui; Miao, Zhimin; Han, Lin; Chu, Nan; Zhang, Kun; Ma, Xu; Li, Changgui
2011-09-01
We suspect that genes or loci that contribute to coronary artery disease (CAD) may also play a role in the pathogenesis of gout, since hyperuricaemia leads to gout, and serum uric acid (SUA) levels are potential risk factors for CAD. The single nucleotide polymorphism (SNP) rs1333049 (C/G) on chromosome 9p21 has been implicated in previous studies to be associated with CAD. The aim of this study was to evaluate the relationship between this SNP and gout pathogenesis. Nine hundred Chinese Han were recruited for this study (461 gout patients and 439 gout-free individuals). The rs1333049 SNP and surrounding sequences were PCR sequenced. There was a clear link between the rs1333049 genotypic and allelic frequencies between gout cases and controls (χ(2) = 6.81, df = 2, P = 0.033 by genotype; χ(2) = 6.63, df = 1, P = 0.01 by allele). There was a significantly increased risk of gout in carriers of the CC genotype (odds ratio = 1.43, 95% CI 1.07, 1.91). To the best of our knowledge, our findings are the first to establish an association of rs1333049 with gout in a Chinese Han population. Meanwhile, this SNP is homologous to miR-519 and miR-520.
Using Next Generation Sequencing for Multiplexed Trait-Linked Markers in Wheat
Bernardo, Amy; Wang, Shan; St. Amand, Paul; Bai, Guihua
2015-01-01
With the advent of next generation sequencing (NGS) technologies, single nucleotide polymorphisms (SNPs) have become the major type of marker for genotyping in many crops. However, the availability of SNP markers for important traits of bread wheat ( Triticum aestivum L.) that can be effectively used in marker-assisted selection (MAS) is still limited and SNP assays for MAS are usually uniplex. A shift from uniplex to multiplex assays will allow the simultaneous analysis of multiple markers and increase MAS efficiency. We designed 33 locus-specific markers from SNP or indel-based marker sequences that linked to 20 different quantitative trait loci (QTL) or genes of agronomic importance in wheat and analyzed the amplicon sequences using an Ion Torrent Proton Sequencer and a custom allele detection pipeline to determine the genotypes of 24 selected germplasm accessions. Among the 33 markers, 27 were successfully multiplexed and 23 had 100% SNP call rates. Results from analysis of "kompetitive allele-specific PCR" (KASP) and sequence tagged site (STS) markers developed from the same loci fully verified the genotype calls of 23 markers. The NGS-based multiplexed assay developed in this study is suitable for rapid and high-throughput screening of SNPs and some indel-based markers in wheat. PMID:26625271
Fondevila, M; Phillips, C; Santos, C; Freire Aradas, A; Vallone, P M; Butler, J M; Lareu, M V; Carracedo, A
2013-01-01
A revision of an established 34 SNP forensic ancestry test has been made by swapping the under-performing rs727811 component SNP with the highly informative rs3827760 that shows a near-fixed East Asian specific allele. We collated SNP variability data for the revised SNP set in 66 reference populations from 1000 Genomes and HGDP-CEPH panels and used this as reference data to analyse four U.S. populations showing a range of admixture patterns. The U.S. Hispanics sample in particular displayed heterogeneous values of co-ancestry between European, Native American and African contributors, likely to reflect in part, the way this disparate group is defined using cultural as well as population genetic parameters. The genotyping of over 700 U.S. population samples also provided the opportunity to thoroughly gauge peak mobility variation and peak height ratios observed from routine use of the single base extension chemistry of the 34-plex test. Finally, the genotyping of the widely used DNA profiling Standard Reference Material samples plus other control DNAs completes the audit of the 34-plex assay to allow forensic practitioners to apply this test more readily in their own laboratories. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Tillmann, Hans L; Thompson, Alex J; Patel, Keyur; Wiese, Manfred; Tenckhoff, Hannelore; Nischalke, Hans D; Lokhnygina, Yuliya; Kullig, Ulrike; Göbel, Uwe; Capka, Emanuela; Wiegand, Johannes; Schiefke, Ingolf; Güthoff, Wolfgang; Grüngreiff, Kurt; König, Ingrid; Spengler, Ulrich; McCarthy, Jeanette; Shianna, Kevin V; Goldstein, David B; McHutchison, John G; Timm, Jörg; Nattermann, Jacob
2010-11-01
A single nucleotide polymorphism (SNP) upstream of the IL28B gene has been associated with response of patients with chronic hepatitis C to therapy with pegylated interferon and ribavirin and also with spontaneous clearance of acute hepatitis C in a heterogeneous population. We analyzed the association between IL28B and the clinical presentation of acute hepatitis C virus (HCV) infection in a homogeneous population. We analyzed the SNP rs12979860 in 190 women from the German anti-D cohort (infected with HCV genotype 1b via contaminated rhesus prophylaxis) and its association with spontaneous clearance. Clinical data were available in 136 women with acute infection who were also evaluated for IL28B genotype. Based on results of a TaqMan polymerase chain reaction assay, the rs12979860 SNP genotypes studied were C/C, C/T, or T/T. Spontaneous clearance was more common in patients with the C/C genotype (43/67; 64%) compared with C/T (22/90; 24%) or T/T (2/33; 6%) (P < .001). Jaundice during acute infection was more common among patients with C/C genotype (32.7%) than non-C/C patients (with C/T or T/T) (16.1%; P = .032). In C/C patients, jaundice during acute infection was not associated with an increased chance of spontaneous clearance (56.3%) compared with those without jaundice (60.6%). In contrast, in non-C/C patients, jaundice was associated with a higher likelihood of spontaneous clearance (42.9%) compared with those without jaundice (13.7%). The SNP rs12979860 upstream of IL28B is associated with spontaneous clearance of HCV. Women with the C/T or T/T genotype who did not develop jaundice had a lower chance of spontaneous clearance of HCV infection. Copyright © 2010 AGA Institute. Published by Elsevier Inc. All rights reserved.
Morales, Eugenia; Azocar, Lorena; Maul, Ximena; Perez, Claudio; Chianale, José
2011-01-01
Background The lactase persistent (LP) or lactase non-persistent (LNP) state in European adults is genetically determined by a single nucleotide polymorphism (SNP) located 13.9 kb upstream of the lactase (LCT) gene, known as LCT C>T−13910 (rs4988235). The LNP condition leads to an inability to digest the milk sugar lactose leading to gastrointestinal symptoms and can affect nutrient and calcium intake in certain populations. Objectives The authors studied a group of 51 Chilean patients to assess whether this SNP influences the LP/LNP state in this population, and determined the prevalence of LCT C>T−13910 genotypes in a representative sample of 216 Hispanics and 43 Amerindians with correlation to digestive symptoms. Design Case–control study done in Chilean patients with clinical suspicion of LNP that were assessed using clinical survey, hydrogen breath test (HBT) and SNP genotyping. The population sample of Hispanics and Amerindians was assessed by clinical survey and SNP genotyping. Results Of the 51 patients with clinical suspicion of LNP, 29 were HBT-positive. The CC genotype (LNP) was present in 89.7% of the patients with positive HBT and in only 4.7% of those with negative HBT. The prevalence of the CC genotype was 56.9% in the Hispanic population and 88.3% in Amerindians, and was associated with a higher self-reported clinical intolerance to ingestion of dairy products. Conclusion The LP/LNP state is determined by the LCT C>T−13910 variant in Chileans. This variant predicts digestive symptoms associated with the ingestion of lactose and is a good tool for the diagnosis of primary adult hypolactasia. The LCT T−13910 allele is rare in the Amerindian population and is suggestive of European ancestry in this contemporary population. PMID:22021768
Wang, Sihua; Ding, Mingcui; Duan, Xiaoran; Wang, Tuanwei; Feng, Xiaolei; Wang, Pengpeng; Yao, Wu; Wu, Yongjun; Yan, Zhen; Feng, Feifei; Yu, Songcheng; Wang, Wei
2017-09-01
It has been shown that the single nucleotide polymorphism (SNP) of the rs2735940 site in the human telomerase reverse transcriptase ( hTERT ) gene is associated with increased cancer risk. The traditional method to detect SNP genotypes is polymerase chain reaction-restriction fragment length polymorphism (PCR-RFLP). However, there is a limitation to utilizing PCR-RFLP due to a lack of proper restriction enzyme sites at many polymorphic loci. This study used an improved PCR-RFLP method with a mismatched base for detection of the SNP rs2735940. A new restriction enzyme cutting site was created by created restriction site PCR (CRS-PCR), and in addition, the restriction enzyme Msp I for CRS-PCR was cheaper than other enzymes. We used this novel assay to determine the allele frequencies in 552 healthy Chinese Han individuals, and found the allele frequencies to be 63% for allele C and 37% for allele T In summary, the modified PCR-RFLP can be used to detect the SNP of rs2735940 with low cost and high efficiency. © 2017 by the Association of Clinical Scientists, Inc.
Covariance Between Genotypic Effects and its Use for Genomic Inference in Half-Sib Families
Wittenburg, Dörte; Teuscher, Friedrich; Klosa, Jan; Reinsch, Norbert
2016-01-01
In livestock, current statistical approaches utilize extensive molecular data, e.g., single nucleotide polymorphisms (SNPs), to improve the genetic evaluation of individuals. The number of model parameters increases with the number of SNPs, so the multicollinearity between covariates can affect the results obtained using whole genome regression methods. In this study, dependencies between SNPs due to linkage and linkage disequilibrium among the chromosome segments were explicitly considered in methods used to estimate the effects of SNPs. The population structure affects the extent of such dependencies, so the covariance among SNP genotypes was derived for half-sib families, which are typical in livestock populations. Conditional on the SNP haplotypes of the common parent (sire), the theoretical covariance was determined using the haplotype frequencies of the population from which the individual parent (dam) was derived. The resulting covariance matrix was included in a statistical model for a trait of interest, and this covariance matrix was then used to specify prior assumptions for SNP effects in a Bayesian framework. The approach was applied to one family in simulated scenarios (few and many quantitative trait loci) and using semireal data obtained from dairy cattle to identify genome segments that affect performance traits, as well as to investigate the impact on predictive ability. Compared with a method that does not explicitly consider any of the relationship among predictor variables, the accuracy of genetic value prediction was improved by 10–22%. The results show that the inclusion of dependence is particularly important for genomic inference based on small sample sizes. PMID:27402363
USDA-ARS?s Scientific Manuscript database
Hop is a perennial crop with clonal propagation system for varietal distribution. Brewers and growers are highly concerned about variety purity and regularly seek genotype testing. Current means for genotyping are based upon SSRs OR AFLPs that are relatively accurate but cannot differentiate close...
Pooled Genome-Wide Analysis to Identify Novel Risk Loci for Pediatric Allergic Asthma
Ricci, Giampaolo; Astolfi, Annalisa; Remondini, Daniel; Cipriani, Francesca; Formica, Serena; Dondi, Arianna; Pession, Andrea
2011-01-01
Background Genome-wide association studies of pooled DNA samples were shown to be a valuable tool to identify candidate SNPs associated to a phenotype. No such study was up to now applied to childhood allergic asthma, even if the very high complexity of asthma genetics is an appropriate field to explore the potential of pooled GWAS approach. Methodology/Principal Findings We performed a pooled GWAS and individual genotyping in 269 children with allergic respiratory diseases comparing allergic children with and without asthma. We used a modular approach to identify the most significant loci associated with asthma by combining silhouette statistics and physical distance method with cluster-adapted thresholding. We found 97% concordance between pooled GWAS and individual genotyping, with 36 out of 37 top-scoring SNPs significant at individual genotyping level. The most significant SNP is located inside the coding sequence of C5, an already identified asthma susceptibility gene, while the other loci regulate functions that are relevant to bronchial physiopathology, as immune- or inflammation-mediated mechanisms and airway smooth muscle contraction. Integration with gene expression data showed that almost half of the putative susceptibility genes are differentially expressed in experimental asthma mouse models. Conclusion/Significance Combined silhouette statistics and cluster-adapted physical distance threshold analysis of pooled GWAS data is an efficient method to identify candidate SNP associated to asthma development in an allergic pediatric population. PMID:21359210
2012-01-01
Background Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species. The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). Results We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Conclusion Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in the coding regions of genes involved in different physiological processes. The platform will also be useful for future mapping and diversity studies, and will be essential in order to accelerate the process of breeding new and better-adapted squash varieties. PMID:22356647
Molecular Diagnostics in Transfusion Medicine: In Capillary, on a Chip, in Silico, or in Flight?
Garritsen, Henk S.P.; Xiu-Cheng Fan, Alex; Lenz, Daniela; Hannig, Horst; Yan Zhong, Xiao; Geffers, Robert; Lindenmaier, Werner; Dittmar, Kurt E.J.; Wörmann, Bernhard
2009-01-01
Summary Serology, defined as antibody-based diagnostics, has been regarded as the diagnostic gold standard in transfusion medicine. Nowadays however the impact of molecular diagnostics in transfusion medicine is rapidly growing. Molecular diagnostics can improve tissue typing (HLA typing), increase safety of blood products (NAT testing of infectious diseases), and enable blood group typing in difficult situations (after transfusion of blood products or prenatal non-invasive RhD typing). Most of the molecular testing involves the determination of the presence of single nucleotide polymorphisms (SNPs). Antigens (e.g. blood group antigens) mostly result from single nucleotide differences in critical positions. However, most blood group systems cannot be determined by looking at a single SNP. To identify members of a blood group system a number of critical SNPs have to be taken into account. The platforms which are currently used to perform molecular diagnostics are mostly gel-based, requiring time-consuming multiple manual steps. To implement molecular methods in transfusion medicine in the future the development of higher-throughput SNP genotyping non-gel-based platforms which allow a rapid, cost-effective screening are essential. Because of its potential for automation, high throughput and cost effectiveness the special focus of this paper is a relative new technique: SNP genotyping by MALDI-TOF MS analysis. PMID:21113259
Naj, Adam C.; West, Michael; Rich, Stephen S.; Post, Wendy; Kao, W.H. Linda; Wasserman, Bruce A.; Herrington, David M.; Rodriguez, Annabelle
2012-01-01
Background Little is known regarding the association of scavenger receptor class B type I (SCARB1) single nucleotide polymorphisms (SNPs) and subclinical atherosclerosis (SCA), particularly in subjects of different racial/ethnic backgrounds. We examined this relationship in the Multi-Ethnic Study of Atherosclerosis (MESA). Methods and Results Forty-three SCARB1 tagging SNPs were genotyped. Baseline examinations included fasting lipids and SCA phenotypes (coronary artery calcium [CAC], and common and internal carotid artery thickness [CCIMT and ICIMT]). Examining SNP associations with different SCA phenotypes across multiple racial/ethnic groups with adjustment for multiple covariates, we found the C allele of SNP rs10846744 was associated with higher CCIMT in African American (P=0.03), Chinese (P=0.02), European American (P=0.05), and Hispanic participants (P=0.03), and was strongly associated in pooled analyses (P=0.0002). The results also showed that the association of this SNP with CCIMT was independent of lipids and other well-established cardiovascular risk factors. Stratifying by sex, there appeared to be a strong association of rs10846744 with CCIMT in females, but no genotype-sex interactions were observed. Conclusions Variation in SCARB1 at rs10846744 was significantly associated with CCIMT across racial/ethnic groups in MESA. PMID:20160195
Rapid multiplexed genotyping for hereditary thrombophilia by SELDI-TOF mass spectrometry.
Yang, Shangbin; Xu, Lihui; Wu, Haifeng M
2010-03-01
Approximately 50% of patients with venous thromboembolism also present with hereditary predisposition. The most common genetic factors are single nucleotide polymorphisms (SNPs) of factor V Leiden, prothrombin G20210A, and methylenetetrahydrofolate reductase C677T. Genotyping these SNPs helps clinicians to correctly diagnose the disease and properly manage patients. In this study, we report a novel method using surface-enhanced laser desorption and ionization time of flight mass spectrometry to rapidly genotype, in a multiplex fashion, 3 SNPs that predispose patients to thrombosis. First, patient DNA samples were subjected to polymerase chain reaction to amplify and extend the DNA products with masses corresponding to specific genotypes. Polymerase chain reaction products were then applied to Q10 anionic protein chips, undergoing on-chip sample enrichment and clean-up. Finally, the genotypes of the SNPs were determined by surface-enhanced laser desorption and ionization time of flight mass spectrometry. This method offers a rapid turnaround time of less than 5 hours from sample collection to result reporting. The analytical accuracy of each SNP genotyping result has been confirmed by DNA sequencing. In addition, the genotype results produced by this method were validated by comparing them with results obtained by the approved method in the clinical reference laboratory. This novel method is fast, accurate, and reproducible, and thus provides an excellent platform to promote personalized medicine in the management of clotting disorders.
Genomic Heritability of Beef Cattle Growth
USDA-ARS?s Scientific Manuscript database
Calf weights were examined to determine association between high-density SNP genotypes and growth, in order to estimate additive genetic variation explained by SNP. Data taken from Cycle VII of the U.S. Meat Animal Research Center Germplasm Evaluation Project included birth weight (BWT), 205-d adju...
Is there any relation between IL-6 gene -174 G>C polymorphism and postmenopausal osteoporosis?
Deveci, Derya; Ozkan, Zehra Sema; Yuce, Huseyin
2012-09-01
IL-6 gene single nucleotide polymorphisms (SNPs) have been reported to have a protective effect against bone resorption. We aimed to investigate the association between bone mineral density and IL-6 promoter region -174 G>C SNP. This study included 356 postmenopausal Turkish women, of whom 201 were osteoporotic (lumbar spine T score<-2.5 SD) and 155 non-osteoporotic (lumbar spine T score>-1.5 SD). Bone mineral density (BMD) measures were obtained using dual-energy X-ray absorptiometry. SNP of the IL-6 gene (-174 G>C) was examined by polymerase chain reaction-restriction fragment length polymorphism. The frequencies of the variant C allele (24% vs. 30%, p=0.074) and mutant CC genotype (12% vs. 20%, p=0.094) were higher in non-osteoporotic women. Lumbar spine and total hip BMD values were lowest among women with the G/G genotype, intermediate in the heterozygotes, and highest in women with the C/C genotype. The GG (p=0.022) and GC (p=0.037) genotypes were covariates which approached statistical significance in the regression model fitting of BMD. IL-6 promoter region SNP showed an association with BMD in this postmenopausal Turkish population and these data suggest that the wild GG genotype influences the phenotype. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Tanaka, Keiko; Arakawa, Masashi
2014-01-01
Epidemiological evidence on the relationship between single-nucleotide polymorphisms (SNPs) rs7216389 and rs11650680 on chromosome 17q12-21 and asthma is inconsistent. We examined this issue in young adult Japanese women. Case subjects were 202 women who had been diagnosed with asthma by a doctor, while 1290 women without doctor-diagnosed asthma served as control subjects. Adjustments were made for age and the presence of older siblings. There were no significant associations between SNP rs7216389 and asthma. Compared with the CC genotype of SNP rs11650680, the CT genotype, but not the TT genotype, was significantly inversely associated with asthma: the adjusted odds ratio for the CT genotype was 0.67 (95% confidence interval: 0.46–0.96). This inverse relationship was significant in women with late-onset asthma, but not in those with early-onset asthma. Under the dominant model, a significant inverse association was found between rs11650680 and asthma in women without older siblings, but not in those with older siblings; the interaction, however, was not significant. This is the first study to show that the CT genotype of SNP rs11650680 was significantly inversely associated with asthma, especially adult-onset asthma. We could not find evidence for interactions between rs11650680 and older siblings affecting asthma. PMID:24735179
NASA Astrophysics Data System (ADS)
He, Feng; Wen, Haishen; Yu, Dahui; Li, Jifang; Shi, Bao; Chen, Caifang; Zhang, Jiaren; Jin, Guoxiong; Chen, Xiaoyan; Shi, Dan; Yang, Yanping
2010-12-01
Follicle stimulating hormone β (FSHβ) of Japanese flounder ( Paralichthys olivaceus) plays a key role in the regulation of gonadal development. This study aimed to investigate molecular genetic characteristics of the FSHβ gene and elucidate the effects of single nucleotide polymorphisms (SNPs) of FSHβ on reproductive traits in Japanese flounder. We used polymerase chain reaction single-strand conformation polymorphism (PCR-SSCP) and sequencing of the FSHβ gene in 60 individuals. We identified only an SNP (T/C) in the coding region of exon3 of FSHβ. The SNP (T/C) did not lead to amino acid changes at the position 340 bp of FSHβ gene. Statistical analysis showed that the SNP was significantly associated with testosterone (T) level and gonadosomatic index (GSI) ( P < 0.05). Individuals with genotype TC of the SNP had significantly higher serum T levels and GSI ( P < 0.05) than that of genotype CC. Therefore, FSHβ gene could be a useful molecular marker in selection for prominent reproductive trait in Japanese Flounder.
SNP_tools: A compact tool package for analysis and conversion of genotype data for MS-Excel
Chen, Bowang; Wilkening, Stefan; Drechsel, Marion; Hemminki, Kari
2009-01-01
Background Single nucleotide polymorphism (SNP) genotyping is a major activity in biomedical research. Scientists prefer to have a facile access to the results which may require conversions between data formats. First hand SNP data is often entered in or saved in the MS-Excel format, but this software lacks genetic and epidemiological related functions. A general tool to do basic genetic and epidemiological analysis and data conversion for MS-Excel is needed. Findings The SNP_tools package is prepared as an add-in for MS-Excel. The code is written in Visual Basic for Application, embedded in the Microsoft Office package. This add-in is an easy to use tool for users with basic computer knowledge (and requirements for basic statistical analysis). Conclusion Our implementation for Microsoft Excel 2000-2007 in Microsoft Windows 2000, XP, Vista and Windows 7 beta can handle files in different formats and converts them into other formats. It is a free software. PMID:19852806
SNP_tools: A compact tool package for analysis and conversion of genotype data for MS-Excel.
Chen, Bowang; Wilkening, Stefan; Drechsel, Marion; Hemminki, Kari
2009-10-23
Single nucleotide polymorphism (SNP) genotyping is a major activity in biomedical research. Scientists prefer to have a facile access to the results which may require conversions between data formats. First hand SNP data is often entered in or saved in the MS-Excel format, but this software lacks genetic and epidemiological related functions. A general tool to do basic genetic and epidemiological analysis and data conversion for MS-Excel is needed. The SNP_tools package is prepared as an add-in for MS-Excel. The code is written in Visual Basic for Application, embedded in the Microsoft Office package. This add-in is an easy to use tool for users with basic computer knowledge (and requirements for basic statistical analysis). Our implementation for Microsoft Excel 2000-2007 in Microsoft Windows 2000, XP, Vista and Windows 7 beta can handle files in different formats and converts them into other formats. It is a free software.
Heidary, Masoumeh; Rakhshi, Nahid; Pahlevan Kakhki, Majid; Behmanesh, Mehrdad; Sanati, Mohammad Hossein; Sanadgol, Nima; Kamaladini, Hossein; Nikravesh, Abbas
2014-08-15
IL-1B is released by monocytes, astrocytes and brain endothelial cells and seems to be involved in inflammatory reactions of the central nervous system (CNS) in multiple sclerosis (MS). This study aims to evaluate the expression level of IL-1B mRNA in peripheral blood mononuclear cells (PBMCs), genotype the rs16944 SNP and find out the role of this SNP on the expression level of IL-1B in MS patients. We found that the expression level of IL-1B in MS patients increased 3.336 times more than controls in PBMCs but the rs16944 SNP in the promoter region of IL-1B did not affect the expression level of this gene and there was not association of this SNP with MS in the examined population. Also, our data did not reveal any correlation between normalized expressions of IL-1B gene with age of participants, age of onset, and disease duration. Copyright © 2014 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Liu, Hongna; Li, Song; Wang, Zhifei; Li, Zhiyang; Deng, Yan; Wang, Hua; Shi, Zhiyang; He, Nongyue
2008-11-01
Single nucleotide polymorphisms (SNPs) comprise the most abundant source of genetic variation in the human genome wide codominant SNPs identification. Therefore, large-scale codominant SNPs identification, especially for those associated with complex diseases, has induced the need for completely high-throughput and automated SNP genotyping method. Herein, we present an automated detection system of SNPs based on two kinds of functional magnetic nanoparticles (MNPs) and dual-color hybridization. The amido-modified MNPs (NH 2-MNPs) modified with APTES were used for DNA extraction from whole blood directly by electrostatic reaction, and followed by PCR, was successfully performed. Furthermore, biotinylated PCR products were captured on the streptavidin-coated MNPs (SA-MNPs) and interrogated by hybridization with a pair of dual-color probes to determine SNP, then the genotype of each sample can be simultaneously identified by scanning the microarray printed with the denatured fluorescent probes. This system provided a rapid, sensitive and highly versatile automated procedure that will greatly facilitate the analysis of different known SNPs in human genome.
Zhou, Hongbin; Wu, Yinfang; Jin, Yan; Zhou, Jiesen; Zhang, Chao; Che, Luanqing; Jing, Jiyong; Chen, Zhihua; Li, Wen; Shen, Huahao
2013-10-02
Matrix metalloproteinase (MMP) family is considered to be associated with chronic obstructive pulmonary disease (COPD) pathogenesis, however, no consistent results have been provided by previous studies. In this report, we performed Meta analysis to investigate the association between four kinds of MMP single nucleotide polymorphisms (SNP, MMP1 -1607 1G/2G, MMP3 -1171 5A/6A, MMP9 -1562 C/T, MMP12 -82 A/G) and COPD risk from 21 studies including 4184 cases and 5716 controls. Both overall and subgroup association between SNP and COPD susceptibility were tested. There was no evident association between MMP polymorphisms and COPD susceptibility in general population. On the other hand, subgroup analysis suggested that MMP9 -1562 C/T polymorphism was related to COPD, as we found that C allele carriers were at lower risk in some subgroups stratified by lung function, age and genotype identification method, compared with TT homozygotes. Our results indicated the genotype TT might be one genetic risk factor of severe COPD.
Streit, M; Reinhardt, F; Thaller, G; Bennewitz, J
2013-01-01
Genotype by environment interaction (G × E) has been widely reported in dairy cattle. If the environment can be measured on a continuous scale, reaction norms can be applied to study G × E. The average herd milk production level has frequently been used as an environmental descriptor because it is influenced by the level of feeding or the feeding regimen. Another important environmental factor is the level of udder health and hygiene, for which the average herd somatic cell count might be a descriptor. In the present study, we conducted a genome-wide association analysis to identify single nucleotide polymorphisms (SNP) that affect intercept and slope of milk protein yield reaction norms when using the average herd test-day solution for somatic cell score as an environmental descriptor. Sire estimates for intercept and slope of the reaction norms were calculated from around 12 million daughter records, using linear reaction norm models. Sires were genotyped for ~54,000 SNP. The sire estimates were used as observations in the association analysis, using 1,797 sires. Significant SNP were confirmed in an independent validation set consisting of 500 sires. A known major gene affecting protein yield was included as a covariable in the statistical model. Sixty (21) SNP were confirmed for intercept with P ≤ 0.01 (P ≤ 0.001) in the validation set, and 28 and 11 SNP, respectively, were confirmed for slope. Most but not all SNP affecting slope also affected intercept. Comparison with an earlier study revealed that SNP affecting slope were, in general, also significant for slope when the environment was modeled by the average herd milk production level, although the two environmental descriptors were poorly correlated. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
A tool for selecting SNPs for association studies based on observed linkage disequilibrium patterns.
De La Vega, Francisco M; Isaac, Hadar I; Scafe, Charles R
2006-01-01
The design of genetic association studies using single-nucleotide polymorphisms (SNPs) requires the selection of subsets of the variants providing high statistical power at a reasonable cost. SNPs must be selected to maximize the probability that a causative mutation is in linkage disequilibrium (LD) with at least one marker genotyped in the study. The HapMap project performed a genome-wide survey of genetic variation with about a million SNPs typed in four populations, providing a rich resource to inform the design of association studies. A number of strategies have been proposed for the selection of SNPs based on observed LD, including construction of metric LD maps and the selection of haplotype tagging SNPs. Power calculations are important at the study design stage to ensure successful results. Integrating these methods and annotations can be challenging: the algorithms required to implement these methods are complex to deploy, and all the necessary data and annotations are deposited in disparate databases. Here, we present the SNPbrowser Software, a freely available tool to assist in the LD-based selection of markers for association studies. This stand-alone application provides fast query capabilities and swift visualization of SNPs, gene annotations, power, haplotype blocks, and LD map coordinates. Wizards implement several common SNP selection workflows including the selection of optimal subsets of SNPs (e.g. tagging SNPs). Selected SNPs are screened for their conversion potential to either TaqMan SNP Genotyping Assays or the SNPlex Genotyping System, two commercially available genotyping platforms, expediting the set-up of genetic studies with an increased probability of success.
COUSTET, BAPTISTE; AGARWAL, SANDEEP K.; GOURH, PRAVITT; GUEDJ, MICKAEL; MAYES, MAUREEN D.; DIEUDE, PHILIPPE; WIPFF, JULIEN; AVOUAC, JEROME; HACHULLA, ERIC; DIOT, ELISABETH; CRACOWSKI, JEAN LUC; TIEV, KIET; SIBILIA, JEAN; MOUTHON, LUC; FRANCES, CAMILLE; AMOURA, ZAHIR; CARPENTIER, PATRICK; MEYER, OLIVIER; KAHAN, ANDRE; BOILEAU, CATHERINE; ARNETT, FRANK C.; ALLANORE, YANNICK
2012-01-01
Objective Accumulating evidence shows that shared autoimmunity is critical for the pathogenesis of many autoimmune diseases. Systemic sclerosis (SSc) belongs to the connective tissue disorders, and recent data have highlighted strong associations with autoimmunity genes shared with other autoimmune diseases. To determine whether novel risk loci associated with systemic lupus erythematosus or multiple sclerosis may confer susceptibility to SSc, we tested single-nucleotide polymorphisms (SNP) from ITGAM, ITGAX, and CD58 for associations. Methods SNP harboring associations with autoimmune diseases, ITGAM rs9937837, ITGAX rs11574637, and CD58 rs12044852, were genotyped in 2 independent cohorts of European Caucasian ancestry: 1031 SSc patients and 1014 controls from France and 1038 SSc patients and 691 controls from the USA, providing a combined study population of 3774 individuals. ITGAM rs1143679 was additionally genotyped in the French cohort. Results The 4 polymorphisms were in Hardy-Weinberg equilibrium in the 2 control populations, and allelic frequencies were similar to those expected in European Caucasian populations. Allelic and genotypic frequencies for these 3 SNP were found to be statistically similar in SSc patients and controls. Subphenotype analyses for subgroups having diffuse cutaneous subtype disease, specific autoantibodies, or fibrosing alveolitis did not reveal any difference between SSc patients and controls. Conclusion These results obtained through 2 large cohorts of SSc patients of European Caucasian ancestry do not support the implication of ITGAM, ITGAX, and CD58 genes in the genetic susceptibility of SSc, although they were recently identified as autoimmune disease risk genes. PMID:21362770
Mao, Yongjiang; Zhu, Xiaorui; Xing, Shiyu; Zhang, Meirong; Zhang, Huimin; Wang, Xiaolong; Karrow, Niel; Yang, Liguo; Yang, Zhangping
2015-12-01
Lactoferrin is an iron-binding protein found in cow's milk that plays an important role in preventing mastitis caused by intramammary infection. In this study, 20 Chinese Holstein cows were selected randomly for PCR amplification and sequencing of the bovine lactoferrin gene promoter region and used for SNP discovery in the region between nucleotide positions -461 to -132. Three SNPs (-270T>C, -190G>A and -156A>G) were identified in bovine lactoferrin, then Chinese Holstein cows (n=866) were genotyped using Sequenom MassARRAY (Sequenom Inc., San Diego, CA) based on the previous SNP information in this study, and the associations between SNPs or haplotype and milk somatic cell score (SCS) and production traits were analyzed by the least squares method in the GLM procedure of SAS. SNPs -270T>C and -156A>G showed close linkage disequilibrium (r(2)=0.76). The SNP -190G>A showed a significant association with SCS, and individuals with genotype GG had higher SCS than genotypes AG and AA. Associations were found between the SNPs -270T>C and -190G>A with SCS and the milk composition. The software MatInspector revealed that these SNPs were located within several potential transcription factor binding sites, including NF-κB p50, KLF7 and SP1, and may alter gene expression, but further investigation will be required to elucidate the biological and practical relevance of these SNPs. Copyright © 2015 Elsevier Ltd. All rights reserved.
Genetic Variation at 9p22.2 and Ovarian Cancer Risk for BRCA1 and BRCA2 Mutation Carriers
Kartsonaki, Christiana; Gayther, Simon A.; Pharoah, Paul D. P.; Sinilnikova, Olga M.; Beesley, Jonathan; Chen, Xiaoqing; McGuffog, Lesley; Healey, Sue; Couch, Fergus J.; Wang, Xianshu; Fredericksen, Zachary; Peterlongo, Paolo; Manoukian, Siranoush; Peissel, Bernard; Zaffaroni, Daniela; Roversi, Gaia; Barile, Monica; Viel, Alessandra; Allavena, Anna; Ottini, Laura; Papi, Laura; Gismondi, Viviana; Capra, Fabio; Radice, Paolo; Greene, Mark H.; Mai, Phuong L.; Andrulis, Irene L.; Glendon, Gord; Ozcelik, Hilmi; Thomassen, Mads; Gerdes, Anne-Marie; Kruse, Torben A.; Cruger, Dorthe; Jensen, Uffe Birk; Caligo, Maria Adelaide; Olsson, Håkan; Kristoffersson, Ulf; Lindblom, Annika; Arver, Brita; Karlsson, Per; Stenmark Askmalm, Marie; Borg, Ake; Neuhausen, Susan L.; Ding, Yuan Chun; Nathanson, Katherine L.; Domchek, Susan M.; Jakubowska, Anna; Lubiński, Jan; Huzarski, Tomasz; Byrski, Tomasz; Gronwald, Jacek; Górski, Bohdan; Cybulski, Cezary; Dębniak, Tadeusz; Osorio, Ana; Durán, Mercedes; Tejada, Maria-Isabel; Benítez, Javier; Hamann, Ute; Rookus, Matti A.; Verhoef, Senno; Tilanus-Linthorst, Madeleine A.; Vreeswijk, Maaike P.; Bodmer, Danielle; Ausems, Margreet G. E. M.; van Os, Theo A.; Asperen, Christi J.; Blok, Marinus J.; Meijers-Heijboer, Hanne E. J.; Peock, Susan; Cook, Margaret; Oliver, Clare; Frost, Debra; Dunning, Alison M.; Evans, D. Gareth; Eeles, Ros; Pichert, Gabriella; Cole, Trevor; Hodgson, Shirley; Brewer, Carole; Morrison, Patrick J.; Porteous, Mary; Kennedy, M. John; Rogers, Mark T.; Side, Lucy E.; Donaldson, Alan; Gregory, Helen; Godwin, Andrew; Stoppa-Lyonnet, Dominique; Moncoutier, Virginie; Castera, Laurent; Mazoyer, Sylvie; Barjhoux, Laure; Bonadona, Valérie; Leroux, Dominique; Faivre, Laurence; Lidereau, Rosette; Nogues, Catherine; Bignon, Yves-Jean; Prieur, Fabienne; Collonge-Rame, Marie-Agnès; Venat-Bouvet, Laurence; Fert-Ferrer, Sandra; Miron, Alex; Buys, Saundra S.; Hopper, John L.; Daly, Mary B.; John, Esther M.; Terry, Mary Beth; Goldgar, David; Hansen, Thomas v. O.; Jønson, Lars; Ejlertsen, Bent; Agnarsson, Bjarni A.; Offit, Kenneth; Kirchhoff, Tomas; Vijai, Joseph; Dutra-Clarke, Ana V. C.; Przybylo, Jennifer A.; Montagna, Marco; Casella, Cinzia; Imyanitov, Evgeny N.; Janavicius, Ramunas; Blanco, Ignacio; Lázaro, Conxi; Moysich, Kirsten B.; Karlan, Beth Y.; Gross, Jenny; Beattie, Mary S.; Schmutzler, Rita; Wappenschmidt, Barbara; Meindl, Alfons; Ruehl, Ina; Fiebig, Britta; Sutter, Christian; Arnold, Norbert; Deissler, Helmut; Varon-Mateeva, Raymonda; Kast, Karin; Niederacher, Dieter; Gadzicki, Dorothea; Caldes, Trinidad; de la Hoya, Miguel; Nevanlinna, Heli; Aittomäki, Kristiina; Simard, Jacques; Soucy, Penny; Spurdle, Amanda B.; Holland, Helene; Chenevix-Trench, Georgia; Easton, Douglas F.; Antoniou, Antonis C.
2011-01-01
Background Germline mutations in the BRCA1 and BRCA2 genes are associated with increased risks of breast and ovarian cancers. Although several common variants have been associated with breast cancer susceptibility in mutation carriers, none have been associated with ovarian cancer susceptibility. A genome-wide association study recently identified an association between the rare allele of the single-nucleotide polymorphism (SNP) rs3814113 (ie, the C allele) at 9p22.2 and decreased risk of ovarian cancer for women in the general population. We evaluated the association of this SNP with ovarian cancer risk among BRCA1 or BRCA2 mutation carriers by use of data from the Consortium of Investigators of Modifiers of BRCA1/2. Methods We genotyped rs3814113 in 10 029 BRCA1 mutation carriers and 5837 BRCA2 mutation carriers. Associations with ovarian and breast cancer were assessed with a retrospective likelihood approach. All statistical tests were two-sided. Results The minor allele of rs3814113 was associated with a reduced risk of ovarian cancer among BRCA1 mutation carriers (per-allele hazard ratio of ovarian cancer = 0.78, 95% confidence interval = 0.72 to 0.85; P = 4.8 × 10-9) and BRCA2 mutation carriers (hazard ratio of ovarian cancer = 0.78, 95% confidence interval = 0.67 to 0.90; P = 5.5 × 10-4). This SNP was not associated with breast cancer risk among either BRCA1 or BRCA2 mutation carriers. BRCA1 mutation carriers with the TT genotype at SNP rs3814113 were predicted to have an ovarian cancer risk to age 80 years of 48%, and those with the CC genotype were predicted to have a risk of 33%. Conclusion Common genetic variation at the 9p22.2 locus was associated with decreased risk of ovarian cancer for carriers of a BRCA1 or BRCA2 mutation. PMID:21169536
Heat Shock70 Protein Genes and Genetic Susceptibility to Apical Periodontitis
Maheshwari, Kanwal; Silva, Renato M.; Guajardo-Morales, Leticia; Garlet, Gustavo P.; Vieira, Alexandre R.; Letra, Ariadne
2016-01-01
Introduction Heat shock proteins (HSP) protect cells under adverse conditions such as infection, inflammation, and disease. The differential expression of HSPs in human periapical granulomas suggests a potential role for these proteins in periapical lesion development, which may contribute to different clinical outcomes. Therefore, we hypothesize that polymorphisms in HSP genes leading to perturbed gene expression and protein function may contribute to an individual’s susceptibility to periapical lesion development. Methods Subjects with deep carious lesions, with or without periapical lesions (≥ 3 mm) were recruited at the University of Texas School of Dentistry at Houston and at the University of Pittsburgh. Genomic DNA samples of 400 patients were sorted into 2 groups: 183 cases with deep carious lesions and periapical lesions (cases), and 217 cases with deep carious lesions but without periapical lesions (controls). Eight single nucleotide polymorphisms in HSPA4, HSPA6, HSPA1L, HSPA4L and HSPA9 genes were selected for genotyping. Genotypes were generated by endpoint analysis using Taqman chemistry in a real-time polymerase chain reaction assay. Allele and genotype frequencies were compared among cases and controls using chi-square and Fisher Exact tests as implemented in PLINK v.1.07. In silico analysis of SNP function was performed using Polymorphism Phenotyping V2 and MirSNP softwares. Results Overall, SNPs in HSPA1L and HSPA6 showed significant allelic association with cases of deep caries and periapical lesions (P<0.05). We also observed altered transmission of HSPA1L SNP haplotypes (P=0.03). In silico analysis of HSPA1L rs2075800 function showed that this SNP results in a glutamine to lysine substitution at position 602 of the protein and might affect the stability and function of the final protein. Conclusions Variations in HSPA1L and HSPA6 may be associated with periapical lesion formation in individuals with untreated deep carious lesions. Future studies could help predict host susceptibility to developing apical periodontitis. PMID:27567034
Fariña-Sarasqueta, A.; Gosens, M. J. E. M.; Moerland, E.; van Lijnschoten, I.; Lemmens, V. E. P. P.; Slooter, G. D.; Rutten, H. J. T.; van den Brule, A. J. C.
2010-01-01
Aim: Although the predictive and prognostic value of thymidylate synthase (TS) expression and gene polymorphism in colon cancer has been widely studied, the results are inconclusive probably because of methodological differences. With this study, we aimed to elucidate the role of TS gene polymorphisms genotyping in therapy response in stage III colon carcinoma patients treated with 5-FU adjuvant chemotherapy. Patients and Methods: 251 patients diagnosed with stage III colon carcinoma treated with surgery followed by 5-FU based adjuvant therapy were selected. The variable number of tandem repeats (VNTR) and the single nucleotide polymorphism (SNP) in the 5′-untranslated region of the TS gene were genotyped. Results: There was a positive association between tumor T stage and the VNTR genotypes (p=0.05). In both univariate and multivariate survival analysis no effects of the studied polymorphisms on survival were found. However, there was an association between both polymorphisms and age. Among patients younger than 60 years, the patients homozygous for 2R seemed to have a better overall survival, whereas among the patients older than 67 this longer survival was seen by the carriers of other genotypes. Conclusion: We conclude that the TS VNTR and SNP do not predict response to 5-FU therapy in patients with stage III colon carcinoma. However, age appears to modify the effects of TS polymorphisms on survival. PMID:20966539
2014-01-01
Background Previous studies have shown that single nucleotide polymorphisms (SNP) in IL28B and IL10R are associated with sustained virological response (SVR) in chronic hepatitis C patients treated with pegilated interferon plus ribavirin (P/R). The present study extends our earlier investigations on a large East-Central European cohort. The allele frequencies of IL28B and IL10R in genotype 1 HCV infection were compared with that of healthy controls for the purpose of examining the relationship between the polymorphisms and the SVR to P/R treatment. Methods A total of 748 chronic HCV1 infected patients (365 male, 383 female; 18–82 years) and 105 voluntary blood donors as controls were enrolled. Four hundred and twenty HCV patients were treated with P/R for 24–72 weeks, out of them 195 (46.4%) achieved SVR. The IL28 rs12979860 SNP was determined using Custom Taqman SNP Genotyping Assays. The IL10R −1087 (also known as IL10R −1082 (rs1800896) promoter region SNP was determined by RT-PCR and restriction fragment length polymorphism analysis. Results The IL28B CC genotype occurred with lower frequency in HCV patients than in controls (26.1% vs 51.4%, p<0.001). P/R treated patients with the IL28B CC genotype achieved higher SVR rate, as compared to patients with CT (58.6% vs 40.8%, p=0.002). The prevalence of IL10R −1087 GG genotype was lower in patients than in controls (31.8 % vs 52.2%, p<0.001). Among patients achieving SVR, the IL10R −1087 GG genotype occurred with higher frequency than the AA (32.0% vs 17.4%, p=0.013). The IL28B T allele plus IL10R A allele combination was found with higher prevalence in patients than in controls (52% vs 20.7%, p<0.001). The IL28B CC plus IL10R A allele combination occurred with higher frequency among patients with SVR than in non-responders (21.3% vs 12.8%, p=0.026). Both the IL28B CC plus IL10R GG and the IL28B CC plus IL10R A allele combinations occurred with lower frequency in patients than in controls. Conclusions In our HCV1 patients, both the IL28B CC and IL10R GG genotypes are associated with clearance of HCV. Moreover, distinct IL28B and IL10R allele combinations appear to be protective against chronic HCV1 infection and predictors of response to P/R therapy. PMID:24398031
Chen, Guo-Bo; Lee, Sang Hong; Brion, Marie-Jo A; Montgomery, Grant W; Wray, Naomi R; Radford-Smith, Graham L; Visscher, Peter M
2014-09-01
As custom arrays are cheaper than generic GWAS arrays, larger sample size is achievable for gene discovery. Custom arrays can tag more variants through denser genotyping of SNPs at associated loci, but at the cost of losing genome-wide coverage. Balancing this trade-off is important for maximizing experimental designs. We quantified both the gain in captured SNP-heritability at known candidate regions and the loss due to imperfect genome-wide coverage for inflammatory bowel disease using immunochip (iChip) and imputed GWAS data on 61,251 and 38.550 samples, respectively. For Crohn's disease (CD), the iChip and GWAS data explained 19 and 26% of variation in liability, respectively, and SNPs in the densely genotyped iChip regions explained 13% of the SNP-heritability for both the iChip and GWAS data. For ulcerative colitis (UC), the iChip and GWAS data explained 15 and 19% of variation in liability, respectively, and the dense iChip regions explained 10 and 9% of the SNP-heritability in the iChip and the GWAS data. From bivariate analyses, estimates of the genetic correlation in risk between CD and UC were 0.75 (SE 0.017) and 0.62 (SE 0.042) for the iChip and GWAS data, respectively. We also quantified the SNP-heritability of genomic regions that did or did not contain the previous 163 GWAS hits for CD and UC, and SNP-heritability of the overlapping loci between the densely genotyped iChip regions and the 163 GWAS hits. For both diseases, over different genomic partitioning, the densely genotyped regions on the iChip tagged at least as much variation in liability as in the corresponding regions in the GWAS data, however a certain amount of tagged SNP-heritability in the GWAS data was lost using the iChip due to the low coverage at unselected regions. These results imply that custom arrays with a GWAS backbone will facilitate more gene discovery, both at associated and novel loci. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
[Association of XRCC1 genetic polymorphism with susceptibility to non-Hodgkin's lymphoma].
Li, Su-Xia; Zhu, Hong-Li; Guo, Bo; Yang, Yang; Wang, Hong-Yan; Sun, Jing-Fen; Cao, Yong-Bin
2014-08-01
The purpose of this study was to explore the association between X-ray repair cross-complementing group 1 (XRCC1)gene polymorphism and non-Hodgkin's lymphoma risk. A total of 282 non-Hodgkin's lymphoma (NHL) patients and 231 normal controls were used to investigate the effect of three XRCC1 gene polymorphisms (rs25487, rs25489, rs1799782) on susceptibility to non-Hodgkin's lymphoma. Genotyping was performed by using SNaPshot method. All statistical analyses were done with R software. Genotype and allele frequencies of XRCC1 were compared between the patients and controls by using the chi-square test. Crude and adjusted odd ratios and 95% confidence intervals were calculated by using logistic regression on the basis of genetic different models. For four kinds of NHL, subgroup analyses were also conducted. Combined genotype analyses of the three XRCC1 polymorphisms were also done by using logistic regression. The results showed that the variant genotype frequency was not significantly different between the controls and NHL or NHL subtype cases. Combined genotype analyses of XRCC1 399-280-194 results showed that the combined genotype was not associated with risk of NHL overall, but the VT-WT-WT combined genotype was associated with the decreased risk of T-NHL (OR: 0.21; 95%CI (0.06-0.8); P = 0.022), and the WT-VT-WT combined genotype was associated with the increased risk of FL(OR:15.23; 95%CI (1.69-137.39); P = 0.015). It is concluded that any studied polymorphism (rs25487, rs25489, rs1799782) alone was not shown to be rela-ted with the risk of NHL or each histologic subtype of NHL. The combined genotype with mutation of three SNP of XRCC1 was not related to the risk of NHL. However, further large-scale studies would be needed to confirm the association of decreased or increased risk for T-NHL and FL with the risk 3 combined SNP mutants of XRCC1 polymorphism.
Presence of Mycobacterium leprae genotype 4 in environmental waters in Northeast Brazil.
Holanda, Maísa Viana de; Marques, Livia Erika Carlos; Macedo, Maria Luisa Bezerra de; Pontes, Maria Araci de Andrade; Sabadia, José Antonio Beltrão; Kerr, Ligia Regina Franco Sansigolo; Almeida, Rosa Lívia Freitas; Frota, Cristiane Cunha
2017-01-01
This study quantified Mycobacterium leprae bacilli in environmental water samples from five municipalities in the State of Ceará by quantitative polymerase chain reaction (qPCR) and compared the identified genotypes with those obtained from leprosy patient biopsies. We collected five replicas from each of the 30 selected reservoirs and skin lesion biopsies from 25 new leprosy cases treated at a reference center in Fortaleza, Ceará from 2010 to 2013. The 16S rRNA gene region of M. leprae was amplified by qPCR and a standard curve was created with the pIDTBlue 16SrRNAMlep plasmid. The Juazeiro do Norte water samples and the biopsies were genotyped (single nucleotide polymorphism [SNP] 1 to 4) and the SNP 4 genotypes were subtyped. Of the 149 water samples analyzed, 54.4% were positive for the M. leprae DNA. The M. leprae bacilli copy number ranged from 1.42 × 10 -1 to 1.44 × 10 + 2 . Most biopsies showed SNP type 4 (64%), while all samples from Juazeiro do Norte were SNP type 4, with subtype 4-N appearing at the highest frequency. We suggest that environmental waters containing M. leprae bacilli play an important role in disease transmission, justifying PGL-1 seropositivity in individuals living in areas where there is no reported case, and in leprosy cases individuals who report no previous contact with other case. Therefore, further investigation is needed to clarify disease transmission in this region and to explore the role of the environment. We also suggest that in this area surveillance for leprosy cases should be intensified.
Kim, H; Lee, S K; Hong, M W; Park, S R; Lee, Y S; Kim, J W; Lee, H K; Jeong, D K; Song, Y H; Lee, S J
2013-12-01
The akirin 2 gene, located on chromosome 9 in cattle, was previously reported to be associated with nuclear factor-kappa B (NF-κB), involved in immune reactions and marbling of meat. To determine whether a single nucleotide polymorphism (SNP) in akirin 2 is associated with economically important traits of Korean native cattle, the c.*188G>A SNP DNA marker in the 3'-UTR region of akirin 2 was analyzed for its association with carcass weight, longissimus muscle area and marbling. The c.*188G>A SNP was genotyped by polymerase chain reaction restriction fragment length polymorphism, and the frequency of the AA, AG, and GG genotypes were 6.82%, 71.29% and 21.88% respectively. This SNP was significantly associated with longissimus muscle area (Bonferroni corrected P < 0.05), and marbling score (Bonferroni corrected P < 0.01). These results suggest that the c.*188G>A SNP of akirin 2 might be useful as a DNA marker for longissimus muscle area and marbling scores in Korean native cattle. © 2013 The Authors, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
Laddha, Naresh C.; Dwivedi, Mitesh; Mansuri, Mohmmad Shoab; Singh, Mala; Patel, Hetanshi H.; Agarwal, Nishtha; Shah, Anish M.; Begum, Rasheedunnisa
2014-01-01
Background Vitiligo is a depigmenting disorder resulting from loss of functional melanocytes in the skin. NPY plays an important role in induction of immune response by acting on a variety of immune cells. NPY synthesis and release is governed by IL1B. Moreover, genetic variability in IL1B is reported to be associated with elevated NPY levels. Objectives Aim of the present study was to explore NPY promoter −399T/C (rs16147) and exon2 +1128T/C (rs16139) polymorphisms as well as IL1B promoter −511C/T (rs16944) polymorphism and to correlate IL1B transcript levels with vitiligo. Methods PCR-RFLP method was used to genotype NPY -399T/C SNP in 454 patients and 1226 controls; +1128T/C SNP in 575 patients and 1279 controls and IL1B −511C/T SNP in 448 patients and 785 controls from Gujarat. IL1B transcript levels in blood were also assessed in 105 controls and 95 patients using real-time PCR. Results Genotype and allele frequencies for NPY −399T/C, +1128T/C and IL1B −511C/T SNPs differed significantly (p<0.0001, p<0.0001; p = 0.0161, p = 0.0035 and p<0.0001, p<0.0001) between patients and controls. ‘TC’ haplotype containing minor alleles of NPY polymorphisms was significantly higher in patients and increased the risk of vitiligo by 2.3 fold (p<0.0001). Transcript levels of IL1B were significantly higher, in patients compared to controls (p = 0.0029), in patients with active than stable vitiligo (p = 0.015), also in female patients than male patients (p = 0.026). Genotype-phenotype correlation showed moderate association of IL1B -511C/T polymorphism with higher IL1B transcript levels. Trend analysis revealed significant difference between patients and controls for IL1B transcript levels with respect to different genotypes. Conclusion Our results suggest that NPY −399T/C, +1128T/C and IL1B −511C/T polymorphisms are associated with vitiligo and IL1B −511C/T SNP influences its transcript levels leading to increased risk for vitiligo in Gujarat population. Up-regulation of IL1B transcript in patients advocates its possible role in autoimmune pathogenesis of vitiligo. PMID:25221996
Alaylıoğlu, Merve; Gezen-Ak, Duygu; Dursun, Erdinç; Bilgiç, Başar; Hanağası, Haşmet; Ertan, Turan; Gürvit, Hakan; Emre, Murat; Eker, Engin; Uysal, Ömer; Yılmazer, Selma
2016-07-01
Previous studies have demonstrated that clusterin (CLU), which is also known as apolipoprotein J, is involved in the pathogenesis of Alzheimer disease (AD). In this study, we investigated the association between rs2279590, rs11136000, and rs9331888 single-nucleotide polymorphisms (SNPs) in CLU and apolipoprotein E (APOE) genotypes in a cohort of Turkish patients with late-onset AD (LOAD). There were 183 patients with LOAD and 154 healthy controls included in the study. The CLU and APOE polymorphisms were genotyped using the LightSNiP assay. The "GG" genotype of rs9331888 was significantly more frequent in patients with LOAD. The "CC" genotype of the SNP was significantly more frequent in controls. The rs9331888 "GG" genotype in patients and the "CC" genotype in controls were significantly higher in non-∊4 allele carriers of APOE The haplotype analysis showed the CLU "GCG" haplotype was a risk haplotype. Our findings indicate the rs9331888 SNP of CLU is associated with LOAD independent of APOE. © The Author(s) 2016.
Genetic and clinical risk factors of root resorption associated with orthodontic treatment.
Guo, Yujiao; He, Shushu; Gu, Tian; Liu, Yi; Chen, Song
2016-08-01
External apical root resorption (EARR) is a common complication in orthodontic treatment. Despite many studies on EARR, great controversies remain with regard to its risk factors. The objective of this study was to explore the relationship among sex, root movement, IL-1RN single nucleotide polymorphism (SNP) rs419598, IL-6 SNP rs1800796, and EARR associated with orthodontic treatment. Altogether 174 patients (with 174 maxillary left central incisors) were selected for this study. Cone-beam computed tomography was performed before the start of the treatment and at the end of the treatment. Cone-beam computed tomography data were used to reconstruct a 3-dimensional image of each tooth; the volume and the root resorption volume of each tooth were calculated. Three-dimensional matching was used to measure the amount of movement of each root. Genomic DNA was extracted from buccal swabs, and genotypes of SNP rs419598 and SNP rs1800796 of each subject were determined using TaqMan polymerase chain reaction genotyping (Applied Biosystems, Foster City, Calif). The data were analyzed with multiple linear regression analysis. The statistical analysis indicated no relationship between sex, tooth movement amount, and IL-1RN SNP rs419598 with EARR. The IL-6 SNP rs1800796 GC was associated with EARR, and root resorption differed significantly between SNP rs1800796 GC and CC. IL-6 SNP rs1800796 GC is a risk factor for EARR. The amount of root movement, IL-1RN SNP rs419598, and sex as risk factors for EARR need further study. Copyright © 2016 American Association of Orthodontists. Published by Elsevier Inc. All rights reserved.
Espin‐Garcia, Osvaldo; Craiu, Radu V.
2017-01-01
ABSTRACT We evaluate two‐phase designs to follow‐up findings from genome‐wide association study (GWAS) when the cost of regional sequencing in the entire cohort is prohibitive. We develop novel expectation‐maximization‐based inference under a semiparametric maximum likelihood formulation tailored for post‐GWAS inference. A GWAS‐SNP (where SNP is single nucleotide polymorphism) serves as a surrogate covariate in inferring association between a sequence variant and a normally distributed quantitative trait (QT). We assess test validity and quantify efficiency and power of joint QT‐SNP‐dependent sampling and analysis under alternative sample allocations by simulations. Joint allocation balanced on SNP genotype and extreme‐QT strata yields significant power improvements compared to marginal QT‐ or SNP‐based allocations. We illustrate the proposed method and evaluate the sensitivity of sample allocation to sampling variation using data from a sequencing study of systolic blood pressure. PMID:29239496
Wujcicka, Wioletta; Wilczyński, Jan; Nowakowska, Dorota
2017-09-01
The research was conducted to evaluate the role of genotypes, haplotypes and multiple-SNP variants in the range of TLR2, TLR4 and TLR9 single nucleotide polymorphisms (SNPs) in the development of Toxoplasma gondii infection among Polish pregnant women. The study was performed for 116 Polish pregnant women, including 51 patients infected with T. gondii, and 65 age-matched control pregnant individuals. Genotypes in TLR2 2258 G>A, TLR4 896 A>G, TLR4 1196 C>T and TLR9 2848 G>A SNPs were estimated by self-designed, nested PCR-RFLP assays. Randomly selected PCR products, representative for distinct genotypes in the studied polymorphisms, were confirmed by sequencing. All the genotypes were calculated for Hardy-Weinberg (H-W) equilibrium and TLR4 variants were tested for linkage disequilibrium. Relationships were assessed between alleles, genotypes, haplotypes or multiple-SNP variants in TLR polymorphisms and the occurrence of T. gondii infection in pregnant women, using a logistic regression model. All the analyzed genotypes preserved the H-W equilibrium among the studied groups of patients (P>0.050). Similar distribution of distinct alleles and individual genotypes in TLR SNPs, as well as of haplotypes in TLR4 polymorphisms, were observed in T. gondii infected and control uninfected pregnant women. However, the GACG multiple-SNP variant, within the range of all the four studied polymorphisms, was correlated with a decreased risk of the parasitic infection (OR 0.52, 95% CI 0.28-0.97; P≤0.050). The polymorphisms, located within TLR2, TLR4 and TLR9 genes, may be involved together in occurrence of T. gondii infection among Polish pregnant women. Copyright © 2017 Medical University of Bialystok. Published by Elsevier B.V. All rights reserved.
Shah, Kushani; Thomas, Shelby; Stein, Arnold
2013-01-01
In this report, we describe a 5-week laboratory exercise for undergraduate biology and biochemistry students in which students learn to sequence DNA and to genotype their DNA for selected single nucleotide polymorphisms (SNPs). Students use miniaturized DNA sequencing gels that require approximately 8 min to run. The students perform G, A, T, C Sanger sequencing reactions. They prepare and run the gels, perform Southern blots (which require only 10 min), and detect sequencing ladders using a colorimetric detection system. Students enlarge their sequencing ladders from digital images of their small nylon membranes, and read the sequence manually. They compare their reads with the actual DNA sequence using BLAST2. After mastering the DNA sequencing system, students prepare their own DNA from a cheek swab, polymerase chain reaction-amplify a region of their DNA that encompasses a SNP of interest, and perform sequencing to determine their genotype at the SNP position. A family pedigree can also be constructed. The SNP chosen by the instructor was rs17822931, which is in the ABCC11 gene and is the determinant of human earwax type. Genotypes at the rs178229931 site vary in different ethnic populations. © 2013 by The International Union of Biochemistry and Molecular Biology.
Associations between variants of bone morphogenetic protein 7 gene and growth traits in chickens.
Wang, Yan; Guo, Fuyou; Qu, Hao; Luo, Chenglong; Wang, Jie; Shu, Dingming
2018-04-18
1. Enhancing bone strength to solve leg disorders in poultry has become an important goal in broiler production. 2. Bone morphogenetic protein 7 (BMP7), a member of the BMP family, represents an attractive therapeutic target for bone regeneration in humans and plays critical roles in skeletal development. 3. The objective of this study was to investigate the relationship between BMP7 gene expression, single nucleotide polymorphisms (SNPs) and growth traits in chickens. Here, a SNP (c.1995T>C) in the chicken (Gallus gallus) BMP7 gene was identified, that was associated with growth and carcass traits. 4. Genotyping revealed that the T allele occurred more frequently in breeds with high growth rates, whereas the C allele was predominant in those with low growth rates. The expression level of BMP7 in the thigh bone of birds with the TT genotype was significantly higher than in those with the CC genotype at 21, 42 and 91 days of age. 5. These findings suggest that selecting the birds with the TT genotype of SNP c.1995T>C could improve bone growth, could reduce leg disorders in fast-growing birds. The SNP c.1995T>C may serve as a selective marker for improving bone growth and increasing the consistency of body weights in poultry breeding.
Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl
2017-01-01
Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273
Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Alonso, M. Rosario; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W.; Benitez, Javier; Bogdanova, Natalia V.; Bojesen, Stig E.; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M.; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F.; Fasching, Peter A.; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G.; Goldberg, Mark S.; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A.; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L.; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L.; Muir, Kenneth; Neuhausen, Susan L.; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C.; Stram, Daniel O.; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H.; Tessier, Daniel C.; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M.; Vincent, Daniel; Winqvist, Robert; Wu, Anna H.; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D. P.; Hall, Per; Edwards, Stacey L.; Simard, Jacques; French, Juliet D.; Chenevix-Trench, Georgia; Dunning, Alison M.
2016-01-01
Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90–0.94; P = 8.96 × 10−15)) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10−09, r2 = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10−11, r2 = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus. PMID:27600471
Wu, Xiaoping; Guldbrandtsen, Bernt; Lund, Mogens Sandø; Sahana, Goutam
2016-09-01
Identification of genetic variants associated with feet and legs disorders (FLD) will aid in the genetic improvement of these traits by providing knowledge on genes that influence trait variations. In Denmark, FLD in cattle has been recorded since the 1990s. In this report, we used deregressed breeding values as response variables for a genome-wide association study. Bulls (5,334 Danish Holstein, 4,237 Nordic Red Dairy Cattle, and 1,180 Danish Jersey) with deregressed estimated breeding values were genotyped with the Illumina Bovine 54k single nucleotide polymorphism (SNP) genotyping array. Genotypes were imputed to whole-genome sequence variants, and then 22,751,039 SNP on 29 autosomes were used for an association analysis. A modified linear mixed-model approach (efficient mixed-model association eXpedited, EMMAX) and a linear mixed model were used for association analysis. We identified 5 (3,854 SNP), 3 (13,642 SNP), and 0 quantitative trait locus (QTL) regions associated with the FLD index in Danish Holstein, Nordic Red Dairy Cattle, and Danish Jersey populations, respectively. We did not identify any QTL that were common among the 3 breeds. In a meta-analysis of the 3 breeds, 4 QTL regions were significant, but no additional QTL region was identified compared with within-breed analyses. Comparison between top SNP locations within these QTL regions and known genes suggested that RASGRP1, LCORL, MOS, and MITF may be candidate genes for FLD in dairy cattle. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Alonso, M Rosario; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Benitez, Javier; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F; Fasching, Peter A; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G; Goldberg, Mark S; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C; Stram, Daniel O; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H; Tessier, Daniel C; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M; Vincent, Daniel; Winqvist, Robert; Wu, Anna H; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D P; Hall, Per; Edwards, Stacey L; Simard, Jacques; French, Juliet D; Chenevix-Trench, Georgia; Dunning, Alison M
2016-09-07
Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90-0.94; P = 8.96 × 10(-15))) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10(-09), r(2) = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10(-11), r(2) = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus.
Generation of a Saturated Genetic Recombination Map for Avocado (Persea americana)
USDA-ARS?s Scientific Manuscript database
Two large mapping populations of avocado consisting of 1582 trees were genotyped with 5050 SNP markers from transcribed genes using an Illumina Infinium SNP chip. A Florida mapping population consisted of 527 progeny from 'Tonnage' x 'Simmonds' and 249 from 'Simmonds' x 'Tonnage'. A California map...
SNP-based genotyping in lentil: linking sequence information with phenotypes
USDA-ARS?s Scientific Manuscript database
Lentil (Lens culinaris) has been late to enter the world of high throughput molecular analysis due to a general lack of genomic resources. Using a 454 sequencing-based approach, SNPs have been identified in genes across the lentil genome. Several hundred have been turned into single SNP KASP assay...
USDA-ARS?s Scientific Manuscript database
Next-generation sequencing (NGS) technologies are revolutionizing both medical and biological research through generation of massive SNP data sets for identifying heritable genome variation underlying key traits, from rare human diseases to important agronomic phenotypes in crop species. We evaluate...
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.
Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D
2013-02-28
Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation
2013-01-01
Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355
Daca-Roszak, P; Pfeifer, A; Żebracka-Gala, J; Jarząb, B; Witt, M; Ziętkiewicz, E
2016-01-01
Assays that allow analysis of the biogeographic origin of biological samples in a standard forensic laboratory have to target a small number of highly differentiating markers. Such markers should be easy to multiplex and the assay must perform well in the degraded and scarce biological material. SNPs localized in the genome regions, which in the past were subjected to differential selective pressure in various populations, are the most widely used markers in the studies of biogeographic affiliation. SNPs reflecting biogeographic differences not related to any phenotypic traits are not sufficiently explored. The goal of our study was to identify a small set of SNPs not related to any known pigmentation/phenotype-specific genes, which would allow efficient discrimination between populations of Europe and East Asia. The selection of SNPs was based on the comparative analysis of representative European and Chinese/Japanese samples (B-lymphocyte cell lines), genotyped using the Infinium HumanOmniExpressExome microarray (Illumina). The classifier, consisting of 24 unlinked SNPs (24-SNP classifier), was selected. The performance of a 14-SNP subset of this classifier (14-SNP subclassifier) was tested using genotype data from several populations. The 14-SNP subclassifier differentiated East Asians, Europeans and Africans with ∼100% accuracy; Palestinians, representative of the Middle East, clustered with Europeans, while Amerindians and Pakistani were placed between East Asian and European populations. Based on these results, we have developed a SNaPshot assay (EurEAs_Gplex) for genotyping SNPs from the 14-SNP subclassifier, combined with an additional marker for gender identification. Forensic utility of the EurEAs_Gplex was verified using degraded and low quantity DNA samples. The performance of the EurEAs_Gplex was satisfactory when using degraded DNA; tests using low quantity DNA samples revealed a previously not described source of genotyping errors, potentially important for any SNaPshot-based assays. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Strategies for genotype imputation in composite beef cattle.
Chud, Tatiane C S; Ventura, Ricardo V; Schenkel, Flavio S; Carvalheiro, Roberto; Buzanskas, Marcos E; Rosa, Jaqueline O; Mudadu, Maurício de Alvarenga; da Silva, Marcos Vinicius G B; Mokry, Fabiana B; Marcondes, Cintia R; Regitano, Luciana C A; Munari, Danísio P
2015-08-07
Genotype imputation has been used to increase genomic information, allow more animals in genome-wide analyses, and reduce genotyping costs. In Brazilian beef cattle production, many animals are resulting from crossbreeding and such an event may alter linkage disequilibrium patterns. Thus, the challenge is to obtain accurately imputed genotypes in crossbred animals. The objective of this study was to evaluate the best fitting and most accurate imputation strategy on the MA genetic group (the progeny of a Charolais sire mated with crossbred Canchim X Zebu cows) and Canchim cattle. The data set contained 400 animals (born between 1999 and 2005) genotyped with the Illumina BovineHD panel. Imputation accuracy of genotypes from the Illumina-Bovine3K (3K), Illumina-BovineLD (6K), GeneSeek-Genomic-Profiler (GGP) BeefLD (GGP9K), GGP-IndicusLD (GGP20Ki), Illumina-BovineSNP50 (50K), GGP-IndicusHD (GGP75Ki), and GGP-BeefHD (GGP80K) to Illumina-BovineHD (HD) SNP panels were investigated. Seven scenarios for reference and target populations were tested; the animals were grouped according with birth year (S1), genetic groups (S2 and S3), genetic groups and birth year (S4 and S5), gender (S6), and gender and birth year (S7). Analyses were performed using FImpute and BEAGLE software and computation run-time was recorded. Genotype imputation accuracy was measured by concordance rate (CR) and allelic R square (R(2)). The highest imputation accuracy scenario consisted of a reference population with males and females and a target population with young females. Among the SNP panels in the tested scenarios, from the 50K, GGP75Ki and GGP80K were the most adequate to impute to HD in Canchim cattle. FImpute reduced computation run-time to impute genotypes from 20 to 100 times when compared to BEAGLE. The genotyping panels possessing at least 50 thousands markers are suitable for genotype imputation to HD with acceptable accuracy. The FImpute algorithm demonstrated a higher efficiency of imputed markers, especially in lower density panels. These considerations may assist to increase genotypic information, reduce genotyping costs, and aid in genomic selection evaluations in crossbred animals.
Wang, Xiuge; Cui, Xiaohui; Zhang, Yan; Hao, Haisheng; Ju, Zhihua; Liu, Deyu; Jiang, Qiang; Yang, Chunhong; Sun, Yan; Wang, Changfa; Huang, Jinming; Zhu, Huabin
2017-11-01
RAB, member of RAS oncogene family like 2B (RABL2B) is a member of a poorly characterised clade of the RAS GTPase superfamily, which plays an essential role in male fertility, sperm intraflagellar transport and tail assembly. In the present study, we identified a novel RABL2B splice variant in bovine testis and spermatozoa. This splice variant, designated RABL2B-TV, is characterised by exon 2 skipping. Moreover, a single nucleotide polymorphism (SNP), namely c.125G>A, was found within the exonic splicing enhancer (ESE) motif, indicating that the SNP caused the production of the RABL2B-TV aberrant splice variant. This was demonstrated by constructing a pSPL3 exon capturing vector with different genotypes and transfecting these vectors into murine Leydig tumour cell line (MLTC-1) cells. Expression of the RABL2B-TV transcript was lower in semen from high- versus low-performance bulls. Association analysis showed that sperm deformity rate was significantly lower in Chinese Holstein bulls with the GG or GA genotype than in bulls with the AA genotype (P<0.05). In addition, initial sperm motility was significantly higher in individuals with the GG or GA genotype than in individuals with the AA genotype (P<0.05). The findings of the present study suggest that the difference in semen quality in bulls with different RABL2B genotypes is generated via an alternative splicing mechanism caused by a functional SNP within the ESE motif.
Multiplex-Ready Technology for mid-throughput genotyping of molecular markers.
Bonneau, Julien; Hayden, Matthew
2014-01-01
Screening molecular markers across large populations in breeding programs is generally time consuming and expensive. The Multiplex-Ready Technology (MRT) (Hayden et al., BMC genomics 9:80, 2008) was created to optimize polymorphism screening and genotyping using standardized PCR reaction conditions. The flexibility of this method maximizes the number of markers (up to 24 markers SSR or SNP, ideally small PCR product <500 bp and highly polymorphic) by using fluorescent dye (VIC, FAM, NED, and PET) and a semiautomated DNA fragment analyzer (ABI3730) capillary electrophoresis for large numbers of DNA samples (96 or 384 samples).
Maximum likelihood estimation of linkage disequilibrium in half-sib families.
Gomez-Raya, L
2012-05-01
Maximum likelihood methods for the estimation of linkage disequilibrium between biallelic DNA-markers in half-sib families (half-sib method) are developed for single and multifamily situations. Monte Carlo computer simulations were carried out for a variety of scenarios regarding sire genotypes, linkage disequilibrium, recombination fraction, family size, and number of families. A double heterozygote sire was simulated with recombination fraction of 0.00, linkage disequilibrium among dams of δ=0.10, and alleles at both markers segregating at intermediate frequencies for a family size of 500. The average estimates of δ were 0.17, 0.25, and 0.10 for Excoffier and Slatkin (1995), maternal informative haplotypes, and the half-sib method, respectively. A multifamily EM algorithm was tested at intermediate frequencies by computer simulation. The range of the absolute difference between estimated and simulated δ was between 0.000 and 0.008. A cattle half-sib family was genotyped with the Illumina 50K BeadChip. There were 314,730 SNP pairs for which the sire was a homo-heterozygote with average estimates of r2 of 0.115, 0.067, and 0.111 for half-sib, Excoffier and Slatkin (1995), and maternal informative haplotypes methods, respectively. There were 208,872 SNP pairs for which the sire was double heterozygote with average estimates of r2 across the genome of 0.100, 0.267, and 0.925 for half-sib, Excoffier and Slatkin (1995), and maternal informative haplotypes methods, respectively. Genome analyses for all possible sire genotypes with 829,042 tests showed that ignoring half-sib family structure leads to upward biased estimates of linkage disequilibrium. Published inferences on population structure and evolution of cattle should be revisited after accommodating existing half-sib family structure in the estimation of linkage disequilibrium.
Maximum Likelihood Estimation of Linkage Disequilibrium in Half-Sib Families
Gomez-Raya, L.
2012-01-01
Maximum likelihood methods for the estimation of linkage disequilibrium between biallelic DNA-markers in half-sib families (half-sib method) are developed for single and multifamily situations. Monte Carlo computer simulations were carried out for a variety of scenarios regarding sire genotypes, linkage disequilibrium, recombination fraction, family size, and number of families. A double heterozygote sire was simulated with recombination fraction of 0.00, linkage disequilibrium among dams of δ = 0.10, and alleles at both markers segregating at intermediate frequencies for a family size of 500. The average estimates of δ were 0.17, 0.25, and 0.10 for Excoffier and Slatkin (1995), maternal informative haplotypes, and the half-sib method, respectively. A multifamily EM algorithm was tested at intermediate frequencies by computer simulation. The range of the absolute difference between estimated and simulated δ was between 0.000 and 0.008. A cattle half-sib family was genotyped with the Illumina 50K BeadChip. There were 314,730 SNP pairs for which the sire was a homo-heterozygote with average estimates of r2 of 0.115, 0.067, and 0.111 for half-sib, Excoffier and Slatkin (1995), and maternal informative haplotypes methods, respectively. There were 208,872 SNP pairs for which the sire was double heterozygote with average estimates of r2 across the genome of 0.100, 0.267, and 0.925 for half-sib, Excoffier and Slatkin (1995), and maternal informative haplotypes methods, respectively. Genome analyses for all possible sire genotypes with 829,042 tests showed that ignoring half-sib family structure leads to upward biased estimates of linkage disequilibrium. Published inferences on population structure and evolution of cattle should be revisited after accommodating existing half-sib family structure in the estimation of linkage disequilibrium. PMID:22377635
Design and characterization of a 52K SNP chip for goats.
Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C M; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T; McEwan, John; Martin, Patrice; Moreno, Carole R; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang
2014-01-01
The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50-60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years.
Design and Characterization of a 52K SNP Chip for Goats
Tosser-Klopp, Gwenola; Bardou, Philippe; Bouchez, Olivier; Cabau, Cédric; Crooijmans, Richard; Dong, Yang; Donnadieu-Tonon, Cécile; Eggen, André; Heuven, Henri C. M.; Jamli, Saadiah; Jiken, Abdullah Johari; Klopp, Christophe; Lawley, Cynthia T.; McEwan, John; Martin, Patrice; Moreno, Carole R.; Mulsant, Philippe; Nabihoudine, Ibouniyamine; Pailhoux, Eric; Palhière, Isabelle; Rupp, Rachel; Sarry, Julien; Sayre, Brian L.; Tircazes, Aurélie; Jun Wang; Wang, Wen; Zhang, Wenguang
2014-01-01
The success of Genome Wide Association Studies in the discovery of sequence variation linked to complex traits in humans has increased interest in high throughput SNP genotyping assays in livestock species. Primary goals are QTL detection and genomic selection. The purpose here was design of a 50–60,000 SNP chip for goats. The success of a moderate density SNP assay depends on reliable bioinformatic SNP detection procedures, the technological success rate of the SNP design, even spacing of SNPs on the genome and selection of Minor Allele Frequencies (MAF) suitable to use in diverse breeds. Through the federation of three SNP discovery projects consolidated as the International Goat Genome Consortium, we have identified approximately twelve million high quality SNP variants in the goat genome stored in a database together with their biological and technical characteristics. These SNPs were identified within and between six breeds (meat, milk and mixed): Alpine, Boer, Creole, Katjang, Saanen and Savanna, comprising a total of 97 animals. Whole genome and Reduced Representation Library sequences were aligned on >10 kb scaffolds of the de novo goat genome assembly. The 60,000 selected SNPs, evenly spaced on the goat genome, were submitted for oligo manufacturing (Illumina, Inc) and published in dbSNP along with flanking sequences and map position on goat assemblies (i.e. scaffolds and pseudo-chromosomes), sheep genome V2 and cattle UMD3.1 assembly. Ten breeds were then used to validate the SNP content and 52,295 loci could be successfully genotyped and used to generate a final cluster file. The combined strategy of using mainly whole genome Next Generation Sequencing and mapping on a contig genome assembly, complemented with Illumina design tools proved to be efficient in producing this GoatSNP50 chip. Advances in use of molecular markers are expected to accelerate goat genomic studies in coming years. PMID:24465974
Analysis of single nucleotide polymorphisms in case-control studies.
Li, Yonghong; Shiffman, Dov; Oberbauer, Rainer
2011-01-01
Single nucleotide polymorphisms (SNPs) are the most common type of genetic variants in the human genome. SNPs are known to modify susceptibility to complex diseases. We describe and discuss methods used to identify SNPs associated with disease in case-control studies. An outline on study population selection, sample collection and genotyping platforms is presented, complemented by SNP selection, data preprocessing and analysis.
VCS: Tool for Visualizing Copy Number Variation and Single Nucleotide Polymorphism.
Kim, HyoYoung; Sung, Samsun; Cho, Seoae; Kim, Tae-Hun; Seo, Kangseok; Kim, Heebal
2014-12-01
Copy number variation (CNV) or single nucleotide phlyorphism (SNP) is useful genetic resource to aid in understanding complex phenotypes or deseases susceptibility. Although thousands of CNVs and SNPs are currently avaliable in the public databases, they are somewhat difficult to use for analyses without visualization tools. We developed a web-based tool called the VCS (visualization of CNV or SNP) to visualize the CNV or SNP detected. The VCS tool can assist to easily interpret a biological meaning from the numerical value of CNV and SNP. The VCS provides six visualization tools: i) the enrichment of genome contents in CNV; ii) the physical distribution of CNV or SNP on chromosomes; iii) the distribution of log2 ratio of CNVs with criteria of interested; iv) the number of CNV or SNP per binning unit; v) the distribution of homozygosity of SNP genotype; and vi) cytomap of genes within CNV or SNP region.
Nemoto, Kiyotaka; Takahashi, Tsutomu; Aleksic, Branko; Furuichi, Atsushi; Nakamura, Yumiko; Ikeda, Masashi; Noguchi, Kyo; Kaibuchi, Kozo; Iwata, Nakao; Ozaki, Norio; Suzuki, Michio
2014-01-01
Background YWHAE is a possible susceptibility gene for schizophrenia that encodes 14-3-3epsilon, a Disrupted-in-Schizophrenia 1 (DISC1)-interacting molecule, but the effect of variation in its genotype on brain morphology remains largely unknown. Methods In this voxel-based morphometric magnetic resonance imaging study, we conducted whole-brain analyses regarding the effects of YWHAE single-nucleotide polymorphisms (SNPs) (rs28365859, rs11655548, and rs9393) and DISC1 SNP (rs821616) on gray matter volume in a Japanese sample of 72 schizophrenia patients and 86 healthy controls. On the basis of a previous animal study, we also examined the effect of rs28365859 genotype specifically on hippocampal volume. Results Whole-brain analyses showed no significant genotype effect of these SNPs on gray matter volume in all subjects, but we found significant genotype-by-diagnosis interaction for rs28365859 in the left insula and right putamen. The protective C allele carriers of rs28365859 had a significantly larger left insula than the G homozygotes only for schizophrenia patients, while the controls with G allele homozygosity had a significantly larger right putamen than the C allele carriers. The C allele carriers had a larger right hippocampus than the G allele homozygotes in schizophrenia patients, but not in healthy controls. No significant interaction was found between rs28365859 and DISC1 SNP on gray matter volume. Conclusions These different effects of the YWHAE (rs28365859) genotype on brain morphology in schizophrenia and healthy controls suggest that variation in its genotype might be, at least partly, related to the abnormal neurodevelopment, including in the limbic regions, reported in schizophrenia. Our results also suggest its specific role among YWHAE SNPs in the pathophysiology of schizophrenia. PMID:25105667
Birdsell, Dawn N.; Pearson, Talima; Price, Erin P.; Hornstra, Heidie M.; Nera, Roxanne D.; Stone, Nathan; Gruendike, Jeffrey; Kaufman, Emily L.; Pettus, Amanda H.; Hurbon, Audriana N.; Buchhagen, Jordan L.; Harms, N. Jane; Chanturia, Gvantsa; Gyuranecz, Miklos; Wagner, David M.; Keim, Paul S.
2012-01-01
Single nucleotide polymorphisms (SNPs) are abundant in genomes of all species and biologically informative markers extensively used across broad scientific disciplines. Newly identified SNP markers are publicly available at an ever-increasing rate due to advancements in sequencing technologies. Efficient, cost-effective SNP genotyping methods to screen sample populations are in great demand in well-equipped laboratories, but also in developing world situations. Dual Probe TaqMan assays are robust but can be cost-prohibitive and require specialized equipment. The Mismatch Amplification Mutation Assay, coupled with melt analysis (Melt-MAMA), is flexible, efficient and cost-effective. However, Melt-MAMA traditionally suffers from high rates of assay design failures and knowledge gaps on assay robustness and sensitivity. In this study, we identified strategies that improved the success of Melt-MAMA. We examined the performance of 185 Melt-MAMAs across eight different pathogens using various optimization parameters. We evaluated the effects of genome size and %GC content on assay development. When used collectively, specific strategies markedly improved the rate of successful assays at the first design attempt from ∼50% to ∼80%. We observed that Melt-MAMA accurately genotypes across a broad DNA range (∼100 ng to ∼0.1 pg). Genomic size and %GC content influence the rate of successful assay design in an independent manner. Finally, we demonstrated the versatility of these assays by the creation of a duplex Melt-MAMA real-time PCR (two SNPs) and conversion to a size-based genotyping system, which uses agarose gel electrophoresis. Melt-MAMA is comparable to Dual Probe TaqMan assays in terms of design success rate and accuracy. Although sensitivity is less robust than Dual Probe TaqMan assays, Melt-MAMA is superior in terms of cost-effectiveness, speed of development and versatility. We detail the parameters most important for the successful application of Melt-MAMA, which should prove useful to the wider scientific community. PMID:22438886
Lipphardt, Mark F; Deryal, Mustafa; Ong, Mei Fang; Schmidt, Werner; Mahlknecht, Ulrich
2013-01-01
Estrogen and progesterone hormones are key regulators of a wide variety of biological processes. In addition to their influence on reproduction, cell differentiation and apoptosis, they affect inflammatory response, cell metabolism and most importantly, they regulate physiological breast tissue proliferation and differentiation as well as the development and progression of breast cancer. In order to assess whether genetic variants in the steroid hormone receptor gene ESR1 (estrogen receptor alpha) had an effect on sporadic breast cancer susceptibility, we assessed 7 ESR1 single nucleotide polymorphisms (SNPs) for associations with breast cancer susceptibility and clinical parameters in 221 breast cancer patients and 221 controls, respectively. We identified ESR1 intron SNP +2464 C/T (rs3020314) and ESR1 intron SNP -4576 A/C (rs1514348) to correlate with breast cancer susceptibility and progesterone receptor expression status. Patients genotyped CT for ESR1 intron SNP +2464 (rs3020314) (p ≤ 0.045) or genotyped AC for ESR1 intron SNP -4576 (rs1514348) (p ≤ 0.000026) were identified to carry a significant risk as to the development of breast cancer in the Central European Caucasian population (both together: p ≤ 0.000488). Our study could confirm previous associations and revealed new associations of SNP rs1514348 with susceptibility to breast cancer and clinical outcome, which might be used as new additional SNP markers.
Rodriguez-Jimenez, R; Hoenicka, J; Jimenez-Arriero, M A; Ponce, G; Bagney, A; Aragues, M; Palomo, T
2006-01-01
Previous studies have associated a decreased striatal D2 dopamine receptor (DRD2) binding with impaired performance in cognitive tasks. In vivo studies have found a lower DRD2 binding associated with the CC genotype of the C957T single nucleotide polymorphism (SNP) of the DRD2 gene. The aim of this study was to investigate the relationship between executive functions and the C957T DRD2 SNP. We hypothesized that the CC genotype would be associated with a poorer executive functioning. Our sample consisted of 83 healthy volunteers (28 males and 55 females; mean age 25.2, SD 1.7 years). To assess executive functions, the Wisconsin Card Sorting Test was used, considering the variables perseverative errors, perseverative responses, and number of categories achieved. The genotype distribution was 13 CC, 41 CT, and 29 TT, satisfying Hardy-Weinberg equilibrium. Carriers of the CC genotype, compared with carriers of the CT/TT genotypes, achieved significantly fewer categories (5.00 vs. 5.81; p = 0.004), made a greater number of perseverative errors (13.46 vs. 8.39; p = 0.018), and had a greater number of perseverative responses (14.92 vs. 8.94; p = 0.014). Our results support the hypothesis that the C957T DRD2 SNP may influence cognitive performance through its repercussions on central dopaminergic function. 2006 S. Karger AG, Basel
Li, Jiali; Jiao, Xiaodong; Zhang, Qingjiong; Hejtmancik, J Fielding
2017-01-01
Previously, a genome-wide association study (GWAS) identified rs13382811 (near ZFHX1B) and rs6469937 (near SNTB1) to be associated with high myopia. The present study evaluates the association of these two single nucleotide polymorphisms (SNPs) with moderate to high myopia in two Chinese cohorts and two cohorts of European populations. Two Chinese university student cohorts, including one with 300 unrelated subjects with high myopia and 308 emmetropic controls from Guangzhou and a second with 96 unrelated individuals with moderate to high myopia and 96 emmetropic controls of Chaoshanese origin in Guangzhou, were enrolled in this study. Two SNPs, rs6469937 and rs13382811, were selected for genotyping based on their reported associations with severe myopia. The SNPs were genotyped via DNA sequencing. In addition, association analysis of both SNPs was performed using genotype data from the database of Genotypes and Phenotypes (dbGaP) involving a total of 2,423 samples in two independent cohorts of European-derived populations, as follows: Kooperative Gesundheitsforschung in der Region Augsburg (KORA) and TwinsUK. The allelic and genotypic distribution among cases and controls were analyzed using the Chi-square test. Logistic regression was used to evaluate the SNP-SNP interaction. Fisher's exact test was used for two-SNP comparisons. In the Guangzhou cohort, SNP rs13382811 near ZFHX1B showed significant association with high myopia ( p allelic = 0.0001, p genotypic = 4.07 × 10 -5 ), with the minor T allele showing an increased risk of high myopia (odds ratio [OR] = 1.68, 95% confidence interval [CI] = 1.28-2.20). SNP rs6469937 near SNTB1 showed nominal evidence of association ( p allelic = 0.0085, p genotypic = 0.0166), which did not withstand correction for multiple testing. No significant association was detected in the smaller Chaoshan cohort alone. The association of SNPs rs13382811 and rs6469937 remained significant when both Han Chinese cohorts were combined ( p allelic = 0.0033 and 0.0016, respectively), and it was also significant under the genotypic test ( p genotypic = 0.0036 and 0.0053, respectively). When both SNPs were considered together under a recessive model, their significance increased ( p = 8.37 × 10 -4 ), as did their effect (OR = 4.09, 95%CI = 1.7-9.8). The association between either of these two SNPs alone and myopia did not replicate significantly in the combined cohorts of European descent, providing only suggestive results ( p allelic = 0.0088 for rs13382811 and p allelic = 0.0319 for rs6469937). However, the effects of the combined SNPs showed significant association ( p = 8.2 × 10 -4 ; OR = 1.56, 95%CI = 1.2-2.0). While the risk for myopia increased with risk alleles from both SNPs, the increase was additive rather representing a multiplicative interaction in both populations. Our study confirms that the two susceptibility loci ZFHX1B and SNTB1 are associated with moderate to high myopia in a Han Chinese population, as well as in a European population, when both SNPs are combined. These results confirm previous reports of their associations, extend these observations to a European population, and suggest that additional interactive and possibly population-specific genetic or environmental factors may affect their contribution to myopia.
Da, Yang; Wang, Chunkao; Wang, Shengwen; Hu, Guo
2014-01-01
We established a genomic model of quantitative trait with genomic additive and dominance relationships that parallels the traditional quantitative genetics model, which partitions a genotypic value as breeding value plus dominance deviation and calculates additive and dominance relationships using pedigree information. Based on this genomic model, two sets of computationally complementary but mathematically identical mixed model methods were developed for genomic best linear unbiased prediction (GBLUP) and genomic restricted maximum likelihood estimation (GREML) of additive and dominance effects using SNP markers. These two sets are referred to as the CE and QM sets, where the CE set was designed for large numbers of markers and the QM set was designed for large numbers of individuals. GBLUP and associated accuracy formulations for individuals in training and validation data sets were derived for breeding values, dominance deviations and genotypic values. Simulation study showed that GREML and GBLUP generally were able to capture small additive and dominance effects that each accounted for 0.00005–0.0003 of the phenotypic variance and GREML was able to differentiate true additive and dominance heritability levels. GBLUP of the total genetic value as the summation of additive and dominance effects had higher prediction accuracy than either additive or dominance GBLUP, causal variants had the highest accuracy of GREML and GBLUP, and predicted accuracies were in agreement with observed accuracies. Genomic additive and dominance relationship matrices using SNP markers were consistent with theoretical expectations. The GREML and GBLUP methods can be an effective tool for assessing the type and magnitude of genetic effects affecting a phenotype and for predicting the total genetic value at the whole genome level. PMID:24498162
Da, Yang; Wang, Chunkao; Wang, Shengwen; Hu, Guo
2014-01-01
We established a genomic model of quantitative trait with genomic additive and dominance relationships that parallels the traditional quantitative genetics model, which partitions a genotypic value as breeding value plus dominance deviation and calculates additive and dominance relationships using pedigree information. Based on this genomic model, two sets of computationally complementary but mathematically identical mixed model methods were developed for genomic best linear unbiased prediction (GBLUP) and genomic restricted maximum likelihood estimation (GREML) of additive and dominance effects using SNP markers. These two sets are referred to as the CE and QM sets, where the CE set was designed for large numbers of markers and the QM set was designed for large numbers of individuals. GBLUP and associated accuracy formulations for individuals in training and validation data sets were derived for breeding values, dominance deviations and genotypic values. Simulation study showed that GREML and GBLUP generally were able to capture small additive and dominance effects that each accounted for 0.00005-0.0003 of the phenotypic variance and GREML was able to differentiate true additive and dominance heritability levels. GBLUP of the total genetic value as the summation of additive and dominance effects had higher prediction accuracy than either additive or dominance GBLUP, causal variants had the highest accuracy of GREML and GBLUP, and predicted accuracies were in agreement with observed accuracies. Genomic additive and dominance relationship matrices using SNP markers were consistent with theoretical expectations. The GREML and GBLUP methods can be an effective tool for assessing the type and magnitude of genetic effects affecting a phenotype and for predicting the total genetic value at the whole genome level.
Genomic Variants Revealed by Invariably Missing Genotypes in Nelore Cattle
da Silva, Joaquim Manoel; Giachetto, Poliana Fernanda; da Silva, Luiz Otávio Campos; Cintra, Leandro Carrijo; Paiva, Samuel Rezende; Caetano, Alexandre Rodrigues; Yamagishi, Michel Eduardo Beleza
2015-01-01
High density genotyping panels have been used in a wide range of applications. From population genetics to genome-wide association studies, this technology still offers the lowest cost and the most consistent solution for generating SNP data. However, in spite of the application, part of the generated data is always discarded from final datasets based on quality control criteria used to remove unreliable markers. Some discarded data consists of markers that failed to generate genotypes, labeled as missing genotypes. A subset of missing genotypes that occur in the whole population under study may be caused by technical issues but can also be explained by the presence of genomic variations that are in the vicinity of the assayed SNP and that prevent genotyping probes from annealing. The latter case may contain relevant information because these missing genotypes might be used to identify population-specific genomic variants. In order to assess which case is more prevalent, we used Illumina HD Bovine chip genotypes from 1,709 Nelore (Bos indicus) samples. We found 3,200 missing genotypes among the whole population. NGS re-sequencing data from 8 sires were used to verify the presence of genomic variations within their flanking regions in 81.56% of these missing genotypes. Furthermore, we discovered 3,300 novel SNPs/Indels, 31% of which are located in genes that may affect traits of importance for the genetic improvement of cattle production. PMID:26305794
Qi, Xiaoquan; Bakht, Saleha; Devos, Katrien M.; Gale, Mike D.; Osbourn, Anne
2001-01-01
A flexible, non-gel-based single nucleotide polymorphism (SNP) detection method is described. The method adopts thermostable ligation for allele discrimination and rolling circle amplification (RCA) for signal enhancement. Clear allelic discrimination was achieved after staining of the final reaction mixtures with Cybr-Gold and visualisation by UV illumination. The use of a compatible buffer system for all enzymes allows the reaction to be initiated and detected in the same tube or microplate well, so that the experiment can be scaled up easily for high-throughput detection. Only a small amount of DNA (i.e. 50 ng) is required per assay, and use of carefully designed short padlock probes coupled with generic primers and probes make the SNP detection cost effective. Biallelic assay by hybridisation of the RCA products with fluorescence dye-labelled probes is demonstrated, indicating that ligation-RCA (L-RCA) has potential for multiplexed assays. PMID:11713336
Single nucleotide polymorphism discovery in bovine liver using RNA-seq technology.
Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Pierzchała, Mariusz; Feng, Yaping; Kadarmideen, Haja N; Kumar, Dibyendu
2017-01-01
RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver tissue of young bulls of the Polish Red, Polish Holstein-Friesian (HF) and Hereford breeds, and to understand the genomic variation in the three cattle breeds that may reflect differences in production traits. The RNA-seq experiment on bovine liver produced 107,114,4072 raw paired-end reads, with an average of approximately 60 million paired-end reads per library. Breed-wise, a total of 345.06, 290.04 and 436.03 million paired-end reads were obtained from the Polish Red, Polish HF, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed that 81.35%, 82.81% and 84.21% of the mapped sequencing reads were properly paired to the Polish Red, Polish HF, and Hereford breeds, respectively. This study identified 5,641,401 SNPs and insertion and deletion (indel) positions expressed in the bovine liver with an average of 313,411 SNPs and indel per young bull. Following the removal of the indel mutations, a total of 195,3804, 152,7120 and 205,3184 raw SNPs expressed in bovine liver were identified for the Polish Red, Polish HF, and Hereford breeds, respectively. Breed-wise, three highly reliable breed-specific SNP-databases (SNP-dbs) with 31,562, 24,945 and 28,194 SNP records were constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10, IGFBPL1, IGFL1, LEP, LHX4, MC4R, MSTN, NKAIN1, PLAG1, POU1F1, SDR16C5, SH2B2, TOX, UCP3 and WNT10B) in all three cattle breeds. However, six SNP loci (CCSER1, GHR, KCNIP4, MTSS1, EGFR and NSMCE2) were identified as highly polymorphic among the cattle breeds. This study identified breed-specific SNPs with greater SNP ratio and excellent mapping coverage, as well as monomorphic and highly polymorphic putative SNP loci within QTL/CGs of bovine liver tissue. A breed-specific SNP-db constructed for bovine liver yielded nearly six million SNPs. In addition, a KASPTM SNP genotyping assay, as a reliable cost-effective method, successfully validated the breed-specific putative SNPs originating from the RNA-seq experiments.
Willems, Petra; Claes, Kathleen; Baeyens, Ans; Vandersickel, Veerle; Werbrouck, Joke; De Ruyck, Kim; Poppe, Bruce; Van den Broecke, Rudy; Makar, Amin; Marras, Emanuela; Perletti, Gianpaolo; Thierens, Hubert; Vral, Anne
2008-02-01
As enhanced chromosomal radiosensitivity (CRS) results from non- or misrepaired double strand breaks (DSBs) and is a hallmark for breast cancer and single nucleotide polymorphisms (SNPs) in DSB repair genes, such as non homologous end-joining (NHEJ) genes, could be involved in CRS and genetic predisposition to breast cancer. In this study, we investigated the association of five SNPs in three different NHEJ genes with breast cancer in a population-based case-control setting. The total patient population composed of a selected group of patients with a family history of the disease and an unselected group, consisting mainly of sporadic cases. SNP analysis showed that the c.2099-2408G>A SNP (XRCC5Ku80) [corrected] has a significant, positive odds ratio (OR) of 2.81 (95% confidence interval (CI): 1.30-6.05) for the heterozygous (He) and homozygous variant (HV) genotypes in the selected patient group. For the c.-1310 C>G SNP (XRCC6Ku70)[corrected] a significant OR of 1.85 (95%CI: 1.01-3.41) was found for the He genotype in the unselected patient group. On the contrary, the HV genotype of c.1781G>T (XRCC6Ku70) [corrected] displays a significant, negative OR of 0.43 (95%CI: 0.18-0.99) in the total patient population. The He+HV genotypes of the c.2099-2408G>A SNP (XRCC5Ku80) [corrected] also showed high and significant ORs in the group of "radiosensitive," familial breast cancer patients. In conclusion, our results provide preliminary evidence that the variant allele of c.-1310C>G (XRCC6Ku70) [corrected]and c.2099-2408G>A (XRCC5Ku80) [corrected] are risk alleles for breast cancer as well as CRS. The HV genotype of c.1781G>T (XRCC6Ku70) [corrected] on the contrary, seems to protect against breast cancer and ionizing radiation induced micronuclei. (c) 2007 Wiley-Liss, Inc.
Validation of a Cost-Efficient Multi-Purpose SNP Panel for Disease Based Research
Hou, Liping; Phillips, Christopher; Azaro, Marco; Brzustowicz, Linda M.; Bartlett, Christopher W.
2011-01-01
Background Here we present convergent methodologies using theoretical calculations, empirical assessment on in-house and publicly available datasets as well as in silico simulations, that validate a panel of SNPs for a variety of necessary tasks in human genetics disease research before resources are committed to larger-scale genotyping studies on those samples. While large-scale well-funded human genetic studies routinely have up to a million SNP genotypes, samples in a human genetics laboratory that are not yet part of such studies may be productively utilized in pilot projects or as part of targeted follow-up work though such smaller scale applications require at least some genome-wide genotype data for quality control purposes such as DNA “barcoding” to detect swaps or contamination issues, determining familial relationships between samples and correcting biases due to population effects such as population stratification in pilot studies. Principal Findings Empirical performance in classification of relative types for any two given DNA samples (e.g., full siblings, parental, etc) indicated that for outbred populations the panel performs sufficiently to classify relationship in extended families and therefore also for smaller structures such as trios and for twin zygosity testing. Additionally, familial relationships do not significantly diminish the (mean match) probability of sharing SNP genotypes in pedigrees, further indicating the uniqueness of the “barcode.” Simulation using these SNPs for an African American case-control disease association study demonstrated that population stratification, even in complex admixed samples, can be adequately corrected under a range of disease models using the SNP panel. Conclusion The panel has been validated for use in a variety of human disease genetics research tasks including sample barcoding, relationship verification, population substructure detection and statistical correction. Given the ease of genotyping our specific assay contained herein, this panel represents a useful and economical panel for human geneticists. PMID:21611176
Siqueira, Erika Rabelo Forte de; Pereira, Luciano Beltrao; Stefano, Jose Tadeu; Patente, Thiago; Cavaleiro, Ana Mercedes; Silva Vasconcelos, Luydson Richardson; Carmo, Rodrigo Feliciano; Moreira Beltrao Pereira, Leila Maria; Carrilho, Flair Jose; Corrêa-Giannella, Maria Lucia; Oliveira, Claudia P
2015-03-28
Given the important contribution of the nicotinamide adenine dinucleotide phosphate (NADPH) oxidase system to the generation of reactive oxygen species induced by hepatitis C virus (HCV), we investigated two single nucleotide polymorphisms (SNPs) in the putative regulatory region of the genes encoding NADPH oxidase 4 catalytic subunit (NOX4) and its regulatory subunit p22phox (CYBA) and their relation with metabolic and histological variables in patients with HCV. One hundred seventy eight naïve HCV patients (49.3% male; 65% HCV genotype 1) with positive HCV RNA were genotyped using specific primers and fluorescent-labeled probes for SNPs rs3017887 in NOX4 and -675 T → A in CYBA. No association was found between the genotype frequencies of NOX4 and CYBA SNPs and inflammation scores or fibrosis stages in the overall population. The presence of the CA + AA genotypes of the NOX4 SNP was nominally associated with a lower alanine aminotransferase (ALT) concentration in the male population (CA + AA = 72.23 ± 6.34 U/L versus CC = 100.22 ± 9.85; mean ± SEM; P = 0.05). The TT genotype of the CYBA SNP was also nominally associated with a lower ALT concentration in the male population (TT = 84.01 ± 6.77 U/L versus TA + AA = 109.67 ± 18.37 U/L; mean ± SEM; P = 0.047). The minor A-allele of the NOX4 SNP was inversely associated with the frequency of metabolic syndrome (MS) in the male population (odds ratio (OR): 0.15; 95% confidence interval (CI): 0.03 to 0.79; P = 0.025). The results suggest that the evaluated NOX4 and CYBA SNPs are not direct genetic determinants of fibrosis in HCV patients, but nevertheless NOX4 rs3017887 SNP could indirectly influence fibrosis susceptibility due to its inverse association with MS in male patients.
A genetic map and germplasm diversity estimation of Mangifera indica (mango) with SNPs
USDA-ARS?s Scientific Manuscript database
Mango (Mangifera indica) is often referred to as the “King of Fruits”. As the first steps in developing a mango genomics project, we genotyped 582 individuals comprising six mapping populations with 1054 SNP markers. The resulting consensus map had 20 linkage groups defined by 726 SNP markers with...
USDA-ARS?s Scientific Manuscript database
In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...
USDA-ARS?s Scientific Manuscript database
The rapid advancement in high-throughput SNP genotyping technologies along with next generation sequencing (NGS) platforms has decreased the cost, improved the quality of large-scale genome surveys, and allowed specialty crops with limited genomic resources such as carrot (Daucus carota) to access t...
USDA-ARS?s Scientific Manuscript database
As an initial step to explore the transcriptome genetic diversity and to discover single nucleotide polymorphic (SNP)-biomarkers for marker assisted breeding within Pima (Gossypium barbadense L.) cotton, leaves from 25 day plants of three diverse genotypes were used to develop cDNA libraries. Using ...
USDA-ARS?s Scientific Manuscript database
Microsatellite markers (MS) have traditionally been used for parental verification and are still the international standard in spite of their higher cost, error rate, and turnaround time compared with Single Nucleotide Polymorphisms (SNP) -based assays. Despite domestic and international demands fr...
A graphene-based platform for single nucleotide polymorphism (SNP) genotyping.
Liu, Meng; Zhao, Huimin; Chen, Shuo; Yu, Hongtao; Zhang, Yaobin; Quan, Xie
2011-06-15
A facile, rapid, stable and sensitive approach for fluorescent detection of single nucleotide polymorphism (SNP) is designed based on DNA ligase reaction and π-stacking between the graphene and the nucleotide bases. In the presence of perfectly matched DNA, DNA ligase can catalyze the linkage of fluorescein amidite-labeled single-stranded DNA (ssDNA) and a phosphorylated ssDNA, and thus the formation of a stable duplex in high yield. However, the catalytic reaction cannot effectively carry out with one-base mismatched DNA target. In this case, we add graphene to the system in order to produce different quenching signals due to its different adsorption affinity for ssDNA and double-stranded DNA. Taking advantage of the unique surface property of graphene and the high discriminability of DNA ligase, the proposed protocol exhibits good performance in SNP genotyping. The results indicate that it is possible to accurately determine SNP with frequency as low as 2.6% within 40 min. Furthermore, the presented flexible strategy facilitates the development of other biosensing applications in the future. Copyright © 2011 Elsevier B.V. All rights reserved.
Gerhard, Glenn S; Still, Christopher D; Wood, G Craig; Chu, Xin; Erdman, Robert; Susek, Meghan; Gerst, Heather; Derr, Kim; AlAgha, Mouna; Hartman, Christina; Carey, David; Benotti, Peter
2010-01-01
Background/Aims: Obesity has a strong genetic component. Recent genome-wide association studies have identified single nucleotide polymorphisms (SNPs) in or near over a dozen genes that are related to body mass index (BMI). Despite the association of these SNPs with BMI, the mechanism by which they influence the determination of body weight is not yet known. Recently, the fat- mass and obesity-associated (FTO) obesity SNP was related to energy intake and preference for foods of high caloric density in children. FTO genotype was not associated with resting energy expenditure. We have extended this type of analysis to eating behaviors in the morbidly obese. Methods: DNA was obtained from approximately 900 morbidly obese (BMI>40 kg/m2) patients and used to genotype obesity SNPs in or near the FTO, INSIG2, MC4R, and PCSK1 genes. Binge eating status (normal, episodic overeating, or any binge eating) was determined using the validated Questionnaire on Eating and Weight Patterns (QEWP). Binge eating status was correlated with each individual genotype, the combined obesity allele burden, and the combined homozygous obesity gene burden. Results: Binge eating data was obtained from 640 patients who had completed the QEWP. Of these 640, 116 (18%) were classified as manifesting binge eating behavior. No association was present between heterozygous or homozygous FTO (P=0.59), MC4R (P=0.30), or PSK1 (P=0.77) obesity SNPs. However, 29% of those who were homozygous for the INSIG2 obesity SNP were classified as binge eaters, versus 17% of heterozygous or homozygous normal patients (P=0.006). Association was also found with binge eating status and the presence of 2 or more homozygous obesity genotypes (28% versus 17%, P=0.041), likely due to the INSIG2 gene. Cumulative obesity allele burden (0–8 alleles for the 4 genes) was not associated with binge eating status (P=0.42). Conclusions: The INSIG2 obesity SNP appears to influence binge eating behavior in morbidly obese adults. The FTO obesity SNP appears to influence eating behavior in children suggesting that different genes may influence obesity at different ages. For both genes, excess caloric intake appears to be the major mechanism influencing BMI. How other obesity genes influence body weight regulation has not yet been determined.
Dux, Marta; Muranowicz, Magdalena; Siadkowska, Eulalia; Robakowska-Hyżorek, Dagmara; Flisikowski, Krzysztof; Bagnicka, Emilia; Zwierzchowski, Lech
2018-05-01
The objective of the study reported in this Research Communication was to investigate the association of polymorphisms in the insulin-like growth factor receptor 2 (IGF2R) gene with milk traits in 283 Polish Holstein-Friesian (PHF) cows from the IGAB PAS farm in Jastrzębiec. IGF2R regulates the availability of biologically active IGF2 which is considered as a genetic marker for milk or meat production in farm animals. Two novel genetic polymorphisms were identified in the bovine IGF2R gene: a polymorphic TG-repeat in intron 23 (g.72389 (TG)15-67), and a g.72479 G > A SNP RFLP-StyI in exon 24. The following milk traits were investigated: milk yield, protein and fat yield, SCC and lactose content. To determine the influence of the IGF2R STR and SNP genotypes on the milk traits, we used the AI-REML (average information restricted maximum likelihood) method with repeatability, multi-trait animal model based on test-day information using DMU package. Statistical analysis revealed that the G/A genotype (P ≤ 0·01) was associated with milk and protein yield, lactose content and somatic cell count (SCC) in Polish HF cows. TGn (29/22, 28/29, 28/22, 28/28) genotypes were associated with high values for milk, (28/22, 28/23) with protein and fat yield, (25/20) with lactose content, and (29/33, 28/28) with low SCC. We suggest that the IGF2R gene polymorphisms could be useful genetic markers for dairy production traits in cattle.
Fernández-Ruiz, M; Corrales, I; Arias, M; Campistol, J M; Giménez, E; Crespo, J; López-Oliva, M O; Beneyto, I; Martín-Moreno, P L; Llamas-Fuente, F; Gutiérrez, A; García-Álvarez, T; Guerra-Rodríguez, R; Calvo, N; Fernández-Rodríguez, A; Tabernero-Romo, J M; Navarro, M D; Ramos-Verde, A; Aguado, J M; Navarro, D
2015-05-01
In this study, we assessed the association between single-nucleotide polymorphisms (SNPs) in seven candidate genes involved in orchestrating the immune response against cytomegalovirus (CMV) and the 12-month incidence of CMV infection in 315 CMV-seropositive kidney transplant (KT) recipients. Patients were managed either by antiviral prophylaxis or preemptive therapy. CMV infection occurred in 140 patients (44.4%), including 13 episodes of disease. After adjusting for various clinical covariates, patients harboring T-allele genotypes of interleukin-28B (IL28B) (rs12979860) SNP had lower incidence of CMV infection (adjusted hazard ratio [aHR]: 0.66; 95% confidence interval [CI]: 0.46-0.96; p-value = 0.029). In the analysis restricted to patients not receiving prophylaxis, carriers of the TT genotype of toll-like receptor 9 (TLR9) (rs5743836) SNP had lower incidence of infection (aHR: 0.61; 95% CI: 0.38-0.96; p-value = 0.035), whereas the GG genotype of dendritic cell-specific ICAM 3-grabbing nonintegrin (DC-SIGN) (rs735240) SNP exerted the opposite effect (aHR: 1.86; 95% CI: 1.18-2.94; p-value = 0.008). An independent association was found between the number of unfavorable SNP genotypes carried by the patient and the incidence of CMV infection. In conclusion, specific SNPs in IL28B, TLR9 and DC-SIGN genes may play a role in modulating the susceptibility to CMV infection in CMV-seropositive KT recipients. © Copyright 2015 The American Society of Transplantation and the American Society of Transplant Surgeons.
Ma, G J; Song, Q J; Markell, S G; Qi, L L
2018-07-01
A novel rust resistance gene, R 15 , derived from the cultivated sunflower HA-R8 was assigned to linkage group 8 of the sunflower genome using a genotyping-by-sequencing approach. SNP markers closely linked to R 15 were identified, facilitating marker-assisted selection of resistance genes. The rust virulence gene is co-evolving with the resistance gene in sunflower, leading to the emergence of new physiologic pathotypes. This presents a continuous threat to the sunflower crop necessitating the development of resistant sunflower hybrids providing a more efficient, durable, and environmentally friendly host plant resistance. The inbred line HA-R8 carries a gene conferring resistance to all known races of the rust pathogen in North America and can be used as a broad-spectrum resistance resource. Based on phenotypic assessments of 140 F 2 individuals derived from a cross of HA 89 with HA-R8, rust resistance in the population was found to be conferred by a single dominant gene (R 15 ) originating from HA-R8. Genotypic analysis with the currently available SSR markers failed to find any association between rust resistance and any markers. Therefore, we used genotyping-by-sequencing (GBS) analysis to achieve better genomic coverage. The GBS data showed that R 15 was located at the top end of linkage group (LG) 8. Saturation with 71 previously mapped SNP markers selected within this region further showed that it was located in a resistance gene cluster on LG8, and mapped to a 1.0-cM region between three co-segregating SNP makers SFW01920, SFW00128, and SFW05824 as well as the NSA_008457 SNP marker. These closely linked markers will facilitate marker-assisted selection and breeding in sunflower.
Signatures of selection in five Italian cattle breeds detected by a 54K SNP panel.
Mancini, Giordano; Gargani, Maria; Chillemi, Giovanni; Nicolazzi, Ezequiel Luis; Marsan, Paolo Ajmone; Valentini, Alessio; Pariset, Lorraine
2014-02-01
In this study we used a medium density panel of SNP markers to perform population genetic analysis in five Italian cattle breeds. The BovineSNP50 BeadChip was used to genotype a total of 2,935 bulls of Piedmontese, Marchigiana, Italian Holstein, Italian Brown and Italian Pezzata Rossa breeds. To determine a genome-wide pattern of positive selection we mapped the F st values against genome location. The highest F st peaks were obtained on BTA6 and BTA13 where some candidate genes are located. We identified selection signatures peculiar of each breed which suggest selection for genes involved in milk or meat traits. The genetic structure was investigated by using a multidimensional scaling of the genetic distance matrix and a Bayesian approach implemented in the STRUCTURE software. The genotyping data showed a clear partitioning of the cattle genetic diversity into distinct breeds if a number of clusters equal to the number of populations were given. Assuming a lower number of clusters beef breeds group together. Both methods showed all five breeds separated in well defined clusters and the Bayesian approach assigned individuals to the breed of origin. The work is of interest not only because it enriches the knowledge on the process of evolution but also because the results generated could have implications for selective breeding programs.
Service, Susan; Molina, Julio; Deyoung, Joseph; Jawaheer, Damini; Aldana, Ileana; Vu, Thuy; Araya, Carmen; Araya, Xinia; Bejarano, Julio; Fournier, Eduardo; Ramirez, Magui; Mathews, Carol A; Davanzo, Pablo; Macaya, Gabriel; Sandkuijl, Lodewijk; Sabatti, Chiara; Reus, Victor; Freimer, Nelson
2006-06-05
We have ascertained in the Central Valley of Costa Rica a new kindred (CR201) segregating for severe bipolar disorder (BP-I). The family was identified by tracing genealogical connections among eight persons initially independently ascertained for a genome wide association study of BP-I. For the genome screen in CR201, we trimmed the family down to 168 persons (82 of whom are genotyped), containing 25 individuals with a best-estimate diagnosis of BP-I. A total of 4,690 SNP markers were genotyped. Analysis of the data was hampered by the size and complexity of the pedigree, which prohibited using exact multipoint methods on the entire kindred. Two-point parametric linkage analysis, using a conservative model of transmission, produced a maximum LOD score of 2.78 on chromosome 6, and a total of 39 loci with LOD scores >1.0. Multipoint parametric and non-parametric linkage analysis was performed separately on four sections of CR201, and interesting (nominal P-value from either analysis <0.01), although not statistically significant, regions were highlighted on chromosomes 1, 2, 3, 12, 16, 19, and 22, in at least one section of the pedigree, or when considering all sections together. The difficulties of analyzing genome wide SNP data for complex disorders in large, potentially informative, kindreds are discussed.
Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann
2017-01-01
Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs. PMID:28387238
Shalia, Kavita; Saranath, Dhananjaya; Rayar, Jaipreet; Shah, Vinod K.; Mashru, Manoj R.; Soneji, Surendra L.
2017-01-01
Background & objectives: Acute myocardial infarction (AMI) is a major health concern in India. The aim of the study was to identify single nucleotide polymorphisms (SNPs) associated with AMI in patients using dedicated chip and validating the identified SNPs on custom-designed chips using high-throughput microarray analysis. Methods: In pilot phase, 48 AMI patients and 48 healthy controls were screened for SNPs using human CVD55K BeadChip with 48,472 SNP probes on Illumina high-throughput microarray platform. The identified SNPs were validated by genotyping additional 160 patients and 179 controls using custom-made Illumina VeraCode GoldenGate Genotyping Assay. Analysis was carried out using PLINK software. Results: From the pilot phase, 98 SNPs present on 94 genes were identified with increased risk of AMI (odds ratio of 1.84-8.85, P=0.04861-0.003337). Five of these SNPs demonstrated association with AMI in the validation phase (P<0.05). Among these, one SNP rs9978223 on interferon gamma receptor 2 [IFNGR2, interferon (IFN)-gamma transducer 1] gene showed a significant association (P=0.00021) with AMI below Bonferroni corrected P value (P=0.00061). IFNGR2 is the second subunit of the receptor for IFN-gamma, an important cytokine in inflammatory reactions. Interpretation & conclusions: The study identified an SNP rs9978223 on IFNGR2 gene, associated with increased risk in AMI patient from India. PMID:29434065
Joseph, S; Schmidt, L M; Danquah, W B; Timper, P; Mekete, T
2017-02-01
To generate single spore lines of a population of bacterial parasite of root-knot nematode (RKN), Pasteuria penetrans, isolated from Florida and examine genotypic variation and virulence characteristics exist within the population. Six single spore lines (SSP), 16SSP, 17SSP, 18SSP, 25SSP, 26SSP and 30SSP were generated. Genetic variability was evaluated by comparing single-nucleotide polymorphisms (SNPs) in six protein-coding genes and the 16S rRNA gene. An average of one SNP was observed for every 69 bp in the 16S rRNA, whereas no SNPs were observed in the protein-coding sequences. Hierarchical cluster analysis of 16S rRNA sequences placed the clones into three distinct clades. Bio-efficacy analysis revealed significant heterogeneity in the level virulence and host specificity between the individual clones. The SNP markers developed to the 5' hypervariable region of the 16S rRNA gene may be useful in biotype differentiation within a population of P. penetrans. This study demonstrates an efficient method for generating single spore lines of P. penetrans and gives a deep insight into genetic heterogeneity and varying level of virulence exists within a population parasitizing a specific Meloidogyne sp. host. The results also suggest that the application of generalist spore lines in nematode management may achieve broad RKN control. © 2016 The Society for Applied Microbiology.
Genetic polymorphisms predict response to anti-tumor necrosis factor treatment in Crohn's disease.
Netz, Uri; Carter, Jane Victoria; Eichenberger, Maurice Robert; Dryden, Gerald Wayne; Pan, Jianmin; Rai, Shesh Nath; Galandiuk, Susan
2017-07-21
To investigate genetic factors that might help define which Crohn's disease (CD) patients are likely to benefit from anti-tumor necrosis factor (TNF) therapy. This was a prospective cohort study. Patients were recruited from a university digestive disease practice database. We included CD patients who received anti-TNF therapy, had available medical records (with information on treatment duration and efficacy) and who consented to participation. Patients with allergic reactions were excluded. Patients were grouped as ever-responders or non-responders. Genomic DNA was extracted from peripheral blood, and 7 single nucleotide polymorphisms (SNPs) were assessed. The main outcome measure (following exposure to the drug) was response to therapy. The patient genotypes were assessed as the predictors of outcome. Possible confounders and effect modifiers included age, gender, race, and socioeconomic status disease, as well as disease characteristics (such as Montreal criteria). 121 patients were included. Twenty-one were non-responders, and 100 were ever-responders. Fas ligand SNP (rs763110) genotype frequencies, TNF gene -308 SNP (rs1800629) genotype frequencies, and their combination, were significantly different between groups on multivariable analysis controlling for Montreal disease behavior and perianal disease. The odds of a patient with a Fas ligand CC genotype being a non-responder were four-fold higher as compared to a TC or TT genotype ( P = 0.009, OR = 4.30, 95%CI: 1.45-12.80). The presence of the A (minor) TNF gene -308 allele correlated with three-fold higher odds of being a non-responder ( P = 0.049, OR = 2.88, 95%CI: 1.01-8.22). Patients with the combination of the Fas ligand CC genotype and the TNF -308 A allele had nearly five-fold higher odds of being a non-responder ( P = 0.015, OR = 4.76, 95%CI: 1.35-16.77). No difference was seen for the remaining SNPs. The Fas-ligand SNP and TNF gene -308 SNP are associated with anti-TNF treatment response in CD and may help select patients likely to benefit from therapy.
HLA Type Inference via Haplotypes Identical by Descent
NASA Astrophysics Data System (ADS)
Setty, Manu N.; Gusev, Alexander; Pe'Er, Itsik
The Human Leukocyte Antigen (HLA) genes play a major role in adaptive immune response and are used to differentiate self antigens from non self ones. HLA genes are hyper variable with nearly every locus harboring over a dozen alleles. This variation plays an important role in susceptibility to multiple autoimmune diseases and needs to be matched on for organ transplantation. Unfortunately, HLA typing by serological methods is time consuming and expensive compared to high throughput Single Nucleotide Polymorphism (SNP) data. We present a new computational method to infer per-locus HLA types using shared segments Identical By Descent (IBD), inferred from SNP genotype data. IBD information is modeled as graph where shared haplotypes are explored among clusters of individuals with known and unknown HLA types to identify the latter. We analyze performance of the method in a previously typed subset of the HapMap population, achieving accuracy of 96% in HLA-A, 94% in HLA-B, 95% in HLA-C, 77% in HLA-DR1, 93% in HLA-DQA1 and 90% in HLA-DQB1 genes. We compare our method to a tag SNP based approach and demonstrate higher sensitivity and specificity. Our method demonstrates the power of using shared haplotype segments for large-scale imputation at the HLA locus.
Bouakaze, Caroline; Keyser, Christine; Crubézy, Eric; Montagnon, Daniel; Ludes, Bertrand
2009-07-01
In the present study, a multiplexed genotyping assay for ten single nucleotide polymorphisms (SNPs) located within six pigmentation candidate genes was developed on modern biological samples and applied to DNA retrieved from 25 archeological human remains from southern central Siberia dating from the Bronze and Iron Ages. SNP genotyping was successful for the majority of ancient samples and revealed that most probably had typical European pigment features, i.e., blue or green eye color, light hair color and skin type, and were likely of European individual ancestry. To our knowledge, this study reports for the first time the multiplexed typing of autosomal SNPs on aged and degraded DNA. By providing valuable information on pigment traits of an individual and allowing individual biogeographical ancestry estimation, autosomal SNP typing can improve ancient DNA studies and aid human identification in some forensic casework situations when used to complement conventional molecular markers.
McClure, Matthew C; Bickhart, Derek; Null, Dan; Vanraden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B; Van Tassell, Curtis P; Sonstegard, Tad S
2014-01-01
The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.
McClure, Matthew C.; Bickhart, Derek; Null, Dan; VanRaden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B.; Van Tassell, Curtis P.; Sonstegard, Tad S.
2014-01-01
The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array. PMID:24667746
Dissimilarity based Partial Least Squares (DPLS) for genomic prediction from SNPs.
Singh, Priyanka; Engel, Jasper; Jansen, Jeroen; de Haan, Jorn; Buydens, Lutgarde Maria Celina
2016-05-04
Genomic prediction (GP) allows breeders to select plants and animals based on their breeding potential for desirable traits, without lengthy and expensive field trials or progeny testing. We have proposed to use Dissimilarity-based Partial Least Squares (DPLS) for GP. As a case study, we use the DPLS approach to predict Bacterial wilt (BW) in tomatoes using SNPs as predictors. The DPLS approach was compared with the Genomic Best-Linear Unbiased Prediction (GBLUP) and single-SNP regression with SNP as a fixed effect to assess the performance of DPLS. Eight genomic distance measures were used to quantify relationships between the tomato accessions from the SNPs. Subsequently, each of these distance measures was used to predict the BW using the DPLS prediction model. The DPLS model was found to be robust to the choice of distance measures; similar prediction performances were obtained for each distance measure. DPLS greatly outperformed the single-SNP regression approach, showing that BW is a comprehensive trait dependent on several loci. Next, the performance of the DPLS model was compared to that of GBLUP. Although GBLUP and DPLS are conceptually very different, the prediction quality (PQ) measured by DPLS models were similar to the prediction statistics obtained from GBLUP. A considerable advantage of DPLS is that the genotype-phenotype relationship can easily be visualized in a 2-D scatter plot. This so-called score-plot provides breeders an insight to select candidates for their future breeding program. DPLS is a highly appropriate method for GP. The model prediction performance was similar to the GBLUP and far better than the single-SNP approach. The proposed method can be used in combination with a wide range of genomic dissimilarity measures and genotype representations such as allele-count, haplotypes or allele-intensity values. Additionally, the data can be insightfully visualized by the DPLS model, allowing for selection of desirable candidates from the breeding experiments. In this study, we have assessed the DPLS performance on a single trait.
Zain, Maryam; Awan, Fazli Rabbi; Cooper, Jackie A; Li, Ka Wah; Palmen, Jutta; Acharya, Jay; Howard, Philip; Baig, Shahid M; Elkeles, Robert S; Stephens, Jeffrey W; Ireland, Helen; Humphries, Steve E
2014-09-01
To determine the sequence variant of TLL1 gene (rs1503298, T > C) in three British cohorts (PREDICT, UDACS and ED) of patients with type-2 Diabetes mellitus (T2DM) in order to assess its association with coronary heart disease (CHD). Analytical study. UCL, London, UK. Participants were genotyped in 2011-2012 for TLL1 SNP. Samples and related information were previously collected in 2001-2003 for PREDICT, and in 2001-2002 for UDACS and ED groups. Patients included in PREDICT (n=600), UDACS (n=1020) and ED (n=1240) had Diabetes. TLL1 SNP (rs1503298, T > C) was genotyped using TaqMan technology. Allele frequencies were compared using c2 test, and tested for Hardy-Weinberg equilibrium. The risk of disease was assessed from Odds ratios (OR) with 95% Confidence Intervals (95% CI). Moreover, for the PREDICT cohort, the SNP association was tested with Coronary Artery Calcification (CAC) scores. No significant association was found for this SNP with CHD or CAC scores in these cohorts. This SNP could not be confirmed as a risk factor for CHD in T2DM patients. However, the low power of thesmall sample size available is a limitation to the modest effect on risk. Further studies in larger samples would be useful.
Piedra, María; Berja, Ana; García-Unzueta, María Teresa; Ramos, Laura; Valero, Carmen; Amado, José Antonio
2015-01-01
The CLDN14 gene encodes a protein involved in the regulation of paracellular permeability or ion transport at epithelial tight junctions as in the nephron. The C allele of the rs219780 SNP (single nucleotide polymorphism) of CLDN14 has been associated with renal lithiasis, high levels of parathormone (PTH), and with low bone mineral density (BMD) in healthy women. Our aim is to study the relationship between rs219780 SNP of CLDN14 and renal lithiasis, fractures, and BMD in patients with primary hyperparathyroidism (PHPT). We enrolled 298 Caucasian patients with PHPT and 328 healthy volunteers in a cross-sectional study. We analysed anthropometric data, history of fractures or kidney stones, biochemical parameters including markers for bone remodelling, abdominal ultrasound, and BMD and genotyping for the rs219780 SNP of CLDN14. We did not find any difference in the frequency of fractures or renal lithiasis between the genotype groups in PHPT patients. Moreover, we did not find any relationship between the T or C alleles and BMD or biochemical parameters. rs219780 SNP of CLDN14 does not appear to be a risk factor for the development of PHPT nor does it seem to influence the clinical expression of PHPT.
IL13 genetic polymorphisms, smoking, and eczema in women: a case-control study in Japan.
Miyake, Yoshihiro; Tanaka, Keiko; Arakawa, Masashi
2011-10-21
Several genetic association studies have examined the relationships between single nucleotide polymorphisms (SNPs) in the IL13 gene and eczema, and have provided contradictory results. We investigated the relationship between the IL13 SNPs rs1800925 and rs20541 and the risk of eczema in Japanese young adult women. Included were 188 cases who met the criteria of the International Study of Asthma and Allergies in Childhood (ISAAC) for eczema. Control subjects were 1,082 women without eczema according to the ISAAC criteria, who had not been diagnosed with atopic eczema by a doctor and who had no current asthma as defined by the European Community Respiratory Health Survey criteria. Adjustment was made for age, region of residence, number of children, smoking, and education. The minor TT genotype of SNP rs1800925 was significantly associated with an increased risk of eczema in the co-dominant model: the adjusted odds ratio was 2.19 (95% confidence interval: 1.03-4.67). SNP rs20541 was not related to eczema. None of the haplotypes were significantly associated with eczema. Compared with women with the CC or CT genotype of SNP rs1800925 who had never smoked, those with the TT genotype who had ever smoked had a 2.85-fold increased risk of eczema, though the adjusted odds ratio was not statistically significant, and neither multiplicative nor additive interaction was statistically significant. Our findings suggest that the IL13 SNP rs1800925 is significantly associated with eczema in Japanese young adult women. We could not find evidence for an interaction between SNP rs1800925 and smoking with regard to eczema.
Shavrukov, Yuri; Suchecki, Radoslaw; Eliby, Serik; Abugalieva, Aigul; Kenebayev, Serik; Langridge, Peter
2014-09-28
New SNP marker platforms offer the opportunity to investigate the relationships between wheat cultivars from different regions and assess the mechanism and processes that have led to adaptation to particular production environments. Wheat breeding has a long history in Kazakhstan and the aim of this study was to explore the relationship between key varieties from Kazakhstan and germplasm from breeding programs for other regions. The study revealed 5,898 polymorphic markers amongst ten cultivars, of which 2,730 were mapped in the consensus genetic map. Mapped SNP markers were distributed almost equally across the A and B genomes, with between 279 and 484 markers assigned to each chromosome. Marker coverage was approximately 10-fold lower in the D genome. There were 863 SNP markers identified as unique to specific cultivars, and clusters of these markers (regions containing more than three closely mapped unique SNPs) showed specific patterns on the consensus genetic map for each cultivar. Significant intra-varietal genetic polymorphism was identified in three cultivars (Tzelinnaya 3C, Kazakhstanskaya rannespelaya and Kazakhstanskaya 15). Phylogenetic analysis based on inter-varietal polymorphism showed that the very old cultivar Erythrospermum 841 was the most genetically distinct from the other nine cultivars from Kazakhstan, falling in a clade together with the American cultivar Sonora and genotypes from Central and South Asia. The modern cultivar Kazakhstanskaya 19 also fell into a separate clade, together with the American cultivar Thatcher. The remaining eight cultivars shared a single sub-clade but were categorised into four clusters. The accumulated data for SNP marker polymorphisms amongst bread wheat genotypes from Kazakhstan may be used for studying genetic diversity in bread wheat, with potential application for marker-assisted selection and the preparation of a set of genotype-specific markers.
Complex nature of SNP genotype effects on gene expression in primary human leucocytes.
Heap, Graham A; Trynka, Gosia; Jansen, Ritsert C; Bruinenberg, Marcel; Swertz, Morris A; Dinesen, Lotte C; Hunt, Karen A; Wijmenga, Cisca; Vanheel, David A; Franke, Lude
2009-01-07
Genome wide association studies have been hugely successful in identifying disease risk variants, yet most variants do not lead to coding changes and how variants influence biological function is usually unknown. We correlated gene expression and genetic variation in untouched primary leucocytes (n = 110) from individuals with celiac disease - a common condition with multiple risk variants identified. We compared our observations with an EBV-transformed HapMap B cell line dataset (n = 90), and performed a meta-analysis to increase power to detect non-tissue specific effects. In celiac peripheral blood, 2,315 SNP variants influenced gene expression at 765 different transcripts (< 250 kb from SNP, at FDR = 0.05, cis expression quantitative trait loci, eQTLs). 135 of the detected SNP-probe effects (reflecting 51 unique probes) were also detected in a HapMap B cell line published dataset, all with effects in the same allelic direction. Overall gene expression differences within the two datasets predominantly explain the limited overlap in observed cis-eQTLs. Celiac associated risk variants from two regions, containing genes IL18RAP and CCR3, showed significant cis genotype-expression correlations in the peripheral blood but not in the B cell line datasets. We identified 14 genes where a SNP affected the expression of different probes within the same gene, but in opposite allelic directions. By incorporating genetic variation in co-expression analyses, functional relationships between genes can be more significantly detected. In conclusion, the complex nature of genotypic effects in human populations makes the use of a relevant tissue, large datasets, and analysis of different exons essential to enable the identification of the function for many genetic risk variants in common diseases.
Singh, Kanhaiya; Goyal, Prabhjot; Singh, Manju; Deshmukh, Sujit; Upadhyay, Divyesh; Kant, Sri; Agrawal, Neeraj K; Gupta, Sanjeev K; Singh, Kiran
2017-12-01
Retinal angiogenesis is a hallmark of diabetic retinopathy. Matrix Metalloproteinases (MMPs) are involved in degradation of extracellular matrix (ECM). Functional SNP-1562C>T in the promoter of the MMP-9 gene results increase in transcriptional activity. The present work was designed to evaluate the contribution of functional SNP-1562C>T of MMP-9 gene to the risk of proliferative diabetic retinopathy (PDR) in type 2 diabetes mellitus (T2DM) patients in north Indian Population. This Case control study comprised of a total of 645 individuals in which 320 were T2DM patients out of which 73 had PDR, 98 had non- proliferative diabetic retinopathy (NPDR), 149 T2DM cases without any eye related disease (DM) and 325 non diabetic healthy individuals as controls (non DM controls). Genotyping for SNP-1562C>T of MMP-9 was done by polymerase chain reactions followed by restriction analyses with specific endonucleases (PCR-RFLP). DNA sequencing was used to ascertain PCR-RFLP results. T allele frequency in PDR patients was 32.1%, 20.4% in NPDR, 15.4% in DM and 13.7% in controls. Statistically significant difference was observed in both allele and genotype distribution between the PDR versus non-DM control group (p<0.0001 by T allele; p=0.002 by TT and p<0.0001 by CT genotype). The present study suggests that the functional SNP-1562C>T in the promoter of the MMP-9 gene could be regarded as a major risk factor for PDR as increased MMP-9 production from high expressing T allele may promote retinal angiogenesis. Copyright © 2017 Elsevier Inc. All rights reserved.
McCue, Molly E.; Bannasch, Danika L.; Petersen, Jessica L.; Gurr, Jessica; Bailey, Ernie; Binns, Matthew M.; Distl, Ottmar; Guérin, Gérard; Hasegawa, Telhisa; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Penedo, M. Cecilia T.; Røed, Knut H.; Ryder, Oliver A.; Swinburne, June E.; Tozaki, Teruaki; Valberg, Stephanie J.; Vaudin, Mark; Lindblad-Toh, Kerstin
2012-01-01
An equine SNP genotyping array was developed and evaluated on a panel of samples representing 14 domestic horse breeds and 18 evolutionarily related species. More than 54,000 polymorphic SNPs provided an average inter-SNP spacing of ∼43 kb. The mean minor allele frequency across domestic horse breeds was 0.23, and the number of polymorphic SNPs within breeds ranged from 43,287 to 52,085. Genome-wide linkage disequilibrium (LD) in most breeds declined rapidly over the first 50–100 kb and reached background levels within 1–2 Mb. The extent of LD and the level of inbreeding were highest in the Thoroughbred and lowest in the Mongolian and Quarter Horse. Multidimensional scaling (MDS) analyses demonstrated the tight grouping of individuals within most breeds, close proximity of related breeds, and less tight grouping in admixed breeds. The close relationship between the Przewalski's Horse and the domestic horse was demonstrated by pair-wise genetic distance and MDS. Genotyping of other Perissodactyla (zebras, asses, tapirs, and rhinoceros) was variably successful, with call rates and the number of polymorphic loci varying across taxa. Parsimony analysis placed the modern horse as sister taxa to Equus przewalski. The utility of the SNP array in genome-wide association was confirmed by mapping the known recessive chestnut coat color locus (MC1R) and defining a conserved haplotype of ∼750 kb across all breeds. These results demonstrate the high quality of this SNP genotyping resource, its usefulness in diverse genome analyses of the horse, and potential use in related species. PMID:22253606
Tumino, Giorgio; Voorrips, Roeland E; Rizza, Fulvia; Badeck, Franz W; Morcia, Caterina; Ghizzoni, Roberta; Germeier, Christoph U; Paulo, Maria-João; Terzi, Valeria; Smulders, Marinus J M
2016-09-01
Infinium SNP data analysed as continuous intensity ratios enabled associating genotypic and phenotypic data from heterogeneous oat samples, showing that association mapping for frost tolerance is a feasible option. Oat is sensitive to freezing temperatures, which restricts the cultivation of fall-sown or winter oats to regions with milder winters. Fall-sown oats have a longer growth cycle, mature earlier, and have a higher productivity than spring-sown oats, therefore improving frost tolerance is an important goal in oat breeding. Our aim was to test the effectiveness of a Genome-Wide Association Study (GWAS) for mapping QTLs related to frost tolerance, using an approach that tolerates continuously distributed signals from SNPs in bulked samples from heterogeneous accessions. A collection of 138 European oat accessions, including landraces, old and modern varieties from 27 countries was genotyped using the Infinium 6K SNP array. The SNP data were analyzed as continuous intensity ratios, rather than converting them into discrete values by genotype calling. PCA and Ward's clustering of genetic similarities revealed the presence of two main groups of accessions, which roughly corresponded to Continental Europe and Mediterranean/Atlantic Europe, although a total of eight subgroups can be distinguished. The accessions were phenotyped for frost tolerance under controlled conditions by measuring fluorescence quantum yield of photosystem II after a freezing stress. GWAS were performed by a linear mixed model approach, comparing different corrections for population structure. All models detected three robust QTLs, two of which co-mapped with QTLs identified earlier in bi-parental mapping populations. The approach used in the present work shows that SNP array data of heterogeneous hexaploid oat samples can be successfully used to determine genetic similarities and to map associations to quantitative phenotypic traits.
Yang, Jing; Zhou, Haixia; Liang, Binmiao; Xiao, Jun; Su, Zhiguang; Chen, Hong; Ma, Chunlan; Li, Dengxue; Feng, Yulin; Ou, Xuemei
2014-02-01
Recent genome-wide association studies have shown associations between variants at five loci (TNS1, GSTCD, HTR4, AGER and THSD4) and chronic obstructive pulmonary disease (COPD) or lung function. However, their association with COPD has not been proven in Chinese Han population, nor have COPD-related phenotypes been studied. The objective of this study was to look for associations between five single nucleotide polymorphisms (SNP) in these novel candidate genes and COPD susceptibility or lung function in a Chinese Han population. Allele and genotype data on 680 COPD patients and 687 healthy controls for sentinel SNP in these five loci were investigated. Allele frequencies and genotype distributions were compared between cases and controls, and odds ratios were calculated. Potential relationships between these SNP and COPD-related lung function were assessed. No significant associations were found between any of the SNP and COPD in cases and controls. The SNP (rs3995090) in HTR4 was associated with COPD (adjusted P = 0.022) in never-smokers, and the SNP (rs2070600) in AGER was associated with forced expiratory volume in 1 s (FEV1 %) predicted (β = -0.066, adjusted P = 0.016) and FEV1 /forced vital capacity (β = -0.071, adjusted P = 0.009) in all subjects. The variant at HTR4 was associated with COPD in never-smokers, and the SNP in AGER was associated with pulmonary function in a Chinese Han population. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.
Bennett, G L; Shackelford, S D; Wheeler, T L; King, D A; Casas, E; Smith, T P L
2013-02-01
Genetic markers in casein (CSN1S1) and thyroglobulin (TG) genes have previously been associated with fat distribution in cattle. Determining the nature of these genetic associations (additive, recessive, or dominant) has been difficult, because both markers have small minor allele frequencies in most beef cattle populations. This results in few animals homozygous for the minor alleles. selection to increase the frequencies of the minor alleles for 2 SNP markers in these genes was undertaken in a composite population. The objective was to obtain better estimates of genetic effects associated with these markers and determine if there were epistatic interactions. Selection increased the frequencies of minor alleles for both SNP from <0.30 to 0.45. Bulls (n = 24) heterozygous for both SNP were used in 3 yr to produce 204 steer progeny harvested at an average age of 474 d. The combined effect of the 9 CSN1S1 × TG genotypes was associated with carcass-adjusted fat thickness (P < 0.06) and meat tenderness predicted at the abattoir by visible and near-infrared reflectance spectroscopy (P < 0.04). Genotype did not affect BW from birth through harvest, ribeye area, marbling score, slice shear force, or image-based yield grade (P > 0.10). Additive, dominance, and epistatic SNP association effects were estimated from genotypic effects for adjusted fat thickness and predicted meat tenderness. Adjusted fat thickness showed a dominance association with TG SNP (P < 0.06) and an epistatic additive CSN1S1 × additive TG association (P < 0.03). For predicted meat tenderness, heterozygous TG meat was more tender than meat from either homozygote (P < 0.002). Dominance and epistatic associations can result in different SNP allele substitution effects in populations where SNP have the same linkage disequilibrium with causal mutations but have different frequencies. Although the complex associations estimated in this study would contribute little to within-population selection response, they could be important for marker-assisted management or reciprocal selection schemes.
Zeng, K; Wu, X D; Cai, H D; Gao, Y G; Li, G; Liu, Q C; Gao, F; Chen, J H; Lin, C Z
2014-04-29
The aim of this study was to investigate the correlation between the natriuretic peptide precursor B (NPPB) gene single nucleotide polymorphism (SNP) c.-1298 G/T and pulse pressure (PP) of the Chinese Han population and the association between genotype and clinical indicators of hypertension. Peripheral blood was collected from 180 unrelated patients with hypertension and 540 healthy volunteers (control group), and DNA was extracted to amplify the 5'-flanking region and 2 exons of the NPPB gene by polymerase chain reaction; the fragment was sequenced after purification. The clinical data of all subjects were recorded, the distribution of the NPPB gene c.-1298 G/T polymorphism was determined, and differences in clinical indicators between the two groups were evaluated. The mean arterial pressure PP, and creatinine levels were significantly higher in the hypertension group than in the control group (P<0.05), but no other clinical indicators differed between the groups. There were no significant differences in genotype frequency and distribution of the NPPB gene c.-1298 G/T polymorphism between the hypertension group and the control group (P>0.05); in the control group, the mean PP of individuals with the SNP c.-1298 GG genotype was greater than that of individuals with the GT+TT genotype (P<0.05). In conclusion, there was no significant correlation between the NPPB gene c.-1298 G/T polymorphism and the incidence of essential hypertension in the Han population; however, the PP of the SNP c.-1298 GG genotype was greater than that of the GT+TT genotype in the control group.
Clarke, Shannon M.; Henry, Hannah M.; Dodds, Ken G.; Jowett, Timothy W. D.; Manley, Tim R.; Anderson, Rayna M.; McEwan, John C.
2014-01-01
Accurate pedigree information is critical to animal breeding systems to ensure the highest rate of genetic gain and management of inbreeding. The abundance of available genomic data, together with development of high throughput genotyping platforms, means that single nucleotide polymorphisms (SNPs) are now the DNA marker of choice for genomic selection studies. Furthermore the superior qualities of SNPs compared to microsatellite markers allows for standardization between laboratories; a property that is crucial for developing an international set of markers for traceability studies. The objective of this study was to develop a high throughput SNP assay for use in the New Zealand sheep industry that gives accurate pedigree assignment and will allow a reduction in breeder input over lambing. This required two phases of development- firstly, a method of extracting quality DNA from ear-punch tissue performed in a high throughput cost efficient manner and secondly a SNP assay that has the ability to assign paternity to progeny resulting from mob mating. A likelihood based approach to infer paternity was used where sires with the highest LOD score (log of the ratio of the likelihood given parentage to likelihood given non-parentage) are assigned. An 84 “parentage SNP panel” was developed that assigned, on average, 99% of progeny to a sire in a problem where there were 3,000 progeny from 120 mob mated sires that included numerous half sib sires. In only 6% of those cases was there another sire with at least a 0.02 probability of paternity. Furthermore dam information (either recorded, or by genotyping possible dams) was absent, highlighting the SNP test’s suitability for paternity testing. Utilization of this parentage SNP assay will allow implementation of progeny testing into large commercial farms where the improved accuracy of sire assignment and genetic evaluations will increase genetic gain in the sheep industry. PMID:24740141
Clarke, Shannon M; Henry, Hannah M; Dodds, Ken G; Jowett, Timothy W D; Manley, Tim R; Anderson, Rayna M; McEwan, John C
2014-01-01
Accurate pedigree information is critical to animal breeding systems to ensure the highest rate of genetic gain and management of inbreeding. The abundance of available genomic data, together with development of high throughput genotyping platforms, means that single nucleotide polymorphisms (SNPs) are now the DNA marker of choice for genomic selection studies. Furthermore the superior qualities of SNPs compared to microsatellite markers allows for standardization between laboratories; a property that is crucial for developing an international set of markers for traceability studies. The objective of this study was to develop a high throughput SNP assay for use in the New Zealand sheep industry that gives accurate pedigree assignment and will allow a reduction in breeder input over lambing. This required two phases of development--firstly, a method of extracting quality DNA from ear-punch tissue performed in a high throughput cost efficient manner and secondly a SNP assay that has the ability to assign paternity to progeny resulting from mob mating. A likelihood based approach to infer paternity was used where sires with the highest LOD score (log of the ratio of the likelihood given parentage to likelihood given non-parentage) are assigned. An 84 "parentage SNP panel" was developed that assigned, on average, 99% of progeny to a sire in a problem where there were 3,000 progeny from 120 mob mated sires that included numerous half sib sires. In only 6% of those cases was there another sire with at least a 0.02 probability of paternity. Furthermore dam information (either recorded, or by genotyping possible dams) was absent, highlighting the SNP test's suitability for paternity testing. Utilization of this parentage SNP assay will allow implementation of progeny testing into large commercial farms where the improved accuracy of sire assignment and genetic evaluations will increase genetic gain in the sheep industry.
Savio, Andrea J; Mrkonjic, Miralem; Lemire, Mathieu; Gallinger, Steven; Knight, Julia A; Bapat, Bharat
2017-01-01
Colorectal cancers (CRCs) undergo distinct genetic and epigenetic alterations. Expression of mutL homolog 1 ( MLH1 ), a mismatch repair gene that corrects DNA replication errors, is lost in up to 15% of sporadic tumours due to mutation or, more commonly, due to DNA methylation of its promoter CpG island. A single nucleotide polymorphism (SNP) in the CpG island of MLH1 ( MLH1 -93G>A or rs1800734) is associated with CpG island hypermethylation and decreased MLH1 expression in CRC tumours. Further, in peripheral blood mononuclear cell (PBMC) DNA of both CRC cases and non-cancer controls, the variant allele of rs1800734 is associated with hypomethylation at the MLH1 shore, a region upstream of its CpG island that is less dense in CpG sites . To determine whether this genotype-epigenotype association is present in other tissue types, including colorectal tumours, we assessed DNA methylation in matched normal colorectal tissue, tumour, and PBMC DNA from 349 population-based CRC cases recruited from the Ontario Familial Colorectal Cancer Registry. Using the semi-quantitative real-time PCR-based MethyLight assay, MLH1 shore methylation was significantly higher in tumour tissue than normal colon or PBMCs ( P < 0.01). When shore methylation levels were stratified by SNP genotype, normal colorectal DNA and PBMC DNA were significantly hypomethylated in association with variant SNP genotype ( P < 0.05). However, this association was lost in tumour DNA. Among distinct stages of CRC, metastatic stage IV CRC tumours incurred significant hypomethylation compared to stage I-III cases, irrespective of genotype status. Shore methylation of MLH1 was not associated with MSI status or promoter CpG island hypermethylation, regardless of genotype. To confirm these results, bisulfite sequencing was performed in matched tumour and normal colorectal specimens from six CRC cases, including two cases per genotype (wildtype, heterozygous, and homozygous variant). Bisulfite sequencing results corroborated the methylation patterns found by MethyLight, with significant hypomethylation in normal colorectal tissue of variant SNP allele carriers. These results indicate that the normal tissue types tested (colorectum and PBMC) experience dynamic genotype-associated epigenetic alterations at the MLH1 shore, whereas tumour DNA incurs aberrant hypermethylation compared to normal DNA.
NQO1 gene rs1800566 variant is not associated with risk for multiple sclerosis
2014-01-01
Background A possible role of oxidative stress in the pathogenesis of multiple sclerosis (MS) and in experimental autoimmune encephalomyelitis has been suggested. The detoxification enzyme NAD(P)H dehydrogenase, quinone 1 (NQO1) has been found up-regulated in MS lesions. A previous report described an association between the SNP rs1800566 in the NQO1 gene and the risk for MS in the Greek population. The aim of this study was to replicate a possible influence of the. SNP rs1800566 in the NQO1 gene in the risk for MS in the Spanish Caucasian population. Methods We analyzed allelic and genotypic frequency of NQO1 rs1800566 in 290 patients with MS and 310 healthy controls, using TaqMan Assays. Results NQO1 rs1800566 allelic and genotypic frequencies did not differ significantly between MS patients and controls, and were unrelated with age of onset of MS, gender, and clinical type of MS. Conclusions Our results indicate that NQO1 rs1800566 does not have an effect on MS disease risk. PMID:24755231
Genome-Wide SNP Genotyping to Infer the Effects on Gene Functions in Tomato
Hirakawa, Hideki; Shirasawa, Kenta; Ohyama, Akio; Fukuoka, Hiroyuki; Aoki, Koh; Rothan, Christophe; Sato, Shusei; Isobe, Sachiko; Tabata, Satoshi
2013-01-01
The genotype data of 7054 single nucleotide polymorphism (SNP) loci in 40 tomato lines, including inbred lines, F1 hybrids, and wild relatives, were collected using Illumina's Infinium and GoldenGate assay platforms, the latter of which was utilized in our previous study. The dendrogram based on the genotype data corresponded well to the breeding types of tomato and wild relatives. The SNPs were classified into six categories according to their positions in the genes predicted on the tomato genome sequence. The genes with SNPs were annotated by homology searches against the nucleotide and protein databases, as well as by domain searches, and they were classified into the functional categories defined by the NCBI's eukaryotic orthologous groups (KOG). To infer the SNPs' effects on the gene functions, the three-dimensional structures of the 843 proteins that were encoded by the genes with SNPs causing missense mutations were constructed by homology modelling, and 200 of these proteins were considered to carry non-synonymous amino acid substitutions in the predicted functional sites. The SNP information obtained in this study is available at the Kazusa Tomato Genomics Database (http://plant1.kazusa.or.jp/tomato/). PMID:23482505
Andrews, Kimberly R; Adams, Jennifer R; Cassirer, E Frances; Plowright, Raina K; Gardner, Colby; Dwire, Maggie; Hohenlohe, Paul A; Waits, Lisette P
2018-06-05
The development of high-throughput sequencing technologies is dramatically increasing the use of single nucleotide polymorphisms (SNPs) across the field of genetics, but most parentage studies of wild populations still rely on microsatellites. We developed a bioinformatic pipeline for identifying SNP panels that are informative for parentage analysis from restriction site-associated DNA sequencing (RADseq) data. This pipeline includes options for analysis with or without a reference genome, and provides methods to maximize genotyping accuracy and select sets of unlinked loci that have high statistical power. We test this pipeline on small populations of Mexican gray wolf and bighorn sheep, for which parentage analyses are expected to be challenging due to low genetic diversity and the presence of many closely related individuals. We compare the results of parentage analysis across SNP panels generated with or without the use of a reference genome, and between SNPs and microsatellites. For Mexican gray wolf, we conducted parentage analyses for 30 pups from a single cohort where samples were available from 64% of possible mothers and 53% of possible fathers, and the accuracy of parentage assignments could be estimated because true identities of parents were known a priori based on field data. For bighorn sheep, we conducted maternity analyses for 39 lambs from five cohorts where 77% of possible mothers were sampled, but true identities of parents were unknown. Analyses with and without a reference genome produced SNP panels with >95% parentage assignment accuracy for Mexican gray wolf, outperforming microsatellites at 78% accuracy. Maternity assignments were completely consistent across all SNP panels for the bighorn sheep, and were 74.4% consistent with assignments from microsatellites. Accuracy and consistency of parentage analysis were not reduced when using as few as 284 SNPs for Mexican gray wolf and 142 SNPs for bighorn sheep, indicating our pipeline can be used to develop SNP genotyping assays for parentage analysis with relatively small numbers of loci. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Miyakura, Yasuyuki; Tahara, Makiko; Lefor, Alan T; Yasuda, Yoshikazu; Sugano, Kokichi
2014-11-24
Methylation of the MLH1 promoter region has been suggested to be a major mechanism of gene inactivation in sporadic microsatellite instability-positive (MSI-H) colorectal cancers (CRCs). Recently, single-nucleotide polymorphism (SNP) in the MLH1 promoter region (MLH1-93G/A; rs1800734) has been proposed to be associated with MLH1 promoter methylation, loss of MLH1 protein expression and MSI-H tumors. We examined the association of MLH1-93G/A and six other SNPs surrounding MLH1-93G/A with the methylation status in 210 consecutive sporadic CRCs in Japanese patients. Methylation of the MLH1 promoter region was evaluated by Na-bisulfite polymerase chain reaction (PCR)/single-strand conformation polymorphism (SSCP) analysis. The genotype frequencies of SNPs located in the 54-kb region surrounding the MLH1-93G/A SNP were examined by SSCP analysis. Methylation of the MLH1 promoter region was observed in 28.6% (60/210) of sporadic CRCs. The proportions of MLH1-93G/A genotypes A/A, A/G and G/G were 26% (n=54), 51% (n=108) and 23% (n=48), respectively, and they were significantly associated with the methylation status (p=0.01). There were no significant associations between genotype frequency of the six other SNPs and methylation status. The A-allele of MLH1-93G/A was more common in cases with methylation than the G-allele (p=0.0094), especially in females (p=0.0067). In logistic regression, the A/A genotype of the MLH1-93G/A SNP was shown to be the most significant risk factor for methylation of the MLH1 promoter region (odds ratio 2.82, p=0.003). Furthermore, a haplotype of the A-allele of rs2276807 located -47 kb upstream from the MLH1-93G/A SNP and the A-allele of MLH1-93G/A SNP was significantly associated with MLH1 promoter methylation. These results indicate that individuals, and particularly females, carrying the A-allele at the MLH1-93G/A SNP, especially in association with the A-allele of rs2276807, may harbor an increased risk of methylation of the MLH1 promoter region.
Perry, Brea L.; Pescosolido, Bernice A.; Bucholz, Kathleen; Edenberg, Howard; Kramer, John; Kuperman, Samuel; Schuckit, Marc Alan; Nurnberger, John I.
2015-01-01
Gender-moderated gene–environment interactions are rarely explored, raising concerns about inaccurate specification of etiological models and inferential errors. The current study examined the influence of gender, negative and positive daily life events, and GABRA2 genotype (SNP rs279871) on alcohol dependence, testing two- and three-way interactions between these variables using multilevel regression models fit to data from 2,281 White participants in the Collaborative Study on the Genetics of Alcoholism. Significant direct effects of variables of interest were identified, as well as gender-specific moderation of genetic risk on this SNP by social experiences. Higher levels of positive life events were protective for men with the high-risk genotype, but not among men with the low-risk genotype or women, regardless of genotype. Our findings support the disinhibition theory of alcohol dependence, suggesting that gender differences in social norms, constraints and opportunities, and behavioral undercontrol may explain men and women’s distinct patterns of association. PMID:23974430
IL-10 -1082 SNP and IL-10 in primary CNS and vitreoretinal lymphomas.
Ramkumar, Hema L; Shen, De Fen; Tuo, Jingsheng; Braziel, Rita M; Coupland, Sarah E; Smith, Justine R; Chan, Chi-Chao
2012-10-01
Most primary central nervous system lymphomas (PCNSLs) and primary vitreoretinal lymphomas (PVRLs) are B-cell lymphomas that produce high levels of interleukin (IL)-10, which is linked to rapid disease progression. The IL-10 (-1082) G → A polymorphism (IL-10 SNP) is associated with improved survival in certain non-CNS lymphoma patients. PDCD4 is a tumor suppressor gene and upstream regulator of IL-10. This study examined the correlation between the IL-10 SNP, PDCD4 mRNA expression, and IL-10 expression (at transcript and protein levels) in these lymphoma cells. Single-nucleotide polymorphism (SNP)-typing at IL-10 (-1082) was performed after microdissecting cytospun PVRL cells from 26 specimens. Vitreal IL-10 and IL-6 levels were measured by ELISA. PCNSL cells from 52 paraffin-embedded sections were microdissected and SNP typed on genomic DNA. RT-PCR was performed to analyze expression of IL-10 and PDCD4 mRNA. IL-10 (-1082) SNP typing was performed on blood samples of 96 healthy controls. We measured IL-10 (-1082) SNP expression in 26 PVRLs and 52 PCNSLs and examined its relationship with IL-10 protein and gene expression, respectively. More PVRL patients expressed one copy of the IL-10 ( -1082 ) G → A SNP with the GA genotype compared to controls. The frequencies of the three genotypes (AA, AG, GG) significantly differed in PVRL versus controls and in PCNSL versus controls. In PVRLs, the vitreal IL-10/IL-6 ratio was higher in IL-10 (-1082) AG and IL-10 (-1082) AA patients, compared to IL-10 (-1082) GG patients. IL-10 mRNA expression was higher in IL-10 (-1082) AG and IL-10 (-1082) AA PCNSLs, compared to IL-10 (-1082) GG PCNSLs. No correlation was found between IL-10 and PDCD4 expression levels in 37 PCNSL samples. PVRL and PCNSL patients had similar IL-10 (-1082) A allele frequencies, but genotype distributions differed from healthy controls. The findings suggest that the IL-10 (-1082) A allele is a risk factor for higher IL-10 levels in PVRLs and PCNSLs. Higher IL-10 levels have been correlated with more aggressive disease in both PVRLs and PCNSLs, making this finding an important and potentially clinically significant observation.
Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv
2018-01-01
Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species.
Compression and fast retrieval of SNP data.
Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio
2014-11-01
The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Compression and fast retrieval of SNP data
Sambo, Francesco; Di Camillo, Barbara; Toffolo, Gianna; Cobelli, Claudio
2014-01-01
Motivation: The increasing interest in rare genetic variants and epistatic genetic effects on complex phenotypic traits is currently pushing genome-wide association study design towards datasets of increasing size, both in the number of studied subjects and in the number of genotyped single nucleotide polymorphisms (SNPs). This, in turn, is leading to a compelling need for new methods for compression and fast retrieval of SNP data. Results: We present a novel algorithm and file format for compressing and retrieving SNP data, specifically designed for large-scale association studies. Our algorithm is based on two main ideas: (i) compress linkage disequilibrium blocks in terms of differences with a reference SNP and (ii) compress reference SNPs exploiting information on their call rate and minor allele frequency. Tested on two SNP datasets and compared with several state-of-the-art software tools, our compression algorithm is shown to be competitive in terms of compression rate and to outperform all tools in terms of time to load compressed data. Availability and implementation: Our compression and decompression algorithms are implemented in a C++ library, are released under the GNU General Public License and are freely downloadable from http://www.dei.unipd.it/~sambofra/snpack.html. Contact: sambofra@dei.unipd.it or cobelli@dei.unipd.it. PMID:25064564
USDA-ARS?s Scientific Manuscript database
Components of the growth endocrine axis regulate growth and reproduction traits in cattle. A SNP in the promoter of the signal transducer and activator of transcription 2 (STAT2) has been previously reported to be associated with postpartum rebreeding in a diallel beef population composed of 650 hei...
USDA-ARS?s Scientific Manuscript database
Genetic diversity, population structure, and genome-wide marker-trait association analyses were conducted on a special collection of 298 homozygous lettuce (Lactuca sativa L.) lines. Each of these lines was derived from a single plant that had been genotyped with 384 SNP makers using LSGermOPA. They...
Association Analysis of the Ephrin-B2 Gene in African-Americans with End-Stage Renal Disease
Hicks, Pamela J.; Staten, Jennifer L.; Palmer, Nicholette D.; Langefeld, Carl D.; Ziegler, Julie T.; Keene, Keith L.; Sale, Michele M.; Bowden, Donald W.; Freedman, Barry I.
2008-01-01
Background Genome scans in African-Americans with end-stage renal disease (ESRD) identified linkage on chromosome 13q33 in the region containing the ephrin-B2 ligand (EFNB2) genes. Interactions between the ephrin-B2 receptor and ephrin-B2 ligand play essential roles in renal angiogenesis, blood vessel maturation, and kidney disease. Methods The EFNB2 gene was evaluated as a positional candidate for non-diabetic and diabetic ESRD susceptibility in 1,071 unrelated African-American subjects; 316 with non-diabetic etiologies of ESRD, 394 with type 2 diabetes-associated ESRD and 361 healthy controls. Single nucleotide polymorphism (SNP) genotyping was performed on the Sequenom Mass Array System. Statistical analyses were computed using Dandelion version 1.26, Snpaddmix version 1.4 and Haploview version 3.32. Results Twenty-eight HapMap tag SNPs were genotyped spanning the 39 kilobases (kb) of the EFNB2 coding region, with average spacing of 1.43 kb. Analysis of 710 ESRD patient samples and 361 controls provided no evidence of single SNP associations in either diabetic or non-diabetic ESRD; although nominal evidence of association with all-cause ESRD was observed with a two SNP (p = 0.022) and three SNP (p = 0.023) haplotype, both containing SNPs rs7490924 and rs2391335 in intron 1. Conclusions Although an attractive positional candidate gene, polymorphisms in the EFNB2 gene do not appear to contribute in a substantial way to non-diabetic, diabetic or all-cause ESRD susceptibility in African-Americans. Additional genes within the chromosome 13q33 linkage interval are likely contributors to African-American non-diabetic ESRD. PMID:18580054
Szental, Joshua A; Baird, Paul N; Richardson, Andrea J; Islam, F M Amirul; Scholl, Hendrik P N; Charbel Issa, Peter; Holz, Frank G; Gillies, Mark; Guymer, Robyn H
2010-12-01
Recent imaging studies have suggested that macular pigment is decreased centrally in macular telangiectasia type 2 (MT2). The uptake of xanthophyll pigment into the macula is thought to be facilitated by a xanthophyll-binding protein (XBP). The Pi isoform of glutathione S-transferase (GSTP1) represents one such XBP with high binding affinity. This case-control study aimed to determine whether two common single-nucleotide polymorphisms (SNPs) of GSTP1 were associated with MT2. DNA samples from 39 cases and 21 controls were collected. Two polymorphic sites of Ile105Val and Ala114Val in exons 5 and 6 respectively, of the GSTP1 gene were analysed. Comparison of alleles and genotypes between cases and controls indicated that there were no statistically significant differences for either the Ile105Val SNP (P=0.43) or the Ala114Val SNP (P=0.85), or for any combinations; however, the homozygous at-risk genotype (GG) of the Ile105Val SNP was present in 8% of cases but absent in controls. This study found no statistically significant association between two common GSTP1 SNPs and MT2; however, a trend towards a greater frequency of the GG genotype of the Ile105Val SNP in cases is of great interest. The biological plausibility of disturbed macular pigment uptake in MT2 makes GSTP1 an excellent candidate gene. Further investigation is warranted in future studies of MT2.
Key glycolytic branch influences mesocarp oil content in oil palm.
Ruzlan, Nurliyana; Low, Yoke Sum Jaime; Win, Wilonita; Azizah Musa, Noor; Ong, Ai-Ling; Chew, Fook-Tim; Appleton, David; Mohd Yusof, Hirzun; Kulaveerasingam, Harikrishna
2017-08-29
The fructose-1,6-bisphosphate aldolase catalyzed glycolysis branch that forms dihydroxyacetone phosphate and glyceraldehyde-3-phosphate was identified as a key driver of increased oil synthesis in oil palm and was validated in Saccharomyces cerevisiae. Reduction in triose phosphate isomerase (TPI) activity in a yeast knockdown mutant resulted in 19% increase in lipid content, while yeast strains overexpressing oil palm fructose-1,6-bisphosphate aldolase (EgFBA) and glycerol-3-phosphate dehydrogenase (EgG3PDH) showed increased lipid content by 16% and 21%, respectively. Genetic association analysis on oil palm SNPs of EgTPI SD_SNP_000035801 and EgGAPDH SD_SNP_000041011 showed that palms harboring homozygous GG in EgTPI and heterozygous AG in EgGAPDH exhibited higher mesocarp oil content based on dry weight. In addition, AG genotype of the SNP of EgG3PDH SD_SNP_000008411 was associated with higher mean mesocarp oil content, whereas GG genotype of the EgFBA SNP SD_SNP_000007765 was favourable. Additive effects were observed with a combination of favourable alleles in TPI and FBA in Nigerian x AVROS population (family F7) with highest allele frequency GG.GG being associated with a mean increase of 3.77% (p value = 2.3E -16 ) oil content over the Family 1. An analogous effect was observed in yeast, where overexpressed EgFBA in TPI - resulted in a 30% oil increment. These results provide insights into flux balances in glycolysis leading to higher yield in mesocarp oil-producing fruit.
Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A
2015-01-01
Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.
Ali, Shahin S.; Shao, Jonathan; Strem, Mary D.; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W.; Bailey, Bryan A.
2015-01-01
Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri. PMID:26379633
Chang, M-T; Cheng, Y-S; Huang, M-C
2013-02-01
In our previous cDNA microarray study, we found that the carbonic anhydrase II (CA2) gene is one of the differentially expressed transcripts in the duck isthmus epithelium during egg formation period. The aim of this study was to identify the single-nucleotide polymorphisms (SNPs) in the CA2 gene of Tsaiya ducks. The relationship of SNP genotype with egg production and reproduction traits was also investigated. A total of 317 ducks from two lines, a control line with no selection and a selected line, were employed for testing. Three SNPs (C37T, A62G and A65G) in the 3'-untranslated region of the CA2 gene were found. SNP-trait association analysis showed that SNP C37T and A62G were associated with duck egg weight besides fertility. The ducks with the CT and AG genotypes had a 1.46 and 1.62 g/egg lower egg weight as compared with ducks with the CC and AA genotypes, respectively (p < 0.05). But the ducks with CT and AG genotypes had 5.20% and 4.22% higher fertility than those with CC and AA genotypes, respectively (p < 0.05). Diplotype constructed on these three SNPs was associated with duck fertility, and the diplotype H1H4 was dominant for duck fertility. These findings might provide the basis for balanced selection and may be used in marker-assisted selection to improve egg weight and fertility simultaneously in the Tsaiya ducks. © 2012 Blackwell Verlag GmbH.
Hidden Markov Model-Based CNV Detection Algorithms for Illumina Genotyping Microarrays.
Seiser, Eric L; Innocenti, Federico
2014-01-01
Somatic alterations in DNA copy number have been well studied in numerous malignancies, yet the role of germline DNA copy number variation in cancer is still emerging. Genotyping microarrays generate allele-specific signal intensities to determine genotype, but may also be used to infer DNA copy number using additional computational approaches. Numerous tools have been developed to analyze Illumina genotype microarray data for copy number variant (CNV) discovery, although commonly utilized algorithms freely available to the public employ approaches based upon the use of hidden Markov models (HMMs). QuantiSNP, PennCNV, and GenoCN utilize HMMs with six copy number states but vary in how transition and emission probabilities are calculated. Performance of these CNV detection algorithms has been shown to be variable between both genotyping platforms and data sets, although HMM approaches generally outperform other current methods. Low sensitivity is prevalent with HMM-based algorithms, suggesting the need for continued improvement in CNV detection methodologies.
The Minnesota Center for Twin and Family Research Genome-Wide Association Study
Miller, Michael B.; Basu, Saonli; Cunningham, Julie; Eskin, Eleazar; Malone, Steven M.; Oetting, William S.; Schork, Nicholas; Sul, Jae Hoon; Iacono, William G.; Mcgue, Matt
2012-01-01
As part of the Genes, Environment and Development Initiative (GEDI), the Minnesota Center for Twin and Family Research (MCTFR) undertook a genome-wide association study (GWAS), which we describe here. A total of 8405 research participants, clustered in 4-member families, have been successfully genotyped on 527,829 single nucleotide polymorphism (SNP) markers using Illumina’s Human660W-Quad array. Quality control screening of samples and markers as well as SNP imputation procedures are described. We also describe methods for ancestry control and how the familial clustering of the MCTFR sample can be accounted for in the analysis using a Rapid Feasible Generalized Least Squares algorithm. The rich longitudinal MCTFR assessments provide numerous opportunities for collaboration. PMID:23363460
Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R. K.; Singh, N. K.; Singh, Rakesh
2013-01-01
Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis. PMID:24367635
Translational genomics for analysis of complex traits in peanut and sorghum
USDA-ARS?s Scientific Manuscript database
The integration of sequencing and genotype data from natural variation studies (by whole genome resequencing [wgs] or genotype by sequencing [gbs]), transcriptome (RNA-seq) and mutant analysis (also by wgs) facilitated the development of DNA markers in the form of single nucleotide polymorphic (SNP)...
USDA-ARS?s Scientific Manuscript database
High-density single nucleotide polymorphism (SNP) genotyping chips are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships among individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array includ...
Clinical Utility of Five Genetic Variants for Predicting Prostate Cancer Risk and Mortality
Salinas, Claudia A.; Koopmeiners, Joseph S.; Kwon, Erika M.; FitzGerald, Liesel; Lin, Daniel W.; Ostrander, Elaine A.; Feng, Ziding; Stanford, Janet L.
2009-01-01
Background A recent report suggests that the combination of five single-nucleotide polymorphisms (SNPs) at 8q24, 17q12, 17q24.3 and a family history of the disease may predict risk of prostate cancer. The present study tests the performance of these factors in prediction models for prostate cancer risk and prostate cancer-specific mortality. Methods SNPs were genotyped in population-based samples from Caucasians in King County, Washington. Incident cases (n=1308), aged 35–74, were compared to age-matched controls (n=1266) using logistic regression to estimate odds ratios (OR) associated with genotypes and family history. Cox proportional hazards models estimated hazard ratios for prostate cancer-specific mortality according to genotypes. Results The combination of SNP genotypes and family history was significantly associated with prostate cancer risk (ptrend=1.5 × 10−20). Men with ≥ five risk factors had an OR of 4.9 (95% CI 1.6 to 18.5) compared to men with none. However, this combination of factors did not improve the ROC curve after accounting for known risk predictors (i.e., age, serum PSA, family history). Neither the individual nor combined risk factors was associated with prostate cancer-specific mortality. Conclusion Genotypes for five SNPs plus family history are associated with a significant elevation in risk for prostate cancer and may explain up to 45% of prostate cancer in our population. However, they do not improve prediction models for assessing who is at risk of getting or dying from the disease, once known risk or prognostic factors are taken into account. Thus, this SNP panel may have limited clinical utility. PMID:19058137
Mohammadzadeh, Ghorban; Ghaffari, Mohammad-Ali; Heibar, Habib; Bazyar, Mohammad
2016-01-01
Background: Adiponectin, an adipocyte-secreted hormone, is known to have anti-atherogenic, anti-inflammatory, and anti-diabetic properties. In the present study, the association between two common single nucleotide polymorphisms (SNPs) (+45T/G and +276G/T) of ADIOPQ gene and coronary artery disease (CAD) was assessed in the subjects with type 2 diabetes (T2DM). Methods: Genotypes of two SNPs were determined by polymerase chain reaction-restriction fragment length polymorphism in 200 subjects with T2DM (100 subjects with CAD and 100 without CAD). Results: The frequency of TT genotype of +276G/T was significantly elevated in CAD compared to controls (χ2=7.967, P=0.019). A similar difference was found in the allele frequency of +276G/T between two groups (χ2=3.895, P=0.048). The increased risk of CAD was associated with +276 TT genotype when compared to reference GG genotype (OR=5.158; 95% CI=1.016-26.182, P=0.048). However, no similar difference was found in genotype and allele frequencies of SNP +45T/G between two groups. There was a CAD protective haplotype combination of +276 wild-type and +45 mutant-type allele (276G-45G) (OR=0.37, 95% CI=0.16-0.86, P=0.022) in the subject population. Conclusion: Our findings indicated that T allele of SNP +276G/T is more associated with the increased risk of CAD in subjects with T2DM. Also, a haplotype combination of +45G/+276G of these two SNPs has a protective effect on the risk of CAD. PMID:26781170
Gao, Ling; Li, Xiao-hong; Zhao, Jian-qing; Lu, Ji-hong; Zhao, Jia-gang; Zhu, Jia-shi
2012-06-18
To examine maturational changes in expressions of Ophiocordyceps sinensis (O.sinensis) transition and transversion mutation genotypes in Cordyceps sinensis (C.sinensis) stroma. MassARRAY single nucleotide polymorphism (SNP) matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) mass spectrum genotyping was used, and 8 SNP extension primers were designed based on the scattered, multiple point mutations of known sequences for the O.sinensis mutants within their internal transcribed spacer (ITS) segments. Of the extension primers, 5 (not capable of distinguishing between the 2 AT-biased genotypes) located in rDNA ITS1 and ITS2 regions: 067721-211, 067721-240, 067721-477, 067721-531 and 067721-581. The other 3 extension primers located in 5.8S rDNA region: 067740-324, 067740-328 and 067740-360, to distinguish between the 2 AT-biased genotypes. MS chromatograms at the 8 SNP sites showed dynamic alterations of mutant alleles in C.sinensis stroma. The allele for the AT-biased genotypes at 067721-211 site showed higher peak height than its GC-biased counterpart in the premature C.sinensis stroma, but disappeared with C.sinensis maturation. Chromatograms displayed not only the transition mutation alleles, but also transversion mutants. Some of the transversion mutation alleles displayed higher peak heights than those for GC- and AT-biased alleles, but their peak heights and detection rates tended to be decreased with C.sinensis maturation. When distinguishing between the 2 AT-biases, AB067744 and AB067740 genotype alleles co-existed in the premature C.sinensis stroma. The allele peak height for AB067744 genotype was greatly decreased with C.sinensis maturation, while that for AB067740 genotype increased. Co-existence of at least 5 transition and transversion mutant genotypes of O.sinensis and the dynamic changes in their expressions in C.sinensis stroma along with C.sinensis maturation may be of extreme importance in C.sinensis stroma germination and maturation, enabling C.sinensis to complete its life cycle.
SNPConvert: SNP Array Standardization and Integration in Livestock Species.
Nicolazzi, Ezequiel Luis; Marras, Gabriele; Stella, Alessandra
2016-06-09
One of the main advantages of single nucleotide polymorphism (SNP) array technology is providing genotype calls for a specific number of SNP markers at a relatively low cost. Since its first application in animal genetics, the number of available SNP arrays for each species has been constantly increasing. However, conversely to that observed in whole genome sequence data analysis, SNP array data does not have a common set of file formats or coding conventions for allele calling. Therefore, the standardization and integration of SNP array data from multiple sources have become an obstacle, especially for users with basic or no programming skills. Here, we describe the difficulties related to handling SNP array data, focusing on file formats, SNP allele coding, and mapping. We also present SNPConvert suite, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. This tool, which can be integrated with open-source and open-access tools already available, is a first step towards an integrated system to standardize and integrate any type of raw SNP array data. The tool is available at: https://github. com/nicolazzie/SNPConvert.git.
Kimbacher, Christine; Paar, Christian; Freystetter, Andrea; Berg, Joerg
2018-05-01
Genotyping for clinically important single nucleotide polymorphisms (SNPs) is performed by many clinical routine laboratories. To support testing, quality controls and reference materials are needed. Those may be derived from residual patient samples, left over samples of external quality assurance schemes, plasmid DNA or DNA from cell lines. DNAs from cell lines are commutable and available in large amounts. DNA from 38 cell lines were examined for suitability as controls in 11 SNP assays that are frequently used in a clinical routine laboratory: FV (1691G>A), FII (20210G>A), PAI-1 4G/5G polymorphism, MTHFR (677C>T, 1298A>C), HFE (H63D, S65C, C282Y), APOE (E2, E3, E4), LPH (-13910C>T), UGT1A1 (*28, *36, *37), TPMT (*2, *3A, *3B, *3C), VKORC1 (-1639G>A, 1173C>T), CYP2C9 (*2, *3, *5). Genotyping was performed by real-time PCR with melting curve analysis and confirmed by bi-directional sequencing. We find an almost complete spectrum of genotypic constellations within these 38 cell lines. About 12 cell lines appear sufficient as genotypic controls for the 11 SNP assays by covering almost all of the genotypes. However, hetero- and homozygous genotypes for FII and the alleles TPMT*2, UGT1A1*37 and CYP2C9*5 were not detected in any of the cell lines. DNA from most of the examined cell lines appear suitable as quality controls for these SNP assays in the laboratory routine, as to the implementation of those assays or to prepare samples for quality assurance schemes. Our study may serve as a pilot to further characterize these cell lines to arrive at the status of reference materials.
Dambrauskienė, R; Gerbutavičius, R; Ugenskienė, R; Jankauskaitė, R; Savukaitytė, A; Šimoliūnienė, R; Rudžianskienė, M; Gerbutavičienė, R; Juozaitytė, E
2017-01-01
Abstract The most important complications of Philadelphianegagive (non BCR-ABL) myeloproliferative neoplasms (MPNs) are vascular events. Our aim was to evaluate the effects of single nucleotide polymorphisms (SNPs), platelet glycoproteins (GPs) (Ia/IIa, Ibα, IIb/IIIa and VI), von Willebrand factor (vWF), coagulation factor VII (FVII), β-fibrinogen, and the risk of thrombosis in patients with non BCR-ABL MPNs at the Lithuanian University of Health Sciences. Kaunas, Lithuania. Genotyping was done for 108 patients. The TT genotype of the GP Ia/IIa c.807C>T polymorphism was more frequently found in the group of MPN patients with arterial thrombosis compared to MPN patients who were thrombosis-free [26.5 vs. 11.5%, p = 0.049; odds ratio (OR) 2.68; 95% confidence interval (95% CI) 1.01-7.38]. The CT genotype of the β-fibrinogen c.-148C>T polymorphism occurred more frequently in MPN patients with arterial, and total thrombosis compared to the wild or homozygous genotype (57.7 vs. 40.0 vs. 12.5%; p = 0.027), (64.7 vs. 44.4 vs. 25%; p = 0.032), respectively. The carrier state for the c.-323P10 variant of FVII SNP (summation of P10/10 and P0/10) was more frequent in MPN patients with thrombosis compared to the wild-type genotype carriers (71.4 vs. 43.4%; p = 0.049; OR 3.26; 95% CI 1.01-11.31). The coexistence of heterozygous β-fibrinogen c.-148C>T and FVII c.-323P0/10 SNP, increased the risk of arterial thrombosis (21.1 vs. 3.7%, p = 0.008; OR 6.93; 95% CI 1.38-34.80). The TT genotype of GP Ia/IIa c.807C>T, the CT genotype of β-fibrinogen c.-148C>T and FVII c.-323P0/10 SNP could be associated with risk of thrombosis in MPN patients. PMID:28924539
Efficient Moment-Based Inference of Admixture Parameters and Sources of Gene Flow
Levin, Alex; Reich, David; Patterson, Nick; Berger, Bonnie
2013-01-01
The recent explosion in available genetic data has led to significant advances in understanding the demographic histories of and relationships among human populations. It is still a challenge, however, to infer reliable parameter values for complicated models involving many populations. Here, we present MixMapper, an efficient, interactive method for constructing phylogenetic trees including admixture events using single nucleotide polymorphism (SNP) genotype data. MixMapper implements a novel two-phase approach to admixture inference using moment statistics, first building an unadmixed scaffold tree and then adding admixed populations by solving systems of equations that express allele frequency divergences in terms of mixture parameters. Importantly, all features of the model, including topology, sources of gene flow, branch lengths, and mixture proportions, are optimized automatically from the data and include estimates of statistical uncertainty. MixMapper also uses a new method to express branch lengths in easily interpretable drift units. We apply MixMapper to recently published data for Human Genome Diversity Cell Line Panel individuals genotyped on a SNP array designed especially for use in population genetics studies, obtaining confident results for 30 populations, 20 of them admixed. Notably, we confirm a signal of ancient admixture in European populations—including previously undetected admixture in Sardinians and Basques—involving a proportion of 20–40% ancient northern Eurasian ancestry. PMID:23709261
Muleta, Kebede T; Bulli, Peter; Zhang, Zhiwu; Chen, Xianming; Pumphrey, Michael
2017-11-01
Harnessing diversity from germplasm collections is more feasible today because of the development of lower-cost and higher-throughput genotyping methods. However, the cost of phenotyping is still generally high, so efficient methods of sampling and exploiting useful diversity are needed. Genomic selection (GS) has the potential to enhance the use of desirable genetic variation in germplasm collections through predicting the genomic estimated breeding values (GEBVs) for all traits that have been measured. Here, we evaluated the effects of various scenarios of population genetic properties and marker density on the accuracy of GEBVs in the context of applying GS for wheat ( L.) germplasm use. Empirical data for adult plant resistance to stripe rust ( f. sp. ) collected on 1163 spring wheat accessions and genotypic data based on the wheat 9K single nucleotide polymorphism (SNP) iSelect assay were used for various genomic prediction tests. Unsurprisingly, the results of the cross-validation tests demonstrated that prediction accuracy increased with an increase in training population size and marker density. It was evident that using all the available markers (5619) was unnecessary for capturing the trait variation in the germplasm collection, with no further gain in prediction accuracy beyond 1 SNP per 3.2 cM (∼1850 markers), which is close to the linkage disequilibrium decay rate in this population. Collectively, our results suggest that larger germplasm collections may be efficiently sampled via lower-density genotyping methods, whereas genetic relationships between the training and validation populations remain critical when exploiting GS to select from germplasm collections. Copyright © 2017 Crop Science Society of America.
Single nucleotide polymorphism discovery in bovine liver using RNA-seq technology
Pareek, Chandra Shekhar; Błaszczyk, Paweł; Dziuba, Piotr; Czarnik, Urszula; Fraser, Leyland; Sobiech, Przemysław; Pierzchała, Mariusz; Feng, Yaping; Kadarmideen, Haja N.; Kumar, Dibyendu
2017-01-01
Background RNA-seq is a useful next-generation sequencing (NGS) technology that has been widely used to understand mammalian transcriptome architecture and function. In this study, a breed-specific RNA-seq experiment was utilized to detect putative single nucleotide polymorphisms (SNPs) in liver tissue of young bulls of the Polish Red, Polish Holstein-Friesian (HF) and Hereford breeds, and to understand the genomic variation in the three cattle breeds that may reflect differences in production traits. Results The RNA-seq experiment on bovine liver produced 107,114,4072 raw paired-end reads, with an average of approximately 60 million paired-end reads per library. Breed-wise, a total of 345.06, 290.04 and 436.03 million paired-end reads were obtained from the Polish Red, Polish HF, and Hereford breeds, respectively. Burrows-Wheeler Aligner (BWA) read alignments showed that 81.35%, 82.81% and 84.21% of the mapped sequencing reads were properly paired to the Polish Red, Polish HF, and Hereford breeds, respectively. This study identified 5,641,401 SNPs and insertion and deletion (indel) positions expressed in the bovine liver with an average of 313,411 SNPs and indel per young bull. Following the removal of the indel mutations, a total of 195,3804, 152,7120 and 205,3184 raw SNPs expressed in bovine liver were identified for the Polish Red, Polish HF, and Hereford breeds, respectively. Breed-wise, three highly reliable breed-specific SNP-databases (SNP-dbs) with 31,562, 24,945 and 28,194 SNP records were constructed for the Polish Red, Polish HF, and Hereford breeds, respectively. Using a combination of stringent parameters of a minimum depth of ≥10 mapping reads that support the polymorphic nucleotide base and 100% SNP ratio, 4,368, 3,780 and 3,800 SNP records were detected in the Polish Red, Polish HF, and Hereford breeds, respectively. The SNP detections using RNA-seq data were successfully validated by kompetitive allele-specific PCR (KASPTM) SNP genotyping assay. The comprehensive QTL/CG analysis of 110 QTL/CG with RNA-seq data identified 20 monomorphic SNP hit loci (CARTPT, GAD1, GDF5, GHRH, GHRL, GRB10, IGFBPL1, IGFL1, LEP, LHX4, MC4R, MSTN, NKAIN1, PLAG1, POU1F1, SDR16C5, SH2B2, TOX, UCP3 and WNT10B) in all three cattle breeds. However, six SNP loci (CCSER1, GHR, KCNIP4, MTSS1, EGFR and NSMCE2) were identified as highly polymorphic among the cattle breeds. Conclusions This study identified breed-specific SNPs with greater SNP ratio and excellent mapping coverage, as well as monomorphic and highly polymorphic putative SNP loci within QTL/CGs of bovine liver tissue. A breed-specific SNP-db constructed for bovine liver yielded nearly six million SNPs. In addition, a KASPTM SNP genotyping assay, as a reliable cost-effective method, successfully validated the breed-specific putative SNPs originating from the RNA-seq experiments. PMID:28234981
Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S; Jaing, C
2012-03-27
The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interimmore » report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.« less
USDA-ARS?s Scientific Manuscript database
Background: Our goal is to produce a high-throughput SNP genotyping platform for genomic analyses in rainbow trout that will enable fine mapping of QTL, whole genome association studies, genomic selection for improved aquaculture production traits, and genetic analyses of wild populations that aid ...
High-density SNP Scan of Production and Product Quality Traits in Beef Cattle
USDA-ARS?s Scientific Manuscript database
Genotypes from the BovineSNP50 BeadChip (50K) were obtained on animals derived from 150 AI sires from seven breeds (22 sires per breed; Angus, Charolais, Gelbvieh, Hereford, Limousin, Red Angus, and Simmental) as either progeny (F1; 590 steers) or grandprogeny (F1 x F1 = F1**2; 1,306 steers and 707 ...
Jia, Xiang-Jie; Wang, Chang-Fa; Yang, Gui-Wen; Huang, Jin-Ming; Li, Qiu-Ling; Zhong, Ji-Feng
2011-12-01
Three novel SNPs were found by DNA sequencing, PCR-RFLP and CRS-PCR methods were used for genotyping in 979 Chinese Holstein cattle. One SNP, G1178C, was identified in exon 2 of POU1F1 gene. Two novel SNPs, A906G and A1134G, were identified in 5'-flanking regulatory region (5'-UTR) of PRL gene. The association between polymorphisms of the two genes and milk performance traits were analyzed with PROC GLM of SAS. The results showed that GC genotype at 1178 locus of POU1F1 gene was advantageous for milk yield, milk protein yield, and milk fat yield. AG genotype at 906 locus was advantageous for milk yield. There was no significant difference between 1134 locus and milk performance traits of 5'-UTR of PRL gene. Analysis of genotype combination effect on milk production traits showed that the effect of combined genotype was not simple sum of single genotypes and the effects of gene pyramiding seemed to be more important in molecular breeding.
Korinsak, Siripar; Tangphatsornruang, Sithichoke; Pootakham, Wirulda; Wanchana, Samart; Plabpla, Anucha; Jantasuriyarat, Chatchawan; Patarapuwadol, Sujin; Vanavichit, Apichart; Toojinda, Theerayut
2018-05-15
Magnaporthe oryzae is a fungal pathogen causing blast disease in many plant species. In this study, seventy three isolates of M. oryzae collected from rice (Oryza sativa) in 1996-2014 were genotyped using a genotyping-by-sequencing approach to detect genetic variation. An association study was performed to identify single nucleotide polymorphisms (SNPs) associated with virulence genes using 831 selected SNP and infection phenotypes on local and improved rice varieties. Population structure analysis revealed eight subpopulations. The division into eight groups was not related to the degree of virulence. Association mapping showed five SNPs associated with fungal virulence on chromosome 1, 2, 3, 4 and 7. The SNP on chromosome 1 was associated with virulence against RD6-Pi7 and IRBL7-M which might be linked to the previously reported AvrPi7. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Meyer, Benedicte; Nguyen, Chinh Bkrong Thuy; Moen, Aurora; Fagermoen, Even; Sulheim, Dag; Nilsen, Hilde; Wyller, Vegard Bruun; Gjerstad, Johannes
2015-01-01
Earlier studies have shown that genetic variability in the SLC6A4 gene encoding the serotonin transporter (5-HTT) may be important for the re-uptake of serotonin (5-HT) in the central nervous system. In the present study we investigated how the 5-HTT genotype i.e. the short (S) versus long (L) 5-HTTLPR allele and the SNP rs25531 A > G affect the physical and psychosocial functioning in patients with chronic fatigue syndrome (CFS). All 120 patients were recruited from The Department of Paediatrics at Oslo University Hospital, Norway, a national referral center for young CFS patients (12-18 years). Main outcomes were number of steps per day obtained by an accelerometer and disability scored by the Functional Disability Inventory (FDI). Patients with the 5-HTT SS or SLG genotype had a significantly lower number of steps per day than patients with the 5-HTT LALG, SLA or LALA genotype. Patients with the 5-HTT SS or SLG genotype also had a significantly higher FDI score than patients with the 5-HTT LALG, SLA or LALA genotype. Thus, CFS patients with the 5-HTT SS or SLG genotype had worse 30 weeks outcome than CFS patients with the 5-HTT LALG, SLA or LALA genotype. The present study suggests that the 5-HTT genotype may be a factor that contributes to maintenance of CFS.
Meyer, Benedicte; Nguyen, Chinh Bkrong Thuy; Moen, Aurora; Fagermoen, Even; Sulheim, Dag; Nilsen, Hilde; Wyller, Vegard Bruun; Gjerstad, Johannes
2015-01-01
Earlier studies have shown that genetic variability in the SLC6A4 gene encoding the serotonin transporter (5-HTT) may be important for the re-uptake of serotonin (5-HT) in the central nervous system. In the present study we investigated how the 5-HTT genotype i.e. the short (S) versus long (L) 5-HTTLPR allele and the SNP rs25531 A > G affect the physical and psychosocial functioning in patients with chronic fatigue syndrome (CFS). All 120 patients were recruited from The Department of Paediatrics at Oslo University Hospital, Norway, a national referral center for young CFS patients (12–18 years). Main outcomes were number of steps per day obtained by an accelerometer and disability scored by the Functional Disability Inventory (FDI). Patients with the 5-HTT SS or SLG genotype had a significantly lower number of steps per day than patients with the 5-HTT LALG, SLA or LALA genotype. Patients with the 5-HTT SS or SLG genotype also had a significantly higher FDI score than patients with the 5-HTT LALG, SLA or LALA genotype. Thus, CFS patients with the 5-HTT SS or SLG genotype had worse 30 weeks outcome than CFS patients with the 5-HTT LALG, SLA or LALA genotype. The present study suggests that the 5-HTT genotype may be a factor that contributes to maintenance of CFS. PMID:26473596
Gan, W; Song, Q; Zhang, N N; Xiong, X P; Wang, D M C; Li, L
2015-06-18
The fat mass and obesity-associated gene (FTO) is an excellent candidate gene that affects energy metabolism. Single nucleotide polymorphisms (SNPs) in FTO are associated with carcass and meat quality traits in pigs, cattle, and rabbits. The aim of this study was to investigate the association between novel SNPs in the FTO coding region and carcass and meat quality traits in 95 crossbred ducks, using DNA sequencing. We found two transitions G/A (SNP 387 and 473) within exon 3. SNP 387 was a synonymous mutation, whereas SNP 473 was a missense mutation. Association analysis suggested that SNP g.387G>A was significantly associated with all of the carcass traits measured, the intramuscular fat content (IMF), cooking yield (CY), pH values 45 min after slaughter (pH45m), drip losses from the breast muscle, and the leg muscle (P < 0.05). For SNP g.473G>A, the genotype AA exhibited greater leg muscle weight than the genotypes GG or AG (P < 0.05). The D value suggested that the two SNPs exhibited strong linkage disequilibrium. Three haplotypes (G1G2, G1A2, and A1A2) were significantly associated with IMF, CY, the a* value, and all of the carcass traits measured (P < 0.05). The results suggest that FTO is a candidate locus that affects carcass and meat quality traits in ducks.
AGARWAL, SANDEEP K.; GOURH, PRAVITT; SHETE, SANJAY; PAZ, GENE; DIVECHA, DIPAL; REVEILLE, JOHN D.; ASSASSI, SHERVIN; TAN, FILEMON K.; MAYES, MAUREEN D.; ARNETT, FRANK C.
2010-01-01
Objective IL23R has been identified as a susceptibility gene for development of multiple autoimmune diseases. We investigated the possible association of IL23R with systemic sclerosis (SSc), an autoimmune disease that leads to the development of cutaneous and visceral fibrosis. Methods We tested 9 single-nucleotide polymorphisms (SNP) in IL23R for association with SSc in a cohort of 1402 SSc cases and 1038 controls. IL23R SNP tested were previously identified as SNP showing associations with inflammatory bowel disease. Results Case-control comparisons revealed no statistically significant differences between patients and healthy controls with any of the IL23R polymorphisms. Analyses of subsets of SSc patients showed that rs11209026 (Arg381Gln variant) was associated with anti-topoisomerase I antibody (ATA)-positive SSc (p = 0.001)) and rs11465804 SNP was associated with diffuse and ATA-positive SSc (p = 0.0001, p = 0.0026, respectively). These associations remained significant after accounting for multiple comparisons using the false discovery rate method. Wild-type genotype at both rs11209026 and rs11465804 showed significant protection against the presence of pulmonary hypertension (PHT). (p = 3×10−5, p = 1×10−5, respectively). Conclusion Polymorphisms in IL23R are associated with susceptibility to ATA-positive SSc and protective against development of PHT in patients with SSc. PMID:19918037
Pasaniuc, Bogdan; Zaitlen, Noah; Lettre, Guillaume; Chen, Gary K; Tandon, Arti; Kao, W H Linda; Ruczinski, Ingo; Fornage, Myriam; Siscovick, David S; Zhu, Xiaofeng; Larkin, Emma; Lange, Leslie A; Cupples, L Adrienne; Yang, Qiong; Akylbekova, Ermeg L; Musani, Solomon K; Divers, Jasmin; Mychaleckyj, Joe; Li, Mingyao; Papanicolaou, George J; Millikan, Robert C; Ambrosone, Christine B; John, Esther M; Bernstein, Leslie; Zheng, Wei; Hu, Jennifer J; Ziegler, Regina G; Nyante, Sarah J; Bandera, Elisa V; Ingles, Sue A; Press, Michael F; Chanock, Stephen J; Deming, Sandra L; Rodriguez-Gil, Jorge L; Palmer, Cameron D; Buxbaum, Sarah; Ekunwe, Lynette; Hirschhorn, Joel N; Henderson, Brian E; Myers, Simon; Haiman, Christopher A; Reich, David; Patterson, Nick; Wilson, James G; Price, Alkes L
2011-04-01
While genome-wide association studies (GWAS) have primarily examined populations of European ancestry, more recent studies often involve additional populations, including admixed populations such as African Americans and Latinos. In admixed populations, linkage disequilibrium (LD) exists both at a fine scale in ancestral populations and at a coarse scale (admixture-LD) due to chromosomal segments of distinct ancestry. Disease association statistics in admixed populations have previously considered SNP association (LD mapping) or admixture association (mapping by admixture-LD), but not both. Here, we introduce a new statistical framework for combining SNP and admixture association in case-control studies, as well as methods for local ancestry-aware imputation. We illustrate the gain in statistical power achieved by these methods by analyzing data of 6,209 unrelated African Americans from the CARe project genotyped on the Affymetrix 6.0 chip, in conjunction with both simulated and real phenotypes, as well as by analyzing the FGFR2 locus using breast cancer GWAS data from 5,761 African-American women. We show that, at typed SNPs, our method yields an 8% increase in statistical power for finding disease risk loci compared to the power achieved by standard methods in case-control studies. At imputed SNPs, we observe an 11% increase in statistical power for mapping disease loci when our local ancestry-aware imputation framework and the new scoring statistic are jointly employed. Finally, we show that our method increases statistical power in regions harboring the causal SNP in the case when the causal SNP is untyped and cannot be imputed. Our methods and our publicly available software are broadly applicable to GWAS in admixed populations.
Nawaz, Syed Kashif; Noreen, Aasma; Rani, Asima; Yousaf, Memoona; Arshad, Muhammad
2015-01-01
Objective: The present study aimed to investigate the association between the rs10757274 SNP (present on locus 9p21 in the gene for CDKN2B-AS1) and coronary artery disease (CAD) in a local population of Pakistan. Methods: It was a case-control study. An allele-specific PCR-based strategy was used for the identification of genotypes. A total of 350 samples were used for the investigation, out of which 220 samples were CAD patients and 130 samples were normal healthy individuals. Effects of parameters, like family history of CAD, smoking, presence of diabetes, and hypertension, in changing the chances of CAD were studied. Odds ratio was estimated with 95% confidence interval. Results: A strong association was observed between CAD and factors, like smoking (OR: 1.666; 95% CI: 1.042-2.664), presence of hypertension (OR: 26.55; 95% CI: 15.95-44.20), diabetes (OR: 3.009; 95% CI: 1.841-4.920), and family history of CAD (OR: 4.9; 95% CI: 2.965-8.099). Results for the association between the genotype on the basis of rs10757274 showed a strong association between the GG genotype and the occurrence of CAD (OR: 9.603; 95% CI: 5.746-16.05). Conclusion: The present results suggest the importance of the 9p21 locus in modulating the chances of CAD. PMID:25592106
Yang, Ming; Xie, Wanling; Mostaghel, Elahe; Nakabayashi, Mari; Werner, Lillian; Sun, Tong; Pomerantz, Mark; Freedman, Matthew; Ross, Robert; Regan, Meredith; Sharifi, Nima; Figg, William Douglas; Balk, Steven; Brown, Myles; Taplin, Mary-Ellen; Oh, William K.; Lee, Gwo-Shu Mary; Kantoff, Philip W.
2011-01-01
Purpose Androgen deprivation therapy (ADT), an important treatment for advanced prostate cancer, is highly variable in its effectiveness. We hypothesized that genetic variants of androgen transporter genes, SLCO2B1 and SLCO1B3, may determine time to progression on ADT. Patients and Methods A cohort of 538 patients with prostate cancer treated with ADT was genotyped for SLCO2B1 and SLCO1B3 single nucleotide polymorphisms (SNP). The biologic function of a SLCO2B1 coding SNP in transporting androgen was examined through biochemical assays. Results Three SNPs in SLCO2B1 were associated with time to progression (TTP) on ADT (P < .05). The differences in median TTP for each of these polymorphisms were about 10 months. The SLCO2B1 genotype, which allows more efficient import of androgen, enhances cell growth and is associated with a shorter TTP on ADT. Patients carrying both SLCO2B1 and SLCO1B3 genotypes, which import androgens more efficiently, exhibited a median 2-year shorter TTP on ADT, demonstrating a gene-gene interaction (Pinteraction = .041). Conclusion Genetic variants of SLCO2B1 and SLCO1B3 may function as pharmacogenomic determinants of resistance to ADT in prostate cancer. PMID:21606417
Validation and discovery of genotype-phenotype associations in chronic diseases using linked data.
Pathak, Jyotishman; Kiefer, Richard; Freimuth, Robert; Chute, Christopher
2012-01-01
This study investigates federated SPARQL queries over Linked Open Data (LOD) in the Semantic Web to validate existing, and potentially discover new genotype-phenotype associations from public datasets. In particular, we report our preliminary findings for identifying such associations for commonly occurring chronic diseases using the Online Mendelian Inheritance in Man (OMIM) and Database for SNPs (dbSNP) within the LOD knowledgebase and compare them with Gene Wiki for coverage and completeness. Our results indicate that Semantic Web technologies can play an important role for in-silico identification of novel disease-gene-SNP associations, although additional verification is required before such information can be applied and used effectively.
Validation of Pooled Whole-Genome Re-Sequencing in Arabidopsis lyrata.
Fracassetti, Marco; Griffin, Philippa C; Willi, Yvonne
2015-01-01
Sequencing pooled DNA of multiple individuals from a population instead of sequencing individuals separately has become popular due to its cost-effectiveness and simple wet-lab protocol, although some criticism of this approach remains. Here we validated a protocol for pooled whole-genome re-sequencing (Pool-seq) of Arabidopsis lyrata libraries prepared with low amounts of DNA (1.6 ng per individual). The validation was based on comparing single nucleotide polymorphism (SNP) frequencies obtained by pooling with those obtained by individual-based Genotyping By Sequencing (GBS). Furthermore, we investigated the effect of sample number, sequencing depth per individual and variant caller on population SNP frequency estimates. For Pool-seq data, we compared frequency estimates from two SNP callers, VarScan and Snape; the former employs a frequentist SNP calling approach while the latter uses a Bayesian approach. Results revealed concordance correlation coefficients well above 0.8, confirming that Pool-seq is a valid method for acquiring population-level SNP frequency data. Higher accuracy was achieved by pooling more samples (25 compared to 14) and working with higher sequencing depth (4.1× per individual compared to 1.4× per individual), which increased the concordance correlation coefficient to 0.955. The Bayesian-based SNP caller produced somewhat higher concordance correlation coefficients, particularly at low sequencing depth. We recommend pooling at least 25 individuals combined with sequencing at a depth of 100× to produce satisfactory frequency estimates for common SNPs (minor allele frequency above 0.05).
Wong, Michelle; Öhrmalm, Lars; Broliden, Kristina; Aust, Carl; Hibberd, Martin; Tolfvenstam, Thomas
2012-01-01
Mannose-binding Lectin protein (MBL) has been suggested to be relevant in the defence against infections in immunosuppressed individuals. In a Swedish adult cohort immunosuppressed from both the underlying disease and from iatrogenic treatments for their underlying disease we investigated the role of MBL in susceptibility to infection. In this cross sectional, prospective study, blood samples obtained from 96 neutropaenic febrile episodes, representing 82 individuals were analysed for single nucleotide polymorphism (SNP) in the MBL2 gene. Concurrent measurement of plasma MBL protein concentrations was also performed for observation of acute response during febrile episodes. No association was observed between MBL2 genotype or plasma MBL concentrations, and the type or frequency of infection. Adding to the literature, we found no evidence that viral infections or co-infections with virus and bacteria would be predisposed by MBL deficiency. We further saw no correlation between MBL2 genotype and the risk of fever. However, fever duration in febrile neutropaenic episodes was negatively associated with MBL2 SNP mutations (p<0.05). Patients with MBL2 SNP mutations presented a median febrile duration of 1.8 days compared with 3 days amongst patients with wildtype MBL2 genotype. We found no clear association between infection, or infection type to MBL2 genotypes or plasma MBL concentration, and add to the reports casting doubts on the benefit of recombinant MBL replacement therapy use during iatrogenic neutropaenia.
Genetic Variant of Kalirin Gene Is Associated with Ischemic Stroke in a Chinese Han Population.
Li, Hong; Yu, Shasha; Wang, Rui; Sun, Zhaoqing; Zhou, Xinghu; Zheng, Liqiang; Yin, Zhihua; Zhang, Xingang; Sun, Yingxian
2017-01-01
Ischemic stroke is a complex disorder resulting from the interplay of genetic and environmental factors. Previous studies showed that kalirin gene variations were associated with cardiovascular disease. However, the association between this gene and ischemic stroke was unknown. We performed this study to confirm if kalirin gene variation was associated with ischemic stroke. We enrolled 385 ischemic stroke patients and 362 controls from China. Three SNPs of kalirin gene were genotyped by means of ligase detection reaction-PCR method. Data was processed with SPSS and SHEsis platform. SNP rs7620580 (dominant model: OR = 1.590, p = 0.002 and adjusted OR = 1.662, p = 0.014; additive model: OR = 1.490, p = 0.002 and adjusted OR = 1.636, p = 0.005; recessive model: OR = 2.686, p = 0.039) and SNP rs1708303 (dominant model: OR = 1.523, p = 0.007 and adjusted OR = 1.604, p = 0.028; additive model: OR = 1.438, p = 0.01 and adjusted OR = 1.476, p = 0.039) were associated with ischemic stroke. The GG genotype and G allele of SNP rs7620580 were associated with a risk for ischemic stroke with an adjusted OR of 3.195 and an OR of 1.446, respectively. Haplotype analysis revealed that A-T-G,G-T-A, and A-T-A haplotypes were associated with ischemic stroke. Our results provide evidence that kalirin gene variations were associated with ischemic stroke in the Chinese Han population.
Ghrelin gene polymorphisms in rheumatoid arthritis.
Ozgen, Metin; Koca, Suleyman Serdar; Etem, Ebru Onalan; Yuce, Huseyin; Aydin, Suleyman; Isik, Ahmet
2011-07-01
Ghrelin, an endogenous orexigenic peptide, has anti-inflammatory effects, down-regulates pro-inflammatory cytokines, and its altered levels are reported in various inflammatory diseases. The human preproghrelin (ghrelin/obestatin) gene shows several single nucleotide polymorphisms (SNPs) including Arg51Gln, Leu72Met, Gln90Leu, and A-501C. The aim of this study was to investigate the frequency, and clinical significance, of these four SNPs in a small cohort of Turkish patients with rheumatoid arthritis (RA). The study included 103 patients with RA and 103 healthy controls. In the RA group, disease activity and disease-related damage were assessed using the Disease Activity Score-28 (DAS-28), and the modified Larsen scoring (MLS) methods. In all the participants, genomic DNA was isolated and genotyped by polymerase chain reaction and restriction fragment length polymorphism analysis. The frequencies of ghrelin gene SNPs were 82.5 and 79.6% in the RA and control groups, respectively, and there were no significant differences in terms of genotype distributions and allele frequencies for these four SNPs between the groups. However, the A-501C SNP was found to be associated with early disease onset, and Gln90Leu SNP with less frequent rheumatoid factor positivity, in the RA group. A-501C SNP is associated with earlier onset of RA suggesting that genetic variations in the ghrelin gene may have an impact on RA. Copyright © 2010 Société française de rhumatologie. Published by Elsevier SAS. All rights reserved.
Estimation of genomic breeding values for residual feed intake in a multibreed cattle population.
Khansefid, M; Pryce, J E; Bolormaa, S; Miller, S P; Wang, Z; Li, C; Goddard, M E
2014-08-01
Residual feed intake (RFI) is a measure of the efficiency of animals in feed utilization. The accuracies of GEBV for RFI could be improved by increasing the size of the reference population. Combining RFI records of different breeds is a way to do that. The aims of this study were to 1) develop a method for calculating GEBV in a multibreed population and 2) improve the accuracies of GEBV by using SNP associated with RFI. An alternative method for calculating accuracies of GEBV using genomic BLUP (GBLUP) equations is also described and compared to cross-validation tests. The dataset included RFI records and 606,096 SNP genotypes for 5,614 Bos taurus animals including 842 Holstein heifers and 2,009 Australian and 2,763 Canadian beef cattle. A range of models were tested for combining genotype and phenotype information from different breeds and the best model included an overall effect of each SNP, an effect of each SNP specific to a breed, and a small residual polygenic effect defined by the pedigree. In this model, the Holsteins and some Angus cattle were combined into 1 "breed class" because they were the only cattle measured for RFI at an early age (6-9 mo of age) and were fed a similar diet. The average empirical accuracy (0.31), estimated by calculating the correlation between GEBV and actual phenotypes divided by the square root of estimated heritability in 5-fold cross-validation tests, was near to that expected using the GBLUP equations (0.34). The average empirical and expected accuracies were 0.30 and 0.31, respectively, when the GEBV were estimated for each breed separately. Therefore, the across-breed reference population increased the accuracy of GEBV slightly, although the gain was greater for breeds with smaller number of individuals in the reference population (0.08 in Murray Grey and 0.11 in Hereford for empirical accuracy). In a second approach, SNP that were significantly (P < 0.001) associated with RFI in the beef cattle genomewide association studies were used to create an auxiliary genomic relationship matrix for estimating GEBV in Holstein heifers. The empirical (and expected) accuracy of GEBV within Holsteins increased from 0.33 (0.35) to 0.39 (0.36) and improved even more to 0.43 (0.50) when using a multibreed reference population. Therefore, a multibreed reference population is a useful resource to find SNP with a greater than average association with RFI in 1 breed and use them to estimate GEBV in another breed.
Mason, Annaliese S; Zhang, Jing; Tollenaere, Reece; Vasquez Teuber, Paula; Dalton-Morgan, Jessica; Hu, Liyong; Yan, Guijun; Edwards, David; Redden, Robert; Batley, Jacqueline
2015-09-01
Germplasm collections provide an extremely valuable resource for breeders and researchers. However, misclassification of accessions by species often hinders the effective use of these collections. We propose that use of high-throughput genotyping tools can provide a fast, efficient and cost-effective way of confirming species in germplasm collections, as well as providing valuable genetic diversity data. We genotyped 180 Brassicaceae samples sourced from the Australian Grains Genebank across the recently released Illumina Infinium Brassica 60K SNP array. Of these, 76 were provided on the basis of suspected misclassification and another 104 were sourced independently from the germplasm collection. Presence of the A- and C-genomes combined with principle components analysis clearly separated Brassica rapa, B. oleracea, B. napus, B. carinata and B. juncea samples into distinct species groups. Several lines were further validated using chromosome counts. Overall, 18% of samples (32/180) were misclassified on the basis of species. Within these 180 samples, 23/76 (30%) supplied on the basis of suspected misclassification were misclassified, and 9/105 (9%) of the samples randomly sourced from the Australian Grains Genebank were misclassified. Surprisingly, several individuals were also found to be the product of interspecific hybridization events. The SNP (single nucleotide polymorphism) array proved effective at confirming species, and provided useful information related to genetic diversity. As similar genomic resources become available for different crops, high-throughput molecular genotyping will offer an efficient and cost-effective method to screen germplasm collections worldwide, facilitating more effective use of these valuable resources by breeders and researchers. © 2015 John Wiley & Sons Ltd.
Abraham, Gad; Kowalczyk, Adam; Zobel, Justin; Inouye, Michael
2013-02-01
A central goal of medical genetics is to accurately predict complex disease from genotypes. Here, we present a comprehensive analysis of simulated and real data using lasso and elastic-net penalized support-vector machine models, a mixed-effects linear model, a polygenic score, and unpenalized logistic regression. In simulation, the sparse penalized models achieved lower false-positive rates and higher precision than the other methods for detecting causal SNPs. The common practice of prefiltering SNP lists for subsequent penalized modeling was examined and shown to substantially reduce the ability to recover the causal SNPs. Using genome-wide SNP profiles across eight complex diseases within cross-validation, lasso and elastic-net models achieved substantially better predictive ability in celiac disease, type 1 diabetes, and Crohn's disease, and had equivalent predictive ability in the rest, with the results in celiac disease strongly replicating between independent datasets. We investigated the effect of linkage disequilibrium on the predictive models, showing that the penalized methods leverage this information to their advantage, compared with methods that assume SNP independence. Our findings show that sparse penalized approaches are robust across different disease architectures, producing as good as or better phenotype predictions and variance explained. This has fundamental ramifications for the selection and future development of methods to genetically predict human disease. © 2012 WILEY PERIODICALS, INC.
Powrózek, Tomasz; Mlak, Radosław; Brzozowska, Anna; Mazurek, Marcin; Gołębiowski, Paweł; Małecka-Massalska, Teresa
2018-05-25
Malnutrition and cachexia are frequent among head and neck cancer (HNC) patients and these syndromes are associated with both poor quality of life and unfavorable disease prognosis. Unfortunately, there are still no established biomarkers that could predict the development of cachexia. Among potential molecular alterations related to cancer cachexia, there are single-nucleotide polymorphisms (SNPs) within genes encoding pro-inflammatory cytokines such as TNF-α. To investigate TNF-α -1031T/C SNP as a risk factor of cachexia in 62 HNC patients subjected to radiotherapy. DNA was isolated from whole blood samples and genotyping was conducted using real-time PCR method by means of TaqMan SNP Genotyping Assay. TNF-alpha Human ELISA Kit was used to determine TNF-α concentration in each extracted plasma sample. Moreover, the relationship between genotype variants of TNF-α and plasma level of TNF-α was examined. Detailed clinical-demographic and nutritional data were collected from each study participant. CC genotype carriers were at a significantly higher risk of being qualified as cachectic compared with other genotype carriers (p = 0.044; HR = 3.724). Subjects, who carried CC genotype had significantly lower body mass compared to patients with TT and CT genotype (p = 0.045). Moreover, CC individuals had the highest TNF-α plasma level (median 10.70 ± 0.72 pg/mL, p = 0.006) among the studied cases. We also noted, that CC genotype carriers had significantly higher risk of early death incidence compared to other genotype carriers [overall survival (OS): 28 vs 38 months (HR = 3.630, p = 0.013)]. Despite the differences between SGA and NRS scoring, the presence of CC genotype could be a useful objective marker allowing for the prediction of cachexia development in both parenterally nourished and non-parenterally nourished patients. Patients with CC genotype had also the highest risk of early death incidence; therefore, such individuals should be qualified for parenteral nutrition and supportive care at the time of diagnosis to improve further therapy outcomes. Moreover, this is the first study demonstrating the relationship between TNF-α -1031T/C polymorphism and plasma level of TNF-α. This is also the first paper investigating the role of TNF-α -1031T/C in cancer cachexia.
Barra, Gustavo Barcelos; Dutra, Ludmila Alves Sanches; Watanabe, Sílvia Conde; Costa, Patrícia Godoy Garcia; Cruz, Patrícia Sales Marques da; Azevedo, Monalisa Ferreira; Amato, Angélica Amorim
2012-11-01
To investigate the association of the T allele of the single nucleotide polymorphism (SNP) rs7903146 of TCF7L2 with the occurrence of T2D in a sample of subjects followed up at the Brasilia University Hospital. The SNP rs7903146 of TCF7L2 was genotyped by allele-specific PCR in 113 patients with known T2D and in 139 non-diabetic controls in Brasilia, Brazil. We found that the T allele of the SNP rs7903146 of TCF7L2 was significantly associated with T2D risk (odds ratio of 3.92 for genotype TT in the recessive genetic model, p = 0.004 and 1.5 for T allele, p = 0.032). These results reinforce previous findings on the consistent association of this genetic factor and the risk of T2D in populations of diverse ethnic backgrounds.
Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses.
Orr, N; Back, W; Gu, J; Leegwater, P; Govindarajan, P; Conroy, J; Ducro, B; Van Arendonk, J A M; MacHugh, D E; Ennis, S; Hill, E W; Brama, P A J
2010-12-01
The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of inheritance, to a 2-MB region of chromosome 14 using just 10 affected animals and 10 controls. We successfully genotyped 34,429 SNPs that were tested for association with dwarfism using chi-square tests. The most significant SNP in our study, BIEC2-239376 (P(2df)=4.54 × 10(-5), P(rec)=7.74 × 10(-6)), is located close to a gene implicated in human dwarfism. Fine-mapping and resequencing analyses did not aid in further localization of the causative variant, and replication of our findings in independent sample sets will be necessary to confirm these results. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.
Granzyme B gene polymorphism associated with subacute sclerosing panencephalitis.
Yentur, Sibel P; Aydin, Hatice Nur; Gurses, Candan; Demirbilek, Veysi; Kuru, Umit; Uysal, Serap; Yapici, Zuhal; Baris, Safa; Yilmaz, Gülden; Cokar, Ozlem; Onal, Emel; Gokyigit, Ayşen; Saruhan-Direskeneli, Güher
2014-10-01
Subacute sclerosing panencephalitis (SSPE) is a late complication of measles infection. Immune dysfunction related to genetic susceptibility has been considered in disease pathogenesis. A functional single nucleotide polymorphism (SNP) of granzyme B gene (GZMB) reported in several pathologies may also be involved in susceptibility to SSPE. An SNP (rs8192917, G → A, R→Q) was screened in 118 SSPE patients and 221 healthy controls (HC) by polymerase chain reaction-restriction fragment length polymorphism. Frequencies were compared between groups. In vitro production of GZMB was measured in controls with different genotypes. The SNP had a minor allele (G) frequency of 0.22 in patients and 0.31 in controls. GG genotype was significantly less frequent in patients (odds ratio, 0.23). G allele carriers produced relatively higher levels of GZMB, when stimulated in vitro. These findings implicate possible effect of this genetic polymorphism in susceptibility to SSPE which needs to be confirmed in bigger populations. Georg Thieme Verlag KG Stuttgart · New York.
Toll-like receptors genes polymorphisms and the occurrence of HCMV infection among pregnant women.
Wujcicka, Wioletta; Paradowska, Edyta; Studzińska, Mirosława; Wilczyński, Jan; Nowakowska, Dorota
2017-03-24
Human cytomegalovirus (HCMV) is the most common cause of intrauterine infections worldwide. The toll-like receptors (TLRs) have been reported as important factors in immune response against HCMV. Particularly, TLR2, TLR4 and TLR9 have been shown to be involved in antiviral immunity. Evaluation of the role of single nucleotide polymorphisms (SNPs), located within TLR2, TLR4 and TLR9 genes, in the development of human cytomegalovirus (HCMV) infection in pregnant women and their fetuses and neonates, was performed. The study was performed for 131 pregnant women, including 66 patients infected with HCMV during pregnancy, and 65 age-matched control pregnant individuals. The patients were selected to the study, based on serological status of anti-HCMV IgG and IgM antibodies and on the presence of viral DNA in their body fluids. Genotypes in TLR2 2258 A > G, TLR4 896 G > A and 1196 C > T and TLR9 2848 G > A SNPs were determined by self-designed nested PCR-RFLP assays. Randomly selected PCR products, representative for distinct genotypes in TLR SNPs, were confirmed by sequencing. A relationship between the genotypes, alleles, haplotypes and multiple variants in the studied polymorphisms, and the occurrence of HCMV infection in pregnant women and their offsprings, was determined, using a logistic regression model. Genotypes in all the analyzed polymorphisms preserved the Hardy-Weinberg equilibrium in pregnant women, both infected and uninfected with HCMV (P > 0.050). GG homozygotic and GA heterozygotic status in TLR9 2848 G > A SNP decreased significantly the occurrence of HCMV infection (OR 0.44 95% CI 0.21-0.94 in the dominant model, P ≤ 0.050). The G allele in TLR9 SNP was significantly more frequent among the uninfected pregnant women than among the infected ones (χ 2 = 4.14, P ≤ 0.050). Considering other polymorphisms, similar frequencies of distinct genotypes, haplotypes and multiple-SNP variants were observed between the studied groups of patients. TLR9 2848 G > A SNP may be associated with HCMV infection in pregnant women.
Howie, Bryan N.; Donnelly, Peter; Marchini, Jonathan
2009-01-01
Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000 Genomes Project) will soon allow a broader range of SNPs to be imputed with higher accuracy, thereby increasing power. We describe a genotype imputation method (IMPUTE version 2) that is designed to address the challenges presented by these new datasets. The main innovation of our approach is a flexible modelling framework that increases accuracy and combines information across multiple reference panels while remaining computationally feasible. We find that IMPUTE v2 attains higher accuracy than other methods when the HapMap provides the sole reference panel, but that the size of the panel constrains the improvements that can be made. We also find that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%–20% lower than those of the closest competing method. One particularly challenging aspect of next-generation association studies is to integrate information across multiple reference panels genotyped on different sets of SNPs; we show that our approach to this problem has practical advantages over other suggested solutions. PMID:19543373
Koshy, Linda; Vijayalekshmi, S V; Harikrishnan, S; Raman, Kutty V; Jissa, V T; Jayakumaran Nair, A; Gangaprasad, A; Nair, G M; Sudhakaran, P R
2017-11-28
Epigenetic regulation of arterial blood pressure mediated through mirSNPs in renin-angiotensin aldosterone system (RAAS) genes is a less explored hypothesis. Recently, the mirSNP rs11174811 in the 3'UTR of the AVPR1A gene was associated with higher arterial blood pressure in a large study population from the Study of Myocardial Infarctions Leiden (SMILE). The aim of the present study was to replicate the association of mirSNP rs11174811 with blood pressure outcomes and hypertension in a south Indian population. Four hundred and fifteen hypertensive cases and 416 normotensive controls were genotyped using a 5' nuclease allelic discrimination assay. Logistic regression was used to test the association of mirSNP rs11174811 with the hypertension phenotype. Censored normal regression was used to test the association of the polymorphism with continuous blood pressure outcomes such as systolic and diastolic blood pressure. The mirSNP rs11174811 did not show any significant association with hypertension. The adjusted odds ratio was 1.02, with 95% CI of 0.72 to 1.45 (p = 0.909). The mean systolic and diastolic blood pressure values were not significantly different across the three genotypic groups, between hypertensives and normotensives, or when stratified by gender. Despite having a similar minor allele frequency (MAF) of 14.5% compared with the SMILE cohort, our results did not support an association of the mirSNP rs11174811 with the hypertension phenotype or with continuous blood pressure outcomes in the south Indian population.
Hu, J; Li, Q-L; Hou, S-H; Peng, H; Guo, J-J
2015-09-01
Inducible T cell costimulator (ICOS) functions to regulate cell-cell signalling, immune responses and cell proliferation. ICOS single nucleotide polymorphism (SNP) may affect protein expression and functions. This study investigated the association of ICOS SNPs with hepatitis B virus (HBV) infection and outcome in a Chinese population. A total of 1290 Chinese Han individuals were enrolled, including 63 asymptomatic HBV carriers, 220 chronic hepatitis B patients (CHB), 249 HBV-related liver cirrhosis patients (LC), 108 patients with HBV-related hepatocellular carcinoma (HCC), 338 patients with natural HBV clearance and 312 healthy subjects (as controls). DNA samples from these subjects were genotyped for four ICOS SNPs (rs11883722, rs10932029, rs1559931 and rs4675379) using TaqMan SNP Genotyping Assay and analysed. The data showed that genotype and allele frequencies of ICOS SNPs in cases and controls followed the Hardy-Weinberg distribution. The CC genotype of rs4675379 was higher in patients with HBV infection (including AC, CHB, LC and HCC) than in patients with HBV clearance (P = 0.006). Furthermore, the genotype 'GA' and the minor allele 'A' of rs1559931 were associated with a decreased HCC susceptibility (P < 0.001). Haplotype analysis data showed that 'GC' haplotype in block 2 (rs1559931 and rs4675379) had a lower frequency in patients than in HBV-cleared subjects (P = 0.034), although its overall frequency was only 1.6%. Our study found that ICOS rs1559931 SNP was associated with decreased HBV-related HCC risk in the studied Chinese Han population, except for patients with natural clearance of HBV. © 2015 The Foundation for the Scandinavian Journal of Immunology.
Llamas-Covarrubias, Iris Monserrat; Llamas-Covarrubias, Mara Anaís; Martinez-López, Erika; Zepeda-Carrillo, Eloy Alfonso; Rivera-León, Edgar Alfonso; Palmeros-Sánchez, Beatriz; Alcalá-Zermeño, Juan Luis; Sánchez-Enríquez, Sergio
2017-07-01
Obesity is a metabolic disorder that has a multifactorial etiology and affects millions of people worldwide. Ghrelin, a hormone coded by the GHRL gene, plays a role in human body composition and appetite. Single nucleotide polymorphisms (SNPs) of the GHRL gene have been associated with obesity and metabolic disorders. To evaluate the association of A-604G SNP of GHRL promoter region with serum ghrelin levels and the risk of obesity in a Mexican population. Two hundred and fifty individuals were enrolled and classified as obese or control subjects (CS) according to BMI. DNA samples, anthropometric measurements and biochemical parameters were obtained from all subjects. The A-604G SNP was genotyped using PCR-RFLPs technique. Ghrelin levels were measured using a commercial enzyme immunoassay. The G/G genotype was more frequent among obese individuals (p < 0.0001) when compared to CS. The G/A genotype and A allele were associated with protection against obesity (OR 0.29, p < 0.0001; OR 0.39, p < 0.0001 respectively), the A allele remained significant after adjusting for age and gender (OR: 0.25, p < 0.0001). Serum ghrelin levels were higher in obese patients (p = 0.004) than in CS, however, significance was lost after adjustment for age (p = 0.088). The G/G genotype was associated with higher levels of serum ghrelin (p = 0.02) independently of the effect of age. The G/G genotype of the A-604G SNP in the GHRL gene is associated with altered serum ghrelin levels and obesity. The A allele was also associated with protection against obesity in this study.
Fu, Yong-Bi; Peterson, Gregory W; Dong, Yibo
2016-04-07
Genotyping-by-sequencing (GBS) has emerged as a useful genomic approach for exploring genome-wide genetic variation. However, GBS commonly samples a genome unevenly and can generate a substantial amount of missing data. These technical features would limit the power of various GBS-based genetic and genomic analyses. Here we present software called IgCoverage for in silico evaluation of genomic coverage through GBS with an individual or pair of restriction enzymes on one sequenced genome, and report a new set of 21 restriction enzyme combinations that can be applied to enhance GBS applications. These enzyme combinations were developed through an application of IgCoverage on 22 plant, animal, and fungus species with sequenced genomes, and some of them were empirically evaluated with different runs of Illumina MiSeq sequencing in 12 plant species. The in silico analysis of 22 organisms revealed up to eight times more genome coverage for the new combinations consisted of pairing four- or five-cutter restriction enzymes than the commonly used enzyme combination PstI + MspI. The empirical evaluation of the new enzyme combination (HinfI + HpyCH4IV) in 12 plant species showed 1.7-6 times more genome coverage than PstI + MspI, and 2.3 times more genome coverage in dicots than monocots. Also, the SNP genotyping in 12 Arabidopsis and 12 rice plants revealed that HinfI + HpyCH4IV generated 7 and 1.3 times more SNPs (with 0-16.7% missing observations) than PstI + MspI, respectively. These findings demonstrate that these novel enzyme combinations can be utilized to increase genome sampling and improve SNP genotyping in various GBS applications. Copyright © 2016 Fu et al.
Kadkhodazadeh, Mahdi; Ebadian, Ahmad Reza; Gholami, Gholam Ali; Khosravi, Alireza; Tabari, Zahra Alizadeh
2013-05-01
RANK/OPG/RANKL pathway plays a significant role in osteoclastogenesis, osteoclast activation, and regulation of bone resorption. The aim of this study was to investigate the association of RANKL gene polymorphisms (rs9533156 and rs2277438) with chronic periodontitis and peri-implantitis in an Iranian population. 77 patients with chronic periodontitis, 40 patients with peri-implantitis and 89 periodontally healthy patients were enrolled in this study. 5cc of blood was obtained from the cephalic vein of subjects arms and transferred into tubes containing EDTA. Genomic DNA was extracted using Miller's Salting Out technique. The DNA was transferred into 96 division plates, transported to Kbioscience Institute in United Kingdom and analyzed using the Kbioscience Competitive Allele Specific PCR (KASP) technique. Differences in the frequencies of genotypes and alleles in the disease and control groups were analyzed using Chi-square and Fisher's exact statistical tests. Comparison of frequency of alleles in SNP rs9533156 of RANKL gene between the chronic periodontitis group with the control and peri-implantitis groups revealed statistically significant differences (P=0.024 and P=0.027, respectively). Comparison of genotype expression of SNP rs9533156 on RANKL gene between the peri-implantitis group with chronic periodontitis and control groups revealed statistically significant differences (P=0.001); the prevalence of CT genotype was significantly higher amongst the chronic periodontitis group. Regarding SNP rs2277438 of RANKL gene, comparison of prevalence of genotypes and frequency of alleles did not reveal any significant differences (P=0.641/P=0.537, respectively). The results of this study indicate that CT genotype of rs9533156 RANKL gene polymorphism was significantly associated with peri-implantitis, and may be considered as a genetic determinant for peri-implantitis. Copyright © 2012 Elsevier Ltd. All rights reserved.
2012-01-01
Background Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS) technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD) might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP) marker development. Results An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison. PMID:22908993
A high-density intraspecific SNP linkage map of pigeonpea (Cajanas cajan L. Millsp.)
Mandal, Paritra; Bhutani, Shefali; Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram Pratap; Chaudhary, A. K.; Yadav, Rekha; Gaikwad, K.; Sevanthi, Amitha Mithra; Datta, Subhojit; Raje, Ranjeet S.; Sharma, Tilak R.; Singh, Nagendra Kumar
2017-01-01
Pigeonpea (Cajanus cajan (L.) Millsp.) is a major food legume cultivated in semi-arid tropical regions including the Indian subcontinent, Africa, and Southeast Asia. It is an important source of protein, minerals, and vitamins for nearly 20% of the world population. Due to high carbon sequestration and drought tolerance, pigeonpea is an important crop for the development of climate resilient agriculture and nutritional security. However, pigeonpea productivity has remained low for decades because of limited genetic and genomic resources, and sparse utilization of landraces and wild pigeonpea germplasm. Here, we present a dense intraspecific linkage map of pigeonpea comprising 932 markers that span a total adjusted map length of 1,411.83 cM. The consensus map is based on three different linkage maps that incorporate a large number of single nucleotide polymorphism (SNP) markers derived from next generation sequencing data, using Illumina GoldenGate bead arrays, and genotyping with restriction site associated DNA (RAD) sequencing. The genotyping-by-sequencing enhanced the marker density but was met with limited success due to lack of common markers across the genotypes of mapping population. The integrated map has 547 bead-array SNP, 319 RAD-SNP, and 65 simple sequence repeat (SSR) marker loci. We also show here correspondence between our linkage map and published genome pseudomolecules of pigeonpea. The availability of a high-density linkage map will help improve the anchoring of the pigeonpea genome to its chromosomes and the mapping of genes and quantitative trait loci associated with useful agronomic traits. PMID:28654689
DOE Office of Scientific and Technical Information (OSTI.GOV)
Paez, David, E-mail: dpaez@santpau.cat; Salazar, Juliana; Pare, Laia
Purpose: Several studies have been performed to evaluate the usefulness of neoadjuvant treatment using oxaliplatin and fluoropyrimidines for locally advanced rectal cancer. However, preoperative biomarkers of outcome are lacking. We studied the polymorphisms in thymidylate synthase, epidermal growth factor receptor, glutathione S-transferase pi 1 (GSTP1), and several DNA repair genes to evaluate their usefulness as pharmacogenetic markers in a cohort of 128 rectal cancer patients treated with preoperative chemoradiotherapy. Methods and Materials: Blood samples were obtained from 128 patients with Stage II-III rectal cancer. DNA was extracted from the peripheral blood nucleated cells, and the genotypes were analyzed by polymerasemore » chain reaction amplification and automated sequencing techniques or using a 48.48 dynamic array on the BioMark system. The germline polymorphisms studied were thymidylate synthase, (VNTR/5 Prime UTR, 2R G>C single nucleotide polymorphism [SNP], 3R G>C SNP), epidermal growth factor receptor (Arg497Lys), GSTP1 (Ile105val), excision repair cross-complementing 1 (Asn118Asn, 8092C>A, 19716G>C), X-ray repair cross-complementing group 1 (XRCC1) (Arg194Trp, Arg280His, Arg399Gln), and xeroderma pigmentosum group D (Lys751Gln). The pathologic response, pathologic regression, progression-free survival, and overall survival were evaluated according to each genotype. Results: The Asterisk-Operator 3/ Asterisk-Operator 3 thymidylate synthase genotype was associated with a greater response rate (pathologic complete remission and microfoci residual tumor, 59% in Asterisk-Operator 3/ Asterisk-Operator 3 vs. 35% in Asterisk-Operator 2/ Asterisk-Operator 2 and Asterisk-Operator 2/ Asterisk-Operator 3; p = .013). For the thymidylate synthase genotype, the median progression-free survival was 103 months for the Asterisk-Operator 3/ Asterisk-Operator 3 patients and 84 months for the Asterisk-Operator 2/ Asterisk-Operator 2 and Asterisk-Operator 2/ Asterisk-Operator 3 patients (p = .039). For XRCC1 Arg399Gln SNP, the median progression-free survival was 101 months for the G/G, 78 months for the G/A, and 31 months for the A/A patients (p = .048). Conclusions: The thymidylate synthase genotype and XRCC1 Arg399Gln polymorphism might help to identify Stage II-III rectal cancer patients with a better outcome after preoperative concomitant chemoradiotherapy.« less
Kobayashi, Fuminori; Tanaka, Tsuyoshi; Kanamori, Hiroyuki; Wu, Jianzhong; Katayose, Yuichi; Handa, Hirokazu
2016-03-01
A core collection of Japanese wheat varieties (JWC) consisting of 96 accessions was established based on their passport data and breeding pedigrees. To clarify the molecular basis of the JWC collection, genome-wide single-nucleotide polymorphism (SNP) genotyping was performed using the genotyping-by-sequencing (GBS) approach. Phylogenetic tree and population structure analyses using these SNP data revealed the genetic diversity and relationships among the JWC accessions, classifying them into four groups; "varieties in the Hokkaido area", "modern varieties in the northeast part of Japan", "modern varieties in the southwest part of Japan" and "classical varieties including landraces". This clustering closely reflected the history of wheat breeding in Japan. Furthermore, to demonstrate the utility of the JWC collection, we performed a genome-wide association study (GWAS) for three traits, namely, "days to heading in autumn sowing", "days to heading in spring sowing" and "culm length". We found significantly associated SNP markers with each trait, and some of these were closely linked to known major genes for heading date or culm length on the genetic map. Our study indicates that this JWC collection is a useful set of germplasm for basic and applied research aimed at understanding and utilizing the genetic diversity among Japanese wheat varieties.
A potential molecular marker for selection against abdominal fatness in chickens.
Wu, G Q; Deng, X M; Li, J Y; Li, N; Yang, N
2006-11-01
The peroxisome proliferators-activated receptor-gamma coactivator-1alpha (PGC-1alpha) was investigated as a candidate gene for growth and fatness traits in chicken because of its prominent role in muscle fiber specialization and adipogenesis. A single nucleotide polymorphism (SNP) from G to A at position 646 of the open reading frame of chicken PGC-1alpha gene causing an Asp216Asn amino acid substitution was identified. The frequencies of alleles and genotypes were significantly different among 6 chicken breeds (P < 0.01). The White Plymouth Rock had the highest frequency (0.67) of allele G, whereas the White Leghorn had the lowest (0.18). The associations of the SNP with the growth and fatness traits were evaluated in 332 F(2) birds from an experimental cross of White Plymouth Rock x Silkies. No association was found between the SNP and growth-related traits. However, abdominal fat weight at 12 wk of age for birds with genotype GG was 34.26 and 28.71% higher than those with genotypes AA and AG, respectively (P < 0.01), indicating that the Asp216Asn polymorphism of the PGC-1alpha gene could be used as a novel potential molecular marker for selection against abdominal fatness without interfering in regular breeding for growth rate of chickens.
van der Starre, Willize E.; van Nieuwkoop, Cees; Thomson, Uginia; Zijderveld-Voshart, Marleen S. M.; Koopman, Jan Pieter R.; van der Reijden, Tanny J. K.; van Dissel, Jaap T.; van de Vosse, Esther
2015-01-01
Objective/Purpose Febrile urinary tract infection (UTI) is a common bacterial disease that may lead to substantial morbidity and mortality especially among the elderly. Little is known about biomarkers that predict a complicated course. Our aim was to determine the role of certain urinary cytokines or antimicrobial proteins, plasma vitamin D level, and genetic variation in host defense of febrile UTI and its relation with bacteremia. Methods A case-control study. Out of a cohort of consecutive adults with febrile UTI (n = 787) included in a multi-center observational cohort study, 46 cases with bacteremic E.coli UTI and 45 cases with non-bacteremic E.coli UTI were randomly selected and compared to 46 controls. Urinary IL-6, IL-8, LL37, β-defensin 2 and uromodulin as well as plasma 25-hydroxyvitamin D were measured. In 440 controls and 707 UTI patients polymorphisms were genotyped in the genes CXCR1, DEFA4, DEFB1, IL6, IL8, MYD88, UMOD, TIRAP, TLR1, TLR2, TLR5 and TNF. Results IL-6, IL-8, and LL37 are different between controls and UTI patients, although these proteins do not distinguish between patients with and without bacteremia. While uromodulin did not differ between groups, inability to produce uromodulin is more common in patients with bacteremia. Most participants in the study, including the controls, had insufficient vitamin D and, at least in winter, UTI patients have lower vitamin D than controls. Associations were found between the CC genotype of IL6 SNP rs1800795 and occurrence of bacteremia and between TLR5 SNP rs5744168 and protection from UTI. The rare GG genotype of IL6 SNP rs1800795 was associated with higher β-defensin 2 production. Conclusion Although no biomarker was able to distinguish between UTI with or without bacteremia, two risk factors for bacteremia were identified. These were inability to produce uromodulin and an IL6 rs1800795 genotype. PMID:25807366