Sample records for randomly selected snps

  1. SNP selection and classification of genome-wide SNP data using stratified sampling random forests.

    PubMed

    Wu, Qingyao; Ye, Yunming; Liu, Yang; Ng, Michael K

    2012-09-01

    For high dimensional genome-wide association (GWA) case-control data of complex disease, there are usually a large portion of single-nucleotide polymorphisms (SNPs) that are irrelevant with the disease. A simple random sampling method in random forest using default mtry parameter to choose feature subspace, will select too many subspaces without informative SNPs. Exhaustive searching an optimal mtry is often required in order to include useful and relevant SNPs and get rid of vast of non-informative SNPs. However, it is too time-consuming and not favorable in GWA for high-dimensional data. The main aim of this paper is to propose a stratified sampling method for feature subspace selection to generate decision trees in a random forest for GWA high-dimensional data. Our idea is to design an equal-width discretization scheme for informativeness to divide SNPs into multiple groups. In feature subspace selection, we randomly select the same number of SNPs from each group and combine them to form a subspace to generate a decision tree. The advantage of this stratified sampling procedure can make sure each subspace contains enough useful SNPs, but can avoid a very high computational cost of exhaustive search of an optimal mtry, and maintain the randomness of a random forest. We employ two genome-wide SNP data sets (Parkinson case-control data comprised of 408 803 SNPs and Alzheimer case-control data comprised of 380 157 SNPs) to demonstrate that the proposed stratified sampling method is effective, and it can generate better random forest with higher accuracy and lower error bound than those by Breiman's random forest generation method. For Parkinson data, we also show some interesting genes identified by the method, which may be associated with neurological disorders for further biological investigations.

  2. Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.

    PubMed

    Nguyen, Thanh-Tung; Huang, Joshua; Wu, Qingyao; Nguyen, Thuy; Li, Mark

    2015-01-01

    Single-nucleotide polymorphisms (SNPs) selection and identification are the most important tasks in Genome-wide association data analysis. The problem is difficult because genome-wide association data is very high dimensional and a large portion of SNPs in the data is irrelevant to the disease. Advanced machine learning methods have been successfully used in Genome-wide association studies (GWAS) for identification of genetic variants that have relatively big effects in some common, complex diseases. Among them, the most successful one is Random Forests (RF). Despite of performing well in terms of prediction accuracy in some data sets with moderate size, RF still suffers from working in GWAS for selecting informative SNPs and building accurate prediction models. In this paper, we propose to use a new two-stage quality-based sampling method in random forests, named ts-RF, for SNP subspace selection for GWAS. The method first applies p-value assessment to find a cut-off point that separates informative and irrelevant SNPs in two groups. The informative SNPs group is further divided into two sub-groups: highly informative and weak informative SNPs. When sampling the SNP subspace for building trees for the forest, only those SNPs from the two sub-groups are taken into account. The feature subspaces always contain highly informative SNPs when used to split a node at a tree. This approach enables one to generate more accurate trees with a lower prediction error, meanwhile possibly avoiding overfitting. It allows one to detect interactions of multiple SNPs with the diseases, and to reduce the dimensionality and the amount of Genome-wide association data needed for learning the RF model. Extensive experiments on two genome-wide SNP data sets (Parkinson case-control data comprised of 408,803 SNPs and Alzheimer case-control data comprised of 380,157 SNPs) and 10 gene data sets have demonstrated that the proposed model significantly reduced prediction errors and outperformed most existing the-state-of-the-art random forests. The top 25 SNPs in Parkinson data set were identified by the proposed model including four interesting genes associated with neurological disorders. The presented approach has shown to be effective in selecting informative sub-groups of SNPs potentially associated with diseases that traditional statistical approaches might fail. The new RF works well for the data where the number of case-control objects is much smaller than the number of SNPs, which is a typical problem in gene data and GWAS. Experiment results demonstrated the effectiveness of the proposed RF model that outperformed the state-of-the-art RFs, including Breiman's RF, GRRF and wsRF methods.

  3. Bilirubin and Stroke Risk Using a Mendelian Randomization Design.

    PubMed

    Lee, Sun Ju; Jee, Yon Ho; Jung, Keum Ji; Hong, Seri; Shin, Eun Soon; Jee, Sun Ha

    2017-05-01

    Circulating bilirubin, a natural antioxidant, is associated with decreased risk of stroke. However, the nature of the relationship between the two remains unknown. We used a Mendelian randomization analysis to assess the causal effect of serum bilirubin on stroke risk in Koreans. The 14 single-nucleotide polymorphisms (SNPs) (<10 -7 ) including rs6742078 of uridine diphosphoglucuronyl-transferase were selected from genome-wide association study of bilirubin level in the KCPS-II (Korean Cancer Prevention Study-II) Biobank subcohort consisting of 4793 healthy Korean and 806 stroke cases. Weighted genetic risk score was calculated using 14 SNPs selected from the top SNPs. Both rs6742078 (F statistics=138) and weighted genetic risk score with 14 SNPs (F statistics=187) were strongly associated with bilirubin levels. Simultaneously, serum bilirubin level was associated with decreased risk of stroke in an ordinary least-squares analysis. However, in 2-stage least-squares Mendelian randomization analysis, no causal relationship between serum bilirubin and stroke risk was found. There is no evidence that bilirubin level is causally associated with risk of stroke in Koreans. Therefore, bilirubin level is not a risk determinant of stroke. © 2017 American Heart Association, Inc.

  4. Preselection statistics and Random Forest classification identify population informative single nucleotide polymorphisms in cosmopolitan and autochthonous cattle breeds.

    PubMed

    Bertolini, F; Galimberti, G; Schiavo, G; Mastrangelo, S; Di Gerlando, R; Strillacci, M G; Bagnato, A; Portolano, B; Fontanesi, L

    2018-01-01

    Commercial single nucleotide polymorphism (SNP) arrays have been recently developed for several species and can be used to identify informative markers to differentiate breeds or populations for several downstream applications. To identify the most discriminating genetic markers among thousands of genotyped SNPs, a few statistical approaches have been proposed. In this work, we compared several methods of SNPs preselection (Delta, F st and principal component analyses (PCA)) in addition to Random Forest classifications to analyse SNP data from six dairy cattle breeds, including cosmopolitan (Holstein, Brown and Simmental) and autochthonous Italian breeds raised in two different regions and subjected to limited or no breeding programmes (Cinisara, Modicana, raised only in Sicily and Reggiana, raised only in Emilia Romagna). From these classifications, two panels of 96 and 48 SNPs that contain the most discriminant SNPs were created for each preselection method. These panels were evaluated in terms of the ability to discriminate as a whole and breed-by-breed, as well as linkage disequilibrium within each panel. The obtained results showed that for the 48-SNP panel, the error rate increased mainly for autochthonous breeds, probably as a consequence of their admixed origin lower selection pressure and by ascertaining bias in the construction of the SNP chip. The 96-SNP panels were generally more able to discriminate all breeds. The panel derived by PCA-chrom (obtained by a preselection chromosome by chromosome) could identify informative SNPs that were particularly useful for the assignment of minor breeds that reached the lowest value of Out Of Bag error even in the Cinisara, whose value was quite high in all other panels. Moreover, this panel contained also the lowest number of SNPs in linkage disequilibrium. Several selected SNPs are located nearby genes affecting breed-specific phenotypic traits (coat colour and stature) or associated with production traits. In general, our results demonstrated the usefulness of Random Forest in combination to other reduction techniques to identify population informative SNPs.

  5. Geographic differences in allele frequencies of susceptibility SNPs for cardiovascular disease

    PubMed Central

    2011-01-01

    Background We hypothesized that the frequencies of risk alleles of SNPs mediating susceptibility to cardiovascular diseases differ among populations of varying geographic origin and that population-specific selection has operated on some of these variants. Methods From the database of genome-wide association studies (GWAS), we selected 36 cardiovascular phenotypes including coronary heart disease, hypertension, and stroke, as well as related quantitative traits (eg, body mass index and plasma lipid levels). We identified 292 SNPs in 270 genes associated with a disease or trait at P < 5 × 10-8. As part of the Human Genome-Diversity Project (HGDP), 158 (54.1%) of these SNPs have been genotyped in 938 individuals belonging to 52 populations from seven geographic areas. A measure of population differentiation, FST, was calculated to quantify differences in risk allele frequencies (RAFs) among populations and geographic areas. Results Large differences in RAFs were noted in populations of Africa, East Asia, America and Oceania, when compared with other geographic regions. The mean global FST (0.1042) for 158 SNPs among the populations was not significantly higher than the mean global FST of 158 autosomal SNPs randomly sampled from the HGDP database. Significantly higher global FST (P < 0.05) was noted in eight SNPs, based on an empirical distribution of global FST of 2036 putatively neutral SNPs. For four of these SNPs, additional evidence of selection was noted based on the integrated Haplotype Score. Conclusion Large differences in RAFs for a set of common SNPs that influence risk of cardiovascular disease were noted between the major world populations. Pairwise comparisons revealed RAF differences for at least eight SNPs that might be due to population-specific selection or demographic factors. These findings are relevant to a better understanding of geographic variation in the prevalence of cardiovascular disease. PMID:21507254

  6. Applications of random forest feature selection for fine-scale genetic population assignment.

    PubMed

    Sylvester, Emma V A; Bentzen, Paul; Bradbury, Ian R; Clément, Marie; Pearce, Jon; Horne, John; Beiko, Robert G

    2018-02-01

    Genetic population assignment used to inform wildlife management and conservation efforts requires panels of highly informative genetic markers and sensitive assignment tests. We explored the utility of machine-learning algorithms (random forest, regularized random forest and guided regularized random forest) compared with F ST ranking for selection of single nucleotide polymorphisms (SNP) for fine-scale population assignment. We applied these methods to an unpublished SNP data set for Atlantic salmon ( Salmo salar ) and a published SNP data set for Alaskan Chinook salmon ( Oncorhynchus tshawytscha ). In each species, we identified the minimum panel size required to obtain a self-assignment accuracy of at least 90% using each method to create panels of 50-700 markers Panels of SNPs identified using random forest-based methods performed up to 7.8 and 11.2 percentage points better than F ST -selected panels of similar size for the Atlantic salmon and Chinook salmon data, respectively. Self-assignment accuracy ≥90% was obtained with panels of 670 and 384 SNPs for each data set, respectively, a level of accuracy never reached for these species using F ST -selected panels. Our results demonstrate a role for machine-learning approaches in marker selection across large genomic data sets to improve assignment for management and conservation of exploited populations.

  7. Genome-wide association study for backfat thickness in Canchim beef cattle using Random Forest approach

    PubMed Central

    2013-01-01

    Background Meat quality involves many traits, such as marbling, tenderness, juiciness, and backfat thickness, all of which require attention from livestock producers. Backfat thickness improvement by means of traditional selection techniques in Canchim beef cattle has been challenging due to its low heritability, and it is measured late in an animal’s life. Therefore, the implementation of new methodologies for identification of single nucleotide polymorphisms (SNPs) linked to backfat thickness are an important strategy for genetic improvement of carcass and meat quality. Results The set of SNPs identified by the random forest approach explained as much as 50% of the deregressed estimated breeding value (dEBV) variance associated with backfat thickness, and a small set of 5 SNPs were able to explain 34% of the dEBV for backfat thickness. Several quantitative trait loci (QTL) for fat-related traits were found in the surrounding areas of the SNPs, as well as many genes with roles in lipid metabolism. Conclusions These results provided a better understanding of the backfat deposition and regulation pathways, and can be considered a starting point for future implementation of a genomic selection program for backfat thickness in Canchim beef cattle. PMID:23738659

  8. Genotyping by sequencing for genomic prediction in a soybean breeding population.

    PubMed

    Jarquín, Diego; Kocak, Kyle; Posadas, Luis; Hyma, Katie; Jedlicka, Joseph; Graef, George; Lorenz, Aaron

    2014-08-29

    Advances in genotyping technology, such as genotyping by sequencing (GBS), are making genomic prediction more attractive to reduce breeding cycle times and costs associated with phenotyping. Genomic prediction and selection has been studied in several crop species, but no reports exist in soybean. The objectives of this study were (i) evaluate prospects for genomic selection using GBS in a typical soybean breeding program and (ii) evaluate the effect of GBS marker selection and imputation on genomic prediction accuracy. To achieve these objectives, a set of soybean lines sampled from the University of Nebraska Soybean Breeding Program were genotyped using GBS and evaluated for yield and other agronomic traits at multiple Nebraska locations. Genotyping by sequencing scored 16,502 single nucleotide polymorphisms (SNPs) with minor-allele frequency (MAF) > 0.05 and percentage of missing values ≤ 5% on 301 elite soybean breeding lines. When SNPs with up to 80% missing values were included, 52,349 SNPs were scored. Prediction accuracy for grain yield, assessed using cross validation, was estimated to be 0.64, indicating good potential for using genomic selection for grain yield in soybean. Filtering SNPs based on missing data percentage had little to no effect on prediction accuracy, especially when random forest imputation was used to impute missing values. The highest accuracies were observed when random forest imputation was used on all SNPs, but differences were not significant. A standard additive G-BLUP model was robust; modeling additive-by-additive epistasis did not provide any improvement in prediction accuracy. The effect of training population size on accuracy began to plateau around 100, but accuracy steadily climbed until the largest possible size was used in this analysis. Including only SNPs with MAF > 0.30 provided higher accuracies when training populations were smaller. Using GBS for genomic prediction in soybean holds good potential to expedite genetic gain. Our results suggest that standard additive G-BLUP models can be used on unfiltered, imputed GBS data without loss in accuracy.

  9. Genotyping of 75 SNPs using arrays for individual identification in five population groups.

    PubMed

    Hwa, Hsiao-Lin; Wu, Lawrence Shih Hsin; Lin, Chun-Yen; Huang, Tsun-Ying; Yin, Hsiang-I; Tseng, Li-Hui; Lee, James Chun-I

    2016-01-01

    Single nucleotide polymorphism (SNP) typing offers promise to forensic genetics. Various strategies and panels for analyzing SNP markers for individual identification have been published. However, the best panels with fewer identity SNPs for all major population groups are still under discussion. This study aimed to find more autosomal SNPs with high heterozygosity for individual identification among Asian populations. Ninety-six autosomal SNPs of 502 DNA samples from unrelated individuals of five population groups (208 Taiwanese Han, 83 Filipinos, 62 Thais, 69 Indonesians, and 80 individuals with European, Near Eastern, or South Asian ancestry) were analyzed using arrays in an initial screening, and 75 SNPs (group A, 46 newly selected SNPs; groups B, 29 SNPs based on a previous SNP panel) were selected for further statistical analyses. Some SNPs with high heterozygosity from Asian populations were identified. The combined random match probability of the best 40 and 45 SNPs was between 3.16 × 10(-17) and 7.75 × 10(-17) and between 2.33 × 10(-19) and 7.00 × 10(-19), respectively, in all five populations. These loci offer comparable power to short tandem repeats (STRs) for routine forensic profiling. In this study, we demonstrated the population genetic characteristics and forensic parameters of 75 SNPs with high heterozygosity from five population groups. This SNPs panel can provide valuable genotypic information and can be helpful in forensic casework for individual identification among these populations.

  10. Selenium- or vitamin E-related gene variants, interaction with supplementation, and risk of high-grade prostate cancer in SELECT

    PubMed Central

    Chan, June M.; Darke, Amy K.; Penney, Kathryn L.; Tangen, Catherine M.; Goodman, Phyllis J.; Lee, Gwo-Shu Mary; Sun, Tong; Peisch, Sam; Tinianow, Alex M.; Rae, James M.; Klein, Eric A.; Thompson, Ian M.

    2016-01-01

    Background Epidemiological studies and secondary analyses of randomized trials supported the hypothesis that selenium and vitamin E lower prostate cancer risk. However, the Selenium and Vitamin E Cancer Prevention Trial (SELECT) showed no benefit of either supplement. Genetic variants involved in selenium or vitamin E metabolism or transport may underlie the complex associations of selenium and vitamin E. Methods We undertook a case-cohort study of SELECT participants randomized to placebo, selenium or vitamin E. The subcohort included 1,434 men; our primary outcome was high-grade prostate cancer (N=278 cases, Gleason 7 or higher cancer). We used weighted Cox regression to examine the association between SNPs and high-grade prostate cancer risk. To assess effect modification, we created interaction terms between randomization arm and genotype and calculated log likelihood statistics. Results We noted statistically significant (p<0.05) interactions between selenium assignment, SNPs in CAT, SOD2, PRDX6, SOD3, and TXNRD2 and high-grade prostate cancer risk. Statistically significant SNPs that modified the association of vitamin E assignment and high-grade prostate cancer included SEC14L2, SOD1, and TTPA. In the placebo arm, several SNPs, hypothesized to interact with supplement assignment and risk of high-grade prostate cancer, were also directly associated with outcome. Conclusion Variants in selenium and vitamin E metabolism/transport genes may influence risk of overall and high-grade prostate cancer, and may modify an individual man’s response to vitamin E or selenium supplementation with regards to these risks. Impact The effect of selenium or vitamin E supplementation on high-grade prostate cancer risk may vary by genotype. PMID:27197287

  11. Resampling procedures to identify important SNPs using a consensus approach.

    PubMed

    Pardy, Christopher; Motyer, Allan; Wilson, Susan

    2011-11-29

    Our goal is to identify common single-nucleotide polymorphisms (SNPs) (minor allele frequency > 1%) that add predictive accuracy above that gained by knowledge of easily measured clinical variables. We take an algorithmic approach to predict each phenotypic variable using a combination of phenotypic and genotypic predictors. We perform our procedure on the first simulated replicate and then validate against the others. Our procedure performs well when predicting Q1 but is less successful for the other outcomes. We use resampling procedures where possible to guard against false positives and to improve generalizability. The approach is based on finding a consensus regarding important SNPs by applying random forests and the least absolute shrinkage and selection operator (LASSO) on multiple subsamples. Random forests are used first to discard unimportant predictors, narrowing our focus to roughly 100 important SNPs. A cross-validation LASSO is then used to further select variables. We combine these procedures to guarantee that cross-validation can be used to choose a shrinkage parameter for the LASSO. If the clinical variables were unavailable, this prefiltering step would be essential. We perform the SNP-based analyses simultaneously rather than one at a time to estimate SNP effects in the presence of other causal variants. We analyzed the first simulated replicate of Genetic Analysis Workshop 17 without knowledge of the true model. Post-conference knowledge of the simulation parameters allowed us to investigate the limitations of our approach. We found that many of the false positives we identified were substantially correlated with genuine causal SNPs.

  12. Screening large-scale association study data: exploiting interactions using random forests.

    PubMed

    Lunetta, Kathryn L; Hayward, L Brooke; Segal, Jonathan; Van Eerdewegh, Paul

    2004-12-10

    Genome-wide association studies for complex diseases will produce genotypes on hundreds of thousands of single nucleotide polymorphisms (SNPs). A logical first approach to dealing with massive numbers of SNPs is to use some test to screen the SNPs, retaining only those that meet some criterion for further study. For example, SNPs can be ranked by p-value, and those with the lowest p-values retained. When SNPs have large interaction effects but small marginal effects in a population, they are unlikely to be retained when univariate tests are used for screening. However, model-based screens that pre-specify interactions are impractical for data sets with thousands of SNPs. Random forest analysis is an alternative method that produces a single measure of importance for each predictor variable that takes into account interactions among variables without requiring model specification. Interactions increase the importance for the individual interacting variables, making them more likely to be given high importance relative to other variables. We test the performance of random forests as a screening procedure to identify small numbers of risk-associated SNPs from among large numbers of unassociated SNPs using complex disease models with up to 32 loci, incorporating both genetic heterogeneity and multi-locus interaction. Keeping other factors constant, if risk SNPs interact, the random forest importance measure significantly outperforms the Fisher Exact test as a screening tool. As the number of interacting SNPs increases, the improvement in performance of random forest analysis relative to Fisher Exact test for screening also increases. Random forests perform similarly to the univariate Fisher Exact test as a screening tool when SNPs in the analysis do not interact. In the context of large-scale genetic association studies where unknown interactions exist among true risk-associated SNPs or SNPs and environmental covariates, screening SNPs using random forest analyses can significantly reduce the number of SNPs that need to be retained for further study compared to standard univariate screening methods.

  13. On marker-based parentage verification via non-linear optimization.

    PubMed

    Boerner, Vinzent

    2017-06-15

    Parentage verification by molecular markers is mainly based on short tandem repeat markers. Single nucleotide polymorphisms (SNPs) as bi-allelic markers have become the markers of choice for genotyping projects. Thus, the subsequent step is to use SNP genotypes for parentage verification as well. Recent developments of algorithms such as evaluating opposing homozygous SNP genotypes have drawbacks, for example the inability of rejecting all animals of a sample of potential parents. This paper describes an algorithm for parentage verification by constrained regression which overcomes the latter limitation and proves to be very fast and accurate even when the number of SNPs is as low as 50. The algorithm was tested on a sample of 14,816 animals with 50, 100 and 500 SNP genotypes randomly selected from 40k genotypes. The samples of putative parents of these animals contained either five random animals, or four random animals and the true sire. Parentage assignment was performed by ranking of regression coefficients, or by setting a minimum threshold for regression coefficients. The assignment quality was evaluated by the power of assignment (P[Formula: see text]) and the power of exclusion (P[Formula: see text]). If the sample of putative parents contained the true sire and parentage was assigned by coefficient ranking, P[Formula: see text] and P[Formula: see text] were both higher than 0.99 for the 500 and 100 SNP genotypes, and higher than 0.98 for the 50 SNP genotypes. When parentage was assigned by a coefficient threshold, P[Formula: see text] was higher than 0.99 regardless of the number of SNPs, but P[Formula: see text] decreased from 0.99 (500 SNPs) to 0.97 (100 SNPs) and 0.92 (50 SNPs). If the sample of putative parents did not contain the true sire and parentage was rejected using a coefficient threshold, the algorithm achieved a P[Formula: see text] of 1 (500 SNPs), 0.99 (100 SNPs) and 0.97 (50 SNPs). The algorithm described here is easy to implement, fast and accurate, and is able to assign parentage using genomic marker data with a size as low as 50 SNPs.

  14. A small number of candidate gene SNPs reveal continental ancestry in African Americans

    PubMed Central

    KODAMAN, NURI; ALDRICH, MELINDA C.; SMITH, JEFFREY R.; SIGNORELLO, LISA B.; BRADLEY, KEVIN; BREYER, JOAN; COHEN, SARAH S.; LONG, JIRONG; CAI, QIUYIN; GILES, JUSTIN; BUSH, WILLIAM S.; BLOT, WILLIAM J.; MATTHEWS, CHARLES E.; WILLIAMS, SCOTT M.

    2013-01-01

    SUMMARY Using genetic data from an obesity candidate gene study of self-reported African Americans and European Americans, we investigated the number of Ancestry Informative Markers (AIMs) and candidate gene SNPs necessary to infer continental ancestry. Proportions of African and European ancestry were assessed with STRUCTURE (K=2), using 276 AIMs. These reference values were compared to estimates derived using 120, 60, 30, and 15 SNP subsets randomly chosen from the 276 AIMs and from 1144 SNPs in 44 candidate genes. All subsets generated estimates of ancestry consistent with the reference estimates, with mean correlations greater than 0.99 for all subsets of AIMs, and mean correlations of 0.99±0.003; 0.98± 0.01; 0.93±0.03; and 0.81± 0.11 for subsets of 120, 60, 30, and 15 candidate gene SNPs, respectively. Among African Americans, the median absolute difference from reference African ancestry values ranged from 0.01 to 0.03 for the four AIMs subsets and from 0.03 to 0.09 for the four candidate gene SNP subsets. Furthermore, YRI/CEU Fst values provided a metric to predict the performance of candidate gene SNPs. Our results demonstrate that a small number of SNPs randomly selected from candidate genes can be used to estimate admixture proportions in African Americans reliably. PMID:23278390

  15. Statistical modelling of growth using a mixed model with orthogonal polynomials.

    PubMed

    Suchocki, T; Szyda, J

    2011-02-01

    In statistical modelling, the effects of single-nucleotide polymorphisms (SNPs) are often regarded as time-independent. However, for traits recorded repeatedly, it is very interesting to investigate the behaviour of gene effects over time. In the analysis, simulated data from the 13th QTL-MAS Workshop (Wageningen, The Netherlands, April 2009) was used and the major goal was the modelling of genetic effects as time-dependent. For this purpose, a mixed model which describes each effect using the third-order Legendre orthogonal polynomials, in order to account for the correlation between consecutive measurements, is fitted. In this model, SNPs are modelled as fixed, while the environment is modelled as random effects. The maximum likelihood estimates of model parameters are obtained by the expectation-maximisation (EM) algorithm and the significance of the additive SNP effects is based on the likelihood ratio test, with p-values corrected for multiple testing. For each significant SNP, the percentage of the total variance contributed by this SNP is calculated. Moreover, by using a model which simultaneously incorporates effects of all of the SNPs, the prediction of future yields is conducted. As a result, 179 from the total of 453 SNPs covering 16 out of 18 true quantitative trait loci (QTL) were selected. The correlation between predicted and true breeding values was 0.73 for the data set with all SNPs and 0.84 for the data set with selected SNPs. In conclusion, we showed that a longitudinal approach allows for estimating changes of the variance contributed by each SNP over time and demonstrated that, for prediction, the pre-selection of SNPs plays an important role.

  16. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

    PubMed

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal.

  17. Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications

    PubMed Central

    Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R.; Taylor, Jeremy F.; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

    2016-01-01

    Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal. PMID:27583971

  18. Identification and validation of single nucleotide polymorphisms in growth- and maturation-related candidate genes in sole (Solea solea L.).

    PubMed

    Diopere, Eveline; Hellemans, Bart; Volckaert, Filip A M; Maes, Gregory E

    2013-03-01

    Genomic methodologies applied in evolutionary and fisheries research have been of great benefit to understand the marine ecosystem and the management of natural resources. Although single nucleotide polymorphisms (SNPs) are attractive for the study of local adaptation, spatial stock management and traceability, and investigating the effects of fisheries-induced selection, they have rarely been exploited in non-model organisms. This is partly due to difficulties in finding and validating SNPs in species with limited or no genomic resources. Complementary to random genome-scan approaches, a targeted candidate gene approach has the potential to unveil pre-selected functional diversity and provides more in depth information on the action of selection at specific genes. For example genes can be under selective pressure due to climate change and sustained periods of heavy fishing pressure. In this study, we applied a candidate gene approach in sole (Solea solea L.), an important member of the demersal ecosystem. As consumption flatfish it is heavy exploited and has experienced associated life-history changes over the last 60years. To discover novel genetic polymorphisms in or around genes linked to important life history traits in sole, we screened a total of 76 candidate genes related to growth and maturation using a targeted resequencing approach. We identified in total 86 putative SNPs in 22 genes and validated 29 SNPs using a multiplex single-base extension genotyping assay. We found 22 informative SNPs, of which two represent non-synonymous mutations, potentially of functional relevance. These novel markers should be rapidly and broadly applicable in analyses of natural sole populations, as a measure of the evolutionary signature of overfishing and for initiatives on marker assisted selection. Copyright © 2012 Elsevier B.V. All rights reserved.

  19. SNPs selected by information content outperform randomly selected microsatellite loci for delineating genetic identification and introgression in the endangered dark European honeybee (Apis mellifera mellifera).

    PubMed

    Muñoz, Irene; Henriques, Dora; Jara, Laura; Johnston, J Spencer; Chávez-Galarza, Julio; De La Rúa, Pilar; Pinto, M Alice

    2017-07-01

    The honeybee (Apis mellifera) has been threatened by multiple factors including pests and pathogens, pesticides and loss of locally adapted gene complexes due to replacement and introgression. In western Europe, the genetic integrity of the native A. m. mellifera (M-lineage) is endangered due to trading and intensive queen breeding with commercial subspecies of eastern European ancestry (C-lineage). Effective conservation actions require reliable molecular tools to identify pure-bred A. m. mellifera colonies. Microsatellites have been preferred for identification of A. m. mellifera stocks across conservation centres. However, owing to high throughput, easy transferability between laboratories and low genotyping error, SNPs promise to become popular. Here, we compared the resolving power of a widely utilized microsatellite set to detect structure and introgression with that of different sets that combine a variable number of SNPs selected for their information content and genomic proximity to the microsatellite loci. Contrary to every SNP data set, microsatellites did not discriminate between the two lineages in the PCA space. Mean introgression proportions were identical across the two marker types, although at the individual level, microsatellites' performance was relatively poor at the upper range of Q-values, a result reflected by their lower precision. Our results suggest that SNPs are more accurate and powerful than microsatellites for identification of A. m. mellifera colonies, especially when they are selected by information content. © 2016 John Wiley & Sons Ltd.

  20. Genetic parameters and signatures of selection in two divergent laying hen lines selected for feather pecking behaviour.

    PubMed

    Grams, Vanessa; Wellmann, Robin; Preuß, Siegfried; Grashorn, Michael A; Kjaer, Jörgen B; Bessei, Werner; Bennewitz, Jörn

    2015-09-30

    Feather pecking (FP) in laying hens is a well-known and multi-factorial behaviour with a genetic background. In a selection experiment, two lines were developed for 11 generations for high (HFP) and low (LFP) feather pecking, respectively. Starting with the second generation of selection, there was a constant difference in mean number of FP bouts between both lines. We used the data from this experiment to perform a quantitative genetic analysis and to map selection signatures. Pedigree and phenotypic data were available for the last six generations of both lines. Univariate quantitative genetic analyses were conducted using mixed linear and generalized mixed linear models assuming a Poisson distribution. Selection signatures were mapped using 33,228 single nucleotide polymorphisms (SNPs) genotyped on 41 HFP and 34 LFP individuals of generation 11. For each SNP, we estimated Wright's fixation index (FST). We tested the null hypothesis that FST is driven purely by genetic drift against the alternative hypothesis that it is driven by genetic drift and selection. The mixed linear model failed to analyze the LFP data because of the large number of 0s in the observation vector. The Poisson model fitted the data well and revealed a small but continuous genetic trend in both lines. Most of the 17 genome-wide significant SNPs were located on chromosomes 3 and 4. Thirteen clusters with at least two significant SNPs within an interval of 3 Mb maximum were identified. Two clusters were mapped on chromosomes 3, 4, 8 and 19. Of the 17 genome-wide significant SNPs, 12 were located within the identified clusters. This indicates a non-random distribution of significant SNPs and points to the presence of selection sweeps. Data on FP should be analysed using generalised linear mixed models assuming a Poisson distribution, especially if the number of FP bouts is small and the distribution is heavily peaked at 0. The FST-based approach was suitable to map selection signatures that need to be confirmed by linkage or association mapping.

  1. A reduced number of mtSNPs saturates mitochondrial DNA haplotype diversity of worldwide population groups.

    PubMed

    Salas, Antonio; Amigo, Jorge

    2010-05-03

    The high levels of variation characterising the mitochondrial DNA (mtDNA) molecule are due ultimately to its high average mutation rate; moreover, mtDNA variation is deeply structured in different populations and ethnic groups. There is growing interest in selecting a reduced number of mtDNA single nucleotide polymorphisms (mtSNPs) that account for the maximum level of discrimination power in a given population. Applications of the selected mtSNP panel range from anthropologic and medical studies to forensic genetic casework. This study proposes a new simulation-based method that explores the ability of different mtSNP panels to yield the maximum levels of discrimination power. The method explores subsets of mtSNPs of different sizes randomly chosen from a preselected panel of mtSNPs based on frequency. More than 2,000 complete genomes representing three main continental human population groups (Africa, Europe, and Asia) and two admixed populations ("African-Americans" and "Hispanics") were collected from GenBank and the literature, and were used as training sets. Haplotype diversity was measured for each combination of mtSNP and compared with existing mtSNP panels available in the literature. The data indicates that only a reduced number of mtSNPs ranging from six to 22 are needed to account for 95% of the maximum haplotype diversity of a given population sample. However, only a small proportion of the best mtSNPs are shared between populations, indicating that there is not a perfect set of "universal" mtSNPs suitable for all population contexts. The discrimination power provided by these mtSNPs is much higher than the power of the mtSNP panels proposed in the literature to date. Some mtSNP combinations also yield high diversity values in admixed populations. The proposed computational approach for exploring combinations of mtSNPs that optimise the discrimination power of a given set of mtSNPs is more efficient than previous empirical approaches. In contrast to precedent findings, the results seem to indicate that only few mtSNPs are needed to reach high levels of discrimination power in a population, independently of its ancestral background.

  2. A Reduced Number of mtSNPs Saturates Mitochondrial DNA Haplotype Diversity of Worldwide Population Groups

    PubMed Central

    Salas, Antonio; Amigo, Jorge

    2010-01-01

    Background The high levels of variation characterising the mitochondrial DNA (mtDNA) molecule are due ultimately to its high average mutation rate; moreover, mtDNA variation is deeply structured in different populations and ethnic groups. There is growing interest in selecting a reduced number of mtDNA single nucleotide polymorphisms (mtSNPs) that account for the maximum level of discrimination power in a given population. Applications of the selected mtSNP panel range from anthropologic and medical studies to forensic genetic casework. Methodology/Principal Findings This study proposes a new simulation-based method that explores the ability of different mtSNP panels to yield the maximum levels of discrimination power. The method explores subsets of mtSNPs of different sizes randomly chosen from a preselected panel of mtSNPs based on frequency. More than 2,000 complete genomes representing three main continental human population groups (Africa, Europe, and Asia) and two admixed populations (“African-Americans” and “Hispanics”) were collected from GenBank and the literature, and were used as training sets. Haplotype diversity was measured for each combination of mtSNP and compared with existing mtSNP panels available in the literature. The data indicates that only a reduced number of mtSNPs ranging from six to 22 are needed to account for 95% of the maximum haplotype diversity of a given population sample. However, only a small proportion of the best mtSNPs are shared between populations, indicating that there is not a perfect set of “universal” mtSNPs suitable for all population contexts. The discrimination power provided by these mtSNPs is much higher than the power of the mtSNP panels proposed in the literature to date. Some mtSNP combinations also yield high diversity values in admixed populations. Conclusions/Significance The proposed computational approach for exploring combinations of mtSNPs that optimise the discrimination power of a given set of mtSNPs is more efficient than previous empirical approaches. In contrast to precedent findings, the results seem to indicate that only few mtSNPs are needed to reach high levels of discrimination power in a population, independently of its ancestral background. PMID:20454657

  3. Linear reduction method for predictive and informative tag SNP selection.

    PubMed

    He, Jingwu; Westbrooks, Kelly; Zelikovsky, Alexander

    2005-01-01

    Constructing a complete human haplotype map is helpful when associating complex diseases with their related SNPs. Unfortunately, the number of SNPs is very large and it is costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNPs that should be sequenced to a small number of informative representatives called tag SNPs. In this paper, we propose a new linear algebra-based method for selecting and using tag SNPs. We measure the quality of our tag SNP selection algorithm by comparing actual SNPs with SNPs predicted from selected linearly independent tag SNPs. Our experiments show that for sufficiently long haplotypes, knowing only 0.4% of all SNPs the proposed linear reduction method predicts an unknown haplotype with the error rate below 2% based on 10% of the population.

  4. Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines.

    PubMed

    Cericola, Fabio; Jahoor, Ahmed; Orabi, Jihad; Andersen, Jeppe R; Janss, Luc L; Jensen, Just

    2017-01-01

    Wheat breeding programs generate a large amount of variation which cannot be completely explored because of limited phenotyping throughput. Genomic prediction (GP) has been proposed as a new tool which provides breeding values estimations without the need of phenotyping all the material produced but only a subset of it named training population (TP). However, genotyping of all the accessions under analysis is needed and, therefore, optimizing TP dimension and genotyping strategy is pivotal to implement GP in commercial breeding schemes. Here, we explored the optimum TP size and we integrated pedigree records and genome wide association studies (GWAS) results to optimize the genotyping strategy. A total of 988 advanced wheat breeding lines were genotyped with the Illumina 15K SNPs wheat chip and phenotyped across several years and locations for yield, lodging, and starch content. Cross-validation using the largest possible TP size and all the SNPs available after editing (~11k), yielded predictive abilities (rGP) ranging between 0.5-0.6. In order to explore the Training population size, rGP were computed using progressively smaller TP. These exercises showed that TP of around 700 lines were enough to yield the highest observed rGP. Moreover, rGP were calculated by randomly reducing the SNPs number. This showed that around 1K markers were enough to reach the highest observed rGP. GWAS was used to identify markers associated with the traits analyzed. A GWAS-based selection of SNPs resulted in increased rGP when compared with random selection and few hundreds SNPs were sufficient to obtain the highest observed rGP. For each of these scenarios, advantages of adding the pedigree information were shown. Our results indicate that moderate TP sizes were enough to yield high rGP and that pedigree information and GWAS results can be used to greatly optimize the genotyping strategy.

  5. SNP Discovery in the Transcriptome of White Pacific Shrimp Litopenaeus vannamei by Next Generation Sequencing

    PubMed Central

    Yu, Yang; Wei, Jiankai; Zhang, Xiaojun; Liu, Jingwen; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

    2014-01-01

    The application of next generation sequencing technology has greatly facilitated high throughput single nucleotide polymorphism (SNP) discovery and genotyping in genetic research. In the present study, SNPs were discovered based on two transcriptomes of Litopenaeus vannamei (L. vannamei) generated from Illumina sequencing platform HiSeq 2000. One transcriptome of L. vannamei was obtained through sequencing on the RNA from larvae at mysis stage and its reference sequence was de novo assembled. The data from another transcriptome were downloaded from NCBI and the reads of the two transcriptomes were mapped separately to the assembled reference by BWA. SNP calling was performed using SAMtools. A total of 58,717 and 36,277 SNPs with high quality were predicted from the two transcriptomes, respectively. SNP calling was also performed using the reads of two transcriptomes together, and a total of 96,040 SNPs with high quality were predicted. Among these 96,040 SNPs, 5,242 and 29,129 were predicted as non-synonymous and synonymous SNPs respectively. Characterization analysis of the predicted SNPs in L. vannamei showed that the estimated SNP frequency was 0.21% (one SNP per 476 bp) and the estimated ratio for transition to transversion was 2.0. Fifty SNPs were randomly selected for validation by Sanger sequencing after PCR amplification and 76% of SNPs were confirmed, which indicated that the SNPs predicted in this study were reliable. These SNPs will be very useful for genetic study in L. vannamei, especially for the high density linkage map construction and genome-wide association studies. PMID:24498047

  6. Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout.

    PubMed

    Al-Tobasei, Rafet; Ali, Ali; Leeds, Timothy D; Liu, Sixin; Palti, Yniv; Kenney, Brett; Salem, Mohamed

    2017-08-07

    Coding/functional SNPs change the biological function of a gene and, therefore, could serve as "large-effect" genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7-93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout.

  7. BAC-End Sequence-Based SNP Mining in Allotetraploid Cotton (Gossypium) Utilizing Resequencing Data, Phylogenetic Inferences, and Perspectives for Genetic Mapping

    PubMed Central

    Hulse-Kemp, Amanda M.; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A.; Scheffler, Brian E.; Fang, David D.; Chen, Z. Jeffrey; Van Deynze, Allen; Stelly, David M.

    2015-01-01

    A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. PMID:25858960

  8. A tool for selecting SNPs for association studies based on observed linkage disequilibrium patterns.

    PubMed

    De La Vega, Francisco M; Isaac, Hadar I; Scafe, Charles R

    2006-01-01

    The design of genetic association studies using single-nucleotide polymorphisms (SNPs) requires the selection of subsets of the variants providing high statistical power at a reasonable cost. SNPs must be selected to maximize the probability that a causative mutation is in linkage disequilibrium (LD) with at least one marker genotyped in the study. The HapMap project performed a genome-wide survey of genetic variation with about a million SNPs typed in four populations, providing a rich resource to inform the design of association studies. A number of strategies have been proposed for the selection of SNPs based on observed LD, including construction of metric LD maps and the selection of haplotype tagging SNPs. Power calculations are important at the study design stage to ensure successful results. Integrating these methods and annotations can be challenging: the algorithms required to implement these methods are complex to deploy, and all the necessary data and annotations are deposited in disparate databases. Here, we present the SNPbrowser Software, a freely available tool to assist in the LD-based selection of markers for association studies. This stand-alone application provides fast query capabilities and swift visualization of SNPs, gene annotations, power, haplotype blocks, and LD map coordinates. Wizards implement several common SNP selection workflows including the selection of optimal subsets of SNPs (e.g. tagging SNPs). Selected SNPs are screened for their conversion potential to either TaqMan SNP Genotyping Assays or the SNPlex Genotyping System, two commercially available genotyping platforms, expediting the set-up of genetic studies with an increased probability of success.

  9. Population-based case-control study of DRD2 gene polymorphisms and alcoholism.

    PubMed

    Bhaskar, L V K S; Thangaraj, K; Non, A L; Singh, Lalji; Rao, V R

    2010-10-01

    Several independent lines of evidence for genetic contributions to vulnerability to alcoholism exist. Dopamine is thought to play a major role in the mechanism of reward and reinforcement in response to alcohol. D2 dopamine receptor (DRD2) gene has been among the stronger candidate genes implicated in alcoholism. In this study, alcohol use was assessed in 196 randomly selected Kota individuals of Nilgiri Hills, South India. Six DRD2 SNPs were assessed in 81 individuals with alcoholism and 151 controls to evaluate the association between single nucleotide polymorphisms (SNPs) and alcoholism. Of the three models (dominant, recessive, and additive) tested for association between alcoholism and DRD2 SNPs, only the additive model shows association for three loci (rs1116313, TaqID, and rs2734835). Of six studied polymorphisms, five are in strong linkage disequilibrium forming onesingle haplotype block. Though the global haplotype analysis with these five SNPs was not significant, haplotype analysis using all six SNPs yielded a global P value of .033, even after adjusting for age. These findings support the importance of dopamine receptor gene polymorphisms in alcoholism. Further studies to replicate these findings in different populations are needed to confirm these results.

  10. SU-D-204-06: Integration of Machine Learning and Bioinformatics Methods to Analyze Genome-Wide Association Study Data for Rectal Bleeding and Erectile Dysfunction Following Radiotherapy in Prostate Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Oh, J; Deasy, J; Kerns, S

    Purpose: We investigated whether integration of machine learning and bioinformatics techniques on genome-wide association study (GWAS) data can improve the performance of predictive models in predicting the risk of developing radiation-induced late rectal bleeding and erectile dysfunction in prostate cancer patients. Methods: We analyzed a GWAS dataset generated from 385 prostate cancer patients treated with radiotherapy. Using genotype information from these patients, we designed a machine learning-based predictive model of late radiation-induced toxicities: rectal bleeding and erectile dysfunction. The model building process was performed using 2/3 of samples (training) and the predictive model was tested with 1/3 of samples (validation).more » To identify important single nucleotide polymorphisms (SNPs), we computed the SNP importance score, resulting from our random forest regression model. We performed gene ontology (GO) enrichment analysis for nearby genes of the important SNPs. Results: After univariate analysis on the training dataset, we filtered out many SNPs with p>0.001, resulting in 749 and 367 SNPs that were used in the model building process for rectal bleeding and erectile dysfunction, respectively. On the validation dataset, our random forest regression model achieved the area under the curve (AUC)=0.70 and 0.62 for rectal bleeding and erectile dysfunction, respectively. We performed GO enrichment analysis for the top 25%, 50%, 75%, and 100% SNPs out of the select SNPs in the univariate analysis. When we used the top 50% SNPs, more plausible biological processes were obtained for both toxicities. An additional test with the top 50% SNPs improved predictive power with AUC=0.71 and 0.65 for rectal bleeding and erectile dysfunction. A better performance was achieved with AUC=0.67 when age and androgen deprivation therapy were added to the model for erectile dysfunction. Conclusion: Our approach that combines machine learning and bioinformatics techniques enabled designing better models and identifying more plausible biological processes associated with the outcomes.« less

  11. BAC-End Sequence-Based SNP Mining in Allotetraploid Cotton (Gossypium) Utilizing Resequencing Data, Phylogenetic Inferences, and Perspectives for Genetic Mapping.

    PubMed

    Hulse-Kemp, Amanda M; Ashrafi, Hamid; Stoffel, Kevin; Zheng, Xiuting; Saski, Christopher A; Scheffler, Brian E; Fang, David D; Chen, Z Jeffrey; Van Deynze, Allen; Stelly, David M

    2015-04-09

    A bacterial artificial chromosome library and BAC-end sequences for cultivated cotton (Gossypium hirsutum L.) have recently been developed. This report presents genome-wide single nucleotide polymorphism (SNP) mining utilizing resequencing data with BAC-end sequences as a reference by alignment of 12 G. hirsutum L. lines, one G. barbadense L. line, and one G. longicalyx Hutch and Lee line. A total of 132,262 intraspecific SNPs have been developed for G. hirsutum, whereas 223,138 and 470,631 interspecific SNPs have been developed for G. barbadense and G. longicalyx, respectively. Using a set of interspecific SNPs, 11 randomly selected and 77 SNPs that are putatively associated with the homeologous chromosome pair 12 and 26, we mapped 77 SNPs into two linkage groups representing these chromosomes, spanning a total of 236.2 cM in an interspecific F2 population (G. barbadense 3-79 × G. hirsutum TM-1). The mapping results validated the approach for reliably producing large numbers of both intraspecific and interspecific SNPs aligned to BAC-ends. This will allow for future construction of high-density integrated physical and genetic maps for cotton and other complex polyploid genomes. The methods developed will allow for future Gossypium resequencing data to be automatically genotyped for identified SNPs along the BAC-end sequence reference for anchoring sequence assemblies and comparative studies. Copyright © 2015 Hulse-Kemp et al.

  12. Design of the Illumina Porcine 50K+ SNP Iselect(TM) Beadchip and Characterization of the Porcine HapMap Population

    USDA-ARS?s Scientific Manuscript database

    Using next generation sequencing technology the International Swine SNP Consortium has identified 500,000 SNPs and used these to design an Illumina Infinium iSelect™ SNP BeadChip with a selection of 60,218 SNPs. The selected SNPs include previously validated SNPs and SNPs identified de novo using se...

  13. Genetic prediction of type 2 diabetes using deep neural network.

    PubMed

    Kim, J; Kim, J; Kwak, M J; Bajaj, M

    2018-04-01

    Type 2 diabetes (T2DM) has strong heritability but genetic models to explain heritability have been challenging. We tested deep neural network (DNN) to predict T2DM using the nested case-control study of Nurses' Health Study (3326 females, 45.6% T2DM) and Health Professionals Follow-up Study (2502 males, 46.5% T2DM). We selected 96, 214, 399, and 678 single-nucleotide polymorphism (SNPs) through Fisher's exact test and L1-penalized logistic regression. We split each dataset randomly in 4:1 to train prediction models and test their performance. DNN and logistic regressions showed better area under the curve (AUC) of ROC curves than the clinical model when 399 or more SNPs included. DNN was superior than logistic regressions in AUC with 399 or more SNPs in male and 678 SNPs in female. Addition of clinical factors consistently increased AUC of DNN but failed to improve logistic regressions with 214 or more SNPs. In conclusion, we show that DNN can be a versatile tool to predict T2DM incorporating large numbers of SNPs and clinical information. Limitations include a relatively small number of the subjects mostly of European ethnicity. Further studies are warranted to confirm and improve performance of genetic prediction models using DNN in different ethnic groups. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. Genetically decreased vitamin D and risk of Alzheimer disease.

    PubMed

    Mokry, Lauren E; Ross, Stephanie; Morris, John A; Manousaki, Despoina; Forgetta, Vincenzo; Richards, J Brent

    2016-12-13

    To test whether genetically decreased vitamin D levels are associated with Alzheimer disease (AD) using mendelian randomization (MR), a method that minimizes bias due to confounding or reverse causation. We selected single nucleotide polymorphisms (SNPs) that are strongly associated with 25-hydroxyvitamin D (25OHD) levels (p < 5 × 10 -8 ) from the Study of Underlying Genetic Determinants of Vitamin D and Highly Related Traits (SUNLIGHT) Consortium (N = 33,996) to act as instrumental variables for the MR study. We measured the effect of each of these SNPs on 25OHD levels in the Canadian Multicentre Osteoporosis Study (CaMos; N = 2,347) and obtained the corresponding effect estimates for each SNP on AD risk from the International Genomics of Alzheimer's Project (N = 17,008 AD cases and 37,154 controls). To produce MR estimates, we weighted the effect of each SNP on AD by its effect on 25OHD and meta-analyzed these estimates using a fixed-effects model to provide a summary effect estimate. The SUNLIGHT Consortium identified 4 SNPs to be genome-wide significant for 25OHD, which described 2.44% of the variance in 25OHD in CaMos. All 4 SNPs map to genes within the vitamin D metabolic pathway. MR analyses demonstrated that a 1-SD decrease in natural log-transformed 25OHD increased AD risk by 25% (odds ratio 1.25, 95% confidence interval 1.03-1.51, p = 0.021). After sensitivity analysis in which we removed SNPs possibly influenced by pleiotropy and population stratification, the results were largely unchanged. Our results provide evidence supporting 25OHD as a causal risk factor for AD. These findings provide further rationale to understand the effect of vitamin D supplementation on cognition and AD risk in randomized controlled trials. © 2016 American Academy of Neurology.

  15. Genetic Polymorphisms in the Hypothalamic Pathway in Relation to Subsequent Weight Change – The DiOGenes Study

    PubMed Central

    Ängquist, Lars; Hansen, Rikke D.; van der A, Daphne L.; Holst, Claus; Tjønneland, Anne; Overvad, Kim; Jakobsen, Marianne Uhre; Boeing, Heiner; Meidtner, Karina; Palli, Domenico; Masala, Giovanna; Bouatia-Naji, Nabila; Saris, Wim H. M.; Feskens, Edith J. M.; J.Wareham, Nicolas; Sørensen, Thorkild I. A.; Loos, Ruth J. F.

    2011-01-01

    Background Single nucleotide polymorphisms (SNPs) in genes encoding the components involved in the hypothalamic pathway may influence weight gain and dietary factors may modify their effects. Aim We conducted a case-cohort study to investigate the associations of SNPs in candidate genes with weight change during an average of 6.8 years of follow-up and to examine the potential effect modification by glycemic index (GI) and protein intake. Methods and Findings Participants, aged 20–60 years at baseline, came from five European countries. Cases (‘weight gainers’) were selected from the total eligible cohort (n = 50,293) as those with the greatest unexplained annual weight gain (n = 5,584). A random subcohort (n = 6,566) was drawn with the intention to obtain an equal number of cases and noncases (n = 5,507). We genotyped 134 SNPs that captured all common genetic variation across the 15 candidate genes; 123 met the quality control criteria. Each SNP was tested for association with the risk of being a ‘weight gainer’ (logistic regression models) in the case-noncase data and with weight gain (linear regression models) in the random subcohort data. After accounting for multiple testing, none of the SNPs was significantly associated with weight change. Furthermore, we observed no significant effect modification by dietary factors, except for SNP rs7180849 in the neuromedin β gene (NMB). Carriers of the minor allele had a more pronounced weight gain at a higher GI (P = 2×10−7). Conclusions We found no evidence of association between SNPs in the studied hypothalamic genes with weight change. The interaction between GI and NMB SNP rs7180849 needs further confirmation. PMID:21390334

  16. Free and Reduced-Price Meal Application and Income Verification Practices in School Nutrition Programs in the United States

    ERIC Educational Resources Information Center

    Kwon, Junehee; Lee, Yee Ming; Park, Eunhye; Wang, Yujia; Rushing, Keith

    2017-01-01

    Purpose/Objectives: This study assessed current practices and attitudes of school nutrition program (SNP) management staff regarding free and reduced-price (F-RP) meal application and verification in SNPs. Methods: Stratified, randomly selected 1,500 SNP management staff in 14 states received a link to an online questionnaire and/or a printed…

  17. Evolutionary selective pressure on three mitochondrial SNPs is consistent with their influence on metabolic efficiency in Pima Indians.

    PubMed

    Chamala, Srikar; Beckstead, Wesley A; Rowe, Mark J; McClellan, David A

    2007-01-01

    We investigated whether the effect of evolutionary selection on three recent Single Nucleotide Polymorphisms (SNPs) in the mitochondrial sub-haplogroups of Pima Indians is consistent with their effects on metabolic efficiency. The mitochondrial SNPs impact metabolic rate and respiratory quotient, and may be adaptations to caloric restriction in a desert habitat. Using TreeSAAP software, we examined evolutionary selection in 107 mammalian species at these SNPs, characterising the biochemical shifts produced by the amino acid substitutions. Our results suggest that two SNPs were affected by selection during mammalian evolution in a manner consistent with their effects on metabolic efficiency in Pima Indians.

  18. Efficient selection of tagging single-nucleotide polymorphisms in multiple populations.

    PubMed

    Howie, Bryan N; Carlson, Christopher S; Rieder, Mark J; Nickerson, Deborah A

    2006-08-01

    Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.

  19. selectSNP – An R package for selecting SNPs optimal for genetic evaluation

    USDA-ARS?s Scientific Manuscript database

    There has been a huge increase in the number of SNPs in the public repositories. This has made it a challenge to design low and medium density SNP panels, which requires careful selection of available SNPs considering many criteria, such as map position, allelic frequency, possible biological functi...

  20. Controversial opinion: evaluation of EGR1 and LAMA2 loci for high myopia in Chinese populations.

    PubMed

    Lin, Fang-yu; Huang, Zhu; Lu, Ning; Chen, Wei; Fang, Hui; Han, Wei

    2016-03-01

    Functional studies have suggested the important role of early growth response 1 (EGR1) and Laminin α2-chain (LAMA2) in human eye development. Genetic studies have reported a significant association of the single nucleotide polymorphism (SNP) in the LAMA2 gene with myopia. This study aimed to evaluate the association of the tagging SNPs (tSNPs) in the EGR1 and LAMA2 genes with high myopia in two independent Han Chinese populations. Four tSNPs (rs11743810 in the EGR1 gene; rs2571575, rs9321170, and rs1889891 in the LAMA2 gene) were selected, according to the HapMap database (http://hapmap.ncbi.nlm.nih.gov), and were genotyped using the ligase detection reaction (LDR) approach for 167 Han Chinese nuclear families with extremely highly myopic offspring (<-10.0 diopters) and an independent group with 485 extremely highly myopic cases (<-10.0 diopters) and 499 controls. Direct sequencing was used to confirm the LDR results in twenty randomly selected subjects. Family-based association analysis was performed using the family-based association test (FBAT) software package (Version 1.5.5). Population-based association analysis was performed using the Chi-square test. The association analysis power was estimated using online software (http://design.cs.ucla.edu). The FBAT demonstrated that all four tSNPs tested did not show association with high myopia (P>0.05). Haplotype analysis of tSNPs in the LAMA2 genes also did not show a significant association (P>0.05). Meanwhile, population-based association analysis also showed no significant association results with high myopia (P>0.05). On the basis of our family- and population-based analyses for the Han Chinese population, we did not find positive association signals of the four SNPs in the LAMA2 and EGR1 genes with high myopia.

  1. A 2-Stage Genome-Wide Association Study to Identify Single Nucleotide Polymorphisms Associated With Development of Erectile Dysfunction Following Radiation Therapy for Prostate Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kerns, Sarah L.; Departments of Pathology and Genetics, Albert Einstein College of Medicine, Bronx, New York; Stock, Richard

    2013-01-01

    Purpose: To identify single nucleotide polymorphisms (SNPs) associated with development of erectile dysfunction (ED) among prostate cancer patients treated with radiation therapy. Methods and Materials: A 2-stage genome-wide association study was performed. Patients were split randomly into a stage I discovery cohort (132 cases, 103 controls) and a stage II replication cohort (128 cases, 102 controls). The discovery cohort was genotyped using Affymetrix 6.0 genome-wide arrays. The 940 top ranking SNPs selected from the discovery cohort were genotyped in the replication cohort using Illumina iSelect custom SNP arrays. Results: Twelve SNPs identified in the discovery cohort and validated in themore » replication cohort were associated with development of ED following radiation therapy (Fisher combined P values 2.1 Multiplication-Sign 10{sup -5} to 6.2 Multiplication-Sign 10{sup -4}). Notably, these 12 SNPs lie in or near genes involved in erectile function or other normal cellular functions (adhesion and signaling) rather than DNA damage repair. In a multivariable model including nongenetic risk factors, the odds ratios for these SNPs ranged from 1.6 to 5.6 in the pooled cohort. There was a striking relationship between the cumulative number of SNP risk alleles an individual possessed and ED status (Sommers' D P value = 1.7 Multiplication-Sign 10{sup -29}). A 1-allele increase in cumulative SNP score increased the odds for developing ED by a factor of 2.2 (P value = 2.1 Multiplication-Sign 10{sup -19}). The cumulative SNP score model had a sensitivity of 84% and specificity of 75% for prediction of developing ED at the radiation therapy planning stage. Conclusions: This genome-wide association study identified a set of SNPs that are associated with development of ED following radiation therapy. These candidate genetic predictors warrant more definitive validation in an independent cohort.« less

  2. Silymarin-Loaded Eudragit Nanoparticles: Formulation, Characterization, and Hepatoprotective and Toxicity Evaluation.

    PubMed

    El-Nahas, Amira E; Allam, Ahmed N; Abdelmonsif, Doaa A; El-Kamel, Amal H

    2017-11-01

    The objectives of this study were to formulate, characterize silymarin-loaded Eudragit nanoparticles (SNPs) and evaluate their hepatoprotective and cytotoxic effects after oral administration. SNPs were prepared by nanoprecipitation technique and were evaluated for particle size, entrapment efficiency, TEM, solid-state characterization, and in vitro drug release. The hepatoprotective activity was evaluated after oral administration of selected SNPs in carbon tetrachloride-intoxicated rats. Potential in vivo acute cytotoxicity study was also assessed. The selected SNPs contained 50 mg silymarin and 50 mg Eudragit polymers (1:1 w/w Eudragit RS 100 & Eudragit LS 100). Morphology of the selected SNPs (particle size of 84.70 nm and entrapment efficiency of 83.45% with 100% drug release after 12 h) revealed spherical and uniformly distributed nanoparticles. DSC and FT-IR studies suggested the presence of silymarin in an amorphous state and absence of chemical interaction. The hepatoprotective evaluation of the selected SNPs in CCl 4 -intoxicated rats revealed significant improvement in the activities of different biochemical parameters (P ≤ 0.01) compared to the marketed product. The histopathological studies suggested that the selected SNPs produced better hepatoprotective effect in CCl 4 -intoxicated rats compared with the commercially marketed product. Toxicity study revealed no evident toxic effect for blank or silymarin-loaded nanoparticles at the dose level of 50 mg/kg body weight. The obtained results suggested that the selected SNPs were safe and potentially offered enhancement in the pharmacological hepatoprotective properties of silymarin.

  3. Single nucleotide polymorphism discovery in rainbow trout by deep sequencing of a reduced representation library.

    PubMed

    Sánchez, Cecilia Castaño; Smith, Timothy P L; Wiedmann, Ralph T; Vallejo, Roger L; Salem, Mohamed; Yao, Jianbo; Rexroad, Caird E

    2009-11-25

    To enhance capabilities for genomic analyses in rainbow trout, such as genomic selection, a large suite of polymorphic markers that are amenable to high-throughput genotyping protocols must be identified. Expressed Sequence Tags (ESTs) have been used for single nucleotide polymorphism (SNP) discovery in salmonids. In those strategies, the salmonid semi-tetraploid genomes often led to assemblies of paralogous sequences and therefore resulted in a high rate of false positive SNP identification. Sequencing genomic DNA using primers identified from ESTs proved to be an effective but time consuming methodology of SNP identification in rainbow trout, therefore not suitable for high throughput SNP discovery. In this study, we employed a high-throughput strategy that used pyrosequencing technology to generate data from a reduced representation library constructed with genomic DNA pooled from 96 unrelated rainbow trout that represent the National Center for Cool and Cold Water Aquaculture (NCCCWA) broodstock population. The reduced representation library consisted of 440 bp fragments resulting from complete digestion with the restriction enzyme HaeIII; sequencing produced 2,000,000 reads providing an average 6 fold coverage of the estimated 150,000 unique genomic restriction fragments (300,000 fragment ends). Three independent data analyses identified 22,022 to 47,128 putative SNPs on 13,140 to 24,627 independent contigs. A set of 384 putative SNPs, randomly selected from the sets produced by the three analyses were genotyped on individual fish to determine the validation rate of putative SNPs among analyses, distinguish apparent SNPs that actually represent paralogous loci in the tetraploid genome, examine Mendelian segregation, and place the validated SNPs on the rainbow trout linkage map. Approximately 48% (183) of the putative SNPs were validated; 167 markers were successfully incorporated into the rainbow trout linkage map. In addition, 2% of the sequences from the validated markers were associated with rainbow trout transcripts. The use of reduced representation libraries and pyrosequencing technology proved to be an effective strategy for the discovery of a high number of putative SNPs in rainbow trout; however, modifications to the technique to decrease the false discovery rate resulting from the evolutionary recent genome duplication would be desirable.

  4. Global genetic differentiation of complex traits shaped by natural selection in humans.

    PubMed

    Guo, Jing; Wu, Yang; Zhu, Zhihong; Zheng, Zhili; Trzaskowski, Maciej; Zeng, Jian; Robinson, Matthew R; Visscher, Peter M; Yang, Jian

    2018-05-14

    There are mean differences in complex traits among global human populations. We hypothesize that part of the phenotypic differentiation is due to natural selection. To address this hypothesis, we assess the differentiation in allele frequencies of trait-associated SNPs among African, Eastern Asian, and European populations for ten complex traits using data of large sample size (up to ~405,000). We show that SNPs associated with height ([Formula: see text]), waist-to-hip ratio ([Formula: see text]), and schizophrenia ([Formula: see text]) are significantly more differentiated among populations than matched "control" SNPs, suggesting that these trait-associated SNPs have undergone natural selection. We further find that SNPs associated with height ([Formula: see text]) and schizophrenia ([Formula: see text]) show significantly higher variance in linkage disequilibrium (LD) scores across populations than control SNPs. Our results support the hypothesis that natural selection has shaped the genetic differentiation of complex traits, such as height and schizophrenia, among worldwide populations.

  5. Novel and efficient tag SNPs selection algorithms.

    PubMed

    Chen, Wen-Pei; Hung, Che-Lun; Tsai, Suh-Jen Jane; Lin, Yaw-Ling

    2014-01-01

    SNPs are the most abundant forms of genetic variations amongst species; the association studies between complex diseases and SNPs or haplotypes have received great attention. However, these studies are restricted by the cost of genotyping all SNPs; thus, it is necessary to find smaller subsets, or tag SNPs, representing the rest of the SNPs. In fact, the existing tag SNP selection algorithms are notoriously time-consuming. An efficient algorithm for tag SNP selection was presented, which was applied to analyze the HapMap YRI data. The experimental results show that the proposed algorithm can achieve better performance than the existing tag SNP selection algorithms; in most cases, this proposed algorithm is at least ten times faster than the existing methods. In many cases, when the redundant ratio of the block is high, the proposed algorithm can even be thousands times faster than the previously known methods. Tools and web services for haplotype block analysis integrated by hadoop MapReduce framework are also developed using the proposed algorithm as computation kernels.

  6. Regulatory single nucleotide polymorphisms (rSNPs) at the promoters 1A and 1B of the human APC gene.

    PubMed

    Matveeva, Marina Yu; Kashina, Elena V; Reshetnikov, Vasily V; Bryzgalov, Leonid O; Antontseva, Elena V; Bondar, Natalia P; Merkulova, Tatiana I

    2016-12-22

    Germline mutations in the coding sequence of the tumour suppressor APC gene give rise to familial adenomatous polyposis (which leads to colorectal cancer) and are associated with many other oncopathologies. The loss of APC function because of deletion of putative promoter 1A or 1B also results in the development of colorectal cancer. Since the regions of promoters 1A and 1B contain many single nucleotide polymorphisms (SNPs), the aim of this study was to perform functional analysis of some of these SNPs by means of an electrophoretic mobility shift assay (EMSA) and a luciferase reporter assay. First, it was shown that both putative promoters of APC (1A and 1B) drive transcription in an in vitro reporter experiment. From eleven randomly selected SNPs of promoter 1A and four SNPs of promoter 1B, nine and two respectively showed differential patterns of binding of nuclear proteins to oligonucleotide probes corresponding to alternative alleles. The luciferase reporter assay showed that among the six SNPs tested, the rs75612255 C allele and rs113017087 C allele in promoter 1A as well as the rs138386816 T allele and rs115658307 T allele in promoter 1B significantly increased luciferase activity in the human erythromyeloblastoid leukaemia cell line K562. In human colorectal cancer HCT-116 cells, none of the substitutions under study had any effect, with the exception of minor allele G of rs79896135 in promoter 1B. This allele significantly decreased the luciferase reporter's activity CONCLUSION: Our results indicate that many SNPs in APC promoters 1A and 1B are functionally relevant and that allele G of rs79896135 may be associated with the predisposition to colorectal cancer.

  7. Polymorphisms in the promoter region of the bovine lactoferrin gene influence milk somatic cell score and milk production traits in Chinese Holstein cows.

    PubMed

    Mao, Yongjiang; Zhu, Xiaorui; Xing, Shiyu; Zhang, Meirong; Zhang, Huimin; Wang, Xiaolong; Karrow, Niel; Yang, Liguo; Yang, Zhangping

    2015-12-01

    Lactoferrin is an iron-binding protein found in cow's milk that plays an important role in preventing mastitis caused by intramammary infection. In this study, 20 Chinese Holstein cows were selected randomly for PCR amplification and sequencing of the bovine lactoferrin gene promoter region and used for SNP discovery in the region between nucleotide positions -461 to -132. Three SNPs (-270T>C, -190G>A and -156A>G) were identified in bovine lactoferrin, then Chinese Holstein cows (n=866) were genotyped using Sequenom MassARRAY (Sequenom Inc., San Diego, CA) based on the previous SNP information in this study, and the associations between SNPs or haplotype and milk somatic cell score (SCS) and production traits were analyzed by the least squares method in the GLM procedure of SAS. SNPs -270T>C and -156A>G showed close linkage disequilibrium (r(2)=0.76). The SNP -190G>A showed a significant association with SCS, and individuals with genotype GG had higher SCS than genotypes AG and AA. Associations were found between the SNPs -270T>C and -190G>A with SCS and the milk composition. The software MatInspector revealed that these SNPs were located within several potential transcription factor binding sites, including NF-κB p50, KLF7 and SP1, and may alter gene expression, but further investigation will be required to elucidate the biological and practical relevance of these SNPs. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. Development of genetic markers in abalone through construction of a SNP database.

    PubMed

    Kang, J-H; Appleyard, S A; Elliott, N G; Jee, Y-J; Lee, J B; Kang, S W; Baek, M K; Han, Y S; Choi, T-J; Lee, Y S

    2011-06-01

    In the absence of a reference genome, single-nucleotide polymorphisms (SNP) discovery in a group of abalone species was undertaken by random sequence assembly. A web-based interface was constructed, and 11 932 DNA sequences from the genus Haliotis were assembled, with 1321 contigs built. Of these, 118 contigs that consisted of at least ten annotation groups were selected. The 1577 putative SNPs were identified from the 118 contigs, with SNPs in several HSP70 gene contigs confirmed by PCR amplification of an 809-bp DNA fragment. SNPs in the HSP70 gene were compared across eight abalone species. A total of 129 polymorphic sites, including heterozygote sites within and among species, were observed. Phylogenetic analysis of the partial HSP70 gene region showed separation of the tested abalone into two groups, one reflecting the southern hemisphere species and the other the northern hemisphere species. Interestingly, Haliotis iris from New Zealand showed a closer relationship to species distributed in the northern Pacific region. Although HSP genes are known to be highly conserved among taxa, the validation of polymorphic SNPs from HSP70 in this mollusc demonstrates the applicability of cross-species SNP markers in abalone and the first step towards universal nuclear markers in Haliotis. © 2010 NFRDI, Animal Genetics © 2010 Stichting International Foundation for Animal Genetics.

  9. Association of calpain 10 gene polymorphisms with type 2 diabetes mellitus in Southern Indians.

    PubMed

    Bodhini, Dhanasekaran; Radha, Venkatesan; Ghosh, Saurabh; Sanapala, Krishna R; Majumder, Partha P; Rao, Manchanahalli Rangaswamy Satyanarayana; Mohan, Viswanathan

    2011-05-01

    The aim was to investigate the association between the CAPN10 gene single nucleotide polymorphisms (SNPs) -44 (rs2975760), -43 (rs3792267), -19 (rs3842570), and -63 (rs5030952) and type 2 diabetes mellitus in an Asian Indian population in Southern India. A total of 1443 subjects, 794 normal glucose tolerant (NGT) and 649 type 2 diabetes mellitus subjects, were randomly selected from the Chennai Urban Rural Epidemiology Study. These subjects were genotyped for the 4 CAPN10 SNPs using polymerase chain reaction-restriction fragment length polymorphism and validated by direct sequencing. None of the 4 SNPs showed any significant differences in the genotypic distribution among the NGT and type 2 diabetes mellitus subjects (P = .20, .86, .34, and .39 for SNPs -44, -43, -19, and -63, respectively). The NGT subjects with the 11 genotype of the SNP -63 had significantly higher 2-hour postload plasma glucose (mean ± SD, 5.66 ± 1.05 mmol/L) levels compared with the combined 12 and 22 genotype group (5.33 ± 1.11 mmol/L, P = .004). The P value remained significant even after adjusting for age, sex, body mass index, smoking, and alcohol consumption (nominal P = .008). No significant difference in the biochemical parameters was observed when the subjects were stratified according to the other SNPs. The 2111 haplotype corresponding to SNPs -44, -43, -19, and -63 showed a significant difference in the proportion among NGT (0.18) and type 2 diabetes mellitus subjects (0.22, nominal P = .014). Although the Bonferroni correction based on the asymptotic test does not preserve this significance, the test based on the empirical distribution remained significant. In conclusion, our study raises the possibility that the 2111 haplotype of SNPs -44, -43, -19, and -63 may be associated with type 2 diabetes mellitus, although none of these SNPs may be individually associated with diabetes. Copyright © 2011 Elsevier Inc. All rights reserved.

  10. Molecular footprints of domestication and improvement in soybean revealed by whole genome re-sequencing

    PubMed Central

    2013-01-01

    Background Artificial selection played an important role in the origin of modern Glycine max cultivars from the wild soybean Glycine soja. To elucidate the consequences of artificial selection accompanying the domestication and modern improvement of soybean, 25 new and 30 published whole-genome re-sequencing accessions, which represent wild, domesticated landrace, and Chinese elite soybean populations were analyzed. Results A total of 5,102,244 single nucleotide polymorphisms (SNPs) and 707,969 insertion/deletions were identified. Among the SNPs detected, 25.5% were not described previously. We found that artificial selection during domestication led to more pronounced reduction in the genetic diversity of soybean than the switch from landraces to elite cultivars. Only a small proportion (2.99%) of the whole genomic regions appear to be affected by artificial selection for preferred agricultural traits. The selection regions were not distributed randomly or uniformly throughout the genome. Instead, clusters of selection hotspots in certain genomic regions were observed. Moreover, a set of candidate genes (4.38% of the total annotated genes) significantly affected by selection underlying soybean domestication and genetic improvement were identified. Conclusions Given the uniqueness of the soybean germplasm sequenced, this study drew a clear picture of human-mediated evolution of the soybean genomes. The genomic resources and information provided by this study would also facilitate the discovery of genes/loci underlying agronomically important traits. PMID:23984715

  11. Genome-wide association analysis for feed efficiency in Angus cattle.

    PubMed

    Rolf, M M; Taylor, J F; Schnabel, R D; McKay, S D; McClure, M C; Northcutt, S L; Kerley, M S; Weaber, R L

    2012-08-01

    Estimated breeding values for average daily feed intake (AFI; kg/day), residual feed intake (RFI; kg/day) and average daily gain (ADG; kg/day) were generated using a mixed linear model incorporating genomic relationships for 698 Angus steers genotyped with the Illumina BovineSNP50 assay. Association analyses of estimated breeding values (EBVs) were performed for 41,028 single nucleotide polymorphisms (SNPs), and permutation analysis was used to empirically establish the genome-wide significance threshold (P < 0.05) for each trait. SNPs significantly associated with each trait were used in a forward selection algorithm to identify genomic regions putatively harbouring genes with effects on each trait. A total of 53, 66 and 68 SNPs explained 54.12% (24.10%), 62.69% (29.85%) and 55.13% (26.54%) of the additive genetic variation (when accounting for the genomic relationships) in steer breeding values for AFI, RFI and ADG, respectively, within this population. Evaluation by pathway analysis revealed that many of these SNPs are in genomic regions that harbour genes with metabolic functions. The presence of genetic correlations between traits resulted in 13.2% of SNPs selected for AFI and 4.5% of SNPs selected for RFI also being selected for ADG in the analysis of breeding values. While our study identifies panels of SNPs significant for efficiency traits in our population, validation of all SNPs in independent populations will be necessary before commercialization. © 2011 The Authors, Animal Genetics © 2011 Stichting International Foundation for Animal Genetics.

  12. Mendelian randomization of blood lipids for coronary heart disease

    PubMed Central

    Holmes, Michael V.; Asselbergs, Folkert W.; Palmer, Tom M.; Drenos, Fotios; Lanktree, Matthew B.; Nelson, Christopher P.; Dale, Caroline E.; Padmanabhan, Sandosh; Finan, Chris; Swerdlow, Daniel I.; Tragante, Vinicius; van Iperen, Erik P.A.; Sivapalaratnam, Suthesh; Shah, Sonia; Elbers, Clara C.; Shah, Tina; Engmann, Jorgen; Giambartolomei, Claudia; White, Jon; Zabaneh, Delilah; Sofat, Reecha; McLachlan, Stela; Doevendans, Pieter A.; Balmforth, Anthony J.; Hall, Alistair S.; North, Kari E.; Almoguera, Berta; Hoogeveen, Ron C.; Cushman, Mary; Fornage, Myriam; Patel, Sanjay R.; Redline, Susan; Siscovick, David S.; Tsai, Michael Y.; Karczewski, Konrad J.; Hofker, Marten H.; Verschuren, W. Monique; Bots, Michiel L.; van der Schouw, Yvonne T.; Melander, Olle; Dominiczak, Anna F.; Morris, Richard; Ben-Shlomo, Yoav; Price, Jackie; Kumari, Meena; Baumert, Jens; Peters, Annette; Thorand, Barbara; Koenig, Wolfgang; Gaunt, Tom R.; Humphries, Steve E.; Clarke, Robert; Watkins, Hugh; Farrall, Martin; Wilson, James G.; Rich, Stephen S.; de Bakker, Paul I.W.; Lange, Leslie A.; Davey Smith, George; Reiner, Alex P.; Talmud, Philippa J.; Kivimäki, Mika; Lawlor, Debbie A.; Dudbridge, Frank; Samani, Nilesh J.; Keating, Brendan J.; Hingorani, Aroon D.; Casas, Juan P.

    2015-01-01

    Aims To investigate the causal role of high-density lipoprotein cholesterol (HDL-C) and triglycerides in coronary heart disease (CHD) using multiple instrumental variables for Mendelian randomization. Methods and results We developed weighted allele scores based on single nucleotide polymorphisms (SNPs) with established associations with HDL-C, triglycerides, and low-density lipoprotein cholesterol (LDL-C). For each trait, we constructed two scores. The first was unrestricted, including all independent SNPs associated with the lipid trait identified from a prior meta-analysis (threshold P < 2 × 10−6); and the second a restricted score, filtered to remove any SNPs also associated with either of the other two lipid traits at P ≤ 0.01. Mendelian randomization meta-analyses were conducted in 17 studies including 62,199 participants and 12,099 CHD events. Both the unrestricted and restricted allele scores for LDL-C (42 and 19 SNPs, respectively) associated with CHD. For HDL-C, the unrestricted allele score (48 SNPs) was associated with CHD (OR: 0.53; 95% CI: 0.40, 0.70), per 1 mmol/L higher HDL-C, but neither the restricted allele score (19 SNPs; OR: 0.91; 95% CI: 0.42, 1.98) nor the unrestricted HDL-C allele score adjusted for triglycerides, LDL-C, or statin use (OR: 0.81; 95% CI: 0.44, 1.46) showed a robust association. For triglycerides, the unrestricted allele score (67 SNPs) and the restricted allele score (27 SNPs) were both associated with CHD (OR: 1.62; 95% CI: 1.24, 2.11 and 1.61; 95% CI: 1.00, 2.59, respectively) per 1-log unit increment. However, the unrestricted triglyceride score adjusted for HDL-C, LDL-C, and statin use gave an OR for CHD of 1.01 (95% CI: 0.59, 1.75). Conclusion The genetic findings support a causal effect of triglycerides on CHD risk, but a causal role for HDL-C, though possible, remains less certain. PMID:24474739

  13. Mendelian randomization of blood lipids for coronary heart disease.

    PubMed

    Holmes, Michael V; Asselbergs, Folkert W; Palmer, Tom M; Drenos, Fotios; Lanktree, Matthew B; Nelson, Christopher P; Dale, Caroline E; Padmanabhan, Sandosh; Finan, Chris; Swerdlow, Daniel I; Tragante, Vinicius; van Iperen, Erik P A; Sivapalaratnam, Suthesh; Shah, Sonia; Elbers, Clara C; Shah, Tina; Engmann, Jorgen; Giambartolomei, Claudia; White, Jon; Zabaneh, Delilah; Sofat, Reecha; McLachlan, Stela; Doevendans, Pieter A; Balmforth, Anthony J; Hall, Alistair S; North, Kari E; Almoguera, Berta; Hoogeveen, Ron C; Cushman, Mary; Fornage, Myriam; Patel, Sanjay R; Redline, Susan; Siscovick, David S; Tsai, Michael Y; Karczewski, Konrad J; Hofker, Marten H; Verschuren, W Monique; Bots, Michiel L; van der Schouw, Yvonne T; Melander, Olle; Dominiczak, Anna F; Morris, Richard; Ben-Shlomo, Yoav; Price, Jackie; Kumari, Meena; Baumert, Jens; Peters, Annette; Thorand, Barbara; Koenig, Wolfgang; Gaunt, Tom R; Humphries, Steve E; Clarke, Robert; Watkins, Hugh; Farrall, Martin; Wilson, James G; Rich, Stephen S; de Bakker, Paul I W; Lange, Leslie A; Davey Smith, George; Reiner, Alex P; Talmud, Philippa J; Kivimäki, Mika; Lawlor, Debbie A; Dudbridge, Frank; Samani, Nilesh J; Keating, Brendan J; Hingorani, Aroon D; Casas, Juan P

    2015-03-01

    To investigate the causal role of high-density lipoprotein cholesterol (HDL-C) and triglycerides in coronary heart disease (CHD) using multiple instrumental variables for Mendelian randomization. We developed weighted allele scores based on single nucleotide polymorphisms (SNPs) with established associations with HDL-C, triglycerides, and low-density lipoprotein cholesterol (LDL-C). For each trait, we constructed two scores. The first was unrestricted, including all independent SNPs associated with the lipid trait identified from a prior meta-analysis (threshold P < 2 × 10(-6)); and the second a restricted score, filtered to remove any SNPs also associated with either of the other two lipid traits at P ≤ 0.01. Mendelian randomization meta-analyses were conducted in 17 studies including 62,199 participants and 12,099 CHD events. Both the unrestricted and restricted allele scores for LDL-C (42 and 19 SNPs, respectively) associated with CHD. For HDL-C, the unrestricted allele score (48 SNPs) was associated with CHD (OR: 0.53; 95% CI: 0.40, 0.70), per 1 mmol/L higher HDL-C, but neither the restricted allele score (19 SNPs; OR: 0.91; 95% CI: 0.42, 1.98) nor the unrestricted HDL-C allele score adjusted for triglycerides, LDL-C, or statin use (OR: 0.81; 95% CI: 0.44, 1.46) showed a robust association. For triglycerides, the unrestricted allele score (67 SNPs) and the restricted allele score (27 SNPs) were both associated with CHD (OR: 1.62; 95% CI: 1.24, 2.11 and 1.61; 95% CI: 1.00, 2.59, respectively) per 1-log unit increment. However, the unrestricted triglyceride score adjusted for HDL-C, LDL-C, and statin use gave an OR for CHD of 1.01 (95% CI: 0.59, 1.75). The genetic findings support a causal effect of triglycerides on CHD risk, but a causal role for HDL-C, though possible, remains less certain. © The Author 2014. Published by Oxford University Press on behalf of the European Society of Cardiology.

  14. Analysis of single nucleotide polymorphisms in case-control studies.

    PubMed

    Li, Yonghong; Shiffman, Dov; Oberbauer, Rainer

    2011-01-01

    Single nucleotide polymorphisms (SNPs) are the most common type of genetic variants in the human genome. SNPs are known to modify susceptibility to complex diseases. We describe and discuss methods used to identify SNPs associated with disease in case-control studies. An outline on study population selection, sample collection and genotyping platforms is presented, complemented by SNP selection, data preprocessing and analysis.

  15. Genomic analysis of morphometric traits in bighorn sheep using the Ovine Infinium® HD SNP BeadChip.

    PubMed

    Miller, Joshua M; Festa-Bianchet, Marco; Coltman, David W

    2018-01-01

    Elucidating the genetic basis of fitness-related traits is a major goal of molecular ecology. Traits subject to sexual selection are particularly interesting, as non-random mate choice should deplete genetic variation and thereby their evolutionary benefits. We examined the genetic basis of three sexually selected morphometric traits in bighorn sheep ( Ovis canadensis ): horn length, horn base circumference, and body mass. These traits are of specific concern in bighorn sheep as artificial selection through trophy hunting opposes sexual selection. Specifically, horn size determines trophy status and, in most North American jurisdictions, if an individual can be legally harvested. Using between 7,994-9,552 phenotypic measures from the long-term individual-based study at Ram Mountain (Alberta, Canada), we first showed that all three traits are heritable ( h 2  = 0.15-0.23). We then conducted a genome-wide association study (GWAS) utilizing a set of 3,777 SNPs typed in 76 individuals using the Ovine Infinium ®  HD SNP BeadChip. We found suggestive association for body mass at a single locus (OAR9_91647990). The absence of strong associations with SNPs suggests that the traits are likely polygenic. These results represent a step forward for characterizing the genetic architecture of fitness related traits in sexually dimorphic ungulates.

  16. Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences.

    PubMed

    Pang, Erli; Wu, Xiaomei; Lin, Kui

    2016-06-01

    Protein evolution plays an important role in the evolution of each genome. Because of their functional nature, in general, most of their parts or sites are differently constrained selectively, particularly by purifying selection. Most previous studies on protein evolution considered individual proteins in their entirety or compared protein-coding sequences with non-coding sequences. Less attention has been paid to the evolution of different parts within each protein of a given genome. To this end, based on PfamA annotation of all human proteins, each protein sequence can be split into two parts: domains or unassigned regions. Using this rationale, single nucleotide polymorphisms (SNPs) in protein-coding sequences from the 1000 Genomes Project were mapped according to two classifications: SNPs occurring within protein domains and those within unassigned regions. With these classifications, we found: the density of synonymous SNPs within domains is significantly greater than that of synonymous SNPs within unassigned regions; however, the density of non-synonymous SNPs shows the opposite pattern. We also found there are signatures of purifying selection on both the domain and unassigned regions. Furthermore, the selective strength on domains is significantly greater than that on unassigned regions. In addition, among all of the human protein sequences, there are 117 PfamA domains in which no SNPs are found. Our results highlight an important aspect of protein domains and may contribute to our understanding of protein evolution.

  17. Genetic variation predicting cisplatin cytotoxicity associated with overall survival in lung cancer patients receiving platinum-based chemotherapy †, ‡

    PubMed Central

    Tan, Xiang-Lin; Moyer, Ann M.; Fridley, Brooke L.; Schaid, Daniel J.; Niu, Nifang; Batzler, Anthony J.; Jenkins, Gregory D.; Abo, Ryan P.; Li, Liang; Cunningham, Julie M.; Sun, Zhifu; Yang, Ping; Wang, Liewei

    2011-01-01

    Purpose Inherited variability in the prognosis of lung cancer patients treated with platinum-based chemotherapy has been widely investigated. However, the overall contribution of genetic variation to platinum response is not well established. To identify novel candidate SNPs/genes, we performed a genome-wide association study (GWAS) for cisplatin cytotoxicity using lymphoblastoid cell lines (LCLs), followed by an association study of selected SNPs from the GWAS with overall survival (OS) in lung cancer patients. Experimental Design GWAS for cisplatin were performed with 283 ethnically diverse LCLs. 168 top SNPs were genotyped in 222 small cell and 961 non-small cell lung cancer (SCLC, NSCLC) patients treated with platinum-based therapy. Association of the SNPs with OS was determined using the Cox regression model. Selected candidate genes were functionally validated by siRNA knockdown in human lung cancer cells. Results Among 157 successfully genotyped SNPs, 9 and 10 SNPs were top SNPs associated with OS for patients with NSCLC and SCLC, respectively, although they were not significant after adjusting for multiple testing. Fifteen genes, including 7 located within 200 kb up or downstream of the four top SNPs and 8 genes for which expression was correlated with three SNPs in LCLs were selected for siRNA screening. Knockdown of DAPK3 and METTL6, for which expression levels were correlated with the rs11169748 and rs2440915 SNPs, significantly decreased cisplatin sensitivity in lung cancer cells. Conclusions This series of clinical and complementary laboratory-based functional studies identified several candidate genes/SNPs that might help predict treatment outcomes for platinum-based therapy of lung cancer. PMID:21775533

  18. efficient association study design via power-optimized tag SNP selection

    PubMed Central

    HAN, BUHM; KANG, HYUN MIN; SEO, MYEONG SEONG; ZAITLEN, NOAH; ESKIN, ELEAZAR

    2008-01-01

    Discovering statistical correlation between causal genetic variation and clinical traits through association studies is an important method for identifying the genetic basis of human diseases. Since fully resequencing a cohort is prohibitively costly, genetic association studies take advantage of local correlation structure (or linkage disequilibrium) between single nucleotide polymorphisms (SNPs) by selecting a subset of SNPs to be genotyped (tag SNPs). While many current association studies are performed using commercially available high-throughput genotyping products that define a set of tag SNPs, choosing tag SNPs remains an important problem for both custom follow-up studies as well as designing the high-throughput genotyping products themselves. The most widely used tag SNP selection method optimizes over the correlation between SNPs (r2). However, tag SNPs chosen based on an r2 criterion do not necessarily maximize the statistical power of an association study. We propose a study design framework that chooses SNPs to maximize power and efficiently measures the power through empirical simulation. Empirical results based on the HapMap data show that our method gains considerable power over a widely used r2-based method, or equivalently reduces the number of tag SNPs required to attain the desired power of a study. Our power-optimized 100k whole genome tag set provides equivalent power to the Affymetrix 500k chip for the CEU population. For the design of custom follow-up studies, our method provides up to twice the power increase using the same number of tag SNPs as r2-based methods. Our method is publicly available via web server at http://design.cs.ucla.edu. PMID:18702637

  19. The role of parasite-driven selection in shaping landscape genomic structure in red grouse (Lagopus lagopus scotica).

    PubMed

    Wenzel, Marius A; Douglas, Alex; James, Marianne C; Redpath, Steve M; Piertney, Stuart B

    2016-01-01

    Landscape genomics promises to provide novel insights into how neutral and adaptive processes shape genome-wide variation within and among populations. However, there has been little emphasis on examining whether individual-based phenotype-genotype relationships derived from approaches such as genome-wide association (GWAS) manifest themselves as a population-level signature of selection in a landscape context. The two may prove irreconcilable as individual-level patterns become diluted by high levels of gene flow and complex phenotypic or environmental heterogeneity. We illustrate this issue with a case study that examines the role of the highly prevalent gastrointestinal nematode Trichostrongylus tenuis in shaping genomic signatures of selection in red grouse (Lagopus lagopus scotica). Individual-level GWAS involving 384 SNPs has previously identified five SNPs that explain variation in T. tenuis burden. Here, we examine whether these same SNPs display population-level relationships between T. tenuis burden and genetic structure across a small-scale landscape of 21 sites with heterogeneous parasite pressure. Moreover, we identify adaptive SNPs showing signatures of directional selection using F(ST) outlier analysis and relate population- and individual-level patterns of multilocus neutral and adaptive genetic structure to T. tenuis burden. The five candidate SNPs for parasite-driven selection were neither associated with T. tenuis burden on a population level, nor under directional selection. Similarly, there was no evidence of parasite-driven selection in SNPs identified as candidates for directional selection. We discuss these results in the context of red grouse ecology and highlight the broader consequences for the utility of landscape genomics approaches for identifying signatures of selection. © 2015 John Wiley & Sons Ltd.

  20. A Prediction Algorithm for Drug Response in Patients with Mesial Temporal Lobe Epilepsy Based on Clinical and Genetic Information

    PubMed Central

    Carvalho, Benilton S.; Bilevicius, Elizabeth; Alvim, Marina K. M.; Lopes-Cendes, Iscia

    2017-01-01

    Mesial temporal lobe epilepsy is the most common form of adult epilepsy in surgical series. Currently, the only characteristic used to predict poor response to clinical treatment in this syndrome is the presence of hippocampal sclerosis. Single nucleotide polymorphisms (SNPs) located in genes encoding drug transporter and metabolism proteins could influence response to therapy. Therefore, we aimed to evaluate whether combining information from clinical variables as well as SNPs in candidate genes could improve the accuracy of predicting response to drug therapy in patients with mesial temporal lobe epilepsy. For this, we divided 237 patients into two groups: 75 responsive and 162 refractory to antiepileptic drug therapy. We genotyped 119 SNPs in ABCB1, ABCC2, CYP1A1, CYP1A2, CYP1B1, CYP2C9, CYP2C19, CYP2D6, CYP2E1, CYP3A4, and CYP3A5 genes. We used 98 additional SNPs to evaluate population stratification. We assessed a first scenario using only clinical variables and a second one including SNP information. The random forests algorithm combined with leave-one-out cross-validation was used to identify the best predictive model in each scenario and compared their accuracies using the area under the curve statistic. Additionally, we built a variable importance plot to present the set of most relevant predictors on the best model. The selected best model included the presence of hippocampal sclerosis and 56 SNPs. Furthermore, including SNPs in the model improved accuracy from 0.4568 to 0.8177. Our findings suggest that adding genetic information provided by SNPs, located on drug transport and metabolism genes, can improve the accuracy for predicting which patients with mesial temporal lobe epilepsy are likely to be refractory to drug treatment, making it possible to identify patients who may benefit from epilepsy surgery sooner. PMID:28052106

  1. Whole-genome sequence-based genomic prediction in laying chickens with different genomic relationship matrices to account for genetic architecture.

    PubMed

    Ni, Guiyan; Cavero, David; Fangmann, Anna; Erbe, Malena; Simianer, Henner

    2017-01-16

    With the availability of next-generation sequencing technologies, genomic prediction based on whole-genome sequencing (WGS) data is now feasible in animal breeding schemes and was expected to lead to higher predictive ability, since such data may contain all genomic variants including causal mutations. Our objective was to compare prediction ability with high-density (HD) array data and WGS data in a commercial brown layer line with genomic best linear unbiased prediction (GBLUP) models using various approaches to weight single nucleotide polymorphisms (SNPs). A total of 892 chickens from a commercial brown layer line were genotyped with 336 K segregating SNPs (array data) that included 157 K genic SNPs (i.e. SNPs in or around a gene). For these individuals, genome-wide sequence information was imputed based on data from re-sequencing runs of 25 individuals, leading to 5.2 million (M) imputed SNPs (WGS data), including 2.6 M genic SNPs. De-regressed proofs (DRP) for eggshell strength, feed intake and laying rate were used as quasi-phenotypic data in genomic prediction analyses. Four weighting factors for building a trait-specific genomic relationship matrix were investigated: identical weights, -(log 10 P) from genome-wide association study results, squares of SNP effects from random regression BLUP, and variable selection based weights (known as BLUP|GA). Predictive ability was measured as the correlation between DRP and direct genomic breeding values in five replications of a fivefold cross-validation. Averaged over the three traits, the highest predictive ability (0.366 ± 0.075) was obtained when only genic SNPs from WGS data were used. Predictive abilities with genic SNPs and all SNPs from HD array data were 0.361 ± 0.072 and 0.353 ± 0.074, respectively. Prediction with -(log 10 P) or squares of SNP effects as weighting factors for building a genomic relationship matrix or BLUP|GA did not increase accuracy, compared to that with identical weights, regardless of the SNP set used. Our results show that little or no benefit was gained when using all imputed WGS data to perform genomic prediction compared to using HD array data regardless of the weighting factors tested. However, using only genic SNPs from WGS data had a positive effect on prediction ability.

  2. Current sequencing technology makes microhaplotypes a powerful new type of genetic marker for forensics.

    PubMed

    Kidd, Kenneth K; Pakstis, Andrew J; Speed, William C; Lagacé, Robert; Chang, Joseph; Wootton, Sharon; Haigh, Eva; Kidd, Judith R

    2014-09-01

    SNPs that are molecularly very close (<10kb) will generally have extremely low recombination rates, much less than 10(-4). Multiple haplotypes will often exist because of the history of the origins of the variants at the different sites, rare recombinants, and the vagaries of random genetic drift and/or selection. Such multiallelic haplotype loci are potentially important in forensic work for individual identification, for defining ancestry, and for identifying familial relationships. The new DNA sequencing capabilities currently available make possible continuous runs of a few hundred base pairs so that we can now determine the allelic combination of multiple SNPs on each chromosome of an individual, i.e., the phase, for multiple SNPs within a small segment of DNA. Therefore, we have begun to identify regions, encompassing two to four SNPs with an extent of <200bp that define multiallelic haplotype loci. We have identified candidate regions and have collected pilot data on many candidate microhaplotype loci. Here we present 31 microhaplotype loci that have at least three alleles, have high heterozygosity, are globally informative, and are statistically independent at the population level. This study of microhaplotype loci (microhaps) provides proof of principle that such markers exist and validates their usefulness for ancestry inference, lineage-clan-family inference, and individual identification. The true value of microhaplotypes will come with sequencing methods that can establish alleles unambiguously, including disentangling of mixtures, because a single sequencing run on a single strand of DNA will encompass all of the SNPs. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

  3. Bioinformatic analyses to select phenotype affecting polymorphisms in HTR2C gene.

    PubMed

    Piva, Francesco; Giulietti, Matteo; Baldelli, Luisa; Nardi, Bernardo; Bellantuono, Cesario; Armeni, Tatiana; Saccucci, Franca; Principato, Giovanni

    2011-08-01

    Single nucleotide polymorphisms (SNPs) in serotonin related genes influence mental disorders, responses to pharmacological and psychotherapeutic treatments. In planning association studies, researchers that want to investigate new SNPs have to select some among a large number of candidates. Our aim is to guide researchers in the selection of the most likely phenotype affecting polymorphisms. Here, we studied serotonin receptor 2C (HTR2C) SNPs because, till now, only relatively few of about 2000 are investigated. We used the most updated and assessed bioinformatic tools to predict which variations can give rise to biological effects among 2450 HTR2C SNPs. We suggest 48 SNPs that are worth considering in future association studies in the field of psychiatry, psychology and pharmacogenomics. Moreover, our analyses point out the biological level probably affected, such as transcription, splicing, miRNA regulation and protein structure, thus allowing to suggest future molecular investigations. Although few association studies are available in literature, their results are in agreement with our predictions, showing that our selection methods can help to guide future association studies. Copyright © 2011 John Wiley & Sons, Ltd.

  4. Adaptations to Climate-Mediated Selective Pressures in Humans

    PubMed Central

    Hancock, Angela M.; Witonsky, David B.; Alkorta-Aranburu, Gorka; Beall, Cynthia M.; Gebremedhin, Amha; Sukernik, Rem; Utermann, Gerd; Pritchard, Jonathan K.; Coop, Graham; Di Rienzo, Anna

    2011-01-01

    Humans inhabit a remarkably diverse range of environments, and adaptation through natural selection has likely played a central role in the capacity to survive and thrive in extreme climates. Unlike numerous studies that used only population genetic data to search for evidence of selection, here we scan the human genome for selection signals by identifying the SNPs with the strongest correlations between allele frequencies and climate across 61 worldwide populations. We find a striking enrichment of genic and nonsynonymous SNPs relative to non-genic SNPs among those that are strongly correlated with these climate variables. Among the most extreme signals, several overlap with those from GWAS, including SNPs associated with pigmentation and autoimmune diseases. Further, we find an enrichment of strong signals in gene sets related to UV radiation, infection and immunity, and cancer. Our results imply that adaptations to climate shaped the spatial distribution of variation in humans. PMID:21533023

  5. Use of Multiple Metabolic and Genetic Markers to Improve the Prediction of Type 2 Diabetes: the EPIC-Potsdam Study

    PubMed Central

    Schulze, Matthias B.; Weikert, Cornelia; Pischon, Tobias; Bergmann, Manuela M.; Al-Hasani, Hadi; Schleicher, Erwin; Fritsche, Andreas; Häring, Hans-Ulrich; Boeing, Heiner; Joost, Hans-Georg

    2009-01-01

    OBJECTIVE We investigated whether metabolic biomarkers and single nucleotide polymorphisms (SNPs) improve diabetes prediction beyond age, anthropometry, and lifestyle risk factors. RESEARCH DESIGN AND METHODS A case-cohort study within a prospective study was designed. We randomly selected a subcohort (n = 2,500) from 26,444 participants, of whom 1,962 were diabetes free at baseline. Of the 801 incident type 2 diabetes cases identified in the cohort during 7 years of follow-up, 579 remained for analyses after exclusions. Prediction models were compared by receiver operatoring characteristic (ROC) curve and integrated discrimination improvement. RESULTS Case-control discrimination by the lifestyle characteristics (ROC-AUC: 0.8465) improved with plasma glucose (ROC-AUC: 0.8672, P < 0.001) and A1C (ROC-AUC: 0.8859, P < 0.001). ROC-AUC further improved with HDL cholesterol, triglycerides, γ-glutamyltransferase, and alanine aminotransferase (0.9000, P = 0.002). Twenty SNPs did not improve discrimination beyond these characteristics (P = 0.69). CONCLUSIONS Metabolic markers, but not genotyping for 20 diabetogenic SNPs, improve discrimination of incident type 2 diabetes beyond lifestyle risk factors. PMID:19720844

  6. MultiBLUP: improved SNP-based prediction for complex traits.

    PubMed

    Speed, Doug; Balding, David J

    2014-09-01

    BLUP (best linear unbiased prediction) is widely used to predict complex traits in plant and animal breeding, and increasingly in human genetics. The BLUP mathematical model, which consists of a single random effect term, was adequate when kinships were measured from pedigrees. However, when genome-wide SNPs are used to measure kinships, the BLUP model implicitly assumes that all SNPs have the same effect-size distribution, which is a severe and unnecessary limitation. We propose MultiBLUP, which extends the BLUP model to include multiple random effects, allowing greatly improved prediction when the random effects correspond to classes of SNPs with distinct effect-size variances. The SNP classes can be specified in advance, for example, based on SNP functional annotations, and we also provide an adaptive procedure for determining a suitable partition of SNPs. We apply MultiBLUP to genome-wide association data from the Wellcome Trust Case Control Consortium (seven diseases), and from much larger studies of celiac disease and inflammatory bowel disease, finding that it consistently provides better prediction than alternative methods. Moreover, MultiBLUP is computationally very efficient; for the largest data set, which includes 12,678 individuals and 1.5 M SNPs, the total analysis can be run on a single desktop PC in less than a day and can be parallelized to run even faster. Tools to perform MultiBLUP are freely available in our software LDAK. © 2014 Speed and Balding; Published by Cold Spring Harbor Laboratory Press.

  7. Genetic analysis of ancestry, admixture and selection in Bolivian and Totonac populations of the New World.

    PubMed

    Watkins, W Scott; Xing, Jinchuan; Huff, Chad; Witherspoon, David J; Zhang, Yuhua; Perego, Ugo A; Woodward, Scott R; Jorde, Lynn B

    2012-05-20

    Populations of the Americas were founded by early migrants from Asia, and some have experienced recent genetic admixture. To better characterize the native and non-native ancestry components in populations from the Americas, we analyzed 815,377 autosomal SNPs, mitochondrial hypervariable segments I and II, and 36 Y-chromosome STRs from 24 Mesoamerican Totonacs and 23 South American Bolivians. We analyzed common genomic regions from native Bolivian and Totonac populations to identify 324 highly predictive Native American ancestry informative markers (AIMs). As few as 40-50 of these AIMs perform nearly as well as large panels of random genome-wide SNPs for predicting and estimating Native American ancestry and admixture levels. These AIMs have greater New World vs. Old World specificity than previous AIMs sets. We identify highly-divergent New World SNPs that coincide with high-frequency haplotypes found at similar frequencies in all populations examined, including the HGDP Pima, Maya, Colombian, Karitiana, and Surui American populations. Some of these regions are potential candidates for positive selection. European admixture in the Bolivian sample is approximately 12%, though individual estimates range from 0-48%. We estimate that the admixture occurred ~360-384 years ago. Little evidence of European or African admixture was found in Totonac individuals. Bolivians with pre-Columbian mtDNA and Y-chromosome haplogroups had 5-30% autosomal European ancestry, demonstrating the limitations of Y-chromosome and mtDNA haplogroups and the need for autosomal ancestry informative markers for assessing ancestry in admixed populations.

  8. Performance of the SNPforID 52 SNP-plex assay in paternity testing.

    PubMed

    Børsting, Claus; Sanchez, Juan J; Hansen, Hanna E; Hansen, Anders J; Bruun, Hanne Q; Morling, Niels

    2008-09-01

    The performance of a multiplex assay with 52 autosomal single nucleotide polymorphisms (SNPs) developed for human identification was tested on 124 mother-child-father trios. The typical paternity indices (PIs) were 10(5)-10(6) for the trios and 10(3)-10(4) for the child-father duos. Using the SNP profiles from the randomly selected trios and 700 previously typed individuals, a total of 83,096 comparisons between mother, child and an unrelated man were performed. On average, 9-10 mismatches per comparison were detected. Four mismatches were genetic inconsistencies and 5-6 mismatches were opposite homozygosities. In only two of the 83,096 comparisons did an unrelated man match perfectly to a mother-child duo, and in both cases the PI of the true father was much higher than the PI of the unrelated man. The trios were also typed for 15 short tandem repeats (STRs) and seven variable number of tandem repeats (VNTRs). The typical PIs based on 15 STRs or seven VNTRs were 5-50 times higher than the typical PIs based on 52 SNPs. Six mutations in tandem repeats were detected among the randomly selected trios. In contrast, there was not found any mutations in the SNP loci. The results showed that the 52 SNP-plex assay is a very useful alternative to currently used methods in relationship testing. The usefulness of SNP markers with low mutation rates in paternity and immigration casework is discussed.

  9. FIFS: A data mining method for informative marker selection in high dimensional population genomic data.

    PubMed

    Kavakiotis, Ioannis; Samaras, Patroklos; Triantafyllidis, Alexandros; Vlahavas, Ioannis

    2017-11-01

    Single Nucleotide Polymorphism (SNPs) are, nowadays, becoming the marker of choice for biological analyses involving a wide range of applications with great medical, biological, economic and environmental interest. Classification tasks i.e. the assignment of individuals to groups of origin based on their (multi-locus) genotypes, are performed in many fields such as forensic investigations, discrimination between wild and/or farmed populations and others. Τhese tasks, should be performed with a small number of loci, for computational as well as biological reasons. Thus, feature selection should precede classification tasks, especially for Single Nucleotide Polymorphism (SNP) datasets, where the number of features can amount to hundreds of thousands or millions. In this paper, we present a novel data mining approach, called FIFS - Frequent Item Feature Selection, based on the use of frequent items for selection of the most informative markers from population genomic data. It is a modular method, consisting of two main components. The first one identifies the most frequent and unique genotypes for each sampled population. The second one selects the most appropriate among them, in order to create the informative SNP subsets to be returned. The proposed method (FIFS) was tested on a real dataset, which comprised of a comprehensive coverage of pig breed types present in Britain. This dataset consisted of 446 individuals divided in 14 sub-populations, genotyped at 59,436 SNPs. Our method outperforms the state-of-the-art and baseline methods in every case. More specifically, our method surpassed the assignment accuracy threshold of 95% needing only half the number of SNPs selected by other methods (FIFS: 28 SNPs, Delta: 70 SNPs Pairwise FST: 70 SNPs, In: 100 SNPs.) CONCLUSION: Our approach successfully deals with the problem of informative marker selection in high dimensional genomic datasets. It offers better results compared to existing approaches and can aid biologists in selecting the most informative markers with maximum discrimination power for optimization of cost-effective panels with applications related to e.g. species identification, wildlife management, and forensics. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. SNP Discovery for mapping alien introgressions in wheat

    PubMed Central

    2014-01-01

    Background Monitoring alien introgressions in crop plants is difficult due to the lack of genetic and molecular mapping information on the wild crop relatives. The tertiary gene pool of wheat is a very important source of genetic variability for wheat improvement against biotic and abiotic stresses. By exploring the 5Mg short arm (5MgS) of Aegilops geniculata, we can apply chromosome genomics for the discovery of SNP markers and their use for monitoring alien introgressions in wheat (Triticum aestivum L). Results The short arm of chromosome 5Mg of Ae. geniculata Roth (syn. Ae. ovata L.; 2n = 4x = 28, UgUgMgMg) was flow-sorted from a wheat line in which it is maintained as a telocentric chromosome. DNA of the sorted arm was amplified and sequenced using an Illumina Hiseq 2000 with ~45x coverage. The sequence data was used for SNP discovery against wheat homoeologous group-5 assemblies. A total of 2,178 unique, 5MgS-specific SNPs were discovered. Randomly selected samples of 59 5MgS-specific SNPs were tested (44 by KASPar assay and 15 by Sanger sequencing) and 84% were validated. Of the selected SNPs, 97% mapped to a chromosome 5Mg addition to wheat (the source of t5MgS), and 94% to 5Mg introgressed from a different accession of Ae. geniculata substituting for chromosome 5D of wheat. The validated SNPs also identified chromosome segments of 5MgS origin in a set of T5D-5Mg translocation lines; eight SNPs (25%) mapped to TA5601 [T5DL · 5DS-5MgS(0.75)] and three (8%) to TA5602 [T5DL · 5DS-5MgS (0.95)]. SNPs (gsnp_5ms83 and gsnp_5ms94), tagging chromosome T5DL · 5DS-5MgS(0.95) with the smallest introgression carrying resistance to leaf rust (Lr57) and stripe rust (Yr40), were validated in two released germplasm lines with Lr57 and Yr40 genes. Conclusion This approach should be widely applicable for the identification of species/genome-specific SNPs. The development of a large number of SNP markers will facilitate the precise introgression and monitoring of alien segments in crop breeding programs and further enable mapping and cloning novel genes from the wild relatives of crop plants. PMID:24716476

  11. Maternal single nucleotide polymorphisms in the fatty acid desaturase 1 and 2 coding regions modify the impact of prenatal supplementation with DHA on birth weight12

    PubMed Central

    Gonzalez-Casanova, Ines; Rzehak, Peter; Stein, Aryeh D; Garcia Feregrino, Raquel; Dommarco, Juan A Rivera; Barraza-Villarreal, Albino; Demmelmair, Hans; Romieu, Isabelle; Villalpando, Salvador; Martorell, Reynaldo; Koletzko, Berthold; Ramakrishnan, Usha

    2016-01-01

    Background: Specific single nucleotide polymorphisms (SNPs) in the fatty acid desaturase (FADS) gene affect the activity and efficiency of enzymes that are responsible for the conversion of polyunsaturated fatty acids (PUFAs) into their long-chain active form. A high prevalence of SNPs that are associated with slow PUFA conversion has been described in Hispanic populations. Objective: We assessed the heterogeneity of the effect of prenatal supplementation with docosahexaenoic acid (DHA) on birth weight across selected FADS SNPs in a sample of Mexican women and their offspring. Design: We obtained information on the maternal genotype from stored blood samples of 654 women who received supplementation with 400 mg DHA/d or a placebo from weeks 18 to 22 of gestation through delivery as part of a randomized controlled trial conducted in Cuernavaca, Mexico. We selected 4 tag SNPs (rs174455, rs174556, rs174602, and rs498793) in the FADS region for analysis. We used an ANOVA to test for the heterogeneity of the effect on birth weight across each of the 4 SNPs. Results: The mean ± SD birth weight was 3210 ± 470 g, and the weight-for-age z score (WAZ) was −0.24 ± 1.00. There were no intention-to-treat differences in birth weights. We showed significant heterogeneity by SNP rs174602 (P = 0.02); offspring of carriers of alleles TT and TC in the intervention group were heavier than those in the placebo group (WAZ: −0.13 ± 0.14 and −0.20 ± 0.08 compared with −0.55 ± 0.15 and −0.39 ± 0.09, respectively); there were no significant differences in offspring of rs174602 CC homozygotes (WAZ: −0.26 ± 0.09 in the intervention group compared with −0.04 ± 0.09 in the placebo group). We showed no significant heterogeneity across the other 3 FADS SNPs. Conclusion: Differential responses to prenatal DHA supplementation on the basis of the genetic makeup of target populations could explain the mixed evidence of the impact of DHA supplementation on birth weight. This trial was registered at clinicaltrials.gov as NCT00646360. PMID:26912491

  12. Maternal single nucleotide polymorphisms in the fatty acid desaturase 1 and 2 coding regions modify the impact of prenatal supplementation with DHA on birth weight.

    PubMed

    Gonzalez-Casanova, Ines; Rzehak, Peter; Stein, Aryeh D; Garcia Feregrino, Raquel; Rivera Dommarco, Juan A; Barraza-Villarreal, Albino; Demmelmair, Hans; Romieu, Isabelle; Villalpando, Salvador; Martorell, Reynaldo; Koletzko, Berthold; Ramakrishnan, Usha

    2016-04-01

    Specific single nucleotide polymorphisms (SNPs) in the fatty acid desaturase (FADS) gene affect the activity and efficiency of enzymes that are responsible for the conversion of polyunsaturated fatty acids (PUFAs) into their long-chain active form. A high prevalence of SNPs that are associated with slow PUFA conversion has been described in Hispanic populations. We assessed the heterogeneity of the effect of prenatal supplementation with docosahexaenoic acid (DHA) on birth weight across selected FADS SNPs in a sample of Mexican women and their offspring. We obtained information on the maternal genotype from stored blood samples of 654 women who received supplementation with 400 mg DHA/d or a placebo from weeks 18 to 22 of gestation through delivery as part of a randomized controlled trial conducted in Cuernavaca, Mexico. We selected 4 tag SNPs (rs174455, rs174556, rs174602, and rs498793) in the FADS region for analysis. We used an ANOVA to test for the heterogeneity of the effect on birth weight across each of the 4 SNPs. The mean ± SD birth weight was 3210 ± 470 g, and the weight-for-age z score (WAZ) was -0.24 ± 1.00. There were no intention-to-treat differences in birth weights. We showed significant heterogeneity by SNP rs174602 (P= 0.02); offspring of carriers of alleles TT and TC in the intervention group were heavier than those in the placebo group (WAZ: -0.13 ± 0.14 and -0.20 ± 0.08 compared with -0.55 ± 0.15 and -0.39 ± 0.09, respectively); there were no significant differences in offspring of rs174602 CC homozygotes (WAZ: -0.26 ± 0.09 in the intervention group compared with -0.04 ± 0.09 in the placebo group). We showed no significant heterogeneity across the other 3 FADS SNPs. Differential responses to prenatal DHA supplementation on the basis of the genetic makeup of target populations could explain the mixed evidence of the impact of DHA supplementation on birth weight. This trial was registered at clinicaltrials.gov as NCT00646360. © 2016 American Society for Nutrition.

  13. Targeted resequencing in peanuts using the fluidigm access array

    USDA-ARS?s Scientific Manuscript database

    The presence of homoeologous gene copies in allotetraploid peanut makes it challenging to select homologous SNPs differentiating two or more cultivars. An integrated approach of improved bioinformatics and targeted resequencing to select homologous SNPs in tetraploid peanut is needed. Raw transcrip...

  14. The Relationship between Smoking and Replicated Sequence Variants on Chromosomes 8 and 9 with Familial Intracranial Aneurysm

    PubMed Central

    Deka, Ranjan; Koller, Daniel L.; Lai, Dongbing; Indugula, Subba Rao; Sun, Guangyun; Woo, Daniel; Sauerbeck, Laura; Moomaw, Charles J.; Hornung, Richard; Connolly, E. Sander; Anderson, Craig; Rouleau, Guy; Meissner, Irene; Bailey-Wilson, Joan E.; Huston, John; Brown, Robert D.; Kleindorfer, Dawn O.; Flaherty, Matthew L.; Langefeld, Carl; Foroud, Tatiana; Broderick, Joseph P.

    2010-01-01

    Purpose To replicate the previous association of single nucleotide polymorphisms (SNPs) with risk of intracranial aneurysm (IA) and to examine the relationship of smoking with these variants and the risk of IA. Methods White probands with an IA from families with multiple affected members were identified by 26 clinical centers located throughout North America, New Zealand, and Australia. White controls free of stroke and IA were selected by random digit dialing from the Greater Cincinnati population. SNPs previously associated with IA on chromosome 2, 8, and 9 were genotyped using a TaqMan assay or were included in the Affymetrix 6.0 array that was part of a genome-wide association study of 406 IA cases and 392 controls. Logistic regression modeling tested whether the association of replicated SNPs with IA was modulated by smoking. Results The strongest evidence of association with IA was found with the 8q SNP rs10958409 (genotypic P = 9.2 × 10-5; allelic P = 1.3 × 10-5; OR = 1.86, 95% CI: 1.40−2.47). We also replicated association with both SNPs on chromosome 9p, rs1333040 and rs10757278, but were not able to replicate the previously reported association of the two SNPs on chromosome 2q. Statistical testing showed a multiplicative relationship between the risk alleles and smoking with regard to the risk of IA. Conclusion Our data provide complementary evidence that the variants on chromosome 8q and 9p are associated with IA and that the risk of IA in patients with these variants are greatly increased with cigarette smoking. PMID:20190001

  15. Differential effects of two probiotics on the risks of eczema and atopy associated with single nucleotide polymorphisms to Toll-like receptors.

    PubMed

    Marlow, Gareth; Han, Dug Yeo; Wickens, Kristin; Stanley, Thorsten; Crane, Julian; Mitchell, Edwin A; Dekker, James; Barthow, Christine; Fitzharris, Penny; Ferguson, Lynnette R; Morgan, Angharad R

    2015-05-01

    There is strong evidence to support a genetic predisposition to eczema and more recently studies have suggested that probiotics might be used to prevent eczema by modifying the expression of putative allergy-associated genes. The aim of this present study was to investigate whether two probiotics, Lactobacillus rhamnosus HN001 (HN001) and Bifidobacterium animalis subsp. lactis HN019 (HN019), can modify the known genetic predisposition to eczema conferred by genetic variation in the Toll-like receptor (TLR) genes in a high-risk infant population. We selected 54 SNPs in the Toll-like receptor genes. These SNPs were analysed in 331 children of sole European ancestry as part of a double-blind, randomized, placebo-controlled trial examining the effects of HN001 and HN019 supplementation on eczema development and atopic sensitization. The data showed that 26 TLR SNPs interacted with HN001 resulting in a significantly reduced risk of eczema, 18 for eczema severity as defined by SCORAD ≥ 10 and 20 for atopic sensitization compared to placebo. There were only two SNPs that interacted with HN019 resulting in a reduced risk of eczema, eczema severity or atopy. This is the first study to show that the negative impact of specific TLR genotypes may be positively affected by probiotic supplementation. HN001 exhibits a much stronger effect than HN019 in this respect. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  16. Genetic association with low concentrations of high density lipoprotein-cholesterol in a pediatric population of the Middle East and North Africa: the CASPIAN-III study.

    PubMed

    Kelishadi, Roya; Haghjooy Javanmard, Shaghayegh; Tajadini, Mohammad Hasan; Mansourian, Marjan; Motlagh, Mohammad Esmaeil; Ardalan, Gelayol; Ban, Matthew

    2014-11-01

    Depressed high-density lipoprotein cholesterol (HDL-C) is prevalent the Middle East and North Africa. Some studies have documented associations between HDL-C and several single nucleotide polymorphisms (SNPs) in candidate gene polymorphisms. We investigated the associations between SNP genotypes and HDL-C levels in Iranian students, aged 10-18 years. Genotyping was performed in 750 randomly selected participants among those with low HDL-C levels (below 5th percentile), intermediate HDL-C levels (5-95th) and high HDL-C levels (above the 95th percentile). Minor allele frequencies (MAFs) of the SNPs of interest were compared between the three HDL-C groups. The vast majority of pairwise comparisons of MAFs between HDL-C groups were significant. Pairwise comparisons between low and high HDL-C groups showed significant between-group differences in MAFs for all SNPs, except for APOC3 rs5128. Pairwise comparisons between low and intermediate HDL-C groups showed significant between-group differences in MAFs for all SNPs, except for APOC3 rs5128 and APOA1 rs2893157. Pairwise comparisons between intermediate and high HDL-C groups showed significant between-group differences in MAFs for all SNPs, except for ABCA1 APOC3 rs5128 and APOA1 rs2893157. After adjustment for confounding factors, including age, sex, body mass index, low physical activity, consumption of saturated fats, and socioeconomic status, ABCA1 r1587K and CETP A373P significantly increased the risk of depressed HDL-C, and CETP Taq1 had a protective role. This study replicated several associations between HDL-C levels and candidate gene SNPs from genome-wide associations with HDL-C in Iranians from the pediatric age group. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  17. Associations between Potentially Modifiable Risk Factors and Alzheimer Disease: A Mendelian Randomization Study

    PubMed Central

    Østergaard, Søren D.; Mukherjee, Shubhabrata; Sharp, Stephen J.; Proitsi, Petroula; Lotta, Luca A.; Day, Felix; Perry, John R. B.; Boehme, Kevin L.; Walter, Stefan; Kauwe, John S.; Gibbons, Laura E.; Larson, Eric B.; Powell, John F.; Langenberg, Claudia; Crane, Paul K.; Wareham, Nicholas J.; Scott, Robert A.

    2015-01-01

    Background Potentially modifiable risk factors including obesity, diabetes, hypertension, and smoking are associated with Alzheimer disease (AD) and represent promising targets for intervention. However, the causality of these associations is unclear. We sought to assess the causal nature of these associations using Mendelian randomization (MR). Methods and Findings We used SNPs associated with each risk factor as instrumental variables in MR analyses. We considered type 2 diabetes (T2D, N SNPs = 49), fasting glucose (N SNPs = 36), insulin resistance (N SNPs = 10), body mass index (BMI, N SNPs = 32), total cholesterol (N SNPs = 73), HDL-cholesterol (N SNPs = 71), LDL-cholesterol (N SNPs = 57), triglycerides (N SNPs = 39), systolic blood pressure (SBP, N SNPs = 24), smoking initiation (N SNPs = 1), smoking quantity (N SNPs = 3), university completion (N SNPs = 2), and years of education (N SNPs = 1). We calculated MR estimates of associations between each exposure and AD risk using an inverse-variance weighted approach, with summary statistics of SNP–AD associations from the International Genomics of Alzheimer’s Project, comprising a total of 17,008 individuals with AD and 37,154 cognitively normal elderly controls. We found that genetically predicted higher SBP was associated with lower AD risk (odds ratio [OR] per standard deviation [15.4 mm Hg] of SBP [95% CI]: 0.75 [0.62–0.91]; p = 3.4 × 10−3). Genetically predicted higher SBP was also associated with a higher probability of taking antihypertensive medication (p = 6.7 × 10−8). Genetically predicted smoking quantity was associated with lower AD risk (OR per ten cigarettes per day [95% CI]: 0.67 [0.51–0.89]; p = 6.5 × 10−3), although we were unable to stratify by smoking history; genetically predicted smoking initiation was not associated with AD risk (OR = 0.70 [0.37, 1.33]; p = 0.28). We saw no evidence of causal associations between glycemic traits, T2D, BMI, or educational attainment and risk of AD (all p > 0.1). Potential limitations of this study include the small proportion of intermediate trait variance explained by genetic variants and other implicit limitations of MR analyses. Conclusions Inherited lifetime exposure to higher SBP is associated with lower AD risk. These findings suggest that higher blood pressure—or some environmental exposure associated with higher blood pressure, such as use of antihypertensive medications—may reduce AD risk. PMID:26079503

  18. CLUSTAG: hierarchical clustering and graph methods for selecting tag SNPs.

    PubMed

    Ao, S I; Yip, Kevin; Ng, Michael; Cheung, David; Fong, Pui-Yee; Melhado, Ian; Sham, Pak C

    2005-04-15

    Cluster and set-cover algorithms are developed to obtain a set of tag single nucleotide polymorphisms (SNPs) that can represent all the known SNPs in a chromosomal region, subject to the constraint that all SNPs must have a squared correlation R2>C with at least one tag SNP, where C is specified by the user. http://hkumath.hku.hk/web/link/CLUSTAG/CLUSTAG.html mng@maths.hku.hk.

  19. Elastic-net regularization approaches for genome-wide association studies of rheumatoid arthritis.

    PubMed

    Cho, Seoae; Kim, Haseong; Oh, Sohee; Kim, Kyunga; Park, Taesung

    2009-12-15

    The current trend in genome-wide association studies is to identify regions where the true disease-causing genes may lie by evaluating thousands of single-nucleotide polymorphisms (SNPs) across the whole genome. However, many challenges exist in detecting disease-causing genes among the thousands of SNPs. Examples include multicollinearity and multiple testing issues, especially when a large number of correlated SNPs are simultaneously tested. Multicollinearity can often occur when predictor variables in a multiple regression model are highly correlated, and can cause imprecise estimation of association. In this study, we propose a simple stepwise procedure that identifies disease-causing SNPs simultaneously by employing elastic-net regularization, a variable selection method that allows one to address multicollinearity. At Step 1, the single-marker association analysis was conducted to screen SNPs. At Step 2, the multiple-marker association was scanned based on the elastic-net regularization. The proposed approach was applied to the rheumatoid arthritis (RA) case-control data set of Genetic Analysis Workshop 16. While the selected SNPs at the screening step are located mostly on chromosome 6, the elastic-net approach identified putative RA-related SNPs on other chromosomes in an increased proportion. For some of those putative RA-related SNPs, we identified the interactions with sex, a well known factor affecting RA susceptibility.

  20. Interaction between arsenic exposure from drinking water and genetic susceptibility in carotid intima–media thickness in Bangladesh

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wu, Fen; Department of Environmental Medicine, New York University School of Medicine, New York, NY; Jasmine, Farzana

    Epidemiologic studies that evaluated genetic susceptibility for the effects of arsenic exposure from drinking water on subclinical atherosclerosis are limited. We conducted a cross-sectional study of 1078 participants randomly selected from the Health Effects of Arsenic Longitudinal Study in Bangladesh to evaluate whether the association between arsenic exposure and carotid artery intima–media thickness (cIMT) differs by 207 single-nucleotide polymorphisms (SNPs) in 18 genes related to arsenic metabolism, oxidative stress, inflammation, and endothelial dysfunction. Although not statistically significant after correcting for multiple testing, nine SNPs in APOE, AS3MT, PNP, and TNF genes had a nominally statistically significant interaction with well-water arsenicmore » in cIMT. For instance, the joint presence of a higher level of well-water arsenic (≥ 40.4 μg/L) and the GG genotype of AS3MT rs3740392 was associated with a difference of 40.9 μm (95% CI = 14.4, 67.5) in cIMT, much greater than the difference of cIMT associated with the genotype alone (β = − 5.1 μm, 95% CI = − 31.6, 21.3) or arsenic exposure alone (β = 7.2 μm, 95% CI = − 3.1, 17.5). The pattern and magnitude of the interactions were similar when urinary arsenic was used as the exposure variable. Additionally, the at-risk genotypes of the AS3MT SNPs were positively related to the proportion of monomethylarsonic acid (MMA) in urine, which is indicative of arsenic methylation capacity. The findings provide novel evidence that genetic variants related to arsenic metabolism may play an important role in arsenic-induced subclinical atherosclerosis. Future replication studies in diverse populations are needed to confirm the findings. - Highlights: • Nine SNPs had a nominally significant interaction with well-water arsenic in cIMT. • Three SNPs in AS3MT showed nominally significant interactions with urinary arsenic. • cIMT was much higher among subjects with higher arsenic exposure and AS3MT SNPs. • The at-risk genotypes of AS3MT SNPs were positively related to urinary MMA%.« less

  1. Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo)

    PubMed Central

    2012-01-01

    Background The turkey (Meleagris gallopavo) is an important agricultural species and the second largest contributor to the world’s poultry meat production. Genetic improvement is attributed largely to selective breeding programs that rely on highly heritable phenotypic traits, such as body size and breast muscle development. Commercial breeding with small effective population sizes and epistasis can result in loss of genetic diversity, which in turn can lead to reduced individual fitness and reduced response to selection. The presence of genomic diversity in domestic livestock species therefore, is of great importance and a prerequisite for rapid and accurate genetic improvement of selected breeds in various environments, as well as to facilitate rapid adaptation to potential changes in breeding goals. Genomic selection requires a large number of genetic markers such as e.g. single nucleotide polymorphisms (SNPs) the most abundant source of genetic variation within the genome. Results Alignment of next generation sequencing data of 32 individual turkeys from different populations was used for the discovery of 5.49 million SNPs, which subsequently were used for the analysis of genetic diversity among the different populations. All of the commercial lines branched from a single node relative to the heritage varieties and the South Mexican turkey population. Heterozygosity of all individuals from the different turkey populations ranged from 0.17-2.73 SNPs/Kb, while heterozygosity of populations ranged from 0.73-1.64 SNPs/Kb. The average frequency of heterozygous SNPs in individual turkeys was 1.07 SNPs/Kb. Five genomic regions with very low nucleotide variation were identified in domestic turkeys that showed state of fixation towards alleles different than wild alleles. Conclusion The turkey genome is much less diverse with a relatively low frequency of heterozygous SNPs as compared to other livestock species like chicken and pig. The whole genome SNP discovery study in turkey resulted in the detection of 5.49 million putative SNPs compared to the reference genome. All commercial lines appear to share a common origin. Presence of different alleles/haplotypes in the SM population highlights that specific haplotypes have been selected in the modern domesticated turkey. PMID:22891612

  2. Identification of KIF3A as a Novel Candidate Gene for Childhood Asthma Using RNA Expression and Population Allelic Frequencies Differences

    PubMed Central

    Butsch Kovacic, Melinda; Biagini Myers, Jocelyn M.; Wang, Ning; Martin, Lisa J.; Lindsey, Mark; Ericksen, Mark B.; He, Hua; Patterson, Tia L.; Baye, Tesfaye M.; Torgerson, Dara; Roth, Lindsey A.; Gupta, Jayanta; Sivaprasad, Umasundari; Gibson, Aaron M.; Tsoras, Anna M.; Hu, Donglei; Eng, Celeste; Chapela, Rocío; Rodríguez-Santana, José R.; Rodríguez-Cintrón, William; Avila, Pedro C.; Beckman, Kenneth; Seibold, Max A.; Gignoux, Chris; Musaad, Salma M.; Chen, Weiguo; Burchard, Esteban González; Khurana Hershey, Gurjit K.

    2011-01-01

    Background Asthma is a chronic inflammatory disease with a strong genetic predisposition. A major challenge for candidate gene association studies in asthma is the selection of biologically relevant genes. Methodology/Principal Findings Using epithelial RNA expression arrays, HapMap allele frequency variation, and the literature, we identified six possible candidate susceptibility genes for childhood asthma including ADCY2, DNAH5, KIF3A, PDE4B, PLAU, SPRR2B. To evaluate these genes, we compared the genotypes of 194 predominantly tagging SNPs in 790 asthmatic, allergic and non-allergic children. We found that SNPs in all six genes were nominally associated with asthma (p<0.05) in our discovery cohort and in three independent cohorts at either the SNP or gene level (p<0.05). Further, we determined that our selection approach was superior to random selection of genes either differentially expressed in asthmatics compared to controls (p = 0.0049) or selected based on the literature alone (p = 0.0049), substantiating the validity of our gene selection approach. Importantly, we observed that 7 of 9 SNPs in the KIF3A gene more than doubled the odds of asthma (OR = 2.3, p<0.0001) and increased the odds of allergic disease (OR = 1.8, p<0.008). Our data indicate that KIF3A rs7737031 (T-allele) has an asthma population attributable risk of 18.5%. The association between KIF3A rs7737031 and asthma was validated in 3 independent populations, further substantiating the validity of our gene selection approach. Conclusions/Significance Our study demonstrates that KIF3A, a member of the kinesin superfamily of microtubule associated motors that are important in the transport of protein complexes within cilia, is a novel candidate gene for childhood asthma. Polymorphisms in KIF3A may in part be responsible for poor mucus and/or allergen clearance from the airways. Furthermore, our study provides a promising framework for the identification and evaluation of novel candidate susceptibility genes. PMID:21912604

  3. The search for loci under selection: trends, biases and progress.

    PubMed

    Ahrens, Collin W; Rymer, Paul D; Stow, Adam; Bragg, Jason; Dillon, Shannon; Umbers, Kate D L; Dudaniec, Rachael Y

    2018-03-01

    Detecting genetic variants under selection using F ST outlier analysis (OA) and environmental association analyses (EAAs) are popular approaches that provide insight into the genetic basis of local adaptation. Despite the frequent use of OA and EAA approaches and their increasing attractiveness for detecting signatures of selection, their application to field-based empirical data have not been synthesized. Here, we review 66 empirical studies that use Single Nucleotide Polymorphisms (SNPs) in OA and EAA. We report trends and biases across biological systems, sequencing methods, approaches, parameters, environmental variables and their influence on detecting signatures of selection. We found striking variability in both the use and reporting of environmental data and statistical parameters. For example, linkage disequilibrium among SNPs and numbers of unique SNP associations identified with EAA were rarely reported. The proportion of putatively adaptive SNPs detected varied widely among studies, and decreased with the number of SNPs analysed. We found that genomic sampling effort had a greater impact than biological sampling effort on the proportion of identified SNPs under selection. OA identified a higher proportion of outliers when more individuals were sampled, but this was not the case for EAA. To facilitate repeatability, interpretation and synthesis of studies detecting selection, we recommend that future studies consistently report geographical coordinates, environmental data, model parameters, linkage disequilibrium, and measures of genetic structure. Identifying standards for how OA and EAA studies are designed and reported will aid future transparency and comparability of SNP-based selection studies and help to progress landscape and evolutionary genomics. © 2018 John Wiley & Sons Ltd.

  4. CIDR

    Science.gov Websites

    they have high Illumina design scores or have worked in other experiments. For Golden Gate experiments design files returned by Illumina. Efficient SNP selection: The most efficient way to select your SNPs is to get Illumina design scores on all of your possible SNPs prior to narrowing down your list. You can

  5. Genetic analysis of ancestry, admixture and selection in Bolivian and Totonac populations of the New World

    PubMed Central

    2012-01-01

    Background Populations of the Americas were founded by early migrants from Asia, and some have experienced recent genetic admixture. To better characterize the native and non-native ancestry components in populations from the Americas, we analyzed 815,377 autosomal SNPs, mitochondrial hypervariable segments I and II, and 36 Y-chromosome STRs from 24 Mesoamerican Totonacs and 23 South American Bolivians. Results and Conclusions We analyzed common genomic regions from native Bolivian and Totonac populations to identify 324 highly predictive Native American ancestry informative markers (AIMs). As few as 40–50 of these AIMs perform nearly as well as large panels of random genome-wide SNPs for predicting and estimating Native American ancestry and admixture levels. These AIMs have greater New World vs. Old World specificity than previous AIMs sets. We identify highly-divergent New World SNPs that coincide with high-frequency haplotypes found at similar frequencies in all populations examined, including the HGDP Pima, Maya, Colombian, Karitiana, and Surui American populations. Some of these regions are potential candidates for positive selection. European admixture in the Bolivian sample is approximately 12%, though individual estimates range from 0–48%. We estimate that the admixture occurred ~360–384 years ago. Little evidence of European or African admixture was found in Totonac individuals. Bolivians with pre-Columbian mtDNA and Y-chromosome haplogroups had 5–30% autosomal European ancestry, demonstrating the limitations of Y-chromosome and mtDNA haplogroups and the need for autosomal ancestry informative markers for assessing ancestry in admixed populations. PMID:22606979

  6. Comparing strategies for selection of low-density SNPs for imputation-mediated genomic prediction in U. S. Holsteins.

    PubMed

    He, Jun; Xu, Jiaqi; Wu, Xiao-Lin; Bauck, Stewart; Lee, Jungjae; Morota, Gota; Kachman, Stephen D; Spangler, Matthew L

    2018-04-01

    SNP chips are commonly used for genotyping animals in genomic selection but strategies for selecting low-density (LD) SNPs for imputation-mediated genomic selection have not been addressed adequately. The main purpose of the present study was to compare the performance of eight LD (6K) SNP panels, each selected by a different strategy exploiting a combination of three major factors: evenly-spaced SNPs, increased minor allele frequencies, and SNP-trait associations either for single traits independently or for all the three traits jointly. The imputation accuracies from 6K to 80K SNP genotypes were between 96.2 and 98.2%. Genomic prediction accuracies obtained using imputed 80K genotypes were between 0.817 and 0.821 for daughter pregnancy rate, between 0.838 and 0.844 for fat yield, and between 0.850 and 0.863 for milk yield. The two SNP panels optimized on the three major factors had the highest genomic prediction accuracy (0.821-0.863), and these accuracies were very close to those obtained using observed 80K genotypes (0.825-0.868). Further exploration of the underlying relationships showed that genomic prediction accuracies did not respond linearly to imputation accuracies, but were significantly affected by genotype (imputation) errors of SNPs in association with the traits to be predicted. SNPs optimal for map coverage and MAF were favorable for obtaining accurate imputation of genotypes whereas trait-associated SNPs improved genomic prediction accuracies. Thus, optimal LD SNP panels were the ones that combined both strengths. The present results have practical implications on the design of LD SNP chips for imputation-enabled genomic prediction.

  7. Genetic variants in VEGF pathway genes in neoadjuvant breast cancer patients receiving bevacizumab: Results from the randomized phase III GeparQuinto study.

    PubMed

    Hein, Alexander; Lambrechts, Diether; von Minckwitz, Gunter; Häberle, Lothar; Eidtmann, Holger; Tesch, Hans; Untch, Michael; Hilfrich, Jörn; Schem, Christian; Rezai, Mahdi; Gerber, Bernd; Dan Costa, Serban; Blohmer, Jens-Uwe; Schwedler, Kathrin; Kittel, Kornelia; Fehm, Tanja; Kunz, Georg; Beckmann, Matthias W; Ekici, Arif B; Hanusch, Claus; Huober, Jens; Liedtke, Cornelia; Mau, Christine; Moisse, Matthieu; Müller, Volkmar; Nekljudova, Valentina; Peuteman, Gilian; Rack, Brigitte; Rübner, Matthias; Van Brussel, Thomas; Wang, Liewei; Weinshilboum, Richard M; Loibl, Sibylle; Fasching, Peter A

    2015-12-15

    Studies assessing the effect of bevacizumab (BEV) on breast cancer (BC) outcome have shown different effects on progression-free and overall survival, suggesting that a subgroup of patients may benefit from this treatment. Unfortunately, no biomarkers exist to identify these patients. Here, we investigate whether single nucleotide polymorphisms (SNPs) in VEGF pathway genes correlate with pathological complete response (pCR) in the neoadjuvant GeparQuinto trial. HER2-negative patients were randomized into treatment arms receiving either BEV combined with standard chemotherapy or chemotherapy alone. In a pre-planned biomarker study, DNA was collected from 729 and 724 patients, respectively from both treatment arms, and genotyped for 125 SNPs. Logistic regression assessed interaction between individual SNPs and both treatment arms to predict pCR. Five SNPs may be associated with a better response to BEV, but none of them remained significant after correction for multiple testing. The two SNPs most strongly associated, rs833058 and rs699947, were located upstream of the VEGF-A promoter. Odds ratios for the homozygous common, heterozygous and homozygous rare rs833058 genotypes were 2.36 (95% CI, 1.49-3.75), 1.20 (95% CI, 0.88-1.64) and 0.61 (95% CI, 0.34-1.12). Notably, some SNPs in VEGF-A exhibited a more pronounced effect in the triple-negative subgroup. Several SNPs in VEGF-A may be associated with improved pCR when receiving BEV in the neoadjuvant setting. Although none of the observed effects survived correction for multiple testing, our observations are consistent with previous studies on BEV efficacy in BC. Further research is warranted to clarify the predictive value of these markers. © 2015 UICC.

  8. Relationship Between Some Single-nucleotide Polymorphism and Response to Hydroxyurea Therapy in Iranian Patients With β-Thalassemia Intermedia.

    PubMed

    Karimi, Mehran; Zarei, Tahereh; Haghpanah, Sezaneh; Moghadam, Mohamad; Ebrahimi, Ahmad; Rezaei, Narges; Heidari, Ghazaleh; Vazin, Afsaneh; Khavari, Maryam; Miri, Hamid R

    2017-05-01

    To evaluate the possible relationship between hydroxyurea (HU) response and some single-nucleotide polymorphism (SNP) in patients affected by β-thalassemia intermedia. In this cross-sectional study, 100 β-thalassemia intermedia patients who were taking HU with a dose of 8 to 15 mg/kg body weight per day for a period of at least 6 months were randomly selected between February 2013 and October 2014 in southern Iran. HU response was defined based on decrease or cessation of the blood transfusion need and evaluation of Hb level. In univariate analysis, from all evaluated SNPs, only rs10837814 SNP of olfactory receptors (ORs) OR51B2 showed a significant association with HU response (P=0.038) and from laboratory characteristics, only nucleated red blood cells showed significant associations (116%±183%) in good responders versus (264%±286%) in poor responders (P=0.045). In multiple logistic regression, neither laboratory variables nor different SNPs, showed significant association with HU response. Three novel nucleotide variations (-665 [A→C], -1301 [T→G],-1199 delA) in OR51B2 gene were found in good responders. None of the evaluated SNPs in our study showed significant association with HU response. Further larger studies and evaluation of other genes are suggested.

  9. Genetic tests for estimating dairy breed proportion and parentage assignment in East African crossbred cattle.

    PubMed

    Strucken, Eva M; Al-Mamun, Hawlader A; Esquivelzeta-Rabell, Cecilia; Gondro, Cedric; Mwai, Okeyo A; Gibson, John P

    2017-09-12

    Smallholder dairy farming in much of the developing world is based on the use of crossbred cows that combine local adaptation traits of indigenous breeds with high milk yield potential of exotic dairy breeds. Pedigree recording is rare in such systems which means that it is impossible to make informed breeding decisions. High-density single nucleotide polymorphism (SNP) assays allow accurate estimation of breed composition and parentage assignment but are too expensive for routine application. Our aim was to determine the level of accuracy achieved with low-density SNP assays. We constructed subsets of 100 to 1500 SNPs from the 735k-SNP Illumina panel by selecting: (a) on high minor allele frequencies (MAF) in a crossbred population; (b) on large differences in allele frequency between ancestral breeds; (c) at random; or (d) with a differential evolution algorithm. These panels were tested on a dataset of 1933 crossbred dairy cattle from Kenya/Uganda and on crossbred populations from Ethiopia (N = 545) and Tanzania (N = 462). Dairy breed proportions were estimated by using the ADMIXTURE program, a regression approach, and SNP-best linear unbiased prediction, and tested against estimates obtained by ADMIXTURE based on the 735k-SNP panel. Performance for parentage assignment was based on opposing homozygotes which were used to calculate the separation value (sv) between true and false assignments. Panels of SNPs based on the largest differences in allele frequency between European dairy breeds and a combined Nelore/N'Dama population gave the best predictions of dairy breed proportion (r 2  = 0.962 to 0.994 for 100 to 1500 SNPs) with an average absolute bias of 0.026. Panels of SNPs based on the highest MAF in the crossbred population (Kenya/Uganda) gave the most accurate parentage assignments (sv = -1 to 15 for 100 to 1500 SNPs). Due to the different required properties of SNPs, panels that did well for breed composition did poorly for parentage assignment and vice versa. A combined panel of 400 SNPs was not able to assign parentages correctly, thus we recommend the use of 200 SNPs either for breed proportion prediction or parentage assignment, independently.

  10. Genomic selection for fruit quality traits in apple (Malus×domestica Borkh.).

    PubMed

    Kumar, Satish; Chagné, David; Bink, Marco C A M; Volz, Richard K; Whitworth, Claire; Carlisle, Charmaine

    2012-01-01

    The genome sequence of apple (Malus×domestica Borkh.) was published more than a year ago, which helped develop an 8K SNP chip to assist in implementing genomic selection (GS). In apple breeding programmes, GS can be used to obtain genomic breeding values (GEBV) for choosing next-generation parents or selections for further testing as potential commercial cultivars at a very early stage. Thus GS has the potential to accelerate breeding efficiency significantly because of decreased generation interval or increased selection intensity. We evaluated the accuracy of GS in a population of 1120 seedlings generated from a factorial mating design of four females and two male parents. All seedlings were genotyped using an Illumina Infinium chip comprising 8,000 single nucleotide polymorphisms (SNPs), and were phenotyped for various fruit quality traits. Random-regression best liner unbiased prediction (RR-BLUP) and the Bayesian LASSO method were used to obtain GEBV, and compared using a cross-validation approach for their accuracy to predict unobserved BLUP-BV. Accuracies were very similar for both methods, varying from 0.70 to 0.90 for various fruit quality traits. The selection response per unit time using GS compared with the traditional BLUP-based selection were very high (>100%) especially for low-heritability traits. Genome-wide average estimated linkage disequilibrium (LD) between adjacent SNPs was 0.32, with a relatively slow decay of LD in the long range (r(2) = 0.33 and 0.19 at 100 kb and 1,000 kb respectively), contributing to the higher accuracy of GS. Distribution of estimated SNP effects revealed involvement of large effect genes with likely pleiotropic effects. These results demonstrated that genomic selection is a credible alternative to conventional selection for fruit quality traits.

  11. Predicting adaptive phenotypes from multilocus genotypes in Sitka spruce (Picea sitchensis) using random forest.

    PubMed

    Holliday, Jason A; Wang, Tongli; Aitken, Sally

    2012-09-01

    Climate is the primary driver of the distribution of tree species worldwide, and the potential for adaptive evolution will be an important factor determining the response of forests to anthropogenic climate change. Although association mapping has the potential to improve our understanding of the genomic underpinnings of climatically relevant traits, the utility of adaptive polymorphisms uncovered by such studies would be greatly enhanced by the development of integrated models that account for the phenotypic effects of multiple single-nucleotide polymorphisms (SNPs) and their interactions simultaneously. We previously reported the results of association mapping in the widespread conifer Sitka spruce (Picea sitchensis). In the current study we used the recursive partitioning algorithm 'Random Forest' to identify optimized combinations of SNPs to predict adaptive phenotypes. After adjusting for population structure, we were able to explain 37% and 30% of the phenotypic variation, respectively, in two locally adaptive traits--autumn budset timing and cold hardiness. For each trait, the leading five SNPs captured much of the phenotypic variation. To determine the role of epistasis in shaping these phenotypes, we also used a novel approach to quantify the strength and direction of pairwise interactions between SNPs and found such interactions to be common. Our results demonstrate the power of Random Forest to identify subsets of markers that are most important to climatic adaptation, and suggest that interactions among these loci may be widespread.

  12. An abbreviated SNP panel for ancestry assignment of honeybees (Apis mellifera)

    USDA-ARS?s Scientific Manuscript database

    This paper examines whether an abbreviated panel of 37 single nucleotide polymorphisms (SNPs) has the same power as a larger and more expensive panel of 95 SNPs to assign ancestry of honeybees (Apis mellifera) to three ancestral lineages. We selected 37 SNPs from the original 95 SNP panel using alle...

  13. Inferring Alcoholism SNPs and Regulatory Chemical Compounds Based on Ensemble Bayesian Network.

    PubMed

    Chen, Huan; Sun, Jiatong; Jiang, Hong; Wang, Xianyue; Wu, Lingxiang; Wu, Wei; Wang, Qh

    2017-01-01

    The disturbance of consciousness is one of the most common symptoms of those have alcoholism and may cause disability and mortality. Previous studies indicated that several single nucleotide polymorphisms (SNP) increase the susceptibility of alcoholism. In this study, we utilized the Ensemble Bayesian Network (EBN) method to identify causal SNPs of alcoholism based on the verified GAW14 data. We built a Bayesian network combining random process and greedy search by using Genetic Analysis Workshop 14 (GAW14) dataset to establish EBN of SNPs. Then we predicted the association between SNPs and alcoholism by determining Bayes' prior probability. Thirteen out of eighteen SNPs directly connected with alcoholism were found concordance with potential risk regions of alcoholism in OMIM database. As many SNPs were found contributing to alteration on gene expression, known as expression quantitative trait loci (eQTLs), we further sought to identify chemical compounds acting as regulators of alcoholism genes captured by causal SNPs. Chloroprene and valproic acid were identified as the expression regulators for genes C11orf66 and SALL3 which were captured by alcoholism SNPs, respectively. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  14. Incorporating Single-nucleotide Polymorphisms Into the Lyman Model to Improve Prediction of Radiation Pneumonitis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tucker, Susan L., E-mail: sltucker@mdanderson.org; Li Minghuan; Xu Ting

    2013-01-01

    Purpose: To determine whether single-nucleotide polymorphisms (SNPs) in genes associated with DNA repair, cell cycle, transforming growth factor-{beta}, tumor necrosis factor and receptor, folic acid metabolism, and angiogenesis can significantly improve the fit of the Lyman-Kutcher-Burman (LKB) normal-tissue complication probability (NTCP) model of radiation pneumonitis (RP) risk among patients with non-small cell lung cancer (NSCLC). Methods and Materials: Sixteen SNPs from 10 different genes (XRCC1, XRCC3, APEX1, MDM2, TGF{beta}, TNF{alpha}, TNFR, MTHFR, MTRR, and VEGF) were genotyped in 141 NSCLC patients treated with definitive radiation therapy, with or without chemotherapy. The LKB model was used to estimate the risk ofmore » severe (grade {>=}3) RP as a function of mean lung dose (MLD), with SNPs and patient smoking status incorporated into the model as dose-modifying factors. Multivariate analyses were performed by adding significant factors to the MLD model in a forward stepwise procedure, with significance assessed using the likelihood-ratio test. Bootstrap analyses were used to assess the reproducibility of results under variations in the data. Results: Five SNPs were selected for inclusion in the multivariate NTCP model based on MLD alone. SNPs associated with an increased risk of severe RP were in genes for TGF{beta}, VEGF, TNF{alpha}, XRCC1 and APEX1. With smoking status included in the multivariate model, the SNPs significantly associated with increased risk of RP were in genes for TGF{beta}, VEGF, and XRCC3. Bootstrap analyses selected a median of 4 SNPs per model fit, with the 6 genes listed above selected most often. Conclusions: This study provides evidence that SNPs can significantly improve the predictive ability of the Lyman MLD model. With a small number of SNPs, it was possible to distinguish cohorts with >50% risk vs <10% risk of RP when they were exposed to high MLDs.« less

  15. pLARmEB: integration of least angle regression with empirical Bayes for multilocus genome-wide association studies.

    PubMed

    Zhang, J; Feng, J-Y; Ni, Y-L; Wen, Y-J; Niu, Y; Tamba, C L; Yue, C; Song, Q; Zhang, Y-M

    2017-06-01

    Multilocus genome-wide association studies (GWAS) have become the state-of-the-art procedure to identify quantitative trait nucleotides (QTNs) associated with complex traits. However, implementation of multilocus model in GWAS is still difficult. In this study, we integrated least angle regression with empirical Bayes to perform multilocus GWAS under polygenic background control. We used an algorithm of model transformation that whitened the covariance matrix of the polygenic matrix K and environmental noise. Markers on one chromosome were included simultaneously in a multilocus model and least angle regression was used to select the most potentially associated single-nucleotide polymorphisms (SNPs), whereas the markers on the other chromosomes were used to calculate kinship matrix as polygenic background control. The selected SNPs in multilocus model were further detected for their association with the trait by empirical Bayes and likelihood ratio test. We herein refer to this method as the pLARmEB (polygenic-background-control-based least angle regression plus empirical Bayes). Results from simulation studies showed that pLARmEB was more powerful in QTN detection and more accurate in QTN effect estimation, had less false positive rate and required less computing time than Bayesian hierarchical generalized linear model, efficient mixed model association (EMMA) and least angle regression plus empirical Bayes. pLARmEB, multilocus random-SNP-effect mixed linear model and fast multilocus random-SNP-effect EMMA methods had almost equal power of QTN detection in simulation experiments. However, only pLARmEB identified 48 previously reported genes for 7 flowering time-related traits in Arabidopsis thaliana.

  16. Genome-Wide SNP Discovery, Genotyping and Their Preliminary Applications for Population Genetic Inference in Spotted Sea Bass (Lateolabrax maculatus)

    PubMed Central

    Wang, Juan; Xue, Dong-Xiu; Zhang, Bai-Dong; Li, Yu-Long; Liu, Bing-Jian; Liu, Jin-Xian

    2016-01-01

    Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus. PMID:27336696

  17. Genome-Wide SNP Discovery, Genotyping and Their Preliminary Applications for Population Genetic Inference in Spotted Sea Bass (Lateolabrax maculatus).

    PubMed

    Wang, Juan; Xue, Dong-Xiu; Zhang, Bai-Dong; Li, Yu-Long; Liu, Bing-Jian; Liu, Jin-Xian

    2016-01-01

    Next-generation sequencing and the collection of genome-wide single-nucleotide polymorphisms (SNPs) allow identifying fine-scale population genetic structure and genomic regions under selection. The spotted sea bass (Lateolabrax maculatus) is a non-model species of ecological and commercial importance and widely distributed in northwestern Pacific. A total of 22 648 SNPs was discovered across the genome of L. maculatus by paired-end sequencing of restriction-site associated DNA (RAD-PE) for 30 individuals from two populations. The nucleotide diversity (π) for each population was 0.0028±0.0001 in Dandong and 0.0018±0.0001 in Beihai, respectively. Shallow but significant genetic differentiation was detected between the two populations analyzed by using both the whole data set (FST = 0.0550, P < 0.001) and the putatively neutral SNPs (FST = 0.0347, P < 0.001). However, the two populations were highly differentiated based on the putatively adaptive SNPs (FST = 0.6929, P < 0.001). Moreover, a total of 356 SNPs representing 298 unique loci were detected as outliers putatively under divergent selection by FST-based outlier tests as implemented in BAYESCAN and LOSITAN. Functional annotation of the contigs containing putatively adaptive SNPs yielded hits for 22 of 55 (40%) significant BLASTX matches. Candidate genes for local selection constituted a wide array of functions, including binding, catalytic and metabolic activities, etc. The analyses with the SNPs developed in the present study highlighted the importance of genome-wide genetic variation for inference of population structure and local adaptation in L. maculatus.

  18. Identification of novel drought-tolerant-associated SNPs in common bean (Phaseolus vulgaris)

    PubMed Central

    Villordo-Pineda, Emiliano; González-Chavira, Mario M.; Giraldo-Carbajo, Patricia; Acosta-Gallegos, Jorge A.; Caballero-Pérez, Juan

    2015-01-01

    Common bean (Phaseolus vulgaris L.) is a leguminous in high demand for human nutrition and a very important agricultural product. Production of common bean is constrained by environmental stresses such as drought. Although conventional plant selection has been used to increase production yield and stress tolerance, drought tolerance selection based on phenotype is complicated by associated physiological, anatomical, cellular, biochemical, and molecular changes. These changes are modulated by differential gene expression. A common method to identify genes associated with phenotypes of interest is the characterization of Single Nucleotide Polymorphims (SNPs) to link them to specific functions. In this work, we selected two drought-tolerant parental lines from Mesoamerica, Pinto Villa, and Pinto Saltillo. The parental lines were used to generate a population of 282 families (F3:5) and characterized by 169 SNPs. We associated the segregation of the molecular markers in our population with phenotypes including flowering time, physiological maturity, reproductive period, plant, seed and total biomass, reuse index, seed yield, weight of 100 seeds, and harvest index in three cultivation cycles. We observed 83 SNPs with significant association (p < 0.0003 after Bonferroni correction) with our quantified phenotypes. Phenotypes most associated were days to flowering and seed biomass with 58 and 44 associated SNPs, respectively. Thirty-seven out of the 83 SNPs were annotated to a gene with a potential function related to drought tolerance or relevant molecular/biochemical functions. Some SNPs such as SNP28 and SNP128 are related to starch biosynthesis, a common osmotic protector; and SNP18 is related to proline biosynthesis, another well-known osmotic protector. PMID:26257755

  19. Identification of novel drought-tolerant-associated SNPs in common bean (Phaseolus vulgaris).

    PubMed

    Villordo-Pineda, Emiliano; González-Chavira, Mario M; Giraldo-Carbajo, Patricia; Acosta-Gallegos, Jorge A; Caballero-Pérez, Juan

    2015-01-01

    Common bean (Phaseolus vulgaris L.) is a leguminous in high demand for human nutrition and a very important agricultural product. Production of common bean is constrained by environmental stresses such as drought. Although conventional plant selection has been used to increase production yield and stress tolerance, drought tolerance selection based on phenotype is complicated by associated physiological, anatomical, cellular, biochemical, and molecular changes. These changes are modulated by differential gene expression. A common method to identify genes associated with phenotypes of interest is the characterization of Single Nucleotide Polymorphims (SNPs) to link them to specific functions. In this work, we selected two drought-tolerant parental lines from Mesoamerica, Pinto Villa, and Pinto Saltillo. The parental lines were used to generate a population of 282 families (F3:5) and characterized by 169 SNPs. We associated the segregation of the molecular markers in our population with phenotypes including flowering time, physiological maturity, reproductive period, plant, seed and total biomass, reuse index, seed yield, weight of 100 seeds, and harvest index in three cultivation cycles. We observed 83 SNPs with significant association (p < 0.0003 after Bonferroni correction) with our quantified phenotypes. Phenotypes most associated were days to flowering and seed biomass with 58 and 44 associated SNPs, respectively. Thirty-seven out of the 83 SNPs were annotated to a gene with a potential function related to drought tolerance or relevant molecular/biochemical functions. Some SNPs such as SNP28 and SNP128 are related to starch biosynthesis, a common osmotic protector; and SNP18 is related to proline biosynthesis, another well-known osmotic protector.

  20. Oceanographic variation influences spatial genomic structure in the sea scallop, Placopecten magellanicus.

    PubMed

    Van Wyngaarden, Mallory; Snelgrove, Paul V R; DiBacco, Claudio; Hamilton, Lorraine C; Rodríguez-Ezpeleta, Naiara; Zhan, Luyao; Beiko, Robert G; Bradbury, Ian R

    2018-03-01

    Environmental factors can influence diversity and population structure in marine species and accurate understanding of this influence can both improve fisheries management and help predict responses to environmental change. We used 7163 SNPs derived from restriction site-associated DNA sequencing genotyped in 245 individuals of the economically important sea scallop, Placopecten magellanicus , to evaluate the correlations between oceanographic variation and a previously identified latitudinal genomic cline. Sea scallops span a broad latitudinal area (>10 degrees), and we hypothesized that climatic variation significantly drives clinal trends in allele frequency. Using a large environmental dataset, including temperature, salinity, chlorophyll a, and nutrient concentrations, we identified a suite of SNPs (285-621, depending on analysis and environmental dataset) potentially under selection through correlations with environmental variation. Principal components analysis of different outlier SNPs and environmental datasets revealed similar northern and southern clusters, with significant associations between the first axes of each ( R 2 adj  = .66-.79). Multivariate redundancy analysis of outlier SNPs and the environmental principal components indicated that environmental factors explained more than 32% of the variance. Similarly, multiple linear regressions and random-forest analysis identified winter average and minimum ocean temperatures as significant parameters in the link between genetic and environmental variation. This work indicates that oceanographic variation is associated with the observed genomic cline in this species and that seasonal periods of extreme cold may restrict gene flow along a latitudinal gradient in this marine benthic bivalve. Incorporating this finding into management may improve accuracy of management strategies and future predictions.

  1. Large-scale genotyping identifies 41 new loci associated with breast cancer risk.

    PubMed

    Michailidou, Kyriaki; Hall, Per; Gonzalez-Neira, Anna; Ghoussaini, Maya; Dennis, Joe; Milne, Roger L; Schmidt, Marjanka K; Chang-Claude, Jenny; Bojesen, Stig E; Bolla, Manjeet K; Wang, Qin; Dicks, Ed; Lee, Andrew; Turnbull, Clare; Rahman, Nazneen; Fletcher, Olivia; Peto, Julian; Gibson, Lorna; Dos Santos Silva, Isabel; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Czene, Kamila; Irwanto, Astrid; Liu, Jianjun; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Adank, Muriel; van der Luijt, Rob B; Hein, Rebecca; Dahmen, Norbert; Beckman, Lars; Meindl, Alfons; Schmutzler, Rita K; Müller-Myhsok, Bertram; Lichtner, Peter; Hopper, John L; Southey, Melissa C; Makalic, Enes; Schmidt, Daniel F; Uitterlinden, Andre G; Hofman, Albert; Hunter, David J; Chanock, Stephen J; Vincent, Daniel; Bacot, François; Tessier, Daniel C; Canisius, Sander; Wessels, Lodewyk F A; Haiman, Christopher A; Shah, Mitul; Luben, Robert; Brown, Judith; Luccarini, Craig; Schoof, Nils; Humphreys, Keith; Li, Jingmei; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Couch, Fergus J; Wang, Xianshu; Vachon, Celine; Stevens, Kristen N; Lambrechts, Diether; Moisse, Matthieu; Paridaens, Robert; Christiaens, Marie-Rose; Rudolph, Anja; Nickels, Stefan; Flesch-Janys, Dieter; Johnson, Nichola; Aitken, Zoe; Aaltonen, Kirsimari; Heikkinen, Tuomas; Broeks, Annegien; Veer, Laura J Van't; van der Schoot, C Ellen; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Menegaux, Florence; Marme, Frederik; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Zamora, M Pilar; Perez, Jose Ignacio Arias; Pita, Guillermo; Alonso, M Rosario; Cox, Angela; Brock, Ian W; Cross, Simon S; Reed, Malcolm W R; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J; Hollestelle, Antoinette; van den Ouweland, Ans M W; Jager, Agnes; Bui, Quang M; Stone, Jennifer; Dite, Gillian S; Apicella, Carmel; Tsimiklis, Helen; Giles, Graham G; Severi, Gianluca; Baglietto, Laura; Fasching, Peter A; Haeberle, Lothar; Ekici, Arif B; Beckmann, Matthias W; Brenner, Hermann; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Jones, Michael; Figueroa, Jonine; Lissowska, Jolanta; Brinton, Louise; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Brauch, Hiltrud; Hamann, Ute; Brüning, Thomas; Radice, Paolo; Peterlongo, Paolo; Manoukian, Siranoush; Bonanni, Bernardo; Devilee, Peter; Tollenaar, Rob A E M; Seynaeve, Caroline; van Asperen, Christi J; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Bogdanova, Natalia V; Antonenkova, Natalia N; Dörk, Thilo; Kristensen, Vessela N; Anton-Culver, Hoda; Slager, Susan; Toland, Amanda E; Edge, Stephen; Fostira, Florentia; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Sueta, Aiko; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Teo, Soo Hwang; Yip, Cheng Har; Phuah, Sze Yee; Cornes, Belinda K; Hartman, Mikael; Miao, Hui; Lim, Wei Yen; Sng, Jen-Hwei; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Ding, Shian-Ling; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Blot, William J; Signorello, Lisa B; Cai, Qiuyin; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Simard, Jacques; Garcia-Closas, Montse; Pharoah, Paul D P; Chenevix-Trench, Georgia; Dunning, Alison M; Benitez, Javier; Easton, Douglas F

    2013-04-01

    Breast cancer is the most common cancer among women. Common variants at 27 loci have been identified as associated with susceptibility to breast cancer, and these account for ∼9% of the familial risk of the disease. We report here a meta-analysis of 9 genome-wide association studies, including 10,052 breast cancer cases and 12,575 controls of European ancestry, from which we selected 29,807 SNPs for further genotyping. These SNPs were genotyped in 45,290 cases and 41,880 controls of European ancestry from 41 studies in the Breast Cancer Association Consortium (BCAC). The SNPs were genotyped as part of a collaborative genotyping experiment involving four consortia (Collaborative Oncological Gene-environment Study, COGS) and used a custom Illumina iSelect genotyping array, iCOGS, comprising more than 200,000 SNPs. We identified SNPs at 41 new breast cancer susceptibility loci at genome-wide significance (P < 5 × 10(-8)). Further analyses suggest that more than 1,000 additional loci are involved in breast cancer susceptibility.

  2. Large-scale genotyping identifies 41 new loci associated with breast cancer risk

    PubMed Central

    Michailidou, Kyriaki; Hall, Per; Gonzalez-Neira, Anna; Ghoussaini, Maya; Dennis, Joe; Milne, Roger L; Schmidt, Marjanka K; Chang-Claude, Jenny; Bojesen, Stig E; Bolla, Manjeet K; Wang, Qin; Dicks, Ed; Lee, Andrew; Turnbull, Clare; Rahman, Nazneen; Fletcher, Olivia; Peto, Julian; Gibson, Lorna; Silva, Isabel dos Santos; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Czene, Kamila; Irwanto, Astrid; Liu, Jianjun; Waisfisz, Quinten; Meijers-Heijboer, Hanne; Adank, Muriel; van der Luijt, Rob B; Hein, Rebecca; Dahmen, Norbert; Beckman, Lars; Meindl, Alfons; Schmutzler, Rita K; Müller-Myhsok, Bertram; Lichtner, Peter; Hopper, John L; Southey, Melissa C; Makalic, Enes; Schmidt, Daniel F; Uitterlinden, Andre G; Hofman, Albert; Hunter, David J; Chanock, Stephen J; Vincent, Daniel; Bacot, François; Tessier, Daniel C; Canisius, Sander; Wessels, Lodewyk F A; Haiman, Christopher A; Shah, Mitul; Luben, Robert; Brown, Judith; Luccarini, Craig; Schoof, Nils; Humphreys, Keith; Li, Jingmei; Nordestgaard, Børge G; Nielsen, Sune F; Flyger, Henrik; Couch, Fergus J; Wang, Xianshu; Vachon, Celine; Stevens, Kristen N; Lambrechts, Diether; Moisse, Matthieu; Paridaens, Robert; Christiaens, Marie-Rose; Rudolph, Anja; Nickels, Stefan; Flesch-Janys, Dieter; Johnson, Nichola; Aitken, Zoe; Aaltonen, Kirsimari; Heikkinen, Tuomas; Broeks, Annegien; Van’t Veer, Laura J; van der Schoot, C Ellen; Guénel, Pascal; Truong, Thérèse; Laurent-Puig, Pierre; Menegaux, Florence; Marme, Frederik; Schneeweiss, Andreas; Sohn, Christof; Burwinkel, Barbara; Zamora, M Pilar; Perez, Jose Ignacio Arias; Pita, Guillermo; Alonso, M Rosario; Cox, Angela; Brock, Ian W; Cross, Simon S; Reed, Malcolm W R; Sawyer, Elinor J; Tomlinson, Ian; Kerin, Michael J; Miller, Nicola; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J; Hollestelle, Antoinette; van den Ouweland, Ans M W; Jager, Agnes; Bui, Quang M; Stone, Jennifer; Dite, Gillian S; Apicella, Carmel; Tsimiklis, Helen; Giles, Graham G; Severi, Gianluca; Baglietto, Laura; Fasching, Peter A; Haeberle, Lothar; Ekici, Arif B; Beckmann, Matthias W; Brenner, Hermann; Müller, Heiko; Arndt, Volker; Stegmaier, Christa; Swerdlow, Anthony; Ashworth, Alan; Orr, Nick; Jones, Michael; Figueroa, Jonine; Lissowska, Jolanta; Brinton, Louise; Goldberg, Mark S; Labrèche, France; Dumont, Martine; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Brauch, Hiltrud; Hamann, Ute; Brüning, Thomas; Radice, Paolo; Peterlongo, Paolo; Manoukian, Siranoush; Bonanni, Bernardo; Devilee, Peter; Tollenaar, Rob A E M; Seynaeve, Caroline; van Asperen, Christi J; Jakubowska, Anna; Lubinski, Jan; Jaworska, Katarzyna; Durda, Katarzyna; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Bogdanova, Natalia V; Antonenkova, Natalia N; Dörk, Thilo; Kristensen, Vessela N; Anton-Culver, Hoda; Slager, Susan; Toland, Amanda E; Edge, Stephen; Fostira, Florentia; Kang, Daehee; Yoo, Keun-Young; Noh, Dong-Young; Matsuo, Keitaro; Ito, Hidemi; Iwata, Hiroji; Sueta, Aiko; Wu, Anna H; Tseng, Chiu-Chen; Van Den Berg, David; Stram, Daniel O; Shu, Xiao-Ou; Lu, Wei; Gao, Yu-Tang; Cai, Hui; Teo, Soo Hwang; Yip, Cheng Har; Phuah, Sze Yee; Cornes, Belinda K; Hartman, Mikael; Miao, Hui; Lim, Wei Yen; Sng, Jen-Hwei; Muir, Kenneth; Lophatananon, Artitaya; Stewart-Brown, Sarah; Siriwanarangsan, Pornthep; Shen, Chen-Yang; Hsiung, Chia-Ni; Wu, Pei-Ei; Ding, Shian-Ling; Sangrajrang, Suleeporn; Gaborieau, Valerie; Brennan, Paul; McKay, James; Blot, William J; Signorello, Lisa B; Cai, Qiuyin; Zheng, Wei; Deming-Halverson, Sandra; Shrubsole, Martha; Long, Jirong; Simard, Jacques; Garcia-Closas, Montse; Pharoah, Paul D P; Chenevix-Trench, Georgia; Dunning, Alison M; Benitez, Javier; Easton, Douglas F

    2013-01-01

    Breast cancer is the most common cancer among women. Common variants at 27 loci have been identified as associated with susceptibility to breast cancer, and these account for ~9% of the familial risk of the disease. We report here a meta-analysis of 9 genome-wide association studies, including 10,052 breast cancer cases and 12,575 controls of European ancestry, from which we selected 29,807 SNPs for further genotyping. These SNPs were genotyped in 45,290 cases and 41,880 controls of European ancestry from 41 studies in the Breast Cancer Association Consortium (BCAC). The SNPs were genotyped as part of a collaborative genotyping experiment involving four consortia (Collaborative Oncological Gene-environment Study, COGS) and used a custom Illumina iSelect genotyping array, iCOGS, comprising more than 200,000 SNPs. We identified SNPs at 41 new breast cancer susceptibility loci at genome-wide significance (P < 5 × 10−8). Further analyses suggest that more than 1,000 additional loci are involved in breast cancer susceptibility. PMID:23535729

  3. Single nucleotide polymorphisms from Theobroma cacao expressed sequence tags associated with witches' broom disease in cacao.

    PubMed

    Lima, L S; Gramacho, K P; Carels, N; Novais, R; Gaiotto, F A; Lopes, U V; Gesteira, A S; Zaidan, H A; Cascardo, J C M; Pires, J L; Micheli, F

    2009-07-14

    In order to increase the efficiency of cacao tree resistance to witches' broom disease, which is caused by Moniliophthora perniciosa (Tricholomataceae), we looked for molecular markers that could help in the selection of resistant cacao genotypes. Among the different markers useful for developing marker-assisted selection, single nucleotide polymorphisms (SNPs) constitute the most common type of sequence difference between alleles and can be easily detected by in silico analysis from expressed sequence tag libraries. We report the first detection and analysis of SNPs from cacao-M. perniciosa interaction expressed sequence tags, using bioinformatics. Selection based on analysis of these SNPs should be useful for developing cacao varieties resistant to this devastating disease.

  4. HapMap tagSNP transferability in multiple populations: general guidelines

    PubMed Central

    Xing, Jinchuan; Witherspoon, David J.; Watkins, W. Scott; Zhang, Yuhua; Tolpinrud, Whitney; Jorde, Lynn B.

    2008-01-01

    This PDF receipt will only be used as the basis for generating PubMed Central (PMC) documents. PMC documents will be made available for review after conversion (approx. 2–3 weeks time). Any corrections that need to be made will be done at that time. No materials will be released to PMC without the approval of an author. Only the PMC documents will appear on PubMed Central -- this PDF Receipt will not appear on PubMed Central. Linkage disequilibrium (LD) has received much recent attention because of its value in localizing disease-causing genes. Due to the extensive LD between neighboring loci in the human genome, it is believed that a subset of the single nucleotide polymorphisms in a region (tagSNPs) can be selected to capture most of the remaining SNP variants. In this study, we examined LD patterns and HapMap tagSNP transferability in more than 300 individuals. A South Indian and an African Mbuti Pygmy population sample were included to evaluate the performance of HapMap tagSNPs in geographically distinct and genetically isolated populations. Our results show that HapMap tagSNPs selected with r2 >= 0.8 can capture more than 85% of the SNPs in populations that are from the same continental group. Combined tagSNPs from HapMap CEU and CHB+JPT serve as the best reference for the Indian sample. The HapMap YRI are a sufficient reference for tagSNP selection in the Pygmy sample. In addition to our findings, we reviewed over 25 recent studies of tagSNP transferability and propose a general guideline for selecting tagSNPs from HapMap populations. PMID:18482828

  5. Whole-genome scanning for the litter size trait associated genes and SNPs under selection in dairy goat (Capra hircus)

    PubMed Central

    Lai, Fang-Nong; Zhai, Hong-Li; Cheng, Ming; Ma, Jun-Yu; Cheng, Shun-Feng; Ge, Wei; Zhang, Guo-Liang; Wang, Jun-Jie; Zhang, Rui-Qian; Wang, Xue; Min, Ling-Jiang; Song, Jiu-Zhou; Shen, Wei

    2016-01-01

    Dairy goats are one of the most utilized domesticated animals in China. Here, we selected extreme populations based on differential fecundity in two Laoshan dairy goat populations. Utilizing deep sequencing we have generated 68.7 and 57.8 giga base of sequencing data, and identified 12,458,711 and 12,423,128 SNPs in the low fecundity and high fecundity groups, respectively. Following selective sweep analyses, a number of loci and candidate genes in the two populations were scanned independently. The reproduction related genes CCNB2, AR, ADCY1, DNMT3B, SMAD2, AMHR2, ERBB2, FGFR1, MAP3K12 and THEM4 were specifically selected in the high fecundity group whereas KDM6A, TENM1, SWI5 and CYM were specifically selected in the low fecundity group. A sub-set of genes including SYCP2, SOX5 and POU3F4 were localized both in the high and low fecundity selection windows, suggesting that these particular genes experienced strong selection with lower genetic diversity. From the genome data, the rare nonsense mutations may not contribute to fecundity, whereas nonsynonymous SNPs likely play a predominant role. The nonsynonymous exonic SNPs in SETDB2 and CDH26 which were co-localized in the selected region may take part in fecundity traits. These observations bring us a new insights into the genetic variation influencing fecundity traits within dairy goats. PMID:27905513

  6. LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources.

    PubMed

    Karchin, Rachel; Diekhans, Mark; Kelly, Libusha; Thomas, Daryl J; Pieper, Ursula; Eswar, Narayanan; Haussler, David; Sali, Andrej

    2005-06-15

    The NCBI dbSNP database lists over 9 million single nucleotide polymorphisms (SNPs) in the human genome, but currently contains limited annotation information. SNPs that result in amino acid residue changes (nsSNPs) are of critical importance in variation between individuals, including disease and drug sensitivity. We have developed LS-SNP, a genomic scale software pipeline to annotate nsSNPs. LS-SNP comprehensively maps nsSNPs onto protein sequences, functional pathways and comparative protein structure models, and predicts positions where nsSNPs destabilize proteins, interfere with the formation of domain-domain interfaces, have an effect on protein-ligand binding or severely impact human health. It currently annotates 28,043 validated SNPs that produce amino acid residue substitutions in human proteins from the SwissProt/TrEMBL database. Annotations can be viewed via a web interface either in the context of a genomic region or by selecting sets of SNPs, genes, proteins or pathways. These results are useful for identifying candidate functional SNPs within a gene, haplotype or pathway and in probing molecular mechanisms responsible for functional impacts of nsSNPs. http://www.salilab.org/LS-SNP CONTACT: rachelk@salilab.org http://salilab.org/LS-SNP/supp-info.pdf.

  7. SNP-associations and phenotype predictions from hundreds of microbial genomes without genome alignments.

    PubMed

    Hall, Barry G

    2014-01-01

    SNP-association studies are a starting point for identifying genes that may be responsible for specific phenotypes, such as disease traits. The vast bulk of tools for SNP-association studies are directed toward SNPs in the human genome, and I am unaware of any tools designed specifically for such studies in bacterial or viral genomes. The PPFS (Predict Phenotypes From SNPs) package described here is an add-on to kSNP , a program that can identify SNPs in a data set of hundreds of microbial genomes. PPFS identifies those SNPs that are non-randomly associated with a phenotype based on the χ² probability, then uses those diagnostic SNPs for two distinct, but related, purposes: (1) to predict the phenotypes of strains whose phenotypes are unknown, and (2) to identify those diagnostic SNPs that are most likely to be causally related to the phenotype. In the example illustrated here, from a set of 68 E. coli genomes, for 67 of which the pathogenicity phenotype was known, there were 418,500 SNPs. Using the phenotypes of 36 of those strains, PPFS identified 207 diagnostic SNPs. The diagnostic SNPs predicted the phenotypes of all of the genomes with 97% accuracy. It then identified 97 SNPs whose probability of being causally related to the pathogenic phenotype was >0.999. In a second example, from a set of 116 E. coli genome sequences, using the phenotypes of 65 strains PPFS identified 101 SNPs that predicted the source host (human or non-human) with 90% accuracy.

  8. Machine learning shows association between genetic variability in PPARG and cerebral connectivity in preterm infants

    PubMed Central

    Krishnan, Michelle L.; Wang, Zi; Aljabar, Paul; Ball, Gareth; Mirza, Ghazala; Saxena, Alka; Counsell, Serena J.; Hajnal, Joseph V.; Montana, Giovanni

    2017-01-01

    Preterm infants show abnormal structural and functional brain development, and have a high risk of long-term neurocognitive problems. The molecular and cellular mechanisms involved are poorly understood, but novel methods now make it possible to address them by examining the relationship between common genetic variability and brain endophenotype. We addressed the hypothesis that variability in the Peroxisome Proliferator Activated Receptor (PPAR) pathway would be related to brain development. We employed machine learning in an unsupervised, unbiased, combined analysis of whole-brain diffusion tractography together with genomewide, single-nucleotide polymorphism (SNP)-based genotypes from a cohort of 272 preterm infants, using Sparse Reduced Rank Regression (sRRR) and correcting for ethnicity and age at birth and imaging. Empirical selection frequencies for SNPs associated with cerebral connectivity ranged from 0.663 to zero, with multiple highly selected SNPs mapping to genes for PPARG (six SNPs), ITGA6 (four SNPs), and FXR1 (two SNPs). SNPs in PPARG were significantly overrepresented (ranked 7–11 and 67 of 556,000 SNPs; P < 2.2 × 10−7), and were mostly in introns or regulatory regions with predicted effects including protein coding and nonsense-mediated decay. Edge-centric graph-theoretic analysis showed that highly selected white-matter tracts were consistent across the group and important for information transfer (P < 2.2 × 10−17); they most often connected to the insula (P < 6 × 10−17). These results suggest that the inhibited brain development seen in humans exposed to the stress of a premature extrauterine environment is modulated by genetic factors, and that PPARG signaling has a previously unrecognized role in cerebral development. PMID:29229843

  9. Translating natural genetic variation to gene expression in a computational model of the Drosophila gap gene regulatory network

    PubMed Central

    Kozlov, Konstantin N.; Kulakovskiy, Ivan V.; Zubair, Asif; Marjoram, Paul; Lawrie, David S.; Nuzhdin, Sergey V.; Samsonova, Maria G.

    2017-01-01

    Annotating the genotype-phenotype relationship, and developing a proper quantitative description of the relationship, requires understanding the impact of natural genomic variation on gene expression. We apply a sequence-level model of gap gene expression in the early development of Drosophila to analyze single nucleotide polymorphisms (SNPs) in a panel of natural sequenced D. melanogaster lines. Using a thermodynamic modeling framework, we provide both analytical and computational descriptions of how single-nucleotide variants affect gene expression. The analysis reveals that the sequence variants increase (decrease) gene expression if located within binding sites of repressors (activators). We show that the sign of SNP influence (activation or repression) may change in time and space and elucidate the origin of this change in specific examples. The thermodynamic modeling approach predicts non-local and non-linear effects arising from SNPs, and combinations of SNPs, in individual fly genotypes. Simulation of individual fly genotypes using our model reveals that this non-linearity reduces to almost additive inputs from multiple SNPs. Further, we see signatures of the action of purifying selection in the gap gene regulatory regions. To infer the specific targets of purifying selection, we analyze the patterns of polymorphism in the data at two phenotypic levels: the strengths of binding and expression. We find that combinations of SNPs show evidence of being under selective pressure, while individual SNPs do not. The model predicts that SNPs appear to accumulate in the genotypes of the natural population in a way biased towards small increases in activating action on the expression pattern. Taken together, these results provide a systems-level view of how genetic variation translates to the level of gene regulatory networks via combinatorial SNP effects. PMID:28898266

  10. Genome-Wide Association Study for Identification and Validation of Novel SNP Markers for Sr6 Stem Rust Resistance Gene in Bread Wheat.

    PubMed

    Mourad, Amira M I; Sallam, Ahmed; Belamkar, Vikas; Wegulo, Stephen; Bowden, Robert; Jin, Yue; Mahdy, Ezzat; Bakheit, Bahy; El-Wafaa, Atif A; Poland, Jesse; Baenziger, Peter S

    2018-01-01

    Stem rust (caused by Puccinia graminis f. sp. tritici Erikss. & E. Henn.), is a major disease in wheat ( Triticum aestivium L.). However, in recent years it occurs rarely in Nebraska due to weather and the effective selection and gene pyramiding of resistance genes. To understand the genetic basis of stem rust resistance in Nebraska winter wheat, we applied genome-wide association study (GWAS) on a set of 270 winter wheat genotypes (A-set). Genotyping was carried out using genotyping-by-sequencing and ∼35,000 high-quality SNPs were identified. The tested genotypes were evaluated for their resistance to the common stem rust race in Nebraska (QFCSC) in two replications. Marker-trait association identified 32 SNP markers, which were significantly (Bonferroni corrected P < 0.05) associated with the resistance on chromosome 2D. The chromosomal location of the significant SNPs (chromosome 2D) matched the location of Sr6 gene which was expected in these genotypes based on pedigree information. A highly significant linkage disequilibrium (LD, r 2 ) was found between the significant SNPs and the specific SSR marker for the Sr6 gene ( Xcfd43 ). This suggests the significant SNP markers are tagging Sr6 gene. Out of the 32 significant SNPs, eight SNPs were in six genes that are annotated as being linked to disease resistance in the IWGSC RefSeq v1.0. The 32 significant SNP markers were located in nine haplotype blocks. All the 32 significant SNPs were validated in a set of 60 different genotypes (V-set) using single marker analysis. SNP markers identified in this study can be used in marker-assisted selection, genomic selection, and to develop KASP (Kompetitive Allele Specific PCR) marker for the Sr6 gene. Novel SNPs for Sr6 gene, an important stem rust resistant gene, were identified and validated in this study. These SNPs can be used to improve stem rust resistance in wheat.

  11. Functional Genomics Analysis of Big Data Identifies Novel Peroxisome Proliferator-Activated Receptor γ Target Single Nucleotide Polymorphisms Showing Association With Cardiometabolic Outcomes.

    PubMed

    Richardson, Kris; Schnitzler, Gavin R; Lai, Chao-Qiang; Ordovas, Jose M

    2015-12-01

    Cardiovascular disease and type 2 diabetes mellitus represent overlapping diseases where a large portion of the variation attributable to genetics remains unexplained. An important player in their pathogenesis is peroxisome proliferator-activated receptor γ (PPARγ) that is involved in lipid and glucose metabolism and maintenance of metabolic homeostasis. We used a functional genomics methodology to interrogate human chromatin immunoprecipitation-sequencing, genome-wide association studies, and expression quantitative trait locus data to inform selection of candidate functional single nucleotide polymorphisms (SNPs) falling in PPARγ motifs. We derived 27 328 chromatin immunoprecipitation-sequencing peaks for PPARγ in human adipocytes through meta-analysis of 3 data sets. The PPARγ consensus motif showed greatest enrichment and mapped to 8637 peaks. We identified 146 SNPs in these motifs. This number was significantly less than would be expected by chance, and Inference of Natural Selection from Interspersed Genomically coHerent elemenTs analysis indicated that these motifs are under weak negative selection. A screen of these SNPs against genome-wide association studies for cardiometabolic traits revealed significant enrichment with 16 SNPs. A screen against the MuTHER expression quantitative trait locus data revealed 8 of these were significantly associated with altered gene expression in human adipose, more than would be expected by chance. Several SNPs fall close, or are linked by expression quantitative trait locus to lipid-metabolism loci including CYP26A1. We demonstrated the use of functional genomics to identify SNPs of potential function. Specifically, that SNPs within PPARγ motifs that bind PPARγ in adipocytes are significantly associated with cardiometabolic disease and with the regulation of transcription in adipose. This method may be used to uncover functional SNPs that do not reach significance thresholds in the agnostic approach of genome-wide association studies. © 2015 American Heart Association, Inc.

  12. Transcriptome-wide single nucleotide polymorphisms (SNPs) for abalone (Haliotis midae): validation and application using GoldenGate medium-throughput genotyping assays.

    PubMed

    Bester-Van Der Merwe, Aletta; Blaauw, Sonja; Du Plessis, Jana; Roodt-Wilding, Rouvay

    2013-09-23

    Haliotis midae is one of the most valuable commercial abalone species in the world, but is highly vulnerable, due to exploitation, habitat destruction and predation. In order to preserve wild and cultured stocks, genetic management and improvement of the species has become crucial. Fundamental to this is the availability and employment of molecular markers, such as microsatellites and single nucleotide (SNPs). Transcriptome sequences generated through sequencing-by-synthesis technology were utilized for the in vitro and in silico identification of 505 putative SNPs from a total of 316 selected contigs. A subset of 234 SNPs were further validated and characterized in wild and cultured abalone using two Illumina GoldenGate genotyping assays. Combined with VeraCode technology, this genotyping platform yielded a 65%-69% conversion rate (percentage polymorphic markers) with a global genotyping success rate of 76%-85% and provided a viable means for validating SNP markers in a non-model species. The utility of 31 of the validated SNPs in population structure analysis was confirmed, while a large number of SNPs (174) were shown to be informative and are, thus, good candidates for linkage map construction. The non-synonymous SNPs (50) located in coding regions of genes that showed similarities with known proteins will also be useful for genetic applications, such as the marker-assisted selection of genes of relevance to abalone aquaculture.

  13. Spectroscopic and microscopic characterization of silver nanoparticles synthesized using Justicia adhatoda flower

    NASA Astrophysics Data System (ADS)

    Singh, Tej; Shekhawat, Dharmender Singh; Jyoti, Kumari

    2018-05-01

    The synthesis of silver nanoparticles (SNPs) by chemical and physical methods produce harmful products which may cause various environmental problems, thus, there is an increasing demand to use ecofriendly methods. Therefore, biosynthesis of SNPs using Justicia adhatoda flower extract is demonstrated in the present study. The biosynthesized SNPs were characterized by UV-visible spectroscopy, Fourier transform-infrared spectroscopy (FTIR), transmission electron microscopy (TEM), selected area electron diffraction (SAED) and atomic force microscopy (AFM) analysis. The result of UV-visible spectroscopy peaked at 417 nm corresponding to the plasmon absorbance of SNPs. The TEM and SAED result reveals the crystalline nature of SNPs. FTIR spectroscopy used to identify the possible biomolecules responsible for the conversion of silver ions to SNPs. The study concluded that Justicia adhatoda flower extract act as an excellent reducing agent and the green synthesized SNPs are safer to the environment.

  14. A fast boosting-based screening method for large-scale association study in complex traits with genetic heterogeneity.

    PubMed

    Wang, Lu-Yong; Fasulo, D

    2006-01-01

    Genome-wide association study for complex diseases will generate massive amount of single nucleotide polymorphisms (SNPs) data. Univariate statistical test (i.e. Fisher exact test) was used to single out non-associated SNPs. However, the disease-susceptible SNPs may have little marginal effects in population and are unlikely to retain after the univariate tests. Also, model-based methods are impractical for large-scale dataset. Moreover, genetic heterogeneity makes the traditional methods harder to identify the genetic causes of diseases. A more recent random forest method provides a more robust method for screening the SNPs in thousands scale. However, for more large-scale data, i.e., Affymetrix Human Mapping 100K GeneChip data, a faster screening method is required to screening SNPs in whole-genome large scale association analysis with genetic heterogeneity. We propose a boosting-based method for rapid screening in large-scale analysis of complex traits in the presence of genetic heterogeneity. It provides a relatively fast and fairly good tool for screening and limiting the candidate SNPs for further more complex computational modeling task.

  15. Lack of Association for Reported Endocrine Pancreatic Cancer Risk Loci in the PANDoRA Consortium.

    PubMed

    Campa, Daniele; Obazee, Ofure; Pastore, Manuela; Panzuto, Francesco; Liço, Valbona; Greenhalf, William; Katzke, Verena; Tavano, Francesca; Costello, Eithne; Corbo, Vincenzo; Talar-Wojnarowska, Renata; Strobel, Oliver; Zambon, Carlo Federico; Neoptolemos, John P; Zerboni, Giulia; Kaaks, Rudolf; Key, Timothy J; Lombardo, Carlo; Jamroziak, Krzysztof; Gioffreda, Domenica; Hackert, Thilo; Khaw, Kay-Tee; Landi, Stefano; Milanetto, Anna Caterina; Landoni, Luca; Lawlor, Rita T; Bambi, Franco; Pirozzi, Felice; Basso, Daniela; Pasquali, Claudio; Capurso, Gabriele; Canzian, Federico

    2017-08-01

    Background: Pancreatic neuroendocrine tumors (PNETs) are rare neoplasms for which very little is known about either environmental or genetic risk factors. Only a handful of association studies have been performed so far, suggesting a small number of risk loci. Methods: To replicate the best findings, we have selected 16 SNPs suggested in previous studies to be relevant in PNET etiogenesis. We genotyped the selected SNPs (rs16944, rs1052536, rs1059293, rs1136410, rs1143634, rs2069762, rs2236302, rs2387632, rs3212961, rs3734299, rs3803258, rs4962081, rs7234941, rs7243091, rs12957119, and rs1800629) in 344 PNET sporadic cases and 2,721 controls in the context of the PANcreatic Disease ReseArch (PANDoRA) consortium. Results: After correction for multiple testing, we did not observe any statistically significant association between the SNPs and PNET risk. We also used three online bioinformatic tools (HaploReg, RegulomeDB, and GTEx) to predict a possible functional role of the SNPs, but we did not observe any clear indication. Conclusions: None of the selected SNPs were convincingly associated with PNET risk in the PANDoRA consortium. Impact: We can exclude a major role of the selected polymorphisms in PNET etiology, and this highlights the need for replication of epidemiologic findings in independent populations, especially in rare diseases such as PNETs. Cancer Epidemiol Biomarkers Prev; 26(8); 1349-51. ©2017 AACR . ©2017 American Association for Cancer Research.

  16. EST-derived SNP discovery and selective pressure analysis in Pacific white shrimp ( Litopenaeus vannamei)

    NASA Astrophysics Data System (ADS)

    Liu, Chengzhang; Wang, Xia; Xiang, Jianhai; Li, Fuhua

    2012-09-01

    Pacific white shrimp has become a major aquaculture and fishery species worldwide. Although a large scale EST resource has been publicly available since 2008, the data have not yet been widely used for SNP discovery or transcriptome-wide assessment of selective pressure. In this study, a set of 155 411 expressed sequence tags (ESTs) from the NCBI database were computationally analyzed and 17 225 single nucleotide polymorphisms (SNPs) were predicted, including 9 546 transitions, 5 124 transversions and 2 481 indels. Among the 7 298 SNP substitutions located in functionally annotated contigs, 58.4% (4 262) are non-synonymous SNPs capable of introducing amino acid mutations. Two hundred and fifty nonsynonymous SNPs in genes associated with economic traits have been identified as candidates for markers in selective breeding. Diversity estimates among the synonymous nucleotides were on average 3.49 times greater than those in non-synonymous, suggesting negative selection. Distribution of non-synonymous to synonymous substitutions (Ka/Ks) ratio ranges from 0 to 4.01, (average 0.42, median 0.26), suggesting that the majority of the affected genes are under purifying selection. Enrichment analysis identified multiple gene ontology categories under positive or negative selection. Categories involved in innate immune response and male gamete generation are rich in positively selected genes, which is similar to reports in Drosophila and primates. This work is the first transcriptome-wide assessment of selective pressure in a Penaeid shrimp species. The functionally annotated SNPs provide a valuable resource of potential molecular markers for selective breeding.

  17. Can polymorphisms in the fatty acid desaturase (FADS) gene cluster alter the effects of fish oil supplementation on plasma and erythrocyte fatty acid profiles? An exploratory study.

    PubMed

    Meldrum, Suzanne J; Li, Yuchun; Zhang, Guicheng; Heaton, Alexandra E M; D'Vaz, Nina; Manz, Judith; Reischl, Eva; Koletzko, Berthold V; Prescott, Susan L; Simmer, Karen

    2017-09-19

    The enzymes encoded by fatty acid desaturases (FADS) genes determine the desaturation of long-chain polyunsaturated fatty acids (LCPUFA). We investigated if haplotype and single nucleotide polymorphisms (SNPs) in FADS gene cluster can influence LCPUFA status in infants who received either fish oil or placebo supplementation. Children enrolled in the Infant Fish Oil Supplementation Study (IFOS) were randomly allocated to receive either fish oil or placebo from birth to 6 months of age. Blood was collected at 6 months of age for the measurement of fatty acids and for DNA extraction. A total of 276 participant DNA samples underwent genotyping, and 126 erythrocyte and 133 plasma fatty acid measurements were available for analysis. Twenty-two FADS SNPs were selected on the basis of literature and linkage disequilibrium patterns identified from the HapMap data. Haplotype construction was completed using PHASE. For participants allocated to the fish oil group who had two copies of the FADS1 haplotype consisting of SNP minor alleles, DHA levels were significantly higher compared to other haplotypes. This finding was not observed for the placebo group. Furthermore, for members of the fish oil group only, the minor homozygous carriers of all the FADS1 SNPs investigated had significantly higher DHA than other genotypes (rs174545, rs174546, rs174548, rs174553, rs174556, rs174537, rs174448, and rs174455). Overall results of this preliminary study suggest that supplementation with fish oil may only significantly increase DHA in minor allele carriers of FADS1 SNPs. Further research is required to confirm this novel finding.

  18. Genome-Wide Association Study Identifies Candidate Genes for Starch Content Regulation in Maize Kernels

    PubMed Central

    Liu, Na; Xue, Yadong; Guo, Zhanyong; Li, Weihua; Tang, Jihua

    2016-01-01

    Kernel starch content is an important trait in maize (Zea mays L.) as it accounts for 65–75% of the dry kernel weight and positively correlates with seed yield. A number of starch synthesis-related genes have been identified in maize in recent years. However, many loci underlying variation in starch content among maize inbred lines still remain to be identified. The current study is a genome-wide association study that used a set of 263 maize inbred lines. In this panel, the average kernel starch content was 66.99%, ranging from 60.60 to 71.58% over the three study years. These inbred lines were genotyped with the SNP50 BeadChip maize array, which is comprised of 56,110 evenly spaced, random SNPs. Population structure was controlled by a mixed linear model (MLM) as implemented in the software package TASSEL. After the statistical analyses, four SNPs were identified as significantly associated with starch content (P ≤ 0.0001), among which one each are located on chromosomes 1 and 5 and two are on chromosome 2. Furthermore, 77 candidate genes associated with starch synthesis were found within the 100-kb intervals containing these four QTLs, and four highly associated genes were within 20-kb intervals of the associated SNPs. Among the four genes, Glucose-1-phosphate adenylyltransferase (APS1; Gene ID GRMZM2G163437) is known as an important regulator of kernel starch content. The identified SNPs, QTLs, and candidate genes may not only be readily used for germplasm improvement by marker-assisted selection in breeding, but can also elucidate the genetic basis of starch content. Further studies on these identified candidate genes may help determine the molecular mechanisms regulating kernel starch content in maize and other important cereal crops. PMID:27512395

  19. Transfer of genetic therapy across human populations: molecular targets for increasing patient coverage in repeat expansion diseases

    PubMed Central

    Varela, Miguel A; Curtis, Helen J; Douglas, Andrew GL; Hammond, Suzan M; O'Loughlin, Aisling J; Sobrido, Maria J; Scholefield, Janine; Wood, Matthew JA

    2016-01-01

    Allele-specific gene therapy aims to silence expression of mutant alleles through targeting of disease-linked single-nucleotide polymorphisms (SNPs). However, SNP linkage to disease varies between populations, making such molecular therapies applicable only to a subset of patients. Moreover, not all SNPs have the molecular features necessary for potent gene silencing. Here we provide knowledge to allow the maximisation of patient coverage by building a comprehensive understanding of SNPs ranked according to their predicted suitability toward allele-specific silencing in 14 repeat expansion diseases: amyotrophic lateral sclerosis and frontotemporal dementia, dentatorubral-pallidoluysian atrophy, myotonic dystrophy 1, myotonic dystrophy 2, Huntington's disease and several spinocerebellar ataxias. Our systematic analysis of DNA sequence variation shows that most annotated SNPs are not suitable for potent allele-specific silencing across populations because of suboptimal sequence features and low variability (>97% in HD). We suggest maximising patient coverage by selecting SNPs with high heterozygosity across populations, and preferentially targeting SNPs that lead to purine:purine mismatches in wild-type alleles to obtain potent allele-specific silencing. We therefore provide fundamental knowledge on strategies for optimising patient coverage of therapeutics for microsatellite expansion disorders by linking analysis of population genetic variation to the selection of molecular targets. PMID:25990798

  20. Transfer of genetic therapy across human populations: molecular targets for increasing patient coverage in repeat expansion diseases.

    PubMed

    Varela, Miguel A; Curtis, Helen J; Douglas, Andrew G L; Hammond, Suzan M; O'Loughlin, Aisling J; Sobrido, Maria J; Scholefield, Janine; Wood, Matthew J A

    2016-02-01

    Allele-specific gene therapy aims to silence expression of mutant alleles through targeting of disease-linked single-nucleotide polymorphisms (SNPs). However, SNP linkage to disease varies between populations, making such molecular therapies applicable only to a subset of patients. Moreover, not all SNPs have the molecular features necessary for potent gene silencing. Here we provide knowledge to allow the maximisation of patient coverage by building a comprehensive understanding of SNPs ranked according to their predicted suitability toward allele-specific silencing in 14 repeat expansion diseases: amyotrophic lateral sclerosis and frontotemporal dementia, dentatorubral-pallidoluysian atrophy, myotonic dystrophy 1, myotonic dystrophy 2, Huntington's disease and several spinocerebellar ataxias. Our systematic analysis of DNA sequence variation shows that most annotated SNPs are not suitable for potent allele-specific silencing across populations because of suboptimal sequence features and low variability (>97% in HD). We suggest maximising patient coverage by selecting SNPs with high heterozygosity across populations, and preferentially targeting SNPs that lead to purine:purine mismatches in wild-type alleles to obtain potent allele-specific silencing. We therefore provide fundamental knowledge on strategies for optimising patient coverage of therapeutics for microsatellite expansion disorders by linking analysis of population genetic variation to the selection of molecular targets.

  1. Genetic Variation and Recent Positive Selection in Worldwide Human Populations: Evidence from Nearly 1 Million SNPs

    PubMed Central

    Theunert, Christoph; Pugach, Irina; Li, Jing; Nandineni, Madhusudan R.; Gross, Arnd; Scholz, Markus; Stoneking, Mark

    2009-01-01

    Background Genome-wide scans of hundreds of thousands of single-nucleotide polymorphisms (SNPs) have resulted in the identification of new susceptibility variants to common diseases and are providing new insights into the genetic structure and relationships of human populations. Moreover, genome-wide data can be used to search for signals of recent positive selection, thereby providing new insights into the genetic adaptations that occurred as modern humans spread out of Africa and around the world. Methodology We genotyped approximately 500,000 SNPs in 255 individuals (5 individuals from each of 51 worldwide populations) from the Human Genome Diversity Panel (HGDP-CEPH). When merged with non-overlapping SNPs typed previously in 250 of these same individuals, the resulting data consist of over 950,000 SNPs. We then analyzed the genetic relationships and ancestry of individuals without assigning them to populations, and we also identified candidate regions of recent positive selection at both the population and regional (continental) level. Conclusions Our analyses both confirm and extend previous studies; in particular, we highlight the impact of various dispersals, and the role of substructure in Africa, on human genetic diversity. We also identified several novel candidate regions for recent positive selection, and a gene ontology (GO) analysis identified several GO groups that were significantly enriched for such candidate genes, including immunity and defense related genes, sensory perception genes, membrane proteins, signal receptors, lipid binding/metabolism genes, and genes involved in the nervous system. Among the novel candidate genes identified are two genes involved in the thyroid hormone pathway that show signals of selection in African Pygmies that may be related to their short stature. PMID:19924308

  2. Combined sequence and sequence-structure-based methods for analyzing RAAS gene SNPs: a computational approach.

    PubMed

    Singh, Kh Dhanachandra; Karthikeyan, Muthusamy

    2014-12-01

    The renin-angiotensin-aldosterone system (RAAS) plays a key role in the regulation of blood pressure (BP). Mutations on the genes that encode components of the RAAS have played a significant role in genetic susceptibility to hypertension and have been intensively scrutinized. The identification of such probably causal mutations not only provides insight into the RAAS but may also serve as antihypertensive therapeutic targets and diagnostic markers. The methods for analyzing the SNPs from the huge dataset of SNPs, containing both functional and neutral SNPs is challenging by the experimental approach on every SNPs to determine their biological significance. To explore the functional significance of genetic mutation (SNPs), we adopted combined sequence and sequence-structure-based SNP analysis algorithm. Out of 3864 SNPs reported in dbSNP, we found 108 missense SNPs in the coding region and remaining in the non-coding region. In this study, we are reporting only those SNPs in coding region to be deleterious when three or more tools are predicted to be deleterious and which have high RMSD from the native structure. Based on these analyses, we have identified two SNPs of REN gene, eight SNPs of AGT gene, three SNPs of ACE gene, two SNPs of AT1R gene, three SNPs of CYP11B2 gene and three SNPs of CMA1 gene in the coding region were found to be deleterious. Further this type of study will be helpful in reducing the cost and time for identification of potential SNP and also helpful in selecting potential SNP for experimental study out of SNP pool.

  3. Polymorphisms Related to the Serum 25-Hydroxyvitamin D Level and Risk of Myocardial Infarction, Diabetes, Cancer and Mortality. The Tromsø Study

    PubMed Central

    Jorde, Rolf; Schirmer, Henrik; Wilsgaard, Tom; Joakimsen, Ragnar Martin; Mathiesen, Ellisiv Bøgeberg; Njølstad, Inger; Løchen, Maja-Lisa; Figenschau, Yngve; Berg, Jens Petter; Svartberg, Johan; Grimnes, Guri

    2012-01-01

    Objective Low serum 25(OH)D levels are associated with cardiovascular risk factors, and also predict future myocardial infarction (MI), type 2 diabetes (T2DM), cancer and all-cause mortality. Recently several single nucleotide polymorphisms (SNPs) associated with serum 25-hydroxyvitamin D (25(OH)D) level have been identified. If these relations are causal one would expect a similar association between these SNPs and health. Methods DNA was prepared from subjects who participated in the fourth survey of the Tromsø Study in 1994–1995 and who were registered with the endpoints MI, T2DM, cancer or death as well as a randomly selected control group. The endpoint registers were complete up to 2007–2010. Genotyping was performed for 17 SNPs related to the serum 25(OH)D level. Results A total of 9528 subjects were selected for genetic analyses which were successfully performed for at least one SNP in 9471 subjects. Among these, 2025 were registered with MI, 1092 with T2DM, 2924 with cancer and 3828 had died. The mean differences in serum 25(OH)D levels between SNP genotypes with the lowest and highest serum 25(OH)D levels varied from 0.1 to 7.8 nmol/L. A genotype score based on weighted risk alleles regarding low serum 25(OH)D levels was established. There was no consistent association between the genotype score or individuals SNPs and MI, T2DM, cancer, mortality or risk factors for disease. However, for rs6013897 genotypes (located at the 24-hydroxylase gene (CYP24A1)) there was a significant association with breast cancer (P<0.05). Conclusion Our results do not support nor exclude a causal relationship between serum 25(OH)D levels and MI, T2DM, cancer or mortality, and our observation on breast cancer needs confirmation. Further genetic studies are warranted, particularly in populations with vitamin D deficiency. Trial Registration ClinicalTrials.gov NCT01395303 PMID:22649517

  4. Integrative Bayesian variable selection with gene-based informative priors for genome-wide association studies.

    PubMed

    Zhang, Xiaoshuai; Xue, Fuzhong; Liu, Hong; Zhu, Dianwen; Peng, Bin; Wiemels, Joseph L; Yang, Xiaowei

    2014-12-10

    Genome-wide Association Studies (GWAS) are typically designed to identify phenotype-associated single nucleotide polymorphisms (SNPs) individually using univariate analysis methods. Though providing valuable insights into genetic risks of common diseases, the genetic variants identified by GWAS generally account for only a small proportion of the total heritability for complex diseases. To solve this "missing heritability" problem, we implemented a strategy called integrative Bayesian Variable Selection (iBVS), which is based on a hierarchical model that incorporates an informative prior by considering the gene interrelationship as a network. It was applied here to both simulated and real data sets. Simulation studies indicated that the iBVS method was advantageous in its performance with highest AUC in both variable selection and outcome prediction, when compared to Stepwise and LASSO based strategies. In an analysis of a leprosy case-control study, iBVS selected 94 SNPs as predictors, while LASSO selected 100 SNPs. The Stepwise regression yielded a more parsimonious model with only 3 SNPs. The prediction results demonstrated that the iBVS method had comparable performance with that of LASSO, but better than Stepwise strategies. The proposed iBVS strategy is a novel and valid method for Genome-wide Association Studies, with the additional advantage in that it produces more interpretable posterior probabilities for each variable unlike LASSO and other penalized regression methods.

  5. Joint effect of unlinked genotypes: application to type 2 diabetes in the EPIC-Potsdam case-cohort study.

    PubMed

    Knüppel, Sven; Meidtner, Karina; Arregui, Maria; Holzhütter, Hermann-Georg; Boeing, Heiner

    2015-07-01

    Analyzing multiple single nucleotide polymorphisms (SNPs) is a promising approach to finding genetic effects beyond single-locus associations. We proposed the use of multilocus stepwise regression (MSR) to screen for allele combinations as a method to model joint effects, and compared the results with the often used genetic risk score (GRS), conventional stepwise selection, and the shrinkage method LASSO. In contrast to MSR, the GRS, conventional stepwise selection, and LASSO model each genotype by the risk allele doses. We reanalyzed 20 unlinked SNPs related to type 2 diabetes (T2D) in the EPIC-Potsdam case-cohort study (760 cases, 2193 noncases). No SNP-SNP interactions and no nonlinear effects were found. Two SNP combinations selected by MSR (Nagelkerke's R² = 0.050 and 0.048) included eight SNPs with mean allele combination frequency of 2%. GRS and stepwise selection selected nearly the same SNP combinations consisting of 12 and 13 SNPs (Nagelkerke's R² ranged from 0.020 to 0.029). LASSO showed similar results. The MSR method showed the best model fit measured by Nagelkerke's R² suggesting that further improvement may render this method a useful tool in genetic research. However, our comparison suggests that the GRS is a simple way to model genetic effects since it does not consider linkage, SNP-SNP interactions, and no non-linear effects. © 2015 John Wiley & Sons Ltd/University College London.

  6. Studying the genetic basis of speciation in high gene flow marine invertebrates

    PubMed Central

    2016-01-01

    A growing number of genes responsible for reproductive incompatibilities between species (barrier loci) exhibit the signals of positive selection. However, the possibility that genes experiencing positive selection diverge early in speciation and commonly cause reproductive incompatibilities has not been systematically investigated on a genome-wide scale. Here, I outline a research program for studying the genetic basis of speciation in broadcast spawning marine invertebrates that uses a priori genome-wide information on a large, unbiased sample of genes tested for positive selection. A targeted sequence capture approach is proposed that scores single-nucleotide polymorphisms (SNPs) in widely separated species populations at an early stage of allopatric divergence. The targeted capture of both coding and non-coding sequences enables SNPs to be characterized at known locations across the genome and at genes with known selective or neutral histories. The neutral coding and non-coding SNPs provide robust background distributions for identifying FST-outliers within genes that can, in principle, identify specific mutations experiencing diversifying selection. If natural hybridization occurs between species, the neutral coding and non-coding SNPs can provide a neutral admixture model for genomic clines analyses aimed at finding genes exhibiting strong blocks to introgression. Strongylocentrotid sea urchins are used as a model system to outline the approach but it can be used for any group that has a complete reference genome available. PMID:29491951

  7. SNP mining in Crassostrea gigas EST data: transferability to four other Crassostrea species, phylogenetic inferences and outlier SNPs under selection.

    PubMed

    Zhong, Xiaoxiao; Li, Qi; Yu, Hong; Kong, Lingfeng

    2014-01-01

    Oysters, with high levels of phenotypic plasticity and wide geographic distribution, are a challenging group for taxonomists and phylogenetics. Our study is intended to generate new EST-SNP markers and to evaluate their potential for cross-species utilization in phylogenetic study of the genus Crassostrea. In the study, 57 novel SNPs were developed from an EST database of C. gigas by the HRM (high-resolution melting) method. Transferability of 377 SNPs developed for C. gigas was examined on four other Crassostrea species: C. sikamea, C. angulata, C. hongkongensis and C. ariakensis. Among the 377 primer pairs tested, 311 (82.5%) primers showed amplification in C. sikamea, 353 (93.6%) in C. angulata, 254 (67.4%) in C. hongkongensis and 253 (67.1%) in C. ariakensis. A total of 214 SNPs were found to be transferable to all four species. Phylogenetic analyses showed that C. hongkongensis was a sister species of C. ariakensis and that this clade was sister to the clade containing C. sikamea, C. angulata and C. gigas. Within this clade, C. gigas and C. angulata had the closest relationship, with C. sikamea being the sister group. In addition, we detected eight SNPs as potentially being under selection by two outlier tests (fdist and hierarchical methods). The SNPs studied here should be useful for genetic diversity, comparative mapping and phylogenetic studies across species in Crassostrea and the candidate outlier SNPs are worth exploring in more detail regarding association genetics and functional studies.

  8. Genomic variation at the tips of the adaptive radiation of Darwin's finches.

    PubMed

    Chaves, Jaime A; Cooper, Elizabeth A; Hendry, Andrew P; Podos, Jeffrey; De León, Luis F; Raeymaekers, Joost A M; MacMillan, W Owen; Uy, J Albert C

    2016-11-01

    Adaptive radiation unfolds as selection acts on the genetic variation underlying functional traits. The nature of this variation can be revealed by studying the tips of an ongoing adaptive radiation. We studied genomic variation at the tips of the Darwin's finch radiation; specifically focusing on polymorphism within, and variation among, three sympatric species of the genus Geospiza. Using restriction site-associated DNA (RAD-seq), we characterized 32 569 single-nucleotide polymorphisms (SNPs), from which 11 outlier SNPs for beak and body size were uncovered by a genomewide association study (GWAS). Principal component analysis revealed that these 11 SNPs formed four statistically linked groups. Stepwise regression then revealed that the first PC score, which included 6 of the 11 top SNPs, explained over 80% of the variation in beak size, suggesting that selection on these traits influences multiple correlated loci. The two SNPs most strongly associated with beak size were near genes associated with beak morphology across deeper branches of the radiation: delta-like 1 homologue (DLK1) and high-mobility group AT-hook 2 (HMGA2). Our results suggest that (i) key adaptive traits are associated with a small fraction of the genome (11 of 32 569 SNPs), (ii) SNPs linked to the candidate genes are dispersed throughout the genome (on several chromosomes), and (iii) micro- and macro-evolutionary variation (roots and tips of the radiation) involve some shared and some unique genomic regions. © 2016 John Wiley & Sons Ltd.

  9. Searching for ancient balanced polymorphisms shared between Neanderthals and Modern Humans

    PubMed Central

    Viscardi, Lucas Henriques; Paixão-Côrtes, Vanessa Rodrigues; Comas, David; Salzano, Francisco Mauro; Rovaris, Diego; Bau, Claiton Dotto; Amorim, Carlos Eduardo G.; Bortolini, Maria Cátira

    2018-01-01

    Abstract Hominin evolution is characterized by adaptive solutions often rooted in behavioral and cognitive changes. If balancing selection had an important and long-lasting impact on the evolution of these traits, it can be hypothesized that genes associated with them should carry an excess of shared polymorphisms (trans- SNPs) across recent Homo species. In this study, we investigate the role of balancing selection in human evolution using available exomes from modern (Homo sapiens) and archaic humans (H. neanderthalensis and Denisovan) for an excess of trans-SNP in two gene sets: one associated with the immune system (IMMS) and another one with behavioral system (BEHS). We identified a significant excess of trans-SNPs in IMMS (N=547), of which six of these located within genes previously associated with schizophrenia. No excess of trans-SNPs was found in BEHS, but five genes in this system harbor potential signals for balancing selection and are associated with psychiatric or neurodevelopmental disorders. Our approach evidenced recent Homo trans-SNPs that have been previously implicated in psychiatric diseases such as schizophrenia, suggesting that a genetic repertoire common to the immune and behavioral systems could have been maintained by balancing selection starting before the split between archaic and modern humans. PMID:29658973

  10. Strong Signature of Natural Selection within an FHIT Intron Implicated in Prostate Cancer Risk

    PubMed Central

    Ding, Yan; Larson, Garrett; Rivas, Guillermo; Lundberg, Cathryn; Geller, Louis; Ouyang, Ching; Weitzel, Jeffrey; Archambeau, John; Slater, Jerry; Daly, Mary B.; Benson, Al B.; Kirkwood, John M.; O'Dwyer, Peter J.; Sutphen, Rebecca; Stewart, James A.; Johnson, David; Nordborg, Magnus; Krontiris, Theodore G.

    2008-01-01

    Previously, a candidate gene linkage approach on brother pairs affected with prostate cancer identified a locus of prostate cancer susceptibility at D3S1234 within the fragile histidine triad gene (FHIT), a tumor suppressor that induces apoptosis. Subsequent association tests on 16 SNPs spanning approximately 381 kb surrounding D3S1234 in Americans of European descent revealed significant evidence of association for a single SNP within intron 5 of FHIT. In the current study, re-sequencing and genotyping within a 28.5 kb region surrounding this SNP further delineated the association with prostate cancer risk to a 15 kb region. Multiple SNPs in sequences under evolutionary constraint within intron 5 of FHIT defined several related haplotypes with an increased risk of prostate cancer in European-Americans. Strong associations were detected for a risk haplotype defined by SNPs 138543, 142413, and 152494 in all cases (Pearson's χ2 = 12.34, df 1, P = 0.00045) and for the homozygous risk haplotype defined by SNPs 144716, 142413, and 148444 in cases that shared 2 alleles identical by descent with their affected brothers (Pearson's χ2 = 11.50, df 1, P = 0.00070). In addition to highly conserved sequences encompassing SNPs 148444 and 152413, population studies revealed strong signatures of natural selection for a 1 kb window covering the SNP 144716 in two human populations, the European American (π = 0.0072, Tajima's D = 3.31, 14 SNPs) and the Japanese (π = 0.0049, Fay & Wu's H = 8.05, 14 SNPs), as well as in chimpanzees (Fay & Wu's H = 8.62, 12 SNPs). These results strongly support the involvement of the FHIT intronic region in an increased risk of prostate cancer. PMID:18953408

  11. Lack of replication of thirteen single-nucleotide polymorphisms implicated in Parkinson’s disease: a large-scale international study

    PubMed Central

    Elbaz, Alexis; Nelson, Lorene M; Payami, Haydeh; Ioannidis, John P A; Fiske, Brian K; Annesi, Grazia; Belin, Andrea Carmine; Factor, Stewart A; Ferrarese, Carlo; Hadjigeorgiou, Georgios M; Higgins, Donald S; Kawakami, Hideshi; Krüger, Rejko; Marder, Karen S; Mayeux, Richard P; Mellick, George D; Nutt, John G; Ritz, Beate; Samii, Ali; Tanner, Caroline M; Van Broeckhoven, Christine; Van Den Eeden, Stephen K; Wirdefeldt, Karin; Zabetian, Cyrus P; Dehem, Marie; Montimurro, Jennifer S; Southwick, Audrey; Myers, Richard M; Trikalinos, Thomas A

    2013-01-01

    Summary Background A genome-wide association study identified 13 single-nucleotide polymorphisms (SNPs) significantly associated with Parkinson’s disease. Small-scale replication studies were largely non-confirmatory, but a meta-analysis that included data from the original study could not exclude all SNP associations, leaving relevance of several markers uncertain. Methods Investigators from three Michael J Fox Foundation for Parkinson’s Research-funded genetics consortia—comprising 14 teams—contributed DNA samples from 5526 patients with Parkinson’s disease and 6682 controls, which were genotyped for the 13 SNPs. Most (88%) participants were of white, non-Hispanic descent. We assessed log-additive genetic effects using fixed and random effects models stratified by team and ethnic origin, and tested for heterogeneity across strata. A meta-analysis was undertaken that incorporated data from the original genome-wide study as well as subsequent replication studies. Findings In fixed and random-effects models no associations with any of the 13 SNPs were identified (odds ratios 0·89 to 1·09). Heterogeneity between studies and between ethnic groups was low for all SNPs. Subgroup analyses by age at study entry, ethnic origin, sex, and family history did not show any consistent associations. In our meta-analysis, no SNP showed significant association (summary odds ratios 0·95 to 1.08); there was little heterogeneity except for SNP rs7520966. Interpretation Our results do not lend support to the finding that the 13 SNPs reported in the original genome-wide association study are genetic susceptibility factors for Parkinson’s disease. PMID:17052658

  12. Genomic Selection in Dairy Cattle: The USDA Experience.

    PubMed

    Wiggans, George R; Cole, John B; Hubbard, Suzanne M; Sonstegard, Tad S

    2017-02-08

    Genomic selection has revolutionized dairy cattle breeding. Since 2000, assays have been developed to genotype large numbers of single-nucleotide polymorphisms (SNPs) at relatively low cost. The first commercial SNP genotyping chip was released with a set of 54,001 SNPs in December 2007. Over 15,000 genotypes were used to determine which SNPs should be used in genomic evaluation of US dairy cattle. Official USDA genomic evaluations were first released in January 2009 for Holsteins and Jerseys, in August 2009 for Brown Swiss, in April 2013 for Ayrshires, and in April 2016 for Guernseys. Producers have accepted genomic evaluations as accurate indications of a bull's eventual daughter-based evaluation. The integration of DNA marker technology and genomics into the traditional evaluation system has doubled the rate of genetic progress for traits of economic importance, decreased generation interval, increased selection accuracy, reduced previous costs of progeny testing, and allowed identification of recessive lethals.

  13. Prospecting for pig single nucleotide polymorphisms in the human genome: have we struck gold?

    PubMed

    Grapes, L; Rudd, S; Fernando, R L; Megy, K; Rocha, D; Rothschild, M F

    2006-06-01

    Gene-to-gene variation in the frequency of single nucleotide polymorphisms (SNPs) has been observed in humans, mice, rats, primates and pigs, but a relationship across species in this variation has not been described. Here, the frequency of porcine coding SNPs (cSNPs) identified by in silico methods, and the frequency of murine cSNPs, were compared with the frequency of human cSNPs across homologous genes. From 150,000 porcine expressed sequence tag (EST) sequences, a total of 452 SNP-containing sequence clusters were found, totalling 1394 putative SNPs. All the clustered porcine EST annotations and SNP data have been made publicly available at http://sputnik.btk.fi/project?name=swine. Human and murine cSNPs were identified from dbSNP and were characterized as either validated or total number of cSNPs (validated plus non-validated) for comparison purposes. The correlation between in silico pig cSNP and validated human cSNP densities was found to be 0.77 (p < 0.00001) for a set of 25 homologous genes, while a correlation of 0.48 (p < 0.0005) was found for a primarily random sample of 50 homologous human and mouse genes. This is the first evidence of conserved gene-to-gene variability in cSNP frequency across species and indicates that site-directed screening of porcine genes that are homologous to cSNP-rich human genes may rapidly advance cSNP discovery in pigs.

  14. Genomewide single nucleotide polymorphism discovery in Atlantic salmon (Salmo salar): validation in wild and farmed American and European populations.

    PubMed

    Yáñez, J M; Naswa, S; López, M E; Bassini, L; Correa, K; Gilbey, J; Bernatchez, L; Norris, A; Neira, R; Lhorente, J P; Schnable, P S; Newman, S; Mileham, A; Deeb, N; Di Genova, A; Maass, A

    2016-07-01

    A considerable number of single nucleotide polymorphisms (SNPs) are required to elucidate genotype-phenotype associations and determine the molecular basis of important traits. In this work, we carried out de novo SNP discovery accounting for both genome duplication and genetic variation from American and European salmon populations. A total of 9 736 473 nonredundant SNPs were identified across a set of 20 fish by whole-genome sequencing. After applying six bioinformatic filtering steps, 200 K SNPs were selected to develop an Affymetrix Axiom(®) myDesign Custom Array. This array was used to genotype 480 fish representing wild and farmed salmon from Europe, North America and Chile. A total of 159 099 (79.6%) SNPs were validated as high quality based on clustering properties. A total of 151 509 validated SNPs showed a unique position in the genome. When comparing these SNPs against 238 572 markers currently available in two other Atlantic salmon arrays, only 4.6% of the SNP overlapped with the panel developed in this study. This novel high-density SNP panel will be very useful for the dissection of economically and ecologically relevant traits, enhancing breeding programmes through genomic selection as well as supporting genetic studies in both wild and farmed populations of Atlantic salmon using high-resolution genomewide information. © 2016 John Wiley & Sons Ltd.

  15. Selected single-nucleotide polymorphisms in FOXE1, SERPINA5, FTO, EVPL, TICAM1 and SCARB1 are associated with papillary and follicular thyroid cancer risk: replication study in a German population

    PubMed Central

    Sigurdson, Alice J.; Brenner, Alina V.; Roach, James A.; Goudeva, Lilia; Müller, Jörg A.; Nerlich, Kai; Reiners, Christoph; Schwab, Robert; Pfeiffer, Liliane; Waldenberger, Melanie; Braganza, Melissa; Xu, Li; Sturgis, Erich M.; Yeager, Meredith; Chanock, Stephen J.; Pfeiffer, Ruth M.; Abend, Michael; Port, Matthias

    2016-01-01

    Several single-nucleotide polymorphisms (SNPs) have been associated with papillary and follicular thyroid cancer (PTC and FTC, respectively) risk, but few have replicated. After analyzing 17525 tag SNPs in 1129 candidate genes, we found associations with PTC risk in SERPINA5, FTO, HEMGN (near FOXE1) and other genes. Here, we report results from a replication effort in a large independent PTC/FTC case–control study conducted in Germany. We evaluated the best tagging SNPs from our previous PTC study and additionally included SNPs in or near FOXE1 and NKX2-1 genes, known susceptibility loci for thyroid cancer. We genotyped 422 PTC and 130 FTC cases and 752 controls recruited from three German clinical centers. We used polytomous logistic regression to simultaneously estimate PTC and FTC associations for 79 SNPs based on log-additive models. We assessed effect modification by body mass index (BMI), gender and age for all SNPs, and selected SNP by SNP interactions. We confirmed associations with PTC and SNPs in FOXE1/HEMGN, SERPINA5 (rs2069974), FTO (rs8047395), EVPL (rs2071194), TICAM1 (rs8120) and SCARB1 (rs11057820) genes. We found associations with SNPs in FOXE1, SERPINA5, FTO, TICAM1 and HSPA6 and FTC. We found two significant interactions between FTO (rs8047395) and BMI (P = 0.0321) and between TICAM1 (rs8120) and FOXE1 (rs10984377) (P = 0.0006). Besides the known associations with FOXE1 SNPs, we confirmed additional PTC SNP associations reported previously. We also found several new associations with FTC risk and noteworthy interactions. We conclude that multiple variants and host factors might interact in complex ways to increase risk of PTC and FTC. PMID:27207655

  16. Selected single-nucleotide polymorphisms in FOXE1, SERPINA5, FTO, EVPL, TICAM1 and SCARB1 are associated with papillary and follicular thyroid cancer risk: replication study in a German population.

    PubMed

    Sigurdson, Alice J; Brenner, Alina V; Roach, James A; Goudeva, Lilia; Müller, Jörg A; Nerlich, Kai; Reiners, Christoph; Schwab, Robert; Pfeiffer, Liliane; Waldenberger, Melanie; Braganza, Melissa; Xu, Li; Sturgis, Erich M; Yeager, Meredith; Chanock, Stephen J; Pfeiffer, Ruth M; Abend, Michael; Port, Matthias

    2016-07-01

    Several single-nucleotide polymorphisms (SNPs) have been associated with papillary and follicular thyroid cancer (PTC and FTC, respectively) risk, but few have replicated. After analyzing 17525 tag SNPs in 1129 candidate genes, we found associations with PTC risk in SERPINA5, FTO, HEMGN (near FOXE1) and other genes. Here, we report results from a replication effort in a large independent PTC/FTC case-control study conducted in Germany. We evaluated the best tagging SNPs from our previous PTC study and additionally included SNPs in or near FOXE1 and NKX2-1 genes, known susceptibility loci for thyroid cancer. We genotyped 422 PTC and 130 FTC cases and 752 controls recruited from three German clinical centers. We used polytomous logistic regression to simultaneously estimate PTC and FTC associations for 79 SNPs based on log-additive models. We assessed effect modification by body mass index (BMI), gender and age for all SNPs, and selected SNP by SNP interactions. We confirmed associations with PTC and SNPs in FOXE1/HEMGN, SERPINA5 (rs2069974), FTO (rs8047395), EVPL (rs2071194), TICAM1 (rs8120) and SCARB1 (rs11057820) genes. We found associations with SNPs in FOXE1, SERPINA5, FTO, TICAM1 and HSPA6 and FTC. We found two significant interactions between FTO (rs8047395) and BMI (P = 0.0321) and between TICAM1 (rs8120) and FOXE1 (rs10984377) (P = 0.0006). Besides the known associations with FOXE1 SNPs, we confirmed additional PTC SNP associations reported previously. We also found several new associations with FTC risk and noteworthy interactions. We conclude that multiple variants and host factors might interact in complex ways to increase risk of PTC and FTC. Published by Oxford University Press 2016.

  17. Discovery of Pod Shatter-Resistant Associated SNPs by Deep Sequencing of a Representative Library Followed by Bulk Segregant Analysis in Rapeseed

    PubMed Central

    Huang, Shunmou; Yang, Hongli; Zhan, Gaomiao; Wang, Xinfa; Liu, Guihua; Wang, Hanzhong

    2012-01-01

    Background Single nucleotide polymorphisms (SNPs) are an important class of genetic marker for target gene mapping. As of yet, there is no rapid and effective method to identify SNPs linked with agronomic traits in rapeseed and other crop species. Methodology/Principal Findings We demonstrate a novel method for identifying SNP markers in rapeseed by deep sequencing a representative library and performing bulk segregant analysis. With this method, SNPs associated with rapeseed pod shatter-resistance were discovered. Firstly, a reduced representation of the rapeseed genome was used. Genomic fragments ranging from 450–550 bp were prepared from the susceptible bulk (ten F2 plants with the silique shattering resistance index, SSRI <0.10) and the resistance bulk (ten F2 plants with SSRI >0.90), and also Solexa sequencing-produced 90 bp reads. Approximately 50 million of these sequence reads were assembled into contigs to a depth of 20-fold coverage. Secondly, 60,396 ‘simple SNPs’ were identified, and the statistical significance was evaluated using Fisher's exact test. There were 70 associated SNPs whose –log10 p value over 16 were selected to be further analyzed. The distribution of these SNPs appeared a tight cluster, which consisted of 14 associated SNPs within a 396 kb region on chromosome A09. Our evidence indicates that this region contains a major quantitative trait locus (QTL). Finally, two associated SNPs from this region were mapped on a major QTL region. Conclusions/Significance 70 associated SNPs were discovered and a major QTL for rapeseed pod shatter-resistance was found on chromosome A09 using our novel method. The associated SNP markers were used for mapping of the QTL, and may be useful for improving pod shatter-resistance in rapeseed through marker-assisted selection and map-based cloning. This approach will accelerate the discovery of major QTLs and the cloning of functional genes for important agronomic traits in rapeseed and other crop species. PMID:22529909

  18. Bone mineral density and risk of type 2 diabetes and coronary heart disease: A Mendelian randomization study.

    PubMed

    Gan, Wei; Clarke, Robert J; Mahajan, Anubha; Kulohoma, Benard; Kitajima, Hidetoshi; Robertson, Neil R; Rayner, N William; Walters, Robin G; Holmes, Michael V; Chen, Zhengming; McCarthy, Mark I

    2017-01-01

    Background: Observational studies have demonstrated that increased bone mineral density is associated with a higher risk of type 2 diabetes (T2D), but the relationship with risk of coronary heart disease (CHD) is less clear. Moreover, substantial uncertainty remains about the causal relevance of increased bone mineral density for T2D and CHD, which can be assessed by Mendelian randomisation studies.  Methods: We identified 235 independent single nucleotide polymorphisms (SNPs) associated at p <5×10 -8 with estimated heel bone mineral density (eBMD) in 116,501 individuals from the UK Biobank study, accounting for 13.9% of eBMD variance. For each eBMD-associated SNP, we extracted effect estimates from the largest available GWAS studies for T2D (DIAGRAM: n=26,676 T2D cases and 132,532 controls) and CHD (CARDIoGRAMplusC4D: n=60,801 CHD cases and 123,504 controls). A two-sample design using several Mendelian randomization approaches was used to investigate the causal relevance of eBMD for risk of T2D and CHD. In addition, we explored the relationship of eBMD, instrumented by the 235 SNPs, on 12 cardiovascular and metabolic risk factors. Finally, we conducted Mendelian randomization analysis in the reverse direction to investigate reverse causality. Results: Each one standard deviation increase in genetically instrumented eBMD (equivalent to 0.14 g/cm 2 ) was associated with an 8% higher risk of T2D (odds ratio [OR] 1.08; 95% confidence interval [CI]: 1.02 to 1.14; p =0.012) and 5% higher risk of CHD (OR 1.05; 95%CI: 1.00 to 1.10; p =0.034). Consistent results were obtained in sensitivity analyses using several different Mendelian randomization approaches. Equivalent increases in eBMD were also associated with lower plasma levels of HDL-cholesterol and increased insulin resistance. Mendelian randomization in the reverse direction using 94 T2D SNPs or 52 CHD SNPs showed no evidence of reverse causality with eBMD. Conclusions: These findings suggest a causal relationship between elevated bone mineral density with risks of both T2D and CHD.

  19. Association between variants in genes involved in the immune response and prostate cancer risk in men randomized to the finasteride Arm in the Prostate Cancer Prevention Trial*

    PubMed Central

    Winchester, Danyelle; Till, Cathee; Goodman, Phyllis J.; Tangen, Catherine M.; Santella, Regina M.; Johnson-Pais, Teresa L.; Leach, Robin J.; Xu, Jianfeng; Zheng, S. Lilly; Thompson, Ian M.; Lucia, M. Scott; Lippman, Scott M.; Parnes, Howard L.; Isaacs, William B.; De Marzo, Angelo M.; Drake, Charles G.; Platz, Elizabeth A.

    2017-01-01

    BACKGROUND We reported that some, but not all single nucleotide polymorphisms (SNPs) in select immune response genes are associated with prostate cancer, but not individually with the prevalence of intraprostatic inflammation in the Prostate Cancer Prevention Trial (PCPT) placebo arm. Here, we investigated whether these same SNPs are associated with risk of lower- and higher-grade prostate cancer in men randomized to finasteride, and with prevalence of intraprostatic inflammation among controls. METHODS 16 candidate SNPs in IL1β, IL2, IL4, IL6, IL8, IL10, IL12(p40), IFNG, MSR1, RNASEL, TLR4, and TNFA and 7 tagSNPs in IL10 were genotyped in 625 white prostate cancer cases, and 532 white controls negative for cancer on an end-of-study biopsy nested in the PCPT finasteride arm. We used logistic regression to estimate log-additive odds ratios (OR) and 95% confidence intervals (CI) adjusting for age and family history. RESULTS Minor alleles of rs2243250 (T) in IL4 (OR=1.46, 95% CI 1.03–2.08, P-trend=0.03), rs1800896 (G) in IL10 (OR=0.77, 95% CI 0.61–0.96, P-trend=0.02), rs2430561 (A) in IFNG (OR=1.33, 95% CI 1.02–1.74; P-trend=0.04), rs3747531 (C) in MSR1 (OR=0.55, 95% CI 0.32–0.95; P-trend=0.03), and possibly rs4073 (A) in IL8 (OR=0.81, 95% CI 0.64–1.01, P-trend=0.06) were associated with higher- (Gleason 7–10; N=222), but not lower- (Gleason 2–6; N=380) grade prostate cancer. In men with low PSA (<2 ng/mL), these higher-grade disease associations were attenuated and/or no longer significant, whereas associations with higher-grade disease were apparent for minor alleles of rs1800795 (C: OR=0.70, 95% CI 0.51–0.94, P-trend=0.02) and rs1800797 (A: OR=0.72, 95% CI 0.53–0.98, P-trend=0.04) in IL6. While some IL10 tagSNPs were associated with lower- and higher-grade prostate cancer, distributions of IL10 haplotypes did not differ, except possibly between higher-grade cases and controls among those with low PSA (P=0.07). We did not observe an association between the studied SNPs and intraprostatic inflammation in the controls. CONCLUSION In the PCPT finasteride arm, variation in genes involved in the immune response, including possibly IL8 and IL10 as in the placebo arm, may be associated with prostate cancer, especially higher-grade disease, but not with intraprostatic inflammation. We cannot rule out PSA-associated detection bias or chance due to multiple testing. PMID:28317149

  20. Common sequence variants in CD36 gene and the levels of triglyceride and high-density lipoprotein cholesterol among ethnic Chinese in Taiwan

    PubMed Central

    2012-01-01

    Background Evidence of the genetic association between CD36 candidate gene and the risk of metabolic syndrome and its components has been inconsistent. This case–control study assessed the haplotype-tagged SNPs from CD36 on the risk of metabolic syndrome and components. Methods and results We recruited 1,000 cases and age, gender-matched controls were randomly selected from the participants with metabolic syndrome defined by International Diabetes Federation. Overall, the haplotype tagged SNPs of CD36 gene were not related to the risk of metabolic syndrome. For individuals with normal lipid levels, several SNPs were significantly associated with the triglycerides and HDL-cholesterol levels: Subjects with rs3211848 homozygote had a higher triglyceride level (99.16 ± 2.61 mg/dL), compared with non-carriers (89.27 ± 1.45 mg/dL, P = 0.001). In addition, compared with non-carriers, individuals with rs1054516 heterozygous and homozygous genotypes had a significantly lower HDL-cholesterol (46.6 ± 0.46 mg/dL for non-carrier, 44.6 ± 0.36 mg/dL for heterozygous, and 44.3 ± 0.56 mg/dL for homozygous, P = 0.0008). Conclusion The CD36 gene variants were significantly associated with triglycerides and HDL-cholesterol concentrations among ethnic Chinese in Taiwan. PMID:23249574

  1. Adaptations to Climate in Candidate Genes for Common Metabolic Disorders

    PubMed Central

    Hancock, Angela M; Witonsky, David B; Gordon, Adam S; Eshel, Gidon; Pritchard, Jonathan K; Coop, Graham; Di Rienzo, Anna

    2008-01-01

    Evolutionary pressures due to variation in climate play an important role in shaping phenotypic variation among and within species and have been shown to influence variation in phenotypes such as body shape and size among humans. Genes involved in energy metabolism are likely to be central to heat and cold tolerance. To test the hypothesis that climate shaped variation in metabolism genes in humans, we used a bioinformatics approach based on network theory to select 82 candidate genes for common metabolic disorders. We genotyped 873 tag SNPs in these genes in 54 worldwide populations (including the 52 in the Human Genome Diversity Project panel) and found correlations with climate variables using rank correlation analysis and a newly developed method termed Bayesian geographic analysis. In addition, we genotyped 210 carefully matched control SNPs to provide an empirical null distribution for spatial patterns of allele frequency due to population history alone. For nearly all climate variables, we found an excess of genic SNPs in the tail of the distributions of the test statistics compared to the control SNPs, implying that metabolic genes as a group show signals of spatially varying selection. Among our strongest signals were several SNPs (e.g., LEPR R109K, FABP2 A54T) that had previously been associated with phenotypes directly related to cold tolerance. Since variation in climate may be correlated with other aspects of environmental variation, it is possible that some of the signals that we detected reflect selective pressures other than climate. Nevertheless, our results are consistent with the idea that climate has been an important selective pressure acting on candidate genes for common metabolic disorders. PMID:18282109

  2. Conjugation of silica nanoparticles with cellulose acetate/polyethylene glycol 300 membrane for reverse osmosis using MgSO4 solution.

    PubMed

    Sabir, Aneela; Shafiq, Muhammad; Islam, Atif; Jabeen, Faiza; Shafeeq, Amir; Ahmad, Adnan; Zahid Butt, Muhammad Taqi; Jacob, Karl I; Jamil, Tahir

    2016-01-20

    Thermally-induced phase separation (TIPS) method was used to synthesize polymer matrix (PM) membranes for reverse osmosis from cellulose acetate/polyethylene glycol (CA/PEG300) conjugated with silica nanoparticles (SNPs). Experimental data showed that the conjugation of SNPs changed the surface properties as dense and asymmetric composite structure. The results were explicitly determined by the permeability flux and salt rejection efficiency of the PM-SNPs membranes. The effect of SNPs conjugation on MgSO4 salt rejection was more significant in magnitude than on permeation flux i.e. 2.38 L/m(2)h. FTIR verified that SNPs were successfully conjugated on the surface of PM membrane. DSC of PM-SNPs shows an improved Tg from 76.2 to 101.8 °C for PM and PM-S4 respectively. Thermal stability of the PM-SNPs membranes was observed by TGA which was significantly enhanced with the conjugation of SNPs. The micrographs of SEM and AFM showed the morphological changes and increase in the valley and ridges on membrane surface. Experimental data showed that the PM-S4 (0.4 wt% SNPs) membrane has maximum salt rejection capacity and was selected as an optimal membrane. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. Population and performance analyses of four major populations with Illumina's FGx Forensic Genomics System.

    PubMed

    Churchill, Jennifer D; Novroski, Nicole M M; King, Jonathan L; Seah, Lay Hong; Budowle, Bruce

    2017-09-01

    The MiSeq FGx Forensic Genomics System (Illumina) enables amplification and massively parallel sequencing of 59 STRs, 94 identity informative SNPs, 54 ancestry informative SNPs, and 24 phenotypic informative SNPs. Allele frequency and population statistics data were generated for the 172 SNP loci included in this panel on four major population groups (Chinese, African Americans, US Caucasians, and Southwest Hispanics). Single-locus and combined random match probability values were generated for the identity informative SNPs. The average combined STR and identity informative SNP random match probabilities (assuming independence) across all four populations were 1.75E-67 and 2.30E-71 with length-based and sequence-based STR alleles, respectively. Ancestry and phenotype predictions were obtained using the ForenSeq™ Universal Analysis System (UAS; Illumina) based on the ancestry informative and phenotype informative SNP profiles generated for each sample. Additionally, performance metrics, including profile completeness, read depth, relative locus performance, and allele coverage ratios, were evaluated and detailed for the 725 samples included in this study. While some genetic markers included in this panel performed notably better than others, performance across populations was generally consistent. The performance and population data included in this study support that accurate and reliable profiles were generated and provide valuable background information for laboratories considering internal validation studies and implementation. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Employing genome-wide SNP discovery and genotyping strategy to extrapolate the natural allelic diversity and domestication patterns in chickpea

    PubMed Central

    Kujur, Alice; Bajaj, Deepak; Upadhyaya, Hari D.; Das, Shouvik; Ranjan, Rajeev; Shree, Tanima; Saxena, Maneesha S.; Badoni, Saurabh; Kumar, Vinod; Tripathi, Shailesh; Gowda, C. L. L.; Sharma, Shivali; Singh, Sube; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    The genome-wide discovery and high-throughput genotyping of SNPs in chickpea natural germplasm lines is indispensable to extrapolate their natural allelic diversity, domestication, and linkage disequilibrium (LD) patterns leading to the genetic enhancement of this vital legume crop. We discovered 44,844 high-quality SNPs by sequencing of 93 diverse cultivated desi, kabuli, and wild chickpea accessions using reference genome- and de novo-based GBS (genotyping-by-sequencing) assays that were physically mapped across eight chromosomes of desi and kabuli. Of these, 22,542 SNPs were structurally annotated in different coding and non-coding sequence components of genes. Genes with 3296 non-synonymous and 269 regulatory SNPs could functionally differentiate accessions based on their contrasting agronomic traits. A high experimental validation success rate (92%) and reproducibility (100%) along with strong sensitivity (93–96%) and specificity (99%) of GBS-based SNPs was observed. This infers the robustness of GBS as a high-throughput assay for rapid large-scale mining and genotyping of genome-wide SNPs in chickpea with sub-optimal use of resources. With 23,798 genome-wide SNPs, a relatively high intra-specific polymorphic potential (49.5%) and broader molecular diversity (13–89%)/functional allelic diversity (18–77%) was apparent among 93 chickpea accessions, suggesting their tremendous applicability in rapid selection of desirable diverse accessions/inter-specific hybrids in chickpea crossbred varietal improvement program. The genome-wide SNPs revealed complex admixed domestication pattern, extensive LD estimates (0.54–0.68) and extended LD decay (400–500 kb) in a structured population inclusive of 93 accessions. These findings reflect the utility of our identified SNPs for subsequent genome-wide association study (GWAS) and selective sweep-based domestication trait dissection analysis to identify potential genomic loci (gene-associated targets) specifically regulating important complex quantitative agronomic traits in chickpea. The numerous informative genome-wide SNPs, natural allelic diversity-led domestication pattern, and LD-based information generated in our study have got multidimensional applicability with respect to chickpea genomics-assisted breeding. PMID:25873920

  5. Genetic Divergence between Camellia sinensis and Its Wild Relatives Revealed via Genome-Wide SNPs from RAD Sequencing.

    PubMed

    Yang, Hua; Wei, Chao-Ling; Liu, Hong-Wei; Wu, Jun-Lan; Li, Zheng-Guo; Zhang, Liang; Jian, Jian-Bo; Li, Ye-Yun; Tai, Yu-Ling; Zhang, Jing; Zhang, Zheng-Zhu; Jiang, Chang-Jun; Xia, Tao; Wan, Xiao-Chun

    2016-01-01

    Tea is one of the most popular beverages across the world and is made exclusively from cultivars of Camellia sinensis. Many wild relatives of the genus Camellia that are closely related to C. sinensis are native to Southwest China. In this study, we first identified the distinct genetic divergence between C. sinensis and its wild relatives and provided a glimpse into the artificial selection of tea plants at a genome-wide level by analyzing 15,444 genomic SNPs that were identified from 18 cultivated and wild tea accessions using a high-throughput genome-wide restriction site-associated DNA sequencing (RAD-Seq) approach. Six distinct clusters were detected by phylogeny inferrence and principal component and genetic structural analyses, and these clusters corresponded to six Camellia species/varieties. Genetic divergence apparently indicated that C. taliensis var. bangwei is a semi-wild or transient landrace occupying a phylogenetic position between those wild and cultivated tea plants. Cultivated accessions exhibited greater heterozygosity than wild accessions, with the exception of C. taliensis var. bangwei. Thirteen genes with non-synonymous SNPs exhibited strong selective signals that were suggestive of putative artificial selective footprints for tea plants during domestication. The genome-wide SNPs provide a fundamental data resource for assessing genetic relationships, characterizing complex traits, comparing heterozygosity and analyzing putatitve artificial selection in tea plants.

  6. Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence

    PubMed Central

    2011-01-01

    Background Many plants have large and complex genomes with an abundance of repeated sequences. Many plants are also polyploid. Both of these attributes typify the genome architecture in the tribe Triticeae, whose members include economically important wheat, rye and barley. Large genome sizes, an abundance of repeated sequences, and polyploidy present challenges to genome-wide SNP discovery using next-generation sequencing (NGS) of total genomic DNA by making alignment and clustering of short reads generated by the NGS platforms difficult, particularly in the absence of a reference genome sequence. Results An annotation-based, genome-wide SNP discovery pipeline is reported using NGS data for large and complex genomes without a reference genome sequence. Roche 454 shotgun reads with low genome coverage of one genotype are annotated in order to distinguish single-copy sequences and repeat junctions from repetitive sequences and sequences shared by paralogous genes. Multiple genome equivalents of shotgun reads of another genotype generated with SOLiD or Solexa are then mapped to the annotated Roche 454 reads to identify putative SNPs. A pipeline program package, AGSNP, was developed and used for genome-wide SNP discovery in Aegilops tauschii-the diploid source of the wheat D genome, and with a genome size of 4.02 Gb, of which 90% is repetitive sequences. Genomic DNA of Ae. tauschii accession AL8/78 was sequenced with the Roche 454 NGS platform. Genomic DNA and cDNA of Ae. tauschii accession AS75 was sequenced primarily with SOLiD, although some Solexa and Roche 454 genomic sequences were also generated. A total of 195,631 putative SNPs were discovered in gene sequences, 155,580 putative SNPs were discovered in uncharacterized single-copy regions, and another 145,907 putative SNPs were discovered in repeat junctions. These SNPs were dispersed across the entire Ae. tauschii genome. To assess the false positive SNP discovery rate, DNA containing putative SNPs was amplified by PCR from AL8/78 and AS75 and resequenced with the ABI 3730 xl. In a sample of 302 randomly selected putative SNPs, 84.0% in gene regions, 88.0% in repeat junctions, and 81.3% in uncharacterized regions were validated. Conclusion An annotation-based genome-wide SNP discovery pipeline for NGS platforms was developed. The pipeline is suitable for SNP discovery in genomic libraries of complex genomes and does not require a reference genome sequence. The pipeline is applicable to all current NGS platforms, provided that at least one such platform generates relatively long reads. The pipeline package, AGSNP, and the discovered 497,118 Ae. tauschii SNPs can be accessed at (http://avena.pw.usda.gov/wheatD/agsnp.shtml). PMID:21266061

  7. Evaluating information content of SNPs for sample-tagging in re-sequencing projects.

    PubMed

    Hu, Hao; Liu, Xiang; Jin, Wenfei; Hilger Ropers, H; Wienker, Thomas F

    2015-05-15

    Sample-tagging is designed for identification of accidental sample mix-up, which is a major issue in re-sequencing studies. In this work, we develop a model to measure the information content of SNPs, so that we can optimize a panel of SNPs that approach the maximal information for discrimination. The analysis shows that as low as 60 optimized SNPs can differentiate the individuals in a population as large as the present world, and only 30 optimized SNPs are in practice sufficient in labeling up to 100 thousand individuals. In the simulated populations of 100 thousand individuals, the average Hamming distances, generated by the optimized set of 30 SNPs are larger than 18, and the duality frequency, is lower than 1 in 10 thousand. This strategy of sample discrimination is proved robust in large sample size and different datasets. The optimized sets of SNPs are designed for Whole Exome Sequencing, and a program is provided for SNP selection, allowing for customized SNP numbers and interested genes. The sample-tagging plan based on this framework will improve re-sequencing projects in terms of reliability and cost-effectiveness.

  8. RNA-Seq identifies SNP markers for growth traits in rainbow trout.

    PubMed

    Salem, Mohamed; Vallejo, Roger L; Leeds, Timothy D; Palti, Yniv; Liu, Sixin; Sabbagh, Annas; Rexroad, Caird E; Yao, Jianbo

    2012-01-01

    Fast growth is an important and highly desired trait, which affects the profitability of food animal production, with feed costs accounting for the largest proportion of production costs. Traditional phenotype-based selection is typically used to select for growth traits; however, genetic improvement is slow over generations. Single nucleotide polymorphisms (SNPs) explain 90% of the genetic differences between individuals; therefore, they are most suitable for genetic evaluation and strategies that employ molecular genetics for selective breeding. SNPs found within or near a coding sequence are of particular interest because they are more likely to alter the biological function of a protein. We aimed to use SNPs to identify markers and genes associated with genetic variation in growth. RNA-Seq whole-transcriptome analysis of pooled cDNA samples from a population of rainbow trout selected for improved growth versus unselected genetic cohorts (10 fish from 1 full-sib family each) identified SNP markers associated with growth-rate. The allelic imbalances (the ratio between the allele frequencies of the fast growing sample and that of the slow growing sample) were considered at scores >5.0 as an amplification and <0.2 as loss of heterozygosity. A subset of SNPs (n = 54) were validated and evaluated for association with growth traits in 778 individuals of a three-generation parent/offspring panel representing 40 families. Twenty-two SNP markers and one mitochondrial haplotype were significantly associated with growth traits. Polymorphism of 48 of the markers was confirmed in other commercially important aquaculture stocks. Many markers were clustered into genes of metabolic energy production pathways and are suitable candidates for genetic selection. The study demonstrates that RNA-Seq at low sequence coverage of divergent populations is a fast and effective means of identifying SNPs, with allelic imbalances between phenotypes. This technique is suitable for marker development in non-model species lacking complete and well-annotated genome reference sequences.

  9. A genome-wide association study in American Indians implicates DNER as a susceptibility locus for type 2 diabetes.

    PubMed

    Hanson, Robert L; Muller, Yunhua L; Kobes, Sayuko; Guo, Tingwei; Bian, Li; Ossowski, Victoria; Wiedrich, Kim; Sutherland, Jeffrey; Wiedrich, Christopher; Mahkee, Darin; Huang, Ke; Abdussamad, Maryam; Traurig, Michael; Weil, E Jennifer; Nelson, Robert G; Bennett, Peter H; Knowler, William C; Bogardus, Clifton; Baier, Leslie J

    2014-01-01

    Most genetic variants associated with type 2 diabetes mellitus (T2DM) have been identified through genome-wide association studies (GWASs) in Europeans. The current study reports a GWAS for young-onset T2DM in American Indians. Participants were selected from a longitudinal study conducted in Pima Indians and included 278 cases with diabetes with onset before 25 years of age, 295 nondiabetic controls ≥45 years of age, and 267 siblings of cases or controls. Individuals were genotyped on a ∼1M single nucleotide polymorphism (SNP) array, resulting in 453,654 SNPs with minor allele frequency >0.05. SNPs were analyzed for association in cases and controls, and a family-based association test was conducted. Tag SNPs (n = 311) were selected for 499 SNPs associated with diabetes (P < 0.0005 in case-control analyses or P < 0.0003 in family-based analyses), and these SNPs were genotyped in up to 6,834 additional Pima Indians to assess replication. Rs1861612 in DNER was associated with T2DM (odds ratio = 1.29 per copy of the T allele; P = 6.6 × 10(-8), which represents genome-wide significance accounting for the number of effectively independent SNPs analyzed). Transfection studies in murine pancreatic β-cells suggested that DNER regulates expression of notch signaling pathway genes. These studies implicate DNER as a susceptibility gene for T2DM in American Indians.

  10. A survey of genome-wide single nucleotide polymorphisms through genome resequencing in the Périgord black truffle (Tuber melanosporum Vittad.).

    PubMed

    Payen, Thibaut; Murat, Claude; Gigant, Anaïs; Morin, Emmanuelle; De Mita, Stéphane; Martin, Francis

    2015-09-01

    The Périgord black truffle (Tuber melanosporum Vittad.), considered a gastronomic delicacy worldwide, is an ectomycorrhizal filamentous fungus that is ecologically important in Mediterranean French, Italian and Spanish woodlands. In this study, we developed a novel resource of single nucleotide polymorphisms (SNPs) for T. melanosporum using Illumina high-throughput resequencing. The genome from six T. melanosporum geographical accessions was sequenced to a depth of approximately 20×. These geographical accessions were selected from different populations within the northern and southern regions of the geographical species distribution. Approximately 80% of the reads for each of the six resequenced geographical accessions mapped against the reference T. melanosporum genome assembly, estimating the core genome size of this organism to be approximately 110 Mbp. A total of 442 326 SNPs corresponding to 3540 SNPs/Mbps were identified as being included in all seven genomes. The SNPs occurred more frequently in repeated sequences (85%), although 4501 SNPs were also identified in the coding regions of 2587 genes. Using the ratio of nonsynonymous mutations per nonsynonymous site (pN) to synonymous mutations per synonymous site (pS) and Tajima's D index scanning the whole genome, we were able to identify genomic regions and genes potentially subjected to positive or purifying selection. The SNPs identified represent a valuable resource for future population genetics and genomics studies. © 2015 John Wiley & Sons Ltd.

  11. Novel efficient genome-wide SNP panels for the conservation of the highly endangered Iberian lynx.

    PubMed

    Kleinman-Ruiz, Daniel; Martínez-Cruz, Begoña; Soriano, Laura; Lucena-Perez, Maria; Cruz, Fernando; Villanueva, Beatriz; Fernández, Jesús; Godoy, José A

    2017-07-21

    The Iberian lynx (Lynx pardinus) has been acknowledged as the most endangered felid species in the world. An intense contraction and fragmentation during the twentieth century left less than 100 individuals split in two isolated and genetically eroded populations by 2002. Genetic monitoring and management so far have been based on 36 STRs, but their limited variability and the more complex situation of current populations demand more efficient molecular markers. The recent characterization of the Iberian lynx genome identified more than 1.6 million SNPs, of which 1536 were selected and genotyped in an extended Iberian lynx sample. We validated 1492 SNPs and analysed their heterozygosity, Hardy-Weinberg equilibrium, and linkage disequilibrium. We then selected a panel of 343 minimally linked autosomal SNPs from which we extracted subsets optimized for four different typical tasks in conservation applications: individual identification, parentage assignment, relatedness estimation, and admixture classification, and compared their power to currently used STR panels. We ascribed 21 SNPs to chromosome X based on their segregation patterns, and identified one additional marker that showed significant differentiation between sexes. For all applications considered, panels of autosomal SNPs showed higher power than the currently used STR set with only a very modest increase in the number of markers. These novel panels of highly informative genome-wide SNPs provide more powerful, efficient, and flexible tools for the genetic management and non-invasive monitoring of Iberian lynx populations. This example highlights an important outcome of whole-genome studies in genetically threatened species.

  12. Prioritizing individual genetic variants after kernel machine testing using variable selection.

    PubMed

    He, Qianchuan; Cai, Tianxi; Liu, Yang; Zhao, Ni; Harmon, Quaker E; Almli, Lynn M; Binder, Elisabeth B; Engel, Stephanie M; Ressler, Kerry J; Conneely, Karen N; Lin, Xihong; Wu, Michael C

    2016-12-01

    Kernel machine learning methods, such as the SNP-set kernel association test (SKAT), have been widely used to test associations between traits and genetic polymorphisms. In contrast to traditional single-SNP analysis methods, these methods are designed to examine the joint effect of a set of related SNPs (such as a group of SNPs within a gene or a pathway) and are able to identify sets of SNPs that are associated with the trait of interest. However, as with many multi-SNP testing approaches, kernel machine testing can draw conclusion only at the SNP-set level, and does not directly inform on which one(s) of the identified SNP set is actually driving the associations. A recently proposed procedure, KerNel Iterative Feature Extraction (KNIFE), provides a general framework for incorporating variable selection into kernel machine methods. In this article, we focus on quantitative traits and relatively common SNPs, and adapt the KNIFE procedure to genetic association studies and propose an approach to identify driver SNPs after the application of SKAT to gene set analysis. Our approach accommodates several kernels that are widely used in SNP analysis, such as the linear kernel and the Identity by State (IBS) kernel. The proposed approach provides practically useful utilities to prioritize SNPs, and fills the gap between SNP set analysis and biological functional studies. Both simulation studies and real data application are used to demonstrate the proposed approach. © 2016 WILEY PERIODICALS, INC.

  13. Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm.

    PubMed

    Hoffmann, Thomas J; Zhan, Yiping; Kvale, Mark N; Hesselson, Stephanie E; Gollub, Jeremy; Iribarren, Carlos; Lu, Yontao; Mei, Gangwu; Purdy, Matthew M; Quesenberry, Charles; Rowell, Sarah; Shapero, Michael H; Smethurst, David; Somkin, Carol P; Van den Eeden, Stephen K; Walter, Larry; Webster, Teresa; Whitmer, Rachel A; Finn, Andrea; Schaefer, Catherine; Kwok, Pui-Yan; Risch, Neil

    2011-12-01

    Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies. Copyright © 2011 Elsevier Inc. All rights reserved.

  14. Association of the GALNT2 gene polymorphisms and several environmental factors with serum lipid levels in the Mulao and Han populations.

    PubMed

    Li, Qing; Yin, Rui-Xing; Yan, Ting-Ting; Miao, Lin; Cao, Xiao-Li; Hu, Xi-Jiang; Aung, Lynn Htet Htet; Wu, Dong-Feng; Wu, Jin-Zhen; Lin, Wei-Xiong

    2011-09-20

    The association of UDP-N-acetyl-alpha-D-galactosamine: polypeptide N-acetylgalactosaminyltransferase 2 gene (GALNT2) single nucleotide polymorphisms (SNPs) and serum lipid profiles in the general population is not well known. The present study was undertaken to detect the association of GALNT2 polymorphisms and several environmental factors with serum lipid levels in the Guangxi Mulao and Han populations. A total of 775 subjects of Mulao nationality and 699 participants of Han nationality were randomly selected from our stratified randomized cluster samples. Genotyping of the GALNT2 rs2144300 and rs4846914 SNPs was performed by polymerase chain reaction and restriction fragment length polymorphism combined with gel electrophoresis, and then confirmed by direct sequencing. There were no significant differences in the genotypic and allelic frequencies of both SNPs between the two ethnic groups, or between the males and females. The subjects with TT genotype of rs2144300 in Mulao had lower serum triglyceride (TG) levels than the subjects with CC genotype in females (P < 0.01). The participants with CT/TT genotype of rs2144300 in Han had lower TG and apolipoprotein (Apo) B levels, and higher high-density lipoprotein cholesterol (HDL-C), ApoA1 levels and the ratio of ApoA1 to ApoB in males; and higher low-density lipoprotein cholesterol (LDL-C) and ApoB levels in females than the participants with CC genotype (P < 0.05-0.001). The individuals with GA/AA genotype of rs4846914 in Mulao had higher total cholesterol (TC) and LDL-C levels than the individuals with GG genotype in males (P < 0.05 for each). The subjects with AA genotype of rs4846914 in Han had higher LDL-C and ApoB levels, and lower HDL-C levels and the ratio of ApoA1 to ApoB than the subjects with GG genotype (P < 0.05 for each). The levels of TC in Mulao were correlated with the genotypes of rs4846914 in males (P < 0.05). The levels of ApoA1 in Han were correlated with the genotypes of both SNPs, and the levels of HDL-C and ApoB and the ratio of ApoA1 to ApoB were associated with the genotypes of rs2144300 in males (P < 0.05-0.001). The levels of LDL-C in Han were correlated with the genotypes of rs4846914 in females (P < 0.05). Serum lipid parameters were also correlated with several enviromental factors. The associations of both GALNT2 rs2144300 and rs4846914 SNPs and serum lipid levels are different in the Mulao and Han populations. These discrepancies might partly result from different GALNT2 gene-enviromental interactions.

  15. Two novel polymorphisms of bovine SIRT2 gene are associated with higher body weight in Nanyang cattle.

    PubMed

    Sun, Xiaomei; Li, Mingxun; Hao, Dan; Hua, Liushuai; Lan, Xianyong; Lei, Chuzhao; Hu, Shenrong; Qi, Xinglei; Chen, Hong

    2015-03-01

    Identification of polymorphisms associated with economic traits is important for successful marker-assisted selection in cattle breeding. The family of mammalian sirtuin regulates many biological functions, such as life span extension and energy metabolism. SIRT2, a most abundant sirtuin in adipocytes, acts as a crucial regulator of adipogenic differentiation and plays a key role in controlling adipose tissue function and mass. Here we investigated single nucleotide polymorphisms (SNPs) of bovine SIRT2 in 1226 cattle from five breeds and further evaluated the effects of identified SNPs on economically important traits of Nanyang cattle. Our results revealed four novel SNPs in bovine SIRT2, one was located in intronic region and the other three were synonymous mutations. Linkage disequilibrium and haplotype analyses based on the identified SNPs showed obvious difference between crossbred breed and the other four beef breeds. Association analyses demonstrated that SNPs g.17333C > T and g.17578A > G have a significantly effect on 18-months-old body weight of Nanyang population. Animals with combined genotype TTGG at the above two loci exhibited especially higher body weight. Our data for the first time demonstrated that polymorphisms in bovine SIRT2 are associated with economic traits of Nanyang cattle, which will be helpful for future cattle selection practices.

  16. Rapid discovery of SNPs differentiating hatchery steelhead trout from ESA-listed natural-origin steelhead trout using a 57K SNP array

    USGS Publications Warehouse

    Larson, Wesley; Palti, Yniv; Gao, G.; Warheit, Kenneth I.; Seeb, James E.

    2017-01-01

    Natural-origin steelhead trout (Oncorhynchus mykiss (Walbaum, 1792)) in the Pacific Northwest, USA, are threatened by a number of factors including habitat destruction, disease, decline in marine survival, and a potential erosion of genetic viability due to introgression from hatchery strains. Our major goal was to use a recently developed SNP array containing ∼57 000 SNPs to identify a subset of SNPs that differentiate hatchery and natural-origin populations. We analyzed 35 765 polymorphic SNPs in nine populations of steelhead trout sampled from Puget Sound, Washington, USA. We then conducted two outlier tests and found 360 loci that were candidates for divergent selection between hatchery and natural-origin populations (mean FCT = 0.29, maximum = 0.65) and 595 SNPs that were candidates for selection among natural-origin populations (mean FST = 0.25, maximum = 0.51). Comparisons with a linkage map revealed that two chromosomes (Omy05 and Omy25) contained significantly more outliers than other chromosomes, suggesting that regions on Omy05 and Omy25 may be of adaptive significance. Our results highlight several advantages of the 57 000 SNP array as a tool for population and conservation genomics studies.

  17. Identification of Functional Single-Nucleotide Polymorphisms Affecting Leaf Hair Number in Brassica rapa.

    PubMed

    Zhang, Wenting; Mirlohi, Shirin; Li, Xiaorong; He, Yuke

    2018-06-01

    Leaf traits affect plant agronomic performance; for example, leaf hair number provides a morphological indicator of drought and insect resistance. Brassica rapa crops have diverse phenotypes, and many B. rapa single-nucleotide polymorphisms (SNPs) have been identified and used as molecular markers for plant breeding. However, which SNPs are functional for leaf hair traits and, therefore, effective for breeding purposes remains unknown. Here, we identify a set of SNPs in the B. rapa ssp. pekinenesis candidate gene BrpHAIRY LEAVES1 ( BrpHL1 ) and a number of SNPs of BrpHL1 in a natural population of 210 B. rapa accessions that have hairy, margin-only hairy, and hairless leaves. BrpHL1 genes and their orthologs and paralogs have many SNPs. By intensive mutagenesis and genetic transformation, we selected the functional SNPs for leaf hairs by the exclusion of nonfunctional SNPs and the orthologous and paralogous genes. The residue tryptophan-92 of BrpHL1a was essential for direct interaction with GLABROUS3 and, thus, necessary for the formation of leaf hairs. The accessions with the functional SNP leading to substitution of the tryptophan-92 residue had hairless leaves. The orthologous BrcHL1b from B. rapa ssp. chinensis regulates hair formation on leaf margins rather than leaf surfaces. The selected SNP for the hairy phenotype could be adopted as a molecular marker for insect resistance in Brassica spp. crops. Moreover, the procedures optimized here can be used to explain the molecular mechanisms of natural variation and to facilitate the molecular breeding of many crops. © 2018 American Society of Plant Biologists. All rights reserved.

  18. Effects of methylation-sensitive enzymes on the enrichment of genic SNPs and the degree of genome complexity reduction in a two-enzyme genotyping-by-sequencing (GBS) approach: a case study in oil palm (Elaeis guineensis).

    PubMed

    Pootakham, Wirulda; Sonthirod, Chutima; Naktang, Chaiwat; Jomchai, Nukoon; Sangsrakru, Duangjai; Tangphatsornruang, Sithichoke

    2016-01-01

    Advances in next generation sequencing have facilitated a large-scale single nucleotide polymorphism (SNP) discovery in many crop species. Genotyping-by-sequencing (GBS) approach couples next generation sequencing with genome complexity reduction techniques to simultaneously identify and genotype SNPs. Choice of enzymes used in GBS library preparation depends on several factors including the number of markers required, the desired level of multiplexing, and whether the enrichment of genic SNP is preferred. We evaluated various combinations of methylation-sensitive ( Aat II, Pst I, Msp I) and methylation-insensitive ( Sph I, Mse I) enzymes for their effectiveness in genome complexity reduction and enrichment of genic SNPs. We discovered that the use of two methylation-sensitive enzymes effectively reduced genome complexity and did not require a size selection step. On the contrary, the genome coverage of libraries constructed with methylation-insensitive enzymes was quite high, and the additional size selection step may be required to increase the overall read depth. We also demonstrated the effectiveness of methylation-sensitive enzymes in enriching for SNPs located in genic regions. When two methylation-insensitive enzymes were used, only 16% of SNPs identified were located in genes and 18% in the vicinity (± 5 kb) of the genic regions, while most SNPs resided in the intergenic regions. In contrast, a remarkable degree of enrichment was observed when two methylation-sensitive enzymes were employed. Almost two thirds of the SNPs were located either inside (32-36%) or in the vicinity (28-31%) of the genic regions. These results provide useful information to help researchers choose appropriate GBS enzymes in oil palm and other crop species.

  19. Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction

    PubMed Central

    Barbero, Marina M. D.; Oliveira, Henrique N.; de Camargo, Gregório M. F.; Fernandes Júnior, Gerardo A.; Aspilcueta-Borquis, Rusbel R.; Souza, Fabio R. P.; Boligon, Arione A.; Melo, Thaise P.; Regatieri, Inaê C.; Feitosa, Fabieli L. B.; Fonseca, Larissa F. S.; Magalhães, Ana F. B.; Costa, Raphael B.; Albuquerque, Lucia G.

    2018-01-01

    Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs. PMID:29293544

  20. The Discovery of Single-Nucleotide Polymorphisms—and Inferences about Human Demographic History

    PubMed Central

    Wakeley, John; Nielsen, Rasmus; Liu-Cordero, Shau Neen; Ardlie, Kristin

    2001-01-01

    A method of historical inference that accounts for ascertainment bias is developed and applied to single-nucleotide polymorphism (SNP) data in humans. The data consist of 84 short fragments of the genome that were selected, from three recent SNP surveys, to contain at least two polymorphisms in their respective ascertainment samples and that were then fully resequenced in 47 globally distributed individuals. Ascertainment bias is the deviation, from what would be observed in a random sample, caused either by discovery of polymorphisms in small samples or by locus selection based on levels or patterns of polymorphism. The three SNP surveys from which the present data were derived differ both in their protocols for ascertainment and in the size of the samples used for discovery. We implemented a Monte Carlo maximum-likelihood method to fit a subdivided-population model that includes a possible change in effective size at some time in the past. Incorrectly assuming that ascertainment bias does not exist causes errors in inference, affecting both estimates of migration rates and historical changes in size. Migration rates are overestimated when ascertainment bias is ignored. However, the direction of error in inferences about changes in effective population size (whether the population is inferred to be shrinking or growing) depends on whether either the numbers of SNPs per fragment or the SNP-allele frequencies are analyzed. We use the abbreviation “SDL,” for “SNP-discovered locus,” in recognition of the genomic-discovery context of SNPs. When ascertainment bias is modeled fully, both the number of SNPs per SDL and their allele frequencies support a scenario of growth in effective size in the context of a subdivided population. If subdivision is ignored, however, the hypothesis of constant effective population size cannot be rejected. An important conclusion of this work is that, in demographic or other studies, SNP data are useful only to the extent that their ascertainment can be modeled. PMID:11704929

  1. Genomic association for sexual precocity in beef heifers using pre-selection of genes and haplotype reconstruction.

    PubMed

    Takada, Luciana; Barbero, Marina M D; Oliveira, Henrique N; de Camargo, Gregório M F; Fernandes Júnior, Gerardo A; Aspilcueta-Borquis, Rusbel R; Souza, Fabio R P; Boligon, Arione A; Melo, Thaise P; Regatieri, Inaê C; Feitosa, Fabieli L B; Fonseca, Larissa F S; Magalhães, Ana F B; Costa, Raphael B; Albuquerque, Lucia G

    2018-01-01

    Reproductive traits are of the utmost importance for any livestock farming, but are difficult to measure and to interpret since they are influenced by various factors. The objective of this study was to detect associations between known polymorphisms in candidate genes related to sexual precocity in Nellore heifers, which could be used in breeding programs. Records of 1,689 precocious and non-precocious heifers from farms participating in the Conexão Delta G breeding program were analyzed. A subset of single nucleotide polymorphisms (SNP) located in the region of the candidate genes at a distance of up to 5 kb from the boundaries of each gene, were selected from the panel of 777,000 SNPs of the High-Density Bovine SNP BeadChip. Linear mixed models were used for statistical analysis of early heifer pregnancy, relating the trait with isolated SNPs or with haplotype groups. The model included the contemporary group (year and month of birth) as fixed effect and parent of the animal (sire effect) as random effect. The fastPHASE® and GenomeStudio® were used for reconstruction of the haplotypes and for analysis of linkage disequilibrium based on r2 statistics. A total of 125 candidate genes and 2,024 SNPs forming haplotypes were analyzed. Statistical analysis after Bonferroni correction showed that nine haplotypes exerted a significant effect (p<0.05) on sexual precocity. Four of these haplotypes were located in the Pregnancy-associated plasma protein-A2 gene (PAPP-A2), two in the Estrogen-related receptor gamma gene (ESRRG), and one each in the Pregnancy-associated plasma protein-A gene (PAPP-A), Kell blood group complex subunit-related family (XKR4) and mannose-binding lectin genes (MBL-1) genes. Although the present results indicate that the PAPP-A2, PAPP-A, XKR4, MBL-1 and ESRRG genes influence sexual precocity in Nellore heifers, further studies are needed to evaluate their possible use in breeding programs.

  2. How immunogenetically different are domestic pigs from wild boars: a perspective from single-nucleotide polymorphisms of 19 immunity-related candidate genes.

    PubMed

    Chen, Shanyuan; Gomes, Rui; Costa, Vânia; Santos, Pedro; Charneca, Rui; Zhang, Ya-ping; Liu, Xue-hong; Wang, Shao-qing; Bento, Pedro; Nunes, Jose-Luis; Buzgó, József; Varga, Gyula; Anton, István; Zsolnai, Attila; Beja-Pereira, Albano

    2013-10-01

    The coexistence of wild boars and domestic pigs across Eurasia makes it feasible to conduct comparative genetic or genomic analyses for addressing how genetically different a domestic species is from its wild ancestor. To test whether there are differences in patterns of genetic variability between wild and domestic pigs at immunity-related genes and to detect outlier loci putatively under selection that may underlie differences in immune responses, here we analyzed 54 single-nucleotide polymorphisms (SNPs) of 19 immunity-related candidate genes on 11 autosomes in three pairs of wild boar and domestic pig populations from China, Iberian Peninsula, and Hungary. Our results showed no statistically significant differences in allele frequency and heterozygosity across SNPs between three pairs of wild and domestic populations. This observation was more likely due to the widespread and long-lasting gene flow between wild boars and domestic pigs across Eurasia. In addition, we detected eight coding SNPs from six genes as outliers being under selection consistently by three outlier tests (BayeScan2.1, FDIST2, and Arlequin3.5). Among four non-synonymous outlier SNPs, one from TLR4 gene was identified as being subject to positive (diversifying) selection and three each from CD36, IFNW1, and IL1B genes were suggested as under balancing selection. All of these four non-synonymous variants were predicted as being benign by PolyPhen-2. Our results were supported by other independent lines of evidence for positive selection or balancing selection acting on these four immune genes (CD36, IFNW1, IL1B, and TLR4). Our study showed an example applying a candidate gene approach to identify functionally important mutations (i.e., outlier loci) in wild and domestic pigs for subsequent functional experiments.

  3. Novel Genetic Loci Associated with the Plasma Triglyceride Response to an Omega-3 Fatty Acid Supplementation.

    PubMed

    Vallée Marcotte, Bastien; Cormier, Hubert; Guénard, Frédéric; Rudkowska, Iwona; Lemieux, Simone; Couture, Patrick; Vohl, Marie-Claude

    2016-01-01

    A recent genome-wide association study (GWAS) by our group identified 13 loci associated with the plasma triglyceride (TG) response to omega-3 (n-3) fatty acid (FA) supplementation. This study aimed to test whether single-nucleotide polymorphisms (SNPs) within the IQCJ, NXPH1, PHF17 and MYB genes are associated with the plasma TG response to an n-3 FA supplementation. A total of 208 subjects followed a 6-week n-3 FA supplementation of 5 g/day of fish oil (1.9-2.2 g of eicosapentaenoic acid and 1.1 g of docosahexaenoic acid). Measurements of plasma lipids were made before and after the supplementation. Sixty-seven tagged SNPs were selected to increase the density of markers near GWAS hits. In a repeated model, independent effects of the genotype and the gene-supplementation interaction were associated with plasma TG. Genotype effects were observed with two SNPs of NXPH1, and gene-diet interactions were observed with ten SNPs of IQCJ, four SNPs of NXPH1 and three SNPs of MYB. Positive and negative responders showed different genotype frequencies with nine SNPs of IQCJ, two SNPs of NXPH1 and two SNPs of MYB. Fine mapping in GWAS-associated loci allowed the identification of SNPs partly explaining the large interindividual variability observed in plasma TG levels in response to an n-3 FA supplementation. © 2016 S. Karger AG, Basel.

  4. Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel

    PubMed Central

    Houle, David; Márquez, Eladio J.

    2015-01-01

    We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r2 ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines. PMID:26068573

  5. Linkage Disequilibrium and Inversion-Typing of the Drosophila melanogaster Genome Reference Panel.

    PubMed

    Houle, David; Márquez, Eladio J

    2015-06-10

    We calculated the linkage disequilibrium between all pairs of variants in the Drosophila Genome Reference Panel with minor allele count ≥5. We used r(2) ≥ 0.5 as the cutoff for a highly correlated SNP. We make available the list of all highly correlated SNPs for use in association studies. Seventy-six percent of variant SNPs are highly correlated with at least one other SNP, and the mean number of highly correlated SNPs per variant over the whole genome is 83.9. Disequilibrium between distant SNPs is also common when minor allele frequency (MAF) is low: 37% of SNPs with MAF < 0.1 are highly correlated with SNPs more than 100 kb distant. Although SNPs within regions with polymorphic inversions are highly correlated with somewhat larger numbers of SNPs, and these correlated SNPs are on average farther away, the probability that a SNP in such regions is highly correlated with at least one other SNP is very similar to SNPs outside inversions. Previous karyotyping of the DGRP lines has been inconsistent, and we used LD and genotype to investigate these discrepancies. When previous studies agreed on inversion karyotype, our analysis was almost perfectly concordant with those assignments. In discordant cases, and for inversion heterozygotes, our results suggest errors in two previous analyses or discordance between genotype and karyotype. Heterozygosities of chromosome arms are, in many cases, surprisingly highly correlated, suggesting strong epsistatic selection during the inbreeding and maintenance of the DGRP lines. Copyright © 2015 Houle and Márquez.

  6. Dynamics of Dark-Fly Genome Under Environmental Selections.

    PubMed

    Izutsu, Minako; Toyoda, Atsushi; Fujiyama, Asao; Agata, Kiyokazu; Fuse, Naoyuki

    2015-12-04

    Environmental adaptation is one of the most fundamental features of organisms. Modern genome science has identified some genes associated with adaptive traits of organisms, and has provided insights into environmental adaptation and evolution. However, how genes contribute to adaptive traits and how traits are selected under an environment in the course of evolution remain mostly unclear. To approach these issues, we utilize "Dark-fly", a Drosophila melanogaster line maintained in constant dark conditions for more than 60 years. Our previous analysis identified 220,000 single nucleotide polymorphisms (SNPs) in the Dark-fly genome, but did not clarify which SNPs of Dark-fly are truly adaptive for living in the dark. We found here that Dark-fly dominated over the wild-type fly in a mixed population under dark conditions, and based on this domination we designed an experiment for genome reselection to identify adaptive genes of Dark-fly. For this experiment, large mixed populations of Dark-fly and the wild-type fly were maintained in light conditions or in dark conditions, and the frequencies of Dark-fly SNPs were compared between these populations across the whole genome. We thereby detected condition-dependent selections toward approximately 6% of the genome. In addition, we observed the time-course trajectory of SNP frequency in the mixed populations through generations 0, 22, and 49, which resulted in notable categorization of the selected SNPs into three types with different combinations of positive and negative selections. Our data provided a list of about 100 strong candidate genes associated with the adaptive traits of Dark-fly. Copyright © 2016 Izutsu et al.

  7. Dynamics of Dark-Fly Genome Under Environmental Selections

    PubMed Central

    Izutsu, Minako; Toyoda, Atsushi; Fujiyama, Asao; Agata, Kiyokazu; Fuse, Naoyuki

    2015-01-01

    Environmental adaptation is one of the most fundamental features of organisms. Modern genome science has identified some genes associated with adaptive traits of organisms, and has provided insights into environmental adaptation and evolution. However, how genes contribute to adaptive traits and how traits are selected under an environment in the course of evolution remain mostly unclear. To approach these issues, we utilize “Dark-fly”, a Drosophila melanogaster line maintained in constant dark conditions for more than 60 years. Our previous analysis identified 220,000 single nucleotide polymorphisms (SNPs) in the Dark-fly genome, but did not clarify which SNPs of Dark-fly are truly adaptive for living in the dark. We found here that Dark-fly dominated over the wild-type fly in a mixed population under dark conditions, and based on this domination we designed an experiment for genome reselection to identify adaptive genes of Dark-fly. For this experiment, large mixed populations of Dark-fly and the wild-type fly were maintained in light conditions or in dark conditions, and the frequencies of Dark-fly SNPs were compared between these populations across the whole genome. We thereby detected condition-dependent selections toward approximately 6% of the genome. In addition, we observed the time-course trajectory of SNP frequency in the mixed populations through generations 0, 22, and 49, which resulted in notable categorization of the selected SNPs into three types with different combinations of positive and negative selections. Our data provided a list of about 100 strong candidate genes associated with the adaptive traits of Dark-fly. PMID:26637434

  8. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.

    PubMed

    Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D

    2013-02-28

    Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.

  9. Development and Evaluation of a 9K SNP Array for Peach by Internationally Coordinated SNP Detection and Validation in Breeding Germplasm

    PubMed Central

    Scalabrin, Simone; Gilmore, Barbara; Lawley, Cynthia T.; Gasic, Ksenija; Micheletti, Diego; Rosyara, Umesh R.; Cattonaro, Federica; Vendramin, Elisa; Main, Dorrie; Aramini, Valeria; Blas, Andrea L.; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Troggio, Michela; Sosinski, Bryon; Aranzana, Maria José; Arús, Pere; Iezzoni, Amy; Morgante, Michele; Peace, Cameron

    2012-01-01

    Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species. PMID:22536421

  10. A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation

    PubMed Central

    2013-01-01

    Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355

  11. Natural variation in genes potentially involved in plant architecture and adaptation in switchgrass (Panicum virgatum L.).

    PubMed

    Bahri, Bochra A; Daverdin, Guillaume; Xu, Xiangyang; Cheng, Jan-Fang; Barry, Kerrie W; Brummer, E Charles; Devos, Katrien M

    2018-06-14

    Advances in genomic technologies have expanded our ability to accurately and exhaustively detect natural genomic variants that can be applied in crop improvement and to increase our knowledge of plant evolution and adaptation. Switchgrass (Panicum virgatum L.), an allotetraploid (2n = 4× = 36) perennial C4 grass (Poaceae family) native to North America and a feedstock crop for cellulosic biofuel production, has a large potential for genetic improvement due to its high genotypic and phenotypic variation. In this study, we analyzed single nucleotide polymorphism (SNP) variation in 372 switchgrass genotypes belonging to 36 accessions for 12 genes putatively involved in biomass production to investigate signatures of selection that could have led to ecotype differentiation and to population adaptation to geographic zones. A total of 11,682 SNPs were mined from ~ 15 Gb of sequence data, out of which 251 SNPs were retained after filtering. Population structure analysis largely grouped upland accessions into one subpopulation and lowland accessions into two additional subpopulations. The most frequent SNPs were in homozygous state within accessions. Sixty percent of the exonic SNPs were non-synonymous and, of these, 45% led to non-conservative amino acid changes. The non-conservative SNPs were largely in linkage disequilibrium with one haplotype being predominantly present in upland accessions while the other haplotype was commonly present in lowland accessions. Tajima's test of neutrality indicated that PHYB, a gene involved in photoperiod response, was under positive selection in the switchgrass population. PHYB carried a SNP leading to a non-conservative amino acid change in the PAS domain, a region that acts as a sensor for light and oxygen in signal transduction. Several non-conservative SNPs in genes potentially involved in plant architecture and adaptation have been identified and led to population structure and genetic differentiation of ecotypes in switchgrass. We suggest here that PHYB is a key gene involved in switchgrass natural selection. Further analyses are needed to determine whether any of the non-conservative SNPs identified play a role in the differential adaptation of upland and lowland switchgrass.

  12. Developing a new nonbinary SNP fluorescent multiplex detection system for forensic application in China.

    PubMed

    Liu, Yanfang; Liao, Huidan; Liu, Ying; Guo, Juanjuan; Sun, Yi; Fu, Xiaoliang; Xiao, Ding; Cai, Jifeng; Lan, Lingmei; Xie, Pingli; Zha, Lagabaiyila

    2017-04-01

    Nonbinary single-nucleotide polymorphisms (SNPs) are potential forensic genetic markers because their discrimination power is greater than that of normal binary SNPs, and that they can detect highly degraded samples. We previously developed a nonbinary SNP multiplex typing assay. In this study, we selected additional 20 nonbinary SNPs from the NCBI SNP database and verified them through pyrosequencing. These 20 nonbinary SNPs were analyzed using the fluorescent-labeled SNaPshot multiplex SNP typing method. The allele frequencies and genetic parameters of these 20 nonbinary SNPs were determined among 314 unrelated individuals from Han populations from China. The total power of discrimination was 0.9999999999994, and the cumulative probability of exclusion was 0.9986. Moreover, the result of the combination of this 20 nonbinary SNP assay with the 20 nonbinary SNP assay we previously developed demonstrated that the cumulative probability of exclusion of the 40 nonbinary SNPs was 0.999991 and that no significant linkage disequilibrium was observed in all 40 nonbinary SNPs. Thus, we concluded that this new system consisting of new 20 nonbinary SNPs could provide highly informative polymorphic data which would be further used in forensic application and would serve as a potentially valuable supplement to forensic DNA analysis. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. Identification and Evaluation of Single-Nucleotide Polymorphisms in Allotetraploid Peanut (Arachis hypogaea L.) Based on Amplicon Sequencing Combined with High Resolution Melting (HRM) Analysis.

    PubMed

    Hong, Yanbin; Pandey, Manish K; Liu, Ying; Chen, Xiaoping; Liu, Hong; Varshney, Rajeev K; Liang, Xuanqiang; Huang, Shangzhi

    2015-01-01

    The cultivated peanut (Arachis hypogaea L.) is an allotetraploid (AABB) species derived from the A-genome (Arachis duranensis) and B-genome (Arachis ipaensis) progenitors. Presence of two versions of a DNA sequence based on the two progenitor genomes poses a serious technical and analytical problem during single nucleotide polymorphism (SNP) marker identification and analysis. In this context, we have analyzed 200 amplicons derived from expressed sequence tags (ESTs) and genome survey sequences (GSS) to identify SNPs in a panel of genotypes consisting of 12 cultivated peanut varieties and two diploid progenitors representing the ancestral genomes. A total of 18 EST-SNPs and 44 genomic-SNPs were identified in 12 peanut varieties by aligning the sequence of A. hypogaea with diploid progenitors. The average frequency of sequence polymorphism was higher for genomic-SNPs than the EST-SNPs with one genomic-SNP every 1011 bp as compared to one EST-SNP every 2557 bp. In order to estimate the potential and further applicability of these identified SNPs, 96 peanut varieties were genotyped using high resolution melting (HRM) method. Polymorphism information content (PIC) values for EST-SNPs ranged between 0.021 and 0.413 with a mean of 0.172 in the set of peanut varieties, while genomic-SNPs ranged between 0.080 and 0.478 with a mean of 0.249. Total 33 SNPs were used for polymorphism detection among the parents and 10 selected lines from mapping population Y13Zh (Zhenzhuhei × Yueyou13). Of the total 33 SNPs, nine SNPs showed polymorphism in the mapping population Y13Zh, and seven SNPs were successfully mapped into five linkage groups. Our results showed that SNPs can be identified in allotetraploid peanut with high accuracy through amplicon sequencing and HRM assay. The identified SNPs were very informative and can be used for different genetic and breeding applications in peanut.

  14. Mucoadhesive buccal tablets containing silymarin Eudragit-loaded nanoparticles: formulation, characterisation and ex vivo permeation.

    PubMed

    El-Nahas, Amira E; Allam, Ahmed N; El-Kamel, Amal H

    2017-08-01

    Eudragit-loaded silymarin nanoparticles (SNPs) and their formulation into buccal mucoadhesive tablets were investigated to improve the low bioavailability of silymarin through buccal delivery. Characterisation of SNPs and silymarin buccal tablets (SBTs) containing the optimised NPs were performed. Ex vivo permeability of nominated SBTs were assessed using chicken pouch mucosa compared to SNPs and drug suspension followed by histopathological examination. Selected SNPs had a small size (<150 nm), encapsulation effciency (>77%) with drug release of about 90% after 6 h. For STBs, all physicochemical parameters were satisfactory for different polymers used. DSC and FT-IR studies suggested the presence of silymarin in an amorphous state. Ex vivo permeation significantly emphasised the great enhancement of silymarin permeation after NPs formation and much more increase after formulating into BTs relative to the corresponding drug dispersion with confirmed membrane integrity. Incorporation of SNPs into BTs could be an efficient vehicle for delivery of silymarin.

  15. Single nucleotide polymorphisms in bone turnover-related genes in Koreans: ethnic differences in linkage disequilibrium and haplotype

    PubMed Central

    Kim, Kyung-Seon; Kim, Ghi-Su; Hwang, Joo-Yeon; Lee, Hye-Ja; Park, Mi-Hyun; Kim, Kwang-joong; Jung, Jongsun; Cha, Hyo-Soung; Shin, Hyoung Doo; Kang, Jong-Ho; Park, Eui Kyun; Kim, Tae-Ho; Hong, Jung-Min; Koh, Jung-Min; Oh, Bermseok; Kimm, Kuchan; Kim, Shin-Yoon; Lee, Jong-Young

    2007-01-01

    Background Osteoporosis is defined as the loss of bone mineral density that leads to bone fragility with aging. Population-based case-control studies have identified polymorphisms in many candidate genes that have been associated with bone mass maintenance or osteoporotic fracture. To investigate single nucleotide polymorphisms (SNPs) that are associated with osteoporosis, we examined the genetic variation among Koreans by analyzing 81 genes according to their function in bone formation and resorption during bone remodeling. Methods We resequenced all the exons, splice junctions and promoter regions of candidate osteoporosis genes using 24 unrelated Korean individuals. Using the common SNPs from our study and the HapMap database, a statistical analysis of deviation in heterozygosity depicted. Results We identified 942 variants, including 888 SNPs, 43 insertion/deletion polymorphisms, and 11 microsatellite markers. Of the SNPs, 557 (63%) had been previously identified and 331 (37%) were newly discovered in the Korean population. When compared SNPs in the Korean population with those in HapMap database, 1% (or less) of SNPs in the Japanese and Chinese subpopulations and 20% of those in Caucasian and African subpopulations were significantly differentiated from the Hardy-Weinberg expectations. In addition, an analysis of the genetic diversity showed that there were no significant differences among Korean, Han Chinese and Japanese populations, but African and Caucasian populations were significantly differentiated in selected genes. Nevertheless, in the detailed analysis of genetic properties, the LD and Haplotype block patterns among the five sub-populations were substantially different from one another. Conclusion Through the resequencing of 81 osteoporosis candidate genes, 118 unknown SNPs with a minor allele frequency (MAF) > 0.05 were discovered in the Korean population. In addition, using the common SNPs between our study and HapMap, an analysis of genetic diversity and deviation in heterozygosity was performed and the polymorphisms of the above genes among the five populations were substantially differentiated from one another. Further studies of osteoporosis could utilize the polymorphisms identified in our data since they may have important implications for the selection of highly informative SNPs for future association studies. PMID:18036257

  16. Genetic variation in protein specific antigen detected prostate cancer and the effect of control selection on genetic association studies

    PubMed Central

    Knipe, Duleeka W; Evans, David M; Kemp, John P.; Eeles, Rosalind; Easton, Douglas F; Kote-Jarai, Zsofia; Al Olama, Ali Amin; Benlloch, Sara; Donovan, Jenny L.; Hamdy, Freddie C.; Neal, David E

    2014-01-01

    Background Only a minority of the genetic component of prostate cancer (PrCa) risk has been explained. Some observed associations of single nucleotide polymorphisms (SNPs) with PrCa might arise from associations of these SNPs with circulating prostate specific antigen (PSA) because PSA values are used to select controls. Methods We undertook a genome-wide association study (GWAS) of screen detected PrCa (ProtecT 1146 cases and 1804 controls); meta-analysed the results with those from the previously published UK Genetic Prostate Cancer Study (1854 cases and 1437 controls); investigated associations of SNPs with PrCa using either ‘low’ (PSA <0.5ng/ml) or ‘high’ (PSA ≥3ng/ml, biopsy negative) PSA controls; and investigated associations of SNPs with PSA. Results The ProtecT GWAS confirmed previously reported associations of PrCa at 3 loci: 10q11.23, 17q24.3 and 19q13.33. The meta-analysis confirmed associations of PrCa with SNPs near 4 previously identified loci (8q24.21,10q11.23, 17q24.3 and 19q13.33). When comparing PrCa cases with low PSA controls, alleles at genetic markers rs1512268, rs445114, rs10788160, rs11199874, rs17632542, rs266849 and rs2735839 were associated with an increased risk of PrCa, but the effect-estimates were attenuated to the null when using high PSA controls (p for heterogeneity in effect-estimates<0.04). We found a novel inverse association of rs9311171-T with circulating PSA. Conclusions Differences in effect estimates for PrCa observed when comparing low vs. high PSA controls, may be explained by associations of these SNPs with PSA. Impact These findings highlight the need for inferences from genetic studies of PrCa risk to carefully consider the influence of control selection criteria. PMID:24753544

  17. EurEAs_Gplex--A new SNaPshot assay for continental population discrimination and gender identification.

    PubMed

    Daca-Roszak, P; Pfeifer, A; Żebracka-Gala, J; Jarząb, B; Witt, M; Ziętkiewicz, E

    2016-01-01

    Assays that allow analysis of the biogeographic origin of biological samples in a standard forensic laboratory have to target a small number of highly differentiating markers. Such markers should be easy to multiplex and the assay must perform well in the degraded and scarce biological material. SNPs localized in the genome regions, which in the past were subjected to differential selective pressure in various populations, are the most widely used markers in the studies of biogeographic affiliation. SNPs reflecting biogeographic differences not related to any phenotypic traits are not sufficiently explored. The goal of our study was to identify a small set of SNPs not related to any known pigmentation/phenotype-specific genes, which would allow efficient discrimination between populations of Europe and East Asia. The selection of SNPs was based on the comparative analysis of representative European and Chinese/Japanese samples (B-lymphocyte cell lines), genotyped using the Infinium HumanOmniExpressExome microarray (Illumina). The classifier, consisting of 24 unlinked SNPs (24-SNP classifier), was selected. The performance of a 14-SNP subset of this classifier (14-SNP subclassifier) was tested using genotype data from several populations. The 14-SNP subclassifier differentiated East Asians, Europeans and Africans with ∼100% accuracy; Palestinians, representative of the Middle East, clustered with Europeans, while Amerindians and Pakistani were placed between East Asian and European populations. Based on these results, we have developed a SNaPshot assay (EurEAs_Gplex) for genotyping SNPs from the 14-SNP subclassifier, combined with an additional marker for gender identification. Forensic utility of the EurEAs_Gplex was verified using degraded and low quantity DNA samples. The performance of the EurEAs_Gplex was satisfactory when using degraded DNA; tests using low quantity DNA samples revealed a previously not described source of genotyping errors, potentially important for any SNaPshot-based assays. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  18. Impact of pre-imputation SNP-filtering on genotype imputation results

    PubMed Central

    2014-01-01

    Background Imputation of partially missing or unobserved genotypes is an indispensable tool for SNP data analyses. However, research and understanding of the impact of initial SNP-data quality control on imputation results is still limited. In this paper, we aim to evaluate the effect of different strategies of pre-imputation quality filtering on the performance of the widely used imputation algorithms MaCH and IMPUTE. Results We considered three scenarios: imputation of partially missing genotypes with usage of an external reference panel, without usage of an external reference panel, as well as imputation of completely un-typed SNPs using an external reference panel. We first created various datasets applying different SNP quality filters and masking certain percentages of randomly selected high-quality SNPs. We imputed these SNPs and compared the results between the different filtering scenarios by using established and newly proposed measures of imputation quality. While the established measures assess certainty of imputation results, our newly proposed measures focus on the agreement with true genotypes. These measures showed that pre-imputation SNP-filtering might be detrimental regarding imputation quality. Moreover, the strongest drivers of imputation quality were in general the burden of missingness and the number of SNPs used for imputation. We also found that using a reference panel always improves imputation quality of partially missing genotypes. MaCH performed slightly better than IMPUTE2 in most of our scenarios. Again, these results were more pronounced when using our newly defined measures of imputation quality. Conclusion Even a moderate filtering has a detrimental effect on the imputation quality. Therefore little or no SNP filtering prior to imputation appears to be the best strategy for imputing small to moderately sized datasets. Our results also showed that for these datasets, MaCH performs slightly better than IMPUTE2 in most scenarios at the cost of increased computing time. PMID:25112433

  19. SNP-VISTA: An Interactive SNPs Visualization Tool

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shah, Nameeta; Teplitsky, Michael V.; Pennacchio, Len A.

    2005-07-05

    Recent advances in sequencing technologies promise better diagnostics for many diseases as well as better understanding of evolution of microbial populations. Single Nucleotide Polymorphisms(SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it is possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease and then screen for causative mutations.In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmentalmore » samples makes possible more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at http://genome.lbl.gov/vista/snpvista.« less

  20. Partition dataset according to amino acid type improves the prediction of deleterious non-synonymous SNPs

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Jing; Li, Yuan-Yuan; Shanghai Center for Bioinformation Technology, Shanghai 200235

    2012-03-02

    Highlights: Black-Right-Pointing-Pointer Proper dataset partition can improve the prediction of deleterious nsSNPs. Black-Right-Pointing-Pointer Partition according to original residue type at nsSNP is a good criterion. Black-Right-Pointing-Pointer Similar strategy is supposed promising in other machine learning problems. -- Abstract: Many non-synonymous SNPs (nsSNPs) are associated with diseases, and numerous machine learning methods have been applied to train classifiers for sorting disease-associated nsSNPs from neutral ones. The continuously accumulated nsSNP data allows us to further explore better prediction approaches. In this work, we partitioned the training data into 20 subsets according to either original or substituted amino acid type at the nsSNPmore » site. Using support vector machine (SVM), training classification models on each subset resulted in an overall accuracy of 76.3% or 74.9% depending on the two different partition criteria, while training on the whole dataset obtained an accuracy of only 72.6%. Moreover, the dataset was also randomly divided into 20 subsets, but the corresponding accuracy was only 73.2%. Our results demonstrated that partitioning the whole training dataset into subsets properly, i.e., according to the residue type at the nsSNP site, will improve the performance of the trained classifiers significantly, which should be valuable in developing better tools for predicting the disease-association of nsSNPs.« less

  1. A functional SNP catalog of overlapping miRNA-binding sites in genes implicated in prion disease and other neurodegenerative disorders.

    PubMed

    Saba, Reuben; Medina, Sarah J; Booth, Stephanie A

    2014-10-01

    The involvement of SNPs in miRNA target sites remains poorly investigated in neurodegenerative disease. In addition to associations with disease risk, such genetic variations can also provide novel insight into mechanistic pathways that may be responsible for disease etiology and/or pathobiology. To identify SNPs associated specifically with degenerating neurons, we restricted our analysis to genes that are dysregulated in CA1 hippocampal neurons of mice during early, preclinical phase of Prion disease. The 125 genes chosen are also implicated in other numerous degenerative and neurological diseases and disorders and are therefore likely to be of fundamental importance. We predicted those SNPs that could increase, decrease, or have neutral effects on miRNA binding. This group of genes was more likely to possess DNA variants than were genes chosen at random. Furthermore, many of the SNPs are common within the human population, and could contribute to the growing awareness that miRNAs and associated SNPs could account for detrimental neurological states. Interestingly, SNPs that overlapped miRNA-binding sites in the 3'-UTR of GABA-receptor subunit coding genes were particularly enriched. Moreover, we demonstrated that SNP rs9291296 would strengthen miR-26a-5p binding to a highly conserved site in the 3'-UTR of gamma-aminobutyric acid receptor subunit alpha-4. © 2014 WILEY PERIODICALS, INC.

  2. An omnibus permutation test on ensembles of two-locus analyses can detect pure epistasis and genetic heterogeneity in genome-wide association studies.

    PubMed

    Setsirichok, Damrongrit; Tienboon, Phuwadej; Jaroonruang, Nattapong; Kittichaijaroen, Somkit; Wongseree, Waranyu; Piroonratana, Theera; Usavanarong, Touchpong; Limwongse, Chanin; Aporntewan, Chatchawit; Phadoongsidhi, Marong; Chaiyaratana, Nachol

    2013-01-01

    This article presents the ability of an omnibus permutation test on ensembles of two-locus analyses (2LOmb) to detect pure epistasis in the presence of genetic heterogeneity. The performance of 2LOmb is evaluated in various simulation scenarios covering two independent causes of complex disease where each cause is governed by a purely epistatic interaction. Different scenarios are set up by varying the number of available single nucleotide polymorphisms (SNPs) in data, number of causative SNPs and ratio of case samples from two affected groups. The simulation results indicate that 2LOmb outperforms multifactor dimensionality reduction (MDR) and random forest (RF) techniques in terms of a low number of output SNPs and a high number of correctly-identified causative SNPs. Moreover, 2LOmb is capable of identifying the number of independent interactions in tractable computational time and can be used in genome-wide association studies. 2LOmb is subsequently applied to a type 1 diabetes mellitus (T1D) data set, which is collected from a UK population by the Wellcome Trust Case Control Consortium (WTCCC). After screening for SNPs that locate within or near genes and exhibit no marginal single-locus effects, the T1D data set is reduced to 95,991 SNPs from 12,146 genes. The 2LOmb search in the reduced T1D data set reveals that 12 SNPs, which can be divided into two independent sets, are associated with the disease. The first SNP set consists of three SNPs from MUC21 (mucin 21, cell surface associated), three SNPs from MUC22 (mucin 22), two SNPs from PSORS1C1 (psoriasis susceptibility 1 candidate 1) and one SNP from TCF19 (transcription factor 19). A four-locus interaction between these four genes is also detected. The second SNP set consists of three SNPs from ATAD1 (ATPase family, AAA domain containing 1). Overall, the findings indicate the detection of pure epistasis in the presence of genetic heterogeneity and provide an alternative explanation for the aetiology of T1D in the UK population.

  3. Association between variants in genes involved in the immune response and prostate cancer risk in men randomized to the finasteride arm in the Prostate Cancer Prevention Trial.

    PubMed

    Winchester, Danyelle A; Till, Cathee; Goodman, Phyllis J; Tangen, Catherine M; Santella, Regina M; Johnson-Pais, Teresa L; Leach, Robin J; Xu, Jianfeng; Zheng, S Lilly; Thompson, Ian M; Lucia, M Scott; Lippman, Scott M; Parnes, Howard L; Isaacs, William B; De Marzo, Angelo M; Drake, Charles G; Platz, Elizabeth A

    2017-06-01

    We reported that some, but not all single nucleotide polymorphisms (SNPs) in select immune response genes are associated with prostate cancer, but not individually with the prevalence of intraprostatic inflammation in the Prostate Cancer Prevention Trial (PCPT) placebo arm. Here, we investigated whether these same SNPs are associated with risk of lower- and higher-grade prostate cancer in men randomized to finasteride, and with prevalence of intraprostatic inflammation among controls. Methods A total of 16 candidate SNPs in IL1β, IL2, IL4, IL6, IL8, IL10, IL12(p40), IFNG, MSR1, RNASEL, TLR4, and TNFA and 7 tagSNPs in IL10 were genotyped in 625 white prostate cancer cases, and 532 white controls negative for cancer on an end-of-study biopsy nested in the PCPT finasteride arm. We used logistic regression to estimate log-additive odds ratios (OR) and 95% confidence intervals (CI) adjusting for age and family history. Minor alleles of rs2243250 (T) in IL4 (OR = 1.46, 95% CI 1.03-2.08, P-trend = 0.03), rs1800896 (G) in IL10 (OR = 0.77, 95% CI 0.61-0.96, P-trend = 0.02), rs2430561 (A) in IFNG (OR = 1.33, 95% CI 1.02-1.74; P-trend = 0.04), rs3747531 (C) in MSR1 (OR = 0.55, 95% CI 0.32-0.95; P-trend = 0.03), and possibly rs4073 (A) in IL8 (OR = 0.81, 95% CI 0.64-1.01, P-trend = 0.06) were associated with higher- (Gleason 7-10; N = 222), but not lower- (Gleason 2-6; N = 380) grade prostate cancer. In men with low PSA (<2 ng/mL), these higher-grade disease associations were attenuated and/or no longer significant, whereas associations with higher-grade disease were apparent for minor alleles of rs1800795 (C: OR = 0.70, 95% CI 0.51-0.94, P-trend = 0.02) and rs1800797 (A: OR = 0.72, 95% CI 0.53-0.98, P-trend = 0.04) in IL6. While some IL10 tagSNPs were associated with lower- and higher-grade prostate cancer, distributions of IL10 haplotypes did not differ, except possibly between higher-grade cases and controls among those with low PSA (P = 0.07). We did not observe an association between the studied SNPs and intraprostatic inflammation in the controls. In the PCPT finasteride arm, variation in genes involved in the immune response, including possibly IL8 and IL10 as in the placebo arm, may be associated with prostate cancer, especially higher-grade disease, but not with intraprostatic inflammation. We cannot rule out PSA-associated detection bias or chance due to multiple testing. © 2017 Wiley Periodicals, Inc.

  4. Genetic Variation in the Acorn Barnacle from Allozymes to Population Genomics

    PubMed Central

    Flight, Patrick A.; Rand, David M.

    2012-01-01

    Understanding the patterns of genetic variation within and among populations is a central problem in population and evolutionary genetics. We examine this question in the acorn barnacle, Semibalanus balanoides, in which the allozyme loci Mpi and Gpi have been implicated in balancing selection due to varying selective pressures at different spatial scales. We review the patterns of genetic variation at the Mpi locus, compare this to levels of population differentiation at mtDNA and microsatellites, and place these data in the context of genome-wide variation from high-throughput sequencing of population samples spanning the North Atlantic. Despite considerable geographic variation in the patterns of selection at the Mpi allozyme, this locus shows rather low levels of population differentiation at ecological and trans-oceanic scales (FST ∼ 5%). Pooled population sequencing was performed on samples from Rhode Island (RI), Maine (ME), and Southwold, England (UK). Analysis of more than 650 million reads identified approximately 335,000 high-quality SNPs in 19 million base pairs of the S. balanoides genome. Much variation is shared across the Atlantic, but there are significant examples of strong population differentiation among samples from RI, ME, and UK. An FST outlier screen of more than 22,000 contigs provided a genome-wide context for interpretation of earlier studies on allozymes, mtDNA, and microsatellites. FST values for allozymes, mtDNA and microsatellites are close to the genome-wide average for random SNPs, with the exception of the trans-Atlantic FST for mtDNA. The majority of FST outliers were unique between individual pairs of populations, but some genes show shared patterns of excess differentiation. These data indicate that gene flow is high, that selection is strong on a subset of genes, and that a variety of genes are experiencing diversifying selection at large spatial scales. This survey of polymorphism in S. balanoides provides a number of genomic tools that promise to make this a powerful model for ecological genomics of the rocky intertidal. PMID:22767487

  5. The impact of single nucleotide polymorphism in monomeric alpha-amylase inhibitor genes from wild emmer wheat, primarily from Israel and Golan

    PubMed Central

    2010-01-01

    Background Various enzyme inhibitors act on key insect gut digestive hydrolases, including alpha-amylases and proteinases. Alpha-amylase inhibitors have been widely investigated for their possible use in strengthening a plant's defense against insects that are highly dependent on starch as an energy source. We attempted to unravel the diversity of monomeric alpha-amylase inhibitor genes of Israeli and Golan Heights' wild emmer wheat with different ecological factors (e.g., geography, water, and temperature). Population methods that analyze the nature and frequency of allele diversity within a species and the codon analysis method (comparing patterns of synonymous and non-synonymous changes in protein coding sequences) were used to detect natural selection. Results Three hundred and forty-eight sequences encoding monomeric alpha-amylase inhibitors (WMAI) were obtained from 14 populations of wild emmer wheat. The frequency of SNPs in WMAI genes was 1 out of 16.3 bases, where 28 SNPs were detected in the coding sequence. The results of purifying and the positive selection hypothesis (p < 0.05) showed that the sequences of WMAI were contributed by both natural selection and co-evolution, which ensured conservation of protein function and inhibition against diverse insect amylases. The majority of amino acid substitutions occurred at the C-terminal (positive selection domain), which ensured the stability of WMAI. SNPs in this gene could be classified into several categories associated with water, temperature, and geographic factors, respectively. Conclusions Great diversity at the WMAI locus, both between and within populations, was detected in the populations of wild emmer wheat. It was revealed that WMAI were naturally selected for across populations by a ratio of dN/dS as expected. Ecological factors, singly or in combination, explained a significant proportion of the variations in the SNPs. A sharp genetic divergence over very short geographic distances compared to a small genetic divergence between large geographic distances also suggested that the SNPs were subjected to natural selection, and ecological factors had an important evolutionary role in polymorphisms at this locus. According to population and codon analysis, these results suggested that monomeric alpha-amylase inhibitors are adaptively selected under different environmental conditions. PMID:20534122

  6. Discovery of single nucleotide polymorphisms in candidate genes associated with fertility and production traits in Holstein cattle

    PubMed Central

    2013-01-01

    Background Identification of single nucleotide polymorphisms (SNPs) for specific genes involved in reproduction might improve reliability of genomic estimates for these low-heritability traits. Semen from 550 Holstein bulls of high (≥ 1.7; n = 288) or low (≤ −2; n = 262) daughter pregnancy rate (DPR) was genotyped for 434 candidate SNPs using the Sequenom MassARRAY® system. Three types of SNPs were evaluated: SNPs previously reported to be associated with reproductive traits or physically close to genetic markers for reproduction, SNPs in genes that are well known to be involved in reproductive processes, and SNPs in genes that are differentially expressed between physiological conditions in a variety of tissues associated in reproductive function. Eleven reproduction and production traits were analyzed. Results A total of 40 SNPs were associated (P < 0.05) with DPR. Among these were genes involved in the endocrine system, cell signaling, immune function and inhibition of apoptosis. A total of 10 genes were regulated by estradiol. In addition, 22 SNPs were associated with heifer conception rate, 33 with cow conception rate, 36 with productive life, 34 with net merit, 23 with milk yield, 19 with fat yield, 13 with fat percent, 19 with protein yield, 22 with protein percent, and 13 with somatic cell score. The allele substitution effect for SNPs associated with heifer conception rate, cow conception rate, productive life and net merit were in the same direction as for DPR. Allele substitution effects for several SNPs associated with production traits were in the opposite direction as DPR. Nonetheless, there were 29 SNPs associated with DPR that were not negatively associated with production traits. Conclusion SNPs in a total of 40 genes associated with DPR were identified as well as SNPs for other traits. It might be feasible to include these SNPs into genomic tests of reproduction and other traits. The genes associated with DPR are likely to be important for understanding the physiology of reproduction. Given the large number of SNPs associated with DPR that were not negatively associated with production traits, it should be possible to select for DPR without compromising production. PMID:23759029

  7. The more from East-Asian, the better: risk prediction of colorectal cancer risk by GWAS-identified SNPs among Japanese.

    PubMed

    Abe, Makiko; Ito, Hidemi; Oze, Isao; Nomura, Masatoshi; Ogawa, Yoshihiro; Matsuo, Keitaro

    2017-12-01

    Little is known about the difference of genetic predisposition for CRC between ethnicities; however, many genetic traits common to colorectal cancer have been identified. This study investigated whether more SNPs identified in GWAS in East Asian population could improve the risk prediction of Japanese and explored possible application of genetic risk groups as an instrument of the risk communication. 558 Patients histologically verified colorectal cancer and 1116 first-visit outpatients were included for derivation study, and 547 cases and 547 controls were for replication study. Among each population, we evaluated prediction models for the risk of CRC that combined the genetic risk group based on SNPs from GWASs in European-population and a similarly developed model adding SNPs from GWASs in East Asian-population. We examined whether adding East Asian-specific SNPs would improve the discrimination. Six SNPs (rs6983267, rs4779584, rs4444235, rs9929218, rs10936599, rs16969681) from 23 SNPs by European-based GWAS and five SNPs (rs704017, rs11196172, rs10774214, rs647161, rs2423279) among ten SNPs by Asian-based GWAS were selected in CRC risk prediction model. Compared with a 6-SNP-based model, an 11-SNP model including Asian GWAS-SNPs showed improved discrimination capacity in Receiver operator characteristic analysis. A model with 11 SNPs resulted in statistically significant improvement in both derivation (P = 0.0039) and replication studies (P = 0.0018) compared with six SNP model. We estimated cumulative risk of CRC by using genetic risk group based on 11 SNPs and found that the cumulative risk at age 80 is approximately 13% in the high-risk group while 6% in the low-risk group. We constructed a more efficient CRC risk prediction model with 11 SNPs including newly identified East Asian-based GWAS SNPs (rs704017, rs11196172, rs10774214, rs647161, rs2423279). Risk grouping based on 11 SNPs depicted lifetime difference of CRC risk. This might be useful for effective individualized prevention for East Asian.

  8. Linked genetic variants on chromosome 10 control ear morphology and body mass among dog breeds.

    PubMed

    Webster, Matthew T; Kamgari, Nona; Perloski, Michele; Hoeppner, Marc P; Axelsson, Erik; Hedhammar, Åke; Pielberg, Gerli; Lindblad-Toh, Kerstin

    2015-06-23

    The domestic dog is a rich resource for mapping the genetic components of phenotypic variation due to its unique population history involving strong artificial selection. Genome-wide association studies have revealed a number of chromosomal regions where genetic variation associates with morphological characters that typify dog breeds. A region on chromosome 10 is among those with the highest levels of genetic differentiation between dog breeds and is associated with body mass and ear morphology, a common motif of animal domestication. We characterised variation in this region to uncover haplotype structure and identify candidate functional variants. We first identified SNPs that strongly associate with body mass and ear type by comparing sequence variation in a 3 Mb region between 19 breeds with a variety of phenotypes. We next genotyped a subset of 123 candidate SNPs in 288 samples from 46 breeds to identify the variants most highly associated with phenotype and infer haplotype structure. A cluster of SNPs that associate strongly with the drop ear phenotype is located within a narrow interval downstream of the gene MSRB3, which is involved in human hearing. These SNPs are in strong genetic linkage with another set of variants that correlate with body mass within the gene HMGA2, which affects human height. In addition we find evidence that this region has been under selection during dog domestication, and identify a cluster of SNPs within MSRB3 that are highly differentiated between dogs and wolves. We characterise genetically linked variants that potentially influence ear type and body mass in dog breeds, both key traits that have been modified by selective breeding that may also be important for domestication. The finding that variants on long haplotypes have effects on more than one trait suggests that genetic linkage can be an important determinant of the phenotypic response to selection in domestic animals.

  9. Evolutionary Quantitative Genomics of Populus trichocarpa

    PubMed Central

    McKown, Athena D.; La Mantia, Jonathan; Guy, Robert D.; Ingvarsson, Pär K.; Hamelin, Richard; Mansfield, Shawn D.; Ehlting, Jürgen; Douglas, Carl J.; El-Kassaby, Yousry A.

    2015-01-01

    Forest trees generally show high levels of local adaptation and efforts focusing on understanding adaptation to climate will be crucial for species survival and management. Here, we address fundamental questions regarding the molecular basis of adaptation in undomesticated forest tree populations to past climatic environments by employing an integrative quantitative genetics and landscape genomics approach. Using this comprehensive approach, we studied the molecular basis of climate adaptation in 433 Populus trichocarpa (black cottonwood) genotypes originating across western North America. Variation in 74 field-assessed traits (growth, ecophysiology, phenology, leaf stomata, wood, and disease resistance) was investigated for signatures of selection (comparing Q ST -F ST) using clustering of individuals by climate of origin (temperature and precipitation). 29,354 SNPs were investigated employing three different outlier detection methods and marker-inferred relatedness was estimated to obtain the narrow-sense estimate of population differentiation in wild populations. In addition, we compared our results with previously assessed selection of candidate SNPs using the 25 topographical units (drainages) across the P. trichocarpa sampling range as population groupings. Narrow-sense Q ST for 53% of distinct field traits was significantly divergent from expectations of neutrality (indicating adaptive trait variation); 2,855 SNPs showed signals of diversifying selection and of these, 118 SNPs (within 81 genes) were associated with adaptive traits (based on significant Q ST). Many SNPs were putatively pleiotropic for functionally uncorrelated adaptive traits, such as autumn phenology, height, and disease resistance. Evolutionary quantitative genomics in P. trichocarpa provides an enhanced understanding regarding the molecular basis of climate-driven selection in forest trees and we highlight that important loci underlying adaptive trait variation also show relationship to climate of origin. We consider our approach the most comprehensive, as it uncovers the molecular mechanisms of adaptation using multiple methods and tests. We also provide a detailed outline of the required analyses for studying adaptation to the environment in a population genomics context to better understand the species’ potential adaptive capacity to future climatic scenarios. PMID:26599762

  10. Target capture enrichment of nuclear SNP markers for massively parallel sequencing of degraded and mixed samples.

    PubMed

    Bose, Nikhil; Carlberg, Katie; Sensabaugh, George; Erlich, Henry; Calloway, Cassandra

    2018-05-01

    DNA from biological forensic samples can be highly fragmented and present in limited quantity. When DNA is highly fragmented, conventional PCR based Short Tandem Repeat (STR) analysis may fail as primer binding sites may not be present on a single template molecule. Single Nucleotide Polymorphisms (SNPs) can serve as an alternative type of genetic marker for analysis of degraded samples because the targeted variation is a single base. However, conventional PCR based SNP analysis methods still require intact primer binding sites for target amplification. Recently, probe capture methods for targeted enrichment have shown success in recovering degraded DNA as well as DNA from ancient bone samples using next-generation sequencing (NGS) technologies. The goal of this study was to design and test a probe capture assay targeting forensically relevant nuclear SNP markers for clonal and massively parallel sequencing (MPS) of degraded and limited DNA samples as well as mixtures. A set of 411 polymorphic markers totaling 451 nuclear SNPs (375 SNPs and 36 microhaplotype markers) was selected for the custom probe capture panel. The SNP markers were selected for a broad range of forensic applications including human individual identification, kinship, and lineage analysis as well as for mixture analysis. Performance of the custom SNP probe capture NGS assay was characterized by analyzing read depth and heterozygote allele balance across 15 samples at 25 ng input DNA. Performance thresholds were established based on read depth ≥500X and heterozygote allele balance within ±10% deviation from 50:50, which was observed for 426 out of 451 SNPs. These 426 SNPs were analyzed in size selected samples (at ≤75 bp, ≤100 bp, ≤150 bp, ≤200 bp, and ≤250 bp) as well as mock degraded samples fragmented to an average of 150 bp. Samples selected for ≤75 bp exhibited 99-100% reportable SNPs across varied DNA amounts and as low as 0.5 ng. Mock degraded samples at 1 ng and 10 ng exhibited >90% reportable SNPs. Finally, two-person male-male mixtures were tested at 10 ng in contributor varying ratios. Overall, 85-100% of alleles unique to the minor contributor were observed at all mixture ratios. Results from these studies using the SNP probe capture NGS system demonstrates proof of concept for application to forensically relevant degraded and mixed DNA samples. Copyright © 2018 Elsevier B.V. All rights reserved.

  11. Twenty years of artificial directional selection have shaped the genome of the Italian Large White pig breed.

    PubMed

    Schiavo, G; Galimberti, G; Calò, D G; Samorè, A B; Bertolini, F; Russo, V; Gallo, M; Buttazzoni, L; Fontanesi, L

    2016-04-01

    In this study, we investigated at the genome-wide level if 20 years of artificial directional selection based on boar genetic evaluation obtained with a classical BLUP animal model shaped the genome of the Italian Large White pig breed. The most influential boars of this breed (n = 192), born from 1992 (the beginning of the selection program of this breed) to 2012, with an estimated breeding value reliability of >0.85, were genotyped with the Illumina Porcine SNP60 BeadChip. After grouping the boars in eight classes according to their year of birth, filtered single nucleotide polymorphisms (SNPs) were used to evaluate the effects of time on genotype frequency changes using multinomial logistic regression models. Of these markers, 493 had a PBonferroni  < 0.10. However, there was an increasing number of SNPs with a decreasing level of allele frequency changes over time, representing a continuous profile across the genome. The largest proportion of the 493 SNPs was on porcine chromosome (SSC) 7, SSC2, SSC8 and SSC18 for a total of 204 haploblocks. Functional annotations of genomic regions, including the 493 shifted SNPs, reported a few Gene Ontology terms that might underly the biological processes that contributed to increase performances of the pigs over the 20 years of the selection program. The obtained results indicated that the genome of the Italian Large White pigs was shaped by a directional selection program derived by the application of methodologies assuming the infinitesimal model that captured a continuous trend of allele frequency changes in the boar population. © 2015 Stichting International Foundation for Animal Genetics.

  12. Evidence for negative selection of gene variants that increase dependence on dietary choline in a Gambian cohort

    PubMed Central

    Silver, Matt J.; Corbin, Karen D.; Hellenthal, Garrett; da Costa, Kerry-Ann; Dominguez-Salas, Paula; Moore, Sophie E.; Owen, Jennifer; Prentice, Andrew M.; Hennig, Branwen J.; Zeisel, Steven H.

    2015-01-01

    Choline is an essential nutrient, and the amount needed in the diet is modulated by several factors. Given geographical differences in dietary choline intake and disparate frequencies of single-nucleotide polymorphisms (SNPs) in choline metabolism genes between ethnic groups, we tested the hypothesis that 3 SNPs that increase dependence on dietary choline would be under negative selection pressure in settings where choline intake is low: choline dehydrogenase (CHDH) rs12676, methylenetetrahydrofolate reductase 1 (MTHFD1) rs2236225, and phosphatidylethanolamine-N-methyltransferase (PEMT) rs12325817. Evidence of negative selection was assessed in 2 populations: one in The Gambia, West Africa, where there is historic evidence of a choline-poor diet, and the other in the United States, with a comparatively choline-rich diet. We used 2 independent methods, and confirmation of our hypothesis was sought via a comparison with SNP data from the Maasai, an East African population with a genetic background similar to that of Gambians but with a traditional diet that is higher in choline. Our results show that frequencies of SNPs known to increase dependence on dietary choline are significantly reduced in the low-choline setting of The Gambia. Our findings suggest that adequate intake levels of choline may have to be reevaluated in different ethnic groups and highlight a possible approach for identifying novel functional SNPs under the influence of dietary selective pressure.—Silver, M. J., Corbin, K. D., Hellenthal, G., da Costa, K.-A., Dominguez-Salas, P., Moore, S. E., Owen, J., Prentice, A. M., Hennig, B. J., Zeisel, S. H. Evidence for negative selection of gene variants that increase dependence on dietary choline in a Gambian cohort. PMID:25921832

  13. Whole-Genome Resequencing of Experimental Populations Reveals Polygenic Basis of Egg-Size Variation in Drosophila melanogaster

    PubMed Central

    Jha, Aashish R.; Miles, Cecelia M.; Lippert, Nodia R.; Brown, Christopher D.; White, Kevin P.; Kreitman, Martin

    2015-01-01

    Complete genome resequencing of populations holds great promise in deconstructing complex polygenic traits to elucidate molecular and developmental mechanisms of adaptation. Egg size is a classic adaptive trait in insects, birds, and other taxa, but its highly polygenic architecture has prevented high-resolution genetic analysis. We used replicated experimental evolution in Drosophila melanogaster and whole-genome sequencing to identify consistent signatures of polygenic egg-size adaptation. A generalized linear-mixed model revealed reproducible allele frequency differences between replicated experimental populations selected for large and small egg volumes at approximately 4,000 single nucleotide polymorphisms (SNPs). Several hundred distinct genomic regions contain clusters of these SNPs and have lower heterozygosity than the genomic background, consistent with selection acting on polymorphisms in these regions. These SNPs are also enriched among genes expressed in Drosophila ovaries and many of these genes have well-defined functions in Drosophila oogenesis. Additional genes regulating egg development, growth, and cell size show evidence of directional selection as genes regulating these biological processes are enriched for highly differentiated SNPs. Genetic crosses performed with a subset of candidate genes demonstrated that these genes influence egg size, at least in the large genetic background. These findings confirm the highly polygenic architecture of this adaptive trait, and suggest the involvement of many novel candidate genes in regulating egg size. PMID:26044351

  14. Genome-wide association study on growth traits in Colombian creole breeds and crossbreeds with Zebu cattle.

    PubMed

    Martínez, R; Gómez, Y; Rocha, J F M

    2014-08-25

    Whole genome selection represents an important tool for improving parameters related to the production of livestock. In order to build genomic selection indexes within a particular breed, it is important to identify polymorphisms that have the most significant association with a desired trait. A genome-wide marker association approach based on the Illumina BovineSNP50 BeadChip(TM) was used to identify genomic regions affecting birth weight (BW), weaning weight (WW), and daily weight gain (DWG) in purebred and crossbred creole cattle populations. We genotyped 654 individuals of Blanco Orejinegro (BON), Romosinuano (ROMO) and Cebú breeds and the crossbreeds BON x Cebú and ROMO x Cebú, and tested 5 genetic control models. In total, 85 single nucleotide polymorphisms (SNPs) were related (P < 0.05) to the 3 evaluated traits; BW was associated with the highest number of SNPs. For statistical false-positive correction, Bonferroni correction was used. From the results, we identified 7, 6, and 4 SNPs with strong associations with BW, WW, and DWG, respectively. Many of these SNPs were located on important coding regions of the bovine genome; their ontology and interactions are discussed herein. The results could contribute to the identification of genes involved in the physiology of beef cattle growth and the development of new strategies for breeding management via genomic selection to improve the productivity of creole cattle herds.

  15. Selectivity in Genetic Association with Sub-classified Migraine in Women

    PubMed Central

    Chasman, Daniel I.; Anttila, Verneri; Buring, Julie E.; Ridker, Paul M.; Schürks, Markus; Kurth, Tobias

    2014-01-01

    Migraine can be sub-classified not only according to presence of migraine aura (MA) or absence of migraine aura (MO), but also by additional features accompanying migraine attacks, e.g. photophobia, phonophobia, nausea, etc. all of which are formally recognized by the International Classification of Headache Disorders. It remains unclear how aura status and the other migraine features may be related to underlying migraine pathophysiology. Recent genome-wide association studies (GWAS) have identified 12 independent loci at which single nucleotide polymorphisms (SNPs) are associated with migraine. Using a likelihood framework, we explored the selective association of these SNPs with migraine, sub-classified according to aura status and the other features in a large population-based cohort of women including 3,003 active migraineurs and 18,108 free of migraine. Five loci met stringent significance for association with migraine, among which four were selective for sub-classified migraine, including rs11172113 (LRP1) for MO. The number of loci associated with migraine increased to 11 at suggestive significance thresholds, including five additional selective associations for MO but none for MA. No two SNPs showed similar patterns of selective association with migraine characteristics. At one extreme, SNPs rs6790925 (near TGFBR2) and rs2274316 (MEF2D) were not associated with migraine overall, MA, or MO but were selective for migraine sub-classified by the presence of one or more of the additional migraine features. In contrast, SNP rs7577262 (TRPM8) was associated with migraine overall and showed little or no selectivity for any of the migraine characteristics. The results emphasize the multivalent nature of migraine pathophysiology and suggest that a complete understanding of the genetic influence on migraine may benefit from analyses that stratify migraine according to both aura status and the additional diagnostic features used for clinical characterization of migraine. PMID:24852292

  16. Development of a single nucleotide polymorphism barcode to genotype Plasmodium vivax infections.

    PubMed

    Baniecki, Mary Lynn; Faust, Aubrey L; Schaffner, Stephen F; Park, Daniel J; Galinsky, Kevin; Daniels, Rachel F; Hamilton, Elizabeth; Ferreira, Marcelo U; Karunaweera, Nadira D; Serre, David; Zimmerman, Peter A; Sá, Juliana M; Wellems, Thomas E; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E; Volkman, Sarah K; Wirth, Dyann F; Sabeti, Pardis C

    2015-03-01

    Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25-40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections.

  17. Development of a Single Nucleotide Polymorphism Barcode to Genotype Plasmodium vivax Infections

    PubMed Central

    Baniecki, Mary Lynn; Faust, Aubrey L.; Schaffner, Stephen F.; Park, Daniel J.; Galinsky, Kevin; Daniels, Rachel F.; Hamilton, Elizabeth; Ferreira, Marcelo U.; Karunaweera, Nadira D.; Serre, David; Zimmerman, Peter A.; Sá, Juliana M.; Wellems, Thomas E.; Musset, Lise; Legrand, Eric; Melnikov, Alexandre; Neafsey, Daniel E.; Volkman, Sarah K.; Wirth, Dyann F.; Sabeti, Pardis C.

    2015-01-01

    Plasmodium vivax, one of the five species of Plasmodium parasites that cause human malaria, is responsible for 25–40% of malaria cases worldwide. Malaria global elimination efforts will benefit from accurate and effective genotyping tools that will provide insight into the population genetics and diversity of this parasite. The recent sequencing of P. vivax isolates from South America, Africa, and Asia presents a new opportunity by uncovering thousands of novel single nucleotide polymorphisms (SNPs). Genotyping a selection of these SNPs provides a robust, low-cost method of identifying parasite infections through their unique genetic signature or barcode. Based on our experience in generating a SNP barcode for P. falciparum using High Resolution Melting (HRM), we have developed a similar tool for P. vivax. We selected globally polymorphic SNPs from available P. vivax genome sequence data that were located in putatively selectively neutral sites (i.e., intergenic, intronic, or 4-fold degenerate coding). From these candidate SNPs we defined a barcode consisting of 42 SNPs. We analyzed the performance of the 42-SNP barcode on 87 P. vivax clinical samples from parasite populations in South America (Brazil, French Guiana), Africa (Ethiopia) and Asia (Sri Lanka). We found that the P. vivax barcode is robust, as it requires only a small quantity of DNA (limit of detection 0.3 ng/μl) to yield reproducible genotype calls, and detects polymorphic genotypes with high sensitivity. The markers are informative across all clinical samples evaluated (average minor allele frequency > 0.1). Population genetic and statistical analyses show the barcode captures high degrees of population diversity and differentiates geographically distinct populations. Our 42-SNP barcode provides a robust, informative, and standardized genetic marker set that accurately identifies a genomic signature for P. vivax infections. PMID:25781890

  18. Genome-Wide Association Study of Seed Dormancy and the Genomic Consequences of Improvement Footprints in Rice (Oryza sativa L.)

    PubMed Central

    Lu, Qing; Niu, Xiaojun; Zhang, Mengchen; Wang, Caihong; Xu, Qun; Feng, Yue; Yang, Yaolong; Wang, Shan; Yuan, Xiaoping; Yu, Hanyong; Wang, Yiping; Chen, Xiaoping; Liang, Xuanqiang; Wei, Xinghua

    2018-01-01

    Seed dormancy is an important agronomic trait affecting grain yield and quality because of pre-harvest germination and is influenced by both environmental and genetic factors. However, our knowledge of the factors controlling seed dormancy remains limited. To better reveal the molecular mechanism underlying this trait, a genome-wide association study was conducted in an indica-only population consisting of 453 accessions genotyped using 5,291 SNPs. Nine known and new significant SNPs were identified on eight chromosomes. These lead SNPs explained 34.9% of the phenotypic variation, and four of them were designed as dCAPS markers in the hope of accelerating molecular breeding. Moreover, a total of 212 candidate genes was predicted and eight candidate genes showed plant tissue-specific expression in expression profile data from different public bioinformatics databases. In particular, LOC_Os03g10110, which had a maize homolog involved in embryo development, was identified as a candidate regulator for further biological function investigations. Additionally, a polymorphism information content ratio method was used to screen improvement footprints and 27 selective sweeps were identified, most of which harbored domestication-related genes. Further studies suggested that three significant SNPs were adjacent to the candidate selection signals, supporting the accuracy of our genome-wide association study (GWAS) results. These findings show that genome-wide screening for selective sweeps can be used to identify new improvement-related DNA regions, although the phenotypes are unknown. This study enhances our knowledge of the genetic variation in seed dormancy, and the new dormancy-associated SNPs will provide real benefits in molecular breeding. PMID:29354150

  19. Genetic Risk Score Mendelian Randomization Shows that Obesity Measured as Body Mass Index, but not Waist:Hip Ratio, Is Causal for Endometrial Cancer.

    PubMed

    Painter, Jodie N; O'Mara, Tracy A; Marquart, Louise; Webb, Penelope M; Attia, John; Medland, Sarah E; Cheng, Timothy; Dennis, Joe; Holliday, Elizabeth G; McEvoy, Mark; Scott, Rodney J; Ahmed, Shahana; Healey, Catherine S; Shah, Mitul; Gorman, Maggie; Martin, Lynn; Hodgson, Shirley V; Beckmann, Matthias W; Ekici, Arif B; Fasching, Peter A; Hein, Alexander; Rübner, Matthias; Czene, Kamila; Darabi, Hatef; Hall, Per; Li, Jingmei; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo B; Amant, Frederic; Annibali, Daniela; Depreeuw, Jeroen; Lambrechts, Diether; Neven, Patrick; Cunningham, Julie M; Dowdy, Sean C; Goode, Ellen L; Fridley, Brooke L; Winham, Stacey J; Njølstad, Tormund S; Salvesen, Helga B; Trovik, Jone; Werner, Henrica M J; Ashton, Katie A; Otton, Geoffrey; Proietto, Anthony; Mints, Miriam; Tham, Emma; Bolla, Manjeet K; Michailidou, Kyriaki; Wang, Qin; Tyrer, Jonathan P; Hopper, John L; Peto, Julian; Swerdlow, Anthony J; Burwinkel, Barbara; Brenner, Hermann; Meindl, Alfons; Brauch, Hiltrud; Lindblom, Annika; Chang-Claude, Jenny; Couch, Fergus J; Giles, Graham G; Kristensen, Vessela N; Cox, Angela; Pharoah, Paul D P; Tomlinson, Ian; Dunning, Alison M; Easton, Douglas F; Thompson, Deborah J; Spurdle, Amanda B

    2016-11-01

    The strongest known risk factor for endometrial cancer is obesity. To determine whether SNPs associated with increased body mass index (BMI) or waist-hip ratio (WHR) are associated with endometrial cancer risk, independent of measured BMI, we investigated relationships between 77 BMI and 47 WHR SNPs and endometrial cancer in 6,609 cases and 37,926 country-matched controls. Logistic regression analysis and fixed effects meta-analysis were used to test for associations between endometrial cancer risk and (i) individual BMI or WHR SNPs, (ii) a combined weighted genetic risk score (wGRS) for BMI or WHR. Causality of BMI for endometrial cancer was assessed using Mendelian randomization, with BMIwGRS as instrumental variable. The BMIwGRS was significantly associated with endometrial cancer risk (P = 3.4 × 10 -17 ). Scaling the effect of the BMIwGRS on endometrial cancer risk by its effect on BMI, the endometrial cancer OR per 5 kg/m 2 of genetically predicted BMI was 2.06 [95% confidence interval (CI), 1.89-2.21], larger than the observed effect of BMI on endometrial cancer risk (OR = 1.55; 95% CI, 1.44-1.68, per 5 kg/m 2 ). The association attenuated but remained significant after adjusting for BMI (OR = 1.22; 95% CI, 1.10-1.39; P = 5.3 × 10 -4 ). There was evidence of directional pleiotropy (P = 1.5 × 10 -4 ). BMI SNP rs2075650 was associated with endometrial cancer at study-wide significance (P < 4.0 × 10 -4 ), independent of BMI. Endometrial cancer was not significantly associated with individual WHR SNPs or the WHRwGRS. BMI, but not WHR, is causally associated with endometrial cancer risk, with evidence that some BMI-associated SNPs alter endometrial cancer risk via mechanisms other than measurable BMI. The causal association between BMI SNPs and endometrial cancer has possible implications for endometrial cancer risk modeling. Cancer Epidemiol Biomarkers Prev; 25(11); 1503-10. ©2016 AACR. ©2016 American Association for Cancer Research.

  20. Identifying Genetic Signatures of Natural Selection Using Pooled Population Sequencing in Picea abies

    PubMed Central

    Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin

    2016-01-01

    The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. PMID:27172202

  1. Identifying Genetic Signatures of Natural Selection Using Pooled Population Sequencing in Picea abies.

    PubMed

    Chen, Jun; Källman, Thomas; Ma, Xiao-Fei; Zaina, Giusi; Morgante, Michele; Lascoux, Martin

    2016-07-07

    The joint inference of selection and past demography remain a costly and demanding task. We used next generation sequencing of two pools of 48 Norway spruce mother trees, one corresponding to the Fennoscandian domain, and the other to the Alpine domain, to assess nucleotide polymorphism at 88 nuclear genes. These genes are candidate genes for phenological traits, and most belong to the photoperiod pathway. Estimates of population genetic summary statistics from the pooled data are similar to previous estimates, suggesting that pooled sequencing is reliable. The nonsynonymous SNPs tended to have both lower frequency differences and lower FST values between the two domains than silent ones. These results suggest the presence of purifying selection. The divergence between the two domains based on synonymous changes was around 5 million yr, a time similar to a recent phylogenetic estimate of 6 million yr, but much larger than earlier estimates based on isozymes. Two approaches, one of them novel and that considers both FST and difference in allele frequencies between the two domains, were used to identify SNPs potentially under diversifying selection. SNPs from around 20 genes were detected, including genes previously identified as main target for selection, such as PaPRR3 and PaGI. Copyright © 2016 Chen et al.

  2. Genomic data for 78 chickens from 14 populations

    PubMed Central

    Li, Diyan; Che, Tiandong; Chen, Binlong; Tian, Shilin; Zhou, Xuming; Zhang, Guolong; Li, Miao; Gaur, Uma; Li, Yan; Luo, Majing; Zhang, Long; Xu, Zhongxian; Zhao, Xiaoling; Yin, Huadong; Wang, Yan; Jin, Long; Tang, Qianzi; Xu, Huailiang; Yang, Mingyao; Zhou, Rongjia; Li, Ruiqiang

    2017-01-01

    Abstract Background: Since the domestication of the red jungle fowls (Gallus gallus; dating back to ∼10 000 B.P.) in Asia, domestic chickens (Gallus gallus domesticus) have been subjected to the combined effects of natural selection and human-driven artificial selection; this has resulted in marked phenotypic diversity in a number of traits, including behavior, body composition, egg production, and skin color. Population genomic variations through diversifying selection have not been fully investigated. Findings: The whole genomes of 78 domestic chickens were sequenced to an average of 18-fold coverage for each bird. By combining this data with publicly available genomes of five wild red jungle fowls and eight Xishuangbanna game fowls, we conducted a comprehensive comparative genomics analysis of 91 chickens from 17 populations. After aligning ∼21.30 gigabases (Gb) of high-quality data from each individual to the reference chicken genome, we identified ∼6.44 million (M) single nucleotide polymorphisms (SNPs) for each population. These SNPs included 1.10 M novel SNPs in 17 populations that were absent in the current chicken dbSNP (Build 145) entries. Conclusions: The current data is important for population genetics and further studies in chickens and will serve as a valuable resource for investigating diversifying selection and candidate genes for selective breeding in chickens. PMID:28431039

  3. Predicting attention-deficit/hyperactivity disorder severity from psychosocial stress and stress-response genes: a random forest regression approach

    PubMed Central

    van der Meer, D; Hoekstra, P J; van Donkelaar, M; Bralten, J; Oosterlaan, J; Heslenfeld, D; Faraone, S V; Franke, B; Buitelaar, J K; Hartman, C A

    2017-01-01

    Identifying genetic variants contributing to attention-deficit/hyperactivity disorder (ADHD) is complicated by the involvement of numerous common genetic variants with small effects, interacting with each other as well as with environmental factors, such as stress exposure. Random forest regression is well suited to explore this complexity, as it allows for the analysis of many predictors simultaneously, taking into account any higher-order interactions among them. Using random forest regression, we predicted ADHD severity, measured by Conners’ Parent Rating Scales, from 686 adolescents and young adults (of which 281 were diagnosed with ADHD). The analysis included 17 374 single-nucleotide polymorphisms (SNPs) across 29 genes previously linked to hypothalamic–pituitary–adrenal (HPA) axis activity, together with information on exposure to 24 individual long-term difficulties or stressful life events. The model explained 12.5% of variance in ADHD severity. The most important SNP, which also showed the strongest interaction with stress exposure, was located in a region regulating the expression of telomerase reverse transcriptase (TERT). Other high-ranking SNPs were found in or near NPSR1, ESR1, GABRA6, PER3, NR3C2 and DRD4. Chronic stressors were more influential than single, severe, life events. Top hits were partly shared with conduct problems. We conclude that random forest regression may be used to investigate how multiple genetic and environmental factors jointly contribute to ADHD. It is able to implicate novel SNPs of interest, interacting with stress exposure, and may explain inconsistent findings in ADHD genetics. This exploratory approach may be best combined with more hypothesis-driven research; top predictors and their interactions with one another should be replicated in independent samples. PMID:28585928

  4. Development and validation of a 20K single nucleotide polymorphism (SNP) whole genome genotyping array for apple (Malus × domestica Borkh).

    PubMed

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.

  5. Development and Validation of a 20K Single Nucleotide Polymorphism (SNP) Whole Genome Genotyping Array for Apple (Malus × domestica Borkh)

    PubMed Central

    Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

    2014-01-01

    High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088

  6. Linkage disequilibrium and signatures of positive selection around LINE-1 retrotransposons in the human genome.

    PubMed

    Kuhn, Alexandre; Ong, Yao Min; Cheng, Ching-Yu; Wong, Tien Yin; Quake, Stephen R; Burkholder, William F

    2014-06-03

    Insertions of the human-specific subfamily of LINE-1 (L1) retrotransposon are highly polymorphic across individuals and can critically influence the human transcriptome. We hypothesized that L1 insertions could represent genetic variants determining important human phenotypic traits, and performed an integrated analysis of L1 elements and single nucleotide polymorphisms (SNPs) in several human populations. We found that a large fraction of L1s were in high linkage disequilibrium with their surrounding genomic regions and that they were well tagged by SNPs. However, L1 variants were only partially captured by SNPs on standard SNP arrays, so that their potential phenotypic impact would be frequently missed by SNP array-based genome-wide association studies. We next identified potential phenotypic effects of L1s by looking for signatures of natural selection linked to L1 insertions; significant extended haplotype homozygosity was detected around several L1 insertions. This finding suggests that some of these L1 insertions may have been the target of recent positive selection.

  7. Accumulation of slightly deleterious mutations in the mitochondrial genome: a hallmark of animal domestication.

    PubMed

    Hughes, Austin L

    2013-02-15

    The hypothesis that domestication leads to a relaxation of purifying selection on mitochondrial (mt) genomes was tested by comparative analysis of mt genes from dog, pig, chicken, and silkworm. The three vertebrate species showed mt genome phylogenies in which domestic and wild isolates were intermingled, whereas the domestic silkworm (Bombyx mori) formed a distinct cluster nested within its closest wild relative (Bombyx mandarina). In spite of these differences in phylogenetic pattern, significantly greater proportions of nonsynonymous SNPs than of synonymous SNPs were unique to the domestic populations of all four species. Likewise, in all four species, significantly greater proportions of RNA-encoding SNPs than of synonymous SNPs were unique to the domestic populations. Thus, domestic populations were characterized by an excess of unique polymorphisms in two categories generally subject to purifying selection: nonsynonymous sites and RNA-encoding sites. Many of these unique polymorphisms thus seem likely to be slightly deleterious; the latter hypothesis was supported by the generally lower gene diversities of polymorphisms unique to domestic populations in comparison to those of polymorphisms shared by domestic and wild populations. Copyright © 2012 Elsevier B.V. All rights reserved.

  8. Medaka: a promising model animal for comparative population genomics

    PubMed Central

    Matsumoto, Yoshifumi; Oota, Hiroki; Asaoka, Yoichi; Nishina, Hiroshi; Watanabe, Koji; Bujnicki, Janusz M; Oda, Shoji; Kawamura, Shoji; Mitani, Hiroshi

    2009-01-01

    Background Within-species genome diversity has been best studied in humans. The international HapMap project has revealed a tremendous amount of single-nucleotide polymorphisms (SNPs) among humans, many of which show signals of positive selection during human evolution. In most of the cases, however, functional differences between the alleles remain experimentally unverified due to the inherent difficulty of human genetic studies. It would therefore be highly useful to have a vertebrate model with the following characteristics: (1) high within-species genetic diversity, (2) a variety of gene-manipulation protocols already developed, and (3) a completely sequenced genome. Medaka (Oryzias latipes) and its congeneric species, tiny fresh-water teleosts distributed broadly in East and Southeast Asia, meet these criteria. Findings Using Oryzias species from 27 local populations, we conducted a simple screening of nonsynonymous SNPs for 11 genes with apparent orthology between medaka and humans. We found medaka SNPs for which the same sites in human orthologs are known to be highly differentiated among the HapMap populations. Importantly, some of these SNPs show signals of positive selection. Conclusion These results indicate that medaka is a promising model system for comparative population genomics exploring the functional and adaptive significance of allelic differentiations. PMID:19426554

  9. GrigoraSNPs: Optimized Analysis of SNPs for DNA Forensics.

    PubMed

    Ricke, Darrell O; Shcherbina, Anna; Michaleas, Adam; Fremont-Smith, Philip

    2018-04-16

    High-throughput sequencing (HTS) of single nucleotide polymorphisms (SNPs) enables additional DNA forensic capabilities not attainable using traditional STR panels. However, the inclusion of sets of loci selected for mixture analysis, extended kinship, phenotype, biogeographic ancestry prediction, etc., can result in large panel sizes that are difficult to analyze in a rapid fashion. GrigoraSNP was developed to address the allele-calling bottleneck that was encountered when analyzing SNP panels with more than 5000 loci using HTS. GrigoraSNPs uses a MapReduce parallel data processing on multiple computational threads plus a novel locus-identification hashing strategy leveraging target sequence tags. This tool optimizes the SNP calling module of the DNA analysis pipeline with runtimes that scale linearly with the number of HTS reads. Results are compared with SNP analysis pipelines implemented with SAMtools and GATK. GrigoraSNPs removes a computational bottleneck for processing forensic samples with large HTS SNP panels. Published 2018. This article is a U.S. Government work and is in the public domain in the USA.

  10. BAC-end sequence-based SNPs and Bin mapping for rapid integration of physical and genetic maps in apple.

    PubMed

    Han, Yuepeng; Chagné, David; Gasic, Ksenija; Rikkerink, Erik H A; Beever, Jonathan E; Gardiner, Susan E; Korban, Schuyler S

    2009-03-01

    A genome-wide BAC physical map of the apple, Malus x domestica Borkh., has been recently developed. Here, we report on integrating the physical and genetic maps of the apple using a SNP-based approach in conjunction with bin mapping. Briefly, BAC clones located at ends of BAC contigs were selected, and sequenced at both ends. The BAC end sequences (BESs) were used to identify candidate SNPs. Subsequently, these candidate SNPs were genetically mapped using a bin mapping strategy for the purpose of mapping the physical onto the genetic map. Using this approach, 52 (23%) out of 228 BESs tested were successfully exploited to develop SNPs. These SNPs anchored 51 contigs, spanning approximately 37 Mb in cumulative physical length, onto 14 linkage groups. The reliability of the integration of the physical and genetic maps using this SNP-based strategy is described, and the results confirm the feasibility of this approach to construct an integrated physical and genetic maps for apple.

  11. Associations between incident ischemic stroke events and stroke and cardiovascular disease-related genome-wide association studies single nucleotide polymorphisms in the Population Architecture Using Genomics and Epidemiology study.

    PubMed

    Carty, Cara L; Buzková, Petra; Fornage, Myriam; Franceschini, Nora; Cole, Shelley; Heiss, Gerardo; Hindorff, Lucia A; Howard, Barbara V; Mann, Sue; Martin, Lisa W; Zhang, Ying; Matise, Tara C; Prentice, Ross; Reiner, Alexander P; Kooperberg, Charles

    2012-04-01

    Genome-wide association studies (GWAS) have identified loci associated with ischemic stroke (IS) and cardiovascular disease (CVD) in European-descent individuals, but their replication in different populations has been largely unexplored. Nine single nucleotide polymorphisms (SNPs) selected from GWAS and meta-analyses of stroke, and 86 SNPs previously associated with myocardial infarction and CVD risk factors, including blood lipids (high density lipoprotein [HDL], low density lipoprotein [LDL], and triglycerides), type 2 diabetes, and body mass index (BMI), were investigated for associations with incident IS in European Americans (EA) N=26 276, African-Americans (AA) N=8970, and American Indians (AI) N=3570 from the Population Architecture using Genomics and Epidemiology Study. Ancestry-specific fixed effects meta-analysis with inverse variance weighting was used to combine study-specific log hazard ratios from Cox proportional hazards models. Two of 9 stroke SNPs (rs783396 and rs1804689) were significantly associated with [corrected] IS hazard in AA; none were significant in this large EA cohort. Of 73 CVD risk factor SNPs tested in EA, 2 (HDL and triglycerides SNPs) were associated with IS. In AA, SNPs associated with LDL, HDL, and BMI were significantly associated with IS (3 of 86 SNPs tested). Out of 58 SNPs tested in AI, 1 LDL SNP was significantly associated with IS. Our analyses showing lack of replication in spite of reasonable power for many stroke SNPs and differing results by ancestry highlight the need to follow up on GWAS findings and conduct genetic association studies in diverse populations. We found modest IS associations with BMI and lipids SNPs, though these findings require confirmation.

  12. A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families.

    PubMed

    Wang, Yi-Ting; Sung, Pei-Yuan; Lin, Peng-Lin; Yu, Ya-Wen; Chung, Ren-Hua

    2015-05-15

    Genome-wide association studies (GWAS) have become a common approach to identifying single nucleotide polymorphisms (SNPs) associated with complex diseases. As complex diseases are caused by the joint effects of multiple genes, while the effect of individual gene or SNP is modest, a method considering the joint effects of multiple SNPs can be more powerful than testing individual SNPs. The multi-SNP analysis aims to test association based on a SNP set, usually defined based on biological knowledge such as gene or pathway, which may contain only a portion of SNPs with effects on the disease. Therefore, a challenge for the multi-SNP analysis is how to effectively select a subset of SNPs with promising association signals from the SNP set. We developed the Optimal P-value Threshold Pedigree Disequilibrium Test (OPTPDT). The OPTPDT uses general nuclear families. A variable p-value threshold algorithm is used to determine an optimal p-value threshold for selecting a subset of SNPs. A permutation procedure is used to assess the significance of the test. We used simulations to verify that the OPTPDT has correct type I error rates. Our power studies showed that the OPTPDT can be more powerful than the set-based test in PLINK, the multi-SNP FBAT test, and the p-value based test GATES. We applied the OPTPDT to a family-based autism GWAS dataset for gene-based association analysis and identified MACROD2-AS1 with genome-wide significance (p-value=2.5×10(-6)). Our simulation results suggested that the OPTPDT is a valid and powerful test. The OPTPDT will be helpful for gene-based or pathway association analysis. The method is ideal for the secondary analysis of existing GWAS datasets, which may identify a set of SNPs with joint effects on the disease.

  13. Colorectal cancer-susceptibility single-nucleotide polymorphisms in Korean population.

    PubMed

    Hong, Sung Noh; Park, Changho; Kim, Jong-Il; Kim, Duk-Hwan; Kim, Hee Cheol; Chang, Dong Kyung; Rhee, Poong-Lyul; Kim, Jae J; Rhee, Jong Chul; Son, Hee Jung; Kim, Young-Ho

    2015-05-01

    Considering the significant racial and ethnic diversity in genetic variation, it is unclear whether the genome-wide association studies-identified colorectal cancer (CRC)-susceptibility single-nucleotide polymorphisms (SNPs) discovered in European populations are also relevant to the Korean population. However, studies on CRC-susceptibility SNPs in Koreans are limited. To investigate the racial and ethnic diversity of CRC-susceptibility genetic variants, we genotyped for the established European CRC-susceptibility SNPs in 198 CRC cases and 329 controls in Korea. To identify novel genetic variants using genome-wide screening in Korea, Illumina HumanHap 370K/610K BeadChips were performed on 105 CRC patients, and candidate CRC-susceptibility SNPs were selected. Subsequently, genotyping for replication was done in 189 CRC cases and 190 controls. Among the European CRC-susceptibility SNPs, rs4939827 in SMAD7 was associated with a significant decreased risk of Korean CRC (age-/gender-adjusted odds ratio [95% confidence interval]: additive model, 0.67 [95% CI, 0.47-0.95]; dominant model, 0.59 [95% CI, 0.39-0.91]). rs4779584 and rs10795668 were associated with CRC risk in females and males, respectively. Among candidate CRC-susceptibility SNPs selected from genome-wide screening, novel SNP, rs17051076, was found to be associated with a significantly increased risk of microsatellite instability-high CRC (age-/gender-adjusted odds ratio [95% confidence interval]: additive model, 4.25 [95% CI, 1.51-11.98]; dominant model, 3.52 [95% CI, 1.13-10.94]) in the replication study. rs4939827, rs4779584, and rs10795668 may contribute to the risk of CRC in the Korean population as well as in European populations. Novel rs17051076 could be associated with microsatellite instability-high CRC in Koreans. These associations support the ethnic diversity of CRC-susceptibility SNPs and should be taken into account in large-scale studies. © 2013 Journal of Gastroenterology and Hepatology Foundation and Wiley Publishing Asia Pty Ltd.

  14. Single nucleotide polymorphisms associated with thermoregulation in lactating dairy cows exposed to heat stress.

    PubMed

    Dikmen, S; Wang, X-z; Ortega, M S; Cole, J B; Null, D J; Hansen, P J

    2015-12-01

    Dairy cows with increased rectal temperature experience lower milk yield and fertility. Rectal temperature during heat stress is heritable, so genetic selection for body temperature regulation could reduce effects of heat stress on production. One aim of the study was to validate the relationship between genotype and heat tolerance for single nucleotide polymorphisms (SNPs) previously associated with resistance to heat stress. A second aim was to identify new SNPs associated with heat stress resistance. Thermotolerance was assessed in lactating Holsteins during the summer by measuring rectal temperature (a direct measurement of body temperature regulation; n = 435), respiration rate (an indirect measurement of body temperature regulation, n = 450) and sweating rate (the major evaporative cooling mechanism in cattle, n = 455). The association between genotype and thermotolerance was evaluated for 19 SNPs previously associated with rectal temperature from a genomewide analysis study (GWAS), four SNPs previously associated with change in milk yield during heat stress from GWAS, 2 candidate gene SNPs previously associated with rectal temperature and respiration rate during heat stress (ATPA1A and HSP70A) and 66 SNPs in genes previously shown to be associated with reproduction, production or health traits in Holsteins. For SNPs previously associated with heat tolerance, regions of BTA4, BTA6 and BTA24 were associated with rectal temperature; regions of BTA6 and BTA24 were associated with respiration rate; and regions of BTA5, BTA26 and BTA29 were associated with sweating rate. New SNPs were identified for rectal temperature (n = 12), respiration rate (n = 8) and sweating rate (n = 3) from among those previously associated with production, reproduction or health traits. The SNP that explained the most variation were PGR and ASL for rectal temperature, ACAT2 and HSD17B7 for respiration rate, and ARL6IP1 and SERPINE2 for sweating rate. ARL6IP1 was associated with all three thermotolerance traits. In conclusion, specific genetic markers responsible for genetic variation in thermoregulation during heat stress in Holsteins were identified. These markers may prove useful in genetic selection for heat tolerance in Holstein cattle. © 2015 Blackwell Verlag GmbH.

  15. Ultra-low-density genotype panels for breed assignment of Angus and Hereford cattle.

    PubMed

    Judge, M M; Kelleher, M M; Kearney, J F; Sleator, R D; Berry, D P

    2017-06-01

    Angus and Hereford beef is marketed internationally for apparent superior meat quality attributes; DNA-based breed authenticity could be a useful instrument to ensure consumer confidence on premium meat products. The objective of this study was to develop an ultra-low-density genotype panel to accurately quantify the Angus and Hereford breed proportion in biological samples. Medium-density genotypes (13 306 single nucleotide polymorphisms (SNPs)) were available on 54 703 commercial and 4042 purebred animals. The breed proportion of the commercial animals was generated from the medium-density genotypes and this estimate was regarded as the gold-standard breed composition. Ten genotype panels (100 to 1000 SNPs) were developed from the medium-density genotypes; five methods were used to identify the most informative SNPs and these included the Delta statistic, the fixation (F st) statistic and an index of both. Breed assignment analyses were undertaken for each breed, panel density and SNP selection method separately with a programme to infer population structure using the entire 13 306 SNP panel (representing the gold-standard measure). Breed assignment was undertaken for all commercial animals (n=54 703), animals deemed to contain some proportion of Angus based on pedigree (n=5740) and animals deemed to contain some proportion of Hereford based on pedigree (n=5187). The predicted breed proportion of all animals from the lower density panels was then compared with the gold-standard breed prediction. Panel density, SNP selection method and breed all had a significant effect on the correlation of predicted and actual breed proportion. Regardless of breed, the Index method of SNP selection numerically (but not significantly) outperformed all other selection methods in accuracy (i.e. correlation and root mean square of prediction) when panel density was ⩾300 SNPs. The correlation between actual and predicted breed proportion increased as panel density increased. Using 300 SNPs (selected using the global index method), the correlation between predicted and actual breed proportion was 0.993 and 0.995 in the Angus and Hereford validation populations, respectively. When SNP panels optimised for breed prediction in one population were used to predict the breed proportion of a separate population, the correlation between predicted and actual breed proportion was 0.034 and 0.044 weaker in the Hereford and Angus populations, respectively (using the 300 SNP panel). It is necessary to include at least 300 to 400 SNPs (per breed) on genotype panels to accurately predict breed proportion from biological samples.

  16. Correlates between Models of Virulence for Mycobacterium tuberculosis among Isolates of the Central Asian Lineage: a Case for Lysozyme Resistance Testing?

    PubMed Central

    Casali, Nicola; Clark, Simon O.; Hooper, Richard; Williams, Ann; Velji, Preya; Gonzalo, Ximena

    2015-01-01

    Virulence factors (VFs) contribute to the emergence of new human Mycobacterium tuberculosis strains, are lineage dependent, and are relevant to the development of M. tuberculosis drugs/vaccines. VFs were sought within M. tuberculosis lineage 3, which has the Central Asian (CAS) spoligotype. Three isolates were selected from clusters previously identified as dominant in London, United Kingdom. Strain-associated virulence was studied in guinea pig, monocyte-derived macrophage, and lysozyme resistance assays. Whole-genome sequencing, single nucleotide polymorphism (SNP) analysis, and a literature review contributed to the identification of SNPs of interest. The animal model revealed borderline differences in strain-associated pathogenicity. Ex vivo, isolate C72 exhibited statistically significant differences in intracellular growth relative to C6 and C14. SNP candidates inducing lower fitness levels included 123 unique nonsynonymous SNPs, including three located in genes (lysX, caeA, and ponA2) previously identified as VFs in the laboratory-adapted reference strain H37Rv and shown to confer lysozyme resistance. C72 growth was most affected by lysozyme in vitro. A BLAST search revealed that all three SNPs of interest (C35F, P76Q, and P780R) also occurred in Tiruvallur, India, and in Uganda. Unlike C72, however, no single isolate identified through BLAST carried all three SNPs simultaneously. CAS isolates representative of three medium-sized human clusters demonstrated differential outcomes in models commonly used to estimate strain-associated virulence, supporting the idea that virulence varies within, not just across, M. tuberculosis lineages. Three VF SNPs of interest were identified in two additional locations worldwide, which suggested independent selection and supported a role for these SNPs in virulence. The relevance of lysozyme resistance to strain virulence remains to be established. PMID:25776753

  17. Conservation genomics of anadromous Atlantic salmon across its North American range: outlier loci identify the same patterns of population structure as neutral loci.

    PubMed

    Moore, Jean-Sébastien; Bourret, Vincent; Dionne, Mélanie; Bradbury, Ian; O'Reilly, Patrick; Kent, Matthew; Chaput, Gérald; Bernatchez, Louis

    2014-12-01

    Anadromous Atlantic salmon (Salmo salar) is a species of major conservation and management concern in North America, where population abundance has been declining over the past 30 years. Effective conservation actions require the delineation of conservation units to appropriately reflect the spatial scale of intraspecific variation and local adaptation. Towards this goal, we used the most comprehensive genetic and genomic database for Atlantic salmon to date, covering the entire North American range of the species. The database included microsatellite data from 9142 individuals from 149 sampling locations and data from a medium-density SNP array providing genotypes for >3000 SNPs for 50 sampling locations. We used neutral and putatively selected loci to integrate adaptive information in the definition of conservation units. Bayesian clustering with the microsatellite data set and with neutral SNPs identified regional groupings largely consistent with previously published regional assessments. The use of outlier SNPs did not result in major differences in the regional groupings, suggesting that neutral markers can reflect the geographic scale of local adaptation despite not being under selection. We also performed assignment tests to compare power obtained from microsatellites, neutral SNPs and outlier SNPs. Using SNP data substantially improved power compared to microsatellites, and an assignment success of 97% to the population of origin and of 100% to the region of origin was achieved when all SNP loci were used. Using outlier SNPs only resulted in minor improvements to assignment success to the population of origin but improved regional assignment. We discuss the implications of these new genetic resources for the conservation and management of Atlantic salmon in North America. © 2014 John Wiley & Sons Ltd.

  18. Analysis of 60 reported glioma risk SNPs replicates published GWAS findings but fails to replicate associations from published candidate-gene studies.

    PubMed

    Walsh, Kyle M; Anderson, Erik; Hansen, Helen M; Decker, Paul A; Kosel, Matt L; Kollmeyer, Thomas; Rice, Terri; Zheng, Shichun; Xiao, Yuanyuan; Chang, Jeffrey S; McCoy, Lucie S; Bracci, Paige M; Wiemels, Joe L; Pico, Alexander R; Smirnov, Ivan; Lachance, Daniel H; Sicotte, Hugues; Eckel-Passow, Jeanette E; Wiencke, John K; Jenkins, Robert B; Wrensch, Margaret R

    2013-02-01

    Genomewide association studies (GWAS) and candidate-gene studies have implicated single-nucleotide polymorphisms (SNPs) in at least 45 different genes as putative glioma risk factors. Attempts to validate these associations have yielded variable results and few genetic risk factors have been consistently replicated. We conducted a case-control study of Caucasian glioma cases and controls from the University of California San Francisco (810 cases, 512 controls) and the Mayo Clinic (852 cases, 789 controls) in an attempt to replicate previously reported genetic risk factors for glioma. Sixty SNPs selected from the literature (eight from GWAS and 52 from candidate-gene studies) were successfully genotyped on an Illumina custom genotyping panel. Eight SNPs in/near seven different genes (TERT, EGFR, CCDC26, CDKN2A, PHLDB1, RTEL1, TP53) were significantly associated with glioma risk in the combined dataset (P < 0.05), with all associations in the same direction as in previous reports. Several SNP associations showed considerable differences across histologic subtype. All eight successfully replicated associations were first identified by GWAS, although none of the putative risk SNPs from candidate-gene studies was associated in the full case-control sample (all P values > 0.05). Although several confirmed associations are located near genes long known to be involved in gliomagenesis (e.g., EGFR, CDKN2A, TP53), these associations were first discovered by the GWAS approach and are in noncoding regions. These results highlight that the deficiencies of the candidate-gene approach lay in selecting both appropriate genes and relevant SNPs within these genes. © 2012 WILEY PERIODICALS, INC.

  19. Microsatellite genotyping and genome-wide single nucleotide polymorphism-based indices of Plasmodium falciparum diversity within clinical infections.

    PubMed

    Murray, Lee; Mobegi, Victor A; Duffy, Craig W; Assefa, Samuel A; Kwiatkowski, Dominic P; Laman, Eugene; Loua, Kovana M; Conway, David J

    2016-05-12

    In regions where malaria is endemic, individuals are often infected with multiple distinct parasite genotypes, a situation that may impact on evolution of parasite virulence and drug resistance. Most approaches to studying genotypic diversity have involved analysis of a modest number of polymorphic loci, although whole genome sequencing enables a broader characterisation of samples. PCR-based microsatellite typing of a panel of ten loci was performed on Plasmodium falciparum in 95 clinical isolates from a highly endemic area in the Republic of Guinea, to characterize within-isolate genetic diversity. Separately, single nucleotide polymorphism (SNP) data from genome-wide short-read sequences of the same samples were used to derive within-isolate fixation indices (F ws), an inverse measure of diversity within each isolate compared to overall local genetic diversity. The latter indices were compared with the microsatellite results, and also with indices derived by randomly sampling modest numbers of SNPs. As expected, the number of microsatellite loci with more than one allele in each isolate was highly significantly inversely correlated with the genome-wide F ws fixation index (r = -0.88, P < 0.001). However, the microsatellite analysis revealed that most isolates contained mixed genotypes, even those that had no detectable genome sequence heterogeneity. Random sampling of different numbers of SNPs showed that an F ws index derived from ten or more SNPs with minor allele frequencies of >10 % had high correlation (r > 0.90) with the index derived using all SNPs. Different types of data give highly correlated indices of within-infection diversity, although PCR-based analysis detects low-level minority genotypes not apparent in bulk sequence analysis. When whole-genome data are not obtainable, quantitative assay of ten or more SNPs can yield a reasonably accurate estimate of the within-infection fixation index (F ws).

  20. Association of the GALNT2 gene polymorphisms and several environmental factors with serum lipid levels in the Mulao and Han populations

    PubMed Central

    2011-01-01

    Background The association of UDP-N-acetyl-alpha-D-galactosamine: polypeptide N-acetylgalactosaminyltransferase 2 gene (GALNT2) single nucleotide polymorphisms (SNPs) and serum lipid profiles in the general population is not well known. The present study was undertaken to detect the association of GALNT2 polymorphisms and several environmental factors with serum lipid levels in the Guangxi Mulao and Han populations. Method A total of 775 subjects of Mulao nationality and 699 participants of Han nationality were randomly selected from our stratified randomized cluster samples. Genotyping of the GALNT2 rs2144300 and rs4846914 SNPs was performed by polymerase chain reaction and restriction fragment length polymorphism combined with gel electrophoresis, and then confirmed by direct sequencing. Results There were no significant differences in the genotypic and allelic frequencies of both SNPs between the two ethnic groups, or between the males and females. The subjects with TT genotype of rs2144300 in Mulao had lower serum triglyceride (TG) levels than the subjects with CC genotype in females (P < 0.01). The participants with CT/TT genotype of rs2144300 in Han had lower TG and apolipoprotein (Apo) B levels, and higher high-density lipoprotein cholesterol (HDL-C), ApoA1 levels and the ratio of ApoA1 to ApoB in males; and higher low-density lipoprotein cholesterol (LDL-C) and ApoB levels in females than the participants with CC genotype (P < 0.05-0.001). The individuals with GA/AA genotype of rs4846914 in Mulao had higher total cholesterol (TC) and LDL-C levels than the individuals with GG genotype in males (P < 0.05 for each). The subjects with AA genotype of rs4846914 in Han had higher LDL-C and ApoB levels, and lower HDL-C levels and the ratio of ApoA1 to ApoB than the subjects with GG genotype (P < 0.05 for each). The levels of TC in Mulao were correlated with the genotypes of rs4846914 in males (P < 0.05). The levels of ApoA1 in Han were correlated with the genotypes of both SNPs, and the levels of HDL-C and ApoB and the ratio of ApoA1 to ApoB were associated with the genotypes of rs2144300 in males (P < 0.05-0.001). The levels of LDL-C in Han were correlated with the genotypes of rs4846914 in females (P < 0.05). Serum lipid parameters were also correlated with several enviromental factors. Conclusions The associations of both GALNT2 rs2144300 and rs4846914 SNPs and serum lipid levels are different in the Mulao and Han populations. These discrepancies might partly result from different GALNT2 gene-enviromental interactions. PMID:21933382

  1. In Vitro vs In Silico Detected SNPs for the Development of a Genotyping Array: What Can We Learn from a Non-Model Species?

    PubMed Central

    Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe

    2010-01-01

    Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950

  2. Discovery of SNPs for individual identification by reduced representation sequencing of moose (Alces alces).

    PubMed

    Blåhed, Ida-Maria; Königsson, Helena; Ericsson, Göran; Spong, Göran

    2018-01-01

    Monitoring of wild animal populations is challenging, yet reliable information about population processes is important for both management and conservation efforts. Access to molecular markers, such as SNPs, enables population monitoring through genotyping of various DNA sources. We have developed 96 high quality SNP markers for individual identification of moose (Alces alces), an economically and ecologically important top-herbivore in boreal regions. Reduced representation libraries constructed from 34 moose were high-throughput de novo sequenced, generating nearly 50 million read pairs. About 50 000 stacks of aligned reads containing one or more SNPs were discovered with the Stacks pipeline. Several quality criteria were applied on the candidate SNPs to find markers informative on the individual level and well representative for the population. An empirical validation by genotyping of sequenced individuals and additional moose, resulted in the selection of a final panel of 86 high quality autosomal SNPs. Additionally, five sex-specific SNPs and five SNPs for sympatric species diagnostics are included in the panel. The genotyping error rate was 0.002 for the total panel and probability of identities were low enough to separate individuals with high confidence. Moreover, the autosomal SNPs were highly informative also for population level analyses. The potential applications of this SNP panel are thus many including investigations of population size, sex ratios, relatedness, reproductive success and population structure. Ideally, SNP-based studies could improve today's population monitoring and increase our knowledge about moose population dynamics.

  3. Spontaneous formation of Au-Pt alloyed nanoparticles using pure nano-counterparts as starters: a ligand and size dependent process.

    PubMed

    Usón, Laura; Sebastian, Victor; Mayoral, Alvaro; Hueso, Jose L; Eguizabal, Adela; Arruebo, Manuel; Santamaria, Jesus

    2015-06-14

    In this work we investigate the formation of PtAu monodisperse alloyed nanoparticles by ageing pure metallic Au and Pt small nanoparticles (sNPs), nanoparticle size <5 nm, under certain conditions. We demonstrate that those bimetallic entities can be obtained by controlling the size of the initial metallic sNPs separately prepared and by selecting their appropriate capping agents. The formation of this spontaneous phenomenon was studied using HR-STEM, EDS, ionic conductivity, UV-Vis spectroscopy and cyclic voltammetry. Depending on the type of capping agent used and the size of the initial Au sNPs, three different materials were obtained: (i) AuPt bimetallic sNPs showing a surface rich in Au atoms, (ii) segregated Au and Pt sNPs and (iii) a mixture of bimetallic nanoparticles as well as Pt sNPs and Au NPs. Surface segregation energies and the nature of the reaction environment are the driving forces to direct the distribution of atoms in the bimetallic sNPs. PtAu alloyed nanoparticles were obtained after 150 h of reaction at room temperature if a weak capping agent was used for the stabilization of the nanoparticles. It was also found that Au atoms diffuse towards Pt sNPs, producing a surface enriched in Au atoms. This study shows that even pure nanoparticles are prone to be modified by the surrounding nanoparticles to give rise to new nanomaterials if atomic diffusion is feasible.

  4. OAS single-nucleotide polymorphisms and haplotypes are associated with variations in immune responses to rubella vaccine

    PubMed Central

    Haralambieva, Iana H.; Dhiman, Neelam; Ovsyannikova, Inna G.; Vierkant, Robert A.; Pankratz, V. Shane; Jacobson, Robert M.; Poland, Gregory A.

    2010-01-01

    Interferon (IFN)-induced antiviral genes are crucial players in innate antiviral defense and potential determinants of immune response heterogeneity. We selected 114 candidate SNPs from 12 antiviral genes using an LD tagSNP selection approach and genotyped them in a cohort of 738 schoolchildren immunized with two doses of rubella vaccine. Associations between SNPs/haplotypes and rubella virus-specific immune measures were assessed using linear regression methodologies. We identified 23 significant associations (p<0.05) between polymorphisms within the 2′-5′-oligoadenylate synthetase (OAS) gene cluster, and rubella virus-specific IL-2, IL-10, IL-6 secretion and antibody levels. The minor allele variants of three OAS1 SNPs (rs3741981/Ser162Gly, rs1051042/Thr361Arg, rs2660), located in a linkage disequilibrium block of functional importance, were significantly associated with an increase in rubella virus-specific IL-2/Th1 response (p≤0.024). Seven OAS1 and OAS3 promoter/regulatory SNPs were similarly associated with IL-2 secretion. Importantly, two SNPs (rs3741981 and rs10774670), independently cross-regulated rubella virus-specific IL-10 secretion levels (p≤0.031). Furthermore, both global tests and individual haplotype analyses revealed significant associations between OAS1 haplotypes and rubella virus-specific cytokine secretion. Our results suggest that innate immunity and OAS genetic variations are likely involved in modulating the magnitude and quality of the adaptive immune responses to live attenuated rubella vaccine. PMID:20079393

  5. The genetic consequences of selection in natural populations.

    PubMed

    Thurman, Timothy J; Barrett, Rowan D H

    2016-04-01

    The selection coefficient, s, quantifies the strength of selection acting on a genetic variant. Despite this parameter's central importance to population genetic models, until recently we have known relatively little about the value of s in natural populations. With the development of molecular genetic techniques in the late 20th century and the sequencing technologies that followed, biologists are now able to identify genetic variants and directly relate them to organismal fitness. We reviewed the literature for published estimates of natural selection acting at the genetic level and found over 3000 estimates of selection coefficients from 79 studies. Selection coefficients were roughly exponentially distributed, suggesting that the impact of selection at the genetic level is generally weak but can occasionally be quite strong. We used both nonparametric statistics and formal random-effects meta-analysis to determine how selection varies across biological and methodological categories. Selection was stronger when measured over shorter timescales, with the mean magnitude of s greatest for studies that measured selection within a single generation. Our analyses found conflicting trends when considering how selection varies with the genetic scale (e.g., SNPs or haplotypes) at which it is measured, suggesting a need for further research. Besides these quantitative conclusions, we highlight key issues in the calculation, interpretation, and reporting of selection coefficients and provide recommendations for future research. © 2016 John Wiley & Sons Ltd.

  6. Identification of susceptible genes for complex chronic diseases based on disease risk functional SNPs and interaction networks.

    PubMed

    Li, Wan; Zhu, Lina; Huang, Hao; He, Yuehan; Lv, Junjie; Li, Weimin; Chen, Lina; He, Weiming

    2017-10-01

    Complex chronic diseases are caused by the effects of genetic and environmental factors. Single nucleotide polymorphisms (SNPs), one common type of genetic variations, played vital roles in diseases. We hypothesized that disease risk functional SNPs in coding regions and protein interaction network modules were more likely to contribute to the identification of disease susceptible genes for complex chronic diseases. This could help to further reveal the pathogenesis of complex chronic diseases. Disease risk SNPs were first recognized from public SNP data for coronary heart disease (CHD), hypertension (HT) and type 2 diabetes (T2D). SNPs in coding regions that were classified into nonsense and missense by integrating several SNP functional annotation databases were treated as functional SNPs. Then, regions significantly associated with each disease were screened using random permutations for disease risk functional SNPs. Corresponding to these regions, 155, 169 and 173 potential disease susceptible genes were identified for CHD, HT and T2D, respectively. A disease-related gene product interaction network in environmental context was constructed for interacting gene products of both disease genes and potential disease susceptible genes for these diseases. After functional enrichment analysis for disease associated modules, 5 CHD susceptible genes, 7 HT susceptible genes and 3 T2D susceptible genes were finally identified, some of which had pleiotropic effects. Most of these genes were verified to be related to these diseases in literature. This was similar for disease genes identified from another method proposed by Lee et al. from a different aspect. This research could provide novel perspectives for diagnosis and treatment of complex chronic diseases and susceptible genes identification for other diseases. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Whole-Genome Resequencing of Experimental Populations Reveals Polygenic Basis of Egg-Size Variation in Drosophila melanogaster.

    PubMed

    Jha, Aashish R; Miles, Cecelia M; Lippert, Nodia R; Brown, Christopher D; White, Kevin P; Kreitman, Martin

    2015-10-01

    Complete genome resequencing of populations holds great promise in deconstructing complex polygenic traits to elucidate molecular and developmental mechanisms of adaptation. Egg size is a classic adaptive trait in insects, birds, and other taxa, but its highly polygenic architecture has prevented high-resolution genetic analysis. We used replicated experimental evolution in Drosophila melanogaster and whole-genome sequencing to identify consistent signatures of polygenic egg-size adaptation. A generalized linear-mixed model revealed reproducible allele frequency differences between replicated experimental populations selected for large and small egg volumes at approximately 4,000 single nucleotide polymorphisms (SNPs). Several hundred distinct genomic regions contain clusters of these SNPs and have lower heterozygosity than the genomic background, consistent with selection acting on polymorphisms in these regions. These SNPs are also enriched among genes expressed in Drosophila ovaries and many of these genes have well-defined functions in Drosophila oogenesis. Additional genes regulating egg development, growth, and cell size show evidence of directional selection as genes regulating these biological processes are enriched for highly differentiated SNPs. Genetic crosses performed with a subset of candidate genes demonstrated that these genes influence egg size, at least in the large genetic background. These findings confirm the highly polygenic architecture of this adaptive trait, and suggest the involvement of many novel candidate genes in regulating egg size. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  8. FABP4 is a leading candidate gene associated with residual feed intake in growing Holstein calves.

    PubMed

    Cohen-Zinder, Miri; Asher, Aviv; Lipkin, Ehud; Feingersch, Roi; Agmon, Rotem; Karasik, David; Brosh, Arieh; Shabtay, Ariel

    2016-05-01

    Ecological and economic concerns drive the need to improve feed utilization by domestic animals. Residual feed intake (RFI) is one of the most acceptable measures for feed efficiency (FE). However, phenotyping RFI-related traits is complex and expensive and requires special equipment. Advances in marker technology allow the development of various DNA-based selection tools. To assimilate these technologies for the benefit of RFI-based selection, reliable phenotypic measures are prerequisite. In the current study, we identified single nucleotide polymorphisms (SNPs) associated with RFI phenotypic consistency across different ages and diets (named RFI 1-3), using DNA samples of high or low RFI ranked Holstein calves. Using targeted sequencing of chromosomal regions associated with FE- and RFI-related traits, we identified 48 top SNPs significantly associated with at least one of three defined RFIs. Eleven of these SNPs were harbored by the fatty acid binding protein 4 (FABP4). While 10 significant SNPs found in FABP4 were common for RFI 1 and RFI 3, one SNP (FABP4_5; A

  9. Replication of Caucasian loci associated with bone mineral density in Koreans.

    PubMed

    Kim, Y A; Choi, H J; Lee, J Y; Han, B G; Shin, C S; Cho, N H

    2013-10-01

    Most bone mineral density (BMD) loci were reported in Caucasian genome-wide association studies (GWAS). This study investigated the association between 59 known BMD loci (+200 suggestive SNPs) and DXA-derived BMD in East Asian population with respect to sex and site specificity. We also identified four novel BMD candidate loci from the suggestive SNPs. Most GWAS have reported BMD-related variations in Caucasian populations. This study investigates whether the BMD loci discovered in Caucasian GWAS are also associated with BMD in East Asian ethnic samples. A total of 2,729 unrelated Korean individuals from a population-based cohort were analyzed. We selected 747 single-nucleotide polymorphisms (SNPs). These markers included 547 SNPs from 59 loci with genome-wide significance (GWS, p value less than 5 × 10(-8)) levels and 200 suggestive SNPs that showed weaker BMD association with p value less than 5 × 10(-5). After quality control, 535 GWS SNPs and 182 suggestive SNPs were included in the replication analysis. Of the 535 GWS SNPs, 276 from 25 loci were replicated (p < 0.05) in the Korean population with 51.6 % replication rate. Of the 182 suggestive variants, 16 were replicated (p < 0.05, 8.8 % of replication rate), and five reached a significant combined p value (less than 7.0 × 10(-5), 0.05/717 SNPs, corrected for multiple testing). Two markers (rs11711157, rs3732477) are for the same signal near the gene CPN2 (carboxypeptidase N, polypeptide 2). The other variants, rs6436440 and rs2291296, were located in the genes AP1S3 (adaptor-related protein complex 1, sigma 3 subunit) and RARB (retinoic acid receptor, beta). Our results illustrate ethnic differences in BMD susceptibility genes and underscore the need for further genetic studies in each ethnic group. We were also able to replicate some SNPs with suggestive associations. These SNPs may be BMD-related genetic markers and should be further investigated.

  10. Therapygenetics in mindfulness-based cognitive therapy: do genes have an impact on therapy-induced change in real-life positive affective experiences?

    PubMed

    Bakker, J M; Lieverse, R; Menne-Lothmann, C; Viechtbauer, W; Pishva, E; Kenis, G; Geschwind, N; Peeters, F; van Os, J; Wichers, M

    2014-04-22

    Positive affect (PA) has an important role in resilience against depression and has been shown to increase with mindfulness-based cognitive therapy (MBCT). To elucidate the underlying mechanisms of change in PA as well as develop insights that may benefit personalized medicine, the current study examined the contribution of genetic variation to individual differences in change in PA in response to MBCT. Individuals (n=126) with residual depressive symptoms were randomized to either an MBCT group or treatment as usual. PA was assessed using experience sampling methodology (ESM). Single-nucleotide polymorphisms (SNPs) in genes known to be involved in reward functioning were selected. SNPs in the genes for brain-derived neurotrophic factor (BDNF), the muscarinic acetylcholine receptor M2 (CHRM2), the dopamine receptor D4 (DRD4) and the μ1 opioid receptor (OPRM1) significantly moderated the impact of treatment condition over time on PA. Genetic variation in the genes for CHRM2 and OPRM1 specifically had an impact on the level of PA following MBCT. The current study shows that variation in response to MBCT may be contingent on genetic factors associated with the regulation of PA. These findings contribute to our understanding of the processes moderating response to treatment and prediction of treatment outcome.

  11. Soil environment is a key driver of adaptation in Medicago truncatula: new insights from landscape genomics.

    PubMed

    Guerrero, Jimena; Andrello, Marco; Burgarella, Concetta; Manel, Stephanie

    2018-07-01

    Spatial differences in environmental selective pressures interact with the genomes of organisms, ultimately leading to local adaptation. Landscape genomics is an emergent research area that uncovers genome-environment associations, thus allowing researchers to identify candidate loci for adaptation to specific environmental variables. In the present study, we used latent factor mixed models (LFMMs) and Moran spectral outlier detection/randomization (MSOD-MSR) to identify candidate loci for adaptation to 10 environmental variables (climatic, soil and atmospheric) among 43 515 single nucleotide polymorphisms (SNPs) from 202 accessions of the model legume Medicago truncatula. Soil variables were associated with a large number of candidate loci identified through both LFMMs and MSOD-MSR. Genes tagged by candidate loci associated with drought and salinity are involved in the response to biotic and abiotic stresses, while those tagged by candidates associated with soil nitrogen and atmospheric nitrogen, participate in the legume-rhizobia symbiosis. Candidate SNPs identified through both LFMMs and MSOD-MSR explained up to 56% of variance in flowering traits. Our findings highlight the importance of soil in driving adaptation in the system and elucidate the basis of evolutionary potential of M. truncatula to respond to global climate change and anthropogenic disruption of the nitrogen cycle. © 2018 The Authors New Phytologist © 2018 New Phytologist Trust.

  12. Evaluation of methods and marker Systems in Genomic Selection of oil palm (Elaeis guineensis Jacq.).

    PubMed

    Kwong, Qi Bin; Teh, Chee Keng; Ong, Ai Ling; Chew, Fook Tim; Mayes, Sean; Kulaveerasingam, Harikrishna; Tammi, Martti; Yeoh, Suat Hui; Appleton, David Ross; Harikrishna, Jennifer Ann

    2017-12-11

    Genomic selection (GS) uses genome-wide markers as an attempt to accelerate genetic gain in breeding programs of both animals and plants. This approach is particularly useful for perennial crops such as oil palm, which have long breeding cycles, and for which the optimal method for GS is still under debate. In this study, we evaluated the effect of different marker systems and modeling methods for implementing GS in an introgressed dura family derived from a Deli dura x Nigerian dura (Deli x Nigerian) with 112 individuals. This family is an important breeding source for developing new mother palms for superior oil yield and bunch characters. The traits of interest selected for this study were fruit-to-bunch (F/B), shell-to-fruit (S/F), kernel-to-fruit (K/F), mesocarp-to-fruit (M/F), oil per palm (O/P) and oil-to-dry mesocarp (O/DM). The marker systems evaluated were simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs). RR-BLUP, Bayesian A, B, Cπ, LASSO, Ridge Regression and two machine learning methods (SVM and Random Forest) were used to evaluate GS accuracy of the traits. The kinship coefficient between individuals in this family ranged from 0.35 to 0.62. S/F and O/DM had the highest genomic heritability, whereas F/B and O/P had the lowest. The accuracies using 135 SSRs were low, with accuracies of the traits around 0.20. The average accuracy of machine learning methods was 0.24, as compared to 0.20 achieved by other methods. The trait with the highest mean accuracy was F/B (0.28), while the lowest were both M/F and O/P (0.18). By using whole genomic SNPs, the accuracies for all traits, especially for O/DM (0.43), S/F (0.39) and M/F (0.30) were improved. The average accuracy of machine learning methods was 0.32, compared to 0.31 achieved by other methods. Due to high genomic resolution, the use of whole-genome SNPs improved the efficiency of GS dramatically for oil palm and is recommended for dura breeding programs. Machine learning slightly outperformed other methods, but required parameters optimization for GS implementation.

  13. Adaptive Set-Based Methods for Association Testing.

    PubMed

    Su, Yu-Chen; Gauderman, William James; Berhane, Kiros; Lewinger, Juan Pablo

    2016-02-01

    With a typical sample size of a few thousand subjects, a single genome-wide association study (GWAS) using traditional one single nucleotide polymorphism (SNP)-at-a-time methods can only detect genetic variants conferring a sizable effect on disease risk. Set-based methods, which analyze sets of SNPs jointly, can detect variants with smaller effects acting within a gene, a pathway, or other biologically relevant sets. Although self-contained set-based methods (those that test sets of variants without regard to variants not in the set) are generally more powerful than competitive set-based approaches (those that rely on comparison of variants in the set of interest with variants not in the set), there is no consensus as to which self-contained methods are best. In particular, several self-contained set tests have been proposed to directly or indirectly "adapt" to the a priori unknown proportion and distribution of effects of the truly associated SNPs in the set, which is a major determinant of their power. A popular adaptive set-based test is the adaptive rank truncated product (ARTP), which seeks the set of SNPs that yields the best-combined evidence of association. We compared the standard ARTP, several ARTP variations we introduced, and other adaptive methods in a comprehensive simulation study to evaluate their performance. We used permutations to assess significance for all the methods and thus provide a level playing field for comparison. We found the standard ARTP test to have the highest power across our simulations followed closely by the global model of random effects (GMRE) and a least absolute shrinkage and selection operator (LASSO)-based test. © 2015 WILEY PERIODICALS, INC.

  14. Interaction between arsenic exposure from drinking water and genetic susceptibility in carotid intima-media thickness in Bangladesh

    PubMed Central

    Wu, Fen; Jasmine, Farzana; Kibriya, Muhammad G.; Liu, Mengling; Cheng, Xin; Parvez, Faruque; Paul-Brutus, Rachelle; Islam, Tariqul; Paul, Rina Rani; Sarwar, Golam; Ahmed, Alauddin; Jiang, Jieying; Islam, Tariqul; Slavkovich, Vesna; Rundek, Tatjana; Demmer, Ryan T.; Desvarieux, Moise; Ahsan, Habibul; Chen, Yu

    2014-01-01

    Epidemiologic studies that evaluated genetic susceptibility to the effects of arsenic exposure from drinking water on subclinical atherosclerosis are limited. We conducted a cross-sectional study of 1,078 participants randomly selected from the Health Effects of Arsenic Longitudinal Study in Bangladesh to evaluate whether the association between arsenic exposure and carotid artery intima-medial thickness (cIMT) differs by 207 single-nucleotide polymorphisms (SNPs) in 18 genes related to arsenic metabolism, oxidative stress, inflammation, and endothelial dysfunction. Although not statistically significant after correcting for multiple testing, nine SNPs in APOE, AS3MT, PNP, and TNF genes had a nominally statistically significant interaction with well-water arsenic in cIMT. For instance, the joint presence of a higher level of well-water arsenic (≥ 40.4 μg/L) and the GG genotype of AS3MT rs3740392 was associated with a difference of 40.9 μm (95% CI = 14.4, 67.5) in cIMT, much greater than the difference of cIMT associated with the genotype alone (β = -5.1 μm, 95% CI = -31.6, 21.3) or arsenic exposure alone (β = 7.2 μm, 95% CI = -3.1, 17.5). The pattern and magnitude of the interactions were similar when urinary arsenic was used as the exposure variable. Additionally, the at-risk genotypes of the AS3MT SNPs were positively related to proportion of monomethylarsonic acid (MMA) in urine, which is indicative of arsenic methylation capacity. The findings provide novel evidence that genetic variants related to arsenic metabolism may play an important role in arsenic-induced subclinical atherosclerosis. Future replication studies in diverse populations are needed to confirm the findings. PMID:24593923

  15. A matrix metalloproteinase 9 (MMP9) gene single nucleotide polymorphism is associated with predisposition to tick-borne encephalitis virus-induced severe central nervous system disease.

    PubMed

    Barkhash, Andrey V; Yurchenko, Andrey A; Yudin, Nikolay S; Ignatieva, Elena V; Kozlova, Irina V; Borishchuk, Inessa A; Pozdnyakova, Larisa L; Voevoda, Mikhail I; Romaschenko, Aida G

    2018-05-01

    The progression of infectious diseases depends on causative agents, the environment and the host's genetic susceptibility. To date, human genetic susceptibility to tick-borne encephalitis (TBE) virus-induced disease has not been sufficiently studied. We have combined whole-exome sequencing with a candidate gene approach to identify genes that are involved in the development of predisposition to TBE in a Russian population. Initially, six exomes from TBE patients with severe central nervous system (CNS) disease and seven exomes from control individuals were sequenced. Despite the small sample size, two nonsynonymous single nucleotide polymorphisms (SNPs) were significantly associated with TBE virus-induced severe CNS disease. One of these SNPs is rs6558394 (G/A, Pro422Leu) in the scribbled planar cell polarity protein (SCRIB) gene and the other SNP is rs17576 (A/G, Gln279Arg) in the matrix metalloproteinase 9 (MMP9) gene. Subsequently, these SNPs were genotyped in DNA samples of 150 non-immunized TBE patients with different clinical forms of the disease from two cities and 228 control randomly selected samples from the same populations. There were no statistically significant differences in genotype and allele frequencies between the case and control groups for rs6558394. However, the frequency of the rs17576 G allele was significantly higher in TBE patients with severe CNS diseases such as meningo-encephalitis (43.5%) when compared with TBE patients with milder meningitis (26.3%; P = 0.01), as well as with the population control group (32.5%; P = 0.042). The results suggest that the MMP9 gene may affect genetic predisposition to TBE in a Russian population. Copyright © 2018 Elsevier GmbH. All rights reserved.

  16. Genetic alterations within TLR genes in development of Toxoplasma gondii infection among Polish pregnant women.

    PubMed

    Wujcicka, Wioletta; Wilczyński, Jan; Nowakowska, Dorota

    2017-09-01

    The research was conducted to evaluate the role of genotypes, haplotypes and multiple-SNP variants in the range of TLR2, TLR4 and TLR9 single nucleotide polymorphisms (SNPs) in the development of Toxoplasma gondii infection among Polish pregnant women. The study was performed for 116 Polish pregnant women, including 51 patients infected with T. gondii, and 65 age-matched control pregnant individuals. Genotypes in TLR2 2258 G>A, TLR4 896 A>G, TLR4 1196 C>T and TLR9 2848 G>A SNPs were estimated by self-designed, nested PCR-RFLP assays. Randomly selected PCR products, representative for distinct genotypes in the studied polymorphisms, were confirmed by sequencing. All the genotypes were calculated for Hardy-Weinberg (H-W) equilibrium and TLR4 variants were tested for linkage disequilibrium. Relationships were assessed between alleles, genotypes, haplotypes or multiple-SNP variants in TLR polymorphisms and the occurrence of T. gondii infection in pregnant women, using a logistic regression model. All the analyzed genotypes preserved the H-W equilibrium among the studied groups of patients (P>0.050). Similar distribution of distinct alleles and individual genotypes in TLR SNPs, as well as of haplotypes in TLR4 polymorphisms, were observed in T. gondii infected and control uninfected pregnant women. However, the GACG multiple-SNP variant, within the range of all the four studied polymorphisms, was correlated with a decreased risk of the parasitic infection (OR 0.52, 95% CI 0.28-0.97; P≤0.050). The polymorphisms, located within TLR2, TLR4 and TLR9 genes, may be involved together in occurrence of T. gondii infection among Polish pregnant women. Copyright © 2017 Medical University of Bialystok. Published by Elsevier B.V. All rights reserved.

  17. Developing a novel panel of genome-wide ancestry informative markers for bio-geographical ancestry estimates.

    PubMed

    Jia, Jing; Wei, Yi-Liang; Qin, Cui-Jiao; Hu, Lan; Wan, Li-Hua; Li, Cai-Xia

    2014-01-01

    Inferring the ancestral origin of DNA samples can be helpful in correcting population stratification in disease association studies or guiding crime investigations. Populations throughout the world vary in appearance features and biological characteristics. Based on this idea, we performed a genome-wide scan for SNPs within genes that are related to physical and biological traits. Using the HapMap database, we screened 52 genes and their flanking regions. Thirty-five SNPs that displayed highly contrasting allele frequencies (F(st)>0.3, linkage disequilibrium r(2)<0.2, and Hardy-Weinberg equilibrium P>0.001) among Africans, Europeans, and East Asians were selected and validated. A multiplexed assay was developed to genotype these 35 SNPs in 357 individuals from 10 populations worldwide. This panel provided accurate estimates of individual ancestry proportions with balanced discriminatory power among the three continental ancestries: Africans, Europeans, and East Asians. It also proved very effective in evaluating admixed populations living in joint regions of continents (e.g., Uyghurs and Indians) and discriminating some subpopulations within each of the three continents. Structure analysis was performed to establish and evaluate the panel of ancestry-informative markers, and the components of each population were also described to indicate the structural composition. The 21 population structures in our study are consistent with geographic patterns, and individuals were properly assigned to their original ancestral populations with proportion analyses and random match probability calculations. Thus, the panel and its population information will be useful resources to minimize the effects of population stratification in association analyses and to assign the most likely origin of an unknown DNA contributor in forensic investigations. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  18. Ancestry of the Timorese: age-related macular degeneration associated genotype and allele sharing among human populations from throughout the world

    PubMed Central

    Morrison, Margaux A.; Magalhaes, Tiago R.; Ramke, Jacqueline; Smith, Silvia E.; Ennis, Sean; Simpson, Claire L.; Portas, Laura; Murgia, Federico; Ahn, Jeeyun; Dardenne, Caitlin; Mayne, Katie; Robinson, Rosann; Morgan, Denise J.; Brian, Garry; Lee, Lucy; Woo, Se J.; Zacharaki, Fani; Tsironi, Evangelia E.; Miller, Joan W.; Kim, Ivana K.; Park, Kyu H.; Bailey-Wilson, Joan E.; Farrer, Lindsay A.; Stambolian, Dwight; DeAngelis, Margaret M.

    2015-01-01

    We observed that the third leading cause of blindness in the world, age-related macular degeneration (AMD), occurs at a very low documented frequency in a population-based cohort from Timor-Leste. Thus, we determined a complete catalog of the ancestry of the Timorese by analysis of whole exome chip data and haplogroup analysis of SNP genotypes determined by sequencing the Hypervariable I and II regions of the mitochondrial genome and 17 genotyped YSTR markers obtained from 535 individuals. We genotyped 20 previously reported AMD-associated SNPs in the Timorese to examine their allele frequencies compared to and between previously documented AMD cohorts of varying ethnicities. For those without AMD (average age > 55 years), genotype and allele frequencies were similar for most SNPs with a few exceptions. The major risk allele of HTRA1 rs11200638 (10q26) was at a significantly higher frequency in the Timorese, as well as 3 of the 5 protective CFH (1q32) SNPs (rs800292, rs2284664, and rs12066959). Additionally, the most commonly associated AMD-risk SNP, CFH rs1061170 (Y402H), was also seen at a much lower frequency in the Korean and Timorese populations than in the assessed Caucasian populations (C ~7 vs. ~40%, respectively). The difference in allele frequencies between the Timorese population and the other genotyped populations, along with the haplogroup analysis, also highlight the genetic diversity of the Timorese. Specifically, the most common ancestry groupings were Oceanic (Melanesian and Papuan) and Eastern Asian (specifically Han Chinese). The low prevalence of AMD in the Timorese population (2 of 535 randomly selected participants) may be due to the enrichment of protective alleles in this population at the 1q32 locus. PMID:26217379

  19. Genetic Predictors of Depressive Symptoms in the Look AHEAD Trial.

    PubMed

    McCaffery, Jeanne M; Papandonatos, George D; Faulconbridge, Lucy F; Erar, Bahar; Peter, Inga; Wagenknecht, Lynne E; Pajewski, Nicholas M; Anderson, Andrea; Wadden, Thomas A; Wing, Rena R

    2015-01-01

    Numerous studies have found elevated depressive symptoms among individuals with Type 2 diabetes, yet the mechanisms remain unclear. We examined whether genetic loci previously associated with depressive symptoms predict depressive symptoms among overweight/obese individuals with Type 2 diabetes or change in depressive symptoms during behavioral weight loss. The Illumina CARe iSelect (IBC) chip and Cardiometabochip were characterized in 2118 overweight or obese participants with Type 2 diabetes from Look AHEAD (Action for Health in Diabetes), a randomized trial to determine the effects of intensive life-style intervention and diabetes support and education on cardiovascular morbidity and mortality. Primary analyses focused on baseline Beck Depression Inventory (BDI) scores and depressive symptom change at 1 year. Of eight single nucleotide polymorphisms (SNPs) in six loci, three a priori SNPs in two loci (chromosome 5: rs60271; LBR: rs2230419, rs1011319) were associated with baseline BDI scores, but in the opposite direction of prior research. In joint analysis of 90,003 IBC and Cardiometabochip SNPs, rs1543654 in the region of KCNE1 predicted change in BDI scores at Year 1 in diabetes support and education (β = -1.05, standard error [SE] = 0.21, p = 6.9 × 10(-7)) at the level of chip-wide significance, while also showing a nominal association with baseline BDI (β = 0.35, SE = 0.16, p = .026). Adjustment for antidepressant medication and/or limiting analyses to non-Hispanic white individuals did not meaningfully alter results. Previously reported genetic associations with depressive symptoms did not replicate in this cohort of overweight/obese individuals with Type 2 diabetes. We identified KCNE1 as a potential novel locus associated with depressive symptoms.

  20. Germline variants in the CYP19A1 gene are related to specific adverse events in aromatase inhibitor users: a substudy of Dutch patients in the TEAM trial.

    PubMed

    Fontein, Duveken B Y; Houtsma, Daniel; Nortier, Johan W R; Baak-Pablo, Renee F; Kranenbarg, Elma Meershoek-Klein; van der Straaten, Tahar R J H M; Putter, Hein; Seynaeve, Caroline; Gelderblom, Hans; van de Velde, Cornelis J H; Guchelaar, Henk-Jan

    2014-04-01

    Musculoskeletal adverse events (MSAEs) and vasomotor symptoms (VMSs) are known side-effects of aromatase inhibitors, and may be related to genetic variations of the aromatase gene (CYP19A1). We investigated the relationship between these specific AEs and single nucleotide polymorphisms (SNPs) in the CYP19A1 gene in postmenopausal, hormone receptor-positive early breast cancer (BC) patients treated with adjuvant exemestane for 5 years. Dutch patients who were randomized to receive 5 years of exemestane in the Tamoxifen Exemestane Adjuvant Multinational (TEAM) trial were included. A tagging-SNP approach was performed, covering 80 % of variations of the CYP19A1 gene with 30 SNPs. Logistic regression analyses were used to assess the risk of reporting VMSs or MSAEs in relation to genotypes within selected SNPs. Of 737 included patients, 281 patients reported at least one MSAE (n = 210) or VMS (n = 163). Homozygous AA genotype of rs934635 was associated with a significantly higher odds of MSAEs (multivariate odds ratio (OR) 4.66, p = 0.008) and VMSs (multivariate OR 2.78, p = 0.044). Regarding both rs1694189 and rs7176005, the homozygous variant genotypes (TT) were associated with a higher odds of VMSs, but not MSAEs (OR 1.758, p = 0.025 and OR 6.361, p = 0.021, respectively). Our exploratory analysis demonstrated that some CYP19A1 gene variations may be associated with MSAEs and/or VMSs. Specifically, patients with the homozygous variant rs934635 genotype reported more MSAEs and VMSs. Although further confirmatory studies are warranted, genomic profiling can help identify patients at an increased risk of reporting these specific AEs, potentiating further personalized BC treatment.

  1. Association of the homeobox transcription factor gene ENGRAILED 2 with autistic disorder in Chinese children.

    PubMed

    Yang, Pinchen; Lung, For-Wey; Jong, Yuh-Jyh; Hsieh, Hsin-Yi; Liang, Chung-Ling; Juo, Suh-Hang Hank

    2008-01-01

    Autism is a neurodevelopmental disorder with a strong genetic component. Previous studies have mapped the disease to chromosome 7q, where the homeobox transcription factor ENGRAILED 2 (EN2) gene is located. EN2 is specifically involved in patterning the region that gives rise to the cerebellum. In the present work, we carried out a case-control study to determine whether 2 intronic single-nucleotide polymorphisms (SNPs) of EN2 are a susceptibility to autism in a Han Chinese population. We enrolled 184 cases of DSM-IV-TR diagnosed autistic disorder, 225 controls of unrelated healthy volunteers and 409 randomly selected controls from the community who lives in the adjacent geographical regions for this study. Two SNPs (rs1861972, rs1861973) at the EN2 gene that have been reported to be associated with autism underwent analysis among our studied cohorts. Both the UNPHASE and PHASE statistical programs were utilized for evaluating the association of EN2 SNPs with autism based on allelic and genotypic frequencies and haplotype compositions accompanied with the goodness-of-fit method of the chi(2) test. The gender difference was also investigated by using 2-side Fisher's exact test treated as a covariate in logistic regression analysis. Both the allelic and genotypic distributions of the 2 polymorphisms were concordant with Hardy-Weinberg equilibrium. Significant differences were found for cases versus community and overall controls. By using the UNPHASE and PHASE programs, the 2-marker haplotype A-C of EN2 was identified to have a protective effect for autism, indicating that the ethnic difference might confound the EN2 association with autism. Therefore, more EN2 gene association studies of Han Chinese populations are warranted to confirm this finding. 2008 S. Karger AG, Basel.

  2. Interaction between arsenic exposure from drinking water and genetic susceptibility in carotid intima-media thickness in Bangladesh.

    PubMed

    Wu, Fen; Jasmine, Farzana; Kibriya, Muhammad G; Liu, Mengling; Cheng, Xin; Parvez, Faruque; Paul-Brutus, Rachelle; Paul, Rina Rani; Sarwar, Golam; Ahmed, Alauddin; Jiang, Jieying; Islam, Tariqul; Slavkovich, Vesna; Rundek, Tatjana; Demmer, Ryan T; Desvarieux, Moise; Ahsan, Habibul; Chen, Yu

    2014-05-01

    Epidemiologic studies that evaluated genetic susceptibility for the effects of arsenic exposure from drinking water on subclinical atherosclerosis are limited. We conducted a cross-sectional study of 1078 participants randomly selected from the Health Effects of Arsenic Longitudinal Study in Bangladesh to evaluate whether the association between arsenic exposure and carotid artery intima-media thickness (cIMT) differs by 207 single-nucleotide polymorphisms (SNPs) in 18 genes related to arsenic metabolism, oxidative stress, inflammation, and endothelial dysfunction. Although not statistically significant after correcting for multiple testing, nine SNPs in APOE, AS3MT, PNP, and TNF genes had a nominally statistically significant interaction with well-water arsenic in cIMT. For instance, the joint presence of a higher level of well-water arsenic (≥ 40.4 μg/L) and the GG genotype of AS3MT rs3740392 was associated with a difference of 40.9 μm (95% CI = 14.4, 67.5) in cIMT, much greater than the difference of cIMT associated with the genotype alone (β = -5.1 μm, 95% CI = -31.6, 21.3) or arsenic exposure alone (β = 7.2 μm, 95% CI = -3.1, 17.5). The pattern and magnitude of the interactions were similar when urinary arsenic was used as the exposure variable. Additionally, the at-risk genotypes of the AS3MT SNPs were positively related to the proportion of monomethylarsonic acid (MMA) in urine, which is indicative of arsenic methylation capacity. The findings provide novel evidence that genetic variants related to arsenic metabolism may play an important role in arsenic-induced subclinical atherosclerosis. Future replication studies in diverse populations are needed to confirm the findings. Copyright © 2014 Elsevier Inc. All rights reserved.

  3. Polymorphisms in the bovine CIDEC gene are associated with body measurement traits and meat quality traits in Qinchuan cattle.

    PubMed

    Mei, C G; Gui, L S; Fu, C Z; Wang, H C; Wang, J L; Cheng, G; Zan, L S

    2015-08-07

    Previous studies have shown that the cell death-inducing DFF45-like effector-C (CIDEC) gene is involved in lipid storage and energy metabolism, suggesting that it is a potential candidate gene that affects body measurement traits (BMTs) and meat quality traits (MQTs). The aim of this study was to identify polymorphisms of the bovine CIDEC gene and analyze their possible associations with BMTs and MQTs in 531 randomly selected Qinchuan cattle aged between 18 and 24 months. DNA sequencing and polymerase chain reaction-restriction fragment length polymorphism were employed to detect CIDEC single nucleotide polymorphisms (SNPs). We found five SNPs: two in exon 5 (SNP1, g.9815G>A and SNP2, g.9924C>T) and three in the 3'-untranslated region (SNP3, g.13281C>T; SNP4, g.13297A>G; and SNP5, g.13307G>A). SNP1 was a missense mutation that resulted in an arginine to glutamine amino acid change, and exhibited two genotypes (GG and AG). SNP2 was a synonymous mutation that exhibited three genotypes (CC, CT, and TT). SNP3, 4, and 5 were completely linked, and only exhibited two genotypes (CC-AA-GG and CT-AG-GA). We found significant associations between these polymorphisms and BMTs and MQTs (P < 0.05); GG, CT, and CT-AG-GA appeared to be the most beneficial genotypes. Therefore, CIDEC may affect BMTs and MQTs in Qinchuan cattle, and could be used in marker-assisted selection.

  4. Mitochondrial haplotypes are not associated with mice selectively bred for high voluntary wheel running.

    PubMed

    Wone, Bernard W M; Yim, Won C; Schutz, Heidi; Meek, Thomas H; Garland, Theodore

    2018-04-04

    Mitochondrial haplotypes have been associated with human and rodent phenotypes, including nonshivering thermogenesis capacity, learning capability, and disease risk. Although the mammalian mitochondrial D-loop is highly polymorphic, D-loops in laboratory mice are identical, and variation occurs elsewhere mainly between nucleotides 9820 and 9830. Part of this region codes for the tRNA Arg gene and is associated with mitochondrial densities and number of mtDNA copies. We hypothesized that the capacity for high levels of voluntary wheel-running behavior would be associated with mitochondrial haplotype. Here, we analyzed the mtDNA polymorphic region in mice from each of four replicate lines selectively bred for 54 generations for high voluntary wheel running (HR) and from four control lines (Control) randomly bred for 54 generations. Sequencing the polymorphic region revealed a variable number of adenine repeats. Single nucleotide polymorphisms (SNPs) varied from 2 to 3 adenine insertions, resulting in three haplotypes. We found significant genetic differentiations between the HR and Control groups (F st  = 0.779, p ≤ 0.0001), as well as among the replicate lines of mice within groups (F sc  = 0.757, p ≤ 0.0001). Haplotypes, however, were not strongly associated with voluntary wheel running (revolutions run per day), nor with either body mass or litter size. This system provides a useful experimental model to dissect the physiological processes linking mitochondrial, genomic SNPs, epigenetics, or nuclear-mitochondrial cross-talk to exercise activity. Copyright © 2018. Published by Elsevier B.V.

  5. Identification of selection signatures in cattle breeds selected for dairy production.

    PubMed

    Stella, Alessandra; Ajmone-Marsan, Paolo; Lazzari, Barbara; Boettcher, Paul

    2010-08-01

    The genomics revolution has spurred the undertaking of HapMap studies of numerous species, allowing for population genomics to increase the understanding of how selection has created genetic differences between subspecies populations. The objectives of this study were to (1) develop an approach to detect signatures of selection in subsets of phenotypically similar breeds of livestock by comparing single nucleotide polymorphism (SNP) diversity between the subset and a larger population, (2) verify this method in breeds selected for simply inherited traits, and (3) apply this method to the dairy breeds in the International Bovine HapMap (IBHM) study. The data consisted of genotypes for 32,689 SNPs of 497 animals from 19 breeds. For a given subset of breeds, the test statistic was the parametric composite log likelihood (CLL) of the differences in allelic frequencies between the subset and the IBHM for a sliding window of SNPs. The null distribution was obtained by calculating CLL for 50,000 random subsets (per chromosome) of individuals. The validity of this approach was confirmed by obtaining extremely large CLLs at the sites of causative variation for polled (BTA1) and black-coat-color (BTA18) phenotypes. Across the 30 bovine chromosomes, 699 putative selection signatures were detected. The largest CLL was on BTA6 and corresponded to KIT, which is responsible for the piebald phenotype present in four of the five dairy breeds. Potassium channel-related genes were at the site of the largest CLL on three chromosomes (BTA14, -16, and -25) whereas integrins (BTA18 and -19) and serine/arginine rich splicing factors (BTA20 and -23) each had the largest CLL on two chromosomes. On the basis of the results of this study, the application of population genomics to farm animals seems quite promising. Comparisons between breed groups have the potential to identify genomic regions influencing complex traits with no need for complex equipment and the collection of extensive phenotypic records and can contribute to the identification of candidate genes and to the understanding of the biological mechanisms controlling complex traits.

  6. A whole genome analyses of genetic variants in two Kelantan Malay individuals.

    PubMed

    Wan Juhari, Wan Khairunnisa; Md Tamrin, Nur Aida; Mat Daud, Mohd Hanif Ridzuan; Isa, Hatin Wan; Mohd Nasir, Nurfazreen; Maran, Sathiya; Abdul Rajab, Nur Shafawati; Ahmad Amin Noordin, Khairul Bariah; Nik Hassan, Nik Norliza; Tearle, Rick; Razali, Rozaimi; Merican, Amir Feisal; Zilfalil, Bin Alwi

    2014-12-01

    The sequencing of two members of the Royal Kelantan Malay family genomes will provide insights on the Kelantan Malay whole genome sequences. The two Kelantan Malay genomes were analyzed for the SNP markers associated with thalassemia and Helicobacter pylori infection. Helicobacter pylori infection was reported to be low prevalence in the north-east as compared to the west coast of the Peninsular Malaysia and beta-thalassemia was known to be one of the most common inherited and genetic disorder in Malaysia. By combining SNP information from literatures, GWAS study and NCBI ClinVar, 18 unique SNPs were selected for further analysis. From these 18 SNPs, 10 SNPs came from previous study of Helicobacter pylori infection among Malay patients, 6 SNPs were from NCBI ClinVar and 2 SNPs from GWAS studies. The analysis reveals that both Royal Kelantan Malay genomes shared all the 10 SNPs identified by Maran (Single Nucleotide Polymorphims (SNPs) genotypic profiling of Malay patients with and without Helicobacter pylori infection in Kelantan, 2011) and one SNP from GWAS study. In addition, the analysis also reveals that both Royal Kelantan Malay genomes shared 3 SNP markers; HBG1 (rs1061234), HBB (rs1609812) and BCL11A (rs766432) where all three markers were associated with beta-thalassemia. Our findings suggest that the Royal Kelantan Malays carry the SNPs which are associated with protection to Helicobacter pylori infection. In addition they also carry SNPs which are associated with beta-thalassemia. These findings are in line with the findings by other researchers who conducted studies on thalassemia and Helicobacter pylori infection in the non-royal Malay population.

  7. Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

    PubMed Central

    Ramos, Antonio M.; Crooijmans, Richard P. M. A.; Affara, Nabeel A.; Amaral, Andreia J.; Archibald, Alan L.; Beever, Jonathan E.; Bendixen, Christian; Churcher, Carol; Clark, Richard; Dehais, Patrick; Hansen, Mark S.; Hedegaard, Jakob; Hu, Zhi-Liang; Kerstens, Hindrik H.; Law, Andy S.; Megens, Hendrik-Jan; Milan, Denis; Nonneman, Danny J.; Rohrer, Gary A.; Rothschild, Max F.; Smith, Tim P. L.; Schnabel, Robert D.; Van Tassell, Curt P.; Taylor, Jeremy F.; Wiedmann, Ralph T.; Schook, Lawrence B.; Groenen, Martien A. M.

    2009-01-01

    Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs. PMID:19654876

  8. Genome-wide analysis of central corneal thickness in primary open-angle glaucoma cases in the NEIGHBOR and GLAUGEN consortia.

    PubMed

    Ulmer, Megan; Li, Jun; Yaspan, Brian L; Ozel, Ayse Bilge; Richards, Julia E; Moroi, Sayoko E; Hawthorne, Felicia; Budenz, Donald L; Friedman, David S; Gaasterland, Douglas; Haines, Jonathan; Kang, Jae H; Lee, Richard; Lichter, Paul; Liu, Yutao; Pasquale, Louis R; Pericak-Vance, Margaret; Realini, Anthony; Schuman, Joel S; Singh, Kuldev; Vollrath, Douglas; Weinreb, Robert; Wollstein, Gadi; Zack, Donald J; Zhang, Kang; Young, Terri; Allingham, R Rand; Wiggs, Janey L; Ashley-Koch, Allison; Hauser, Michael A

    2012-07-03

    To investigate the effects of central corneal thickness (CCT)-associated variants on primary open-angle glaucoma (POAG) risk using single nucleotide polymorphisms (SNP) data from the Glaucoma Genes and Environment (GLAUGEN) and National Eye Institute (NEI) Glaucoma Human Genetics Collaboration (NEIGHBOR) consortia. A replication analysis of previously reported CCT SNPs was performed in a CCT dataset (n = 1117) and these SNPs were then tested for association with POAG using a larger POAG dataset (n = 6470). Then a CCT genome-wide association study (GWAS) was performed. Top SNPs from this analysis were selected and tested for association with POAG. cDNA libraries from fetal and adult brain and ocular tissue samples were generated and used for candidate gene expression analysis. Association with one of 20 previously published CCT SNPs was replicated: rs12447690, near the ZNF469 gene (P = 0.001; β = -5.08 μm/allele). None of these SNPs were significantly associated with POAG. In the CCT GWAS, no SNPs reached genome-wide significance. After testing 50 candidate SNPs for association with POAG, one SNP was identified, rs7481514 within the neurotrimin (NTM) gene, that was significantly associated with POAG in a low-tension subset (P = 0.00099; Odds Ratio [OR] = 1.28). Additionally, SNPs in the CNTNAP4 gene showed suggestive association with POAG (top SNP = rs1428758; P = 0.018; OR = 0.84). NTM and CNTNAP4 were shown to be expressed in ocular tissues. The results suggest previously reported CCT loci are not significantly associated with POAG susceptibility. By performing a quantitative analysis of CCT and a subsequent analysis of POAG, SNPs in two cell adhesion molecules, NTM and CNTNAP4, were identified and may increase POAG susceptibility in a subset of cases.

  9. Development and evaluation of high-density Axiom® CicerSNP Array for high-resolution genetic mapping and breeding applications in chickpea.

    PubMed

    Roorkiwal, Manish; Jain, Ankit; Kale, Sandip M; Doddamani, Dadakhalandar; Chitikineni, Annapurna; Thudi, Mahendar; Varshney, Rajeev K

    2018-04-01

    To accelerate genomics research and molecular breeding applications in chickpea, a high-throughput SNP genotyping platform 'Axiom ® CicerSNP Array' has been designed, developed and validated. Screening of whole-genome resequencing data from 429 chickpea lines identified 4.9 million SNPs, from which a subset of 70 463 high-quality nonredundant SNPs was selected using different stringent filter criteria. This was further narrowed down to 61 174 SNPs based on p-convert score ≥0.3, of which 50 590 SNPs could be tiled on array. Among these tiled SNPs, a total of 11 245 SNPs (22.23%) were from the coding regions of 3673 different genes. The developed Axiom ® CicerSNP Array was used for genotyping two recombinant inbred line populations, namely ICCRIL03 (ICC 4958 × ICC 1882) and ICCRIL04 (ICC 283 × ICC 8261). Genotyping data reflected high success and polymorphic rate, with 15 140 (29.93%; ICCRIL03) and 20 018 (39.57%; ICCRIL04) polymorphic SNPs. High-density genetic maps comprising 13 679 SNPs spanning 1033.67 cM and 7769 SNPs spanning 1076.35 cM were developed for ICCRIL03 and ICCRIL04 populations, respectively. QTL analysis using multilocation, multiseason phenotyping data on these RILs identified 70 (ICCRIL03) and 120 (ICCRIL04) main-effect QTLs on genetic map. Higher precision and potential of this array is expected to advance chickpea genetics and breeding applications. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.

  10. SNP discovery and genotyping using Genotyping-by-Sequencing in Pekin ducks.

    PubMed

    Zhu, Feng; Cui, Qian-Qian; Hou, Zhuo-Cheng

    2016-11-15

    Genomic selection and genome-wide association studies need thousands to millions of SNPs. However, many non-model species do not have reference chips for detecting variation. Our goal was to develop and validate an inexpensive but effective method for detecting SNP variation. Genotyping by sequencing (GBS) can be a highly efficient strategy for genome-wide SNP detection, as an alternative to microarray chips. Here, we developed a GBS protocol for ducks and tested it to genotype 49 Pekin ducks. A total of 169,209 SNPs were identified from all animals, with a mean of 55,920 SNPs per individual. The average SNP density reached 1156 SNPs/MB. In this study, the first application of GBS to ducks, we demonstrate the power and simplicity of this method. GBS can be used for genetic studies in to provide an effective method for genome-wide SNP discovery.

  11. A genetic variation map for chicken with 2.8 million single nucleotide polymorphisms

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wong, G K; Hillier, L; Brandstrom, M

    2005-02-20

    We describe a genetic variation map for the chicken genome containing 2.8 million single nucleotide polymorphisms (SNPs), based on a comparison of the sequences of 3 domestic chickens (broiler, layer, Silkie) to their wild ancestor Red Jungle Fowl (RJF). Subsequent experiments indicate that at least 90% are true SNPs, and at least 70% are common SNPs that segregate in many domestic breeds. Mean nucleotide diversity is about 5 SNP/kb for almost every possible comparison between RJF and domestic lines, between two different domestic lines, and within domestic lines--contrary to the idea that domestic animals are highly inbred relative to theirmore » wild ancestors. In fact, most of the SNPs originated prior to domestication, and there is little to no evidence of selective sweeps for adaptive alleles on length scales of greater than 100 kb.« less

  12. Large-Scale Interaction Effects Reveal Missing Heritability in Schizophrenia, Bipolar Disorder and Posttraumatic Stress Disorder

    DTIC Science & Technology

    2017-04-11

    polymorphisms (SNPs) reached genome-wide significance. In contrast, when SNPs were selected in groups ( containing up to thousands each) and the collective...the underlying genetic factors has been challen- ging because of high polygenicity, necessitating large sample sizes in meta-analyses.4 Possible ways...partners simultaneously considered beyond SNP pairs by using the regularized inference of high -dimensional interactions within large SNP groups. Over

  13. Interaction of methylation-related genetic variants with circulating fatty acids on plasma lipids: a meta-analysis of 7 studies & methylation analysis of 3 studies in the Cohorts for Heart & Aging Research

    USDA-ARS?s Scientific Manuscript database

    Background: DNA methylation is influenced by diet and single nucleotide polymorphisms (SNPs), and methylation modulates gene expression. Objective: We aimed to explore whether the gene-by-diet interactions on blood lipids act through DNA methylation. Design: We selected 7 SNPs on the basis of predic...

  14. A genome-wide association study implicates diacylglycerol kinase eta (DGKH) and several other genes in the etiology of bipolar disorder

    PubMed Central

    Baum, AE; Akula, N; Cabanero, M; Cardona, I; Corona, W; Klemens, B; Schulze, TG; Cichon, S; Rietschel, M; Nöthen, MM; Georgi, A; Schumacher, J; Schwarz, M; Jamra, R Abou; Höfels, S; Propping, P; Satagopan, J; Detera-Wadleigh, SD; Hardy, J; McMahon, FJ

    2008-01-01

    The genetic basis of bipolar disorder has long been thought to be complex, with the potential involvement of multiple genes, but methods to analyze populations with respect to this complexity have only recently become available. We have carried out a genome-wide association study of bipolar disorder by genotyping over 550,000 SNPs in two independent case-control samples of European origin. The initial association screen was performed using pooled DNA; selected SNPs were confirmed by individual genotyping. While DNA pooling reduces power to detect genetic associations, there is a substantial cost savings and gain in efficiency. A total of 88 SNPs representing 80 different genes met the prior criteria for replication in both samples. Effect sizes were modest: no single SNP of large effect was detected. Of 37 SNPs selected for individual genotyping, the strongest association signal was detected at a marker within the first intron of DGKH (p = 1.5 × 10−8, experiment-wide p<0.01, OR= 1.59). This gene encodes diacylglycerol kinase eta, a key protein in the lithium-sensitive phosphatidyl inositol pathway. This first genome-wide association study of bipolar disorder shows that several genes, each of modest effect, reproducibly influence disease risk. Bipolar disorder may be a polygenic disease. PMID:17486107

  15. A novel fluorescent aptasensor based on gold and silica nanoparticles for the ultrasensitive detection of ochratoxin A

    NASA Astrophysics Data System (ADS)

    Taghdisi, Seyed Mohammad; Danesh, Noor Mohammad; Beheshti, Hamed Reza; Ramezani, Mohammad; Abnous, Khalil

    2016-02-01

    Analytical approaches for the detection and quantitation of ochratoxin A (OTA) in blood serum and food products are high in demand. In this study, a fluorescent aptamer-based sensor (aptasensor) is developed for the selective and sensitive detection of OTA, based on a complementary strand of aptamer (CS) and two types of nanoparticles, gold nanoparticles (AuNPs) and silica nanoparticles (SNPs) coated with streptavidin. The fabricated aptasensor inherits the characteristics of SNPs, as enhancers of fluorescence intensity; AuNPs, such as large surface area and unique optical properties; and high affinity of the aptamer toward its target compared to its CS. In the absence of OTA, no FAM and biotin-labeled CS is in the environment of the SNPs coated with streptavidin, which leads to no fluorescence emission. In the presence of the target, an FAM and biotin-labeled CS-SNPs coated with streptavidin conjugate is formed, thus resulting in a very strong fluorescence emission. The designed fluorescent aptasensor exhibits high selectivity toward OTA with a limit of detection (LOD) as low as 0.098 nM. Furthermore, the fabricated aptasensor was successfully applied for the detection of OTA in grape juice and serum with LODs of 0.113 and 0.152 nM, respectively.

  16. Investigation of Genetic Variants Associated with Alzheimer Disease in Parkinson Disease Cognition.

    PubMed

    Barrett, Matthew J; Koeppel, Alexander F; Flanigan, Joseph L; Turner, Stephen D; Worrall, Bradford B

    2016-01-01

    Meta-analysis of genome-wide association studies have implicated multiple single nucleotide polymorphisms (SNPs) and associated genes with Alzheimer disease. The role of these SNPs in cognitive impairment in Parkinson disease (PD) remains incompletely evaluated. The objective of this study was to test alleles associated with risk of Alzheimer disease for association with cognitive impairment in Parkinson disease (PD). Two datasets with PD subjects accessed through the NIH database of Genotypes and Phenotypes contained both single nucleotide polymorphism (SNP) arrays and mini-mental state exam (MMSE) scores. Genetic data underwent rigorous quality control and we selected SNPs for genes associated with AD other than APOE. We constructed logistic regression and ordinal regression models, adjusted for sex, age at MMSE, and duration of PD, to assess the association between selected SNPs and MMSE score. In one dataset, PICALM rs3851179 was associated with cognitive impairment (MMSE <  24) in PD subjects > 70 years old (OR = 2.3; adjusted p-value = 0.017; n = 250) but not in PD subjects ≤ 70 years old. Our finding suggests that PICALM rs3851179 could contribute to cognitive impairment in older patients with PD. It is important that future studies consider the interaction of age and genetic risk factors in the development of cognitive impairment in PD.

  17. Screening white spot syndrome virus (WSSV)-resistant molecular markers from Fenneropenaeus chinensis

    NASA Astrophysics Data System (ADS)

    Wu, Yingying; Meng, Xianhong; Kong, Jie; Luan, Sheng; Luo, Kun; Wang, Qingyin; Zheng, Yongyun

    2017-02-01

    White spot syndrome virus (WSSV)-resistant molecular markers were screened from the selectively bred new variety `Huanghai No. 2' of Fenneropenaeus chinensis using unlabeled-probe high-resolution melting (HRM) technique. After the artificial infection with WSSV, the first 96 dead shrimps and the last 96 surviving shrimps were collected, representing WSSV-susceptible and -resistant populations, respectively. The genotypes at well-developed 39 single nucleotide polymorphisms (SNPs) loci were obtained. As revealed in the Chi-square test, 3 SNPs, genotype A/A of contig C364-89AT, genotype A/A of C2635-527CA and genotype C/T of contig C12355-592CT, were positively correlated with disease-resistance traits. Other 2 SNPs, genotype G/G of contig C283-145AG and genotype C/C of contig C12355-592CT, were negatively correlated. Moreover, analysis with BlastX program for disease-resistant SNPs indicated that 3 contigs, Contig283, Contig364 and Contig12355, matched to the functional genes of effector caspase of Penaeus monodon, peptide transporter family 1-like protein, and 40S ribosomal protein S2 of Perca flavescens with high sequence similarity. The results will be helpful to provide theoretical and technical supports for molecular marker-assisted selective breeding of F. chinensis.

  18. Association of vitamin D levels and risk of ovarian cancer: a Mendelian randomization study.

    PubMed

    Ong, Jue-Sheng; Cuellar-Partida, Gabriel; Lu, Yi; Fasching, Peter A; Hein, Alexander; Burghaus, Stefanie; Beckmann, Matthias W; Lambrechts, Diether; Van Nieuwenhuysen, Els; Vergote, Ignace; Vanderstichele, Adriaan; Anne Doherty, Jennifer; Anne Rossing, Mary; Chang-Claude, Jenny; Eilber, Ursula; Rudolph, Anja; Wang-Gohrke, Shan; Goodman, Marc T; Bogdanova, Natalia; Dörk, Thilo; Dürst, Matthias; Hillemanns, Peter; Runnebaum, Ingo B; Antonenkova, Natalia; Butzow, Ralf; Leminen, Arto; Nevanlinna, Heli; Pelttari, Liisa M; Edwards, Robert P; Kelley, Joseph L; Modugno, Francesmary; Moysich, Kirsten B; Ness, Roberta B; Cannioto, Rikki; Høgdall, Estrid; Høgdall, Claus K; Jensen, Allan; Giles, Graham G; Bruinsma, Fiona; Kjaer, Susanne K; Hildebrandt, Michelle At; Liang, Dong; Lu, Karen H; Wu, Xifeng; Bisogna, Maria; Dao, Fanny; Levine, Douglas A; Cramer, Daniel W; Terry, Kathryn L; Tworoger, Shelley S; Stampfer, Meir; Missmer, Stacey; Bjorge, Line; Salvesen, Helga B; Kopperud, Reidun K; Bischof, Katharina; Aben, Katja Kh; Kiemeney, Lambertus A; Massuger, Leon Fag; Brooks-Wilson, Angela; Olson, Sara H; McGuire, Valerie; Rothstein, Joseph H; Sieh, Weiva; Whittemore, Alice S; Cook, Linda S; Le, Nhu D; Gilks, C Blake; Gronwald, Jacek; Jakubowska, Anna; Lubiński, Jan; Kluz, Tomasz; Song, Honglin; Tyrer, Jonathan P; Wentzensen, Nicolas; Brinton, Louise; Trabert, Britton; Lissowska, Jolanta; McLaughlin, John R; Narod, Steven A; Phelan, Catherine; Anton-Culver, Hoda; Ziogas, Argyrios; Eccles, Diana; Campbell, Ian; Gayther, Simon A; Gentry-Maharaj, Aleksandra; Menon, Usha; Ramus, Susan J; Wu, Anna H; Dansonka-Mieszkowska, Agnieszka; Kupryjanczyk, Jolanta; Timorek, Agnieszka; Szafron, Lukasz; Cunningham, Julie M; Fridley, Brooke L; Winham, Stacey J; Bandera, Elisa V; Poole, Elizabeth M; Morgan, Terry K; Risch, Harvey A; Goode, Ellen L; Schildkraut, Joellen M; Pearce, Celeste L; Berchuck, Andrew; Pharoah, Paul Dp; Chenevix-Trench, Georgia; Gharahkhani, Puya; Neale, Rachel E; Webb, Penelope M; MacGregor, Stuart

    2016-10-01

    In vitro and observational epidemiological studies suggest that vitamin D may play a role in cancer prevention. However, the relationship between vitamin D and ovarian cancer is uncertain, with observational studies generating conflicting findings. A potential limitation of observational studies is inadequate control of confounding. To overcome this problem, we used Mendelian randomization (MR) to evaluate the association between single nucleotide polymorphisms (SNPs) associated with circulating 25-hydroxyvitamin D [25(OH)D] concentration and risk of ovarian cancer. We employed SNPs with well-established associations with 25(OH)D concentration as instrumental variables for MR: rs7944926 (DHCR7), rs12794714 (CYP2R1) and rs2282679 (GC). We included 31 719 women of European ancestry (10 065 cases, 21 654 controls) from the Ovarian Cancer Association Consortium, who were genotyped using customized Illumina Infinium iSelect (iCOGS) arrays. A two-sample (summary data) MR approach was used and analyses were performed separately for all ovarian cancer (10 065 cases) and for high-grade serous ovarian cancer (4121 cases). The odds ratio for epithelial ovarian cancer risk (10 065 cases) estimated by combining the individual SNP associations using inverse variance weighting was 1.27 (95% confidence interval: 1.06 to 1.51) per 20 nmol/L decrease in 25(OH)D concentration. The estimated odds ratio for high-grade serous epithelial ovarian cancer (4121 cases) was 1.54 (1.19, 2.01). Genetically lowered 25-hydroxyvitamin D concentrations were associated with higher ovarian cancer susceptibility in Europeans. These findings suggest that increasing plasma vitamin D levels may reduce risk of ovarian cancer. © The Author 2016; all rights reserved. Published by Oxford University Press on behalf of the International Epidemiological Association.

  19. Toll-like receptors genes polymorphisms and the occurrence of HCMV infection among pregnant women.

    PubMed

    Wujcicka, Wioletta; Paradowska, Edyta; Studzińska, Mirosława; Wilczyński, Jan; Nowakowska, Dorota

    2017-03-24

    Human cytomegalovirus (HCMV) is the most common cause of intrauterine infections worldwide. The toll-like receptors (TLRs) have been reported as important factors in immune response against HCMV. Particularly, TLR2, TLR4 and TLR9 have been shown to be involved in antiviral immunity. Evaluation of the role of single nucleotide polymorphisms (SNPs), located within TLR2, TLR4 and TLR9 genes, in the development of human cytomegalovirus (HCMV) infection in pregnant women and their fetuses and neonates, was performed. The study was performed for 131 pregnant women, including 66 patients infected with HCMV during pregnancy, and 65 age-matched control pregnant individuals. The patients were selected to the study, based on serological status of anti-HCMV IgG and IgM antibodies and on the presence of viral DNA in their body fluids. Genotypes in TLR2 2258 A > G, TLR4 896 G > A and 1196 C > T and TLR9 2848 G > A SNPs were determined by self-designed nested PCR-RFLP assays. Randomly selected PCR products, representative for distinct genotypes in TLR SNPs, were confirmed by sequencing. A relationship between the genotypes, alleles, haplotypes and multiple variants in the studied polymorphisms, and the occurrence of HCMV infection in pregnant women and their offsprings, was determined, using a logistic regression model. Genotypes in all the analyzed polymorphisms preserved the Hardy-Weinberg equilibrium in pregnant women, both infected and uninfected with HCMV (P > 0.050). GG homozygotic and GA heterozygotic status in TLR9 2848 G > A SNP decreased significantly the occurrence of HCMV infection (OR 0.44 95% CI 0.21-0.94 in the dominant model, P ≤ 0.050). The G allele in TLR9 SNP was significantly more frequent among the uninfected pregnant women than among the infected ones (χ 2  = 4.14, P ≤ 0.050). Considering other polymorphisms, similar frequencies of distinct genotypes, haplotypes and multiple-SNP variants were observed between the studied groups of patients. TLR9 2848 G > A SNP may be associated with HCMV infection in pregnant women.

  20. SNPs at 3'-UTR of the bovine CDIPT gene associated with Qinchuan cattle meat quality traits.

    PubMed

    Fu, C Z; Wang, H; Mei, C G; Wang, J L; Jiang, B J; Ma, X H; Wang, H B; Cheng, G; Zan, L S

    2013-03-13

    The CDIPT is crucial to the fatty acid metabolic pathway, intracellular signal transduction and energy metabolism in eukaryotic cells. We detected three SNPs at 3'-untranslated regions (UTR), named 3'-UTR_108 A > G, 3'-UTR_448 G > A and 3'-UTR_477 C > G, of the CDIPT gene in 618 Qinchuan cattle using PCR-RFLP and DNA sequencing methods. At each of the three SNPs, we found three genotypes named as follows: AA, AB, BB (3'-UTR_108 A > G), CC, CD, DD (3'-UTR_448 G > A) and EE, EF, FF (3'-UTR_477 C > G.). Based on association analysis of these SNPs with ultrasound measurement traits, individuals of genotype BB had a significantly larger loin muscle area than genotype AA. Individuals of genotype CC had significantly thicker back fat than individuals of genotype DD. Individuals of genotype EE also had significantly thicker back fat than did individuals of genotype FF. We conclude that these SNPs of the CDIPT gene could be used as molecular markers for selecting and breeding beef cattle with superior body traits, depending on breeding goals.

  1. Design of a 9K illumina BeadChip for polar bears (Ursus maritimus) from RAD and transcriptome sequencing.

    PubMed

    Malenfant, René M; Coltman, David W; Davis, Corey S

    2015-05-01

    Single-nucleotide polymorphisms (SNPs) offer numerous advantages over anonymous markers such as microsatellites, including improved estimation of population parameters, finer-scale resolution of population structure and more precise genomic dissection of quantitative traits. However, many SNPs are needed to equal the resolution of a single microsatellite, and reliable large-scale genotyping of SNPs remains a challenge in nonmodel species. Here, we document the creation of a 9K Illumina Infinium BeadChip for polar bears (Ursus maritimus), which will be used to investigate: (i) the fine-scale population structure among Canadian polar bears and (ii) the genomic architecture of phenotypic traits in the Western Hudson Bay subpopulation. To this end, we used restriction-site associated DNA (RAD) sequencing from 38 bears across their circumpolar range, as well as blood/fat transcriptome sequencing of 10 individuals from Western Hudson Bay. Six-thousand RAD SNPs and 3000 transcriptomic SNPs were selected for the chip, based primarily on genomic spacing and gene function respectively. Of the 9000 SNPs ordered from Illumina, 8042 were successfully printed, and - after genotyping 1450 polar bears - 5441 of these SNPs were found to be well clustered and polymorphic. Using this array, we show rapid linkage disequilibrium decay among polar bears, we demonstrate that in a subsample of 78 individuals, our SNPs detect known genetic structure more clearly than 24 microsatellites genotyped for the same individuals and that these results are not driven by the SNP ascertainment scheme. Here, we present one of the first large-scale genotyping resources designed for a threatened species. © 2014 John Wiley & Sons Ltd.

  2. Genome-wide DNA polymorphisms in two cultivars of mei (Prunus mume sieb. et zucc.).

    PubMed

    Sun, Lidan; Zhang, Qixiang; Xu, Zongda; Yang, Weiru; Guo, Yu; Lu, Jiuxing; Pan, Huitang; Cheng, Tangren; Cai, Ming

    2013-10-06

    Mei (Prunus mume Sieb. et Zucc.) is a famous ornamental plant and fruit crop grown in East Asian countries. Limited genetic resources, especially molecular markers, have hindered the progress of mei breeding projects. Here, we performed low-depth whole-genome sequencing of Prunus mume 'Fenban' and Prunus mume 'Kouzi Yudie' to identify high-quality polymorphic markers between the two cultivars on a large scale. A total of 1464.1 Mb and 1422.1 Mb of 'Fenban' and 'Kouzi Yudie' sequencing data were uniquely mapped to the mei reference genome with about 6-fold coverage, respectively. We detected a large number of putative polymorphic markers from the 196.9 Mb of sequencing data shared by the two cultivars, which together contained 200,627 SNPs, 4,900 InDels, and 7,063 SSRs. Among these markers, 38,773 SNPs, 174 InDels, and 418 SSRs were distributed in the 22.4 Mb CDS region, and 63.0% of these marker-containing CDS sequences were assigned to GO terms. Subsequently, 670 selected SNPs were validated using an Agilent's SureSelect solution phase hybridization assay. A subset of 599 SNPs was used to assess the genetic similarity of a panel of mei germplasm samples and a plum (P. salicina) cultivar, producing a set of informative diversity data. We also analyzed the frequency and distribution of detected InDels and SSRs in mei genome and validated their usefulness as DNA markers. These markers were successfully amplified in the cultivars and in their segregating progeny. A large set of high-quality polymorphic SNPs, InDels, and SSRs were identified in parallel between 'Fenban' and 'Kouzi Yudie' using low-depth whole-genome sequencing. The study presents extensive data on these polymorphic markers, which can be useful for constructing high-resolution genetic maps, performing genome-wide association studies, and designing genomic selection strategies in mei.

  3. Evolution of the Bovine TLR Gene Family and Member Associations with Mycobacterium avium Subspecies paratuberculosis Infection

    PubMed Central

    Fisher, Colleen A.; Bhattarai, Eric K.; Osterstock, Jason B.; Dowd, Scot E.; Seabury, Paul M.; Vikram, Meenu; Whitlock, Robert H.; Schukken, Ynte H.; Schnabel, Robert D.; Taylor, Jeremy F.; Womack, James E.; Seabury, Christopher M.

    2011-01-01

    Members of the Toll-like receptor (TLR) gene family occupy key roles in the mammalian innate immune system by functioning as sentries for the detection of invading pathogens, thereafter provoking host innate immune responses. We utilized a custom next-generation sequencing approach and allele-specific genotyping assays to detect and validate 280 biallelic variants across all 10 bovine TLR genes, including 71 nonsynonymous single nucleotide polymorphisms (SNPs) and one putative nonsense SNP. Bayesian haplotype reconstructions and median joining networks revealed haplotype sharing between Bos taurus taurus and Bos taurus indicus breeds at every locus, and specialized beef and dairy breeds could not be differentiated despite an average polymorphism density of 1 marker/158 bp. Collectively, 160 tagSNPs and two tag insertion-deletion mutations (indels) were sufficient to predict 100% of the variation at 280 variable sites for both Bos subspecies and their hybrids, whereas 118 tagSNPs and 1 tagIndel predictively captured 100% of the variation at 235 variable sites for B. t. taurus. Polyphen and SIFT analyses of amino acid (AA) replacements encoded by bovine TLR SNPs indicated that up to 32% of the AA substitutions were expected to impact protein function. Classical and newly developed tests of diversity provide strong support for balancing selection operating on TLR3 and TLR8, and purifying selection acting on TLR10. An investigation of the persistence and continuity of linkage disequilibrium (r2≥0.50) between adjacent variable sites also supported the presence of selection acting on TLR3 and TLR8. A case-control study employing validated variants from bovine TLR genes recognizing bacterial ligands revealed six SNPs potentially eliciting small effects on susceptibility to Mycobacterium avium spp paratuberculosis infection in dairy cattle. The results of this study will broadly impact domestic cattle research by providing the necessary foundation to explore several avenues of bovine translational genomics, and the potential for marker-assisted vaccination. PMID:22164200

  4. Identification of novel microRNA genes in freshwater and marine ecotypes of the three-spined stickleback (Gasterosteus aculeatus).

    PubMed

    Rastorguev, S M; Nedoluzhko, A V; Sharko, F S; Boulygina, E S; Sokolov, A S; Gruzdeva, N M; Skryabin, K G; Prokhortchouk, E B

    2016-11-01

    The three-spined stickleback (Gasterosteus aculeatus L.) is an important model organism for studying the molecular mechanisms of speciation and adaptation to salinity. Despite increased interest to microRNA discovery and recent publication on microRNA prediction in the three-spined stickleback using bioinformatics approaches, there is still a lack of experimental support for these data. In this paper, high-throughput sequencing technology was applied to identify microRNA genes in gills of the three-spined stickleback. In total, 595 miRNA genes were discovered; half of them were predicted in previous computational studies and were confirmed here as microRNAs expressed in gill tissue. Moreover, 298 novel microRNA genes were identified. The presence of miRNA genes in selected 'divergence islands' was analysed and 10 miRNA genes were identified as not randomly located in 'divergence islands'. Regulatory regions of miRNA genes were found enriched with selective SNPs that may play a role in freshwater adaptation. © 2016 John Wiley & Sons Ltd.

  5. WS-SNPs&GO: a web server for predicting the deleterious effect of human protein variants using functional annotation

    PubMed Central

    2013-01-01

    Background SNPs&GO is a method for the prediction of deleterious Single Amino acid Polymorphisms (SAPs) using protein functional annotation. In this work, we present the web server implementation of SNPs&GO (WS-SNPs&GO). The server is based on Support Vector Machines (SVM) and for a given protein, its input comprises: the sequence and/or its three-dimensional structure (when available), a set of target variations and its functional Gene Ontology (GO) terms. The output of the server provides, for each protein variation, the probabilities to be associated to human diseases. Results The server consists of two main components, including updated versions of the sequence-based SNPs&GO (recently scored as one of the best algorithms for predicting deleterious SAPs) and of the structure-based SNPs&GO3d programs. Sequence and structure based algorithms are extensively tested on a large set of annotated variations extracted from the SwissVar database. Selecting a balanced dataset with more than 38,000 SAPs, the sequence-based approach achieves 81% overall accuracy, 0.61 correlation coefficient and an Area Under the Curve (AUC) of the Receiver Operating Characteristic (ROC) curve of 0.88. For the subset of ~6,600 variations mapped on protein structures available at the Protein Data Bank (PDB), the structure-based method scores with 84% overall accuracy, 0.68 correlation coefficient, and 0.91 AUC. When tested on a new blind set of variations, the results of the server are 79% and 83% overall accuracy for the sequence-based and structure-based inputs, respectively. Conclusions WS-SNPs&GO is a valuable tool that includes in a unique framework information derived from protein sequence, structure, evolutionary profile, and protein function. WS-SNPs&GO is freely available at http://snps.biofold.org/snps-and-go. PMID:23819482

  6. Single Nucleotide Polymorphisms of Stemness Genes Predicted to Regulate RNA Splicing, microRNA and Oncogenic Signaling are Associated with Prostate Cancer Survival.

    PubMed

    Freedman, Jennifer A; Wang, Yanru; Li, Xuechan; Liu, Hongliang; Moorman, Patricia G; George, Daniel J; Lee, Norman H; Hyslop, Terry; Wei, Qingyi; Patierno, Steven R

    2018-05-03

    Prostate cancer is a clinically and molecularly heterogeneous disease, with variation in outcomes only partially predicted by grade and stage. Additional tools to distinguish indolent from aggressive disease are needed. Phenotypic characteristics of stemness correlate with poor cancer prognosis. Given this correlation, we identified single nucleotide polymorphisms (SNPs) of stemness-related genes and examined their associations with prostate cancer survival. SNPs within stemness-related genes were analyzed for association with overall survival of prostate cancer in the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial. Significant SNPs predicted to be functional were selected for linkage disequilibrium analysis and combined and stratified analyses. Identified SNPs were evaluated for association with gene expression. SNPs of CD44 (rs9666607), ABCC1 (rs35605 and rs212091) and GDF15 (rs1058587) were associated with prostate cancer survival and predicted to be functional. A role for rs9666607 of CD44 and rs35605 of ABCC1 in RNA splicing regulation, rs212091 of ABCC1 in miRNA binding site activity and rs1058587 of GDF15 in causing an amino acid change was predicted. These SNPs represent potential novel prognostic markers for overall survival of prostate cancer and support a contribution of the stemness pathway to prostate cancer patient outcome.

  7. Oxytocin receptor gene variations predict neural and behavioral response to oxytocin in autism

    PubMed Central

    Watanabe, Takamitsu; Otowa, Takeshi; Abe, Osamu; Kuwabara, Hitoshi; Aoki, Yuta; Natsubori, Tatsunobu; Takao, Hidemasa; Kakiuchi, Chihiro; Kondo, Kenji; Ikeda, Masashi; Iwata, Nakao; Kasai, Kiyoto; Sasaki, Tsukasa

    2017-01-01

    Abstract Oxytocin appears beneficial for autism spectrum disorder (ASD), and more than 20 single-nucleotide polymorphisms (SNPs) in oxytocin receptor (OXTR) are relevant to ASD. However, neither biological functions of OXTR SNPs in ASD nor critical OXTR SNPs that determine oxytocin’s effects on ASD remains known. Here, using a machine-learning algorithm that was designed to evaluate collective effects of multiple SNPs and automatically identify most informative SNPs, we examined relationships between 27 representative OXTR SNPs and six types of behavioral/neural response to oxytocin in ASD individuals. The oxytocin effects were extracted from our previous placebo-controlled within-participant clinical trial administering single-dose intranasal oxytocin to 38 high-functioning adult Japanese ASD males. Consequently, we identified six different SNP sets that could accurately predict the six different oxytocin efficacies, and confirmed the robustness of these SNP selections against variations of the datasets and analysis parameters. Moreover, major alleles of several prominent OXTR SNPs—including rs53576 and rs2254298—were found to have dissociable effects on the oxytocin efficacies. These findings suggest biological functions of the OXTR SNP variants on autistic oxytocin responses, and implied that clinical oxytocin efficacy may be genetically predicted before its actual administration, which would contribute to establishment of future precision medicines for ASD. PMID:27798253

  8. SNPs selection using support vector regression and genetic algorithms in GWAS

    PubMed Central

    2014-01-01

    Introduction This paper proposes a new methodology to simultaneously select the most relevant SNPs markers for the characterization of any measurable phenotype described by a continuous variable using Support Vector Regression with Pearson Universal kernel as fitness function of a binary genetic algorithm. The proposed methodology is multi-attribute towards considering several markers simultaneously to explain the phenotype and is based jointly on statistical tools, machine learning and computational intelligence. Results The suggested method has shown potential in the simulated database 1, with additive effects only, and real database. In this simulated database, with a total of 1,000 markers, and 7 with major effect on the phenotype and the other 993 SNPs representing the noise, the method identified 21 markers. Of this total, 5 are relevant SNPs between the 7 but 16 are false positives. In real database, initially with 50,752 SNPs, we have reduced to 3,073 markers, increasing the accuracy of the model. In the simulated database 2, with additive effects and interactions (epistasis), the proposed method matched to the methodology most commonly used in GWAS. Conclusions The method suggested in this paper demonstrates the effectiveness in explaining the real phenotype (PTA for milk), because with the application of the wrapper based on genetic algorithm and Support Vector Regression with Pearson Universal, many redundant markers were eliminated, increasing the prediction and accuracy of the model on the real database without quality control filters. The PUK demonstrated that it can replicate the performance of linear and RBF kernels. PMID:25573332

  9. Landscape genomic analysis of candidate genes for climate adaptation in a California endemic oak, Quercus lobata.

    PubMed

    Sork, Victoria L; Squire, Kevin; Gugger, Paul F; Steele, Stephanie E; Levy, Eric D; Eckert, Andrew J

    2016-01-01

    The ability of California tree populations to survive anthropogenic climate change will be shaped by the geographic structure of adaptive genetic variation. Our goal is to test whether climate-associated candidate genes show evidence of spatially divergent selection in natural populations of valley oak, Quercus lobata, as preliminary indication of local adaptation. Using DNA from 45 individuals from 13 localities across the species' range, we sequenced portions of 40 candidate genes related to budburst/flowering, growth, osmotic stress, and temperature stress. Using 195 single nucleotide polymorphisms (SNPs), we estimated genetic differentiation across populations and correlated allele frequencies with climate gradients using single-locus and multivariate models. The top 5% of FST estimates ranged from 0.25 to 0.68, yielding loci potentially under spatially divergent selection. Environmental analyses of SNP frequencies with climate gradients revealed three significantly correlated SNPs within budburst/flowering genes and two SNPs within temperature stress genes with mean annual precipitation, after controlling for multiple testing. A redundancy model showed a significant association between SNPs and climate variables and revealed a similar set of SNPs with high loadings on the first axis. In the RDA, climate accounted for 67% of the explained variation, when holding climate constant, in contrast to a putatively neutral SSR data set where climate accounted for only 33%. Population differentiation and geographic gradients of allele frequencies in climate-associated functional genes in Q. lobata provide initial evidence of adaptive genetic variation and background for predicting population response to climate change. © 2016 Botanical Society of America.

  10. Estimated allele substitution effects underlying genomic evaluation models depend on the scaling of allele counts.

    PubMed

    Bouwman, Aniek C; Hayes, Ben J; Calus, Mario P L

    2017-10-30

    Genomic evaluation is used to predict direct genomic values (DGV) for selection candidates in breeding programs, but also to estimate allele substitution effects (ASE) of single nucleotide polymorphisms (SNPs). Scaling of allele counts influences the estimated ASE, because scaling of allele counts results in less shrinkage towards the mean for low minor allele frequency (MAF) variants. Scaling may become relevant for estimating ASE as more low MAF variants will be used in genomic evaluations. We show the impact of scaling on estimates of ASE using real data and a theoretical framework, and in terms of power, model fit and predictive performance. In a dairy cattle dataset with 630 K SNP genotypes, the correlation between DGV for stature from a random regression model using centered allele counts (RRc) and centered and scaled allele counts (RRcs) was 0.9988, whereas the overall correlation between ASE using RRc and RRcs was 0.27. The main difference in ASE between both methods was found for SNPs with a MAF lower than 0.01. Both the ratio (ASE from RRcs/ASE from RRc) and the regression coefficient (regression of ASE from RRcs on ASE from RRc) were much higher than 1 for low MAF SNPs. Derived equations showed that scenarios with a high heritability, a large number of individuals and a small number of variants have lower ratios between ASE from RRc and RRcs. We also investigated the optimal scaling parameter [from - 1 (RRcs) to 0 (RRc) in steps of 0.1] in the bovine stature dataset. We found that the log-likelihood was maximized with a scaling parameter of - 0.8, while the mean squared error of prediction was minimized with a scaling parameter of - 1, i.e., RRcs. Large differences in estimated ASE were observed for low MAF SNPs when allele counts were scaled or not scaled because there is less shrinkage towards the mean for scaled allele counts. We derived a theoretical framework that shows that the difference in ASE due to shrinkage is heavily influenced by the power of the data. Increasing the power results in smaller differences in ASE whether allele counts are scaled or not.

  11. Fast Screening Technology for Drug Emergency Management: Predicting Suspicious SNPs for ADR with Information Theory-based Models.

    PubMed

    Liang, Zhaohui; Liu, Jun; Huang, Jimmy X; Zeng, Xing

    2018-01-01

    The genetic polymorphism of Cytochrome P450 (CYP 450) is considered as one of the main causes for adverse drug reactions (ADRs). In order to explore the latent correlations between ADRs and potentially corresponding single-nucleotide polymorphism (SNPs) in CYP450, three algorithms based on information theory are used as the main method to predict the possible relation. The study uses a retrospective case-control study to explore the potential relation of ADRs to specific genomic locations and single-nucleotide polymorphism (SNP). The genomic data collected from 53 healthy volunteers are applied for the analysis, another group of genomic data collected from 30 healthy volunteers excluded from the study are used as the control group. The SNPs respective on five loci of CYP2D6*2,*10,*14 and CYP1A2*1C, *1F are detected by the Applied Biosystem 3130xl. The raw data is processed by ChromasPro to detect the specific alleles on the above loci from each sample. The secondary data are reorganized and processed by R combined with the reports of ADRs from clinical reports. Three information theory based algorithms are implemented for the screening task: JMI, CMIM, and mRMR. If a SNP is selected by more than two algorithms, we are confident to conclude that it is related to the corresponding ADR. The selection results are compared with the control decision tree + LASSO regression model. In the study group where ADRs occur, 10 SNPs are considered relevant to the occurrence of a specific ADR by the combined information theory model. In comparison, only 5 SNPs are considered relevant to a specific ADR by the decision tree + LASSO regression model. In addition, the new method detects more relevant pairs of SNP and ADR which are affected by both SNP and dosage. This implies that the new information theory based model is effective to discover correlations of ADRs and CYP 450 SNPs and is helpful in predicting the potential vulnerable genotype for some ADRs. The newly proposed information theory based model has superiority performance in detecting the relation between SNP and ADR compared to the decision tree + LASSO regression model. The new model is more sensitive to detect ADRs compared to the old method, while the old method is more reliable. Therefore, the selection criteria for selecting algorithms should depend on the pragmatic needs. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  12. Genotyping analysis and ¹⁸FDG uptake in breast cancer patients: a preliminary research.

    PubMed

    Bravatà, Valentina; Stefano, Alessandro; Cammarata, Francesco P; Minafra, Luigi; Russo, Giorgio; Nicolosi, Stefania; Pulizzi, Sabina; Gelfi, Cecilia; Gilardi, Maria C; Messa, Cristina

    2013-04-30

    Diagnostic imaging plays a relevant role in the care of patients with breast cancer (BC). Positron Emission Tomography (PET) with 18F-fluoro-2-deoxy-D-glucose (FDG) has been widely proven to be a clinical tool suitable for BC detection and staging in which the glucose analog supplies metabolic information about the tumor. A limited number of studies, sometimes controversial, describe possible associations between FDG uptake and single nucleotide polymorphisms (SNPs). For this reason this field has to be explored and clarified. We investigated the association of SNPs in GLUT1, HIF-1a, EPAS1, APEX1, VEGFA and MTHFR genes with the FDG uptake in BC. In 26 caucasian individuals with primary BC, whole-body PET-CT scans were obtained and quantitative analysis was performed by calculating the maximum Standardized Uptake Value normalized to body-weight (SUVmax) and the mean SUV normalized to body-weight corrected for partial volume effect (SUVpvc). Human Gene Mutation Database and dbSNP Short Genetic Variations database were used to analyze gene regions containing the selected SNPs. Patient genotypes were obtained using Sanger DNA sequencing analysis performed by Capillary Electrophoresis. BC patients were genotyped for the following nine SNPs: GLUT1: rs841853 and rs710218; HIF-1a: rs11549465 and rs11549467; EPAS1: rs137853037 and rs137853036; APEX1: rs1130409; VEGFA: rs3025039 and MTHFR: rs1801133. In this work correlations between the nine potentially useful polymorphisms selected and previously suggested with tracer uptake (using both SUVmax and SUVpvc) were not found. The possible functional influence of specific SNPs on FDG uptake needs further studies in human cancer. In summary, this is the first pilot study, to our knowledge, which investigates the association between a large panel of SNPs and FDG uptake specifically in BC patients. This work represents a multidisciplinary and translational medicine approach to study BC where, the possible correlation between SNPs and tracer uptake, may be considered to improve personalized cancer treatment and care.

  13. RTEL1 tagging SNPs and haplotypes were associated with glioma development.

    PubMed

    Li, Gang; Jin, Tianbo; Liang, Hongjuan; Zhang, Zhiguo; He, Shiming; Tu, Yanyang; Yang, Haixia; Geng, Tingting; Cui, Guangbin; Chen, Chao; Gao, Guodong

    2013-05-17

    As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case-control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF)>5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P=0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P=0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype "GG" of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P=0.0002), while the genotype "CC" of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P=0.0003). Furthermore, haplotype "GCT" in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher's P=0.0005; Pearson's P=0.0005), and haplotype "ATT" was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher's P=0.0013; Pearson's P=0.0013). Two single variants, the genotypes of "GG" of rs6010620 and "CC" of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998.

  14. Regulatory element-based prediction identifies new susceptibility regulatory variants for osteoporosis.

    PubMed

    Yao, Shi; Guo, Yan; Dong, Shan-Shan; Hao, Ruo-Han; Chen, Xiao-Feng; Chen, Yi-Xiao; Chen, Jia-Bin; Tian, Qing; Deng, Hong-Wen; Yang, Tie-Lin

    2017-08-01

    Despite genome-wide association studies (GWASs) have identified many susceptibility genes for osteoporosis, it still leaves a large part of missing heritability to be discovered. Integrating regulatory information and GWASs could offer new insights into the biological link between the susceptibility SNPs and osteoporosis. We generated five machine learning classifiers with osteoporosis-associated variants and regulatory features data. We gained the optimal classifier and predicted genome-wide SNPs to discover susceptibility regulatory variants. We further utilized Genetic Factors for Osteoporosis Consortium (GEFOS) and three in-house GWASs samples to validate the associations for predicted positive SNPs. The random forest classifier performed best among all machine learning methods with the F1 score of 0.8871. Using the optimized model, we predicted 37,584 candidate SNPs for osteoporosis. According to the meta-analysis results, a list of regulatory variants was significantly associated with osteoporosis after multiple testing corrections and contributed to the expression of known osteoporosis-associated protein-coding genes. In summary, combining GWASs and regulatory elements through machine learning could provide additional information for understanding the mechanism of osteoporosis. The regulatory variants we predicted will provide novel targets for etiology research and treatment of osteoporosis.

  15. Data on polymorphisms in CYP2A6 associated to risk and predispose to smoking related variables.

    PubMed

    López-Flores, Luis A; Pérez-Rubio, Gloria; Ramírez-Venegas, Alejandra; Ambrocio-Ortiz, Enrique; Sansores, Raúl H; Falfán-Valencia, Ramcés

    2017-12-01

    This article contains data on the single nucleotide polymorphisms (SNPs) rs1137115, rs1801272 and rs28399433 rs4105144 in CYP2A6 associated to smoking related variables in Mexican Mestizo smokers (Pérez-Rubio et al., 2017) [1]. These SNPs were selected due to previous associations with other populations. Mexican Mestizo smokers were classified according their smoking pattern. A genetic association test was performed.

  16. Integrating Milk Metabolite Profile Information for the Prediction of Traditional Milk Traits Based on SNP Information for Holstein Cows

    PubMed Central

    Melzer, Nina; Wittenburg, Dörte; Repsilber, Dirk

    2013-01-01

    In this study the benefit of metabolome level analysis for the prediction of genetic value of three traditional milk traits was investigated. Our proposed approach consists of three steps: First, milk metabolite profiles are used to predict three traditional milk traits of 1,305 Holstein cows. Two regression methods, both enabling variable selection, are applied to identify important milk metabolites in this step. Second, the prediction of these important milk metabolite from single nucleotide polymorphisms (SNPs) enables the detection of SNPs with significant genetic effects. Finally, these SNPs are used to predict milk traits. The observed precision of predicted genetic values was compared to the results observed for the classical genotype-phenotype prediction using all SNPs or a reduced SNP subset (reduced classical approach). To enable a comparison between SNP subsets, a special invariable evaluation design was implemented. SNPs close to or within known quantitative trait loci (QTL) were determined. This enabled us to determine if detected important SNP subsets were enriched in these regions. The results show that our approach can lead to genetic value prediction, but requires less than 1% of the total amount of (40,317) SNPs., significantly more important SNPs in known QTL regions were detected using our approach compared to the reduced classical approach. Concluding, our approach allows a deeper insight into the associations between the different levels of the genotype-phenotype map (genotype-metabolome, metabolome-phenotype, genotype-phenotype). PMID:23990900

  17. Forensic genetic informativeness of an SNP panel consisting of 19 multi-allelic SNPs.

    PubMed

    Gao, Zehua; Chen, Xiaogang; Zhao, Yuancun; Zhao, Xiaohong; Zhang, Shu; Yang, Yiwen; Wang, Yufang; Zhang, Ji

    2018-05-01

    Current research focusing on forensic personal identification, phenotype inference and ancestry information on single-nucleotide polymorphisms (SNPs) has been widely reported. In the present study, we focused on tetra-allelic SNPs in the Chinese Han population. A total of 48 tetra-allelic SNPs were screened out from the Chinese Han population of the 1000 Genomes Database, including Chinese Han in Beijing (CHB) and Chinese Han South (CHS). Considering the forensic genetic requirement for the polymorphisms, only 11 tetra-allelic SNPs with a heterozygosity >0.06 were selected for further multiplex panel construction. In order to meet the demands of personal identification and parentage identification, an additional 8 tri-allelic SNPs were combined into the final multiplex panel. To ensure application in the degraded DNA analysis, all the PCR products were designed to be 87-188 bp. Employing multiple PCR reactions and SNaPshot minisequencing, 511 unrelated Chinese Han individuals from Sichuan were genotyped. The combined match probability (CMP), combined discrimination power (CDP), and cumulative probability of exclusion (CPE) of the panel were 6.07 × 10 -11 , 0.9999999999393 and 0.996764, respectively. Based on the population data retrieved from the 1000 Genomes Project, Fst values between Chinese Han in Sichuan (SCH) and all the populations included in the 1000 Genomes Project were calculated. The results indicated that two SNPs in this panel may contain ancestry information and may be used as markers of forensic biogeographical ancestry inference. Copyright © 2018 Elsevier B.V. All rights reserved.

  18. Tissue expression and predicted protein structures of the bovine ANGPTL3 and association of novel SNPs with growth and meat quality traits.

    PubMed

    Chen, N B; Ma, Y; Yang, T; Lin, F; Fu, W W; Xu, Y J; Li, F; Li, J Y; Gao, S X

    2015-08-01

    Angiopoietin-like protein 3 (ANGPTL3) is a secreted protein that regulates lipid, glucose and energy metabolism. This study was conducted to better understand the effect of ANGPTL3 on important economic traits in cattle. First, transcript profiles for ANGPTL3 were measured in nine different Jiaxian cattle tissues. Second, polymorphisms were identified in the complete coding region and promoter region of the bovine ANGPTL3 gene in 707 cattle samples. Finally, an association study was carried out utilizing these single nucleotide polymorphisms (SNPs) to determine the effect of these SNPs on the growth and meat quality traits. Quantitative real-time PCR analysis showed that ANGPTL3 was mainly expressed in the liver. The promoter of the bovine ANGPTL3 contained several putative transcription factor binding sites (SF1, HNF-1, LXRα, NFκβ, HNF-3 and C/EBP). In total, four SNPs of the bovine ANGPTL3 gene were identified by direct sequencing. SNP1 (rs469906272: g.-38T>C) was identified in the promoter, SNP2 (rs451104723:g.104A>T) and SNP3 (rs482516226: g.509A>G) were identified in exon 1, and SNP4 (rs477165942: g.8661T>C) was identified in exon 6. Changes in predicted protein structures due to non-synonymous SNPs were analyzed. Haplotype frequencies and linkage disequilibrium were also investigated. Analysis of four SNPs in cattle from different native Chinese breeds (Nanyang (NY) and Jiaxian (JX)) and commercial breeds (Angus (AG), Hereford (HF), Limousin (LM), Luxi (LX), Simmental (ST) and Jinnan (JN)) revealed a significant association with growth traits (including: BW and hipbone width) and meat quality traits (including: Warner-Bratzler shear force and ribeye area). Therefore, implementation of these four mutations in selection indices in the beef industry may be beneficial in selecting individuals with superior growth and meat quality traits.

  19. A genome-wide association study reveals novel genomic regions and positional candidate genes for fat deposition in broiler chickens.

    PubMed

    Moreira, Gabriel Costa Monteiro; Boschiero, Clarissa; Cesar, Aline Silva Mello; Reecy, James M; Godoy, Thaís Fernanda; Trevisoli, Priscila Anchieta; Cantão, Maurício E; Ledur, Mônica Corrêa; Ibelli, Adriana Mércia Guaratini; Peixoto, Jane de Oliveira; Moura, Ana Silvia Alves Meira Tavares; Garrick, Dorian; Coutinho, Luiz Lehmann

    2018-05-21

    Excess fat content in chickens has a negative impact on poultry production. The discovery of QTL associated with fat deposition in the carcass allows the identification of positional candidate genes (PCGs) that might regulate fat deposition and be useful for selection against excess fat content in chicken's carcass. This study aimed to estimate genomic heritability coefficients and to identify QTLs and PCGs for abdominal fat (ABF) and skin (SKIN) traits in a broiler chicken population, originated from the White Plymouth Rock and White Cornish breeds. ABF and SKIN are moderately heritable traits in our broiler population with estimates ranging from 0.23 to 0.33. Using a high density SNP panel (355,027 informative SNPs), we detected nine unique QTLs that were associated with these fat traits. Among these, four QTL were novel, while five have been previously reported in the literature. Thirteen PCGs were identified that might regulate fat deposition in these QTL regions: JDP2, PLCG1, HNF4A, FITM2, ADIPOR1, PTPN11, MVK, APOA1, APOA4, APOA5, ENSGALG00000000477, ENSGALG00000000483, and ENSGALG00000005043. We used sequence information from founder animals to detect 4843 SNPs in the 13 PCGs. Among those, two were classified as potentially deleterious and two as high impact SNPs. This study generated novel results that can contribute to a better understanding of fat deposition in chickens. The use of high density array of SNPs increases genome coverage and improves QTL resolution than would have been achieved with low density. The identified PCGs were involved in many biological processes that regulate lipid storage. The SNPs identified in the PCGs, especially those predicted as potentially deleterious and high impact, may affect fat deposition. Validation should be undertaken before using these SNPs for selection against carcass fat accumulation and to improve feed efficiency in broiler chicken production.

  20. Mendelian randomization shows sex-specific associations between long-chain PUFA-related genotypes and cognitive performance in Danish schoolchildren.

    PubMed

    Lauritzen, Lotte; Sørensen, Louise B; Harsløf, Laurine B; Ritz, Christian; Stark, Ken D; Astrup, Arne; Dyssegaard, Camilla B; Egelund, Niels; Michaelsen, Kim F; Damsgaard, Camilla T

    2017-07-01

    Background: Dietary and endogenously formed long-chain polyunsaturated fatty acids (LCPUFAs) are hypothesized to improve cognitive development, but results are inconclusive, with suggestions of sex specificity. One study suggested that single-nucleotide polymorphisms (SNPs) rs1535 and rs174448 in the fatty acid desaturase ( FADS ) gene cluster have opposite effects on erythrocyte LCPUFAs at 9 mo. Objective: To explore whether SNPs in FADS and elongase ( ELOVL ) genes were associated with school performance in a sex-specific manner, we performed a Mendelian randomization study using data from the Optimal well-being, development and health for Danish children through a healthy New Nordic Diet (OPUS) School Meal Study with 765 Danish schoolchildren 8-11 y old. Design: Associations between selected FADS1/2 SNPs (rs1535, rs174448, and rs174468) and ELOVL5 rs2397142, whole-blood fatty acid composition, and performance in the d2 Test of Attention and a reading test were analyzed in multiple regression models including all SNPs, SNP-sex interactions, and covariates related to testing conditions. Results: FADS , rs1535 minor allele carriage associated with lower whole-blood arachidonic acid ( P ≤ 0.002), and minor alleles of rs174448 tended to associate with lower docosahexaenoic acid (DHA) ( P = 0.052). We identified sex interactions in 50% of the SNP performance sets. Sex-dependent associations were observed for rs174448 and rs1535 on the d2 Test of Attention outcomes ( P < 0.03) and for the associations between reading scores and rs174448 and rs2397142 ( P < 0.01). All of the sex-specific analyses showed associations in opposite directions in girls and boys. The minor allele carriage of rs174448 was associated with lower d2 Test of Attention performance ( P < 0.02) and reading scores ( P < 0.001) in boys but with better reading scores in girls ( P ≤ 0.002). The associations were consistently the opposite for rs1535 minor allele carriage ( P < 0.05). Associations with rs2397142 also appeared to be opposite of those of rs174448, but only for reading and not significant after adjustment for parental educational level and whole-blood DHA. Conclusions: This study showed associations between rs1535 minor allele homozygosity and rs174448 major allele carriage and improved performance in 8- to 11-y-old boys but not in girls, thereby counteracting existing sex differences. This may be a consequence of increased endogenous DHA synthesis in infancy but not at school-age. This trial was registered at clinicaltrials.gov as NCT01457794. © 2017 American Society for Nutrition.

  1. A genome-wide signature of positive selection in ancient and recent invasive expansions of the honey bee Apis mellifera

    PubMed Central

    Zayed, Amro; Whitfield, Charles W.

    2008-01-01

    Apis mellifera originated in Africa and extended its range into Eurasia in two or more ancient expansions. In 1956, honey bees of African origin were introduced into South America, their descendents admixing with previously introduced European bees, giving rise to the highly invasive and economically devastating “Africanized” honey bee. Here we ask whether the honey bee's out-of-Africa expansions, both ancient and recent (invasive), were associated with a genome-wide signature of positive selection, detected by contrasting genetic differentiation estimates (FST) between coding and noncoding SNPs. In native populations, SNPs in protein-coding regions had significantly higher FST estimates than those in noncoding regions, indicating adaptive evolution in the genome driven by positive selection. This signal of selection was associated with the expansion of honey bees from Africa into Western and Northern Europe, perhaps reflecting adaptation to temperate environments. We estimate that positive selection acted on a minimum of 852–1,371 genes or ≈10% of the bee's coding genome. We also detected positive selection associated with the invasion of African-derived honey bees in the New World. We found that introgression of European-derived alleles into Africanized bees was significantly greater for coding than noncoding regions. Our findings demonstrate that Africanized bees exploited the genetic diversity present from preexisting introductions in an adaptive way. Finally, we found a significant negative correlation between FST estimates and the local GC content surrounding coding SNPs, suggesting that AT-rich genes play an important role in adaptive evolution in the honey bee. PMID:18299560

  2. A genome-wide signature of positive selection in ancient and recent invasive expansions of the honey bee Apis mellifera.

    PubMed

    Zayed, Amro; Whitfield, Charles W

    2008-03-04

    Apis mellifera originated in Africa and extended its range into Eurasia in two or more ancient expansions. In 1956, honey bees of African origin were introduced into South America, their descendents admixing with previously introduced European bees, giving rise to the highly invasive and economically devastating "Africanized" honey bee. Here we ask whether the honey bee's out-of-Africa expansions, both ancient and recent (invasive), were associated with a genome-wide signature of positive selection, detected by contrasting genetic differentiation estimates (F(ST)) between coding and noncoding SNPs. In native populations, SNPs in protein-coding regions had significantly higher F(ST) estimates than those in noncoding regions, indicating adaptive evolution in the genome driven by positive selection. This signal of selection was associated with the expansion of honey bees from Africa into Western and Northern Europe, perhaps reflecting adaptation to temperate environments. We estimate that positive selection acted on a minimum of 852-1,371 genes or approximately 10% of the bee's coding genome. We also detected positive selection associated with the invasion of African-derived honey bees in the New World. We found that introgression of European-derived alleles into Africanized bees was significantly greater for coding than noncoding regions. Our findings demonstrate that Africanized bees exploited the genetic diversity present from preexisting introductions in an adaptive way. Finally, we found a significant negative correlation between F(ST) estimates and the local GC content surrounding coding SNPs, suggesting that AT-rich genes play an important role in adaptive evolution in the honey bee.

  3. Response to Antenatal Cholecalciferol Supplementation Is Associated With Common Vitamin D–Related Genetic Variants

    PubMed Central

    Moon, Rebecca J.; Harvey, Nicholas C.; D’Angelo, Stefania; Curtis, Elizabeth M.; Crozier, Sarah R.; Barton, Sheila J.; Robinson, Sian M.; Godfrey, Keith M.; Graham, Nikki J.; Holloway, John W.; Bishop, Nicholas J.; Kennedy, Stephen; Papageorghiou, Aris T.; Schoenmakers, Inez; Fraser, Robert; Gandhi, Saurabh V.; Prentice, Ann; Inskip, Hazel M.; Javaid, M. Kassim

    2017-01-01

    Context: Single-nucleotide polymorphisms (SNPs) in genes related to vitamin D metabolism have been associated with serum 25-hydroxyvitamin D [25(OH)D] concentration, but these relationships have not been examined following antenatal cholecalciferol supplementation. Objective: To determine whether SNPs in DHCR7, CYP2R1, CYP24A1, and GC are associated with the response to gestational cholecalciferol supplementation. Design: Within-randomization group analysis of the Maternal Vitamin D Osteoporosis Study trial of antenatal cholecalciferol supplementation. Setting: Hospital antenatal clinics. Participants: In total, 682 women of white ethnicity (351 placebo, 331 cholecalciferol) were included. SNPs at rs12785878 (DHCR7), rs10741657 (CYP2R1), rs6013897 (CYP24A1), and rs2282679 (GC) were genotyped. Interventions: 1000 IU/d cholecalciferol from 14 weeks of gestation until delivery. Main Outcome Measure: 25(OH)D at randomization and 34 weeks of gestation were measured in a single batch (Liaison; Diasorin, Dartford, UK). Associations between 25(OH)D and the SNPs were assessed by linear regression using an additive model [β represents the change in 25(OH)D per additional common allele]. Results: Only rs12785878 (DHCR7) was associated with baseline 25(OH)D [β = 3.1 nmol/L; 95% confidence interval (CI), 1.0 to 5.2 nmol/L; P < 0.004]. In contrast, rs10741657 (CYP2R1) (β = −5.2 nmol/L; 95% CI, −8.2 to −2.2 nmol/L; P = 0.001) and rs2282679 (GC) (β = 4.2 nmol/L; 95% CI, 0.9 to 7.5 nmol/L; P = 0.01) were associated with achieved 25(OH)D status following supplementation, whereas rs12785878 and rs6013897 (CYP24A1) were not. Conclusions: Genetic variation in DHCR7, which encodes 7-dehyrocholesterol reductase in the epidermal vitamin D biosynthesis pathway, appears to modify baseline 25(OH)D. In contrast, the response to antenatal cholecalciferol supplementation was associated with SNPs in CYP2R1, which may alter 25-hydroxylase activity, and GC, which may affect vitamin D binding protein synthesis or metabolite affinity. PMID:28575224

  4. Response to Antenatal Cholecalciferol Supplementation Is Associated With Common Vitamin D-Related Genetic Variants.

    PubMed

    Moon, Rebecca J; Harvey, Nicholas C; Cooper, Cyrus; D'Angelo, Stefania; Curtis, Elizabeth M; Crozier, Sarah R; Barton, Sheila J; Robinson, Sian M; Godfrey, Keith M; Graham, Nikki J; Holloway, John W; Bishop, Nicholas J; Kennedy, Stephen; Papageorghiou, Aris T; Schoenmakers, Inez; Fraser, Robert; Gandhi, Saurabh V; Prentice, Ann; Inskip, Hazel M; Javaid, M Kassim

    2017-08-01

    Single-nucleotide polymorphisms (SNPs) in genes related to vitamin D metabolism have been associated with serum 25-hydroxyvitamin D [25(OH)D] concentration, but these relationships have not been examined following antenatal cholecalciferol supplementation. To determine whether SNPs in DHCR7, CYP2R1, CYP24A1, and GC are associated with the response to gestational cholecalciferol supplementation. Within-randomization group analysis of the Maternal Vitamin D Osteoporosis Study trial of antenatal cholecalciferol supplementation. Hospital antenatal clinics. In total, 682 women of white ethnicity (351 placebo, 331 cholecalciferol) were included. SNPs at rs12785878 (DHCR7), rs10741657 (CYP2R1), rs6013897 (CYP24A1), and rs2282679 (GC) were genotyped. 1000 IU/d cholecalciferol from 14 weeks of gestation until delivery. 25(OH)D at randomization and 34 weeks of gestation were measured in a single batch (Liaison; Diasorin, Dartford, UK). Associations between 25(OH)D and the SNPs were assessed by linear regression using an additive model [β represents the change in 25(OH)D per additional common allele]. Only rs12785878 (DHCR7) was associated with baseline 25(OH)D [β = 3.1 nmol/L; 95% confidence interval (CI), 1.0 to 5.2 nmol/L; P < 0.004]. In contrast, rs10741657 (CYP2R1) (β = -5.2 nmol/L; 95% CI, -8.2 to -2.2 nmol/L; P = 0.001) and rs2282679 (GC) (β = 4.2 nmol/L; 95% CI, 0.9 to 7.5 nmol/L; P = 0.01) were associated with achieved 25(OH)D status following supplementation, whereas rs12785878 and rs6013897 (CYP24A1) were not. Genetic variation in DHCR7, which encodes 7-dehyrocholesterol reductase in the epidermal vitamin D biosynthesis pathway, appears to modify baseline 25(OH)D. In contrast, the response to antenatal cholecalciferol supplementation was associated with SNPs in CYP2R1, which may alter 25-hydroxylase activity, and GC, which may affect vitamin D binding protein synthesis or metabolite affinity. Copyright © 2017 Endocrine Society

  5. Genome-wide association study of alcohol dependence

    PubMed Central

    Treutlein, Jens; Cichon, Sven; Ridinger, Monika; Wodarz, Norbert; Soyka, Michael; Zill, Peter; Maier, Wolfgang; Moessner, Rainald; Gaebel, Wolfgang; Dahmen, Norbert; Fehr, Christoph; Scherbaum, Norbert; Steffens, Michael; Ludwig, Kerstin U.; Frank, Josef; Wichmann, H.- Erich; Schreiber, Stefan; Dragano, Nico; Sommer, Wolfgang; Leonardi-Essmann, Fernando; Lourdusamy, Anbarasu; Gebicke-Haerter, Peter; Wienker, Thomas F.; Sullivan, Patrick F.; Nöthen, Markus M.; Kiefer, Falk; Spanagel, Rainer; Mann, Karl; Rietschel, Marcella

    2014-01-01

    Context Identification of genes contributing to alcohol dependence will improve our understanding of the mechanisms underlying this disorder. Objective To identify susceptibility genes for alcohol dependence through a genome-wide association study (GWAS) and follow-up study in a population of German male inpatients with an early age at onset. Design The GWAS included 487 male inpatients with DSM-IV alcohol dependence with an age at onset below 28 years and 1,358 population based control individuals. The follow-up study included 1,024 male inpatients and 996 age-matched male controls. All subjects were of German descent. The GWAS tested 524,396 single nucleotide polymorphisms (SNPs). All SNPs with p<10-4 were subjected to the follow-up study. In addition, nominally significant SNPs from those genes that had also shown expression changes in rat brains after chronic alcohol consumption were selected for the follow-up step. Results The GWAS produced 121 SNPs with nominal p<10-4. These, together with 19 additional SNPs from homologs of rat genes showing differential expression, were genotyped in the follow-up sample. Fifteen SNPs showed significant association with the same allele as in the GWAS. In the combined analysis, two closely linked intergenic SNPs met genome-wide significance (rs7590720 p=9.72×10-9; rs1344694 p=1.69×10-8). They are located on chromosome 2q35, a region which has been implicated in linkage studies for alcohol phenotypes. Nine SNPs were located in genes, including CDH13 and ADH1C genes which have been reported to be associated with alcohol dependence. Conclusion This is the first GWAS and follow-up study to identify a genome-wide significant association in alcohol dependence. Further independent studies are required to confirm these findings. PMID:19581569

  6. Transferability of genome-wide associated loci for asthma in African Americans.

    PubMed

    Faruque, Mezbah U; Chen, Guanjie; Doumatey, Ayo P; Zhou, Jie; Huang, Hanxia; Shriner, Daniel; Adeyemo, Adebowale A; Rotimi, Charles N; Dunston, Georgia M

    2017-01-02

    Transferability of significantly associated loci or GWAS "hits" adds credibility to genotype-disease associations and provides evidence for generalizability across different ancestral populations. We sought evidence of association of known asthma-associated single nucleotide polymorphisms (SNPs) in an African American population. Subjects comprised 661 participants (261 asthma cases and 400 controls) from the Howard University Family Study. Forty-eight SNPs previously reported to be associated with asthma by GWAS were selected for testing. We adopted a combined strategy by first adopting an "exact" approach where we looked-up only the reported index SNP. For those index SNPs missing form our dataset, we used a "local" approach that examined all the regional SNPs in LD with the index SNP. Out of the 48 SNPs, our cohort had genotype data available for 27, which were examined for exact replication. Of these, two SNPs were found positively associated with asthma. These included: rs10508372 (OR = 1.567 [95%CI, 1.133-2.167], P = 0.0066) and rs2378383 (OR = 2.147 [95%CI, 1.149-4.013], P = 0.0166), located on chromosomal bands 10p14 and 9q21.31, respectively. Local replication of the remaining 21 loci showed association at two chromosomal loci (9p24.1-rs2381413 and 6p21.32-rs3132947; Bonferroni-corrected P values: 0.0033 and 0.0197, respectively). Of note, multiple SNPs in LD with rs2381413 located upstream of IL33 were significantly associated with asthma. This study has successfully transferred four reported asthma-associated loci in an independent African American population. Identification of several asthma-associated SNPs in the upstream of the IL33, a gene previously implicated in allergic inflammation of asthmatic airway, supports the generalizability of this finding.

  7. The Development of Quality Control Genotyping Approaches: A Case Study Using Elite Maize Lines.

    PubMed

    Chen, Jiafa; Zavala, Cristian; Ortega, Noemi; Petroli, Cesar; Franco, Jorge; Burgueño, Juan; Costich, Denise E; Hearne, Sarah J

    2016-01-01

    Quality control (QC) of germplasm identity and purity is a critical component of breeding and conservation activities. SNP genotyping technologies and increased availability of markers provide the opportunity to employ genotyping as a low-cost and robust component of this QC. In the public sector available low-cost SNP QC genotyping methods have been developed from a very limited panel of markers of 1,000 to 1,500 markers without broad selection of the most informative SNPs. Selection of optimal SNPs and definition of appropriate germplasm sampling in addition to platform section impact on logistical and resource-use considerations for breeding and conservation applications when mainstreaming QC. In order to address these issues, we evaluated the selection and use of SNPs for QC applications from large DArTSeq data sets generated from CIMMYT maize inbred lines (CMLs). Two QC genotyping strategies were developed, the first is a "rapid QC", employing a small number of SNPs to identify potential mislabeling of seed packages or plots, the second is a "broad QC", employing a larger number of SNP, used to identify each germplasm entry and to measure heterogeneity. The optimal marker selection strategies combined the selection of markers with high minor allele frequency, sampling of clustered SNP in proportion to marker cluster distance and selecting markers that maintain a uniform genomic distribution. The rapid and broad QC SNP panels selected using this approach were further validated using blind test assessments of related re-generation samples. The influence of sampling within each line was evaluated. Sampling 192 individuals would result in close to 100% possibility of detecting a 5% contamination in the entry, and approximately a 98% probability to detect a 2% contamination of the line. These results provide a framework for the establishment of QC genotyping. A comparison of financial and time costs for use of these approaches across different platforms is discussed providing a framework for institutions involved in maize conservation and breeding to assess the resource use effectiveness of QC genotyping. Application of these research findings, in combination with existing QC approaches, will ensure the regeneration, distribution and use in breeding of true to type inbred germplasm. These findings also provide an effective approach to optimize SNP selection for QC genotyping in other species.

  8. Genomic Trajectories to Desiccation Resistance: Convergence and Divergence Among Replicate Selected Drosophila Lines

    PubMed Central

    Griffin, Philippa C.; Hangartner, Sandra B.; Fournier-Level, Alexandre; Hoffmann, Ary A.

    2017-01-01

    Adaptation to environmental stress is critical for long-term species persistence. With climate change and other anthropogenic stressors compounding natural selective pressures, understanding the nature of adaptation is as important as ever in evolutionary biology. In particular, the number of alternative molecular trajectories available for an organism to reach the same adaptive phenotype remains poorly understood. Here, we investigate this issue in a set of replicated Drosophila melanogaster lines selected for increased desiccation resistance—a classical physiological trait that has been closely linked to Drosophila species distributions. We used pooled whole-genome sequencing (Pool-Seq) to compare the genetic basis of their selection responses, using a matching set of replicated control lines for characterizing laboratory (lab-)adaptation, as well as the original base population. The ratio of effective population size to census size was high over the 21 generations of the experiment at 0.52–0.88 for all selected and control lines. While selected SNPs in replicates of the same treatment (desiccation-selection or lab-adaptation) tended to change frequency in the same direction, suggesting some commonality in the selection response, candidate SNP and gene lists often differed among replicates. Three of the five desiccation-selection replicates showed significant overlap at the gene and network level. All five replicates showed enrichment for ovary-expressed genes, suggesting maternal effects on the selected trait. Divergence between pairs of replicate lines for desiccation-candidate SNPs was greater than between pairs of control lines. This difference also far exceeded the divergence between pairs of replicate lines for neutral SNPs. Overall, while there was overlap in the direction of allele frequency changes and the network and functional categories affected by desiccation selection, replicates showed unique responses at all levels, likely reflecting hitchhiking effects, and highlighting the challenges in identifying candidate genes from these types of experiments when traits are likely to be polygenic. PMID:28007884

  9. Diversifying Selection Between Pure-Breed and Free-Breeding Dogs Inferred from Genome-Wide SNP Analysis.

    PubMed

    Pilot, Małgorzata; Malewski, Tadeusz; Moura, Andre E; Grzybowski, Tomasz; Oleński, Kamil; Kamiński, Stanisław; Fadel, Fernanda Ruiz; Alagaili, Abdulaziz N; Mohammed, Osama B; Bogdanowicz, Wiesław

    2016-08-09

    Domesticated species are often composed of distinct populations differing in the character and strength of artificial and natural selection pressures, providing a valuable model to study adaptation. In contrast to pure-breed dogs that constitute artificially maintained inbred lines, free-ranging dogs are typically free-breeding, i.e., unrestrained in mate choice. Many traits in free-breeding dogs (FBDs) may be under similar natural and sexual selection conditions to wild canids, while relaxation of sexual selection is expected in pure-breed dogs. We used a Bayesian approach with strict false-positive control criteria to identify FST-outlier SNPs between FBDs and either European or East Asian breeds, based on 167,989 autosomal SNPs. By identifying outlier SNPs located within coding genes, we found four candidate genes under diversifying selection shared by these two comparisons. Three of them are associated with the Hedgehog (HH) signaling pathway regulating vertebrate morphogenesis. A comparison between FBDs and East Asian breeds also revealed diversifying selection on the BBS6 gene, which was earlier shown to cause snout shortening and dental crowding via disrupted HH signaling. Our results suggest that relaxation of natural and sexual selection in pure-breed dogs as opposed to FBDs could have led to mild changes in regulation of the HH signaling pathway. HH inhibits adhesion and the migration of neural crest cells from the neural tube, and minor deficits of these cells during embryonic development have been proposed as the underlying cause of "domestication syndrome." This suggests that the process of breed formation involved the same genetic and developmental pathways as the process of domestication. Copyright © 2016 Pilot et al.

  10. Diversifying Selection Between Pure-Breed and Free-Breeding Dogs Inferred from Genome-Wide SNP Analysis

    PubMed Central

    Pilot, Małgorzata; Malewski, Tadeusz; Moura, Andre E.; Grzybowski, Tomasz; Oleński, Kamil; Kamiński, Stanisław; Fadel, Fernanda Ruiz; Alagaili, Abdulaziz N.; Mohammed, Osama B.; Bogdanowicz, Wiesław

    2016-01-01

    Domesticated species are often composed of distinct populations differing in the character and strength of artificial and natural selection pressures, providing a valuable model to study adaptation. In contrast to pure-breed dogs that constitute artificially maintained inbred lines, free-ranging dogs are typically free-breeding, i.e., unrestrained in mate choice. Many traits in free-breeding dogs (FBDs) may be under similar natural and sexual selection conditions to wild canids, while relaxation of sexual selection is expected in pure-breed dogs. We used a Bayesian approach with strict false-positive control criteria to identify FST-outlier SNPs between FBDs and either European or East Asian breeds, based on 167,989 autosomal SNPs. By identifying outlier SNPs located within coding genes, we found four candidate genes under diversifying selection shared by these two comparisons. Three of them are associated with the Hedgehog (HH) signaling pathway regulating vertebrate morphogenesis. A comparison between FBDs and East Asian breeds also revealed diversifying selection on the BBS6 gene, which was earlier shown to cause snout shortening and dental crowding via disrupted HH signaling. Our results suggest that relaxation of natural and sexual selection in pure-breed dogs as opposed to FBDs could have led to mild changes in regulation of the HH signaling pathway. HH inhibits adhesion and the migration of neural crest cells from the neural tube, and minor deficits of these cells during embryonic development have been proposed as the underlying cause of “domestication syndrome.” This suggests that the process of breed formation involved the same genetic and developmental pathways as the process of domestication. PMID:27233669

  11. Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine

    PubMed Central

    2011-01-01

    Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation sequencing technologies and will include SNPs from comparative orthologous sequences that were identified in the present study, providing a wider collection of anchor points for comparative genomics among the conifers. PMID:21767361

  12. [Association of single nucleotide polymorphisms of susceptibility genes of type 2 diabetes mellitus with liability to gout among ethnic Han Chinese males from coastal region of Shandong].

    PubMed

    Han, Lin; Xin, Ruosai; Sun, Jian; Hou, Feng; Li, Changgui; Hu, Xinlin; Liu, Zhen; Wang, Yao; Li, Xinde; Ren, Wei; Wang, Xuefeng; Jia, Zhaotong

    2015-10-01

    OBJECTIVE To assess the association of single nucleotide polymorphisms (SNPs) of susceptibility genes of type 2 diabetes mellitus (T2DM) with liability to gout among ethnic Han Chinese males from coastal region of Shandong province. METHODS Seven SNPs within the susceptibility genes of T2DM, including rs10773971(G/C) and rs4766398(G/C) of WNT5B gene, rs10225163(G/C) of JAZF1 gene, rs2069590(T/A) of BDKRB2 gene, rs5745709(G/A) of HGF gene, rs1991914(C/A) of OTOP1 gene and rs2236479(G/A) of COL18A1 gene, were typed with a custom-made Illumina GoldenGate Genotyping assay in 480 male patients with gout and 480 male controls. Potential association was assessed with the chi-square test. RESULTS No significant difference was detected for the 7 selected SNPs in terms of genotypic and allelic frequencies (P > 0.05). When age and body mass index (BMI) were adjusted, the 7 genetic variants still showed no significant association with gout. CONCLUSION The genotypes of the 7 selected SNPs are not associated with gout in ethnic Han Chinese male patients from the coastal region of Shandong province. However, the results need to be replicated in larger sets of patients collected from other regions and populations.

  13. Population structure of pigs determined by single nucleotide polymorphisms observed in assembled expressed sequence tags.

    PubMed

    Matsumoto, Toshimi; Okumura, Naohiko; Uenishi, Hirohide; Hayashi, Takeshi; Hamasima, Noriyuki; Awata, Takashi

    2012-01-01

    We have collected more than 190000 porcine expressed sequence tags (ESTs) from full-length complementary DNA (cDNA) libraries and identified more than 2800 single nucleotide polymorphisms (SNPs). In this study, we tentatively chose 222 SNPs observed in assembled ESTs to study pigs of different breeds; 104 were selected by comparing the cDNA sequences of a Meishan pig and samples of three-way cross pigs (Landrace, Large White, and Duroc: LWD), and 118 were selected from LWD samples. To evaluate the genetic variation between the chosen SNPs from pig breeds, we determined the genotypes for 192 pig samples (11 pig groups) from our DNA reference panel with matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Of the 222 reference SNPs, 186 were successfully genotyped. A neighbor-joining tree showed that the pig groups were classified into two large clusters, namely, Euro-American and East Asian pig populations. F-statistics and the analysis of molecular variance of Euro-American pig groups revealed that approximately 25% of the genetic variations occurred because of intergroup differences. As the F(IS) values were less than the F(ST) values(,) the clustering, based on the Bayesian inference, implied that there was strong genetic differentiation among pig groups and less divergence within the groups in our samples. © 2011 The Authors. Animal Science Journal © 2011 Japanese Society of Animal Science.

  14. Physiological Study on Association between Nicotinamide N-Methyltransferase Gene Polymorphisms and Hyperlipidemia

    PubMed Central

    Zhu, Xiao-Juan; Lin, Ya-Jun; Chen, Wei; Wang, Ya-Hui; Qiu, Li-Qiang; Cai, Can-Xin; Xiong, Qun; Chen, Fei; Chen, Li-Hui; Zhou, Qiong

    2016-01-01

    Nicotinamide N-methyltransferase (NNMT) catalyzes the methylation of nicotinamide. Our previous works indicate that NNMT is involved in the body mass index and energy metabolism, and recently the association between a SNP (rs694539) of NNMT and a variety of cardiovascular diseases was reported. At present, more than 200 NNMT single nucleotide polymorphisms (SNPs) have been identified in the databases of the human genome projects; however, the association between rs694539 variation and hyperlipidemia has not been reported yet, and whether there are any SNPs in NNMT significantly associated with hyperlipidemia is still unclear. In this paper, we selected 19 SNPs in NNMT as the tagSNPs using Haploview software (Haploview 4.2) first and then performed a case-control study to observe the association between these tagSNPs and hyperlipidemia and finally applied physiological approaches to explore the possible mechanisms through which the NNMT polymorphism induces hyperlipidemia. The results show that a SNP (rs1941404) in NNMT is significantly associated with hyperlipidemia, and the influence of rs1941404 variation on the resting energy expenditure may be the possible mechanism for rs1941404 variation to induce hyperlipidemia. PMID:27999813

  15. Significant association of APOA5 and APOC3 gene polymorphisms with meat quality traits in Kele pigs.

    PubMed

    Hui, Y T; Yang, Y Q; Liu, R Y; Zhang, Y Y; Xiang, C J; Liu, Z Z; Ding, Y H; Zhang, Y L; Wang, B R

    2013-09-13

    Apolipoprotein A5 (APOA5) and C3 (APOC3) genes are involved in the PPAR lipid metabolism pathway and thus associated with elevated triglyceride levels. However, whether APOA5 and APOC3 genetic polymorphisms affect intramuscular fat deposition and other meat quality traits remains unknown in pigs. One hundred and seventy-one Kele pigs were sampled to investigate genetic variants in the APOA5 and APOC3 genes and their association with seven pork quality traits. We identified 5 single nucleotide polymorphisms (SNPs) in the promoter region of the APOA5 gene and 17 SNPs in the APOC3 gene. Linkage disequilibrium analysis revealed 5 complete linkage disequilibria among these 22 SNPs. We found that 10 SNPs were significantly correlated with meat quality traits, including the mutation A5/-769 in the APOA5 gene, which was significantly associated with cooked weight percentage, and 9 SNPs in the APOC3 gene that were significantly associated with drip loss rate, meat color value of longissimus dorsi muscle and shear force. Therefore, these SNP markers will be useful for marker-assisted selection for improved pork quality.

  16. SNP2TFBS - a database of regulatory SNPs affecting predicted transcription factor binding site affinity.

    PubMed

    Kumar, Sunil; Ambrosini, Giovanna; Bucher, Philipp

    2017-01-04

    SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Nucleotide polymorphisms in a pine ortholog of the Arabidopsis degrading enzyme cellulase KORRIGAN are associated with early growth performance in Pinus pinaster.

    PubMed

    Cabezas, José Antonio; González-Martínez, Santiago C; Collada, Carmen; Guevara, María Angeles; Boury, Christophe; de María, Nuria; Eveno, Emmanuelle; Aranda, Ismael; Garnier-Géré, Pauline H; Brach, Jean; Alía, Ricardo; Plomion, Christophe; Cervera, María Teresa

    2015-09-01

    We have carried out a candidate-gene-based association genetic study in Pinus pinaster Aiton and evaluated the predictive performance for genetic merit gain of the most significantly associated genes and single nucleotide polymorphisms (SNPs). We used a second generation 384-SNP array enriched with candidate genes for growth and wood properties to genotype mother trees collected in 20 natural populations covering most of the European distribution of the species. Phenotypic data for total height, polycyclism, root-collar diameter and biomass were obtained from a replicated provenance-progeny trial located in two sites with contrasting environments (Atlantic vs Mediterranean climate). General linear models identified strong associations between growth traits (total height and polycyclism) and four SNPs from the korrigan candidate gene, after multiple testing corrections using false discovery rate. The combined genomic breeding value predictions assessed for the four associated korrigan SNPs by ridge regression-best linear unbiased prediction (RR-BLUP) and cross-validation accounted for up to 8 and 15% of the phenotypic variance for height and polycyclic growth, respectively, and did not improve adding SNPs from other growth-related candidate genes. For root-collar diameter and total biomass, they accounted for 1.6 and 1.1% of the phenotypic variance, respectively, but increased to 15 and 4.1% when other SNPs from lp3.1, lp3.3 and cad were included in RR-BLUP models. These results point towards a desirable integration of candidate-gene studies as a means to pre-select relevant markers, and aid genomic selection in maritime pine breeding programs. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. SNP Assay Development for Linkage Map Construction, Anchoring Whole-Genome Sequence, and Other Genetic and Genomic Applications in Common Bean

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Song, Qijian; Jia, Gaofeng; Hyten, David L.

    A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of largemore » scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.« less

  19. SNP Assay Development for Linkage Map Construction, Anchoring Whole-Genome Sequence, and Other Genetic and Genomic Applications in Common Bean

    DOE PAGES

    Song, Qijian; Jia, Gaofeng; Hyten, David L.; ...

    2015-08-28

    A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of largemore » scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad.« less

  20. Evaluation of a SNP map of 6q24-27 confirms diabetic nephropathy loci and identifies novel associations type 2 diabetes patients enriched with nephropathy from an African American population

    PubMed Central

    Leak, Tennille S.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Keene, Keith L.; Gordon, Candace J.; Hicks, Pamela J.; Freedman, Barry I.; Bowden, Donald W.; Sale, Michèle M.

    2009-01-01

    Previously we performed a genome scan for type 2 diabetes (T2DM) using 638 African-American (AA) affected sibling pairs from 247 families; non-parametric linkage analysis suggested evidence of linkage at 6q24-27 (LOD 2.26). To comprehensively evaluate this region we performed a 2-stage association study by first constructing a SNP map of 754 SNPs selected from HapMap on the basis of linkage disequilibrium (LD) in 300 AAT2DM-ESRD subjects, 311 AA controls, 43 European American controls and 45 Yoruba Nigerian samples (Set 1). Replication analyses were conducted in an independent population of 283 AA T2DM-ESRD subjects and 282 AA controls (Set 2). In addition, we adjusted for the impact of admixture on association results by using ancestry informative markers (AIMs). In Stage 1, 137 (18.2%) SNPs showed nominal evidence of association (P<0.05) in one or more of tests of association: allelic (n=33), dominant (n=36), additive (n=29), or recessive (n=34) genotypic models, and 2- (n=47) and 3-SNP (n=43) haplotypic analyses. These SNPs were selected for follow-up genotyping. Stage 2 analyses confirmed association with a predicted 2-SNP “risk” haplotype in the PARK2 gene. Also, two intergenic SNPs showed consistent genotypic association with T2DM-ESRD: rs12197043 and rs4897081. Combined analysis of all subjects from both stages revealed nominal associations with 17 SNPs within genes; including suggestive associations in ESR1 and PARK2. This study confirms known diabetic nephropathy loci and identifies potentially novel susceptibility variants located within 6q24-27 in AA. PMID:18560894

  1. SNP Assay Development for Linkage Map Construction, Anchoring Whole-Genome Sequence, and Other Genetic and Genomic Applications in Common Bean.

    PubMed

    Song, Qijian; Jia, Gaofeng; Hyten, David L; Jenkins, Jerry; Hwang, Eun-Young; Schroeder, Steven G; Osorno, Juan M; Schmutz, Jeremy; Jackson, Scott A; McClean, Phillip E; Cregan, Perry B

    2015-08-28

    A total of 992,682 single-nucleotide polymorphisms (SNPs) was identified as ideal for Illumina Infinium II BeadChip design after sequencing a diverse set of 17 common bean (Phaseolus vulgaris L) varieties with the aid of next-generation sequencing technology. From these, two BeadChips each with >5000 SNPs were designed. The BARCBean6K_1 BeadChip was selected for the purpose of optimizing polymorphism among market classes and, when possible, SNPs were targeted to sequence scaffolds in the Phaseolus vulgaris 14× genome assembly with sequence lengths >10 kb. The BARCBean6K_2 BeadChip was designed with the objective of anchoring additional scaffolds and to facilitate orientation of large scaffolds. Analysis of 267 F2 plants from a cross of varieties Stampede × Red Hawk with the two BeadChips resulted in linkage maps with a total of 7040 markers including 7015 SNPs. With the linkage map, a total of 432.3 Mb of sequence from 2766 scaffolds was anchored to create the Phaseolus vulgaris v1.0 assembly, which accounted for approximately 89% of the 487 Mb of available sequence scaffolds of the Phaseolus vulgaris v0.9 assembly. A core set of 6000 SNPs (BARCBean6K_3 BeadChip) with high genotyping quality and polymorphism was selected based on the genotyping of 365 dry bean and 134 snap bean accessions with the BARCBean6K_1 and BARCBean6K_2 BeadChips. The BARCBean6K_3 BeadChip is a useful tool for genetics and genomics research and it is widely used by breeders and geneticists in the United States and abroad. Copyright © 2015 Song et al.

  2. A global perspective on hepatitis B‐related single nucleotide polymorphisms and evolution during human migration

    PubMed Central

    Jeng, Wen‐Juei; Lin, Chun‐Yen

    2017-01-01

    Genome‐wide association studies have indicated that human leukocyte antigen (HLA)‐DP and HLA‐DQ play roles in persistent hepatitis B virus (HBV) infection in Asia. To understand the evolution of HBV‐related single nucleotide polymorphisms (SNPs) and to correlate these SNPs with chronic HBV infection among different populations, we conducted a global perspective study on hepatitis‐related SNPs. We selected 12 HBV‐related SNPs on the HLA locus and two HBV and three hepatitis C virus immune‐related SNPs for analysis. Five nasopharyngeal carcinoma‐related SNPs served as controls. All SNP data worldwide from 26 populations were downloaded from 1,000 genomes. We found a dramatic difference in the allele frequency in most of the HBV‐ and HLA‐related SNPs in East Asia compared to the other continents. A sharp change in allele frequency in 8 of 12 SNPs was found between Bengali populations in Bangladesh and Chinese Dai populations in Xishuangbanna, China (P < 0.001); these areas represent the junction of South and East Asia. For the immune‐related SNPs, significant changes were found after leaving Africa. Most of these genes shifted from higher expression genotypes in Africa to lower expression genotypes in either Europe or South Asia (P < 0.001). During this two‐stage adaptation, immunity adjusted toward a weak immune response, which could have been a survival strategy during human migration to East Asia. The prevalence of chronic HBV infection in Africa is as high as in Asia; however, the HBV‐related SNP genotypes are not present in Africa, and so the genetic mechanism of chronic HBV infection in Africa needs further exploration. Conclusion: Two stages of genetic changes toward a weak immune response occurred when humans migrated out of Africa. These changes could be a survival strategy for avoiding cytokine storms and surviving in new environments. (Hepatology Communications 2017;1:1005–1013) PMID:29404438

  3. Study of five novel non-synonymous polymorphisms in human brain-expressed genes in a Colombian sample.

    PubMed

    Ojeda, Diego A; Forero, Diego A

    2014-10-01

    Non-synonymous single nucleotide polymorphisms (nsSNPs) in brain-expressed genes represent interesting candidates for genetic research in neuropsychiatric disorders. To study novel nsSNPs in brain-expressed genes in a sample of Colombian subjects. We applied an approach based on in silico mining of available genomic data to identify and select novel nsSNPs in brain-expressed genes. We developed novel genotyping assays, based in allele-specific PCR methods, for these nsSNPs and genotyped them in 171 Colombian subjects. Five common nsSNPs (rs6855837; p.Leu395Ile, rs2305160; p.Thr394Ala, rs10503929; p.Met289Thr, rs2270641; p.Thr4Pro and rs3822659; p.Ser735Ala) were studied, located in the CLOCK, NPAS2, NRG1, SLC18A1 and WWC1 genes. We reported allele and genotype frequencies in a sample of South American healthy subjects. There is previous experimental evidence, arising from genome-wide expression and association studies, for the involvement of these genes in several neuropsychiatric disorders and endophenotypes, such as schizophrenia, mood disorders or memory performance. Frequencies for these nsSNPSs in the Colombian samples varied in comparison to different HapMap populations. Future study of these nsSNPs in brain-expressed genes, a synaptogenomics approach, will be important for a better understanding of neuropsychiatric diseases and endophenotypes in different populations.

  4. Effect of polymorphisms in the CSN3 (κ-casein) gene on milk production traits in Chinese Holstein Cattle.

    PubMed

    Alim, M A; Dong, T; Xie, Y; Wu, X P; Zhang, Yi; Zhang, Shengli; Sun, D X

    2014-11-01

    This study was designed to evaluate significant associations between single nucleotide polymorphisms (SNPs) and milk composition and milk production traits in Chinese Holstein cows. Six SNPs were identified in the κ-casein gene using pooled DNA sequencing. The identified SNPs were genotyped by Matrix-assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF MS) methods from 507 individuals. Out of six, we identified three non-synonymous SNPs (g.10888T>C, g.10924C>A and g.10944A>G) that changed in the protein product. SIFT (Sorting_Intolerant_From_Tolerant) prediction score (0.01) demonstrated that protein changed Isoleucine > Threonine (g.10888T>C) will affect the phenotypes. Significant associations between identified SNPs and three yield traits (milk, protein and fat) and two composition traits (fat and protein percentages) were found whereas it did not reach significance for fat percentage in haplotypes association. Importantly, the significant SNPs in our results showed a large proportion of the phenotypic variation of milk protein yield and concentration. Our results suggest that CSN3 is an important candidate gene that influences milk production traits, and identified polymorphisms and haplotypes could be used as a genetic marker in programs of marker-assisted selection for the genetic improvement of milk production traits in dairy cattle.

  5. A TNF region haplotype offers protection from typhoid fever in Vietnamese patients

    PubMed Central

    2009-01-01

    The genomic region surrounding the TNF locus on human chromosome 6 has previously been associated with typhoid fever in Vietnam. We used a haplotypic approach to understand this association further. Eighty single nucleotide polymorphisms (SNPs) spanning a 150 kb region were genotyped in 95 Vietnamese individuals (typhoid case/mother/father trios). A subset of data from 33 SNPs with a minor allele frequency of >4.3% was used to construct haplotypes. Fifteen SNPs, which tagged the 42 constructed haplotypes were selected. The haplotype tagging SNPs (T1-T15) were genotyped in 380 confirmed typhoid cases and 380 Vietnamese ethnically matched controls. Allelic frequencies of seven SNPs (T1, T2, T3, T5, T6, T7, T8) were significantly different between typhoid cases and controls. Logistic regression results support the hypothesis that there is just one signal associated with disease at this locus. Haplotype-based analysis of the tag SNPs provided positive evidence of association with typhoid (posterior probability 0.821). The analysis highlighted a low-risk cluster of haplotypes that each carry the minor allele of T1 or T7, but not both, and otherwise carry the combination of alleles *12122*1111 at T1-T11, further supporting the one associated signal hypothesis. Finally, individuals that carry the typhoid fever protective haplotype *12122*1111 also produce a relatively low TNF-α response to LPS. PMID:17503085

  6. A population genomic scan in Chorthippus grasshoppers unveils previously unknown phenotypic divergence.

    PubMed

    Berdan, Emma L; Mazzoni, Camila J; Waurick, Isabelle; Roehr, Johannes T; Mayer, Frieder

    2015-08-01

    Understanding the genetics of speciation and the processes that drive it is a central goal of evolutionary biology. Grasshoppers of the Chorthippus species group differ strongly in calling song (and corresponding female preferences) but are exceedingly similar in other characteristics such as morphology. Here, we performed a population genomic scan on three Chorthippus species (Chorthippus biguttulus, C. mollis and C. brunneus) to gain insight into the genes and processes involved in divergence and speciation in this group. Using an RNA-seq approach, we examined functional variation between the species by calling SNPs for each of the three species pairs and using FST -based approaches to identify outliers. We found approximately 1% of SNPs in each comparison to be outliers. Between 37% and 40% of these outliers were nonsynonymous SNPs (as opposed to a global level of 17%) indicating that we recovered loci under selection. Among the outliers were several genes that may be involved in song production and hearing as well as genes involved in other traits such as food preferences and metabolism. Differences in food preferences between species were confirmed with a behavioural experiment. This indicates that multiple phenotypic differences implicating multiple evolutionary processes (sexual selection and natural selection) are present between the species. © 2015 John Wiley & Sons Ltd.

  7. Larva-mediated chalkbrood resistance-associated single nucleotide polymorphism markers in the honey bee Apis mellifera.

    PubMed

    Liu, Y; Yan, L; Li, Z; Huang, W-F; Pokhrel, S; Liu, X; Su, S

    2016-06-01

    Chalkbrood is a disease affecting honey bees that seriously impairs brood growth and productivity of diseased colonies. Although honey bees can develop chalkbrood resistance naturally, the details underlying the mechanisms of resistance are not fully understood, and no easy method is currently available for selecting and breeding resistant bees. Finding the genes involved in the development of resistance and identifying single nucleotide polymorphisms (SNPs) that can be used as molecular markers of resistance is therefore a high priority. We conducted genome resequencing to compare resistant (Res) and susceptible (Sus) larvae that were selected following in vitro chalkbrood inoculation. Twelve genomic libraries, including 14.4 Gb of sequence data, were analysed using SNP-finding algorithms. Unique SNPs derived from chromosomes 2 and 11 were analysed in this study. SNPs from resistant individuals were confirmed by PCR and Sanger sequencing using in vitro reared larvae and resistant colonies. We found strong support for an association between the C allele at SNP C2587245T and chalkbrood resistance. SNP C2587245T may be useful as a genetic marker for the selection of chalkbrood resistance and high royal jelly production honey bee lines, thereby helping to minimize the negative effects of chalkbrood on managed honey bees. © 2016 The Royal Entomological Society.

  8. Whole-Genome Sequencing of Theileria parva Strains Provides Insight into Parasite Migration and Diversification in the African Continent

    PubMed Central

    Hayashida, Kyoko; Abe, Takashi; Weir, William; Nakao, Ryo; Ito, Kimihito; Kajino, Kiichi; Suzuki, Yutaka; Jongejan, Frans; Geysen, Dirk; Sugimoto, Chihiro

    2013-01-01

    The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814–121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent. PMID:23404454

  9. Whole-genome sequencing of Theileria parva strains provides insight into parasite migration and diversification in the African continent.

    PubMed

    Hayashida, Kyoko; Abe, Takashi; Weir, William; Nakao, Ryo; Ito, Kimihito; Kajino, Kiichi; Suzuki, Yutaka; Jongejan, Frans; Geysen, Dirk; Sugimoto, Chihiro

    2013-06-01

    The disease caused by the apicomplexan protozoan parasite Theileria parva, known as East Coast fever or Corridor disease, is one of the most serious cattle diseases in Eastern, Central, and Southern Africa. We performed whole-genome sequencing of nine T. parva strains, including one of the vaccine strains (Kiambu 5), field isolates from Zambia, Uganda, Tanzania, or Rwanda, and two buffalo-derived strains. Comparison with the reference Muguga genome sequence revealed 34 814-121 545 single nucleotide polymorphisms (SNPs) that were more abundant in buffalo-derived strains. High-resolution phylogenetic trees were constructed with selected informative SNPs that allowed the investigation of possible complex recombination events among ancestors of the extant strains. We further analysed the dN/dS ratio (non-synonymous substitutions per non-synonymous site divided by synonymous substitutions per synonymous site) for 4011 coding genes to estimate potential selective pressure. Genes under possible positive selection were identified that may, in turn, assist in the identification of immunogenic proteins or vaccine candidates. This study elucidated the phylogeny of T. parva strains based on genome-wide SNPs analysis with prediction of possible past recombination events, providing insight into the migration, diversification, and evolution of this parasite species in the African continent.

  10. Whole genome sequencing in the search for genes associated with the control of SIV infection in the Mauritian macaque model.

    PubMed

    de Manuel, Marc; Shiina, Takashi; Suzuki, Shingo; Dereuddre-Bosquet, Nathalie; Garchon, Henri-Jean; Tanaka, Masayuki; Congy-Jolivet, Nicolas; Aarnink, Alice; Le Grand, Roger; Marques-Bonet, Tomas; Blancher, Antoine

    2018-05-08

    In the Mauritian macaque experimentally inoculated with SIV, gene polymorphisms potentially associated with the plasma virus load at a set point, approximately 100 days post inoculation, were investigated. Among the 42 animals inoculated with 50 AID 50 of the same strain of SIV, none of which received any preventive or curative treatment, nine individuals were selected: three with a plasma virus load (PVL) among the lowest, three with intermediate PVL values and three among the highest PVL values. The complete genomes of these nine animals were then analyzed. Initially, attention was focused on variants with a potential functional impact on protein encoding genes (non-synonymous SNPs (NS-SNPs) and splicing variants). Thus, 424 NS-SNPs possibly associated with PVL were detected. The 424 candidates SNPs were genotyped in these 42 SIV experimentally infected animals (including the nine animals subjected to whole genome sequencing). The genes containing variants most probably associated with PVL at a set time point are analyzed herein.

  11. Fine-mapping identifies multiple prostate cancer risk loci at 5p15, one of which associates with TERT expression

    PubMed Central

    Kote-Jarai, Zsofia; Saunders, Edward J.; Leongamornlert, Daniel A.; Tymrakiewicz, Malgorzata; Dadaev, Tokhir; Jugurnauth-Little, Sarah; Ross-Adams, Helen; Al Olama, Ali Amin; Benlloch, Sara; Halim, Silvia; Russel, Roslin; Dunning, Alison M.; Luccarini, Craig; Dennis, Joe; Neal, David E.; Hamdy, Freddie C.; Donovan, Jenny L.; Muir, Ken; Giles, Graham G.; Severi, Gianluca; Wiklund, Fredrik; Gronberg, Henrik; Haiman, Christopher A.; Schumacher, Fredrick; Henderson, Brian E.; Le Marchand, Loic; Lindstrom, Sara; Kraft, Peter; Hunter, David J.; Gapstur, Susan; Chanock, Stephen; Berndt, Sonja I.; Albanes, Demetrius; Andriole, Gerald; Schleutker, Johanna; Weischer, Maren; Canzian, Federico; Riboli, Elio; Key, Tim J.; Travis, Ruth C.; Campa, Daniele; Ingles, Sue A.; John, Esther M.; Hayes, Richard B.; Pharoah, Paul; Khaw, Kay-Tee; Stanford, Janet L.; Ostrander, Elaine A.; Signorello, Lisa B.; Thibodeau, Stephen N.; Schaid, Dan; Maier, Christiane; Vogel, Walther; Kibel, Adam S.; Cybulski, Cezary; Lubinski, Jan; Cannon-Albright, Lisa; Brenner, Hermann; Park, Jong Y.; Kaneva, Radka; Batra, Jyotsna; Spurdle, Amanda; Clements, Judith A.; Teixeira, Manuel R.; Govindasami, Koveela; Guy, Michelle; Wilkinson, Rosemary A.; Sawyer, Emma J.; Morgan, Angela; Dicks, Ed; Baynes, Caroline; Conroy, Don; Bojesen, Stig E.; Kaaks, Rudolf; Vincent, Daniel; Bacot, François; Tessier, Daniel C.; Easton, Douglas F.; Eeles, Rosalind A.

    2013-01-01

    Associations between single nucleotide polymorphisms (SNPs) at 5p15 and multiple cancer types have been reported. We have previously shown evidence for a strong association between prostate cancer (PrCa) risk and rs2242652 at 5p15, intronic in the telomerase reverse transcriptase (TERT) gene that encodes TERT. To comprehensively evaluate the association between genetic variation across this region and PrCa, we performed a fine-mapping analysis by genotyping 134 SNPs using a custom Illumina iSelect array or Sequenom MassArray iPlex, followed by imputation of 1094 SNPs in 22 301 PrCa cases and 22 320 controls in The PRACTICAL consortium. Multiple stepwise logistic regression analysis identified four signals in the promoter or intronic regions of TERT that independently associated with PrCa risk. Gene expression analysis of normal prostate tissue showed evidence that SNPs within one of these regions also associated with TERT expression, providing a potential mechanism for predisposition to disease. PMID:23535824

  12. [Association analysis between SNPs of the growth hormone receptor gene and growth traits in arctic fox].

    PubMed

    DU, Zhi-Heng; Liu, Zong-Yue; Bai, Xiu-Juan

    2010-06-01

    Using single-strand conformation polymorphism (PCR-SSCP) and DNA sequencing, single nucleotide polymorphisms (SNPs) of growth hormone receptor (GHR) gene were detected in an arctic fox population. Correlation analysis between GHR polymorphisms and growth traits were carried out using the appropriate model. Four SNPs, G3A in the 5'UTR, C99T in the first exon, T59C and G65A in the fifth exon were identified on the arctic fox GHR gene. The G3A and C99T polymorphisms of GHR were associated with female fox body weight (Pamp;0.05) and the T59C and G65A polymorphisms of GHR were associated with male fox body weight (Pamp;0.05) and the skin length of the female fox (Pamp;0.01). Therefore, marker assistant selection on body weight and skin length of arctic foxes using these SNPs can be applied to get big and high quality arctic foxes.

  13. Interpopulation hybridization results in widespread viability selection across the genome in Tigriopus californicus

    PubMed Central

    2011-01-01

    Background Genetic interactions within hybrids influence their overall fitness. Understanding the details of these interactions can improve our understanding of speciation. One experimental approach is to investigate deviations from Mendelian expectations (segregation distortion) in the inheritance of mapped genetic markers. In this study, we used the copepod Tigriopus californicus, a species which exhibits high genetic divergence between populations and a general pattern of reduced fitness in F2 interpopulation hybrids. Previous studies have implicated both nuclear-cytoplasmic and nuclear-nuclear interactions in causing this fitness reduction. We identified and mapped population-diagnostic single nucleotide polymorphisms (SNPs) and used these to examine segregation distortion across the genome within F2 hybrids. Results We generated a linkage map which included 45 newly elucidated SNPs and 8 population-diagnostic microsatellites used in previous studies. The map, the first available for the Copepoda, was estimated to cover 75% of the genome and included markers on all 12 T. californicus chromosomes. We observed little segregation distortion in newly hatched F2 hybrid larvae (fewer than 10% of markers at p < 0.05), but strikingly higher distortion in F2 hybrid adult males (45% of markers at p < 0.05). Hence, segregation distortion was primarily caused by selection against particular genetic combinations which acted between hatching and maturity. Distorted markers were not distributed randomly across the genome but clustered on particular chromosomes. In contrast to other studies in this species we found little evidence for cytonuclear coadaptation. Instead, different linkage groups exhibited markedly different patterns of distortion, which appear to have been influenced by nuclear-nuclear epistatic interactions and may also reflect genetic load carried within the parental lines. Conclusion Adult male F2 hybrids between two populations of T. californius exhibit dramatic segregation distortion across the genome. Distorted loci are clustered within specific linkage groups, and the direction of distortion differs between chromosomes. This segregation distortion is due to selection acting between hatching and adulthood. PMID:21639918

  14. Increasing the discrimination power of ancestry- and identity-informative SNP loci within the ForenSeq™ DNA Signature Prep Kit.

    PubMed

    King, Jonathan L; Churchill, Jennifer D; Novroski, Nicole M M; Zeng, Xiangpei; Warshauer, David H; Seah, Lay-Hong; Budowle, Bruce

    2018-06-06

    The use of single nucleotide polymorphisms (SNPs) in forensic genetics has been limited to challenged samples with low template and/or degraded DNA. The recent introduction of massively parallel sequencing (MPS) technologies has expanded the potential applications of these markers and increased the discrimination power of well-established loci by considering variation in the flanking regions of target loci. The ForenSeq Signature Preparation Kit contains 165 SNP amplicons for ancestry- (aiSNPs), identity- (iiSNPs), and phenotype-inference (piSNPs). In this study, 714 individuals from four major populations (African American, AFA; East Asian, ASN; US Caucasian, CAU; and Southwest US Hispanic, HIS) previously reported by Churchill et al. [Forensic Sci Int Genet. 30 (2017) 81-92; DOI: https://doi.org/10.1016/j.fsigen.2017.06.004] were assessed using STRait Razor v2s to determine the level of diversity in the flanking regions of these amplicons. The results show that nearly 70% of loci showed some level of flanking region variation with 22 iiSNPs and 8 aiSNPs categorized as microhaplotypes in this study. The heterozygosities of these microhaplotypes approached, and in one instance surpassed, those of some core STR loci. Also, the impact of the flanking region on other forensic parameters (e.g., power of exclusion and power of discrimination) was examined. Sixteen of the 94 iiSNPs had an effective allele number greater than 2.00 across the four populations. To assess what effect the flanking region information had on the ancestry inference, genotype probabilities and likelihood ratios were determined. Additionally, concordance with the ForenSeq UAS and Nextera Rapid Capture was evaluated, and patterns of heterozygote imbalance were identified. Pairwise comparison of the iiSNP diplotypes determined the probability of detecting a mixture (i.e., observing ≥ 3 haplotypes) using these loci alone was 0.9952. The improvement in random match probabilities for the full regions over the target iiSNPs was found to be significant. When combining the iiSNPs with the autosomal STRs, the combined match probabilities ranged from 6.40 × 10 -73 (ASN) to 1.02 × 10 -79 (AFA). Copyright © 2018 Elsevier B.V. All rights reserved.

  15. Genetic contributions to the association between adult height and testicular germ cell tumors.

    PubMed

    Cook, Michael B; Chia, Victoria M; Berndt, Sonja I; Graubard, Barry I; Chanock, Stephen J; Rubertone, Mark V; Erickson, Ralph L; Hayes, Richard B; McGlynn, Katherine A

    2011-06-01

    Previously, we have shown that increasing adult height is associated with increased risk of testicular germ-cell tumor (TGCT). Recently, a number of single nucleotide polymorphisms (SNPs) have been found to be related to height. We examined whether these SNPs were associated with TGCT and whether they explained the relationship between height and TGCT. We genotyped 15 height-related SNPs in the US Servicemen's Testicular Tumor Environmental and Endocrine Determinants (STEED) case-control study. DNA was extracted from buccal cell samples and Taqman assays were used to type the selected SNPs. We used logistic regression models to estimate odds ratios (ORs) and 95% confidence intervals (95%CIs). There were 561 cases and 676 controls for analysis. Two SNPs were found to be associated with risk of TGCT, rs6060373 (CC vs TT, OR = 1.51, 95% CI: 1.06-2.15) and rs143384 (CC vs TT, OR = 1.53, 95% CI: 1.09-2.15). rs6060373 is an intronic polymorphism of ubiquinol-cytochrome c reductase complex chaperone (UQCC), and rs143384 is a 5'UTR polymorphism of growth differentiation factor 5 (GDF5). No individual SNP attenuated the association between height and TGCT. Adjustment for all SNPs previously associated with adult height reduced the associations between adult height and TGCT by ~8.5%, although the P-value indicated only weak evidence that this difference was important (P = 0.26). This novel analysis provides tentative evidence that SNPs which are associated with adult height may also share an association with risk of TGCT.

  16. Genetic analysis of candidate SNPs for metabolic syndrome in obstructive sleep apnea (OSA)

    PubMed Central

    Grilo, Antonio; Ruiz-Granados, Elena S.; Moreno-Rey, Concha; Rivera, Jose M.; Ruiz, Agustin; Real, Luis M.; Sáez, Maria E.

    2014-01-01

    Obstructive sleep apnea (OSA) is a common disorder characterized by the reduction or complete cessation in airflow resulting from an obstruction of the upper airway. Several studies have observed an increased risk for cardiovascular morbidity and mortality among OSA patients. Metabolic syndrome (MetS), a cluster of cardiovascular risk factors characterized by the presence of insulin resistance, is often found in patients with OSA, but the complex interplay between these two syndromes is not well understood. In this study, we present the results of a genetic association analysis of 373 candidate SNPs for MetS selected in a previous genome wide association analysis (GWAS). The 384 selected SNPs were genotyped using the Illumina VeraCode Technology in 387 subjects retrospectively assessed at the Internal Medicine Unit of the “Virgen de Valme” University Hospital (Seville, Spain). In order to increase the power of this study and to validate our findings in an independent population, we used data from the Framingham Sleep study which comprises 368 individuals. Only the rs11211631 polymorphism was associated with OSA in both populations, with an estimated OR=0.57 (0.42-0.79) in the joint analysis (p=7.21 × 10-4). This SNP was selected in the previous GWAS for MetS components using a digenic approach, but was not significant in the monogenic study. We have also identified two SNPs (rs2687855 and rs4299396) with a protective effect from OSA only in the abdominal obese subpopulation. As a whole, our study does not support that OSA and MetS share major genetic determinants, although both syndromes share common epidemiological and clinical features. PMID:23524009

  17. Increased fire frequency promotes stronger spatial genetic structure and natural selection at regional and local scales in Pinus halepensis Mill

    PubMed Central

    González-Martínez, Santiago C.; Navascués, Miguel; Burgarella, Concetta; Mosca, Elena; Lorenzo, Zaida; Zabal-Aguirre, Mario; Vendramin, Giovanni G.; Verdú, Miguel; Pausas, Juli G.

    2017-01-01

    Background and Aims The recurrence of wildfires is predicted to increase due to global climate change, resulting in severe impacts on biodiversity and ecosystem functioning. Recurrent fires can drive plant adaptation and reduce genetic diversity; however, the underlying population genetic processes have not been studied in detail. In this study, the neutral and adaptive evolutionary effects of contrasting fire regimes were examined in the keystone tree species Pinus halepensis Mill. (Aleppo pine), a fire-adapted conifer. The genetic diversity, demographic history and spatial genetic structure were assessed at local (within-population) and regional scales for populations exposed to different crown fire frequencies. Methods Eight natural P. halepensis stands were sampled in the east of the Iberian Peninsula, five of them in a region exposed to frequent crown fires (HiFi) and three of them in an adjacent region with a low frequency of crown fires (LoFi). Samples were genotyped at nine neutral simple sequence repeats (SSRs) and at 251 single nucleotide polymorphisms (SNPs) from coding regions, some of them potentially important for fire adaptation. Key Results Fire regime had no effects on genetic diversity or demographic history. Three high-differentiation outlier SNPs were identified between HiFi and LoFi stands, suggesting fire-related selection at the regional scale. At the local scale, fine-scale spatial genetic structure (SGS) was overall weak as expected for a wind-pollinated and wind-dispersed tree species. HiFi stands displayed a stronger SGS than LoFi stands at SNPs, which probably reflected the simultaneous post-fire recruitment of co-dispersed related seeds. SNPs with exceptionally strong SGS, a proxy for microenvironmental selection, were only reliably identified under the HiFi regime. Conclusions An increasing fire frequency as predicted due to global change can promote increased SGS with stronger family structures and alter natural selection in P. halepensis and in plants with similar life history traits. PMID:28159988

  18. A Comparison Between Genotyping-by-sequencing and Array-based Scoring of SNPs for Genomic Prediction Accuracy in Winter Wheat

    USDA-ARS?s Scientific Manuscript database

    The utilization of DNA molecular markers in plant breeding to maximize selection response via marker assisted selection (MAS) and genomic selection (GS) has the potential to revolutionize plant breeding. A key factor affecting GS applicability is the choice of molecular marker platform. Genotypying-...

  19. Evaluation of 41 Candidate Gene Variants for Obesity in the EPIC-Potsdam Cohort by Multi-Locus Stepwise Regression

    PubMed Central

    Knüppel, Sven; Rohde, Klaus; Meidtner, Karina; Drogan, Dagmar; Holzhütter, Hermann-Georg; Boeing, Heiner; Fisher, Eva

    2013-01-01

    Objective Obesity has become a leading preventable cause of morbidity and mortality in many parts of the world. It is thought to originate from multiple genetic and environmental determinants. The aim of the current study was to introduce haplotype-based multi-locus stepwise regression (MSR) as a method to investigate combinations of unlinked single nucleotide polymorphisms (SNPs) for obesity phenotypes. Methods In 2,122 healthy randomly selected men and women of the EPIC-Potsdam cohort, the association between 41 SNPs from 18 obesity-candidate genes and either body mass index (BMI, mean = 25.9 kg/m2, SD = 4.1) or waist circumference (WC, mean = 85.2 cm, SD = 12.6) was assessed. Single SNP analyses were done by using linear regression adjusted for age, sex, and other covariates. Subsequently, MSR was applied to search for the ‘best’ SNP combinations. Combinations were selected according to specific AICc and p-value criteria. Model uncertainty was accounted for by a permutation test. Results The strongest single SNP effects on BMI were found for TBC1D1 rs637797 (β = −0.33, SE = 0.13), FTO rs9939609 (β = 0.28, SE = 0.13), MC4R rs17700144 (β = 0.41, SE = 0.15), and MC4R rs10871777 (β = 0.34, SE = 0.14). All these SNPs showed similar effects on waist circumference. The two ‘best’ six-SNP combinations for BMI (global p-value = 3.45⋅10–6 and 6.82⋅10–6) showed effects ranging from −1.70 (SE = 0.34) to 0.74 kg/m2 (SE = 0.21) per allele combination. We selected two six-SNP combinations on waist circumference (global p-value = 7.80⋅10–6 and 9.76⋅10–6) with an allele combination effect of −2.96 cm (SE = 0.76) at maximum. Additional adjustment for BMI revealed 15 three-SNP combinations (global p-values ranged from 3.09⋅10–4 to 1.02⋅10–2). However, after carrying out the permutation test all SNP combinations lost significance indicating that the statistical associations might have occurred by chance. Conclusion MSR provides a tool to search for risk-related SNP combinations of common traits or diseases. However, the search process does not always find meaningful SNP combinations in a dataset. PMID:23874820

  20. Transcriptome sequencing of Eucalyptus camaldulensis seedlings subjected to water stress reveals functional single nucleotide polymorphisms and genes under selection

    PubMed Central

    2012-01-01

    Background Water stress limits plant survival and production in many parts of the world. Identification of genes and alleles responding to water stress conditions is important in breeding plants better adapted to drought. Currently there are no studies examining the transcriptome wide gene and allelic expression patterns under water stress conditions. We used RNA sequencing (RNA-seq) to identify the candidate genes and alleles and to explore the evolutionary signatures of selection. Results We studied the effect of water stress on gene expression in Eucalyptus camaldulensis seedlings derived from three natural populations. We used reference-guided transcriptome mapping to study gene expression. Several genes showed differential expression between control and stress conditions. Gene ontology (GO) enrichment tests revealed up-regulation of 140 stress-related gene categories and down-regulation of 35 metabolic and cell wall organisation gene categories. More than 190,000 single nucleotide polymorphisms (SNPs) were detected and 2737 of these showed differential allelic expression. Allelic expression of 52% of these variants was correlated with differential gene expression. Signatures of selection patterns were studied by estimating the proportion of nonsynonymous to synonymous substitution rates (Ka/Ks). The average Ka/Ks ratio among the 13,719 genes was 0.39 indicating that most of the genes are under purifying selection. Among the positively selected genes (Ka/Ks > 1.5) apoptosis and cell death categories were enriched. Of the 287 positively selected genes, ninety genes showed differential expression and 27 SNPs from 17 positively selected genes showed differential allelic expression between treatments. Conclusions Correlation of allelic expression of several SNPs with total gene expression indicates that these variants may be the cis-acting variants or in linkage disequilibrium with such variants. Enrichment of apoptosis and cell death gene categories among the positively selected genes reveals the past selection pressures experienced by the populations used in this study. PMID:22853646

  1. Genome-wide identification of allele-specific expression (ASE) in response to Marek's disease virus infection using next generation sequencing.

    PubMed

    Maceachern, Sean; Muir, William M; Crosby, Seth; Cheng, Hans H

    2011-06-03

    Marek's disease (MD), a T cell lymphoma induced by the highly oncogenic α-herpesvirus Marek's disease virus (MDV), is the main chronic infectious disease concern threatening the poultry industry. Enhancing genetic resistance to MD in commercial poultry is an attractive method to augment MD vaccines, which is currently the control method of choice. In order to optimally implement this control strategy through marker-assisted selection (MAS) and to gain biological information, it is necessary to identify specific genes that influence MD incidence. A genome-wide screen for allele-specific expression (ASE) in response to MDV infection was conducted. The highly inbred ADOL chicken lines 6 (MD resistant) and 7 (MD susceptible) were inter-mated in reciprocal crosses and half of the progeny challenged with MDV. Splenic RNA pools at a single time after infection for each treatment group point were generated, sequenced using a next generation sequencer, then analyzed for allele-specific expression (ASE). To validate and extend the results, Illumina GoldenGate assays for selected cSNPs were developed and used on all RNA samples from all 6 time points following MDV challenge. RNA sequencing resulted in 11-13+ million mappable reads per treatment group, 1.7+ Gb total sequence, and 22,655 high-confidence cSNPs. Analysis of these cSNPs revealed that 5360 cSNPs in 3773 genes exhibited statistically significant allelic imbalance. Of the 1536 GoldenGate assays, 1465 were successfully scored with all but 19 exhibiting evidence for allelic imbalance. ASE is an efficient method to identify potentially all or most of the genes influencing this complex trait. The identified cSNPs can be further evaluated in resource populations to determine their allelic direction and size of effect on genetic resistance to MD as well as being directly implemented in genomic selection programs. The described method, although demonstrated in inbred chicken lines, is applicable to all traits in any diploid species, and should prove to be a simple method to identify the majority of genes controlling any complex trait.

  2. RTEL1 tagging SNPs and haplotypes were associated with glioma development

    PubMed Central

    2013-01-01

    Abstract As glioma ranks as the first most prevalent solid tumors in primary central nervous system, certain single-nucleotide polymorphisms (SNPs) may be related to increased glioma risk, and have implications in carcinogenesis. The present case–control study was carried out to elucidate how common variants contribute to glioma susceptibility. Ten candidate tagging SNPs (tSNPs) were selected from seven genes whose polymorphisms have been proven by classical literatures and reliable databases to be tended to relate with gliomas, and with the minor allele frequency (MAF) > 5% in the HapMap Asian population. The selected tSNPs were genotyped in 629 glioma patients and 645 controls from a Han Chinese population using the multiplexed SNP MassEXTEND assay calibrated. Two significant tSNPs in RTEL1 gene were observed to be associated with glioma risk (rs6010620, P = 0.0016, OR: 1.32, 95% CI: 1.11-1.56; rs2297440, P = 0.001, OR: 1.33, 95% CI: 1.12-1.58) by χ2 test. It was identified the genotype “GG” of rs6010620 acted as the protective genotype for glioma (OR, 0.46; 95% CI, 0.31-0.7; P = 0.0002), while the genotype “CC” of rs2297440 as the protective genotype in glioma (OR, 0.47; 95% CI, 0.31-0.71; P = 0.0003). Furthermore, haplotype “GCT” in RTEL1 gene was found to be associated with risk of glioma (OR, 0.7; 95% CI, 0.57-0.86; Fisher’s P = 0.0005; Pearson’s P = 0.0005), and haplotype “ATT” was detected to be associated with risk of glioma (OR, 1.32; 95% CI, 1.12-1.57; Fisher’s P = 0.0013; Pearson’s P = 0.0013). Two single variants, the genotypes of “GG” of rs6010620 and “CC” of rs2297440 (rs6010620 and rs2297440) in the RTEL1 gene, together with two haplotypes of GCT and ATT, were identified to be associated with glioma development. And it might be used to evaluate the glioma development risks to screen the above RTEL1 tagging SNPs and haplotypes. Virtual slides The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1993021136961998 PMID:23683922

  3. Signatures of negative selection in the genetic architecture of human complex traits.

    PubMed

    Zeng, Jian; de Vlaming, Ronald; Wu, Yang; Robinson, Matthew R; Lloyd-Jones, Luke R; Yengo, Loic; Yap, Chloe X; Xue, Angli; Sidorenko, Julia; McRae, Allan F; Powell, Joseph E; Montgomery, Grant W; Metspalu, Andres; Esko, Tonu; Gibson, Greg; Wray, Naomi R; Visscher, Peter M; Yang, Jian

    2018-05-01

    We develop a Bayesian mixed linear model that simultaneously estimates single-nucleotide polymorphism (SNP)-based heritability, polygenicity (proportion of SNPs with nonzero effects), and the relationship between SNP effect size and minor allele frequency for complex traits in conventionally unrelated individuals using genome-wide SNP data. We apply the method to 28 complex traits in the UK Biobank data (N = 126,752) and show that on average, 6% of SNPs have nonzero effects, which in total explain 22% of phenotypic variance. We detect significant (P < 0.05/28) signatures of natural selection in the genetic architecture of 23 traits, including reproductive, cardiovascular, and anthropometric traits, as well as educational attainment. The significant estimates of the relationship between effect size and minor allele frequency in complex traits are consistent with a model of negative (or purifying) selection, as confirmed by forward simulation. We conclude that negative selection acts pervasively on the genetic variants associated with human complex traits.

  4. Validating genetic markers of response to recombinant human growth hormone in children with growth hormone deficiency and Turner syndrome: the PREDICT validation study

    PubMed Central

    Stevens, Adam; Murray, Philip; Wojcik, Jerome; Raelson, John; Koledova, Ekaterina; Chatelain, Pierre

    2016-01-01

    Objective Single-nucleotide polymorphisms (SNPs) associated with the response to recombinant human growth hormone (r-hGH) have previously been identified in growth hormone deficiency (GHD) and Turner syndrome (TS) children in the PREDICT long-term follow-up (LTFU) study (Nbib699855). Here, we describe the PREDICT validation (VAL) study (Nbib1419249), which aimed to confirm these genetic associations. Design and methods Children with GHD (n = 293) or TS (n = 132) were recruited retrospectively from 29 sites in nine countries. All children had completed 1 year of r-hGH therapy. 48 SNPs previously identified as associated with first year growth response to r-hGH were genotyped. Regression analysis was used to assess the association between genotype and growth response using clinical/auxological variables as covariates. Further analysis was undertaken using random forest classification. Results The children were younger, and the growth response was higher in VAL study. Direct genotype analysis did not replicate what was found in the LTFU study. However, using exploratory regression models with covariates, a consistent relationship with growth response in both VAL and LTFU was shown for four genes – SOS1 and INPPL1 in GHD and ESR1 and PTPN1 in TS. The random forest analysis demonstrated that only clinical covariates were important in the prediction of growth response in mild GHD (>4 to <10 μg/L on GH stimulation test), however, in severe GHD (≤4 μg/L) several SNPs contributed (in IGF2, GRB10, FOS, IGFBP3 and GHRHR). Conclusions The PREDICT validation study supports, in an independent cohort, the association of four of 48 genetic markers with growth response to r-hGH treatment in both pre-pubertal GHD and TS children after controlling for clinical/auxological covariates. However, the contribution of these SNPs in a prediction model of first-year response is not sufficient for routine clinical use. PMID:27651465

  5. Validating genetic markers of response to recombinant human growth hormone in children with growth hormone deficiency and Turner syndrome: the PREDICT validation study.

    PubMed

    Stevens, Adam; Murray, Philip; Wojcik, Jerome; Raelson, John; Koledova, Ekaterina; Chatelain, Pierre; Clayton, Peter

    2016-12-01

    Single-nucleotide polymorphisms (SNPs) associated with the response to recombinant human growth hormone (r-hGH) have previously been identified in growth hormone deficiency (GHD) and Turner syndrome (TS) children in the PREDICT long-term follow-up (LTFU) study (Nbib699855). Here, we describe the PREDICT validation (VAL) study (Nbib1419249), which aimed to confirm these genetic associations. Children with GHD (n = 293) or TS (n = 132) were recruited retrospectively from 29 sites in nine countries. All children had completed 1 year of r-hGH therapy. 48 SNPs previously identified as associated with first year growth response to r-hGH were genotyped. Regression analysis was used to assess the association between genotype and growth response using clinical/auxological variables as covariates. Further analysis was undertaken using random forest classification. The children were younger, and the growth response was higher in VAL study. Direct genotype analysis did not replicate what was found in the LTFU study. However, using exploratory regression models with covariates, a consistent relationship with growth response in both VAL and LTFU was shown for four genes - SOS1 and INPPL1 in GHD and ESR1 and PTPN1 in TS. The random forest analysis demonstrated that only clinical covariates were important in the prediction of growth response in mild GHD (>4 to <10 μg/L on GH stimulation test), however, in severe GHD (≤4 μg/L) several SNPs contributed (in IGF2, GRB10, FOS, IGFBP3 and GHRHR). The PREDICT validation study supports, in an independent cohort, the association of four of 48 genetic markers with growth response to r-hGH treatment in both pre-pubertal GHD and TS children after controlling for clinical/auxological covariates. However, the contribution of these SNPs in a prediction model of first-year response is not sufficient for routine clinical use. © 2016 European Society of Endocrinology.

  6. No association between polymorphisms and haplotypes of COL1A1 and COL1A2 genes and osteoporotic fracture in postmenopausal Chinese women

    PubMed Central

    Hu, Wei-wei; He, Jin-wei; Zhang, Hao; Wang, Chun; Gu, Jie-mei; Yue, Hua; Ke, Yao-hua; Hu, Yun-qiu; Fu, Wen-zhen; Li, Miao; Liu, Yu-juan; Zhang, Zhen-lin

    2011-01-01

    Aim: To study whether genetic polymorphisms of COL1A1 and COL1A2 genes affected the onset of fracture in postmenopausal Chinese women. Methods: SNPs in COL1A1 and COL1A2 genes were identified via direct sequencing in 32 unrelated postmenopausal Chinese women. Ten SNPs were genotyped in 1252 postmenopausal Chinese women. The associations were examined using both single-SNP and haplotype tests using logistic regression. Results: Twenty four (4 novel) and 28 (7 novel) SNPs were identified in COL1A1 and COL1A2 gene, respectively. The distribution frequencies of 2 SNPs in COL1A1 (rs2075554 and rs2586494) and 3 SNPs in COL1A2 (rs42517, rs1801182, and rs42524) were significantly different from those documented for the European Caucasian population. No significant difference was observed between fracture and control groups with respect to allele frequency or genotype distribution in 9 selected SNPs and haplotype. No significant association was found between fragility fracture and each SNP or haplotype. The results remained the same after additional corrections for other risk factors such as weight, height, and bone mineral density. Conclusion: Our results show no association between common genetic variations of COL1A1 and COL1A2 genes and fracture, suggesting the complex genetic background of osteoporotic fractures. PMID:21602843

  7. Finding a Needle in a Haystack: Distinguishing Mexican Maize Landraces Using a Small Number of SNPs

    PubMed Central

    Caldu-Primo, Jose L.; Mastretta-Yanes, Alicia; Wegier, Ana; Piñero, Daniel

    2017-01-01

    In Mexico's territory, the center of origin and domestication of maize (Zea mays), there is a large phenotypic diversity of this crop. This diversity has been classified into “landraces.” Previous studies have reported that genomic variation in Mexican maize is better explained by environmental factors, particularly those related with altitude, than by landrace. Still, landraces are extensively used by agronomists, who recognize them as stable and discriminatory categories for the classification of samples. In order to investigate the genomic foundation of maize landraces, we analyzed genomic data (35,909 SNPs from Illumina MaizeSNP50 BeadChip) obtained from 50 samples representing five maize landraces (Comiteco, Conejo, Tehua, Zapalote Grande, and Zapalote Chico), and searched for markers suitable for landrace assignment. Landrace clusters could not be identified taking all the genomic information, but they become manifest taking only a subset of SNPs with high FST among landraces. Discriminant analysis of principal components was conducted to classify samples using SNP data. Two classification analyses were done, first classifying samples by landrace and then by altitude category. Through this classification method, we identified 20 landrace-informative SNPs and 14 altitude-informative SNPs, with only 6 SNPs in common for both analyses. These results show that Mexican maize phenotypic diversity can be classified in landraces using a small number of genomic markers, given the fact that landrace genomic diversity is influenced by environmental factors as well as artificial selection due to bio-cultural practices. PMID:28458682

  8. A novel fluorescent aptasensor based on hairpin structure of complementary strand of aptamer and nanoparticles as a signal amplification approach for ultrasensitive detection of cocaine.

    PubMed

    Emrani, Ahmad Sarreshtehdar; Danesh, Noor Mohammad; Ramezani, Mohammad; Taghdisi, Seyed Mohammad; Abnous, Khalil

    2016-05-15

    Cocaine is one of the most commonly misused stimulant which could influence the central nervous system. In this study, a fluorescent aptamer-based sensor (aptasensor) was designed for sensitive and selective detection of cocaine, based on hairpin structure of complementary strand of aptamer (CS), target-induced release of aptamer (Apt) from CS and two kinds of nanoparticles, including silica nanoparticles (SNPs) coated with streptavidin and gold nanoparticles (AuNPs). The designed aptasensor acquires characteristics of AuNPs such as unique optical properties and large surface area, SNPs as amplifiers of fluorescence intensity, higher affinity of Apt toward its target relative to its CS, and finally the hairpin structure of CS that brings the fluorophore (FAM) to close proximity to the surface of SNPs. In the absence of cocaine, FAM is in close proximity to the surface of AuNPs, resulting in a weak fluorescence emission. In the presence of target, FAM comes to close proximity to the surface of SNPs because of the formation of hairpin structure of CS, leading to a very strong fluorescence emission. The fabricated fluorescent aptasensor exhibited a good selectivity toward cocaine with a limit of detection (LOD) as low as 209 pM. Moreover, the designed aptasensor was successfully utilized to detect cocaine in serum with a LOD as low as 293 pM. Copyright © 2015 Elsevier B.V. All rights reserved.

  9. Mining the transcriptomes of four commercially important shellfish species for single nucleotide polymorphisms within biomineralization genes.

    PubMed

    Vendrami, David L J; Shah, Abhijeet; Telesca, Luca; Hoffman, Joseph I

    2016-06-01

    Transcriptional profiling not only provides insights into patterns of gene expression, but also generates sequences that can be mined for molecular markers, which in turn can be used for population genetic studies. As part of a large-scale effort to better understand how commercially important European shellfish species may respond to ocean acidification, we therefore mined the transcriptomes of four species (the Pacific oyster Crassostrea gigas, the blue mussel Mytilus edulis, the great scallop Pecten maximus and the blunt gaper Mya truncata) for single nucleotide polymorphisms (SNPs). Illumina data for C. gigas, M. edulis and P. maximus and 454 data for M. truncata were interrogated using GATK and SWAP454 respectively to identify between 8267 and 47,159 high quality SNPs per species (total=121,053 SNPs residing within 34,716 different contigs). We then annotated the transcripts containing SNPs to reveal homology to diverse genes. Finally, as oceanic pH affects the ability of organisms to incorporate calcium carbonate, we honed in on genes implicated in the biomineralization process to identify a total of 1899 SNPs in 157 genes. These provide good candidates for biomarkers with which to study patterns of selection in natural or experimental populations. Copyright © 2016 Elsevier B.V. All rights reserved.

  10. Mining SNPs from EST sequences using filters and ensemble classifiers.

    PubMed

    Wang, J; Zou, Q; Guo, M Z

    2010-05-04

    Abundant single nucleotide polymorphisms (SNPs) provide the most complete information for genome-wide association studies. However, due to the bottleneck of manual discovery of putative SNPs and the inaccessibility of the original sequencing reads, it is essential to develop a more efficient and accurate computational method for automated SNP detection. We propose a novel computational method to rapidly find true SNPs in public-available EST (expressed sequence tag) databases; this method is implemented as SNPDigger. EST sequences are clustered and aligned. SNP candidates are then obtained according to a measure of redundant frequency. Several new informative biological features, such as the structural neighbor profiles and the physical position of the SNP, were extracted from EST sequences, and the effectiveness of these features was demonstrated. An ensemble classifier, which employs a carefully selected feature set, was included for the imbalanced training data. The sensitivity and specificity of our method both exceeded 80% for human genetic data in the cross validation. Our method enables detection of SNPs from the user's own EST dataset and can be used on species for which there is no genome data. Our tests showed that this method can effectively guide SNP discovery in ESTs and will be useful to avoid and save the cost of biological analyses.

  11. Genome-wide screening for highly discriminative SNPs for personal identification and their assessment in world populations.

    PubMed

    Li, Liming; Wang, Yi; Yang, Shuping; Xia, Mingying; Yang, Yajun; Wang, Jiucun; Lu, Daru; Pan, Xingwei; Ma, Teng; Jiang, Pei; Yu, Ge; Zhao, Ziqin; Ping, Yuan; Zhou, Huaigu; Zhao, Xueying; Sun, Hui; Liu, Bing; Jia, Dongtao; Li, Chengtao; Hu, Rile; Lu, Hongzhou; Liu, Xiaoyang; Chen, Wenqing; Mi, Qin; Xue, Fuzhong; Su, Yongdong; Jin, Li; Li, Shilin

    2017-05-01

    The applications of DNA profiling aim to identify perpetrators, missing family members and disaster victims in forensic investigations. Single nucleotide polymorphisms (SNPs) based forensic applications are emerging rapidly with a potential to replace short tandem repeats (STRs) based panels which are now being used widely, and there is a need for a well-designed SNP panel to meet such challenge for this transition. Here we present a panel of 175 SNP markers (referred to as Fudan ID Panel or FID), selected from ∼3.6 million SNPs, for the application of personal identification. We optimized and validated FID panel using 729 Chinese individuals using a next generation sequencing (NGS) technology. We showed that the SNPs in the panel possess very high heterozygosity as well as low within- and among-continent differentiations, enabling FID panel exhibit discrimination power in both regional and worldwide populations, with the average match probabilities ranging from 4.77×10 -71 to 1.06×10 -64 across 54 world populations. With the advent of biomedical research, the SNPs connecting physical anthropological, physiological, behavioral and phenotypic traits will be eventually added to the forensic panels that will revolutionize criminal investigation. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Genetic Variation in FABP4 and Evaluation of Its Effects on Beef Cattle Fat Content.

    PubMed

    Goszczynski, Daniel E; Papaleo-Mazzucco, Juliana; Ripoli, María V; Villarreal, Edgardo L; Rogberg-Muñoz, Andrés; Mezzadra, Carlos A; Melucci, Lilia M; Giovambattista, Guillermo

    2017-07-03

    FABP4 is a protein primarily expressed in adipocytes and macrophages that plays a key role in fatty acid trafficking and lipid hydrolysis. FABP4 gene polymorphisms have been associated with meat quality traits in cattle, mostly in Asian breeds under feedlot conditions. The objectives of this work were to characterize FABP4 genetic variation in several worldwide cattle breeds and evaluate possible genotype effects on fat content in a pasture-fed crossbred (Angus-Hereford-Limousin) population. We re-sequenced 43 unrelated animals from nine cattle breeds (Angus, Brahman, Creole, Hereford, Holstein, Limousin, Nelore, Shorthorn, and Wagyu) and obtained 22 single nucleotide polymorphisms (SNPs) over 3,164 bp, including four novel polymorphisms. Haplotypes and linkage disequilibrium analyses showed a high variability. Five SNPs were selected to perform validation and association studies in our crossbred population. Four SNPs showed well-balanced allele frequencies (minor frequency > 0.159), and three showed no significant deviations from Hardy-Weinberg proportions. SNPs showed significant effects on backfat thickness and fatty acid composition (P < 0.05). The protein structure of one of the missense SNPs was analyzed to elucidate its possible effect on fat content in our studied population. Our results revealed a possible blockage of the fatty acid binding site by the missense mutation.

  13. Sequence Analysis of APOA5 Among the Kuwaiti Population Identifies Association of rs2072560, rs2266788, and rs662799 With TG and VLDL Levels

    PubMed Central

    Jasim, Anfal A.; Al-Bustan, Suzanne A.; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda

    2018-01-01

    Common variants of Apolipoprotein A5 (APOA5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3′ UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism. PMID:29686695

  14. SNPs in stress-responsive rice genes: validation, genotyping, functional relevance and population structure

    PubMed Central

    2012-01-01

    Background Single nucleotide polymorphism (SNP) validation and large-scale genotyping are required to maximize the use of DNA sequence variation and determine the functional relevance of candidate genes for complex stress tolerance traits through genetic association in rice. We used the bead array platform-based Illumina GoldenGate assay to validate and genotype SNPs in a select set of stress-responsive genes to understand their functional relevance and study the population structure in rice. Results Of the 384 putative SNPs assayed, we successfully validated and genotyped 362 (94.3%). Of these 325 (84.6%) showed polymorphism among the 91 rice genotypes examined. Physical distribution, degree of allele sharing, admixtures and introgression, and amino acid replacement of SNPs in 263 abiotic and 62 biotic stress-responsive genes provided clues for identification and targeted mapping of trait-associated genomic regions. We assessed the functional and adaptive significance of validated SNPs in a set of contrasting drought tolerant upland and sensitive lowland rice genotypes by correlating their allelic variation with amino acid sequence alterations in catalytic domains and three-dimensional secondary protein structure encoded by stress-responsive genes. We found a strong genetic association among SNPs in the nine stress-responsive genes with upland and lowland ecological adaptation. Higher nucleotide diversity was observed in indica accessions compared with other rice sub-populations based on different population genetic parameters. The inferred ancestry of 16% among rice genotypes was derived from admixed populations with the maximum between upland aus and wild Oryza species. Conclusions SNPs validated in biotic and abiotic stress-responsive rice genes can be used in association analyses to identify candidate genes and develop functional markers for stress tolerance in rice. PMID:22921105

  15. Sequence Analysis of APOA5 Among the Kuwaiti Population Identifies Association of rs2072560, rs2266788, and rs662799 With TG and VLDL Levels.

    PubMed

    Jasim, Anfal A; Al-Bustan, Suzanne A; Al-Kandari, Wafa; Al-Serri, Ahmad; AlAskar, Huda

    2018-01-01

    Common variants of Apolipoprotein A5 ( APOA 5) have been associated with lipid levels yet very few studies have reported full sequence data from various ethnic groups. The purpose of this study was to analyse the full APOA5 gene sequence to identify variants in 100 healthy Kuwaitis of Arab ethnicities and assess their association with variation in lipid levels in a cohort of 733 samples. Sanger method was used in the direct sequencing of the full 3.7 Kb APOA5 and multiple sequence alignment was used to identify variants. The complete APOA5 sequence in Kuwaiti Arabs has been deposited in GenBank (KJ401315). A total of 20 reported single nucleotide polymorphisms (SNPs) were identified. Two novel SNPs were also identified: a synonymous 2197G>A polymorphism at genomic position 116661525 and a 3' UTR 3222 C>T polymorphism at genomic position 116660500 based on human genome assembly GRCh37/hg:19. Five SNPs along with the two novel SNPs were selected for validation in the cohort. Association of those SNPs with lipid levels was tested and minor alleles of three SNPs (rs2072560, rs2266788, and rs662799) were found significantly associated with TG and VLDL levels. This is the first study to report the full APOA5 sequence and SNPs in an Arab ethnic group. Analysis of the variants identified and comparison to other populations suggests a distinctive genetic component in Arabs. The positive association observed for rs2072560 and rs2266788 with TG and VLDL levels confirms their role in lipid metabolism.

  16. Association study between monoamine oxidase A (MAOA) gene polymorphisms and schizophrenia: lack of association with schizophrenia and possible association with affective disturbances of schizophrenia.

    PubMed

    Kim, Su Kang; Park, Hae Jeong; Seok, Hosik; Jeon, Hye Sook; Chung, Joo-Ho; Kang, Won Sub; Kim, Jong Woo; Yu, Gyeong Im; Shin, Dong Hoon

    2014-05-01

    Monoamine oxidase A (MAOA) catalyzes monoamine neurotransmitters including dopamine, 5-hydroxytryptamine (5-HT, serotonin), and norepinephrine. MAOA also plays a key role in emotional regulation. The aim of this study was to investigate the associations between the exonic single nucleotide polymorphisms (SNPs) of the MAOA gene located on the X chromosome and schizophrenia. We also analyzed the relationships between these SNPs and the common clinical symptoms of schizophrenia such as persecutory delusion, auditory hallucinations, affective disturbances, and poor concentration. Two hundred seventy five Korean schizophrenia patients and 289 control subjects were recruited. Three SNPs [rs6323 (Arg294Arg), rs1137070 (Asp470Asp), and rs3027407 (3'-untranslated region)] of the MAOA gene were selected and genotyped by direct sequencing. The common clinical symptoms of schizophrenia according to the Operation Criteria Checklist were analyzed. Three examined SNPs showed no associations with male and female schizophrenia, respectively (p>0.05). In the analysis of the common clinical symptoms of schizophrenia patients, three examined SNPs were associated with affective disturbances, especially restricted affect and blunted affect in male schizophrenia, respectively (restricted affect, p=0.002, OR=2.71, 95% CI 1.45-5.00; blunted affect, p=0.009, OR 2.25, 95% CI 1.22-4.12). The SNPs were not associated with other clinical symptoms of schizophrenia (persecutory delusion, auditory hallucinations, and poor concentration). These results suggest that exonic SNPs (rs6323, rs1137070, and rs3027407) of the MAOA gene may be contributed to affective disturbances of Korean males schizophrenia, especially restricted affect and blunted affect.

  17. Association of single-nucleotide polymorphisms of the tau gene with late-onset Parkinson disease.

    PubMed

    Martin, E R; Scott, W K; Nance, M A; Watts, R L; Hubble, J P; Koller, W C; Lyons, K; Pahwa, R; Stern, M B; Colcher, A; Hiner, B C; Jankovic, J; Ondo, W G; Allen, F H; Goetz, C G; Small, G W; Masterman, D; Mastaglia, F; Laing, N G; Stajich, J M; Ribble, R C; Booze, M W; Rogala, A; Hauser, M A; Zhang, F; Gibson, R A; Middleton, L T; Roses, A D; Haines, J L; Scott, B L; Pericak-Vance, M A; Vance, J M

    2001-11-14

    The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. To investigate whether the tau gene is involved in idiopathic PD. Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Family-based tests of association, calculated using asymptotic distributions. Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P =.03; SNP 9i, P =.04; and SNP 11, P =.04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P =.11, and SNP 9iii, P =.87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P =.009) and a negative association with another haplotype (P =.007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3, 9i, 9ii, and 11). This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD.

  18. The contribution of individual and pairwise combinations of SNPs in the APOA1 and APOC3 genes to interindividual HDL-C variability.

    PubMed

    Brown, C M; Rea, T J; Hamon, S C; Hixson, J E; Boerwinkle, E; Clark, A G; Sing, C F

    2006-07-01

    Apolipoproteins (apo) A-I and C-III are components of high-density lipoprotein-cholesterol (HDL-C), a quantitative trait negatively correlated with risk of cardiovascular disease (CVD). We analyzed the contribution of individual and pairwise combinations of single nucleotide polymorphisms (SNPs) in the APOA1/APOC3 genes to HDL-C variability to evaluate (1) consistency of published single-SNP studies with our single-SNP analyses; (2) consistency of single-SNP and two-SNP phenotype-genotype relationships across race-, gender-, and geographical location-dependent contexts; and (3) the contribution of single SNPs and pairs of SNPs to variability beyond that explained by plasma apo A-I concentration. We analyzed 45 SNPs in 3,831 young African-American (N=1,858) and European-American (N=1,973) females and males ascertained by the Coronary Artery Risk Development in Young Adults (CARDIA) study. We found three SNPs that significantly impact HDL-C variability in both the literature and the CARDIA sample. Single-SNP analyses identified only one of five significant HDL-C SNP genotype relationships in the CARDIA study that was consistent across all race-, gender-, and geographical location-dependent contexts. The other four were consistent across geographical locations for a particular race-gender context. The portion of total phenotypic variance explained by single-SNP genotypes and genotypes defined by pairs of SNPs was less than 3%, an amount that is miniscule compared to the contribution explained by variability in plasma apo A-I concentration. Our findings illustrate the impact of context-dependence on SNP selection for prediction of CVD risk factor variability.

  19. Signatures of natural selection between life cycle stages separated by metamorphosis in European eel.

    PubMed

    Pujolar, J M; Jacobsen, M W; Bekkevold, D; Lobón-Cervià, J; Jónsson, B; Bernatchez, L; Hansen, M M

    2015-08-13

    Species showing complex life cycles provide excellent opportunities to study the genetic associations between life cycle stages, as selective pressures may differ before and after metamorphosis. The European eel presents a complex life cycle with two metamorphoses, a first metamorphosis from larvae into glass eels (juvenile stage) and a second metamorphosis into silver eels (adult stage). We tested the hypothesis that different genes and gene pathways will be under selection at different life stages when comparing the genetic associations between glass eels and silver eels. We used two sets of markers to test for selection: first, we genotyped individuals using a panel of 80 coding-gene single nucleotide polymorphisms (SNPs) developed in American eel; second, we investigated selection at the genome level using a total of 153,423 RAD-sequencing generated SNPs widely distributed across the genome. Using the RAD approach, outlier tests identified a total of 2413 (1.57%) potentially selected SNPs. Functional annotation analysis identified signal transduction pathways as the most over-represented group of genes, including MAPK/Erk signalling, calcium signalling and GnRH (gonadotropin-releasing hormone) signalling. Many of the over-represented pathways were related to growth, while others could result from the different conditions that eels inhabit during their life cycle. The observation of different genes and gene pathways under selection when comparing glass eels vs. silver eels supports the adaptive decoupling hypothesis for the benefits of metamorphosis. Partitioning the life cycle into discrete morphological phases may be overall beneficial since it allows the different life stages to respond independently to their unique selection pressures. This might translate into a more effective use of food and niche resources and/or performance of phase-specific tasks (e.g. feeding in the case of glass eels, migrating and reproducing in the case of silver eels).

  20. DNA sequence variation and selection of tag single-nucleotide polymorphisms at candidate genes for drought-stress response in Pinus taeda L.

    PubMed

    González-Martínez, Santiago C; Ersoz, Elhan; Brown, Garth R; Wheeler, Nicholas C; Neale, David B

    2006-03-01

    Genetic association studies are rapidly becoming the experimental approach of choice to dissect complex traits, including tolerance to drought stress, which is the most common cause of mortality and yield losses in forest trees. Optimization of association mapping requires knowledge of the patterns of nucleotide diversity and linkage disequilibrium and the selection of suitable polymorphisms for genotyping. Moreover, standard neutrality tests applied to DNA sequence variation data can be used to select candidate genes or amino acid sites that are putatively under selection for association mapping. In this article, we study the pattern of polymorphism of 18 candidate genes for drought-stress response in Pinus taeda L., an important tree crop. Data analyses based on a set of 21 putatively neutral nuclear microsatellites did not show population genetic structure or genomewide departures from neutrality. Candidate genes had moderate average nucleotide diversity at silent sites (pi(sil) = 0.00853), varying 100-fold among single genes. The level of within-gene LD was low, with an average pairwise r2 of 0.30, decaying rapidly from approximately 0.50 to approximately 0.20 at 800 bp. No apparent LD among genes was found. A selective sweep may have occurred at the early-response-to-drought-3 (erd3) gene, although population expansion can also explain our results and evidence for selection was not conclusive. One other gene, ccoaomt-1, a methylating enzyme involved in lignification, showed dimorphism (i.e., two highly divergent haplotype lineages at equal frequency), which is commonly associated with the long-term action of balancing selection. Finally, a set of haplotype-tagging SNPs (htSNPs) was selected. Using htSNPs, a reduction of genotyping effort of approximately 30-40%, while sampling most common allelic variants, can be gained in our ongoing association studies for drought tolerance in pine.

  1. An SNP resource for rice genetics and breeding based on subspecies indica and japonica genome alignments.

    PubMed

    Feltus, F Alex; Wan, Jun; Schulze, Stefan R; Estill, James C; Jiang, Ning; Paterson, Andrew H

    2004-09-01

    Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% +/- 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% +/- 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp.

  2. An SNP Resource for Rice Genetics and Breeding Based on Subspecies Indica and Japonica Genome Alignments

    PubMed Central

    Feltus, F. Alex; Wan, Jun; Schulze, Stefan R.; Estill, James C.; Jiang, Ning; Paterson, Andrew H.

    2004-01-01

    Dense coverage of the rice genome with polymorphic DNA markers is an invaluable tool for DNA marker-assisted breeding, positional cloning, and a wide range of evolutionary studies. We have aligned drafts of two rice subspecies, indica and japonica, and analyzed levels and patterns of genetic diversity. After filtering multiple copy and low quality sequence, 408,898 candidate DNA polymorphisms (SNPs/INDELs) were discerned between the two subspecies. These filters have the consequence that our data set includes only a subset of the available SNPs (in particular excluding large numbers of SNPs that may occur between repetitive DNA alleles) but increase the likelihood that this subset is useful: Direct sequencing suggests that 79.8% ± 7.5% of the in silico SNPs are real. The SNP sample in our database is not randomly distributed across the genome. In fact, 566 rice genomic regions had unusually high (328 contigs/48.6 Mb/13.6% of genome) or low (237 contigs/64.7 Mb/18.1% of genome) polymorphism rates. Many SNP-poor regions were substantially longer than most SNP-rich regions, covering up to 4 Mb, and possibly reflecting introgression between the respective gene pools that may have occurred hundreds of years ago. Although 46.2% ± 8.3% of the SNPs differentiate other pairs of japonica and indica genotypes, SNP rates in rice were not predictive of evolutionary rates for corresponding genes in another grass species, sorghum. The data set is freely available at http://www.plantgenome.uga.edu/snp. PMID:15342564

  3. Application of Multi-SNP Approaches Bayesian LASSO and AUC-RF to Detect Main Effects of Inflammatory-Gene Variants Associated with Bladder Cancer Risk

    PubMed Central

    Calle, M. Luz; Rothman, Nathaniel; Urrea, Víctor; Kogevinas, Manolis; Petrus, Sandra; Chanock, Stephen J.; Tardón, Adonina; García-Closas, Montserrat; González-Neira, Anna; Vellalta, Gemma; Carrato, Alfredo; Navarro, Arcadi; Lorente-Galdós, Belén; Silverman, Debra T.; Real, Francisco X.; Wu, Xifeng; Malats, Núria

    2013-01-01

    The relationship between inflammation and cancer is well established in several tumor types, including bladder cancer. We performed an association study between 886 inflammatory-gene variants and bladder cancer risk in 1,047 cases and 988 controls from the Spanish Bladder Cancer (SBC)/EPICURO Study. A preliminary exploration with the widely used univariate logistic regression approach did not identify any significant SNP after correcting for multiple testing. We further applied two more comprehensive methods to capture the complexity of bladder cancer genetic susceptibility: Bayesian Threshold LASSO (BTL), a regularized regression method, and AUC-Random Forest, a machine-learning algorithm. Both approaches explore the joint effect of markers. BTL analysis identified a signature of 37 SNPs in 34 genes showing an association with bladder cancer. AUC-RF detected an optimal predictive subset of 56 SNPs. 13 SNPs were identified by both methods in the total population. Using resources from the Texas Bladder Cancer study we were able to replicate 30% of the SNPs assessed. The associations between inflammatory SNPs and bladder cancer were reexamined among non-smokers to eliminate the effect of tobacco, one of the strongest and most prevalent environmental risk factor for this tumor. A 9 SNP-signature was detected by BTL. Here we report, for the first time, a set of SNP in inflammatory genes jointly associated with bladder cancer risk. These results highlight the importance of the complex structure of genetic susceptibility associated with cancer risk. PMID:24391818

  4. Bacopa monniera Stabilized Silver Nanoparticles Attenuates Oxidative Stress Induced by Aluminum in Albino Mice.

    PubMed

    Mahitha, B; Deva Prasad Raju, B; Mallikarjuna, K; Durga Mahalakshmi, Ch N; Sushmal, N John

    2015-02-01

    In the recent years usage of nanomedicine plays a promising strategy in the improvement of medical treatment. The ecofriendly synthesized silver nanoparticles has introduced a new opportunity to increase the efficacy of drug by reducing its side effects. In the present study, we investigated the antioxidant property of Bacopa monniera stabilized silver nanoparticles against aluminum induced toxicity in albino mice. Forty male albino mice were randomly divided into five groups. First group was treated as control, second group received aluminum acetate (5 mg/kg b . w), third group received Bacopa monniera extract (5 mg/kg b . w), fourth group received BmSNPs (5 mg/kg b . w), fifth group received aluminum acetate plus BmSNPs. Exposure to aluminum acetate significantly increased lipid peroxidation levels with a significant decrease in the antioxidant enzymes such as superoxide dismutase, catalase and glutathione peroxidase activities in the brain, liver and kidney of mice. Degenerative changes were also observed in brain, liver and kidney of aluminum treated mice. No significant changes in the oxidative stress were observed in the Bacopa monniera and BmSNPs alone treated mice. Whereas, co-administration of BmSNPs to Al treated mice showed a significant decrease in lipid peroxidation levels with a significant increase of SOD, CAT and GPx indicating the antioxidant potential of nanoparticles and in counteracting Al induced oxidative stress and histological response in male albino mice. These findings clearly implicate that BmSNPs are able to eradicate the oxidative stress and prevent the tissue damage in aluminum exposed mice.

  5. Is High-Density Lipoprotein Cholesterol Causally Related to Kidney Function? Evidence From Genetic Epidemiological Studies.

    PubMed

    Coassin, Stefan; Friedel, Salome; Köttgen, Anna; Lamina, Claudia; Kronenberg, Florian

    2016-11-01

    A recent observational study with almost 2 million men reported an association between low high-density lipoprotein (HDL) cholesterol and worse kidney function. The causality of this association would be strongly supported if genetic variants associated with HDL cholesterol were also associated with kidney function. We used 68 genetic variants (single-nucleotide polymorphisms [SNPs]) associated with HDL cholesterol in genome-wide association studies including >188 000 subjects and tested their association with estimated glomerular filtration rate (eGFR) using summary statistics from another genome-wide association studies meta-analysis of kidney function including ≤133 413 subjects. Fourteen of the 68 SNPs (21%) had a P value <0.05 compared with the 5% expected by chance (Binomial test P=5.8×10 - 6 ). After Bonferroni correction, 6 SNPs were still significantly associated with eGFR. The genetic variants with the strongest associations with HDL cholesterol concentrations were not the same as those with the strongest association with kidney function and vice versa. An evaluation of pleiotropy indicated that the effects of the HDL-associated SNPs on eGFR were not mediated by HDL cholesterol. In addition, we performed a Mendelian randomization analysis. This analysis revealed a positive but nonsignificant causal effect of HDL cholesterol-increasing variants on eGFR. In summary, our findings indicate that HDL cholesterol does not causally influence eGFR and propose pleiotropic effects on eGFR for some HDL cholesterol-associated SNPs. This may cause the observed association by mechanisms other than the mere HDL cholesterol concentration. © 2016 The Authors.

  6. Fine-mapping additive and dominant SNP effects using group-LASSO and Fractional Resample Model Averaging

    PubMed Central

    Sabourin, Jeremy; Nobel, Andrew B.; Valdar, William

    2014-01-01

    Genomewide association studies sometimes identify loci at which both the number and identities of the underlying causal variants are ambiguous. In such cases, statistical methods that model effects of multiple SNPs simultaneously can help disentangle the observed patterns of association and provide information about how those SNPs could be prioritized for follow-up studies. Current multi-SNP methods, however, tend to assume that SNP effects are well captured by additive genetics; yet when genetic dominance is present, this assumption translates to reduced power and faulty prioritizations. We describe a statistical procedure for prioritizing SNPs at GWAS loci that efficiently models both additive and dominance effects. Our method, LLARRMA-dawg, combines a group LASSO procedure for sparse modeling of multiple SNP effects with a resampling procedure based on fractional observation weights; it estimates for each SNP the robustness of association with the phenotype both to sampling variation and to competing explanations from other SNPs. In producing a SNP prioritization that best identifies underlying true signals, we show that: our method easily outperforms a single marker analysis; when additive-only signals are present, our joint model for additive and dominance is equivalent to or only slightly less powerful than modeling additive-only effects; and, when dominance signals are present, even in combination with substantial additive effects, our joint model is unequivocally more powerful than a model assuming additivity. We also describe how performance can be improved through calibrated randomized penalization, and discuss how dominance in ungenotyped SNPs can be incorporated through either heterozygote dosage or multiple imputation. PMID:25417853

  7. Genome-Wide Association Study of Serum 25-Hydroxyvitamin D in US Women.

    PubMed

    O'Brien, Katie M; Sandler, Dale P; Shi, Min; Harmon, Quaker E; Taylor, Jack A; Weinberg, Clarice R

    2018-01-01

    Genetic factors likely influence individuals' concentrations of 25-hydroxyvitamin D [25(OH)D], a biomarker of vitamin D exposure previously linked to reduced risk of several chronic diseases. We conducted a genome-wide association study of serum 25(OH)D (assessed using liquid chromatography-tandem mass spectrometry) and 386,449 single nucleotide polymorphisms (SNPs). Our sample consisted of 1,829 participants randomly selected from the Sister Study, a cohort of women who had a sister with breast cancer but had never had breast cancer themselves. 19,741 SNPs were associated with 25(OH)D ( p < 0.05). We re-assessed these hits in an independent sample of 1,534 participants who later developed breast cancer. After pooling, 32 SNPs had genome-wide significant associations ( p < 5 × 10 -8 ). These were located in or near GC , the vitamin D binding protein, or CYP2R1 , a cytochrome P450 enzyme that hydroxylates vitamin D to form 25(OH)D. The top hit was rs4588, a missense GC polymorphism associated with a 3.5 ng/mL decrease in 25(OH)D per copy of the minor allele (95% confidence interval [CI]: -4.1, -3.0; p = 4.5 × 10 -38 ). The strongest SNP near CYP2R1 was rs12794714, a synonymous variant ( p = 3.8 × 10 -12 ; β = 1.8 ng/mL decrease in 25(OH)D per minor allele [CI: -2.2, -1.3]). Serum 25(OH)D concentrations from samples collected from some participants 3-10 years after baseline (811 cases, 780 non-cases) were also strongly associated with both loci. These findings augment our understanding of genetic influences on 25(OH)D and the possible role of vitamin D binding proteins and cytochrome P450 enzymes in determining measured levels. These results may help to identify individuals genetically predisposed to vitamin D insufficiency.

  8. Nucleotide-binding oligomerization domain containing 1 (NOD1) haplotypes and single nucleotide polymorphisms modify susceptibility to inflammatory bowel diseases in a New Zealand caucasian population: a case-control study

    PubMed Central

    Huebner, Claudia; Ferguson, Lynnette R; Han, Dug Yeo; Philpott, Martin; Barclay, Murray L; Gearry, Richard B; McCulloch, Alan; Demmers, Pieter S; Browning, Brian L

    2009-01-01

    Background The nucleotide-binding oligomerization domain containing 1 (NOD1) gene encodes a pattern recognition receptor that senses pathogens, leading to downstream responses characteristic of innate immunity. We investigated the role of NOD1 single nucleotide polymorphisms (SNPs) on IBD risk in a New Zealand Caucasian population, and studied Nod1 expression in response to bacterial invasion in the Caco2 cell line. Findings DNA samples from 388 Crohn's disease (CD), 405 ulcerative colitis (UC), 27 indeterminate colitis patients and 201 randomly selected controls, from Canterbury, New Zealand were screened for 3 common SNPs in NOD1, using the MassARRAY® iPLEX Gold assay. Transcriptional activation of the protein produced by NOD1 (Nod1) was studied after infection of Caco2 cells with Escherichia coli LF82. Carrying the rs2075818 G allele decreased the risk of CD (OR = 0.66, 95% CI = 0.50–0.88, p < 0.002) but not UC. There was an increased frequency of the three SNP (rs2075818, rs2075822, rs2907748) haplotype, CTG (p = 0.004) and a decreased frequency of the GTG haplotype (p = 0.02).in CD. The rs2075822 CT or TT genotypes were at an increased frequency (genotype p value = 0.02), while the rs2907748 AA or AG genotypes showed decreased frequencies in UC (p = 0.04), but not in CD. Functional assays showed that Nod1 is produced 6 hours after bacterial invasion of the Caco2 cell line. Conclusion The NOD1 gene is important in signalling invasion of colonic cells by pathogenic bacteria, indicative of its' key role in innate immunity. Carrying specific SNPs in this gene significantly modifies the risk of CD and/or UC in a New Zealand Caucasian population. PMID:19327158

  9. Analyses of single nucleotide polymorphisms in selected nutrient-sensitive genes in weight-regain prevention: the DIOGENES study.

    PubMed

    Larsen, Lesli H; Angquist, Lars; Vimaleswaran, Karani S; Hager, Jörg; Viguerie, Nathalie; Loos, Ruth J F; Handjieva-Darlenska, Teodora; Jebb, Susan A; Kunesova, Marie; Larsen, Thomas M; Martinez, J Alfredo; Papadaki, Angeliki; Pfeiffer, Andreas F H; van Baak, Marleen A; Sørensen, Thorkild Ia; Holst, Claus; Langin, Dominique; Astrup, Arne; Saris, Wim H M

    2012-05-01

    Differences in the interindividual response to dietary intervention could be modified by genetic variation in nutrient-sensitive genes. This study examined single nucleotide polymorphisms (SNPs) in presumed nutrient-sensitive candidate genes for obesity and obesity-related diseases for main and dietary interaction effects on weight, waist circumference, and fat mass regain over 6 mo. In total, 742 participants who had lost ≥ 8% of their initial body weight were randomly assigned to follow 1 of 5 different ad libitum diets with different glycemic indexes and contents of dietary protein. The SNP main and SNP-diet interaction effects were analyzed by using linear regression models, corrected for multiple testing by using Bonferroni correction and evaluated by using quantile-quantile (Q-Q) plots. After correction for multiple testing, none of the SNPs were significantly associated with weight, waist circumference, or fat mass regain. Q-Q plots showed that ALOX5AP rs4769873 showed a higher observed than predicted P value for the association with less waist circumference regain over 6 mo (-3.1 cm/allele; 95% CI: -4.6, -1.6; P/Bonferroni-corrected P = 0.000039/0.076), independently of diet. Additional associations were identified by using Q-Q plots for SNPs in ALOX5AP, TNF, and KCNJ11 for main effects; in LPL and TUB for glycemic index interaction effects on waist circumference regain; in GHRL, CCK, MLXIPL, and LEPR on weight; in PPARC1A, PCK2, ALOX5AP, PYY, and ADRB3 on waist circumference; and in PPARD, FABP1, PLAUR, and LPIN1 on fat mass regain for dietary protein interaction. The observed effects of SNP-diet interactions on weight, waist, and fat mass regain suggest that genetic variation in nutrient-sensitive genes can modify the response to diet. This trial was registered at clinicaltrials.gov as NCT00390637.

  10. Adaptive Genetic Divergence Despite Significant Isolation-by-Distance in Populations of Taiwan Cow-Tail Fir (Keteleeria davidiana var. formosana)

    PubMed Central

    Shih, Kai-Ming; Chang, Chung-Te; Chung, Jeng-Der; Chiang, Yu-Chung; Hwang, Shih-Ying

    2018-01-01

    Double digest restriction site-associated DNA sequencing (ddRADseq) is a tool for delivering genome-wide single nucleotide polymorphism (SNP) markers for non-model organisms useful in resolving fine-scale population structure and detecting signatures of selection. This study performs population genetic analysis, based on ddRADseq data, of a coniferous species, Keteleeria davidiana var. formosana, disjunctly distributed in northern and southern Taiwan, for investigation of population adaptive divergence in response to environmental heterogeneity. A total of 13,914 SNPs were detected and used to assess genetic diversity, FST outlier detection, population genetic structure, and individual assignments of five populations (62 individuals) of K. davidiana var. formosana. Principal component analysis (PCA), individual assignments, and the neighbor-joining tree were successful in differentiating individuals between northern and southern populations of K. davidiana var. formosana, but apparent gene flow between the southern DW30 population and northern populations was also revealed. Fifteen of 23 highly differentiated SNPs identified were found to be strongly associated with environmental variables, suggesting isolation-by-environment (IBE). However, multiple matrix regression with randomization analysis revealed strong IBE as well as significant isolation-by-distance. Environmental impacts on divergence were found between populations of the North and South regions and also between the two southern neighboring populations. BLASTN annotation of the sequences flanking outlier SNPs gave significant hits for three of 23 markers that might have biological relevance to mitochondrial homeostasis involved in the survival of locally adapted lineages. Species delimitation between K. davidiana var. formosana and its ancestor, K. davidiana, was also examined (72 individuals). This study has produced highly informative population genomic data for the understanding of population attributes, such as diversity, connectivity, and adaptive divergence associated with large- and small-scale environmental heterogeneity in K. davidiana var. formosana. PMID:29449860

  11. Single nucleotide polymorphisms for feed efficiency and performance in crossbred beef cattle

    PubMed Central

    2014-01-01

    Background This study was conducted to: (1) identify new SNPs for residual feed intake (RFI) and performance traits within candidate genes identified in a genome wide association study (GWAS); (2) estimate the proportion of variation in RFI explained by the detected SNPs; (3) estimate the effects of detected SNPs on carcass traits to avoid undesirable correlated effects on these economically important traits when selecting for feed efficiency; and (4) map the genes to biological mechanisms and pathways. A total number of 339 SNPs corresponding to 180 genes were tested for association with phenotypes using a single locus regression (SLRM) and genotypic model on 726 and 990 crossbred animals for feed efficiency and carcass traits, respectively. Results Strong evidence of associations for RFI were located on chromosomes 8, 15, 16, 18, 19, 21, and 28. The strongest association with RFI (P = 0.0017) was found with a newly discovered SNP located on BTA 8 within the ELP3 gene. SNPs rs41820824 and rs41821600 on BTA 16 within the gene HMCN1 were strongly associated with RFI (P = 0.0064 and P = 0.0033, respectively). A SNP located on BTA 18 within the ZNF423 gene provided strong evidence for association with RFI (P = 0.0028). Genomic estimated breeding values (GEBV) from 98 significant SNPs were moderately correlated (0.47) to the estimated breeding values (EBVs) from a mixed animal model. The significant (P < 0.05) SNPs (98) explained 26% of the genetic variance for RFI. In silico functional analysis for the genes suggested 35 and 39 biological processes and pathways, respectively for feed efficiency traits. Conclusions This study identified several positional and functional candidate genes involved in important biological mechanisms associated with feed efficiency and performance. Significant SNPs should be validated in other populations to establish their potential utilization in genetic improvement programs. PMID:24476087

  12. Tomato breeding in the genomics era: insights from a SNP array.

    PubMed

    Víquez-Zamora, Marcela; Vosman, Ben; van de Geest, Henri; Bovy, Arnaud; Visser, Richard G F; Finkers, Richard; van Heusden, Adriaan W

    2013-05-27

    The major bottle neck in genetic and linkage studies in tomato has been the lack of a sufficient number of molecular markers. This has radically changed with the application of next generation sequencing and high throughput genotyping. A set of 6000 SNPs was identified and 5528 of them were used to evaluate tomato germplasm at the level of species, varieties and segregating populations. From the 5528 SNPs, 1980 originated from 454-sequencing, 3495 from Illumina Solexa sequencing and 53 were additional known markers. Genotyping different tomato samples allowed the evaluation of the level of heterozygosity and introgressions among commercial varieties. Cherry tomatoes were especially different from round/beefs in chromosomes 4, 5 and 12. We were able to identify a set of 750 unique markers distinguishing S. lycopersicum 'Moneymaker' from all its distantly related wild relatives. Clustering and neighbour joining analysis among varieties and species showed expected grouping patterns, with S. pimpinellifolium as the most closely related to commercial tomatoes earlier results. Our results show that a SNP search in only a few breeding lines already provides generally applicable markers in tomato and its wild relatives. It also shows that the Illumina bead array generated data are highly reproducible. Our SNPs can roughly be divided in two categories: SNPs of which both forms are present in the wild relatives and in domesticated tomatoes (originating from common ancestors) and SNPs unique for the domesticated tomato (originating from after the domestication event). The SNPs can be used for genotyping, identification of varieties, comparison of genetic and physical linkage maps and to confirm (phylogenetic) relations. In the SNPs used for the array there is hardly any overlap with the SolCAP array and it is strongly recommended to combine both SNP sets and to select a core collection of robust SNPs completely covering the entire tomato genome.

  13. Candidate gene association analysis for milk yield, composition, urea nitrogen and somatic cell scores in Brown Swiss cows.

    PubMed

    Cecchinato, A; Ribeca, C; Chessa, S; Cipolat-Gotet, C; Maretto, F; Casellas, J; Bittante, G

    2014-07-01

    The aim of this study was to investigate 96 single-nucleotide polymorphisms (SNPs) from 54 candidate genes, and test the associations of the polymorphic SNPs with milk yield, composition, milk urea nitrogen (MUN) content and somatic cell score (SCS) in individual milk samples from Italian Brown Swiss cows. Milk and blood samples were collected from 1271 cows sampled once from 85 herds. Milk production, quality traits (i.e. protein, casein, fat and lactose percentages), MUN and SCS were measured for each milk sample. Genotyping was performed using a custom Illumina VeraCode GoldenGate approach. A Bayesian linear animal model that considered the effects of herd, days in milk, parity, SNP genotype and additive polygenic effect was used for the association analysis. Our results showed that 14 of the 51 polymorphic SNPs had relevant additive effects on at least one of the aforementioned traits. Polymorphisms in the glucocorticoid receptor DNA-binding factor 1 (GRLF1), prolactin receptor (PRLR) and chemokine ligand 2 (CCL2) were associated with milk yield; an SNP in the stearoyl-CoA desaturase (SCD-1) was related to fat content; SNPs in the caspase recruitment domain 15 protein (CARD15) and lipin 1 (LPIN1) affected the protein and casein contents; SNPs in growth hormone 1 (GH1), lactotransferrin (LTF) and SCD-1 were relevant for casein number; variants in beta casein (CSN2), GH1, GRLF1 and LTF affected lactose content; SNPs in beta-2 adrenergic receptor (ADRB2), serpin peptidase inhibitor (PI) and SCD-1 were associated with MUN; and SNPs in acetyl-CoA carboxylase alpha (ACACA) and signal transducer and activator of transcription 5A (STAT5A) were relevant in explaining the variation of SCS. Although further research is needed to validate these SNPs in other populations and breeds, the association between these markers and milk yield, composition, MUN and SCS could be exploited in gene-assisted selection programs for genetic improvement purposes.

  14. Transferability of genome-wide associated loci for asthma in African Americans

    PubMed Central

    Faruque, Mezbah U.; Chen, Guanjie; Doumatey, Ayo P.; Zhou, Jie; Huang, Hanxia; Shriner, Daniel; Adeyemo, Adebowale A.; Rotimi, Charles N.; Dunston, Georgia M.

    2017-01-01

    Objective Transferability of significantly associated loci or GWAS “hits” adds credibility to genotype-disease associations and provides evidence for generalizability across different ancestral populations. We sought evidence of association of known asthma-associated single nucleotide polymorphisms (SNPs) in an African American population. Methods Subjects comprised 661 participants (261 asthma cases and 400 controls) from the Howard University Family Study. Forty-eight SNPs previously reported to be associated with asthma by GWAS were selected for testing. We adopted a combined strategy by first adopting an “exact” approach where we looked-up only the reported index SNP. For those index SNPs missing form our dataset, we used a “local” approach that examined all the regional SNPs in LD with the index SNP. Results Out of the 48 SNPs, our cohort had genotype data available for 27, which were examined for exact replication. Of these, two SNPs were found positively associated with asthma. These included: rs10508372 (OR = 1.567 [95%CI, 1.133–2.167], P = 0.0066) and rs2378383 (OR = 2.147 [95%CI, 1.149–4.013], P = 0.0166), located on chromosomal bands 10p14 and 9q21.31, respectively. Local replication of the remaining 21 loci showed association at two chromosomal loci (9p24.1-rs2381413 and 6p21.32-rs3132947; Bonferroni-corrected P values: 0.0033 and 0.0197, respectively). Of note, multiple SNPs in LD with rs2381413 located upstream of IL33 were significantly associated with asthma. Conclusions This study has successfully transferred four reported asthma-associated loci in an independent African American population. Identification of several asthma-associated SNPs in the upstream of the IL33, a gene previously implicated in allergic inflammation of asthmatic airway, supports the generalizability of this finding. PMID:27177148

  15. Transcriptomic SNP discovery for custom genotyping arrays: impacts of sequence data, SNP calling method and genotyping technology on the probability of validation success.

    PubMed

    Humble, Emily; Thorne, Michael A S; Forcada, Jaume; Hoffman, Joseph I

    2016-08-26

    Single nucleotide polymorphism (SNP) discovery is an important goal of many studies. However, the number of 'putative' SNPs discovered from a sequence resource may not provide a reliable indication of the number that will successfully validate with a given genotyping technology. For this it may be necessary to account for factors such as the method used for SNP discovery and the type of sequence data from which it originates, suitability of the SNP flanking sequences for probe design, and genomic context. To explore the relative importance of these and other factors, we used Illumina sequencing to augment an existing Roche 454 transcriptome assembly for the Antarctic fur seal (Arctocephalus gazella). We then mapped the raw Illumina reads to the new hybrid transcriptome using BWA and BOWTIE2 before calling SNPs with GATK. The resulting markers were pooled with two existing sets of SNPs called from the original 454 assembly using NEWBLER and SWAP454. Finally, we explored the extent to which SNPs discovered using these four methods overlapped and predicted the corresponding validation outcomes for both Illumina Infinium iSelect HD and Affymetrix Axiom arrays. Collating markers across all discovery methods resulted in a global list of 34,718 SNPs. However, concordance between the methods was surprisingly poor, with only 51.0 % of SNPs being discovered by more than one method and 13.5 % being called from both the 454 and Illumina datasets. Using a predictive modeling approach, we could also show that SNPs called from the Illumina data were on average more likely to successfully validate, as were SNPs called by more than one method. Above and beyond this pattern, predicted validation outcomes were also consistently better for Affymetrix Axiom arrays. Our results suggest that focusing on SNPs called by more than one method could potentially improve validation outcomes. They also highlight possible differences between alternative genotyping technologies that could be explored in future studies of non-model organisms.

  16. A genome-wide association study of seed composition traits in wild soybean (Glycine soja).

    PubMed

    Leamy, Larry J; Zhang, Hengyou; Li, Changbao; Chen, Charles Y; Song, Bao-Hua

    2017-01-05

    Cultivated soybean (Glycine max) is a major agricultural crop that provides a crucial source of edible protein and oil. Decreased amounts of saturated palmitic acid and increased amounts of unsaturated oleic acid in soybean oil are considered optimal for human cardiovascular health and therefore there has considerable interest by breeders in discovering genes affecting the relative concentrations of these fatty acids. Using a genome-wide association (GWA) approach with nearly 30,000 single nucleotide polymorphisms (SNPs), we investigated the genetic basis of protein, oil and all five fatty acid levels in seeds from a sample of 570 wild soybeans (Glycine soja), the progenitor of domesticated soybean, to identify quantitative trait loci (QTLs) affecting these seed composition traits. We discovered 29 SNPs located on ten different chromosomes that are significantly associated with the seven seed composition traits in our wild soybean sample. Eight SNPs co-localized with QTLs previously uncovered in linkage or association mapping studies conducted with cultivated soybean samples, while the remaining SNPs appeared to be in novel locations. Twenty-four of the SNPs significantly associated with fatty acid variation, with the majority located on chromosomes 14 (6 SNPs) and seven (8 SNPs). Two SNPs were common for two or more fatty acids, suggesting loci with pleiotropic effects. We also identified some candidate genes that are involved in fatty acid metabolism and regulation. For each of the seven traits, most of the SNPs produced differences between the average phenotypic values of the two homozygotes of about one-half standard deviation and contributed over 3% of their total variability. This is the first GWA study conducted on seed composition traits solely in wild soybean populations, and a number of QTLs were found that have not been previously discovered. Some of these may be useful to breeders who select for increased protein/oil content or altered fatty acid ratios in the seeds. The results also provide additional insight into the genetic architecture of these traits in a large sample of wild soybean, and suggest some new candidate genes whose molecular effects on these traits need to be further studied.

  17. Effect of polymorphisms in candidate genes on carcass and meat quality traits in double muscled Piemontese cattle.

    PubMed

    Ribeca, C; Bonfatti, V; Cecchinato, A; Albera, A; Gallo, L; Carnier, P

    2014-03-01

    The aim of this study was to investigate the association between 10 candidate genes and carcass weight and conformation, carcass daily gain, and meat quality (pH, color, cooking loss, drip loss and shear force) in 990 double-muscled Piemontese young bulls. Animals were genotyped at each of the following genes: growth hormone, growth hormone receptor, pro-opiomelanocortin, pro-opiomelanocortin class 1 homeobox 1, melanocortin-4 receptor, corticotrophin-releasing hormone, diacylglycerol O-acyltransferase-1, thyroglobulin, carboxypeptidase E and gamma-3 regulatory subunit of the AMP-activated protein kinase. All the investigated SNPs had additive effects which were relevant for at least one of the traits. Relevant associations between the investigated SNPs and carcass weight, carcass daily gain and carcass conformation were detected, whereas associations of SNPs with meat quality were moderate. Results confirmed some of previously reported associations, but diverged for others. Validation in other cattle breeds is required to use these SNPs in gene-assisted selection programs for enhancement of carcass traits and meat quality. Copyright © 2013 Elsevier Ltd. All rights reserved.

  18. Genome-Wide Association Study for Identifying Loci that Affect Fillet Yield, Carcass, and Body Weight Traits in Rainbow Trout (Oncorhynchus mykiss).

    PubMed

    Gonzalez-Pena, Dianelys; Gao, Guangtu; Baranski, Matthew; Moen, Thomas; Cleveland, Beth M; Kenney, P Brett; Vallejo, Roger L; Palti, Yniv; Leeds, Timothy D

    2016-01-01

    Fillet yield (FY, %) is an economically-important trait in rainbow trout aquaculture that affects production efficiency. Despite that, FY has received little attention in breeding programs because it is difficult to measure on a large number of fish and cannot be directly measured on breeding candidates. The recent development of a high-density SNP array for rainbow trout has provided the needed tool for studying the underlying genetic architecture of this trait. A genome-wide association study (GWAS) was conducted for FY, body weight at 10 (BW10) and 13 (BW13) months post-hatching, head-off carcass weight (CAR), and fillet weight (FW) in a pedigreed rainbow trout population selectively bred for improved growth performance. The GWAS analysis was performed using the weighted single-step GBLUP method (wssGWAS). Phenotypic records of 1447 fish (1.5 kg at harvest) from 299 full-sib families in three successive generations, of which 875 fish from 196 full-sib families were genotyped, were used in the GWAS analysis. A total of 38,107 polymorphic SNPs were analyzed in a univariate model with hatch year and harvest group as fixed effects, harvest weight as a continuous covariate, and animal and common environment as random effects. A new linkage map was developed to create windows of 20 adjacent SNPs for use in the GWAS. The two windows with largest effect for FY and FW were located on chromosome Omy9 and explained only 1.0-1.5% of genetic variance, thus suggesting a polygenic architecture affected by multiple loci with small effects in this population. One window on Omy5 explained 1.4 and 1.0% of the genetic variance for BW10 and BW13, respectively. Three windows located on Omy27, Omy17, and Omy9 (same window detected for FY) explained 1.7, 1.7, and 1.0%, respectively, of genetic variance for CAR. Among the detected 100 SNPs, 55% were located directly in genes (intron and exons). Nucleotide sequences of intragenic SNPs were blasted to the Mus musculus genome to create a putative gene network. The network suggests that differences in the ability to maintain a proliferative and renewable population of myogenic precursor cells may affect variation in growth and fillet yield in rainbow trout.

  19. Genome-Wide Association Study for Identifying Loci that Affect Fillet Yield, Carcass, and Body Weight Traits in Rainbow Trout (Oncorhynchus mykiss)

    PubMed Central

    Gonzalez-Pena, Dianelys; Gao, Guangtu; Baranski, Matthew; Moen, Thomas; Cleveland, Beth M.; Kenney, P. Brett; Vallejo, Roger L.; Palti, Yniv; Leeds, Timothy D.

    2016-01-01

    Fillet yield (FY, %) is an economically-important trait in rainbow trout aquaculture that affects production efficiency. Despite that, FY has received little attention in breeding programs because it is difficult to measure on a large number of fish and cannot be directly measured on breeding candidates. The recent development of a high-density SNP array for rainbow trout has provided the needed tool for studying the underlying genetic architecture of this trait. A genome-wide association study (GWAS) was conducted for FY, body weight at 10 (BW10) and 13 (BW13) months post-hatching, head-off carcass weight (CAR), and fillet weight (FW) in a pedigreed rainbow trout population selectively bred for improved growth performance. The GWAS analysis was performed using the weighted single-step GBLUP method (wssGWAS). Phenotypic records of 1447 fish (1.5 kg at harvest) from 299 full-sib families in three successive generations, of which 875 fish from 196 full-sib families were genotyped, were used in the GWAS analysis. A total of 38,107 polymorphic SNPs were analyzed in a univariate model with hatch year and harvest group as fixed effects, harvest weight as a continuous covariate, and animal and common environment as random effects. A new linkage map was developed to create windows of 20 adjacent SNPs for use in the GWAS. The two windows with largest effect for FY and FW were located on chromosome Omy9 and explained only 1.0–1.5% of genetic variance, thus suggesting a polygenic architecture affected by multiple loci with small effects in this population. One window on Omy5 explained 1.4 and 1.0% of the genetic variance for BW10 and BW13, respectively. Three windows located on Omy27, Omy17, and Omy9 (same window detected for FY) explained 1.7, 1.7, and 1.0%, respectively, of genetic variance for CAR. Among the detected 100 SNPs, 55% were located directly in genes (intron and exons). Nucleotide sequences of intragenic SNPs were blasted to the Mus musculus genome to create a putative gene network. The network suggests that differences in the ability to maintain a proliferative and renewable population of myogenic precursor cells may affect variation in growth and fillet yield in rainbow trout. PMID:27920797

  20. Association of SNPs from IL1A, IL1B, and IL6 Genes with Human Cytomegalovirus Infection Among Pregnant Women.

    PubMed

    Wujcicka, Wioletta Izabela; Wilczyński, Jan Szczęsny; Nowakowska, Dorota Ewa

    2017-05-01

    The study was aimed to estimate the role and prevalence rates of genotypes, haplotypes, and alleles, located within the single-nucleotide polymorphisms (SNPs) of interleukin (IL) 1A, IL1B, and IL6 genes, in the occurrence and development of human cytomegalovirus (HCMV) infection among pregnant women. A research was conducted in 129 pregnant women, out of whom, 65 were HCMV infected and 64 were age-matched control uninfected individuals. HCMV DNA was quantitated for UL55 gene by the real-time Q PCR in the body fluids. The genotypic statuses within the SNPs were determined by nested PCR-RFLP assays and confirmed, by sequencing for randomly selected representative PCR products. A relationship between the genotypes and alleles, as well as haplotypes and multiple variants in the studied polymorphisms, and the occurrence of HCMV infection in pregnant women, was determined using a logistic regression model. TT genotype within IL1A polymorphism significantly decreased the risk of HCMV infection (OR 0.32, 95% CI 0.09-1.05; p ≤ 0.050). Considering IL6 SNP, the prevalence rate of GC genotype was significantly decreased among the HCMV infected, compared to the uninfected control individuals (OR 0.45, 95% CI 0.21-0.99; p ≤ 0.050). Moreover, CC homozygotic status in IL6 SNP, found in pregnant women, significantly decreased the risk of congenital infection with HCMV in their offsprings (OR 0.12; p ≤ 0.050). In multiple SNP analysis, TC haplotype within the IL1 polymorphisms significantly decreased the risk of the infection in pregnant women (OR 0.38 95% CI 0.15-0.96; p ≤ 0.050). In addition, TTG complex variants for all the studied polymorphisms and TG variants for IL1B and IL6 SNPs were significantly more prevalent among the infected offsprings with symptomatic congenital cytomegaly than among the asymptomatic cases (p ≤ 0.050). In conclusion, the analyzed IL1A -889 C>T, IL1B +3954 C>T, and IL6 -174 G>C polymorphisms may be associated with the occurrence and development of HCMV infection among studied patients.

  1. Type 2 Diabetes Risk Allele Loci in the Qatari Population

    PubMed Central

    Abi Khalil, Charbel; Fakhro, Khalid A.; Robay, Amal; Ramstetter, Monica D.; Al-Azwani, Iman K.; Malek, Joel A.; Zirie, Mahmoud; Jayyousi, Amin; Badii, Ramin; Al-Nabet Al-Marri, Ajayeb; Chiuchiolo, Maria J.; Al-Shakaki, Alya; Chidiac, Omar; Gharbiah, Maey; Bener, Abdulbari; Stadler, Dora; Hackett, Neil R.; Mezey, Jason G.; Crystal, Ronald G.

    2016-01-01

    Background The prevalence of type 2 diabetes (T2D) is increasing in the Middle East. However, the genetic risk factors for T2D in the Middle Eastern populations are not known, as the majority of studies of genetic risk for T2D are in Europeans and Asians. Methods All subjects were ≥3 generation Qataris. Cases with T2D (n = 1,124) and controls (n = 590) were randomly recruited and assigned to the 3 known Qatari genetic subpopulations [Bedouin (Q1), Persian/South Asian (Q2) and African (Q3)]. Subjects underwent genotyping for 37 single nucleotide polymorphisms (SNPs) in 29 genes known to be associated with T2D in Europeans and/or Asian populations, and an additional 27 tag SNPs related to these susceptibility loci. Pre-study power analysis suggested that with the known incidence of T2D in adult Qataris (22%), the study population size would be sufficient to detect significant differences if the SNPs were risk factors among Qataris, assuming that the odds ratio (OR) for T2D SNPs in Qatari’s is greater than or equal to the SNP with highest known OR in other populations. Results Haplotype analysis demonstrated that Qatari haplotypes in the region of known T2D risk alleles in Q1 and Q2 genetic subpopulations were similar to European haplotypes. After Benjamini-Hochberg adjustment for multiple testing, only two SNPs (rs7903146 and rs4506565), both associated with transcription factor 7-like 2 (TCF7L2), achieved statistical significance in the whole study population. When T2D subjects and control subjects were assigned to the known 3 Qatari subpopulations, and analyzed individually and with the Q1 and Q2 genetic subpopulations combined, one of these SNPs (rs4506565) was also significant in the admixed group. No other SNPs associated with T2D in all Qataris or individual genetic subpopulations. Conclusions With the caveats of the power analysis, the European/Asian T2D SNPs do not contribute significantly to the high prevalence of T2D in the Qatari population, suggesting that the genetic risks for T2D are likely different in Qataris compared to Europeans and Asians. PMID:27383215

  2. Going where traditional markers have not gone before: utility of and promise for RAD sequencing in marine invertebrate phylogeography and population genomics.

    PubMed

    Reitzel, A M; Herrera, S; Layden, M J; Martindale, M Q; Shank, T M

    2013-06-01

    Characterization of large numbers of single-nucleotide polymorphisms (SNPs) throughout a genome has the power to refine the understanding of population demographic history and to identify genomic regions under selection in natural populations. To this end, population genomic approaches that harness the power of next-generation sequencing to understand the ecology and evolution of marine invertebrates represent a boon to test long-standing questions in marine biology and conservation. We employed restriction-site-associated DNA sequencing (RAD-seq) to identify SNPs in natural populations of the sea anemone Nematostella vectensis, an emerging cnidarian model with a broad geographic range in estuarine habitats in North and South America, and portions of England. We identified hundreds of SNP-containing tags in thousands of RAD loci from 30 barcoded individuals inhabiting four locations from Nova Scotia to South Carolina. Population genomic analyses using high-confidence SNPs resulted in a highly-resolved phylogeography, a result not achieved in previous studies using traditional markers. Plots of locus-specific FST against heterozygosity suggest that a majority of polymorphic sites are neutral, with a smaller proportion suggesting evidence for balancing selection. Loci inferred to be under balancing selection were mapped to the genome, where 90% were located in gene bodies, indicating potential targets of selection. The results from analyses with and without a reference genome supported similar conclusions, further highlighting RAD-seq as a method that can be efficiently applied to species lacking existing genomic resources. We discuss the utility of RAD-seq approaches in burgeoning Nematostella research as well as in other cnidarian species, particularly corals and jellyfishes, to determine phylogeographic relationships of populations and identify regions of the genome undergoing selection. © 2013 John Wiley & Sons Ltd.

  3. Going where traditional markers have not gone before: utility of and promise for RAD sequencing in marine invertebrate phylogeography and population genomics

    PubMed Central

    Reitzel, A.M.; Herrera, S.; Layden, M.J.; Martindale, M.Q.; Shank, T.M.

    2013-01-01

    Characterization of large numbers of single nucleotide polymorphisms (SNPs) throughout a genome has the power to refine the understanding of population demographic history and to identify genomic regions under selection in natural populations. To this end, population genomic approaches that harness the power of next-generation sequencing to understand the ecology and evolution of marine invertebrates represent a boon to test long-standing questions in marine biology and conservation. We employed restriction-site-associated DNA sequencing (RAD-seq) to identify SNPs in natural populations of the sea anemone Nematostella vectensis, an emerging cnidarian model with a broad geographic range in estuarine habitats in North and South America, and portions of England. We identified hundreds of SNP-containing tags in thousands of RAD loci from 30 barcoded individuals inhabiting four locations from Nova Scotia to South Carolina. Population genomic analyses using high-confidence SNPs resulted in a highly-resolved phylogeography, a result not achieved in previous studies using traditional markers. Plots of locus-specific FST against heterozygosity suggest that a majority of polymorphic sites are neutral, with a smaller proportion suggesting evidence for balancing selection. Loci inferred to be under balancing selection were mapped to the genome, where 90% were located in gene bodies, indicating potential targets of selection. Results from analyses with and without a reference genome supported similar conclusions, further supporting RAD-seq as a method that can be efficiently applied to species lacking existing genomic resources. We discuss the utility of RAD-seq approaches in burgeoning Nematostella research as well as in other cnidarian species, particularly corals, to determine phylogeographic relationships of populations and identify regions of the genome undergoing selection. PMID:23473066

  4. Selection signature analysis in Holstein cattle identified genes known to affect reproduction

    USDA-ARS?s Scientific Manuscript database

    Using direct comparison of 45,878 SNPs between a group of Holstein cattle unselected since 1964 and contemporary Holsteins that on average take 30 days longer for successful conception than the 1964 Holsteins, we conducted selection signature analyses to identify genomic regions associated with dair...

  5. Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning

    PubMed Central

    Zhao, Nan; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry

    2014-01-01

    Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of large-scale protein-protein interaction networks, and can be useful for functional annotation of disease-associated SNPs. SNIP-IN tool is freely accessible as a web-server at http://korkinlab.org/snpintool/. PMID:24784581

  6. Mining of haplotype-based expressed sequence tag single nucleotide polymorphisms in citrus

    PubMed Central

    2013-01-01

    Background Single nucleotide polymorphisms (SNPs), the most abundant variations in a genome, have been widely used in various studies. Detection and characterization of citrus haplotype-based expressed sequence tag (EST) SNPs will greatly facilitate further utilization of these gene-based resources. Results In this paper, haplotype-based SNPs were mined out of publicly available citrus expressed sequence tags (ESTs) from different citrus cultivars (genotypes) individually and collectively for comparison. There were a total of 567,297 ESTs belonging to 27 cultivars in varying numbers and consequentially yielding different numbers of haplotype-based quality SNPs. Sweet orange (SO) had the most (213,830) ESTs, generating 11,182 quality SNPs in 3,327 out of 4,228 usable contigs. Summed from all the individually mining results, a total of 25,417 quality SNPs were discovered – 15,010 (59.1%) were transitions (AG and CT), 9,114 (35.9%) were transversions (AC, GT, CG, and AT), and 1,293 (5.0%) were insertion/deletions (indels). A vast majority of SNP-containing contigs consisted of only 2 haplotypes, as expected, but the percentages of 2 haplotype contigs varied widely in these citrus cultivars. BLAST of the 25,417 25-mer SNP oligos to the Clementine reference genome scaffolds revealed 2,947 SNPs had “no hits found”, 19,943 had 1 unique hit / alignment, 1,571 had one hit and 2+ alignments per hit, and 956 had 2+ hits and 1+ alignment per hit. Of the total 24,293 scaffold hits, 23,955 (98.6%) were on the main scaffolds 1 to 9, and only 338 were on 87 minor scaffolds. Most alignments had 100% (25/25) or 96% (24/25) nucleotide identities, accounting for 93% of all the alignments. Considering almost all the nucleotide discrepancies in the 24/25 alignments were at the SNP sites, it served well as in silico validation of these SNPs, in addition to and consistent with the rate (81%) validated by sequencing and SNaPshot assay. Conclusions High-quality EST-SNPs from different citrus genotypes were detected, and compared to estimate the heterozygosity of each genome. All the SNP oligo sequences were aligned with the Clementine citrus genome to determine their distribution and uniqueness and for in silico validation, in addition to SNaPshot and sequencing validation of selected SNPs. PMID:24175923

  7. Contributions of Caucasian-associated bone mass loci to the variation in bone mineral density in Vietnamese population.

    PubMed

    Ho-Pham, Lan T; Nguyen, Sing C; Tran, Bich; Nguyen, Tuan V

    2015-07-01

    Bone mineral density (BMD) is under strong genetic regulation, but it is not clear which genes are involved in the regulation, particularly in Asian populations. This study sought to determine the association between 29 genes discovered by Caucasian-based genome-wide association studies and BMD in a Vietnamese population. The study involved 564 Vietnamese men and women aged 18 years and over (average age: 47 years) who were randomly sampled from the Ho Chi Minh City. BMD at the femoral neck, lumbar spine, total hip and whole body was measured by DXA (Hologic QDR4500, Bedford, MA, USA). Thirty-two single nucleotide polymorphisms (SNPs) in 29 genes were genotyped using Sequenom MassARRAY technology. The magnitude of association between SNPs and BMD was analyzed by the linear regression model. The Bayesian model average method was used to identify SNPs that are independently associated with BMD. The distribution of genotypes of all, but two, SNPs was consistent with the Hardy-Weinberg equilibrium law. After adjusting for age, gender and weight, 3 SNPs were associated with BMD: rs2016266 (SP7 gene), rs7543680 (ZBTB40 gene), and rs1373004 (MBL2/DKK1 gene). Among the three genetic variants, the SNP rs2016266 had the strongest association, with each minor allele being associated with ~0.02 g/cm(2) increase in BMD at the femoral neck and whole body. Each of these genetic variant explained about 0.2 to 1.1% variance of BMD. All other SNPs were not significantly associated with BMD. These results suggest that genetic variants in the SP7, ZBTB40 and MBL2/DKK1 genes are associated with BMD in the Vietnamese population, and that the effect of these genes on BMD is likely to be modest. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. Novel genetic markers of the carbonic anhydrase II gene associated with egg production and reproduction traits in Tsaiya ducks.

    PubMed

    Chang, M-T; Cheng, Y-S; Huang, M-C

    2013-02-01

    In our previous cDNA microarray study, we found that the carbonic anhydrase II (CA2) gene is one of the differentially expressed transcripts in the duck isthmus epithelium during egg formation period. The aim of this study was to identify the single-nucleotide polymorphisms (SNPs) in the CA2 gene of Tsaiya ducks. The relationship of SNP genotype with egg production and reproduction traits was also investigated. A total of 317 ducks from two lines, a control line with no selection and a selected line, were employed for testing. Three SNPs (C37T, A62G and A65G) in the 3'-untranslated region of the CA2 gene were found. SNP-trait association analysis showed that SNP C37T and A62G were associated with duck egg weight besides fertility. The ducks with the CT and AG genotypes had a 1.46 and 1.62 g/egg lower egg weight as compared with ducks with the CC and AA genotypes, respectively (p < 0.05). But the ducks with CT and AG genotypes had 5.20% and 4.22% higher fertility than those with CC and AA genotypes, respectively (p < 0.05). Diplotype constructed on these three SNPs was associated with duck fertility, and the diplotype H1H4 was dominant for duck fertility. These findings might provide the basis for balanced selection and may be used in marker-assisted selection to improve egg weight and fertility simultaneously in the Tsaiya ducks. © 2012 Blackwell Verlag GmbH.

  9. Adaptations to Climate-Mediated Selective Pressures in Sheep

    PubMed Central

    Lv, Feng-Hua; Agha, Saif; Kantanen, Juha; Colli, Licia; Stucki, Sylvie; Kijas, James W.; Joost, Stéphane; Li, Meng-Hua; Ajmone Marsan, Paolo

    2014-01-01

    Following domestication, sheep (Ovis aries) have become essential farmed animals across the world through adaptation to a diverse range of environments and varied production systems. Climate-mediated selective pressure has shaped phenotypic variation and has left genetic “footprints” in the genome of breeds raised in different agroecological zones. Unlike numerous studies that have searched for evidence of selection using only population genetics data, here, we conducted an integrated coanalysis of environmental data with single nucleotide polymorphism (SNP) variation. By examining 49,034 SNPs from 32 old, autochthonous sheep breeds that are adapted to a spectrum of different regional climates, we identified 230 SNPs with evidence for selection that is likely due to climate-mediated pressure. Among them, 189 (82%) showed significant correlation (P ≤ 0.05) between allele frequency and climatic variables in a larger set of native populations from a worldwide range of geographic areas and climates. Gene ontology analysis of genes colocated with significant SNPs identified 17 candidates related to GTPase regulator and peptide receptor activities in the biological processes of energy metabolism and endocrine and autoimmune regulation. We also observed high linkage disequilibrium and significant extended haplotype homozygosity for the core haplotype TBC1D12-CH1 of TBC1D12. The global frequency distribution of the core haplotype and allele OAR22_18929579-A showed an apparent geographic pattern and significant (P ≤ 0.05) correlations with climatic variation. Our results imply that adaptations to local climates have shaped the spatial distribution of some variants that are candidates to underpin adaptive variation in sheep. PMID:25249477

  10. Polymorphisms in ERAP1 and ERAP2 are shared by Caninae and segregate within and between random- and pure-breeds of dogs.

    PubMed

    Pedersen, N C; Dhanota, J K; Liu, H

    2016-10-15

    Specific polymorphisms in the endoplasmic reticulum amino peptidase genes ERAP1 and ERAP2, when present with certain MHC class receptor types, have been associated with increased risk for specific cancers, infectious diseases and autoimmune disorders in humans. This increased risk has been linked to distinct polymorphisms in both ERAPs and MHC class I receptors that affect the way cell-generated peptides are screened for antigenicity. The incidence of cancer, infectious disease and autoimmune disorders differ greatly among pure breeds of dogs as it does in humans and it is possible that this heightened susceptibility is also due to specific polymorphisms in ERAP1 and ERAP2. In order to determine if such polymorphisms exist, the ERAP1 and ERAP2 genes of 10 dogs of nine diverse breeds were sequenced and SNPs causing synonymous or non-synonymous amino acid changes, deletions or insertions were identified. Eight ERAP1 and 10 ERAP2 SNPs were used to create a Sequenom MassARRAY iPLEX based test panel which defined 24 ERAP1, 36 ERAP2 and 128 ERAP1/2 haplotypes. The prevalence of these haplotypes was then measured among dog, wolf, coyote, jackal and red fox populations. Some haplotypes were species specific, while others were shared across species, especially between dog, wolf, coyote and jackal. The prevalence of these haplotypes was then compared among various canid populations, and in particular between various populations of random- and pure-bred dogs. Human-directed positive selection has led to loss of ERAP diversity and segregation of certain haplotypes among various dog breeds. A phylogenetic tree generated from 45 of the most common ERAP1/2 haplotypes demonstrated three distinct clades, all of which were rooted with haplotypes either shared among species or specific to contemporary dogs, coyote and wolf. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  11. SNPranker 2.0: a gene-centric data mining tool for diseases associated SNP prioritization in GWAS.

    PubMed

    Merelli, Ivan; Calabria, Andrea; Cozzi, Paolo; Viti, Federica; Mosca, Ettore; Milanesi, Luciano

    2013-01-01

    The capability of correlating specific genotypes with human diseases is a complex issue in spite of all advantages arisen from high-throughput technologies, such as Genome Wide Association Studies (GWAS). New tools for genetic variants interpretation and for Single Nucleotide Polymorphisms (SNPs) prioritization are actually needed. Given a list of the most relevant SNPs statistically associated to a specific pathology as result of a genotype study, a critical issue is the identification of genes that are effectively related to the disease by re-scoring the importance of the identified genetic variations. Vice versa, given a list of genes, it can be of great importance to predict which SNPs can be involved in the onset of a particular disease, in order to focus the research on their effects. We propose a new bioinformatics approach to support biological data mining in the analysis and interpretation of SNPs associated to pathologies. This system can be employed to design custom genotyping chips for disease-oriented studies and to re-score GWAS results. The proposed method relies (1) on the data integration of public resources using a gene-centric database design, (2) on the evaluation of a set of static biomolecular annotations, defined as features, and (3) on the SNP scoring function, which computes SNP scores using parameters and weights set by users. We employed a machine learning classifier to set default feature weights and an ontological annotation layer to enable the enrichment of the input gene set. We implemented our method as a web tool called SNPranker 2.0 (http://www.itb.cnr.it/snpranker), improving our first published release of this system. A user-friendly interface allows the input of a list of genes, SNPs or a biological process, and to customize the features set with relative weights. As result, SNPranker 2.0 returns a list of SNPs, localized within input and ontologically enriched genes, combined with their prioritization scores. Different databases and resources are already available for SNPs annotation, but they do not prioritize or re-score SNPs relying on a-priori biomolecular knowledge. SNPranker 2.0 attempts to fill this gap through a user-friendly integrated web resource. End users, such as researchers in medical genetics and epidemiology, may find in SNPranker 2.0 a new tool for data mining and interpretation able to support SNPs analysis. Possible scenarios are GWAS data re-scoring, SNPs selection for custom genotyping arrays and SNPs/diseases association studies.

  12. A System-Level Pathway-Phenotype Association Analysis Using Synthetic Feature Random Forest

    PubMed Central

    Pan, Qinxin; Hu, Ting; Malley, James D.; Andrew, Angeline S.; Karagas, Margaret R.; Moore, Jason H.

    2015-01-01

    As the cost of genome-wide genotyping decreases, the number of genome-wide association studies (GWAS) has increased considerably. However, the transition from GWAS findings to the underlying biology of various phenotypes remains challenging. As a result, due to its system-level interpretability, pathway analysis has become a popular tool for gaining insights on the underlying biology from high-throughput genetic association data. In pathway analyses, gene sets representing particular biological processes are tested for significant associations with a given phenotype. Most existing pathway analysis approaches rely on single-marker statistics and assume that pathways are independent of each other. As biological systems are driven by complex biomolecular interactions, embracing the complex relationships between single-nucleotide polymorphisms (SNPs) and pathways needs to be addressed. To incorporate the complexity of gene-gene interactions and pathway-pathway relationships, we propose a system-level pathway analysis approach, synthetic feature random forest (SF-RF), which is designed to detect pathway-phenotype associations without making assumptions about the relationships among SNPs or pathways. In our approach, the genotypes of SNPs in a particular pathway are aggregated into a synthetic feature representing that pathway via Random Forest (RF). Multiple synthetic features are analyzed using RF simultaneously and the significance of a synthetic feature indicates the significance of the corresponding pathway. We further complement SF-RF with pathway-based Statistical Epistasis Network (SEN) analysis that evaluates interactions among pathways. By investigating the pathway SEN, we hope to gain additional insights into the genetic mechanisms contributing to the pathway-phenotype association. We apply SF-RF to a population-based genetic study of bladder cancer and further investigate the mechanisms that help explain the pathway-phenotype associations using SEN. The bladder cancer associated pathways we found are both consistent with existing biological knowledge and reveal novel and plausible hypotheses for future biological validations. PMID:24535726

  13. Higher Magnesium Intake Is Associated with Lower Fasting Glucose and Insulin, with No Evidence of Interaction with Select Genetic Loci, in a Meta-Analysis of 15 CHARGE Consortium Studies1234

    PubMed Central

    Hruby, Adela; Ngwa, Julius S.; Renström, Frida; Wojczynski, Mary K.; Ganna, Andrea; Hallmans, Göran; Houston, Denise K.; Jacques, Paul F.; Kanoni, Stavroula; Lehtimäki, Terho; Lemaitre, Rozenn N.; Manichaikul, Ani; North, Kari E.; Ntalla, Ioanna; Sonestedt, Emily; Tanaka, Toshiko; van Rooij, Frank J. A.; Bandinelli, Stefania; Djoussé, Luc; Grigoriou, Efi; Johansson, Ingegerd; Lohman, Kurt K.; Pankow, James S.; Raitakari, Olli T.; Riserus, Ulf; Yannakoulia, Mary; Zillikens, M. Carola; Hassanali, Neelam; Liu, Yongmei; Mozaffarian, Dariush; Papoutsakis, Constantina; Syvänen, Ann-Christine; Uitterlinden, André G.; Viikari, Jorma; Groves, Christopher J.; Hofman, Albert; Lind, Lars; McCarthy, Mark I.; Mikkilä, Vera; Mukamal, Kenneth; Franco, Oscar H.; Borecki, Ingrid B.; Cupples, L. Adrienne; Dedoussis, George V.; Ferrucci, Luigi; Hu, Frank B.; Ingelsson, Erik; Kähönen, Mika; Kao, W. H. Linda; Kritchevsky, Stephen B.; Orho-Melander, Marju; Prokopenko, Inga; Rotter, Jerome I.; Siscovick, David S.; Witteman, Jacqueline C. M.; Franks, Paul W.; Meigs, James B.; McKeown, Nicola M.; Nettleton, Jennifer A.

    2013-01-01

    Favorable associations between magnesium intake and glycemic traits, such as fasting glucose and insulin, are observed in observational and clinical studies, but whether genetic variation affects these associations is largely unknown. We hypothesized that single nucleotide polymorphisms (SNPs) associated with either glycemic traits or magnesium metabolism affect the association between magnesium intake and fasting glucose and insulin. Fifteen studies from the CHARGE (Cohorts for Heart and Aging Research in Genomic Epidemiology) Consortium provided data from up to 52,684 participants of European descent without known diabetes. In fixed-effects meta-analyses, we quantified 1) cross-sectional associations of dietary magnesium intake with fasting glucose (mmol/L) and insulin (ln-pmol/L) and 2) interactions between magnesium intake and SNPs related to fasting glucose (16 SNPs), insulin (2 SNPs), or magnesium (8 SNPs) on fasting glucose and insulin. After adjustment for age, sex, energy intake, BMI, and behavioral risk factors, magnesium (per 50-mg/d increment) was inversely associated with fasting glucose [β = −0.009 mmol/L (95% CI: −0.013, −0.005), P < 0.0001] and insulin [−0.020 ln-pmol/L (95% CI: −0.024, −0.017), P < 0.0001]. No magnesium-related SNP or interaction between any SNP and magnesium reached significance after correction for multiple testing. However, rs2274924 in magnesium transporter-encoding TRPM6 showed a nominal association (uncorrected P = 0.03) with glucose, and rs11558471 in SLC30A8 and rs3740393 near CNNM2 showed a nominal interaction (uncorrected, both P = 0.02) with magnesium on glucose. Consistent with other studies, a higher magnesium intake was associated with lower fasting glucose and insulin. Nominal evidence of TRPM6 influence and magnesium interaction with select loci suggests that further investigation is warranted. PMID:23343670

  14. Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar).

    PubMed

    Houston, Ross D; Taggart, John B; Cézard, Timothé; Bekaert, Michaël; Lowe, Natalie R; Downing, Alison; Talbot, Richard; Bishop, Stephen C; Archibald, Alan L; Bron, James E; Penman, David J; Davassi, Alessandro; Brew, Fiona; Tinch, Alan E; Gharbi, Karim; Hamilton, Alastair

    2014-02-06

    Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection.

  15. Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)

    PubMed Central

    2014-01-01

    Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection. PMID:24524230

  16. IMHOTEP—a composite score integrating popular tools for predicting the functional consequences of non-synonymous sequence variants

    PubMed Central

    Knecht, Carolin; Mort, Matthew; Junge, Olaf; Cooper, David N.; Krawczak, Michael

    2017-01-01

    Abstract The in silico prediction of the functional consequences of mutations is an important goal of human pathogenetics. However, bioinformatic tools that classify mutations according to their functionality employ different algorithms so that predictions may vary markedly between tools. We therefore integrated nine popular prediction tools (PolyPhen-2, SNPs&GO, MutPred, SIFT, MutationTaster2, Mutation Assessor and FATHMM as well as conservation-based Grantham Score and PhyloP) into a single predictor. The optimal combination of these tools was selected by means of a wide range of statistical modeling techniques, drawing upon 10 029 disease-causing single nucleotide variants (SNVs) from Human Gene Mutation Database and 10 002 putatively ‘benign’ non-synonymous SNVs from UCSC. Predictive performance was found to be markedly improved by model-based integration, whilst maximum predictive capability was obtained with either random forest, decision tree or logistic regression analysis. A combination of PolyPhen-2, SNPs&GO, MutPred, MutationTaster2 and FATHMM was found to perform as well as all tools combined. Comparison of our approach with other integrative approaches such as Condel, CoVEC, CAROL, CADD, MetaSVM and MetaLR using an independent validation dataset, revealed the superiority of our newly proposed integrative approach. An online implementation of this approach, IMHOTEP (‘Integrating Molecular Heuristics and Other Tools for Effect Prediction’), is provided at http://www.uni-kiel.de/medinfo/cgi-bin/predictor/. PMID:28180317

  17. Human cardiovascular disease IBC chip-wide association with weight loss and weight regain in the look AHEAD trial.

    PubMed

    McCaffery, Jeanne M; Papandonatos, George D; Huggins, Gordon S; Peter, Inga; Erar, Bahar; Kahn, Steven E; Knowler, William C; Lipkin, Edward W; Kitabchi, Abbas E; Wagenknecht, Lynne E; Wing, Rena R

    2013-01-01

    The present study identified genetic predictors of weight change during behavioral weight loss treatment. Participants were 3,899 overweight/obese individuals with type 2 diabetes from Look AHEAD, a randomized controlled trial to determine the effects of intensive lifestyle intervention (ILI), including weight loss and physical activity, relative to diabetes support and education, on cardiovascular outcomes. Analyses focused on associations of single nucleotide polymorphisms (SNPs) on the Illumina CARe iSelect (IBC) chip (minor allele frequency >5%; n = 31,959) with weight change at year 1 and year 4, and weight regain at year 4, among individuals who lost ≥ 3% at year 1. Two novel regions of significant chip-wide association with year-1 weight loss in ILI were identified (p < 2.96E-06). ABCB11 rs484066 was associated with 1.16 kg higher weight per minor allele at year 1, whereas TNFRSF11A, or RANK, rs17069904 was associated with 1.70 kg lower weight per allele at year 1. This study, the largest to date on genetic predictors of weight loss and regain, indicates that SNPs within ABCB11, related to bile salt transfer, and TNFRSF11A, implicated in adipose tissue physiology, predict the magnitude of weight loss during behavioral intervention. These results provide new insights into potential biological mechanisms and may ultimately inform weight loss treatment. © 2013 S. Karger AG, Basel.

  18. S187. SEARCHING FOR BRAIN CO-EXPRESSION MODULES THAT CONTRIBUTE DISPROPORTIONATELY TO THE COMMON POLYGENIC RISK FOR SCHIZOPHRENIA

    PubMed Central

    Costas, Javier; Paramo, Mario; Arrojo, Manuel

    2018-01-01

    Abstract Background Genomic research has revealed that schizophrenia is a highly polygenic disease. Recent estimates indicate that at least 71% of genomic segments of 1 Mb include one or more risk loci for schizophrenia (Loh et al., Nature Genet 2015). This extremely high polygenicity represents a challenge to decipher the biological basis of schizophrenia, as it is expected that any set of SNPs with enough size will be associated with the disorder. Among the different gene sets available for study (such as those from Gene Ontology, KEGG pathway, Reactome pathways or protein protein interaction datasets), those based on brain co-expression networks represent putative functional relationships in the relevant tissue. The aim of this work was to identify brain co-expression networks that contribute disproportionately to the common polygenic risk for schizophrenia to get more insight on schizophrenia etiopathology. Methods We analyzed a case -control dataset consisting of 582 schizophrenia patients from Galicia, NW Spain, and 591 ancestrally matched controls, genotyped with the Illumina PsychArray. Using as discovery sample the summary results from the largest GWAS of schizophrenia to date (Psychiatric Genomics Consortium, SCZ2), we generated polygenic risk scores (PRS) in our sample based on SNPs located at genes belonging to brain co-expression modules determined by the CommonMind Consortium (Fromer et al., Nature Neurosci 2016). PRS were generated using the clumping procedure of PLINK, considering several different thresholds to select SNPs from the discovery sample. In order to test if any specific module increased risk to schizophrenia more than expected by their size, we generated up to 10,000 random permutations of the same number of SNPs, matched by frequency, distance to nearest gene, number of SNPs in LD and gene density, using SNPsnap. Results As expected, most modules with enough number of independent SNPs belonging to them showed a significant increase in Nagelkerke’s R2 in our case-control sample after the addition of the module-specific PRS in a logistic regression model. Our permutation strategy revealed that most modules did not show an excess of risk, measured by increase in Nagelkerke’s R2, in comparison to equal number of SNPs with similar characteristics. But one module, M2c from Fromer et al., remained highly significant after multiple tests’ correction. Reactome pathways analysis revealed an over-representation of genes involved in “Neuronal System” and “Axon guidance” among genes from this module. Using the same protocol, we detected that the 84 genes from the neuronal system pathway at this module, representing less than 6% of the genes from the module, explained a higher level of risk than expected. “Voltage-gated Potassium channels” and “Neurexins and neuroligins” are overrepresented among the Neuronal System genes from module M2c. Discussion Here, we show that, in spite of the high polygenicity of schizophrenia, it is possible to identify gene sets contributing disproportionately to total risk, as it was the case for the M2c module from Fromer et al. These authors have previously reported that the M2c module was enriched in GWAS signals, as well as CNVs and rare variants associated with schizophrenia. Therefore, this module shows a disproportionately contribution to schizophrenia risk. Study supported by Grant PI14/01020 from Instituto de Salud Carlos III, Ministry of Health, Spanish Government.

  19. Effects of GWAS-Associated Genetic Variants on lncRNAs within IBD and T1D Candidate Loci

    PubMed Central

    Brorsson, Caroline A.; Pociot, Flemming

    2014-01-01

    Long non-coding RNAs are a new class of non-coding RNAs that are at the crosshairs in many human diseases such as cancers, cardiovascular disorders, inflammatory and autoimmune disease like Inflammatory Bowel Disease (IBD) and Type 1 Diabetes (T1D). Nearly 90% of the phenotype-associated single-nucleotide polymorphisms (SNPs) identified by genome-wide association studies (GWAS) lie outside of the protein coding regions, and map to the non-coding intervals. However, the relationship between phenotype-associated loci and the non-coding regions including the long non-coding RNAs (lncRNAs) is poorly understood. Here, we systemically identified all annotated IBD and T1D loci-associated lncRNAs, and mapped nominally significant GWAS/ImmunoChip SNPs for IBD and T1D within these lncRNAs. Additionally, we identified tissue-specific cis-eQTLs, and strong linkage disequilibrium (LD) signals associated with these SNPs. We explored sequence and structure based attributes of these lncRNAs, and also predicted the structural effects of mapped SNPs within them. We also identified lncRNAs in IBD and T1D that are under recent positive selection. Our analysis identified putative lncRNA secondary structure-disruptive SNPs within and in close proximity (+/−5 kb flanking regions) of IBD and T1D loci-associated candidate genes, suggesting that these RNA conformation-altering polymorphisms might be associated with diseased-phenotype. Disruption of lncRNA secondary structure due to presence of GWAS SNPs provides valuable information that could be potentially useful for future structure-function studies on lncRNAs. PMID:25144376

  20. Increased fire frequency promotes stronger spatial genetic structure and natural selection at regional and local scales in Pinus halepensis Mill.

    PubMed

    Budde, Katharina B; González-Martínez, Santiago C; Navascués, Miguel; Burgarella, Concetta; Mosca, Elena; Lorenzo, Zaida; Zabal-Aguirre, Mario; Vendramin, Giovanni G; Verdú, Miguel; Pausas, Juli G; Heuertz, Myriam

    2017-04-01

    The recurrence of wildfires is predicted to increase due to global climate change, resulting in severe impacts on biodiversity and ecosystem functioning. Recurrent fires can drive plant adaptation and reduce genetic diversity; however, the underlying population genetic processes have not been studied in detail. In this study, the neutral and adaptive evolutionary effects of contrasting fire regimes were examined in the keystone tree species Pinus halepensis Mill. (Aleppo pine), a fire-adapted conifer. The genetic diversity, demographic history and spatial genetic structure were assessed at local (within-population) and regional scales for populations exposed to different crown fire frequencies. Eight natural P. halepensis stands were sampled in the east of the Iberian Peninsula, five of them in a region exposed to frequent crown fires (HiFi) and three of them in an adjacent region with a low frequency of crown fires (LoFi). Samples were genotyped at nine neutral simple sequence repeats (SSRs) and at 251 single nucleotide polymorphisms (SNPs) from coding regions, some of them potentially important for fire adaptation. Fire regime had no effects on genetic diversity or demographic history. Three high-differentiation outlier SNPs were identified between HiFi and LoFi stands, suggesting fire-related selection at the regional scale. At the local scale, fine-scale spatial genetic structure (SGS) was overall weak as expected for a wind-pollinated and wind-dispersed tree species. HiFi stands displayed a stronger SGS than LoFi stands at SNPs, which probably reflected the simultaneous post-fire recruitment of co-dispersed related seeds. SNPs with exceptionally strong SGS, a proxy for microenvironmental selection, were only reliably identified under the HiFi regime. An increasing fire frequency as predicted due to global change can promote increased SGS with stronger family structures and alter natural selection in P. halepensis and in plants with similar life history traits. © The Author 2017. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  1. Novel genes identified in a high-density genome wide association study for nicotine dependence.

    PubMed

    Bierut, Laura Jean; Madden, Pamela A F; Breslau, Naomi; Johnson, Eric O; Hatsukami, Dorothy; Pomerleau, Ovide F; Swan, Gary E; Rutter, Joni; Bertelsen, Sarah; Fox, Louis; Fugman, Douglas; Goate, Alison M; Hinrichs, Anthony L; Konvicka, Karel; Martin, Nicholas G; Montgomery, Grant W; Saccone, Nancy L; Saccone, Scott F; Wang, Jen C; Chase, Gary A; Rice, John P; Ballinger, Dennis G

    2007-01-01

    Tobacco use is a leading contributor to disability and death worldwide, and genetic factors contribute in part to the development of nicotine dependence. To identify novel genes for which natural variation contributes to the development of nicotine dependence, we performed a comprehensive genome wide association study using nicotine dependent smokers as cases and non-dependent smokers as controls. To allow the efficient, rapid, and cost effective screen of the genome, the study was carried out using a two-stage design. In the first stage, genotyping of over 2.4 million single nucleotide polymorphisms (SNPs) was completed in case and control pools. In the second stage, we selected SNPs for individual genotyping based on the most significant allele frequency differences between cases and controls from the pooled results. Individual genotyping was performed in 1050 cases and 879 controls using 31 960 selected SNPs. The primary analysis, a logistic regression model with covariates of age, gender, genotype and gender by genotype interaction, identified 35 SNPs with P-values less than 10(-4) (minimum P-value 1.53 x 10(-6)). Although none of the individual findings is statistically significant after correcting for multiple tests, additional statistical analyses support the existence of true findings in this group. Our study nominates several novel genes, such as Neurexin 1 (NRXN1), in the development of nicotine dependence while also identifying a known candidate gene, the beta3 nicotinic cholinergic receptor. This work anticipates the future directions of large-scale genome wide association studies with state-of-the-art methodological approaches and sharing of data with the scientific community.

  2. The easy road to genome-wide medium density SNP screening in a non-model species: development and application of a 10 K SNP-chip for the house sparrow (Passer domesticus).

    PubMed

    Hagen, Ingerid J; Billing, Anna M; Rønning, Bernt; Pedersen, Sindre A; Pärn, Henrik; Slate, Jon; Jensen, Henrik

    2013-05-01

    With the advent of next generation sequencing, new avenues have opened to study genomics in wild populations of non-model species. Here, we describe a successful approach to a genome-wide medium density Single Nucleotide Polymorphism (SNP) panel in a non-model species, the house sparrow (Passer domesticus), through the development of a 10 K Illumina iSelect HD BeadChip. Genomic DNA and cDNA derived from six individuals were sequenced on a 454 GS FLX system and generated a total of 1.2 million sequences, in which SNPs were detected. As no reference genome exists for the house sparrow, we used the zebra finch (Taeniopygia guttata) reference genome to determine the most likely position of each SNP. The 10 000 SNPs on the SNP-chip were selected to be distributed evenly across 31 chromosomes, giving on average one SNP per 100 000 bp. The SNP-chip was screened across 1968 individual house sparrows from four island populations. Of the original 10 000 SNPs, 7413 were found to be variable, and 99% of these SNPs were successfully called in at least 93% of all individuals. We used the SNP-chip to demonstrate the ability of such genome-wide marker data to detect population sub-division, and compared these results to similar analyses using microsatellites. The SNP-chip will be used to map Quantitative Trait Loci (QTL) for fitness-related phenotypic traits in natural populations. © 2013 Blackwell Publishing Ltd.

  3. Inferring Geographic Coordinates of Origin for Europeans Using Small Panels of Ancestry Informative Markers

    PubMed Central

    Paschou, Peristera

    2010-01-01

    Recent large-scale studies of European populations have demonstrated the existence of population genetic structure within Europe and the potential to accurately infer individual ancestry when information from hundreds of thousands of genetic markers is used. In fact, when genomewide genetic variation of European populations is projected down to a two-dimensional Principal Components Analysis plot, a surprising correlation with actual geographic coordinates of self-reported ancestry has been reported. This substructure can hamper the search of susceptibility genes for common complex disorders leading to spurious correlations. The identification of genetic markers that can correct for population stratification becomes therefore of paramount importance. Analyzing 1,200 individuals from 11 populations genotyped for more than 500,000 SNPs (Population Reference Sample), we present a systematic exploration of the extent to which geographic coordinates of origin within Europe can be predicted, with small panels of SNPs. Markers are selected to correlate with the top principal components of the dataset, as we have previously demonstrated. Performing thorough cross-validation experiments we show that it is indeed possible to predict individual ancestry within Europe down to a few hundred kilometers from actual individual origin, using information from carefully selected panels of 500 or 1,000 SNPs. Furthermore, we show that these panels can be used to correctly assign the HapMap Phase 3 European populations to their geographic origin. The SNPs that we propose can prove extremely useful in a variety of different settings, such as stratification correction or genetic ancestry testing, and the study of the history of European populations. PMID:20805874

  4. Association of MYF5 and KLF15 gene polymorphisms with carcass traits in domestic pigeons (Columba livia).

    PubMed

    Yin, Z Z; Dong, X Y; Dong, D J; Ma, Y Z

    2016-10-01

    Single nucleotide polymorphisms (SNPs) in the exons of the myogenic factor 5 (MYF5) and Kruppel-like factor 15 (KLF15) genes were identified and analysed by using DNA sequencing methods in 60 female domestic pigeons (Columba livia). Five SNPs (T5067A, C5084T, C5101T, T5127A and C5154G) were detected in exon 3 of MYF5 and 6 SNPs (C1398T, C1464T, G1542A, C1929T, G1965A and A2355G) were found in exon 2 of KLF15, respectively. The analysis revealed three genotypes, in which the AA genotype was dominant and the A allele showed a dominant advantage. For the MYF5 gene, the C5084T and T5127A SNP genotypes were significantly associated with carcass traits of pigeons. Within those two SNPs, the BB genotype showed relatively higher trait association values than those of AA or AB genotypes. No significant association was observed between the KLF15 SNP genotypes and carcass traits. These results indicated that the MYF5 gene is a potential major gene affecting carcass traits in domestic pigeons. The BB genotype of the C5084T and T5127A SNPs could be a potential candidate genetic marker for marker-assisted selection in pigeon.

  5. Activating Transcription Factor 6 (ATF6) Sequence Polymorphisms in Type 2 Diabetes and Pre-Diabetic Traits

    PubMed Central

    Chu, Winston S.; Das, Swapan Kumar; Wang, Hua; Chan, Juliana C.; Deloukas, Panos; Froguel, Philippe; Baier, Leslie J.; Jia, Weiping; McCarthy, Mark I.; Ng, Maggie C.Y.; Damcott, Coleen; Shuldiner, Alan R.; Zeggini, Eleftheria; Elbein, Steven C.

    2009-01-01

    Activating transcription factor 6 (ATF6) is located within the region of linkage to type 2 diabetes on chromosome 1q21-q23 and is a key activator of the endoplasmic reticulum stress response. We evaluated 78 single nucleotide polymorphisms (SNPs) spanning >213 kb in 95 people, from which we selected 64 SNPs for evaluation in 191 Caucasian case subjects from Utah and between 165 and 188 control subjects. Six SNPs showed nominal associations with type 2 diabetes (P = 0.001-0.04), including the nonsynonymous SNP rs1058405 (M67V) in exon 3 and rs11579627 in the 3′ flanking region. Only rs1159627 remained significant on permutation testing. The associations were not replicated in 353 African-American case subjects and 182 control subjects, nor were ATF6 SNPs associated with altered insulin secretion or insulin sensitivity in nondiabetic Caucasian individuals. No association with type 2 diabetes was found in a subset of 44 SNPs in Caucasian (n = 2,099), Pima Indian (n = 293), and Chinese (n = 287) samples. Allelic expression imbalance was found in transformed lymphocyte cDNA for 3′ untranslated region variants, thus suggesting cis-acting regulatory variants. ATF6 does not appear to play a major role in type 2 diabetes, but further work is required to identify the cause of the allelic expression imbalance. PMID:17327457

  6. Recapitulation of Candidate Systemic Lupus Erythematosus-Associated Variants in Koreans

    PubMed Central

    Kwon, Ki-Sung; Cho, Hye-Young

    2016-01-01

    Systemic lupus erythematosus (SLE) is a chronic autoimmune disease that affects multiple organ systems. Although the etiology of SLE remains unclear, it is widely accepted that genetic factors could be involved in its pathogenesis. A number of genome-wide association studies (GWASs) have identified novel single-nucleotide polymorphisms (SNPs) associated with the risk of SLE in diverse populations. However, not all the SNP candidates identified from non-Asian populations have been validated in Koreans. In this study, we aimed to replicate the SNPs that were recently discovered in the GWAS; these SNPs have not been validated in Koreans or have only been replicated in Koreans with an insufficient sample size to conclude any association. For this, we selected five SNPs (rs1801274 in FCGR2A and rs2286672 in PLD2, rs887369 in CXorf21, rs9782955 in LYST, and rs3794060 in NADSYN1). Through the replication study with 656 cases and 622 controls, rs1801274 in FCGR2A was found to be significantly associated with SLE in Koreans (odds ratio, 1.26, 95% confidence interval, 1.06 to 1.50; p = 0.01 in allelic model). This association was also significant in two other models (dominant and recessive). The other four SNPs did not show a significant association. Our data support that FCGR polymorphisms play important roles in the susceptibility to SLE in diverse populations, including Koreans. PMID:27729837

  7. Recapitulation of Candidate Systemic Lupus Erythematosus-Associated Variants in Koreans.

    PubMed

    Kwon, Ki-Sung; Cho, Hye-Young; Chung, Yeun-Jun

    2016-09-01

    Systemic lupus erythematosus (SLE) is a chronic autoimmune disease that affects multiple organ systems. Although the etiology of SLE remains unclear, it is widely accepted that genetic factors could be involved in its pathogenesis. A number of genome-wide association studies (GWASs) have identified novel single-nucleotide polymorphisms (SNPs) associated with the risk of SLE in diverse populations. However, not all the SNP candidates identified from non-Asian populations have been validated in Koreans. In this study, we aimed to replicate the SNPs that were recently discovered in the GWAS; these SNPs have not been validated in Koreans or have only been replicated in Koreans with an insufficient sample size to conclude any association. For this, we selected five SNPs (rs1801274 in FCGR2A and rs2286672 in PLD2 , rs887369 in CXorf21 , rs9782955 in LYST , and rs3794060 in NADSYN1 ). Through the replication study with 656 cases and 622 controls, rs1801274 in FCGR2A was found to be significantly associated with SLE in Koreans (odds ratio, 1.26, 95% confidence interval, 1.06 to 1.50; p = 0.01 in allelic model). This association was also significant in two other models (dominant and recessive). The other four SNPs did not show a significant association. Our data support that FCGR polymorphisms play important roles in the susceptibility to SLE in diverse populations, including Koreans.

  8. SNP-VISTA: An interactive SNP visualization tool

    PubMed Central

    Shah, Nameeta; Teplitsky, Michael V; Minovitsky, Simon; Pennacchio, Len A; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L

    2005-01-01

    Background Recent advances in sequencing technologies promise to provide a better understanding of the genetics of human disease as well as the evolution of microbial populations. Single Nucleotide Polymorphisms (SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it has become possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease in an attempt to identify causative mutations. In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples enables more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at [1]. Results We have developed and present two modifications of an interactive visualization tool, SNP-VISTA, to aid in the analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein evolutionary conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. Conclusion The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNP data by the user. PMID:16336665

  9. A function accounting for training set size and marker density to model the average accuracy of genomic prediction.

    PubMed

    Erbe, Malena; Gredler, Birgit; Seefried, Franz Reinhold; Bapst, Beat; Simianer, Henner

    2013-01-01

    Prediction of genomic breeding values is of major practical relevance in dairy cattle breeding. Deterministic equations have been suggested to predict the accuracy of genomic breeding values in a given design which are based on training set size, reliability of phenotypes, and the number of independent chromosome segments ([Formula: see text]). The aim of our study was to find a general deterministic equation for the average accuracy of genomic breeding values that also accounts for marker density and can be fitted empirically. Two data sets of 5'698 Holstein Friesian bulls genotyped with 50 K SNPs and 1'332 Brown Swiss bulls genotyped with 50 K SNPs and imputed to ∼600 K SNPs were available. Different k-fold (k = 2-10, 15, 20) cross-validation scenarios (50 replicates, random assignment) were performed using a genomic BLUP approach. A maximum likelihood approach was used to estimate the parameters of different prediction equations. The highest likelihood was obtained when using a modified form of the deterministic equation of Daetwyler et al. (2010), augmented by a weighting factor (w) based on the assumption that the maximum achievable accuracy is [Formula: see text]. The proportion of genetic variance captured by the complete SNP sets ([Formula: see text]) was 0.76 to 0.82 for Holstein Friesian and 0.72 to 0.75 for Brown Swiss. When modifying the number of SNPs, w was found to be proportional to the log of the marker density up to a limit which is population and trait specific and was found to be reached with ∼20'000 SNPs in the Brown Swiss population studied.

  10. Multigene interactions and the prediction of depression in the Wisconsin Longitudinal Study

    PubMed Central

    Roetker, Nicholas S; Yonker, James A; Lee, Chee; Chang, Vicky; Basson, Jacob J; Roan, Carol L; Hauser, Taissa S; Hauser, Robert M

    2012-01-01

    Objectives Single genetic loci offer little predictive power for the identification of depression. This study examined whether an analysis of gene–gene (G × G) interactions of 78 single nucleotide polymorphisms (SNPs) in genes associated with depression and age-related diseases would identify significant interactions with increased predictive power for depression. Design A retrospective cohort study. Setting A survey of participants in the Wisconsin Longitudinal Study. Participants A total of 4811 persons (2464 women and 2347 men) who provided saliva for genotyping; the group comes from a randomly selected sample of Wisconsin high school graduates from the class of 1957 as well as a randomly selected sibling, almost all of whom are non-Hispanic white. Primary outcome measure Depression as determine by the Composite International Diagnostic Interview–Short-Form. Results Using a classification tree approach (recursive partitioning (RP)), the authors identified a number of candidate G × G interactions associated with depression. The primary SNP splits revealed by RP (ANKK1 rs1800497 (also known as DRD2 Taq1A) in men and DRD2 rs224592 in women) were found to be significant as single factors by logistic regression (LR) after controlling for multiple testing (p=0.001 for both). Without considering interaction effects, only one of the five subsequent RP splits reached nominal significance in LR (FTO rs1421085 in women, p=0.008). However, after controlling for G × G interactions by running LR on RP-specific subsets, every split became significant and grew larger in magnitude (OR (before) → (after): men: GNRH1 novel SNP: (1.43 → 1.57); women: APOC3 rs2854116: (1.28 → 1.55), ACVR2B rs3749386: (1.11 → 2.17), FTO rs1421085: (1.32 → 1.65), IL6 rs1800795: (1.12 → 1.85)). Conclusions The results suggest that examining G × G interactions improves the identification of genetic associations predictive of depression. 4 of the SNPs identified in these interactions were located in two pathways well known to impact depression: neurotransmitter (ANKK1 and DRD2) and neuroendocrine (GNRH1 and ACVR2B) signalling. This study demonstrates the utility of RP analysis as an efficient and powerful exploratory analysis technique for uncovering genetic and molecular pathway interactions associated with disease aetiology. PMID:22761283

  11. Signatures of positive selection in East African Shorthorn Zebu: a genome-wide SNP analysis

    USDA-ARS?s Scientific Manuscript database

    The small East African Shorthorn Zebu is the main indigenous cattle across East Africa. A recent genome wide SNPs analysis has revealed their ancient stable African taurine x Asian zebu admixture. Here, we assess the presence of candidate signature of positive selection in their genome, with the aim...

  12. Genome changes due to forty years of artificial selection associated with divergent dairy production and reproduction

    USDA-ARS?s Scientific Manuscript database

    Artificial selection in dairy cattle since 1964 has achieved steady increase in milk production that was accompanied by unintended declines in fertility. Direct comparison of 45,878 SNPs between a group of Holstein cattle unselected since 1964 and the contemporary Holsteins was conducted to identify...

  13. Genome-wide scan for commons SNPs affecting bovine leukemia virus infection level in dairy cattle.

    PubMed

    Carignano, Hugo A; Roldan, Dana L; Beribe, María J; Raschia, María A; Amadio, Ariel; Nani, Juan P; Gutierrez, Gerónimo; Alvarez, Irene; Trono, Karina; Poli, Mario A; Miretti, Marcos M

    2018-02-13

    Bovine leukemia virus (BLV) infection is omnipresent in dairy herds causing direct economic losses due to trade restrictions and lymphosarcoma-related deaths. Milk production drops and increase in the culling rate are also relevant and usually neglected. The BLV provirus persists throughout a lifetime and an inter-individual variation is observed in the level of infection (LI) in vivo. High LI is strongly correlated with disease progression and BLV transmission among herd mates. In a context of high prevalence, classical control strategies are economically prohibitive. Alternatively, host genomics studies aiming to dissect loci associated with LI are potentially useful tools for genetic selection programs tending to abrogate the viral spreading. The LI was measured through the proviral load (PVL) set-point and white blood cells (WBC) counts. The goals of this work were to gain insight into the contribution of SNPs (bovine 50KSNP panel) on LI variability and to identify genomics regions underlying this trait. We quantified anti-p24 response and total leukocytes count in peripheral blood from 1800 cows and used these to select 800 individuals with extreme phenotypes in WBCs and PVL. Two case-control genomic association studies using linear mixed models (LMMs) considering population stratification were performed. The proportion of the variance captured by all QC-passed SNPs represented 0.63 (SE ± 0.14) of the phenotypic variance for PVL and 0.56 (SE ± 0.15) for WBCs. Overall, significant associations (Bonferroni's corrected -log 10 p > 5.94) were shared for both phenotypes by 24 SNPs within the Bovine MHC. Founder haplotypes were used to measure the linkage disequilibrium (LD) extent (r 2  = 0.22 ± 0.27 at inter-SNP distance of 25-50 kb). The SNPs and LD blocks indicated genes potentially associated with LI in infected cows: i.e. relevant immune response related genes (DQA1, DRB3, BOLA-A, LTA, LTB, TNF, IER3, GRP111, CRISP1), several genes involved in cell cytoskeletal reorganization (CD2AP, PKHD1, FLOT1, TUBB5) and modelling of the extracellular matrix (TRAM2, TNXB). Host transcription factors (TFs) were also highlighted (TFAP2D; ABT1, GCM1, PRRC2A). Data obtained represent a step forward to understand the biology of BLV-bovine interaction, and provide genetic information potentially applicable to selective breeding programs.

  14. Pooled-DNA sequencing identifies genomic regions of selection in Nigerian isolates of Plasmodium falciparum.

    PubMed

    Oyebola, Kolapo M; Idowu, Emmanuel T; Olukosi, Yetunde A; Awolola, Taiwo S; Amambua-Ngwa, Alfred

    2017-06-29

    The burden of falciparum malaria is especially high in sub-Saharan Africa. Differences in pressure from host immunity and antimalarial drugs lead to adaptive changes responsible for high level of genetic variations within and between the parasite populations. Population-specific genetic studies to survey for genes under positive or balancing selection resulting from drug pressure or host immunity will allow for refinement of interventions. We performed a pooled sequencing (pool-seq) of the genomes of 100 Plasmodium falciparum isolates from Nigeria. We explored allele-frequency based neutrality test (Tajima's D) and integrated haplotype score (iHS) to identify genes under selection. Fourteen shared iHS regions that had at least 2 SNPs with a score > 2.5 were identified. These regions code for genes that were likely to have been under strong directional selection. Two of these genes were the chloroquine resistance transporter (CRT) on chromosome 7 and the multidrug resistance 1 (MDR1) on chromosome 5. There was a weak signature of selection in the dihydrofolate reductase (DHFR) gene on chromosome 4 and MDR5 genes on chromosome 13, with only 2 and 3 SNPs respectively identified within the iHS window. We observed strong selection pressure attributable to continued chloroquine and sulfadoxine-pyrimethamine use despite their official proscription for the treatment of uncomplicated malaria. There was also a major selective sweep on chromosome 6 which had 32 SNPs within the shared iHS region. Tajima's D of circumsporozoite protein (CSP), erythrocyte-binding antigen (EBA-175), merozoite surface proteins - MSP3 and MSP7, merozoite surface protein duffy binding-like (MSPDBL2) and serine repeat antigen (SERA-5) were 1.38, 1.29, 0.73, 0.84 and 0.21, respectively. We have demonstrated the use of pool-seq to understand genomic patterns of selection and variability in P. falciparum from Nigeria, which bears the highest burden of infections. This investigation identified known genomic signatures of selection from drug pressure and host immunity. This is evidence that P. falciparum populations explore common adaptive strategies that can be targeted for the development of new interventions.

  15. Predicting the disease of Alzheimer with SNP biomarkers and clinical data using data mining classification approach: decision tree.

    PubMed

    Erdoğan, Onur; Aydin Son, Yeşim

    2014-01-01

    Single Nucleotide Polymorphisms (SNPs) are the most common genomic variations where only a single nucleotide differs between individuals. Individual SNPs and SNP profiles associated with diseases can be utilized as biological markers. But there is a need to determine the SNP subsets and patients' clinical data which is informative for the diagnosis. Data mining approaches have the highest potential for extracting the knowledge from genomic datasets and selecting the representative SNPs as well as most effective and informative clinical features for the clinical diagnosis of the diseases. In this study, we have applied one of the widely used data mining classification methodology: "decision tree" for associating the SNP biomarkers and significant clinical data with the Alzheimer's disease (AD), which is the most common form of "dementia". Different tree construction parameters have been compared for the optimization, and the most accurate tree for predicting the AD is presented.

  16. Efficient SNP Discovery by Combining Microarray and Lab-on-a-Chip Data for Animal Breeding and Selection

    PubMed Central

    Huang, Chao-Wei; Lin, Yu-Tsung; Ding, Shih-Torng; Lo, Ling-Ling; Wang, Pei-Hwa; Lin, En-Chung; Liu, Fang-Wei; Lu, Yen-Wen

    2015-01-01

    The genetic markers associated with economic traits have been widely explored for animal breeding. Among these markers, single-nucleotide polymorphism (SNPs) are gradually becoming a prevalent and effective evaluation tool. Since SNPs only focus on the genetic sequences of interest, it thereby reduces the evaluation time and cost. Compared to traditional approaches, SNP genotyping techniques incorporate informative genetic background, improve the breeding prediction accuracy and acquiesce breeding quality on the farm. This article therefore reviews the typical procedures of animal breeding using SNPs and the current status of related techniques. The associated SNP information and genotyping techniques, including microarray and Lab-on-a-Chip based platforms, along with their potential are highlighted. Examples in pig and poultry with different SNP loci linked to high economic trait values are given. The recommendations for utilizing SNP genotyping in nimal breeding are summarized. PMID:27600241

  17. Genome-environment associations in sorghum landraces predict adaptive traits

    PubMed Central

    Lasky, Jesse R.; Upadhyaya, Hari D.; Ramu, Punna; Deshpande, Santosh; Hash, C. Tom; Bonnette, Jason; Juenger, Thomas E.; Hyma, Katie; Acharya, Charlotte; Mitchell, Sharon E.; Buckler, Edward S.; Brenton, Zachary; Kresovich, Stephen; Morris, Geoffrey P.

    2015-01-01

    Improving environmental adaptation in crops is essential for food security under global change, but phenotyping adaptive traits remains a major bottleneck. If associations between single-nucleotide polymorphism (SNP) alleles and environment of origin in crop landraces reflect adaptation, then these could be used to predict phenotypic variation for adaptive traits. We tested this proposition in the global food crop Sorghum bicolor, characterizing 1943 georeferenced landraces at 404,627 SNPs and quantifying allelic associations with bioclimatic and soil gradients. Environment explained a substantial portion of SNP variation, independent of geographical distance, and genic SNPs were enriched for environmental associations. Further, environment-associated SNPs predicted genotype-by-environment interactions under experimental drought stress and aluminum toxicity. Our results suggest that genomic signatures of environmental adaptation may be useful for crop improvement, enhancing germplasm identification and marker-assisted selection. Together, genome-environment associations and phenotypic analyses may reveal the basis of environmental adaptation. PMID:26601206

  18. Genetic structure characterization of Chileans reflects historical immigration patterns.

    PubMed

    Eyheramendy, Susana; Martinez, Felipe I; Manevy, Federico; Vial, Cecilia; Repetto, Gabriela M

    2015-03-17

    Identifying the ancestral components of genomes of admixed individuals helps uncovering the genetic basis of diseases and understanding the demographic history of populations. We estimate local ancestry on 313 Chileans and assess the contribution from three continental populations. The distribution of ancestry block-length suggests an average admixing time around 10 generations ago. Sex-chromosome analyses confirm imbalanced contribution of European men and Native-American women. Previously known genes under selection contain SNPs showing large difference in allele frequencies. Furthermore, we show that assessing ancestry is harder at SNPs with higher recombination rates and easier at SNPs with large difference in allele frequencies at the ancestral populations. Two observations, that African ancestry proportions systematically decrease from North to South, and that European ancestry proportions are highest in central regions, show that the genetic structure of Chileans is under the influence of a diffusion process leading to an ancestry gradient related to geography.

  19. Genetic structure characterization of Chileans reflects historical immigration patterns

    PubMed Central

    Eyheramendy, Susana; Martinez, Felipe I.; Manevy, Federico; Vial, Cecilia; Repetto, Gabriela M.

    2015-01-01

    Identifying the ancestral components of genomes of admixed individuals helps uncovering the genetic basis of diseases and understanding the demographic history of populations. We estimate local ancestry on 313 Chileans and assess the contribution from three continental populations. The distribution of ancestry block-length suggests an average admixing time around 10 generations ago. Sex-chromosome analyses confirm imbalanced contribution of European men and Native-American women. Previously known genes under selection contain SNPs showing large difference in allele frequencies. Furthermore, we show that assessing ancestry is harder at SNPs with higher recombination rates and easier at SNPs with large difference in allele frequencies at the ancestral populations. Two observations, that African ancestry proportions systematically decrease from North to South, and that European ancestry proportions are highest in central regions, show that the genetic structure of Chileans is under the influence of a diffusion process leading to an ancestry gradient related to geography. PMID:25778948

  20. Selection and sex-biased dispersal in a coastal shark: the influence of philopatry on adaptive variation.

    PubMed

    Portnoy, D S; Puritz, J B; Hollenbeck, C M; Gelsleichter, J; Chapman, D; Gold, J R

    2015-12-01

    Sex-biased dispersal is expected to homogenize nuclear genetic variation relative to variation in genetic material inherited through the philopatric sex. When site fidelity occurs across a heterogeneous environment, local selective regimes may alter this pattern. We assessed spatial patterns of variation in nuclear-encoded, single nucleotide polymorphisms (SNPs) and sequences of the mitochondrial control region in bonnethead sharks (Sphyrna tiburo), a species thought to exhibit female philopatry, collected from summer habitats used for gestation. Geographic patterns of mtDNA haplotypes and putatively neutral SNPs confirmed female philopatry and male-mediated gene flow along the northeastern coast of the Gulf of Mexico. A total of 30 outlier SNP loci were identified; alleles at over half of these loci exhibited signatures of latitude-associated selection. Our results indicate that in species with sex-biased dispersal, philopatry can facilitate sorting of locally adaptive variation, with the dispersing sex facilitating movement of potentially adaptive variation among locations and environments. © 2015 John Wiley & Sons Ltd.

  1. Association Genetics of Wood Physical Traits in the Conifer White Spruce and Relationships With Gene Expression

    PubMed Central

    Beaulieu, Jean; Doerksen, Trevor; Boyle, Brian; Clément, Sébastien; Deslauriers, Marie; Beauseigle, Stéphanie; Blais, Sylvie; Poulin, Pier-Luc; Lenz, Patrick; Caron, Sébastien; Rigault, Philippe; Bicho, Paul; Bousquet, Jean; MacKay, John

    2011-01-01

    Marker-assisted selection holds promise for highly influencing tree breeding, especially for wood traits, by considerably reducing breeding cycles and increasing selection accuracy. In this study, we used a candidate gene approach to test for associations between 944 single-nucleotide polymorphism markers from 549 candidate genes and 25 wood quality traits in white spruce. A mixed-linear model approach, including a weak but nonsignificant population structure, was implemented for each marker–trait combination. Relatedness among individuals was controlled using a kinship matrix estimated either from the known half-sib structure or from the markers. Both additive and dominance effect models were tested. Between 8 and 21 single-nucleotide polymorphisms (SNPs) were found to be significantly associated (P ≤ 0.01) with each of earlywood, latewood, or total wood traits. After controlling for multiple testing (Q ≤ 0.10), 13 SNPs were still significant across as many genes belonging to different families, each accounting for between 3 and 5% of the phenotypic variance in 10 wood characters. Transcript accumulation was determined for genes containing SNPs associated with these traits. Significantly different transcript levels (P ≤ 0.05) were found among the SNP genotypes of a 1-aminocyclopropane-1-carboxylate oxidase, a β-tonoplast intrinsic protein, and a long-chain acyl-CoA synthetase 9. These results should contribute toward the development of efficient marker-assisted selection in an economically important tree species. PMID:21385726

  2. Multilocus adaptation associated with heat resistance in reef-building corals.

    PubMed

    Bay, Rachael A; Palumbi, Stephen R

    2014-12-15

    The evolution of tolerance to future climate change depends on the standing stock of genetic variation for resistance to climate-related impacts, but genes contributing to climate tolerance in wild populations are poorly described in number and effect. Physiology and gene expression patterns have shown that corals living in naturally high-temperature microclimates are more resistant to bleaching because of both acclimation and fixed effects, including adaptation. To search for potential genetic correlates of these fixed effects, we genotyped 15,399 single nucleotide polymorphisms (SNPs) in 23 individual tabletop corals, Acropora hyacinthus, within a natural temperature mosaic in backreef lagoons on Ofu Island, American Samoa. Despite overall lack of population substructure, we identified 114 highly divergent SNPs as candidates for environmental selection, via multiple stringent outlier tests, and correlations with temperature. Corals from the warmest reef location had higher minor allele frequencies across these candidate SNPs, a pattern not seen for noncandidate loci. Furthermore, within backreef pools, colonies in the warmest microclimates had a higher number and frequency of alternative alleles at candidate loci. These data suggest mild selection for alternate alleles at many loci in these corals during high heat episodes and possible maintenance of extensive polymorphism through multilocus balancing selection in a heterogeneous environment. In this case, a natural population harbors a reservoir of alleles preadapted to high temperatures, suggesting potential for future evolutionary response to climate change. Copyright © 2014 Elsevier Ltd. All rights reserved.

  3. Revealing phenotype-associated functional differences by genome-wide scan of ancient haplotype blocks

    PubMed Central

    Onuki, Ritsuko; Yamaguchi, Rui; Shibuya, Tetsuo; Kanehisa, Minoru; Goto, Susumu

    2017-01-01

    Genome-wide scans for positive selection have become important for genomic medicine, and many studies aim to find genomic regions affected by positive selection that are associated with risk allele variations among populations. Most such studies are designed to detect recent positive selection. However, we hypothesize that ancient positive selection is also important for adaptation to pathogens, and has affected current immune-mediated common diseases. Based on this hypothesis, we developed a novel linkage disequilibrium-based pipeline, which aims to detect regions associated with ancient positive selection across populations from single nucleotide polymorphism (SNP) data. By applying this pipeline to the genotypes in the International HapMap project database, we show that genes in the detected regions are enriched in pathways related to the immune system and infectious diseases. The detected regions also contain SNPs reported to be associated with cancers and metabolic diseases, obesity-related traits, type 2 diabetes, and allergic sensitization. These SNPs were further mapped to biological pathways to determine the associations between phenotypes and molecular functions. Assessments of candidate regions to identify functions associated with variations in incidence rates of these diseases are needed in the future. PMID:28445522

  4. A Hierarchical Feature and Sample Selection Framework and Its Application for Alzheimer’s Disease Diagnosis

    NASA Astrophysics Data System (ADS)

    An, Le; Adeli, Ehsan; Liu, Mingxia; Zhang, Jun; Lee, Seong-Whan; Shen, Dinggang

    2017-03-01

    Classification is one of the most important tasks in machine learning. Due to feature redundancy or outliers in samples, using all available data for training a classifier may be suboptimal. For example, the Alzheimer’s disease (AD) is correlated with certain brain regions or single nucleotide polymorphisms (SNPs), and identification of relevant features is critical for computer-aided diagnosis. Many existing methods first select features from structural magnetic resonance imaging (MRI) or SNPs and then use those features to build the classifier. However, with the presence of many redundant features, the most discriminative features are difficult to be identified in a single step. Thus, we formulate a hierarchical feature and sample selection framework to gradually select informative features and discard ambiguous samples in multiple steps for improved classifier learning. To positively guide the data manifold preservation process, we utilize both labeled and unlabeled data during training, making our method semi-supervised. For validation, we conduct experiments on AD diagnosis by selecting mutually informative features from both MRI and SNP, and using the most discriminative samples for training. The superior classification results demonstrate the effectiveness of our approach, as compared with the rivals.

  5. Genetic Dissection of End-Use Quality Traits in Adapted Soft White Winter Wheat

    PubMed Central

    Jernigan, Kendra L.; Godoy, Jayfred V.; Huang, Meng; Zhou, Yao; Morris, Craig F.; Garland-Campbell, Kimberly A.; Zhang, Zhiwu; Carter, Arron H.

    2018-01-01

    Soft white wheat is used in domestic and foreign markets for various end products requiring specific quality profiles. Phenotyping for end-use quality traits can be costly, time-consuming and destructive in nature, so it is advantageous to use molecular markers to select experimental lines with superior traits. An association mapping panel of 469 soft white winter wheat cultivars and advanced generation breeding lines was developed from regional breeding programs in the U.S. Pacific Northwest. This panel was genotyped on a wheat-specific 90 K iSelect single nucleotide polymorphism (SNP) chip. A total of 15,229 high quality SNPs were selected and combined with best linear unbiased predictions (BLUPs) from historical phenotypic data of the genotypes in the panel. Genome-wide association mapping was conducted using the Fixed and random model Circulating Probability Unification (FarmCPU). A total of 105 significant marker-trait associations were detected across 19 chromosomes. Potentially new loci for total flour yield, lactic acid solvent retention capacity, flour sodium dodecyl sulfate sedimentation and flour swelling volume were also detected. Better understanding of the genetic factors impacting end-use quality enable breeders to more effectively discard poor quality germplasm and increase frequencies of favorable end-use quality alleles in their breeding populations. PMID:29593752

  6. Association study of IL10, IL1beta, and IL1RN and schizophrenia using tag SNPs from a comprehensive database: suggestive association with rs16944 at IL1beta.

    PubMed

    Shirts, Brian H; Wood, Joel; Yolken, Robert H; Nimgaonkar, Vishwajit L

    2006-12-01

    Genetic association studies of several candidate cytokine genes have been motivated by evidence of immune dysfunction among patients with schizophrenia. Intriguing but inconsistent associations have been reported with polymorphisms of three positional candidate genes, namely IL1beta, IL1RN, and IL10. We used comprehensive sequencing data from the Seattle SNPs database to select tag SNPs that represent all common polymorphisms in the Caucasian population at these loci. Associations with 28 tag SNPs were evaluated in 478 cases and 501 unscreened control individuals, while accounting for population sub-structure using the genomic control method. The samples were also stratified by gender, diagnostic category, and exposure to infectious agents. Significant association was not detected after correcting for multiple comparisons. However, meta-analysis of our data combined with previously published association studies of rs16944 (IL1beta -511) suggests that the C allele confers modest risk for schizophrenia among individuals reporting Caucasian ancestry, but not Asians (Caucasians, n=819 cases, 1292 controls; p=0.0013, OR=1.24, 95% CI 1.09, 1.41).

  7. Polymorphisms in the SIRT5 gene and their association with body measurement and ultrasound traits in Qinchuan cattle.

    PubMed

    Gui, L S; Wang, H C; Liu, G Y; Zan, L S

    2015-04-22

    Silent information regulator 5 (SIRT5), a member of the Sirtuin family class III nicotinamide adenine dinucleotide-dependent protein deacetylases, plays an important role in metabolic and aging processes in mammals. We identified 4 single-nucleotide polymorphisms (SNPs) (G22010A, G22052A, G22119T, and G22245C) in the 3' untranslated regions of the SIRT5 gene from 572 Qinchuan cattle by sequencing and investigating their association with growth and ultrasound traits. The frequencies of genotype GG and allele G were high at the 4 SNPs. Based on the X(2) test, the genotypic distributions of the 4 SNPs were not in Hardy-Weinberg equilibrium (P < 0.05 or P < 0.01). Association analysis of individual SNPs and haplotype combinations revealed that the 4 loci were significantly associated with some body measurement and ultrasound traits in Qinchuan cattle, and the H1H5 (AG-GA-GG-GG) diplotypes had better performance than other combinations in Qinchuan cattle. Our results demonstrate that SIRT5 may be a candidate for marker-assisted selection in future breeding programs for Qinchuan cattle.

  8. Effects of DGAT1 gene on meat and carcass fatness quality in Chinese commercial cattle.

    PubMed

    Yuan, Zhengrong; Li, Junya; Li, Jiao; Gao, Xue; Gao, Huijiang; Xu, Shangzhong

    2013-02-01

    This study was designed to investigate the candidate single nucleotide polymorphisms (SNPs) in the exon's region of bovine diacylglycerol O-acyltransferase (DGAT1) gene using bioinformatics and experimental methods. A total of 17 SNPs were screened from public data resources and DNA sequencing. Three SNPs (c.572A>G, c.1241C>T and c.1416T>G) of these candidate SNPs were genotyped by created restriction site-polymerase chain reaction (CRS-PCR) methods. The gene-specific SNP markers and their effects on meat and carcass fatness quality traits were evaluated in Chinese commercial cattle. The c.572A>G and c.1416T>G significantly effected on backfat thickness, longissimus muscle area, marbling score, fat color and Warner-Bratzler shear force. No significant association was detected between the c.1241C>T and measured traits. Results from this study suggested that the SNP markers may be effective for the marker-assisted selection of meat and carcass fatness quality traits, and added new evidence that DGAT1 gene is an important candidate gene for the improvement of meat and carcass fatness quality in beef cattle industry.

  9. Association of maternal weight with FADS and ELOVL genetic variants and fatty acid levels- The PREOBE follow-up

    PubMed Central

    de la Garza Puentes, Andrea; Montes Goyanes, Rosa; Chisaguano Tonato, Aida Maribel; Torres-Espínola, Francisco José; Arias García, Miriam; de Almeida, Leonor; Bonilla Aguirre, María; Guerendiain, Marcela; Castellote Bargalló, Ana Isabel; Segura Moreno, Maite; García-Valdés, Luz; Campoy, Cristina; Lopez-Sabater, M. Carmen

    2017-01-01

    Single nucleotide polymorphisms (SNPs) in the genes encoding the fatty acid desaturase (FADS) and elongase (ELOVL) enzymes affect long-chain polyunsaturated fatty acid (LC-PUFA) production. We aimed to determine if these SNPs are associated with body mass index (BMI) or affect fatty acids (FAs) in pregnant women. Participants (n = 180) from the PREOBE cohort were grouped according to pre-pregnancy BMI: normal-weight (BMI = 18.5–24.9, n = 88) and overweight/obese (BMI≥25, n = 92). Plasma samples were analyzed at 24 weeks of gestation to measure FA levels in the phospholipid fraction. Selected SNPs were genotyped (7 in FADS1, 5 in FADS2, 3 in ELOVL2 and 2 in ELOVL5). Minor allele carriers of rs174545, rs174546, rs174548 and rs174553 (FADS1), and rs1535 and rs174583 (FADS2) were nominally associated with an increased risk of having a BMI≥25. Only for the normal-weight group, minor allele carriers of rs174537, rs174545, rs174546, and rs174553 (FADS1) were negatively associated with AA:DGLA index. Normal-weight women who were minor allele carriers of FADS SNPs had lower levels of AA, AA:DGLA and AA:LA indexes, and higher levels of DGLA, compared to major homozygotes. Among minor allele carriers of FADS2 and ELOVL2 SNPs, overweight/obese women showed higher DHA:EPA index than the normal-weight group; however, they did not present higher DHA concentrations than the normal-weight women. In conclusion, minor allele carriers of FADS SNPs have an increased risk of obesity. Maternal weight changes the effect of genotype on FA levels. Only in the normal-weight group, minor allele carriers of FADS SNPs displayed reduced enzymatic activity and FA levels. This suggests that women with a BMI≥25 are less affected by FADS genetic variants in this regard. In the presence of FADS2 and ELOVL2 SNPs, overweight/obese women showed higher n-3 LC-PUFA production indexes than women with normal weight, but this was not enough to obtain a higher n-3 LC-PUFA concentration. PMID:28598979

  10. WASP: a Web-based Allele-Specific PCR assay designing tool for detecting SNPs and mutations

    PubMed Central

    Wangkumhang, Pongsakorn; Chaichoompu, Kridsadakorn; Ngamphiw, Chumpol; Ruangrit, Uttapong; Chanprasert, Juntima; Assawamakin, Anunchai; Tongsima, Sissades

    2007-01-01

    Background Allele-specific (AS) Polymerase Chain Reaction is a convenient and inexpensive method for genotyping Single Nucleotide Polymorphisms (SNPs) and mutations. It is applied in many recent studies including population genetics, molecular genetics and pharmacogenomics. Using known AS primer design tools to create primers leads to cumbersome process to inexperience users since information about SNP/mutation must be acquired from public databases prior to the design. Furthermore, most of these tools do not offer the mismatch enhancement to designed primers. The available web applications do not provide user-friendly graphical input interface and intuitive visualization of their primer results. Results This work presents a web-based AS primer design application called WASP. This tool can efficiently design AS primers for human SNPs as well as mutations. To assist scientists with collecting necessary information about target polymorphisms, this tool provides a local SNP database containing over 10 million SNPs of various populations from public domain databases, namely NCBI dbSNP, HapMap and JSNP respectively. This database is tightly integrated with the tool so that users can perform the design for existing SNPs without going off the site. To guarantee specificity of AS primers, the proposed system incorporates a primer specificity enhancement technique widely used in experiment protocol. In particular, WASP makes use of different destabilizing effects by introducing one deliberate 'mismatch' at the penultimate (second to last of the 3'-end) base of AS primers to improve the resulting AS primers. Furthermore, WASP offers graphical user interface through scalable vector graphic (SVG) draw that allow users to select SNPs and graphically visualize designed primers and their conditions. Conclusion WASP offers a tool for designing AS primers for both SNPs and mutations. By integrating the database for known SNPs (using gene ID or rs number), this tool facilitates the awkward process of getting flanking sequences and other related information from public SNP databases. It takes into account the underlying destabilizing effect to ensure the effectiveness of designed primers. With user-friendly SVG interface, WASP intuitively presents resulting designed primers, which assist users to export or to make further adjustment to the design. This software can be freely accessed at . PMID:17697334

  11. Multi-Ancestral Analysis of Inflammation-Related Genetic Variants and C-Reactive Protein in the Population Architecture using Genomics and Epidemiology (PAGE) Study

    PubMed Central

    Kocarnik, Jonathan M.; Pendergrass, Sarah A.; Carty, Cara L.; Pankow, James S.; Schumacher, Fredrick R.; Cheng, Iona; Durda, Peter; Ambite, JoséLuis; Deelman, Ewa; Cook, Nancy R.; Liu, Simin; Wactawski-Wende, Jean; Hutter, Carolyn; Brown-Gentry, Kristin; Wilson, Sarah; Best, Lyle G.; Pankratz, Nathan; Hong, Ching-Ping; Cole, Shelley A.; Voruganti, V. Saroja; Bůžková, Petra; Jorgensen, Neal W.; Jenny, Nancy S.; Wilkens, Lynne R.; Haiman, Christopher A.; Kolonel, Laurence N.; LaCroix, Andrea; North, Kari; Jackson, Rebecca; Le Marchand, Loic; Hindorff, Lucia A.; Crawford, Dana C.; Gross, Myron; Peters, Ulrike

    2014-01-01

    Background C-reactive protein (CRP) is a biomarker of inflammation. Genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with CRP concentrations and inflammation-related traits such as cardiovascular disease, type 2 diabetes, and obesity. We aimed to replicate previous CRP-SNP associations, assess whether these associations generalize to additional race/ethnicity groups, and evaluate inflammation-related SNPs for a potentially pleiotropic association with CRP. Methods and Results We selected and analyzed 16 CRP-associated and 250 inflammation-related GWAS SNPs among 40,473 African American, American Indian, Asian/Pacific Islander, European American, and Hispanic participants from 7 studies collaborating in the Population Architecture using Genomics and Epidemiology (PAGE) study. Fixed-effect meta-analyses combined study-specific race/ethnicity-stratified linear regression estimates to evaluate the association between each SNP and high-sensitivity CRP. Overall, 18 SNPs in 8 loci were significantly associated with CRP (Bonferroni-corrected p<3.1×10−3 for replication, p<2.0×10−4 for pleiotropy): Seven of these were specific to European Americans, while 9 additionally generalized to African Americans (1), Hispanics (5), or both (3); 1 SNP was seen only in African Americans and Hispanics. Two SNPs in the CELSR2/PSRC1/SORT1 locus showed a potentially novel association with CRP: rs599839 (p=2.0×10−6) and rs646776 (p=3.1×10−5). Conclusions We replicated 16 SNP-CRP associations, 10 of which generalized to African Americans and/or Hispanics. We also identified potentially novel pleiotropic associations with CRP for two SNPs previously associated with coronary artery disease and LDL cholesterol. These findings demonstrate the benefit of evaluating genotype-phenotype associations in multiple race/ethnicity groups, and of looking for pleiotropic relationships among SNPs previously associated with related phenotypes. PMID:24622110

  12. Genetic modifiers of menopausal hormone replacement therapy and breast cancer risk: A genome-wide interaction study

    PubMed Central

    Rudolph, Anja; Hein, Rebecca; Lindström, Sara; Beckmann, Lars; Behrens, Sabine; Liu, Jianjun; Aschard, Hugues; Bolla, Manjeet K.; Wang, Jean; Truong, Thérèse; Cordina-Duverger, Emilie; Menegaux, Florence; Brüning, Thomas; Harth, Volker; Severi, Gianluca; Baglietto, Laura; Southey, Melissa; Chanock, Stephen J.; Lissowska, Jolanta; Figueroa, Jonine D.; Eriksson, Mikael; Humpreys, Keith; Darabi, Hatef; Olson, Janet E.; Stevens, Kristen N.; Vachon, Celine M.; Knight, Julia A.; Glendon, Gord; Mulligan, Anna Marie; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk; Webb, Penny M.; Guénel, Pascal; Brauch, Hiltrud; Giles, Graham; García-Closas, Montserrat; Czene, Kamila; Chenevix-Trench, Georgia; Couch, Fergus J.; Andrulis, Irene L.; Swerdlow, Anthony; Hunter, David J.; Flesch-Janys, Dieter; Easton, Douglas F.; Hall, Per; Nevanlinna, Heli; Kraft, Peter; Chang-Claude, Jenny

    2013-01-01

    Women using menopausal hormone therapy (MHT) are at increased risk to develop breast cancer (BC). To detect genetic modifiers of the association between current use of MHT and BC risk, we conducted a meta-analysis of four genome-wide case-only studies followed by replication in eleven case-control studies. We used a case-only design to assess interactions between single nucleotide polymorphisms (SNPs) and current MHT use on risk of overall and lobular BC. The discovery stage included 2,920 cases (541 lobular) from four genome-wide association studies. The top 1,391 SNPs showing P-values for interaction (Pint) <3.0×10−03 were selected for replication using pooled case-control data from eleven studies of the Breast Cancer Association Consortium, including 7,689 cases (676 lobular) and 9,266 controls. Fixed effects meta-analysis was used to derive combined Pint. No SNP reached genome-wide significance in either the discovery or combined stage. We observed effect modification of current MHT use on overall BC risk by two SNPs on chr13 near POMP (combined Pint≤8.9×10−06), two SNPs in SLC25A21 (combined Pint≤4.8×10−05), and three SNPs in PLCG2 (combined Pint≤4.5×10−05). The association between lobular BC risk was potentially modified by one SNP in TMEFF2 (combined Pint≤2.7×10−05), one SNP in CD80 (combined Pint≤8.2×10−06), three SNPs on chr17 near TMEM132E (combined Pint≤2.2×10−06), and two SNPs on chr18 near SLC25A52 (combined Pint≤4.6×10−05). In conclusion, polymorphisms in genes related to solute transportation in mitochondria, transmembrane signaling and immune cell activation are potentially modifying BC risk associated with current use of MHT. These findings warrant replication in independent studies. PMID:24080446

  13. Multiancestral analysis of inflammation-related genetic variants and C-reactive protein in the population architecture using genomics and epidemiology study.

    PubMed

    Kocarnik, Jonathan M; Pendergrass, Sarah A; Carty, Cara L; Pankow, James S; Schumacher, Fredrick R; Cheng, Iona; Durda, Peter; Ambite, José Luis; Deelman, Ewa; Cook, Nancy R; Liu, Simin; Wactawski-Wende, Jean; Hutter, Carolyn; Brown-Gentry, Kristin; Wilson, Sarah; Best, Lyle G; Pankratz, Nathan; Hong, Ching-Ping; Cole, Shelley A; Voruganti, V Saroja; Bůžkova, Petra; Jorgensen, Neal W; Jenny, Nancy S; Wilkens, Lynne R; Haiman, Christopher A; Kolonel, Laurence N; Lacroix, Andrea; North, Kari; Jackson, Rebecca; Le Marchand, Loic; Hindorff, Lucia A; Crawford, Dana C; Gross, Myron; Peters, Ulrike

    2014-04-01

    C-reactive protein (CRP) is a biomarker of inflammation. Genome-wide association studies (GWAS) have identified single-nucleotide polymorphisms (SNPs) associated with CRP concentrations and inflammation-related traits such as cardiovascular disease, type 2 diabetes mellitus, and obesity. We aimed to replicate previous CRP-SNP associations, assess whether these associations generalize to additional race/ethnicity groups, and evaluate inflammation-related SNPs for a potentially pleiotropic association with CRP. We selected and analyzed 16 CRP-associated and 250 inflammation-related GWAS SNPs among 40 473 African American, American Indian, Asian/Pacific Islander, European American, and Hispanic participants from 7 studies collaborating in the Population Architecture using Genomics and Epidemiology (PAGE) study. Fixed-effect meta-analyses combined study-specific race/ethnicity-stratified linear regression estimates to evaluate the association between each SNP and high-sensitivity CRP. Overall, 18 SNPs in 8 loci were significantly associated with CRP (Bonferroni-corrected P<3.1×10(-3) for replication, P<2.0×10(-4) for pleiotropy): Seven of these were specific to European Americans, while 9 additionally generalized to African Americans (1), Hispanics (5), or both (3); 1 SNP was seen only in African Americans and Hispanics. Two SNPs in the CELSR2/PSRC1/SORT1 locus showed a potentially novel association with CRP: rs599839 (P=2.0×10(-6)) and rs646776 (P=3.1×10(-5)). We replicated 16 SNP-CRP associations, 10 of which generalized to African Americans and/or Hispanics. We also identified potentially novel pleiotropic associations with CRP for two SNPs previously associated with coronary artery disease and/or low-density lipoprotein-cholesterol. These findings demonstrate the benefit of evaluating genotype-phenotype associations in multiple race/ethnicity groups and looking for pleiotropic relationships among SNPs previously associated with related phenotypes.

  14. Association of Single-Nucleotide Polymorphisms of the Tau Gene With Late-Onset Parkinson Disease

    PubMed Central

    Martin, Eden R.; Scott, William K.; Nance, Martha A.; Watts, Ray L.; Hubble, Jean P.; Koller, William C.; Lyons, Kelly; Pahwa, Rajesh; Stern, Matthew B.; Colcher, Amy; Hiner, Bradley C.; Jankovic, Joseph; Ondo, William G.; Allen, Fred H.; Goetz, Christopher G.; Small, Gary W.; Masterman, Donna; Mastaglia, Frank; Laing, Nigel G.; Stajich, Jeffrey M.; Ribble, Robert C.; Booze, Michael W.; Rogala, Allison; Hauser, Michael A.; Zhang, Fengyu; Gibson, Rachel A.; Middleton, Lefkos T.; Roses, Allen D.; Haines, Jonathan L.; Scott, Burton L.; Pericak-Vance, Margaret A.; Vance, Jeffery M.

    2013-01-01

    Context The human tau gene, which promotes assembly of neuronal microtubules, has been associated with several rare neurologic diseases that clinically include parkinsonian features. We recently observed linkage in idiopathic Parkinson disease (PD) to a region on chromosome 17q21 that contains the tau gene. These factors make tau a good candidate for investigation as a susceptibility gene for idiopathic PD, the most common form of the disease. Objective To investigate whether the tau gene is involved in idiopathic PD. Design, Setting, and Participants Among a sample of 1056 individuals from 235 families selected from 13 clinical centers in the United States and Australia and from a family ascertainment core center, we tested 5 single-nucleotide polymorphisms (SNPs) within the tau gene for association with PD, using family-based tests of association. Both affected (n = 426) and unaffected (n = 579) family members were included; 51 individuals had unclear PD status. Analyses were conducted to test individual SNPs and SNP haplotypes within the tau gene. Main Outcome Measure Family-based tests of association, calculated using asymptotic distributions. Results Analysis of association between the SNPs and PD yielded significant evidence of association for 3 of the 5 SNPs tested: SNP 3, P = .03; SNP 9i, P = .04; and SNP 11, P = .04. The 2 other SNPs did not show evidence of significant association (SNP 9ii, P = .11, and SNP 9iii, P = .87). Strong evidence of association was found with haplotype analysis, with a positive association with one haplotype (P = .009) and a negative association with another haplotype (P = .007). Substantial linkage disequilibrium (P<.001) was detected between 4 of the 5 SNPs (SNPs 3,9i, 9ii, and 11). Conclusions This integrated approach of genetic linkage and positional association analyses implicates tau as a susceptibility gene for idiopathic PD. PMID:11710889

  15. Contribution of Selected Gene Mutations to Resistance in Clinical Isolates of Vancomycin-Intermediate Staphylococcus aureus

    PubMed Central

    Hafer, Cory; Lin, Ying; Kornblum, John; Lowy, Franklin D.

    2012-01-01

    Infections with vancomycin-intermediate Staphylococcus aureus (VISA) have been associated with vancomycin treatment failures and poor clinical outcomes. Routine identification of clinical isolates with increased vancomycin MICs remains challenging, and no molecular marker exists to aid in diagnosis of VISA strains. We tested vancomycin susceptibilities by using microscan, Etest, and population analyses in a collection of putative VISA, methicillin-resistant S. aureus, and methicillin-sensitive S. aureus (VSSA) infectious isolates from community- or hospital-associated S. aureus infections (n = 77) and identified 22 VISA and 9 heterogeneous VISA (hVISA) isolates. Sequencing of VISA candidate loci vraS, vraR, yvqF, graR, graS, walR, walK, and rpoB revealed a high diversity of nonsynonymous single-nucleotide polymorphisms (SNPs). For vraS, vraR, yvqF, walK, and rpoB, SNPs were more frequently present in VISA and hVISA than in VSSA isolates, whereas mutations in graR, graS, and walR were exclusively detected in VISA isolates. For each of the individual loci, SNPs were only detected in about half of the VISA isolates. All but one VISA isolate had at least one SNP in any of the genes sequenced, and isolates with an MIC of 6 or 8 μg/ml harbored at least 2 SNPs. Overall, increasing vancomycin MICs were paralleled by a higher proportion of isolates with SNPs. Depending on the clonal background, SNPs appeared to preferentially accumulate in vraS and vraR for sequence type 8 (ST8) and in walK and walR for ST5 isolates. Taken together, by comparing VISA, hVISA, and VSSA controls, we observed preferential clustering of SNPs in VISA candidate genes, with an unexpectedly high diversity across these loci. Our results support a polygenetic etiology of VISA. PMID:22948864

  16. Association of RBP4 genetic variants with childhood obesity and cardiovascular risk factors.

    PubMed

    Codoñer-Franch, Pilar; Carrasco-Luna, Joaquín; Allepuz, Paula; Codoñer-Alejos, Alan; Guillem, Vicent

    2016-12-01

    Recent data suggest that retinol-binding protein 4 (RBP4) gene variants could be associated with a risk of obesity and its co-morbidities, such as metabolic syndrome, which increases the risk of developing type 2 diabetes mellitus and cardiovascular disease. The present study examined the potential association of RBP4 single nucleotide polymorphisms (SNPs) with childhood obesity and its metabolic complications. Four RBP4 SNPs, rs3758538 (3944A>C), rs3758539 (4406G>A), rs12265684 (12177G>C) and rs34571439 (14684T>G), were genotyped in a population of 180 Spanish Caucasian children (97 obese and 83 normal-weight children). Association of RBP4 SNPs with obesity, metabolic risk factors (blood pressure, triglycerides, high-density lipoprotein cholesterol, insulin resistance) and markers of vascular inflammation, such as high-sensitive C-reactive protein (hs-CRP), was tested. We found SNP rs3758538 to be associated with obesity (p = 0.007). Specifically, each copy of the minor allele C was associated with an increased risk of obesity, by more than twofold, in respect of being homozygous for the major allele A (odds ratio = 2.4; 95% confidence interval = 1.2-4.8). The rs3758538 and rs34571439 RBP4 SNPs correlated with plasma RBP4 levels. The SNPs rs12265684 and rs34571439 correlated with plasma triglyceride levels. The rs34571439 was also associated to hs-CRP levels. Marginal association of RBP4 SNPs with plasma high-density lipoprotein levels (rs34571439), blood pressure (rs12265684) and insulin resistance (rs3758539) was also observed. These findings suggest that childhood obesity may be associated with variations in RBP4 gene. The presence of selective SNPs in the RBP4 gene may account for metabolic complications. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  17. Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio)

    PubMed Central

    2014-01-01

    Background A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. Results The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. Conclusions The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species. PMID:24762296

  18. Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio).

    PubMed

    Xu, Jian; Zhao, Zixia; Zhang, Xiaofeng; Zheng, Xianhu; Li, Jiongtang; Jiang, Yanliang; Kuang, Youyi; Zhang, Yan; Feng, Jianxin; Li, Chuangju; Yu, Juhua; Li, Qiang; Zhu, Yuanyuan; Liu, Yuanyuan; Xu, Peng; Sun, Xiaowen

    2014-04-24

    A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species.

  19. The analysis of APOL1 genetic variation and haplotype diversity provided by 1000 Genomes project.

    PubMed

    Peng, Ting; Wang, Li; Li, Guisen

    2017-08-11

    The APOL1 gene variants has been shown to be associated with an increased risk of multiple kinds of diseases, particularly in African Americans, but not in Caucasians and Asians. In this study, we explored the single nucleotide polymorphism (SNP) and haplotype diversity of APOL1 gene in different races provided by 1000 Genomes project. Variants of APOL1 gene in 1000 Genome Project were obtained and SNPs located in the regulatory region or coding region were selected for genetic variation analysis. Total 2504 individuals from 26 populations were classified as four groups that included Africa, Europe, Asia and Admixed populations. Tag SNPs were selected to evaluate the haplotype diversities in the four populations by HaploStats software. APOL1 gene was surrounded by some of the most polymorphic genes in the human genome, variation of APOL1 gene was common, with up to 613 SNP (1000 Genome Project reported) and 99 of them (16.2%) with MAF ≥ 1%. There were 79 SNPs in the URR and 92 SNPs in 3'UTR. Total 12 SNPs in URR and 24 SNPs in 3'UTR were considered as common variants with MAF ≥ 1%. It is worth noting that URR-1 was presents lower frequencies in European populations, while other three haplotypes taken an opposite pattern; 3'UTR presents several high-frequency variation sites in a short segment, and the differences of its haplotypes among different population were significant (P < 0.01), UTR-1 and UTR-5 presented much higher frequency in African population, while UTR-2, UTR-3 and UTR-4 were much lower. APOL1 coding region showed that two SNP of G1 with higher frequency are actually pull down the haplotype H-1 frequency when considering all populations pooled together, and the diversity among the four populations be widen by the G1 two mutation (P 1  = 3.33E-4 vs P 2  = 3.61E-30). The distributions of APOL1 gene variants and haplotypes were significantly different among the different populations, in either regulatory or coding regions. It could provide clues for the future genetic study of APOL1 related diseases.

  20. Empirical Distributions of F ST from Large-Scale Human Polymorphism Data

    PubMed Central

    Elhaik, Eran

    2012-01-01

    Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright’s F ST that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-F ST may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically F ST analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global F ST distribution closely follows an exponential distribution. Third, although the overall F ST distribution is similarly shaped (inverse J), F ST distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-F ST of these groups is linear in allele frequency. These results suggest that investigating the extremes of the F ST distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection. PMID:23185452

  1. Empirical distributions of F(ST) from large-scale human polymorphism data.

    PubMed

    Elhaik, Eran

    2012-01-01

    Studies of the apportionment of human genetic variation have long established that most human variation is within population groups and that the additional variation between population groups is small but greatest when comparing different continental populations. These studies often used Wright's F(ST) that apportions the standardized variance in allele frequencies within and between population groups. Because local adaptations increase population differentiation, high-F(ST) may be found at closely linked loci under selection and used to identify genes undergoing directional or heterotic selection. We re-examined these processes using HapMap data. We analyzed 3 million SNPs on 602 samples from eight worldwide populations and a consensus subset of 1 million SNPs found in all populations. We identified four major features of the data: First, a hierarchically F(ST) analysis showed that only a paucity (12%) of the total genetic variation is distributed between continental populations and even a lesser genetic variation (1%) is found between intra-continental populations. Second, the global F(ST) distribution closely follows an exponential distribution. Third, although the overall F(ST) distribution is similarly shaped (inverse J), F(ST) distributions varies markedly by allele frequency when divided into non-overlapping groups by allele frequency range. Because the mean allele frequency is a crude indicator of allele age, these distributions mark the time-dependent change in genetic differentiation. Finally, the change in mean-F(ST) of these groups is linear in allele frequency. These results suggest that investigating the extremes of the F(ST) distribution for each allele frequency group is more efficient for detecting selection. Consequently, we demonstrate that such extreme SNPs are more clustered along the chromosomes than expected from linkage disequilibrium for each allele frequency group. These genomic regions are therefore likely candidates for natural selection.

  2. SiNoPsis: Single Nucleotide Polymorphisms selection and promoter profiling.

    PubMed

    Boloc, Daniel; Rodríguez, Natalia; Gassó, Patricia; Abril, Josep F; Bernardo, Miquel; Lafuente, Amalia; Mas, Sergi

    2017-09-14

    The selection of a Single Nucleotide Polymorphism (SNP) using bibliographic methods can be a very time-consuming task. Moreover, a SNP selected in this way may not be easily visualized in its genomic context by a standard user hoping to correlate it with other valuable information. Here we propose a web form built on top of Circos that can assist SNP-centred screening, based on their location in the genome and the regulatory modules they can disrupt. Its use may allow researchers to prioritize SNPs in genotyping and disease studies. SiNoPsis is bundled as a web portal. It focuses on the different structures involved in the genomic expression of a gene, especially those found in the core promoter upstream region. These structures include transcription factor binding sites (for promoter and enhancer signals), histones, and promoter flanking regions. Additionally, the tool provides eQTL and linkage disequilibrium (LD) properties for a given SNP query, yielding further clues about other indirectly associated SNPs. Possible disruptions of the aforementioned structures affecting gene transcription are reported using multiple resource databases. SiNoPsis has a simple user-friendly interface, which allows single queries by gene symbol, genomic coordinates, Ensembl gene identifiers, RefSeq transcript identifiers and SNPs. It is the only portal providing useful SNP selection based on regulatory modules and LD with functional variants in both textual and graphic modes (by properly defining the arguments and parameters needed to run Circos). SiNoPsis is freely available at https://compgen.bio.ub.edu/SiNoPsis /. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  3. Evidence for pleiotropism and recent selection in the PLAG1 region in Australian Beef cattle.

    PubMed

    Fortes, M R S; Kemper, K; Sasazaki, S; Reverter, A; Pryce, J E; Barendse, W; Bunch, R; McCulloch, R; Harrison, B; Bolormaa, S; Zhang, Y D; Hawken, R J; Goddard, M E; Lehnert, S A

    2013-12-01

    A putative functional mutation (rs109231213) near PLAG1 (BTA14) associated with stature was studied in beef cattle. Data from 8199 Bos taurus, Bos indicus and Tropical Composite cattle were used to test the associations between rs109231213 and various phenotypes. Further, 23 496 SNPs located on BTA14 were tested for association with these phenotypes, both independently and fitted together with rs109231213. The C allele of rs109231213 significantly increased hip height, weight, net food intake, age at puberty in males and females and decreased IGF-I concentration in blood and fat depth. When rs109231213 was fitted as a fixed effect in the model, there was an overall reduction in associations between other SNPs and these traits but some SNPs remained associated (P < 10(-4) ). Frequency of the mutant C allele of rs109231213 differed among B. indicus (0.52), B. taurus (0.96) and Tropical Composite (0.68). Most chromosomes carrying the C allele had the same surrounding 10 SNP haplotype, probably because the C allele was introgressed into Brahman from B. taurus cattle. A region of reduced heterozygosity surrounds the C allele; this is small in B. taurus but 20 Mb long in Brahmans, indicating recent and strong selection for the mutant allele. Thus, the C allele appears to mark a mutation that has been selected almost to fixation in the B. taurus breeds studied here and introduced into Brahman cattle during grading up and selected to a frequency of 0.52 despite its negative effects on fertility. © 2013 The Authors, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.

  4. Adaptations to climate-mediated selective pressures in sheep.

    PubMed

    Lv, Feng-Hua; Agha, Saif; Kantanen, Juha; Colli, Licia; Stucki, Sylvie; Kijas, James W; Joost, Stéphane; Li, Meng-Hua; Ajmone Marsan, Paolo

    2014-12-01

    Following domestication, sheep (Ovis aries) have become essential farmed animals across the world through adaptation to a diverse range of environments and varied production systems. Climate-mediated selective pressure has shaped phenotypic variation and has left genetic "footprints" in the genome of breeds raised in different agroecological zones. Unlike numerous studies that have searched for evidence of selection using only population genetics data, here, we conducted an integrated coanalysis of environmental data with single nucleotide polymorphism (SNP) variation. By examining 49,034 SNPs from 32 old, autochthonous sheep breeds that are adapted to a spectrum of different regional climates, we identified 230 SNPs with evidence for selection that is likely due to climate-mediated pressure. Among them, 189 (82%) showed significant correlation (P ≤ 0.05) between allele frequency and climatic variables in a larger set of native populations from a worldwide range of geographic areas and climates. Gene ontology analysis of genes colocated with significant SNPs identified 17 candidates related to GTPase regulator and peptide receptor activities in the biological processes of energy metabolism and endocrine and autoimmune regulation. We also observed high linkage disequilibrium and significant extended haplotype homozygosity for the core haplotype TBC1D12-CH1 of TBC1D12. The global frequency distribution of the core haplotype and allele OAR22_18929579-A showed an apparent geographic pattern and significant (P ≤ 0.05) correlations with climatic variation. Our results imply that adaptations to local climates have shaped the spatial distribution of some variants that are candidates to underpin adaptive variation in sheep. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  5. SNPs in PTGS2 and LTA Predict Pain and Quality of Life in Long Term Lung Cancer Survivors

    PubMed Central

    Rausch, Sarah M.; Gonzalez, Brian D.; Clark, Matthew M.; Patten, Christi; Felten, Sara; Liu, Heshan; Li, Yafei; Sloan, Jeff; Yang, Ping

    2015-01-01

    PURPOSE Lung cancer survivors report the lowest quality of life relative to other cancer survivors. Pain is one of the most devastating, persistent, and incapacitating symptoms for lung cancer survivors. Prevalence rates vary with 80–100% of survivors experiencing cancer pain and healthcare costs are five times higher in cancer survivors with uncontrolled pain. Cancer pain often has a considerable impact on quality of life among cancer patients and cancer survivors. Therefore, early identification, and treatment is important. Although recent studies have suggested a relationship between single nucleotide polymorphisms (SNPs) in several cytokine and inflammation genes with cancer prognosis, associations with cancer pain are not clear. Therefore, the primary aim of this study was to identify SNPs related to pain in long term lung cancer survivors. PATIENTS AND METHODS Participants were enrolled in the Mayo Clinic Lung Cancer Cohort upon diagnosis of their lung cancer. 1149 Caucasian lung cancer survivors, (440 surviving < 3 years; 354 surviving 3–5 years; and 355 surviving> 5 years) completed study questionnaires and had genetic samples available. Ten SNPS from PTGS2 and LTA genes were selected based on the serum literature. Outcomes included pain, and quality of life as measured by the SF-8. RESULTS Of the 10 SNPs evaluated in LTA and PTGS2 genes, 3 were associated with pain severity (rs5277; rs1799964), social function (rs5277) and mental health (rs5275). These results suggested both specificity and consistency of these inflammatory gene SNPs in predicting pain severity in long term lung cancer survivors. CONCLUSION These results provide support for genetic predisposition to pain severity and may aid in identification of lung cancer survivors at high risk for morbidity and poor QOL. PMID:22464751

  6. Involvement of PTPN5, the gene encoding the STriatal-Enriched protein tyrosine Phosphatase (STEP), in schizophrenia and cognition

    PubMed Central

    Pelov, Ilana; Teltsh, Omri; Greenbaum, Lior; Rigbi, Amihai; Kanyas-Sarner, Kyra; Lerer, Bernard; Lombroso, Paul; Kohn, Yoav

    2013-01-01

    Objective STriatal-Enriched protein tyrosine Phosphatase (STEP) is a brain-specific member of the PTP family that has been implicated in learning and memory. In this study, we examined the association of the PTPN5 (protein-tyrosine-phosphatase non-receptor 5) gene, which encodes for STEP, with both schizophrenia and cognitive functioning in the Israeli Jewish population. Methods A 868 subjects schizophrenia (SZ) case-control study was performed (286 cases and 582 controls). Eleven STEP tagging SNPs were selected, and single markers and haplotypes association analyses were performed. A cognitive variability study included 437 healthy females who completed a computerized cognitive battery. We performed univariate associations between the SNPs and cognitive performance. The possible functional role of these variants was examined by studying their association with gene expression levels in the brain. Results In the SZ study, we found nominal association in the whole sample between rs4075664 and SZ. SZ males showed a more significant association for 3 SNPs (rs4075664, rs2278732, rs4757710). Haplotypes of the studied SNPs were associated with SZ both in the overall sample and within the male sub-sample. Expression analysis provided some support for the effects of the associated SNPs on PTPN5 expression level. The cognitive variability study showed positive associations between PTPN5 SNPs and different cognitive subtests. Principal component analysis demonstrated an “Attention Index” neurocognitive component that was associated with two SNP pairs (rs10832983*rs10766504 and rs7932938*rs4757718). Conclusion The results imply a model in which PTPN5 may play a role in normal cognitive functioning and contributes to aspects of the neuropathology of schizophrenia. PMID:22555153

  7. A novel Markov Blanket-based repeated-fishing strategy for capturing phenotype-related biomarkers in big omics data.

    PubMed

    Li, Hongkai; Yuan, Zhongshang; Ji, Jiadong; Xu, Jing; Zhang, Tao; Zhang, Xiaoshuai; Xue, Fuzhong

    2016-03-09

    We propose a novel Markov Blanket-based repeated-fishing strategy (MBRFS) in attempt to increase the power of existing Markov Blanket method (DASSO-MB) and maintain its advantages in omic data analysis. Both simulation and real data analysis were conducted to assess its performances by comparing with other methods including χ(2) test with Bonferroni and B-H adjustment, least absolute shrinkage and selection operator (LASSO) and DASSO-MB. A serious of simulation studies showed that the true discovery rate (TDR) of proposed MBRFS was always close to zero under null hypothesis (odds ratio = 1 for each SNPs) with excellent stability in all three scenarios of independent phenotype-related SNPs without linkage disequilibrium (LD) around them, correlated phenotype-related SNPs without LD around them, and phenotype-related SNPs with strong LD around them. As expected, under different odds ratio and minor allel frequency (MAFs), MBRFS always had the best performances in capturing the true phenotype-related biomarkers with higher matthews correlation coefficience (MCC) for all three scenarios above. More importantly, since proposed MBRFS using the repeated fishing strategy, it still captures more phenotype-related SNPs with minor effects when non-significant phenotype-related SNPs emerged under χ(2) test after Bonferroni multiple correction. The various real omics data analysis, including GWAS data, DNA methylation data, gene expression data and metabolites data, indicated that the proposed MBRFS always detected relatively reasonable biomarkers. Our proposed MBRFS can exactly capture the true phenotype-related biomarkers with the reduction of false negative rate when the phenotype-related biomarkers are independent or correlated, as well as the circumstance that phenotype-related biomarkers are associated with non-phenotype-related ones.

  8. Testing of diabetes-associated WFS1 polymorphisms in the Diabetes Prevention Program

    PubMed Central

    Florez, J. C.; Jablonski, K. A.; McAteer, J.; Sandhu, M. S.; Wareham, N. J.; Barroso, I.; Franks, P. W.; Altshuler, D.; Knowler, W. C.

    2008-01-01

    Aims/hypothesis Wolfram syndrome (diabetes insipidus, diabetes mellitus, optic atrophy and deafness) is caused by mutations in the WFS1 gene. Recently, single nucleotide polymorphisms (SNPs) in WFS1 have been reproducibly associated with type 2 diabetes. We therefore examined the effects of these variants on diabetes incidence and response to interventions in the Diabetes Prevention Program (DPP), in which a lifestyle intervention or metformin treatment was compared with placebo. Methods We genotyped the WFS1 SNPs rs10010131, rs752 854 and rs734312 (H611R) in 3,548 DPP participants and performed Cox regression analysis using genotype, intervention and their interactions as predictors of diabetes incidence. We also evaluated the effect of these SNPs on insulin resistance and beta cell function at 1 year. Results Although none of the three SNPs was associated with diabetes incidence in the overall cohort, white homozygotes for the previously reported protective alleles appeared less likely to develop diabetes in the lifestyle arm. Examination of the publicly available Diabetes Genetics Initiative genome-wide association dataset revealed that rs10012946, which is in strong linkage disequilibrium with the three WFS1 SNPs (r2=0.88–1.0), was associated with type 2 diabetes (allelic odds ratio 0.85, 95% CI 0.75–0.97, p=0.026). In the DPP, we noted a trend towards increased insulin secretion in carriers of the protective variants, although for most SNPs this was seen as compensatory for the diminished insulin sensitivity. Conclusions/interpretation The previously reported protective effect of select WFS1 alleles may be magnified by a lifestyle intervention. These variants appear to confer an improvement in beta cell function. PMID:18060660

  9. Genome-Wide Association of CKD Progression: The Chronic Renal Insufficiency Cohort Study.

    PubMed

    Parsa, Afshin; Kanetsky, Peter A; Xiao, Rui; Gupta, Jayanta; Mitra, Nandita; Limou, Sophie; Xie, Dawei; Xu, Huichun; Anderson, Amanda Hyre; Ojo, Akinlolu; Kusek, John W; Lora, Claudia M; Hamm, L Lee; He, Jiang; Sandholm, Niina; Jeff, Janina; Raj, Dominic E; Böger, Carsten A; Bottinger, Erwin; Salimi, Shabnam; Parekh, Rulan S; Adler, Sharon G; Langefeld, Carl D; Bowden, Donald W; Groop, Per-Henrik; Forsblom, Carol; Freedman, Barry I; Lipkowitz, Michael; Fox, Caroline S; Winkler, Cheryl A; Feldman, Harold I

    2017-03-01

    The rate of decline of renal function varies significantly among individuals with CKD. To understand better the contribution of genetics to CKD progression, we performed a genome-wide association study among participants in the Chronic Renal Insufficiency Cohort Study. Our outcome of interest was CKD progression measured as change in eGFR over time among 1331 blacks and 1476 whites with CKD. We stratified all analyses by race and subsequently, diabetes status. Single-nucleotide polymorphisms (SNPs) that surpassed a significance threshold of P <1×10 -6 for association with eGFR slope were selected as candidates for follow-up and secondarily tested for association with proteinuria and time to ESRD. We identified 12 such SNPs among black patients and six such SNPs among white patients. We were able to conduct follow-up analyses of three candidate SNPs in similar (replication) cohorts and eight candidate SNPs in phenotype-related (validation) cohorts. Among blacks without diabetes, rs653747 in LINC00923 replicated in the African American Study of Kidney Disease and Hypertension cohort (discovery P =5.42×10 -7 ; replication P =0.039; combined P =7.42×10 -9 ). This SNP also associated with ESRD (hazard ratio, 2.0 (95% confidence interval, 1.5 to 2.7); P =4.90×10 -6 ). Similarly, rs931891 in LINC00923 associated with eGFR decline ( P =1.44×10 -4 ) in white patients without diabetes. In summary, SNPs in LINC00923 , an RNA gene expressed in the kidney, significantly associated with CKD progression in individuals with nondiabetic CKD. However, the lack of equivalent cohorts hampered replication for most discovery loci. Further replication of our findings in comparable study populations is warranted. Copyright © 2017 by the American Society of Nephrology.

  10. Systematic investigation of the relationship between high myopia and polymorphisms of the MMP2, TIMP2, and TIMP3 genes by a DNA pooling approach.

    PubMed

    Leung, Kim Hung; Yiu, Wai Chi; Yap, Maurice K H; Ng, Po Wah; Fung, Wai Yan; Sham, Pak Chung; Yip, Shea Ping

    2011-06-01

    This study examined the relationship between high myopia and three myopia candidate genes--matrix metalloproteinase 2 (MMP2) and tissue inhibitor of metalloproteinase-2 and -3 (TIMP2 and TIMP3)--involved in scleral remodeling. Recruited for the study were unrelated adult Han Chinese who were high myopes (spherical equivalent, ≤ -6.0 D in both eyes; cases) and emmetropes (within ±1.0 D in both eyes; controls). Sample set 1 had 300 cases and 300 controls, and sample set 2 had 356 cases and 354 controls. Forty-nine tag single-nucleotide polymorphisms (SNPs) were selected from these candidate genes. The first stage was an initial screen of six case pools and six control pools constructed from sample set 1, each pool consisting of 50 distinct subjects of the same affection status. In the second stage, positive SNPs from the first stage were confirmed by genotyping individual samples forming the DNA pools. In the third stage, positive SNPs from stage 2 were replicated, with sample set 2 genotyped individually. Of the 49 SNPs screened by DNA pooling, three passed the lenient threshold of P < 0.10 (nested ANOVA) and were followed up by individual genotyping. Of the three SNPs genotyped, two TIMP3 SNPs were found to be significantly associated with high myopia by single-marker or haplotype analysis. However, the initial positive results could not be replicated by sample set 2. MMP2, TIPM2, and TIMP3 genes were not associated with high myopia in this Chinese sample and hence are unlikely to play a major role in the genetic susceptibility to high myopia.

  11. Identification of type 2 diabetes-associated combination of SNPs using support vector machine.

    PubMed

    Ban, Hyo-Jeong; Heo, Jee Yeon; Oh, Kyung-Soo; Park, Keun-Joon

    2010-04-23

    Type 2 diabetes mellitus (T2D), a metabolic disorder characterized by insulin resistance and relative insulin deficiency, is a complex disease of major public health importance. Its incidence is rapidly increasing in the developed countries. Complex diseases are caused by interactions between multiple genes and environmental factors. Most association studies aim to identify individual susceptibility single markers using a simple disease model. Recent studies are trying to estimate the effects of multiple genes and multi-locus in genome-wide association. However, estimating the effects of association is very difficult. We aim to assess the rules for classifying diseased and normal subjects by evaluating potential gene-gene interactions in the same or distinct biological pathways. We analyzed the importance of gene-gene interactions in T2D susceptibility by investigating 408 single nucleotide polymorphisms (SNPs) in 87 genes involved in major T2D-related pathways in 462 T2D patients and 456 healthy controls from the Korean cohort studies. We evaluated the support vector machine (SVM) method to differentiate between cases and controls using SNP information in a 10-fold cross-validation test. We achieved a 65.3% prediction rate with a combination of 14 SNPs in 12 genes by using the radial basis function (RBF)-kernel SVM. Similarly, we investigated subpopulation data sets of men and women and identified different SNP combinations with the prediction rates of 70.9% and 70.6%, respectively. As the high-throughput technology for genome-wide SNPs improves, it is likely that a much higher prediction rate with biologically more interesting combination of SNPs can be acquired by using this method. Support Vector Machine based feature selection method in this research found novel association between combinations of SNPs and T2D in a Korean population.

  12. Genomic prediction of piglet response to infection with one of two porcine reproductive and respiratory syndrome virus isolates.

    PubMed

    Waide, Emily H; Tuggle, Christopher K; Serão, Nick V L; Schroyen, Martine; Hess, Andrew; Rowland, Raymond R R; Lunney, Joan K; Plastow, Graham; Dekkers, Jack C M

    2018-02-01

    Genomic prediction of the pig's response to the porcine reproductive and respiratory syndrome (PRRS) virus (PRRSV) would be a useful tool in the swine industry. This study investigated the accuracy of genomic prediction based on porcine SNP60 Beadchip data using training and validation datasets from populations with different genetic backgrounds that were challenged with different PRRSV isolates. Genomic prediction accuracy averaged 0.34 for viral load (VL) and 0.23 for weight gain (WG) following experimental PRRSV challenge, which demonstrates that genomic selection could be used to improve response to PRRSV infection. Training on WG data during infection with a less virulent PRRSV, KS06, resulted in poor accuracy of prediction for WG during infection with a more virulent PRRSV, NVSL. Inclusion of single nucleotide polymorphisms (SNPs) that are in linkage disequilibrium with a major quantitative trait locus (QTL) on chromosome 4 was vital for accurate prediction of VL. Overall, SNPs that were significantly associated with either trait in single SNP genome-wide association analysis were unable to predict the phenotypes with an accuracy as high as that obtained by using all genotyped SNPs across the genome. Inclusion of data from close relatives into the training population increased whole genome prediction accuracy by 33% for VL and by 37% for WG but did not affect the accuracy of prediction when using only SNPs in the major QTL region. Results show that genomic prediction of response to PRRSV infection is moderately accurate and, when using all SNPs on the porcine SNP60 Beadchip, is not very sensitive to differences in virulence of the PRRSV in training and validation populations. Including close relatives in the training population increased prediction accuracy when using the whole genome or SNPs other than those near a major QTL.

  13. LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs

    PubMed Central

    Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong

    2009-01-01

    Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380

  14. Genetic analysis of the Yavapai Native Americans from West-Central Arizona using the Illumina MiSeq FGx™ forensic genomics system.

    PubMed

    Wendt, Frank R; Churchill, Jennifer D; Novroski, Nicole M M; King, Jonathan L; Ng, Jillian; Oldt, Robert F; McCulloh, Kelly L; Weise, Jessica A; Smith, David Glenn; Kanthaswamy, Sreetharan; Budowle, Bruce

    2016-09-01

    Forensically-relevant genetic markers were typed for sixty-two Yavapai Native Americans using the ForenSeq™ DNA Signature Prep Kit.These data are invaluable to the human identity community due to the greater genetic differentiation among Native American tribes than among other subdivisions within major populations of the United States. Autosomal, X-chromosomal, and Y-chromosomal short tandem repeat (STR) and identity-informative (iSNPs), ancestry-informative (aSNPs), and phenotype-informative (pSNPs) single nucleotide polymorphism (SNP) allele frequencies are reported. Sequence-based allelic variants were observed in 13 autosomal, 3 X, and 3 Y STRs. These observations increased observed and expected heterozygosities for autosomal STRs by 0.081±0.068 and 0.073±0.063, respectively, and decreased single-locus random match probabilities by 0.051±0.043 for 13 autosomal STRs. The autosomal random match probabilities (RMPs) were 2.37×10-26 and 2.81×10-29 for length-based and sequence-based alleles, respectively. There were 22 and 25 unique Y-STR haplotypes among 26 males, generating haplotype diversities of 0.95 and 0.96, for length-based and sequencebased alleles, respectively. Of the 26 haplotypes generated, 17 were assigned to haplogroup Q, three to haplogroup R1b, two each to haplogroups E1b1b and L, and one each to haplogroups R1a and I1. Male and female sequence-based X-STR random match probabilities were 3.28×10-7 and 1.22×10-6, respectively. The average observed and expected heterozygosities for 94 iSNPs were 0.39±0.12 and 0.39±0.13, respectively, and the combined iSNP RMP was 1.08×10-32. The combined STR and iSNP RMPs were 2.55×10-58 and 3.02×10-61 for length-based and sequence-based STR alleles, respectively. Ancestry and phenotypic SNP information, performed using the ForenSeq™ Universal Analysis Software, predicted black hair, brown eyes, and some probability of East Asian ancestry for all but one sample that clustered between European and Admixed American ancestry on a principal components analysis. These data serve as the first population assessment using the ForenSeq™ panel and highlight the value of employing sequence-based alleles for forensic DNA typing to increase heterozygosity, which is beneficial for identity testing in populations with reduced genetic diversity. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  15. Genomic Variability of Serial Human Isolates of Salmonella enterica Serovar Typhimurium Associated with Prolonged Carriage.

    PubMed

    Octavia, Sophie; Wang, Qinning; Tanaka, Mark M; Sintchenko, Vitali; Lan, Ruiting

    2015-11-01

    Salmonella enterica serovar Typhimurium is an important foodborne human pathogen that often causes self-limiting but severe gastroenteritis. Prolonged excretion of S. Typhimurium after the infection can lead to secondary transmissions. However, little is known about within-host genomic variation in bacteria associated with asymptomatic shedding. Genomes of 35 longitudinal isolates of S. Typhimurium recovered from 11 patients (children and adults) with culture-confirmed gastroenteritis were sequenced. There were three or four isolates obtained from each patient. Single nucleotide polymorphisms (SNPs) were analyzed in these isolates, which were recovered between 1 and 279 days after the initial diagnosis. Limited genomic variation (5 SNPs or fewer) was associated with short- and long-term carriage of S. Typhimurium. None of the isolates was shown to be due to reinfection. SNPs occurred randomly, and the majority of the SNPs were nonsynonymous. Two nonsense mutations were observed. A nonsense mutation in flhC rendered the isolate nonmotile, whereas the significance of a nonsense mutation in yihV is unknown. The estimated mutation rate is 1.49 × 10(-6) substitution per site per year. S. Typhimurium isolates excreted in stools following acute gastroenteritis in children and adults demonstrated limited genomic variability over time, regardless of the duration of carriage. These findings have important implications for the detection of possible transmission events suspected by public health genomic surveillance of S. Typhimurium infections. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  16. A large-scale assessment of two-way SNP interactions in breast cancer susceptibility using 46 450 cases and 42 461 controls from the breast cancer association consortium

    PubMed Central

    Milne, Roger L.; Herranz, Jesús; Michailidou, Kyriaki; Dennis, Joe; Tyrer, Jonathan P.; Zamora, M. Pilar; Arias-Perez, José Ignacio; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Wang, Qin; Bolla, Manjeet K.; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Li, Jingmei; Anton-Culver, Hoda; Neuhausen, Susan L.; Ziogas, Argyrios; Clarke, Christina A.; Hopper, John L.; Dite, Gillian S.; Apicella, Carmel; Southey, Melissa C.; Chenevix-Trench, Georgia; Swerdlow, Anthony; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Mulligan, Anna Marie; Bojesen, Stig E.; Nordestgaard, Børge G.; Flyger, Henrik; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Wang, Xianshu; Olson, Janet E.; Vachon, Celine; Purrington, Kristen; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Dunning, Alison M.; Shah, Mitul; Guénel, Pascal; Truong, Thérèse; Sanchez, Marie; Mulot, Claire; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J.; Hollestelle, Antoinette; Collée, J. Margriet; Jager, Agnes; Cox, Angela; Brock, Ian W.; Reed, Malcolm W.R.; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Dumont, Martine; Soucy, Penny; Dörk, Thilo; Bogdanova, Natalia V.; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Fasching, Peter A.; Häberle, Lothar; Ekici, Arif B.; Beckmann, Matthias W.; Fletcher, Olivia; Johnson, Nichola; dos Santos Silva, Isabel; Peto, Julian; Radice, Paolo; Peterlongo, Paolo; Peissel, Bernard; Mariani, Paolo; Giles, Graham G.; Severi, Gianluca; Baglietto, Laura; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Marme, Federik; Burwinkel, Barbara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Lambrechts, Diether; Yesilyurt, Betul T.; Floris, Giuseppe; Leunen, Karin; Alnæs, Grethe Grenaker; Kristensen, Vessela; Børresen-Dale, Anne-Lise; García-Closas, Montserrat; Chanock, Stephen J.; Lissowska, Jolanta; Figueroa, Jonine D.; Schmidt, Marjanka K.; Broeks, Annegien; Verhoef, Senno; Rutgers, Emiel J.; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Couch, Fergus J.; Toland, Amanda E.; Yannoukakos, Drakoulis; Pharoah, Paul D.P.; Hall, Per; Benítez, Javier; Malats, Núria; Easton, Douglas F.

    2014-01-01

    Part of the substantial unexplained familial aggregation of breast cancer may be due to interactions between common variants, but few studies have had adequate statistical power to detect interactions of realistic magnitude. We aimed to assess all two-way interactions in breast cancer susceptibility between 70 917 single nucleotide polymorphisms (SNPs) selected primarily based on prior evidence of a marginal effect. Thirty-eight international studies contributed data for 46 450 breast cancer cases and 42 461 controls of European origin as part of a multi-consortium project (COGS). First, SNPs were preselected based on evidence (P < 0.01) of a per-allele main effect, and all two-way combinations of those were evaluated by a per-allele (1 d.f.) test for interaction using logistic regression. Second, all 2.5 billion possible two-SNP combinations were evaluated using Boolean operation-based screening and testing, and SNP pairs with the strongest evidence of interaction (P < 10−4) were selected for more careful assessment by logistic regression. Under the first approach, 3277 SNPs were preselected, but an evaluation of all possible two-SNP combinations (1 d.f.) identified no interactions at P < 10−8. Results from the second analytic approach were consistent with those from the first (P > 10−10). In summary, we observed little evidence of two-way SNP interactions in breast cancer susceptibility, despite the large number of SNPs with potential marginal effects considered and the very large sample size. This finding may have important implications for risk prediction, simplifying the modelling required. Further comprehensive, large-scale genome-wide interaction studies may identify novel interacting loci if the inherent logistic and computational challenges can be overcome. PMID:24242184

  17. A large-scale assessment of two-way SNP interactions in breast cancer susceptibility using 46,450 cases and 42,461 controls from the breast cancer association consortium.

    PubMed

    Milne, Roger L; Herranz, Jesús; Michailidou, Kyriaki; Dennis, Joe; Tyrer, Jonathan P; Zamora, M Pilar; Arias-Perez, José Ignacio; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Wang, Qin; Bolla, Manjeet K; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Li, Jingmei; Anton-Culver, Hoda; Neuhausen, Susan L; Ziogas, Argyrios; Clarke, Christina A; Hopper, John L; Dite, Gillian S; Apicella, Carmel; Southey, Melissa C; Chenevix-Trench, Georgia; Swerdlow, Anthony; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Bojesen, Stig E; Nordestgaard, Børge G; Flyger, Henrik; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Wang, Xianshu; Olson, Janet E; Vachon, Celine; Purrington, Kristen; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Dunning, Alison M; Shah, Mitul; Guénel, Pascal; Truong, Thérèse; Sanchez, Marie; Mulot, Claire; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J; Hollestelle, Antoinette; Collée, J Margriet; Jager, Agnes; Cox, Angela; Brock, Ian W; Reed, Malcolm W R; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Dumont, Martine; Soucy, Penny; Dörk, Thilo; Bogdanova, Natalia V; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Fasching, Peter A; Häberle, Lothar; Ekici, Arif B; Beckmann, Matthias W; Fletcher, Olivia; Johnson, Nichola; dos Santos Silva, Isabel; Peto, Julian; Radice, Paolo; Peterlongo, Paolo; Peissel, Bernard; Mariani, Paolo; Giles, Graham G; Severi, Gianluca; Baglietto, Laura; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Marme, Federik; Burwinkel, Barbara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Lambrechts, Diether; Yesilyurt, Betul T; Floris, Giuseppe; Leunen, Karin; Alnæs, Grethe Grenaker; Kristensen, Vessela; Børresen-Dale, Anne-Lise; García-Closas, Montserrat; Chanock, Stephen J; Lissowska, Jolanta; Figueroa, Jonine D; Schmidt, Marjanka K; Broeks, Annegien; Verhoef, Senno; Rutgers, Emiel J; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Couch, Fergus J; Toland, Amanda E; Yannoukakos, Drakoulis; Pharoah, Paul D P; Hall, Per; Benítez, Javier; Malats, Núria; Easton, Douglas F

    2014-04-01

    Part of the substantial unexplained familial aggregation of breast cancer may be due to interactions between common variants, but few studies have had adequate statistical power to detect interactions of realistic magnitude. We aimed to assess all two-way interactions in breast cancer susceptibility between 70,917 single nucleotide polymorphisms (SNPs) selected primarily based on prior evidence of a marginal effect. Thirty-eight international studies contributed data for 46,450 breast cancer cases and 42,461 controls of European origin as part of a multi-consortium project (COGS). First, SNPs were preselected based on evidence (P < 0.01) of a per-allele main effect, and all two-way combinations of those were evaluated by a per-allele (1 d.f.) test for interaction using logistic regression. Second, all 2.5 billion possible two-SNP combinations were evaluated using Boolean operation-based screening and testing, and SNP pairs with the strongest evidence of interaction (P < 10(-4)) were selected for more careful assessment by logistic regression. Under the first approach, 3277 SNPs were preselected, but an evaluation of all possible two-SNP combinations (1 d.f.) identified no interactions at P < 10(-8). Results from the second analytic approach were consistent with those from the first (P > 10(-10)). In summary, we observed little evidence of two-way SNP interactions in breast cancer susceptibility, despite the large number of SNPs with potential marginal effects considered and the very large sample size. This finding may have important implications for risk prediction, simplifying the modelling required. Further comprehensive, large-scale genome-wide interaction studies may identify novel interacting loci if the inherent logistic and computational challenges can be overcome.

  18. A genetic risk score based on direct associations with coronary heart disease improves coronary heart disease risk prediction in the Atherosclerosis Risk in Communities (ARIC), but not in the Rotterdam and Framingham Offspring, Studies

    PubMed Central

    Brautbar, Ariel; Pompeii, Lisa A.; Dehghan, Abbas; Ngwa, Julius S.; Nambi, Vijay; Virani, Salim S.; Rivadeneira, Fernando; Uitterlinden, André G.; Hofman, Albert; Witteman, Jacqueline C.M.; Pencina, Michael J.; Folsom, Aaron R.; Cupples, L. Adrienne; Ballantyne, Christie M.; Boerwinkle, Eric

    2013-01-01

    Objective Multiple studies have identified single-nucleotide polymorphisms (SNPs) that are associated with coronary heart disease (CHD). We examined whether SNPs selected based on predefined criteria will improve CHD risk prediction when added to traditional risk factors (TRFs). Methods SNPs were selected from the literature based on association with CHD, lack of association with a known CHD risk factor, and successful replication. A genetic risk score (GRS) was constructed based on these SNPs. Cox proportional hazards model was used to calculate CHD risk based on the Atherosclerosis Risk in Communities (ARIC) and Framingham CHD risk scores with and without the GRS. Results The GRS was associated with risk for CHD (hazard ratio [HR] = 1.10; 95% confidence interval [CI]: 1.07–1.13). Addition of the GRS to the ARIC risk score significantly improved discrimination, reclassification, and calibration beyond that afforded by TRFs alone in non-Hispanic whites in the ARIC study. The area under the receiver operating characteristic curve (AUC) increased from 0.742 to 0.749 (Δ= 0.007; 95% CI, 0.004–0.013), and the net reclassification index (NRI) was 6.3%. Although the risk estimates for CHD in the Framingham Offspring (HR = 1.12; 95% CI: 1.10–1.14) and Rotterdam (HR = 1.08; 95% CI: 1.02–1.14) Studies were significantly improved by adding the GRS to TRFs, improvements in AUC and NRI were modest. Conclusion Addition of a GRS based on direct associations with CHD to TRFs significantly improved discrimination and reclassification in white participants of the ARIC Study, with no significant improvement in the Rotterdam and Framingham Offspring Studies. PMID:22789513

  19. Development of a novel forensic STR multiplex for ancestry analysis and extended identity testing.

    PubMed

    Phillips, Chris; Fernandez-Formoso, Luis; Gelabert-Besada, Miguel; Garcia-Magariños, Manuel; Santos, Carla; Fondevila, Manuel; Carracedo, Angel; Lareu, Maria Victoria

    2013-04-01

    There is growing interest in developing additional DNA typing techniques to provide better investigative leads in forensic analysis. These include inference of genetic ancestry and prediction of common physical characteristics of DNA donors. To date, forensic ancestry analysis has centered on population-divergent SNPs but these binary loci cannot reliably detect DNA mixtures, common in forensic samples. Furthermore, STR genotypes, forming the principal DNA profiling system, are not routinely combined with forensic SNPs to strengthen frequency data available for ancestry inference. We report development of a 12-STR multiplex composed of ancestry informative marker STRs (AIM-STRs) selected from 434 tetranucleotide repeat loci. We adapted our online Bayesian classifier for AIM-SNPs: Snipper, to handle multiallele STR data using frequency-based training sets. We assessed the ability of the 12-plex AIM-STRs to differentiate CEPH Human Genome Diversity Panel populations, plus their informativeness combined with established forensic STRs and AIM-SNPs. We found combining STRs and SNPs improves the success rate of ancestry assignments while providing a reliable mixture detection system lacking from SNP analysis alone. As the 12 STRs generally show a broad range of alleles in all populations, they provide highly informative supplementary STRs for extended relationship testing and identification of missing persons with incomplete reference pedigrees. Lastly, mixed marker approaches (combining STRs with binary loci) for simple ancestry inference tests beyond forensic analysis bring advantages and we discuss the genotyping options available. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. Genetic contribution to iron status: SNPs related to iron deficiency anaemia and fine mapping of CACNA2D3 calcium channel subunit.

    PubMed

    Baeza-Richer, Carlos; Arroyo-Pardo, Eduardo; Blanco-Rojo, Ruth; Toxqui, Laura; Remacha, Angel; Vaquero, M Pilar; López-Parra, Ana M

    2015-12-01

    Numerous studies associate genetic markers with iron- and erythrocyte-related parameters, but few relate them to iron-clinical phenotypes. Novel SNP rs1375515, located in a subunit of the calcium channel gene CACNA2D3, is associated with a higher risk of anaemia. The aim of this study is to further investigate the association of this SNP with iron-related parameters and iron-clinical phenotypes, and to explore the potential role of calcium channel subunit region in iron regulation. Furthermore, we aim to replicate the association of other SNPs reported previously in our population. We tested 45 SNPs selected via systematic review and fine mapping of CACNA2D3 region, with haematological and biochemical traits in 358 women of reproductive age. Multivariate analyses include back-step logistic regression and decision trees. The results replicate the association of SNPs with iron-related traits, and also confirm the protective effect of both A allele of rs1800562 (HFE) and G allele of rs4895441 (HBS1L-MYB). The risk of developing anaemia is increased in reproductive age women carriers of A allele of rs1868505 (CACNA2D3) and/or T allele of rs13194491 (HIST1H2BJ). Association of SNPs from fine mapping with ferritin and serum iron suggests that calcium channels could be a potential pathway for iron uptake in physiological conditions. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Genome-wide high-throughput SNP discovery and genotyping for understanding natural (functional) allelic diversity and domestication patterns in wild chickpea

    PubMed Central

    Bajaj, Deepak; Das, Shouvik; Badoni, Saurabh; Kumar, Vinod; Singh, Mohar; Bansal, Kailash C.; Tyagi, Akhilesh K.; Parida, Swarup K.

    2015-01-01

    We identified 82489 high-quality genome-wide SNPs from 93 wild and cultivated Cicer accessions through integrated reference genome- and de novo-based GBS assays. High intra- and inter-specific polymorphic potential (66–85%) and broader natural allelic diversity (6–64%) detected by genome-wide SNPs among accessions signify their efficacy for monitoring introgression and transferring target trait-regulating genomic (gene) regions/allelic variants from wild to cultivated Cicer gene pools for genetic improvement. The population-specific assignment of wild Cicer accessions pertaining to the primary gene pool are more influenced by geographical origin/phenotypic characteristics than species/gene-pools of origination. The functional significance of allelic variants (non-synonymous and regulatory SNPs) scanned from transcription factors and stress-responsive genes in differentiating wild accessions (with potential known sources of yield-contributing and stress tolerance traits) from cultivated desi and kabuli accessions, fine-mapping/map-based cloning of QTLs and determination of LD patterns across wild and cultivated gene-pools are suitably elucidated. The correlation between phenotypic (agromorphological traits) and molecular diversity-based admixed domestication patterns within six structured populations of wild and cultivated accessions via genome-wide SNPs was apparent. This suggests utility of whole genome SNPs as a potential resource for identifying naturally selected trait-regulating genomic targets/functional allelic variants adaptive to diverse agroclimatic regions for genetic enhancement of cultivated gene-pools. PMID:26208313

  2. High throughput SNP discovery and genotyping in hexaploid wheat.

    PubMed

    Rimbert, Hélène; Darrier, Benoît; Navarro, Julien; Kitt, Jonathan; Choulet, Frédéric; Leveugle, Magalie; Duarte, Jorge; Rivière, Nathalie; Eversole, Kellye; Le Gouis, Jacques; Davassi, Alessandro; Balfourier, François; Le Paslier, Marie-Christine; Berard, Aurélie; Brunel, Dominique; Feuillet, Catherine; Poncet, Charles; Sourdille, Pierre; Paux, Etienne

    2018-01-01

    Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research.

  3. Steroid Sex Hormones, Sex Hormone-Binding Globulin, and Diabetes Incidence in the Diabetes Prevention Program.

    PubMed

    Mather, K J; Kim, C; Christophi, C A; Aroda, V R; Knowler, W C; Edelstein, S E; Florez, J C; Labrie, F; Kahn, S E; Goldberg, R B; Barrett-Connor, E

    2015-10-01

    Steroid sex hormones and SHBG may modify metabolism and diabetes risk, with implications for sex-specific diabetes risk and effects of prevention interventions. This study aimed to evaluate the relationships of steroid sex hormones, SHBG and SHBG single-nucleotide polymorphisms (SNPs) with diabetes risk factors and with progression to diabetes in the Diabetes Prevention Program (DPP). This was a secondary analysis of a multicenter randomized clinical trial involving 27 U.S. academic institutions. The study included 2898 DPP participants: 969 men, 948 premenopausal women not taking exogenous sex hormones, 550 postmenopausal women not taking exogenous sex hormones, and 431 postmenopausal women taking exogenous sex hormones. Participants were randomized to receive intensive lifestyle intervention, metformin, or placebo. Associations of steroid sex hormones, SHBG, and SHBG SNPs with glycemia and diabetes risk factors, and with incident diabetes over median 3.0 years (maximum, 5.0 y). T and DHT were inversely associated with fasting glucose in men, and estrone sulfate was directly associated with 2-hour post-challenge glucose in men and premenopausal women. SHBG was associated with fasting glucose in premenopausal women not taking exogenous sex hormones, and in postmenopausal women taking exogenous sex hormones, but not in the other groups. Diabetes incidence was directly associated with estrone and estradiol and inversely with T in men; the association with T was lost after adjustment for waist circumference. Sex steroids were not associated with diabetes outcomes in women. SHBG and SHBG SNPs did not predict incident diabetes in the DPP population. Estrogens and T predicted diabetes risk in men but not in women. SHBG and its polymorphisms did not predict risk in men or women. Diabetes risk is more potently determined by obesity and glycemia than by sex hormones.

  4. High-Dose Vitamin D3 during Tuberculosis Treatment in Mongolia. A Randomized Controlled Trial.

    PubMed

    Ganmaa, Davaasambuu; Munkhzul, Baatar; Fawzi, Wafaie; Spiegelman, Donna; Willett, Walter C; Bayasgalan, Purev; Baasansuren, Erkhembayar; Buyankhishig, Burneebaatar; Oyun-Erdene, Sereeter; Jolliffe, David A; Xenakis, Theodoros; Bromage, Sabri; Bloom, Barry R; Martineau, Adrian R

    2017-09-01

    Existing trials of adjunctive vitamin D in the treatment of pulmonary tuberculosis (PTB) are variously limited by small sample sizes, inadequate dosing regimens, and high baseline vitamin D status among participants. Comprehensive analyses of the effects of genetic variation in the vitamin D pathway on response to vitamin D supplementation are lacking. To determine the effect of high-dose vitamin D 3 on response to antimicrobial therapy for PTB and to evaluate the influence of single-nucleotide polymorphisms (SNPs) in vitamin D pathway genes on response to adjunctive vitamin D 3 . We conducted a clinical trial in 390 adults with PTB in Ulaanbaatar, Mongolia, who were randomized to receive four biweekly doses of 3.5 mg (140,000 IU) vitamin D 3 (n = 190) or placebo (n = 200) during intensive-phase antituberculosis treatment. The intervention elevated 8-week serum 25-hydroxyvitamin D concentrations (154.5 nmol/L vs. 15.2 nmol/L in active vs. placebo arms, respectively; 95% confidence interval for difference, 125.9-154.7 nmol/L; P < 0.001) but did not influence time to sputum culture conversion overall (adjusted hazard ratio, 1.09; 95% confidence interval, 0.86-1.36; P = 0.48). Adjunctive vitamin D 3 accelerated sputum culture conversion in patients with one or more minor alleles for SNPs in genes encoding the vitamin D receptor (rs4334089, rs11568820) and 25-hydroxyvitamin D 1α-hydroxylase (CYP27B1: rs4646536) (adjusted hazard ratio ≥ 1.47; P for interaction ≤ 0.02). Vitamin D 3 did not influence time to sputum culture conversion in the study population overall. Effects of the intervention were modified by SNPs in VDR and CYP27B1. Clinical trial registered with www.clinicaltrials.gov (NCT01657656).

  5. Genetic diversity and signatures of selection in various goat breeds revealed by genome-wide SNP markers.

    PubMed

    Brito, Luiz F; Kijas, James W; Ventura, Ricardo V; Sargolzaei, Mehdi; Porto-Neto, Laercio R; Cánovas, Angela; Feng, Zeny; Jafarikia, Mohsen; Schenkel, Flávio S

    2017-03-14

    The detection of signatures of selection has the potential to elucidate the identities of genes and mutations associated with phenotypic traits important for livestock species. It is also very relevant to investigate the levels of genetic diversity of a population, as genetic diversity represents the raw material essential for breeding and has practical implications for implementation of genomic selection. A total of 1151 animals from nine goat populations selected for different breeding goals and genotyped with the Illumina Goat 50K single nucleotide polymorphisms (SNP) Beadchip were included in this investigation. The proportion of polymorphic SNPs ranged from 0.902 (Nubian) to 0.995 (Rangeland). The overall mean H O and H E was 0.374 ± 0.021 and 0.369 ± 0.023, respectively. The average pairwise genetic distance (D) ranged from 0.263 (Toggenburg) to 0.323 (Rangeland). The overall average for the inbreeding measures F EH , F VR , F LEUT , F ROH and F PED was 0.129, -0.012, -0.010, 0.038 and 0.030, respectively. Several regions located on 19 chromosomes were potentially under selection in at least one of the goat breeds. The genomic population tree constructed using all SNPs differentiated breeds based on selection purpose, while genomic population tree built using only SNPs in the most significant region showed a great differentiation between LaMancha and the other breeds. We hypothesized that this region is related to ear morphogenesis. Furthermore, we identified genes potentially related to reproduction traits, adult body mass, efficiency of food conversion, abdominal fat deposition, conformation traits, liver fat metabolism, milk fatty acids, somatic cells score, milk protein, thermo-tolerance and ear morphogenesis. In general, moderate to high levels of genetic variability were observed for all the breeds and a characterization of runs of homozygosity gave insights into the breeds' development history. The information reported here will be useful for the implementation of genomic selection and other genomic studies in goats. We also identified various genome regions under positive selection using smoothed F ST and hapFLK statistics and suggested genes, which are potentially under selection. These results can now provide a foundation to formulate biological hypotheses related to selection processes in goats.

  6. The effect of missing data on linkage disequilibrium mapping and haplotype association analysis in the GAW14 simulated datasets

    PubMed Central

    McCaskie, Pamela A; Carter, Kim W; McCaskie, Simon R; Palmer, Lyle J

    2005-01-01

    We used our newly developed linkage disequilibrium (LD) plotting software, JLIN, to plot linkage disequilibrium between pairs of single-nucleotide polymorphisms (SNPs) for three chromosomes of the Genetic Analysis Workshop 14 Aipotu simulated population to assess the effect of missing data on LD calculations. Our haplotype analysis program, SIMHAP, was used to assess the effect of missing data on haplotype-phenotype association. Genotype data was removed at random, at levels of 1%, 5%, and 10%, and the LD calculations and haplotype association results for these levels of missingness were compared to those for the complete dataset. It was concluded that ignoring individuals with missing data substantially affects the number of regions of LD detected which, in turn, could affect tagging SNPs chosen to generate haplotypes. PMID:16451612

  7. Genetic signatures of natural selection in response to air pollution in red spruce (Picea rubens, Pinaceae).

    PubMed

    Bashalkhanov, Stanislav; Eckert, Andrew J; Rajora, Om P

    2013-12-01

    One of the most important drivers of local adaptation for forest trees is climate. Coupled to these patterns, however, are human-induced disturbances through habitat modification and pollution. The confounded effects of climate and disturbance have rarely been investigated with regard to selective pressure on forest trees. Here, we have developed and used a population genetic approach to search for signals of selection within a set of 36 candidate genes chosen for their putative effects on adaptation to climate and human-induced air pollution within five populations of red spruce (Picea rubens Sarg.), distributed across its natural range and air pollution gradient in eastern North America. Specifically, we used FST outlier and environmental correlation analyses to highlight a set of seven single nucleotide polymorphisms (SNPs) that were overly correlated with climate and levels of sulphate pollution after correcting for the confounding effects of population history. Use of three age cohorts within each population allowed the effects of climate and pollution to be separated temporally, as climate-related SNPs (n = 7) showed the strongest signals in the oldest cohort, while pollution-related SNPs (n = 3) showed the strongest signals in the youngest cohorts. These results highlight the usefulness of population genetic scans for the identification of putatively nonneutral evolution within genomes of nonmodel forest tree species, but also highlight the need for the development and application of robust methodologies to deal with the inherent multivariate nature of the genetic and ecological data used in these types of analyses. © 2013 John Wiley & Sons Ltd.

  8. African and Non-African Admixture Components in African Americans and An African Caribbean Population

    PubMed Central

    Murray, Tanda; Beaty, Terri H.; Mathias, Rasika A.; Rafaels, Nicholas; Grant, Audrey Virginia; Faruque, Mezbah U.; Watson, Harold R.; Ruczinski, Ingo; Dunston, Georgia M.; Barnes, Kathleen C.

    2013-01-01

    Admixture is a potential source of confounding in genetic association studies, so it becomes important to detect and estimate admixture in a sample of unrelated individuals. Populations of African descent in the US and the Caribbean share similar historical backgrounds but the distributions of African admixture may differ. We selected 416 ancestry informative markers (AIMs) to estimate and compare admixture proportions using STRUCTURE in 906 unrelated African Americans (AAs) and 294 Barbadians (ACs) from a study of asthma. This analysis showed AAs on average were 72.5% African, 19.6% European and 8% Asian, while ACs were 77.4% African, 15.9% European, and 6.7% Asian which were significantly different. A principal components analysis based on these AIMs yielded one primary eigenvector that explained 54.04% of the variation and captured a gradient from West African to European admixture. This principal component was highly correlated with African vs. European ancestry as estimated by STRUCTURE (r2 = 0.992, r2 = 0.912, respectively). To investigate other African contributions to African American and Barbadian admixture, we performed PCA on ~14,000 (14k) genome-wide SNPs in AAs, ACs, Yorubans, Luhya and Maasai African groups, and estimated genetic distances (FST). We found AAs and ACs were closest genetically (FST = 0.008), and both were closer to the Yorubans than the other East African populations. In our sample of individuals of African descent, ~400 well-defined AIMs were just as good for detecting substructure as ~14,000 random SNPs drawn from a genome-wide panel of markers. PMID:20717976

  9. Integrated genetic and epigenetic prediction of coronary heart disease in the Framingham Heart Study.

    PubMed

    Dogan, Meeshanthini V; Grumbach, Isabella M; Michaelson, Jacob J; Philibert, Robert A

    2018-01-01

    An improved method for detecting coronary heart disease (CHD) could have substantial clinical impact. Building on the idea that systemic effects of CHD risk factors are a conglomeration of genetic and environmental factors, we use machine learning techniques and integrate genetic, epigenetic and phenotype data from the Framingham Heart Study to build and test a Random Forest classification model for symptomatic CHD. Our classifier was trained on n = 1,545 individuals and consisted of four DNA methylation sites, two SNPs, age and gender. The methylation sites and SNPs were selected during the training phase. The final trained model was then tested on n = 142 individuals. The test data comprised of individuals removed based on relatedness to those in the training dataset. This integrated classifier was capable of classifying symptomatic CHD status of those in the test set with an accuracy, sensitivity and specificity of 78%, 0.75 and 0.80, respectively. In contrast, a model using only conventional CHD risk factors as predictors had an accuracy and sensitivity of only 65% and 0.42, respectively, but with a specificity of 0.89 in the test set. Regression analyses of the methylation signatures illustrate our ability to map these signatures to known risk factors in CHD pathogenesis. These results demonstrate the capability of an integrated approach to effectively model symptomatic CHD status. These results also suggest that future studies of biomaterial collected from longitudinally informative cohorts that are specifically characterized for cardiac disease at follow-up could lead to the introduction of sensitive, readily employable integrated genetic-epigenetic algorithms for predicting onset of future symptomatic CHD.

  10. Identification of a Serum amyloid A gene and the association of SNPs with Vibrio-resistance and growth traits in the clam Meretrix meretrix.

    PubMed

    Zou, Linhu; Liu, Baozhong

    2015-04-01

    Serum amyloid A (SAA), an acute response protein as well as an apolipoprotein, is considered to play crucial roles in both innate immunity and lipid metabolism. In this study, a SAA gene (MmSAA) was identified in the clam Meretrix meretrix. The full length DNA of MmSAA was 1407bp, consisting of three exons and two introns. The distribution of MmSAA in clam tissues was examined with the highest expression in hepatopancreas. In response to the Vibrio parahaemolyticus challenge, MmSAA mRNA showed significantly higher expression at 24 h post-challenge in experimental clams (P < 0.05). Forty-eight single nucleotide polymorphisms (SNPs) in the DNA partial sequence of MmSAA were discovered and examined for their association with Vibrio-resistance and growth traits, respectively. The single SNP association analysis indicated that five single SNPs (g.42, g.72, g.82, g.147 and g.165) were significantly associated with Vibrio-resistance (P < 0.05). Haplotype analysis produced additional support for association with the Chi-square values 6.393 (P = 0.012). Among the five selected SNPs, the effect of a missense mutation (g.82, A → G) was detected by site-directed mutagenesis with fusion expression of protein assay, and the result showed that the recombinant plasmids containing wild-type pET30a-MmSAA had more inhibition effect than the mutant ones on the growth rate of the host bacteria. In addition, four growth traits of the clams in 09G3SPSB population were recorded and the SNP g.176 was found to be significantly associated with the growth traits with the Global score value 0.790 (P = 0.015). Our findings suggested that common genetic variation in MmSAA might contribute to the risk of susceptibility to Vibrio infection and might be associated with the growth traits in the clams M. meretrix, and more works are still needed to validate these SNPs as potential markers for actual selective breeding. Copyright © 2015 Elsevier Ltd. All rights reserved.

  11. High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species

    PubMed Central

    2011-01-01

    Background High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of Eucalyptus from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for E. grandis. A systematic assessment of in silico SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous in silico constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine Eucalyptus species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions This study indicates that the GGGT performs well both within and across species of Eucalyptus notwithstanding its nucleotide diversity ≥2%. The development of a much larger array of informative SNPs across multiple Eucalyptus species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in Eucalyptus. PMID:21492434

  12. Maximizing the reliability of genomic selection by optimizing the calibration set of reference individuals: comparison of methods in two diverse groups of maize inbreds (Zea mays L.).

    PubMed

    Rincent, R; Laloë, D; Nicolas, S; Altmann, T; Brunel, D; Revilla, P; Rodríguez, V M; Moreno-Gonzalez, J; Melchinger, A; Bauer, E; Schoen, C-C; Meyer, N; Giauffret, C; Bauland, C; Jamin, P; Laborde, J; Monod, H; Flament, P; Charcosset, A; Moreau, L

    2012-10-01

    Genomic selection refers to the use of genotypic information for predicting breeding values of selection candidates. A prediction formula is calibrated with the genotypes and phenotypes of reference individuals constituting the calibration set. The size and the composition of this set are essential parameters affecting the prediction reliabilities. The objective of this study was to maximize reliabilities by optimizing the calibration set. Different criteria based on the diversity or on the prediction error variance (PEV) derived from the realized additive relationship matrix-best linear unbiased predictions model (RA-BLUP) were used to select the reference individuals. For the latter, we considered the mean of the PEV of the contrasts between each selection candidate and the mean of the population (PEVmean) and the mean of the expected reliabilities of the same contrasts (CDmean). These criteria were tested with phenotypic data collected on two diversity panels of maize (Zea mays L.) genotyped with a 50k SNPs array. In the two panels, samples chosen based on CDmean gave higher reliabilities than random samples for various calibration set sizes. CDmean also appeared superior to PEVmean, which can be explained by the fact that it takes into account the reduction of variance due to the relatedness between individuals. Selected samples were close to optimality for a wide range of trait heritabilities, which suggests that the strategy presented here can efficiently sample subsets in panels of inbred lines. A script to optimize reference samples based on CDmean is available on request.

  13. SNPs of bovine HGF gene and their association with growth traits in Nanyang cattle.

    PubMed

    Cai, Hanfang; Lan, Xianyong; Li, Aimin; Zhou, Yang; Sun, Jiajie; Lei, Chuzhao; Zhang, Chunlei; Chen, Hong

    2013-10-01

    Hepatocyte growth factor (HGF) is one of the multifunctional cell factors that regulates cellular proliferation, motility and morphogenesis in mammalians. And its medical research has deep significance. In this paper, polymorphisms of HGF gene were investigated in 1433 health and irrelated Chinese cattle by PCR-RFLP and DNA sequencing approach. Ten novel Single nucleotide polymorphisms (SNPs) were identified, which included one missense mutation, g.72801G>A in the coding region, and the others in the intron. Association analysis between four of them, g.288T>C, g.72801G>A, g.77172G>T, and g.77408T>G, and growth traits in Nanyang, were performed. The results indicated that SNPs within bovine HGF gene were significantly associated with growth traits. Phylogenetic analysis showed that the genetic background of Caoyuan Red cattle was different from the others in the tested breeds. The findings will provide a background for application of bovine HGF gene in the selection program in Chinese cattle. Copyright © 2013 Elsevier Ltd. All rights reserved.

  14. Interleukin-2 and Interleukin-8 Gene Polymorphisms and Acquired Aplastic Anemia Risk in a Chinese Population.

    PubMed

    Zhang, Xuejie; Lin, Shengyun; Yang, Yan; Rong, Liucheng; He, Guangsheng; He, Hailong; Xue, Yao; Fang, Yongjun; Wang, Yaping

    2017-01-01

    Cytokines IL-2 and IL-8 both participate in immune regulation. However, the relationship between polymorphisms in these two cytokines and the risk of acquired aplastic anemia (acquired AA) has not been explored. We selected five SNPs including rs11575812, rs2069772 and rs2069762 of IL-2, rs2227306 and rs2227543 of IL-8. SNaPshot genotyping was used to test the genotypes of IL-2 and IL-8 polymorphisms in a population of 101 acquired AA patients and 165 healthy controls. The rs2069762 G allele appeared to be a protective mutation, but no significant differences were found in other four SNPs. We also found that rs2069762 had an impact on the transcriptional regulation. It could be assumed that the rs2069762 polymorphism might reduce the risk of acquired aplastic anemia, while the remaining four SNPs might not contribute to susceptibility to acquired AA in a Chinese population. © 2017 The Author(s)Published by S. Karger AG, Basel.

  15. Development and validation of a D-loop mtDNA SNP assay for the screening of specimens in forensic casework.

    PubMed

    Chemale, Gustavo; Paneto, Greiciane Gaburro; Menezes, Meiga Aurea Mendes; de Freitas, Jorge Marcelo; Jacques, Guilherme Silveira; Cicarelli, Regina Maria Barretto; Fagundes, Paulo Roberto

    2013-05-01

    Mitochondrial DNA (mtDNA) analysis is usually a last resort in routine forensic DNA casework. However, it has become a powerful tool for the analysis of highly degraded samples or samples containing too little or no nuclear DNA, such as old bones and hair shafts. The gold standard methodology still constitutes the direct sequencing of polymerase chain reaction (PCR) products or cloned amplicons from the HVS-1 and HVS-2 (hypervariable segment) control region segments. Identifications using mtDNA are time consuming, expensive and can be very complex, depending on the amount and nature of the material being tested. The main goal of this work is to develop a less labour-intensive and less expensive screening method for mtDNA analysis, in order to aid in the exclusion of non-matching samples and as a presumptive test prior to final confirmatory DNA sequencing. We have selected 14 highly discriminatory single nucleotide polymorphisms (SNPs) based on simulations performed by Salas and Amigo (2010) to be typed using SNaPShot(TM) (Applied Biosystems, Foster City, CA, USA). The assay was validated by typing more than 100 HVS-1/HVS-2 sequenced samples. No differences were observed between the SNP typing and DNA sequencing when results were compared, with the exception of allelic dropouts observed in a few haplotypes. Haplotype diversity simulations were performed using 172 mtDNA sequences representative of the Brazilian population and a score of 0.9794 was obtained when the 14 SNPs were used, showing that the theoretical prediction approach for the selection of highly discriminatory SNPs suggested by Salas and Amigo (2010) was confirmed in the population studied. As the main goal of the work is to develop a screening assay to skip the sequencing of all samples in a particular case, a pair-wise comparison of the sequences was done using the selected SNPs. When both HVS-1/HVS-2 SNPs were used for simulations, at least two differences were observed in 93.2% of the comparisons performed. The assay was validated with casework samples. Results show that the method is straightforward and can be used for exclusionary purposes, saving time and laboratory resources. The assay confirms the theoretic prediction suggested by Salas and Amigo (2010). All forensic advantages, such as high sensitivity and power of discrimination, as also the disadvantages, such as the occurrence of allele dropouts, are discussed throughout the article. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  16. A genome-wide association study identifies colorectal cancer susceptibility loci on chromosomes 10p14 and 8q23.3.

    PubMed

    Tomlinson, Ian P M; Webb, Emily; Carvajal-Carmona, Luis; Broderick, Peter; Howarth, Kimberley; Pittman, Alan M; Spain, Sarah; Lubbe, Steven; Walther, Axel; Sullivan, Kate; Jaeger, Emma; Fielding, Sarah; Rowan, Andrew; Vijayakrishnan, Jayaram; Domingo, Enric; Chandler, Ian; Kemp, Zoe; Qureshi, Mobshra; Farrington, Susan M; Tenesa, Albert; Prendergast, James G D; Barnetson, Rebecca A; Penegar, Steven; Barclay, Ella; Wood, Wendy; Martin, Lynn; Gorman, Maggie; Thomas, Huw; Peto, Julian; Bishop, D Timothy; Gray, Richard; Maher, Eamonn R; Lucassen, Anneke; Kerr, David; Evans, D Gareth R; Schafmayer, Clemens; Buch, Stephan; Völzke, Henry; Hampe, Jochen; Schreiber, Stefan; John, Ulrich; Koessler, Thibaud; Pharoah, Paul; van Wezel, Tom; Morreau, Hans; Wijnen, Juul T; Hopper, John L; Southey, Melissa C; Giles, Graham G; Severi, Gianluca; Castellví-Bel, Sergi; Ruiz-Ponte, Clara; Carracedo, Angel; Castells, Antoni; Försti, Asta; Hemminki, Kari; Vodicka, Pavel; Naccarati, Alessio; Lipton, Lara; Ho, Judy W C; Cheng, K K; Sham, Pak C; Luk, J; Agúndez, Jose A G; Ladero, Jose M; de la Hoya, Miguel; Caldés, Trinidad; Niittymäki, Iina; Tuupanen, Sari; Karhu, Auli; Aaltonen, Lauri; Cazier, Jean-Baptiste; Campbell, Harry; Dunlop, Malcolm G; Houlston, Richard S

    2008-05-01

    To identify colorectal cancer (CRC) susceptibility alleles, we conducted a genome-wide association study. In phase 1, we genotyped 550,163 tagSNPs in 940 familial colorectal tumor cases (627 CRC, 313 high-risk adenoma) and 965 controls. In phase 2, we genotyped 42,708 selected SNPs in 2,873 CRC cases and 2,871 controls. In phase 3, we evaluated 11 SNPs showing association at P < 10(-4) in a joint analysis of phases 1 and 2 in 4,287 CRC cases and 3,743 controls. Two SNPs were taken forward to phase 4 genotyping (10,731 CRC cases and 10,961 controls from eight centers). In addition to the previously reported 8q24, 15q13 and 18q21 CRC risk loci, we identified two previously unreported associations: rs10795668, located at 10p14 (P = 2.5 x 10(-13) overall; P = 6.9 x 10(-12) replication), and rs16892766, at 8q23.3 (P = 3.3 x 10(-18) overall; P = 9.6 x 10(-17) replication), which tags a plausible causative gene, EIF3H. These data provide further evidence for the 'common-disease common-variant' model of CRC predisposition.

  17. A second generation human haplotype map of over 3.1 million SNPs.

    PubMed

    Frazer, Kelly A; Ballinger, Dennis G; Cox, David R; Hinds, David A; Stuve, Laura L; Gibbs, Richard A; Belmont, John W; Boudreau, Andrew; Hardenbol, Paul; Leal, Suzanne M; Pasternak, Shiran; Wheeler, David A; Willis, Thomas D; Yu, Fuli; Yang, Huanming; Zeng, Changqing; Gao, Yang; Hu, Haoran; Hu, Weitao; Li, Chaohua; Lin, Wei; Liu, Siqi; Pan, Hao; Tang, Xiaoli; Wang, Jian; Wang, Wei; Yu, Jun; Zhang, Bo; Zhang, Qingrun; Zhao, Hongbin; Zhao, Hui; Zhou, Jun; Gabriel, Stacey B; Barry, Rachel; Blumenstiel, Brendan; Camargo, Amy; Defelice, Matthew; Faggart, Maura; Goyette, Mary; Gupta, Supriya; Moore, Jamie; Nguyen, Huy; Onofrio, Robert C; Parkin, Melissa; Roy, Jessica; Stahl, Erich; Winchester, Ellen; Ziaugra, Liuda; Altshuler, David; Shen, Yan; Yao, Zhijian; Huang, Wei; Chu, Xun; He, Yungang; Jin, Li; Liu, Yangfan; Shen, Yayun; Sun, Weiwei; Wang, Haifeng; Wang, Yi; Wang, Ying; Xiong, Xiaoyan; Xu, Liang; Waye, Mary M Y; Tsui, Stephen K W; Xue, Hong; Wong, J Tze-Fei; Galver, Luana M; Fan, Jian-Bing; Gunderson, Kevin; Murray, Sarah S; Oliphant, Arnold R; Chee, Mark S; Montpetit, Alexandre; Chagnon, Fanny; Ferretti, Vincent; Leboeuf, Martin; Olivier, Jean-François; Phillips, Michael S; Roumy, Stéphanie; Sallée, Clémentine; Verner, Andrei; Hudson, Thomas J; Kwok, Pui-Yan; Cai, Dongmei; Koboldt, Daniel C; Miller, Raymond D; Pawlikowska, Ludmila; Taillon-Miller, Patricia; Xiao, Ming; Tsui, Lap-Chee; Mak, William; Song, You Qiang; Tam, Paul K H; Nakamura, Yusuke; Kawaguchi, Takahisa; Kitamoto, Takuya; Morizono, Takashi; Nagashima, Atsushi; Ohnishi, Yozo; Sekine, Akihiro; Tanaka, Toshihiro; Tsunoda, Tatsuhiko; Deloukas, Panos; Bird, Christine P; Delgado, Marcos; Dermitzakis, Emmanouil T; Gwilliam, Rhian; Hunt, Sarah; Morrison, Jonathan; Powell, Don; Stranger, Barbara E; Whittaker, Pamela; Bentley, David R; Daly, Mark J; de Bakker, Paul I W; Barrett, Jeff; Chretien, Yves R; Maller, Julian; McCarroll, Steve; Patterson, Nick; Pe'er, Itsik; Price, Alkes; Purcell, Shaun; Richter, Daniel J; Sabeti, Pardis; Saxena, Richa; Schaffner, Stephen F; Sham, Pak C; Varilly, Patrick; Altshuler, David; Stein, Lincoln D; Krishnan, Lalitha; Smith, Albert Vernon; Tello-Ruiz, Marcela K; Thorisson, Gudmundur A; Chakravarti, Aravinda; Chen, Peter E; Cutler, David J; Kashuk, Carl S; Lin, Shin; Abecasis, Gonçalo R; Guan, Weihua; Li, Yun; Munro, Heather M; Qin, Zhaohui Steve; Thomas, Daryl J; McVean, Gilean; Auton, Adam; Bottolo, Leonardo; Cardin, Niall; Eyheramendy, Susana; Freeman, Colin; Marchini, Jonathan; Myers, Simon; Spencer, Chris; Stephens, Matthew; Donnelly, Peter; Cardon, Lon R; Clarke, Geraldine; Evans, David M; Morris, Andrew P; Weir, Bruce S; Tsunoda, Tatsuhiko; Mullikin, James C; Sherry, Stephen T; Feolo, Michael; Skol, Andrew; Zhang, Houcan; Zeng, Changqing; Zhao, Hui; Matsuda, Ichiro; Fukushima, Yoshimitsu; Macer, Darryl R; Suda, Eiko; Rotimi, Charles N; Adebamowo, Clement A; Ajayi, Ike; Aniagwu, Toyin; Marshall, Patricia A; Nkwodimmah, Chibuzor; Royal, Charmaine D M; Leppert, Mark F; Dixon, Missy; Peiffer, Andy; Qiu, Renzong; Kent, Alastair; Kato, Kazuto; Niikawa, Norio; Adewole, Isaac F; Knoppers, Bartha M; Foster, Morris W; Clayton, Ellen Wright; Watkin, Jessica; Gibbs, Richard A; Belmont, John W; Muzny, Donna; Nazareth, Lynne; Sodergren, Erica; Weinstock, George M; Wheeler, David A; Yakub, Imtaz; Gabriel, Stacey B; Onofrio, Robert C; Richter, Daniel J; Ziaugra, Liuda; Birren, Bruce W; Daly, Mark J; Altshuler, David; Wilson, Richard K; Fulton, Lucinda L; Rogers, Jane; Burton, John; Carter, Nigel P; Clee, Christopher M; Griffiths, Mark; Jones, Matthew C; McLay, Kirsten; Plumb, Robert W; Ross, Mark T; Sims, Sarah K; Willey, David L; Chen, Zhu; Han, Hua; Kang, Le; Godbout, Martin; Wallenburg, John C; L'Archevêque, Paul; Bellemare, Guy; Saeki, Koji; Wang, Hongguang; An, Daochang; Fu, Hongbo; Li, Qing; Wang, Zhen; Wang, Renwu; Holden, Arthur L; Brooks, Lisa D; McEwen, Jean E; Guyer, Mark S; Wang, Vivian Ota; Peterson, Jane L; Shi, Michael; Spiegel, Jack; Sung, Lawrence M; Zacharia, Lynn F; Collins, Francis S; Kennedy, Karen; Jamieson, Ruth; Stewart, John

    2007-10-18

    We describe the Phase II HapMap, which characterizes over 3.1 million human single nucleotide polymorphisms (SNPs) genotyped in 270 individuals from four geographically diverse populations and includes 25-35% of common SNP variation in the populations surveyed. The map is estimated to capture untyped common variation with an average maximum r2 of between 0.9 and 0.96 depending on population. We demonstrate that the current generation of commercial genome-wide genotyping products captures common Phase II SNPs with an average maximum r2 of up to 0.8 in African and up to 0.95 in non-African populations, and that potential gains in power in association studies can be obtained through imputation. These data also reveal novel aspects of the structure of linkage disequilibrium. We show that 10-30% of pairs of individuals within a population share at least one region of extended genetic identity arising from recent ancestry and that up to 1% of all common variants are untaggable, primarily because they lie within recombination hotspots. We show that recombination rates vary systematically around genes and between genes of different function. Finally, we demonstrate increased differentiation at non-synonymous, compared to synonymous, SNPs, resulting from systematic differences in the strength or efficacy of natural selection between populations.

  18. Genome-wide association study reveals candidate genes influencing lipids and diterpenes contents in Coffea arabica L.

    PubMed

    Sant'Ana, Gustavo C; Pereira, Luiz F P; Pot, David; Ivamoto, Suzana T; Domingues, Douglas S; Ferreira, Rafaelle V; Pagiatto, Natalia F; da Silva, Bruna S R; Nogueira, Lívia M; Kitzberger, Cintia S G; Scholz, Maria B S; de Oliveira, Fernanda F; Sera, Gustavo H; Padilha, Lilian; Labouisse, Jean-Pierre; Guyot, Romain; Charmetant, Pierre; Leroy, Thierry

    2018-01-11

    Lipids, including the diterpenes cafestol and kahweol, are key compounds that contribute to the quality of coffee beverages. We determined total lipid content and cafestol and kahweol concentrations in green beans and genotyped 107 Coffea arabica accessions, including wild genotypes from the historical FAO collection from Ethiopia. A genome-wide association study was performed to identify genomic regions associated with lipid, cafestol and kahweol contents and cafestol/kahweol ratio. Using the diploid Coffea canephora genome as a reference, we identified 6,696 SNPs. Population structure analyses suggested the presence of two to three groups (K = 2 and K = 3) corresponding to the east and west sides of the Great Rift Valley and an additional group formed by wild accessions collected in western forests. We identified 5 SNPs associated with lipid content, 4 with cafestol, 3 with kahweol and 9 with cafestol/kahweol ratio. Most of these SNPs are located inside or near candidate genes related to metabolic pathways of these chemical compounds in coffee beans. In addition, three trait-associated SNPs showed evidence of directional selection among cultivated and wild coffee accessions. Our results also confirm a great allelic richness in wild accessions from Ethiopia, especially in accessions originating from forests in the west side of the Great Rift Valley.

  19. SNPs of melanocortin 4 receptor (MC4R) associated with body weight in Beagle dogs.

    PubMed

    Zeng, Ruixia; Zhang, Yibo; Du, Peng

    2014-01-01

    Melanocortin 4 receptor (MC4R), which is associated with inherited human obesity, is involoved in food intake and body weight of mammals. To study the relationships between MC4R gene polymorphism and body weight in Beagle dogs, we detected and compared the nucleotide sequence of the whole coding region and 3'- and 5'- flanking regions of the dog MC4R gene (1214 bp). In 120 Beagle dogs, two SNPs (A420C, C895T) were identified and their relation with body weight was analyzed with RFLP-PCR method. The results showed that the SNP at A420C was significantly associated with canine body weight trait when it changed amino acid 101 of the MC4R protein from asparagine to threonine, while canine body weight variations were significant in female dogs when MC4R nonsense mutation at C895T. It suggested that the two SNPs might affect the MC4R gene's function which was relative to body weight in Beagle dogs. Therefore, MC4R was a candidate gene for selecting different size dogs with the MC4R SNPs (A420C, C895T) being potentially valuable as a genetic marker.

  20. Validation of genetic polymorphisms on BTA14 associated with carcass trait in a commercial Hanwoo population.

    PubMed

    Sharma, A; Dang, C G; Kim, K S; Kim, J J; Lee, H K; Kim, H C; Yeon, S H; Kang, H S; Lee, S H

    2014-12-01

    The objective of this study was to validate the association of significant SNPs identified from a previous genome-wide association study with carcass weight (CWT) in a commercial Hanwoo population. We genotyped 13 SNPs located on BTA14 in 867 steers from Korea Hanwoo feedlot bulls. Of these 13 SNPs, five SNPs, namely rs29021868, rs110061498, rs109546980, rs42404006 and rs42303720, were found to be significantly associated (P < 0.001) with CWT. These five significant markers spanned the 24.3 to 29.4 Mb region of BTA14. The most significant marker (rs29021868) for CWT in this study had a 13.07 kg allele substitution effect and accounted for 2.4% of the additive genetic variance in the commercial Hanwoo population. The SNP marker rs109546980 was found to be significantly associated with both CWT (P < 0.001) and eye muscle area (P < 0.001) and could potentially be exploited for marker-assisted selection in Hanwoo cattle. We also genotyped the ss319607402 variation, which maps to intron2 of PLAG1 gene and which is already reported to be associated with height, to identify any significant association with carcass weight; however, no such association was observed in this Hanwoo commercial population. © 2014 Stichting International Foundation for Animal Genetics.

  1. Analytical and statistical consideration on the use of the ISAG-ICAR-SNP bovine panel for parentage control, using the Illumina BeadChip technology: example on the German Holstein population.

    PubMed

    Schütz, Ekkehard; Brenig, Bertram

    2015-02-05

    Parentage control is moving from short tandem repeats- to single nucleotide polymorphism (SNP) systems. For SNP-based parentage control in cattle, the ISAG-ICAR Committee proposes a set of 100/200 SNPs but quality criteria are lacking. Regarding German Holstein-Friesian cattle with only a limited number of evaluated individuals, the exclusion probability is not well-defined. We propose a statistical procedure for excluding single SNPs from parentage control, based on case-by-case evaluation of the GenCall score, to minimize parentage exclusion, based on miscalled genotypes. Exclusion power of the ISAG-ICAR SNPs used for the German Holstein-Friesian population was adjusted based on the results of more than 25,000 individuals. Experimental data were derived from routine genomic selection analyses of the German Holstein-Friesian population using the Illumina BovineSNP50 v2 BeadChip (20,000 individuals) or the EuroG10K variant (7000 individuals). Averages and standard deviations of GenCall scores for the 200 SNPs of the ISAG-ICAR recommended panel were calculated and used to calculate the downward Z-value. Based on minor allelic frequencies in the Holstein-Friesian population, one minus exclusion probability was equal to 1.4×10⁻¹⁰ and 7.2×10⁻²⁶, with one and two parents, respectively. Two monomorphic SNPs from the 100-SNP ISAG-ICAR core-panel did not contribute. Simulation of 10,000 parentage control combinations, using the GenCall score data from both BeadChips, showed that with a Z-value greater than 3.66 only about 2.5% parentages were excluded, based on the ISAG-ICAR recommendations (core-panel: ≥ 90 SNPs for one, ≥ 85 SNPs for two parents). When applied to real data from 1750 single parentage assessments, the optimal threshold was determined to be Z = 5.0, with only 34 censored cases and reduction to four (0.2%) doubtful parentages. About 70 parentage exclusions due to weak genotype calls were avoided, whereas true exclusions (n = 34) were unaffected. Using SNPs for parentage evaluation provides a high exclusion power also for parent identification. SNPs with a low GenCall score show a high tendency towards intra-molecular secondary structures and substantially contribute to false exclusion of parentages. We propose a method that controls this error without excluding too many parent combinations from the evaluation.

  2. A robust and efficient statistical method for genetic association studies using case and control samples from multiple cohorts

    PubMed Central

    2013-01-01

    Background The theoretical basis of genome-wide association studies (GWAS) is statistical inference of linkage disequilibrium (LD) between any polymorphic marker and a putative disease locus. Most methods widely implemented for such analyses are vulnerable to several key demographic factors and deliver a poor statistical power for detecting genuine associations and also a high false positive rate. Here, we present a likelihood-based statistical approach that accounts properly for non-random nature of case–control samples in regard of genotypic distribution at the loci in populations under study and confers flexibility to test for genetic association in presence of different confounding factors such as population structure, non-randomness of samples etc. Results We implemented this novel method together with several popular methods in the literature of GWAS, to re-analyze recently published Parkinson’s disease (PD) case–control samples. The real data analysis and computer simulation show that the new method confers not only significantly improved statistical power for detecting the associations but also robustness to the difficulties stemmed from non-randomly sampling and genetic structures when compared to its rivals. In particular, the new method detected 44 significant SNPs within 25 chromosomal regions of size < 1 Mb but only 6 SNPs in two of these regions were previously detected by the trend test based methods. It discovered two SNPs located 1.18 Mb and 0.18 Mb from the PD candidates, FGF20 and PARK8, without invoking false positive risk. Conclusions We developed a novel likelihood-based method which provides adequate estimation of LD and other population model parameters by using case and control samples, the ease in integration of these samples from multiple genetically divergent populations and thus confers statistically robust and powerful analyses of GWAS. On basis of simulation studies and analysis of real datasets, we demonstrated significant improvement of the new method over the non-parametric trend test, which is the most popularly implemented in the literature of GWAS. PMID:23394771

  3. Polymorphisms in the estrogen receptor alpha gene (ESR1), daily cycling estrogen and mammographic density phenotypes.

    PubMed

    Fjeldheim, F N; Frydenberg, H; Flote, V G; McTiernan, A; Furberg, A-S; Ellison, P T; Barrett, E S; Wilsgaard, T; Jasienska, G; Ursin, G; Wist, E A; Thune, I

    2016-10-07

    Single nucleotide polymorphisms (SNPs) involved in the estrogen pathway and SNPs in the estrogen receptor alpha gene (ESR1 6q25) have been linked to breast cancer development, and mammographic density is an established breast cancer risk factor. Whether there is an association between daily estradiol levels, SNPs in ESR1 and premenopausal mammographic density phenotypes is unknown. We assessed estradiol in daily saliva samples throughout an entire menstrual cycle in 202 healthy premenopausal women in the Norwegian Energy Balance and Breast Cancer Aspects I study. DNA was genotyped using the Illumina Golden Gate platform. Mammograms were taken between days 7 and 12 of the menstrual cycle, and digitized mammographic density was assessed using a computer-assisted method (Madena). Multivariable regression models were used to study the association between SNPs in ESR1, premenopausal mammographic density phenotypes and daily cycling estradiol. We observed inverse linear associations between the minor alleles of eight measured SNPs (rs3020364, rs2474148, rs12154178, rs2347867, rs6927072, rs2982712, rs3020407, rs9322335) and percent mammographic density (p-values: 0.002-0.026), these associations were strongest in lean women (BMI, ≤23.6 kg/m 2. ). The odds of above-median percent mammographic density (>28.5 %) among women with major homozygous genotypes were 3-6 times higher than those of women with minor homozygous genotypes in seven SNPs. Women with rs3020364 major homozygous genotype had an OR of 6.46 for above-median percent mammographic density (OR: 6.46; 95 % Confidence Interval 1.61, 25.94) when compared to women with the minor homozygous genotype. These associations were not observed in relation to absolute mammographic density. No associations between SNPs and daily cycling estradiol were observed. However, we suggest, based on results of borderline significance (p values: 0.025-0.079) that the level of 17β-estradiol for women with the minor genotype for rs3020364, rs24744148 and rs2982712 were lower throughout the cycle in women with low (<28.5 %) percent mammographic density and higher in women with high (>28.5 %) percent mammographic density, when compared to women with the major genotype. Our results support an association between eight selected SNPs in the ESR1 gene and percent mammographic density. The results need to be confirmed in larger studies.

  4. Placental genome and maternal-placental genetic interactions: a genome-wide and candidate gene association study of placental abruption.

    PubMed

    Denis, Marie; Enquobahrie, Daniel A; Tadesse, Mahlet G; Gelaye, Bizu; Sanchez, Sixto E; Salazar, Manuel; Ananth, Cande V; Williams, Michelle A

    2014-01-01

    While available evidence supports the role of genetics in the pathogenesis of placental abruption (PA), PA-related placental genome variations and maternal-placental genetic interactions have not been investigated. Maternal blood and placental samples collected from participants in the Peruvian Abruptio Placentae Epidemiology study were genotyped using Illumina's Cardio-Metabochip platform. We examined 118,782 genome-wide SNPs and 333 SNPs in 32 candidate genes from mitochondrial biogenesis and oxidative phosphorylation pathways in placental DNA from 280 PA cases and 244 controls. We assessed maternal-placental interactions in the candidate gene SNPS and two imprinted regions (IGF2/H19 and C19MC). Univariate and penalized logistic regression models were fit to estimate odds ratios. We examined the combined effect of multiple SNPs on PA risk using weighted genetic risk scores (WGRS) with repeated ten-fold cross-validations. A multinomial model was used to investigate maternal-placental genetic interactions. In placental genome-wide and candidate gene analyses, no SNP was significant after false discovery rate correction. The top genome-wide association study (GWAS) hits were rs544201, rs1484464 (CTNNA2), rs4149570 (TNFRSF1A) and rs13055470 (ZNRF3) (p-values: 1.11e-05 to 3.54e-05). The top 200 SNPs of the GWAS overrepresented genes involved in cell cycle, growth and proliferation. The top candidate gene hits were rs16949118 (COX10) and rs7609948 (THRB) (p-values: 6.00e-03 and 8.19e-03). Participants in the highest quartile of WGRS based on cross-validations using SNPs selected from the GWAS and candidate gene analyses had a 8.40-fold (95% CI: 5.8-12.56) and a 4.46-fold (95% CI: 2.94-6.72) higher odds of PA compared to participants in the lowest quartile. We found maternal-placental genetic interactions on PA risk for two SNPs in PPARG (chr3:12313450 and chr3:12412978) and maternal imprinting effects for multiple SNPs in the C19MC and IGF2/H19 regions. Variations in the placental genome and interactions between maternal-placental genetic variations may contribute to PA risk. Larger studies may help advance our understanding of PA pathogenesis.

  5. The population genomic signature of environmental selection in the widespread insect-pollinated tree species Frangula alnus at different geographical scales

    PubMed Central

    De Kort, H; Vandepitte, K; Mergeay, J; Mijnsbrugge, K V; Honnay, O

    2015-01-01

    The evaluation of the molecular signatures of selection in species lacking an available closely related reference genome remains challenging, yet it may provide valuable fundamental insights into the capacity of populations to respond to environmental cues. We screened 25 native populations of the tree species Frangula alnus subsp. alnus (Rhamnaceae), covering three different geographical scales, for 183 annotated single-nucleotide polymorphisms (SNPs). Standard population genomic outlier screens were combined with individual-based and multivariate landscape genomic approaches to examine the strength of selection relative to neutral processes in shaping genomic variation, and to identify the main environmental agents driving selection. Our results demonstrate a more distinct signature of selection with increasing geographical distance, as indicated by the proportion of SNPs (i) showing exceptional patterns of genetic diversity and differentiation (outliers) and (ii) associated with climate. Both temperature and precipitation have an important role as selective agents in shaping adaptive genomic differentiation in F. alnus subsp. alnus, although their relative importance differed among spatial scales. At the ‘intermediate' and ‘regional' scales, where limited genetic clustering and high population diversity were observed, some indications of natural selection may suggest a major role for gene flow in safeguarding adaptability. High genetic diversity at loci under selection in particular, indicated considerable adaptive potential, which may nevertheless be compromised by the combined effects of climate change and habitat fragmentation. PMID:25944466

  6. Association of MAP4K4 gene single nucleotide polymorphism with mastitis and milk traits in Chinese Holstein cattle.

    PubMed

    Bhattarai, Dinesh; Chen, Xing; Ur Rehman, Zia; Hao, Xingjie; Ullah, Farman; Dad, Rahim; Talpur, Hira Sajjad; Kadariya, Ishwari; Cui, Lu; Fan, Mingxia; Zhang, Shujun

    2017-02-01

    The objective of the studies presented in this Research Communication was to investigate the association of single nucleotide polymorphisms present in the MAP4K4 gene with different milk traits in dairy cows. Based on previous QTL fine mapping results on bovine chromosome 11, the MAP4K4 gene was selected as a candidate gene to evaluate its effect on somatic cell count and milk traits in ChineseHolstein cows. Milk production traits including milk yield, fat percentage, and protein percentage of each cow were collected using 305 d lactation records. Association between MAP4K4 genotype and different traits and Somatic Cell Score (SCS) was performed using General Linear Regression Model of R. Two SNPs at exon 18 (c.2061T > G and c.2196T > C) with genotype TT in both SNPs were found significantly higher for somatic SCS. We found the significant effect of exon 18 (c.2061T > G) on protein percentage, milk yield and SCS. We identified SNPs at different location of MAP4K4 gene of the cattle and several of them were significantly associated with the somatic cell score and other different milk traits. Thus, MAP4K4 gene could be a useful candidate gene for selection of dairy cattle against mastitis and the identified polymorphisms might potentially be strong genetic markers.

  7. Large-scale SNP discovery and construction of a high-density genetic map of Colossoma macropomum through genotyping-by-sequencing

    PubMed Central

    Nunes, José de Ribamar da Silva; Liu, Shikai; Pértille, Fábio; Perazza, Caio Augusto; Villela, Priscilla Marqui Schmidt; de Almeida-Val, Vera Maria Fonseca; Hilsdorf, Alexandre Wagner Silva; Liu, Zhanjiang; Coutinho, Luiz Lehmann

    2017-01-01

    Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs. PMID:28387238

  8. Different level of population differentiation among human genes.

    PubMed

    Wu, Dong-Dong; Zhang, Ya-Ping

    2011-01-14

    During the colonization of the world, after dispersal out of African, modern humans encountered changeable environments and substantial phenotypic variations that involve diverse behaviors, lifestyles and cultures, were generated among the different modern human populations. Here, we study the level of population differentiation among different populations of human genes. Intriguingly, genes involved in osteoblast development were identified as being enriched with higher FST SNPs, a result consistent with the proposed role of the skeletal system in accounting for variation among human populations. Genes involved in the development of hair follicles, where hair is produced, were also found to have higher levels of population differentiation, consistent with hair morphology being a distinctive trait among human populations. Other genes that showed higher levels of population differentiation include those involved in pigmentation, spermatid, nervous system and organ development, and some metabolic pathways, but few involved with the immune system. Disease-related genes demonstrate excessive SNPs with lower levels of population differentiation, probably due to purifying selection. Surprisingly, we find that Mendelian-disease genes appear to have a significant excessive of SNPs with high levels of population differentiation, possibly because the incidence and susceptibility of these diseases show differences among populations. As expected, microRNA regulated genes show lower levels of population differentiation due to purifying selection. Our analysis demonstrates different level of population differentiation among human populations for different gene groups.

  9. Fine Mapping Causal Variants with an Approximate Bayesian Method Using Marginal Test Statistics

    PubMed Central

    Chen, Wenan; Larrabee, Beth R.; Ovsyannikova, Inna G.; Kennedy, Richard B.; Haralambieva, Iana H.; Poland, Gregory A.; Schaid, Daniel J.

    2015-01-01

    Two recently developed fine-mapping methods, CAVIAR and PAINTOR, demonstrate better performance over other fine-mapping methods. They also have the advantage of using only the marginal test statistics and the correlation among SNPs. Both methods leverage the fact that the marginal test statistics asymptotically follow a multivariate normal distribution and are likelihood based. However, their relationship with Bayesian fine mapping, such as BIMBAM, is not clear. In this study, we first show that CAVIAR and BIMBAM are actually approximately equivalent to each other. This leads to a fine-mapping method using marginal test statistics in the Bayesian framework, which we call CAVIAR Bayes factor (CAVIARBF). Another advantage of the Bayesian framework is that it can answer both association and fine-mapping questions. We also used simulations to compare CAVIARBF with other methods under different numbers of causal variants. The results showed that both CAVIARBF and BIMBAM have better performance than PAINTOR and other methods. Compared to BIMBAM, CAVIARBF has the advantage of using only marginal test statistics and takes about one-quarter to one-fifth of the running time. We applied different methods on two independent cohorts of the same phenotype. Results showed that CAVIARBF, BIMBAM, and PAINTOR selected the same top 3 SNPs; however, CAVIARBF and BIMBAM had better consistency in selecting the top 10 ranked SNPs between the two cohorts. Software is available at https://bitbucket.org/Wenan/caviarbf. PMID:25948564

  10. Association of genetic variation in the tachykinin receptor 3 locus with hot flashes and night sweats in the Women's Health Initiative Study.

    PubMed

    Crandall, Carolyn J; Manson, JoAnn E; Hohensee, Chancellor; Horvath, Steve; Wactawski-Wende, Jean; LeBlanc, Erin S; Vitolins, Mara Z; Nassir, Rami; Sinsheimer, Janet S

    2017-03-01

    Vasomotor symptoms (VMS, ie, hot flashes or night sweats) are reported by many, but not all, women. The extent to which VMS are genetically determined is unknown. We evaluated the relationship of genetic variation and VMS. In this observational study, we accessed data from three genome-wide association studies (GWAS) (SNP Health Association Resource cohort [SHARe], WHI Memory Study cohort [WHIMS+], and Genome-Wide Association Studies of Treatment Response in Randomized Clinical Trials [GARNET] studies, total n = 17,695) of European American, African American, and Hispanic American postmenopausal women aged 50 to 79 years at baseline in the Women's Health Initiative Study. We examined genetic variation in relation to VMS (yes/no) in each study and using trans-ethnic inverse variance fixed-effects meta-analysis. A total of 11,078,977 single-nucleotide polymorphisms (SNPs) met the quality criteria. After adjustment for covariates and population structure, three SNPs (on chromosomes 3 and 11) were associated with VMS at the genome-wide threshold of 5 × 10 in the African American SHARe GWAS, but were not associated in the other cohorts. In the meta-analysis, 14 SNPs, all located on chromosome 4 in the tachykinin receptor 3 (TACR3) locus, however, had P < 5 × 10. These SNPs' effect sizes were similar across studies/participants' ancestry (odds ratio ∼1.5). Genetic variation in TACR3 may contribute to the risk of VMS. To our knowledge, this is the first GWAS to examine SNPs associated with VMS. These results support the biological hypothesis of a role for TACR3 in VMS, which was previously hypothesized from animal and human studies. Further study of these variants may lead to new insights into the biological pathways involved in VMS, which are poorly understood.

  11. Mutation analysis of BRCA1/2 mutations with special reference to polymorphic SNPs in Indian breast cancer patients.

    PubMed

    Shah, Nidhi D; Shah, Parth S; Panchal, Yash Y; Katudia, Kalpesh H; Khatri, Nikunj B; Ray, Hari Shankar P; Bhatiya, Upti R; Shah, Sandip C; Shah, Bhavini S; Rao, Mandava V

    2018-01-01

    Germline mutations BRCA1 and BRCA2 contribute almost equally in the causation of breast cancer (BC). The type of mutations in the Indian population that cause this condition is largely unknown. In this cohort, 79 randomized BC patients were screened for various types of BRCA1 and BRCA2 mutations including frameshift, nonsense, missense, in-frame and splice site types. The purified extracted DNA of each referral patient was subjected to Sanger gene sequencing using Codon Code Analyzer and Mutation Surveyor and next-generation sequencing (NGS) methods with Ion torrent software, after appropriate care. The data revealed that 35 cases were positive for BRCA1 or BRCA2 (35/79: 44.3%). BRCA2 mutations were higher (52.4%) than BRCA1 mutations (47.6%). Five novel mutations detected in this study were p.pro163 frameshift, p.asn997 frameshift, p.ser148 frameshift and two splice site single-nucleotide polymorphisms (SNPs). Additionally, four nonsense and one in-frame deletion were identified, which all seemed to be pathogenic. Polymorphic SNPs contributed the highest percentage of mutations (72/82: 87.8%) and contributed to pathogenic, likely pathogenic, likely benign, benign and variant of unknown significance (VUS). Young age groups (20-60 years) had a high frequency of germline mutations (62/82;75.6%) in the Indian population. This study suggested that polymorphic SNPs contributed a high percentage of mutations along with five novel types. Younger age groups are prone to having BC with a higher mutational rate. Furthermore, the SNPs detected in exons 10, 11 and 16 of BRCA1 and BRCA2 were higher than those in other exons 2, 3 and 9 polymorphic sites in two germline genes. These may be contributory for BC although missense types are known to be susceptible for cancer depending on the type of amino acid replaced in the protein and associated with pathologic events. Accordingly, appropriate counseling and treatment may be suggested.

  12. Kernel machine SNP set analysis provides new insight into the association between obesity and polymorphisms located on the chromosomal 16q.12.2 region: Tehran Lipid and Glucose Study.

    PubMed

    Javanrouh, Niloufar; Daneshpour, Maryam S; Soltanian, Ali Reza; Tapak, Leili

    2018-06-05

    Obesity is a serious health problem that leads to low quality of life and early mortality. To the purpose of prevention and gene therapy for such a worldwide disease, genome wide association study is a powerful tool for finding SNPs associated with increased risk of obesity. To conduct an association analysis, kernel machine regression is a generalized regression method, has an advantage of considering the epistasis effects as well as the correlation between individuals due to unknown factors. In this study, information of the people who participated in Tehran cardio-metabolic genetic study was used. They were genotyped for the chromosomal region, evaluation 986 variations located at 16q12.2; build 38hg. Kernel machine regression and single SNP analysis were used to assess the association between obesity and SNPs genotyped data. We found that associated SNP sets with obesity, were almost in the FTO (P = 0.01), AIKTIP (P = 0.02) and MMP2 (P = 0.02) genes. Moreover, two SNPs, i.e., rs10521296 and rs11647470, showed significant association with obesity using kernel regression (P = 0.02). In conclusion, significant sets were randomly distributed throughout the region with more density around the FTO, AIKTIP and MMP2 genes. Furthermore, two intergenic SNPs showed significant association after using kernel machine regression. Therefore, more studies have to be conducted to assess their functionality or precise mechanism. Copyright © 2018 Elsevier B.V. All rights reserved.

  13. Development of a Genetic Map for Onion (Allium cepa L.) Using Reference-Free Genotyping-by-Sequencing and SNP Assays

    PubMed Central

    Jo, Jinkwan; Purushotham, Preethi M.; Han, Koeun; Lee, Heung-Ryul; Nah, Gyoungju; Kang, Byoung-Cheorl

    2017-01-01

    Single nucleotide polymorphisms (SNPs) play important roles as molecular markers in plant genomics and breeding studies. Although onion (Allium cepa L.) is an important crop globally, relatively few molecular marker resources have been reported due to its large genome and high heterozygosity. Genotyping-by-sequencing (GBS) offers a greater degree of complexity reduction followed by concurrent SNP discovery and genotyping for species with complex genomes. In this study, GBS was employed for SNP mining in onion, which currently lacks a reference genome. A segregating F2 population, derived from a cross between ‘NW-001’ and ‘NW-002,’ as well as multiple parental lines were used for GBS analysis. A total of 56.15 Gbp of raw sequence data were generated and 1,851,428 SNPs were identified from the de novo assembled contigs. Stringent filtering resulted in 10,091 high-fidelity SNP markers. Robust SNPs that satisfied the segregation ratio criteria and with even distribution in the mapping population were used to construct an onion genetic map. The final map contained eight linkage groups and spanned a genetic length of 1,383 centiMorgans (cM), with an average marker interval of 8.08 cM. These robust SNPs were further analyzed using the high-throughput Fluidigm platform for marker validation. This is the first study in onion to develop genome-wide SNPs using GBS. The resulting SNP markers and developed linkage map will be valuable tools for genetic mapping of important agronomic traits and marker-assisted selection in onion breeding programs. PMID:28959273

  14. Fine scale mapping of the 17q22 breast cancer locus using dense SNPs, genotyped within the Collaborative Oncological Gene-Environment Study (COGs)

    PubMed Central

    Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Alonso, M. Rosario; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W.; Benitez, Javier; Bogdanova, Natalia V.; Bojesen, Stig E.; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M.; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F.; Fasching, Peter A.; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G.; Goldberg, Mark S.; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A.; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L.; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L.; Muir, Kenneth; Neuhausen, Susan L.; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C.; Stram, Daniel O.; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H.; Tessier, Daniel C.; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M.; Vincent, Daniel; Winqvist, Robert; Wu, Anna H.; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D. P.; Hall, Per; Edwards, Stacey L.; Simard, Jacques; French, Juliet D.; Chenevix-Trench, Georgia; Dunning, Alison M.

    2016-01-01

    Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90–0.94; P = 8.96 × 10−15)) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10−09, r2 = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10−11, r2 = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus. PMID:27600471

  15. A survey of the population genetic variation in the human kinome.

    PubMed

    Zhang, Wei; Catenacci, Daniel V T; Duan, Shiwei; Ratain, Mark J

    2009-08-01

    Protein kinases are key regulators of various biological processes, such as control of cell growth, metabolism, differentiation and apoptosis. Therefore, protein kinases have been an important class of targets for anticancer drugs. Health-related disparities such as differential drug response have been observed between human populations. A survey of the human kinases and their ligand genes for those containing population-specific genetic variants could provide new insights into the mechanisms of these health disparities and suggest novel targets for ethnicity-specific personalized medicine. Using the International HapMap Project genotypic data on single-nucleotide polymorphisms (SNPs), the protein kinase complement of the human genome (kinome) and some experimentally verified ligand genes were scanned for the existence of population-specific SNPs (eSNPs). In general, protein kinases were found to contain a much higher proportion of eSNPs than the whole genome background, indicating a stronger pressure for adaptation in individual populations. In contrast, the proportion of ligand genes containing eSNPs was not different from that of the whole genome background. Although with some important limitations, our results suggest that human kinases are more likely to be under recent positive selection than ligands. Our findings suggest that the health-related disparities associated with kinase signaling pathways are more likely to be driven by the genetic variation in the kinase genes than their cognate ligands. Illustrating the role of molecular evolution in the genetic variation of the human kinome could provide a promising route to understand the ethnic differences in cancer and facilitate the realization of ethnicity-based individualized medicine.

  16. Genetic Variants in the Hedgehog Interacting Protein Gene Are Associated with the FEV1/FVC Ratio in Southern Han Chinese Subjects with Chronic Obstructive Pulmonary Disease

    PubMed Central

    Zhang, Zili; Wang, Jian; Zheng, Zeguang; Chen, Xindong; Zeng, Xiansheng; Zhang, Yi; Li, Defu; Shu, Jiaze; Yang, Kai; Lai, Ning; Dong, Lian

    2017-01-01

    Background Convincing evidences have demonstrated the associations between HHIP and FAM13a polymorphisms and COPD in non-Asian populations. Here genetic variants in HHIP and FAM13a were investigated in Southern Han Chinese COPD. Methods A case-control study was conducted, including 989 cases and 999 controls. The associations between SNPs genotypes and COPD were performed by a logistic regression model; for SNPs and COPD-related phenotypes such as lung function, COPD severity, pack-year of smoking, and smoking status, a linear regression model was employed. Effects of risk alleles, genotypes, and haplotypes of the 3 significant SNPs in the HHIP gene on FEV1/FVC were also assessed in a linear regression model in COPD. Results The mean FEV1/FVC% value was 46.8 in combined COPD population. None of the 8 selected SNPs apparently related to COPD susceptibility. However, three SNPs (rs12509311, rs13118928, and rs182859) in HHIP were associated significantly with the FEV1/FVC% (Pmax = 4.1 × 10−4) in COPD adjusting for gender, age, and smoking pack-years. Moreover, statistical significance between risk alleles and the FEV1/FVC% (P = 2.3 × 10−4), risk genotypes, and the FEV1/FVC% (P = 3.5 × 10−4) was also observed in COPD. Conclusions Genetic variants in HHIP were related with FEV1/FVC in COPD. Significant relationships between risk alleles and risk genotypes and FEV1/FVC in COPD were also identified. PMID:28929109

  17. Genome-wide association studies and epistasis analyses of candidate genes related to age at menarche and age at natural menopause in a Korean population.

    PubMed

    Pyun, Jung-A; Kim, Sunshin; Cho, Nam H; Koh, InSong; Lee, Jong-Young; Shin, Chol; Kwack, KyuBum

    2014-05-01

    The aim of this study was to identify polymorphisms and gene-gene interactions that are significantly associated with age at menarche and age at menopause in a Korean population. A total of 3,452 and 1,827 women participated in studies of age at menarche and age at natural menopause, respectively. Linear regression analyses adjusted for residence area were used to perform genome-wide association studies (GWAS), candidate gene association studies, and interactions between the candidate genes for age at menarche and age at natural menopause. In GWAS, four single nucleotide polymorphisms (SNPs; rs7528241, rs1324329, rs11597068, and rs6495785) were strongly associated with age at natural menopause (lowest P = 9.66 × 10). However, GWAS of age at menarche did not reveal any strong associations. In candidate gene association studies, SNPs with P < 0.01 were selected to test their synergistic interactions. For age at natural menopause, there was a significant interaction between intronic SNPs on ADAM metallopeptidase with thrombospondin type I motif 9 (ADAMTS9) and SMAD family member 3 (SMAD3) genes (P = 9.52 × 10). For age at menarche, there were three significant interactions between three intronic SNPs on follicle-stimulating hormone receptor (FSHR) gene and one SNP located at the 3' flanking region of insulin-like growth factor 2 receptor (IGF2R) gene (lowest P = 1.95 × 10). Novel SNPs and synergistic interactions between candidate genes are significantly associated with age at menarche and age at natural menopause in a Korean population.

  18. Investigation of the estrogen receptor-alpha gene with type 2 diabetes and/or nephropathy in African-American and European-American populations.

    PubMed

    Gallagher, Carla J; Keene, Keith L; Mychaleckyj, Josyf C; Langefeld, Carl D; Hirschhorn, Joel N; Henderson, Brian E; Gordon, Candace J; Freedman, Barry I; Rich, Stephen S; Bowden, Donald W; Sale, Michèle M

    2007-03-01

    The estrogen receptor-alpha gene (ESR1) was selected as a positional candidate under a type 2 diabetes linkage peak at 6q24-27. A total of 42 ESR1 single nucleotide polymorphisms (SNPs) were genotyped in 380 African-American type 2 diabetic case subjects with end-stage renal disease (ESRD) and 276 African-American control subjects. A total of 22 ancestry informative markers were also genotyped, and the program Admixmap was used to adjust allelic and haplotypic association tests for individual estimates of admixture. The most significant association with type 2 diabetes-ESRD was with rs1033182 in intron 2 (P = 0.013, admixture-adjusted P(a) = 0.021). Genotyping 17 SNPs across a region of ESR1 intron 1-intron 2 in an expanded population of 851 case and 635 control subjects supported association with rs1033182 (P = 0.004, P(a) = 0.027) and with an independent six-SNP haplotype of high linkage disequilibrium spanning 6.4 kb (P < 0.0001, P(a) < 0.0001). The same 17 ESR1 SNPs were genotyped in 300 European-American type 2 diabetes-ESRD case subjects and 310 European-American control subjects. Two intron 2 SNPs, rs2431260 (P = 0.015) and rs1709183 (P = 0.019), and a four-SNP haplotype containing these SNPs (P = 0.033) were associated with type 2 diabetes and/or ESRD. Results suggest that intron 1 and intron 2 of the ESR1 gene may contain functionally important regions related to type 2 diabetes or ESRD risk.

  19. Genome-Wide Analysis in Brazilians Reveals Highly Differentiated Native American Genome Regions

    PubMed Central

    Havt, Alexandre; Nayak, Uma; Pinkerton, Relana; Farber, Emily; Concannon, Patrick; Lima, Aldo A.; Guerrant, Richard L.

    2017-01-01

    Despite its population, geographic size, and emerging economic importance, disproportionately little genome-scale research exists into genetic factors that predispose Brazilians to disease, or the population genetics of risk. After identification of suitable proxy populations and careful analysis of tri-continental admixture in 1,538 North-Eastern Brazilians to estimate individual ancestry and ancestral allele frequencies, we computed 400,000 genome-wide locus-specific branch length (LSBL) Fst statistics of Brazilian Amerindian ancestry compared to European and African; and a similar set of differentiation statistics for their Amerindian component compared with the closest Asian 1000 Genomes population (surprisingly, Bengalis in Bangladesh). After ranking SNPs by these statistics, we identified the top 10 highly differentiated SNPs in five genome regions in the LSBL tests of Brazilian Amerindian ancestry compared to European and African; and the top 10 SNPs in eight regions comparing their Amerindian component to the closest Asian 1000 Genomes population. We found SNPs within or proximal to the genes CIITA (rs6498115), SMC6 (rs1834619), and KLHL29 (rs2288697) were most differentiated in the Amerindian-specific branch, while SNPs in the genes ADAMTS9 (rs7631391), DOCK2 (rs77594147), SLC28A1 (rs28649017), ARHGAP5 (rs7151991), and CIITA (rs45601437) were most highly differentiated in the Asian comparison. These genes are known to influence immune function, metabolic and anthropometry traits, and embryonic development. These analyses have identified candidate genes for selection within Amerindian ancestry, and by comparison of the two analyses, those for which the differentiation may have arisen during the migration from Asia to the Americas. PMID:28100790

  20. Genome-wide association study of vitamin D concentrations in Hispanic Americans: the IRAS family study.

    PubMed

    Engelman, Corinne D; Meyers, Kristin J; Ziegler, Julie T; Taylor, Kent D; Palmer, Nicholette D; Haffner, Steven M; Fingerlin, Tasha E; Wagenknecht, Lynne E; Rotter, Jerome I; Bowden, Donald W; Langefeld, Carl D; Norris, Jill M

    2010-10-01

    Vitamin D deficiency is associated with many adverse health outcomes. There are several well established environmental predictors of vitamin D concentrations, yet studies of the genetic determinants of vitamin D concentrations are in their infancy. Our objective was to conduct a pilot genome-wide association (GWA) study of 25-hydroxyvitamin D (25[OH]D) and 1,25-dihydroxyvitamin D (1,25[OH](2)D) concentrations in a subset of 229 Hispanic subjects, followed by replication genotyping of 50 single nucleotide polymorphisms (SNPs) in the entire sample of 1190 Hispanics from San Antonio, Texas and San Luis Valley, Colorado. Of the 309,200 SNPs that met all quality control criteria, three SNPs in high linkage disequilibrium (LD) with each other were significantly associated with 1,25[OH](2)D (rs6680429, rs9970802, and rs10889028) at a Bonferroni corrected P-value threshold of 1.62 × 10(-7), however none met the threshold for 25[OH]D. Of the 50 SNPs selected for replication genotyping, five for 25[OH]D (rs2806508, rs10141935, rs4778359, rs1507023, and rs9937918) and eight for 1,25[OH](2)D (rs6680429, rs1348864, rs4559029, rs12667374, rs7781309, rs10505337, rs2486443, and rs2154175) were replicated in the entire sample of Hispanics (P<0.01). In conclusion, we identified several SNPs that were associated with vitamin D metabolite concentrations in Hispanics. These candidate polymorphisms merit further investigation in independent populations and other ethnicities. Copyright © 2010 Elsevier Ltd. All rights reserved.

  1. Fine scale mapping of the 17q22 breast cancer locus using dense SNPs, genotyped within the Collaborative Oncological Gene-Environment Study (COGs).

    PubMed

    Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Alonso, M Rosario; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Benitez, Javier; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F; Fasching, Peter A; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G; Goldberg, Mark S; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C; Stram, Daniel O; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H; Tessier, Daniel C; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M; Vincent, Daniel; Winqvist, Robert; Wu, Anna H; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D P; Hall, Per; Edwards, Stacey L; Simard, Jacques; French, Juliet D; Chenevix-Trench, Georgia; Dunning, Alison M

    2016-09-07

    Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90-0.94; P = 8.96 × 10(-15))) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10(-09), r(2) = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10(-11), r(2) = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus.

  2. Folate network genetic variation, plasma homocysteine, and global genomic methylation content: a genetic association study

    PubMed Central

    2011-01-01

    Background Sequence variants in genes functioning in folate-mediated one-carbon metabolism are hypothesized to lead to changes in levels of homocysteine and DNA methylation, which, in turn, are associated with risk of cardiovascular disease. Methods 330 SNPs in 52 genes were studied in relation to plasma homocysteine and global genomic DNA methylation. SNPs were selected based on functional effects and gene coverage, and assays were completed on the Illumina Goldengate platform. Age-, smoking-, and nutrient-adjusted genotype--phenotype associations were estimated in regression models. Results Using a nominal P ≤ 0.005 threshold for statistical significance, 20 SNPs were associated with plasma homocysteine, 8 with Alu methylation, and 1 with LINE-1 methylation. Using a more stringent false discovery rate threshold, SNPs in FTCD, SLC19A1, and SLC19A3 genes remained associated with plasma homocysteine. Gene by vitamin B-6 interactions were identified for both Alu and LINE-1 methylation, and epistatic interactions with the MTHFR rs1801133 SNP were identified for the plasma homocysteine phenotype. Pleiotropy involving the MTHFD1L and SARDH genes for both plasma homocysteine and Alu methylation phenotypes was identified. Conclusions No single gene was associated with all three phenotypes, and the set of the most statistically significant SNPs predictive of homocysteine or Alu or LINE-1 methylation was unique to each phenotype. Genetic variation in folate-mediated one-carbon metabolism, other than the well-known effects of the MTHFR c.665C>T (known as c.677 C>T, rs1801133, p.Ala222Val), is predictive of cardiovascular disease biomarkers. PMID:22103680

  3. High throughput SNP discovery and genotyping in hexaploid wheat

    PubMed Central

    Navarro, Julien; Kitt, Jonathan; Choulet, Frédéric; Leveugle, Magalie; Duarte, Jorge; Rivière, Nathalie; Eversole, Kellye; Le Gouis, Jacques; Davassi, Alessandro; Balfourier, François; Le Paslier, Marie-Christine; Berard, Aurélie; Brunel, Dominique; Feuillet, Catherine; Poncet, Charles; Sourdille, Pierre

    2018-01-01

    Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research. PMID:29293495

  4. SNP-markers in Allium species to facilitate introgression breeding in onion.

    PubMed

    Scholten, Olga E; van Kaauwen, Martijn P W; Shahin, Arwa; Hendrickx, Patrick M; Keizer, L C Paul; Burger, Karin; van Heusden, Adriaan W; van der Linden, C Gerard; Vosman, Ben

    2016-08-31

    Within onion, Allium cepa L., the availability of disease resistance is limited. The identification of sources of resistance in related species, such as Allium roylei and Allium fistulosum, was a first step towards the improvement of onion cultivars by breeding. SNP markers linked to resistance and polymorphic between these related species and onion cultivars are a valuable tool to efficiently introgress disease resistance genes. In this paper we describe the identification and validation of SNP markers valuable for onion breeding. Transcriptome sequencing resulted in 192 million RNA seq reads from the interspecific F1 hybrid between A. roylei and A. fistulosum (RF) and nine onion cultivars. After assembly, reliable SNPs were discovered in about 36 % of the contigs. For genotyping of the interspecific three-way cross population, derived from a cross between an onion cultivar and the RF (CCxRF), 1100 SNPs that are polymorphic in RF and monomorphic in the onion cultivars (RF SNPs) were selected for the development of KASP assays. A molecular linkage map based on 667 RF-SNP markers was constructed for CCxRF. In addition, KASP assays were developed for 1600 onion-SNPs (SNPs polymorphic among onion cultivars). A second linkage map was constructed for an F2 of onion x A. roylei (F2(CxR)) that consisted of 182 onion-SNPs and 119 RF-SNPs, and 76 previously mapped markers. Markers co-segregating in both the F2(CxR) and the CCxRF population were used to assign the linkage groups of RF to onion chromosomes. To validate usefulness of these SNP markers, QTL mapping was applied in the CCxRF population that segregates for resistance to Botrytis squamosa and resulted in a QTL for resistance on chromosome 6 of A. roylei. Our research has more than doubled the publicly available marker sequences of expressed onion genes and two onion-related species. It resulted in a detailed genetic map for the interspecific CCxRF population. This is the first paper that reports the detection of a QTL for resistance to B. squamosa in A. roylei.

  5. "Contrasting patterns of selection at Pinus pinaster Ait. Drought stress candidate genes as revealed by genetic differentiation analyses".

    PubMed

    Eveno, Emmanuelle; Collada, Carmen; Guevara, M Angeles; Léger, Valérie; Soto, Alvaro; Díaz, Luis; Léger, Patrick; González-Martínez, Santiago C; Cervera, M Teresa; Plomion, Christophe; Garnier-Géré, Pauline H

    2008-02-01

    The importance of natural selection for shaping adaptive trait differentiation among natural populations of allogamous tree species has long been recognized. Determining the molecular basis of local adaptation remains largely unresolved, and the respective roles of selection and demography in shaping population structure are actively debated. Using a multilocus scan that aims to detect outliers from simulated neutral expectations, we analyzed patterns of nucleotide diversity and genetic differentiation at 11 polymorphic candidate genes for drought stress tolerance in phenotypically contrasted Pinus pinaster Ait. populations across its geographical range. We compared 3 coalescent-based methods: 2 frequentist-like, including 1 approach specifically developed for biallelic single nucleotide polymorphisms (SNPs) here and 1 Bayesian. Five genes showed outlier patterns that were robust across methods at the haplotype level for 2 of them. Two genes presented higher F(ST) values than expected (PR-AGP4 and erd3), suggesting that they could have been affected by the action of diversifying selection among populations. In contrast, 3 genes presented lower F(ST) values than expected (dhn-1, dhn2, and lp3-1), which could represent signatures of homogenizing selection among populations. A smaller proportion of outliers were detected at the SNP level suggesting the potential functional significance of particular combinations of sites in drought-response candidate genes. The Bayesian method appeared robust to low sample sizes, flexible to assumptions regarding migration rates, and powerful for detecting selection at the haplotype level, but the frequentist-like method adapted to SNPs was more efficient for the identification of outlier SNPs showing low differentiation. Population-specific effects estimated in the Bayesian method also revealed populations with lower immigration rates, which could have led to favorable situations for local adaptation. Outlier patterns are discussed in relation to the different genes' putative involvement in drought tolerance responses, from published results in transcriptomics and association mapping in P. pinaster and other related species. These genes clearly constitute relevant candidates for future association studies in P. pinaster.

  6. Genome-Wide Associations and Functional Genomic Studies of Musculoskeletal Adverse Events in Women Receiving Aromatase Inhibitors

    PubMed Central

    Ingle, James N.; Schaid, Daniel J.; Goss, Paul E.; Liu, Mohan; Mushiroda, Taisei; Chapman, Judy-Anne W.; Kubo, Michiaki; Jenkins, Gregory D.; Batzler, Anthony; Shepherd, Lois; Pater, Joseph; Wang, Liewei; Ellis, Matthew J.; Stearns, Vered; Rohrer, Daniel C.; Goetz, Matthew P.; Pritchard, Kathleen I.; Flockhart, David A.; Nakamura, Yusuke; Weinshilboum, Richard M.

    2010-01-01

    Purpose We performed a case-control genome-wide association study (GWAS) to identify single nucleotide polymorphisms (SNPs) associated with musculoskeletal adverse events (MS-AEs) in women treated with aromatase inhibitors (AIs) for early breast cancer. Patients and Methods A nested case-control design was used to select patients enrolled onto the MA.27 phase III trial comparing anastrozole with exemestane. Cases were matched to two controls and were defined as patients with grade 3 or 4 MS-AEs (according to the National Cancer Institute's Common Terminology Criteria for Adverse Events v3.0) or those who discontinued treatment for any grade of MS-AE within the first 2 years. Genotyping was performed with the Illumina Human610-Quad BeadChip. Results The GWAS included 293 cases and 585 controls. A total of 551,358 SNPs were analyzed, followed by imputation and fine mapping of a region of interest on chromosome 14. Four SNPs on chromosome 14 had the lowest P values (2.23E-06 to 6.67E-07). T-cell leukemia 1A (TCL1A) was the gene closest (926-7000 bp) to the four SNPs. Functional genomic studies revealed that one of these SNPs (rs11849538) created an estrogen response element and that TCL1A expression was estrogen dependent, was associated with the variant SNP genotypes in estradiol-treated lymphoblastoid cells transfected with estrogen receptor alpha and was directly related to interleukin 17 receptor A (IL17RA) expression. Conclusion This GWAS identified SNPs associated with MS-AEs in women treated with AIs and with a gene (TCL1A) which, in turn, was related to a cytokine (IL17). These findings provide a focus for further research to identify patients at risk for MS-AEs and to explore the mechanisms for these adverse events. PMID:20876420

  7. Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

    PubMed Central

    Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

    2012-01-01

    As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718

  8. [Association between aryl hydrocarbon receptor gene polymorphisms and chromosomal damage in coke-oven workers].

    PubMed

    Bin, Ping; Leng, Shuguang; Liang, Xuemiao; Cheng, Juan

    2007-11-01

    To investigate the association of single nucleotide polymorphisms (SNPs) or haplotypes of aryl hydrocarbon receptor (AHR) gene and chromosomal damage in peripheral blood lymphocytes among coke-oven workers. Eighty-nine coke-oven workers exposed to a high level of polycyclic aromatic hydrocarbons (PAHs) and sixty non-exposed workers were selected as the study subjects. Urinary 1-hydroxypyrene (1-OHPyr) levels were measured as the internal dose of PAHs exposure. The chromosomal damage in peripheral lymphocyte was measured by the cytokinesis-block micronucleus (CBMN) assay. Two SNPs in AHR gene, including rs6960165, rs2282885 were detected by PCR-RFLP. The AHR haplotypes were estimated by Bayesian statistical method with the software of PHASE Version 2.1. The associations between SNPs or haplotypes pairs and CBMN were assessed by analysis of covariance in the coke-oven workers and non-exposed workers. The level of 1-OHPyr among coke-oven workers was significantly higher than that among non-exposed workers (P < 0.01). The CBMN among coke-oven workers was significantly higher than that among non-exposed workers (P < 0.01). After adjusting the age and the level of 1-OHPyr, the different SNPs of AHR gene rs6960165 in coke-oven workers were related to the CBMN frequencies (P = 0.014), but no association between the different SNPs of AHR gene rs2282885 and the rates of CBMN was observed in coke-oven workers (P = 0.586), either in the controls (P = 0.308 and P = 0.415, respectively), the haplotypes in coke-oven workers were significantly related to the rates of CBMN (P = 0.007), while there was no significant association in non-exposed workers (P = 0.768). Our results suggested that SNPs rs6960165 or haplotypes of AHR were associated with the CBMN frequencies in coke-oven workers.

  9. Polymorphisms in genes of the renin-angiotensin-aldosterone system and renal cell cancer risk: interplay with hypertension and intakes of sodium, potassium and fluid.

    PubMed

    Deckers, Ivette A; van den Brandt, Piet A; van Engeland, Manon; van Schooten, Frederik-Jan; Godschalk, Roger W; Keszei, András P; Schouten, Leo J

    2015-03-01

    Hypertension is an established risk factor for renal cell cancer (RCC). The renin-angiotensin-aldosterone system (RAAS) regulates blood pressure and is closely linked to hypertension. RAAS additionally influences homeostasis of electrolytes (e.g. sodium and potassium) and fluid. We investigated single nucleotide polymorphisms (SNPs) in RAAS and their interactions with hypertension and intakes of sodium, potassium and fluid regarding RCC risk in the Netherlands Cohort Study (NLCS), which was initiated in 1986 and included 120,852 participants aged 55 to 69 years. Diet and lifestyle were assessed by questionnaires and toenail clippings were collected. Genotyping of toenail DNA was performed using the SEQUENOM® MassARRAY® platform for a literature-based selection of 13 candidate SNPs in seven key RAAS genes. After 20.3 years of follow-up, Cox regression analyses were conducted using a case-cohort approach including 3,583 subcohort members and 503 RCC cases. Two SNPs in AGTR1 were associated with RCC risk. AGTR1_rs1492078 (AA vs. GG) decreased RCC risk [hazard ratio (HR) (95% confidence interval (CI)): 0.70(0.49-1.00)], whereas AGTR1_rs5186 (CC vs. AA) increased RCC risk [HR(95%CI): 1.49(1.08-2.05)]. Associations were stronger in participants with hypertension. The RCC risk for AGT_rs3889728 (AG + AA vs. GG) was modified by hypertension (p interaction = 0.039). SNP-diet interactions were not significant, although HRs suggested interaction between SNPs in ACE and sodium intake. SNPs in AGTR1 and AGT influenced RCC susceptibility, and their effects were modified by hypertension. Sodium intake was differentially associated with RCC risk across genotypes of several SNPs, yet some analyses had probably inadequate power to show significant interaction. Results suggest that RAAS may be a candidate pathway in RCC etiology. © 2014 UICC.

  10. Cytochrome P450 2E1 gene polymorphisms/haplotypes and anti-tuberculosis drug-induced hepatitis in a Chinese cohort.

    PubMed

    Tang, Shaowen; Lv, Xiaozhen; Zhang, Yuan; Wu, Shanshan; Yang, Zhirong; Xia, Yinyin; Tu, Dehua; Deng, Peiyuan; Ma, Yu; Chen, Dafang; Zhan, Siyan

    2013-01-01

    The pathogenic mechanism of anti-tuberculosis (anti-TB) drug-induced hepatitis is associated with drug metabolizing enzymes. No tagging single-nucleotide polymorphisms (tSNPs) of cytochrome P450 2E1(CYP2E1) in the risk of anti-TB drug-induced hepatitis have been reported. The present study was aimed at exploring the role of tSNPs in CYP2E1 gene in a population-based anti-TB treatment cohort. A nested case-control study was designed. Each hepatitis case was 14 matched with controls by age, gender, treatment history, disease severity and drug dosage. The tSNPs were selected by using Haploview 4.2 based on the HapMap database of Han Chinese in Beijing, and detected by using TaqMan allelic discrimination technology. Eighty-nine anti-TB drug-induced hepatitis cases and 356 controls were included in this study. 6 tSNPs (rs2031920, rs2070672, rs915908, rs8192775, rs2515641, rs2515644) were genotyped and minor allele frequencies of these tSNPs were 21.9%, 23.0%, 19.1%, 23.6%, 20.8% and 44.4% in the cases and 20.9%, 22.7%, 18.9%, 23.2%, 18.2% and 43.2% in the controls, respectively. No significant difference was observed in genotypes or allele frequencies of the 6 tSNPs between case group and control group, and neither of haplotypes in block 1 nor in block 2 was significantly associated with the development of hepatitis. Based on the Chinese anti-TB treatment cohort, we did not find a statistically significant association between genetic polymorphisms of CYP2E1 and the risk of anti-TB drug-induced hepatitis. None of the haplotypes showed a significant association with the development of hepatitis in Chinese TB population.

  11. Development of a Medium Density Combined-Species SNP Array for Pacific and European Oysters (Crassostrea gigas and Ostrea edulis).

    PubMed

    Gutierrez, Alejandro P; Turner, Frances; Gharbi, Karim; Talbot, Richard; Lowe, Natalie R; Peñaloza, Carolina; McCullough, Mark; Prodöhl, Paulo A; Bean, Tim P; Houston, Ross D

    2017-07-05

    SNP arrays are enabling tools for high-resolution studies of the genetic basis of complex traits in farmed and wild animals. Oysters are of critical importance in many regions from both an ecological and economic perspective, and oyster aquaculture forms a key component of global food security. The aim of our study was to design a combined-species, medium density SNP array for Pacific oyster ( Crassostrea gigas ) and European flat oyster ( Ostrea edulis ), and to test the performance of this array on farmed and wild populations from multiple locations, with a focus on European populations. SNP discovery was carried out by whole-genome sequencing (WGS) of pooled genomic DNA samples from eight C. gigas populations, and restriction site-associated DNA sequencing (RAD-Seq) of 11 geographically diverse O. edulis populations. Nearly 12 million candidate SNPs were discovered and filtered based on several criteria, including preference for SNPs segregating in multiple populations and SNPs with monomorphic flanking regions. An Affymetrix Axiom Custom Array was created and tested on a diverse set of samples ( n = 219) showing ∼27 K high quality SNPs for C. gigas and ∼11 K high quality SNPs for O. edulis segregating in these populations. A high proportion of SNPs were segregating in each of the populations, and the array was used to detect population structure and levels of linkage disequilibrium (LD). Further testing of the array on three C. gigas nuclear families ( n = 165) revealed that the array can be used to clearly distinguish between both families based on identity-by-state (IBS) clustering parental assignment software. This medium density, combined-species array will be publicly available through Affymetrix, and will be applied for genome-wide association and evolutionary genetic studies, and for genomic selection in oyster breeding programs. Copyright © 2017 Gutierrez et al.

  12. Genome-Wide Association Mapping of Flowering and Ripening Periods in Apple.

    PubMed

    Urrestarazu, Jorge; Muranty, Hélène; Denancé, Caroline; Leforestier, Diane; Ravon, Elisa; Guyader, Arnaud; Guisnel, Rémi; Feugey, Laurence; Aubourg, Sébastien; Celton, Jean-Marc; Daccord, Nicolas; Dondini, Luca; Gregori, Roberto; Lateur, Marc; Houben, Patrick; Ordidge, Matthew; Paprstein, Frantisek; Sedlak, Jiri; Nybom, Hilde; Garkava-Gustavsson, Larisa; Troggio, Michela; Bianco, Luca; Velasco, Riccardo; Poncet, Charles; Théron, Anthony; Moriya, Shigeki; Bink, Marco C A M; Laurens, François; Tartarini, Stefano; Durel, Charles-Eric

    2017-01-01

    Deciphering the genetic control of flowering and ripening periods in apple is essential for breeding cultivars adapted to their growing environments. We implemented a large Genome-Wide Association Study (GWAS) at the European level using an association panel of 1,168 different apple genotypes distributed over six locations and phenotyped for these phenological traits. The panel was genotyped at a high-density of SNPs using the Axiom®Apple 480 K SNP array. We ran GWAS with a multi-locus mixed model (MLMM), which handles the putatively confounding effect of significant SNPs elsewhere on the genome. Genomic regions were further investigated to reveal candidate genes responsible for the phenotypic variation. At the whole population level, GWAS retained two SNPs as cofactors on chromosome 9 for flowering period, and six for ripening period (four on chromosome 3, one on chromosome 10 and one on chromosome 16) which, together accounted for 8.9 and 17.2% of the phenotypic variance, respectively. For both traits, SNPs in weak linkage disequilibrium were detected nearby, thus suggesting the existence of allelic heterogeneity. The geographic origins and relationships of apple cultivars accounted for large parts of the phenotypic variation. Variation in genotypic frequency of the SNPs associated with the two traits was connected to the geographic origin of the genotypes (grouped as North+East, West and South Europe), and indicated differential selection in different growing environments. Genes encoding transcription factors containing either NAC or MADS domains were identified as major candidates within the small confidence intervals computed for the associated genomic regions. A strong microsynteny between apple and peach was revealed in all the four confidence interval regions. This study shows how association genetics can unravel the genetic control of important horticultural traits in apple, as well as reduce the confidence intervals of the associated regions identified by linkage mapping approaches. Our findings can be used for the improvement of apple through marker-assisted breeding strategies that take advantage of the accumulating additive effects of the identified SNPs.

  13. No association of single nucleotide polymorphisms involved in GHRL and GHSR with cancer risk: a meta-analysis.

    PubMed

    Zhu, Shengjie; Shao, Bin; Hao, Yaoyao; Li, Zongxian; Liu, Houqiang; Li, Hong; Wang, Ming; Wang, Kai

    2015-01-01

    Ghrelin was associated with several of cancers. The conflict results of SNPs with GHRL and GHSR gene were demonstrated in different studies. Thus, this meta-analysis is to evaluate the associations. Systematic literature search was done on PubMed database up to October 2013. We used odds ratios (ORs) with 95% confidence intervals (CIs) to assess the strength of the association by a fixed-effect model and a random-effect model. A total of 7 studies, which included 3 studies for breast cancer, 2 for colorectal cancer, 1 for hepatocellular carcinoma, 1 for esophageal cancer and 1 for Non-Hodgkin lymphoma. When analyzed all the GHRL SNPs with all kinds of cancers, there was significantly difference with cancer patients compared with controls (Recessive model: OR 0.938, 95% CI 0.890-0.989, p=0.017), while no significant difference was existed in the additive model (OR 0.9903, 95% CI 0.957-1.024, p=0.558) and dominant model (OR 1.014, 95% CI 0.970-1.061, p=0.536). When analyzed all the GHSR SNPs with all kinds of cancers, no significant difference was observed. Our results suggest that the SNP with GHRL and GHSR might be weaker association with cancer risk, especially with breast cancer risk.

  14. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed Central

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina

    2018-01-01

    The germline JAK2 haplotype known as “GGCC or 46/1 haplotype” (haplotypeGGCC_46/1) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 (INLS4) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a “GGCC” combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotypeGGCC_46/1 and mutations in other genes, such as thrombopoietin receptor (MPL) and calreticulin (CALR), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotypeGGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotypeGGCC_46/1 and blood cell count, survival, or disease progression. PMID:29641446

  15. The JAK2 GGCC (46/1) Haplotype in Myeloproliferative Neoplasms: Causal or Random?

    PubMed

    Anelli, Luisa; Zagaria, Antonella; Specchia, Giorgina; Albano, Francesco

    2018-04-11

    The germline JAK2 haplotype known as "GGCC or 46/1 haplotype" (haplotype GGCC_46/1 ) consists of a combination of single nucleotide polymorphisms (SNPs) mapping in a region of about 250 kb, extending from the JAK2 intron 10 to the Insulin-like 4 ( INLS4 ) gene. Four main SNPs (rs3780367, rs10974944, rs12343867, and rs1159782) generating a "GGCC" combination are more frequently indicated to represent the JAK2 haplotype. These SNPs are inherited together and are frequently associated with the onset of myeloproliferative neoplasms (MPN) positive for both JAK2 V617 and exon 12 mutations. The association between the JAK2 haplotype GGCC_46/1 and mutations in other genes, such as thrombopoietin receptor ( MPL ) and calreticulin ( CALR ), or the association with triple negative MPN, is still controversial. This review provides an overview of the frequency and the role of the JAK2 haplotype GGCC_46/1 in the pathogenesis of different myeloid neoplasms and describes the hypothetical mechanisms at the basis of the association with JAK2 gene mutations. Moreover, possible clinical implications are discussed, as different papers reported contrasting data about the correlation between the JAK2 haplotype GGCC_46/1 and blood cell count, survival, or disease progression.

  16. Transposon Insertions, Structural Variations, and SNPs Contribute to the Evolution of the Melon Genome.

    PubMed

    Sanseverino, Walter; Hénaff, Elizabeth; Vives, Cristina; Pinosio, Sara; Burgos-Paz, William; Morgante, Michele; Ramos-Onsins, Sebastián E; Garcia-Mas, Jordi; Casacuberta, Josep Maria

    2015-10-01

    The availability of extensive databases of crop genome sequences should allow analysis of crop variability at an unprecedented scale, which should have an important impact in plant breeding. However, up to now the analysis of genetic variability at the whole-genome scale has been mainly restricted to single nucleotide polymorphisms (SNPs). This is a strong limitation as structural variation (SV) and transposon insertion polymorphisms are frequent in plant species and have had an important mutational role in crop domestication and breeding. Here, we present the first comprehensive analysis of melon genetic diversity, which includes a detailed analysis of SNPs, SV, and transposon insertion polymorphisms. The variability found among seven melon varieties representing the species diversity and including wild accessions and highly breed lines, is relatively high due in part to the marked divergence of some lineages. The diversity is distributed nonuniformly across the genome, being lower at the extremes of the chromosomes and higher in the pericentromeric regions, which is compatible with the effect of purifying selection and recombination forces over functional regions. Additionally, this variability is greatly reduced among elite varieties, probably due to selection during breeding. We have found some chromosomal regions showing a high differentiation of the elite varieties versus the rest, which could be considered as strongly selected candidate regions. Our data also suggest that transposons and SV may be at the origin of an important fraction of the variability in melon, which highlights the importance of analyzing all types of genetic variability to understand crop genome evolution. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  17. Genotyping-By-Sequencing (GBS) Detects Genetic Structure and Confirms Behavioral QTL in Tame and Aggressive Foxes (Vulpes vulpes)

    PubMed Central

    Johnson, Jennifer L.; Wittgenstein, Helena; Mitchell, Sharon E.; Hyma, Katie E.; Temnykh, Svetlana V.; Kharlamova, Anastasiya V.; Gulevich, Rimma G.; Vladimirova, Anastasiya V.; Fong, Hiu Wa Flora; Acland, Gregory M.; Trut, Lyudmila N.; Kukekova, Anna V.

    2015-01-01

    The silver fox (Vulpes vulpes) offers a novel model for studying the genetics of social behavior and animal domestication. Selection of foxes, separately, for tame and for aggressive behavior has yielded two strains with markedly different, genetically determined, behavioral phenotypes. Tame strain foxes are eager to establish human contact while foxes from the aggressive strain are aggressive and difficult to handle. These strains have been maintained as separate outbred lines for over 40 generations but their genetic structure has not been previously investigated. We applied a genotyping-by-sequencing (GBS) approach to provide insights into the genetic composition of these fox populations. Sequence analysis of EcoT22I genomic libraries of tame and aggressive foxes identified 48,294 high quality SNPs. Population structure analysis revealed genetic divergence between the two strains and more diversity in the aggressive strain than in the tame one. Significant differences in allele frequency between the strains were identified for 68 SNPs. Three of these SNPs were located on fox chromosome 14 within an interval of a previously identified behavioral QTL, further supporting the importance of this region for behavior. The GBS SNP data confirmed that significant genetic diversity has been preserved in both fox populations despite many years of selective breeding. Analysis of SNP allele frequencies in the two populations identified several regions of genetic divergence between the tame and aggressive foxes, some of which may represent targets of selection for behavior. The GBS protocol used in this study significantly expanded genomic resources for the fox, and can be adapted for SNP discovery and genotyping in other canid species. PMID:26061395

  18. Genotyping-By-Sequencing (GBS) Detects Genetic Structure and Confirms Behavioral QTL in Tame and Aggressive Foxes (Vulpes vulpes).

    PubMed

    Johnson, Jennifer L; Wittgenstein, Helena; Mitchell, Sharon E; Hyma, Katie E; Temnykh, Svetlana V; Kharlamova, Anastasiya V; Gulevich, Rimma G; Vladimirova, Anastasiya V; Fong, Hiu Wa Flora; Acland, Gregory M; Trut, Lyudmila N; Kukekova, Anna V

    2015-01-01

    The silver fox (Vulpes vulpes) offers a novel model for studying the genetics of social behavior and animal domestication. Selection of foxes, separately, for tame and for aggressive behavior has yielded two strains with markedly different, genetically determined, behavioral phenotypes. Tame strain foxes are eager to establish human contact while foxes from the aggressive strain are aggressive and difficult to handle. These strains have been maintained as separate outbred lines for over 40 generations but their genetic structure has not been previously investigated. We applied a genotyping-by-sequencing (GBS) approach to provide insights into the genetic composition of these fox populations. Sequence analysis of EcoT22I genomic libraries of tame and aggressive foxes identified 48,294 high quality SNPs. Population structure analysis revealed genetic divergence between the two strains and more diversity in the aggressive strain than in the tame one. Significant differences in allele frequency between the strains were identified for 68 SNPs. Three of these SNPs were located on fox chromosome 14 within an interval of a previously identified behavioral QTL, further supporting the importance of this region for behavior. The GBS SNP data confirmed that significant genetic diversity has been preserved in both fox populations despite many years of selective breeding. Analysis of SNP allele frequencies in the two populations identified several regions of genetic divergence between the tame and aggressive foxes, some of which may represent targets of selection for behavior. The GBS protocol used in this study significantly expanded genomic resources for the fox, and can be adapted for SNP discovery and genotyping in other canid species.

  19. A fine structure genetic analysis evaluating ecoregional adaptability of a Bos taurus breed (Hereford)

    PubMed Central

    Krehbiel, B.; Ericsson, S. A.; Wilson, C.; Caetano, A. R.; Paiva, S. R.

    2017-01-01

    Ecoregional differences contribute to genetic environmental interactions and impact animal performance. These differences may become more important under climate change scenarios. Utilizing genetic diversity within a species to address such problems has not been fully explored. In this study Hereford cattle were genotyped with 50K Bead Chip or 770K Bovine Bead Chip to test the existence of genetic structure in five U.S. ecoregions characterized by precipitation, temperature and humidity and designated: cool arid (CA), cool humid (CH), transition zone (TZ), warm arid (WA), and warm humid (WH). SNP data were analyzed in three sequential analyses. Broad genetic structure was evaluated with STRUCTURE, and ADMIXTURE software using 14,312 SNPs after passing quality control variables. The second analysis was performed using principal coordinate analysis with 66 Tag SNPs associated in the literature with various aspects of environmental stressors (e.g., heat tolerance) or production (e.g., milk production). In the third analysis TreeSelect was used with the 66 SNPs to evaluate if ecoregional allelic frequencies deviated from a central frequency and by so doing are indicative of directional selection. The three analyses suggested subpopulation structures associated with ecoregions from where animals were derived. ADMIXTURE and PCA results illustrated the importance of temperature and humidity and confirm subpopulation assignments. Comparisons of allele frequencies with TreeSelect showed ecoregion differences, in particular the divergence between arid and humid regions. Patterns of genetic variability obtained by medium and high density SNP chips can be used to acclimatize a temperately derived breed to various ecoregions. As climate change becomes an important factor in cattle production, this study should be used as a proof of concept to review future breeding and conservation schemes aimed at adaptation to climatic events. PMID:28459870

  20. A simple aloe vera plant-extracted microwave and conventional combustion synthesis: Morphological, optical, magnetic and catalytic properties of CoFe2O4 nanostructures

    NASA Astrophysics Data System (ADS)

    Manikandan, A.; Sridhar, R.; Arul Antony, S.; Ramakrishna, Seeram

    2014-11-01

    Nanocrystalline magnetic spinel CoFe2O4 was synthesized by a simple microwave combustion method (MCM) using ferric nitrate, cobalt nitrate and Aloe vera plant extracted solution. For the comparative study, it was also prepared by a conventional combustion method (CCM). Powder X-ray diffraction, energy dispersive X-ray and selected-area electron diffraction results indicate that the as-synthesized samples have only single-phase spinel structure with high crystallinity and without the presence of other phase impurities. The crystal structure and morphology of the powders were revealed by high resolution scanning electron microscopy and transmission electron microscopy, show that the MCM products of CoFe2O4 samples contain sphere-like nanoparticles (SNPs), whereas the CCM method of samples consist of flake-like nanoplatelets (FNPs). The band gap of the samples was determined by UV-Visible diffuse reflectance and photoluminescence spectroscopy. The magnetization (Ms) results showed a ferromagnetic behavior of the CoFe2O4 nanostructures. The Ms value of CoFe2O4-SNPs is higher i.e. 77.62 emu/g than CoFe2O4-FNPs (25.46 emu/g). The higher Ms value of the sample suggest that the MCM technique is suitable for preparing high quality nanostructures for magnetic applications. Both the samples were successfully tested as catalysts for the conversion of benzyl alcohol. The resulting spinel ferrites were highly selective for the oxidation of benzyl alcohol and exhibit important difference among their activities. It was found that CoFe2O4-SNPs catalyst show the best performance, whereby 99.5% selectivity of benzaldehyde was achieved at close to 93.2% conversion.

Top